BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 009153
         (542 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255540793|ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
 gi|223550576|gb|EEF52063.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
          Length = 809

 Score =  911 bits (2355), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/576 (75%), Positives = 489/576 (84%), Gaps = 34/576 (5%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGG LPQSW  QQL+LQKKIL R+YELGMNPVLPAFSGNVPAAL+N+FPSAKI 
Sbjct: 234 MGNLHRWGGSLPQSWFFQQLILQKKILARMYELGMNPVLPAFSGNVPAALRNIFPSAKIA 293

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWFSVKSD RWCCTYLLDATDPLFIEIGRAFIEQQL+EYG TSHIYNCDTFDENTPP
Sbjct: 294 RLGNWFSVKSDLRWCCTYLLDATDPLFIEIGRAFIEQQLEEYGSTSHIYNCDTFDENTPP 353

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD P+YIS+LGAA++ GMQSGD+DAVWLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 354 VDDPKYISALGAAVFKGMQSGDNDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGRLVVL 413

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DLFAEVKPIW++S QFYGVPYIWCMLHNFAGN+EMYGILDSIA GPVEARTSEN+TM   
Sbjct: 414 DLFAEVKPIWTSSYQFYGVPYIWCMLHNFAGNVEMYGILDSIASGPVEARTSENSTMVGV 473

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                         AWIN YS RRYGRSVP+IQDAW++LYHTVY
Sbjct: 474 GMSMEGIEQNPVVYDLMSEMAFQHKKVDVKAWINLYSTRRYGRSVPSIQDAWDILYHTVY 533

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           NCTDGA DKNRDVIVAFPDV+P   SV++ ++   GKPVS+ AVLK  + SYDHPHLWYS
Sbjct: 534 NCTDGAYDKNRDVIVAFPDVNPFYFSVSQKRHHLNGKPVSRRAVLKENSDSYDHPHLWYS 593

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
           TSEV+ ALELFI SG ELS S+TY YDL+DLTRQALAKY NELFL IIE+YQ ND +GV 
Sbjct: 594 TSEVLHALELFITSGEELSGSSTYSYDLVDLTRQALAKYGNELFLKIIESYQANDGNGVA 653

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
             S++FL+LVEDMD LL CH+GFLLGPWLESAKQLAQ++EQEKQ+EWNARTQITMWFDNT
Sbjct: 654 SRSQKFLDLVEDMDTLLGCHEGFLLGPWLESAKQLAQDQEQEKQFEWNARTQITMWFDNT 713

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
           ++EASLL DYGNKYWSGLL+DYYGPRAAIYFKY+I+SLE+G  F LKDWRREWIKLTN+W
Sbjct: 714 EDEASLLHDYGNKYWSGLLQDYYGPRAAIYFKYLIKSLENGKVFPLKDWRREWIKLTNEW 773

Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQGTG-VFDH 542
           Q  RN +PV+SNG+ALI S+WLY+KYL+     +DH
Sbjct: 774 QRSRNKFPVKSNGNALIISKWLYDKYLRNPDTTYDH 809


>gi|225457148|ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
          Length = 813

 Score =  911 bits (2355), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 437/578 (75%), Positives = 486/578 (84%), Gaps = 37/578 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWF+V  +PRWCCTYLLDATDPLFIEIG+AFI+QQLKEYGRT HIYNCDTFDENTPP
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPP 354

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 355 VDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVL 414

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYGILD++A GPVEARTSEN+TM   
Sbjct: 415 DLFAEVKPIWITSEQFYGVPYIWCMLHNFAGNIEMYGILDAVASGPVEARTSENSTMVGV 474

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          WI  YS RRYG+SVP IQDAWN+LYHTVY
Sbjct: 475 GMSMEGIEQNPVVYDLMSEMAFQHSKVDVKVWIALYSTRRYGKSVPEIQDAWNILYHTVY 534

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTE----GKYQNYGKPVSKEAVLKSETSSYDHPH 323
           NCTDG+ DKNRDVIVAFPD+DPS I   +    G Y  YGK VS+  VLK  T+S++ PH
Sbjct: 535 NCTDGSYDKNRDVIVAFPDIDPSFIPTPKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPH 594

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           LWYSTSEV  AL LFIASG +L  SNTYRYDL+DLTRQALAKYAN+LFL +IEAYQLND 
Sbjct: 595 LWYSTSEVKDALGLFIASGGQLLGSNTYRYDLVDLTRQALAKYANQLFLEVIEAYQLNDV 654

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            G    S++FLELVEDMD LLACHDGFLLGPWLESAKQLAQ+E+QE Q+EWNARTQITMW
Sbjct: 655 RGAACHSQKFLELVEDMDTLLACHDGFLLGPWLESAKQLAQDEQQEIQFEWNARTQITMW 714

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
           FDNT++EASLLRDYGNKYWSGLLRDYYGPRAAIYFKY++ESLE+G+ F LKDWRREWIKL
Sbjct: 715 FDNTEDEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYLLESLETGNEFALKDWRREWIKL 774

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFD 541
           TNDWQN RN YPV S+G+A+ TS+ LYNKYLQ   ++D
Sbjct: 775 TNDWQNSRNAYPVRSSGNAIDTSRRLYNKYLQDPEIYD 812


>gi|224121634|ref|XP_002318632.1| predicted protein [Populus trichocarpa]
 gi|222859305|gb|EEE96852.1| predicted protein [Populus trichocarpa]
          Length = 812

 Score =  903 bits (2334), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 425/575 (73%), Positives = 484/575 (84%), Gaps = 33/575 (5%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NLH WGGPLPQSW DQQLVLQKKIL R+YELGM PVLPAFSGNVPAAL+N+FPSAKIT
Sbjct: 238 MANLHRWGGPLPQSWFDQQLVLQKKILARMYELGMTPVLPAFSGNVPAALRNIFPSAKIT 297

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWFSV+SD RWCCTYLLDATDPLFIEIGRAFIEQQL EYG TSHIYNCDTFDENTPP
Sbjct: 298 RLGNWFSVRSDVRWCCTYLLDATDPLFIEIGRAFIEQQLTEYGSTSHIYNCDTFDENTPP 357

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD PEYISSLG +I+ GMQSGDS+AVWLMQGWLFSYDPFWRPPQ KALL+SVP+G+LVVL
Sbjct: 358 VDDPEYISSLGGSIFEGMQSGDSNAVWLMQGWLFSYDPFWRPPQTKALLHSVPIGRLVVL 417

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           DLFAEVKPIW+TS+QFYGVPYIWCMLHNFAGN+EMYG LDS+A GPVEARTSEN+TM   
Sbjct: 418 DLFAEVKPIWNTSEQFYGVPYIWCMLHNFAGNLEMYGYLDSVASGPVEARTSENSTMVGV 477

Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          WI+ YS RRYGRSVP IQ+AWN+LYHTVY
Sbjct: 478 GMSMEGIEQNPVVYDLMSEMAFQKNKVDVKEWIDLYSARRYGRSVPTIQNAWNILYHTVY 537

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           NCTDGA DKNRDVIVAFPDV+P+++S+ +G++    K VS+ A L   T SY+HPHLWYS
Sbjct: 538 NCTDGAYDKNRDVIVAFPDVNPNLVSMLQGRHHTDVKLVSRRAALIKNTDSYEHPHLWYS 597

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
           T+EV+RALELFIA G+ELS S+TY YDL+DLTRQ LAKYANELFL +IEAY+L D+HGV 
Sbjct: 598 TTEVVRALELFIAGGDELSGSSTYSYDLVDLTRQVLAKYANELFLKVIEAYRLKDSHGVA 657

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
             S+ FL+LVED+D LLACH+GFLLGPWLESAKQLAQ+EEQ+ Q+EWNARTQITMW+DNT
Sbjct: 658 HQSQMFLDLVEDIDTLLACHEGFLLGPWLESAKQLAQDEEQQIQFEWNARTQITMWYDNT 717

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
           + EASLLRDYGNKYWSGLL+DYYGPRAAIYF ++ +SLE+G GF+LK WRREWIKLTN W
Sbjct: 718 EVEASLLRDYGNKYWSGLLKDYYGPRAAIYFNFLTQSLENGHGFQLKAWRREWIKLTNKW 777

Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH 542
           Q  R ++PVESNG+AL  S+WLY+KYL     +DH
Sbjct: 778 QKSRKIFPVESNGNALNISRWLYHKYLGNPDTYDH 812


>gi|356519003|ref|XP_003528164.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
          Length = 812

 Score =  892 bits (2306), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/567 (74%), Positives = 479/567 (84%), Gaps = 34/567 (5%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLPQSW DQQL+LQKKIL R++ELGM PVLPAFSGNVPAAL+++FPSAKIT
Sbjct: 233 MGNLHGWGGPLPQSWFDQQLILQKKILARMFELGMTPVLPAFSGNVPAALKHIFPSAKIT 292

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWFSVK+D +WCCTYLLDATD LF+EIG+AFIE+QL+EYGRTSHIYNCDTFDENTPP
Sbjct: 293 RLGNWFSVKNDLKWCCTYLLDATDSLFVEIGKAFIEKQLQEYGRTSHIYNCDTFDENTPP 352

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD PEYISSLGAA + GMQSGD DAVWLMQGWLFSYDPFWRPPQMKALL+SVP+GKLVVL
Sbjct: 353 VDDPEYISSLGAATFKGMQSGDDDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGKLVVL 412

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYGILD+IA GP++ARTS N+TM   
Sbjct: 413 DLFAEVKPIWVTSEQFYGVPYIWCMLHNFAGNIEMYGILDAIASGPIDARTSNNSTMVGV 472

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                         AW++ YS RRYG+++P IQ+ WNVLYHT+Y
Sbjct: 473 GMSMEGIEQNPIVYDLMSEMAFQHKKVDVKAWVDMYSTRRYGQTLPLIQEGWNVLYHTIY 532

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           NCTDGA DKNRDVIVAFPDVDPS+ISV   +  +  KP S   ++K  T S+D PHLWY 
Sbjct: 533 NCTDGAYDKNRDVIVAFPDVDPSLISVQHEQSHHNDKPYSG-TIIKEITDSFDRPHLWYP 591

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
           TSEVI ALELFI SG+ELS  NTYRYDL+DLTRQ LAKYANELF  +IEAYQ +D HG+ 
Sbjct: 592 TSEVIYALELFITSGDELSRCNTYRYDLVDLTRQVLAKYANELFFKVIEAYQSHDIHGMT 651

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
            LS+RFL+LVED+D LLACHDGFLLGPWLESAKQLA NEEQE+Q+EWNARTQITMWFDN+
Sbjct: 652 LLSQRFLDLVEDLDTLLACHDGFLLGPWLESAKQLALNEEQERQFEWNARTQITMWFDNS 711

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            EEASLLRDYGNKYW+GLL DYYGPRAAIYFKY+ ESLESG+ F+L+ WRREWIKLTN+W
Sbjct: 712 DEEASLLRDYGNKYWNGLLHDYYGPRAAIYFKYLRESLESGEDFKLRGWRREWIKLTNEW 771

Query: 508 QNGRNVYPVESNGDALITSQWLYNKYL 534
           Q  RN++PVES+GDAL TS+WL+NKYL
Sbjct: 772 QKRRNIFPVESSGDALNTSRWLFNKYL 798


>gi|297733843|emb|CBI15090.3| unnamed protein product [Vitis vinifera]
          Length = 846

 Score =  879 bits (2272), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 429/611 (70%), Positives = 482/611 (78%), Gaps = 70/611 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWF+V  +PRWCCTYLLDATDPLFIEIG+AFI+QQLKEYGRT HIYNCDTFDENTPP
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPP 354

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 355 VDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVL 414

Query: 181 DLFAEVKPIWSTSKQFYGVPYIW--------------------------------CMLHN 208
           DLFAEVKPIW TS+QFYGVPYIW                                CMLHN
Sbjct: 415 DLFAEVKPIWITSEQFYGVPYIWKVTKSGRQQSLKFTNEKCCSFFRSHSPDSEVLCMLHN 474

Query: 209 FAGNIEMYGILDSIAFGPV--EARTSENTTMA---------------------------- 238
           FAGNIEMYGILD++A GP+   A+ +E+  +                             
Sbjct: 475 FAGNIEMYGILDAVASGPILLRAKYAESAVVGVGMSMEGIEQNPVVYDLMSEMAFQHSKV 534

Query: 239 ----WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               WI  YS RRYG+SVP IQDAWN+LYHTVYNCTDG+ DKNRDVIVAFPD+DPS I  
Sbjct: 535 DVKVWIALYSTRRYGKSVPEIQDAWNILYHTVYNCTDGSYDKNRDVIVAFPDIDPSFIPT 594

Query: 295 TE----GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 350
            +    G Y  YGK VS+  VLK  T+S++ PHLWYSTSEV  AL LFIASG +L  SNT
Sbjct: 595 PKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPHLWYSTSEVKDALGLFIASGGQLLGSNT 654

Query: 351 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 410
           YRYDL+DLTRQALAKYAN+LFL +IEAYQLND  G    S++FLELVEDMD LLACHDGF
Sbjct: 655 YRYDLVDLTRQALAKYANQLFLEVIEAYQLNDVRGAACHSQKFLELVEDMDTLLACHDGF 714

Query: 411 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
           LLGPWLESAKQLAQ+E+QE Q+EWNARTQITMWFDNT++EASLLRDYGNKYWSGLLRDYY
Sbjct: 715 LLGPWLESAKQLAQDEQQEIQFEWNARTQITMWFDNTEDEASLLRDYGNKYWSGLLRDYY 774

Query: 471 GPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
           GPRAAIYFKY++ESLE+G+ F LKDWRREWIKLTNDWQN RN YPV S+G+A+ TS+ LY
Sbjct: 775 GPRAAIYFKYLLESLETGNEFALKDWRREWIKLTNDWQNSRNAYPVRSSGNAIDTSRRLY 834

Query: 531 NKYLQGTGVFD 541
           NKYLQ   ++D
Sbjct: 835 NKYLQDPEIYD 845


>gi|449441031|ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
          Length = 808

 Score =  851 bits (2199), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 404/568 (71%), Positives = 468/568 (82%), Gaps = 35/568 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLPQSW DQQL+LQKK++ R++ELGM PVLPAFSGN+PAA + ++P+AKIT
Sbjct: 236 MGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKIT 295

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWF+V SDPRWCCTYLLDA DPLF+EIG+AFIEQQ KEYGRTSH+YNCDTFDENTPP
Sbjct: 296 RLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPP 355

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD  EYISSLG+AI+ GMQ+GDS+AVWLMQGW+FSYDPFWRP QMKALL+SVPLG+LVVL
Sbjct: 356 VDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVL 415

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           DL+AEVKPIW +S+QFYG+PYIWCMLHNFAGN+EMYGILDSIA GP+EAR+S  +TM   
Sbjct: 416 DLYAEVKPIWISSEQFYGIPYIWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGV 475

Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W+ QYSVRRYG  VP+IQDAW+VLYHTVY
Sbjct: 476 GMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVY 535

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           NCTDGA DKNRDVIVAFPDVDPS I V       +G   S  +V + + +++D PHLWY 
Sbjct: 536 NCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHLWYP 593

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
           TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF  I++AYQL+D   + 
Sbjct: 594 TSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMA 653

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
            LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQYEWNARTQITMWFDNT
Sbjct: 654 SLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNT 713

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
           +EEASLLRDYGNKYWSGLL DYY PRAAIY K++ ES E+G  F L +WRREWIKLTNDW
Sbjct: 714 EEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDW 773

Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQ 535
           Q+ R +YPVESNGDAL TS WLYNKYLQ
Sbjct: 774 QSSRKIYPVESNGDALDTSHWLYNKYLQ 801


>gi|326515664|dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 829

 Score =  779 bits (2011), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/573 (63%), Positives = 449/573 (78%), Gaps = 39/573 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y  GM+PVLPAFSGN+PAAL+  FPSAK+T
Sbjct: 239 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVT 298

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 299 HLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 358

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA +  MQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 359 LSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 418

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL+AEVKP+W  S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TM   
Sbjct: 419 DLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 478

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W+  Y  RRYG+SV  +QDAW +L+ T+Y
Sbjct: 479 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLY 538

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAVLK-SETSSYDHP 322
           NCTDG  DKNRDVIVAFPDV+PS+I  T G Y    +NY   +S+  V+K +   +Y+ P
Sbjct: 539 NCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSKNYSTMLSENYVMKDAPNDAYEQP 597

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           H+WY T  VI ALELF+ SG+E+S S+T+RYDL+DLTRQALAKYAN++FL II+ Y+ N+
Sbjct: 598 HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
            + V  L  RFL LV+D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 658 VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQYEWNARTQITM 717

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFK++I SL+  + F L++WRREWI 
Sbjct: 718 WFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFKHLISSLKKKEPFALEEWRREWIS 777

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
           LTN+WQ+ R V+   + GDAL  S+ L+ KYL+
Sbjct: 778 LTNNWQSDRKVFATTATGDALNISRALFTKYLR 810


>gi|222629680|gb|EEE61812.1| hypothetical protein OsJ_16433 [Oryza sativa Japonica Group]
          Length = 1129

 Score =  778 bits (2010), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/573 (63%), Positives = 442/573 (77%), Gaps = 39/573 (6%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+N+HGWGGPLPQSWLD QL LQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 537  MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 596

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 597  HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 656

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            +  P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 657  LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 716

Query: 181  DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
            DL+AEVKPIW  S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TM   
Sbjct: 717  DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMVGV 776

Query: 238  ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                           W+  Y  RRYG+S+  +QDAW +LY T+Y
Sbjct: 777  GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSMVGLQDAWKILYQTLY 836

Query: 268  NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 322
            NCTDG  DKNRDVIVAFPDV+P +I  T G Y    + Y   +SK  + + +    Y+HP
Sbjct: 837  NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 895

Query: 323  HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
            HLWY T  VIRALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 896  HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 955

Query: 383  AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
             + V  L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 956  VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQITM 1015

Query: 443  WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
            WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFKY+I S+E  + F L++WRREWI 
Sbjct: 1016 WFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIYFKYLILSMEKKEPFALEEWRREWIS 1075

Query: 503  LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
            LTN+WQ+   V+P  + GDAL  S+ LY KYL 
Sbjct: 1076 LTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 1108


>gi|326519955|dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 829

 Score =  778 bits (2009), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/573 (63%), Positives = 448/573 (78%), Gaps = 39/573 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y  GM+PVLPAFSGN+PAAL+  FPSAK+T
Sbjct: 239 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVT 298

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 299 HLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 358

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA +  MQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 359 LSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 418

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL+AEVKP W  S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TM   
Sbjct: 419 DLYAEVKPAWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 478

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W+  Y  RRYG+SV  +QDAW +L+ T+Y
Sbjct: 479 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLY 538

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAVLK-SETSSYDHP 322
           NCTDG  DKNRDVIVAFPDV+PS+I  T G Y    +NY   +S+  V+K +   +Y+ P
Sbjct: 539 NCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSKNYSTMLSENYVMKDAPNDAYEQP 597

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           H+WY T  VI ALELF+ SG+E+S S+T+RYDL+DLTRQALAKYAN++FL II+ Y+ N+
Sbjct: 598 HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
            + V  L  RFL LV+D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 658 VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQYEWNARTQITM 717

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFK++I SL+  + F L++WRREWI 
Sbjct: 718 WFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFKHLISSLKKKEPFALEEWRREWIS 777

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
           LTN+WQ+ R V+   + GDAL  S+ L+ KYL+
Sbjct: 778 LTNNWQSDRKVFATTATGDALNISRALFTKYLR 810


>gi|218195716|gb|EEC78143.1| hypothetical protein OsI_17702 [Oryza sativa Indica Group]
          Length = 829

 Score =  775 bits (2001), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/573 (63%), Positives = 442/573 (77%), Gaps = 39/573 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQSWLD QL LQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 237 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 296

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 297 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 356

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 357 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 416

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL+AEVKPIW  S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TM   
Sbjct: 417 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMIGV 476

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W+  Y  RRYG+S+  +QDAW +LY T+Y
Sbjct: 477 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSIVGLQDAWKILYQTLY 536

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 322
           NCTDG  DKNRDVIVAFPDV+P +I  T G Y    + Y   +SK  + + +    Y+HP
Sbjct: 537 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 595

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           HLWY T  VIRALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 596 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 655

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
            + V  L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 656 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQITM 715

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFKY+I S+E  + F L++WRREWI 
Sbjct: 716 WFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIYFKYLILSMEKKEPFALEEWRREWIS 775

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
           LTN+WQ+   V+P  + GDAL  S+ LY KYL 
Sbjct: 776 LTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 808


>gi|414585092|tpg|DAA35663.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
          Length = 831

 Score =  773 bits (1996), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/578 (63%), Positives = 445/578 (76%), Gaps = 39/578 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYNCDTFDENTPP
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYNCDTFDENTPP 359

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA + GMQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+GK++VL
Sbjct: 360 LSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGKMIVL 419

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL+AEVKP+W  S Q YGVPYIWCMLHNFA + EMYG+LD++A GP++AR S+N+TM   
Sbjct: 420 DLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGV 479

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W+  Y  RRYG+ V  +QDAW +LY T+Y
Sbjct: 480 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLY 539

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHP 322
           NCTDG  DKNRDVIVAFPDV+P +I+ T G + N    Y    SK  + K  +S +Y+HP
Sbjct: 540 NCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHP 598

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           HLWY T+ VI ALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+
Sbjct: 599 HLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNN 658

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
            + V  L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITM
Sbjct: 659 MNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITM 718

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           WFDNT+ +ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+   F LK+WRREWI 
Sbjct: 719 WFDNTETKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWIS 778

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
           LTN+WQ+ R V+   + GD L  SQ LY KYL    + 
Sbjct: 779 LTNNWQSDRKVFSTTATGDPLNISQSLYTKYLSNADLL 816


>gi|38345908|emb|CAE04506.2| OSJNBb0059K02.16 [Oryza sativa Japonica Group]
          Length = 829

 Score =  773 bits (1995), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/573 (63%), Positives = 442/573 (77%), Gaps = 39/573 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQSWLD QL LQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 237 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 296

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 297 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 356

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 357 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 416

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL+AEVKPIW  S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TM   
Sbjct: 417 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMVGV 476

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W+  Y  RRYG+S+  +QDAW +LY T+Y
Sbjct: 477 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSMVGLQDAWKILYQTLY 536

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 322
           NCTDG  DKNRDVIVAFPDV+P +I  T G Y    + Y   +SK  + + +    Y+HP
Sbjct: 537 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 595

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           HLWY T  VIRALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 596 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 655

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
            + V  L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 656 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQITM 715

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFKY+I S+E  + F L++WRREWI 
Sbjct: 716 WFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIYFKYLILSMEKKEPFALEEWRREWIS 775

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
           LTN+WQ+   V+P  + GDAL  S+ LY KYL 
Sbjct: 776 LTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 808


>gi|357166414|ref|XP_003580702.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
           distachyon]
          Length = 829

 Score =  771 bits (1990), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/573 (63%), Positives = 449/573 (78%), Gaps = 39/573 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y  GM+PVLPAFSG++PAAL++ FPSAK+T
Sbjct: 238 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGSIPAALKSKFPSAKVT 297

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 298 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 357

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA + GMQSGD DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 358 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 417

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL+AEVKP+W  S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TM   
Sbjct: 418 DLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 477

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W+  Y  RRYG+S+  +QDAW +L+ T+Y
Sbjct: 478 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLQVWVETYPTRRYGKSIVELQDAWRILHQTLY 537

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVLKSETS-SYDHP 322
           NCTDG  DKNRDVIVAFPDV+P +I  T G + +  K  S    K  ++K E++ +Y+ P
Sbjct: 538 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLHTSASKMFSTMSAKSYLVKDESNDAYEQP 596

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           HLWY T+ VIRAL+LF+  G+E+S S+T+RYDL+DLTRQALAKYAN++F  II++Y+ N+
Sbjct: 597 HLWYDTNVVIRALQLFLQYGDEVSDSSTFRYDLVDLTRQALAKYANQIFAKIIQSYKSNN 656

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
            + V  LS  FL+LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 657 MNQVTTLSECFLDLVNDLDMLLASHEGFLLGPWLESAKGLARDQEQEIQYEWNARTQITM 716

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           WFDNT+ +ASLLRDY NKYWSGLL DYYGPRAAIYFKY+I SLE  + F L++WRREWI 
Sbjct: 717 WFDNTETKASLLRDYANKYWSGLLGDYYGPRAAIYFKYLILSLEKKEPFALEEWRREWIS 776

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
           LTN+WQ+ R V+   + GDAL  ++ LY KYL+
Sbjct: 777 LTNNWQSDRKVFATAATGDALNIARSLYMKYLR 809


>gi|218192858|gb|EEC75285.1| hypothetical protein OsI_11626 [Oryza sativa Indica Group]
          Length = 812

 Score =  692 bits (1785), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/570 (57%), Positives = 412/570 (72%), Gaps = 35/570 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVP+  + +FPSA IT
Sbjct: 243 MGNLHGWGGPLSQNWLDQQLTLQKKILSRMIELGMVPVLPSFSGNVPSVFKKLFPSANIT 302

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V  DPRWCCTYLLD +D LFI++G+AFI QQ+KEYG  ++IYNCDTF+ENTPP
Sbjct: 303 KLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQAFIRQQMKEYGDITNIYNCDTFNENTPP 362

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            + P YISSLG+AIY  M  G+ DAVWLMQGWLF  D  FW+ PQMKALL+SVP GK++V
Sbjct: 363 TNEPAYISSLGSAIYEAMSRGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPTGKMIV 422

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFA+VKPIW  S QFYGVPYIWCMLHNF GNIEMYGILDSIA GP++ARTS N+TM  
Sbjct: 423 LDLFADVKPIWQMSSQFYGVPYIWCMLHNFGGNIEMYGILDSIASGPIDARTSHNSTMVG 482

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+  YS RRYG+S   ++ AW +LYHT+
Sbjct: 483 VGMCMEGIEHNPVVYELMSEMAFRSQKVEVEDWLKIYSYRRYGQSNVEVEKAWGILYHTI 542

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 325
           YNCTDG  D N+D IV FPD+ P+  S    K +   +       + SE S S  HPHLW
Sbjct: 543 YNCTDGIADHNKDYIVQFPDISPNSFSSDVSKRKAISEVKKHRRFVLSEVSASLPHPHLW 602

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           YST E I+ALELF+ +GN+LS S TYRYDL+DLTRQ+L+K ANE++L+ + AY+  D++G
Sbjct: 603 YSTKEAIKALELFLNAGNDLSKSLTYRYDLVDLTRQSLSKLANEVYLDAMNAYRKKDSNG 662

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
           +   +++FLEL+ D+D LLA  D FLLGPWLE AK LA+ E + KQYEWNARTQ+TMW+D
Sbjct: 663 LNFYTKKFLELIVDIDTLLASDDNFLLGPWLEDAKSLARTENERKQYEWNARTQVTMWYD 722

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
           NT+ E S L DY NK+WSGLL+ YY PRA+ YF  + + L+    F+L++WR++WI  +N
Sbjct: 723 NTKTEQSKLHDYANKFWSGLLKSYYLPRASKYFSRLTKGLQENQSFQLEEWRKDWIAYSN 782

Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
           +WQ+G+ +Y V++ GDAL  S  L+ KY +
Sbjct: 783 EWQSGKELYAVKATGDALAISSSLFKKYFR 812


>gi|413955691|gb|AFW88340.1| hypothetical protein ZEAMMB73_315381 [Zea mays]
          Length = 814

 Score =  689 bits (1777), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/570 (57%), Positives = 410/570 (71%), Gaps = 35/570 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVPA    +FPSA IT
Sbjct: 244 MGNLHGWGGPLSQNWLDQQLALQKKILSRMIELGMVPVLPSFSGNVPAIFAKLFPSANIT 303

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V ++P+WCCTYLLD +D LFI++G+AFI QQ+KEYG  ++IYNCDTF+ENTPP
Sbjct: 304 RLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQAFIRQQIKEYGDVTNIYNCDTFNENTPP 363

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            D P YISSLG+AIY  M  G+ +AVWLMQGWLF  D  FW+ PQMKALL+SVP+GK++V
Sbjct: 364 TDEPAYISSLGSAIYEAMSRGNKNAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 423

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFA+VKPIW  S QFYGVPYIWCMLHNF GNIEMYGILDSI+ GP++ARTS N+TM  
Sbjct: 424 LDLFADVKPIWKVSSQFYGVPYIWCMLHNFGGNIEMYGILDSISSGPIDARTSYNSTMIG 483

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+  YS RRYG++   I+ AW  LYHT+
Sbjct: 484 VGMCMEGIEHNPVVYELMSEMAFHNKKVEVEDWLKTYSCRRYGQANADIEKAWRYLYHTI 543

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS-YDHPHLW 325
           YNCTDG  D N+D IV FPD+ PS ++    K +      +      SE S     PHLW
Sbjct: 544 YNCTDGIADHNKDYIVEFPDISPSSVTYQVSKRRGMSITRNHRRFFLSEVSGILPQPHLW 603

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           YST E ++ALELF+ +G+  S S TYRYDL+DLTRQ L+K ANE++L+ I  YQ  D+HG
Sbjct: 604 YSTKEAVKALELFLDAGSTFSESLTYRYDLVDLTRQCLSKLANEVYLDAISLYQKKDSHG 663

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
           +   +R+FLE++ D+D LLA  D FLLGPWLESAK LA  E++ +QYEWNARTQ+TMW+D
Sbjct: 664 LNAHARKFLEIIVDIDTLLAADDNFLLGPWLESAKSLAITEKERQQYEWNARTQVTMWYD 723

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
           NT+ E S L DY NK+WSGLL+ YY PRA+ YF Y+  SL+    F+L++WR++WI  +N
Sbjct: 724 NTETEQSKLHDYANKFWSGLLKSYYLPRASKYFAYLTRSLQENRSFQLEEWRKDWISYSN 783

Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
           +WQ+G+ VY V++ GDAL  ++ LY KYL+
Sbjct: 784 EWQSGKEVYAVKATGDALAIARSLYRKYLR 813


>gi|222624949|gb|EEE59081.1| hypothetical protein OsJ_10898 [Oryza sativa Japonica Group]
          Length = 812

 Score =  687 bits (1772), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/569 (57%), Positives = 409/569 (71%), Gaps = 35/569 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVP+  + +FPSA IT
Sbjct: 243 MGNLHGWGGPLSQNWLDQQLTLQKKILSRMIELGMVPVLPSFSGNVPSVFKKLFPSANIT 302

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V  DPRWCCTYLLD +D LFI++G+AFI QQ+KEYG  ++IYNCDTF+ENTPP
Sbjct: 303 KLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQAFIRQQMKEYGDITNIYNCDTFNENTPP 362

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            + P YISSLG+AIY  M  G+ DAVWLMQGWLF  D  FW+ PQMKALL+SVP GK++V
Sbjct: 363 TNEPAYISSLGSAIYEAMSRGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPTGKMIV 422

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFA+VKPIW  S QFYGVPYIWCMLHNF GNIEMYGILDSIA GP++ARTS N+TM  
Sbjct: 423 LDLFADVKPIWQMSSQFYGVPYIWCMLHNFGGNIEMYGILDSIASGPIDARTSHNSTMVG 482

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+  YS RRYG+S   ++ AW +LYHT+
Sbjct: 483 VGMCMEGIEHNPVVYELMSEMAFRSQKVEVEDWLKIYSYRRYGQSNVEVEKAWGILYHTI 542

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 325
           YNCTDG  D N D IV FPD+ P+  S    K +   +       + SE S S  HPHLW
Sbjct: 543 YNCTDGIADHNNDYIVEFPDISPNSFSSDVSKRKAISEVKKHRRFVLSEVSASLPHPHLW 602

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           YST E I+ALELF+ +GN+LS S TYRYDL+DLTRQ+L+K ANE++L+ + AY+  D++G
Sbjct: 603 YSTKEAIKALELFLNAGNDLSKSLTYRYDLVDLTRQSLSKLANEVYLDAMNAYRKKDSNG 662

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
           +   +++FLEL+ D+D LLA  D FLLGPWLE AK LA+ E + KQYEWNARTQ+TMW+D
Sbjct: 663 LNFYTKKFLELIVDIDTLLASDDNFLLGPWLEDAKSLARTENERKQYEWNARTQVTMWYD 722

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
           NT+ E S L DY NK+WSGLL+ YY PRA+ YF  + + L+    F+L++W ++WI  +N
Sbjct: 723 NTKTEQSKLHDYANKFWSGLLKSYYLPRASKYFSRLTKGLQENQSFQLEEWTKDWIAYSN 782

Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYL 534
           +WQ+G+ +Y V++ GDAL  S  L+ KY 
Sbjct: 783 EWQSGKELYAVKATGDALAISSSLFKKYF 811


>gi|357112065|ref|XP_003557830.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
           distachyon]
          Length = 809

 Score =  680 bits (1755), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/571 (57%), Positives = 416/571 (72%), Gaps = 39/571 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPL Q+WLD QL LQKKIL R+ ELGM PVLP+FSGNVP A + +FPSA IT
Sbjct: 240 MGNLHAWGGPLSQNWLDGQLALQKKILSRMTELGMVPVLPSFSGNVPVAFKKLFPSANIT 299

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W +V  DPRWCCTY+LD +D LFI++G AFI QQ+KEYG  + IYNCDTF+ENTPP
Sbjct: 300 RLGEWNTVDGDPRWCCTYILDPSDALFIDVGHAFIRQQIKEYGDITSIYNCDTFNENTPP 359

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            + P YISSLG+AIY  M SG+ DAVWLMQGWLF  D  FW+ PQMKALL+SVP+GK++V
Sbjct: 360 TNEPAYISSLGSAIYEAMSSGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 419

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFA+VKP+W  S QFYGVPYIWCMLHNF GNIEMYGILDSI+ GP++ARTS  +TM  
Sbjct: 420 LDLFADVKPVWKMSSQFYGVPYIWCMLHNFGGNIEMYGILDSISSGPIDARTSYGSTMVG 479

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+  YS RRYG+S   I+ AW VLYHT+
Sbjct: 480 VGMTMEGIEHNPVVFELMSEMSFRSQKVEVEDWLKSYSYRRYGQSNVKIEKAWGVLYHTI 539

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA---VLKSETSSYDHPH 323
           YNCTDG  D NRD IV FPD+ PS  S    K +  G P+ ++     L   +++  HPH
Sbjct: 540 YNCTDGIADHNRDYIVEFPDMSPSSFSSHFSKQR--GMPIVRKHPRFFLSEVSANLPHPH 597

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           LWYST+E ++ALELF+ +GN+LS S T+RYDL+DLTRQ+L+K AN+++L+ +++Y+  ++
Sbjct: 598 LWYSTNEAVKALELFLNAGNDLSKSLTFRYDLVDLTRQSLSKLANKVYLDAMDSYKNKNS 657

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            G+   +++FLEL+ D+D LLA  D FLLGPWLESAK LA +EE+ KQYEWNARTQ+TMW
Sbjct: 658 SGLNFHTKKFLELIVDIDILLASDDNFLLGPWLESAKSLAMSEEERKQYEWNARTQVTMW 717

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
           +DNT+ E S L DY NK+WSGLL++YY PRA+ YF  +  SL+    F+L++WRR+WI  
Sbjct: 718 YDNTKTEQSHLHDYANKFWSGLLKNYYLPRASKYFTGLSRSLQENRSFQLEEWRRDWISY 777

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
           +N+WQ+G  +YPV++ GDAL  S+ L+ KYL
Sbjct: 778 SNEWQSGEELYPVKAKGDALAISKSLFRKYL 808


>gi|297736304|emb|CBI24942.3| unnamed protein product [Vitis vinifera]
          Length = 868

 Score =  679 bits (1751), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/575 (56%), Positives = 408/575 (70%), Gaps = 42/575 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLD+QLVLQK+IL R+ ELGM PVLP+FSGNVP AL+ +FPSA IT
Sbjct: 294 MGNLHGWGGPLSQNWLDEQLVLQKQILCRMLELGMTPVLPSFSGNVPEALKKIFPSANIT 353

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI QQ+KEYG  + IYNCDTF+EN+PP
Sbjct: 354 RLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQQIKEYGDVTDIYNCDTFNENSPP 413

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            + P YISSLGAAIY  M  GD D+VWLMQGWLF  D  FW+PPQMKALL+SVP GK+VV
Sbjct: 414 TNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSDSGFWKPPQMKALLHSVPFGKMVV 473

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMYGILD+++ GPV+AR S+N+TM  
Sbjct: 474 LDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMYGILDAVSSGPVDARISKNSTMVG 533

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+  YS RRYG++V  ++ AW +LY T+
Sbjct: 534 VGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTYSYRRYGKAVHHVEAAWEILYRTI 593

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
           YNCTDG  D N D +V FPD DPS+     IS  +   Q       +  +L  ETSS D 
Sbjct: 594 YNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHIIQKILTQTGRRKILFQETSS-DL 652

Query: 322 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
           P  HLWYST EV+ AL LF+ +GNELS S+TYRYDL+DLTRQ L+K  N+++L+ + A++
Sbjct: 653 PQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLVDLTRQVLSKLGNQVYLDAVIAFR 712

Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
             DA      S++F++LV+D+D LLA  D FLLG WLESAK+LA N  + +QYEWNARTQ
Sbjct: 713 QKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWLESAKKLAVNPREMEQYEWNARTQ 772

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
           +TMWF  T+   S L DY NK+WSGLL +YY PRA++YF Y+ ++L     F+L++WRRE
Sbjct: 773 LTMWFYVTKTNQSKLHDYANKFWSGLLENYYLPRASMYFSYLAKALTENKNFKLEEWRRE 832

Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
           WI  +N WQ G+ +YPV + GD L  S+ LY KY 
Sbjct: 833 WISYSNKWQAGKELYPVRAKGDTLAISRALYEKYF 867


>gi|356534602|ref|XP_003535842.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
          Length = 807

 Score =  679 bits (1751), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/573 (56%), Positives = 400/573 (69%), Gaps = 39/573 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL  +FPSAKIT
Sbjct: 230 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 289

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V  DPRWCCTYLLD +DPLF+EIG AFI +Q+KEYG  + IYNCDTF+EN+PP
Sbjct: 290 RLGDWNTVDGDPRWCCTYLLDPSDPLFVEIGEAFIRKQIKEYGDVTDIYNCDTFNENSPP 349

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
            + PEYIS+LGAA+Y G+  GD DAVWLMQGWLF  D  FW+PPQMKALL+SVP GK++V
Sbjct: 350 TNDPEYISNLGAAVYKGISKGDKDAVWLMQGWLFYSDSSFWKPPQMKALLHSVPFGKMIV 409

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFA+VKPIW  S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S N+TM  
Sbjct: 410 LDLFADVKPIWKNSFQFYGTPYIWCMLHNFGGNIEMYGTLDSISSGPVDARVSANSTMVG 469

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           WI  Y  RRYG+ +  ++ AW +LYHT+
Sbjct: 470 VGMCMEGIEQNPIVYELMSEMAFRDKKVKVSEWIKSYCHRRYGKVIHQVESAWEILYHTI 529

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG----KPVSKEAVLKSETSSYDHP 322
           YNCTDG  D N D IV FPD +PS  SVT G   N       P ++  + +   S     
Sbjct: 530 YNCTDGIADHNHDFIVMFPDWNPSTNSVT-GTSNNQKIYLLPPGNRRYLFQETLSDMPQA 588

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           HLWY + +VI+AL+LF+A G  L+ S TYRYDL+DLTRQ L+K AN+++   + +YQ  +
Sbjct: 589 HLWYPSDDVIKALQLFLAGGKNLAGSLTYRYDLVDLTRQVLSKLANQVYHKAVTSYQKKN 648

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
              +   S +FL+L++D+D LLA  D FLLG WLESAK+LA N  + KQYEWNARTQ+TM
Sbjct: 649 IEALQFHSNKFLQLIKDIDVLLASDDNFLLGTWLESAKKLAVNPSEIKQYEWNARTQVTM 708

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           WFD  +   S L DY NK+WSGLL  YY PRA+ YF ++ ESL   D F+L +WR++WI 
Sbjct: 709 WFDTNETTQSKLHDYANKFWSGLLESYYLPRASTYFSHLTESLRQNDKFKLIEWRKQWIS 768

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
            +N WQ G  +YPV++ GDAL  SQ LY KY Q
Sbjct: 769 QSNKWQEGNELYPVKAKGDALTISQALYEKYFQ 801


>gi|225450036|ref|XP_002273084.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
          Length = 803

 Score =  678 bits (1749), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/575 (56%), Positives = 408/575 (70%), Gaps = 42/575 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLD+QLVLQK+IL R+ ELGM PVLP+FSGNVP AL+ +FPSA IT
Sbjct: 229 MGNLHGWGGPLSQNWLDEQLVLQKQILCRMLELGMTPVLPSFSGNVPEALKKIFPSANIT 288

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI QQ+KEYG  + IYNCDTF+EN+PP
Sbjct: 289 RLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQQIKEYGDVTDIYNCDTFNENSPP 348

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            + P YISSLGAAIY  M  GD D+VWLMQGWLF  D  FW+PPQMKALL+SVP GK+VV
Sbjct: 349 TNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSDSGFWKPPQMKALLHSVPFGKMVV 408

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMYGILD+++ GPV+AR S+N+TM  
Sbjct: 409 LDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMYGILDAVSSGPVDARISKNSTMVG 468

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+  YS RRYG++V  ++ AW +LY T+
Sbjct: 469 VGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTYSYRRYGKAVHHVEAAWEILYRTI 528

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
           YNCTDG  D N D +V FPD DPS+     IS  +   Q       +  +L  ETSS D 
Sbjct: 529 YNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHIIQKILTQTGRRKILFQETSS-DL 587

Query: 322 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
           P  HLWYST EV+ AL LF+ +GNELS S+TYRYDL+DLTRQ L+K  N+++L+ + A++
Sbjct: 588 PQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLVDLTRQVLSKLGNQVYLDAVIAFR 647

Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
             DA      S++F++LV+D+D LLA  D FLLG WLESAK+LA N  + +QYEWNARTQ
Sbjct: 648 QKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWLESAKKLAVNPREMEQYEWNARTQ 707

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
           +TMWF  T+   S L DY NK+WSGLL +YY PRA++YF Y+ ++L     F+L++WRRE
Sbjct: 708 LTMWFYVTKTNQSKLHDYANKFWSGLLENYYLPRASMYFSYLAKALTENKNFKLEEWRRE 767

Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
           WI  +N WQ G+ +YPV + GD L  S+ LY KY 
Sbjct: 768 WISYSNKWQAGKELYPVRAKGDTLAISRALYEKYF 802


>gi|224106113|ref|XP_002314048.1| predicted protein [Populus trichocarpa]
 gi|222850456|gb|EEE88003.1| predicted protein [Populus trichocarpa]
          Length = 806

 Score =  667 bits (1720), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/573 (54%), Positives = 404/573 (70%), Gaps = 39/573 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPAAL+ +FPSA IT
Sbjct: 233 MGNLHGWGGPLSQNWLDQQLCLQKQILSRMLELGMTPVLPSFSGNVPAALKKIFPSANIT 292

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V  +PRWCCTYLL+ +DPLF+EIG AFI QQ+KEYG  + IYNCDTF+EN+PP
Sbjct: 293 RLGDWNTVDKNPRWCCTYLLNPSDPLFVEIGEAFIRQQVKEYGDVTDIYNCDTFNENSPP 352

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              P YISSLGAA+Y  M  GD DAVWLMQGWLF  D  FW+PPQM+ALL+SVP GK++V
Sbjct: 353 TSDPAYISSLGAAVYKAMSRGDKDAVWLMQGWLFYSDSAFWKPPQMQALLHSVPFGKMIV 412

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFAE KPIW  S QFYG PY+WC+LHNF GNIEMYGILD+I+ GPV+AR  EN+TM  
Sbjct: 413 LDLFAEAKPIWKNSSQFYGTPYVWCLLHNFGGNIEMYGILDAISSGPVDARIIENSTMVG 472

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+  YS RRYG++V  +  AW++LYHT+
Sbjct: 473 VGMCMEGIEHNPVVYELMSEMAFRSGKPQVLEWLKTYSRRRYGKAVRQVVAAWDILYHTI 532

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV-----SKEAVLKSETSSYDH 321
           YNCTDG  D N D IV FPD DPS+ S +    Q+  + +     ++  + +  +S +  
Sbjct: 533 YNCTDGIADHNTDFIVKFPDWDPSLHSGSNISEQDNMRILLTSSGTRRFLFQETSSDFPE 592

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
            HLWYST EVI+AL LF+ +GN+L+ S TYRYDL+DLTRQ L+K AN+++ + + A++  
Sbjct: 593 AHLWYSTQEVIQALWLFLDAGNDLAGSPTYRYDLVDLTRQVLSKLANQVYRDAMIAFRRK 652

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
           DA  +    ++FL++++D+D LLA  D FLLG WLESAK+LA +    K YEWNARTQ+T
Sbjct: 653 DARALNLHGQKFLQIIKDIDVLLASDDNFLLGTWLESAKKLAVDPNDMKLYEWNARTQVT 712

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 501
           MW+D T+   S L DY NK+WSGLL DYY PRA+ YF ++++SLE    F+L +WR+EWI
Sbjct: 713 MWYDTTKTNQSQLHDYANKFWSGLLEDYYLPRASTYFGHLMKSLEENKNFKLTEWRKEWI 772

Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
             +N WQ    +YPV++ GDAL  ++ LY KY 
Sbjct: 773 AFSNKWQADTKIYPVKAKGDALAIAKALYRKYF 805


>gi|357458267|ref|XP_003599414.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
 gi|355488462|gb|AES69665.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
          Length = 832

 Score =  657 bits (1696), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/599 (53%), Positives = 400/599 (66%), Gaps = 66/599 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL  +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
           +LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K                    
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353

Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
                 EYG  + IYNCDTF+EN+PP   P YIS+LGAA+Y G+  GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413

Query: 155 SYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
             D  FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473

Query: 214 EMYGILDSIAFGPVEARTSENTTMA---------------------------------WI 240
           EMYG+LD+IA GPV+AR SEN+TM                                  W+
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKINEWL 533

Query: 241 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 300
             YS RRYG+++  +  AW +LYHT+YN TDG  D N D IV  PD DPS  +V  G   
Sbjct: 534 KSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHNHDYIVMLPDWDPS-AAVKSGMSN 592

Query: 301 NYGK-----PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 355
           +  K     P ++  + +   +     HLWY   +VI+AL+LF+A G  L  S TYRYDL
Sbjct: 593 HQKKIYFLPPGNRRYLFQQTPAGMPQAHLWYPPEDVIKALQLFLAGGKNLKGSLTYRYDL 652

Query: 356 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 415
           +DLTRQ L+K+AN++++  I ++Q  +   +   S  FLEL++D+D LLA  D FLLG W
Sbjct: 653 VDLTRQVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELIKDIDLLLASDDNFLLGTW 712

Query: 416 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 475
           L+SAK+LA N  + KQYEWNARTQ+TMWFD  +   S L DY NK+WSG+L +YY PRA+
Sbjct: 713 LQSAKKLAVNPSELKQYEWNARTQVTMWFDTNETTQSKLHDYANKFWSGILENYYLPRAS 772

Query: 476 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
            YF ++ ESL+  + F L +WR+EWI ++N WQ G  +YPV++ GDAL  SQ LY KY 
Sbjct: 773 TYFSHLSESLKQNEKFNLTEWRKEWIPMSNKWQEGSELYPVKAKGDALTISQALYKKYF 831


>gi|357458271|ref|XP_003599416.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
 gi|355488464|gb|AES69667.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
          Length = 807

 Score =  651 bits (1680), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/594 (53%), Positives = 392/594 (65%), Gaps = 81/594 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL  +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
           +LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K                    
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353

Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
                 EYG  + IYNCDTF+EN+PP   P YIS+LGAA+Y G+  GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413

Query: 155 SYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
             D  FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473

Query: 214 EMYGILDSIAFGPVEARTSENTTMA---------------------------------WI 240
           EMYG+LD+IA GPV+AR SEN+TM                                  W+
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKINEWL 533

Query: 241 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 300
             YS RRYG+++  +  AW +LYHT+YN TDG  D N D IV  PD DPS    + G  Q
Sbjct: 534 KSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHNHDYIVMLPDWDPSAAVKSAGMPQ 593

Query: 301 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 360
                                 HLWY   +VI+AL+LF+A G  L  S TYRYDL+DLTR
Sbjct: 594 ---------------------AHLWYPPEDVIKALQLFLAGGKNLKGSLTYRYDLVDLTR 632

Query: 361 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 420
           Q L+K+AN++++  I ++Q  +   +   S  FLEL++D+D LLA  D FLLG WL+SAK
Sbjct: 633 QVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELIKDIDLLLASDDNFLLGTWLQSAK 692

Query: 421 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 480
           +LA N  + KQYEWNARTQ+TMWFD  +   S L DY NK+WSG+L +YY PRA+ YF +
Sbjct: 693 KLAVNPSELKQYEWNARTQVTMWFDTNETTQSKLHDYANKFWSGILENYYLPRASTYFSH 752

Query: 481 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
           + ESL+  + F L +WR+EWI ++N WQ G  +YPV++ GDAL  SQ LY KY 
Sbjct: 753 LSESLKQNEKFNLTEWRKEWIPMSNKWQEGSELYPVKAKGDALTISQALYKKYF 806


>gi|15240689|ref|NP_196873.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|9758035|dbj|BAB08696.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|19423948|gb|AAL87291.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|21436231|gb|AAM51254.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|332004545|gb|AED91928.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
          Length = 806

 Score =  647 bits (1669), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/576 (54%), Positives = 400/576 (69%), Gaps = 43/576 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPL ++WLD QL+LQK+IL R+ + GM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMTPVLPSFSGNVPSALRKIYPEANIT 290

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L NW +V  D RWCCTYLL+ +DPLFIEIG AFI+QQ +EYG  ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIKQQTEEYGEITNIYNCDTFNENTPP 350

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEYISSLGAA+Y  M  G+ +AVWLMQGWLFS D  FW+PPQ+KALL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQLKALLHSVPFGKMIV 410

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL+AEVKPIW+ S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TM  
Sbjct: 411 LDLYAEVKPIWNKSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+  Y+ RRY +    I+ AW +LYHTV
Sbjct: 471 VGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS-------KEAVLKSETSSY 319
           YNCTDG  D N D IV  PD DPS  SV +   Q     +S       +  + + +T+  
Sbjct: 531 YNCTDGIADHNTDFIVKLPDWDPSS-SVQDDLKQKDSYMISTGPYETKRRVLFQDKTADL 589

Query: 320 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
              HLWYST EVI+AL+LF+ +G++LS S TYRYD++DLTRQ L+K AN+++   + A+ 
Sbjct: 590 PKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYTEAVTAFV 649

Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
             D   + QLS +FLEL++DMD LLA  D  LLG WLESAK+LA+N ++ KQYEWNARTQ
Sbjct: 650 KKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQYEWNARTQ 709

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
           +TMW+D+     S L DY NK+WSGLL DYY PRA +YF  M++SL     F+++ WRRE
Sbjct: 710 VTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRARLYFNEMLKSLRDKKIFKVEKWRRE 769

Query: 500 WIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKYL 534
           WI +++ WQ   + VYPV++ GDAL  S+ L +KY 
Sbjct: 770 WIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKYF 805


>gi|297807393|ref|XP_002871580.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317417|gb|EFH47839.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
           lyrata]
          Length = 806

 Score =  646 bits (1667), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/576 (55%), Positives = 403/576 (69%), Gaps = 43/576 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPL ++WL+ QL+LQK+IL ++ +LGM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHTWGGPLSKNWLNDQLILQKQILSQMLKLGMTPVLPSFSGNVPSALRKIYPGANIT 290

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L NW +V  D RWCCTYLL+ +DPLFI+IG AFI+QQ +EYG  ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIDIGEAFIKQQPEEYGEITNIYNCDTFNENTPP 350

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEYISSLGAA+Y  M  G+ +AVWLMQGWLFS D  FW+PPQMK LL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQMKVLLHSVPFGKMIV 410

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL+AEVKPIW+TS QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TM  
Sbjct: 411 LDLYAEVKPIWNTSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+  Y+ RRY +    I+ AW +LYHTV
Sbjct: 471 VGMCMEGIEQNPVVYELISEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN-----YGKPVSKEAVLKSETSSYDH 321
           YNCTDG  D N D IV  PD DPS     E K+ +      G   +K  VL  + SS D 
Sbjct: 531 YNCTDGIADHNTDFIVKLPDWDPSSSVQDESKHTDSYMISTGPYETKRRVLFQDKSS-DL 589

Query: 322 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
           P  HLWYST EVI+AL+LF+ +G+ELS S TYRYD++DLTRQ L+K AN++++  + A+ 
Sbjct: 590 PKAHLWYSTKEVIQALKLFLEAGDELSRSLTYRYDMVDLTRQVLSKLANQVYIEAVTAFV 649

Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
             D   + QLS +FLEL++D+D LLA  D FLLG WLESAK+LA+N ++ KQYEWNARTQ
Sbjct: 650 KKDIGSLGQLSEKFLELIKDIDVLLASDDNFLLGTWLESAKKLARNGDERKQYEWNARTQ 709

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
           +TMW+D+     S L DY NK WSGLL DYY PRA +YF  M++SL     F+++ W+RE
Sbjct: 710 VTMWYDSKDVNQSKLHDYANKLWSGLLEDYYLPRARLYFNEMLKSLRDKKKFKVEKWQRE 769

Query: 500 WIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKYL 534
           WI +++ WQ   + VYPV++ GDAL  S+ L  KY 
Sbjct: 770 WIMMSHKWQQSSSEVYPVKAKGDALAISKHLLLKYF 805


>gi|242035709|ref|XP_002465249.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
 gi|241919103|gb|EER92247.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
          Length = 777

 Score =  641 bits (1653), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/541 (57%), Positives = 397/541 (73%), Gaps = 16/541 (2%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQL LQKK+L R+ ELGM PVLP+FSGNVPA    +FPSA IT
Sbjct: 244 MGNLHGWGGPLSQNWLDQQLALQKKVLSRMIELGMVPVLPSFSGNVPAVFAKLFPSANIT 303

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG+W +V ++P+WCCTYLLD +D LFI++G+AFI QQ+KEYG  ++IYNCDTF+ENTPP
Sbjct: 304 LLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQAFIRQQIKEYGDVTNIYNCDTFNENTPP 363

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            D P YISSLG+AIY  M  G+ +AVWLMQGWLF  D  FW+ PQMKALL+SVP+GK++V
Sbjct: 364 TDEPAYISSLGSAIYEAMSRGNKNAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 423

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
           LDLFA+VKPIW  S QFYGVPYIWCMLHNF GNIEMYG+LDSI+ GP++ARTS N+TM  
Sbjct: 424 LDLFADVKPIWKMSSQFYGVPYIWCMLHNFGGNIEMYGVLDSISSGPIDARTSYNSTMIG 483

Query: 240 INQYSVRRYGRSVPAIQDAWNVLYHTV-----YNCTDGATDKNRDVIVAFPDVDPSIISV 294
           +        G  +  I+    V+Y  +     +N      D N+D IV FPD+ PS IS 
Sbjct: 484 V--------GMCMEGIEHN-PVVYELMSEMAFHNKKVEVEDHNKDYIVEFPDISPSSISS 534

Query: 295 TEGKYQNYGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 353
              K +      +      SE S S  HPHLWYST E I+ALELF+ +G+  S S TYRY
Sbjct: 535 QLSKRRGMSIMRNHRRFFLSEVSGSLPHPHLWYSTKEAIKALELFLDAGSTFSKSLTYRY 594

Query: 354 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
           DL+DLTRQ L+K ANE++L+ + +YQ  D++G+   +R+FLE++ D+D LLA  D FLLG
Sbjct: 595 DLVDLTRQCLSKLANEVYLDAMSSYQKKDSNGLNSHTRKFLEIIMDIDTLLAADDNFLLG 654

Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
           PWLESAK LA  E++ +QYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL+ YY PR
Sbjct: 655 PWLESAKSLAITEKERQQYEWNARTQVTMWYDNTETEQSKLHDYANKFWSGLLKSYYLPR 714

Query: 474 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           A+ YF Y+  SL+    F+L++WR++WI  +N+WQ+G+ VY V++ GDAL  ++ LY KY
Sbjct: 715 ASKYFAYLTRSLQENQSFQLEEWRKDWISYSNEWQSGKEVYAVKATGDALAIARSLYRKY 774

Query: 534 L 534
           L
Sbjct: 775 L 775


>gi|449436325|ref|XP_004135943.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
          Length = 774

 Score =  632 bits (1630), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 311/568 (54%), Positives = 382/568 (67%), Gaps = 57/568 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL ++WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPA L  +FPSA IT
Sbjct: 229 MGNLHGWGGPLSKNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAGLVEIFPSANIT 288

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNW S+ +DP  CCTYLL+ +DPLF++IG AFI QQ+KEYG  ++IY+CDTF+ENTPP
Sbjct: 289 KLGNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTNIYSCDTFNENTPP 348

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            +   YISSLGA++Y  M   D DAVWLMQGWLF  D  FW+P QMKALL+SVP GK++V
Sbjct: 349 TNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSDFWKPDQMKALLHSVPFGKMIV 408

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFA+VKPIW +S QFYG PY+WCMLHNF GNIEMYGILD+I+ GPV+A  SEN+TM  
Sbjct: 409 LDLFADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVG 468

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+  YS  RYG++   +  AWN+LYHT+
Sbjct: 469 VGMCMEGIEHNPVVYELMSEMAFRSKKVQVQEWLKTYSRCRYGKADHYVDAAWNILYHTI 528

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YNCTDG  + N D IV  PD DPS  S  + K     KP                PHLWY
Sbjct: 529 YNCTDGIANHNTDFIVKLPDWDPS--STFDLK-----KP----------------PHLWY 565

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
           ST EVI AL+L +   + L  S TYRYDL+DLTRQ L K ANE +L  + A++  +    
Sbjct: 566 STQEVINALQLLVNVDDNLVHSATYRYDLVDLTRQVLGKLANEEYLKAVTAFRRQNVKAQ 625

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              S+RF++L+ D+D LLA +  FLLG WLESAK+LA N  + KQYEWNARTQ+TMW+DN
Sbjct: 626 NLHSKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLATNPAEMKQYEWNARTQVTMWYDN 685

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
           T+   S L DY NKYWSGLL  YY PRA  YF Y+ +SL   + F L+DWRREWI  +N 
Sbjct: 686 TKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWILFSNK 745

Query: 507 WQNGRNVYPVESNGDALITSQWLYNKYL 534
           WQ    +YPV++ G+A+  S+ LY KY 
Sbjct: 746 WQAASELYPVKAEGNAVAISKALYEKYF 773


>gi|449489156|ref|XP_004158231.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
           [Cucumis sativus]
          Length = 567

 Score =  623 bits (1606), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 307/565 (54%), Positives = 378/565 (66%), Gaps = 57/565 (10%)

Query: 4   LHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLG 63
           L  WGGPL ++WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPA L  +FPSA IT+LG
Sbjct: 25  LKEWGGPLSKNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAGLVEIFPSANITKLG 84

Query: 64  NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDS 123
           NW S+ +DP  CCTYLL+ +DPLF++IG AFI QQ+KEYG  ++IY+CDTF+ENTPP + 
Sbjct: 85  NWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTNIYSCDTFNENTPPTND 144

Query: 124 PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDL 182
             YISSLGA++Y  M   D DAVWLMQGWLF  D  FW+P QMKALL+SVP GK++VLDL
Sbjct: 145 TSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSDFWKPDQMKALLHSVPFGKMIVLDL 204

Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA---- 238
           FA+VKPIW +S QFYG PY+WCMLHNF GNIEMYGILD+I+ GPV+A  SEN+TM     
Sbjct: 205 FADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGM 264

Query: 239 -----------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVYNC 269
                                        W+  YS  RYG++   +  AWN+LYHT+YNC
Sbjct: 265 CMEGIEHNPVVYELMSEMAFRXQKVQVQEWLKTYSRCRYGKADHYVDAAWNILYHTIYNC 324

Query: 270 TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTS 329
           TDG  + N D IV  PD DPS  S  + K     KP                PHLWYST 
Sbjct: 325 TDGIANHNTDFIVKLPDWDPS--STFDLK-----KP----------------PHLWYSTQ 361

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           EVI AL+L +   + L  S TYRYDL+DLTRQ L K ANE +L  + A++  +       
Sbjct: 362 EVINALQLLVNVDDNLVHSATYRYDLVDLTRQVLGKLANEEYLKAVTAFRRQNVKAQNLH 421

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
           S+RF++L+ D+D LLA +  FLLG WLESAK+LA N  + KQYEWNARTQ+TMW+DNT+ 
Sbjct: 422 SKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLATNPAEMKQYEWNARTQVTMWYDNTKV 481

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
             S L DY NKYWSGLL  YY PRA  YF Y+ +SL   + F L+DWRREWI  +N WQ 
Sbjct: 482 NQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWILFSNKWQA 541

Query: 510 GRNVYPVESNGDALITSQWLYNKYL 534
              +YPV++ G+A+  S+ LY KY 
Sbjct: 542 ASELYPVKAEGNAVAISKALYEKYF 566


>gi|414585093|tpg|DAA35664.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
          Length = 721

 Score =  623 bits (1606), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 295/469 (62%), Positives = 360/469 (76%), Gaps = 39/469 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYNCDTFDENTPP
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYNCDTFDENTPP 359

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA + GMQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+GK++VL
Sbjct: 360 LSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGKMIVL 419

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL+AEVKP+W  S Q YGVPYIWCMLHNFA + EMYG+LD++A GP++AR S+N+TM   
Sbjct: 420 DLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGV 479

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W+  Y  RRYG+ V  +QDAW +LY T+Y
Sbjct: 480 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLY 539

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHP 322
           NCTDG  DKNRDVIVAFPDV+P +I+ T G + N    Y    SK  + K  +S +Y+HP
Sbjct: 540 NCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHP 598

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           HLWY T+ VI ALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+
Sbjct: 599 HLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNN 658

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
            + V  L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE Q
Sbjct: 659 MNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQ 707


>gi|4160292|emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum]
          Length = 811

 Score =  614 bits (1583), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 306/580 (52%), Positives = 394/580 (67%), Gaps = 46/580 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPL Q+WL+ QL LQK+IL R+ ELGM PVLP+FSGNVPAAL+ +FPSA IT
Sbjct: 231 MGNLHAWGGPLSQNWLNIQLALQKQILSRMRELGMTPVLPSFSGNVPAALKKIFPSANIT 290

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V  DPRWCCT+LL  +DPLFIEIG AFI +Q++EYG  + IYNCDTF+ENTPP
Sbjct: 291 RLGDWNTVNGDPRWCCTFLLAPSDPLFIEIGEAFIRKQIEEYGDITDIYNCDTFNENTPP 350

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWL-MQGWLFSYD-PFWRPPQMKALLNSVPLGKLV 178
            D P YI           Q   +   WL  + WLF  D  +W+ PQM+ALL+SVP GK++
Sbjct: 351 TDDPTYIHLSALLCTKQCQKQITMRCWLNARVWLFYSDSKYWKSPQMEALLHSVPRGKMI 410

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM- 237
           VLDLFA+VKPIW +S QFYG PYIWCMLHNF GNIEMYG+LD++A GP++ARTSEN+TM 
Sbjct: 411 VLDLFADVKPIWKSSSQFYGTPYIWCMLHNFGGNIEMYGVLDAVASGPIDARTSENSTMV 470

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                            W+  YS RRYG+    IQ AW++LYHT
Sbjct: 471 GVGMCMEGIEHNPVVYELMSEMAFREDNFQLQGWLKSYSHRRYGKVNDQIQAAWDILYHT 530

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPS-----IISVTEGKYQNYGKPVS-----KEAVLKSE 315
           +YNCTDG  D N+D IV FPD DPS      IS T+   QN  + ++     +  +   +
Sbjct: 531 IYNCTDGIADHNKDYIVEFPDWDPSGKTGTDISGTDSSSQNRMQKLAGFQWNRRFLFFEK 590

Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
           +SS   P LWYST +V +AL+LFI +  +LS S TYRYDL+DL+RQ+L+K AN+++L+ I
Sbjct: 591 SSSLPKPRLWYSTEDVFQALQLFIDALKKLSGSLTYRYDLVDLSRQSLSKLANQVYLDAI 650

Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ-LAQNEEQEKQYEW 434
            A++  DA  + Q S +FL L++D+D LLA  D FLLG WLE+  Q LA N +++KQYEW
Sbjct: 651 SAFRREDAKPLNQHSPKFLPLLQDIDRLLAADDNFLLGTWLENCPQNLAMNSDEKKQYEW 710

Query: 435 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
           NARTQITMWFDNT+   S L DY NK+WSGLL  YY PRA+IYF+ + +SL+    F+L+
Sbjct: 711 NARTQITMWFDNTKYNQSQLHDYANKFWSGLLEAYYLPRASIYFELLSKSLKEKVDFKLE 770

Query: 495 DWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
           +WR+EWI  +N WQ    +YPV++ GDAL  +  L+ KY 
Sbjct: 771 EWRKEWIAYSNKWQESTELYPVKAQGDALAIATALFEKYF 810


>gi|168060822|ref|XP_001782392.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666123|gb|EDQ52786.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 801

 Score =  613 bits (1581), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 296/567 (52%), Positives = 376/567 (66%), Gaps = 50/567 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  WGGPLPQ WLDQQL LQ KIL R+ ELGM PVLPAF+GNVPAA+   +PSA++T
Sbjct: 236 MGNLKRWGGPLPQKWLDQQLQLQIKILARMRELGMTPVLPAFAGNVPAAITKKYPSARVT 295

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W +V  D R+CCT+LLD  DPLF++IG+AFI QQ+KEYG T HIYNCDTF+EN PP
Sbjct: 296 RLGEWNTVNGDTRYCCTFLLDPKDPLFVDIGKAFILQQIKEYGGTQHIYNCDTFNENQPP 355

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P YIS+LG+ +Y  M + D DA+WLMQ    +YD FW+PPQMKALL+SVP+G++VVL
Sbjct: 356 TDDPSYISALGSIVYEAMSAADQDAIWLMQ----AYDKFWKPPQMKALLHSVPVGRMVVL 411

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           DLFA+VKP+WS S  FYGVPYIWCMLHNF GN+EMYG LD +A  P++A TS N+TM   
Sbjct: 412 DLFADVKPMWSRSDHFYGVPYIWCMLHNFGGNVEMYGRLDVVATAPIQAVTSSNSTMVGV 471

Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          WI +Y+ RRYG      + AW +L+ ++Y
Sbjct: 472 GMCMEGIEQNPVVYDLMAEMAFHNATVVVEDWIEEYARRRYGELTAGARIAWKMLHESIY 531

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-HLWY 326
           NC+DG  D N DVIV FPD+DP          Q+ G+ +              HP H+WY
Sbjct: 532 NCSDGIADHNGDVIVEFPDIDPKRSLFQIRPRQSLGQQI------------LGHPQHIWY 579

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
           S  +   AL+  ++S + L  S  YRYD++DLTRQ L+K AN+L   +++ +++ +   +
Sbjct: 580 SPQDAAVALQYLLSSADALGLSKPYRYDVVDLTRQVLSKLANQLHSQVLDQFRMFNVEKM 639

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
             +S R LEL+ DMD LL   + FLLG WLESAK LA ++E+ K YEWNARTQITMWFDN
Sbjct: 640 DNISSRLLELLSDMDDLLGASEEFLLGTWLESAKDLATSDEERKLYEWNARTQITMWFDN 699

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
           T ++ S L DY NK WSGL RDYY PRA+IY KY+ +SL     F  ++WRREWI LTN+
Sbjct: 700 TLDKPSPLHDYANKMWSGLTRDYYLPRASIYIKYLKQSLHENTSFAFQEWRREWIALTNE 759

Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
           WQ   N+YP  + GDAL  +  LY KY
Sbjct: 760 WQVASNLYPTVAKGDALEIATTLYEKY 786


>gi|302786446|ref|XP_002974994.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
 gi|300157153|gb|EFJ23779.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
          Length = 761

 Score =  599 bits (1544), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 290/567 (51%), Positives = 378/567 (66%), Gaps = 37/567 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLP+ WL+ QL+LQKKIL  +  LGM  VLPAFSGNVP AL+ ++PSA IT
Sbjct: 192 MGNLHGWGGPLPEKWLELQLILQKKILHHMRSLGMIAVLPAFSGNVPRALKILYPSANIT 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L +W +V  +P+WCCTYLL   DPLFI+IG+AFIEQQ+KEYG T H+YNCDTF+EN PP
Sbjct: 252 RLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGKAFIEQQVKEYGSTQHVYNCDTFNENLPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            D P YIS+L A++Y  M   D  A+WLMQGWLFS D  FW+PPQMKALL++VP GK++V
Sbjct: 312 TDDPSYISALAASVYGAMIVADKQAIWLMQGWLFSSDAQFWKPPQMKALLHAVPFGKMIV 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFAEV+PIWS S  FYGVPYIWCMLHNF GN EMYG LD ++ GPV+A+TS N+TM  
Sbjct: 372 LDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGGNHEMYGRLDVVSSGPVDAKTSANSTMIG 431

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+N YS RRYG++VP   +AW +L HT+
Sbjct: 432 VGMCMEGIEQNPVVYELMAEMAFRSTRNALKDWVNDYSTRRYGKAVPEALEAWQILSHTL 491

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YNC+DG  D N DVIV FPD++ S ++ T  +Y           +L    +S+ H  LWY
Sbjct: 492 YNCSDGLQDHNTDVIVKFPDLNASSLT-TLSRYLAEEAGTQTRRLLTEGLTSFGH--LWY 548

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             +E   AL   + + + LS   TYRYDL+DLTRQ L K AN++ L  + ++   D   +
Sbjct: 549 RPTEAKVALSYMLNASSSLSNVATYRYDLVDLTRQVLMKLANQIHLQALVSFVKGDLEEL 608

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
            +     + +++D + LL  ++GFLLGPWLESAK+L  N +++  YEWNARTQ+TMWFDN
Sbjct: 609 TKNCDILIGIIKDSELLLRSNNGFLLGPWLESAKKLGTNSDEKHLYEWNARTQVTMWFDN 668

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
           T+   S L DY NK WSGL  DYY PRA++Y K ++++L   + F    WR  WI LTN 
Sbjct: 669 TRSLPSALHDYANKMWSGLFEDYYLPRASLYTKLLVKALHDKEPFPYGSWRSSWILLTNT 728

Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
           +QNG   YP+E+ GD++  ++ L++KY
Sbjct: 729 FQNGTKNYPLEAAGDSIEIAKSLFSKY 755


>gi|302791289|ref|XP_002977411.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
 gi|300154781|gb|EFJ21415.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
          Length = 761

 Score =  597 bits (1539), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 288/567 (50%), Positives = 376/567 (66%), Gaps = 37/567 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLP+ WL+ QL+LQKKIL  +  LGM  VLPAFSGNVP AL+ ++PSA IT
Sbjct: 192 MGNLHGWGGPLPEKWLELQLILQKKILHHMRSLGMIAVLPAFSGNVPRALKILYPSANIT 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L +W +V  +P+WCCTYLL   DPLFI+IG+AFIEQQ+KEYG T H+YNCDTF+EN PP
Sbjct: 252 RLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGKAFIEQQVKEYGSTQHVYNCDTFNENLPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            D P YIS+L A++Y  M   D  A+WLMQGWLFS D  FW+PPQMKALL++VP GK++V
Sbjct: 312 TDDPSYISALAASVYGAMIVADKQAIWLMQGWLFSSDAQFWKPPQMKALLHAVPFGKMIV 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFAEV+PIWS S  FYGVPYIWCMLHNF GN EMYG LD ++ GPV+A+TS N+TM  
Sbjct: 372 LDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGGNHEMYGRLDVVSSGPVDAKTSANSTMIG 431

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W++ YS RRYG++VP   +AW +L HT+
Sbjct: 432 VGMCMEGIEQNPVVYELMAEMAFRSTRNALKDWVDDYSTRRYGKAVPEALEAWQILSHTL 491

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YNC+DG  D N DVIV FPD++ S ++         G   ++  + +  TS     HLWY
Sbjct: 492 YNCSDGLQDHNTDVIVKFPDLNASSLTTLSRYLAEEGGTQTRRLLTEGLTS---FGHLWY 548

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             +E   AL   + + + LS   TYRYDL+DLTRQ L K AN++ L  + ++   D   +
Sbjct: 549 RPTEAKVALSYMLNASSSLSNVATYRYDLVDLTRQVLMKLANQIHLQALVSFVKGDLEEL 608

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
            +     + +++D + LL  ++GFLLGPWLESAK+L  N ++   YEWNARTQ+TMWFDN
Sbjct: 609 TKNCDILIGIIKDSELLLRSNNGFLLGPWLESAKKLGTNSDETNLYEWNARTQVTMWFDN 668

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
           T+   S L DY NK WSGL  DYY PRA++Y K ++++L   + F    WR  WI LTN 
Sbjct: 669 TRTLPSALHDYANKMWSGLFEDYYLPRASLYTKLLVKALHDKEPFPYDSWRSSWILLTNT 728

Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
           +QNG   YP+E+ GD++  ++ L++KY
Sbjct: 729 FQNGTKNYPLEAAGDSIEIAKSLFSKY 755


>gi|90399367|emb|CAJ86183.1| H0212B02.15 [Oryza sativa Indica Group]
 gi|116311963|emb|CAJ86322.1| OSIGBa0113E10.5 [Oryza sativa Indica Group]
          Length = 692

 Score =  559 bits (1441), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 272/466 (58%), Positives = 335/466 (71%), Gaps = 44/466 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQSWLD QL LQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 261 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 320

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 321 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 380

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMK  +     G     
Sbjct: 381 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKIGVGMSMEG----- 435

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWI 240
               E  PI                         +Y ++  +AF        +     W+
Sbjct: 436 ---IEQNPI-------------------------VYDLMSEMAF-----HHRQVDLQVWV 462

Query: 241 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY- 299
             Y  RRYG+S+  +QDAW +LY T+YNCTDG  DKNRDVIVAFPDV+P +I  T G Y 
Sbjct: 463 ETYPTRRYGKSIVGLQDAWKILYQTLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYT 521

Query: 300 ---QNYGKPVSKEAV-LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 355
              + Y   +SK  + + +    Y+HPHLWY T  VIRALELF+  G+E+S SNT+RYDL
Sbjct: 522 SSSKTYSTKLSKNYIAVDASNDEYEHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDL 581

Query: 356 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 415
           +DLTRQ LAKYAN++F+ IIE+Y+ N+ + V  L + F++LV D+D LLA H+GFLLGPW
Sbjct: 582 VDLTRQTLAKYANQVFVKIIESYKANNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPW 641

Query: 416 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 461
           LESAK LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDYG  +
Sbjct: 642 LESAKGLARDKEQEMQYEWNARTQITMWFDNTKTKASLLRDYGEAH 687


>gi|326521470|dbj|BAK00311.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 428

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 223/421 (52%), Positives = 287/421 (68%), Gaps = 34/421 (8%)

Query: 149 MQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 207
           +QGWLF  D  FW+  QMKALL+SVP+GK++VLDLFA+VKPIW TS QFYGVPYIWCMLH
Sbjct: 8   VQGWLFYSDAVFWKESQMKALLHSVPIGKMMVLDLFADVKPIWQTSSQFYGVPYIWCMLH 67

Query: 208 NFAGNIEMYGILDSIAFGPVEARTSENTTMA----------------------------- 238
           NF GNIEMYG+LDSI+ GPV+ARTS N+TM                              
Sbjct: 68  NFGGNIEMYGVLDSISSGPVDARTSYNSTMVGVGMCMEGIEHNPVVYELMSEMAFRSQKV 127

Query: 239 ----WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+  YS RRYG+S   IQ AW +LYHT+YNCTDG  D N+D IV FPD+ PS  S 
Sbjct: 128 KVEDWLKTYSHRRYGQSNVEIQKAWGILYHTIYNCTDGIADHNKDYIVEFPDMSPSSFSS 187

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
              K             L   ++S   PHLWYST E I++LELF+ +GN+LS S TYRYD
Sbjct: 188 QYSKRSISLARKHPRFFLSEVSASLPQPHLWYSTEEAIKSLELFLNAGNDLSKSLTYRYD 247

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
           L+DLTRQ+L+K AN+++ + I +YQ  D+ G+   ++ FLEL+ D+D LLA  D FLLGP
Sbjct: 248 LVDLTRQSLSKLANKVYHDAISSYQKRDSSGLNFHTKEFLELIVDIDTLLASDDNFLLGP 307

Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
           WLESAK LA  E++ KQYEWNARTQ+TMW+D+T+ E S L DY NK+WSGLL+ YY PRA
Sbjct: 308 WLESAKSLAMTEDERKQYEWNARTQVTMWYDDTKTEQSKLHDYANKFWSGLLKSYYLPRA 367

Query: 475 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
           + YF  +  SL+    F+L++WRR+WI  +N+WQ+G+ +YPV++ GD+L  S+ L+ KY 
Sbjct: 368 SKYFSRLSRSLQENRSFQLEEWRRDWISYSNEWQSGKELYPVKAIGDSLAISRSLFTKYF 427

Query: 535 Q 535
           +
Sbjct: 428 R 428


>gi|449518399|ref|XP_004166229.1| PREDICTED: alpha-N-acetylglucosaminidase-like, partial [Cucumis
           sativus]
          Length = 336

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 219/324 (67%), Positives = 260/324 (80%), Gaps = 7/324 (2%)

Query: 212 NIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTD 271
           N  +Y ++  +AF     + ++     W+ QYSVRRYG  VP+IQDAW+VLYHTVYNCTD
Sbjct: 13  NPVVYDLMSEMAF-----QHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNCTD 67

Query: 272 GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEV 331
           GA DKNRDVIVAFPDVDPS I V       +G   S  +V + + +++D PHLWY TSEV
Sbjct: 68  GANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHLWYPTSEV 125

Query: 332 IRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSR 391
           I AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF  I++AYQL+D   +  LS+
Sbjct: 126 ISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQ 185

Query: 392 RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEA 451
            FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQYEWNARTQITMWFDNT+EEA
Sbjct: 186 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEA 245

Query: 452 SLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGR 511
           SLLRDYGNKYWSGLL DYY PRAAIY K++ ES E+G  F L +WRREWIKLTNDWQ+ R
Sbjct: 246 SLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSR 305

Query: 512 NVYPVESNGDALITSQWLYNKYLQ 535
            +YPVESNGDAL TS WLYNKYLQ
Sbjct: 306 KIYPVESNGDALDTSHWLYNKYLQ 329


>gi|414585094|tpg|DAA35665.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
          Length = 1202

 Score =  424 bits (1089), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 221/431 (51%), Positives = 281/431 (65%), Gaps = 58/431 (13%)

Query: 142 DSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPY 201
           DS+  W     L + DP +           V +GK+ + +   E    +  +   Y    
Sbjct: 308 DSNPRWCCTYLLDASDPLF-----------VEIGKMFIEEQIRE----YGRTSHIYN--- 349

Query: 202 IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA----------------------- 238
            WCMLHNFA + EMYG+LD++A GP++AR S+N+TM                        
Sbjct: 350 -WCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGVGMSMEGIEQNPIVYDLMSEMA 408

Query: 239 ----------WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD 288
                     W+  Y  RRYG+ V  +QDAW +LY T+YNCTDG  DKNRDVIVAFPDV+
Sbjct: 409 FHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLYNCTDGKNDKNRDVIVAFPDVE 468

Query: 289 PSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGN 343
           P +I+ T G + N    Y    SK  + K  +S +Y+HPHLWY T+ VI ALELF+  G+
Sbjct: 469 PFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHPHLWYDTNAVIHALELFLQHGD 527

Query: 344 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 403
           E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+ + V  L + FL LV D+D L
Sbjct: 528 EVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNNMNQVTILCQHFLSLVNDLDTL 587

Query: 404 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 463
           L+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWS
Sbjct: 588 LSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWS 647

Query: 464 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 523
           GLL+DYYGPRAAIYFK+++ S+E+   F LK+WRREWI LTN+WQ+ R V+   + GD L
Sbjct: 648 GLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWISLTNNWQSDRKVFSTTATGDPL 707

Query: 524 ITSQWLYNKYL 534
             SQ LY KYL
Sbjct: 708 NISQSLYTKYL 718



 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 85/110 (77%), Positives = 101/110 (91%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYN 110
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYN
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYN 349


>gi|156399499|ref|XP_001638539.1| predicted protein [Nematostella vectensis]
 gi|156225660|gb|EDO46476.1| predicted protein [Nematostella vectensis]
          Length = 675

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 221/569 (38%), Positives = 317/569 (55%), Gaps = 75/569 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+HGWGGPLP +W   +L LQ KIL  +   GM PVLP F+G+VPA L  ++P A ++
Sbjct: 142 MGNMHGWGGPLPSTWYGMKLNLQHKILAAMRNFGMTPVLPGFAGHVPAGLLRLYPKANVS 201

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +  S   +CCTYLL+ +DPLF +IG AFI++Q  EYG T+HIYN DTF+E  P 
Sbjct: 202 KLGDWGNFNST--YCCTYLLEPSDPLFQKIGTAFIKEQTAEYG-TNHIYNADTFNEMRPR 258

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P Y+ +  +A+Y GM  GD DAVWLMQGWLF  + FW+P Q+KALL+ VP G ++VL
Sbjct: 259 SSDPTYLGAASSAVYRGMAGGDPDAVWLMQGWLFVDEGFWKPDQIKALLHGVPQGFMIVL 318

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           DL+AE  PIWS ++ FYG P+IWCML NF GNI ++G + S++ GP +A  S N+TM   
Sbjct: 319 DLWAENSPIWSRTQSFYGTPFIWCMLLNFGGNIGLFGNIKSVSTGPPKAFQSFNSTMIGT 378

Query: 239 ----------------------------------WINQYSVRRYGRSVPAIQDAWNVLYH 264
                                             WI  Y++RRYG + PAI  AW +L  
Sbjct: 379 GLTMEGIEQNDMMFELMNEMGYRLEPLNPVDLDNWIKDYALRRYGGTNPAIIQAWRLLIR 438

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           +VY C     D    + V  P +D                               + P+L
Sbjct: 439 SVYQCNGYCADHIHSIFVWKPSLD-------------------------------NKPNL 467

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY   +V  A +   ++  E     T+RYDL+D+TRQAL      ++ ++I AY+   A 
Sbjct: 468 WYDPEDVFNAWDELRSTAAEFMHVETFRYDLVDVTRQALHLRVIPIYNDLISAYKNRSAL 527

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            V     R LE+ +D+D LL  +  FLLG WL SAK L     +   YE+NAR QIT+W 
Sbjct: 528 NVIHFGSRLLEMFDDLDSLLQTNRNFLLGRWLNSAKALGTTPAEVALYEFNARNQITLWG 587

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
              + E     DY NK WSGL++ YY PR  ++   M+ ++  G+    + ++++ ++  
Sbjct: 588 PRGEIE-----DYANKMWSGLVKAYYKPRWELFIDEMVSAIAQGEELDYEAFKKKLLEQE 642

Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
             W +G+  YP + +GD+L  +++L+NK+
Sbjct: 643 TAWTHGKEEYPDQPSGDSLAAAEFLHNKW 671


>gi|384247107|gb|EIE20595.1| hypothetical protein COCSUDRAFT_37819 [Coccomyxa subellipsoidea
           C-169]
          Length = 762

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 226/570 (39%), Positives = 318/570 (55%), Gaps = 64/570 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGPLPQS++D Q  LQ+KI+ R+ ELGM+PV PAF+G VP AL    P+A+I+
Sbjct: 199 MGNLRGYGGPLPQSYIDDQAELQRKIVRRMRELGMSPVFPAFAGFVPGALARERPAARIS 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTS-HIYNCDTFDENTP 119
           +  NW S  +  R+CC +LLD  +PLF EIG AF++   +EYG      Y+ DTF+E TP
Sbjct: 259 RSDNWCSFPA--RYCCVHLLDPLEPLFQEIGSAFVKVLREEYGSDEVGFYSADTFNEMTP 316

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD--PFWRPPQMKALLNSVPLGKL 177
           P   P Y++S+ +AIY+ M + D  A WLMQ WLF YD   FW+PPQ++AL++ VP   L
Sbjct: 317 PSSDPAYLTSVTSAIYNAMAAADPSARWLMQAWLF-YDNQKFWQPPQIQALVSGVPRDAL 375

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-------- 229
           ++LDL+AEV P+W ++K F+G P+I+CMLHNF GNIEMYG L+++A GP E         
Sbjct: 376 IMLDLYAEVFPLWKSTKSFFGAPFIYCMLHNFGGNIEMYGALEAVARGPAEGQIDGVAGL 435

Query: 230 -------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ-DAWNVLY 263
                                    R        WI  Y+ RRYG S P     AW++L 
Sbjct: 436 IGIGMCPEGIEQNPVVYELMSEWAFRRQPVEVEGWIEAYARRRYGNSTPPTALVAWDLLL 495

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            +VYN TDG TD +RD+  + P + P+ + +   K                       PH
Sbjct: 496 RSVYNATDGHTDHSRDIPTSRPGLSPAEVGLWGLK-----------------------PH 532

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           LWY+  +V+ A  L + S  EL     YRYDL+D+ RQ ++K A +++  + EAY    +
Sbjct: 533 LWYNEQQVVDAWGLLLRSAGELQQVEGYRYDLVDVGRQVISKRATDIWKAVAEAYVDGRS 592

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
             V +   R L+L++D++ LLA + GFLLGP LE A      E + + YEWN R Q+T+W
Sbjct: 593 IVVRREGARLLQLLDDLEELLATNRGFLLGPKLEEASSAGHTEAEARLYEWNLRKQLTVW 652

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
              +    S + DY N+ W+GL+  YY PR A++   +   L  G  +  + WR E +  
Sbjct: 653 -GTSDTGGSEIEDYANREWAGLISSYYKPRWALWLLRLETDLAQGRRYDPEAWRMECLNF 711

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           T  W   R+  P+   GD    SQ LY  Y
Sbjct: 712 TLGWAYLRDQLPLHPQGDTGGVSQRLYEVY 741


>gi|390348210|ref|XP_785272.3| PREDICTED: alpha-N-acetylglucosaminidase [Strongylocentrotus
           purpuratus]
          Length = 793

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 213/569 (37%), Positives = 304/569 (53%), Gaps = 74/569 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPLPQSW   QL LQ +IL R+ +LGM PVLPAF+G+VP +   VFP+A I+
Sbjct: 220 MGNIDGWGGPLPQSWHTNQLALQHQILKRMRDLGMIPVLPAFAGHVPXSFSKVFPNASIS 279

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG+W   +  P +CCT LLD  DP+F ++G+AFI+   +E+  T HIY+ DTF+EN P 
Sbjct: 280 NLGDW--GRFGPEYCCTSLLDPQDPMFKQVGKAFIDAMSEEFNGTDHIYSADTFNENKPK 337

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
                Y+S+    +Y G+  GD   VWLM GWLF    FW P Q+KALL  VP+G+++VL
Sbjct: 338 SRDSAYLSAASKGVYQGIIEGDPKGVWLMMGWLFQDTGFWGPTQIKALLQGVPIGRMIVL 397

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL+AE +P + T+  FYG P+IWCMLHNF GN  +YG LD++  GP EAR  +N+TM   
Sbjct: 398 DLYAEARPFYKTTYSFYGQPFIWCMLHNFGGNTGLYGKLDAVNQGPFEARNYDNSTMIGM 457

Query: 238 ------------------------------AWINQYSVRRYG---RSVPAIQDAWNVLYH 264
                                          WI QY+ RRY           +AW +L  
Sbjct: 458 GTTPEGIFQNYVMYNFLTDMTWRSGSTNVSKWIEQYAGRRYSNDPNKSEEATEAWVILKE 517

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           TVYN T    D                          Y  PV + + + +         +
Sbjct: 518 TVYNNTGTLQD------------------------HQYAVPVRRPSNIMTSP-------V 546

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY  ++V +A E  + +  +L  S  +RYDL+D+TR  L   A +    ++ ++++ +A 
Sbjct: 547 WYDYTKVAKAWEFLLEASTKLGTSPVFRYDLVDVTRNVLQDLAFDFQQKLMVSFRIRNAG 606

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            V         L+ DMD + + H+ +LLG WLE AK LA N ++E  YE+NA+ QIT+W 
Sbjct: 607 AVGGNGTLLCNLILDMDNITSSHEDWLLGTWLEDAKSLATNNDEESLYEYNAKNQITIW- 665

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
              +EE   + DY NK W GLLR YY  R  +Y +Y+ E ++S   +    +        
Sbjct: 666 -GPKEE---ILDYANKQWGGLLRTYYHRRWQLYVQYLEECIQSHQPYDQNTFNVRSFVAE 721

Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
           ++W + +  +P E  GD +  S+ LY KY
Sbjct: 722 SEWTHSKEKFPTEPVGDTMAISKALYVKY 750


>gi|14861378|gb|AAK73654.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
          Length = 753

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 217/573 (37%), Positives = 316/573 (55%), Gaps = 84/573 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGW GPLP++W  +QL +Q ++L R+  LGM  VLPAF+G+VP  +   FP    T
Sbjct: 212 MGNLHGWAGPLPRAWHLKQLYVQYRVLERMRSLGMITVLPAFAGHVPQGVLRAFPRVNAT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W     D  + CTYLLD  DP+F  IG  F+++ +KE+G T HIY+ DTF+E  P 
Sbjct: 272 RLGGWSHF--DCTYSCTYLLDPEDPMFQVIGTLFLKELIKEFG-TDHIYSADTFNEMNPL 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S + +A++  M   D  AVWLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 329 SSDPAYLSRVSSAVFRSMTGADPKAVWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 388

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  ++ FYG P+IWCMLHNF GN  ++G +++I  GP  AR   N+TM  
Sbjct: 389 LDLFAESRPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 448

Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                          +W+ +Y+ RRYG    A   AW +L  +V
Sbjct: 449 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVARYAERRYGAPNAAAASAWQLLLRSV 508

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YNCT    + NR  +V  P                          L+ +T       +WY
Sbjct: 509 YNCTGVCVNHNRSPLVRRPS-------------------------LRMDT------EVWY 537

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 380
           + S+V  A  L +++G EL +S T+ YDL D+TRQA  +  +E +L+I +A+Q      L
Sbjct: 538 NKSDVYEAWRLLLSAGAELGSSPTFGYDLADVTRQAAQQLVSEYYLSIRQAFQSRSLPEL 597

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
             A GV        +L+ ++DGLL+ H  FLLG WLESA+ +A ++ + +QYE NAR Q+
Sbjct: 598 LTAGGVL-----VYDLLPELDGLLSSHRLFLLGRWLESARAVATSDREAEQYELNARNQV 652

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
           T+W  N       + DY NK   GL+ DYYG R +++   ++ESL SG  F    + +  
Sbjct: 653 TLWGPNGN-----ILDYANKQLGGLVLDYYGVRWSLFVSALVESLNSGSPFHQDQFNQAV 707

Query: 501 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
            ++   +   +  YP    GD L  S+ ++ KY
Sbjct: 708 FQVERGFIYNKKRYPTAPVGDTLEISKKIFLKY 740


>gi|14861380|gb|AAK73655.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
          Length = 753

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 217/573 (37%), Positives = 316/573 (55%), Gaps = 84/573 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGW GPLP++W  +QL +Q ++L R+  LGM  VLPAF+G+VP  +   FP    T
Sbjct: 212 MGNLHGWAGPLPRAWHLKQLYVQYRVLERMRSLGMITVLPAFAGHVPQGVLRAFPRVNAT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W     D  + CTYLLD  DP+F  IG  F+++ +KE+G T HIY+ DTF+E  P 
Sbjct: 272 RLGGWSHF--DCTYSCTYLLDPEDPMFQVIGTLFLKELIKEFG-TDHIYSADTFNEMNPL 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S + +A++  M   D  AVWLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 329 SSDPAYLSRVSSAVFRSMTGADPKAVWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 388

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  ++ FYG P+IWCMLHNF GN  ++G +++I  GP  AR   N+TM  
Sbjct: 389 LDLFAESRPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 448

Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                          +W+ +Y+ RRYG    A   AW +L  +V
Sbjct: 449 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVARYAERRYGAPNAAAASAWXLLLRSV 508

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YNCT    + NR  +V  P                          L+ +T       +WY
Sbjct: 509 YNCTGVCVNHNRSPLVRRPS-------------------------LRMDT------EVWY 537

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 380
           + S+V  A  L +++G EL +S T+ YDL D+TRQA  +  +E +L+I +A+Q      L
Sbjct: 538 NKSDVYEAWRLLLSAGAELGSSPTFGYDLADVTRQAAQQLVSEYYLSIRQAFQSRSLPEL 597

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
             A GV        +L+ ++DGLL+ H  FLLG WLESA+ +A ++ + +QYE NAR Q+
Sbjct: 598 LTAGGVL-----VYDLLPELDGLLSSHRLFLLGRWLESARAVATSDREAEQYELNARNQV 652

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
           T+W  N       + DY NK   GL+ DYYG R +++   ++ESL SG  F    + +  
Sbjct: 653 TLWGPNGN-----ILDYANKQLGGLVLDYYGVRWSLFVSALVESLNSGSPFHQDQFNQAV 707

Query: 501 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
            ++   +   +  YP    GD L  S+ ++ KY
Sbjct: 708 FQVERGFIYNKKRYPTAPVGDTLEISKKIFLKY 740


>gi|375144105|ref|YP_005006546.1| alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
 gi|361058151|gb|AEV97142.1| Alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
          Length = 735

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 217/568 (38%), Positives = 310/568 (54%), Gaps = 73/568 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  WGGPLP SW+     LQ+KIL R  ELGM PVLPAF+G+VP A +  +P+AK+ 
Sbjct: 201 MGNLDAWGGPLPLSWMKSHKALQEKILQRERELGMKPVLPAFTGHVPPAFKKKYPNAKL- 259

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  NW +  +D     TY+LD+ DPLF E+G+ F+++Q   +G T H+Y+ DTF+EN PP
Sbjct: 260 KATNWTNGFAD-----TYILDSQDPLFAEMGKRFLQKQTSLFG-TDHLYSADTFNENEPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
            D P ++S+L A IY GM+  D+ A W+MQGWLF  D  FW+ PQ++ALL +VP  K+++
Sbjct: 314 SDDPAFLSALSARIYEGMKQADTAATWVMQGWLFYSDRKFWKAPQIEALLKAVPDNKMIL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
           LDL AE++P+W  +  FYG P+IW MLHNF GN+ ++G +D +A  P E           
Sbjct: 374 LDLAAEIEPVWKRTDAFYGKPWIWNMLHNFGGNVNLFGRMDGVATQPAETLNDKASGKLW 433

Query: 230 -------RTSENTTM-----------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                     +N  M                 AWI QY + RY  +   + DAW +L  T
Sbjct: 434 GIGLTMEAIEQNPVMYELMTRHTWQTTPVDLDAWIPQYVLNRYRTNNTNLVDAWQILRKT 493

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYN   GA  +         D   SII+         G+P      + + T     PH  
Sbjct: 494 VYN---GAVIR---------DGAESIIT---------GRPTFDSTTVWTRTKLNYAPH-- 530

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
               E++ A +LF+ +  +   S+ ++YDL+D+TRQ LA YA  L    + A+   D+  
Sbjct: 531 ----ELLPAWDLFVQAAGKGVNSDGFQYDLVDVTRQVLANYAAPLQKKWVTAFNAKDSAA 586

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
             + S+ FL+L+ DMD LLA    F+LGPWL +A+       ++  YE NAR  IT+W D
Sbjct: 587 FNKYSKAFLQLISDMDLLLASRKDFMLGPWLSAARSNGTTPAEKALYEQNARDLITLWGD 646

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
                 S L +Y N+ WSGLL D+Y PR   +F  + +SL +G    LK +         
Sbjct: 647 AN----SPLHEYSNRQWSGLLNDFYKPRWQQFFTLLQQSLRTGSTPDLKQFEENIRSWEW 702

Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKY 533
            W N +  YPV  +G+++  +Q LY KY
Sbjct: 703 KWVNTQKAYPVVPSGNSVQVAQMLYKKY 730


>gi|73965663|ref|XP_548088.2| PREDICTED: alpha-N-acetylglucosaminidase [Canis lupus familiaris]
          Length = 747

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 213/570 (37%), Positives = 312/570 (54%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +QL LQ +IL R+   GM PVLPAFSG+VP AL  VFP   IT
Sbjct: 205 MGNLHTWGGPLPHSWHLKQLYLQHRILDRMRSFGMIPVLPAFSGHVPKALTRVFPQINIT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ + ++E+G T+HIY  DTF+E  PP
Sbjct: 265 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELIQEFG-TNHIYGADTFNEMQPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y++S  A++Y  M + DSDAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 322 SSEPSYLASATASVYQAMITVDSDAVWLLQGWLFQHQPQFWGPAQVKAVLEAVPRGRLLV 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 382 LDLFAESQPVYIQTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMLG 441

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW++ ++ RRYG +    + AW +L  +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVSSFAARRYGVAHRDTEAAWRLLLRS 501

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +  +  NR  +V      PS+  VT                            +
Sbjct: 502 VYNCSGEACSGHNRSPLVR----RPSLQMVTT---------------------------V 530

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +   L++S T+RYDL+D+TRQA  +  +  ++    AY   +  
Sbjct: 531 WYNRSDVFEAWRLLLTAAPTLASSPTFRYDLLDVTRQAAQELVSLYYVEARSAYLRKELV 590

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + + +   + EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 591 PLLRAAGVLVYELLPALDKVLASDSRFLLGRWLEQARAAAVSEAEAHLYEQNSRYQLTLW 650

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ DYY PR  ++ + ++ESL  G  F+   + +   +L
Sbjct: 651 ----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEMLVESLVQGIPFQQHQFDKNAFQL 705

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +  G   YP + +GD +  ++ L+ KY
Sbjct: 706 EQTFIFGTQRYPSQPDGDTVDLAKKLFIKY 735


>gi|357458269|ref|XP_003599415.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
 gi|355488463|gb|AES69666.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
          Length = 539

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 175/265 (66%), Positives = 203/265 (76%), Gaps = 27/265 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL  +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
           +LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K                    
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353

Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
                 EYG  + IYNCDTF+EN+PP   P YIS+LGAA+Y G+  GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413

Query: 155 SYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
             D  FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473

Query: 214 EMYGILDSIAFGPVEARTSENTTMA 238
           EMYG+LD+IA GPV+AR SEN+TM 
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMV 498


>gi|443691318|gb|ELT93213.1| hypothetical protein CAPTEDRAFT_144379, partial [Capitella teleta]
          Length = 718

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 209/549 (38%), Positives = 305/549 (55%), Gaps = 75/549 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL  +W  QQ++LQ +IL R+ +LGM P LPAF+G+VPA +  +FP  K++
Sbjct: 188 MGNMRGWGGPLSTNWHHQQILLQHRILKRMRDLGMTPALPAFAGHVPANITRLFPRVKVS 247

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W    S   +CCT LLD  DPLF EIG+AFI++  +E+G T H+YN DTF+E TP 
Sbjct: 248 KLGDWGRFNST--YCCTTLLDVEDPLFKEIGKAFIDEYTREFG-TDHVYNTDTFNEMTPA 304

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P Y++  G A+YSGM S DS A+WLMQGWLF  D FW+PPQ KALL SVP GK++VL
Sbjct: 305 SSDPSYLTKAGQAVYSGMVSSDSKAIWLMQGWLFLSD-FWKPPQAKALLTSVPQGKMLVL 363

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           DL++EV P +   + +YG P+IWCMLHNF G + MYG ++S+  GP E R+  N+TM   
Sbjct: 364 DLYSEVNPQYPRLQSYYGQPFIWCMLHNFGGTLPMYGAIESVNQGPFEGRSFVNSTMVGI 423

Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W ++Y+ RRY       + AW +   TVY
Sbjct: 424 GLTPEGINQNEVMYEFMMENSFRSQPVELTEWFDKYATRRYASRNANARAAWQIFKRTVY 483

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           NC+DG    N+++ V  P                               S  +   +WY 
Sbjct: 484 NCSDGVKHHNKNIPVCRP-------------------------------SRKNKIDVWYD 512

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             +  +  +L IA+  E+  S  +RYDL+D++RQAL   +   +  I+ +Y+  +   + 
Sbjct: 513 VEDFFKGWDLMIAASKEVD-SPLFRYDLVDVSRQALQVISITYYNQILTSYKQKNLTSLA 571

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
                 L L++DMD +LA    FLLG W+  A +     E++  YE+NAR Q+T+W    
Sbjct: 572 SSGNDLLHLLDDMDTVLATDSHFLLGAWIAGAHRNGVTPEEKALYEFNARNQVTLW---- 627

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKLTND 506
             +A++L DY NK W+GL+ DYY  R  ++   + +SLE+   F  K ++++ + K  + 
Sbjct: 628 GPDANIL-DYANKQWAGLVADYYHERWELFIDELKKSLENKTSFDEKKFQKDVFEKAESP 686

Query: 507 WQNGRNVYP 515
           +    NVYP
Sbjct: 687 FTYRTNVYP 695


>gi|149054264|gb|EDM06081.1| rCG33377, isoform CRA_d [Rattus norvegicus]
          Length = 580

 Score =  369 bits (948), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 209/570 (36%), Positives = 307/570 (53%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 43  MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 102

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLGNW        + C++LL   DPLF  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 103 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 159

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 160 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 219

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++S +  F+G P+IWCMLHNF GN  ++G L+ +  GP  AR   N+TM  
Sbjct: 220 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 279

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW++ ++ RRYG S P    AW +L  +
Sbjct: 280 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLRS 339

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 340 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 368

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +   L+AS  +RYDL+D+TRQA+ +  +  +     A+   D  
Sbjct: 369 WYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQAVQELVSSCYEEARTAFLNQDLD 428

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       +L+  +D LLA +  FLLG WL+ A+++A +E + + YE N+R QIT+W
Sbjct: 429 LLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAREVAVSESEAQFYEQNSRYQITLW 488

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ DYY PR  ++   +  SL  G  F+   + +    L
Sbjct: 489 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGIPFQQHQFEKSVFPL 543

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              + N +  YP++  GD +  S+ ++ K+
Sbjct: 544 EQAFINNKKRYPIQPQGDTVDLSKKIFLKF 573


>gi|255533666|ref|YP_003094038.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
 gi|255346650|gb|ACU05976.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
          Length = 735

 Score =  369 bits (947), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 209/569 (36%), Positives = 310/569 (54%), Gaps = 75/569 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPLP+S +     LQKKIL R    GM P+LPAF+G+VP A ++ FP AK+ 
Sbjct: 199 MGNIDGWGGPLPKSQMLAHEALQKKILERERSFGMTPILPAFTGHVPPAFKDKFPKAKLK 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  NW +  S       Y+LD  D LF  IG+ FIE+++K +G T H+Y  DTF+ENTPP
Sbjct: 259 KT-NWTTFPS------VYILDPEDELFTTIGKRFIEEEVKTFG-TDHLYTADTFNENTPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
                Y+S++   +Y  M   D +A W+MQGWLF + + FW+P Q+KALLN++P  K++V
Sbjct: 311 TSDSLYLSNVSKKVYQSMALADPEATWIMQGWLFYHGEKFWKPTQIKALLNAIPNDKMIV 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
           LDL++E  P+W  +  +YG P+IW MLHNF GNI +YG +D +A G ++A+ + N+    
Sbjct: 371 LDLWSENHPVWQRTAAYYGKPWIWNMLHNFGGNISLYGRMDEVASGAIKAKQAANSGNMV 430

Query: 236 ------------------------------TMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  YS +RYG      + AW +LY T
Sbjct: 431 GIGLTPEAIEQNPVMYQLMLDNIWTDEPINVTAWLKNYSRQRYGAQNALAEQAWQILYKT 490

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VY  T G            P    SI++         G+P        +E++    P   
Sbjct: 491 VY--TGG----------ILPGGPESILT---------GRPTM------AESTRSTRPKKN 523

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           Y  +E+I A E  + +  +LS ++ ++YDL+D+TRQ L  YA+ L     +AYQ  D   
Sbjct: 524 YKPAELIPAWEALLKASQQLS-TDGFKYDLVDVTRQVLVNYADTLQRQFAQAYQGKDGKK 582

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
             +LS  FL +++D+D LLA    FLLG WL  AK++    E++K+YE NAR  IT+W D
Sbjct: 583 FDRLSGDFLAVMDDVDYLLATRKDFLLGKWLNEAKRMGTTAEEKKRYERNARNLITLWAD 642

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
               + S L +Y  + WSGL+  +Y PR   +F Y  + L+SG     K +  +  +   
Sbjct: 643 ----QNSSLNEYSCRQWSGLISSFYKPRWQQFFSYAKQQLKSGAKLDQKVFEEKMKRWEW 698

Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYL 534
           DW N  +V+  + +G+ + T++ LY KY+
Sbjct: 699 DWVNKNDVFTEQPSGNEIKTAESLYKKYI 727


>gi|109491871|ref|XP_001081442.1| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
 gi|392351622|ref|XP_002727861.2| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
 gi|149054262|gb|EDM06079.1| rCG33377, isoform CRA_b [Rattus norvegicus]
          Length = 739

 Score =  368 bits (944), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 209/570 (36%), Positives = 307/570 (53%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 202 MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLGNW        + C++LL   DPLF  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 262 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 319 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++S +  F+G P+IWCMLHNF GN  ++G L+ +  GP  AR   N+TM  
Sbjct: 379 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 438

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW++ ++ RRYG S P    AW +L  +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLRS 498

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 499 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 527

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +   L+AS  +RYDL+D+TRQA+ +  +  +     A+   D  
Sbjct: 528 WYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQAVQELVSSCYEEARTAFLNQDLD 587

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       +L+  +D LLA +  FLLG WL+ A+++A +E + + YE N+R QIT+W
Sbjct: 588 LLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAREVAVSESEAQFYEQNSRYQITLW 647

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ DYY PR  ++   +  SL  G  F+   + +    L
Sbjct: 648 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGIPFQQHQFEKSVFPL 702

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              + N +  YP++  GD +  S+ ++ K+
Sbjct: 703 EQAFINNKKRYPIQPQGDTVDLSKKIFLKF 732


>gi|301773566|ref|XP_002922216.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Ailuropoda
           melanoleuca]
          Length = 634

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 206/570 (36%), Positives = 306/570 (53%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 95  MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 154

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 155 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 211

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  A++Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 212 SSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQPEFWGPAQVTAVLGAVPRGRLLV 271

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F+G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 272 LDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMAG 331

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW++  + RRYG +    + AW +L  +
Sbjct: 332 TGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSSSAARRYGVTHKDTEAAWRLLLRS 391

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +  +  NR  +V  P +  +                                 +
Sbjct: 392 VYNCSGEACSGHNRSPLVRRPSLQMATA-------------------------------V 420

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +   L+AS ++RYDL+D+TRQA  +  +  +     AY   +  
Sbjct: 421 WYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQAAQELVSLYYEEARAAYLNKELV 480

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + + + R + EL+  +D +LA    FLLG WLE A+  A +E + + YE N+R Q+T+W
Sbjct: 481 PLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQARAAAVSEAEARFYEQNSRYQLTLW 540

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ DYY PR  ++ + ++ESL  G  F+   + +   +L
Sbjct: 541 ----GPEGNIL-DYANKQLAGLVADYYAPRWGLFMEMLVESLAQGIPFQQHQFDKNAFQL 595

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +      YP +  GD +  ++ L+ KY
Sbjct: 596 EQAFVFSTQRYPSQPQGDTVDLAKKLFLKY 625


>gi|126307960|ref|XP_001366343.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
           domestica]
          Length = 741

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 199/570 (34%), Positives = 298/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +Q  LQ +IL R+   GM PVLPAF+G++P A   VFP A +T
Sbjct: 202 MGNLHTWGGPLPSSWDLKQSYLQYRILERMRSFGMKPVLPAFAGHIPKAFTRVFPQANVT 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W     +  + C+YLL   DPLF  +G  F+ +  KE+G T HIY+ D F+E  PP
Sbjct: 262 NLGMWGHFSCN--YSCSYLLAPEDPLFPVVGSLFLRELTKEFG-TDHIYSADIFNEMDPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             +P Y+++  AA+Y  M + D DAVWL QGWLF   P FW+PPQMKA+L +VP G+ ++
Sbjct: 319 SSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNHPDFWKPPQMKAVLEAVPRGRFLI 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++S +  FYG P+IWCMLHNF GN  ++G+LD++  GP  AR   N+T+  
Sbjct: 379 LDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLFGVLDAVNRGPSTARLFPNSTIVG 438

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ +RYG   P  + AW +L  +
Sbjct: 439 TGIVPEGINQNEVVYALMAELGWRKDPFPDLGAWVAGFAAQRYGTPHPQAEAAWRLLLRS 498

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +  T  N   +V  P +                                    +
Sbjct: 499 VYNCSWENCTGHNHSPLVKRPSLHLDF-------------------------------SV 527

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +  +L+ S+ +RYDL+D+TRQ   +  +  +  +  A++     
Sbjct: 528 WYNRSDVFEAWRLLLEAAPQLATSSAFRYDLLDVTRQVAQELVSLYYGELKTAFEAGSMP 587

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            +       + +L+  +D LL   + FLLG WLE A+++A +E +   YE NAR Q+T+W
Sbjct: 588 ALLSAGGLLVFDLLPSLDELLGTDERFLLGGWLEQAREMAVSEAEAWHYEQNARYQLTLW 647

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                     + DY NK  +GL+  YY PR  ++ + +++SL  G  F    +  E   L
Sbjct: 648 GPTGN-----ILDYANKQLAGLVAGYYAPRWKLFVEMLVKSLAEGTPFHQNQFENEAFLL 702

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              + +GR  +P +  GD +  ++  + KY
Sbjct: 703 GQAFVSGREKFPTQPQGDTVDLARKFFLKY 732


>gi|281344539|gb|EFB20123.1| hypothetical protein PANDA_011160 [Ailuropoda melanoleuca]
          Length = 619

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 206/570 (36%), Positives = 306/570 (53%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 81  MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 140

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 141 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 197

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  A++Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 198 SSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQPEFWGPAQVTAVLGAVPRGRLLV 257

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F+G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 258 LDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMAG 317

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW++  + RRYG +    + AW +L  +
Sbjct: 318 TGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSSSAARRYGVTHKDTEAAWRLLLRS 377

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +  +  NR  +V  P +  +                                 +
Sbjct: 378 VYNCSGEACSGHNRSPLVRRPSLQMATA-------------------------------V 406

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +   L+AS ++RYDL+D+TRQA  +  +  +     AY   +  
Sbjct: 407 WYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQAAQELVSLYYEEARAAYLNKELV 466

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + + + R + EL+  +D +LA    FLLG WLE A+  A +E + + YE N+R Q+T+W
Sbjct: 467 PLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQARAAAVSEAEARFYEQNSRYQLTLW 526

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ DYY PR  ++ + ++ESL  G  F+   + +   +L
Sbjct: 527 ----GPEGNIL-DYANKQLAGLVADYYAPRWGLFMEMLVESLAQGIPFQQHQFDKNAFQL 581

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +      YP +  GD +  ++ L+ KY
Sbjct: 582 EQAFVFSTQRYPSQPQGDTVDLAKKLFLKY 611


>gi|373953359|ref|ZP_09613319.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
 gi|373889959|gb|EHQ25856.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
          Length = 733

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 209/570 (36%), Positives = 296/570 (51%), Gaps = 74/570 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  WGGPLP  W+     LQKKI+ R   LGM PVLPAF+G+VPAA +N +P+AK+ 
Sbjct: 195 MGNMDSWGGPLPLRWMQTHFDLQKKIIARERALGMKPVLPAFTGHVPAAFKNKYPTAKL- 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  NW +  +D     TY+LD+ DP+F  IG+ F+++Q    G T H+Y+ DTF+EN PP
Sbjct: 254 KTTNWKNGFAD-----TYILDSADPMFARIGQLFLQKQTALLG-TDHLYSADTFNENEPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
            D PEY+  L   +Y GM   D+ AVW+MQGWLF  D  FW+P Q +ALL +VP  K+++
Sbjct: 308 SDEPEYLGKLSERVYQGMHQADTAAVWVMQGWLFYSDRKFWKPEQTRALLKAVPDDKMII 367

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
           LDL  E++P+W  ++ FYG P+IW ML+NF  N  ++G +DS A GP EA          
Sbjct: 368 LDLATEIEPVWKRTEAFYGKPWIWNMLNNFGANTNLFGRMDSAAKGPAEAYHDPKSGQMK 427

Query: 230 ------------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                   R        W+ +Y + RYG+     Q AWN+L  T
Sbjct: 428 GIGLTMEGIEQNPVLYDLLTDNTWRNQPINVDEWLPKYVLNRYGKPNAQAQKAWNILRKT 487

Query: 266 VYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
           VY+        D    +I A P  D S                          S +    
Sbjct: 488 VYSVLADRYIRDGAESIIQARPTTDSS--------------------------SRWARTT 521

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           L Y    ++ A +  I +  +LS S+ +R+DL+DL+RQ LA YA  L    + A+Q  DA
Sbjct: 522 LNYEPKALLPAWQAMIKASEDLSTSDGFRFDLVDLSRQVLANYAFTLQRRFVLAHQQKDA 581

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
               + S  F+EL++DMD LLA    FLLGPW+  A++      ++  YE NA+  IT+W
Sbjct: 582 AAFKKHSAEFIELIQDMDQLLATRKDFLLGPWVADARRCGATVSEKALYEMNAKDLITLW 641

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
            D    +   L +Y  + WSGLL D+Y PR   YF+ +   L     F  + + R+    
Sbjct: 642 GD----KDCPLNEYACRQWSGLLNDFYKPRWQQYFEQINLDLTGKKPFDKEAFERKIKSW 697

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              W N R  YPV+  GD ++ ++ LY KY
Sbjct: 698 EWQWVNARKDYPVKPQGDPVLEARKLYKKY 727


>gi|440903235|gb|ELR53922.1| Alpha-N-acetylglucosaminidase, partial [Bos grunniens mutus]
          Length = 614

 Score =  364 bits (935), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 213/574 (37%), Positives = 305/574 (53%), Gaps = 84/574 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 77  MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 136

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+GNW        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 137 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 193

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 194 SSEPSYLAAATTAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 253

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+TM  
Sbjct: 254 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 313

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S    + AW +L  +
Sbjct: 314 TGMAPEGIGQNEVVYALMAELGWKKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 373

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNC                       S  E +  N+  P+ +   L+  T+      +W
Sbjct: 374 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 403

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 379
           Y+ S+V  A  L +A+ + L++S  +RYDL+D+TRQA+ +  +  +  +  AY       
Sbjct: 404 YNRSDVFEAWRLLLAATSTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 463

Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
           L  A G+        EL+  +D +LA    FLLG WLE A+Q A +E +   YE N+R Q
Sbjct: 464 LTRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARQAAVSETEAHFYEQNSRYQ 518

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
           +T+W      E ++L DY NK  +GL+ DYY PR  ++ + ++ESL  G  F+   + R 
Sbjct: 519 LTLW----GPEGNIL-DYANKQLAGLMADYYAPRWRLFTETLVESLVQGVPFQQHQFDRN 573

Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
             +L   +  G   YP +  GD +   + L+ KY
Sbjct: 574 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 607


>gi|348533253|ref|XP_003454120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oreochromis
           niloticus]
          Length = 845

 Score =  364 bits (934), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 198/570 (34%), Positives = 315/570 (55%), Gaps = 77/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL  + GPLPQSW   QL LQ KIL R+   GM PVLPAFSGN+P  +  ++P A++T
Sbjct: 308 MANLFKFAGPLPQSWHVNQLYLQFKILERMRSFGMIPVLPAFSGNIPKGILRLYPEARVT 367

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W        + C+ +LD  DPLF  IG  ++ Q LK++G T HIY+ DTF+E TPP
Sbjct: 368 RLGPWSHFNCS--YSCSLVLDPQDPLFHHIGSLYLSQVLKQFG-TDHIYSTDTFNEMTPP 424

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S++  ++++ M + D  AVWLMQGWLF  D  FW+P Q++ALL+ VPLG+++V
Sbjct: 425 SSDPAYLSAVSRSVFASMTAVDPQAVWLMQGWLFFSDAAFWKPAQIQALLHGVPLGRMIV 484

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFAE +PI+S ++ FYG P+IWCML NF GN  ++G ++SI  GP +A    N+T+  
Sbjct: 485 LDLFAETEPIFSYTESFYGQPFIWCMLQNFGGNSGLFGTVESINSGPFKALHFPNSTLVG 544

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W++ Y++RRYG +  ++  AW +L+ ++
Sbjct: 545 IGMTPEGIEQNPVTYELMSELAWRKEPVNLAKWVSLYAIRRYGNTQESLTTAWRLLFASI 604

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG-KPVSKEAVLKSETSSYDHPHLW 325
           YNCTD                           Y+N+   P+ +    +  T       LW
Sbjct: 605 YNCTD-------------------------PHYRNHNHSPLVRRPSFQMNTG------LW 633

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           Y  +++ +A +L + +   L +  T+RYDL+D+TR+ L       + +I +A++  +   
Sbjct: 634 YDPADLYKAWKLIMDAAPSLMSKETFRYDLVDVTREVLQVLTTSFYRDIADAFKKQNLSE 693

Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
           +       + +L+ +++ LL+ +  FLLG WLE A+ LA ++++ + Y+ NAR QIT+W 
Sbjct: 694 LLTAGGVLVYDLLPELNRLLSSNRNFLLGAWLERARSLAVDDKEAQLYDMNARNQITLWG 753

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
            + +     + DY +K W GL+ DYY  R  ++ + ++E L SG  F+   + +   ++ 
Sbjct: 754 PSGE-----ILDYASKEWGGLMEDYYAQRWGLFVQTLVECLNSGQPFKQAAFNQAVFQIE 808

Query: 505 NDW-QNGRNVYPVESNGDALITSQWLYNKY 533
             +  NGR  YP +  GD    +  ++ KY
Sbjct: 809 KGFIYNGRK-YPTKPQGDTYEIAYRIFLKY 837


>gi|410981277|ref|XP_003996997.1| PREDICTED: alpha-N-acetylglucosaminidase [Felis catus]
          Length = 857

 Score =  364 bits (934), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 208/570 (36%), Positives = 304/570 (53%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 318 MGNLHTWGGPLPPSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVT 377

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 378 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 434

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y++S  A++Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 435 SSEPSYLASATASVYQAMVTVDPDAVWLLQGWLFQHQPQFWGPAQVSAVLGAVPRGRLLV 494

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 495 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMVG 554

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S    + AW +L  +
Sbjct: 555 TGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVTGFAARRYGVSHGNTEAAWRLLLRS 614

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +  +  NR  +V  P                          LK  T+      +
Sbjct: 615 VYNCSGEACSGHNRSPLVRRPS-------------------------LKMTTT------V 643

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +   L+ S T+RYDL+D+TRQA  +  +  +     AY   +  
Sbjct: 644 WYNRSDVFEAWRLLLTTTPSLATSPTFRYDLLDVTRQAAQELVSLYYGEARTAYLNKELV 703

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + + +   + EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 704 PLLRAAGILVYELLPSLDKVLASDSRFLLGSWLEQARAAAVSEAEAHFYEQNSRYQLTLW 763

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ DYY PR  ++ + ++ESL  G  F+   + +   +L
Sbjct: 764 ----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEMLVESLVRGVPFQQHQFDQNAFQL 818

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +      YP + +GD +  ++ L+ +Y
Sbjct: 819 EQTFVLSTQRYPSQPHGDTVDLAKKLFLRY 848


>gi|410930376|ref|XP_003978574.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Takifugu rubripes]
          Length = 751

 Score =  362 bits (929), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 201/568 (35%), Positives = 314/568 (55%), Gaps = 73/568 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPLPQSW   QL LQ KIL ++   GM PVLPAFSGN+P  +  +FP A++T
Sbjct: 214 MGNMFKFGGPLPQSWHVNQLYLQFKILAQMRSFGMIPVLPAFSGNIPKGILRLFPEARVT 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W   K +  + C+Y+LD  DPLF  IG  ++ Q +K++G T+HIYN DTF+E TPP
Sbjct: 274 RLEPW--SKFNCSFSCSYILDPRDPLFSRIGSLYLSQVVKQFG-TNHIYNTDTFNEMTPP 330

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S++  A+++ M + D  AVWLMQGWLF  D  FW+P Q++ALLN VP+G+++V
Sbjct: 331 SSEPTYLSAVSRAVFASMTAVDPQAVWLMQGWLFLSDALFWKPAQIQALLNGVPVGRMIV 390

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFAE +P++S ++ FYG P+IWCMLHNF GN   +G ++SI  GP +A    N+++  
Sbjct: 391 LDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNGGFFGTVESINTGPFKALHFPNSSLVG 450

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W++ Y  RRYG    ++  AW +L+ +V
Sbjct: 451 IGMTPEGIEQNPVVYELMSELAWRKEPVNLLKWVSLYVTRRYGSMHESVSAAWKILFASV 510

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YNCT              P             Y+N+       + L    S + +  LWY
Sbjct: 511 YNCT-------------LP------------HYRNHN-----HSPLVRRPSFHMNSELWY 540

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             +++ RA +L + +     +  T++YDL+D+TRQ +       + +I++A+Q +    +
Sbjct: 541 DPADLYRAWKLILEAAPSFMSKETFQYDLVDVTRQVMQVLTTSYYQDIVDAFQKHKMQEL 600

Query: 387 FQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
                  L +L+ +++ LL+ +  FLLG WLE A+ LA +E + K Y+ NAR Q+T+W  
Sbjct: 601 LTAGGVLLYDLLPELNRLLSSNHNFLLGTWLEQARSLALDEREAKLYDINARNQLTLWGP 660

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
           + +     + DY NK W GL++DYY  R  ++   ++E L+SG  F+  ++ +   ++  
Sbjct: 661 SGE-----ILDYANKQWGGLMQDYYAQRWGLFIHTLVECLDSGQPFKQDNFNKVVFQVEK 715

Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKY 533
            +   R  YP +  GD    +  ++ KY
Sbjct: 716 GFIYNRRQYPTKPQGDTFEIAHRIFLKY 743


>gi|326679829|ref|XP_688608.3| PREDICTED: alpha-N-acetylglucosaminidase-like [Danio rerio]
          Length = 757

 Score =  362 bits (928), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 205/569 (36%), Positives = 303/569 (53%), Gaps = 75/569 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  WGGPLPQSW  +QL LQ KIL R+   GM PVLPAFSG VP  +  +FP A +T
Sbjct: 214 MGNLFQWGGPLPQSWHVKQLYLQFKILDRMRSFGMIPVLPAFSGIVPEGITRLFPKANVT 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W        + C Y+LD  DPLF  IG  F+ Q ++E+G T HIYN DTF+E  P 
Sbjct: 274 KLSPWSHFNCT--YSCAYVLDPRDPLFHRIGALFLTQVIEEFG-TDHIYNTDTFNEMPPA 330

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y++S+  AI++ M S D  A+WLMQGWLF  DP FW+  Q+KALL+ VPLG+++V
Sbjct: 331 SSDPTYLASISRAIFNTMTSVDPQAIWLMQGWLFISDPSFWKADQVKALLHGVPLGRMIV 390

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFAE  P++S++  FYG P+IWCMLHNF GN  ++G +DSI  GP  A    N+T+  
Sbjct: 391 LDLFAESMPVYSSTNSFYGQPFIWCMLHNFGGNSGLFGTVDSINSGPFNAVRFPNSTLVG 450

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W++ Y++RRYG     +  AW +L+ +V
Sbjct: 451 LGMTPEGIEQNPVIYELMSELAWRKDPVNLYKWVSLYALRRYGSMDENLALAWQLLFRSV 510

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK-PVSKEAVLKSETSSYDHPHLW 325
           YNCT              P            KY+N+ + P+     L  +T       +W
Sbjct: 511 YNCT-------------LP------------KYKNHNRSPLVHRPSLHMQTD------IW 539

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           Y  ++  RA +L   +   L    T+RYDL+D+TRQAL     E + +I  A+Q      
Sbjct: 540 YDPADFYRAWKLLFEAAPGLVTLETFRYDLVDVTRQALQLLTTEFYKDIKSAFQTQKLSD 599

Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
           +       + +L+ ++D +L+ ++ FLLG WL+ A+    +E +   Y+ NAR QIT+W 
Sbjct: 600 LLTAGGVLVYDLLPELDRILSSNEHFLLGAWLQQAQSQGVDEHEAHLYDINARNQITLWG 659

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
            + +     + DY +K W+GL+ DYY  R  ++   ++E L+ G  F+   + +   ++ 
Sbjct: 660 PDGE-----ILDYASKEWAGLVEDYYLQRWGLFVNTLVECLDRGRPFKQDVFNQAVFQVE 714

Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
             +   +  YP +  GD    ++ ++ KY
Sbjct: 715 KGFVFNQRKYPTKPLGDTYDIARRIFLKY 743


>gi|350407422|ref|XP_003488083.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus impatiens]
          Length = 770

 Score =  361 bits (927), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 209/576 (36%), Positives = 301/576 (52%), Gaps = 75/576 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  SW ++ L LQ +IL R+ ELG+ PVLPAF+G+VP A   +FP A +T
Sbjct: 218 MGNIRGFGGPLTSSWHERSLQLQHRILQRMRELGIIPVLPAFTGHVPRAFPRLFPEANVT 277

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   W S     ++CC YLL+ TDPLF +IG  F+   +KE+G T HIYNCDTF+EN PP
Sbjct: 278 KSATWNSFSD--KYCCPYLLEPTDPLFHKIGDQFLRTYIKEFG-TDHIYNCDTFNENEPP 334

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++G +I+  M S D  A+WLMQGWLF +D  FW  P++KA L SVPLG+L+V
Sbjct: 335 TSELKFLRNVGHSIFQTMLSVDPQAIWLMQGWLFVHDAVFWTEPRIKAFLTSVPLGRLIV 394

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL +E  P++   K +YG P+IWCMLHNF G + M+G    I     E R  E +TM  
Sbjct: 395 LDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINRRVFEGRNMEGSTMIG 454

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W   Y+ RRYG        AW  L  TV
Sbjct: 455 TGLTPEGINQNYVIYELMNEMAYRQEPVNLDNWFEDYASRRYGAWNEYAVAAWKNLGSTV 514

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YN                       IS   GKY    +P    A L            WY
Sbjct: 515 YNFRG--------------------ISKIRGKYVITRRPSLNLARLT-----------WY 543

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
              +      +F+ + +    S  YR+D++D+TRQAL   A++++  ++E++   D    
Sbjct: 544 DPEKFYSTWYIFLQARHGRKNSTLYRHDVVDITRQALQLKADKIYSVLVESFNQKDVTTF 603

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              + R LEL +D++ +LA  + FLLG WLE AK LA ++ + K YE+NAR QIT+W   
Sbjct: 604 KLQAGRLLELFDDLEAILASSEDFLLGTWLEMAKNLATDDAESKLYEYNARNQITLWGPR 663

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW-RREWIKLTN 505
            +     +RDY NK WSG++ DY+ PR AI+   +  SL  G    +     R + ++  
Sbjct: 664 GE-----IRDYANKQWSGIVSDYFKPRWAIFLDGLTTSLTKGTSLNITRINERIFKEVEK 718

Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFD 541
            +   R +YP  + GD +  +  + +K+ Q + +FD
Sbjct: 719 PFTLSRKIYPTNATGDCIDIAMRILSKWYQPS-IFD 753


>gi|358419179|ref|XP_003584151.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bos taurus]
          Length = 741

 Score =  360 bits (925), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 213/574 (37%), Positives = 305/574 (53%), Gaps = 84/574 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+GNW        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+TM  
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 440

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S    + AW +L  +
Sbjct: 441 TGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 500

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNC                       S  E +  N+  P+ +   L+  T+      +W
Sbjct: 501 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 530

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 379
           Y+ S+V  A  L + + + L++S  +RYDL+D+TRQA+ +  +  +  +  AY       
Sbjct: 531 YNRSDVFEAWRLLLTATSTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 590

Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
           L  A G+        EL+  +D +LA    FLLG WLE A+Q A +E +   YE N+R Q
Sbjct: 591 LTRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARQAAVSETEAHFYEQNSRYQ 645

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
           +T+W      E ++L DY NK  +GL+ DYY PR  ++ + ++ESL  G  F+   + R 
Sbjct: 646 LTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFTETLVESLVQGVPFQQHQFDRN 700

Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
             +L   +  G   YP +  GD +   + L+ KY
Sbjct: 701 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 734


>gi|311267179|ref|XP_003131436.1| PREDICTED: alpha-N-acetylglucosaminidase [Sus scrofa]
          Length = 744

 Score =  360 bits (924), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 210/575 (36%), Positives = 304/575 (52%), Gaps = 86/575 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 205 MGNLHTWSGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQISVT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPSYLAAATAAVYQAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+TMA 
Sbjct: 382 LDLFAESQPVYVRTASFLGQPFIWCMLHNFGGNHGLFGALESVNQGPAAARLFPNSTMAG 441

Query: 239 ---------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                            W+  ++ RRYG S    + AW +L  +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGTWVTSFAARRYGVSQGDAEAAWRLLLRS 501

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +G T  NR  +V  P +  +                                 +
Sbjct: 502 VYNCSGEGCTGHNRSPLVRRPSLQMATT-------------------------------V 530

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 378
           WY+ S+V  A  L + +   L++S  +RYDL+D+TRQA+ +  +  +     AY      
Sbjct: 531 WYNQSDVFEAWRLLLKATPTLASSPAFRYDLVDITRQAVQELVSLYYEEARTAYLNKELV 590

Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
            L  A G+        EL+  +D +LA    FLLG WLE A+ +A +E +   YE N+R 
Sbjct: 591 SLMRAGGILA-----YELLPALDKVLASDSHFLLGSWLEQARGVAVSEAEALFYEQNSRY 645

Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
           Q+T+W      E ++L DY NK  +GL+ DYY PR  ++ + ++ESL  G  F+   + +
Sbjct: 646 QLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEMLVESLVQGIPFQQHQFDQ 700

Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +L   +  G   YP +  GD +  ++ L+ KY
Sbjct: 701 NVFQLEQTFVLGTRRYPSQPQGDTVDLAKKLFLKY 735


>gi|340717403|ref|XP_003397173.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus terrestris]
          Length = 770

 Score =  360 bits (923), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 207/570 (36%), Positives = 295/570 (51%), Gaps = 74/570 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  SW ++ L LQ KIL R+ ELG+ PVLPAF+G+VP A   +FP A +T
Sbjct: 218 MGNIRGFGGPLTSSWHERSLQLQHKILQRMRELGIIPVLPAFTGHVPRAFPRLFPEANVT 277

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   W S     ++CC YLL+ TDPLF +IG  F+   +KE+G T HIYNCDTF+EN PP
Sbjct: 278 KSATWNSFSD--KYCCPYLLEPTDPLFHKIGDQFLRTYIKEFG-TDHIYNCDTFNENEPP 334

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++G +I+  M S D  A+WLMQGWLF +D  FW  P++K  L SVPLG+L+V
Sbjct: 335 TSELKFLRNVGHSIFQTMLSVDPQAIWLMQGWLFVHDALFWTEPRIKTFLTSVPLGRLIV 394

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL +E  P++   K +YG P+IWCMLHNF G + M+G    I     E R  E +TM  
Sbjct: 395 LDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINRRVFEGRNMEGSTMIG 454

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W   Y+ RRYG        AW  L  TV
Sbjct: 455 TGLTPEGINQNYVIYELMNEMAYRQEPVNLDNWFEDYASRRYGAWNEYAVAAWKNLGSTV 514

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YN                       IS   GKY    +P    A L            WY
Sbjct: 515 YNFRG--------------------ISKIRGKYVITRRPSLNLARLT-----------WY 543

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
              +      +F+ + +    S  YR+D++D+TRQAL   A++++  ++E++   D    
Sbjct: 544 DPEKFYSTWYIFLQARHGRQNSTLYRHDVVDITRQALQLKADKIYSALVESFNQKDVTTF 603

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              + R LEL +D++ +LA  + FLLG WLE AK LA ++ + K YE+NAR QIT+W   
Sbjct: 604 KLQADRLLELFDDLEAILASSEDFLLGTWLEMAKNLATDDAESKLYEYNARNQITLWGPR 663

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW-RREWIKLTN 505
            +     +RDY NK WSG++ DY+ PR AI+   +  SL  G    +     R + ++  
Sbjct: 664 GE-----IRDYANKQWSGIVSDYFKPRWAIFLDALTTSLTKGTSLNITRINERIFKEVEK 718

Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
            +   R +YP    GD +  +  + +K+ Q
Sbjct: 719 PFTLSRKIYPTNVTGDCIDIAMRILSKWHQ 748


>gi|148671928|gb|EDL03875.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
           CRA_a [Mus musculus]
          Length = 538

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 202/570 (35%), Positives = 302/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW   Q+ LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 1   MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 60

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W        + C++LL   DP+F  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 61  KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 117

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 118 FSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 177

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT--- 236
           LDLFAE  P++  +  F+G P+IWCMLHNF GN  ++G L+ +  GP  AR   N+T   
Sbjct: 178 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 237

Query: 237 -------------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                          MAW++ +++RRYG S P    AW +L  +
Sbjct: 238 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 297

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 298 VYNCSGEACSGHNRS-------------------------PLVKRPSLQMSTA------V 326

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQA+ +  +  +     AY   +  
Sbjct: 327 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 386

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +     + +L+  +D LLA    FLLG WL+ A++ A +E + + YE N+R QIT+W
Sbjct: 387 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQFYEQNSRYQITLW 446

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ DYY PR  ++   +  SL  G  F+  ++ +    L
Sbjct: 447 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPFQQHEFEKNVFPL 501

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  S+ ++ KY
Sbjct: 502 EQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 531


>gi|254910995|ref|NP_038820.2| alpha-N-acetylglucosaminidase precursor [Mus musculus]
 gi|20385160|gb|AAM21194.1|AF363242_1 N-acetyl-glucosaminidase [Mus musculus]
 gi|3329361|gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus]
 gi|33585908|gb|AAH55733.1| Alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB) [Mus
           musculus]
 gi|74211094|dbj|BAE37639.1| unnamed protein product [Mus musculus]
 gi|74218052|dbj|BAE42009.1| unnamed protein product [Mus musculus]
 gi|148671929|gb|EDL03876.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
           CRA_b [Mus musculus]
          Length = 739

 Score =  358 bits (918), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 202/570 (35%), Positives = 302/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW   Q+ LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 202 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W        + C++LL   DP+F  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 262 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 319 FSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT--- 236
           LDLFAE  P++  +  F+G P+IWCMLHNF GN  ++G L+ +  GP  AR   N+T   
Sbjct: 379 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 438

Query: 237 -------------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                          MAW++ +++RRYG S P    AW +L  +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 498

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 499 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 527

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQA+ +  +  +     AY   +  
Sbjct: 528 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 587

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +     + +L+  +D LLA    FLLG WL+ A++ A +E + + YE N+R QIT+W
Sbjct: 588 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQFYEQNSRYQITLW 647

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ DYY PR  ++   +  SL  G  F+  ++ +    L
Sbjct: 648 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPFQQHEFEKNVFPL 702

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  S+ ++ KY
Sbjct: 703 EQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732


>gi|2660688|gb|AAB88084.1| Naglu [Mus musculus]
          Length = 739

 Score =  358 bits (918), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 202/570 (35%), Positives = 302/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW   Q+ LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 202 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W        + C++LL   DP+F  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 262 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 319 FSEPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT--- 236
           LDLFAE  P++  +  F+G P+IWCMLHNF GN  ++G L+ +  GP  AR   N+T   
Sbjct: 379 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 438

Query: 237 -------------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                          MAW++ +++RRYG S P    AW +L  +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 498

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 499 VYNCSGEACSGHNRS-------------------------PLVKRPSLQMSTA------V 527

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQA+ +  +  +     AY   +  
Sbjct: 528 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 587

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +     + +L+  +D LLA    FLLG WL+ A++ A +E + + YE N+R QIT+W
Sbjct: 588 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQFYEQNSRYQITLW 647

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ DYY PR  ++   +  SL  G  F+  ++ +    L
Sbjct: 648 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPFQQHEFEKNVFPL 702

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  S+ ++ KY
Sbjct: 703 EQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732


>gi|91080563|ref|XP_973259.1| PREDICTED: similar to alpha-N-acetyl glucosaminidase [Tribolium
           castaneum]
          Length = 747

 Score =  356 bits (914), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 197/572 (34%), Positives = 314/572 (54%), Gaps = 79/572 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W  + LVLQK+IL R+   G+ PVLPAF+G++P A + ++P A ++
Sbjct: 201 MGNMRGFGGPLSPAWHSRSLVLQKQILQRMRAFGIIPVLPAFAGHLPRAFKTLYPDANMS 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W     +  +CC Y LD T+PLF EIG+AF+ +Q+ E+G T H+YNCD+F+EN P 
Sbjct: 261 KMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSEQISEFG-TDHMYNCDSFNENVPT 317

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
                Y++++G +IY  M   D DAVW+MQGWLF++D F W   + KA+L +VP GK++V
Sbjct: 318 SGDLTYLANVGKSIYKAMTDTDPDAVWVMQGWLFAHDFFYWTRNRAKAILTAVPKGKMIV 377

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDL +E  P +    Q++G PYIWCMLH+F G + M+G    I   P++AR  EN+TM  
Sbjct: 378 LDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMFGSSTVINEVPIKARHLENSTMIG 437

Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W  +YS RRYG      ++AW +L  TV
Sbjct: 438 TGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKYSTRRYGFPDSDAENAWRILQRTV 497

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y                    D   ++   GKY      ++K   LK +  +      WY
Sbjct: 498 Y--------------------DYQGLNRMRGKY-----AITKSPSLKIKIWT------WY 526

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
           ST++++ A    + + + L A++ Y +DL+D+TRQ L  Y +  +  +++ YQ +D+   
Sbjct: 527 STNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVLQVYGDLYYKEMVKNYQSHDSANF 586

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              S++FLE+++D+D +L+ +  FLLGPWLE+AK+ A +  +E Q+E+NAR QIT+W   
Sbjct: 587 QANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAANDSAEEAQFEYNARNQITLWGPR 646

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
            +     + DY NK W+G++  ++ PR  ++  Y+  + +        D  + + ++   
Sbjct: 647 GE-----IMDYANKQWAGVVSHFFAPRWYLFINYLNSTFDGAFNQTYID-AKMFKEVEEP 700

Query: 507 WQNGRNVYPVESNGDAL-----ITSQWLYNKY 533
           +   R  +PVE  GDA+     I  +W   +Y
Sbjct: 701 FTFDRTEFPVEPIGDAVEIAWKIHKKWTSEEY 732


>gi|270005801|gb|EFA02249.1| hypothetical protein TcasGA2_TC007912 [Tribolium castaneum]
          Length = 747

 Score =  356 bits (913), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 196/572 (34%), Positives = 315/572 (55%), Gaps = 79/572 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W  + LVLQK+IL R+   G+ PVLPAF+G++P A + ++P A ++
Sbjct: 201 MGNMRGFGGPLSPAWHSRSLVLQKQILQRMRAFGIIPVLPAFAGHLPRAFKTLYPDANMS 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W     +  +CC Y LD T+PLF EIG+AF+ +Q+ E+G T H+YNCD+F+EN P 
Sbjct: 261 KMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSEQISEFG-TDHMYNCDSFNENVPT 317

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVV 179
                Y++++G +IY  M   D DAVWL+QGW+F  D FW+  + ++++L SVPLGK++V
Sbjct: 318 SGDLTYLANVGKSIYKAMTDTDPDAVWLLQGWMFYNDNFWQDTERVRSILTSVPLGKMIV 377

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDL +E  P +    Q++G PYIWCMLH+F G + M+G    I   P++AR  EN+TM  
Sbjct: 378 LDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMFGSSTVINEVPIKARHLENSTMIG 437

Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W  +YS RRYG      ++AW +L  TV
Sbjct: 438 TGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKYSTRRYGFPDSDAENAWRILQRTV 497

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y                    D   ++   GKY      ++K   LK +  +      WY
Sbjct: 498 Y--------------------DYQGLNRMRGKY-----AITKSPSLKIKIWT------WY 526

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
           ST++++ A    + + + L A++ Y +DL+D+TRQ L  Y +  +  +++ YQ +D+   
Sbjct: 527 STNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVLQVYGDLYYKEMVKNYQSHDSANF 586

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              S++FLE+++D+D +L+ +  FLLGPWLE+AK+ A +  +E Q+E+NAR QIT+W   
Sbjct: 587 QANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAANDSAEEAQFEYNARNQITLWGPR 646

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
            +     + DY NK W+G++  ++ PR  ++  Y+  + +        D  + + ++   
Sbjct: 647 GE-----IMDYANKQWAGVVSHFFAPRWYLFINYLNSTFDGAFNQTYID-AKMFKEVEEP 700

Query: 507 WQNGRNVYPVESNGDAL-----ITSQWLYNKY 533
           +   R  +PVE  GDA+     I  +W   +Y
Sbjct: 701 FTFDRTEFPVEPIGDAVEIAWKIHKKWTSEEY 732


>gi|449491231|ref|XP_004174728.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase,
           partial [Taeniopygia guttata]
          Length = 752

 Score =  355 bits (912), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 209/573 (36%), Positives = 312/573 (54%), Gaps = 84/573 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  W GPLP +W  +QL LQ +I+ R+  LGM  VLPAF+G+VP  +  VFP    T
Sbjct: 211 MGNLRRWAGPLPPAWHFKQLYLQYRIVERMRSLGMTTVLPAFAGHVPQGILRVFPRVNAT 270

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W     D  + C YLLD  DP+F  IG  F+++ +KE+G T H+Y+ DTF+E TP 
Sbjct: 271 RLGHWSHF--DCTYSCIYLLDPEDPMFQVIGTLFLKELIKEFG-TDHVYSADTFNEMTPL 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S +  A++  M   D  A+WLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 328 SSDPAYLSRVSNAVFRSMTGADPKALWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 387

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE KP++  ++ FYG P+IWCMLHNF GN  ++G +++I  GP  AR   N+TM  
Sbjct: 388 LDLFAESKPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 447

Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                          +W+ +Y+ RRYG    A   AW +L  +V
Sbjct: 448 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVTRYAERRYGAPNAAAASAWRLLLRSV 507

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YNCT    + NR                          P+ +   L+ +T       LWY
Sbjct: 508 YNCTGVCVNHNRS-------------------------PLVRRPSLRMDT------ELWY 536

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 380
           + S+V  A  L +++G EL +S  + YDL+D+TRQA  +  +  +L+I +A+Q      L
Sbjct: 537 NASDVFEAWRLLLSAGAELGSSPAFLYDLVDVTRQAAQQLVSHYYLSIRQAFQSHALPEL 596

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
             A GV        +L+ ++D LL+ H  FLLG WL+SA+ +A ++++ +QYE NAR Q+
Sbjct: 597 LTAGGVL-----VYDLLPELDSLLSSHSLFLLGRWLQSARAVATSDQEAEQYELNARNQV 651

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
           T+W  +       + DY N    GL+ DYY  R +++   ++ESL SG  F    + + +
Sbjct: 652 TLWGPSGN-----ILDYANXQLGGLVLDYYAVRWSLFVSVLVESLNSGRPFHQNQFNQVF 706

Query: 501 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
            ++   +   +  YP    GD +  S+ L+ KY
Sbjct: 707 FQVERGFIYNKKRYPAVPFGDTMEISRKLFLKY 739


>gi|1171229|gb|AAC50512.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|1171231|gb|AAC50513.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|1197840|gb|AAB06188.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|1479981|gb|AAB36604.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|32450702|gb|AAH53991.1| N-acetylglucosaminidase, alpha- [Homo sapiens]
 gi|119581237|gb|EAW60833.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
           CRA_b [Homo sapiens]
          Length = 743

 Score =  355 bits (912), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 200/573 (34%), Positives = 304/573 (53%), Gaps = 76/573 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ +YY PR  ++ + +++S+  G  F+   + +   +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQL 704

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 536
              +   +  YP +  GD +  ++ ++ KY  G
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPG 737


>gi|301626955|ref|XP_002942650.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Xenopus (Silurana)
           tropicalis]
          Length = 759

 Score =  355 bits (910), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 200/573 (34%), Positives = 299/573 (52%), Gaps = 84/573 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+H WGGPL  SW++++L LQ +I  R+  LGM  VLPAF+G++P  +  VFP   ++
Sbjct: 212 MGNIHTWGGPLSISWMEKRLSLQLQITERMRSLGMITVLPAFAGHIPEGILRVFPKVTVS 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W +      + C+YLLD  DPLF  IG  F+ Q ++ +G T HIY+ DTF+E +P 
Sbjct: 272 RLGGWSNFNCT--YSCSYLLDPEDPLFQWIGELFLSQMVQSFG-TDHIYSADTFNEMSPT 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S++  AI+  M   D DA+WLMQGWLF  +P FWRP Q KALL+  P+G+++V
Sbjct: 329 SSDPGYLSAVSGAIFKSMAKVDPDAIWLMQGWLFINNPSFWRPAQTKALLHGAPIGRIIV 388

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFAE  P++ T++ FYG P+IWCML+NF GN  ++G ++ +  GP +A    N+TM  
Sbjct: 389 LDLFAETVPVYLTTESFYGQPFIWCMLNNFGGNHGLFGNIEGVNRGPFDAAKFPNSTMVG 448

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           WI+ YS RRYG+S    + AW +L  +V
Sbjct: 449 TGLTPEGIEQNDMIYEFMNEIGWSSQPINLTKWISNYSDRRYGQSNTDARMAWQILLRSV 508

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YNCT    + N   +V  P ++ +                                 + Y
Sbjct: 509 YNCTQILHNHNHSPLVRRPSLNMNT-------------------------------DICY 537

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QL 380
           + +++  A      +   L  S T+ YDL+D+TR+A+ +  +E +L I EAY      QL
Sbjct: 538 NKADIYEAWRFMHNASFALGKSATFLYDLVDITREAVQQLVSEYYLEIKEAYGKKSLQQL 597

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
             A GV        +L+ ++D LL+   GFLLG WL++AK +A    +   Y+ NAR QI
Sbjct: 598 MTAGGVL-----VYDLLPELDSLLSSQPGFLLGSWLKAAKSMASTPAEAALYDMNARNQI 652

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
           T+W          + DY NK + GL++DYY  R  ++  ++++SL  G+ F    + +  
Sbjct: 653 TLWGPTGN-----ILDYANKQYGGLVQDYYTERWGLFVWFLVQSLNKGEHFNQDKFNKAV 707

Query: 501 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
             L  D+      Y     GD L  +  +Y KY
Sbjct: 708 FVLEEDFVYNGKEYMASPTGDTLEIANKIYLKY 740


>gi|1479983|gb|AAB36605.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|119581236|gb|EAW60832.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
           CRA_a [Homo sapiens]
          Length = 639

 Score =  355 bits (910), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 200/573 (34%), Positives = 304/573 (53%), Gaps = 76/573 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 100 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 159

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 160 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 216

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 217 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 276

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 277 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 336

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S P    AW +L  +
Sbjct: 337 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 396

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 397 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 425

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 426 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 485

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 486 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 545

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ +YY PR  ++ + +++S+  G  F+   + +   +L
Sbjct: 546 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQL 600

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 536
              +   +  YP +  GD +  ++ ++ KY  G
Sbjct: 601 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPG 633


>gi|66346698|ref|NP_000254.2| alpha-N-acetylglucosaminidase precursor [Homo sapiens]
 gi|317373322|sp|P54802.2|ANAG_HUMAN RecName: Full=Alpha-N-acetylglucosaminidase; AltName:
           Full=N-acetyl-alpha-glucosaminidase; Short=NAG;
           Contains: RecName: Full=Alpha-N-acetylglucosaminidase 82
           kDa form; Contains: RecName:
           Full=Alpha-N-acetylglucosaminidase 77 kDa form; Flags:
           Precursor
          Length = 743

 Score =  354 bits (909), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 199/570 (34%), Positives = 303/570 (53%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ +YY PR  ++ + +++S+  G  F+   + +   +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQL 704

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734


>gi|355568706|gb|EHH24987.1| Alpha-N-acetylglucosaminidase, partial [Macaca mulatta]
          Length = 711

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 201/570 (35%), Positives = 303/570 (53%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 172 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 231

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 232 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 288

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             +P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 289 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 348

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 349 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 408

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ +RYG S P    AW +L  +
Sbjct: 409 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 468

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 469 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 497

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S V  A  L + S   L+AS  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 498 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 557

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       EL+  +D LLA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 558 SLLRAGGVLAYELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 617

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ +YY PR  ++ + + +S+  G  F+   + +   +L
Sbjct: 618 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 672

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  ++ ++ KY
Sbjct: 673 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 702


>gi|402900329|ref|XP_003913130.1| PREDICTED: alpha-N-acetylglucosaminidase [Papio anubis]
          Length = 743

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 201/570 (35%), Positives = 302/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             +P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPEAARLFPNSTMVG 440

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ +RYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 500

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 529

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S V  A  L + S   L+AS  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 530 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 589

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       EL+  +D LLA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ +YY PR  ++ + + +S+  G  F+   + +   +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 704

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734


>gi|348562747|ref|XP_003467170.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
           [Cavia porcellus]
          Length = 750

 Score =  353 bits (906), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 199/569 (34%), Positives = 308/569 (54%), Gaps = 74/569 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLP++W  +QL LQ +IL R+  LGM PVLPAF+G+VP A+  VFP   IT
Sbjct: 211 MGNLHGWGGPLPRTWHLKQLSLQHQILDRMRALGMTPVLPAFAGHVPKAIGRVFPQVNIT 270

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ + ++E+G T+HIY  DTF+E  PP
Sbjct: 271 QLGSWGHFNCS--YSCSFLLAPEDPLFPLIGGIFLRELIREFG-TNHIYGADTFNEMQPP 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A++  M + DSDAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 328 SSDPAYLAAATEAVFKAMVAVDSDAVWLLQGWLFQHQPEFWGPAQVGAVLGAVPQGRLLV 387

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 388 LDLFAESQPVYTRTASFRGQPFIWCMLHNFGGNHGLFGALEAVNRGPTAARLFPNSTMVG 447

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+++++ RRYG + P  + AW +L  +
Sbjct: 448 TGITPEGIGQNEVVYALMAELGWRKDPVPDLLAWVSRFAERRYGVAQPDAEAAWRLLLRS 507

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNC+  A   +                           P+ +   L+  T+      +W
Sbjct: 508 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTA------VW 537

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           Y+ S+V  A  L + +  +L+ S T+RYDL+D+TRQAL +  +  +  +  AY   +  G
Sbjct: 538 YNRSDVFEAWRLLLKASPKLTTSPTFRYDLLDVTRQALQELVSLYYEEVRAAYLHQELAG 597

Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
           + +       +L+  +D +LA    FLLG WL  A+  A +E + + YE N+R Q+T+W 
Sbjct: 598 LLRAGGVLAYQLLPALDEVLASDHHFLLGSWLAQARAAAASETEARLYEQNSRYQLTLW- 656

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                E ++L DY NK  +GL+  YY PR  ++ + + +SL     F+   + ++   L 
Sbjct: 657 ---GPEGNIL-DYANKQLAGLVAHYYAPRWQLFIESLADSLARAAPFQQHQFDKDVFLLE 712

Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
             +      Y  +  GD +  ++ ++ ++
Sbjct: 713 QAFVLSSRRYRSQPQGDTVDLARKVFLRF 741


>gi|114667172|ref|XP_523654.2| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Pan
           troglodytes]
 gi|410216584|gb|JAA05511.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
 gi|410258938|gb|JAA17435.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
 gi|410304442|gb|JAA30821.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
 gi|410337929|gb|JAA37911.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
          Length = 743

 Score =  353 bits (905), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 199/570 (34%), Positives = 302/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ +YY PR  ++ + + +S+  G  F+   + +   +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 704

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734


>gi|355754184|gb|EHH58149.1| Alpha-N-acetylglucosaminidase, partial [Macaca fascicularis]
          Length = 650

 Score =  352 bits (904), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 201/570 (35%), Positives = 302/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 111 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 170

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 171 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 227

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             +P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 228 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 287

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 288 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 347

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ +RYG S P    AW +L  +
Sbjct: 348 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 407

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 408 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 436

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S V  A  L + S   L+AS  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 437 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 496

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       EL+  +D LLA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 497 SLLRAGGVLAYELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 556

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ +YY PR  ++ + + +S+  G  F+   + +   +L
Sbjct: 557 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 611

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  ++ ++ KY
Sbjct: 612 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 641


>gi|426348060|ref|XP_004041658.1| PREDICTED: alpha-N-acetylglucosaminidase [Gorilla gorilla gorilla]
          Length = 743

 Score =  352 bits (903), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 199/570 (34%), Positives = 302/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIEAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ +YY PR  ++ + + +S+  G  F+   + +   +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 704

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPQGDTVDLAKKIFLKY 734


>gi|397485721|ref|XP_003813989.1| PREDICTED: alpha-N-acetylglucosaminidase [Pan paniscus]
          Length = 682

 Score =  352 bits (903), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 199/570 (34%), Positives = 302/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 143 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 202

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 203 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 259

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 260 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 319

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 320 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNEGPEAARLFPNSTMVG 379

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S P    AW +L  +
Sbjct: 380 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 439

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 440 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 468

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 469 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 528

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 529 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 588

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ +YY PR  ++ + + +S+  G  F+   + +   +L
Sbjct: 589 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 643

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  ++ ++ KY
Sbjct: 644 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 673


>gi|432926094|ref|XP_004080826.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oryzias latipes]
          Length = 882

 Score =  352 bits (903), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 203/574 (35%), Positives = 312/574 (54%), Gaps = 85/574 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N++ +GGPLPQSW   QL LQ +IL R+   GM PVLPAFSGNVP  +  + P A +T
Sbjct: 345 MANMYKFGGPLPQSWHVNQLRLQFRILERMRAFGMIPVLPAFSGNVPKGILKLHPEANVT 404

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W        + C+Y+LD  DPLF++IG  ++ Q +K++G T HIYN DTF+E TPP
Sbjct: 405 RLGPWAHFNCS--FSCSYVLDPRDPLFLQIGSLYLSQVVKQFG-TDHIYNTDTFNEMTPP 461

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S++  ++++ M + D  A+WLMQGWLF  D  FW+PPQ++ALL+ VPLG+++V
Sbjct: 462 SSDPAYLSAISRSVFASMTAVDPKAIWLMQGWLFFSDAAFWKPPQIRALLHGVPLGRMIV 521

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFAE +P++S ++ FYG P+IWCMLHNF GN   +G ++SI  GP +A   +N+TM  
Sbjct: 522 LDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNNGFFGTVESINSGPFKALNFKNSTMVG 581

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W + Y+ RRYG    ++  AW +L+ +V
Sbjct: 582 IGMTPEGIHQNPVIYELMSELAWRKESVNLTKWASLYAARRYGSMHESLSAAWKLLFSSV 641

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YNCT                       V   +  N+  P+ +       T       LWY
Sbjct: 642 YNCT-----------------------VPHYRNHNH-SPLVRRPSFNMNTG------LWY 671

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QL 380
             ++++   +LF+ +   L +  T+RYDL+D+TRQ L       + +I +A+      +L
Sbjct: 672 DPADLLETWKLFMEAAPSLMSKETFRYDLVDVTRQVLQDLTTYFYQDIKDAFHSKKMPEL 731

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
             + GV        +L  +++ LL     FLLG WLE A+  A +E + + Y+ NAR Q+
Sbjct: 732 LTSGGVL-----IYDLFPELNRLLNSDRNFLLGTWLEQAQSFALDEPEARLYDLNARNQL 786

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
           T+W  + +     + DY NK W GL+ DYY  R +++ + +++ L SG  F+   + +  
Sbjct: 787 TLWGPSGE-----ILDYANKEWGGLVEDYYAQRWSLFVQTLVDCLNSGLPFKQDAFNQAV 841

Query: 501 IKLTNDW-QNGRNVYPVESNGDALITSQWLYNKY 533
            ++   +  NGR  YP +  GD    +  ++ KY
Sbjct: 842 FRVEKGFISNGRK-YPTKPQGDTYEIAHRIFLKY 874


>gi|354485058|ref|XP_003504701.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cricetulus griseus]
 gi|344251941|gb|EGW08045.1| Alpha-N-acetylglucosaminidase [Cricetulus griseus]
          Length = 740

 Score =  352 bits (903), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 204/575 (35%), Positives = 303/575 (52%), Gaps = 86/575 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 203 MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRAFGMIPVLPAFAGHVPKAITRVFPQVNVF 262

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  P 
Sbjct: 263 QLGSWGHFNCS--YSCSFLLAPGDPVFPLIGSLFLRELIKEFG-TDHIYGADTFNEMQPI 319

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P ++++  AA+Y  M S D DA+WL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 320 SSDPSFLTAATAAVYEAMISVDPDAIWLLQGWLFQHQPQFWGPAQVKAVLQAVPRGRLLV 379

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE  P++  +  FYG P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 380 LDLFAESHPVYMQTASFYGQPFIWCMLHNFGGNHGLFGALEAVNQGPRAARIFPNSTMVG 439

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                            W+++++  RYG S P  + AW +L  +
Sbjct: 440 TGIAPEGIGQNEMVYALMAELGWRKDPVPDLEVWVSRFASHRYGMSHPDAEAAWRLLLRS 499

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC  +     NR  +V  P +  + I                               +
Sbjct: 500 VYNCPGETYNGHNRSPLVKRPSLQINTI-------------------------------V 528

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ----- 379
           WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQ+L +  +  +     A+      
Sbjct: 529 WYNRSDVFEAWRLLLTAAPNLTTSKAFRYDLLDVTRQSLQELVSLFYEEARIAFMKEELD 588

Query: 380 -LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
            L  A G+  ++R+   L+  +D LLA    FLLG WL  A+ +A +E++ + YE N+  
Sbjct: 589 LLLRAGGI--ITRK---LLPALDELLASDSRFLLGTWLNQARAMAVSEDEAQFYELNSLY 643

Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
           Q+T+W      E +++ DY NK  +GL+ DYY PR  ++ + +  SL  G  FR  ++ +
Sbjct: 644 QLTLW----GPEGNIM-DYANKQLAGLVADYYQPRWGLFMEALAHSLARGVPFRQHEFEK 698

Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
               L   +   +  YP    GD +  S+ L+ KY
Sbjct: 699 NVFPLELAFIINKKRYPSHPQGDTVDLSKKLFLKY 733


>gi|395532374|ref|XP_003768245.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Sarcophilus
           harrisii]
          Length = 726

 Score =  352 bits (902), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 201/570 (35%), Positives = 299/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPL  SW  +Q  LQ +IL R+   GM PVLPAF+G+VP A   VFP A +T
Sbjct: 187 MGNLHSWGGPLSSSWHRKQSSLQYQILERMRSFGMKPVLPAFAGHVPKAFTRVFPQAYVT 246

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W        + C+YLL   DPLF  +G  F+ +  +E+G T HIY+ DTF+E  PP
Sbjct: 247 HLGMWGHFNCT--YSCSYLLAPEDPLFPVVGSLFLRELTQEFG-TDHIYSADTFNEMEPP 303

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW+PPQ+KA+L +VPLG+L+V
Sbjct: 304 SSEPAYLAAATAAVYEAMIAVDVDAVWLLQGWLFQHQPDFWKPPQVKAVLKAVPLGRLLV 363

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDL+AE KP++S +  FYG P+IWCMLHNF GN  ++G LD++  GP +A    N+T   
Sbjct: 364 LDLYAESKPVYSRTDSFYGQPFIWCMLHNFGGNHGLFGALDAVNRGPSDAWLFPNSTFVG 423

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ +RYG      + AW +L  +
Sbjct: 424 TGIVPEGINQNEVVYALMAELGWQKGPLPDLGAWVAGFAAQRYGTPHSHAEAAWKLLLQS 483

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ D  T  NR  +V  P +   I                                +
Sbjct: 484 VYNCSGDLCTGHNRSPLVKRPSLHLDI-------------------------------SV 512

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +   L++S  +RYDL+D+TRQ   +  +  +  +  A++     
Sbjct: 513 WYNRSDVFEAWRLLLEAAPVLASSPAFRYDLLDVTRQVAQELVSLYYEELRTAFEAGAMP 572

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            +       + +L+  +D LLA  + FLLG WLE A+++A +E +  QY+ NA  Q+T+W
Sbjct: 573 ALLTAGGLLVFDLLPSLDELLASDERFLLGAWLEQAREMAVSEAEAWQYKQNALYQLTLW 632

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                     + DY NK  +GL+  YY PR  ++ + +++SL  G  F    +  E + L
Sbjct: 633 GPTGN-----ILDYANKQLAGLVAGYYAPRWKLFVEMLVKSLAEGTPFHQNQFESEALLL 687

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
             ++  GR  +P +  GD +   +  + +Y
Sbjct: 688 GQNFVLGREKFPTQPQGDTVDLVKKFFLRY 717


>gi|297701096|ref|XP_002827555.1| PREDICTED: alpha-N-acetylglucosaminidase [Pongo abelii]
          Length = 836

 Score =  352 bits (902), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 199/570 (34%), Positives = 301/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 297 MGNLHSWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 356

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HI+  DTF+E  PP
Sbjct: 357 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIFGADTFNEMQPP 413

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 414 SSEPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 473

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 474 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 533

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S P    AW +L  +
Sbjct: 534 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 593

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 594 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 622

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 623 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 682

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 683 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 742

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ +YY PR  ++ + + +S+  G  F+   + +   +L
Sbjct: 743 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 797

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  ++ ++ KY
Sbjct: 798 EQAFVLSKQRYPSQPQGDTVDLAKKIFLKY 827


>gi|256422141|ref|YP_003122794.1| alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
 gi|256037049|gb|ACU60593.1| Alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
          Length = 728

 Score =  351 bits (900), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 197/570 (34%), Positives = 300/570 (52%), Gaps = 77/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  WGGPLPQ W D   VLQ++IL     +GM P+LPAF+G+VP A ++ +P+ +I 
Sbjct: 195 MGNIDAWGGPLPQHWKDSHKVLQQQILAAERSMGMLPILPAFTGHVPPAFKDKYPN-EIV 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  NW     D  +   Y+LD   P+F +IG+ F+E Q K +G T H Y+ DTF+EN PP
Sbjct: 254 KPTNW-----DAGFPDVYILDPNSPMFDKIGKKFLEAQTKAFG-TDHFYSADTFNENVPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
                ++ ++   +Y+ M + D  AVW+MQGW+F Y+  +W  PQ++ALLN+VP   ++V
Sbjct: 308 SSDSSFLDAMSRKVYASMAAADPKAVWVMQGWMFHYNASYWHQPQIRALLNAVPDDHMIV 367

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
           LDL++E  P W  ++ +YG P+IW MLHNF GN  M+G +D+ A  P  A          
Sbjct: 368 LDLYSESHPEWRNTQAYYGKPWIWNMLHNFGGNTGMWGGMDAAAHDPATALHDPASGKMS 427

Query: 230 ------------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                   R        W+  Y+ +RYG    A+  AW +LYHT
Sbjct: 428 GIGLTPEGIEQNPALYQLMIDNVWRDQPINVDTWLQSYAKQRYGAENEAVNKAWQILYHT 487

Query: 266 VY--NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
           VY    T+GA +    +IVA P +D                 ++ E V            
Sbjct: 488 VYIGGPTEGAPES---IIVARPTLD-----------------IAAERV---------KTK 518

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           L Y  ++V+ A +LFI +  +L  +  ++YDL+DLTRQ L  YA+ L   +  AY+  D 
Sbjct: 519 LEYDPAKVVPAWDLFINAAAQLKPTEGFKYDLVDLTRQVLGNYASPLQQRVATAYRNKDL 578

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
               Q S +F+ L++DMD LL   +GFLLG W+  A+       ++  YE+NA+  +T+W
Sbjct: 579 AAFKQYSTQFIGLLDDMDMLLGTQEGFLLGKWVSDARSNGITPAEQDLYEFNAKDLVTLW 638

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
            D    + S + +Y N+ W+GL++ +Y PR   +F  +  SL+ G+   LK +  +    
Sbjct: 639 GD----KDSPVHEYSNRQWNGLIKGFYKPRWQQFFTLLESSLKKGETADLKAFEEQVKAF 694

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              W NG + Y V+  GDA+  +  L+ KY
Sbjct: 695 EWKWANGHDKYAVKPQGDAVKAAVQLHKKY 724


>gi|395827009|ref|XP_003786703.1| PREDICTED: alpha-N-acetylglucosaminidase [Otolemur garnettii]
          Length = 756

 Score =  350 bits (899), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 201/569 (35%), Positives = 296/569 (52%), Gaps = 74/569 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWGGPLPFSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QL +W        + C++LL   DP+F  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 264 QLSSWGHFNCS--YSCSFLLAPGDPIFSLIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D DAVWL+QGWLF + P FW P Q+KA+L +VPLG+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMIAVDPDAVWLLQGWLFQHQPQFWGPTQIKAVLRAVPLGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++S +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 381 LDLFAESQPVYSRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNQGPKAARLFPNSTMVG 440

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S    + AW +L  +
Sbjct: 441 TGMAPEGINQNEVVYALMAELGWRKDPVPDLVAWVTSFADRRYGISHGDAEAAWRLLLRS 500

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNC+  A   +                           P+ K   L+  T+      +W
Sbjct: 501 VYNCSGEACSGHNH------------------------SPLVKRPSLQMNTT------VW 530

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           Y+ S+V  A  L + S   L+AS  +RYDL+D+TRQA+ +  +  +     AY   +   
Sbjct: 531 YNRSDVFEAWRLLLTSAPTLAASPIFRYDLLDITRQAIQELVSLYYEKARTAYLNKELVP 590

Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
           + +       EL+  +D +LA  + FLLG WL  A+ +A +E +   YE N+R Q+T+W 
Sbjct: 591 LLRAGGLLAYELLPALDEVLASDNHFLLGSWLAQARAVAISEAEANFYEQNSRYQLTLW- 649

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                    + DY NK  +GL+ DYY PR  ++ + +   L  G  F+ + + +    L 
Sbjct: 650 ----GPVGNILDYANKQLAGLVADYYAPRWQLFMQALGNCLAQGIPFQQRQFDKNVFPLE 705

Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
             +      YP +  G+ +  ++ ++ KY
Sbjct: 706 QAFVLNSKRYPSQPQGNTMDLAKKIFLKY 734


>gi|295132875|ref|YP_003583551.1| hypothetical protein ZPR_1010 [Zunongwangia profunda SM-A87]
 gi|294980890|gb|ADF51355.1| predicted protein [Zunongwangia profunda SM-A87]
          Length = 750

 Score =  350 bits (897), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 207/578 (35%), Positives = 303/578 (52%), Gaps = 96/578 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGPLPQSW +    LQKKIL R  ELGM PVLPAF+G+VPA+ +  FP A + 
Sbjct: 206 MGNLDGWGGPLPQSWKESHRDLQKKILKRSRELGMKPVLPAFTGHVPASFKKFFPDADLK 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  NW +   D     TY+LDA DPLF EIG+ F+E+Q + +G T H Y  DTF+EN PP
Sbjct: 266 KT-NWGNDFGD-----TYILDAEDPLFAEIGKRFLEKQEEVFG-TDHFYTADTFNENEPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVV 179
            D P+Y+  L   I+ GM++ D +A W+MQGWLF S+  FW+ PQ+K LL++VP  ++++
Sbjct: 319 SDDPKYLGELSEKIFEGMKAADPEATWVMQGWLFYSHKDFWKTPQIKGLLSTVPDDRMII 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
           LDL  E++P+W  ++ FYG  +IW MLHNF GNI M+G ++++A  P  A          
Sbjct: 379 LDLATEIEPVWKQTEAFYGKQWIWNMLHNFGGNISMFGRIETVAEQPALALNDSTSGNLK 438

Query: 230 ------------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                   R +     +W+  Y+  RYG    +I +AW++L  T
Sbjct: 439 GIGLTMEAIEQNPVLYELMTDNTWRDTPIELKSWLKNYTRNRYGAVNDSILEAWDILVAT 498

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
            YN T    D    +I A P             ++ Y +              +    + 
Sbjct: 499 AYNGT-TIRDGAESIIAARP------------TFEGYRR--------------WARTKMN 531

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           Y   +++ A +LFI + +    S+ + YDL+DL+RQ LA YA  +   +  AY+ ND   
Sbjct: 532 YDPLDLLPAWDLFIGARDRFKDSDGFAYDLVDLSRQVLANYALPVQQQMRIAYENNDKEA 591

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF- 444
             + S   L L+ D+D LLA    FLLGPW+  A+      E++  YE NAR  IT+W  
Sbjct: 592 FKKHSEELLTLISDLDRLLATRKDFLLGPWIADARSWGTTPEEKALYERNARDLITLWGG 651

Query: 445 -DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG--------FRLKD 495
            DN       L +Y  + WSG+L D+Y PR    ++  I  +E+  G         ++K+
Sbjct: 652 PDNP------LHEYSCRQWSGVLDDFYKPR----WQQFIADVEANWGDFDQEVFDEKIKE 701

Query: 496 WRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           W  +W+       N    YP + +GD+   ++ LY+KY
Sbjct: 702 WEWKWV-------NKEEAYPTQPSGDSYKVAKALYDKY 732


>gi|405964692|gb|EKC30145.1| Alpha-N-acetylglucosaminidase [Crassostrea gigas]
          Length = 859

 Score =  349 bits (895), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 195/598 (32%), Positives = 309/598 (51%), Gaps = 70/598 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+HGWGGP+ Q+W+D QL+LQ KIL R+   GM PVLP F+G+VP A    +P A ++
Sbjct: 217 MGNMHGWGGPITQNWIDDQLILQHKILERMRSFGMIPVLPGFAGHVPEATILRYPQANVS 276

Query: 61  QLGNWFSVKSDPRW--------------------CCTYLLDATDPLFIEIGRAFIEQQLK 100
           +L +W        W                    CC YLLD  DPLF++I   FI++   
Sbjct: 277 RLTDWAGFNQSFCWHYPTANVSRLRDWGHFNKTYCCNYLLDFNDPLFMKIAVRFIKEMEN 336

Query: 101 EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 160
           E+G   H+Y+ DTF+E  P  +S EY++  G  +Y  ++  DS A+WLMQGWLF    FW
Sbjct: 337 EFG-VDHVYSVDTFNEMRPRSNSTEYLALSGRTVYKSLKEADSKAIWLMQGWLFIDGGFW 395

Query: 161 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
           + PQ+KALL +VP G++++LDL++E+ PI++ ++ +YG P+IWCMLH+F G +E+YG L 
Sbjct: 396 KQPQIKALLTAVPQGEMIILDLYSEIIPIYTQTESYYGQPFIWCMLHDFGGTMELYGALK 455

Query: 221 SIAFGPVEARTSENTTM---------------------------------AWINQYSVRR 247
            I  GP   R   N++M                                  WI++Y + R
Sbjct: 456 LINEGPFNGRAFPNSSMVGLGMTPEGIFQNEVVYEFFTENVWRKAPRDISTWISKYVLNR 515

Query: 248 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE---GKYQN--- 301
           YG++   I  AW  L ++VYN +D   D + + I   PD  PS+        G Y N   
Sbjct: 516 YGKTNKFIDLAWQYLKNSVYNNSDNLKDHDSNAI---PDHRPSLSPALHPDLGIYNNTDY 572

Query: 302 -YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 360
            +   ++         +      +WY+  ++  A ++   + +E S S+ + YD++D+TR
Sbjct: 573 LHDNSINIIVTTLPRMTPLIQQDVWYNPEDLYVAWDIMTLNLDEFSNSSLFMYDIVDVTR 632

Query: 361 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 420
            +L   + + + +++ A+   D H V     + L L+ DMD +L     FLLG W+++A 
Sbjct: 633 NSLQILSIKYYTDLVYAFGRGDIHAVESHGNQLLGLLSDMDTVLGSDSHFLLGRWIKAAT 692

Query: 421 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 480
             A + +     ++NAR QIT+W    +     +RDY  K WSGL++DYY PR  I+  Y
Sbjct: 693 DNAMDMQDNWFLQFNARNQITLWGPRGE-----IRDYACKQWSGLIKDYYLPRWEIFVNY 747

Query: 481 MIESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGT 537
            ++ +     +   +     + K+   +    + YP E  GD++   + L+ KY   T
Sbjct: 748 TLDIMAHNKTYNATELDIMIYEKVEFPFSYRLDQYPTEPQGDSVAIVKSLHKKYRPDT 805


>gi|426238065|ref|XP_004012978.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 1 [Ovis aries]
          Length = 748

 Score =  348 bits (893), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 211/581 (36%), Positives = 301/581 (51%), Gaps = 84/581 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 211 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 270

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 271 QMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D  AVWL+QGWLF   P FW P Q+ A+L +VP G+L+V
Sbjct: 328 SSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQPEFWGPAQVAAVLGAVPRGRLLV 387

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+T+  
Sbjct: 388 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPATARRFPNSTLVG 447

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S    + AW +L  +
Sbjct: 448 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 507

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNC                       S  E +  N+  P+ K   L   T+      +W
Sbjct: 508 VYNC-----------------------SGEECRGHNH-SPLVKRPSLHMVTT------VW 537

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 379
           Y+ S+V  A  L + +   L++S  +RYDL+D+TRQA+ +  +  +  +  AY       
Sbjct: 538 YNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 597

Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
           L  A G+        EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q
Sbjct: 598 LMRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARLAAVSETEAHFYEQNSRYQ 652

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
           +T+W      E ++L DY NK  +GL+ DYY PR  ++ + + ESL  G  F+   + + 
Sbjct: 653 LTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFAETLAESLVQGVPFQQHQFDKN 707

Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
             +L   +  G   YP +  GD +   + L+ KY    G F
Sbjct: 708 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKYYPRAGSF 748


>gi|426238067|ref|XP_004012979.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Ovis aries]
          Length = 739

 Score =  348 bits (893), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 211/581 (36%), Positives = 301/581 (51%), Gaps = 84/581 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 202 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 262 QMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D  AVWL+QGWLF   P FW P Q+ A+L +VP G+L+V
Sbjct: 319 SSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQPEFWGPAQVAAVLGAVPRGRLLV 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+T+  
Sbjct: 379 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPATARRFPNSTLVG 438

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S    + AW +L  +
Sbjct: 439 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 498

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNC                       S  E +  N+  P+ K   L   T+      +W
Sbjct: 499 VYNC-----------------------SGEECRGHNH-SPLVKRPSLHMVTT------VW 528

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 379
           Y+ S+V  A  L + +   L++S  +RYDL+D+TRQA+ +  +  +  +  AY       
Sbjct: 529 YNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 588

Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
           L  A G+        EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q
Sbjct: 589 LMRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARLAAVSETEAHFYEQNSRYQ 643

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
           +T+W      E ++L DY NK  +GL+ DYY PR  ++ + + ESL  G  F+   + + 
Sbjct: 644 LTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFAETLAESLVQGVPFQQHQFDKN 698

Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
             +L   +  G   YP +  GD +   + L+ KY    G F
Sbjct: 699 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKYYPRAGSF 739


>gi|444714090|gb|ELW54978.1| Alpha-N-acetylglucosaminidase [Tupaia chinensis]
          Length = 724

 Score =  348 bits (892), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 205/571 (35%), Positives = 303/571 (53%), Gaps = 76/571 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +QL LQ ++L R+   GM PVLPAF G+VP A+  VFP   +T
Sbjct: 174 MGNLHTWGGPLPHSWHLKQLYLQHRVLDRMRSFGMIPVLPAFPGHVPKAITRVFPQVNVT 233

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DP+F  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 234 QLGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRELTKEFG-TDHIYGADTFNELQPP 290

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AAIY+ M + D  AVWL+QGW+F + P FW P Q+KA+L +VP G+L+V
Sbjct: 291 SSEPSYLAAATAAIYAAMTAVDPGAVWLLQGWIFQHQPDFWGPAQVKAVLEAVPRGRLLV 350

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F G P+IWCMLHNF GN  +YG L+++ +GP  AR   N++M  
Sbjct: 351 LDLFAETRPVYLYTASFLGQPFIWCMLHNFGGNHGLYGTLEAVNWGPKAARLFPNSSMVG 410

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  Y+ RRYG S+   + AW +L  +
Sbjct: 411 TGMAPEGINQNEVVYALMAELGWRKDPVPDLAAWVTSYADRRYGVSLGDAEAAWRLLLRS 470

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+    +  NR                          P+ K   L+  T+      +
Sbjct: 471 VYNCSGQMCSGHNRS-------------------------PLVKRPSLQMNTT------V 499

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + +   L+AS T+RYDL+D+TRQA+ +  +  +     AY   +  
Sbjct: 500 WYNRSDVFEAWRLLLTAAPTLAASPTFRYDLLDVTRQAVQELVSLYYEEARTAYLNKELV 559

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +     + EL+ D+D LLA    F+LG WLE A+ +A +E + + YE N+R Q+T+W
Sbjct: 560 SLLRAGGILVYELLPDLDNLLATDGRFMLGSWLEQARAVAVSETEAQFYEQNSRYQLTLW 619

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                     + DY NK  +GL+ DYY PR  ++ + +  SL  G  F+   + +   +L
Sbjct: 620 GPTGN-----ILDYANKQLAGLVADYYAPRWQLFMEMLANSLTQGIPFQQHQFDQNAFQL 674

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
              +      YP +  GD +  ++ ++ KY 
Sbjct: 675 EQAFVLSVERYPSQPQGDTVELAKKIFLKYF 705


>gi|344285558|ref|XP_003414528.1| PREDICTED: alpha-N-acetylglucosaminidase [Loxodonta africana]
          Length = 744

 Score =  346 bits (888), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 205/575 (35%), Positives = 297/575 (51%), Gaps = 86/575 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 205 MGNLHSWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAVTRVFPQVNVT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DP+F  IG  F+ +   E+G T HIY  DTF+E  PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRELTTEFG-TDHIYGADTFNEMQPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G L+V
Sbjct: 322 SSEPSYLAAATAAVYEAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGHLLV 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 382 LDLFAETQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGTLETVNQGPAAARLFPNSTMVG 441

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG      + AW +L  +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVPDLGAWVASFAARRYGGIHQDAETAWRLLLRS 501

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 502 VYNCSGESCSGHNRS-------------------------PLVKRPSLQMNTT------V 530

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 378
           WY+ S+V  A  L +A+   L+AS  +RYDL+D+TRQA  +  +  +  +  AY      
Sbjct: 531 WYNRSDVFEAWRLLLATTPALAASPAFRYDLLDVTRQAAQELVSFYYGEVRTAYLNKELV 590

Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
            L  A GV        EL+  +D +LA    FLLG WLE A+  A +E +   +E N+R 
Sbjct: 591 HLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARVAAVSEAEAHFFEQNSRY 645

Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
           Q+T+W          + DY NK  +GL+ DYY PR  ++   ++ESL     F+ + +  
Sbjct: 646 QLTLW-----GPVGNILDYANKQLAGLVSDYYTPRWQLFVGALVESLVQDVPFQQRQFDE 700

Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +L   +      YP +  GD +  ++ L+ KY
Sbjct: 701 NVFQLEQAFVLNTRRYPTQPKGDTVDLAKRLFLKY 735


>gi|156545487|ref|XP_001606979.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Nasonia vitripennis]
          Length = 755

 Score =  346 bits (887), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 201/570 (35%), Positives = 297/570 (52%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N  GWGGPL Q+W +  + LQ  I+ R+ ELG+ PVLPAF+G+VP     VFP A +T
Sbjct: 221 MGNFRGWGGPLSQAWHNHTIQLQHSIVRRMRELGITPVLPAFAGHVPRDFIRVFPEANVT 280

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   ++CC Y LD TDPLF  +GR F++    E+G T+HIYNCD+F+EN P 
Sbjct: 281 KVVSWNGFED--QYCCPYSLDPTDPLFKTVGREFLKAYTDEFG-TNHIYNCDSFNENDPH 337

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +Y+S+ G AIYSGM   D DA+WLMQGWLF +   FW  P++KA + SVP+GK+++
Sbjct: 338 TGDLDYLSNTGKAIYSGMTGADPDAIWLMQGWLFVHSEYFWTFPRVKAFVTSVPIGKMII 397

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL +E  P +     ++G P+IWCMLHNF G + M+G    I  G  EART+  +TM  
Sbjct: 398 LDLQSEQFPQYKRFHSYFGQPFIWCMLHNFGGTLGMFGSAGVINKGVFEARTTNGSTMIG 457

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W   Y+VRRYG++  +I+ +W  L   +
Sbjct: 458 TGLTPEGINQNYVIYEFMNEMSYRKKPVVLDNWFENYAVRRYGQADESIRTSWQELGREL 517

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YN  DG T                      G Y      ++K   L  E      P  WY
Sbjct: 518 YN-YDGKTK-------------------IRGHY-----VITKRPSLNIE------PWYWY 546

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
                +     F+ +GN    +  +++DL+D+TRQAL   A+ ++ +I  AY   +   +
Sbjct: 547 DLKTFLAVWNSFVHAGNGTMKNELFKHDLVDITRQALQITADFIYADIKAAYTQKNLTQL 606

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN--EEQEKQYEWNARTQITMWF 444
              S   L+L +D++  LA    FLLG WLE AK +A        + YE+NAR QIT+W 
Sbjct: 607 QIASSHLLDLFDDLEKNLASSKDFLLGSWLEDAKAIAPEGATRDRENYEFNARNQITLWG 666

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKL 503
              +     + DY NK WSG++ DY+ PR  IY K + ES+           +R  + ++
Sbjct: 667 PRGE-----IVDYANKQWSGVVADYFKPRWEIYLKELQESIRKQTAVPTAKLKRMIFNQV 721

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   + +YP +  GD+++ ++ LY K+
Sbjct: 722 ELPFSYSKKLYPTQPKGDSILIAKELYAKW 751


>gi|431890602|gb|ELK01481.1| Alpha-N-acetylglucosaminidase [Pteropus alecto]
          Length = 740

 Score =  345 bits (884), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 200/569 (35%), Positives = 302/569 (53%), Gaps = 74/569 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 201 MGNLHTWGGPLPFSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+ +W        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 261 QMDSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 317

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 318 SSEPSYLAAATAAVYQAMTTVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 377

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 378 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVG 437

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S    + AW +L  +
Sbjct: 438 TGMAPEGIDQNEVVYALMAELGWRKDPVTDLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 497

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNC                       S  + +  N+  P+ +   L+  T+      +W
Sbjct: 498 VYNC-----------------------SGEDCRGHNH-SPLVRRPSLQMVTT------VW 527

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           Y+ S+V  A  + + +   L+ S  + Y+L+D+TRQA+ +  +  +  +  AY   D   
Sbjct: 528 YNQSDVFEAWRMLLTATPTLATSPLFSYELVDITRQAIQELVSLYYEEVRTAYLNKDLVT 587

Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
           +F+ +     EL+  +D +LA    FLLG WLE A+  A ++ +   YE N+R Q+T+W 
Sbjct: 588 LFRAAGILAYELLPSLDNILATDSHFLLGSWLEQARAAAVSKAEASFYEQNSRYQLTLW- 646

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                E ++L DY NK  +GL+ +YY PR  ++ + ++ESL  G  F+   + +   +L 
Sbjct: 647 ---GPEGNIL-DYANKQLAGLIANYYTPRWRLFMEMLVESLVQGIPFQQHQFDKNAFQLE 702

Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
             +      YP +  GD +  ++ L+ KY
Sbjct: 703 QTFVFSTQRYPNQPQGDTVDLAKKLFLKY 731


>gi|351699889|gb|EHB02808.1| Alpha-N-acetylglucosaminidase, partial [Heterocephalus glaber]
          Length = 652

 Score =  345 bits (884), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 199/559 (35%), Positives = 297/559 (53%), Gaps = 74/559 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLP +W  +QL LQ ++L R+  LGM PVLPAF+G+VP A+  VFP   +T
Sbjct: 114 MGNLHGWGGPLPHAWHLKQLYLQHRVLDRMRALGMTPVLPAFAGHVPKAVTRVFPQVNVT 173

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ +  +E+G T H Y  DTF+E  PP
Sbjct: 174 QLGSWGHFNCS--YSCSFLLAPGDPLFPLIGSLFLRELNREFG-TDHFYGADTFNEMQPP 230

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 231 SSEPAYLAAATAAVYEAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVGAVLGAVPQGRLLV 290

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+T+  
Sbjct: 291 LDLFAENQPVYTRTASFGGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTVVG 350

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+ +++ +RYG + P    AW +L H+
Sbjct: 351 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLSAWVARFAEQRYGVAQPDAVLAWRLLLHS 410

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNC+  A   +                           P+ +   L+  T+      +W
Sbjct: 411 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------VW 440

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           Y+ S+V  A  L + +   L+AS  +RYDL+D+TRQ L +  +  +     AY   +  G
Sbjct: 441 YNRSDVFEAWRLLLKATPNLTASPAFRYDLLDVTRQGLQELVSLYYEEARAAYMRQELEG 500

Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
           + +       +L+  +D +LA    FLLG WLE A+ +A +  +   YE N+R Q+T+W 
Sbjct: 501 LLRAGGVLAYKLLPALDEVLASDHRFLLGSWLEQARAVAVSSAEADLYEQNSRYQLTLW- 559

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                E ++L DY NK  +GL+ DYY PR  ++ + +  SL  G  F+ + +  +   L 
Sbjct: 560 ---GPEGNIL-DYANKQLAGLVADYYVPRWRLFVETLASSLARGVPFQQQQFNSDVFLLE 615

Query: 505 NDWQNGRNVYPVESNGDAL 523
             +   R  YP +  GD +
Sbjct: 616 QAFVLSRKRYPSQPQGDTV 634


>gi|307192254|gb|EFN75548.1| Alpha-N-acetylglucosaminidase [Harpegnathos saltator]
          Length = 741

 Score =  344 bits (882), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 193/569 (33%), Positives = 297/569 (52%), Gaps = 79/569 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W ++ + LQ +IL R+ +LG+ PVLPAF+G+VP A   +FP+A +T
Sbjct: 194 MGNIRGFGGPLSINWHERTVRLQHRILRRMRDLGIVPVLPAFAGHVPRAFARLFPNANMT 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W   K + ++CC YLL+ TDPLF  IG  F+   + E+G T HIYNCDTF+EN P 
Sbjct: 254 KIEPW--NKFEDKYCCPYLLEPTDPLFQTIGEKFLRMYINEFG-TDHIYNCDTFNENEPG 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
                Y+S++G +++  M + D  A+WLMQGWLF +D  FW  P++++ L SVP G+++V
Sbjct: 311 NSELAYLSNVGRSVFQAMSTVDPQAIWLMQGWLFVHDFIFWTEPRVRSFLTSVPTGRMLV 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDL +E  P +   K +YG P+IWCMLHNF G + M+G    I     E R    +TM  
Sbjct: 371 LDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRHMNGSTMVG 430

Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                          AW   Y+ RRYG        AW  L  T+
Sbjct: 431 TGLTPEGINQNYVIYELMNEMAYRHEPVDLDAWFESYATRRYGAWNEYAVAAWKHLGRTI 490

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YN       +   VI   P ++ S                               P +WY
Sbjct: 491 YNFVGIERIRGHYVITRRPSLNIS-------------------------------PWVWY 519

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
           +  +      +F+ +      +  YR+D++D+TRQAL   A+ +++N+++ Y+  +  G 
Sbjct: 520 NREDFYHTWNVFLKARYGRGNNTLYRHDVVDITRQALQLMADNIYMNVVDCYKRKNITGF 579

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              +   L+L +D++ +LA    FLLG WL  AK +A +E++ + YE+NAR QIT+W  N
Sbjct: 580 QSHAAALLDLFDDIEAILASGSNFLLGTWLAQAKDMAVDEKERQSYEYNARNQITLWGPN 639

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIKLTN 505
            +     +RDY NK WSG++ DY+ PR A + K + +SL       + +   R ++++  
Sbjct: 640 GE-----IRDYANKQWSGVVADYFKPRWAFFLKALEKSLVERTRLNMTEINDRMFLEVEQ 694

Query: 506 DWQNGRNVYPVESNGDAL-----ITSQWL 529
            +     +YPV + GD L     I S+WL
Sbjct: 695 AFTFSTKLYPVGTKGDTLDIAVKIISKWL 723


>gi|380030624|ref|XP_003698943.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
           [Apis florea]
          Length = 769

 Score =  343 bits (881), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 196/584 (33%), Positives = 303/584 (51%), Gaps = 82/584 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W ++ + LQ +IL R+  LG+ PVLPAF+G+VP A   +FP A +T
Sbjct: 216 MGNMRGFGGPLSSNWHEKSIRLQHRILERMRALGIIPVLPAFAGHVPRAFLRLFPKANVT 275

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   W +     ++CC YLL+  DPLF +IG+ F++  ++E+G T H+YNCDTF+EN P 
Sbjct: 276 KSAVWNNFSD--KYCCPYLLEPMDPLFKQIGQQFLKTYIEEFG-TDHVYNCDTFNENEPY 332

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++G +I+  M + DS A+WLMQGWLF +D  FW  P+ +  L S+PLG+++V
Sbjct: 333 TSELKFLRNIGHSIFEAMSNVDSKAIWLMQGWLFYHDSVFWTEPRTRTFLTSIPLGRMIV 392

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL +E  P +     +YG P+IWCMLHNF G + M+G  + I     EAR    +TM  
Sbjct: 393 LDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGGTLGMFGSAEIINHRIFEARNMNGSTMVG 452

Query: 239 --------------------------------WINQYSVRRYG--RSVPAIQDAWNVLYH 264
                                           W   Y+ RRYG  +       AW    +
Sbjct: 453 TGLTPEGINQNYVIYELMNEMAYRKRPVNLDKWFENYANRRYGDTKGNEHTVTAWKGFKN 512

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           TVYN +D    + +  I   P+++ S                               P  
Sbjct: 513 TVYNFSDTRRIRGKYAITIRPNLNFS-------------------------------PWR 541

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+    I    + + + +    S  YR+D++D+TRQAL   A+E++ ++IE++   +  
Sbjct: 542 WYNKDAFIHYWYMLLQARDLKRNSTLYRHDVVDVTRQALQLIADEIYTDLIESFNKKNID 601

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
              Q ++  L L +D++ +LA  + FLLG WL+ AK LA N+E+E  YE+NAR QIT+W 
Sbjct: 602 LFKQNAKLLLALFDDLEEILASSEDFLLGKWLKMAKDLATNDEEEILYEYNARNQITLW- 660

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK-L 503
                    +RDY NK WSG++ DY+ PR AI+   +  SL +G          +  + +
Sbjct: 661 ----GPLGEIRDYANKQWSGIVADYFKPRWAIFLNELETSLTTGTRVNTTKMNEQIFENV 716

Query: 504 TNDWQNGRNVYPVESNGDAL-----ITSQWLYNKYLQGTGVFDH 542
              +   R +YP ++ GD++     I S+W Y+ YL     F H
Sbjct: 717 EEAFTFSRKIYPTKATGDSIDIAERILSEW-YDPYLPFHKTFRH 759


>gi|328778968|ref|XP_623833.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Apis mellifera]
          Length = 752

 Score =  343 bits (879), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 196/572 (34%), Positives = 303/572 (52%), Gaps = 85/572 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W D+ + LQ +IL R+  LG+ PVLPAF+G+VP AL  +FP A +T
Sbjct: 198 MGNMRGFGGPLNSNWHDKSIRLQHRILERMRALGIIPVLPAFAGHVPRALLKLFPKANVT 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   W +     ++CC YLL+ TDPLF +IG+ F++  ++E+G T H+YNCDTF+EN P 
Sbjct: 258 KSAVWNNFSD--KYCCPYLLEPTDPLFKQIGQQFLKTYIEEFG-TDHVYNCDTFNENEPY 314

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++G +I+  M S DS A+WLMQGWLF +D  FW  P+ +  L SVPLG+++V
Sbjct: 315 TSELKFLRNIGHSIFEAMNSVDSKAIWLMQGWLFYHDSVFWTEPRTRTFLTSVPLGRMIV 374

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL +E  P +     +YG P+IWCMLHNF G + M+G  + I     EAR    +TM  
Sbjct: 375 LDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGGTLGMFGSAEIINHRVFEARNMNGSTMVG 434

Query: 239 --------------------------------WINQYSVRRYG--RSVPAIQDAWNVLYH 264
                                           W   ++ RRYG  +       AW    +
Sbjct: 435 TGLTPEGINQNYVIYELMNEMAYRKKPVNLDKWFENFANRRYGDIKGNEHTVTAWKGFKN 494

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           TVYN +D    + + VI   P+++                                 P  
Sbjct: 495 TVYNFSDTRRIRGKYVITIRPNLN-------------------------------FFPWR 523

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+    I    + + + +    S  YR+D++D+TRQAL   A+E++ ++IE++   +  
Sbjct: 524 WYNKDAFIYYWYVLLQARDLKRNSTLYRHDVVDVTRQALQLIADEIYTDLIESFNKKNID 583

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
              Q ++  L L +D++ +LA  + FLLG WL+ AK LA ++E+E  YE+NAR QIT+W 
Sbjct: 584 LFKQNAKLLLALFDDLEEILASSEDFLLGKWLKMAKDLATDDEEEILYEYNARNQITLW- 642

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRREWI 501
                    +RDY NK WSG++ DY+ PR AI+   +  SL +G   +  R+   +R + 
Sbjct: 643 ----GPLGEIRDYANKQWSGIVADYFKPRWAIFLNELETSLTTGTRVNTTRIN--KRIFE 696

Query: 502 KLTNDWQNGRNVYPVESNGDAL-----ITSQW 528
            +   +   R +YP ++ GD++     I S+W
Sbjct: 697 NVEKAFTFSRKIYPTKATGDSIDIAERILSEW 728


>gi|291406137|ref|XP_002719212.1| PREDICTED: alpha-N-acetylglucosaminidase [Oryctolagus cuniculus]
          Length = 743

 Score =  343 bits (879), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 207/576 (35%), Positives = 305/576 (52%), Gaps = 86/576 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWAGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAVTRVFPHINVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DP+F  IG  F+ +  +E+G T H+Y  DTF+E  PP
Sbjct: 264 QLGSWGHFNCS--YSCSFLLAPEDPMFPLIGSLFLRELTREFG-TDHVYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA++  M + D DAVWL+QGWLF + P FW P Q+KA+LN+VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVFEAMIAVDPDAVWLLQGWLFQHQPQFWGPSQVKAVLNAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 381 LDLFAENQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMVG 440

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG + P    AW +L  +
Sbjct: 441 TGIAPEGISQNEVVYALMAELGWRKEPVPDLEAWVTSFAGRRYGVAHPDAGAAWRLLLRS 500

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ D     NR  +V  P                          L+  T+      +
Sbjct: 501 VYNCSGDACRGHNRSPLVRRPS-------------------------LQLNTT------V 529

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 378
           WY+ S+V  A  L + +   L++S  +RYDL+D+TRQA+ +  +  +     AY      
Sbjct: 530 WYNRSDVFEAWRLLLKATPTLASSPAFRYDLLDVTRQAVQELVSLYYEEARTAYLHKELA 589

Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
            L  A GV        EL+  +D +LA    FLLG WLE A+  A +E + + YE N+R 
Sbjct: 590 TLLRAGGVLA-----YELLPALDRVLATDSRFLLGSWLEQARAAAASEAEAQLYEQNSRF 644

Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
           Q+T+W      E ++L DY NK  +GL+  YY PR  ++ + + +SL  G  F+ + + +
Sbjct: 645 QLTLW----GPEGNIL-DYANKQLAGLVAQYYSPRWQLFLEALADSLARGVPFQQRLFDK 699

Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
              +L   +      YP +  GD +  +Q ++ KY 
Sbjct: 700 LVFRLEQAFVLSSRRYPTQPQGDTVDLAQKIFLKYF 735


>gi|325103828|ref|YP_004273482.1| alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
 gi|324972676|gb|ADY51660.1| Alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
          Length = 738

 Score =  342 bits (876), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 205/578 (35%), Positives = 298/578 (51%), Gaps = 88/578 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  WGGPLPQSW+D    LQKKIL R  ELGM PVLPAF+G+VP +    FP AK+ 
Sbjct: 195 MNNMDAWGGPLPQSWIDSHKDLQKKILARQRELGMIPVLPAFTGHVPKSFVKKFPEAKVD 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
                 SV     +   Y+L+  DP+F +IG  F+++Q +EYG T H Y+ D F+E  PP
Sbjct: 255 ------SVNWQGNFPNIYMLNPNDPMFSKIGEQFLKEQTREYG-TDHYYSSDIFNELNPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY---DPFWRPPQMKALLNSVPLGKL 177
              P+Y+  +   +YS M+  D  +VW+MQ WLF       FW P +M+A L  VP  KL
Sbjct: 308 SSDPKYLYDISEKVYSSMKKVDPKSVWVMQAWLFVSAHGRKFWTPERMQAFLKPVPDDKL 367

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE---- 233
           ++LDL+ E +P W  ++ +YG  ++W MLHNF GNI ++G   +IA  P    +      
Sbjct: 368 IILDLYTENRPRWKNTEGYYGKKWVWNMLHNFGGNIGLFGKAQTIASEPARVLSDPMKGN 427

Query: 234 ----NTTM--------------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
                 TM                           W N+Y  RRYG        AW +L 
Sbjct: 428 YSGIGLTMEGIEQNPFIYQLMLDHVWNNEPIELEKWTNKYITRRYGVLDNNAVKAWEILL 487

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
           +TVY       D N+D          SI+S         G+P        ++ S +    
Sbjct: 488 NTVY------KDNNKD-----QGAPESILS---------GRPTF------AQNSYWTWTD 521

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           L+Y   E +RA +  I S ++L  S+ ++YD++D+TRQA+A YA  L   +   Y   D 
Sbjct: 522 LYYDNREFVRAWDYLIKSADKLRNSDGFQYDIVDITRQAMANYATALQRQLAYTYYAGDV 581

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
           +   + SRRFLEL+ D+D LLA    FLLG W++ AK+ A N+ + K YE+NA+  ++MW
Sbjct: 582 NTYEKESRRFLELLSDLDRLLATRKDFLLGIWIDDAKKWATNDAERKLYEFNAKDLVSMW 641

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR-------LKDW 496
                 +   + DY  + WSGL+ +YY  R  I+F   ++ L++ + +        +KDW
Sbjct: 642 ----GHKDITINDYSARQWSGLVENYYKQRWKIFFDQSLQKLKNNEIWDQAEFEKYIKDW 697

Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
             EW     +W N R  YP  + GD +  S+ +YNKY 
Sbjct: 698 --EW-----NWVNRRETYPTNTKGDPVNVSKEMYNKYF 728


>gi|374385255|ref|ZP_09642763.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
           12061]
 gi|373226460|gb|EHP48786.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
           12061]
          Length = 736

 Score =  338 bits (868), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 198/557 (35%), Positives = 300/557 (53%), Gaps = 52/557 (9%)

Query: 2   SNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
            N+ GW GPLP+SW++    LQKKIL R  ELGM P+LPAF+G+VP   +  FP A++ Q
Sbjct: 199 GNIDGWCGPLPKSWMESHEELQKKILARERELGMTPILPAFTGHVPPTFKEHFPEARLRQ 258

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
           + NW     + R+  TYLL+A DPLF  IG  F+E+Q++ +G T H+Y  DTF+E  PP 
Sbjct: 259 V-NW-----EGRFDDTYLLEADDPLFQTIGNRFMEEQIRTFG-TDHLYGADTFNEMFPPS 311

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
           +   Y+  +  A+Y  M + D +AVW+MQGWLF     FW+P QMKA L +VP   L+VL
Sbjct: 312 EDSTYLDGISKAVYQSMAAVDPEAVWVMQGWLFHDKRDFWKPAQMKAYLGAVPDEHLIVL 371

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP--VEARTSENTTMA 238
           DL+ E  PIW  ++ FYG P+IWCMLHNF G   ++G    +A  P  V A  ++   + 
Sbjct: 372 DLWGEEFPIWDRTEAFYGKPWIWCMLHNFGGRNMLFGNALKLAEEPSRVLADPAKGQLLG 431

Query: 239 W------INQYSV------RRYGRSVPAIQDAWNVLY-HTVYNCTDGATDKNRDVIVAFP 285
                  I Q  V          R+     D W   Y  + Y C D A +K  D++    
Sbjct: 432 LGAVPEGIEQNPVIYSLLFSHIWRNTAVELDEWFETYLESRYGCRDEAVEKAWDIL---- 487

Query: 286 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 345
                 +   EG   NY   ++     +   +++ +  + Y+  EVI+A +  + + + L
Sbjct: 488 ---RKTVYANEG---NYESAITARPTFEKH-NNWAYTDIPYNPVEVIKAWKYLLQAADRL 540

Query: 346 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 405
             +  YRYDLI + +Q LA YA  +     E Y+  D     + SR F+EL++DMD L+ 
Sbjct: 541 GENPCYRYDLILVGKQVLANYATIIQQKFGEDYRTKDLPAFTRNSREFMELIDDMDELMG 600

Query: 406 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 465
            H+ FLLG WLE A+   +   +++ YE NAR QIT+W        ++L DY +K WSGL
Sbjct: 601 THEAFLLGKWLEDARSWGKTASEKQLYEKNARDQITLWGGKD----AVLHDYASKQWSGL 656

Query: 466 LRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVES 518
            + +Y  R  ++   + + +++G  +       R++ W  EW+       NG+  YP   
Sbjct: 657 FKGFYKGRWQLFIDEVYDCIKTGRKYDHTASDDRVRSWEWEWV-------NGQEKYPAVP 709

Query: 519 NGDALITSQWLYNKYLQ 535
            GD ++ S+ ++ KY++
Sbjct: 710 QGDPVVVSERMFGKYIK 726


>gi|403304646|ref|XP_003942904.1| PREDICTED: alpha-N-acetylglucosaminidase [Saimiri boliviensis
           boliviensis]
          Length = 754

 Score =  338 bits (866), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 199/569 (34%), Positives = 303/569 (53%), Gaps = 74/569 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP++W  +QL LQ +IL R+   GM PVLPAFSG+VP A+  VFP   +T
Sbjct: 214 MGNLHTWDGPLPRAWHIKQLYLQHRILDRMRSFGMIPVLPAFSGHVPRAINRVFPRVNVT 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DP+F  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 274 QMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRELTKEFG-TDHIYGADTFNEMQPP 330

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D+DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 331 SSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLV 390

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 391 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPEAARLFPNSTMVG 450

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ +RYG S P    AW +L  +
Sbjct: 451 TGMAPEGINQNEVVYSLMAELSWRKDPVPDLAAWVTSFATQRYGVSHPDAGAAWRLLLRS 510

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNC+  A   +                           P+ +   L+  T+      +W
Sbjct: 511 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------VW 540

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           Y+ S+V  A  L +++   L+AS T+RYDL+D+TRQA+ +     +     AY   + H 
Sbjct: 541 YNRSDVFEAWRLLLSAAATLAASPTFRYDLLDVTRQAVQELVGLYYEEARSAYLSKELHS 600

Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
           + +       EL+  +D +LA    FLLG WLE A+ +A +E +   YE ++R Q+T+W 
Sbjct: 601 LLRAGGILAYELLPALDEVLASDSHFLLGSWLEQARAVAVSEAEADFYEQSSRYQLTLW- 659

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                E ++L DY NK  +GL+  YY PR  ++ + +  S+  G  F    + +   +L 
Sbjct: 660 ---GPEGNIL-DYANKQLAGLVASYYTPRWRLFLEVLAASVAQGIPFPQHQFDKNVFQLE 715

Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
             +   +  YP +  GD +  ++ ++ KY
Sbjct: 716 QAFVLSKQRYPSQPRGDTVDLAKKIFLKY 744


>gi|255553488|ref|XP_002517785.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
 gi|223543057|gb|EEF44592.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
          Length = 360

 Score =  335 bits (858), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 164/359 (45%), Positives = 232/359 (64%), Gaps = 39/359 (10%)

Query: 215 MYGILDSIAFGPVEARTSENTTMA---------------------------------WIN 241
           MYGILDSI+ GP+EAR SEN+TM                                  W+ 
Sbjct: 1   MYGILDSISTGPIEARVSENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSEKVQVLEWLK 60

Query: 242 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 301
            YS RRYG++V  ++ AW +LYHT+YNCTDG  D N D IV FPD DPS+ S ++   Q+
Sbjct: 61  TYSRRRYGKAVHQVEAAWEILYHTIYNCTDGIADHNTDFIVKFPDWDPSVQSGSDTSQQD 120

Query: 302 -----YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 356
                  +  S+  + +   S+    H+WYS  +VI AL+LFI  G+ L+ S TYRYDL+
Sbjct: 121 NKHIFLHRSGSRRFLFEGPNSTLPQAHIWYSIQKVINALQLFIDGGSHLTGSLTYRYDLV 180

Query: 357 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 416
           DLTRQ L+K AN+++++ I A++ NDA  +   S++F++L++D+D LLA  D FL+G WL
Sbjct: 181 DLTRQVLSKLANQVYVDAIIAFRSNDARALNLHSQKFIQLIKDIDVLLASDDNFLIGTWL 240

Query: 417 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 476
           ESAK+LA N  + +QYEWNARTQ+TMW+D T+   S L DY NK+WSGLL DYY PRA+ 
Sbjct: 241 ESAKELALNPSEMRQYEWNARTQVTMWYDTTKTNQSKLHDYANKFWSGLLEDYYLPRAST 300

Query: 477 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNG-DALITSQWLYNKYL 534
           YF ++++SL+  + F+L++WR +WI  +N+WQ G  +YP++ +G DAL  S+ LY+KY 
Sbjct: 301 YFDHLVKSLKQNEKFKLQEWREKWIAFSNEWQAGTKLYPMKGSGDDALAISKALYDKYF 359


>gi|320162905|gb|EFW39804.1| lysosomal alpha-N-acetyl glucosaminidase [Capsaspora owczarzaki
           ATCC 30864]
          Length = 786

 Score =  333 bits (855), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 194/571 (33%), Positives = 297/571 (52%), Gaps = 71/571 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGP+   W+ +Q  LQ  IL R+   GM PVLP+F+G+VP+AL   FP+A IT
Sbjct: 238 MGNIKGWGGPISLEWIYKQRNLQVLILQRMRTFGMTPVLPSFAGHVPSALAQHFPNANIT 297

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q  +W +     ++CC   LDA+DPLF +IG  F+  Q + YG T+H+YNCD F+E TP 
Sbjct: 298 QSSDWNNFPD--QYCCVGFLDASDPLFTQIGAEFLRLQNETYG-TNHLYNCDQFNEMTPA 354

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
                Y+   G A+Y  M + D  AVW+MQGWLF  +  +W   +++ALL+ VP   +++
Sbjct: 355 STDLGYLKQAGMAVYQSMTAYDPAAVWVMQGWLFFNEAAWWSNDRVQALLSGVPDDHMII 414

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLF++V P+W+  + +YG P+IW MLH+F GNI +YGIL SI  GP  A  +   TM  
Sbjct: 415 LDLFSDVTPVWNRLESYYGKPFIWNMLHDFGGNIGLYGILPSINEGPFAALATPGNTMVG 474

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQD-AWNVLYHT 265
                                           W++ +  RRYG S PA+   A+  L  +
Sbjct: 475 IGLTPEGINQNYILYEFMMENMWRSAPVNLPTWVDAFVGRRYGPSTPAVAKLAYQQLLQS 534

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNCT+G     + ++   P V+                 +S+   + +        +L+
Sbjct: 535 VYNCTNGQYSVTKSLLEIRPAVN-----------------MSRNGFMPT--------NLY 569

Query: 326 YSTSEVIRALELFIA---SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           Y    VI A++  +A   S  +L++   +RYD++D TRQ L+  A +   N+  A     
Sbjct: 570 YDPGHVILAVDHILAAAKSAPQLASVVPFRYDVVDFTRQMLSNLAIDFHSNLTLALTSKQ 629

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
           AH V    +  + L+ D+D LL     FLLGPWL +A+  ++N   +   E+NAR QIT+
Sbjct: 630 AHLVHLYGQGIVGLIADLDELLVSDAHFLLGPWLAAARSWSENTAAQDLLEFNARNQITL 689

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           W  N +     + DY +K W+GL+  YY PR  ++  +   + ES   F    +    ++
Sbjct: 690 WGPNGE-----ITDYASKQWAGLMSSYYRPRWELFVSFASAAAESDLPFNDAAFNAAVLE 744

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           +   WQ+  + + V   GD++  +  L  KY
Sbjct: 745 VEKAWQHSHHNFTVTPLGDSIAIATRLRAKY 775


>gi|332018247|gb|EGI58852.1| Alpha-N-acetylglucosaminidase [Acromyrmex echinatior]
          Length = 686

 Score =  333 bits (855), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 190/571 (33%), Positives = 297/571 (52%), Gaps = 74/571 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W +  + LQ +IL R+ +LG+ PVLPAF+G+VP A   +FP+A +T
Sbjct: 152 MGNIRGFGGPLSSNWHNYTIRLQHQILQRMRDLGIVPVLPAFAGHVPRAFARLFPNANMT 211

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W   K + ++CC YLL+ TDPLF  IG  F++  + E+G T HIYNCDTF+EN P 
Sbjct: 212 KINPW--NKFEDKYCCPYLLEPTDPLFRTIGEKFLQMYIDEFG-TDHIYNCDTFNENEPG 268

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
                Y+ ++G +I+S M + DS A+WLMQ WLF +D  FW   +++A L SVP+G+++V
Sbjct: 269 NTELIYLRNVGHSIFSAMNAVDSKAIWLMQAWLFVHDIMFWTKSRVRAFLTSVPIGRMLV 328

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL +E  P +   K +YG P+IWCMLHNF G + M+G    I     E R   ++TM  
Sbjct: 329 LDLQSEQFPQYDRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRNMNDSTMVG 388

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W   Y+ RRYG        AW  L  TV
Sbjct: 389 TGLTPEGINQNYVIYELMNEMAYRHVPVNLDNWFESYATRRYGAWNEYAVAAWQHLGRTV 448

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YN       +   VI   P ++ S+ +                               WY
Sbjct: 449 YNFIGTQKIRGHYVITRRPSLNISLWT-------------------------------WY 477

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
              +      +F+ +      +  YR+D++D+TRQAL   A+++++ I++ Y+  +    
Sbjct: 478 DRKDFYAMWNMFLKARYGRGNNTLYRHDVVDITRQALQLIADDIYMTILDCYKKKNITAF 537

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              +   LEL +D++ +LA  + FLLG WL  AK +A NEE+ + YE+NAR QIT+W  N
Sbjct: 538 QSSANALLELFDDLESILASGNNFLLGTWLAQAKDIAVNEEERRSYEYNARNQITLWGPN 597

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIKLTN 505
            +     +RDY NK WSG++ DY+  R  ++ K + +SL       + +   R + ++  
Sbjct: 598 GE-----IRDYANKQWSGVVADYFKLRWELFLKALEKSLIQRIEPNITEINDRIFHEVER 652

Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYLQG 536
            +     +YP+E+ GD +  +  + +K+ +G
Sbjct: 653 SFTFSTKLYPIETKGDTIDIAMKIISKWYKG 683


>gi|307168312|gb|EFN61518.1| Alpha-N-acetylglucosaminidase [Camponotus floridanus]
          Length = 737

 Score =  332 bits (851), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 186/555 (33%), Positives = 284/555 (51%), Gaps = 74/555 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W ++ + LQ +IL R+  LG+ PVLPAF+G+VP A   +FP+A +T
Sbjct: 221 MGNIRGFGGPLSTNWHNRTIHLQHQILRRMRNLGIVPVLPAFAGHVPRAFARLFPNANMT 280

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W + +   ++CC YLL+ TDPLF  IG  F+   + E+G T HIYNCDTF+EN P 
Sbjct: 281 KINPWNNFED--KYCCPYLLEPTDPLFQIIGEKFLRMYINEFG-TDHIYNCDTFNENEPG 337

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
                Y+ ++  A+++ + + DS A+WLMQ WLF +D  FW  P++K+ L SVP+G++++
Sbjct: 338 STELIYLRNVSHAVFAAINAVDSKAIWLMQAWLFVHDFMFWTEPRVKSFLTSVPMGRMLI 397

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDL +E  P +   K +YG P+IWCMLHNF G + M+G    I     E R    +TM  
Sbjct: 398 LDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRNMNGSTMVG 457

Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                          AW   Y+ RRYG         W  L  TV
Sbjct: 458 TGLTPEGINQNYVIYELMNEMAYRHEPVDLDAWFQNYATRRYGAWNEYAVTTWQYLGRTV 517

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YN       +   V+   P ++ S+                                +WY
Sbjct: 518 YNFIGSQRIRGHYVVTRRPSLNISL-------------------------------WIWY 546

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
           +          F+ + +    S  YR+D++D+TRQAL    ++L+  I+++Y+  +    
Sbjct: 547 NRKNFYSMWNTFLKARHGRRNSTLYRHDVVDITRQALQLMGDDLYTIILDSYKKRNITAF 606

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              +   LEL +D++ +LA    FLLG WL  AK +A NEE+ K YE+NA+ QIT+W  N
Sbjct: 607 RSSANALLELFDDLESILASGSNFLLGTWLSQAKDVATNEEERKSYEYNAKNQITLWGPN 666

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKLTN 505
            +     +RDY NK WSG++ DY+ PR  ++ K + +SL     F + +   + + K+  
Sbjct: 667 GE-----IRDYANKQWSGVMADYFKPRWELFLKALEKSLVENTKFNVTEINNKIFDKVER 721

Query: 506 DWQNGRNVYPVESNG 520
            +      YPVE  G
Sbjct: 722 PFTFSTKFYPVEPKG 736


>gi|196001339|ref|XP_002110537.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
 gi|190586488|gb|EDV26541.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
          Length = 757

 Score =  330 bits (847), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 193/567 (34%), Positives = 291/567 (51%), Gaps = 72/567 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  W GPLP  W+++Q+ LQ KIL R+ + GM P+LPAF+GN+P AL  ++P AKI 
Sbjct: 217 MGNIQRWAGPLPHDWINKQITLQVKILDRMRKYGMLPILPAFNGNIPNALTKIYPKAKIV 276

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   WF      R+  T LLD  D LFI I + FIE+++K YG T H+Y+ D F+E  P 
Sbjct: 277 KSSPWFGFSK--RYGETALLDPRDKLFIVISKLFIEEEIKAYG-TDHLYSLDLFNEIDPQ 333

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
               EY++++  + Y  + S D+ AVW+MQGW+F  D + W   +++A L+ +P G++V+
Sbjct: 334 SKELEYLTAVSKSAYLALNSADTKAVWIMQGWMFYNDNYYWENKRIQAFLSPIPKGRIVI 393

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFAEV+P +  S  F+G P+IWCML+NF GN  MYG  ++I  G + A   +N+TM  
Sbjct: 394 LDLFAEVEPQYHRSNSFFGHPFIWCMLNNFGGNAGMYGTFETITEGAISAYDMKNSTMIG 453

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+  Y+ RRYG     I  AW  L  TV
Sbjct: 454 TGMAPEGIGNNYIMYDLMAEMGWRKIAVDVRDWVVVYTERRYGGLDENIIKAWLRLSETV 513

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YNC D      R    A P V PS+    +                           +WY
Sbjct: 514 YNCNDMRQYHCR----ALPAVRPSLKIAND---------------------------VWY 542

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
           S  ++  A E  + + NE  +  T++YD++D+TRQAL + A  ++  + + Y  N+   +
Sbjct: 543 SADDIFFAWEHMLRANNEFISEETFQYDIVDVTRQALQELAFIMYKKVTQCYHDNNQETL 602

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
                  +EL  DMD LL  +  FLLG W+  A Q + N   ++Q  +NA  QIT+W   
Sbjct: 603 KTAGGELIELFTDMDTLLGTNSHFLLGRWVADALQHSNNISIKQQLRFNALNQITLW--- 659

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
                S+L DY NK W+GL+  +Y  R  ++ K + +S+ +   F  + +     K    
Sbjct: 660 -GPSKSILHDYANKMWNGLVDKFYKKRWLMFIKALSDSISNNILFDQQKFNLAVQKFEAA 718

Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
           W +  N Y   S+G ++  S+ L++KY
Sbjct: 719 WASENNTYATTSSGSSVTVSKQLFSKY 745


>gi|390463730|ref|XP_003733088.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
           [Callithrix jacchus]
          Length = 830

 Score =  328 bits (842), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 199/563 (35%), Positives = 293/563 (52%), Gaps = 69/563 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W  PLP +W  +QL LQ  IL R+   GM PVLP F G+VP A+  VFP   +T
Sbjct: 297 MGNLHTWDAPLPHAWHIKQLYLQHWILDRMRSFGMVPVLPMFLGHVPKAITRVFPRVSVT 356

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DP+F  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 357 QMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRELTKEFG-TDHIYGADTFNEMQPP 413

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D+DAVWL+QGWLF Y P FW P Q++A+L S P G L+V
Sbjct: 414 SSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQYQPQFWGPAQVRAVLGSAPHGCLLV 473

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 474 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAMNRGPEAARLFPNSTMVG 533

Query: 239 ------WINQYSVR--------------------RYGRSVPAIQDAWNVLYHTVYNCTDG 272
                  I+Q  V                      YG S P    AW +L  +VYNC+  
Sbjct: 534 TGMAPEGISQNXVVYSLMAELSWXKDPVPDLVAWXYGVSHPDTGAAWRLLLRSVYNCSGE 593

Query: 273 ATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVI 332
           A   +                           P+ +   L+  T+      +WY+ S+V 
Sbjct: 594 ACRGHNH------------------------SPLVRRPSLQMNTT------IWYNQSDVF 623

Query: 333 RALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR 392
            A  L  ++   L+AS T+RYDL+D+TRQ + +  +  +     AY L+   G    +  
Sbjct: 624 EAWRLLFSAAATLAASPTFRYDLLDVTRQVVQELVSLYYEEARSAY-LSKELGSLLRAGG 682

Query: 393 FL--ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEE 450
            L  EL+  +D +LA    FLLG WLE A+ +A +E +   YE N+R Q+T+W      E
Sbjct: 683 ILAYELLPALDEVLASDSHFLLGSWLEQARAVAVSEAEADFYEQNSRYQLTLW----GPE 738

Query: 451 ASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNG 510
            ++L DY NK  +GL+  YY PR  ++ + +  S+  G  F+   + +   +L   +   
Sbjct: 739 GNIL-DYANKQLAGLVAHYYAPRRRLFLEALAASVAQGIPFQQHQFDKNVFQLEQAFVLS 797

Query: 511 RNVYPVESNGDALITSQWLYNKY 533
           +  YP +  GD +  ++ ++ KY
Sbjct: 798 KQRYPSQPRGDTVDLAKKIFLKY 820


>gi|194216885|ref|XP_001917396.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
           [Equus caballus]
          Length = 744

 Score =  323 bits (829), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 203/575 (35%), Positives = 300/575 (52%), Gaps = 86/575 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPL +SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 205 MGNLHTWDGPLTRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 265 QLGSWGHFNCS--YSCSFLLAPEDPLFPVVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF +   FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPAYLAAATAAVYQAMTAVDPDAVWLLQGWLFHHQRTFWGPAQVGAVLGAVPRGRLLV 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 382 LDLFAESQPMYIRTASFQGQPFIWCMLHNFGGNQGLFGALEAVNRGPAAARLFPNSTMVG 441

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S    + AW +L  +
Sbjct: 442 TGMTPEGIGQNEVVYALMAELGWRKDPVADLEAWVTSFAARRYGVSHKDAETAWKLLLRS 501

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNC+  A   +                           P+ K   L+  T+      +W
Sbjct: 502 VYNCSAEAYSGHNQ------------------------SPLVKRPSLQMGTT------VW 531

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY-------ANELFLNIIEAY 378
           Y+ S+V  A  L + +   L++S  + YDL+D+TRQA  +        A   +LN  E  
Sbjct: 532 YNRSDVFEAWWLLLTAAPALASSPAFLYDLVDVTRQAAQELISLYYEEARTAYLN-KELV 590

Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
            L  A G+        EL+  +D +LA    FLLG WL+ A+++A +E +   YE N+R 
Sbjct: 591 PLLRAGGILA-----YELLPALDKVLASDSRFLLGSWLKQAREMAVSEAEAHFYEQNSRY 645

Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
           Q+T+W      E ++L DY NK  +GL+ DYY PR  ++ + +++SL  G  F+ + + +
Sbjct: 646 QLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWQLFVEMLVQSLAQGVPFQQQQFDK 700

Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +L   +      YP +  GD +  ++  + KY
Sbjct: 701 NAFELEEAFVLSTRRYPSQPQGDTVDLAKKFFLKY 735


>gi|332260899|ref|XP_003279518.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
           [Nomascus leucogenys]
          Length = 736

 Score =  322 bits (826), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 186/551 (33%), Positives = 288/551 (52%), Gaps = 76/551 (13%)

Query: 20  LVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYL 79
           L LQ ++L R+     +PVLPAF+G+VP A+  VFP   +T++G+W        + C++L
Sbjct: 216 LFLQHRVLDRMRSSAXDPVLPAFAGHVPEAVTRVFPRVNVTKMGSWGHFNCS--YSCSFL 273

Query: 80  LDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQ 139
           L   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP   P Y+++   A+Y  M 
Sbjct: 274 LAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMI 332

Query: 140 SGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYG 198
           + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ +  F G
Sbjct: 333 AVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQG 392

Query: 199 VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--------------------- 237
            P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM                     
Sbjct: 393 QPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMA 452

Query: 238 -------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVA 283
                        AW+  ++ +RYG S P    AW +L  +VYNC+ +     NR  +V 
Sbjct: 453 ELGWRKDPVPDLAAWVTSFAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVR 512

Query: 284 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 343
            P                          L+  TS      +WY+ S+V  A  L + S  
Sbjct: 513 RPS-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAP 541

Query: 344 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDG 402
            L+AS  +RYDL+DLTRQA+ +  +  +     AY   +   + +       EL+  +D 
Sbjct: 542 SLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLRKELASLLRAGGVLAYELLPALDE 601

Query: 403 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 462
           +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W      E ++L DY NK  
Sbjct: 602 VLASDSRFLLGSWLELARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQL 656

Query: 463 SGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDA 522
           +GL+ +YY PR  ++ + + +S+  G  F+   + +   +L   +   +  YP +  GD 
Sbjct: 657 AGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDT 716

Query: 523 LITSQWLYNKY 533
           +  ++ ++ KY
Sbjct: 717 VDLAKKIFLKY 727


>gi|321472423|gb|EFX83393.1| hypothetical protein DAPPUDRAFT_301977 [Daphnia pulex]
          Length = 799

 Score =  322 bits (824), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 181/531 (34%), Positives = 285/531 (53%), Gaps = 76/531 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N   WGGPL  +W    L+LQ KIL R+   GM PVLPAF+G+VP A++ V+P+A  T
Sbjct: 212 MGNFRAWGGPLSDNWQQATLILQHKILERMRSFGMTPVLPAFAGHVPRAMERVYPNASYT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L +W + +   ++CC   L  T+PLF EIG  FI++   E+G + H+YNCD F+E  P 
Sbjct: 272 HLTSWLNFQD--QYCCPLFLQPTEPLFTEIGSRFIKEMALEFG-SDHVYNCDVFNEVRPT 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P ++SS+G A+++ M + D DA+WLMQGWLF  D  +W     KALL SVP G++++
Sbjct: 329 QADPVFVSSVGTAVFNAMTTADPDAIWLMQGWLFKSDADYWTADLSKALLTSVPQGRMLI 388

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL AE+ P +     FYG P+++C+LHNF G + + G +  I+   ++AR   N+TM  
Sbjct: 389 LDLQAELDPQYIRLNSFYGQPFVFCLLHNFGGTLGLNGAIQIISQRVIDARNFPNSTMVG 448

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W ++Y+VRRYG +  A+  AW  L ++V
Sbjct: 449 TGLTMEGIDQNYVVYDKMLEMGWRDKVPNLNQWFDEYTVRRYGVNNTAVMSAWRFLQNSV 508

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YN +   + + + V+V  P +                               +  P +WY
Sbjct: 509 YNDSSRRSFRGQYVLVTRPAL-------------------------------WQLPFVWY 537

Query: 327 STSEVIRALELFIA---SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           +  +VI A +  I+   +   LS ++ +R+D++DLTRQ++ +  + L+  ++E Y   ++
Sbjct: 538 NPHDVILAWDHLISGLMTEPLLSNASNFRHDMVDLTRQSMQEIFHLLYSQLLEVYLEKNS 597

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
             +  ++ + ++L++D+D LL     FLLG W+  AK     E ++ QYEWNAR QIT+W
Sbjct: 598 TAIEGIAYKMIDLLQDLDELLQTGKKFLLGKWIADAKSWGTTEGEKLQYEWNARNQITLW 657

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
               +     +RDY  K W+G++ DYY PR  ++ + M  SL+    F  K
Sbjct: 658 GPRGE-----IRDYAAKQWAGVVADYYKPRWEVFIREMQMSLDENRAFNKK 703


>gi|301106961|ref|XP_002902563.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
 gi|262098437|gb|EEY56489.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
          Length = 684

 Score =  320 bits (820), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 190/577 (32%), Positives = 296/577 (51%), Gaps = 74/577 (12%)

Query: 1   MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           M NL G W  GPLPQ+++D Q  LQ KIL R+ E GM P LPAF+G+VP  ++ +FP+AK
Sbjct: 121 MGNLRGSWVKGPLPQAFIDSQYALQLKILNRMREFGMIPALPAFAGHVPEEMKALFPNAK 180

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            T+  NW     +  +CC Y+LD +DPL+ +IG+ F+E+Q   Y  TS +Y CDT++E  
Sbjct: 181 FTRSPNWGDFSDE--FCCVYMLDFSDPLYYDIGKTFLEEQRALYDYTSSLYQCDTYNEMD 238

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
           P    P  + +   A+   M + D++AVWL+QGWLF   P +W   ++KA L+ V   K+
Sbjct: 239 PDFTDPAKLQAASRAVIDSMTAADANAVWLIQGWLFENSPDYWTKNRVKAYLDGVSNEKM 298

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL++EV+P+WS    ++G  +++C+LHNF GN  M G L ++   PV+A    N TM
Sbjct: 299 IILDLYSEVRPVWSKMDNYFGKSWVYCVLHNFGGNTGMRGDLATLGTAPVQASRDSNGTM 358

Query: 238 ---------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYH 264
                                             W+  ++ +RY       + AW  L  
Sbjct: 359 IGVGLTMEGIYQNYVVYDLTLQMAWVDTPLDMDEWVPSFAAQRYHSQDVHTERAWGFLLQ 418

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           +VYN T G     + +I   P                + K V    +  S T        
Sbjct: 419 SVYNRTLGFGGVTKSLICLIP----------------HWKLVRDGFMPTSIT-------- 454

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ--LND 382
            Y   ++ RA +  + +G+EL A +TYR+DL+D+TRQ L+ +    +L++ E Y+     
Sbjct: 455 -YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFLSDHFMAQYLHLKEMYEGKTQP 513

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ------EKQYEWNA 436
           AH +   + R L  +E MD +LA ++  LLG W+  A+ LA+  E       +  YE+ A
Sbjct: 514 AHQLCAWTERMLLTIERMDEILATNEDSLLGNWIADARALAEESESIESSNLQDYYEYEA 573

Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           R Q+T W DN  E    + DY  K W+GL++ YY PR  ++   + ++   G     +  
Sbjct: 574 RNQVTRWGDNNSET---IHDYAGKEWAGLVKGYYLPRWRMWLGEVCQAYTQGRTINKEVV 630

Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           ++  I     WQ     YP  + GDAL+ SQ +Y+++
Sbjct: 631 KKARIAFELKWQLSHEHYPTTTVGDALVVSQRIYDEF 667


>gi|255533286|ref|YP_003093658.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
 gi|255346270|gb|ACU05596.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
          Length = 734

 Score =  320 bits (819), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 189/577 (32%), Positives = 296/577 (51%), Gaps = 92/577 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  WGGP+ ++++ +Q  LQKKIL R   LGM P+LP+F+G+VP + ++ FP  K+ 
Sbjct: 190 MGNLDAWGGPMSKNFMAKQEALQKKILARERALGMTPILPSFTGHVPPSFKDKFPDIKVN 249

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
                + +   P     Y+L+   P+F EIGR F+   +  +G T H+Y+ DTF+E TP 
Sbjct: 250 TQQ--WGINVSP----AYVLNPETPMFKEIGRKFLTALINTFG-TDHLYSADTFNEMTPV 302

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            +   Y++ +   IY  M + D+ AVW+MQGW+F   P FW+P QMKAL ++VP  KL+V
Sbjct: 303 SNDSTYLNGMAKKIYESMAAVDTQAVWIMQGWMFLDRPNFWQPTQMKALFSAVPQDKLIV 362

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMA 238
           LDL +E+ P+WS +  FYG  +IWCMLHNF G + M+G +  I   P  A +  +   M+
Sbjct: 363 LDLNSELNPVWSRTDAFYGEKWIWCMLHNFGGRLSMFGDMSRIGNDPAAALKNDQRGKMS 422

Query: 239 ---------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                            W+  Y+ RRYG+     + AW VL +T
Sbjct: 423 GIGLTMEGIEQNPAIYSLMLEHIWNDKPIDLDNWLKGYAQRRYGKRNSNAEKAWEVLKNT 482

Query: 266 VYNCTD--GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
           VY+     G                 +II+         G+P    A + + T+      
Sbjct: 483 VYSHQPWWGTN---------------TIIT---------GRPTFDAATVWTYTA------ 512

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           + YS+ E+++A    + + +EL +S+ ++YDL+D+TRQ LA YAN L  +   +Y+  D 
Sbjct: 513 IPYSSKELMKAWSYLLTASDELKSSDGFQYDLVDVTRQVLANYANVLQQDFASSYKQKDM 572

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
               + S +FLEL++D+D LL     FLLG W+ +AK L  N  ++K +E NAR  IT+W
Sbjct: 573 ATFNKKSAQFLELIDDIDQLLGTRSDFLLGKWINNAKALGDNPAEKKLFERNARDLITLW 632

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDW 496
            D    +   + +Y  K W+G+++ +Y PR   +F  +     +G           +KDW
Sbjct: 633 LD----KDCNIHEYACKEWAGMMKGFYKPRWQQFFDEVRLQASAGKEIDQIKFENTIKDW 688

Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
             +W+       N    Y  +  G+ +  ++ LY KY
Sbjct: 689 EWKWV-------NANEAYTDKPTGNPVTVAKALYAKY 718


>gi|255533285|ref|YP_003093657.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
 gi|255346269|gb|ACU05595.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
          Length = 749

 Score =  317 bits (813), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 193/577 (33%), Positives = 283/577 (49%), Gaps = 93/577 (16%)

Query: 2   SNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
            N+ G  GPLP+SW++    LQKKIL R  ELGM P+LPAFSG+VP   +  FP+A++ +
Sbjct: 207 GNIDGLNGPLPKSWMESHEQLQKKILARERELGMKPILPAFSGHVPPTFKARFPNARVDR 266

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
           L NW     + R+  TY+L   DPLF +I   F+ +Q K +G T H+Y  DTF+E   P 
Sbjct: 267 L-NW-----EGRFADTYVLHPDDPLFQQIADKFMAEQDKAFGNTDHLYGADTFNEMYLPY 320

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
               Y+  +G A+Y GM   D +A+W+MQGW+F     FW+P  +K  L+ VP   L++L
Sbjct: 321 TDTAYVRKIGTAVYKGMAKADPEAIWVMQGWMFWDKRDFWKPEVVKNYLSGVPDDNLIML 380

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT----- 235
           DLFA+ +PIW+ ++ F+G  +IWCMLHNF G   +YG L+ I   P E     N      
Sbjct: 381 DLFADEQPIWTKTEAFWGKKWIWCMLHNFGGRNPLYGDLNYIGREPAEMVHDPNRGRLSG 440

Query: 236 -----------------------------TMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                          +W+  Y+ RRYG+  P  + AW +L+ TV
Sbjct: 441 IGLVPEGIEQNPVVYSLMLEHVWNDQVIDVKSWLVNYAQRRYGQRDPQTEKAWQILHQTV 500

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY--GKPVSKEAVLKSETSSYDHPHL 324
           Y                            EG Y+     +P  ++    ++ +  D P  
Sbjct: 501 Y--------------------------AKEGSYETIISARPTHEK---HADWTGTDLP-- 529

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            Y   +++ A    + + N    ++ Y++DL+ + RQ LA YA  L       ++  +  
Sbjct: 530 -YDGDKLVPAWTYLLNASNRFKNNDCYQFDLVTVGRQVLANYATVLQRLFARDFRNKNLT 588

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
                +  FL L+ DMD L+     FLLG WL  AK+ A NE + + YE NAR  IT+W 
Sbjct: 589 AYRAHTAEFLTLIADMDKLMGTRKDFLLGKWLNDAKKWATNESESRLYEKNARDLITLW- 647

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWR 497
               ++AS L +Y NK W+GL   +YG R   +      +LE G  F       R+KDW 
Sbjct: 648 --GGKDAS-LHEYANKQWAGLFNGFYGKRWQTFIAETSTALEQGKSFDQEAFETRMKDW- 703

Query: 498 REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
            EW     +W NGR  Y  +  G+ +  S  L+ KY+
Sbjct: 704 -EW-----NWVNGREQYTDKPQGNPVTVSIQLHKKYI 734


>gi|157134500|ref|XP_001656341.1| alpha-n-acetylglucosaminidase [Aedes aegypti]
 gi|108881379|gb|EAT45604.1| AAEL003150-PA [Aedes aegypti]
          Length = 763

 Score =  317 bits (811), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 187/566 (33%), Positives = 296/566 (52%), Gaps = 80/566 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL  ++++    LQ +++  +  LGM   LPAF+G++P     ++P AK+T
Sbjct: 211 MGNIRGWGGPLTTNFINFSKKLQNQVIDEMRRLGMVLALPAFAGHLPVQFAQLYPEAKLT 270

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            + NW    +  ++     LD  DPLF EIG+ F+ + ++ YG ++HIY CD F+E  P 
Sbjct: 271 PVENWNGFPA--QYASPLFLDPIDPLFQEIGKRFLTKVIERYG-SNHIYFCDPFNEIQPR 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
             S +Y+SS  A IY  M   D  AVWL+QGW+F  +P+W    ++A L +VPLG+++VL
Sbjct: 328 SFSAKYLSSASAGIYKAMNDVDPFAVWLLQGWMFVKNPYWSDVAIRAFLQAVPLGRMLVL 387

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL +E  P +  ++ ++G P+IWCML NF G + M G +D +     + RT+++ TM   
Sbjct: 388 DLQSEQFPQYDRTESYHGQPFIWCMLSNFGGTLGMLGSVDLVFQRIRDVRTNDSMTMIGT 447

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W   Y+  RYG     ++DAW++  +TVY
Sbjct: 448 GITPEGINQNYGLYEFALEMGWNPNIDNVEEWFRTYASVRYGTQDKRLKDAWSMFRYTVY 507

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +  +                      +  GKY    +P  K            HP LWY+
Sbjct: 508 SFKEQ--------------------EMMRGKYTFNRRPSLKL-----------HPWLWYN 536

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
            +     ++L + S    S +  +R D++DLTRQ L   A+ L+LNI+EAY   + + V 
Sbjct: 537 ETLFNAGVQLLLESN---STNTLFRNDVVDLTRQFLQNTADRLYLNIMEAYNTKNPNSVK 593

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
            LS  F +L+EDMD LL     FLLG WLESAK +A+   + ++YE+NAR QIT+W    
Sbjct: 594 YLSILFQKLLEDMDRLLRTDQHFLLGRWLESAKAVAETSLERQKYEYNARNQITLWGPQG 653

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---RLKDWRREWIKLT 504
           Q     + DY NK W+G+++D++ PR  ++   M + +E        +++D   + ++L 
Sbjct: 654 Q-----IVDYANKQWAGMVQDFFLPRWKLFLTEMTKDVEQNRTLNEGKVRDKIFKMVELP 708

Query: 505 NDWQNGRNVYPVESNGDALITSQWLY 530
               N R  YP+  +GDAL+ ++ L+
Sbjct: 709 FCTSNKR--YPIRPDGDALLVARELF 732


>gi|449675146|ref|XP_002156234.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Hydra
           magnipapillata]
          Length = 646

 Score =  317 bits (811), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 183/567 (32%), Positives = 281/567 (49%), Gaps = 69/567 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL-QNVFPSAKI 59
           M NL GWGGPL  SW  +QL LQ+ I+ R+   GM PVLP F G++P AL   +FP++K 
Sbjct: 89  MGNLEGWGGPLSSSWYSKQLQLQQNIISRMRSFGMIPVLPGFGGHIPKALVSRLFPTSKY 148

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
            +L  W   K   ++  T+LLD  DPLF ++G AF+E Q + Y  T H+YN D F+E  P
Sbjct: 149 YKLKPW--NKFTGKYGGTFLLDPQDPLFKKVGAAFVEMQKQLYNGTDHVYNADIFNEMDP 206

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
           P  +  +I++    +Y+ M + DSDAVWLMQGW+F     W+P  ++A L ++P GKL++
Sbjct: 207 PQLTSAFITNTSIGVYNAMLASDSDAVWLMQGWMF-LSSVWKPELVEAWLQAIPYGKLII 265

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
           LDL +++ P++  +  FYG P+IWCM+ NF G   +YG L  +  G + AR +  +    
Sbjct: 266 LDLASDIYPLYDQTNAFYGHPFIWCMIENFGGTTRLYGQLTGVMKGVISARKTYKSFMIG 325

Query: 236 -------------------TMAWINQ----------YSVRRYGRSVPAIQDAWNVLYHTV 266
                               M W N+          Y  RRYG     + DAW +L  T+
Sbjct: 326 TGMTPEGINQNDINFELMNEMGWRNEEFNISDWTLSYIKRRYGDYPKMVSDAWLILIDTI 385

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YNC DG  +   D  +  P + P +         N   PV                H+WY
Sbjct: 386 YNCNDGRENGGYDGRI--PVMRPQL---------NAKLPV----------------HMWY 418

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
           S  ++  A +L +   + +   +T+R DL+ L  Q L   +   +  ++  Y       V
Sbjct: 419 SIKDLYNAWKLMVKGSDYMPLIDTFRNDLVRLGTQVLEDLSIVFYTQMVSGYFNKSTLNV 478

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
            +   +   L+ DMD LLA     LLG W++SA+ +     + K  E+NA+ QIT+W  N
Sbjct: 479 EKYGSKITVLLTDMDRLLATDQYSLLGRWIQSARSMGDTLNETKLLEYNAKNQITLWGPN 538

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
            +     +RDY NK W+GL+  +Y  R  ++  ++ +SL+ G  +    +  + ++    
Sbjct: 539 GE-----IRDYANKNWAGLVGSFYFERWNMFINFLSDSLKRGVPYDDSAFVSKLLQFEKK 593

Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
           W N    +  +  GDA   S  L   Y
Sbjct: 594 WNNEIKEFSADPTGDAFGISHQLLRAY 620


>gi|328867411|gb|EGG15793.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
          Length = 1501

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 191/587 (32%), Positives = 300/587 (51%), Gaps = 97/587 (16%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M N++GWGGPL   ++  Q  LQ++IL R+ + GM PVLP F+G+VP A  ++FP+A IT
Sbjct: 950  MGNVNGWGGPLDYDFIAGQHDLQQQILERMRQYGMKPVLPGFAGHVPRAFMSLFPTANIT 1009

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            QLG+W +         TY LD +DPLF  + + F++ Q   YG T H Y+ D F+E TPP
Sbjct: 1010 QLGDWRAFNG------TYYLDPSDPLFANVSQTFVKVQTAIYG-TDHYYSFDPFNEITPP 1062

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
                 Y+ +  +++Y+ +   D  AVW++Q W F  D  FW+PPQ+KA L  VP+G L+V
Sbjct: 1063 SSDAGYLQNSSSSMYNALAYADPQAVWVLQAWFFISDAWFWQPPQVKAFLGGVPIGHLLV 1122

Query: 180  LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
            LD +AE  P W+ + QF G  +IWCMLHNF G   MYG +  I  GP++A          
Sbjct: 1123 LDTWAEESPAWTVTDQFNGHDWIWCMLHNFGGRTGMYGKIPRITAGPIDARKQSPGMKGT 1182

Query: 230  ----------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                  RT+      WINQY+ RRYG  VP +  AWN L  TVY
Sbjct: 1183 GLTPEAIEQNYIMYDLMSEMSWRTTAPNMTEWINQYTQRRYGVFVPELAQAWNSLASTVY 1242

Query: 268  NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
            N  D + DKN     +F  + P +                               +++Y 
Sbjct: 1243 NAPD-SIDKNPS---SFVGIRPELNMTN---------------------------NIYYD 1271

Query: 328  TSEVIRALELFIASGNE-LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
            +S + +A +L+++  +E + +++TY +D+ ++T QAL+    E  + + +AY+       
Sbjct: 1272 SSIIQKAWQLYLSVTDEYVLSTSTYSFDIAEITIQALSNLFIETEIAMYDAYKTGKGTEF 1331

Query: 387  FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ-----LAQNEEQEK----------Q 431
             + +   L ++ DMD + +     L+G W  +A+Q     L++N+++++          Q
Sbjct: 1332 DEHAMNCLNIITDMDMIASTQQLLLVGTWTANARQWANYNLSRNKDEDRNTDKEQMTIEQ 1391

Query: 432  YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM------IESL 485
            YE+NAR QIT+W        S L DY    WSGLL D+Y  R +++ KY+        + 
Sbjct: 1392 YEFNARNQITLW----GPSNSTLHDYAYHLWSGLLNDFYLARWSLFIKYLDSSLSSSSTN 1447

Query: 486  ESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
            ++G GF+ +++  +   L   W      YP    G+A   S+++ N+
Sbjct: 1448 DAGTGFKNQEYINDIESLEESWNLQTYQYPTRPTGNAYQLSKFINNQ 1494


>gi|194759443|ref|XP_001961958.1| GF14678 [Drosophila ananassae]
 gi|190615655|gb|EDV31179.1| GF14678 [Drosophila ananassae]
          Length = 783

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 187/577 (32%), Positives = 298/577 (51%), Gaps = 87/577 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL   W   QL+LQ++IL     LGM+  LPAF+G+VP AL  + P+   T
Sbjct: 228 MGNIRGWAGPLKPEWRQFQLLLQQEILSAQRNLGMSVALPAFAGHVPRALSRLHPNTSFT 287

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R+CC   ++ T+PLF +I   F++  +  YG ++HI+ CD F+E  PP
Sbjct: 288 DVQRWNQFPD--RYCCGLFVEPTEPLFHQIATTFLQSVVTIYG-SNHIFFCDPFNELEPP 344

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAI++ M + D +A+WL+QGW+F  +PFW P   +A L +VP G+++VL
Sbjct: 345 VAKPEYMRSTAAAIHNSMTAVDPEAIWLLQGWMFVKNPFWTPDMAEAFLTAVPRGRILVL 404

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL +E  P +  +  ++G P+IWCMLHNF G + M+G    I  G   AR+  N+++   
Sbjct: 405 DLQSEQFPQYELTHSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEAARSMPNSSIVGT 464

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                         +W   ++V RYG    ++  AW +L ++VY
Sbjct: 465 GITPEGIGQNYVVYSLTLERGWSRNSIDLDSWFRHFTVTRYGVKDESLAKAWLLLKNSVY 524

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH-PHLWY 326
           +                       +    G+Y    +P            S++H P  WY
Sbjct: 525 SFHG--------------------LQKMRGQYVVTRRP------------SFNHDPFTWY 552

Query: 327 STSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           + S+V+ A  L +++      E    + Y +DL+D+TRQ L   A++L++N+  +++   
Sbjct: 553 NASDVLEAWHLLLSARVIIPLEDDRYDVYEHDLVDITRQFLQITADQLYVNLKSSFRKRQ 612

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
                 LS R L+L +D++ +L+    FLLG WLE AKQ+A + E  K +E+NAR QIT 
Sbjct: 613 LPRFEFLSTRLLQLFDDLELILSSGRNFLLGNWLEQAKQVAPHPEDRKSFEFNARNQITA 672

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWR 497
           W  N Q     + DY  K WSGL++DYY PR +++F  +  +L S     G  F+ K  +
Sbjct: 673 WGPNGQ-----ILDYACKQWSGLVKDYYKPRWSLFFDDVNVALHSQRPFNGSAFKQKVSQ 727

Query: 498 REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
           R  I+L   + N  ++YP +   +    S  ++ +++
Sbjct: 728 R--IELP--FSNKTDIYPTDPVENVWFISHTIFERWM 760


>gi|195577611|ref|XP_002078662.1| GD22403 [Drosophila simulans]
 gi|194190671|gb|EDX04247.1| GD22403 [Drosophila simulans]
          Length = 778

 Score =  308 bits (789), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 187/569 (32%), Positives = 287/569 (50%), Gaps = 85/569 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL   W   QL+LQ++I+   + LGM+  LPAF+G+VP AL+ + P +   
Sbjct: 223 MGNIRGWAGPLTPGWRRYQLLLQQEIITAQHNLGMSVALPAFAGHVPRALKRLHPESTFM 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R+CC   ++ TD LF EI   F++  + +YG ++HI+ CD F+E  PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTDNLFKEIASRFLQNIITKYG-SNHIFFCDPFNELEPP 339

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAIY  M+  D  A+WL+QGW+F  +PFW     +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL +E  P +  ++ ++G P+IWCMLHNF G + M+G    I  G  EAR   N+++   
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIDEARRLPNSSLVGT 459

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                         +W   +S  RYG     ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNFSHTRYGVKDERLEQAWLLLKNSVY 519

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +                       +    G+Y           V+    S    P  WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548

Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
            S V+ A  L + S   +   +     Y +DL+D+TRQ L   A++L++N+  AY+    
Sbjct: 549 ASAVLDAWHLLLTSRAIIPLEDDRYEIYEHDLVDITRQFLQISADQLYVNLRSAYRKRQV 608

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
                LS + L+L +DM+ +LA    FLLG WL+ AKQ A N  +++ +E+NAR QIT W
Sbjct: 609 ARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGEQRNFEFNARNQITAW 668

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRR 498
             + Q     + DY  K WSGL+ DYY PR  ++ + +  +L +     G  F+LK    
Sbjct: 669 GPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLEDVTVALHAGRPYNGTAFKLK--VS 721

Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQ 527
           + I+L   + N  +VYPV   G+  + SQ
Sbjct: 722 QEIELP--FSNKADVYPVTPVGNTWLISQ 748


>gi|383856382|ref|XP_003703688.1| PREDICTED: alpha-N-acetylglucosaminidase [Megachile rotundata]
          Length = 744

 Score =  306 bits (785), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 181/521 (34%), Positives = 278/521 (53%), Gaps = 77/521 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPL  SW +Q + LQ KIL R+  LG+ PVLP+F+G+VP A   +FP+A +T
Sbjct: 194 MGNIRAFGGPLYPSWHEQSINLQHKILERMRSLGIIPVLPSFAGHVPRAFPRLFPNANVT 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W +      +CC YLL  TDPLF +IG+ F++  ++E+G T HIYNCDTF+EN P 
Sbjct: 254 KLAPWNNFPD--VYCCLYLLAPTDPLFQQIGQLFLKTYIEEFG-TDHIYNCDTFNENEPH 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++G + +  M + D DA+WLMQGWLF++D  FW  P+++A L SVP G+++V
Sbjct: 311 TSELKFLRNVGHSTFQAMNAVDPDAIWLMQGWLFTHDKLFWTEPRVEAFLTSVPRGRMIV 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL +E  P +   K ++G P+IWCMLHNF G + M+G    I     E R  +N+TM  
Sbjct: 371 LDLQSEQFPQYGRLKSYFGQPFIWCMLHNFGGTLGMFGSAQIINQRVFEGRNMKNSTMVG 430

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W   Y+ RRYG        AW  L  TV
Sbjct: 431 TGLTPEGINQNYVIYELMNEMAYRKEPVNLNKWFENYASRRYGVWNEYAVSAWQSLGRTV 490

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YN +     + + VI   P ++ S  +                               WY
Sbjct: 491 YNFSGTRKIRGKYVISRRPSLNLSTWT-------------------------------WY 519

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
               +     +F+ + +    S  YR+D++DLTRQ L   A E++  +I+++   +    
Sbjct: 520 DRDTLYNTWSVFLQARHGRRNSTLYRHDVVDLTRQVLQAKAEEIYPVLIDSFNKKNLTAF 579

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              S + L+L +D++ +LA    FLLG WL++AK+LA N+E+ + Y+ NA+ QI++W   
Sbjct: 580 KYHSDKLLDLFDDLELILASGKDFLLGKWLDAAKKLASNDEELRLYQVNAKYQISLWGPR 639

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
            +     +RDY NK W+G++ DY+ PR +I+    +ESLE+
Sbjct: 640 GE-----IRDYANKQWAGVVADYFKPRWSIF----LESLEN 671


>gi|194863164|ref|XP_001970307.1| GG23441 [Drosophila erecta]
 gi|190662174|gb|EDV59366.1| GG23441 [Drosophila erecta]
          Length = 778

 Score =  306 bits (785), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 185/551 (33%), Positives = 288/551 (52%), Gaps = 49/551 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL   W   QL+LQ++I+     LGM+  LPAF+G+VP AL+ + P +   
Sbjct: 223 MGNIRGWAGPLTPEWRRYQLLLQQEIIAAQRNLGMSVALPAFAGHVPRALKRLHPGSTFM 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R+CC   L+ TD LF EI   F+++ +  YG ++HI+ CD F+E  PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFLEPTDNLFNEIALIFLQKIITAYG-SNHIFFCDPFNELEPP 339

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAIY  ++  D  A+WL+QGW+F  +PFW     +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESIRRLDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWI 240
           DL +E  P +  ++ ++G P+IWCMLHNF G + M+G    I  G  EAR   N+++   
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459

Query: 241 N-------------QYSVRRYGRSVPAIQDAWNVLY-HTVYNCTDGATDKNRDVIVAFPD 286
                          +++ R   + P   D+W   + H  Y   D   ++          
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNRPLDLDSWFTSFSHARYGVKDERLEQ---------- 509

Query: 287 VDPSIISVTEGKYQNYG-KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 345
              + + +    Y  +G + +  + V+    S    P  WY+ S V+ A  L ++S   +
Sbjct: 510 ---AWLQLKNSVYSFHGLQKMRGQYVVTRRPSFKQEPFTWYNASAVLDAWHLLLSSRAII 566

Query: 346 SASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 401
              +     Y +DL+D+TRQ L   A++L++N+  AY+         LS + L+L +DM+
Sbjct: 567 PLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAYKKRQVSRFEFLSSKLLKLFDDME 626

Query: 402 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 461
            +LA    FLLG WL+ AKQ A +  +++ YE+NAR QIT W  + Q     + DY  K 
Sbjct: 627 LILASSRNFLLGNWLQQAKQAAPHPGEQRNYEFNARNQITAWGPDGQ-----ILDYACKQ 681

Query: 462 WSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPV 516
           WSGL+ DYY PR  ++ + +  +L S     G  F+LK    + I+L   + N  +VYPV
Sbjct: 682 WSGLVSDYYRPRWRLFLEDVTVALHSLRPFNGTAFKLK--VSQEIELP--FSNKVDVYPV 737

Query: 517 ESNGDALITSQ 527
              G+    SQ
Sbjct: 738 TPVGNTWFISQ 748


>gi|21356587|ref|NP_652045.1| CG13397, isoform A [Drosophila melanogaster]
 gi|442626853|ref|NP_001260251.1| CG13397, isoform B [Drosophila melanogaster]
 gi|16185856|gb|AAL13967.1| LP03571p [Drosophila melanogaster]
 gi|22945953|gb|AAF52672.2| CG13397, isoform A [Drosophila melanogaster]
 gi|440213562|gb|AGB92787.1| CG13397, isoform B [Drosophila melanogaster]
          Length = 778

 Score =  305 bits (782), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 184/568 (32%), Positives = 286/568 (50%), Gaps = 83/568 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL  +W   QL+LQ++I+     LGM+  LPAF+G+VP AL+ + P +   
Sbjct: 223 MGNIRGWAGPLTPAWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLNPESTFM 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R+CC   ++ T+ LF EI   F+   + +YG ++HI+ CD F+E  PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLHNIITKYG-SNHIFFCDPFNELEPP 339

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAIY  M+  D  A+WL+QGW+F  +PFW     +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL +E  P +  ++ ++G P+IWCMLHNF G + M+G    I  G  EAR   N+++   
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                         +W   +S  RYG     ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNFSHSRYGVKDERLEQAWLLLKNSVY 519

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +                       +    G+Y           V+    S    P  WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548

Query: 328 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
            S V+ A  L +        E +    Y +DL+D+TRQ L   A++L++N+  AY+    
Sbjct: 549 ASAVLDAWHLLLTFRAIIPLEDNRYEIYEHDLVDITRQFLQISADQLYINLRSAYRKRQV 608

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
                LS + L+L +DM+ +LA    FLLG WL+ AKQ A N  Q++ +E+NAR QIT W
Sbjct: 609 SRFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGQQRNFEFNARNQITAW 668

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
             + Q     + DY  K WSGL+ DYY PR  ++ + +  +L +G  F    ++   +K+
Sbjct: 669 GPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLEDVTVALHAGRPFNGTAFK---LKV 720

Query: 504 TND----WQNGRNVYPVESNGDALITSQ 527
           +++    + N  +VYPV   G+  + SQ
Sbjct: 721 SHEIELPFSNKDDVYPVTPVGNTWLISQ 748


>gi|198476648|ref|XP_001357424.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
 gi|198137793|gb|EAL34493.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
          Length = 767

 Score =  305 bits (780), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 183/571 (32%), Positives = 292/571 (51%), Gaps = 77/571 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL   +   Q +LQ+ IL    +LG++  LPAF+G++P A++ ++P+   T
Sbjct: 212 MGNIRGWGGPLKPEYQRLQELLQQHILRAQRDLGISVALPAFAGHLPTAMRRIYPNGNYT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W S   DP +CC   +D  DP+F  +   F+ + ++ YG ++HI+ CD F+E  PP
Sbjct: 272 EVERWNSFP-DP-YCCGLFVDPLDPIFDLVAALFLRRVVQRYG-SNHIFFCDPFNELQPP 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  P+Y+ S  AAI++ M+S D +AVWL+QGW+F  + FW    M+A L +VP+G+L+VL
Sbjct: 329 VAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNIFWTDAMMEAFLTAVPIGRLIVL 388

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL +E  P +  +  +YG P++WCMLHNF G + M+G  D +  G   AR   N+++   
Sbjct: 389 DLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGSADLVNNGIEAARRMPNSSIVGV 448

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                         +W   ++  RYG     +Q AW +L  +VY
Sbjct: 449 GITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFARTRYGVDDEGLQQAWQLLRRSVY 508

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +       K R                        G  V++   L  +      P  WY+
Sbjct: 509 SFR--GLQKMRG-----------------------GYTVTRRPALNLD------PFTWYN 537

Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
            S+V+ A +L ++S   +   +     Y +DL+D+TRQ L   A++L++N+  AY+    
Sbjct: 538 ASDVLEAWKLLLSSRAIIPLEDDNYAIYEHDLVDITRQYLQISADQLYVNLKSAYRKRQV 597

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
                L  + L+L  D++ +LA    FLLG WL  A++ A N   +  +E+NAR QIT W
Sbjct: 598 ARFEYLGSKLLQLFGDLERILASGSNFLLGTWLADAQRAAPNAADKPNFEFNARNQITAW 657

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIK 502
             + Q     + DY  K WSGL+ DYY PR A++   +  +L S   F    ++ R   +
Sbjct: 658 GPDGQ-----ILDYACKQWSGLVLDYYRPRWALFLDDVTLALHSNRTFNSTAFKLRVSQE 712

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           +   + N  +VYPVE  G+    SQ +Y ++
Sbjct: 713 VELPFSNKSDVYPVEPMGNTWFISQNIYERW 743


>gi|195339231|ref|XP_002036223.1| GM12949 [Drosophila sechellia]
 gi|194130103|gb|EDW52146.1| GM12949 [Drosophila sechellia]
          Length = 778

 Score =  304 bits (779), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 185/569 (32%), Positives = 286/569 (50%), Gaps = 85/569 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL   W   QL+LQ++I+     LGM+  LPAF+G+VP AL+ + P +   
Sbjct: 223 MGNIRGWAGPLTAGWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLHPESTFM 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R+CC   ++ T+ LF EI   F++  + +YG ++HI+ CD F+E  PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLQNIITKYG-SNHIFFCDPFNELEPP 339

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAIY  M+  D +A+WL+QGW+F  +PFW     +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPEAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL +E  P +  ++ ++G P+IWCMLHNF G + M+G    I  G  EAR   N+++   
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W   +S  RYG     ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDGWFTNFSHTRYGVKDERLEQAWLLLKNSVY 519

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +                       +    G+Y           V+    S    P  WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548

Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
            S V+ A  L + S   +   +     Y +DL+D+TRQ L   A++L++N+  AY+    
Sbjct: 549 ASAVLDAWHLLLTSRAIIPLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAYRKRQV 608

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
                LS + L+L +DM+ +LA    FLLG WL+ AKQ A N  +++ +E+NAR QIT W
Sbjct: 609 ARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGEQRNFEFNARNQITAW 668

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRR 498
             + Q     + DY  K WSGL+ +YY PR  ++ + +  +L +     G  F+LK    
Sbjct: 669 GPDGQ-----ILDYACKQWSGLVSNYYRPRWRLFLEDVTVALHAGRPYNGTAFKLK--VS 721

Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQ 527
           + I+L   + N  +VYPV   G+  + SQ
Sbjct: 722 QEIELP--FSNKIDVYPVTPVGNTWLISQ 748


>gi|195155652|ref|XP_002018715.1| GL25802 [Drosophila persimilis]
 gi|194114868|gb|EDW36911.1| GL25802 [Drosophila persimilis]
          Length = 767

 Score =  303 bits (777), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 182/571 (31%), Positives = 292/571 (51%), Gaps = 77/571 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL   +   Q +LQ+ IL    +LG++  LPAF+G++P A++ ++P+   T
Sbjct: 212 MGNIRGWGGPLKPEYQRLQELLQQHILRAQRDLGISVALPAFAGHLPTAMRRIYPNGNYT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W S   DP +CC   +D  DP+F  +   F+ + ++ YG ++HI+ CD F+E  PP
Sbjct: 272 EVERWNSF-PDP-YCCGLFVDPLDPIFDLVAALFLRRVVQRYG-SNHIFFCDPFNELQPP 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  P+Y+ S  AAI++ M+S D +AVWL+QGW+F  + +W    M+A L +VP+G+L+VL
Sbjct: 329 VAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNIYWTDAMMEAFLTAVPIGRLIVL 388

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL +E  P +  +  +YG P++WCMLHNF G + M+G  D +  G   AR   N+++   
Sbjct: 389 DLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGSADLVNNGIEAARRMPNSSIVGV 448

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                         +W   ++  RYG     +Q AW +L  +VY
Sbjct: 449 GITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFARTRYGVDDEGLQQAWQLLRRSVY 508

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +       K R                        G  V++   L  +      P  WY+
Sbjct: 509 SFR--GLQKMRG-----------------------GYTVTRRPALNLD------PFTWYN 537

Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
            S+V+ A +L ++S   +   +     Y +DL+D+TRQ L   A++L++N+  AY+    
Sbjct: 538 ASDVLEAWKLLLSSRAIIPLEDDKYAIYEHDLVDITRQYLQISADQLYVNLKSAYRKRQV 597

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
                L  + L+L  D++ +LA    FLLG WL  A++ A N   +  +E+NAR QIT W
Sbjct: 598 ARFEYLGSKLLQLFGDLEHILASGSNFLLGTWLADAQRAAPNAADKPNFEFNARNQITAW 657

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIK 502
             + Q     + DY  K WSGL+ DYY PR A++   +  +L S   F    ++ R   +
Sbjct: 658 GPDGQ-----ILDYACKQWSGLVLDYYRPRWALFLDDVTLALHSNRTFNSTAFKLRVSQE 712

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           +   + N  +VYPVE  G+    SQ +Y ++
Sbjct: 713 VELPFSNKSDVYPVEPMGNTWFISQNIYERW 743


>gi|242011515|ref|XP_002426494.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
 gi|212510620|gb|EEB13756.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
          Length = 1345

 Score =  303 bits (775), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 182/521 (34%), Positives = 282/521 (54%), Gaps = 79/521 (15%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M N+  +   L  +WL QQL+LQ KIL R+ ELG+ PVLP+F G VP + ++ +P AK+ 
Sbjct: 836  MGNVRNFSYGLTNNWLQQQLLLQHKILNRLRELGITPVLPSFCGIVPRSFKDSYPFAKLL 895

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            ++  W     D  +CC YLLD+ DPLF  + R F+++ + E+G T+HIYNCD F+EN P 
Sbjct: 896  EMPKWNKFSRD--YCCPYLLDSNDPLFSVVSRVFLKEYINEFG-TNHIYNCDVFNENKPA 952

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRP-PQMKALLNSVPLGKLVV 179
             +S +Y+S++ + IY  M S D  A WL+QGW+F  DPFW    ++KA +N+VP G++++
Sbjct: 953  SESLDYLSTISSTIYKAMSSVDPRATWLVQGWMF-IDPFWASLKRVKAFINAVPKGRMLI 1011

Query: 180  LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
            LDL +++ P +   + ++G P+IWC LHNF G + MYG L+ +  G  + R  +N+TM  
Sbjct: 1012 LDLQSDLTPQYKRLQSYFGQPFIWCTLHNFGGQLGMYGHLNRVNLGVFKGRKFKNSTMVG 1071

Query: 239  --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                            WI +Y++RRYG     I DAW +L +T+
Sbjct: 1072 IGIAPEGIDQNYIMYDFTLDLALRTKPVDLDDWITKYALRRYGLIEKNILDAWLILKNTL 1131

Query: 267  YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
            YN                PD +  + S     Y         + +L    S   +   WY
Sbjct: 1132 YNYN--------------PDSNFRLTSSNVKMYTLVKGEHIAKNILTKFPSLRMNEFTWY 1177

Query: 327  STSEVIRALELF-IASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            + S ++   E F IAS N + ++++ +++DLID+TRQ +                     
Sbjct: 1178 NRSIILDIFEKFQIASSNSILSTSSLFQHDLIDVTRQTI--------------------Q 1217

Query: 385  GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
               + S  FLEL+ ++D +L     FLLG WLESAK +A N+ ++  YE+NAR QIT+W 
Sbjct: 1218 IAIENSNMFLELLNELDMILNTGKKFLLGNWLESAKNMATNKLEKDNYEFNARNQITLWG 1277

Query: 445  DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL 485
             N +     +RDY  K W+G++ D+Y PR  ++F+ + ES+
Sbjct: 1278 SNGE-----IRDYAAKQWAGMIHDFYKPRWKLFFQALNESI 1313


>gi|170060634|ref|XP_001865888.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
 gi|167879069|gb|EDS42452.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
          Length = 761

 Score =  302 bits (773), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 188/571 (32%), Positives = 287/571 (50%), Gaps = 89/571 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL +S+      LQ K++  +   GM   LPAF+G++P   + +FP AK+ 
Sbjct: 210 MGNIRGWGGPLKESFKTFASDLQAKVVQEMRRFGMILALPAFAGHLPVQFKTLFPQAKLN 269

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W    +  ++     LD  DPLF +IG  F+ + +  YG T HIY  D F+E  P 
Sbjct: 270 PVEVWNGFPA--QYASPLFLDPVDPLFQKIGSKFVAKAIARYG-TDHIYFSDPFNEIQPR 326

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            +S  Y++S  A IY  M   D  AVWL+QGW+   +PFW    +KA   +VP G+++VL
Sbjct: 327 SESARYLASAAAGIYQAMVDVDPLAVWLLQGWMLVKNPFWSDRAIKAFFTAVPNGRMLVL 386

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           DL +E  P +  ++ +YG P+IWCML NF G + M G +D +     E R++E+ TM   
Sbjct: 387 DLQSEQFPQYVRTQSYYGQPFIWCMLSNFGGTLGMLGSVDLVFERIRETRSNESMTMIGT 446

Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W  +Y++ RYG     +QDAW++   TVY
Sbjct: 447 GITPEGINQNYGLYEFALEMGWNPDISDVDNWFTRYAMVRYGNDDKRLQDAWSIFRSTVY 506

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +                       + +  GKY    +P  K             P +WY+
Sbjct: 507 SFKG--------------------MEMMRGKYTFNRRPSLKL-----------QPWVWYN 535

Query: 328 TSEVIRALELFIA--SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
            +     +EL +A    NEL     +++D++DLTRQ L   A++L+L I++ Y L +A  
Sbjct: 536 ETRFDEGVELILAVNGSNEL-----FKHDVVDLTRQFLQNTADKLYLTIMDTYTLKNAAA 590

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
               S  F EL++++D LLA +  FLLG WLESAK LA    + ++YE+NAR QIT+W  
Sbjct: 591 FKHYSNLFKELLQNIDRLLATNTHFLLGRWLESAKSLATTSLERQKYEYNARNQITLWGP 650

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-----RLKDWRREW 500
             Q     + DY NK WSG+++D++ PR +++ + M  +L +         R K +R+  
Sbjct: 651 QGQ-----IVDYANKQWSGVVQDFFLPRWSLFLQEMELALATNGTINETKVRDKIFRKVE 705

Query: 501 IKLTNDWQNGRNVYPVESNG-DALITSQWLY 530
           +    D    R  YP E++G DAL  ++ LY
Sbjct: 706 LPFNTD----RKKYPAEASGEDALELARELY 732


>gi|348681836|gb|EGZ21652.1| hypothetical protein PHYSODRAFT_247428 [Phytophthora sojae]
          Length = 991

 Score =  301 bits (771), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 183/581 (31%), Positives = 292/581 (50%), Gaps = 78/581 (13%)

Query: 1   MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           M NL G W  GPLPQ+++D Q  LQ +IL R+ E GM P LPAF+G+VP  L+   P+A 
Sbjct: 422 MGNLRGSWVKGPLPQAFIDNQHELQLRILQRMREFGMIPALPAFAGHVPEDLKLTLPNAN 481

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            T+  NW +     ++CC Y+++ TDPL+ EIG+AF+E+Q   Y  TS +Y CDT+ E  
Sbjct: 482 FTRSPNWGNFTD--QYCCVYMIEPTDPLYREIGKAFLEEQRALYNYTSSLYQCDTYMEMA 539

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
           P       +     A+  GM + D +AVWLMQGW F  DP +W  P++KA L  VP  KL
Sbjct: 540 PEFTDLSELKGAARAVIDGMTAADPNAVWLMQGWPFVDDPHYWTRPRVKAYLEGVPTDKL 599

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LD ++E  PIW+    ++G  +I+ +LHNF GN  M G L ++A  PV+A+   N TM
Sbjct: 600 IILDFYSEAVPIWNKMDNYFGKNWIYSVLHNFGGNTGMRGDLPTLATAPVQAQRDGNGTM 659

Query: 238 A---------------------------------WINQYSVRRYGRSVPAIQDAWNVLYH 264
                                             W+++Y+ RRY      ++ AW+ L  
Sbjct: 660 VGVGLTMEGIFQNYVVYDLTLQMAWEDSPLDVDEWVSKYASRRYHTQNEHVERAWSYLSR 719

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           +VYN T           +A+  V  S++ +       Y +              +    +
Sbjct: 720 SVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR--------------FQPTLI 754

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF--LNIIEAYQLND 382
            Y   +++ A +  + +G+EL   +TYR+DL+D+T+Q L+    E +  L +I + +   
Sbjct: 755 KYDPKDIVLAWKELLLAGDELRNVDTYRHDLVDVTKQFLSNKLLEQYQHLKVIYSAKSAP 814

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN----------EEQEKQY 432
           A+ V +L++  L  +  ++ +LA ++ FLLG W+  A  LA +           + ++ Y
Sbjct: 815 ANEVCELTKTMLTTINRLEEILATNEDFLLGNWVADALNLAGDLNIGGDSVTRTKLQEYY 874

Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
           E+ AR Q+T W DN  E    + DY  K W+GL++ YY PR  ++   +  +        
Sbjct: 875 EYEARNQVTRWGDNNNEA---IHDYAGKEWAGLVKSYYLPRWTMWLTEVCSAYTDRREMD 931

Query: 493 LKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
            K  ++  I     WQ     YP  + GD+   S+ +Y++Y
Sbjct: 932 EKGLKKRRIAFELKWQLSHEKYPTTTVGDSFSISKRIYSEY 972


>gi|428176410|gb|EKX45295.1| hypothetical protein GUITHDRAFT_51145, partial [Guillardia theta
           CCMP2712]
          Length = 680

 Score =  301 bits (770), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 180/583 (30%), Positives = 297/583 (50%), Gaps = 72/583 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  WGG L QSW+DQQ  LQ KIL R  ELGM PVLPAF+G VP  ++++FP AK T
Sbjct: 121 MINIKAWGGGLTQSWIDQQRDLQLKILARERELGMLPVLPAFAGGVPEGMKSLFPEAKFT 180

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           + GNW         CC  ++D TDPLF++IG+ F+E+    YG ++HIY+CDTF+EN P 
Sbjct: 181 RHGNWGGFAEQH--CCVMMVDPTDPLFLKIGKMFVEEVRAVYG-SNHIYSCDTFNENRPR 237

Query: 121 VDSP----EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG 175
            +      +++S    A++  M++ D DAVWLMQGWLF  D  FW+  ++ A L+ VP  
Sbjct: 238 SEHGSVGLDFLSHSSRAVFESMRAADPDAVWLMQGWLFMNDARFWQKRELDAYLSGVPED 297

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVP-----YIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
           ++++LDLF +V P+W         P     ++W MLH+F GN  MYG L  I+  PV A+
Sbjct: 298 RMIILDLFTDVFPVWKRRDLQRPTPIEKRRWVWNMLHSFGGNSGMYGRLQVISKDPVVAK 357

Query: 231 TSENT--------------------------------TMAWINQYSVRRYG----RSVPA 254
               T                                 M+W+ +++ RR G    R   A
Sbjct: 358 KESQTMVGVGITTEGIEQNPVVYEMMAEMRWREQEVDVMSWVEKWADRRLGPEASRERKA 417

Query: 255 I-QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
           + ++AW  L  TVY+C      + + ++ + P +D +   +    +    +   +EA+++
Sbjct: 418 LGEEAWRELASTVYSCPGTQMGQVKSMVESRPRLDLASGWIPNSDFMPIKRHYPEEALVR 477

Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
           +          W      ++ L       +  + S++  +D+ D+TRQ L+     LF  
Sbjct: 478 A----------W------LKLLRATRGGADGYTCSSSASFDIADVTRQVLSDLFARLFQP 521

Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
           +    Q   A       +  L ++ DMD ++      LLG W+E A+   +++E+E+  E
Sbjct: 522 LSSFCQTRLAGSAAVRMQTLLGIISDMDKMVGTQPRMLLGKWIEDARAWGKSKEEEEVLE 581

Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRL 493
           +NAR  +T+W    +     + DY +K W GLL DYY  R  ++F+++ +++     F  
Sbjct: 582 FNARNLVTLWGPRGE-----IADYASKQWQGLLSDYYMSRWKLFFEHLQQAIRGTRIFSQ 636

Query: 494 KDWRREWIKLTNDWQN-GRNVYPVESNGDALITSQWLYNKYLQ 535
           + +++E +     WQ    + +P    G+A+  +  L++KY++
Sbjct: 637 QRFQQELLVFEQQWQTRTSSSFPSSPEGNAVELAWQLHDKYIK 679


>gi|158300970|ref|XP_320760.4| AGAP011750-PA [Anopheles gambiae str. PEST]
 gi|157013415|gb|EAA00039.4| AGAP011750-PA [Anopheles gambiae str. PEST]
          Length = 770

 Score =  301 bits (770), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 181/572 (31%), Positives = 292/572 (51%), Gaps = 82/572 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL  S+      LQ +++  +  LGM   LPAF+G++P   + ++P+    
Sbjct: 213 MGNIRGWGGPLTPSFTQFAHTLQVRVVGEMRRLGMAVALPAFAGHLPVQFRTLYPNVSFA 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W +    P++     LD T+PLF  IG  F++  +K YG T H+Y  D F+E  P 
Sbjct: 273 NVSVWNNFP--PQYASPLFLDPTEPLFAAIGSRFLQLAIKTYG-TDHVYFSDPFNEIDPT 329

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           + S +Y+SS+  AIYS M   D DA+WL+QGW+F  +PFW    +++ L++VPLG+++VL
Sbjct: 330 LPSGKYLSSVSEAIYSTMVQVDPDAIWLLQGWMFVKNPFWSDRAIRSFLSAVPLGRMLVL 389

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL +E  P +  +  + G P+IWCML NF G + M G + ++  G  E R +   T+   
Sbjct: 390 DLQSEQYPQYGRTASYAGQPFIWCMLSNFGGTLGMLGSVGNVFRGIRETRDNSTYTLLGT 449

Query: 238 ------------------------------AWINQYSVRRYGR-SVPAIQDAWNVLYHTV 266
                                          W ++Y+V RYG  S    Q AWN+   TV
Sbjct: 450 GITPEGINQNYALYEFALEMGWNAELDSAEQWFSEYAVARYGNDSDERAQQAWNIFLRTV 509

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y               AF  ++     +  GKY    +P SK             P  WY
Sbjct: 510 Y---------------AFEGLE-----LMRGKYTFNRRPSSK-----------IRPWTWY 538

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
                 + LEL ++   E S +   +YDL+D TRQ L   A+ L+L ++++++  D    
Sbjct: 539 DVHTFNQGLELLLSFAEEASCNQLCQYDLVDATRQCLQHTADALYLTLMDSFKKRDLTSF 598

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              S  FL+L+ D+D LL  ++ FLLGPWLESAK  A+   +  +YE+NAR QIT+W   
Sbjct: 599 RLHSSLFLQLLSDLDVLLRTNEHFLLGPWLESAKAHAETTLERHKYEYNARIQITLWGPQ 658

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKD--WRREWI 501
            Q     + DY NK W+G+++D++ PR  ++   + ++L +    +  +++D  +R   +
Sbjct: 659 GQ-----IVDYANKQWAGMVQDFFLPRWRVFLGELDQALATNGTINDLKIRDKIFRTVEL 713

Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +D ++    Y  + +GD + T++ LY ++
Sbjct: 714 PFVSDSKH----YATQPSGDTVRTARTLYERW 741


>gi|348681870|gb|EGZ21686.1| hypothetical protein PHYSODRAFT_495971 [Phytophthora sojae]
          Length = 692

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 178/577 (30%), Positives = 286/577 (49%), Gaps = 74/577 (12%)

Query: 1   MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           M NL G W  GPLPQ+++D Q  LQ KIL R+   GM P LPAF+G+VP  L+ ++P+AK
Sbjct: 121 MGNLRGSWVEGPLPQAFIDGQYELQLKILERMRGFGMVPALPAFAGHVPEELKTLYPNAK 180

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            T+  NW     +  +CC Y+LD  DPL+ EIG+ F+E+Q   Y  TS +Y CDT++E  
Sbjct: 181 FTRSPNWGGFSDE--FCCVYMLDPQDPLYYEIGKTFLEEQRALYDYTSSLYQCDTYNEMD 238

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
           P    P  + +   A+   M + D +AVWL+QGWLF   P +W   +++  L+ VP  K+
Sbjct: 239 PDFTDPAKLQAASRAVIDSMTAADPNAVWLIQGWLFVNSPNYWTKERVQTYLDGVPNDKM 298

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL++EV+P+W+    ++G  +I+C+LHNF GN  M G L ++   PV A  + + TM
Sbjct: 299 IILDLYSEVRPVWNKMDNYFGKSWIYCVLHNFGGNTGMRGDLPTLGTAPVLANRASSGTM 358

Query: 238 ---------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYH 264
                                             W+  ++ +RY       + AW  L  
Sbjct: 359 IGMGLTMEGIFQNYVVYDLTLQMAWVDAPLDMDEWVPSFAAQRYHSQDAHTERAWGFLLQ 418

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           +VYN T G     + ++   P                    + ++  + +  +       
Sbjct: 419 SVYNRTLGYGGVTKSLVCLIPHWK-----------------LVRDGFMPTLIT------- 454

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-- 382
            Y   ++ RA +  + +G+EL A +TYR+DL+D+TRQ L+ +    +L++ + Y   +  
Sbjct: 455 -YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFLSDHFMAQYLHLEDMYAGKETP 513

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN------EEQEKQYEWNA 436
           A  +   + R L  +E +D +LA +D FLLG W+  A+ LA           +  YE+ A
Sbjct: 514 ADQLCAWTDRMLVTIEWLDEILATNDDFLLGNWVADARALADEVGAAEVTSLQDYYEYEA 573

Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           R Q+T W DN  E    + DY  K W+GL+  YY PR  ++   + +S            
Sbjct: 574 RNQVTRWGDNNSES---IHDYAGKEWAGLVSGYYLPRWRMWLTEVCQSYTQKRDVNEAAL 630

Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           ++  +    +WQ     YP  + GD L  S+ +Y ++
Sbjct: 631 KKARVDFELNWQLSHERYPTTTTGDTLAVSKRIYEEF 667


>gi|195115262|ref|XP_002002183.1| GI17241 [Drosophila mojavensis]
 gi|193912758|gb|EDW11625.1| GI17241 [Drosophila mojavensis]
          Length = 773

 Score =  294 bits (752), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 183/574 (31%), Positives = 290/574 (50%), Gaps = 83/574 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  WGGPLP +    Q +LQ++IL    ELGM+  LPAFSG VP A++ VFP+A  T
Sbjct: 212 MGNLRSWGGPLPPAHRQLQQLLQQRILAAQRELGMSVALPAFSGYVPTAMRRVFPNASFT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q   W     DP +CC   ++  DPLF ++G  F+ + ++ YG ++HIY  D F+E  P 
Sbjct: 272 QSDRWNHF-PDP-YCCVLFVEPQDPLFQQVGAMFLRRVIQVYG-SNHIYFSDPFNEMMPR 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  P Y+     AIY+ MQ  D+DAVWL+QGW+F    +W    ++A L +VP G+++ L
Sbjct: 329 VREPNYVRYTAKAIYNSMQVVDADAVWLIQGWMFLKSVYWTNDLIEAYLTAVPRGRILAL 388

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           DL +E  P +  +  +YG P++WCML+NF GN+ ++G    I  G + AR+  N +M   
Sbjct: 389 DLQSEQFPQYERTHSYYGQPFVWCMLNNFGGNLGLFGSAQLIPSGIIAARSMPNGSMVGV 448

Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W   +++ RYG +   +   W +L  +VY
Sbjct: 449 GITPEGIGQNYALFALTLEQAWSPDELQLEDWFEYFALTRYGVNDTRLSQVWQLLRESVY 508

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +                           +G+ +  GK       L    S + +P +WY+
Sbjct: 509 SF--------------------------QGRERMRGK-----YTLNKRPSLHHYPWVWYN 537

Query: 328 TSEVIRALELFIASGNELSASNT----YRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
            + V  A  L + +   +  ++     Y +DL+D+TRQ L    +  ++N+  A +    
Sbjct: 538 VTMVYEAWRLMLEAKETVPLNDNRRAIYEHDLVDITRQCLQLSFDRFYVNLKSACRHKQL 597

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
           + V  L+ + LEL  DM+ +LA  + +LLG WLE+AK+LA +EEQ   YE+NAR Q+T W
Sbjct: 598 NRVEYLAGKLLELFADMERILASGEHYLLGNWLEAAKRLAPSEEQRPIYEFNARNQLTSW 657

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
             N Q     + DY  K WSGL+ DY+ PR  ++ + +I++L++   F   ++++   ++
Sbjct: 658 GPNYQ-----IPDYATKQWSGLMSDYFQPRWNMFLEAVIQALKTQTPFNYSEFKQ---RV 709

Query: 504 TND----WQNGRNVYPVESNGDALITSQWLYNKY 533
            N+    + N    YP    G     S  +Y K+
Sbjct: 710 ENEIELPFSNHTKAYPTSPVGSTWNISHDIYEKW 743


>gi|198433857|ref|XP_002122480.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 880

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 174/455 (38%), Positives = 243/455 (53%), Gaps = 70/455 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLP  W+  QL+LQ +IL+R+  LGM PVLP F+G++P+A+ N++P A + 
Sbjct: 211 MGNLHGWGGPLPSFWIKSQLILQHQILIRMRSLGMIPVLPGFAGHIPSAILNLYPKADVI 270

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QL +W        + CTYLL   DPLF  IG  FI++Q+ EY  T+HIYN DTF+E TPP
Sbjct: 271 QLSHWSHFNCT--YSCTYLLQPHDPLFNTIGSMFIKEQMLEYNGTNHIYNADTFNEMTPP 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S+   A+Y  M   D DAVWLMQGWLF ++P FW+  Q KALL  VP GK++V
Sbjct: 329 SSDPGYLSNASRAVYDAMAVADPDAVWLMQGWLFHHEPTFWKTAQKKALLTGVPKGKMLV 388

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLF+E  P +     ++G P++WCMLH+F GN+  YG ++++   P  A TS N+TM  
Sbjct: 389 LDLFSESYPQY-LPDWYFGQPFLWCMLHDFGGNMGFYGKINTVNTQPGIALTSVNSTMVG 447

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+ +Y++RRY  S P     WN+L +T+
Sbjct: 448 TGVTPEGINQNYMIYDFMLETGFTVHSVNVTNWLKEYTMRRYNTSSPEAIKTWNILGNTI 507

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL-- 324
           YN          D    FP    S+I          G PV +  +        D+P L  
Sbjct: 508 YN----------DTKPGFP--SKSLIR---------GSPVKRPTL--------DNPGLPY 538

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY  S +  A + F  S N L    T RYD +D+TRQ L      L+  ++E +      
Sbjct: 539 WYQYSSLALAWDNFSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDP 598

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 419
           G  +L  + L+L++D D +L     F +G W++ A
Sbjct: 599 G--KLGEQLLDLLDDFDKMLCSDAHFSMGKWIQDA 631



 Score = 88.6 bits (218), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 58/197 (29%), Positives = 103/197 (52%), Gaps = 8/197 (4%)

Query: 338 FIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 397
           F  S N L    T RYD +D+TRQ L      L+  ++E +      G  +L  + L+L+
Sbjct: 684 FSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDPG--KLGEQLLDLL 741

Query: 398 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 457
           +D D +L     F +G W++ AK L    E++  YE+NAR Q+T+W  N +     + DY
Sbjct: 742 DDFDKMLCSDAHFSMGKWIQDAKILGTTAEEKDLYEYNARIQVTLWGPNGE-----ILDY 796

Query: 458 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKLTNDWQNGRNVYPV 516
            +K+W  L++ YY PR A++  Y+  +  +   F  K +  + +  +   +   R+V+P 
Sbjct: 797 ASKHWCSLVKHYYRPRWALFVSYLNHAYATKSKFDHKAFASDVFTNVEEPFTKDRSVFPS 856

Query: 517 ESNGDALITSQWLYNKY 533
            + G+A+  ++ +Y K+
Sbjct: 857 TATGNAIELAKDMYIKW 873


>gi|195473052|ref|XP_002088810.1| GE10991 [Drosophila yakuba]
 gi|194174911|gb|EDW88522.1| GE10991 [Drosophila yakuba]
          Length = 778

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 182/550 (33%), Positives = 288/550 (52%), Gaps = 47/550 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL   W   QL+LQ++I+     LGM+  LPAF+G+VP AL+ + P +   
Sbjct: 223 MGNIRGWAGPLTPQWRRYQLLLQQEIIAAQRNLGMSVALPAFAGHVPRALKRLNPDSTFM 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       ++CC   ++  + LF EI   F+++ +  YG ++HI+ CD F+E  PP
Sbjct: 283 EVQRWNQFPD--QYCCGLFVEPKENLFNEIALNFLQKIITIYG-SNHIFFCDPFNELEPP 339

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAIY  M+  D  A+WL+QGW+F  +PFW     +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTSAAIYESMRRIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWI 240
           DL +E  P +  ++ ++G P+IWCMLHNF G + M+G    I  G  EAR   N+++   
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459

Query: 241 N-------------QYSVRRYGRSVPAIQDAWNVLY-HTVYNCTDGATDKNRDVIVAFPD 286
                          +++ R   + P   D+W   + HT Y   D   ++      A+  
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNKPLDLDSWFTNFSHTRYGVKDERLEQ------AWLQ 513

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELS 346
           +  S+ S   G  +  G     + V+    S    P  WY  S V+ A  L ++S   + 
Sbjct: 514 LKNSVYSF-RGLQKMRG-----QYVVTRRPSFNQEPFTWYDASAVLDAWHLLLSSRAIIP 567

Query: 347 ASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 402
             +     Y +DL+D+TRQ L   A++L++N+  A++         LS + L+L +DM+ 
Sbjct: 568 LEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAFRKRQVTRFEYLSTKLLKLFDDMEL 627

Query: 403 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 462
           +LA    FLLG WL+ AK+ A +  ++  +E+NAR QIT W  + Q     + DY  K W
Sbjct: 628 ILASSRNFLLGNWLQQAKRAAPSPGEQTNFEFNARNQITAWGPDGQ-----ILDYACKQW 682

Query: 463 SGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVE 517
           SGL+ DYY PR  ++ + +  +L S     G  F+LK    + I+L   + +  +VYPV 
Sbjct: 683 SGLVSDYYRPRWRLFLEDVTVALHSRRPFNGTAFKLK--VSQEIELP--FSHKVDVYPVT 738

Query: 518 SNGDALITSQ 527
             G+  + SQ
Sbjct: 739 PVGNTWLISQ 748


>gi|440800773|gb|ELR21808.1| AlphaN-acetylglucosaminidase (NAGLU) [Acanthamoeba castellanii str.
           Neff]
          Length = 800

 Score =  291 bits (744), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 190/592 (32%), Positives = 287/592 (48%), Gaps = 105/592 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  WGGPL +SW + Q  LQKKI+  ++         +      A L+ V+P A IT
Sbjct: 199 MGNVQSWGGPLTKSWREGQAELQKKIVQGVWN---EERAVSVRWARAAGLKRVYPHANIT 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W       R    +LLD  DP+F +IG AFI+ Q + YG T HIYN DTF+E  PP
Sbjct: 256 LSPTWAHFTDPYR---VWLLDPFDPIFQKIGTAFIDAQTRVYG-TDHIYNADTFNELDPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P Y+++   A+Y GM + D  A+WLMQGWLF    +W   ++KA L+ V    +++L
Sbjct: 312 SADPTYLAAASNAVYQGMAAADPKALWLMQGWLFR-SVWWSNDRIKAYLSGVKNDNMLIL 370

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           DL+AEV PIWS ++ ++G P++WCMLH+F GN ++YG L  IA  PV+ART+  +TM   
Sbjct: 371 DLYAEVDPIWSKTESYFGKPFVWCMLHDFGGNRDLYGNLTHIATAPVDARTAPGSTMVGT 430

Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W++ Y   RYG   P+ + AW +L+ + Y
Sbjct: 431 GLTMEAIEQNPVIYELMSEMGWRSAHVDVDDWLDHYVSFRYGADSPSAKKAWRLLHQSAY 490

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                            P +  SI +    ++                 S   H    YS
Sbjct: 491 QN---------------PVIMRSIYTFVPNRH----------------VSRNHH----YS 515

Query: 328 TSEVIRALELFIASGNEL----SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND- 382
              ++ A  L + S  EL      +  + YDL+D+TRQ L    + LF    +AY L D 
Sbjct: 516 PDVLVEAWGLLLQSRLELPNPAQPNGPWEYDLVDVTRQVL----DNLFH---DAYGLLDG 568

Query: 383 AHGVFQLSRR------------FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
           A+  +  +RR             ++++ D+D +LA +  +LLG W E A+  A NEE+++
Sbjct: 569 AYDAYVATRRDPFNQVKTIGAALIQILSDIDTVLATNQNYLLGVWTERARSWATNEEEKR 628

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YE+NAR QIT+W  N +     + DY +K W+GL+  YY PR  I+  Y+ +S+  G  
Sbjct: 629 LYEFNARNQITLWGPNGE-----INDYASKEWAGLVGTYYRPRWQIFVAYLFDSIAKGTV 683

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH 542
                +  + +     W N  N +P ++ G+    SQ LY +Y+    +  H
Sbjct: 684 IDPNKYAADLLLWEQRWNNQTNAFPSQATGNVAEVSQALYARYVSAAELKQH 735


>gi|195398029|ref|XP_002057627.1| GJ18000 [Drosophila virilis]
 gi|194141281|gb|EDW57700.1| GJ18000 [Drosophila virilis]
          Length = 766

 Score =  289 bits (740), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 178/574 (31%), Positives = 294/574 (51%), Gaps = 83/574 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPLP +    Q +LQ+ I+    ELGM+  LPAF+G+VP A++ VFP+A  T
Sbjct: 214 MGNIRGWAGPLPPAHRRLQQLLQQLIVRAQRELGMSVALPAFAGHVPTAMRRVFPNANYT 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W +     ++CC   ++  DPLF ++G  F+ + ++ YG ++HIY  D F+E  PP
Sbjct: 274 PAERWNNFPD--QYCCDLFVEPHDPLFQQLGAMFLRRVIQVYG-SNHIYFSDPFNEMQPP 330

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P Y+ S   AIY+ M+  D +AVWL+QGW+F  D FW    ++A L +VP G+++VL
Sbjct: 331 LAEPGYMRSTAKAIYNSMREVDGNAVWLLQGWMFLKDIFWTDELIEAFLTAVPRGRILVL 390

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           DL +E  P +  +  +YG P++WCML+NF G + ++G    I  G   AR   N+++   
Sbjct: 391 DLQSEQFPQYQRTHSYYGQPFVWCMLNNFGGTLGLFGSAQFIGSGIASARIMPNSSLVGV 450

Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W + +++ RYG +   +  AW +L   VY
Sbjct: 451 GITPEGIGQNYAIFALTLEQGWSASELQLGDWFDHFALTRYGVNDTRLAQAWQLLRGGVY 510

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +                       +    GKY            L        +P  WY+
Sbjct: 511 SFHG--------------------LQRMRGKY-----------ALNRRPGLNLNPWTWYN 539

Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
            S V  A +L +AS   +  ++     Y +DL+D+TRQ L +  +++++N+  AY+    
Sbjct: 540 GSSVTDAWQLLLASREMVPLTDDRYAIYEHDLVDITRQFLQQSFDQIYVNLRSAYRKEQL 599

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
           + +  L+ + LEL++DM+ +LA    +LLG WLE+AK+LA +++    YE+NAR Q+T W
Sbjct: 600 NRLEYLAGKLLELLDDMERILASGVHYLLGTWLEAAKKLAPSDKLRPLYEFNARNQLTSW 659

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
             N Q     + DY  K WSGL+ DYY PR A++   +  ++++   F   D+++   ++
Sbjct: 660 GPNGQ-----ILDYATKQWSGLMCDYYQPRWAMFLDAVTRAMQTHRPFNATDFKQ---RV 711

Query: 504 TND----WQNGRNVYPVESNGDALITSQWLYNKY 533
            N+    + N   +YP +  G+  + S  +Y K+
Sbjct: 712 ANEIELPFSNLTKMYPTKPMGNTWLISNDIYIKW 745


>gi|330791218|ref|XP_003283691.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
 gi|325086434|gb|EGC39824.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
          Length = 712

 Score =  288 bits (736), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 174/574 (30%), Positives = 284/574 (49%), Gaps = 83/574 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++ WGGP+   WL++Q  LQ +IL R+   GM PVLP F+G++P A+Q +FP+A ++
Sbjct: 170 MGNVNNWGGPITMDWLEKQRDLQIQILTRMRAYGMKPVLPGFAGHIPGAIQTLFPTANVS 229

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W           T+ LD +DPLF +I + FI + +  +G T H YN D F+E  PP
Sbjct: 230 ILSTWCEFNG------TFYLDPSDPLFGKITQLFITELIGVFG-TDHYYNFDPFNELAPP 282

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSYDPFWRPPQMKALLNSVPLGKLVV 179
                ++      +Y+ M + D  AVW++QGW +  Y  FW+  Q +A  + VP+G  +V
Sbjct: 283 SSDLGFLKQTSQQMYNNMLAADPKAVWVLQGWFIVDYPEFWQANQTQAWFSGVPIGGFIV 342

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL+++V P W+ ++ FYG  ++WCMLHNF G   MYG +  IA  P+ AR+  +  M  
Sbjct: 343 LDLWSDVAPAWNITEYFYGHYWLWCMLHNFGGRSGMYGRIPFIATNPIIARSLSDNMMGT 402

Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          WI QY+ RRYG+ +P + + W  +  TV+
Sbjct: 403 GLTPEAIEQNVVVYDLMSEMAWRSTAPDLEEWITQYTNRRYGKIMPEVVEVWMSMVDTVF 462

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           N T     +N     +F  + PSI         N+G                   +++Y 
Sbjct: 463 NATAYWARRNMGAPESFIALRPSI---------NFGD------------------NVFYD 495

Query: 328 TSEVIRALELF-IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
            S +  A  +F + + + + ++ T+++D+ ++T QAL+ +  + + N+I++Y ++D    
Sbjct: 496 PSVMFNAWHVFSLVNDSYVISTETFQFDISEITMQALSNFFMDTYFNLIKSYNVSDIESF 555

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA--QNEEQEKQ---------YEWN 435
            + S   +E +  MD + +      LG W   A+  A   NE    Q         YE+N
Sbjct: 556 QRESITMMETISFMDLIASTQPELQLGVWTYRARLWAYPDNETPSLQNSSNSATLPYEFN 615

Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
           AR Q+T+W        S+L DY  K W GL+ D+YGPR  ++ K +++SLE+   F   +
Sbjct: 616 ARNQLTLW----GPSDSVLHDYAFKLWGGLISDFYGPRWNLFLKTLLQSLENRIPFDANN 671

Query: 496 WRREWIKLTNDWQNGRNVYPVESNGDALITSQWL 529
           +      L   W     +YP+   G    TS+++
Sbjct: 672 FISNVQALEQQWVLESTIYPILPFGQGYNTSRYI 705


>gi|195050088|ref|XP_001992825.1| GH13491 [Drosophila grimshawi]
 gi|193899884|gb|EDV98750.1| GH13491 [Drosophila grimshawi]
          Length = 771

 Score =  288 bits (736), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 183/574 (31%), Positives = 299/574 (52%), Gaps = 83/574 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPLP +    Q +LQ++I+    +LGM+  LPAF+G+VP  L  +FP+A  T
Sbjct: 213 MGNIRGWGGPLPPAHRRLQQLLQQRIVQAQRDLGMSVALPAFAGHVPTGLPRIFPTANFT 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     DP +CC   ++ +DPLF  +G  F+ + ++ YG ++HIY  D F+E  P 
Sbjct: 273 SVERWNQFP-DP-YCCALFIEPSDPLFQLVGAQFLRRVIQIYG-SNHIYFSDPFNEMQPR 329

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISS   AIY+ M+  D D VWL+QGW+F  + +W    ++A L +VP G+++VL
Sbjct: 330 IAEPGYISSTARAIYNSMRMVDKDPVWLLQGWMFLDNAYWSDELIEAFLTAVPRGRMLVL 389

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           DL +E  P +  +  +YG P++WCML+NF G + M+G    I  G + AR+  N++M   
Sbjct: 390 DLQSEQFPQYQRTFSYYGQPFVWCMLNNFGGTLGMFGSAHLINAGIMAARSMPNSSMVGV 449

Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W +Q+++ RYG +   +  AW +L  +VY
Sbjct: 450 GITPEGIGQNYALFALTLEQGWSGSKLELSDWFDQFTLTRYGVNDTDLILAWQLLRGSVY 509

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +                       +    GKY            L    S    P +WY+
Sbjct: 510 HFHG--------------------LQRMRGKY-----------ALNKRPSFNLKPWIWYN 538

Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
            S V+ A +L +A+   +   +     Y++DL+D+TRQ L +  +++++N+  AY+ +  
Sbjct: 539 ASSVVEAWQLLLAANQTIPVEDDRYALYKHDLVDITRQFLQQSFDQVYVNLKSAYRKSQL 598

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
                L+ + LEL+ DM+ +LA  + +LLG WLE+AK+LA + +Q   YE+NAR Q+T W
Sbjct: 599 ARFEYLAAKLLELLADMERILASGEHYLLGNWLEAAKELAPSADQRHIYEFNARNQLTAW 658

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
             + Q     + DY  K WSGL++DYY PR +++   +  ++ S   F    +R+   ++
Sbjct: 659 GPSNQ-----ILDYATKQWSGLMQDYYTPRWSMFLDAVTLAMHSKRPFNATAFRQ---RV 710

Query: 504 TND----WQNGRNVYPVESNGDALITSQWLYNKY 533
            N+    + N   VYP E  G   + SQ +++K+
Sbjct: 711 ANEIELPFSNLTKVYPTEPVGSTWLISQEIHDKW 744


>gi|301107007|ref|XP_002902586.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
 gi|262098460|gb|EEY56512.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
          Length = 736

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 183/572 (31%), Positives = 281/572 (49%), Gaps = 87/572 (15%)

Query: 1   MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           M NL G W  GPLPQ+++D Q  LQ +IL R+ E GM P LPAF+G+VP  L+   P+A 
Sbjct: 196 MGNLRGSWVKGPLPQAFIDNQHELQLRILERMREFGMIPALPAFAGHVPEELKLRLPNAH 255

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            TQ  NW +   +   CC ++++ TD L+ EIG+ F+++Q + Y  TS +Y CDT+ E  
Sbjct: 256 FTQSPNWGNFSEEH--CCVFMIEPTDALYREIGKNFLKEQRELYNYTSSLYQCDTYMEMA 313

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
           P       +     A+  GM + D +AVWLMQGW F  DP FW  P++KA L+ VP  KL
Sbjct: 314 PEFTDLTELEGAARAVIDGMTAADPNAVWLMQGWPFVDDPHFWTKPRVKAYLDGVPTDKL 373

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LD ++E  PIWS    ++G  +I+ +LHNF GN  M G L ++A  PV A  + N TM
Sbjct: 374 IILDFYSESVPIWSKMDNYFGKSWIYSVLHNFGGNTGMRGDLLTLATAPVLANWAGNGTM 433

Query: 238 A---------------------------------WINQYSVRRYGRSVPAIQDAWNVLYH 264
                                             WI QY+ +RY      ++ AW+ L  
Sbjct: 434 VGVGLTMEGIFQNYIVYDLTLQMAWVDNPLDVNTWIPQYAAQRYHTHNEHVEQAWSYLLR 493

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           +VYN T           +A+  V  S++ +       Y +   +  ++K           
Sbjct: 494 SVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR--FQPTLIK----------- 529

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA- 383
            Y  ++V+ A +  + + NEL   +TYR+DL+D+T+Q L+    E ++++   Y    A 
Sbjct: 530 -YDPNDVVLAWKELLLAENELRDVDTYRHDLVDVTKQFLSNKLLEQYIHLKGIYNAKKAS 588

Query: 384 -HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
            + V  L++  L  +E ++ +LA ++ FLLG W+                   AR Q+T 
Sbjct: 589 PNEVCGLTKTMLTTMERLEEILATNEDFLLGNWI-------------------ARNQVTR 629

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           W DN  E    + DY  K W+GL++ YY PR  ++   +  +         K  + + I 
Sbjct: 630 WGDNNNEA---IHDYAGKEWAGLVKGYYIPRWTMWLSEVCNAYTDKREMNEKALKEKRIA 686

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
               WQ G   YP  + GDA   S+  YN+Y+
Sbjct: 687 FELKWQLGHESYPTTTVGDAFTISKRFYNEYI 718


>gi|291515668|emb|CBK64878.1| Alpha-N-acetylglucosaminidase (NAGLU) [Alistipes shahii WAL 8301]
          Length = 713

 Score =  286 bits (733), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 187/566 (33%), Positives = 276/566 (48%), Gaps = 72/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W GPLP+ W+D QL LQ++IL R  ELGM PVLPAF+G+VP  L+ + P A+IT
Sbjct: 179 MSNIDRWQGPLPEEWIDGQLALQQRILARERELGMKPVLPAFAGHVPQELKRLHPDARIT 238

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W     D R+ C++ LD  DPLF  I R F+ +Q + +G T HIY  D F+E   P
Sbjct: 239 RVSYWGGF--DDRYRCSF-LDPMDPLFAVIQREFLTEQTRLFG-TGHIYGADPFNEIDAP 294

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PE ++ +   IY  M   D +AVWL  GWLF  DP  W    ++A L +VP  +L++
Sbjct: 295 TWDPETLAGMSRHIYESMAEVDPEAVWLQMGWLFYADPTHWTAENIRAFLGAVPQDRLLM 354

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG--------ILDSIAFGPVEAR- 230
           LD F E   IW  +++F+G PY+WC L NF GN  + G        + D+ A G    R 
Sbjct: 355 LDYFCEFTEIWKQTEKFHGQPYLWCYLGNFGGNTMLSGNFHTVSARMEDAFAHGGDNLRG 414

Query: 231 --------------------TSENTTMA---WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                + NT +A   WI + + RR G   PA +  W  L  +VY
Sbjct: 415 VGSTLEGFGVNQFMYEFVLDKAWNTGIADDEWIARLADRRTGFRDPAARTGWRTLCDSVY 474

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T  A      +  A P +        EG +    KP          T+ Y  P LW  
Sbjct: 475 --TLPAQTGQSPLTNAHPAL--------EGNWHWTTKP----------TTGYRFPTLW-- 512

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
                R  E  +A  +E    +TYR+D++++ RQ L  Y          AY  +D   + 
Sbjct: 513 -----RVWEELLAVDSE---RDTYRFDVVNIGRQVLGDYFLIERDRFAAAYAQHDRKAMD 564

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
             +RR   L+ D++ L ACH  F L  W+ +A+    +   +  YE NAR  I++W D+ 
Sbjct: 565 AAARRMTGLLADINLLTACHPEFSLERWIAAARGFGSDNASKDYYETNARMLISVWGDSY 624

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
                 L DY ++ WSG++  YY PR  ++ + ++E+  +G  F  + + RE       W
Sbjct: 625 H-----LTDYASRTWSGMISTYYAPRWRLFIERVMEAARTGRMFDHEAFDREIRDFECRW 679

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
            +  +       GDA+ T++ L +KY
Sbjct: 680 ADASHPLTFPEAGDAVRTARELASKY 705


>gi|288927792|ref|ZP_06421639.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella sp. oral taxon 317 str. F0108]
 gi|288330626|gb|EFC69210.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella sp. oral taxon 317 str. F0108]
          Length = 734

 Score =  286 bits (732), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 182/570 (31%), Positives = 278/570 (48%), Gaps = 78/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W GPLP SW+  Q  LQKKIL R  +LGM PVLPAF+G+VP  L+  +P AKIT
Sbjct: 198 MSNIDHWMGPLPMSWIKNQEKLQKKILRRTRDLGMKPVLPAFAGHVPEILKEKYPKAKIT 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W   +   R    + LD  D LF +I + +I++Q K YG T HIY  D F+E  PP
Sbjct: 258 PLSIWGDFEDQYR---CHFLDPFDSLFTDIQKTYIDEQTKLYG-TDHIYGVDPFNELAPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEY+++  A IY  +++ DS AVWL   W+FSY    W   ++K+ + +VP  K ++
Sbjct: 314 SWEPEYLANASAKIYDVLKNADSKAVWLQMTWMFSYQRKDWTDERIKSYITAVPDKKQIL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSI------AFGPVEARTSE 233
           LD +AE   +W  S+ +Y  P+IWC L NF GN  + G +  +      AF   E+    
Sbjct: 374 LDYYAERTEVWKFSESYYKQPFIWCYLGNFGGNTMIAGNIAEVDRRLNEAFANAESMVGV 433

Query: 234 NTTMA--------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
            +T+                           W  Q++ RR G +    + AW +L   +Y
Sbjct: 434 GSTLEGFDVNPIMYDFVFEKVWHKDGISLHDWTVQWAQRRVGTTDENAEKAWKLLIDKIY 493

Query: 268 N----CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
                CT+G     R          PS+        +N+ K                   
Sbjct: 494 VQYSLCTEGTLTNAR----------PSLTGHGNWTTKNWTK------------------- 524

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
             Y+  +++ A  L + S  +      Y+YD++++ RQ L  Y   L     +AY+  D 
Sbjct: 525 --YNNRDLLEAWGLLLRS--KAITKIAYKYDIVNIGRQVLGNYFTVLRDEFTQAYERKDI 580

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
             +       L L+ D++ LL     FLLGPWL +A+ + +N E+ + YE NAR  IT W
Sbjct: 581 SALTIKGNEMLSLLNDLEALLYTSPSFLLGPWLTNAQNMGRNMEESRYYEKNARNIITNW 640

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
             +TQ  A  L DYGN+ W+GLL+ YY PR  ++ + +I +++    F  + + ++    
Sbjct: 641 --STQGVA--LNDYGNRTWAGLLQGYYTPRWKMFIEEVISAVKQNKEFNNETFFKKVTDE 696

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              W +    YP+++ GD+ + +   Y+KY
Sbjct: 697 EWQWISKTENYPIQATGDSYLLANKFYHKY 726


>gi|423292430|ref|ZP_17271008.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
           CL02T12C04]
 gi|423294620|ref|ZP_17272747.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
           CL03T12C18]
 gi|392661665|gb|EIY55241.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
           CL02T12C04]
 gi|392675811|gb|EIY69252.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
           CL03T12C18]
          Length = 727

 Score =  284 bits (727), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 173/566 (30%), Positives = 263/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W GPLP+ WLD Q  LQK+I+ R  +  M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   R   ++ LD  DPLF  I + F+E+Q K +G T HIY  D F+E  PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PE++++    IY  M   D DA WL   WLF  D   W   +++A L +VP  KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQNKLLL 361

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTT- 236
           LD + E   +W  + +++G PY+WC L NF GN  + G    +      V     EN + 
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421

Query: 237 -----------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          WI Q + RR G     ++ AW +LY ++Y
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPDSVWIEQLADRRIGLRNQQMRRAWKLLYDSIY 481

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                                P+ +          G  ++    LK   +    P + YS
Sbjct: 482 TA-------------------PAALG--------QGTLMNARPCLKGNGNWTTTPTVAYS 514

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
              +    E+ + +G    +  TY YD++++ RQ L  Y  +L     E Y       + 
Sbjct: 515 NETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVLGNYFGKLRDEFAETYSRKQLPLLK 572

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           Q      +L+ D++ LL+    FLLG W+E A+ L  +E  +  YE NART ++ W D  
Sbjct: 573 QKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLGIDEASKNYYEENARTIVSTWGDKD 632

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
           Q     L DY N+ W GL+  YY PR  ++   +I S+ +   F    + +   +   DW
Sbjct: 633 QS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEIDW 688

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                 YP E  G+A+  +  L NKY
Sbjct: 689 VKSHERYPSEPVGNAVEIATLLMNKY 714


>gi|295086519|emb|CBK68042.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 727

 Score =  284 bits (727), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 173/566 (30%), Positives = 263/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W GPLP+ WLD Q  LQK+I+ R  +  M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   R   ++ LD  DPLF  I + F+E+Q K +G T HIY  D F+E  PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PE++++    IY  M   D DA WL   WLF  D   W   +++A L +VP  KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 361

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTT- 236
           LD + E   +W  + +++G PY+WC L NF GN  + G    +      V     EN + 
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421

Query: 237 -----------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          WI Q + RR G     ++ AW +LY ++Y
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPDSVWIEQLADRRIGLRNQQMRRAWKLLYDSIY 481

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                                P+ +          G  ++    LK   +    P + YS
Sbjct: 482 TV-------------------PAALG--------QGALMNARPCLKGNGNWTTTPTVAYS 514

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
              +    E+ + +G    ++  Y YD++++ RQ L  Y  +L     EAY       + 
Sbjct: 515 NETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLLK 572

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           Q      +L+ D+D LL+    FLLG W+E A+ L  +E  +  YE NART ++ W D  
Sbjct: 573 QKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGTDEVSKNYYEENARTIVSTWGDKD 632

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
           Q     L DY N+ W GL+  YY PR  ++   +I S+ +   F    + +   +   DW
Sbjct: 633 QS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEIDW 688

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                 YP E  G+ +  +  L NKY
Sbjct: 689 VKSHERYPSEPVGNVVEIATLLMNKY 714


>gi|160883168|ref|ZP_02064171.1| hypothetical protein BACOVA_01137 [Bacteroides ovatus ATCC 8483]
 gi|156111393|gb|EDO13138.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
          Length = 737

 Score =  284 bits (726), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 173/566 (30%), Positives = 263/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W GPLP+ WLD Q  LQK+I+ R  +  M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 196 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   R   ++ LD  DPLF  I + F+E+Q K +G T HIY  D F+E  PP
Sbjct: 256 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PE++++    IY  M   D DA WL   WLF  D   W   +++A L +VP  KL++
Sbjct: 312 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQNKLLL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTT- 236
           LD + E   +W  + +++G PY+WC L NF GN  + G    +      V     EN + 
Sbjct: 372 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 431

Query: 237 -----------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          WI Q + RR G     ++ AW +LY ++Y
Sbjct: 432 LGSTLEGFDVNPFMYEYVFSKAWDCNLPDSVWIEQLADRRIGLRNQQMRRAWKLLYDSIY 491

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                                P+ +          G  ++    LK   +    P + YS
Sbjct: 492 TA-------------------PAALG--------QGTLMNARPCLKGNGNWTTTPTVAYS 524

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
              +    E+ + +G    +  TY YD++++ RQ L  Y  +L     E Y       + 
Sbjct: 525 NETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVLGNYFGKLRDEFAETYSRKQLPLLK 582

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           Q      +L+ D++ LL+    FLLG W+E A+ L  +E  +  YE NART ++ W D  
Sbjct: 583 QKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLGIDEASKNYYEENARTIVSTWGDKD 642

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
           Q     L DY N+ W GL+  YY PR  ++   +I S+ +   F    + +   +   DW
Sbjct: 643 QS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEIDW 698

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                 YP E  G+A+  +  L NKY
Sbjct: 699 VKSHERYPSEPVGNAVEIATLLMNKY 724


>gi|66801665|ref|XP_629757.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
 gi|60463162|gb|EAL61355.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
          Length = 798

 Score =  282 bits (721), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 175/573 (30%), Positives = 282/573 (49%), Gaps = 81/573 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++GWGGP+   WL++Q  LQ KIL R+ + GM PVLP F+G++P A+Q +FP A I+
Sbjct: 255 MGNVNGWGGPITLDWLEKQRDLQIKILERMRQYGMKPVLPGFAGHIPGAIQQLFPQANIS 314

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W +         T+ L++TDPLF +I   FI + +  +G T H YN D F+E  PP
Sbjct: 315 VLSTWCNFNG------TFYLESTDPLFAKITTMFIGELIDVFG-TDHFYNFDPFNELEPP 367

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            +  +Y+     ++Y  +   D  AVW++QGW     P FW+  Q +A  + VP+G ++V
Sbjct: 368 SNDTDYLRQTSQSMYENVLLADPKAVWVLQGWFIVDAPEFWQAKQTEAWFSGVPIGGVLV 427

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
           LDL+++V P W+T+  +YG  ++WCMLHNF G   MYG L  I+  P+ A          
Sbjct: 428 LDLWSDVIPGWTTTNYYYGHYWVWCMLHNFGGRSGMYGRLPWISSNPITARGLSPNMVGI 487

Query: 230 ----------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                 R+ +     W+ QY+ RRYG+ VP I D W  L +TV+
Sbjct: 488 GLTPEAIEQNVVVYDMMSEMSWRSVQPNLTEWVTQYTHRRYGKLVPEIVDVWISLVNTVF 547

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           N T      N     +F  + P +          +G             +S+ +P++ Y+
Sbjct: 548 NATAATARANMGAPESFIALRPQL---------TFGN------------NSFYNPNILYN 586

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
              V   ++        + ++ T+ +D+ + T Q+L+ Y  + +  +IEA+  +D   + 
Sbjct: 587 AWNVFSMVD-----DEYVISTETFEFDISEFTMQSLSNYFMDQYFLLIEAFNASDVQTLS 641

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA---------QNEEQEKQ--YEWNA 436
            +S   L+++  MD + +      LG W   A+  A         QN        YE+NA
Sbjct: 642 TISIELLDIINYMDEIASTQSSLQLGLWTYRARLWAYPTNDIPTLQNSSNSNTAPYEFNA 701

Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           R  +T+W        S+L DY  K WSGL+ D+Y PR  ++ K +++S+E+   F  + +
Sbjct: 702 RNVLTLW----GPSNSVLHDYAFKLWSGLVSDFYSPRWQLFLKSLVQSVENRKPFNKESF 757

Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWL 529
            R    L   W   + +YP    G A  TS+++
Sbjct: 758 NRMVENLEEQWVVQQTIYPTVPVGQAYNTSKYI 790


>gi|383114162|ref|ZP_09934927.1| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
 gi|382948607|gb|EFS30964.2| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
          Length = 727

 Score =  281 bits (718), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 174/566 (30%), Positives = 265/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W GPLP+ WLD Q  LQK+I+ R  +  M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   R   ++ LD  DPLF  I + F+E+Q K +G T HIY  D F+E  PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PE++++    IY  M   D DA WL   WLF  D   W   +++A L +VP  KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 361

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTT- 236
           LD + E   +W  + +++G PY+WC L NF GN  + G    +      V     EN + 
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421

Query: 237 -----------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          WI Q + RR G     ++ AW +LY ++Y
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPDSVWIEQLADRRIGLRNQQMRRAWKLLYDSIY 481

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                                    ++ +G   N  +P  K     + TS+     + YS
Sbjct: 482 TAP---------------------AALGQGTLMN-ARPCLKGNGNWTTTST-----VAYS 514

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
              +    E+ + +G    ++  Y YD++++ RQ L  Y  +L     EAY       + 
Sbjct: 515 NETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLLK 572

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           Q      +L+ D+D LL+    FLLG W+E A+ L  +   +  YE NART ++ W D  
Sbjct: 573 QKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGTDGASKNYYEENARTIVSTWGDKD 632

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
           Q     L DY N+ W GL+  YY PR  ++   +I S+ +   F    + +   +   DW
Sbjct: 633 QS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEIDW 688

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                 YP E  G+A+  +  L NKY
Sbjct: 689 VKSHERYPSEPVGNAVEIATLLMNKY 714


>gi|237719130|ref|ZP_04549611.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229451509|gb|EEO57300.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 737

 Score =  280 bits (716), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 174/566 (30%), Positives = 265/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W GPLP+ WLD Q  LQK+I+ R  +  M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 196 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   R   ++ LD  DPLF  I + F+E+Q K +G T HIY  D F+E  PP
Sbjct: 256 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PE++++    IY  M   D DA WL   WLF  D   W   +++A L +VP  KL++
Sbjct: 312 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTT- 236
           LD + E   +W  + +++G PY+WC L NF GN  + G    +      V     EN + 
Sbjct: 372 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 431

Query: 237 -----------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          WI Q + RR G     ++ AW +LY ++Y
Sbjct: 432 LGSTLEGFDVNPFMYEYVFSKAWDCNLPDSVWIEQLADRRIGLRNQQMRRAWKLLYDSIY 491

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                                    ++ +G   N  +P  K     + TS+     + YS
Sbjct: 492 TAP---------------------AALGQGTLMN-ARPCLKGNGNWTTTST-----VAYS 524

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
              +    E+ + +G    ++  Y YD++++ RQ L  Y  +L     EAY       + 
Sbjct: 525 NETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLLK 582

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           Q      +L+ D+D LL+    FLLG W+E A+ L  +E  +  YE NART ++ W D  
Sbjct: 583 QKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGIDEASKNYYEENARTIVSTWGDKD 642

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
           Q     L DY N+ W GL+  YY PR  ++   +I S+ +   F    + +   +   DW
Sbjct: 643 QS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEIDW 698

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                 YP E   +A+  +  L NKY
Sbjct: 699 VKSHERYPSEPVSNAVEIATLLMNKY 724


>gi|340617022|ref|YP_004735475.1| alpha-N-acetylglucosaminidase [Zobellia galactanivorans]
 gi|339731819|emb|CAZ95084.1| Alpha-N-acetylglucosaminidase, family GH89 [Zobellia
           galactanivorans]
          Length = 747

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 170/550 (30%), Positives = 271/550 (49%), Gaps = 58/550 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G GGPLPQ+W+ Q+  LQ KIL R+  LGM PVL AF+G+VP  L+ ++P A I 
Sbjct: 197 MGNIDGMGGPLPQNWITQRKELQVKILNRMRSLGMKPVLQAFTGHVPQVLKKLYPEANIF 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+ +W  V+       TY LD TD LF +IG AFI++Q + YG T H+Y+ D F E  PP
Sbjct: 257 QIEDWAGVEG------TYFLDPTDELFQKIGTAFIKKQTELYG-TDHLYDADCFIEVDPP 309

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM-KALLNSVPLGKLVV 179
              P ++  +  ++Y  M+  DS A W++QGW F +   +   +  +A L+ +P  + +V
Sbjct: 310 SKDPAFLKQVSESVYKSMELADSKATWVLQGWFFFFKKDFWTKERGRAFLDGIPKNRAIV 369

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE------ 233
           LDL+ E  P W  +  FYG P+IW ++ N    + M G L+ +     EA TSE      
Sbjct: 370 LDLYGEKNPTWDKTDAFYGQPWIWNVICNEDQKVNMSGDLEEMQRQFQEAYTSEIGNNLK 429

Query: 234 ----------------------------NTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                            WI  Y+  RYG   P+++ AW +L  +
Sbjct: 430 GIGVIPEGLGYNPIVQDFIFEKAWDPQKVNVQEWIEDYATIRYGTKSPSVKKAWQLLGES 489

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VY    G T      ++      P ++   EG  ++         + +++  ++D     
Sbjct: 490 VY----GRTRTMWSPLI----TTPRLMIFEEGSKEDIRHVRKDFKITETDPFAWD----- 536

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           +   ++ +A  L +   NEL    TY +DL ++ R+ L    ++   ++  AYQ  D   
Sbjct: 537 FDVYKLAKAAGLLLGEANELQDVETYNFDLTNVYRELLFSLTHKSINDVSVAYQEKDRQA 596

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
           + + ++   +L++D++ +   ++ FLLG WLE AK      E+++ YEWNART +T+W  
Sbjct: 597 LDRSAKSLFKLMDDLEAITGANENFLLGKWLEDAKSWGSTPEEKEYYEWNARTIVTIWQP 656

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
             +     LRDY  K W+GL   YY PR  ++  ++  SL  G  F  K +  E  ++  
Sbjct: 657 YPE---GGLRDYAGKQWNGLFSGYYKPRWQLFVDHLRRSLTEGVDFDPKAYDAEVREMDY 713

Query: 506 DWQNGRNVYP 515
            W     +YP
Sbjct: 714 KWTRSHQIYP 723


>gi|390334740|ref|XP_003724005.1| PREDICTED: uncharacterized protein LOC100893810 [Strongylocentrotus
            purpuratus]
          Length = 1043

 Score =  278 bits (711), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 164/471 (34%), Positives = 232/471 (49%), Gaps = 74/471 (15%)

Query: 100  KEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF 159
            +E+  T HIYN DTF+EN P  +   Y+S+    +Y G+  GD   VWLMQGWLF    F
Sbjct: 573  EEFNGTDHIYNADTFNENQPRSNDSAYLSAASRGVYQGIVEGDPQGVWLMQGWLFQKTDF 632

Query: 160  WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL 219
            W P Q+KALL+ VP+G+++VLDLFAE +PI++ ++ FYG P+IWCMLHNF GN  +YG L
Sbjct: 633  WGPSQIKALLHGVPIGRMIVLDLFAEARPIYNATQSFYGQPFIWCMLHNFGGNTGLYGKL 692

Query: 220  DSIAFGPVEARTSENTTM---------------------------------AWINQYSVR 246
            D++   P EAR   ++TM                                  WI +YS R
Sbjct: 693  DAVNKFPFEARQFNSSTMIGMGLTPEGILQNYVMYNFLTDMTWRSESMNVSKWIEEYSGR 752

Query: 247  RY----GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 302
            RY    G S  A + AW +L  TVYN T    D       A P V PS          N 
Sbjct: 753  RYSPESGHSEEAAK-AWAILQATVYNNTGIDKDHQH----AVPVVRPS----------NK 797

Query: 303  GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 362
             K V                 +WY  +EV +A    + +   L  S+ +RYDL+D+TR  
Sbjct: 798  TKSV-----------------IWYDYTEVAKAWGFLLQASETLGTSSLFRYDLVDVTRNV 840

Query: 363  LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 422
            L   A + +  I+ ++   +   +         L+ DMD + + H  +LLG WLE AK L
Sbjct: 841  LQDLAFDFYEQIMVSFHAKNITAIRGNGTLLCNLILDMDNITSSHQDWLLGTWLEDAKSL 900

Query: 423  AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 482
            A N ++E  YE+NAR QIT+W    +       DY NK W GLLR YY  R  ++ +++ 
Sbjct: 901  ATNHKEESLYEYNARNQITVWGPRGEH-----LDYANKQWGGLLRSYYYNRWQLFVQFLD 955

Query: 483  ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +E    +    +      +  +W N    +P +  GD +  S+ LY+KY
Sbjct: 956  GCIELHVPYDQSKFDMRSFIMETEWTNSTEKFPTKPVGDTVSISRALYSKY 1006


>gi|440799253|gb|ELR20308.1| alpha-N-acetylglucosaminidase family protein [Acanthamoeba
           castellanii str. Neff]
          Length = 854

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 172/586 (29%), Positives = 274/586 (46%), Gaps = 100/586 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL  +W   Q  LQKKI+ R    GM P+LP F+G VP  ++ ++P+A +T
Sbjct: 240 MGNIQGWGGPLDPAWRKAQAELQKKIVERQRMFGMLPILPGFAGFVPDGIKRIYPTANLT 299

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  +W       ++   Y L   D L+  IGR  I +   E+G T HIYN DTF+E +PP
Sbjct: 300 KSADWAGFPH--QYTNVYFLSPLDSLYKTIGRMVIRRVTAEFG-TDHIYNADTFNEMSPP 356

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWR-PPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y GM + D  A+W+MQGW F +D FW    ++++ L+ V    +++
Sbjct: 357 SADPTYLAAASRAVYEGMAAEDPQALWVMQGWSFVFDKFWEDKSRVRSYLSGVSDKDMLI 416

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDL ++  P WS +  ++G  ++WCMLHN  G   +YG L   +  P+ A  +   TM  
Sbjct: 417 LDLASDNNPEWSKTDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSDPLLALATPGNTMLI 476

Query: 238 ---------------------------------------AWINQYSVRRYGRS--VPAIQ 256
                                                   W+ +Y+ RRYG +  + ++ 
Sbjct: 477 CGTCEQVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAAGLSSVG 536

Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
           +AW +L    Y         N+ VI      D      T G    YG             
Sbjct: 537 EAWELLREATY---------NQSVI------DYGWFGFTPGLGMGYGGVA---------- 571

Query: 317 SSYDHPHLWYSTSEVIRALELFIASG--NELSASNTYRYDLIDLTRQALAKYANELFLNI 374
                     + ++ + AL LF+ S      + +  ++YD +DLTRQ LA    +++   
Sbjct: 572 ----------NAAKEVEALRLFLQSALTKGYAPNGPWQYDCVDLTRQVLANTFRDIYAQF 621

Query: 375 IEAYQLNDAHGVF------QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
             AY    AH  +       L    L L+ D+D +LA +  +LLG W++SA   A   +Q
Sbjct: 622 DAAYSAYAAHKTYTVDQLKSLGSALLTLIGDIDEILATNPNYLLGTWIQSALSWADTPDQ 681

Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
              Y++NAR QIT+W  + Q     + DY  K+W+ L+R YY PR  ++   +++++ +G
Sbjct: 682 ALHYQFNARNQITLWGPDGQ-----ITDYATKHWADLVRSYYQPRWTLFITSVLQAVYAG 736

Query: 489 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
                +++R E ++L   W      Y     G+ L  +  L  KYL
Sbjct: 737 -----REYRGELLQLEQKWNRENTTYATTPTGNTLQVAYKLAAKYL 777


>gi|392588150|gb|EIW77482.1| glycoside hydrolase family 89 protein [Coniophora puteana
           RWD-64-598 SS2]
          Length = 761

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 179/583 (30%), Positives = 283/583 (48%), Gaps = 81/583 (13%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LP  W+D Q VL K+I+ R+ +LGM PVLPAF+G VP A+ N++P+A I  
Sbjct: 208 NIQGSWGGDLPTQWIDDQFVLGKQIVQRMVDLGMTPVLPAFTGFVPPAMHNLYPNASIVN 267

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W      P++     L+  DPLF ++ ++FI +Q   +G  SHIY  D ++EN P  
Sbjct: 268 GSAWNDFA--PQFTNDSFLEPFDPLFAQVQQSFISKQQAAFGNVSHIYTLDQYNENDPYS 325

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVP----LGK 176
             P Y++++ AA +S +++ D DA WLMQGWL FS   FW P +++A L  VP       
Sbjct: 326 GDPSYLTNISAATFSSLRAADPDATWLMQGWLFFSSADFWTPERVEAYLAGVPGDDDGSG 385

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           +++LDL++E +P W     ++G  +IWC LH++ GN+   G   ++   P+ A  S N +
Sbjct: 386 MLILDLYSEAQPQWQRLSSYFGKRWIWCELHDYGGNMGFEGNFANVTEAPLAALASPNVS 445

Query: 237 M-----------------------AW----INQ------YSVRRY-GRSVPAIQ-DAWNV 261
           M                       AW    IN+      ++ RRY    +P    +AW  
Sbjct: 446 MVGVGLTPEGMEGNEIIYDVLLDQAWSSSPINKTEYAQAWATRRYPADELPECAIEAWQT 505

Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
           L  TVY+ TD  +      I+        +++VT                         H
Sbjct: 506 LAATVYSNTDPGSQATVKSILELEPALSGLVNVTG-----------------------HH 542

Query: 322 P-HLWYST-SEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
           P H++Y T + ++ AL+  + +G+    L A   YRYDL+DLTRQ L     +L+ +++ 
Sbjct: 543 PTHVFYDTNTTIVPALQQLVQAGHSTPSLLAIPEYRYDLVDLTRQLLVNRFIDLYADLLA 602

Query: 377 AYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-E 433
            Y      +  V    +  LELV D+D +L  ++ F L  W ++A+  A        Y E
Sbjct: 603 VYNTTSASSASVSAAGQPMLELVADLDKVLMTNENFQLSRWTDAARSWANGNASYAAYLE 662

Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRL 493
           +NAR QIT+W    +     + DY +K W GL+ DYYG R A++ +Y+  S  +G  +  
Sbjct: 663 YNARNQITLWGPKGE-----INDYASKQWGGLVGDYYGKRWAMFIQYLEGSKSNGTVYNA 717

Query: 494 KDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 536
                  + +   W    +   V +    ++   WL  + + G
Sbjct: 718 TAIAGALLDIGEQWSQSTSA--VGARDTMVVGDTWLVVQEMAG 758


>gi|404487028|ref|ZP_11022215.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335524|gb|EJZ61993.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
           YIT 11860]
          Length = 726

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 179/566 (31%), Positives = 260/566 (45%), Gaps = 76/566 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+ GW GPLP  WL+ Q  LQKKIL R  EL M PVLPAF+G+VPAAL+ + P A I 
Sbjct: 196 MANIDGWNGPLPMQWLESQAELQKKILARERELNMTPVLPAFAGHVPAALKRIHPDANIQ 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R    + L+  +PLF EI ++F+E+Q K +G T HIY  D F+E  PP
Sbjct: 256 YLGKWAGFGDSYR---CHFLNPEEPLFAEIQKSFLEEQEKMFG-TDHIYGVDPFNEVDPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEY++ + + +Y  + + D DAVWL   W+F +D   W  P++KALL  VP  KLV+
Sbjct: 312 SWEPEYLAQVSSDMYKSLAAADPDAVWLQMTWMFYHDRKLWTAPRVKALLTGVPSDKLVL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------ILDSIAFG------- 225
           LD   E   +W ++++F+G PYIWC L NF GN  + G        LD+           
Sbjct: 372 LDYHCENVELWKSTEKFHGQPYIWCYLGNFGGNTTLTGNVKESGDRLDNALINGGDNLKG 431

Query: 226 -------------PVE-----ARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                        P E     A T +     W+ + + R  G    + ++AW +L+    
Sbjct: 432 IGSTLEGLDINQFPYEYIFEKAWTIDVNGQDWVERLADRHVGAVSESAREAWQILFD--- 488

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                      DV V  P     +         NY K  S E                Y 
Sbjct: 489 -----------DVFVQVPRTLGILPGYRPKLGDNYNKRTSNE----------------YD 521

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
            + ++R  EL +   +     + +  D+I   RQ L  Y  ++       Y+  +  G+ 
Sbjct: 522 NATLLRVWELLLEVPS--CDRDAFEIDVIMTGRQLLGNYFLDVKKEFDGFYKKRNVPGLK 579

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           + +    E++ D++ L + H+   L  W+E A+ L   +E +  YE NAR  IT W    
Sbjct: 580 EKASEMREILSDLELLNSFHNRASLDKWIEDARSLGDTDELKNYYEKNARNLITTW---- 635

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
                 L DY ++ W+GLL DYY  R  IYF  +I + E G      + +        +W
Sbjct: 636 ---GGSLNDYASRTWAGLLNDYYARRWEIYFDAVIGAAEKGIELDKDELKSRLATFEQEW 692

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                   +E NG  L TS+ L  KY
Sbjct: 693 VESTTPVCIERNGTLLDTSRRLLEKY 718


>gi|313203962|ref|YP_004042619.1| alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
 gi|312443278|gb|ADQ79634.1| Alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
          Length = 738

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 189/571 (33%), Positives = 273/571 (47%), Gaps = 69/571 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  WGGPLP S+++ Q  LQ+ IL R   LGM P+L AF+G+VP  L+ + PSAKIT
Sbjct: 196 MANMDKWGGPLPISYIEGQKKLQQHILQRSRALGMKPILSAFAGHVPEQLKTLRPSAKIT 255

Query: 61  QLG-NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
           ++   W  + ++     TY LD TD LF EI + F+  Q K YG T H+Y+ D F+E TP
Sbjct: 256 RIEPGWGGMAAE---YTTYFLDPTDNLFGEIQKRFLTVQQKLYG-TDHLYSADPFNEITP 311

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLV 178
           P   P+Y++++G  IY  M   D +A+W    W F  DP  W  P++ A++++VP GKL 
Sbjct: 312 PSWEPDYLANVGKTIYETMSQVDKEAIWYQMSWTFYNDPTHWTRPRLSAMIHAVPQGKLF 371

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM---------------YG------ 217
            LD   E +  +  S  FYG P+IWC L NF  N  +               YG      
Sbjct: 372 FLDYNCEEEEFFRKSDNFYGAPFIWCYLGNFGANTHLVAPLNKVVNRLGKLTYGSACVGV 431

Query: 218 --ILDSIAFGP--------VEARTSENTTM-AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
              L+ I   P        +  R  E  T    I  Y+ RR G    A+ +AW +L   V
Sbjct: 432 GSTLEGINVNPEIYETVLEMPWRADETVTADTLIRHYAERRAGARDKAVIEAWQLLRQHV 491

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
               D A       +V    V P +  +T   +                     +P + Y
Sbjct: 492 --LVDTAVGIWNHCVVF--QVSP-VTDLTRAFWAT-------------------NPKIPY 527

Query: 327 STSEVIRAL-ELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
              ++  AL  +F AS N    ++ YR+D+++LTRQAL  Y   L+  ++EAY   +   
Sbjct: 528 RNVDLAIALNRMFQASANS-KKTDAYRFDVVNLTRQALGNYGTVLYHKMMEAYSRKNLID 586

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
             + S  FL+L +++DGLLA    FLLG WL  A+       ++  YE NAR  IT W  
Sbjct: 587 FRKYSGEFLQLGQEIDGLLATRHEFLLGKWLADARSWGTTPAEKAYYERNAREIITTW-- 644

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
              +    L DY N+ W+GLLR YY PR   +   +  SL +G  +  K +         
Sbjct: 645 --HKAGGGLTDYSNRQWNGLLRSYYLPRWKEFINRLDTSLSTGKDYDDKAFAAWCSAFEQ 702

Query: 506 DWQNG-RNVYPVESNGDALITSQWLYNKYLQ 535
            W +   + Y     GDA+  +  L+ KY Q
Sbjct: 703 HWVDSPSSAYSDTETGDAVKMAFELFGKYKQ 733


>gi|298385999|ref|ZP_06995556.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           1_1_14]
 gi|298261227|gb|EFI04094.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           1_1_14]
          Length = 715

 Score =  268 bits (684), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 173/570 (30%), Positives = 272/570 (47%), Gaps = 87/570 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL   GGPLP  W  +Q+VLQKKIL R+ E GM PV   F G VP+ L+  +P A++ 
Sbjct: 194 MGNLENIGGPLPDEWFKEQIVLQKKILARMREYGMKPVFQGFFGMVPSLLKEKYPEARLV 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           + G W S++  P      +LD  DPLF  + + +  +  K YG+ + ++  D F E   T
Sbjct: 254 EQGLWNSLQRPP------VLDPADPLFERMAKVWYAEYEKLYGK-ADLFGGDLFHEGGKT 306

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D    ++     + + M+  + DA W++Q WL +       P+ K LL  +     +
Sbjct: 307 GGID----VTDAARRVQTAMKRYNPDATWVIQAWLGN-------PK-KELLAGLDRKNTL 354

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART------- 231
           ++DL AE    W   K F G P++W  + N+ GNI ++G LD+IA GPV+ +        
Sbjct: 355 IVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGGNIGLHGRLDAIATGPVDGQKDSAASPS 414

Query: 232 ---------------------------SENTTM-AWINQYSVRRYGRSVPAIQDAWNVLY 263
                                      SE+  +  W+ +YSVRRYG     +++AW + +
Sbjct: 415 MKGTSSTPEGIEVNPVVFDLLNEMRWRSEHLDLDVWLKEYSVRRYGVEDENLKEAWTIFH 474

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            T Y    G    +  V  A P +    I+                       S++    
Sbjct: 475 RTAYGTYTGHRRPSESVFCAPPSLKRDKITA----------------------SAWSQCR 512

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           ++Y      + + LF+ S + L  ++TY+YD +D  RQ LA    E + N+++AY+  D 
Sbjct: 513 IFYDPELFAQGVGLFLQSADRLKQTSTYQYDAVDFVRQYLADLGRETYYNLVDAYRAKDT 572

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
                 S RFL+L++D + LL+ H+ F +G WL+ A+  ++  E +  YE NAR  I  W
Sbjct: 573 KQFDYWSERFLQLIKDQNELLSTHERFFVGRWLDMARLKSKQPELQDLYEHNARMLIGTW 632

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                E  S +RDY +K W GLL+DYY PR   Y  Y+  +LE G    + D      + 
Sbjct: 633 ----TETLSPVRDYAHKEWGGLLKDYYLPRWTNYIAYLKGTLE-GRSLTVPD----SFQA 683

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              W N  N Y +E+  D + T++ +Y+KY
Sbjct: 684 EKAWVNAHNKYVLEAGVDPVQTAKRMYSKY 713


>gi|297273081|ref|XP_001095618.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Macaca mulatta]
          Length = 691

 Score =  267 bits (682), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 172/517 (33%), Positives = 249/517 (48%), Gaps = 113/517 (21%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+   +  P A 
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRTSCMPVAS 263

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +       S+   P                  GR  I                 + +   
Sbjct: 264 LPA-----SLPPSPG-----------------GRKLIH----------------SINLMQ 285

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
           PP  +P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L
Sbjct: 286 PPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRL 345

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           +VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM
Sbjct: 346 LVLDLFAESQPVYTLTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTM 405

Query: 238 ----------------------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
                                             AW+  ++ +RYG S P    AW +L 
Sbjct: 406 VGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLL 465

Query: 264 HTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
            +VYNC+ +     NR  +V  P                          L+  TS     
Sbjct: 466 RSVYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS----- 495

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
            +WY+ S V  A  L + S   L+AS  +RYDL+DLTRQA+ +  +  +     AY   +
Sbjct: 496 -VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKE 554

Query: 383 AHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
              + +       EL+  +D LLA    FLLG WLE A+  A +E +   YE N+R Q+T
Sbjct: 555 LTSLLRAGGVLAYELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLT 614

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
           +W      E ++L DY NK  +GL+ +YY PR  ++ 
Sbjct: 615 LW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFL 646


>gi|119480815|ref|XP_001260436.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
           181]
 gi|119408590|gb|EAW18539.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
           181]
          Length = 748

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 165/524 (31%), Positives = 265/524 (50%), Gaps = 77/524 (14%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LP SW+D Q  LQKKI+ R+ ELGM PVLPAF+G VP A+  V P+A +  
Sbjct: 197 NIQGSWGGDLPYSWIDSQFELQKKIVRRMVELGMTPVLPAFTGFVPRAISRVLPNATVVN 256

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W     D R+     L+  DP F  + R+FI++Q + YG  +H+Y  D ++EN P  
Sbjct: 257 GSRWGGF--DERYTNDTFLEPFDPSFTRLQRSFIQKQQQAYGNITHVYTLDQYNENDPYS 314

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
              +Y+ ++    +  ++S D +AVWLMQGWLF S   FW   ++KA L+ V + + ++V
Sbjct: 315 GDLDYLRNVTRNTWLSLKSADPNAVWLMQGWLFYSNSDFWTDERVKAYLSGVEVDQDMLV 374

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLF+E +P W  ++ +YG P+IWC LH++ GN+ +YG + ++     +A  + ++ +  
Sbjct: 375 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNVTVNATQALAASDSLVGF 434

Query: 238 --------------------AWINQ---------------YSVRRYGRSVP-AIQDAWNV 261
                               AW  Q               YS    G +VP  +  AW++
Sbjct: 435 GLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHNWVKTRYSSGVRGSAVPEELHQAWDI 494

Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
           L  TVYN T+  +      I    ++ PSI     G     G                DH
Sbjct: 495 LRTTVYNNTNLTSTAVSKSIF---ELQPSI----SGLLNRTG----------------DH 531

Query: 322 P-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEA 377
           P  + Y  + +++A +L  ++ ++   L +   + YD++D+TRQ +A     +++N++  
Sbjct: 532 PTTVNYDPAALVQAWQLMDSAASKDRSLWSQPAFLYDMVDITRQVMANAFIPMYINLVST 591

Query: 378 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 437
           YQ      V       ++L+ D+D +L+ +D F L  W+ SA+  A+N+ +   YE+NAR
Sbjct: 592 YQA--GASVSTDGSNLIQLLRDVDSVLSTNDNFRLSTWIRSARSWARNDTEADFYEYNAR 649

Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
            QI +W          + DY +K W GL+  YY PR   +  Y+
Sbjct: 650 NQIALW-----GPMGEINDYASKQWGGLVSAYYIPRWQTFLHYL 688


>gi|156121099|ref|NP_001095696.1| alpha-N-acetylglucosaminidase precursor [Bos taurus]
 gi|151554244|gb|AAI48148.1| NAGLU protein [Bos taurus]
 gi|296476361|tpg|DAA18476.1| TPA: alpha-N-acetylglucosaminidase [Bos taurus]
          Length = 667

 Score =  265 bits (677), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 148/383 (38%), Positives = 205/383 (53%), Gaps = 68/383 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+GNW        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+TM  
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 440

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S    + AW +L  +
Sbjct: 441 TGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 500

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYNC                       S  E +  N+  P+ +   L+  T+      +W
Sbjct: 501 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 530

Query: 326 YSTSEVIRALELFIASGNELSAS 348
           Y+ S+V  A  L + + + L++S
Sbjct: 531 YNRSDVFEAWRLLLTATSTLASS 553



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 60/115 (52%), Gaps = 5/115 (4%)

Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
           A   A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  ++ 
Sbjct: 551 ASSPAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFT 605

Query: 479 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           + ++ESL  G  F+   + R   +L   +  G   YP +  GD +   + L+ KY
Sbjct: 606 ETLVESLVQGVPFQQHQFDRNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 660


>gi|404406438|ref|ZP_10998022.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
          Length = 726

 Score =  265 bits (676), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 179/576 (31%), Positives = 271/576 (47%), Gaps = 82/576 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLPQSW+D QL LQ++I+ R  ELG+ PV  +F+G+VP AL+ +FP A I 
Sbjct: 190 MANIDAWHGPLPQSWIDGQLELQRRIIARERELGIQPVFTSFTGHVPKALKTLFPDADIE 249

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W S +   R   +Y L+  +PLF  I +A++++Q + +G +S +Y  D F+E  PP
Sbjct: 250 RLNPWTSFE---RPYNSYYLNPAEPLFNRIQQAYMQEQRRLFGESS-VYGVDPFNELDPP 305

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
              PEY++      Y  +   D DAVWL   W+F +    W P ++KA L +VP GKL++
Sbjct: 306 NWDPEYLARAARLTYESITQFDKDAVWLQMAWVFYHKRRDWTPERLKAYLCAVPDGKLLM 365

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG---------------------- 217
           LD + +   +W +++ FYG P+IW  L NF GN  + G                      
Sbjct: 366 LDYYCDKVELWRSTESFYGQPFIWSYLGNFGGNTMLAGDVKDVSRKLDRAYAEAGRNFVG 425

Query: 218 ---ILDSIAFGPV-------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
               L+ +   P         A T       WI++ + R  GR     + AW +LY  +Y
Sbjct: 426 IGCTLEGLDVNPFMYEYVLDRAWTQLYDDAGWIDRLADRHSGRIDVHYRQAWRILYDKIY 485

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
               G    NR   V                     +P  K        S +  PHL Y 
Sbjct: 486 CAPSG----NRSAAVC-------------------ARPNMK------GRSKWSGPHLDYD 516

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             +++R  E    +  E +AS+  R+D +++ RQ L  Y   L    I A +  D   V 
Sbjct: 517 NRDLLRVWEQLTLARPERTASS--RFDCVNIPRQCLENYFGNLNERCIAACRGGDRETVA 574

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           +LS R LEL++D+D L+A    FLLG W+  A+++     ++  +E +AR  +T W    
Sbjct: 575 RLSARLLELLDDIDRLVAADAYFLLGKWIADARRMGATPAEKDYFERDARNILTTW---- 630

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---RLKDWRREWIKLT 504
                 L DY N+ WSGL+ DYY  R   ++  +    E  +      L+D+  EW+   
Sbjct: 631 GGRGYSLNDYANRTWSGLVSDYYKERWRRFYDRLQSDGEPDEDALLQELQDFEWEWVG-- 688

Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
              + GR  +     GDA    + LY KY    G F
Sbjct: 689 ---RKGR--FAERPRGDAFRLCRSLYTKYAAEIGRF 719


>gi|393783261|ref|ZP_10371436.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
           CL02T12C01]
 gi|392669540|gb|EIY63028.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
           CL02T12C01]
          Length = 724

 Score =  264 bits (675), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 169/567 (29%), Positives = 267/567 (47%), Gaps = 79/567 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+ GW GPLP  WLD Q+ LQKKIL R  EL M PVLPAF+G+VP AL+ +FP A I 
Sbjct: 198 MANIDGWNGPLPMHWLDSQVELQKKILTRERELNMKPVLPAFAGHVPGALKRIFPEANIQ 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W     + R    + L+  + LF  I + +I++Q + +G T HIY  D F+E  PP
Sbjct: 258 NLGKWAGFAEEYR---CHFLNPEEALFATIQKQYIKEQTRLFG-TDHIYGVDPFNEVDPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEY+S + A +Y  + + D  A W+   W+F +D   W  P++KA+L  VP GK+V+
Sbjct: 314 SWEPEYLSKVSADMYHTLTAADPKAEWMQMTWMFYFDRKDWTAPRVKAMLTGVPQGKMVL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W T++ F+G PYIWC L NF GN  + G                      
Sbjct: 374 LDYHCENVELWKTTEHFHGQPYIWCYLGNFGGNTTLTGNVKESGARLDNTLINGGSNFKG 433

Query: 219 -------LDSIAFGPVE-----ARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E     A T      +W+N  + R  G +   +++AW++L++ V
Sbjct: 434 IGSTLEGLDVMQF-PYEYIFEKAWTLNTDDRSWLNALADRHTGVTSEPVREAWDILFNQV 492

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y             +   P++ P +            KP ++ ++              Y
Sbjct: 493 YVQVP-------RTLAVLPNLRPVM-----------NKPNNRTSIN-------------Y 521

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             + +++A +  + + +     +  R D+I + RQ L  Y   +  +    Y+  D   +
Sbjct: 522 PNTALLQAWQKLLQAPD--CNRDALRLDIITVGRQLLGNYFLTVKDDFDRMYEAKDLPAL 579

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              +    E++ D++ L A H    L  W+  A++     E +  YE NAR  IT W   
Sbjct: 580 KARAAEMREILNDLERLNAFHSRCSLDKWISDARKYGNTPELKNYYEKNARNLITTW--- 636

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
                  L DY ++ W+GL++DYY  R  +Y   ++ ++E+   F  +    E+    + 
Sbjct: 637 ----GGRLNDYASRTWAGLIKDYYSKRWDMYLDAVVAAVENNREFDQEKLDGEFRLFEDS 692

Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
           W +      V   GD LI +++L NKY
Sbjct: 693 WVSSTRPVEVTPEGDLLIYARFLLNKY 719


>gi|423287380|ref|ZP_17266231.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
           CL02T12C04]
 gi|392672495|gb|EIY65962.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
           CL02T12C04]
          Length = 726

 Score =  264 bits (675), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 170/566 (30%), Positives = 268/566 (47%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP SWL  Q  LQK+I+ R   LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 192 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  DPLF +I + ++E+Q K YG T HIY  D F+E   P
Sbjct: 252 QMSQWGGYDKKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++   I+  ++  DS A W+   W+F +    W  P++KA LNSVP  KL++
Sbjct: 308 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 367

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G +D ++                
Sbjct: 368 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 427

Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                           F   +A +   T   W+  +++ R G     I DAW  LY  +Y
Sbjct: 428 VGATLEGLDVNPFMYEFVLEKAWSHTITNADWMKNWALCRGGSKSSHIIDAWQQLYKKIY 487

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                                  I   T G+       ++   +L+   S   HP ++Y 
Sbjct: 488 -----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYYD 520

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+      F+ + N    S+ Y++D+I++ RQ L    ++   +    Y+  +  G+ 
Sbjct: 521 NKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGMK 578

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           + + +   L  D+D LL+C   F +G W++ A+   +N ++++ YE NAR  +T W    
Sbjct: 579 EWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARCILTTW---- 634

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++A+ L DY N+ W GL   YY  R  ++ +Y I+ +  G     K +     +    W
Sbjct: 635 GQKATQLNDYANRGWGGLTDSYYRKRWELFTQYAIDEMSHGKEIDEKSFYNLITEFEYQW 694

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
               NVY   S  D +  +  LY KY
Sbjct: 695 TLQTNVYSESSGEDPIRIANLLYIKY 720


>gi|295087651|emb|CBK69174.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 703

 Score =  264 bits (675), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 170/566 (30%), Positives = 268/566 (47%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP SWL  Q  LQK+I+ R   LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 169 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 228

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  DPLF +I + ++E+Q K YG T HIY  D F+E   P
Sbjct: 229 QMSQWGGYDEKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 284

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++   I+  ++  DS A W+   W+F +    W  P++KA LNSVP  KL++
Sbjct: 285 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 344

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G +D ++                
Sbjct: 345 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 404

Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                           F   +A +   T   W+  +++ R G     I DAW  LY  +Y
Sbjct: 405 VGATLEGLDVNPFMYEFVLEKAWSHTITNADWMKNWALCRGGSKSSHIIDAWQQLYKKIY 464

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                                  I   T G+       ++   +L+   S   HP ++Y 
Sbjct: 465 -----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYYD 497

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+      F+ + N    S+ Y++D+I++ RQ L    ++   +    Y+  +  G+ 
Sbjct: 498 NKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGMK 555

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           + + +   L  D+D LL+C   F +G W++ A+   +N ++++ YE NAR  +T W    
Sbjct: 556 EWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARCILTTW---- 611

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++A+ L DY N+ W GL   YY  R  ++ +Y I+ +  G     K +     +    W
Sbjct: 612 GQKATQLNDYANRGWGGLTDSYYRKRWELFTQYAIDEMSHGKEIDEKSFYNLITEFEYQW 671

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
               NVY   S  D +  +  LY KY
Sbjct: 672 TLQTNVYSESSGEDPIRIANLLYIKY 697


>gi|423213214|ref|ZP_17199743.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693674|gb|EIY86904.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 726

 Score =  264 bits (675), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 170/566 (30%), Positives = 268/566 (47%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP SWL  Q  LQK+I+ R   LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 192 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  DPLF +I + ++E+Q K YG T HIY  D F+E   P
Sbjct: 252 QMSQWGGYDEKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++   I+  ++  DS A W+   W+F +    W  P++KA LNSVP  KL++
Sbjct: 308 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 367

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G +D ++                
Sbjct: 368 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 427

Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                           F   +A +   T   W+  +++ R G     I DAW  LY  +Y
Sbjct: 428 VGATLEGLDVNPFMYEFVLEKAWSHTITNADWMKNWALCRGGSKSSHIIDAWQQLYKKIY 487

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                                  I   T G+       ++   +L+   S   HP ++Y 
Sbjct: 488 -----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYYD 520

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+      F+ + N    S+ Y++D+I++ RQ L    ++   +    Y+  +  G+ 
Sbjct: 521 NKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGMK 578

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           + + +   L  D+D LL+C   F +G W++ A+   +N ++++ YE NAR  +T W    
Sbjct: 579 EWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARCILTTW---- 634

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++A+ L DY N+ W GL   YY  R  ++ +Y I+ +  G     K +     +    W
Sbjct: 635 GQKATQLNDYANRGWGGLTDSYYRKRWELFTQYAIDEMSHGKEIDEKSFYNLITEFEYQW 694

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
               NVY   S  D +  +  LY KY
Sbjct: 695 TLQTNVYSESSGEDPIRIANLLYIKY 720


>gi|380692804|ref|ZP_09857663.1| putative alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
          Length = 709

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 169/570 (29%), Positives = 267/570 (46%), Gaps = 87/570 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL   GGPLP  W  +Q VLQKKIL R+ E GM PV   F G VP++L+  +P A + 
Sbjct: 188 MGNLENIGGPLPDEWFKEQTVLQKKILARMREYGMKPVFQGFFGMVPSSLKEKYPEAHLV 247

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           + G W S++  P      +LD  DPLF ++ + +  +  K YG+ + ++  D F E   T
Sbjct: 248 EQGLWNSLQRPP------VLDPADPLFEQMAKVWYTEYEKLYGK-ADLFGGDLFHEGGKT 300

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D    ++     + + M+  + DA W++Q WL +       P+ K LL  +     +
Sbjct: 301 GGID----VTDAARRVQTAMKQYNPDATWVIQAWLGN-------PK-KELLAGLDRKHTL 348

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA 238
           ++DL AE    W   K F G P++W  + N+  NI ++G LD+IA GP++ R     + +
Sbjct: 349 IVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGANIGLHGRLDAIATGPIDGRKDPEASPS 408

Query: 239 -----------------------------------WINQYSVRRYGRSVPAIQDAWNVLY 263
                                              W+ +YSVRRYG     ++ AW + +
Sbjct: 409 MKGTSSTPEGIEVNPVVFDLLNEMRWRSEYLDIDTWLKEYSVRRYGAEDENLKKAWIIFH 468

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            T Y    G    +  V  A P +    I+                       S++    
Sbjct: 469 RTAYGTYSGHRRPSESVFCAPPSLKRDKITA----------------------SAWSQCR 506

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           ++Y      + + LF+ S + L  ++TY+YD +D  RQ LA    E + N+++AY+  D 
Sbjct: 507 IFYDPDLFAQGVGLFLQSADHLKQTSTYQYDAVDFVRQYLADLGREAYYNLVDAYRAKDT 566

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
                 S RFL+L++D + LL+ H  F +G WL+ A+  ++  E +  YE NAR  I  W
Sbjct: 567 KQFDYWSERFLQLIKDQNELLSTHKCFFVGRWLDMARSKSKQPELQDLYEHNARMLIGTW 626

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                E  S +RDY +K W GLL+DYY PR   Y  Y+  +LE G    + +      ++
Sbjct: 627 ----TETLSPVRDYAHKEWGGLLKDYYLPRWTNYIAYLKGTLE-GRSLTVPN----SFQV 677

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              W N  N Y +E+  D + T++ +Y KY
Sbjct: 678 EKAWVNAHNKYVLETGVDPVETAKRMYRKY 707


>gi|71001188|ref|XP_755275.1| alpha-N-acetylglucosaminidase [Aspergillus fumigatus Af293]
 gi|66852913|gb|EAL93237.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
           Af293]
 gi|159129357|gb|EDP54471.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
           A1163]
          Length = 756

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 161/520 (30%), Positives = 262/520 (50%), Gaps = 69/520 (13%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LP SW+D Q  LQKKI+ R+ ELGM PVLPAF+G VP A+  V P+A +  
Sbjct: 205 NIQGSWGGELPYSWIDSQFELQKKIVRRMVELGMTPVLPAFTGFVPRAVSRVLPNATVVN 264

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W   + D R+     L+  DP F+ + R+FI++Q + YG  +HIY  D ++EN P  
Sbjct: 265 GSRW--EEFDERYTSDTFLEPFDPSFMRLQRSFIKKQQQAYGNITHIYTLDQYNENAPYS 322

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
              +Y+ ++    +  ++S D +AVWLMQGWLF S   FW   ++KA L+ V + + ++V
Sbjct: 323 GDLDYLHNVTHNTWLSLKSADPNAVWLMQGWLFYSSSGFWTDERVKAYLSGVEVDQDMLV 382

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLF+E +P W  ++ +YG P+IWC LH++ GN+ +YG + ++     +A  + ++ +  
Sbjct: 383 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNVTVNATQALAASDSLVGF 442

Query: 238 --------------------AWINQ---------------YSVRRYGRSVP-AIQDAWNV 261
                               AW  Q               YS    G +VP  +  AW++
Sbjct: 443 GLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHNWAKTRYSSGVRGSAVPEELYQAWDI 502

Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
           L  T YN T+  +      I    ++ PSI  +      +              T SYD 
Sbjct: 503 LRITAYNNTNLTSTAVSKSIF---ELQPSISGLLNRTSHH------------PTTVSYDP 547

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
             L     +  R ++   +    L +   + YD++D+TRQ ++     ++ N++  YQ  
Sbjct: 548 AAL----VQAWRLMDSAASKAPSLWSQPAFLYDMVDITRQVMSNAFIPVYTNLVSTYQAG 603

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
            +  V       ++L+ D+D +L+ +D F L  W++SA+   +N+ +   YE+NAR Q+T
Sbjct: 604 GS--VSTDGSNLIQLLRDLDSVLSTNDNFRLSTWIQSARSWVRNDTEADFYEYNARNQVT 661

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
           +W    +     + DY +K W GL+  YY PR   +  Y+
Sbjct: 662 LWGPKGE-----INDYASKQWGGLVSSYYIPRWQKFLNYL 696


>gi|237717696|ref|ZP_04548177.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229453015|gb|EEO58806.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 729

 Score =  261 bits (667), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 167/568 (29%), Positives = 264/568 (46%), Gaps = 71/568 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GW  PLP+ WL  Q  LQ++I+ R  E  M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 196 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R  CT+ L+  D L+  I + ++ +Q + YG T+HIY  D F+E  PP
Sbjct: 256 RVSEWGGFADQYR--CTF-LNPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
               + +  +   IY  + + D +AVWL   WLF  D   W  P++K+ L SVP  +L++
Sbjct: 312 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
           LD F E   IW  +  ++G PY+WC L NF GN  + G ++ ++                
Sbjct: 372 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 431

Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                           F   +A     T   W  + + RR G+  P  + AW +L + VY
Sbjct: 432 VGSTLEGIDLNQFMYEFVLDKAWNGGQTDKEWFFKLADRRIGKISPEARKAWEILANKVY 491

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                              V P+   V +G   N  +P      LK        P + Y 
Sbjct: 492 -------------------VQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEYQ 524

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             +++ A  L ++  +     ++Y +DL+++ RQ L  Y N +      AY+  D   + 
Sbjct: 525 PKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIPMMK 582

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
               +  E++ D+D L++CH  F L  W+  A+ +  +   +  YE NAR+ IT+W D+ 
Sbjct: 583 NRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHDAASKNYYEMNARSLITIWGDSY 642

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
                 L DY N+ W+GL   YY  R   +   +IE+ E    F  +++  +     N+W
Sbjct: 643 H-----LTDYANRSWAGLTNQYYSVRWDHFINEVIEAAEKKKNFDEEEFFNQSRMYENEW 697

Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQ 535
            N  N       GD +  ++ +Y KY +
Sbjct: 698 VNPSNRISYNEGGDGIKLARQIYKKYAK 725


>gi|299149196|ref|ZP_07042257.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
 gi|298512863|gb|EFI36751.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
          Length = 738

 Score =  261 bits (667), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 167/568 (29%), Positives = 264/568 (46%), Gaps = 71/568 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GW  PLP+ WL  Q  LQ++I+ R  E  M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 205 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R  CT+ L+  D L+  I + ++ +Q + YG T+HIY  D F+E  PP
Sbjct: 265 RVSEWGGFADQYR--CTF-LNPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
               + +  +   IY  + + D +AVWL   WLF  D   W  P++K+ L SVP  +L++
Sbjct: 321 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
           LD F E   IW  +  ++G PY+WC L NF GN  + G ++ ++                
Sbjct: 381 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 440

Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                           F   +A     T   W  + + RR G+  P  + AW +L + VY
Sbjct: 441 VGSTLEGIDLNQFMYEFVLDKAWNGGQTDKEWFFKLADRRIGKISPEARKAWEILANKVY 500

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                              V P+   V +G   N  +P      LK        P + Y 
Sbjct: 501 -------------------VQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEYQ 533

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             +++ A  L ++  +     ++Y +DL+++ RQ L  Y N +      AY+  D   + 
Sbjct: 534 PKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIPMMK 591

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
               +  E++ D+D L++CH  F L  W+  A+ +  +   +  YE NAR+ IT+W D+ 
Sbjct: 592 NRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHDAASKNYYEMNARSLITIWGDSY 651

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
                 L DY N+ W+GL   YY  R   +   +IE+ E    F  +++  +     N+W
Sbjct: 652 H-----LTDYANRSWAGLTNQYYSVRWDHFINEVIEAAEKKKNFDEEEFFNQSRMYENEW 706

Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQ 535
            N  N       GD +  ++ +Y KY +
Sbjct: 707 VNPSNRISYNEGGDGIKLARQIYKKYAK 734


>gi|357622373|gb|EHJ73879.1| putative alpha-N-acetyl glucosaminidase [Danaus plexippus]
          Length = 780

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 163/534 (30%), Positives = 257/534 (48%), Gaps = 79/534 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+HGWGGPLPQSW D+Q  +Q+ +   +++LGM PV PAF+G+VP A + +FP+    
Sbjct: 206 MGNVHGWGGPLPQSWHDRQKQIQEVVTDLMFKLGMIPVFPAFNGHVPKAFEKIFPNTTFH 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W   K D  +CC   +D  +P F  I + F+ +     G +SHIY  D F+E    
Sbjct: 266 PVETW--NKFDEDYCCNLFVDPREPDFKMISKMFMREITAGLG-SSHIYTADPFNEIKIQ 322

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             S   +     AI+S +   D DAVWL+Q W+F ++P  W   ++ + L SVP G+++V
Sbjct: 323 PWSTSLVVETAKAIFSSISEYDKDAVWLVQNWMFVHNPLLWPLKRVNSFLTSVPNGRMLV 382

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDL +E  P +   + +YG P+IW MLHNF G + M+G   +I     E R  EN+TM  
Sbjct: 383 LDLQSEQWPQYDLYQMYYGQPFIWSMLHNFGGTLGMFGNTKTINKDVYEVRKRENSTMVG 442

Query: 239 ---------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                            W++ Y+ RRYG +  +I   W  L  +
Sbjct: 443 IGLTPEGINQNYVIYDLMLESAWRKGPVPDLEEWVSDYAERRYGCNATSI--GWKYLLRS 500

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYN T                     ++   GKY           V+    S    P  W
Sbjct: 501 VYNFTG--------------------LNRIRGKY-----------VMTRRPSFNIRPWAW 529

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           Y   ++  AL+ F+   N   +++ + +DL+D+TRQAL     ++++N+      N  + 
Sbjct: 530 YKGHDLFEALKNFVYVQNPACSTSGFLHDLVDVTRQALQYKIEQIYMNLQNDRYSN--YM 587

Query: 386 VFQLS-RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
           VF  +   F++ + DM  +LA    F +  WL SA+ ++    +   Y++NAR QIT+W 
Sbjct: 588 VFNYTISSFIDAMTDMQNILATSSDFKITSWLSSARAISNLPLESSLYDFNARNQITLWG 647

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
            N +     + DY  K W+ L + YY PR +I+    +++    + F  K  +R
Sbjct: 648 PNGE-----ISDYACKQWAELFKYYYIPRWSIFLSMALDAKTRNEPFDEKGAQR 696


>gi|449541595|gb|EMD32578.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
           B]
          Length = 752

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 166/550 (30%), Positives = 275/550 (50%), Gaps = 72/550 (13%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LP  W++ Q  LQK+IL R+ ELGM PVLPAF+G VP A+  V  +A I  
Sbjct: 198 NIQGSWGGELPMQWVNDQFALQKQILARMTELGMTPVLPAFTGFVPRAMSTVHSNASIVN 257

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTPP 120
              W +    P       L+  DPLF  + ++FI +Q + YG   SHIY  D ++EN P 
Sbjct: 258 GSQW-APGFPPSLTNVSFLEPFDPLFATLQKSFIAKQQEAYGANISHIYTLDQYNENNPF 316

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-KLV 178
             +  Y+SS+    ++ +++ D DAVW++QGWL FS + FW   +++A L  VP    ++
Sbjct: 317 SGNLSYLSSISEGTFTSLRAADPDAVWMLQGWLFFSSEAFWTNERIEAYLGGVPTNDSMI 376

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM- 237
           VLDL++E +P W+ +  ++G  ++WC LH++ G I + G LD+I  GP+ A  S  ++M 
Sbjct: 377 VLDLYSEAQPQWNRTSSYFGKQWVWCELHDYGGTIGLEGNLDAITTGPIAALNSPGSSMK 436

Query: 238 ----------------------AW----------INQYSVRRY-GRSVP-AIQDAWNVLY 263
                                 AW          +  +  RRY    +P A Q+AW +L 
Sbjct: 437 GMGLTMEGQEGNEIVYDLLLDQAWSSSPINIASYVKGWVSRRYLVEPLPSAAQEAWRILS 496

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            TVYN  D     ++  I    +++P +  +           V++  +L +    YD   
Sbjct: 497 TTVYNNQD---PNSQSTIKNIYELEPVLTGL-----------VNRTGILPT-VIPYD--- 538

Query: 324 LWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
              + S ++ AL+L +   A    LS    + +D++D++RQ L+    + +  +I+ Y  
Sbjct: 539 ---TNSTIVPALQLLVKAKAQNAALSTVPEFVHDVVDVSRQLLSNRFIDAYTALIDTYNN 595

Query: 381 ND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNAR 437
            +  +  V +  +  + ++  +D LLA ++ FLL  W+  A+ L+  +E    Y E+NAR
Sbjct: 596 TNVTSDAVIRAGQPLMTILSQLDALLATNENFLLSSWIAQARNLSHGDESYAAYLEYNAR 655

Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR 497
            QIT+W  + +     + DY +K W+GL+  YY  R   +  Y+  +      F    + 
Sbjct: 656 NQITLWGPDGE-----INDYASKAWAGLISTYYAARWQTFIDYLASTKRLARPFDTSAFS 710

Query: 498 REWIKLTNDW 507
            + I L  +W
Sbjct: 711 NQMILLGQEW 720


>gi|333031143|ref|ZP_08459204.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
 gi|332741740|gb|EGJ72222.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
          Length = 723

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 176/567 (31%), Positives = 255/567 (44%), Gaps = 79/567 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP+ WLD Q  LQK+IL R  EL M PVLPAF+G+VP+ L+++FP A I 
Sbjct: 197 MANIDSWNGPLPKEWLDHQSDLQKQILKRERELNMKPVLPAFAGHVPSELKHLFPEADIQ 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  DPLF +I R F+E+Q + +G T HIY  D F+E  PP
Sbjct: 257 HLGKWAGFADKYR--CNF-LNPNDPLFAKIQRLFLEEQTRLFG-TDHIYGVDPFNEVDPP 312

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
              PEY+  + A +Y  +   D  A WL   WLF +    W  P+++ALL  VP  +L +
Sbjct: 313 SWEPEYLKKVAADMYRTLTDVDPKAKWLQMTWLFYHGKKKWTAPRIEALLTGVPQDELYL 372

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W T+  F+G PYIWC L NF GN  + G                      
Sbjct: 373 LDYHCENVELWKTTDYFHGQPYIWCYLGNFGGNTTITGNVKESGQRLENTLINGGNNFKG 432

Query: 219 -------LDSIAFGPVE-----ARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E     A T      +W+   + R  G+   A ++AW +L++ V
Sbjct: 433 IGSTLEGLDVMQF-PYEYIFDKAWTFNMDDNSWVENLADRHLGKKSEAYREAWKILFNDV 491

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y     +       +   P+  P +            KP  +     ++  + D   +W 
Sbjct: 492 YVQVPKS-------LGVLPNFRPEM-----------SKPNKRTV---NDYKNKDLVKVWA 530

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
              EV                 + Y  DLI + RQ L  Y   +     + YQ  D  G+
Sbjct: 531 KLLEVKEC------------TRDAYIIDLITVGRQVLGNYFLVVKNEFDQMYQFKDLPGL 578

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
                +  E++ D++ L A H+   L  W+  A+ L    E +  YE NAR  IT W   
Sbjct: 579 ESRGAKLREILNDLENLTAFHNHCTLEKWISDARALGNTIELKDYYEKNARNLITTW--- 635

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
                  L DY ++ WSGL++DYY  R  +Y   + E+L+    F   +   +   L   
Sbjct: 636 ----GGSLNDYASRTWSGLIKDYYAKRWNLYIDSVTEALKENKKFNQSELNEKLNILEEA 691

Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
           W N          GD L  S++L++KY
Sbjct: 692 WVNKVETVTSYEQGDILELSKYLFDKY 718


>gi|281210062|gb|EFA84230.1| hypothetical protein PPL_03307 [Polysphondylium pallidum PN500]
          Length = 744

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 172/565 (30%), Positives = 272/565 (48%), Gaps = 84/565 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGG LPQSW+  Q  LQ KIL R+ E GM+PV P F+G+VP A +  +PSA I 
Sbjct: 214 MGNLDGWGGVLPQSWIKGQHELQIKILKRMSEYGMSPVFPGFAGHVPVAFKQFYPSANIV 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHI--YNCDTFDENT 118
           +L +W    +      T  L  TDP++  +   F + Q + YG  + I  ++ D F+E  
Sbjct: 274 ELPSWHGFNA------TNHLLTTDPMYDIVADRFYQVQNEIYGAYAKIDYFSIDPFNELI 327

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
           PP +S ++++   + I++ +   + D+ W++Q W  +   FW   Q+ + L  VP+G+L+
Sbjct: 328 PPSNSSQFLNECSSRIFNAINRFNPDSTWVLQNWFLN-SAFWGDGQVASFLGGVPIGRLI 386

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA 238
           VLDL++E+KP+W+ +  + G  +IW MLHNF G   + G +  IA  P+EA+ S  T + 
Sbjct: 387 VLDLWSELKPLWNRTANYQGHKWIWNMLHNFGGRPTISGRMPIIANEPLEAKASSPTMVG 446

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W++ Y  RRYG ++P ++  W +L +TV
Sbjct: 447 IGLTPEAIEQNVIVYDLMSEMGWRSRSFDLNLWVDAYVTRRYGVNLPNLKPVWKMLAYTV 506

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y                 P+  P+          NY   ++K+  L  +        L+Y
Sbjct: 507 YFS---------------PNRSPA----------NY---IAKKPSLDFQLG------LYY 532

Query: 327 STSEVIRA-LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           +   ++ A  EL       + +S TYRYDL ++T QAL+ Y N     + ++Y  +D   
Sbjct: 533 NPVVIVDAWRELLAVDSTIVRSSETYRYDLAEITLQALSNYFNGNLKQLYQSYYASDFQ- 591

Query: 386 VFQLSRRFLEL-VEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            FQ +R+     +  MD +        LG W   A++ A +  + + YE+NAR QIT+W 
Sbjct: 592 TFQSARQNCSFALRAMDAVADTVQLLKLGKWTADARKWATDNNERELYEYNARNQITLWG 651

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                      DY NK+WSGL+ DYY PR  I+F+++  ++     F         +   
Sbjct: 652 WKDMGNP----DYANKWWSGLIADYYFPRWQIFFEHLEHAIFDKSKFNEHSLAVNTMLHE 707

Query: 505 NDWQNGRNVYP--VESNGDALITSQ 527
             W    N+YP  V SN D    S+
Sbjct: 708 ERWNKQTNIYPSDVNSNDDVHTVSK 732


>gi|449541596|gb|EMD32579.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
           B]
          Length = 754

 Score =  258 bits (658), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 159/552 (28%), Positives = 276/552 (50%), Gaps = 77/552 (13%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LP  W++ Q  LQK+IL R+ ELGM P LPAF+G VP A+  ++P+A I  
Sbjct: 201 NIQGSWGGALPMQWVNDQFALQKQILTRMTELGMTPALPAFTGFVPRAMSTLYPNASIVN 260

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTPP 120
              W    +         L+  DPLF  + ++FI +Q + YG   +HIY  D ++EN P 
Sbjct: 261 GSAWSGFPAS--LTNVSFLEPFDPLFSTLQKSFITKQQQAYGTNVTHIYTLDQYNENNPF 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-KLV 178
             +  Y+SS+ A  ++ +++ D DA+W++QGWL FS + FW   +++A L  VP    ++
Sbjct: 319 SGNISYLSSVSAGTFASLRAADPDAIWMLQGWLFFSSETFWTDERIQAYLGGVPTNDSMI 378

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM- 237
           VLDL++E +P W+ +  ++G  ++WC LH + GN+ + G L++I  GP+ A +S+ ++M 
Sbjct: 379 VLDLYSEAQPQWNRTSSYFGKQWVWCELHGYGGNMGLEGNLNAITAGPIAALSSQGSSMK 438

Query: 238 ----------------------AW----------INQYSVRRYG-RSVP-AIQDAWNVLY 263
                                 AW          +  +  RRY    +P A Q+AW +L 
Sbjct: 439 GMGLTMEGQEGNEIVYDVLLDQAWSSAPIDIASYVKSWVARRYTVEPLPSAAQEAWQILS 498

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            TVYN  D     ++  I +  +++P++  +                       +  HP 
Sbjct: 499 TTVYNNQD---PNSQATIKSIYELEPTLTGLVN--------------------RTGHHPT 535

Query: 324 L--WYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
           L  + + + V+ AL+L + +  +   L+A   + YD +D++RQ L+    + +  +++ Y
Sbjct: 536 LIPYDTNTTVVPALQLLVKAKEQNAALAAIPEFVYDAVDVSRQLLSNRFIDAYTGLVDTY 595

Query: 379 QLNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWN 435
              +A    V +  +  + ++  +D LLA ++ FLL  W+  A+  +  +E    Y E+N
Sbjct: 596 NNANATSDAVVRAGQPLMVILSQLDALLATNENFLLSSWIAQARNWSHGDESYAAYLEYN 655

Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
           AR Q+T+W  + +     + DY +K W+GL+  YY  R   +  Y+  +      F    
Sbjct: 656 ARNQVTLWGPDGE-----INDYASKAWAGLISTYYSSRWQTFVDYLASTKRLSRPFDSSA 710

Query: 496 WRREWIKLTNDW 507
           +  + I L   W
Sbjct: 711 FSSQMILLGQQW 722


>gi|393785791|ref|ZP_10373937.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
           CL02T12C05]
 gi|392661410|gb|EIY54996.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
           CL02T12C05]
          Length = 727

 Score =  257 bits (657), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 170/566 (30%), Positives = 261/566 (46%), Gaps = 77/566 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+ GW GPLP  WLD Q+ LQKKIL R  EL M PVLPAF+G+VPAAL+ ++P A I 
Sbjct: 198 MANIDGWNGPLPMEWLDNQVELQKKILARERELNMKPVLPAFAGHVPAALKRIYPEANIQ 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R    Y L+  +PLF  I + F+++Q + +G T HIY  D F+E  PP
Sbjct: 258 HLGKWAGFADTYR---CYFLNPEEPLFATIQKHFLQEQTRLFG-TDHIYGVDPFNEVDPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEY+S + + +Y  + + D  A W+   W+F +D   W  P++KALL  VP  K+ +
Sbjct: 314 SWEPEYLSQVSSDMYRTLTAADPKAEWMQMTWMFYHDRKDWTAPRIKALLTGVPQDKMFL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG--------ILDSIAFGPVEART 231
           LD   E   +W  ++ F+G PYIWC L NF GN  + G        + +++  G    R 
Sbjct: 374 LDYHCENVELWKNTEHFHGQPYIWCYLGNFGGNTTLTGNVKESGDRLDNALINGGSNLRG 433

Query: 232 SENT------------------------TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
             +T                          AW+   + R  G     +++AW++L++ +Y
Sbjct: 434 IGSTLEGLDVMQFPYEYIFEKAWDLNLDNEAWLQNLADRHAGTVSQPVREAWDILFNQIY 493

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                         V  P         T G   NY +PV  +   ++           YS
Sbjct: 494 --------------VQVPK--------TLGVLPNY-RPVMNKPNRRTVID--------YS 522

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
            + +++A E  + + +     +  R D+I + RQ L  Y   +  +    Y + D  G+ 
Sbjct: 523 NATLLQAWEKLLQATD--CNRDALRLDIITVGRQLLGNYFLIVKDDFDRMYTVKDLPGLK 580

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
             +    E++ D+D L A H    L  WL  A+ L    E +  YE NAR  IT W    
Sbjct: 581 ARAAEMKEILNDLDRLNAFHSRCALDKWLADARALGTTPEVKDYYEKNARNLITTW---- 636

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
                 L DY ++ W+GL++DYY  R  +Y   +I ++E    F  K          + W
Sbjct: 637 ---GGSLNDYASRTWAGLIKDYYSKRWDMYMDAVISAVEGNREFDQKKLDESIKNFEDAW 693

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
            +  +   V   G+ +  +++L  KY
Sbjct: 694 VDSTDPILVAPQGELMQYARFLLQKY 719


>gi|336417192|ref|ZP_08597519.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
           3_8_47FAA]
 gi|423297818|ref|ZP_17275878.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
           CL03T12C18]
 gi|335936512|gb|EGM98438.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
           3_8_47FAA]
 gi|392664455|gb|EIY57993.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
           CL03T12C18]
          Length = 727

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 166/568 (29%), Positives = 261/568 (45%), Gaps = 71/568 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GW  PLP+ WL  Q  LQ++I+ R  E  M PVLPAF+G+VPAAL+ V+P+ K +
Sbjct: 194 MCNLDGWQSPLPKEWLSSQAELQEQIVAREREFNMQPVLPAFAGHVPAALKRVYPNIKTS 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R  CT+L +  D L+  I + ++ +Q + YG T+HIY  D F+E  PP
Sbjct: 254 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 309

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
               + +  +   IY  + + D  A+WL   WLF  D   W  P++K+ L SVP  KL++
Sbjct: 310 SWDTDSLGMMAKHIYESVAAVDPKAIWLQMTWLFYADIKHWTTPRIKSYLRSVPQDKLIL 369

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
           LD F E   IW  +  ++G PY+WC L NF GN  + G +  ++                
Sbjct: 370 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVKLVSERLADALKNGGSNLKG 429

Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                           F   +A  S  T   W  + + RR G+  P  + AW +L   VY
Sbjct: 430 VGSTLEGIDLNQFMYEFVLDKAWNSGQTDKEWFLKLADRRTGKVSPEARKAWEILADKVY 489

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                              + P+   V +G   N  +P      LK        P + Y 
Sbjct: 490 -------------------IQPA--QVGQGTLTN-ARPC-----LKGNGHWTTKPTIEYQ 522

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             +++ A  L +   +     ++Y +DL+++ RQ L  Y N +      AY+  D   + 
Sbjct: 523 PKDLVEAWRLLLLVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIMMMK 580

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
               +  E++ D+D L++CH  F L  W+  A+ +  +   +  YE NAR+ IT+W D+ 
Sbjct: 581 NRGDKMREILADLDKLVSCHPTFSLNKWITDARDMGHDATSKNYYEMNARSLITIWGDSY 640

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
                 L DY N+ W+GL   YY  R   +   +I+++E    F  + +  E     N+W
Sbjct: 641 H-----LTDYANRSWAGLTNQYYSVRWDRFINEVIKAVEKKKAFDEEVFFNESRMYENEW 695

Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQ 535
            N  N       GD +  ++ +Y KY +
Sbjct: 696 VNPSNRINYNEGGDGIKLARQIYKKYAK 723


>gi|126307952|ref|XP_001365931.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
           domestica]
          Length = 481

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 120/238 (50%), Positives = 160/238 (67%), Gaps = 4/238 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +Q  LQ +IL R+   GM PVLPAF+G++P A   VFP A +T
Sbjct: 205 MGNLHTWGGPLPSSWDLKQSYLQYQILERMRSFGMKPVLPAFAGHIPKAFTRVFPQANVT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L NW        + C+YLL   DPLF  +G  F+ +  KE+G T HIY+ D F+E  PP
Sbjct: 265 KLDNWIDFNCT--YSCSYLLAPEDPLFPVVGSLFLRELAKEFG-TDHIYSADIFNEMDPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             +P Y+++  AA+Y  M + D DAVWL QGWLF   P FW+PPQMKA+L +VP G+ ++
Sbjct: 322 SSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNHPDFWKPPQMKAVLEAVPRGRFLI 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           LDLFAE +P++S +  FYG P+IWCMLHNF GN  ++G+LD++  GP  AR   N+T+
Sbjct: 382 LDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLFGVLDAVNRGPSTARLFPNSTI 439


>gi|195454475|ref|XP_002074254.1| GK18384 [Drosophila willistoni]
 gi|194170339|gb|EDW85240.1| GK18384 [Drosophila willistoni]
          Length = 743

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 154/508 (30%), Positives = 253/508 (49%), Gaps = 76/508 (14%)

Query: 32  ELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 91
           ELG++  LPAF+G+VP AL+ +FP A  T+   W    +   +CC   ++  +PLF ++ 
Sbjct: 262 ELGISVALPAFAGHVPRALRRIFPQANFTETERWNRFPN--AYCCDLFVEPQEPLFRQLA 319

Query: 92  RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 151
             F+ +  + YG ++HI+ CD F+E  PPV   +++ S  AAIY+ M+  D  A+WL+QG
Sbjct: 320 TTFLRRVTQRYG-SNHIFFCDPFNELEPPVSQADFMRSTAAAIYASMREVDPKAIWLLQG 378

Query: 152 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 211
           W+F  + FW    ++A L +VP G L+VLDL +E  P +  +K +YG P++WCMLHNF G
Sbjct: 379 WMFVKNIFWTDELIEAFLTAVPQGNLLVLDLQSEQFPQYQRTKSYYGQPFVWCMLHNFGG 438

Query: 212 NIEMYGILDSIAFGPVEARTSENTTM---------------------------------A 238
            + M G ++ +  G   AR   N++M                                  
Sbjct: 439 TLGMLGSVELVNSGMDLARQMPNSSMVGAGITPEGIGQNYVMYSFALERGWSDRKLDSAG 498

Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 298
           W   +++ RYG     +  AW +L  +VY        + +  I   P ++ S        
Sbjct: 499 WFTHFALTRYGVQDERLNQAWQLLRTSVYTFHGLQKMRGKYTITRRPAINLS-------- 550

Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT----YRYD 354
                                  P  WY+ + V+ A +L +++ + +   +     Y++D
Sbjct: 551 -----------------------PFTWYNVTHVLEAWQLMLSARSIIPLDDNRYDIYQHD 587

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
           L+D+TRQ L   A++L++N+  +Y+         L  + LEL++D++ +L     FLLG 
Sbjct: 588 LVDITRQYLQITADQLYVNLNSSYRKRQLARFVYLGNKLLELLDDLERILGSGSNFLLGT 647

Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
           WLE+AK LA   E +  +E+NAR QIT W  N +     + DY  K WSG++ DYY PR 
Sbjct: 648 WLEAAKLLAPTVEDQSNFEFNARNQITTWGPNGE-----ILDYACKQWSGMISDYYRPRW 702

Query: 475 AIYFKYMIESLESGDGFRLKDWRREWIK 502
           A +   +  +L+S   F    +++  +K
Sbjct: 703 ARFLDDVTLALQSNQPFNASAYKQHVLK 730


>gi|121698957|ref|XP_001267859.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
           1]
 gi|119396001|gb|EAW06433.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
           1]
          Length = 671

 Score =  255 bits (652), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 157/485 (32%), Positives = 246/485 (50%), Gaps = 70/485 (14%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G W G LP SW+D Q  LQKKI+ R+ ELGM PVLPAF+G VP A+  V P A +  
Sbjct: 197 NIQGSWHGELPYSWIDAQFELQKKIVRRMVELGMTPVLPAFTGFVPRAITRVLPDATVVN 256

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W     D ++     L+  DP F  + R+FI +Q + YG  +HIY  D ++EN P  
Sbjct: 257 GSRWSGF--DEKYTNDTFLEPFDPNFARLQRSFIHKQQQAYGNITHIYTLDQYNENDPYS 314

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
             PEY+ ++    +  ++S D DA+W+MQGWLF S   FW   ++ A L+ V   + ++V
Sbjct: 315 GDPEYLRNVTHNTWQSLKSADPDAIWMMQGWLFYSNSDFWTDERVHAYLSGVETDEDMLV 374

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
           LDLF+E +P W  ++ +YG P+IWC LH++ GN+ +YG + +I     +A    ++ +  
Sbjct: 375 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNITVNATDALAVSDSLVGY 434

Query: 239 -----------------------------------WIN-QYSVRRYGRSVPAIQDAWNVL 262
                                              W+  +YS  R       +  AW++L
Sbjct: 435 GLTMEGQEGNEIVYDLLLDQAWSSRPIDTDSYFHDWVKARYSTARRHNVPHELYQAWDIL 494

Query: 263 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
             T YN T+ AT            V  SI  +         +P  K   L ++T    HP
Sbjct: 495 RTTAYNNTNLATATA---------VSKSIFEL---------QP--KLTGLVNQTGH--HP 532

Query: 323 HLW-YSTSEVIRALELFIASGNELSA---SNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
            +  Y  S ++R+ +L +++ +E +A      +RYD++D+TRQ +A     ++LN+   Y
Sbjct: 533 TVVNYEASSLVRSWKLMVSAASESTALWSHPAFRYDMVDVTRQVMANAFIPMYLNVTSTY 592

Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
           Q      + Q     + L+ D+D +L+ +D F L  W+ESA+  A+N+ +   YE+NAR 
Sbjct: 593 Q--KGGPISQQGDSLIRLLRDLDAVLSTNDNFRLATWIESARTWARNDTEADFYEYNARN 650

Query: 439 QITMW 443
           QIT+W
Sbjct: 651 QITLW 655


>gi|346324333|gb|EGX93930.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
          Length = 751

 Score =  254 bits (650), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 160/546 (29%), Positives = 269/546 (49%), Gaps = 80/546 (14%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LPQ+W++ Q  LQKKI+ R+ ELGM P+LPAF G VP  +  V+P+  + +
Sbjct: 203 NIQGSWGGDLPQAWIEDQFELQKKIVKRMIELGMTPILPAFPGFVPENITRVWPNVSLAE 262

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W       R+     +   DP F E+ +AF+ +Q + YG  +  +  D F+EN P  
Sbjct: 263 SPIWSGFSG--RFTADKYITPYDPHFAELQKAFLTKQNEAYGNVTSFWTLDQFNENKPAS 320

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK-LVV 179
              +Y+ ++    +  +++ D  AVW+MQGWLF+ D  +W   ++K+ L+ VP+ + +++
Sbjct: 321 GELDYLKNVSHNTWQTLKAADPSAVWVMQGWLFASDKTYWIDDRVKSFLDGVPVNEDMLL 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
           LDLFAE  P W  ++ FYG P+IWC LH++ GN+ +YG ++++    VEA          
Sbjct: 381 LDLFAESTPQWQRTESFYGKPWIWCQLHDYGGNMGLYGQIENVTKNAVEAVQTSKSIVGF 440

Query: 230 ----------------------RTSENTTMAWINQYSVRRYG---RSVPA-IQDAWNVLY 263
                                 R     T  + + +   RYG   + +P  +  AW+ + 
Sbjct: 441 GLSMEGQEGNEIMYDLLLDQAWRKEAIETDKYFSDWVTVRYGADHKEIPENLYTAWDKVR 500

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP- 322
            TVYN TD +      V  +  ++ PSI  +                       +  HP 
Sbjct: 501 STVYNNTDSSVTA---VTKSIFELAPSISGLV--------------------NRTGHHPT 537

Query: 323 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
            + Y T  +I A     ++G++   L  +  YRYDL D TRQ LA      +  ++E Y+
Sbjct: 538 KITYDTKTLISAWNDMFSAGDQARWLFDNEAYRYDLTDWTRQVLANAFEATYNKLVEKYK 597

Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
            N+  GV     R   +++ MD +L  +  F L  W+++A++     E    +E+NAR Q
Sbjct: 598 SNNTKGVKCAGDRLQAILQTMDQVLDTNPSFKLSTWIQAARK--SGGEAADFFEYNARNQ 655

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI-----ESLESGDGFRLK 494
           +T+W  N +     + DY +K W+GL+ +YY  R  ++  Y++     E  ++    +L+
Sbjct: 656 VTLWGPNGE-----IEDYASKQWAGLVGNYYAHRWQMFVDYLVATDPKEYDQNVFKKKLR 710

Query: 495 DWRREW 500
           +W  +W
Sbjct: 711 EWETQW 716


>gi|29348998|ref|NP_812501.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340905|gb|AAO78695.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 732

 Score =  254 bits (649), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 171/566 (30%), Positives = 267/566 (47%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK+IL R  E  M PVLPAF+G+VPA L+ ++P+AKI 
Sbjct: 196 MSNVDFWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  D L+  I R F+E+Q K YG T HIY  D F+E   P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
             S ++++++ + IY  +   DS A WL   W+F YD   W  P++++ L +VP  KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDNKLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--------------- 224
           LD + +   IW  ++++YG PYIWC L NF GN  + G L+ I F               
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431

Query: 225 ----------GPV-------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                      P+       +A     TT  WI  +S+ R G     I  AW  L+  +Y
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDYPVTTDQWITNWSMCRGGNQDANIIKAWRALHQKIY 491

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T+ AT     ++ A P        +T  K  N    +            Y +  LW  
Sbjct: 492 --TEHATCGQSVLMNARP-------RLTGTKSWNTNPGIH-----------YANNDLWQI 531

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+++A  +          ++ +R+D+I++ RQ L    +E        Y   D  G+ 
Sbjct: 532 WKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSEYRDQFTACYNRKDTTGMR 582

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           + S R   L+ D+D LL+C     +G WL+ A+       ++  YE NAR  +T+W    
Sbjct: 583 EWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCGATVSEKDYYEENARCILTVW---- 638

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++ + L DY N+ W GL R +Y  R   +   +I ++     F    + ++  +   +W
Sbjct: 639 GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIAAVSEDKPFDEDKFHQDITQFEYNW 698

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
              ++ +P+ S  D +  +  L  KY
Sbjct: 699 TLQKDSFPIVSEEDPIQIADSLILKY 724


>gi|282877910|ref|ZP_06286719.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
 gi|281299911|gb|EFA92271.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
          Length = 723

 Score =  254 bits (649), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 174/543 (32%), Positives = 252/543 (46%), Gaps = 78/543 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP SWL+QQ  LQ++IL+R   L M PVLPAFSG+VPA L+ ++P A I 
Sbjct: 199 MANIDKWNGPLPMSWLEQQKELQQRILLRERSLNMKPVLPAFSGHVPAKLKELYPQANIK 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W     + R    + L+  DPLF +I + ++E+Q   +G T HIY  D F+E  PP
Sbjct: 259 YLGRWAGFSDNYR---CHFLNPEDPLFAKIQKMYLEEQKALFG-TDHIYGIDPFNEVDPP 314

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
              PEY+  +   IY  + S D  A W+   W+F ++   W P ++KALL  V  GK+ +
Sbjct: 315 SWKPEYLKEISHNIYRTVTSVDPGAEWMQMSWMFYHNKKQWTPKRIKALLTGVSRGKMSL 374

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W T+  FYG PYIWC L NF GN  + G                      
Sbjct: 375 LDYHCENVELWKTTNNFYGQPYIWCYLGNFGGNTTITGNVKESGQRLNEALNKKNKNLIG 434

Query: 219 -------LDSIAFGPVE-----ARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD I F P E     A T+      WI+  + R  G S P ++ AW +L++ +
Sbjct: 435 IGSTLEGLDVIQF-PYEYILTQAWTATPADKEWIDNLADRHVGFSSPKLRQAWQILFNDI 493

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y      T   R + +  P + P +     GKYQ     ++       E        +W 
Sbjct: 494 Y------TQIPRSLGI-LPALRPIL-----GKYQERRTEITYPTKRLEE--------VWK 533

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             S+V                 N Y+ DLI + RQ L     +L L +   Y   D  G+
Sbjct: 534 LMSDVSEC------------DRNEYQLDLIAVGRQVLGNKFLKLKLELDSCYVNKDLVGL 581

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
            +      E++ D+D L A +    +G W++ A+    N+ ++  YE NAR  IT W   
Sbjct: 582 QRTGNTMKEVLVDLDYLTAGNSRCSIGKWIDDARAYGNNDLEKAYYEKNARNLITTW--- 638

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
                  L DY N+ WSGL+R YY  R ++Y   +  S+ SG  F  +   +   +   +
Sbjct: 639 ----GGSLNDYANRTWSGLIRTYYVRRWSMYIDELTASVMSGKPFDQQQLDKAIGEFEQN 694

Query: 507 WQN 509
           W N
Sbjct: 695 WVN 697


>gi|242077446|ref|XP_002448659.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
 gi|241939842|gb|EES12987.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
          Length = 252

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 122/185 (65%), Positives = 145/185 (78%)

Query: 353 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 412
           YDL+DLTRQ LAKYAN++FL IIE+Y+ N  + V  L + FL LV D+D LL+ H+GFLL
Sbjct: 51  YDLVDLTRQVLAKYANDVFLKIIESYKSNKMNQVTILCKHFLNLVNDLDTLLSSHEGFLL 110

Query: 413 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 472
           GPWLESAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGP
Sbjct: 111 GPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLRDYYGP 170

Query: 473 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
           RAAIYFK+++ S+E    F L++WRREWI LTN+WQ+ R V+     GD+L  S  LY K
Sbjct: 171 RAAIYFKHLLLSMEKNAPFALEEWRREWISLTNNWQSDRKVFSTTPTGDSLNISWSLYIK 230

Query: 533 YLQGT 537
           YL  T
Sbjct: 231 YLSNT 235


>gi|410100551|ref|ZP_11295511.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409215586|gb|EKN08585.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 739

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 183/592 (30%), Positives = 271/592 (45%), Gaps = 112/592 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW+D    L ++IL R  ELGM P+L +F+G VP  L+  +P A+I 
Sbjct: 194 MTNIETYGGPLPQSWIDSHQALGQQILERQRELGMTPILQSFTGFVPIKLKEKYPDARI- 252

Query: 61  QLGNWFSVKSDPRWC----CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
                   K   RWC     T  LD  DPLF E+G+AF+E+Q K YG T+HIY  D F E
Sbjct: 253 --------KDKNRWCNAFTATVQLDPLDPLFKEMGQAFLEEQQKLYG-TNHIYAADPFHE 303

Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
              P +   Y+ ++G  I+      D +AV  MQ W             +A+  + P  +
Sbjct: 304 GAAPSNEKSYLEAVGKVIWEVASGFDPEAVIAMQTWSLR----------EAITRTFPQDR 353

Query: 177 LVVLDLFAEVKPIWSTSK--QFYGVPYIWCMLHNFAGNIEMYGIL--------------- 219
           L++LDL       W+  K   F+  PY+  +LHN+ G + M G L               
Sbjct: 354 LLLLDLGG-----WNVEKFNSFWNYPYVAGVLHNYGGRVYMGGNLALYAKNAHELKQSPK 408

Query: 220 -----------DSIAFGPVEARTSENTT--------MAWINQYSVRRYGRSVPAIQDAWN 260
                      ++I   PV    S   T          WI  Y+  RYG+     +  W 
Sbjct: 409 GGNIQGIGLFPEAIEHNPVVYELSTEITWMQDAPDLQKWITDYARARYGKLPAGAEQGWK 468

Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
           VL  TVY    G       V+ A P +    ++       +  +P               
Sbjct: 469 VLLETVYGSKAGRLPSTESVMCARPALTIQKVAAN----GDLSRP--------------- 509

Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
                YST  +  A++ F+ + N+L  S+TYRYDL+D+ RQ L+  +  L   I EAY  
Sbjct: 510 -----YSTVRLWDAVDHFLQASNDLKKSDTYRYDLVDVMRQCLSDLSLPLQKQITEAYLA 564

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
            D   + Q   +FL L++D D LL     FLLG W++ A+Q    EE++  YEWNART +
Sbjct: 565 EDNEKLQQAGEQFLALIDDFDRLLGTRSTFLLGKWIKEARQWGTTEEEKALYEWNARTLV 624

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF---------------KYMIESL 485
           T+W  N    ++ L +Y N+ W+GL++ YY PR   +                +Y+ +SL
Sbjct: 625 TVWGPN--HPSAHLFEYSNRQWAGLMKGYYKPRWEKFISYLKAQPKGEWRYDEQYIRKSL 682

Query: 486 ESGDGFRLKDWRREWIKLTN---DWQNGRNVYPVESNGDALITSQWLYNKYL 534
                    D+   + +LTN   DW   ++VY     G+ +   + LY K+L
Sbjct: 683 AGRPALDASDF---YTRLTNWEYDWAFNKDVYTDTPQGNEIEIVKELYAKWL 731


>gi|404406328|ref|ZP_10997912.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
          Length = 738

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 158/507 (31%), Positives = 246/507 (48%), Gaps = 36/507 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W  PLPQSWLD Q+ LQK+I+ R  EL M PVLPAF+G+VPA L  ++P AKI+
Sbjct: 206 MSNLDYWQSPLPQSWLDAQVELQKRIVARERELNMKPVLPAFAGHVPAELGEIYPEAKIS 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W   +   R   ++ LD  DPLF  I R F+ +Q   +G T HIY  D F+E  PP
Sbjct: 266 RMSKWGGFEDRYR---SHFLDPLDPLFARIQREFLAEQTALFG-TDHIYGADPFNEVDPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PE+++ +   IY  M   D +A WL   WLF  D   W   +++A + +VP  K+++
Sbjct: 322 SWEPEFLARVSRTIYDTMTEADPEAEWLQMTWLFYLDRDKWHDDRIEAFVTAVPQDKMLL 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
           LD + E   +W  +  ++G PY WC L NF GN  + G  D ++   ++   +E      
Sbjct: 382 LDYYCENTEVWRQTHSYHGQPYFWCYLGNFGGNTMLVGNFDEVS-KRIDGVLAEGGN--- 437

Query: 240 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNR------DVIVAFPDVD---- 288
               ++R  G ++  + D+   +Y  V+    D   D +R      D  + + D      
Sbjct: 438 ----NLRGLGSTLEGL-DSNPFMYDYVFERAWDFPVDDDRWFDALADRYLGYEDTGYRRA 492

Query: 289 ----PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 344
                  + +T  KY  +   ++    L+   +      + Y   E+       I +G+ 
Sbjct: 493 WDALRKNVYITSSKY-GHCPLLNARPTLEGILTGTTDAEIKYDNDELFEVWAKMIDAGD- 550

Query: 345 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLL 404
            S  +TYRY L+++ RQ L      L      A +  D   + +L    LEL  D++ L 
Sbjct: 551 -SGRDTYRYWLVNVGRQTLGNLFLPLRDGFTAACRAKDLARMKELRSEMLELAADLETLT 609

Query: 405 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 464
           A H  F +  W++ ++      E+   YE N RT +T W D  Q     + DY N+ WSG
Sbjct: 610 AQHGAFSMQKWIDDSRSFGTTPEERDYYEVNGRTLLTTWGDRAQS----INDYANRTWSG 665

Query: 465 LLRDYYGPRAAIYFKYMIESLESGDGF 491
           L+ DYY  R  ++    + ++E+G  F
Sbjct: 666 LVADYYAERWRMFLDAAVGAVEAGRKF 692


>gi|298386708|ref|ZP_06996263.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
 gi|298260382|gb|EFI03251.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
          Length = 732

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 170/566 (30%), Positives = 267/566 (47%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK+IL R  E  M PVLPAF+G+VPA L+ ++P+AKI 
Sbjct: 196 MSNVDYWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  D L+  I R F+E+Q K YG T HIY  D F+E   P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
             S ++++++ + IY  +   DS A WL   W+F YD   W  P++++ L +VP  KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDNKLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--------------- 224
           LD + +   IW  ++++YG PYIWC L NF GN  + G L+ I F               
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431

Query: 225 ----------GPV-------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                      P+       +A     TT  WI  +S+ R G     I  AW  L+  +Y
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDYSVTTDQWITNWSMCRGGNQDANIIKAWRALHQKIY 491

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T+ AT     ++ A P        +T  K  N    +            Y +  LW  
Sbjct: 492 --TEHATCGQSVLMNARP-------RLTGTKSWNTNPGIH-----------YANNDLWQI 531

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+++A  +          ++ +R+D+I++ RQ L    ++        Y   D  G+ 
Sbjct: 532 WKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSKYRDQFTACYNRKDTTGMR 582

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           + S R   L+ D+D LL+C     +G WL+ A+       ++  YE NAR  +T+W    
Sbjct: 583 EWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCGATVSEKDYYEENARCILTVW---- 638

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++ + L DY N+ W GL R +Y  R   +   +I ++     F    + ++  +   +W
Sbjct: 639 GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIAAVSEDKPFDEDKFHQDITQFEYNW 698

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
              ++ +P+ S  D +  +  L  KY
Sbjct: 699 TLQKDSFPIVSEEDPIQIADSLILKY 724


>gi|400599317|gb|EJP67021.1| alpha-N-acetylglucosaminidase, putative [Beauveria bassiana ARSEF
           2860]
          Length = 753

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 161/546 (29%), Positives = 263/546 (48%), Gaps = 80/546 (14%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LPQ+W+D Q  LQ+KI+ R+ ELGM P+LPAF G VP  +  V+P+  + +
Sbjct: 203 NIQGSWGGDLPQAWIDDQFALQRKIIKRMVELGMTPILPAFPGFVPENITRVWPNVSLAE 262

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W       R+     +   DP F E+ +AF+ +Q + YG  +  +  D F+EN P  
Sbjct: 263 SPTWSGFSG--RFTADKYITPYDPRFAELQKAFLTKQNEAYGNVTSFWTLDQFNENKPAS 320

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-LVV 179
               Y+ ++    +  ++  D  AVW+MQGWLF+ D  +W   ++K+ L+ VP+ + +++
Sbjct: 321 GELGYLRNVSHNTWQTLKDADPSAVWVMQGWLFASDKAYWTDDRVKSFLDGVPVNEDMLL 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE  P W  +  FYG P+IWC LH + GN+ +YG ++++    VEA     + +  
Sbjct: 381 LDLFAESTPQWQRTDSFYGKPWIWCQLHGYGGNMGLYGQIENVTRNAVEAVQKSPSIVGL 440

Query: 238 --------------------AW----------INQYSVRRYG---RSVPA-IQDAWNVLY 263
                               AW           + +   RYG   + +P  +  AW+ + 
Sbjct: 441 GLSMEGQEGNEIMYNLLLDQAWSKEALETDKYFSDWVTVRYGADQKEIPKDLYTAWDKVR 500

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            TVYN TD +       + A       ++  T G     G   +K               
Sbjct: 501 STVYNNTDSS-------VTAVAKSIFELVPSTSGLVNRTGHHATK--------------- 538

Query: 324 LWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
           + Y T  +I A      +G++   L  +  Y YDL D TRQ LA      +  ++E Y+ 
Sbjct: 539 ITYDTETLISAWNDMFNAGSQARWLFDNEAYSYDLTDWTRQVLANAFEATYNKLVEKYKS 598

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
           N+  GV     R   ++  MD +L  +  F L  W+++A++     +    +E+NAR Q+
Sbjct: 599 NNIKGVKCAGSRLQAILRTMDQVLETNVHFRLSTWIQAARK--SGGDAADFFEYNARNQV 656

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD----GFR--LK 494
           T+W  N +     + DY +K W+GL+ DYY  R  ++  Y++ + + GD     F+  L 
Sbjct: 657 TLWGPNGE-----IEDYASKQWAGLIGDYYAHRWQMFVDYLVAT-DPGDYDQMAFQKTLI 710

Query: 495 DWRREW 500
           +W +EW
Sbjct: 711 EWEKEW 716


>gi|395331391|gb|EJF63772.1| alpha-N-acetylglucosaminidase [Dichomitus squalens LYAD-421 SS1]
          Length = 750

 Score =  252 bits (643), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 172/556 (30%), Positives = 280/556 (50%), Gaps = 74/556 (13%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LP +W+D Q  LQK+IL R+ ELGM PVLP+F+G VP AL +++P+A I
Sbjct: 195 FGNIQGSWGGDLPVTWVDDQFQLQKQILQRMVELGMTPVLPSFTGFVPRALSSLYPNASI 254

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
                W    +         L+  DPLF  I  +FI +Q + YG  SHIY  D ++EN P
Sbjct: 255 VNGSQWEGFPT--ALTNDSFLEPFDPLFTTIQTSFISKQREAYGNVSHIYALDQYNENDP 312

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-KL 177
               P Y++++ A  ++ +++ D DAVWLMQGWL FS   FW   +++A L  VP    +
Sbjct: 313 FSGDPAYLANVTAGTFASLRAADPDAVWLMQGWLFFSSAAFWTNERIEAYLGGVPGNDSM 372

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL++E +P W+ +  +YG  ++WC LH + GNI M G LD++   P+ A  +  ++M
Sbjct: 373 IILDLYSEAQPQWNRTSSYYGKQWVWCELHGYGGNIGMEGDLDALTQNPIAALHAPGSSM 432

Query: 238 -----------------------AW----------INQYSVRRYG-RSVP-AIQDAWNVL 262
                                  AW          ++Q+  RRY  R +P +  DAW  L
Sbjct: 433 KGVGLTMEGQEGNELVYDILLDQAWSSAPLNLSSYVDQWVARRYNVRRLPKSALDAWRTL 492

Query: 263 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
             TVY+  D  T   +  I +  ++ P++  +T                  + T  +   
Sbjct: 493 ATTVYSNKDSGT---QAAIKSIYELAPALTGMT------------------NRTGHHPTA 531

Query: 323 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
             + + S V+ A +  + + +E   L+    + YD++D+TRQ L+    + +  ++  Y 
Sbjct: 532 IPYDTNSTVLVAAKALLEARSENPLLATIPEFAYDVVDVTRQLLSNRFIDHYNVLVATYN 591

Query: 380 LNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ---EKQYEW 434
            N      V   +   L L++D+D LLA ++ FLL  W+  AK+     ++    +  E+
Sbjct: 592 SNATAPRNVAAAAGPLLALLDDLDELLATNEHFLLSNWIADAKRWTHGADRAAYARLLEY 651

Query: 435 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
           NAR QIT+W  + +     + DY +K W+GL+R YY PR   + +Y+ ++ E+G  +   
Sbjct: 652 NARNQITLWGPDGE-----INDYASKAWAGLVRTYYKPRWEAFVEYLAQTKEAGAAYDAH 706

Query: 495 DWRREWIKLTNDWQNG 510
               + I +   W NG
Sbjct: 707 VVSAKMIAIGQQWSNG 722


>gi|242809019|ref|XP_002485282.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
           ATCC 10500]
 gi|218715907|gb|EED15329.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
           ATCC 10500]
          Length = 755

 Score =  251 bits (640), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 159/527 (30%), Positives = 265/527 (50%), Gaps = 75/527 (14%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G W G LP  W+D Q  LQKKI+ R+ ELGM P+LPAF G VP A+  V P A +  
Sbjct: 203 NIQGSWSGSLPYDWVDSQFDLQKKIVKRMTELGMTPILPAFPGFVPRAITRVLPDADVIN 262

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W +  +   +     ++ TDP F EI ++FI +Q++ YG  +  Y  D F+EN P  
Sbjct: 263 GSAWEAFPT--MYTNDTFMEPTDPHFTEIQKSFIAKQIEAYGNVTTFYTLDQFNENNPSS 320

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPL-GKLVV 179
               Y+ ++    +  +++ DS+AVW+MQGWLF S   FW   +++A L  V +   L++
Sbjct: 321 GDLSYLRNVSQGTWKTLKAADSNAVWVMQGWLFTSNSAFWTNDRIEAYLGGVAVDSDLLI 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
           LDL +E  P W  +  +YG P+IWC +H++ GN+  YG + +I   P+ A  + ++    
Sbjct: 381 LDLASESSPQWQRTNSYYGKPWIWCEIHDYGGNMGFYGQVMNITNNPIAALHNSSSLVGF 440

Query: 236 ----------------------------TMAWINQYSVRRYG--RSVP-AIQDAWNVLYH 264
                                       T ++ + +   RY   RS+P ++  AW++L  
Sbjct: 441 GLSMEGQEGNEIVYDLLLDQAWNAAPIDTESYFHDWVTARYAGSRSIPSSVYSAWDILRT 500

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-H 323
           TVYN T+ A +       A P     +I  T G     G                 HP  
Sbjct: 501 TVYNNTNLAAN-------AVPKAIFELIPSTTGLLNRTGH----------------HPTK 537

Query: 324 LWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
           L Y+T+++++A  LF  S  +   L  +  + +DL+D++RQ LA     ++ N+I  Y  
Sbjct: 538 LNYNTADMVQAWNLFYTSAFKEPSLWLNPAFEFDLVDMSRQVLANAFIPVYENLISTYNT 597

Query: 381 ND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNAR 437
           ++  +  +  +    + +++ +D +LA +  F L  WL +A+  A ++   + + E+NAR
Sbjct: 598 SNPSSTKLQTIGAELIGILQALDTVLATNKNFKLSTWLSAARASAGSQHNIEDFLEYNAR 657

Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
            QIT+W    Q     + DY +K W+GL+  YY PR  ++ +Y+I++
Sbjct: 658 NQITLWGPTGQ-----ISDYASKSWAGLVSSYYIPRWKMFVEYLIDT 699


>gi|319900259|ref|YP_004159987.1| alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
 gi|319415290|gb|ADV42401.1| Alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
          Length = 718

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 178/581 (30%), Positives = 270/581 (46%), Gaps = 102/581 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P  W   Q  LQK+IL R+ E G+ PVLP +SG +PA  +         
Sbjct: 187 MNNLEGWGGPNPDQWYSHQEQLQKRILKRMREYGIEPVLPGYSGMIPANAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG    V    +WC       L  +D  F  I R + ++  + YG+ ++ Y+ D F E 
Sbjct: 239 KLG--LDVADPGKWCGYRRPAFLQPSDKNFRRIARLYYKEMTRLYGKANY-YSMDPFHEG 295

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSYDPFWRPPQMKALLNSVPL 174
            NT  VD    + + G +I   M+  +  AVW+ Q W    YD          ++ ++P 
Sbjct: 296 GNTKGVD----LDAAGKSIRDAMKEANPQAVWVAQAWGACPYD---------NMIKNLPE 342

Query: 175 GKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
           G ++VLDL++E +P W         KQ +G   +I+CML NF GN+ +YG ++ +     
Sbjct: 343 GDMIVLDLYSESRPQWGDPASAWYRKQGFGRHGWIYCMLLNFGGNVGLYGKMEHVIDEFY 402

Query: 223 -----AFGP------VEARTSENTTM------------------AWINQYSVRRYGRSVP 253
                AFG       +    SEN  +                   W+  Y   RYG++ P
Sbjct: 403 KARESAFGGTLQGVGLTMEGSENNPVMYELLCELPWHGRRISKDQWLKSYLKARYGKTTP 462

Query: 254 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
              +AW  L +T+YN  + +T +  +  V  A P ++          YQ           
Sbjct: 463 QTVEAWLKLSNTIYNSPNASTQQGTHESVFCARPSLEA---------YQ----------- 502

Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
                SS+     +Y+ +++IRA    I +  E   +N + YDLID+ RQA+A+    ++
Sbjct: 503 ----VSSWSEMKDYYAPADIIRAAGKMIEAAEEFRGNNNFEYDLIDVVRQAVAEKGRLVY 558

Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
             ++ AY+  D       S RFLEL+E  D LL     F LG W   A+ + + + Q+  
Sbjct: 559 PIVVSAYKAADKQLFEAASARFLELIELQDKLLGTRREFRLGTWTNYARNMGETDAQKDL 618

Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
           YEWNAR QIT W + T      L DY +K W+GLLRD+Y  R   YF  +  +L +G+  
Sbjct: 619 YEWNARVQITTWGNRTAANEGGLHDYAHKEWNGLLRDFYYMRWKAYFDELRSTL-NGNAP 677

Query: 492 RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
           +  D    +  L  +W    N Y  E  GDA   ++ +Y K
Sbjct: 678 KETD----FYTLEENWAGQHNPYSAEPEGDATDIAKEVYGK 714


>gi|383124408|ref|ZP_09945072.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
 gi|251839096|gb|EES67180.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
          Length = 732

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 170/566 (30%), Positives = 266/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK+IL R  E  M PVLPAF+G+VPA L+ ++P+AKI 
Sbjct: 196 MSNVDYWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  D L+  I R F+E+Q K YG T HIY  D F+E   P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYQVIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
             S ++++++ + IY  +   DS A WL   W+F YD   W  P++++ L +VP  KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDDKLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--------------- 224
           LD + +   IW  ++++YG PYIWC L NF GN  + G L+ I F               
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431

Query: 225 ----------GPV-------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                      P+       +A     TT  WI  +S+ R G     I  AW  L+  +Y
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDYPVTTDQWITNWSMCRGGDQDANIIKAWRALHQNIY 491

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T+ A      ++ A P        +T  K  N    +            Y +  LW  
Sbjct: 492 --TEYAICGQSVLMNARP-------RLTGTKSWNTNPGI-----------HYANNDLWQI 531

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+++A  +          ++ +R+D+I++ RQ L    +E        Y   D  G+ 
Sbjct: 532 WKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSEYRDQFTACYNRKDTTGMR 582

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           + S R   L+ D+D LL+C     +G WL+ A+       ++  YE NAR  +T+W    
Sbjct: 583 EWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARDCGTTVSEKDYYEENARCILTVW---- 638

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++ + L DY N+ W GL R +Y  R   +   +I ++     F    + ++  +   +W
Sbjct: 639 GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIGAVSKNKPFDEDKFHQDITQFEYNW 698

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
              ++ +P+ S  D +  +  L  KY
Sbjct: 699 TLQKDSFPIVSEEDPIQIADSLILKY 724


>gi|423346424|ref|ZP_17324112.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
           CL03T12C32]
 gi|409220242|gb|EKN13198.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
           CL03T12C32]
          Length = 718

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 175/578 (30%), Positives = 266/578 (46%), Gaps = 100/578 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQK+I+ R+ E G+ PV P +SG VP   +       ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQKRIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W   +          L  TDP F EI   + ++  K YG+  + Y+ D F E    
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKADY-YSMDPFHEGGSV 298

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V     + + G AI   M+  +  AVW+ Q W  +  P         ++ ++  G L+VL
Sbjct: 299 VGVD--LDAAGKAIMQAMKKNNPKAVWVAQAWQANPRP--------QMIGNLEAGDLIVL 348

Query: 181 DLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSI----------A 223
           DLFAE +P W         K  +G   +I+CML N+ GN+ ++G +  +           
Sbjct: 349 DLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNVGLHGKMKHVIDEFYKAKESP 408

Query: 224 FGP------VEARTSENTTMA------------------WINQYSVRRYGRSVPAIQDAW 259
           FG       +    SEN  +                   W+ +Y+V RYG+S P +QDAW
Sbjct: 409 FGKTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDKDQWLREYTVARYGKSNPTVQDAW 468

Query: 260 NVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 317
            +L +++YNC D  T +  +  V  A P         TE  YQ                S
Sbjct: 469 ILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ---------------VS 504

Query: 318 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 377
           S+     +Y  + VIRA  + ++  +E   +N + YDL+D+ RQA+A+    L   ++EA
Sbjct: 505 SWSEMKDYYDPNNVIRAAAMMVSVADEFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVVEA 563

Query: 378 -YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
            +   D       S RFL L+   D LLA    F +G W+  A+ L    E+++ YEWNA
Sbjct: 564 AFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGSTPEEKELYEWNA 623

Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           R QIT W +    +   LRDY ++ W+G+L+D+Y  R   +F Y           RL D 
Sbjct: 624 RVQITTWGNRLAADEGGLRDYAHREWNGILKDFYYMRWKTWFDYQT---------RLLDG 674

Query: 497 RR----EWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
           R+    ++  +   W    NVY  E  GD + T + ++
Sbjct: 675 RKTAAIDFYAIEERWTKATNVYSSEPEGDCISTVKRIF 712


>gi|154489986|ref|ZP_02030247.1| hypothetical protein PARMER_00215 [Parabacteroides merdae ATCC
           43184]
 gi|423722990|ref|ZP_17697143.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
           CL09T00C40]
 gi|154089428|gb|EDN88472.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
           43184]
 gi|409241820|gb|EKN34587.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
           CL09T00C40]
          Length = 718

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 176/580 (30%), Positives = 267/580 (46%), Gaps = 104/580 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQK+I+ R+ E G+ PV P +SG VP   +       ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQKRIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W   +          L  TDP F EI   + ++  K YG+  + Y+ D F E  + 
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKADY-YSMDPFHEGGSV 298

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    + + G AI   M+  +  AVW+ Q W  +  P         ++ ++  G L+
Sbjct: 299 AGVD----LDAAGKAIMQAMKKNNPKAVWVAQAWQANPRP--------QMIGNLEAGDLI 346

Query: 179 VLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSI--------- 222
           VLDLFAE +P W            F    +I+CML N+ GN+ ++G L  +         
Sbjct: 347 VLDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNVGLHGKLKHVIDEFYKAKE 406

Query: 223 -AFGP------VEARTSENTTMA------------------WINQYSVRRYGRSVPAIQD 257
             FG       +    SEN  +                   W+ +Y+V RYG+S P +QD
Sbjct: 407 SPFGKTLKGVGMTMEGSENNPVMFELLTELPWCPQRFDKDQWLREYTVARYGKSNPTVQD 466

Query: 258 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
           AW +L +++YNC D  T +  +  V  A P         TE  YQ               
Sbjct: 467 AWILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ--------------- 502

Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
            SS+     +Y  ++VIRA  + ++  +E   +N + YDL+D+ RQA+A+    L   ++
Sbjct: 503 VSSWSEMKDYYDPNDVIRAAAMMVSVADEFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVV 561

Query: 376 E-AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEW 434
           E A+   D       S RFL L+   D LLA    F +G W+  A+ L    E+++ YEW
Sbjct: 562 EAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGGTPEEKELYEW 621

Query: 435 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
           NAR QIT W +    +   LRDY ++ W+G+L+D+Y  R   +F Y           RL 
Sbjct: 622 NARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYYMRWKTWFDYQT---------RLL 672

Query: 495 DWRR----EWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
           D R+    ++  +   W    NVY  E  GD + T + ++
Sbjct: 673 DGRKTAAIDFYAIEERWTKATNVYSSEPEGDCISTVKRIF 712


>gi|281200618|gb|EFA74836.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
          Length = 469

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 166/519 (31%), Positives = 256/519 (49%), Gaps = 88/519 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++ W G L   W+  Q  LQ +IL R+ + GM  VLP F+G+VP AL++ +P+A IT
Sbjct: 1   MGNVNEWAGNLTLGWMVDQRDLQIQILTRMRQFGMQAVLPGFAGHVPEALKSHYPNANIT 60

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QL +W           T  +  +   F+ I       Q   YG T H YN D F+E  PP
Sbjct: 61  QLSSW---------NMTVYIHQSPNTFMSI-------QQDLYG-TDHFYNFDPFNELEPP 103

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+ +   ++++ + + D   +W++QGWLF YD  FW+PPQ++A L+ VP+GK++V
Sbjct: 104 SSDPAYLKNCSQSMFNNLIAVDPQGIWVLQGWLFVYDTEFWQPPQIEAFLSGVPIGKMIV 163

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SEN---- 234
           LDL+A+V   W  +  FYG  +IWCMLHNF G   MYG +  I+  P+EAR+ S N    
Sbjct: 164 LDLWADVDAGWKITNYFYGHNWIWCMLHNFGGRSGMYGKIPFISTNPIEARSLSPNMVGT 223

Query: 235 -----------------TTMA----------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                            + MA          W++QY  RRYG+ +  + D W  L  TV+
Sbjct: 224 GLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQYVTRRYGKYIEVLADTWYELVGTVF 283

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           NC+                       VT+G        VS    L   TS Y  P +   
Sbjct: 284 NCS----------------------IVTKGPVTIL---VSVRPQLNFTTSLYYDPIV--- 315

Query: 328 TSEVIRALELFIASGN-ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
              + +A   F++  +  +  ++T+ +DL ++T QAL+       L +  A+ LND++  
Sbjct: 316 ---ISKAWSAFLSIDDLHVVNTSTFSFDLTEITTQALSNLFMTTELQMNAAF-LNDSYEE 371

Query: 387 FQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
           F  LS   L +++D++ +++  +  L+G W   A+ L    E  + YE NAR QIT+W  
Sbjct: 372 FSLLSDALLSIIQDINTIVSTQEMLLVGNWTARARALTPANETTELYEMNARNQITLW-- 429

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
                 S   DY  K W GL  D+Y  R  ++ + + ++
Sbjct: 430 --GPPDSFDHDYAYKLWGGLTEDFYLARWTLFSQSIFKT 466


>gi|329963073|ref|ZP_08300853.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
 gi|328529114|gb|EGF56044.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
          Length = 717

 Score =  249 bits (636), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 177/583 (30%), Positives = 276/583 (47%), Gaps = 106/583 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  Q+  LQK+IL R+ E G+ PVLP +SG VP   +         
Sbjct: 187 MNNLEGWGGPNPDSWYTQREALQKQILKRMREYGIQPVLPGYSGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  TDP F EI   + ++  + YG+ +  Y+ D F E
Sbjct: 239 RLG--LNV-SDPGLWCGYPRPAFLQPTDPRFGEIADLYYKEMTRLYGK-ADFYSMDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +   VD    +++ G AI+  M+  +  AVW+ Q W  +       P+ K ++ ++P 
Sbjct: 295 GGSIAGVD----LNAAGQAIWGAMKKVNPKAVWVAQAWQAN-------PRQK-MIENIPQ 342

Query: 175 GKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G L+VLDLF+E +P W         K+ +G   +++CML N+ GN+ ++G +  +     
Sbjct: 343 GDLIVLDLFSESRPQWGDPASTWYRKEGFGKHDWLYCMLLNYGGNVGLHGKMRHVIDEFY 402

Query: 228 EARTS----------------ENTTMA------------------WINQYSVRRYGRSVP 253
           +A+TS                EN ++                   W+  Y+  RYG++  
Sbjct: 403 KAKTSPFGKTLKGVGMTMEGSENNSVMFELLCELPWRPAQFEKDEWLKNYTAARYGKADA 462

Query: 254 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
            +Q AW +L +++YNC D  T +  +  V  A P +D          YQ           
Sbjct: 463 TVQQAWLLLSNSIYNCPDANTQQGTHESVFCARPGMD---------VYQ----------- 502

Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
                SS+     +Y   EVIRA  + +++ +    +N + YDL+D+ RQA+A+    ++
Sbjct: 503 ----VSSWSEMVKYYEPEEVIRAAGILLSAADRFKGNNNFEYDLVDIVRQAVAEKGRLVY 558

Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
             +I+A +  +       S+RFL L+   D LLA    F +G W+E A+ L   +E++K 
Sbjct: 559 PIMIDALKAGEKELFAAASQRFLNLILLQDRLLATRPEFKVGTWIEKARNLGTTQEEKKL 618

Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES--GD 489
           YEWNAR QI  W + T  +   LRDY +K W+G+LRD+Y  R  ++       L      
Sbjct: 619 YEWNARVQIATWGNRTAADEGGLRDYAHKEWNGMLRDFYYHRWKLWIDAQTAQLNGAPAQ 678

Query: 490 GFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
           GF        W   TND       YP    GD +  ++  Y +
Sbjct: 679 GFDFYAIEEPWTLQTND-------YPSHPEGDVIEVARTAYKE 714


>gi|383122982|ref|ZP_09943669.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
 gi|251841923|gb|EES70003.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
          Length = 730

 Score =  248 bits (634), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 167/558 (29%), Positives = 273/558 (48%), Gaps = 50/558 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP+SWL+QQ VLQK+IL R  +  M PVLPAFSG+VP  L+ ++P AKI 
Sbjct: 197 MSNVDYWQSPLPKSWLEQQEVLQKQILKRERDFNMTPVLPAFSGHVPKELKAIYPDAKIH 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W    S  R   ++ ++  D LF  I + ++E+Q   YG T HIY  D F+E   P
Sbjct: 257 EMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEEQTAIYG-TDHIYGIDPFNEVDSP 312

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
             + ++++ +   IY  +   D++A WL   W+F +D   W  P++++ L +VP  KL++
Sbjct: 313 NWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHDQKKWTQPRIRSFLEAVPDDKLIL 372

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP----VEARTSENT 235
           LD + +   IW  ++ +YG PY+WC L NF GN  M G LD +        VE   +   
Sbjct: 373 LDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMVGNLDDVDVKIEKLFVEGGENVYG 432

Query: 236 TMAWINQYSVRRY--------GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 287
             A +  + V  +            P   D W         C  G  D  R ++ A+  +
Sbjct: 433 LGATLEGFDVNPFMYEFVFDQAWDYPLTTDQW---IQNWAKCRGGNQD--RHILKAWDSL 487

Query: 288 DPSIISVTEGKYQNYGKPVSKEA--VLKSETS-------SYDHPHLWYSTSEVIRALELF 338
              I      KY   G+ V   A  +L    S       +Y++  LW   +E+++A  + 
Sbjct: 488 HKKIYK----KYATAGQAVLMNARPMLVGTDSWNTYPDITYNNRDLWDIWTEMLKASHI- 542

Query: 339 IASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 397
                    +NT YR+D+I++ RQ L    +    +  + Y   D  G+ + + +   L+
Sbjct: 543 ---------NNTGYRFDVINVGRQVLGNLFSSFRDHFTQCYSEKDIDGMKKWADQMDSLL 593

Query: 398 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 457
            D D LL+C   F +G W++ A+   + E +++ YE NAR  +T+W     ++A+ L DY
Sbjct: 594 IDTDRLLSCETNFSIGKWIDDARSFGKTEAEKEYYEENARCILTVW----GQKATQLNDY 649

Query: 458 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 517
            N+ W GL   YY  R   +   +I +  SG  F  K + +       +W   +  +P+ 
Sbjct: 650 ANRGWGGLTYSYYRERWKRFTTEVITASLSGQKFDEKQFYQSITDFEYEWTLSKEHHPII 709

Query: 518 SNGDALITSQWLYNKYLQ 535
           S  + ++ ++ L  KY+Q
Sbjct: 710 SGENPILLAKTLSEKYMQ 727


>gi|29345848|ref|NP_809351.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29337741|gb|AAO75545.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 730

 Score =  248 bits (633), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 167/558 (29%), Positives = 273/558 (48%), Gaps = 50/558 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP+SWL+QQ VLQK+IL R  +  M PVLPAFSG+VP  L+ ++P AKI 
Sbjct: 197 MSNVDYWQSPLPKSWLEQQEVLQKQILKRERDFNMTPVLPAFSGHVPKELKAIYPDAKIH 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W    S  R   ++ ++  D LF  I + ++E+Q   YG T HIY  D F+E   P
Sbjct: 257 EMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEEQTAIYG-TDHIYGIDPFNEVDSP 312

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
             + ++++ +   IY  +   D++A WL   W+F +D   W  P++++ L +VP  KL++
Sbjct: 313 NWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHDQKKWTQPRIRSFLEAVPDDKLIL 372

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP----VEARTSENT 235
           LD + +   IW  ++ +YG PY+WC L NF GN  M G LD +        VE   +   
Sbjct: 373 LDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMVGNLDDVDVKIEKLFVEGGENVYG 432

Query: 236 TMAWINQYSVRRY--------GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 287
             A +  + V  +            P   D W         C  G  D  R ++ A+  +
Sbjct: 433 LGATLEGFDVNPFMYEFVFDQAWDYPLTTDQW---IQNWAKCRGGNQD--RHILKAWDSL 487

Query: 288 DPSIISVTEGKYQNYGKPVSKEA--VLKSETS-------SYDHPHLWYSTSEVIRALELF 338
              I      KY   G+ V   A  +L    S       +Y++  LW   +E+++A  + 
Sbjct: 488 HKKIYK----KYATAGQAVLMNARPMLVGTDSWNTYPDITYNNRDLWDIWTEMLKASHI- 542

Query: 339 IASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 397
                    +NT YR+D+I++ RQ L    +    +  + Y   D  G+ + + +   L+
Sbjct: 543 ---------NNTGYRFDVINVGRQVLGNLFSSFRDHFTQCYSEKDIDGMKKWADQMDALL 593

Query: 398 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 457
            D D LL+C   F +G W++ A+   + E +++ YE NAR  +T+W     ++A+ L DY
Sbjct: 594 IDTDRLLSCETNFSIGKWIDDARSFGKTEAEKEYYEENARCILTVW----GQKATQLNDY 649

Query: 458 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 517
            N+ W GL   YY  R   +   +I +  SG  F  K + +       +W   +  +P+ 
Sbjct: 650 ANRGWGGLTYSYYRERWKRFTTEVITASLSGQKFDEKQFYQSITDFEYEWTLSKEHHPII 709

Query: 518 SNGDALITSQWLYNKYLQ 535
           S  + ++ ++ L  KY+Q
Sbjct: 710 SGENPILLAKTLSEKYMQ 727


>gi|429740222|ref|ZP_19273924.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
 gi|429153947|gb|EKX96708.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
          Length = 730

 Score =  248 bits (632), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 167/568 (29%), Positives = 251/568 (44%), Gaps = 79/568 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP+ WL++Q  LQK+IL R     M PVLPAF+G+VPA L+ +FP A I 
Sbjct: 198 MANIDRWNGPLPKEWLEEQRDLQKQILARERAFNMKPVLPAFAGHVPAELKRIFPDANIK 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W     D ++ C + L+  +PLF +I + F+E+Q   +G T HIY  D F+E  PP
Sbjct: 258 SLGKWGGF--DEQYLC-HFLNPGEPLFAKIQKLFLEEQTALFG-TDHIYGVDPFNEGEPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+  +   +Y  + + D  A W+  GW+F YD   W P ++KA L  VP GK+ +
Sbjct: 314 SWEPAYLKEISKNMYGTLTAVDPKAEWMQMGWMFYYDKKVWTPKRVKAFLTGVPQGKMSL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W T+  FYG PYIWC L NF GN  + G                      
Sbjct: 374 LDYHCENVELWKTNDGFYGQPYIWCYLGNFGGNTTLTGNVKETGKRLDAALKAARRNMLG 433

Query: 219 -------LDSIAFGPVEA------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                  LD I F P E         S+     WI++ + R  G + P+++ AW +L+  
Sbjct: 434 VGSTLEGLDVIQF-PYEYVFDKVWTHSDKGNQQWIDELADRHAGFTSPSVRKAWQILFDE 492

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           ++    G                             Y    S+  VL    S        
Sbjct: 493 IFVQVPGT----------------------------YSILPSRSPVLNDNHSERTEIKYP 524

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
               E + +L L +         N  + DLI + RQ L      +      AY   D   
Sbjct: 525 AQRLEEVWSLLLDVPQ----CERNELQVDLIAVGRQVLGNKFLAVKSEFDAAYAAKDITL 580

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
           + Q +    EL+ D+D L + +    +  W++ A+ L +N E +  YE NAR  IT+W  
Sbjct: 581 LRQKAYEMEELLSDLDCLTSFNTRCTVNKWIDDARALGRNAEMKNYYERNARYLITLW-- 638

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
                   L DY ++ W GL+  YYG R  +Y   ++ S ++G  F  K +  +  +   
Sbjct: 639 -----GGHLSDYASRAWGGLIGSYYGGRWRLYIHDILASAQTGKPFDQKAFDEKRSQFEQ 693

Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKY 533
            W +      +    D L   + +++KY
Sbjct: 694 TWVHSTTPITLPQRNDLLTFCKMMFSKY 721


>gi|237719043|ref|ZP_04549524.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229451821|gb|EEO57612.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 713

 Score =  247 bits (631), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 176/577 (30%), Positives = 269/577 (46%), Gaps = 93/577 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 181 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 240

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 241 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 296

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 297 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 356

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W  ++ F+  PYIWC L NF GN  + G                      
Sbjct: 357 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 416

Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E    +   +      WI   + R  G    +++DAW  L++ +
Sbjct: 417 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 475

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 476 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 510

Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
              EV R L       NE S+   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 511 --LEVWRKL-------NEASSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVETKDHQ 561

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            +     +  E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W 
Sbjct: 562 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 620

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
                    L DY ++ W+GL+ DYY  R  +Y    I++ E G   D  +L+D  +E  
Sbjct: 621 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAAEKGVEVDQKQLEDELKEIE 674

Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
             W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 675 EGWVNATDREDTRKDVH---STTDGLLSFSTFLFSKY 708


>gi|374385779|ref|ZP_09643282.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
           12061]
 gi|373225481|gb|EHP47815.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
           12061]
          Length = 715

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 165/542 (30%), Positives = 242/542 (44%), Gaps = 94/542 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW ++Q+ LQ +IL R+ E G+ PV P ++G        + P     
Sbjct: 186 MNNLEGWGGPNPESWYERQMQLQHRILNRMREYGIEPVFPGYAG--------MLPHNASE 237

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG    VK    WC       L   +P F  I   +  +  K +G+ +  Y  D F E 
Sbjct: 238 KLG--IEVKDPGLWCGYQRPAFLYPENPAFKRIAGLYYMEMEKRFGK-AKFYGMDPFHEG 294

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            N   +D    +++   ++   M++ + +AVW+MQ W        +      ++ ++  G
Sbjct: 295 GNVQGID----LAAAAQSVLQAMKTANPEAVWVMQAW--------QANPRHEMITALQPG 342

Query: 176 KLVVLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            +++LDL +E +P+W          K F G  +++CML NF GN+ MYG +D +  G   
Sbjct: 343 NVLILDLSSENRPMWGDKESVWYREKGFEGQDWLYCMLLNFGGNVGMYGRMDRVINGFYA 402

Query: 229 ARTSEN----------------------------------TTMAWINQYSVRRYGRSVPA 254
           A    N                                  T   W+  Y   RYG+  P 
Sbjct: 403 AVQHPNGASLRGVGKTMEGIENNPVMYELLLELPWRKIPFTKEEWLKGYVKARYGKDDPR 462

Query: 255 IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKS 314
           +Q AW +L    YNC          V  A P                        A   S
Sbjct: 463 LQQAWQILGKAAYNCPVVQEGTTESVFCARP------------------------AEEIS 498

Query: 315 ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNI 374
             SS+    L+Y+  E  +   LF+    +   +N + YDL D+ RQALA   N L   I
Sbjct: 499 GASSWGTSELYYAPEESKKVAALFLEVSEQYKGNNNFEYDLTDIMRQALADKGNVLQKKI 558

Query: 375 IEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEW 434
            EAY+L D      LSR FL+L+   D LLA    F LG WLE AK   + EE+++ YEW
Sbjct: 559 TEAYRLKDETAFRNLSREFLQLILWQDTLLATRPEFRLGTWLERAKAKGETEEEKRLYEW 618

Query: 435 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
           NAR QIT W +    +   LRDY ++ W+GLL+D+Y PR   YF  ++E   +G+     
Sbjct: 619 NARVQITTWGNRQAADKGGLRDYSHREWAGLLKDFYYPRWKAYFD-LLEKRLAGEETEDI 677

Query: 495 DW 496
           DW
Sbjct: 678 DW 679


>gi|336412606|ref|ZP_08592959.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942652|gb|EGN04494.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
           3_8_47FAA]
          Length = 727

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 175/577 (30%), Positives = 269/577 (46%), Gaps = 93/577 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W  ++ F+  PYIWC L NF GN  + G                      
Sbjct: 371 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E    +   +      WI   + R  G    +++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            +   + +  E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W 
Sbjct: 576 ALKACAEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
                    L DY ++ W+GL+ DYY  R  +Y    I++ E G   D  +L+D  +E  
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAAEKGVEVDQKQLEDELKEIE 688

Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
             W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722


>gi|212537509|ref|XP_002148910.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
 gi|210068652|gb|EEA22743.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
          Length = 768

 Score =  245 bits (625), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 159/552 (28%), Positives = 269/552 (48%), Gaps = 77/552 (13%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WG PLP +W+D Q  LQKKI+ R+ ELGM P+LPAF G VP A+  V P A +  
Sbjct: 212 NIQGSWGSPLPYAWVDSQFDLQKKIVKRMVELGMTPILPAFPGFVPRAITRVLPDADVIN 271

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W +  +   +     ++ TDP F EI ++FI +Q   YG  +  Y  D F+EN P  
Sbjct: 272 GSAWEAFPA--MFTSDTFMEPTDPHFTEIQKSFISKQTAAYGNVTTFYTLDQFNENNPSS 329

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPL-GKLVV 179
               Y+ S+    +  +++ D  AVW+MQGWL FS   FW   +++A L  V +   L+V
Sbjct: 330 GDLNYLRSVSHGTWQALKAADPSAVWVMQGWLFFSNSAFWTNDRVEAYLGGVTVDSDLLV 389

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDL +E +P W  +  ++G P+IWC +H++ GN+  YG + +I   P+ A  +   ++  
Sbjct: 390 LDLASESQPQWQRTNSYFGKPWIWCQIHDYGGNMGFYGQVMNITVNPIAALNNATASLVG 449

Query: 238 ---------------------AW----------INQYSVRRYG--RSVPA-IQDAWNVLY 263
                                AW           + +   RY   +S+P  +  AW++L 
Sbjct: 450 FGLSMEGQEGNEVVYDLLLDQAWSAKPIDTATYFHDWVTARYAGSKSIPTDVYSAWDMLR 509

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP- 322
            +VYN T+ A++       A P     +I  T G     G                 HP 
Sbjct: 510 TSVYNNTNLASN-------AVPKAIFELIPSTTGLVNRTGH----------------HPT 546

Query: 323 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
            L Y+ +++++A  LF ++  +   L  +  Y +DL+D++RQ LA     ++ ++I A+ 
Sbjct: 547 TLNYNPADMVKAWSLFYSAAFKEPSLWLNPAYEFDLVDMSRQVLANAFIPVYHDLIAAWN 606

Query: 380 LNDAHGVF--QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 437
             +   +    +    + +++ +D +L  ++ F L  W+ +A+  A  +  E   E+NA 
Sbjct: 607 TTNPSTIRIQIIGAELIGILQAIDTILDTNEHFKLSTWISAARTSAGEQSLEDFLEYNAL 666

Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR 497
            QIT+W    Q     + DY +K W+GL+  YY PR  ++ +Y++++  +   +    ++
Sbjct: 667 NQITLWGPTGQ-----ISDYASKSWAGLVSSYYIPRWKMFIEYLVDTKPA--QYNQTAFK 719

Query: 498 REWIKLTNDWQN 509
            E +K    WQN
Sbjct: 720 AELLKWELQWQN 731


>gi|325299497|ref|YP_004259414.1| alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
 gi|324319050|gb|ADY36941.1| Alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
          Length = 723

 Score =  244 bits (623), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 175/572 (30%), Positives = 267/572 (46%), Gaps = 95/572 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ  LQKKIL R+ E G+ PVLP +SG VP  A Q +  +   
Sbjct: 190 MNNLEGWGGPNPDSWYTQQEALQKKILKRMREYGIEPVLPGYSGMVPHDAHQKLGLNVTE 249

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
            +L N F+  +         L  TD  F EI   + E+Q K +G+ ++ Y+ D F E  N
Sbjct: 250 PELWNGFTRPA--------FLMPTDKRFAEIAALYYEEQEKLFGKANY-YSMDPFHELEN 300

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G A+   M+  +  AVW++QGW  + +P  RP  MK L N    G L
Sbjct: 301 AGEVD----FDAAGKAVMDAMKQVNPKAVWVVQGW--TENP--RPEMMKNLKN----GDL 348

Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSI-------AF 224
           ++LDLF+E +P      IW   K +    +++CML NF  N+ ++G +D +         
Sbjct: 349 LILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLLNNFYLTKN 408

Query: 225 GPVEA---------------------------RTSENTTMAWINQYSVRRYGRSVPAIQD 257
            P+ A                           R  + T  +W+ +Y   RYG     I+ 
Sbjct: 409 NPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKITKESWLKEYLAARYGAKDEKIEQ 468

Query: 258 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
           AW +L   +YNC  G   +  +  +    P ++   +S +  K +NY  P S EA     
Sbjct: 469 AWMILADGIYNCPFGNNQQGPHESIFCGRPSMNNFQVS-SWSKMENYYDPTSTEA----- 522

Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
                             A  L + + ++   +N + YDL+D+ RQALA     ++   I
Sbjct: 523 ------------------AARLMLEAADKFRGNNNFEYDLVDIVRQALADRGRIVYNRAI 564

Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 435
             ++  D     + S+ FL L+   D LLA    F +G W+  A+ L    E++  YEWN
Sbjct: 565 ADFKSFDKRSYARHSKEFLNLLLAQDRLLATRSEFRVGRWINQARSLGNTPEEKDLYEWN 624

Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
           AR QIT W +    +   LRDY +K W+G+L+D+Y  R A +++ +   L   DG  ++D
Sbjct: 625 ARVQITTWGNRECADKGGLRDYAHKEWNGILKDFYYKRWAAWWEMLQGVL---DGGEMQD 681

Query: 496 WRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
              +W  +   W    N Y  E+ GD + T++
Sbjct: 682 --IDWYAMEEPWTLQHNPYKAEAEGDCIETAR 711


>gi|326437768|gb|EGD83338.1| lysosomal alpha-N-acetyl glucosaminidase [Salpingoeca sp. ATCC
           50818]
          Length = 820

 Score =  244 bits (623), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 169/581 (29%), Positives = 255/581 (43%), Gaps = 93/581 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  W  PL + W   Q  LQ KIL R  ELGM   LP F+G+VP A++ +FP A +T
Sbjct: 215 MGNLKYWAAPLDKDWRTSQYNLQLKILSRARELGMVSALPGFAGHVPTAIKRIFPHANLT 274

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q   W +  S   +    LL  TDPLF+++G  F +  +K +G T H++  DT++E  P 
Sbjct: 275 QTAGWANFNS--TYSDVSLLQPTDPLFLQLGTKFYKMLIKAFG-TDHVFQMDTYNEMQPS 331

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
             +   ++     +Y  M + D +AV+LMQGWLF ++ +W P  +K  L+ VP  K+++L
Sbjct: 332 FTNMTLLAESNRVVYQAMANADPEAVYLMQGWLF-HESYWTPEHVKVYLSGVPDDKMIIL 390

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL  E  P++S +  ++G  +IW ML N+ G   +YG    I+  P+        TM   
Sbjct: 391 DLNTEANPVFSLTSDYFGKLWIWNMLLNYGGRRGLYGNATDISTRPLLDLHRAQGTMDGI 450

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          WI  Y+  RYG+     Q AW +L   VY
Sbjct: 451 GITPEAIENNPVMFELMLEMGWHATPPDMHDWIAAYASSRYGKRESLTQSAWQLLLEHVY 510

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +      D +R  +   PD     +S +E +  N                          
Sbjct: 511 D----QPDIDRFHMEMVPD-----LSSSESRNSN-------------------------- 535

Query: 328 TSEVIRALELFI--ASGNELSASNTYRYDLIDLTRQA-----------LAKYANELFLNI 374
           T+ +++A  L +  A    L  +  + YDL+D+ RQA           L  +  E   NI
Sbjct: 536 TTALVQAWRLLVTAAVNGSLPITGPFSYDLVDVGRQALLNLWSDVRGMLVAHVKEYNANI 595

Query: 375 IEAYQLNDAH--GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
             +     +H   +  L    L++  D+D LL     +LLG WLESAK  A N ++    
Sbjct: 596 DSSPSTAASHVPAIKSLFTLLLDITSDLDRLLGTDVNYLLGVWLESAKATAANADERATR 655

Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
           E+NAR QIT+W  + +     + DY  K W GL+ DYY  R  +     + +L S     
Sbjct: 656 EFNARNQITLWGPDGE-----ITDYAAKQWQGLVSDYYVKRWEMMHDATLSALNSSTKID 710

Query: 493 LKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
                ++ +K    W N    YP     D +  S  +  KY
Sbjct: 711 TSA-PKDTLKFEQAWGNENKTYPTAPQADVVKVSAAMLQKY 750


>gi|340520426|gb|EGR50662.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
          Length = 747

 Score =  244 bits (623), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 156/518 (30%), Positives = 264/518 (50%), Gaps = 69/518 (13%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LP+SW+D+Q  LQ KIL R+ ELG+ P+LPAF G VP  +  VFP   ++ 
Sbjct: 203 NIQGSWGGTLPRSWVDEQFSLQLKILKRMEELGITPILPAFPGFVPRNISRVFPDISLST 262

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W +  +         ++  DP F ++ + FI +Q + YG  ++ +  D F+EN P  
Sbjct: 263 SPIWSNFGT--TLSADIYINPFDPRFAQLQKLFINKQQELYGNVTNFWTLDQFNENRPLS 320

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-LVV 179
              +Y+ ++    ++ +++ D +AVW+MQ WLFS D  FW   +++ALL  VP+ + +++
Sbjct: 321 GDLDYLRNVSHNTWAALKAADPEAVWVMQAWLFSSDSSFWTNDRVEALLGGVPVNQDMLL 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENT--- 235
           LDLFAE  P W  +  FYG P+IWC LHN+ GN+ +YG ++++    ++A R S++    
Sbjct: 381 LDLFAESAPQWQRTDSFYGKPWIWCELHNYGGNMGLYGQIENVTINSMDAVRNSDSIVGF 440

Query: 236 ----------------------------TMAWINQYSVRRYG-RSVPAIQDAWNVLYHTV 266
                                       T  + + +   RYG ++V  +   W +L  TV
Sbjct: 441 GLTMEGQEGNEIMYDLLLDQAWSPKPIDTDTYFHDWVSARYGAKNVKGLYKGWEMLRPTV 500

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           +N T+   +  +  I+   ++ PSI  +  G+   +G  +  +  +  E  S        
Sbjct: 501 FNNTNLTVNAVQKSIL---ELTPSISGLL-GRTGRHGTTIMYDPAVMVEAWS-------- 548

Query: 327 STSEVIRALELFIASGNELSASN--TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
                    ELF A   +L+  N  +Y+YDL+D TRQ L     + + ++++AY  + + 
Sbjct: 549 ---------ELFKAGLQDLTLFNNPSYQYDLVDWTRQVLVNSFEDHYKDLVDAYNKSSSP 599

Query: 385 GVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            V +    + + L++ +D +LA +  F L PW++ A+  A +      +E+NAR QIT+W
Sbjct: 600 TVIRTRGAKLVTLLKTLDAVLATNKNFQLTPWIDRAR--ASSPSSANFFEFNARNQITLW 657

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
               Q E     DY +K W+GL+  YY  R   +  Y+
Sbjct: 658 GPQGQIE-----DYASKQWAGLVGTYYAERWQQFVDYL 690


>gi|393236266|gb|EJD43816.1| putative alpha-N-acetylglucosaminidase [Auricularia delicata
           TFB-10046 SS5]
          Length = 778

 Score =  244 bits (623), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 161/560 (28%), Positives = 264/560 (47%), Gaps = 87/560 (15%)

Query: 8   GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
           G  LP  W+D Q  LQKKI+ R+ ELGM P LP+F+G VP A+  V P A +     W S
Sbjct: 219 GSSLPMEWIDDQFELQKKIVRRMVELGMTPALPSFTGFVPRAISRVLPGASVVNGSRW-S 277

Query: 68  VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
              D     T+L +  DP F  + ++FIE+Q+  YG  SH+Y  D ++EN P  +   Y+
Sbjct: 278 GFPDALTRVTFL-EPFDPAFARLQKSFIEKQIAAYGPVSHVYTLDQYNENDPLKNDVGYL 336

Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KLVVLDLFAE 185
             +  + +  +++ D DA+WLMQGWLF S   FW   +++A L  V     +++LDLF+E
Sbjct: 337 RDVSRSTWQSLKAADPDAIWLMQGWLFYSNRGFWTNARVEAFLGGVEKNDDMLILDLFSE 396

Query: 186 VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---------- 235
            +P W  +  +YG P+IWC LH++ GN+ +YG + +I    VEA     +          
Sbjct: 397 SEPQWQRTNSYYGKPWIWCQLHDYGGNLGLYGQVMNITLNAVEALEKSPSLVGFGLTMEG 456

Query: 236 ----------------------TMAWINQYSVRRY------GRSVP-AIQDAWNVLYHTV 266
                                 T ++   ++ RRY      G  +P AI +AW++L  TV
Sbjct: 457 QEGNEIMYDLLLSQAWSRKPIDTASYFRSWATRRYNAGGIIGSLLPSAIYNAWDILRTTV 516

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YN T  A++    V  +  ++ P++  +                   +  + +    + Y
Sbjct: 517 YNNTKLASNA---VTKSVFELRPALSGI-------------------ANRTGHHATTITY 554

Query: 327 STSEVIRALELFIASGNELSA----SNTYRYDLIDLTRQALAKYANELFLNIIEAYQ--- 379
            T  +++A +LF  +     A    +  Y +D +D  RQ L+   +  + +++  Y    
Sbjct: 555 DTQALVKAYDLFDKAAIYTPALWFNNPAYEFDNVDFARQVLSNAFSTQYDDLVATYNEIS 614

Query: 380 --------LNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
                   L +A  +      R + ++  +D +L     F L  WL+ A+  A+   +E 
Sbjct: 615 KPGGSGATLAEAAKIIHDKGERMMGVLASLDKVLRTSKHFTLKKWLQDARAWARGGHEE- 673

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            +E+NAR QIT+W    Q     + DYG+K W GL+ +YY  R  I+F Y+   + +G  
Sbjct: 674 LFEYNARNQITLWGPTGQ-----INDYGSKAWGGLVSEYYAQRWRIFFTYLESVVAAGQP 728

Query: 491 FRLKDWRREWIKLTNDWQNG 510
           F L     +++    DWQ  
Sbjct: 729 FNLTAVGNQFLAFQLDWQTA 748


>gi|294647264|ref|ZP_06724861.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|292637401|gb|EFF55822.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
          Length = 733

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 168/566 (29%), Positives = 264/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 196 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I R F+E+Q K YG T+HIY  D F+E   P
Sbjct: 256 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +Q  DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 312 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F               
Sbjct: 372 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 431

Query: 227 -----------------VEARTSENT--TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                            V  R  +N+     WI  ++  R G     I  AW  LY  +Y
Sbjct: 432 LGVTLEGLDVNPLMYEFVFERAWQNSMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKIY 491

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T  A      ++ A P ++      T   Y                   Y +  LW  
Sbjct: 492 --TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWEI 531

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+++A  ++          + Y +D+I++ RQ L     +      + Y+  D  G  
Sbjct: 532 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGTK 582

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
              +R  +L+ D+D LL C   F +G W++ A+  A NE+++K YE NAR  +T+W    
Sbjct: 583 VWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW---- 638

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++ + L DY N+ W GL R +Y  R   + + +I ++     F  + + ++  +   +W
Sbjct: 639 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 698

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                 +P+ S  + +  ++ L  KY
Sbjct: 699 TLKNEDFPIISEENPISLAKELILKY 724


>gi|423259033|ref|ZP_17239956.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
           CL07T00C01]
 gi|423263996|ref|ZP_17242999.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
           CL07T12C05]
 gi|387776613|gb|EIK38713.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
           CL07T00C01]
 gi|392706262|gb|EIY99385.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
           CL07T12C05]
          Length = 718

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 171/577 (29%), Positives = 263/577 (45%), Gaps = 103/577 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R++E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++     
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
                          G        N  M                  W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+GLL+D+Y  R  +YF ++ + +E    
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKPP 679

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
            ++     ++  +   W    N Y  E+ GD +  ++
Sbjct: 680 AKI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|294807833|ref|ZP_06766618.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|294444952|gb|EFG13634.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
          Length = 703

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 167/566 (29%), Positives = 263/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 166 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 225

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I R F+E+Q K YG T+HIY  D F+E   P
Sbjct: 226 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 281

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +Q  DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 282 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 341

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F               
Sbjct: 342 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 401

Query: 227 -----------------VEARTSENT--TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                            V  R  +N+     WI  ++  R G     I  AW  LY  +Y
Sbjct: 402 LGVTLEGLDVNPLMYEFVFERAWQNSMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKIY 461

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T  A      ++ A P ++                    E         Y +  LW  
Sbjct: 462 --TSAALCGQAVLMNARPQLE------------------GVEGWNTLPGYDYKNIDLWEI 501

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+++A  ++          + Y +D+I++ RQ L     +      + Y+  D  G  
Sbjct: 502 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGTK 552

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
              +R  +L+ D+D LL C   F +G W++ A+  A NE+++K YE NAR  +T+W    
Sbjct: 553 VWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW---- 608

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++ + L DY N+ W GL R +Y  R   + + +I ++     F  + + ++  +   +W
Sbjct: 609 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 668

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                 +P+ S  + +  ++ L  KY
Sbjct: 669 TLKNEDFPIISEENPISLAKELILKY 694


>gi|345511813|ref|ZP_08791352.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|229443748|gb|EEO49539.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
          Length = 720

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 168/566 (29%), Positives = 264/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 183 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 242

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I R F+E+Q K YG T+HIY  D F+E   P
Sbjct: 243 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 298

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +Q  DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 299 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 358

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F               
Sbjct: 359 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 418

Query: 227 -----------------VEARTSENT--TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                            V  R  +N+     WI  ++  R G     I  AW  LY  +Y
Sbjct: 419 LGVTLEGLDVNPLMYEFVFERAWQNSMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKIY 478

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T  A      ++ A P ++      T   Y                   Y +  LW  
Sbjct: 479 --TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWEI 518

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+++A  ++          + Y +D+I++ RQ L     +      + Y+  D  G  
Sbjct: 519 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGTK 569

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
              +R  +L+ D+D LL C   F +G W++ A+  A NE+++K YE NAR  +T+W    
Sbjct: 570 VWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW---- 625

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++ + L DY N+ W GL R +Y  R   + + +I ++     F  + + ++  +   +W
Sbjct: 626 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 685

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                 +P+ S  + +  ++ L  KY
Sbjct: 686 TLKNEDFPIISEENPISLAKELILKY 711


>gi|262406058|ref|ZP_06082608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|294806855|ref|ZP_06765680.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|345510563|ref|ZP_08790130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|262356933|gb|EEZ06023.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|294445884|gb|EFG14526.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|345454460|gb|EEO49066.2| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
          Length = 727

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 175/577 (30%), Positives = 267/577 (46%), Gaps = 93/577 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W  ++ F+  PYIWC L NF GN  + G                      
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E    +   +      WI   + R  G     ++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQPVRDAWKRLFNDI 489

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            +     +  E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W 
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
                    L DY ++ W+GL+ DYY  R  +Y    I++ E G   D  +L+D  +E  
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYVNTFIKAAEEGVEVDQKQLEDELKEIE 688

Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
             W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722


>gi|262407713|ref|ZP_06084261.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|262354521|gb|EEZ03613.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
          Length = 735

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 167/566 (29%), Positives = 263/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 198 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I R F+E+Q K YG T+HIY  D F+E   P
Sbjct: 258 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +Q  DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 314 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F               
Sbjct: 374 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 433

Query: 227 -----------------VEARTSENT--TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                            V  R  +N+     WI  ++  R G     I  AW  LY  +Y
Sbjct: 434 LGVTLEGLDVNPLMYEFVFERAWQNSMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKIY 493

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T  A      ++ A P ++                    E         Y +  LW  
Sbjct: 494 --TSAALCGQAVLMNARPQLE------------------GVEGWNTLPGYDYKNIDLWEI 533

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+++A  ++          + Y +D+I++ RQ L     +      + Y+  D  G  
Sbjct: 534 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGTK 584

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
              +R  +L+ D+D LL C   F +G W++ A+  A NE+++K YE NAR  +T+W    
Sbjct: 585 VWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW---- 640

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++ + L DY N+ W GL R +Y  R   + + +I ++     F  + + ++  +   +W
Sbjct: 641 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 700

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                 +P+ S  + +  ++ L  KY
Sbjct: 701 TLKNEDFPIISEENPISLAKELILKY 726


>gi|83775903|dbj|BAE66022.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 633

 Score =  243 bits (620), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 160/532 (30%), Positives = 257/532 (48%), Gaps = 80/532 (15%)

Query: 1   MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           + N+ G WGG  +  +W++ Q  LQKKI+ RI ELGM PVLPAF G VP A++ V P A 
Sbjct: 73  LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 132

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W   +   ++     L+  D  F ++ ++ I +Q + +G  +H+Y  D F+E  
Sbjct: 133 VVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISRQTRAFGNVTHVYALDQFNEIN 190

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
           P      Y+ +L    +  +++ +  AVW+MQGWLF YD   FW P ++ A L+ V    
Sbjct: 191 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDPNRISAYLSGVERND 249

Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ MYG + +I   P+EA    ++
Sbjct: 250 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEALNKSDS 309

Query: 236 --------------------------------TMAWINQYSVRRYGR--SVP-AIQDAWN 260
                                           T A+   +   RY    SVP  +  AW+
Sbjct: 310 LVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQSWVRSRYSGNFSVPNELYTAWD 369

Query: 261 VLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEGKYQ-----NYGKPVSKEAVLKS 314
           +L  TVYN T+  T    + +    PD+   +  V  G Y      NY   V  E     
Sbjct: 370 LLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--GHYPTPTSINYDPMVLNEVWSLF 427

Query: 315 ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNI 374
             ++   P LW+S +                     Y YD++D+TRQ +      ++ ++
Sbjct: 428 MNATRKEPSLWHSPA---------------------YEYDMVDITRQLMGNAFVNVYSDL 466

Query: 375 IEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
           I +++    N    V   S R L L+  +D +L+C++ F L  W+ SA+      E +  
Sbjct: 467 ISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNENFSLTTWISSARDWGNTTETKDF 526

Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
           +E+NAR QIT+W    +     + DY +K W+GL+  YY PR +I+  Y+ E
Sbjct: 527 FEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGE 573


>gi|53711968|ref|YP_097960.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis YCH46]
 gi|52214833|dbj|BAD47426.1| alpha-N-acetylglucosaminidase precursor [Bacteroides fragilis
           YCH46]
          Length = 718

 Score =  243 bits (620), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 171/577 (29%), Positives = 262/577 (45%), Gaps = 103/577 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R++E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++     
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
                          G        N  M                  W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+GLL+D+Y  R  +YF ++ + +E    
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
             +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|313145188|ref|ZP_07807381.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
 gi|313133955|gb|EFR51315.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
          Length = 718

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 173/577 (29%), Positives = 262/577 (45%), Gaps = 103/577 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ +  Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLGA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
           G L++LDL +E +P W  S      K  YG   +++CML N+ GN+ ++G +D++     
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 223 --AFGPVEARTSENTTMA----------------------------WINQYSVRRYGRSV 252
                P    T +   MA                            W+ +Y   RYG   
Sbjct: 404 LAKADPHAGSTLKGVGMAPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y   +VI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              +  AY+  D       S++FL L+   D LL     F +G W+E A+ L    E++ 
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEKA 619

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+GLL+D+Y  R  +YF  + + LE    
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDSLSQKLEGKTP 679

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
            ++     ++  +   W    N Y  E+ GD + T++
Sbjct: 680 EKI-----DFYAVEEPWAKATNPYSAEAEGDCIETAK 711


>gi|423248659|ref|ZP_17229675.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
           CL03T00C08]
 gi|423253608|ref|ZP_17234539.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
           CL03T12C07]
 gi|392655237|gb|EIY48880.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
           CL03T12C07]
 gi|392657600|gb|EIY51231.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
           CL03T00C08]
          Length = 718

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 171/577 (29%), Positives = 262/577 (45%), Gaps = 103/577 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R++E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++     
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
                          G        N  M                  W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+GLL+D+Y  R  +YF ++ + +E    
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
             +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|383115203|ref|ZP_09935961.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
 gi|313695380|gb|EFS32215.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
          Length = 727

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 173/574 (30%), Positives = 261/574 (45%), Gaps = 87/574 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W  ++ F+  PYIWC L NF GN  + G                      
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E    +   +      WI   + R  G    +++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNADDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFFDVKVEFDRMVEAKDYQ 575

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            +     +  E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W 
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                    L DY ++ W+GL+ DYY  R  +Y    I++ E G     K    E  ++ 
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAAEKGVEVDQKQLEDELKEIE 688

Query: 505 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 533
             W N  +   V     S  D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDVRKDIHSATDGLLSFSTFLFSKY 722


>gi|336386984|gb|EGO28130.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 738

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 158/536 (29%), Positives = 267/536 (49%), Gaps = 80/536 (14%)

Query: 7   WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW- 65
           WGG LP+ W++ Q  LQK+I+ R+ ELGM PVLP+F+G VP A+  ++P+A I     W 
Sbjct: 191 WGGDLPEQWINDQFALQKQIISRMVELGMTPVLPSFTGFVPRAMHTLYPNASIVNGSQWN 250

Query: 66  -FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
            F+++    +     L+  DPLF  +  +FI +Q+  YG  SH+Y  D ++EN+P     
Sbjct: 251 GFTIQ----YTNDSFLEPFDPLFSTLQTSFISKQVAAYGNVSHVYTLDQYNENSPYSGDT 306

Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-KLVVLDL 182
            Y++++ AA ++ +++ D  AVWLMQGWLF  D  FW   +++A L  VP    +++LDL
Sbjct: 307 SYLANVTAATFASLRAADPQAVWLMQGWLFYSDSTFWTTERVEAYLGGVPGNDSMIILDL 366

Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM----- 237
           ++E +P W     ++G  +IWC LH++ GN+   G  +++   P++A  +   +M     
Sbjct: 367 YSEAQPQWQRLNSYFGKQWIWCELHDYGGNMGFEGNFENVTTQPIKALATPGNSMVGMGL 426

Query: 238 ----------------------------AWINQYSVRRYGRSVP----AIQDAWNVLYHT 265
                                       A+I+ ++ RRY  +VP    A  +AW +L  T
Sbjct: 427 TMEGQEGNEIIYDVLLDQAWSSTPLNRTAYISAWASRRY--NVPDLPTAALEAWEILGAT 484

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYN  D  T      I+   ++ PSI                    L + T ++     +
Sbjct: 485 VYNNQDVTTQSTVKSIL---ELSPSITG------------------LVNRTGTHSTKLFY 523

Query: 326 YSTSEVIRALELFIASGNELSA-SNT--YRYDLIDLTRQALAKYANELFLNIIEAY--QL 380
            + + ++ AL+L + +  E SA SN   ++YD++D+TRQ LA    +L+ ++I+ +    
Sbjct: 524 DTNTTIVPALKLLLQARQEASALSNIPEFQYDVVDVTRQLLANRFIDLYTSLIDTFSSTS 583

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ--NEEQEKQYEWNART 438
           + +  V       L L++D+D +L     FLL  W+ +A+      N       E+NAR 
Sbjct: 584 SSSSAVSAAGAPLLALLQDLDSVLLTDTHFLLARWISAARNWTHGDNATYAAYLEYNARN 643

Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
           Q+T+W    +     + DY +K W GL+  YY  R   +  Y+  S E+   + + 
Sbjct: 644 QVTLWGPRGE-----VNDYASKQWGGLVGTYYVQRWETFVGYLAGSKENATVYNVS 694


>gi|298480128|ref|ZP_06998327.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
 gi|336404356|ref|ZP_08585054.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
 gi|298273937|gb|EFI15499.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
 gi|335943684|gb|EGN05523.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
          Length = 727

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 172/574 (29%), Positives = 261/574 (45%), Gaps = 87/574 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W  ++ F+  PYIWC L NF GN  + G                      
Sbjct: 371 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E    +   +      WI   + R  G    +++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            +     +  E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W 
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                    L DY ++ W+GL+ DYY  R  +Y    I+++  G     K    E  ++ 
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAVGEGVEVDQKQLEDELKEIE 688

Query: 505 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 533
             W N  +   V     S  D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDVRKDVHSTTDGLLSFSTFLFSKY 722


>gi|391873368|gb|EIT82411.1| alpha-N-acetylglucosaminidase [Aspergillus oryzae 3.042]
          Length = 633

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 159/535 (29%), Positives = 262/535 (48%), Gaps = 84/535 (15%)

Query: 1   MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           + N+ G WGG  +  +W++ Q  LQKKI+ RI ELGM PVLPAF G VP A++ V P A 
Sbjct: 73  LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 132

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W   +   ++     L   D  F ++ ++ I +Q++ +G  +H+Y  D F+E  
Sbjct: 133 VVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISRQMRAFGNITHVYALDQFNEIN 190

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
           P      Y+ +L    +  +++ +  AVW+MQGWLF YD   FW   ++ A L+ V    
Sbjct: 191 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDSNRISAYLSGVERND 249

Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ MYG + +I   P+EA    N+
Sbjct: 250 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEALNKSNS 309

Query: 236 --------------------------------TMAWINQYSVRRYGR--SVP-AIQDAWN 260
                                           T A+   +   RY R  SVP  +  AW+
Sbjct: 310 LVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQSWVRSRYSRNFSVPNELYTAWD 369

Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS--KEAVLKSET-- 316
           +L  TVYN T+  T     V  +  ++ P I  +  G+  +Y  P S   + ++ +E   
Sbjct: 370 LLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GRVGHYPTPTSINYDPMVLNEVWS 425

Query: 317 ----SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
               ++   P LW++ +                     Y YD++D+TRQ +      ++ 
Sbjct: 426 LFMNATRKEPSLWHNPA---------------------YEYDMVDITRQLMGNAFVNVYS 464

Query: 373 NIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
            +I +++    N    V   S R L L+  +D +L+C++ F L  W+ SA+      E +
Sbjct: 465 VLITSWKSETENRTTKVTSQSERLLNLLSAIDKVLSCNENFSLATWISSARDWGNTTETK 524

Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
             +E+NAR QIT+W    +     + DY +K W+GL+  YY PR +I+  Y+ E+
Sbjct: 525 DFFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGEN 574


>gi|336374066|gb|EGO02404.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 761

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 158/536 (29%), Positives = 267/536 (49%), Gaps = 80/536 (14%)

Query: 7   WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW- 65
           WGG LP+ W++ Q  LQK+I+ R+ ELGM PVLP+F+G VP A+  ++P+A I     W 
Sbjct: 214 WGGDLPEQWINDQFALQKQIISRMVELGMTPVLPSFTGFVPRAMHTLYPNASIVNGSQWN 273

Query: 66  -FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
            F+++    +     L+  DPLF  +  +FI +Q+  YG  SH+Y  D ++EN+P     
Sbjct: 274 GFTIQ----YTNDSFLEPFDPLFSTLQTSFISKQVAAYGNVSHVYTLDQYNENSPYSGDT 329

Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-KLVVLDL 182
            Y++++ AA ++ +++ D  AVWLMQGWLF  D  FW   +++A L  VP    +++LDL
Sbjct: 330 SYLANVTAATFASLRAADPQAVWLMQGWLFYSDSTFWTTERVEAYLGGVPGNDSMIILDL 389

Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM----- 237
           ++E +P W     ++G  +IWC LH++ GN+   G  +++   P++A  +   +M     
Sbjct: 390 YSEAQPQWQRLNSYFGKQWIWCELHDYGGNMGFEGNFENVTTQPIKALATPGNSMVGMGL 449

Query: 238 ----------------------------AWINQYSVRRYGRSVP----AIQDAWNVLYHT 265
                                       A+I+ ++ RRY  +VP    A  +AW +L  T
Sbjct: 450 TMEGQEGNEIIYDVLLDQAWSSTPLNRTAYISAWASRRY--NVPDLPTAALEAWEILGAT 507

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYN  D  T      I+   ++ PSI                    L + T ++     +
Sbjct: 508 VYNNQDVTTQSTVKSIL---ELSPSITG------------------LVNRTGTHSTKLFY 546

Query: 326 YSTSEVIRALELFIASGNELSA-SNT--YRYDLIDLTRQALAKYANELFLNIIEAY--QL 380
            + + ++ AL+L + +  E SA SN   ++YD++D+TRQ LA    +L+ ++I+ +    
Sbjct: 547 DTNTTIVPALKLLLQARQEASALSNIPEFQYDVVDVTRQLLANRFIDLYTSLIDTFSSTS 606

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ--NEEQEKQYEWNART 438
           + +  V       L L++D+D +L     FLL  W+ +A+      N       E+NAR 
Sbjct: 607 SSSSAVSAAGAPLLALLQDLDSVLLTDTHFLLARWISAARNWTHGDNATYAAYLEYNARN 666

Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
           Q+T+W    +     + DY +K W GL+  YY  R   +  Y+  S E+   + + 
Sbjct: 667 QVTLWGPRGE-----VNDYASKQWGGLVGTYYVQRWETFVGYLAGSKENATVYNVS 717


>gi|260642393|ref|ZP_05415712.2| alpha-N-acetylglucosaminidase [Bacteroides finegoldii DSM 17565]
 gi|260622285|gb|EEX45156.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides finegoldii DSM
           17565]
          Length = 735

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 168/566 (29%), Positives = 263/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 198 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I   F+E+Q K YG T+HIY  D F+E   P
Sbjct: 258 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQHRFLEEQTKVYG-TNHIYGIDPFNEVDSP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +QS DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 314 NWNEEFLSNVSDKIYKSIQSVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F               
Sbjct: 374 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 433

Query: 227 -----------------VEARTSENT--TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                            V  R  EN+     WI  ++  R G     I  AW  LY  +Y
Sbjct: 434 LGVTLEGLDVNPLMYEFVFERAWENSIPVHQWIANWAQCRGGNVDNHIIKAWKQLYEKIY 493

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T  A      ++ A P ++      T   Y                   Y +  LW  
Sbjct: 494 --TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWEI 533

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+++A  ++          + Y +D+I++ RQ L     +      + Y+  D  G  
Sbjct: 534 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFADCYRKKDLEGTK 584

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
              +R  +L+ D+D LL C     +G W++ A+  A NE+++K YE NAR  +T+W    
Sbjct: 585 VWGQRMDQLLLDVDRLLCCSPVLSIGKWIKDARDFAVNEQEQKYYEENARCILTVW---- 640

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++ + L DY N+ W GL R +Y  R   + + +I ++     F  + + ++  +   +W
Sbjct: 641 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 700

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                 +P+ S  + +  ++ L  KY
Sbjct: 701 TLKNEDFPITSGENPISLAKELILKY 726


>gi|423293377|ref|ZP_17271504.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
           CL03T12C18]
 gi|392678320|gb|EIY71728.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
           CL03T12C18]
          Length = 727

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 172/577 (29%), Positives = 267/577 (46%), Gaps = 93/577 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W  ++ F+  PYIWC L NF GN  + G                      
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E    +   +      WI   + R  G     ++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQPVRDAWKRLFNDI 489

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y                      + +  T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            +     +  E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W 
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
                    L DY ++ W+GL+ DYY  R  +Y    I+++  G   D  +L+D  +E  
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAVGEGVEVDQKQLEDELKEIE 688

Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
             W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722


>gi|380697007|ref|ZP_09861866.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
          Length = 703

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 169/566 (29%), Positives = 263/566 (46%), Gaps = 70/566 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 166 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 225

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I R F+E+Q K YG T+HIY  D F+E   P
Sbjct: 226 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 281

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +Q  DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 282 NWNEEFLSNVSDKIYKSIQDVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 341

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F               
Sbjct: 342 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 401

Query: 227 -----------------VEARTSENTTMA--WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                            V  R  EN+  A  WI  ++  R G     I  AW  LY  +Y
Sbjct: 402 LGVTLEGLDVNPLMYEFVFERAWENSMPAHQWIANWAQCRGGNVDNHIVKAWKQLYEKIY 461

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T  A      ++ A P ++      T   Y                   Y +  LW  
Sbjct: 462 --TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWEI 501

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             E+++A  ++          + Y +D+I++ RQ L     +      + Y+        
Sbjct: 502 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKKLEETK 552

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
              +R  +L+ D+D LL C   F +G W++ AK  A NE+++K YE NAR  +T+W    
Sbjct: 553 VWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDAKDFAVNEQEQKYYEENARCILTVW---- 608

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
            ++ + L DY N+ W GL R +Y  R   + + +I ++     F  + + ++  +   +W
Sbjct: 609 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 668

Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
                 +P+ S  + +  ++ L  KY
Sbjct: 669 TLKNEDFPITSEENPISLAKELILKY 694


>gi|403416059|emb|CCM02759.1| predicted protein [Fibroporia radiculosa]
          Length = 705

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 162/577 (28%), Positives = 279/577 (48%), Gaps = 74/577 (12%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G W G LP  W++ Q  LQ++I+ R+ ELGM PVLPAF+G VP A+  ++P+A I  
Sbjct: 158 NIQGSWSGALPTQWINDQWALQQQIVQRMVELGMTPVLPAFTGFVPRAMSTLYPNASIVN 217

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTPP 120
              W    S   +  T  L+  DPLF  + ++FI +Q   YG   SH+Y  D ++EN P 
Sbjct: 218 GSQWEGFPSTLTY--TTFLEPFDPLFTTMQKSFISKQQAAYGANVSHVYTLDQYNENDPY 275

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-KLV 178
                Y++++ A  ++ +Q+ D +AVW+MQGWL F+ + FW   ++ A L +VP    ++
Sbjct: 276 SGDVGYLANISAGTFASLQAADPEAVWMMQGWLFFASEAFWTTERIAAFLGAVPSNDSMI 335

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT-- 236
           +LDL++E  P W  +  +YG  +IWC LH+F GN+   G L  +  GP++A ++ ++   
Sbjct: 336 ILDLYSEAAPQWQRTDSYYGKQWIWCELHDFGGNMGFEGNLPELVTGPIQALSNASSMRG 395

Query: 237 --------------------MAW----------INQYSVRRYG-RSVP-AIQDAWNVLYH 264
                                AW          +  +  RRY  + +P A Q+AW +L  
Sbjct: 396 MGLTPEGQEGNEIVYDILLDQAWSSTSIDIASYVEAWVARRYTVQDLPSAAQEAWTILST 455

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           TVY+ +D             P+   +I S+ E         ++ +    ++ + +    +
Sbjct: 456 TVYSNSD-------------PNTQATIKSIFE---------LAPDLSGLTDRTGHHCTEI 493

Query: 325 WYSTS-EVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
            Y T+  ++ AL+  + +  E   L +   + YD++D+TRQ LA    +++  ++  +  
Sbjct: 494 PYDTNITIVPALQNLVQAATENPLLLSVPEFMYDVVDVTRQLLANRFIDVYNELVSTFYS 553

Query: 381 N--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNAR 437
               A  V    +  L ++ D+D LL  +D FLL  W+  A  L+ N      Y E+NAR
Sbjct: 554 TGVTAASVKNAGQPLLTILSDVDTLLWTNDNFLLSNWILGAINLSDNNGTYADYLEYNAR 613

Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR 497
            QIT+W  + +     + DY +K W+G +  YY  R  ++  Y+ +  ++G  +     +
Sbjct: 614 NQITLWGPDGE-----INDYASKQWAGFVGTYYYDRWNMFITYLEDITQNGTAYNDTAIQ 668

Query: 498 REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
              +    +W           +GD +     L  K+L
Sbjct: 669 TVMLNFGKEWDTQTYSLSATVSGDTMSIVDSLIQKWL 705


>gi|317158657|ref|XP_001827155.2| alpha-N-acetylglucosaminidase [Aspergillus oryzae RIB40]
          Length = 849

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 160/532 (30%), Positives = 257/532 (48%), Gaps = 80/532 (15%)

Query: 1   MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           + N+ G WGG  +  +W++ Q  LQKKI+ RI ELGM PVLPAF G VP A++ V P A 
Sbjct: 157 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 216

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W   +   ++     L+  D  F ++ ++ I +Q + +G  +H+Y  D F+E  
Sbjct: 217 VVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISRQTRAFGNVTHVYALDQFNEIN 274

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
           P      Y+ +L    +  +++ +  AVW+MQGWLF YD   FW P ++ A L+ V    
Sbjct: 275 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDPNRISAYLSGVERND 333

Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ MYG + +I   P+EA    ++
Sbjct: 334 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEALNKSDS 393

Query: 236 --------------------------------TMAWINQYSVRRYGR--SVP-AIQDAWN 260
                                           T A+   +   RY    SVP  +  AW+
Sbjct: 394 LVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQSWVRSRYSGNFSVPNELYTAWD 453

Query: 261 VLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEGKYQ-----NYGKPVSKEAVLKS 314
           +L  TVYN T+  T    + +    PD+   +  V  G Y      NY   V  E     
Sbjct: 454 LLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--GHYPTPTSINYDPMVLNEVWSLF 511

Query: 315 ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNI 374
             ++   P LW+S +                     Y YD++D+TRQ +      ++ ++
Sbjct: 512 MNATRKEPSLWHSPA---------------------YEYDMVDITRQLMGNAFVNVYSDL 550

Query: 375 IEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
           I +++    N    V   S R L L+  +D +L+C++ F L  W+ SA+      E +  
Sbjct: 551 ISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNENFSLTTWISSARDWGNTTETKDF 610

Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
           +E+NAR QIT+W    +     + DY +K W+GL+  YY PR +I+  Y+ E
Sbjct: 611 FEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGE 657


>gi|423214204|ref|ZP_17200732.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693149|gb|EIY86384.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 727

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 170/574 (29%), Positives = 260/574 (45%), Gaps = 87/574 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T H+Y  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHVYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W  ++ F+  PYIWC L NF GN  + G                      
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGERLENALINGGGNLKG 430

Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E    +   +      WI   + R  G     ++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDDKWIECLADRHVGCVSQPVRDAWKRLFNDI 489

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y                      + +  T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNIEL-- 524

Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            +     +  E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W 
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                    L DY ++ W+GL+ DYY  R  +Y    I++ E G     K    E  ++ 
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAAEKGVEVDQKQLEDELKEIE 688

Query: 505 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 533
             W N  +   V     S  D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDVRKDIHSATDGLLSFSTFLFSKY 722


>gi|224025137|ref|ZP_03643503.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
           18228]
 gi|224018373|gb|EEF76371.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
           18228]
          Length = 718

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 177/581 (30%), Positives = 274/581 (47%), Gaps = 102/581 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW ++Q  LQK+IL R+ E G+ PVLP +SG VP   ++      + 
Sbjct: 187 MNNLEGWGGPNPDSWYERQEELQKRILKRMREYGIEPVLPGYSGMVPHNAKDRL-GLNVA 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      PR      L  TDP F  I   +  +  + YG+ S+ Y+ D F E  NT
Sbjct: 246 DPGRW---NGYPR---PAFLQPTDPQFERIAALYYREMTRLYGKVSY-YSMDPFHEGGNT 298

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    + + G AI+  M+  +  A W++Q W    +P    PQM   + ++P G +V
Sbjct: 299 SGVD----LEAAGKAIWKAMKQANPRAAWVVQAW--GANP---RPQM---IRNLPAGDMV 346

Query: 179 VLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYG-------------- 217
           VLDLF+E +P W         K+ +G   +++CML N+ GN+ ++G              
Sbjct: 347 VLDLFSESRPQWGDPASSWYRKEGFGQHDWLFCMLLNYGGNVGLHGKMAHLIEEFYKAKD 406

Query: 218 ------------ILDSIAFGPV-----------EARTSENTTMAWINQYSVRRYGRSVPA 254
                        ++ I   PV           E R S++    W+  Y   RYG+S   
Sbjct: 407 SSFGKTLKGVGMTMEGIENNPVMYELLCELPWREQRFSKDE---WLEGYLKARYGKSDSQ 463

Query: 255 IQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
           +  AW +L +T+YNC   +T +  +  ++ A P      +S                   
Sbjct: 464 VSQAWMLLSNTIYNCPAASTQQGTHESILCARPSWKAYQVSSW----------------- 506

Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
            SE S Y      Y  ++VIRA  + + +      +N + YDL+D+ RQA+A+    ++ 
Sbjct: 507 -SEMSDY------YDPADVIRAAGMMVDAAERFRGNNNFEYDLVDIVRQAVAEKGRLMYR 559

Query: 373 NIIEAYQLNDAHGVFQLSR-RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
            +++AY+  D   +F+LS  RFL L+   D LLA    F +G WLESA+ L   EE++  
Sbjct: 560 VLVDAYKAGDRE-LFKLSSDRFLRLILMQDRLLATRSEFKVGRWLESARNLGSTEEEKDW 618

Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
           YEWNAR QIT W +    +   L DY ++ W+GLLRD+Y  R   +    ++S E G   
Sbjct: 619 YEWNARVQITTWGNRVAADDGGLHDYAHREWNGLLRDFYYLRWKTWLDEQLKSFEGGQPK 678

Query: 492 RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
            +     ++  L   W    N Y  E+ G+ +  +  +Y +
Sbjct: 679 AI-----DFYALEEPWTLKHNSYASEAEGNPVDIACEIYRE 714


>gi|373461342|ref|ZP_09553084.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
 gi|371952896|gb|EHO70729.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
          Length = 731

 Score =  242 bits (617), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 152/491 (30%), Positives = 254/491 (51%), Gaps = 24/491 (4%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++GW GPLPQSW+D Q  LQ++IL R  E GM PVLPAF+G+VP   + + P A+IT
Sbjct: 194 MLNINGWQGPLPQSWIDGQADLQRRILQREREFGMRPVLPAFNGSVPLDYKRLHPEARIT 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G W       R   TY L  TDP F ++ ++F+++Q + +G T H+Y  D+F+E  PP
Sbjct: 254 EVGQWGGFGQAYR---TYFLSPTDPRFGKLQKSFLDEQRRMFG-TDHLYCLDSFNEVQPP 309

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
             SP+ +  L   I++ +   D  +VW+  GWLF  D   W P  ++A L+ +P  + ++
Sbjct: 310 SWSPDTLCMLARHIHASLDKADPQSVWVQMGWLFYNDRKHWTPDVIRAYLSGIPKDRALL 369

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
           LD + +   +W  ++ FYG PYI C+L NF GN  + G +  ++   ++A  +++  MA 
Sbjct: 370 LDYYIDHTELWRLTESFYGRPYIACVLGNFGGNTMLQGDVGKVS-SRLDAAIAQDGNMAG 428

Query: 240 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT-EGK 298
           +   ++  +G +     D +  ++   ++C  G TD  RD +    D      S      
Sbjct: 429 VGA-TMEGFGVN----PDFYAFVFDKAWDC--GTTD--RDWLCRMADRHVGFASAAGRTA 479

Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHL---WYSTSEVIRALELFIASGNELSASNTYRYDL 355
           +Q     +    V +S T     P     + +T+     L ++    +  S    + YD+
Sbjct: 480 WQVLFDRIMPSYVNESGTVVCARPSFEARYLNTTYPAELLGVWKMLLDIDSDKREHLYDV 539

Query: 356 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 415
           +++ RQ L  +       +  AY    +  V   +RR  ++++D+D LLAC + F L  W
Sbjct: 540 VNVGRQVLGDFFAFERDGLHRAYLSQRSDSVDYYARRMDKMLDDLDRLLACSEEFSLRKW 599

Query: 416 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 475
           +E A+       ++  YE NART IT+W D+ Q     L DY N+ W+GL+  YY  R  
Sbjct: 600 IEDARGFGATAAEKDYYERNARTLITVWGDSRQ-----LTDYANRTWAGLVSSYYKQRWH 654

Query: 476 IYFKYMIESLE 486
           I+  ++  ++ 
Sbjct: 655 IFTAHVRRAVR 665


>gi|424666301|ref|ZP_18103337.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
           616]
 gi|404573840|gb|EKA78592.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
           616]
          Length = 718

 Score =  242 bits (617), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 174/577 (30%), Positives = 261/577 (45%), Gaps = 103/577 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ +  Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDS------ 221
           G L++LDL +E +P W  S      K  YG   +++CML N+ GN+ ++G +D+      
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 222 ---------------------IAFGPVEA--------RTSENTTMAWINQYSVRRYGRSV 252
                                I   PV          R    T   W+ +Y   RYG   
Sbjct: 404 LAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHEAVFCARP---------AEDVYQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y   +VI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              +  AY+  D       S++FL L+   D LL     F +G W+E A+ L    E++ 
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEKA 619

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+GLL+D+Y  R  +YF  + + LE    
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDSLSQKLEGKTP 679

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
            ++     ++  +   W    N Y  E+ GD + T++
Sbjct: 680 EKI-----DFYAVEEPWAKATNPYSAEAEGDCIETAK 711


>gi|299144715|ref|ZP_07037783.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
 gi|298515206|gb|EFI39087.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
          Length = 727

 Score =  242 bits (617), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 172/577 (29%), Positives = 268/577 (46%), Gaps = 93/577 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRLYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G   HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-IDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIVSDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W  ++ F+  PYIWC L NF GN  + G                      
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E    +   +      WI   + R  G    +++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y                      + +  T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            +     +  E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W 
Sbjct: 576 ALKACGEKMKEILHDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
                    L DY ++ W+GL+RDYY  R  +Y    I+++  G   D  +L+D  +E  
Sbjct: 635 ------GGSLNDYASRSWAGLIRDYYAKRWEVYINTFIKAVGEGVEVDQKQLEDELKEIE 688

Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
             W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722


>gi|423299508|ref|ZP_17277533.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
           CL09T03C10]
 gi|408473317|gb|EKJ91839.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
           CL09T03C10]
          Length = 727

 Score =  242 bits (617), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 164/571 (28%), Positives = 268/571 (46%), Gaps = 80/571 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP+ WL QQ  LQK+IL R  E  M PVLPAF+G+VPA L+ ++P+AKI 
Sbjct: 194 MSNVDYWQSPLPKDWLVQQEELQKRILAREREFNMTPVLPAFAGHVPAELKKIYPNAKIY 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I + F+E+Q K YG T HIY  D F+E   P
Sbjct: 254 TMSQWGGFDKQYR---SHFIDPMDSLYSVIQKRFLEEQTKIYG-TDHIYGIDPFNEVDSP 309

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  + S D +A WL   W+F Y    W P ++K+ L +VP  KL++
Sbjct: 310 DWNEEFLSNVSRKIYESLHSVDPEAQWLQMTWMFYYAKDKWTPSRIKSFLRAVPQDKLIL 369

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG---------------------- 217
           LD + +   IW  ++ +YG PYIWC L NF GN  + G                      
Sbjct: 370 LDYYCDHTEIWKKTEGYYGQPYIWCYLGNFGGNTMLAGNLNDTYEKIHQVLAEGGQNIHG 429

Query: 218 ---ILDSIAFGPV-------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
               L++    P+       +A      T  WI  ++  R G++ PA+  AW  L+  +Y
Sbjct: 430 LGVTLEAFDVNPMMYEFVFEQAWEGAQPTDEWIATWAKCRGGQTCPAVLKAWKELHEKIY 489

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                              + PS+                 +AVL +     +    W +
Sbjct: 490 -------------------IAPSLCG---------------QAVLMNARPQLEGVQGWNT 515

Query: 328 TSEV-IRALELFIASGNELSASNT----YRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
             E      +L++  G+ L   +     + +D++++ RQ L    ++        Y+  D
Sbjct: 516 FPEYKYDNKDLWVIWGSLLQVGSIDKPGHAFDVVNVGRQVLGNLFSDYRAQFTACYKRKD 575

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
             G  + ++R   L+ D+D LLAC   F +G W++ A+     EE++K YE NAR  +T+
Sbjct: 576 VKGAQEWAQRMDALLLDVDRLLACSPLFSMGKWIQDARDCGTTEEEKKYYEENARCILTI 635

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           W     ++ + L DY N+ W+GL + +Y  R   +   ++ ++++   F  K + ++   
Sbjct: 636 W----GQKDTQLNDYANRSWAGLTKGFYRERWKRFTDSVLTAMQANRSFDAKKFHKDITD 691

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +W      + V S  DA+  +  L+NKY
Sbjct: 692 FEYEWTLQHETFSVSSGEDAVKVANELWNKY 722


>gi|160884062|ref|ZP_02065065.1| hypothetical protein BACOVA_02038 [Bacteroides ovatus ATCC 8483]
 gi|423291477|ref|ZP_17270325.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
           CL02T12C04]
 gi|156110404|gb|EDO12149.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
 gi|392663477|gb|EIY57027.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
           CL02T12C04]
          Length = 727

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 174/577 (30%), Positives = 266/577 (46%), Gaps = 93/577 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W  ++ F+  PYIWC L NF GN  + G                      
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGERLENALINGGGNLKG 430

Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E    +   +      WI   + R  G     ++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDDKWIECLADRHVGCVSQPVRDAWKRLFNDI 489

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y              V  P         T G    Y +P       K  ++ Y +  L  
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNRNSEKRTSNVYSNVEL-- 524

Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKVEFDRMVEAKDHQ 575

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            +     +  E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W 
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
                    L DY ++ W+GL+ DYY  R  +Y    I+++  G   D  +L+D  +E  
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAVGEGVEVDQKQLEDELKEIE 688

Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
             W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722


>gi|423282107|ref|ZP_17260992.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
           615]
 gi|404582594|gb|EKA87288.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
           615]
          Length = 718

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 171/577 (29%), Positives = 261/577 (45%), Gaps = 103/577 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++     
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
                          G        N  M                  W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+GLL+D+Y  R  +YF ++ + +E    
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKAP 679

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
             +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|60680169|ref|YP_210313.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC 9343]
 gi|375357012|ref|YP_005109784.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
 gi|383116930|ref|ZP_09937677.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
 gi|60491603|emb|CAH06355.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC
           9343]
 gi|251947777|gb|EES88059.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
 gi|301161693|emb|CBW21233.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
          Length = 718

 Score =  241 bits (616), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 171/577 (29%), Positives = 261/577 (45%), Gaps = 103/577 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++     
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
                          G        N  M                  W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+GLL+D+Y  R  +YF ++ + +E    
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
             +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|423269418|ref|ZP_17248390.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
           CL05T00C42]
 gi|423273021|ref|ZP_17251968.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
           CL05T12C13]
 gi|392701212|gb|EIY94372.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
           CL05T00C42]
 gi|392708585|gb|EIZ01692.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
           CL05T12C13]
          Length = 718

 Score =  241 bits (616), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 171/577 (29%), Positives = 261/577 (45%), Gaps = 103/577 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++     
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
                          G        N  M                  W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+GLL+D+Y  R  +YF ++ + +E    
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKAP 679

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
             +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSTEAEGDCIEVAK 711


>gi|423280158|ref|ZP_17259071.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
           610]
 gi|404584494|gb|EKA89159.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
           610]
          Length = 718

 Score =  241 bits (615), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 174/577 (30%), Positives = 261/577 (45%), Gaps = 103/577 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ +  Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLGA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDS------ 221
           G L++LDL +E +P W  S      K  YG   +++CML N+ GN+ ++G +D+      
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 222 ---------------------IAFGPVEA--------RTSENTTMAWINQYSVRRYGRSV 252
                                I   PV          R    T   W+ +Y   RYG   
Sbjct: 404 LAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHESVFSARP---------AEDVYQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y   +VI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              +  AY+  D       S++FL L+   D LL     F +G W+E A+ L    E++ 
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEKA 619

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+GLL+D+Y  R  +YF  + + LE    
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDSLSQKLEGKTP 679

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
            ++     ++  +   W    N Y  E+ GD + T++
Sbjct: 680 EKI-----DFYAVEEPWTKATNPYSAEAEGDCIETAK 711


>gi|218258436|ref|ZP_03474815.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
           DSM 18315]
 gi|423342591|ref|ZP_17320305.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
           CL02T12C29]
 gi|218225494|gb|EEC98144.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
           DSM 18315]
 gi|409217508|gb|EKN10484.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 718

 Score =  241 bits (615), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 170/576 (29%), Positives = 262/576 (45%), Gaps = 96/576 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQ++I+ R+ E G+ PV P +SG VP   +       ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQQQIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W   +          L  TDP F EI   + ++  K YG+ ++ Y+ D F E  + 
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKANY-YSMDPFHEGGSV 298

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    + + G AI   M+  +  AVW+ Q W  +  P         ++ ++  G L+
Sbjct: 299 AGVD----LDAAGKAIMQAMKKNNPKAVWVAQAWQANPRP--------QMIGNLEAGDLI 346

Query: 179 VLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSI--------- 222
            LDLFAE +P W            F    +I+CML N+ GNI ++G +  +         
Sbjct: 347 ALDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNIGLHGKMKHVIDEFYKAKE 406

Query: 223 -AFGP------VEARTSENTTMA------------------WINQYSVRRYGRSVPAIQD 257
             FG       +    SEN  +                   W+  Y+V RYG+S P +QD
Sbjct: 407 SPFGTTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDKDQWLKAYTVARYGKSNPVVQD 466

Query: 258 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
           AW +L +++YNC D  T +  +  V  A P         TE  YQ               
Sbjct: 467 AWILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ--------------- 502

Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
            SS+     +Y  ++VIRA  + ++  ++   +N + YDL+D+ RQA+A+    L   ++
Sbjct: 503 VSSWSEMKDYYDPNDVIRAAAMMVSVSDQFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVV 561

Query: 376 EA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEW 434
           EA +   D       S RFL L+   D LLA    F +G W+  A+ L    E++  YEW
Sbjct: 562 EAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGNTSEEKDLYEW 621

Query: 435 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
           NAR QIT W +    +   LRDY ++ W+G+L+D+Y  R   +F Y    L+      + 
Sbjct: 622 NARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYYMRWKTWFDYQTRLLDGKKTAAI- 680

Query: 495 DWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
               ++  +   W    N Y  E  GD + T Q ++
Sbjct: 681 ----DFYAIEEPWTKQTNPYSNEPEGDCIPTVQRIF 712


>gi|265765312|ref|ZP_06093587.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
 gi|263254696|gb|EEZ26130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
          Length = 718

 Score =  241 bits (615), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 171/577 (29%), Positives = 261/577 (45%), Gaps = 103/577 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++     
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
                          G        N  M                  W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+GLL+D+Y  R  +YF ++ + +E    
Sbjct: 620 FYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKAP 679

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
             +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|261880010|ref|ZP_06006437.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
 gi|270333326|gb|EFA44112.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
          Length = 719

 Score =  241 bits (614), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 174/572 (30%), Positives = 258/572 (45%), Gaps = 88/572 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP   + +Q  LQ+KIL R   L M PVLPAFSG+VP  ++ ++P + I 
Sbjct: 195 MANIDKWKGPLPYHTVVEQRDLQQKILARERSLNMTPVLPAFSGHVPGQIKQLYPESNIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W +  SD   C  Y +   DPLF +I R ++E+Q   YG T HIY  D F+E  PP
Sbjct: 255 HLGRWAAF-SDQYRC--YFMSPQDPLFAKIQRMYLEEQRAIYG-TDHIYGIDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
              P+Y+  +   IY  +   D  A WL   WLF +    W P ++KAL+  V  GK+V+
Sbjct: 311 SWDPDYLFQISKGIYQTLAHVDPKAEWLQMSWLFYHKKKKWTPERVKALITGVETGKMVL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------ILDS-IAFG------ 225
           LD F +   IW  + +FYG PYIWC L NF GN  + G        LDS +  G      
Sbjct: 371 LDYFCDRNEIWKMTDKFYGQPYIWCYLGNFGGNTTVAGNVKACGAKLDSTLTLGGKNLQG 430

Query: 226 -------------PVE--------ARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
                        P E          +SEN    WI+  +    G + P+ + AW +LYH
Sbjct: 431 VGLTLEGFDVCQFPYEYILDKVWSGNSSENQ---WIDALADSHVGYASPSFRKAWQLLYH 487

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
            V+  + G+           P   P + S+                       ++ + H+
Sbjct: 488 DVFVQSAGSNG-------ILPCYRPELNSL-----------------------NWHYTHV 517

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN---IIEAYQLN 381
            Y   ++I A +L     +  S     + DLI   RQ L    NE   +      AY   
Sbjct: 518 DYDRQKLIEAWKLMQHDAD--SKRTAAQLDLIHYGRQVL---GNEFLTHKQLFDSAYAHC 572

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
           D  G+   +     ++ D+D L A H    L  W++ A+Q+A +      YE NAR+ IT
Sbjct: 573 DLAGMMAQAASMRHIMLDIDTLTAYHPRCTLAGWIDGARQMAPDSVCADYYEDNARSLIT 632

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 501
            W          L DY  K W+GL+ DYY  R   YF + I ++ +   F  + + +E  
Sbjct: 633 TW-------GGKLNDYACKGWAGLMSDYYLTRWERYFAHAINAVRAHRKFDQQAYDKEIA 685

Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           +    W + R++  VE++    +  + +  KY
Sbjct: 686 RFELSWASHRDIPRVETHESLALYCKKIIQKY 717


>gi|281200617|gb|EFA74835.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
          Length = 688

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 162/528 (30%), Positives = 248/528 (46%), Gaps = 72/528 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++ W G L   W+  Q  LQ +IL R+ + GM  VLP F+G+VP AL+  +P A IT
Sbjct: 213 MGNVNEWAGNLTLGWMADQRDLQIQILTRMRQFGMQAVLPGFAGHVPEALETHYPKANIT 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG W +         TY L+  DPLF +I +AF+  Q + YG T H YN D F+E  PP
Sbjct: 273 QLGGWGTFSG------TYYLNPDDPLFSKIAQAFVITQNQLYG-TDHFYNFDPFNELEPP 325

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
                Y+ +   ++++ + + D   +W++QGW    DP FW PPQ +A L+ VP+GK++V
Sbjct: 326 SSDLTYLKNCSQSMFNNLIAADPQGIWVLQGWFLVDDPEFWLPPQTEAFLSGVPIGKMIV 385

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
           LDL+++V P W+++  +YG  +IWCMLHNF G   MYG +  I+  P+EAR+     +  
Sbjct: 386 LDLWSDVIPAWNSTNYYYGHNWIWCMLHNFGGRSGMYGKIPFISTNPIEARSLSPNMVG- 444

Query: 240 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 299
                    G +  AI+   NV+   VY+       ++       PD+   +      +Y
Sbjct: 445 --------TGLTPEAIEQ--NVI---VYDLMSEMAWRSTP-----PDLKEWVDQYVTRRY 486

Query: 300 QNYGKP---VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 356
             +G P   ++  ++  + T S+                                  DL 
Sbjct: 487 GKHGLPFLSINDTSITNTSTFSF----------------------------------DLT 512

Query: 357 DLTRQALAK--YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
           ++T QAL      NEL LN   A+  N      + S   L +++D+  + +  +  L+G 
Sbjct: 513 EITTQALINLFMTNELQLN--SAFLNNSLEEFNKYSEALLSIIQDVYTIASTQEMLLVGH 570

Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
           W   A+ L    E    YE NAR QIT+W        S + DY  K W GL  D+Y  R 
Sbjct: 571 WTARARALTPANESTNLYEMNARNQITLW----GPTYSDVHDYAYKLWGGLTEDFYLARW 626

Query: 475 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDA 522
            ++ K +  SL S   F    ++     +   W      YP    G++
Sbjct: 627 TLFVKELQYSLTSSQPFNSTLFQTNCEAVEEVWNLQTYPYPTIPTGNS 674


>gi|391338146|ref|XP_003743422.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Metaseiulus
           occidentalis]
          Length = 665

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 151/502 (30%), Positives = 241/502 (48%), Gaps = 77/502 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGPLP SW  QQ +LQK IL R+ + GM PV+P F+G VP A + + P+   +
Sbjct: 188 MGNLRGFGGPLPSSWQLQQQLLQKMILRRMRDFGMTPVVPGFNGFVPRAFERLHPAVSWS 247

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   W +   +  +     L  T+  F+ +   +I      YG + H+Y+ D F+E TP 
Sbjct: 248 RASRWNNFPDE--YAMLTFLAPTESFFLNVSSLYITMYRSIYG-SDHLYSVDLFNEETPD 304

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
            + P  ++ + + +Y  +   D   +W+MQGWLF +   +W   ++KA L   PLGK++V
Sbjct: 305 TNDPAALAEMSSNVYESIAKADPKGIWVMQGWLFVHGGDYWNHDRVKAFLGGPPLGKMIV 364

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR--------- 230
           LDLF+E  P +     ++G P+IWCMLHN+ G   ++G L+ I   P+  R         
Sbjct: 365 LDLFSEQSPQFPRFSNYFGQPFIWCMLHNYGGVSGLFGNLEWINSEPLNVRRSVPNMIGI 424

Query: 231 ------------------------TSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                   +SEN ++ W+  Y   RYG S P +++AW +L  +V
Sbjct: 425 GIAPEGTGQNEVIYEFMAENSYRDSSENVSL-WLQNYVGARYGLSDPHLENAWELLRKSV 483

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y                         S+T    +N+G       +L         P +WY
Sbjct: 484 Y-------------------------SLTSKSIENHGN-----YILTHRPKLNSTPLIWY 513

Query: 327 STSEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           + S+VI A    I       EL     +  DL+D+ RQAL    ++ +L ++  ++ N  
Sbjct: 514 NGSDVIGAATELIRGATLHRELCHERLFHQDLVDVVRQALQVRVSDEYLQMMSHFKANSL 573

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ-NEEQEKQYEWNARTQITM 442
               + SRR L  +  +D +L+    FLLG WL  +++ A  + + + Q+E+NAR QIT 
Sbjct: 574 IDFEEHSRRLLHCIRVLDKVLSTDPNFLLGSWLRDSRESAGLDRDLQDQFEFNARNQITR 633

Query: 443 WFDNTQEEASLLRDYGNKYWSG 464
           W  N +     + DY +K W+G
Sbjct: 634 WGPNGE-----IVDYASKMWNG 650


>gi|393782608|ref|ZP_10370791.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
           CL02T12C01]
 gi|392672835|gb|EIY66301.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
           CL02T12C01]
          Length = 761

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 161/525 (30%), Positives = 240/525 (45%), Gaps = 84/525 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+S +D+   L KKI+ R  ELGM P+   FSG VP  L++ +P+A I 
Sbjct: 199 MQNLQSYGGPLPKSVIDRHAALGKKIIARQLELGMQPIQQGFSGYVPRELKDKYPTANIN 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q  +W   K   +      LD TD LF  +GR F+E+Q + +G    +Y  D F E+ PP
Sbjct: 259 QQRSWCGFKGAAQ------LDPTDSLFTRMGRVFLEEQARLFG-AHGVYAADPFHESVPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD+PEY+ ++G  I+   +  D  + W MQ W             +A++ +VP   L++L
Sbjct: 312 VDTPEYLKAVGETIHRLFREFDPQSTWAMQSWSLR----------EAIVKAVPKEALLIL 361

Query: 181 DLFAEVKPIWSTSK-QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           DL        STSK +F+G P +   LHNF G I M+G L  +A          N  +  
Sbjct: 362 DLRGS-----STSKAEFWGYPTVVGNLHNFGGRINMHGDLALLASNQYSKAKRLNPAVCG 416

Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                          AW+ QY+ RRYG   PA Q AW +L    
Sbjct: 417 SGLFMEAIEQNPVYYELAFEMPCHPDSIDLRAWLKQYATRRYGAFSPATQKAWMLLLEGP 476

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y      T+K+  ++ A P +D                 V K          YD P L  
Sbjct: 477 YRQGTNGTEKS-SIVAARPALD-----------------VKKSGPNAGLEIPYD-PAL-- 515

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
               +IRA  L +   ++LSAS  YR+DL+D+ RQ +      +     EA++  D    
Sbjct: 516 ----IIRAQSLLLEDADKLSASRPYRFDLVDVQRQMMTNLGQLIHRKAAEAFRSKDREAF 571

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              S RFL ++ DMD LL     +    WL  A+   + EE++ Q E +A + +T+W  +
Sbjct: 572 TLHSGRFLGMLADMDTLLRTRSEYSFDRWLTEARSWGETEEEKNQMERDATSLVTIWGAD 631

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
                  + DY  + W+GL+  YY PR   ++  + + L+ G  +
Sbjct: 632 GDPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQQHLDEGTSY 673


>gi|295085509|emb|CBK67032.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 716

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 171/574 (29%), Positives = 260/574 (45%), Gaps = 87/574 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 184 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 243

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 244 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 299

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 300 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 359

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD   E   +W  ++ F+  PYIWC L NF GN  + G                      
Sbjct: 360 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 419

Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD + F P E    +   +      WI   + R  G    +++DAW  L++ +
Sbjct: 420 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 478

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 479 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 513

Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 514 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 564

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            +     +  E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W 
Sbjct: 565 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 623

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                    L DY ++ W+GL+ DYY  R  +Y    I+++        K    E  ++ 
Sbjct: 624 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAVGEDVEVDQKQLEDELKEIE 677

Query: 505 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 533
             W N  +   V     S  D L++ S +L++KY
Sbjct: 678 EGWVNATDRKDVRKDVHSTTDGLLSFSTFLFSKY 711


>gi|153808241|ref|ZP_01960909.1| hypothetical protein BACCAC_02529 [Bacteroides caccae ATCC 43185]
 gi|423219048|ref|ZP_17205544.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
           CL03T12C61]
 gi|149129144|gb|EDM20360.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
           43185]
 gi|392625814|gb|EIY19870.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
           CL03T12C61]
          Length = 752

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 169/534 (31%), Positives = 246/534 (46%), Gaps = 87/534 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D+ ++L KKI+ R  ELGM P+   FSG VP  L++ +P AKI 
Sbjct: 193 MQNLQSYGGPLPKSWIDKHIILAKKIIDRERELGMTPIQQGFSGYVPRELKDKYPEAKI- 251

Query: 61  QLGNWFSVKSDPRWC---CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
                   +  P WC       LD TD LF  +GR F+E++ K YG T  IY  D F E+
Sbjct: 252 --------RLQPGWCGFKGAGQLDPTDALFATLGRDFLEEEKKLYG-TYGIYAADPFHES 302

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
            PPV++PEY+S++G AIY  ++  D  A W MQ W        R P +KA    VP   L
Sbjct: 303 APPVNTPEYLSAVGHAIYKLIKDFDPKAKWAMQAWSL------REPIVKA----VPQNDL 352

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------------------- 217
           ++LDL  E        K F+G P +   LHNF G I M+G                    
Sbjct: 353 IILDLNGEK---IKGRKGFWGYPAVEGNLHNFGGRINMHGDLRLLASNQYMTALKQYPNV 409

Query: 218 -----ILDSIAFGPV--------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
                 +++I   PV             E     W+ QY+ RRYG   P+ Q A   L  
Sbjct: 410 CGSGLFMEAIEQNPVYYDLAFEMPLHKGEVAIEEWLKQYANRRYGAVSPSAQQAMICLLE 469

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
             Y      T+  R  I+A     P++     G     G P                   
Sbjct: 470 GPYRPGTNGTE--RSSIIA---ARPALNVKKSGPNAGLGIP------------------- 505

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            YS   VI+A  L +   ++L  S  YR+D+ID+ RQ +      +     EA+   D  
Sbjct: 506 -YSPLLVIQAEGLLLKDADKLKNSEPYRFDVIDVQRQMMTNMGQVIHKRAAEAFLNRDKE 564

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
                S+RFL+++ED+D LL     F    WL SA+     EE++   E++A + +T+W 
Sbjct: 565 AFALHSKRFLQMLEDVDELLRTRPEFNFDRWLTSARSWGDTEEEKNLLEYDATSLVTIW- 623

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
                + S+  DY  + W+GL++ YY PR   ++  + E L++G  +  +  R+
Sbjct: 624 -GADGDPSIF-DYSWREWTGLIKGYYLPRWTKFYAMLQEHLDNGTTYSEEGLRQ 675


>gi|452988463|gb|EME88218.1| glycoside hydrolase family 89 protein [Pseudocercospora fijiensis
           CIRAD86]
          Length = 772

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 155/545 (28%), Positives = 263/545 (48%), Gaps = 85/545 (15%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LPQSW+D Q  L KKI+ R+ ELGM PVLP F+G VP  +  ++P+A    
Sbjct: 202 NIQGSWGGDLPQSWIDHQFELNKKIVARMVELGMTPVLPCFTGFVPTQISRLYPNASFVN 261

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W   +++  +     L+  DPLF  + ++FI +Q++ YG  S IY  D ++EN P  
Sbjct: 262 GSRWNGFQAE--YTNVTFLEPFDPLFTTLQKSFISKQIEAYGNVSSIYTLDQYNENDPFS 319

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
               Y+ ++ +     +++ D +A+W +QGWLF S   FW   +++A L  V    +++L
Sbjct: 320 GELAYLKNVTSNTIKSLKAADPEAIWFIQGWLFYSSADFWTDERVEAYLGGVANEDMLIL 379

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DLF+E +P W  +  ++G P+IWC LH++ GN  ++G ++++   PV+A  ++ +TM   
Sbjct: 380 DLFSESQPQWQRTNSYFGKPWIWCQLHDYGGNQGLHGQVENVTINPVQALANKTSTMVGM 439

Query: 238 --------------------AW----------INQYSVRRY-GRSVPA-IQDAWNVLYHT 265
                               AW           + +   RY G  +P+ +  AW+V+  T
Sbjct: 440 GSTMEGQEGNEIIYDILLDQAWSKEPIDSDSYFHDWVTSRYAGSKLPSGLYTAWDVMRQT 499

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VYN TD   +    V  +  +++P+   +   +  +    +    VL S  +        
Sbjct: 500 VYNSTD--IEAAEAVTKSIFELEPNTTGLLNRRGHHSTLILYDPNVLVSAWN-------- 549

Query: 326 YSTSEVIRALELFIASGNELSA--SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
                     +L+ AS +++       Y++DL+D TRQ LA     L+ + + +      
Sbjct: 550 ----------DLYNASNDDIQLWDVKAYQFDLVDTTRQVLANAFYPLYTDFVHSAN-KSV 598

Query: 384 HGVF------QLSRRFLELVEDMDGLLACHDG--FLLGPWLESAKQLAQNEEQEKQ---- 431
            G +      +  +  + L++D+D +L       F L  W+ESA+  A  E+        
Sbjct: 599 QGTYSPTKAEEKGKEMIMLLKDLDSVLEASGNAHFKLSSWIESARLWAPAEDYADDKNTT 658

Query: 432 ------YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL 485
                 YE+ AR QIT+W  N +     + DY +K W+GL+R YY PR   +  + + S 
Sbjct: 659 AKIADFYEYTARNQITLWGPNGE-----ISDYASKQWAGLIRSYYVPRWQRFVDFTLNST 713

Query: 486 ESGDG 490
            S +G
Sbjct: 714 TSMNG 718


>gi|336408181|ref|ZP_08588675.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
 gi|335939481|gb|EGN01355.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
          Length = 718

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 170/577 (29%), Positives = 260/577 (45%), Gaps = 103/577 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R+ E G+ P+LP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPMLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++  +  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIEDLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++     
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
                          G        N  M                  W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+GLL+D+Y  R  +YF ++ + +E    
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679

Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
             +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|392566857|gb|EIW60032.1| alpha-N-acetylglucosaminidase [Trametes versicolor FP-101664 SS1]
          Length = 747

 Score =  239 bits (610), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 171/554 (30%), Positives = 273/554 (49%), Gaps = 73/554 (13%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LP +W+D Q  LQK++L R+ ELGM PV+P+F+G VP AL  + P+A I
Sbjct: 193 FGNIQGSWGGELPTAWVDDQFALQKRLLPRMVELGMTPVMPSFTGFVPRALAALHPNASI 252

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGR-TSHIYNCDTFDENT 118
                W    +         L+  DPLF  + ++FI +Q   YG   SH+Y  D ++EN 
Sbjct: 253 VTGSQWSGFPTS--LTNDSFLEPFDPLFATLQQSFIAKQQAAYGADISHVYTLDQYNEND 310

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-K 176
           P     +Y+ ++ A  ++ +++ D  AVWLMQGWLF  D  FW   ++ A L  VP    
Sbjct: 311 PFSGDLDYLRNVSAGTFASLRAADPAAVWLMQGWLFFSDAVFWTDDRVAAYLGGVPGNDS 370

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           ++VLDL++E +P W+ +  + G  ++WC LH++ GNI M G LD +   P+ A +S  ++
Sbjct: 371 MIVLDLYSEAQPQWNRTASYSGKQWVWCELHDYGGNIGMEGNLDVLTHAPLTALSSPGSS 430

Query: 237 M-----------------------AW----------INQYSVRRYG-RSVP-AIQDAWNV 261
           M                       AW          ++ +  RRY  + +P A QDAW +
Sbjct: 431 MKGVGLTMEGQEGNEIVYGVLLDQAWSATSLNTSSYVSSWVSRRYPVKPLPKAAQDAWRI 490

Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
           L  TVYN  D  T   +  I    ++ P++  +T     +   P S           YD 
Sbjct: 491 LSTTVYNNQDPNT---QATIKGIYELAPALTGMTNRIGHH---PTSIP---------YD- 534

Query: 322 PHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
                + + ++ AL+L +   A    LSA   + YD++D+ RQ L+     L+  +I+ Y
Sbjct: 535 -----TDATMLSALKLLLEARAQHPTLSAVPEFVYDVVDVARQLLSNRFIGLYDTLIQTY 589

Query: 379 QLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWN 435
                 A  V    +  L L+ D+D LL+ ++ FLL  W+  A++ A        Y E+N
Sbjct: 590 NSTSSTAQSVSAAGQPLLALLTDLDALLSTNEHFLLSSWIADARKWADGSASYGAYLEYN 649

Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
           AR Q+T+W  + +     + DY +K W+GL+  YY PR A +  Y+ E+  +G  +    
Sbjct: 650 ARNQVTLWGPDGE-----INDYASKAWAGLVGTYYKPRWAAFVDYLAETKGTGQAYNATA 704

Query: 496 WRREWIKLTNDWQN 509
            +   + +  +W N
Sbjct: 705 VKSTMLAIGQEWGN 718


>gi|379334158|gb|AFD03088.1| putative alpha-N-acetylglucosaminidase [uncultured bacterium 8]
          Length = 726

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 171/561 (30%), Positives = 261/561 (46%), Gaps = 73/561 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           +++L GW GPLPQSW+D+   L ++IL R   LGM PVL  FSG+VP  L      A+ T
Sbjct: 181 LASLDGWSGPLPQSWIDRHADLGRRILARERALGMRPVLQGFSGHVPQELIAER-GARST 239

Query: 61  QLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
            L  W F V          +LD  DPLF E G   + +Q + +G T H+Y  D F E TP
Sbjct: 240 TLPWWDFEVG---------MLDPRDPLFEEFGTTLLTEQTRLFG-TDHLYAADPFIETTP 289

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLV 178
           PV  P  ++ +  A++  M + D  A W++Q W FSY   +W P +  A L+++P   ++
Sbjct: 290 PVSDPADLAQVARAVHGVMTAVDDRATWVLQAWPFSYRSRYWTPERTGAFLDAIPDDGML 349

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART------- 231
           +LDL+AE +P+W  +  +   P++WCMLH+  G   +YG LD IA G   A+        
Sbjct: 350 ILDLWAEHRPVWQRTDGYRKKPWVWCMLHSLGGRPGLYGKLDEIATGAARAQADARGGSL 409

Query: 232 ---------------------------SENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
                                      S +   AW+  ++  RYGR+ P +  AW++L+ 
Sbjct: 410 SGIGASMEAFGGDPVLYELLADVAWQGSVDDVRAWLETWTRARYGRATPGLLRAWDLLHD 469

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           +VY  ++G       VIV  P +        EG  ++      +  V  ++  S D P  
Sbjct: 470 SVY-ASEGPGPPG-SVIVGRPTL--------EGDLRH------ELPVHLADPPSPDVP-- 511

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
               +       L   +  E SA    R DL D+T Q L   A E      +A    DA 
Sbjct: 512 ---PALAEAWALLADEATQEDSAGPLGR-DLCDVTAQVLTHVACERQWRAADAALARDAD 567

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
           G  + +R  L+ +ED+D LLA      L  WL  A+  A    +   YE +AR  +T+W 
Sbjct: 568 GFQRAARALLDTIEDLDTLLATRPEHRLDGWLADARGWATTPAEADLYETDARRLLTLWG 627

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                  S L DY  ++W+GL+  +Y PR   +++++  +LE+G  +R +++    +   
Sbjct: 628 HTR----SKLHDYSGRHWAGLVGTFYLPRWRSWYEHIARALETGSPYRAEEFEASLLAQE 683

Query: 505 NDWQNGRNVYPVESNGDALIT 525
             W   RN       G A  T
Sbjct: 684 ERWVADRNGPTTPEAGTAGAT 704


>gi|238506383|ref|XP_002384393.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
           NRRL3357]
 gi|220689106|gb|EED45457.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
           NRRL3357]
          Length = 669

 Score =  239 bits (609), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 157/534 (29%), Positives = 257/534 (48%), Gaps = 84/534 (15%)

Query: 1   MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           + N+ G WGG  +  +W++ Q  LQKKI+ RI ELGM PVLPAF G VP A++ V P A 
Sbjct: 109 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMRPVLPAFPGFVPPAIKRVRPHAT 168

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W   +   ++     L   D  F ++ ++ I +Q++ +G  +H+Y  D F+E  
Sbjct: 169 VVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISRQMRAFGNITHVYALDQFNEIN 226

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
           P      Y+ +L    +  +++ +  AVW+MQGWLF YD   FW   ++ A L+ V    
Sbjct: 227 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDSNRISAYLSGVERND 285

Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ MYG + +I   P+EA    ++
Sbjct: 286 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEALNKSDS 345

Query: 236 --------------------------------TMAWINQYSVRRYGR--SVP-AIQDAWN 260
                                           T A+   +   RY    SVP  +  AW+
Sbjct: 346 LVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQSWVRSRYSGNLSVPNELYTAWD 405

Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS--------KEAVL 312
           +L  TVYN T+  T     V  +  ++ P I  +  G+  +Y  P S         E + 
Sbjct: 406 LLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GRVGHYPTPTSINYDPMVLNEVLS 461

Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
               ++   P LW++ +                     Y YD++D+TRQ +      ++ 
Sbjct: 462 LFMNATRKEPSLWHNPA---------------------YEYDMVDITRQLMGNAFVNVYS 500

Query: 373 NIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
            +I +++    N    V   S R L L+  +D +L+C++ F L  W+ SA+      E +
Sbjct: 501 VLITSWKSETENRTTKVTSHSERLLNLLSAIDKVLSCNENFSLATWISSARDWGNTTETK 560

Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
             +E+NAR QIT+W    +     + DY +K W+GL+  YY PR +I+  Y+ E
Sbjct: 561 DFFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGE 609


>gi|423241433|ref|ZP_17222546.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
           CL03T12C01]
 gi|392641326|gb|EIY35103.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
           CL03T12C01]
          Length = 754

 Score =  238 bits (607), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 176/597 (29%), Positives = 268/597 (44%), Gaps = 112/597 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +  +  S +S       +       +L   DP F +I   F E+  K YG TS  Y+ 
Sbjct: 252 EKKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTQIANLFYEETEKLYG-TSDYYSI 310

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E        ++    G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 311 DPFHEAKSLPAGLDF-GKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362

Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
              G L++LDLF+E +P+W       G+P IW             C+L NF GN+ ++G 
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414

Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
           +D +          P+ A                           R  + T   WI QY 
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474

Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
             RYG    +I+ AW +L + +YNC  G   +             SI           G+
Sbjct: 475 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 516

Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
           P    ++   + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572

Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
             A  ++   +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L  
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632

Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
             E++  YEWNAR QIT W +    +   LRDY +K W+GLLRD+Y  R   Y++ + + 
Sbjct: 633 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692

Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
           L+              + D   +     +W  L   W   +N+Y   + GD +  ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNIYAASAEGDCIEVAK 746


>gi|265753065|ref|ZP_06088634.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|263236251|gb|EEZ21746.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
          Length = 750

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 177/597 (29%), Positives = 266/597 (44%), Gaps = 112/597 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 188 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 247

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +  +  S +S       +       +L   DP F  I   F E+  K YG TS  Y+ 
Sbjct: 248 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 306

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 307 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 358

Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
              G L++LDLF+E +P+W       G+P IW             C+L NF GN+ ++G 
Sbjct: 359 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 410

Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
           +D +          P+ A                           R  + T   WI QY 
Sbjct: 411 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 470

Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
             RYG    +IQ AW +L + +YNC  G   +             SI           G+
Sbjct: 471 RARYGTDDESIQQAWQILTNGIYNCPAGNNQQG---------PHESIFC---------GR 512

Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
           P    ++   + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A
Sbjct: 513 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 568

Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
             A  ++   +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L  
Sbjct: 569 DRARIVYNYAVADFKSFDKKNYNTHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 628

Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
             E++  YEWNAR QIT W +    +   LRDY +K W+GLLRD+Y  R   Y++ + + 
Sbjct: 629 TPEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 688

Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
           L+              + D   +     +W  L   W   +N Y   + GD +  ++
Sbjct: 689 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 742


>gi|212695333|ref|ZP_03303461.1| hypothetical protein BACDOR_04880 [Bacteroides dorei DSM 17855]
 gi|212662112|gb|EEB22686.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
          Length = 754

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 177/597 (29%), Positives = 265/597 (44%), Gaps = 112/597 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +     S +S       +       +L   DP F  I   F E+  K YG TS  Y+ 
Sbjct: 252 EEKTAGDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362

Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
              G L++LDLF+E +P+W       G+P IW             C+L NF GN+ ++G 
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414

Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
           +D +          P+ A                           R  + T   WI QY 
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474

Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
             RYG    +IQ AW +L + +YNC  G   +             SI           G+
Sbjct: 475 RARYGTDDESIQQAWQILTNGIYNCPAGNNQQG---------PHESIFC---------GR 516

Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
           P    ++   + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572

Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
             A  ++   +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L  
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYNTHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632

Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
             E++  YEWNAR QIT W +    +   LRDY +K W+GLLRD+Y  R   Y++ + + 
Sbjct: 633 TPEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692

Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
           L+              + D   +     +W  L   W   +N Y   + GD +  ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 746


>gi|345513909|ref|ZP_08793424.1| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
 gi|345456132|gb|EEO45798.2| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
          Length = 754

 Score =  236 bits (602), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 176/597 (29%), Positives = 266/597 (44%), Gaps = 112/597 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +  +  S +S       +       +L   DP F  I   F E+  K YG TS  Y+ 
Sbjct: 252 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362

Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
              G L++LDLF+E +P+W       G+P IW             C+L NF GN+ ++G 
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414

Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
           +D +          P+ A                           R  + T   WI QY 
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474

Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
             RYG    +I+ AW +L + +YNC  G   +             SI           G+
Sbjct: 475 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 516

Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
           P    ++   + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572

Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
             A  ++   +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L  
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632

Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
             E++  YEWNAR QIT W +    +   LRDY +K W+GLLRD+Y  R   Y++ + + 
Sbjct: 633 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692

Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
           L+              + D   +     +W  L   W   +N Y   + GD +  ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 746


>gi|237711645|ref|ZP_04542126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
 gi|229454340|gb|EEO60061.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
          Length = 732

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 176/597 (29%), Positives = 266/597 (44%), Gaps = 112/597 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 170 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 229

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +  +  S +S       +       +L   DP F  I   F E+  K YG TS  Y+ 
Sbjct: 230 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 288

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 289 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 340

Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
              G L++LDLF+E +P+W       G+P IW             C+L NF GN+ ++G 
Sbjct: 341 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 392

Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
           +D +          P+ A                           R  + T   WI QY 
Sbjct: 393 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 452

Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
             RYG    +I+ AW +L + +YNC  G   +             SI           G+
Sbjct: 453 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 494

Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
           P    ++   + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A
Sbjct: 495 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 550

Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
             A  ++   +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L  
Sbjct: 551 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 610

Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
             E++  YEWNAR QIT W +    +   LRDY +K W+GLLRD+Y  R   Y++ + + 
Sbjct: 611 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 670

Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
           L+              + D   +     +W  L   W   +N Y   + GD +  ++
Sbjct: 671 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 724


>gi|404487206|ref|ZP_11022393.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335702|gb|EJZ62171.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
           YIT 11860]
          Length = 731

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 171/578 (29%), Positives = 259/578 (44%), Gaps = 96/578 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGP+ Q ++D+Q  LQ+K+L R+ EL M PV   F G VP +L+  FP A I 
Sbjct: 196 MGNLEGFGGPVSQKFIDRQTDLQQKMLRRMRELDMAPVFQGFYGMVPNSLKEKFPEANIK 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           + G W + +          LD  DPLF +I   + E+Q K +G+  + +  D F E   +
Sbjct: 256 EQGEWQTYQRPA------FLDPNDPLFDKIADIYYEEQEKLFGKAVY-FAGDPFHEGGQS 308

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D    + +    I   M+    +AVW++QGW        R P M+ LL  +  G+ +
Sbjct: 309 EGID----VKAAAKKILKAMRRKTPEAVWIIQGWQ-------RNP-MRDLLEGLEHGEAI 356

Query: 179 VLDLFAEVKPIWSTSKQ--FYGVP------YIWCMLHNFAGNIEMYGILDSIAFGPVEA- 229
           +LDL A  +P W   K   FY         +IWC L NF G   ++G + S A G V A 
Sbjct: 357 ILDLMACERPQWGGIKNSLFYKAEGHMHHDWIWCALPNFGGKTGLHGKMSSYASGVVFAK 416

Query: 230 ---------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ 256
                                            R        W+NQY+  RYG++ P   
Sbjct: 417 NHPLGKNLCGIGTAPEGIGTIPVVYDMVYDMAWREDSIDIKDWVNQYTQYRYGKADPNCN 476

Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
            AW +L  T+Y C +         I A P                        +      
Sbjct: 477 RAWEILSKTIYECHNEIGGPVESYICARP------------------------SDTIKHA 512

Query: 317 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
           SS+    ++Y  +E++ A E      +E + S TY+YDL+DLTRQ L  YA  L    + 
Sbjct: 513 SSWGTAEIFYDPAEIVTAWECMYNVRHEFAQSETYQYDLVDLTRQVLGDYAKYLHKQAVN 572

Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
           A+  ND  G    S +FL L+ D D LL+    F +G W+  A+  A   ++++++  NA
Sbjct: 573 AFYRNDLKGFQTYSSKFLVLIRDEDKLLSTRKEFNVGTWINQARNAACTPQEQERFVANA 632

Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           + QIT W ++     S L DY  K WSGL+RD Y PR   +  Y + +L  G+  +  D 
Sbjct: 633 KRQITTWTNHD----SKLHDYALKEWSGLMRDMYLPRWKAWVDYKL-ALLRGETAQEPD- 686

Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
              + ++  +W +    Y   S G+A+   + +Y KY 
Sbjct: 687 ---YFQIEKNWVDSDTRYDSTSTGNAISAVEEIYKKYF 721


>gi|423230938|ref|ZP_17217342.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
           CL02T00C15]
 gi|423244649|ref|ZP_17225724.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
           CL02T12C06]
 gi|392630058|gb|EIY24060.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
           CL02T00C15]
 gi|392641498|gb|EIY35274.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
           CL02T12C06]
          Length = 754

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 176/597 (29%), Positives = 266/597 (44%), Gaps = 112/597 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +  +  S +S       +       +L   DP F  I   F E+  K YG TS  Y+ 
Sbjct: 252 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362

Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
              G L++LDLF+E +P+W       G+P IW             C+L NF GN+ ++G 
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414

Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
           +D +          P+ A                           R  + T   WI QY 
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474

Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
             RYG    +I+ AW +L + +YNC  G   +             SI           G+
Sbjct: 475 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 516

Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
           P    ++   + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572

Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
             A  ++   +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L  
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632

Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
             E++  YEWNAR QIT W +    +   LRDY +K W+GLLRD+Y  R   Y++ + + 
Sbjct: 633 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692

Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
           L+              + D   +     +W  L   W   +N Y   + GD +  ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 746


>gi|261199246|ref|XP_002626024.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
 gi|239594232|gb|EEQ76813.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
          Length = 752

 Score =  235 bits (599), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 151/529 (28%), Positives = 257/529 (48%), Gaps = 73/529 (13%)

Query: 1   MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
             NL G WGG   P  W D Q  LQKKIL R+ ELGM P+LPAF G VP A+  V P A+
Sbjct: 201 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 260

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W  +  +P++  T  L   DP  + + ++FI + ++ YG  +H Y  D F+E  
Sbjct: 261 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 318

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNS-VPLGK 176
           P    PE++  +       ++S D +A W+MQGWLF  +  +W   +++A L++      
Sbjct: 319 PSSGDPEFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 378

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           +++LDLFAE  P+W  +K F+G  ++WC +  F GN  +YG + +I  GP +A       
Sbjct: 379 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAQAMAQHPNM 438

Query: 237 MA--------------------------------WINQYSVRRY---GRSVP-AIQDAWN 260
           +                                 + + +  RRY   GR+VP  + +AW 
Sbjct: 439 VGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHGRTVPNELYEAWQ 498

Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
           +L  + YN T+         +V  P +  ++ + +           +K  +L  E     
Sbjct: 499 LLRLSAYNNTN---------LVDAPLLPHALFAASPSIN-------AKMPMLFIEG---- 538

Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
              L Y  +++++A  L I     L   ++Y+YD++D+TRQ L+     +  ++   Y+ 
Sbjct: 539 ---LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKVKYKG 593

Query: 381 NDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ-YEWNART 438
                VF  +  + L +++ +D +L+ ++ F L  W+ +A+  A ++ +    +E NAR 
Sbjct: 594 GAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDDSEAADFFEHNARN 653

Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
           QIT+W      E  +L DY  K W+GL+  YY PR  ++ +Y+ ++  S
Sbjct: 654 QITIW----GSEVGVLDDYAQKQWAGLVSGYYTPRWRMFLEYLKDTPAS 698


>gi|392584963|gb|EIW74305.1| glycoside hydrolase family 89 protein [Coniophora puteana
           RWD-64-598 SS2]
          Length = 772

 Score =  234 bits (597), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 156/534 (29%), Positives = 255/534 (47%), Gaps = 93/534 (17%)

Query: 8   GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
           GG LPQ W+D QL LQK+I+ RI ELGM PVLPAF G VP A+  +FP+A I     +  
Sbjct: 221 GGKLPQEWMDAQLALQKQIVPRIVELGMTPVLPAFPGFVPPAMHTLFPNASIVNGSEYPG 280

Query: 68  VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
           + +  ++     L   DPL+ ++  +F+ +Q +  G  +H++  D ++EN+P      Y+
Sbjct: 281 IPA--QYSNDSFLAPFDPLYAQLQSSFLAKQTEALGNVTHVWTIDQYNENSPYSGDLTYL 338

Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 186
           +++  + ++ +++ D DA+WLMQGWLF  D PFW   ++ A L+ +P   +++LDLF++V
Sbjct: 339 ANIANSTFASLRAHDPDAIWLMQGWLFFADEPFWTSDRVDAYLDQIPNDGMIILDLFSDV 398

Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--------- 237
            P W     + G  ++WC +H+F GN+ + G    +  GPV+A  S N++M         
Sbjct: 399 YPQWQRLDSYRGKSWVWCEVHDFGGNMGLEGNFSVVTNGPVDALNSPNSSMKGVGLAMEG 458

Query: 238 ------------------------AWINQYSVRRY-------------GRSVPAIQ-DAW 259
                                   A+   ++ RR+               S+PA   +AW
Sbjct: 459 LEGNEIIYDVLLDQAWSAAPLDRDAYAKAWATRRFHLPTANSSTTTATNTSIPASAIEAW 518

Query: 260 NVLYHTVYNCTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
             L  TVY+ T+    GAT               S+I +       Y  P S      + 
Sbjct: 519 QTLASTVYSSTNPNVWGATK--------------SLIELAPSLGGMYSAPSSTIIFYDTN 564

Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFL 372
           TS             ++ AL   +A+G     L A + +R D ID+ RQ LA    + + 
Sbjct: 565 TS-------------LVPALRGLVAAGTSAPALWALDEFRTDSIDVARQLLANRFADAYT 611

Query: 373 NIIEAYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA---QNEE 427
               AY  +   +  +   + R +++++D+D LL  H+ +LL   + SA+  A    +E 
Sbjct: 612 ATTGAYNASGPGSAALNATAARMMQIIDDLDRLLMTHEPYLLSSRIASARAWAGDGGDEA 671

Query: 428 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
                E+ AR+Q+T+W        S+L DY +K W GL+  YY  R   + +YM
Sbjct: 672 YADYLEYEARSQVTLW----GPVPSVLNDYASKVWGGLVGTYYRQRWTAFVEYM 721


>gi|336371253|gb|EGN99592.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336384013|gb|EGO25161.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 761

 Score =  234 bits (597), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 159/538 (29%), Positives = 260/538 (48%), Gaps = 80/538 (14%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LP+ W++ Q VLQK+IL R+ ELGM PVLP+F+G VP A+  ++P+A I  
Sbjct: 208 NIQGSWGGDLPEQWINDQFVLQKQILARMVELGMTPVLPSFTGFVPRAMHTLYPNASIVN 267

Query: 62  LGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
              W  F+++          L+  DPLF  +  +F+ +    YG  SHIY  D ++E  P
Sbjct: 268 GSQWSTFTIQH----TNDSFLEPFDPLFSTLQTSFMTKYAAAYGNVSHIYTLDQYNEMMP 323

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KL 177
              +  Y+SS+ +A ++ +++ D +AVW+MQGWLF  Y  FW   +++A L  VP    +
Sbjct: 324 YSGNTSYLSSISSATFASLRATDPEAVWMMQGWLFYIYASFWTDERVEAYLGGVPGNDSM 383

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDLF+E  P W     ++G  +IWC LH+F GN+   G  +++   PV+A  +   TM
Sbjct: 384 IILDLFSEAYPQWQRLNSYFGKQWIWCELHDFGGNMGFEGNFENVTTQPVKALATPGNTM 443

Query: 238 -----------------------AW----------INQYSVRRYGRSVP----AIQDAWN 260
                                  AW          ++ ++ RRY  +VP    A  +AW 
Sbjct: 444 VGMGLTMEGQEGNEIMYDVLFDQAWSPTPINRTSYVSAWTSRRY--NVPNLPTAATEAWE 501

Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
           +L  TVYN  D         I         ++++T  +    G P               
Sbjct: 502 ILASTVYNNQDPLLQATIKSIFELEPAINGLVNLTVLQ----GIPTG------------- 544

Query: 321 HPHLWYST-SEVIRALELFIASGNELSASN---TYRYDLIDLTRQALAKYANELFLNIIE 376
              L+Y T + ++ AL+  + +  E SA +    ++YD++ + RQ LA    +L+ ++++
Sbjct: 545 ---LFYDTNTTIVPALQSLLQARQESSALDEVPEFQYDVVYIIRQLLANRFIDLYTSLVD 601

Query: 377 AYQLNDAHGVFQL--SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-E 433
            Y    +            + L++D+D +L     FLL  W+ +A+  A +      Y E
Sbjct: 602 TYNSTTSSSSDVSTAGAPLITLLKDVDSVLLTDTHFLLSNWISAARNWAHDNSTYAAYLE 661

Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
           +NAR QIT+W    +     + DY +K W GL+  YY  R   +  Y+  S  +G  +
Sbjct: 662 YNARNQITLWGPRGE-----VHDYASKQWGGLVGTYYVQRWEEFVSYLSGSKANGTAY 714


>gi|400595379|gb|EJP63180.1| alpha-N-acetylglucosaminidase [Beauveria bassiana ARSEF 2860]
          Length = 761

 Score =  234 bits (597), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 159/525 (30%), Positives = 250/525 (47%), Gaps = 56/525 (10%)

Query: 1   MSNLHG-WGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSA 57
             N+HG WGG   L   W++QQ  LQKKI+ R+ ELG+ PVLP F G VPAAL+ + P  
Sbjct: 205 FGNIHGTWGGEGRLSAEWINQQFALQKKIVARMVELGITPVLPGFPGFVPAALKKLRPDV 264

Query: 58  KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
            I +   W  V  +     T  L+ TD  + E+   FI+ Q+KE+G  +++Y  D F+E 
Sbjct: 265 NIAEAPVWVDVPRNN--TATAFLNPTDKTYAELQSLFIKNQIKEFGNVTNVYTVDQFNEI 322

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVP-LG 175
            P     +YI+ + ++ Y G+ + +  A+WLMQGWLF S   FW   ++ A L   P   
Sbjct: 323 NPSSGDTKYITDVSSSTYKGITAANPAAIWLMQGWLFYSSQSFWTQQRVDAYLAGPPGQD 382

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LDLF+E +P W  ++ ++G P+IWC LH+F GN  ++G + ++         ++N+
Sbjct: 383 DMIILDLFSESQPQWQRTRSYFGRPWIWCELHDFGGNQALHGKITNV---------TQNS 433

Query: 236 TMAWINQYSVRRYGRSVPA-----------IQDAW--------NVLYHTVYNCTDGATDK 276
             A     S+  YG +              +  AW        N       N    A   
Sbjct: 434 VQALKESGSIVGYGLTPEGYEGNEVVYDILLDQAWEGSPIDTANYFRAWARNRYSAAGII 493

Query: 277 NRDVIVAFPDV--------DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
             DV  A+  +        D +I SV    YQ +  P  K  V  + T  Y  P      
Sbjct: 494 PEDVFTAWEQLRQHAYDVQDNAIPSVGVSVYQLF--PSLKGLV--NRTGHYPPPTALQYD 549

Query: 329 SEVIRAL-ELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            +V++ +  LF  S  +   L     +  D +D+TRQ L     +++ +++  +Q     
Sbjct: 550 PKVMKNIWHLFYNSTIDSPGLLQIPAFHLDFVDVTRQVLGNAFIDIYTDLVNQFQATANA 609

Query: 385 GVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            V Q L    L  +ED+D  L  ++ F    WL SA+   Q+        +NAR+Q+T+W
Sbjct: 610 TVIQDLGNSMLSFIEDLDMALNTNEHFTFKKWLNSAESWGQSIGAPDAVAFNARSQVTVW 669

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
                 E+  L DY  K WSG+++ YYG R  I+   ++ + E G
Sbjct: 670 ----STESRALDDYAAKAWSGIVKSYYGERWRIFINSLVSAREQG 710


>gi|239615395|gb|EEQ92382.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ER-3]
          Length = 829

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 151/530 (28%), Positives = 254/530 (47%), Gaps = 79/530 (14%)

Query: 3   NLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           NL G WGG   P  W D Q  LQKKIL R+ ELGM P+LPAF G VP A+  V P A++ 
Sbjct: 223 NLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQVV 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W  +  +P++  T  L   DP  + + ++FI + ++ YG  +H Y  D F+E  P 
Sbjct: 283 NASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMIPS 340

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNS-VPLGKLV 178
              P+++  +       ++S D +A W+MQGWLF  +  +W   +++A L++      ++
Sbjct: 341 SGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRDML 400

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA 238
           +LDLFAE  P+W  +K F+G  ++WC +  F GN  +YG + +I  GP EA       + 
Sbjct: 401 ILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAEAMAQHPNMVG 460

Query: 239 --------------------------------WINQYSVRRYG---RSVPA-IQDAWNVL 262
                                           + + +  RRY    R+VP+ + +AW +L
Sbjct: 461 VGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHERTVPSELYEAWQLL 520

Query: 263 YHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSETSSY 319
             + YN T+   A      +  A P ++  + +   EG                      
Sbjct: 521 RLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG---------------------- 558

Query: 320 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
               L Y  +++++A  L I     L   ++Y+YD++D+TRQ L+     +  ++   Y+
Sbjct: 559 ----LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKVKYK 612

Query: 380 LNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ-YEWNAR 437
                 VF  +  + L +++ +D +L+ ++ F L  W+ +A+  A +E +    +E NAR
Sbjct: 613 GGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDESEAADFFEHNAR 672

Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
            QIT+W      E  +L DY  K W+GL+  YY PR  ++ +Y+ ++  S
Sbjct: 673 NQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPRWRMFLEYLKDTPAS 718


>gi|393788286|ref|ZP_10376416.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
           CL02T12C05]
 gi|392655959|gb|EIY49600.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
           CL02T12C05]
          Length = 757

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 170/586 (29%), Positives = 264/586 (45%), Gaps = 103/586 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP++ +D+   L KKI+ R  ELGM P+   FSG VP  L+  +P+A I 
Sbjct: 196 MQNLQSYGGPLPKTVIDKHAALGKKIISRQLELGMQPIQQGFSGYVPRELKEKYPTANIN 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q  +W   K   +      LD TD LF  +GRAF+E+Q + +G    +Y  D F E+ PP
Sbjct: 256 QQRSWCGFKGAAQ------LDPTDSLFTRMGRAFLEEQARLFG-AHGVYAADPFHESAPP 308

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +D+PEY+ ++G  I+   +  D  + W MQ W    D          ++ +VP   L++L
Sbjct: 309 IDTPEYLKAVGERIHHLFRDFDPHSTWAMQSWSLRED----------IVKAVPKDALLIL 358

Query: 181 DLFAEVKPIWSTSKQ-FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           DL  +     STSK  F+G   +   LHNF G I M+G L  +A          N  +  
Sbjct: 359 DLNGK-----STSKALFWGYSTVVGNLHNFGGRINMHGDLKLLASNQYSKAKRLNPAVCG 413

Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                          AW+ QY+ RRYG   PA Q+AW +L +  
Sbjct: 414 SGLFMEAVEQNPVYYELAFEMPCHADSINLQAWLKQYATRRYGAFSPAAQEAWLLLLNGP 473

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y      T+K+  ++ A P +D               K     A L+          + Y
Sbjct: 474 YRRGTNGTEKS-SIVAARPALDV--------------KKSGPNAALE----------IPY 508

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             + VIRA  L +   ++LS S  YR+D++D+ RQ +      +     EA++  D    
Sbjct: 509 DPTLVIRAQSLLLKDIDKLSVSRPYRFDIVDVQRQLMTNLGQLIHRQAAEAFRKKDQCAF 568

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              S RFLE++ DMD LL     +    WL  A+     +E++   E +A + +T+W  +
Sbjct: 569 TLHSGRFLEMLADMDKLLRTRSEYSFDRWLTEARSWGDTDEEKNLMERDATSLVTIWGAD 628

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD----WRREWIK 502
                  + DY  + WSGL+  YY PR   ++  + + L+ G  +        + RE  +
Sbjct: 629 GDPR---IFDYSWREWSGLISGYYLPRWQKFYAMLQQHLDVGTSYEEAGLPLIYGREAFR 685

Query: 503 LTNDWQN-------------GRNVYPVESNGDALITSQWLYNKYLQ 535
             ND+ N             G+   P+ + GD +I  + L++KYL+
Sbjct: 686 -ANDFYNGLAEWELAYVDTYGKARTPI-TEGDEIIMVKQLFDKYLK 729


>gi|345517325|ref|ZP_08796802.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
 gi|345457718|gb|EET14396.2| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
          Length = 754

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 175/603 (29%), Positives = 262/603 (43%), Gaps = 124/603 (20%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
           M+NL GWGGP P SW  QQ  LQKKIL R+ E GM+PVLP +SG +P+            
Sbjct: 192 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 251

Query: 49  ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
               L N    +  + L  W             +L   DP F +I   F E+  K YG T
Sbjct: 252 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 304

Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
           S  Y+ D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  M
Sbjct: 305 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 359

Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
           KAL      G L++LDLF+E +P+W       G+P IW             C+L NF GN
Sbjct: 360 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 408

Query: 213 IEMYGILDSIAFG-------PVEA---------------------------RTSENTTMA 238
           + ++G +D +          P+ A                           R  + T   
Sbjct: 409 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 468

Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 298
           WI QY   RYG    +I  AW +L + +YNC  G   +             SI       
Sbjct: 469 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQG---------PHESIFC----- 514

Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 358
               G+P    ++   + SS+     +Y  +    A  L ++  ++   +N + YDL+D+
Sbjct: 515 ----GRP----SLNNFQASSWSKMCNYYDPTTTAEAARLMVSVAHKYRGNNNFEYDLVDI 566

Query: 359 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 418
           TRQA+A  A  ++   +  ++  D       +R+FLEL+   D LL     F +G W++ 
Sbjct: 567 TRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWIQQ 626

Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
           A+ L    E++  YEWNAR QIT W +    +   LRDY +K W+GLLRD+Y  R   Y+
Sbjct: 627 ARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYW 686

Query: 479 KYMIESLE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 524
           + + + L+              + D   +     +W  L   W   +N Y   + GD + 
Sbjct: 687 QVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIE 743

Query: 525 TSQ 527
            ++
Sbjct: 744 VAK 746


>gi|294777713|ref|ZP_06743164.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
 gi|294448781|gb|EFG17330.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
          Length = 752

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 175/603 (29%), Positives = 262/603 (43%), Gaps = 124/603 (20%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
           M+NL GWGGP P SW  QQ  LQKKIL R+ E GM+PVLP +SG +P+            
Sbjct: 190 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 249

Query: 49  ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
               L N    +  + L  W             +L   DP F +I   F E+  K YG T
Sbjct: 250 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 302

Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
           S  Y+ D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  M
Sbjct: 303 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 357

Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
           KAL      G L++LDLF+E +P+W       G+P IW             C+L NF GN
Sbjct: 358 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 406

Query: 213 IEMYGILDSIAFG-------PVEA---------------------------RTSENTTMA 238
           + ++G +D +          P+ A                           R  + T   
Sbjct: 407 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 466

Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 298
           WI QY   RYG    +I  AW +L + +YNC  G   +             SI       
Sbjct: 467 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQG---------PHESIFC----- 512

Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 358
               G+P    ++   + SS+     +Y  +    A  L ++  ++   +N + YDL+D+
Sbjct: 513 ----GRP----SLNNFQASSWSKMCNYYDPTTTAEAARLMVSVAHKYRGNNNFEYDLVDI 564

Query: 359 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 418
           TRQA+A  A  ++   +  ++  D       +R+FLEL+   D LL     F +G W++ 
Sbjct: 565 TRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWIQQ 624

Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
           A+ L    E++  YEWNAR QIT W +    +   LRDY +K W+GLLRD+Y  R   Y+
Sbjct: 625 ARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYW 684

Query: 479 KYMIESLE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 524
           + + + L+              + D   +     +W  L   W   +N Y   + GD + 
Sbjct: 685 QVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIE 741

Query: 525 TSQ 527
            ++
Sbjct: 742 VAK 744


>gi|319640296|ref|ZP_07995021.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
 gi|317388071|gb|EFV68925.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
          Length = 752

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 175/603 (29%), Positives = 262/603 (43%), Gaps = 124/603 (20%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
           M+NL GWGGP P SW  QQ  LQKKIL R+ E GM+PVLP +SG +P+            
Sbjct: 190 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 249

Query: 49  ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
               L N    +  + L  W             +L   DP F +I   F E+  K YG T
Sbjct: 250 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 302

Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
           S  Y+ D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  M
Sbjct: 303 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 357

Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
           KAL      G L++LDLF+E +P+W       G+P IW             C+L NF GN
Sbjct: 358 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 406

Query: 213 IEMYGILDSIAFG-------PVEA---------------------------RTSENTTMA 238
           + ++G +D +          P+ A                           R  + T   
Sbjct: 407 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 466

Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 298
           WI QY   RYG    +I  AW +L + +YNC  G   +             SI       
Sbjct: 467 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQG---------PHESIFC----- 512

Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 358
               G+P    ++   + SS+     +Y  +    A  L ++  ++   +N + YDL+D+
Sbjct: 513 ----GRP----SLNNFQASSWSKMCNYYDPTTTAEAARLMVSVAHKYRGNNNFEYDLVDI 564

Query: 359 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 418
           TRQA+A  A  ++   +  ++  D       +R+FLEL+   D LL     F +G W++ 
Sbjct: 565 TRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWIQQ 624

Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
           A+ L    E++  YEWNAR QIT W +    +   LRDY +K W+GLLRD+Y  R   Y+
Sbjct: 625 ARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYW 684

Query: 479 KYMIESLE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 524
           + + + L+              + D   +     +W  L   W   +N Y   + GD + 
Sbjct: 685 QVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIE 741

Query: 525 TSQ 527
            ++
Sbjct: 742 VAK 744


>gi|212541222|ref|XP_002150766.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
 gi|210068065|gb|EEA22157.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
          Length = 787

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 168/526 (31%), Positives = 257/526 (48%), Gaps = 76/526 (14%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
           + N+ G WG PLP  W++ Q  LQKKIL R+ ELG+ PVLP+F+G VP A+  V P+AK+
Sbjct: 203 LGNIQGFWGDPLPNEWIESQFELQKKILARMVELGITPVLPSFTGFVPRAITRVLPNAKV 262

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
                W    S+  + C   L+  D  F  + ++ I +Q   YG  SHIY  D ++EN P
Sbjct: 263 VPGSRWNVFSSN--YTCDTFLEPFDDNFALLQKSTISKQQAYYGNISHIYALDQYNENNP 320

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-LV 178
              +P+Y+ ++       +++ D DAVWLMQ WLF    FW    + A L+ V     ++
Sbjct: 321 FSSNPDYLRNISRTTSQSLKAADPDAVWLMQSWLFLDATFWNNVTICAYLSGVENNSDML 380

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM- 237
           +LDLFAE +P+W  +  +YG P+IWC +H++ GN+ +YG + +I      A  S  + + 
Sbjct: 381 ILDLFAESQPVWQLTDSYYGKPWIWCQVHDYGGNMGLYGQIMNITENATAALASSGSMVG 440

Query: 238 ---------------------AW------INQY-----SVRRYG-RSVP-AIQDAWNVLY 263
                                AW       +QY     +VR  G + VP  + DAW +L 
Sbjct: 441 FGHTMESQEGNEIVYDLLLDQAWSETPINTSQYFEDWVTVRYAGTQHVPQQLFDAWEILR 500

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISV--TEGKYQNYGKPVSKEAVLKSETSSYDH 321
            + YN T+ A+      I+   +++PSI  +   EG +                T +YD 
Sbjct: 501 WSAYNNTNLASSSVPKSIL---ELEPSISGLLNREGHHPT--------------TINYD- 542

Query: 322 PHLWYSTSEVIRALEL-FIASGNELSA--SNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
           P L      V+ A  L + A+  ELS   +  + YDLI LTRQ L       +  +I  Y
Sbjct: 543 PEL------VVEAWALTYEAALLELSLWDNPAFNYDLIFLTRQVLVNAFIPRYELLISFY 596

Query: 379 QLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ-NEEQEKQYEWN 435
              +     +    R+ ++L++ +D +L  ++ F L  W+  A   A  N      YE+N
Sbjct: 597 NNENYSVPAIVSAGRQLIDLLQSLDTVLGTNECFQLAQWINKAVSRAHGNTTLAAYYEYN 656

Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
           AR QIT+W  N +     + DY +K W+GL+  YY PR  I   Y+
Sbjct: 657 ARNQITLWGPNGE-----ISDYASKQWAGLISSYYVPRWQILVDYL 697


>gi|327356744|gb|EGE85601.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ATCC 18188]
          Length = 752

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 150/532 (28%), Positives = 254/532 (47%), Gaps = 79/532 (14%)

Query: 1   MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
             NL G WGG   P  W D Q  LQKKIL R+ ELGM P+LPAF G VP A+  V P A+
Sbjct: 201 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 260

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W  +  +P++  T  L   DP  + + ++FI + ++ YG  +H Y  D F+E  
Sbjct: 261 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 318

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNS-VPLGK 176
           P    P+++  +       ++S D +A W+MQGWLF  +  +W   +++A L++      
Sbjct: 319 PSSGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 378

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           +++LDLFAE  P+W  +K F+G  ++WC +  F GN  +YG + +I  GP EA       
Sbjct: 379 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAEAMAQHPNM 438

Query: 237 MA--------------------------------WINQYSVRRYG---RSVPA-IQDAWN 260
           +                                 + + +  RRY    R+VP+ + +AW 
Sbjct: 439 VGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHERTVPSELYEAWQ 498

Query: 261 VLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSETS 317
           +L  + YN T+   A      +  A P ++  + +   EG                    
Sbjct: 499 LLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG-------------------- 538

Query: 318 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 377
                 L Y  +++++A  L I     L   ++Y+YD++D+TRQ L+     +  ++   
Sbjct: 539 ------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKVK 590

Query: 378 YQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ-YEWN 435
           Y+      VF  +  + L +++ +D +L+ ++ F L  W+ +A+  A ++ +    +E N
Sbjct: 591 YKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDDSEAADFFEHN 650

Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
           AR QIT+W      E  +L DY  K W+GL+  YY PR  ++ +Y+ ++  S
Sbjct: 651 ARNQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPRWRMFLEYLKDTPAS 698


>gi|323344412|ref|ZP_08084637.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
 gi|323094539|gb|EFZ37115.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
          Length = 730

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 167/568 (29%), Positives = 252/568 (44%), Gaps = 79/568 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP+ WL+ Q  LQKKIL R     M PVLPAF+G+VPA L+ +FP A I 
Sbjct: 198 MANIDRWNGPLPKEWLNGQKELQKKILARERAFNMKPVLPAFAGHVPAELKRIFPDANIK 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W   +   ++ C + L   +PLF +I + ++E+Q   +G T HIY  D F+E  PP
Sbjct: 258 SLGKWGGFEE--KYLC-HFLSPEEPLFSKIQKLYLEEQTALFG-TDHIYGVDPFNEVEPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              P Y+  +   +Y  + + D  A W+  GW+FSYD   W P +++A L  VP GK+ +
Sbjct: 314 SWEPAYLRKVSKNMYGTLTAVDPKAEWMQMGWMFSYDNKHWTPDRVQAFLTGVPKGKMSL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
           LD + E   +W T+  FYG PYIWC L NF GN  + G                      
Sbjct: 374 LDYYCENVELWKTTDGFYGQPYIWCYLGNFGGNTTLMGNVKESGRRLDNALANGQRNMLG 433

Query: 219 -------LDSIAFGPVEARTSENTTMA-----WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                  LD I F P E   ++  + A     WI+  + R YG   P+++ AW++L++ +
Sbjct: 434 AGSTLEGLDVIQF-PYEYLYNKLWSHAVADSRWIDDLADRHYGGVSPSVRKAWHILFNDI 492

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y                       + +  +G   N+ +P             Y    L  
Sbjct: 493 Y---------------------VQVSASMQGVLTNF-RPALNNNYPHRTAIEYPAERL-- 528

Query: 327 STSEVIR-ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
              EV R  L++     NEL      + D+I + RQ L      +      AY   D   
Sbjct: 529 --EEVWRLLLDVPRCDRNEL------QLDIIAVGRQVLGNRFAVVKTQFDSAYANKDIPR 580

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
           +   +    EL+ D+D L + +    +  W++ A++L   +E +  YE NAR  IT W  
Sbjct: 581 LKAKACEMEELLGDLDRLTSFNSRCSINRWIDDARKLGSTKELKDYYEKNARNLITTWGG 640

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
           N       + DY ++ W GL+  YY  R  +Y   ++ + E+   F    +  +  K   
Sbjct: 641 N-------INDYASRTWGGLIGSYYAHRWRLYIDDILAAAEANKEFDQNAFNEKVSKFEQ 693

Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKY 533
            W        V    D L   + L  KY
Sbjct: 694 AWIISTEPITVPKRTDLLTFCRILIQKY 721


>gi|424665881|ref|ZP_18102917.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
           616]
 gi|404574134|gb|EKA78885.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
           616]
          Length = 732

 Score =  233 bits (593), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 168/577 (29%), Positives = 257/577 (44%), Gaps = 96/577 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGP+ Q ++D+Q  LQKK++ R+ E GM PVL  F G VP ++   FP+A I 
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTQLQKKMIDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W + +          L  +DPLF ++ + F E+Q K +G+ S  Y  D F E  N+
Sbjct: 258 DAGKWITYQRPA------FLVPSDPLFAKVAQIFYEEQEKLFGK-SRYYGGDPFHEGGNS 310

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             ++    I+   + IY  M++ + DA+W++QGW    +P +      ALL  +  G+ +
Sbjct: 311 EGIN----ITEAASDIYKAMKANNPDAIWVLQGW--GANPSY------ALLKGLKQGEAL 358

Query: 179 VLDLFAEVKPIW-----STSKQFYGV---PYIWCMLHNFAGNIEMYGILDSIAFGPVEA- 229
           +LDL +  +P W     S S +  G     +IWC L NF G I MYG L S A G + A 
Sbjct: 359 ILDLMSCARPQWGGDPSSQSHREDGYLDHNWIWCALPNFGGRIGMYGKLQSYATGVIRAE 418

Query: 230 ---------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ 256
                                            RT      +WI  Y+  RYG      +
Sbjct: 419 HHPKGKYVCGVGTTPEGIGTNPIDYDMVYDMAWRTDSIDVKSWIANYTTYRYGSPNNNAK 478

Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
            A   L  +VYNC   A         A P +                            T
Sbjct: 479 AAMQQLSTSVYNCPWAADGPQESYFCARPSLKID------------------------RT 514

Query: 317 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
           SS+   HL+Y    V++ALE  + + NEL   +TYRYD++D+TRQ LA Y   +   I +
Sbjct: 515 SSWGTAHLYYQPINVLQALEHLLKAENELKEIDTYRYDVVDVTRQMLADYGKYIHKCIAD 574

Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
           AY   D       + +FL+++ D D LL+    FLLG ++  A     N  +++ +  NA
Sbjct: 575 AYYGKDTEKFDFYTSKFLQMISDQDLLLSTRKEFLLGKFIRQADACGSNPMEKRMFINNA 634

Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           + QIT W        S L +Y +K W+G+L   Y PR   YF Y+   LE  +   +   
Sbjct: 635 KRQITTW----ASVNSSLHEYAHKEWNGILGTLYAPRWKAYFDYLRTKLEGKNPKEI--- 687

Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
             ++  +  DW   +  +        +  ++ +Y+ Y
Sbjct: 688 --DFFTMETDWVESKKEFSAVPIKKEIEIAKTIYHNY 722


>gi|399028591|ref|ZP_10729778.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
 gi|398073682|gb|EJL64846.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
          Length = 727

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 161/549 (29%), Positives = 257/549 (46%), Gaps = 82/549 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++   GPLPQ W++++  +QKKIL R+  LGM+PV+PAFSG VP A     P +KI+
Sbjct: 209 MGNINSLEGPLPQEWINKKENVQKKILQRMRALGMHPVVPAFSGYVPKAFAEKHPGSKIS 268

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L +W    S   +  TYLLDA DPLF EIG+ FIE   K YG+ +  Y  D F+E TPP
Sbjct: 269 ELKSW----SGGGFESTYLLDANDPLFKEIGKRFIEIYTKLYGQ-ADFYLADAFNEITPP 323

Query: 121 VDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
           V      E +S  G  I+  +     DA W+MQGWLF  +  FW     KA L+ VP  +
Sbjct: 324 VSKEHKYEELSDYGKTIFETINEASPDATWVMQGWLFGDNKEFWTKEATKAFLSKVPNDR 383

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD---------------- 220
           +++ D   +   +W   + FYG  + +  +HN+ G+  +YG L+                
Sbjct: 384 MMIQDYANDRHKVWEKQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKNELTHLLGNSNKG 443

Query: 221 -SIAFGPVEARTSENTTM-----------------AWINQYSVRRYGR--SVPAIQDAWN 260
             + +G +    + N+ +                  W+N+Y   RYG+  S P  Q AW 
Sbjct: 444 NVVGYGVMPEGLNNNSIVYEYIYDLPWSQGKESVNDWLNKYLSARYGKNISTPVFQ-AWK 502

Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
           +L  +VY+     T    D   A+       + +TE K    G P  K+           
Sbjct: 503 LLIESVYSTKYWETRWWDDRAGAYLFFKRPTLKITEFK----GNPGDKQ----------- 547

Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
                    ++ +AL++        + ++ Y YDL+D++R   +   ++L +  + AY+L
Sbjct: 548 ---------KLKQALDILKRESKSFNKNSLYFYDLLDMSRHYYSLCIDDLLIECVTAYEL 598

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
            D     +L ++  +   D+D +L+      L  WL+SA     + E  K Y  NA+T I
Sbjct: 599 KDIKKADELFKKIEKQALDIDNMLSGQPLNSLNNWLKSASDYGSSPEVSKLYVKNAKTLI 658

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RL 493
           T+W          L DY ++ W G+ + +Y PR  ++ +   ES+ +   F        +
Sbjct: 659 TLWGGEGH-----LNDYASRSWRGMYKGFYWPRWKMFLQAQRESVVNNTSFDELKVRESI 713

Query: 494 KDWRREWIK 502
           K W  +W +
Sbjct: 714 KQWEIKWCQ 722


>gi|383120707|ref|ZP_09941431.1| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
 gi|382984934|gb|EES68331.2| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
          Length = 736

 Score =  231 bits (589), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 157/526 (29%), Positives = 244/526 (46%), Gaps = 92/526 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 197 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 248

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+  + Y+ D F E 
Sbjct: 249 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 305

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  G +I S M+  + +AVW+MQ         W+    +A+++++  G
Sbjct: 306 GNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQA--------WQANPREAMVSTLDSG 353

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG--- 225
            L+VLDL++E  P       +W   K F    +++CML NF GN+ ++G ++ +  G   
Sbjct: 354 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 413

Query: 226 ---PVEART-----------SENTTM-----------------AWINQYSVRRYGRSV-P 253
               V  +T             N  M                 AW+  Y   RYG  + P
Sbjct: 414 ACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWREERFAPDAWLQAYLKARYGNDLSP 473

Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
            + +AW  L HTVYN       +             S++    G +Q+            
Sbjct: 474 EVAEAWRALEHTVYNAPKNYQGEG---------TVESLLCARPGFHQD------------ 512

Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
             TS++ +  L+YS     +A  L ++  ++   +N + YDL+D+ RQ+LA   N L   
Sbjct: 513 -RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNNFEYDLVDVVRQSLADKGNVLLEE 571

Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
           I ++Y   D     + S++FLEL+   D LL+    F +  WL +A+ L   EE++K YE
Sbjct: 572 ISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEFSVSSWLNAARSLGTTEEEKKLYE 631

Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 479
           WNA   IT+W D+       L DY ++ WSG+L+D Y  R   +F+
Sbjct: 632 WNASALITVWGDSIAANRGGLHDYSHREWSGILKDLYYQRWKTFFE 677


>gi|29349767|ref|NP_813270.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29341678|gb|AAO79464.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 744

 Score =  231 bits (589), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 157/526 (29%), Positives = 244/526 (46%), Gaps = 92/526 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 205 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 256

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+  + Y+ D F E 
Sbjct: 257 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 313

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  G +I S M+  + +AVW+MQ         W+    +A+++++  G
Sbjct: 314 GNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQA--------WQANPREAMVSTLDSG 361

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG--- 225
            L+VLDL++E  P       +W   K F    +++CML NF GN+ ++G ++ +  G   
Sbjct: 362 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 421

Query: 226 ---PVEART-----------SENTTM-----------------AWINQYSVRRYGRSV-P 253
               V  +T             N  M                 AW+  Y   RYG  + P
Sbjct: 422 ACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWREERFAPDAWLQAYLKARYGNDLSP 481

Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
            + +AW  L HTVYN       +             S++    G +Q+            
Sbjct: 482 EVAEAWRALEHTVYNAPKNYQGEG---------TVESLLCARPGFHQD------------ 520

Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
             TS++ +  L+YS     +A  L ++  ++   +N + YDL+D+ RQ+LA   N L   
Sbjct: 521 -RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNNFEYDLVDVVRQSLADKGNVLLEE 579

Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
           I ++Y   D     + S++FLEL+   D LL+    F +  WL +A+ L   EE++K YE
Sbjct: 580 ISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEFSVSSWLNAARSLGTTEEEKKLYE 639

Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 479
           WNA   IT+W D+       L DY ++ WSG+L+D Y  R   +F+
Sbjct: 640 WNASALITVWGDSIAANRGGLHDYSHREWSGILKDLYYQRWKTFFE 685


>gi|189465172|ref|ZP_03013957.1| hypothetical protein BACINT_01517 [Bacteroides intestinalis DSM
           17393]
 gi|189437446|gb|EDV06431.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides intestinalis DSM
           17393]
          Length = 723

 Score =  231 bits (588), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 168/587 (28%), Positives = 266/587 (45%), Gaps = 109/587 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ  LQKKIL R+ E G+ PV P +SG VP  A + +  +   
Sbjct: 192 MNNLEGWGGPNPDSWYAQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 251

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
           + L N F+  +         L  TD  F EI   +  +Q K +G+  + Y+ D F E  N
Sbjct: 252 SDLWNGFTRPA--------FLQPTDARFAEIADLYYREQEKLFGKADY-YSMDPFHEAEN 302

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G AI + M+  +  A W++QGW  + +P  RP  +K + N    G L
Sbjct: 303 AASVD----FDAAGKAIMTAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 350

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSI-- 222
           ++LDLF+E +P+W       G+P IW             CML NF GN+ ++G +D +  
Sbjct: 351 LILDLFSECRPMW-------GIPSIWKRDKGYEQHDWLFCMLLNFGGNVGLHGRMDQLLN 403

Query: 223 -----AFGPVEART---------SENTTM------------------AWINQYSVRRYGR 250
                   P+             SEN  M                   W+  Y   RYG 
Sbjct: 404 NFYLTKNNPLATHLKGIGLTMEGSENNAMMFELMCELPWRPEKFTKEEWLKDYLFARYGV 463

Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 308
               I+ AW +L +T+YNC  G   +  +  +    P ++ +  + +  K +NY  P   
Sbjct: 464 RDEKIEQAWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVT 522

Query: 309 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 368
           E                        A  L +   ++   +N + YDL+D+ RQ+L+    
Sbjct: 523 E-----------------------EAARLMLEVADKYRGNNNFEYDLVDIVRQSLSDKGR 559

Query: 369 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
            ++   I  ++  D     + SR+FL+++   D LL     F +G W+E A++L    E+
Sbjct: 560 IVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARKLGTTPEE 619

Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
           +  YEWNAR QIT W +    +   LRDY +K W+G+LRD+Y  R A Y++ + + L+  
Sbjct: 620 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 679

Query: 489 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
              +L     ++  +   W   +N Y   S G+ +  ++  + K  +
Sbjct: 680 PEVKL-----DYYAMEEPWTLAKNPYGSTSEGNCVDVAKEAFEKVFE 721


>gi|423248233|ref|ZP_17229249.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
           CL03T00C08]
 gi|423253182|ref|ZP_17234113.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
           CL03T12C07]
 gi|392657082|gb|EIY50719.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
           CL03T12C07]
 gi|392660340|gb|EIY53954.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
           CL03T00C08]
          Length = 732

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 162/533 (30%), Positives = 239/533 (44%), Gaps = 91/533 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGP+ Q ++D+Q  LQKK+L R+ E GM PVL  F G VP ++   FP+A I 
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTKLQKKMLDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W + +          L  +DPLF ++   F E+Q K +G  S  Y  D F E  N+
Sbjct: 258 NAGKWITYQRPA------FLVPSDPLFAKVAEIFYEEQKKLFGE-SRYYGGDPFHEGGNS 310

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             ++    I+   + IY  M++ + +A+W++QGW  S +P        ALL  +  G+ +
Sbjct: 311 KGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--SGNP------SVALLKGLKHGEAL 358

Query: 179 VLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA- 229
           VLDL A  +P W             F    +IWC L NF G I MYG L S A G ++A 
Sbjct: 359 VLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALPNFGGRIGMYGKLQSYATGVIKAE 418

Query: 230 ---------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ 256
                                            RT      +WI  Y+  RYG      +
Sbjct: 419 HHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDSIDIKSWIANYTTYRYGSENSNAK 478

Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
            A   L  +VYNC   A         A P +    +                        
Sbjct: 479 AAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYV------------------------ 514

Query: 317 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
           SS+   HL+Y    V++ALE  + +  EL   +TYRYD++D+TRQ LA Y   +   I +
Sbjct: 515 SSWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRYDVVDITRQMLADYGKYIHKCISD 574

Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
           AY+  +       + +FL+++ D D LL+    FLLG ++  A     N  +++ +  NA
Sbjct: 575 AYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLGEYIRQADTCGSNPTEKRMFINNA 634

Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
           + QIT W        S L +Y +K W+G+L   Y PR  +YF Y+   LE  +
Sbjct: 635 KRQITSW----TSVNSSLHEYAHKEWNGILSTLYAPRWKVYFDYLHAKLEGKN 683


>gi|423269877|ref|ZP_17248849.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
           CL05T00C42]
 gi|423272668|ref|ZP_17251615.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
           CL05T12C13]
 gi|392700723|gb|EIY93885.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
           CL05T00C42]
 gi|392708745|gb|EIZ01850.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
           CL05T12C13]
          Length = 732

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 162/533 (30%), Positives = 239/533 (44%), Gaps = 91/533 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGP+ Q ++D+Q  LQKK+L R+ E GM PVL  F G VP ++   FP+A I 
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTKLQKKMLDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W + +          L  +DPLF ++   F E+Q K +G  S  Y  D F E  N+
Sbjct: 258 DAGKWITYQRPA------FLVPSDPLFAKVAEIFYEEQKKLFGE-SRYYGGDPFHEGGNS 310

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             ++    I+   + IY  M++ + +A+W++QGW  S +P        ALL  +  G+ +
Sbjct: 311 KGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--SGNP------SVALLKGLKHGEAL 358

Query: 179 VLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA- 229
           VLDL A  +P W             F    +IWC L NF G I MYG L S A G ++A 
Sbjct: 359 VLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALPNFGGRIGMYGKLQSYATGVIKAE 418

Query: 230 ---------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ 256
                                            RT      +WI  Y+  RYG      +
Sbjct: 419 HHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDSIDIKSWIANYTTYRYGSENSNAK 478

Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
            A   L  +VYNC   A         A P +    +                        
Sbjct: 479 AAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYV------------------------ 514

Query: 317 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
           SS+   HL+Y    V++ALE  + +  EL   +TYRYD++D+TRQ LA Y   +   I +
Sbjct: 515 SSWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRYDVVDITRQMLADYGKYIHKCISD 574

Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
           AY+  +       + +FL+++ D D LL+    FLLG ++  A     N  +++ +  NA
Sbjct: 575 AYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLGEYIRQADTCGSNPTEKRMFINNA 634

Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
           + QIT W        S L +Y +K W+G+L   Y PR  +YF Y+   LE  +
Sbjct: 635 KRQITSW----TSVNSSLHEYAHKEWNGILSTLYAPRWKVYFDYLHAKLEGKN 683


>gi|299140550|ref|ZP_07033688.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella oris C735]
 gi|298577516|gb|EFI49384.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella oris C735]
          Length = 741

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 157/526 (29%), Positives = 237/526 (45%), Gaps = 80/526 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP SW +QQ  LQKKIL R++E GM PVLP F G +P   +       +T
Sbjct: 186 MNNLEGWGGPLPDSWYNQQEALQKKILKRMHEYGMQPVLPGFCGMMPHDAKAKL-GLNVT 244

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W              L  TD  F +I   +  +  K YG+ ++ Y+ D F E T  
Sbjct: 245 DGGIWNGYTRPAN------LSPTDAHFDKIADLYYAELTKLYGKANY-YSMDPFHE-TND 296

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            ++ +Y S  G  +   M+  +  A W++QGW  + +P  RP  +K + N    G L+VL
Sbjct: 297 DETIDY-SKAGCKVMEAMKRVNPKATWVIQGW--TENP--RPQMIKNMKN----GDLLVL 347

Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
           DLF+E +P      IW   K +    +++CML NF  N+ ++G +D +       + S  
Sbjct: 348 DLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLLHNFYSTKQSSP 407

Query: 235 TTMA----------------------------------WINQYSVRRYGRSVPAIQDAWN 260
            T                                    WI  Y   RYG++ P I+ AW 
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSELPWRTECKKEDWIKGYVKARYGKTSPEIERAWQ 467

Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
           +L  T+YNC  G   +             SI           G+P      +KS +   +
Sbjct: 468 LLSETIYNCPAGNNQQG---------PHESIFC---------GRPSLNNFQVKSWSKMRN 509

Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
           +    Y     + A +L     ++   +N + YDL+D+ RQALA      +L  I  Y  
Sbjct: 510 Y----YDPQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALADQGRLQYLKTIADYNG 565

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
                  + + RFLE++   D LL     F LG W E+A++L   ++++  YEWNAR QI
Sbjct: 566 FSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGTTQQEKDLYEWNARVQI 625

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
           T W +    +   L DY +K W G+L+D+Y  R  I+   + + +E
Sbjct: 626 TTWGNRICADKGGLHDYAHKEWQGILKDFYYKRWKIFMDALAKQME 671


>gi|160887167|ref|ZP_02068170.1| hypothetical protein BACOVA_05183 [Bacteroides ovatus ATCC 8483]
 gi|423295093|ref|ZP_17273220.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
           CL03T12C18]
 gi|156107578|gb|EDO09323.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
 gi|392673999|gb|EIY67450.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
           CL03T12C18]
          Length = 711

 Score =  228 bits (580), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 156/526 (29%), Positives = 245/526 (46%), Gaps = 92/526 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+ ++ Y+ D F E 
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  GA+I + M+  +  AVW++Q W  S       P+ + ++ S+  G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPKAVWIIQAWQAS-------PR-EEMIASLNQG 352

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            L+VLDL++E +P       +W   K F    +++CML NF GN+ ++G ++ +  G  +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412

Query: 229 ARTSENTTM----------------------------------AWINQYSVRRYGRSV-P 253
           A    N  M                                   W+  Y   RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472

Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
            I +AW  L HTVYN      D   +  +       S++    G + +            
Sbjct: 473 EIMEAWRALEHTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511

Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
             TS++ +  L+Y+     +A  LF +  ++   +N + YDL+D+ RQ+ A   N L   
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570

Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
           I ++Y   D     + +++FL+L+   D LL+    F +  WL +A+ L   EE+++ YE
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKRLYE 630

Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 479
           WNA   IT+W D+       L DY ++ WSGLL+D Y  R   +F+
Sbjct: 631 WNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLYYQRWKAFFE 676


>gi|340347658|ref|ZP_08670763.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
 gi|433652542|ref|YP_007296396.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
           3688]
 gi|339608852|gb|EGQ13735.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
 gi|433303075|gb|AGB28890.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
           3688]
          Length = 781

 Score =  227 bits (579), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 168/540 (31%), Positives = 237/540 (43%), Gaps = 97/540 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP SW  QQ  LQK+IL R  ELGM PVLP + G +P   +       +T
Sbjct: 207 MNNLEGWGGPLPDSWYRQQEALQKRILQRERELGMEPVLPGYCGMMPHDAKQKL-GLDVT 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W        +     L ATDP F EI   +  +Q + YG+ SH Y+ D F E +  
Sbjct: 266 PGGTWNG------YVRPANLSATDPRFDEIADLYYREQTRLYGK-SHYYSMDPFHETSDD 318

Query: 121 VDSPEYI--SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
           V    YI  +  G  + + M+  +  A W++QGW  +  P        A+ + +P G L 
Sbjct: 319 V----YIDYAQAGRKLMAAMKRENPKANWVIQGWTENPRP--------AMTDGLPAGSLT 366

Query: 179 VLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDS------IAFGP 226
           VLDLF+E +P      IW  ++ +    +++CML NF GN+ ++G +D       +A  P
Sbjct: 367 VLDLFSECRPMFGAPSIWKRAEGYGQHDWLFCMLENFGGNVGLHGRMDQLIGNFRLATSP 426

Query: 227 VEA---------------------------------RTSENTTMA--------WINQYSV 245
                                               RT E    A        W+  Y  
Sbjct: 427 QSPLQQARRHLRGIGFTMEGSENNPIMFELMSELPWRTDEVAQAADARTFRTEWVRGYVK 486

Query: 246 RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP 305
            RYG   P  Q AW +L  T+YNC  G   +             SI           G+P
Sbjct: 487 ARYGTDDPHAQQAWQLLAETIYNCPAGNNQQG---------PHESIFD---------GRP 528

Query: 306 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 365
                 +KS +   ++    Y  S  + A  L  A+ + L  +N Y YDL+D+ RQA+  
Sbjct: 529 SLNNFQVKSWSKMRNY----YEPSATLEAARLMAAAADRLKGNNNYEYDLVDIVRQAIDD 584

Query: 366 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 425
            A +++L+ I  Y   D     + S RFL L+   D LL     F LG W E+A+ L   
Sbjct: 585 QARQVYLHAIADYNGFDRRAFSRDSARFLGLLLMQDRLLGTRREFRLGRWTEAARSLGTT 644

Query: 426 EEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL 485
             ++  YEWNAR QIT W +    +   LRDY +K W GLL D+Y  R   Y   +   +
Sbjct: 645 PAEKDLYEWNARVQITTWGNRACADQGGLRDYAHKEWQGLLADFYYMRWHTYLDALSRQM 704


>gi|449299394|gb|EMC95408.1| glycoside hydrolase family 89 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 801

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 153/553 (27%), Positives = 254/553 (45%), Gaps = 96/553 (17%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LP SW+  Q  L K+I+ R+ ELGM PVLP F G VP  +   +P+A    
Sbjct: 226 NIQGSWGGDLPMSWISSQFTLGKQIVARMVELGMTPVLPCFPGFVPMQIGRYYPNAMYIN 285

Query: 62  LGNWFSVKSDPRWCCTY-LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
              W      PR       L+  DPL+  + ++FI +Q   YG  S IY  D ++EN P 
Sbjct: 286 GSQW---NGFPRQNTNVSFLEPFDPLYTTLQKSFISKQTAAYGNVSSIYTLDQYNENNPY 342

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVV 179
                Y+ ++ A   + +++ D +AVW++QGWLF S   FW    ++A L  V    +++
Sbjct: 343 SADTTYLRNISAGTIAALKAADPNAVWMLQGWLFFSSATFWTDAAIRAYLGGVNNTDMII 402

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLF+E +P W  +  +YG P+IWC LH++ GN+ +YG ++++   P++A  + ++TM  
Sbjct: 403 LDLFSETQPQWQRTNSYYGKPWIWCELHDYGGNMGLYGQVENVTINPIQALNNASSTMVG 462

Query: 238 ---------------------AW----------INQYSVRRYGRSV---PAIQDAWNVLY 263
                                AW           + +   RY  +    P +  AW+ + 
Sbjct: 463 MGLTMEGQEGNEIMYDILLDQAWSSTPLNNSLYFHDWVTSRYHGAASLPPGLYTAWDTMR 522

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP- 322
            TVYN T  +T +   V  +  ++ P++  +                       +  HP 
Sbjct: 523 QTVYNNTQISTIQ--SVTKSIWELTPNVTGLLN--------------------RTGHHPT 560

Query: 323 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
            + Y+TS ++ A + F  +  +   L  S  Y +DL D+TRQ +A     L+ + + A  
Sbjct: 561 TIQYNTSTLVGAWKQFYGAAAQEPTLWDSPGYLFDLTDVTRQVMANAFYPLYTSFVSASN 620

Query: 380 LNDAHGVFQ------LSRRFLELVEDMDGLLACH--DGFLLGPWLESAK----------- 420
            + A+  +         ++ + L+  +D +LA      F L  W+  A+           
Sbjct: 621 -HSANATYSPGNATIYGQQMVSLLSALDSMLAASPIPYFHLSTWIAEARSWSAPTATLPN 679

Query: 421 ---QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 477
               L  + +    YE+NAR QIT+W    Q     + DY +K W+GL+  YY PR  ++
Sbjct: 680 NATNLTSSSQTASFYEYNARNQITLWGPTGQ-----ISDYASKQWAGLISSYYVPRWQLF 734

Query: 478 FKYMIESLESGDG 490
             Y +    + +G
Sbjct: 735 VNYTLNGTTASNG 747


>gi|423226735|ref|ZP_17213200.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392627008|gb|EIY21049.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 718

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 165/587 (28%), Positives = 265/587 (45%), Gaps = 109/587 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ  LQKKIL R+ E G+ PV P +SG VP  A + +  +   
Sbjct: 189 MNNLEGWGGPNPDSWYVQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 248

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
           + L N F+  +         L  TD  F EI   + ++Q K +G+  + Y+ D F E  N
Sbjct: 249 SDLWNGFTRPA--------FLQPTDVRFAEIADLYYQEQEKLFGKVDY-YSMDPFHEAEN 299

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G AI + M+  +  A W++QGW  + +P  RP  +K + N    G L
Sbjct: 300 AASVD----FDAAGKAIMAAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 347

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSI-- 222
           ++LDLF+E +P+W       G+P IW             CML NF GN+ ++G +D +  
Sbjct: 348 LILDLFSECRPMW-------GIPSIWKRDKGYEQHNWLFCMLLNFGGNVGLHGRMDQLLD 400

Query: 223 -----AFGPVEA---------------------------RTSENTTMAWINQYSVRRYGR 250
                   P+                             R  + T   W+  Y   RYG 
Sbjct: 401 NFYLTKNNPLAVHLKGIGLTMEGAENNPMMFELMCELPWRPEKFTKEEWLKDYLFARYGV 460

Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 308
               I+ AW +L +T+YNC  G   +  +  +    P ++ +  + +  K +NY  P   
Sbjct: 461 RDEKIEKAWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVT 519

Query: 309 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 368
           E                        A  L +   ++   +N + YDL+D+ RQ+L+    
Sbjct: 520 E-----------------------EAARLMVEVADKYRGNNNFEYDLVDIVRQSLSDKGR 556

Query: 369 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
            ++   I  ++  D     + SR+FL+++   D LL     F +G W+E A+ L    E+
Sbjct: 557 IVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARNLGTTPEE 616

Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
           +  YEWNAR QIT W +    +   LRDY +K W+G+LRD+Y  R A Y++ + + L+  
Sbjct: 617 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 676

Query: 489 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
              +L     ++  +   W   +N Y     G  +  ++ ++ K ++
Sbjct: 677 PEVKL-----DYYAMEEPWTLAKNPYSSVPEGSCVDVAKEVFEKAMR 718


>gi|224537466|ref|ZP_03678005.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520904|gb|EEF90009.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 721

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 165/587 (28%), Positives = 265/587 (45%), Gaps = 109/587 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ  LQKKIL R+ E G+ PV P +SG VP  A + +  +   
Sbjct: 192 MNNLEGWGGPNPDSWYVQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 251

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
           + L N F+  +         L  TD  F EI   + ++Q K +G+  + Y+ D F E  N
Sbjct: 252 SDLWNGFTRPA--------FLQPTDVRFAEIADLYYQEQEKLFGKVDY-YSMDPFHEAEN 302

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G AI + M+  +  A W++QGW  + +P  RP  +K + N    G L
Sbjct: 303 AASVD----FDAAGKAIMAAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 350

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSI-- 222
           ++LDLF+E +P+W       G+P IW             CML NF GN+ ++G +D +  
Sbjct: 351 LILDLFSECRPMW-------GIPSIWKRDKGYEQHNWLFCMLLNFGGNVGLHGRMDQLLD 403

Query: 223 -----AFGPVEA---------------------------RTSENTTMAWINQYSVRRYGR 250
                   P+                             R  + T   W+  Y   RYG 
Sbjct: 404 NFYLTKNNPLAVHLKGIGLTMEGAENNPMMFELMCELPWRPEKFTKEEWLKDYLFARYGV 463

Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 308
               I+ AW +L +T+YNC  G   +  +  +    P ++ +  + +  K +NY  P   
Sbjct: 464 RDEKIEKAWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVT 522

Query: 309 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 368
           E                        A  L +   ++   +N + YDL+D+ RQ+L+    
Sbjct: 523 E-----------------------EAARLMVEVADKYRGNNNFEYDLVDIVRQSLSDKGR 559

Query: 369 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
            ++   I  ++  D     + SR+FL+++   D LL     F +G W+E A+ L    E+
Sbjct: 560 IVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARNLGTTPEE 619

Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
           +  YEWNAR QIT W +    +   LRDY +K W+G+LRD+Y  R A Y++ + + L+  
Sbjct: 620 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 679

Query: 489 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
              +L     ++  +   W   +N Y     G  +  ++ ++ K ++
Sbjct: 680 PEVKL-----DYYAMEEPWTLAKNPYSSVPEGSCVDVAKEVFEKAMR 721


>gi|299148671|ref|ZP_07041733.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           3_1_23]
 gi|383114572|ref|ZP_09935334.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
 gi|298513432|gb|EFI37319.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           3_1_23]
 gi|313693722|gb|EFS30557.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
          Length = 711

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 154/526 (29%), Positives = 243/526 (46%), Gaps = 92/526 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+ ++ Y+ D F E 
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  GA+I + M+  +  AVW++Q         W+    + ++ S+  G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPKAVWIIQA--------WQANPREEMIASLNQG 352

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            L+VLDL++E +P       +W   K F    +++CML NF GN+ ++G ++ +  G  +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412

Query: 229 ARTSENTTM----------------------------------AWINQYSVRRYGRSV-P 253
           A    N  M                                   W+  Y   RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472

Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
            I +AW  L HTVYN      D   +  +       S++    G + +            
Sbjct: 473 EIMEAWRALEHTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511

Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
             TS++ +  L+Y+     +A  LF +  ++   +N + YDL+D+ RQ+ A   N L   
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570

Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
           I ++Y   D     + +++FL+L+   D LL+    F +  WL +A+ L   EE+++ YE
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKRLYE 630

Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 479
           WNA   IT+W D+       L DY ++ WSGLL+D Y  R   +F+
Sbjct: 631 WNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLYYQRWKAFFE 676


>gi|380694112|ref|ZP_09858971.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
          Length = 736

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 158/540 (29%), Positives = 245/540 (45%), Gaps = 106/540 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 199 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 250

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+  + Y+ D F E 
Sbjct: 251 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 307

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  G +I   M+  + +AVW+MQ         W+    +A++N++  G
Sbjct: 308 GNTEGVD----LAKAGTSIMGAMKKANPEAVWVMQA--------WQANPREAMVNTLDSG 355

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            L+VLDL++E  P       +W   K F    +++CML NF GN+ ++G ++ +  G   
Sbjct: 356 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 415

Query: 229 ARTS----------------ENTTMA------------------WINQYSVRRYGRSV-P 253
           A                   EN  M                   W+  Y   RYG  + P
Sbjct: 416 ACAHINGKTLRGVGATPEGIENNPMMFELLYELPWREERFSPDIWLQGYLKARYGDDLSP 475

Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
            + +AW  L HTVYN                               +NY    + E++L 
Sbjct: 476 EVTEAWRALEHTVYNAP-----------------------------KNYQGEGTVESLLC 506

Query: 314 S-------ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 366
           +        TS++ +  L+YS     +A +L ++  +    +N + YDL+D+ RQ+LA  
Sbjct: 507 ARPGFHLDRTSTWGYAKLFYSPDSTAKAAQLLLSVADRYKGNNNFEYDLVDIVRQSLADK 566

Query: 367 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 426
           AN L   I ++Y   D     + +++FL L+   D LL+    F +  WL +A+ L   E
Sbjct: 567 ANVLLEEISQSYDRKDKDSFRKQTQQFLGLILSQDSLLSTRKEFSVSSWLSAARSLGTTE 626

Query: 427 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
           E++K YEWNA   IT+W D+       L DY ++ WSGLL+D Y  R   +F+   + L+
Sbjct: 627 EEKKLYEWNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLYYQRWNTFFEQKQQELD 686


>gi|322702923|gb|EFY94542.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
           ARSEF 23]
          Length = 589

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 161/538 (29%), Positives = 254/538 (47%), Gaps = 85/538 (15%)

Query: 7   WGG--PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGN 64
           WGG   L   W+D Q  LQKKI+ R+ ELG+ PVLPAF G VP A   V P A  T+   
Sbjct: 28  WGGVGNLSSGWIDAQFELQKKIVARMVELGITPVLPAFPGFVPPAFSRVQPDANTTKAPR 87

Query: 65  WFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
           W  +  D     T+ L   D  +  + +AFI +Q++ +G  ++IY  D F+E  P  + P
Sbjct: 88  WTGLP-DTNTRDTF-LSPLDTSYARLQQAFISKQIEAFGNVTNIYTLDQFNEMPPTSNEP 145

Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG--KLVVLDL 182
            Y+S +    Y  + + +  AVWL+QGWLF     W   ++ A L   P G   ++VLDL
Sbjct: 146 SYLSQVSTYTYKALTAANPAAVWLLQGWLFLNSGLWTEERVTAYLGG-PEGHNSMLVLDL 204

Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT----- 236
           ++E +P W  +K ++G P+IWC LH+F GN+ MYG +  I    ++A RTS + +     
Sbjct: 205 YSESRPQWQRTKGYFGRPWIWCQLHDFGGNMGMYGQISDITVQSMDALRTSPSLSGFGMT 264

Query: 237 ----------------MAWINQ----------YSVRRYG--RSVPAIQDAWNVLYHTVYN 268
                            AW             Y VRRY       ++  AW++L   +Y+
Sbjct: 265 PEGYEGNEVVYQMLFDQAWTTTPIDTSGYFYGYVVRRYAGVSQTNSLFQAWDILRQNIYD 324

Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-HLWYS 327
                 +K+R V             V  G YQN        + L + T ++  P  ++Y 
Sbjct: 325 ------NKDRQVPC-----------VGVGIYQN----APSLSGLVNRTGNWPPPTKVYYD 363

Query: 328 TSEVIRALELFIASGNELSA---SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            + + +A  L I + NE+       T++ D++D+TRQ ++   N ++ + ++ +    + 
Sbjct: 364 PATLKKAHSLLIQAANEIPQLWDIPTFQLDVVDVTRQVMSNAFNTMYTDYVQTFNSQLSR 423

Query: 385 GVFQLSRR---------------FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
               +S R                L+ + D+D +LA +  F L  WL++A+  A+     
Sbjct: 424 QKSHISNRGGLQRRDDFATKGKQLLDFLTDLDRVLATNQHFRLDSWLDAAQYWAKQTGAN 483

Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
               +NAR+QIT W      E+  L DY  K WSGL R YY  R +I+   + ++L S
Sbjct: 484 DLIAFNARSQITTWI----WESEALNDYAVKEWSGLTRSYYRGRWSIFVDGLNKALAS 537


>gi|393784337|ref|ZP_10372502.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666113|gb|EIY59630.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
           CL02T12C01]
          Length = 728

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 151/522 (28%), Positives = 232/522 (44%), Gaps = 87/522 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGP+   ++ +Q  LQ+K+L R+ ELGM PV   F G VP  L+  +P A+I 
Sbjct: 193 MGNLEGFGGPVSPEFIARQTDLQQKMLKRMRELGMKPVFQGFYGMVPNVLKKKYPDARIK 252

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           + G W + +          LD TDPLF  +   + E+Q K +G  +  +  D F E    
Sbjct: 253 EQGTWQTYQRPA------FLDPTDPLFDRVAAIYYEEQKKLFG-DAEFFGGDPFHEGG-- 303

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
                ++      I   M+  +  AVW++QGW   ++P      +K L++ +  G+ ++L
Sbjct: 304 TSEGIHVKLAAQKILQAMRKVNPKAVWVLQGW--QHNP------VKDLMDGLNPGETIIL 355

Query: 181 DLFAEVKPIWS--TSKQFYGVP------YIWCMLHNFAGNIEMYGILDSIAFGPVEA--- 229
           DL A  +P W   T+  F+         +IWC L NF G   ++G + S A G V A   
Sbjct: 356 DLMACERPQWGGVTTSMFHKPEGHQDHRWIWCALPNFGGKTGLHGKMSSYASGAVFAKEH 415

Query: 230 -------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQDA 258
                                          RT       W+  Y+  RYG        A
Sbjct: 416 PMGRNICGIGTAPEGIGTVPVVYDMVYDMAWRTDSIQIPQWLTNYTYYRYGMEDTNCDKA 475

Query: 259 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 318
           W +L  TVY C +         I A P                        A      S+
Sbjct: 476 WKILSETVYECHNELGGPVESYICARP------------------------ADTIDHVST 511

Query: 319 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
           + +  ++Y   +++ A E    S N  +  +TY YDL+D+TRQ L+ YA  L   ++EA+
Sbjct: 512 WGNARIFYEPVKMVEAWEFLYQSRNRFNHCDTYEYDLVDVTRQVLSDYAKYLHKEMVEAF 571

Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
              + +G  + S  FL++++D D LL+    F+LG WL  A+      E+++++  NA+ 
Sbjct: 572 HQKNENGFMKYSTEFLDVIKDEDRLLSTRKEFMLGTWLTEAENAGCTPEEKRRFVTNAKR 631

Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 480
            +T W D   +    L DY NK WSGLL D+Y PR   Y  Y
Sbjct: 632 LVTTWTDRDSD----LHDYANKEWSGLLSDFYLPRWEAYVTY 669


>gi|393788556|ref|ZP_10376683.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
           CL02T12C05]
 gi|392654236|gb|EIY47884.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
           CL02T12C05]
          Length = 732

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 170/552 (30%), Positives = 248/552 (44%), Gaps = 96/552 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGP+   ++ +Q  LQ+K+L R+ ELGM PV   F G VP AL+  FP A+I 
Sbjct: 196 MGNLEGFGGPVTPEFIARQTDLQQKMLKRMRELGMKPVFQGFYGMVPNALKEKFPDARIK 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W + +          LD TDPLF ++   + E+Q   +G  +  +  D F E    
Sbjct: 256 DQGIWGTYQRPA------FLDPTDPLFDKLAAIYYEEQKNLFGE-AQFFGGDPFHEG--- 305

Query: 121 VDSPEYISSLGAA--IYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             + E I+   AA  I   M+  +  AVW++QGW   ++P      +K L+  V  G+ +
Sbjct: 306 -GTSEGINVKLAAQKILQAMRKVNPQAVWVLQGW--QHNP------VKELMEGVKPGETI 356

Query: 179 VLDLFAEVKPIWSTSK-QFYGVP-------YIWCMLHNFAGNIEMYGILDSIAFGPVEA- 229
           +LDL A  +P W   K   +  P       +IWC L NF G   ++G + S A GPV A 
Sbjct: 357 ILDLMACERPQWGGVKTSMFHKPEGHWNHQWIWCALPNFGGKTGLHGKMSSYASGPVFAK 416

Query: 230 ---------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ 256
                                            RT       W++ Y+  RYG       
Sbjct: 417 HHPMGKNICGIGTAPEGIGTIPVVYDMVYDMAWRTDSIHIPQWLDNYTYYRYGTEDNNCN 476

Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
            AW +L  T+Y C +         I A P       S T      +G  V          
Sbjct: 477 RAWKLLSETIYECHNELGGPVESYICARP-------SDTIQHVSTWGNAV---------- 519

Query: 317 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
                  ++Y   +V++A +L   S    + S+TY YDL D+TRQ L+ YA  L   ++ 
Sbjct: 520 -------MFYDPMKVVKAWDLLYQSRKRFNHSDTYEYDLTDVTRQVLSDYAKYLHERMVL 572

Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
           A+Q  D     + S +FL +++D D LL+    F+LG WL  A++     E+++++  NA
Sbjct: 573 AFQKKDKERFMEYSGKFLNIIKDEDRLLSTRKEFMLGTWLAEAEKAGGTPEEKRRFVTNA 632

Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           +  IT W D   +    L DY NK WSGLL D+Y PR   Y  Y   SL  G      D+
Sbjct: 633 KRLITTWTDTDSD----LHDYANKEWSGLLIDFYLPRWEAYVTYKT-SLLYGKKLPYPDY 687

Query: 497 RR---EWIKLTN 505
            +   EW+ LTN
Sbjct: 688 SKMEQEWV-LTN 698


>gi|423223006|ref|ZP_17209475.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392640582|gb|EIY34381.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 755

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 171/589 (29%), Positives = 260/589 (44%), Gaps = 109/589 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPLP+S +D+ ++L KKIL R  ELGM P+   FSG VP  LQ  +P AKI+
Sbjct: 195 MQNIQSYGGPLPKSVIDKHVILGKKILARQLELGMQPIQQGFSGYVPRELQAKYPQAKIS 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W           T  LD TDPLF E+G AF+E+Q K +G +  +Y  D F E+ PP
Sbjct: 255 MKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDKLFG-SYGVYAADPFHESAPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +D+PEY++ +G  I+   Q+ D+ A+W+MQ W    D          ++ +VP   L++L
Sbjct: 308 IDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED----------IVKAVPKESLLIL 357

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG---ILDSIAFGPVEAR------- 230
           DL        + +   +G P I   LHNF G I M+G   +L S  +   +AR       
Sbjct: 358 DLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLALLASNQYQKAKARYPNVCGS 413

Query: 231 ------TSENTT-----------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                   +N                    AW+  Y+ RRYG    A   AW  L    Y
Sbjct: 414 GLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERRYGAKSAAAGKAWMYLLEGPY 473

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
               G     R  IVA     P++     G     G P                    Y 
Sbjct: 474 R--QGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP--------------------YE 508

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
              VIRA    +   ++L+ S  YR+D++D+ RQ +      +     EA+   D     
Sbjct: 509 PMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLGQLVHKKAAEAFASKDKAAFV 568

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
             S RFLEL+ DMD LL     +    WL  A+   + +E++   E +A + +T+W  + 
Sbjct: 569 LHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKEEKDLMERDATSLVTIWGADG 628

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---------------- 491
                 + DY  + W+GL+  YY PR   ++  +   L++G  +                
Sbjct: 629 DPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQGHLDAGTDYQEEGLSLAYGREDFRA 685

Query: 492 -----RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
                RL +W   ++      Q G+   PV ++GD L+ ++ L++KYL+
Sbjct: 686 NDFYNRLAEWELAYVD-----QTGKARTPV-THGDELVVTRRLFDKYLK 728


>gi|224537227|ref|ZP_03677766.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521150|gb|EEF90255.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 755

 Score =  224 bits (572), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 171/589 (29%), Positives = 260/589 (44%), Gaps = 109/589 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPLP+S +D+ ++L KKIL R  ELGM P+   FSG VP  LQ  +P AKI+
Sbjct: 195 MQNIQSYGGPLPKSVIDKHVILGKKILARQLELGMQPIQQGFSGYVPRELQAKYPQAKIS 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W           T  LD TDPLF E+G AF+E+Q K +G +  +Y  D F E+ PP
Sbjct: 255 MKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDKLFG-SYGVYAADPFHESAPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +D+PEY++ +G  I+   Q+ D+ A+W+MQ W    D          ++ +VP   L++L
Sbjct: 308 IDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED----------IVKAVPKESLLIL 357

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG---ILDSIAFGPVEAR------- 230
           DL        + +   +G P I   LHNF G I M+G   +L S  +   +AR       
Sbjct: 358 DLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLALLASNQYQKAKARYPNVCGS 413

Query: 231 ------TSENTT-----------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                   +N                    AW+  Y+ RRYG    A   AW  L    Y
Sbjct: 414 GLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERRYGAKSAAAGKAWMYLLEGPY 473

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
               G     R  IVA     P++     G     G P                    Y 
Sbjct: 474 R--RGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP--------------------YE 508

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
              VIRA    +   ++L+ S  YR+D++D+ RQ +      +     EA+   D     
Sbjct: 509 PMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLGQLVHKKAAEAFASKDKAAFA 568

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
             S RFLEL+ DMD LL     +    WL  A+   + +E++   E +A + +T+W  + 
Sbjct: 569 LHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKEEKDLMERDATSLVTIWGADG 628

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---------------- 491
                 + DY  + W+GL+  YY PR   ++  +   L++G  +                
Sbjct: 629 DPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQGHLDAGTDYQEEGLSLAYGREDFRA 685

Query: 492 -----RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
                RL +W   ++      Q G+   PV ++GD L+ ++ L++KYL+
Sbjct: 686 NDFYNRLAEWELAYVD-----QTGKARTPV-THGDELVVTRRLFDKYLK 728


>gi|393786624|ref|ZP_10374756.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
           CL02T12C05]
 gi|392657859|gb|EIY51489.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
           CL02T12C05]
          Length = 717

 Score =  224 bits (572), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 171/581 (29%), Positives = 264/581 (45%), Gaps = 105/581 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P  W  QQ+ LQKKIL R++E G+ PVLP + G VP   +         
Sbjct: 187 MNNLEGWGGPNPDHWYTQQVSLQKKILKRMHEYGIEPVLPGYCGMVPHNAK--------A 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L   D  F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 239 KLG--LNV-SDPGVWCGYRRPAFLQPDDSRFEEISSLYYKELEKLYGKANY-YSMDPFHE 294

Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
               +D    + ++G A+   M+  +  AVW++Q W  +  P         L+ ++  G 
Sbjct: 295 GGS-IDGVN-LDAVGKAVMKAMKKANPKAVWVIQAWQANPRP--------ELIRNLETGD 344

Query: 177 LVVLDLFAEVKPIWST------SKQFYGVP-YIWCMLHNFAGNIEMYGILDSI------A 223
           L++LDL +E +P W         K  YG   +++CML N+  N+ ++G +D++      A
Sbjct: 345 LLILDLTSECRPQWGDPESEWYRKDGYGKHNWVYCMLLNYGANVGLHGKMDNVIDNYYLA 404

Query: 224 FGPVEARTS-----------EN------------------TTMAWINQYSVRRYGRSVPA 254
              + AR +           EN                  T   W+  Y   RYG+  P 
Sbjct: 405 KENLRARATLKGVGMTPEGIENNPVMYELLMELPWRPERFTKEDWLKGYVKARYGKDEPV 464

Query: 255 IQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
           +Q AW  L +++YN     T +  +  V  A P +D          YQ            
Sbjct: 465 LQLAWGKLANSIYNAPKELTQQGTHESVFCARPGLDV---------YQ------------ 503

Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
               SS+     +Y   EVI A  L ++  +    +  + YDL+D+ RQA+A+    +  
Sbjct: 504 ---VSSWSEMKDYYDPQEVIEAARLMVSVADRYRGNTNFEYDLVDVVRQAIAEKGRLMQK 560

Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
            +  AY+  D       S++FL L+   D LL     F LG W+ SA+ L    E++  Y
Sbjct: 561 AVTTAYRAGDKELFAMASQKFLNLILLQDQLLGTRTEFRLGRWINSARALGVTPEEKALY 620

Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
           EWN R Q+T W +    E   LRDY +K W+GLL+D+Y  R  +YF  +   +E G+   
Sbjct: 621 EWNTRVQVTTWGNRNAAERGGLRDYAHKEWNGLLKDFYYMRWKLYFDNLACKME-GETIP 679

Query: 493 LKDW---RREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
             D+      W+K TN +Q        E  GD + T++ ++
Sbjct: 680 EIDFYAVEEAWVKRTNPYQ-------AEPEGDCVDTAKLIF 713


>gi|409042145|gb|EKM51629.1| glycoside hydrolase family 89 protein [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 749

 Score =  224 bits (572), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 148/534 (27%), Positives = 256/534 (47%), Gaps = 72/534 (13%)

Query: 6   GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW 65
            WGG LP  W+  Q  LQK+IL R+ ELGM P+LPAF+G VP+ +   +P+A I     W
Sbjct: 201 AWGGLLPMQWISDQQALQKQILPRMLELGMTPILPAFTGFVPSNMSAHYPNASIIDGSAW 260

Query: 66  FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPE 125
               S         L+  DPL+ ++ ++FI +Q + YG  +H Y  D ++EN P   +  
Sbjct: 261 SGFPS--TLTNVSFLEPFDPLYPQMQQSFITKQQEAYGNITHFYTLDQYNENNPFSGNDS 318

Query: 126 YISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-KLVVLDLF 183
           Y+SS+  +  + +++ D +A W+MQGWL FS + FW   +++A L        +++LDL+
Sbjct: 319 YLSSVSTSTIASLRAADPEATWVMQGWLFFSSETFWTNDRIEAYLGGAQGNDSMLILDLY 378

Query: 184 AEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA----- 238
           +E +P W+ +  ++G  ++WC LH++ GN+ + G L +I  GP+ A  S  ++M      
Sbjct: 379 SEAQPQWNRTDSYFGKQWVWCELHDYGGNMGLEGNLAAITEGPIAALNSNGSSMVGMGLT 438

Query: 239 -----------------------------WINQYSVRRY-GRSVPA-IQDAWNVLYHTVY 267
                                        W+ +++ RRY  +++P  +Q AW +L  T+Y
Sbjct: 439 MEGMEIGNEIVYDILLDQAWSSTPLNVSDWVAKWAARRYLVKTLPTELQQAWTILSTTIY 498

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           N  D             P+   +I S+ E +    G        L + T  +     + +
Sbjct: 499 NNQD-------------PNSQATIKSILELEPATTG--------LVNVTGHHPTEIPYDT 537

Query: 328 TSEVIRALELFI-ASGNELSASNT--YRYDLIDLTRQALAKYANELFLNIIEAYQLND-- 382
            + ++ AL+LF+ AS ++ S      +  D+++L+RQ +     +L+ ++I  +  +   
Sbjct: 538 NTTILHALQLFVNASKSQPSLKQVPEFAVDILELSRQLMVNRFIDLYTDLINTWNSSSST 597

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNARTQIT 441
           A  V       L L+ D+D LL  ++ +L   W+  AKQ A        Y E+ AR Q T
Sbjct: 598 AQNVTTAGVPLLSLISDLDVLLYTNENYLFSTWIADAKQWAHGNVSYAAYLEYQARNQQT 657

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
           +W          + DY +K  +GL+ +YY  R   +   + E   SG  +   +
Sbjct: 658 LWGPQGN-----INDYASKQTAGLVGEYYATRWQTFVVMLAEQKTSGQPYNATE 706


>gi|237708859|ref|ZP_04539340.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
 gi|345513372|ref|ZP_08792893.1| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
 gi|423228941|ref|ZP_17215347.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
           CL02T00C15]
 gi|423242228|ref|ZP_17223337.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
           CL03T12C01]
 gi|423247755|ref|ZP_17228803.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
           CL02T12C06]
 gi|229457285|gb|EEO63006.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
 gi|345456211|gb|EEO47557.2| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
 gi|392631297|gb|EIY25272.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
           CL02T12C06]
 gi|392635177|gb|EIY29082.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
           CL02T00C15]
 gi|392639514|gb|EIY33330.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
           CL03T12C01]
          Length = 717

 Score =  224 bits (571), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 163/538 (30%), Positives = 252/538 (46%), Gaps = 100/538 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 187 MNNLEGWGGPNPESWYIRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   + ++  K YG+T   Y  D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYKELTKLYGKTG-FYAIDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W        +     +++  +  
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--------QDNPRTSMIEHLEA 342

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++  G  
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALINGFY 402

Query: 228 EARTSEN-----------------------------------TTMAWINQYSVRRYGRSV 252
           +A+T  +                                   T   W+  Y   RYG   
Sbjct: 403 DAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
            A+Q AW++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 463 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558

Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGTRKEFKVGTWIEAARSAGQTQEEK 617

Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
             YEWNAR QIT W +    +   LRDY +K W+G+L+D+Y  R   YF Y+   L+ 
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 675


>gi|212693694|ref|ZP_03301822.1| hypothetical protein BACDOR_03214 [Bacteroides dorei DSM 17855]
 gi|265755881|ref|ZP_06090348.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
 gi|212663753|gb|EEB24327.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
 gi|263233959|gb|EEZ19560.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
          Length = 718

 Score =  224 bits (570), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 163/538 (30%), Positives = 252/538 (46%), Gaps = 100/538 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPESWYIRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   + ++  K YG+T   Y  D F E
Sbjct: 240 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYKELTKLYGKTG-FYAIDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W        +     +++  +  
Sbjct: 296 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--------QDNPRTSMIEHLEA 343

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++  G  
Sbjct: 344 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALINGFY 403

Query: 228 EARTSEN-----------------------------------TTMAWINQYSVRRYGRSV 252
           +A+T  +                                   T   W+  Y   RYG   
Sbjct: 404 DAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
            A+Q AW++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 464 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 560 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGTRKEFKVGTWIEAARSAGQTQEEK 618

Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
             YEWNAR QIT W +    +   LRDY +K W+G+L+D+Y  R   YF Y+   L+ 
Sbjct: 619 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 676


>gi|410096483|ref|ZP_11291470.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409226447|gb|EKN19356.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 718

 Score =  224 bits (570), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 170/587 (28%), Positives = 273/587 (46%), Gaps = 108/587 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQ++I+ R+ E G+ PV P +SG VP   +         
Sbjct: 187 MNNLEGWGGPNPDSWYKQQITLQQRIVKRMREYGIEPVFPGYSGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  TDP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 239 KLG--LNV-SDPGLWCGYHRPAFLQPTDPRFQEIASLYYKELNKLYGK-ANFYSMDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +   VD    + + G AI   M+  +  AVW+ Q W        +      ++ ++  
Sbjct: 295 GGSVAGVD----LDAAGKAIMQAMKKNNPKAVWVAQAW--------QANPRSQMIENLKA 342

Query: 175 GKLVVLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSI----- 222
           G ++VLDLF+E +P W   +        F    +I+CML N+ GN+ ++G +  +     
Sbjct: 343 GDMIVLDLFSESRPQWGDPESTWHRKDGFGQHDWIYCMLLNYGGNVGLHGKMAHVIDEYY 402

Query: 223 -----AFGP------VEARTSENTTM------------------AWINQYSVRRYGRSVP 253
                +FG       +    SEN  +                   W+  Y+V RYG++ P
Sbjct: 403 KAKESSFGKTLCGVGMTMEGSENNPVMFELLTELPWRPVHFDKNEWLKNYTVARYGKANP 462

Query: 254 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
            +Q+AW +L +++YNC    T +  +  +  A P   P ++S                  
Sbjct: 463 TVQEAWILLSNSIYNCPPENTQQGTHESIFCARPSDHPYLVSSW---------------- 506

Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
             SE S Y      Y+  +VIRA  + ++  ++ + +N + YDL+D+ RQA+A+    L 
Sbjct: 507 --SEMSDY------YNPDDVIRAAAMMVSVADQFTGNNNFEYDLVDIVRQAIAE-KGRLV 557

Query: 372 LNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
             ++EA +   D       + RFL+L+   D LL     F +G W+   + L    E++ 
Sbjct: 558 EKVVEASFASGDKQLYNTAANRFLQLLLLQDELLGTRPEFKVGNWIARTRSLGNTPEEKD 617

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YEWNAR QIT W +    +   LRDY +K W+G+L+D+Y  R   +F Y  E L+    
Sbjct: 618 LYEWNARVQITTWGNRNAADKGGLRDYAHKEWNGILKDFYYMRWKTWFDYQNELLDGKKP 677

Query: 491 FRLKDWRRE--WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
             +  +  E  W KLT+        Y  E  GD + T + ++ +  +
Sbjct: 678 TAIDFYALEEPWTKLTDS-------YSSEPEGDCISTVKRIFAEVFE 717


>gi|373460171|ref|ZP_09551927.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
 gi|371956556|gb|EHO74342.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
          Length = 742

 Score =  223 bits (569), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 155/529 (29%), Positives = 238/529 (44%), Gaps = 85/529 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP SW  QQ  LQKKIL R++E GM PVLP F G +P   +       +T
Sbjct: 186 MNNLEGWGGPLPDSWYKQQETLQKKILQRMHEYGMEPVLPGFCGMMPHDAKEKL-GLNVT 244

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W              L  TD  F  I   +  +  + YG+ ++ Y+ D F E+   
Sbjct: 245 DGGKWNGYTRPAN------LSPTDSQFNRIADLYYAELTRLYGKANY-YSMDPFHESNDD 297

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D+ +Y    G+ +   M+  +  A W++QGW  + +P  RP  ++ + N    G L++L
Sbjct: 298 -DALDY-GKAGSVMLEAMKRINPKATWVIQGW--TENP--RPRMIQDMKN----GDLLIL 347

Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEART- 231
           DLF+E +P      +W   K +    +++CML NF  N+ ++G +D +   F   + R+ 
Sbjct: 348 DLFSECRPMFGIPSVWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLIHNFYSTKKRSP 407

Query: 232 --------------SENTTMA------------------WINQYSVRRYGRSVPAIQDAW 259
                         SEN  +                   W+  Y   RYGR    I+ AW
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSELPWRPEIFKKEDWVRGYVKARYGRKDETIERAW 467

Query: 260 NVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 317
            +L  T+YNC  G   +  +  V    P ++            N+            +  
Sbjct: 468 LLLAETIYNCPAGNNQQGPHESVFCGRPGLN------------NF------------QVK 503

Query: 318 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 377
           S+     +Y     + A  L  +  +    +N + YDLID+ RQALA      +L  I  
Sbjct: 504 SWSKMRNYYDPQATLEAARLMASVSSRYKGNNNFEYDLIDICRQALADQGRLQYLKTIAD 563

Query: 378 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 437
           Y         + ++RFL+++   D LL     F LG W E+A+ L   + ++  YEWNAR
Sbjct: 564 YNGFSRAAFAKDAKRFLDMILLQDRLLGTRKEFRLGHWTEAARSLGTTQAEKDLYEWNAR 623

Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
            QIT W + T  +   LRDY +K W G+L+D+Y  R  IY   + + +E
Sbjct: 624 VQITTWGNRTCADNGGLRDYAHKEWQGILKDFYYKRWKIYMDALAKQME 672


>gi|427385205|ref|ZP_18881710.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727373|gb|EKU90233.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
           12058]
          Length = 719

 Score =  223 bits (569), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 162/586 (27%), Positives = 263/586 (44%), Gaps = 109/586 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ VLQKKIL R+ E G+ PV P +SG VP  A + +  +   
Sbjct: 190 MNNLEGWGGPNPDSWYAQQEVLQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 249

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
           + L N F+  +         L  TD  F EI   +  +Q K +G+  + Y+ D F E  N
Sbjct: 250 SDLWNGFTRPA--------FLQPTDTRFAEIANLYYREQEKLFGKADY-YSMDPFHEAEN 300

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G AI   M+  +  A W++QGW  + +P  RP  ++ + N    G L
Sbjct: 301 AASVD----FDAAGKAIMQAMKKVNPKATWVVQGW--TENP--RPEMIENMKN----GDL 348

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSI-- 222
           ++LDLF+E +P+W       G+P IW             CML NF GN+ ++G +D +  
Sbjct: 349 LILDLFSECRPMW-------GIPSIWKRDKGYEQHDWLFCMLLNFGGNVGLHGRMDQLLD 401

Query: 223 ---------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGR 250
                            G     +  N  M                  W+  Y   RYG 
Sbjct: 402 NFYQTKDNPLATHLKGIGLTMEGSENNPVMFELMCELPWRPEKFTKEEWLKDYLFARYGV 461

Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 308
               I+ AW +L +++YNC  G   +  +  +    P ++ +  + +  K +NY  P   
Sbjct: 462 KDEKIEKAWTLLANSIYNCPFGNNQQGPHESIFCGRPSMN-NFQASSWSKMKNYYDPTVT 520

Query: 309 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 368
           E                        A  L +   ++   +N + YDL+D+ RQ+L+    
Sbjct: 521 E-----------------------EAARLMLEVADKYRGNNNFEYDLVDIVRQSLSDKGR 557

Query: 369 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
            ++   I  ++  D     + S++FL+++   D LL     F +G W+E A+ L    E+
Sbjct: 558 IVYNQTIADFKSFDKRSFARDSQKFLDILLLQDRLLGTRSEFRVGRWIEQARNLGTTPEE 617

Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
           +  YEWNAR QIT W +    +   LRDY +K W+G+LRD+Y  R A Y++ + + L+  
Sbjct: 618 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 677

Query: 489 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
              +L     ++  +   W   +  Y     G+++  ++ ++ K +
Sbjct: 678 PEVKL-----DYYAMEEPWTLAKTPYDSTPEGNSVDVAKEVFEKAM 718


>gi|156046298|ref|XP_001589681.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980]
 gi|154693798|gb|EDN93536.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 795

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 164/556 (29%), Positives = 267/556 (48%), Gaps = 102/556 (18%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LP SW+++Q +LQKKI+ R+ ELG+ PVLPAF+G VP+AL+ + P+A I
Sbjct: 207 FGNIQGSWGGTLPLSWIEEQHLLQKKIVKRMVELGITPVLPAFTGFVPSALRRIAPNANI 266

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
              G+W ++        T+L   TDPLF  +   F+  Q + YG  +HIY  D ++EN P
Sbjct: 267 INGGDWGNIFPVEYSNDTFLY-PTDPLFTTLQHKFLSFQSEYYGNVTHIYTLDQYNENNP 325

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-L 177
                 Y+ ++    Y  +QS D  AVW++QGWLF S   FW   +++A +  VP  + +
Sbjct: 326 ASGDLSYLRNVSRGTYESLQSFDPCAVWMLQGWLFYSLSSFWTQDRIEAYIGGVPKNESM 385

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSEN-- 234
           ++LDLF+E  P W  +  +YG P+IWC L ++ G + +YG + +I    +EA R SEN  
Sbjct: 386 LILDLFSESFPQWERTHYYYGKPWIWCQLRDYGGTLGLYGQIYNITNSLIEAFRESENMV 445

Query: 235 ---TTM------------------------------AWINQ-YSVRRYGRSVPA-IQDAW 259
               TM                              +W+ + Y ++   + +P  I +AW
Sbjct: 446 GVGNTMEGQGGNGLMYELLLDQAWNIDPIDTEDYFKSWVRKRYHIKGAKKRLPGEIYEAW 505

Query: 260 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 319
           ++L  T YN T+     +  V  +  ++ P+I        +N+G+          ++S+ 
Sbjct: 506 DILRRTAYNNTNLTLADS--VPKSLHELQPNIT-------ENHGR--------LGQSSTI 548

Query: 320 DHPHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
           D     Y   ++ RA EL      S  EL     +++D++D+TRQ LA+     ++ +IE
Sbjct: 549 DL----YDPDDLFRAWELLYNASVSVPELWEDKGWKFDMVDITRQVLAERFKLEYVELIE 604

Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ--------------- 421
            Y+      +       + ++E +D +L+    F L  W+ +A                 
Sbjct: 605 KYK--KGADISCDGDILIGILESLDDVLSASPHFRLDTWVNAAVSSAPLPASTNCSSTSI 662

Query: 422 -------------LAQNEEQEKQ-YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 467
                        L  N    +Q + +NA  QIT+W    Q     + DY +K W GL+R
Sbjct: 663 NNSSLLFNSSTSILTSNLTPTQQFFAYNAINQITIWGPTGQ-----IDDYASKSWGGLVR 717

Query: 468 DYYGPRAAIYFKYMIE 483
            YY PR  ++ +Y+ E
Sbjct: 718 GYYLPRWKMFLEYIDE 733


>gi|423217398|ref|ZP_17203894.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
           CL03T12C61]
 gi|392628557|gb|EIY22583.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
           CL03T12C61]
          Length = 707

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 156/557 (28%), Positives = 251/557 (45%), Gaps = 95/557 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVSRMRELGIEPVFPGYAGMVPRNIGE-------- 247

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L + D  F      + E+  K YG+  + Y+ D F E 
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 304

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  G +I   M+  + +AVW++Q W  +  P        A+++ +  G
Sbjct: 305 GNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAWQANPRP--------AMIDVLNAG 352

Query: 176 KLVVLDLFAEVKPIWSTS-------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            ++VLDL++E +P W  S       K F    +++CML NF GN+ ++G ++ +  G  +
Sbjct: 353 DMLVLDLYSEKRPQWGDSDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412

Query: 229 A----------------------------------RTSENTTMAWINQYSVRRYGRSV-P 253
           A                                  R    +   W+  Y   RYG  + P
Sbjct: 413 ACAHVNGKRMRGVGATPEGIENNPVMFELLYELPWRAERFSPDVWLQGYLKARYGGELSP 472

Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
            + +AW  L HTVYN                P   P      EG  ++     ++     
Sbjct: 473 EVMEAWRALEHTVYNA---------------PKNSPG-----EGTLESL--LCARPGFHL 510

Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
             TS++ +  L+YS     +A +L ++   +    N + YDL+D+ RQ+ A   N L   
Sbjct: 511 DRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGDNNFEYDLVDIVRQSNADKGNALLDE 570

Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
           I ++Y   D     + +++FLEL+   D LL+    F +  WL +A+ L   + ++K YE
Sbjct: 571 ISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEFSVSSWLAAARSLGNTDAEKKLYE 630

Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE---SGDG 490
           WNA   IT+W D+       L DY ++ WSGLL+D Y  R   +F+   + LE   SG+ 
Sbjct: 631 WNASALITVWGDSIASNQGGLHDYSHREWSGLLKDLYYLRWKTFFEQKQQELEGKASGEV 690

Query: 491 FRLKDWRREWIKLTNDW 507
                  + W +  +++
Sbjct: 691 INFYGMEKAWAEKVHEY 707


>gi|423722278|ref|ZP_17696454.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
           CL09T00C40]
 gi|409242419|gb|EKN35181.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
           CL09T00C40]
          Length = 752

 Score =  223 bits (567), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 169/584 (28%), Positives = 249/584 (42%), Gaps = 102/584 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D+ +VL K+I+ R  ELGM P+   FSG VP  L+  +P AKI 
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250

Query: 61  QLGNWFSVKSDPRWC---CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
                   +  P WC       LD TD LF  IGR F+E++ K YG    +Y  D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
            PPVD+PEY+ ++G AI+      D +++W MQ W        R P +KA    VP   L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL------REPIVKA----VPKENL 351

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------------------- 217
           ++LDL              +G P +   LHNF G I ++G                    
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408

Query: 218 -----ILDSIAFGPV--------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
                 ++SI   PV             E     W+ +Y+ RRYG+       AW  L  
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
             Y    G     R  I+A     P++     G     G P                   
Sbjct: 469 GPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            YS   V++A  L +     L  S+ YR+D++D+ RQ ++     +     EA++  D  
Sbjct: 505 -YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMSNLGQAIHKQAAEAFRKKDKE 563

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
                S RFLE++ D D LL     F    WL  A+    N E++  +E +A   +T+W 
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEKDLFEKDATALVTVW- 622

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW----RREW 500
               +   L+ DY  + W+GL+  YY  R   ++  + + L++G  +  KD      RE 
Sbjct: 623 --GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDHLDAGTNYSEKDLPQTHGRES 680

Query: 501 IKLTN------DWQ-----NGRNVYPVESNGDALITSQWLYNKY 533
            +  +      DW+         V    + GD + T+  LY KY
Sbjct: 681 FRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRLYKKY 724


>gi|423312588|ref|ZP_17290525.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
           CL09T03C04]
 gi|392688276|gb|EIY81565.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
           CL09T03C04]
          Length = 717

 Score =  223 bits (567), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 166/532 (31%), Positives = 250/532 (46%), Gaps = 100/532 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   +  +  K YG+T   Y  D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W    +P  R P    ++  +  
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++  G  
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 402

Query: 228 EA-------RTSENTTMA----------------------------WINQYSVRRYGRSV 252
           +A       RT     M                             W+  Y   RYG   
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
            A+Q AW++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 463 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558

Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617

Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
             YEWNAR QIT W +    +   LRDY +K W+G+L+D+Y  R   YF Y+
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYL 669


>gi|237721435|ref|ZP_04551916.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|293370838|ref|ZP_06617383.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
 gi|229449231|gb|EEO55022.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|292634054|gb|EFF52598.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
          Length = 711

 Score =  222 bits (566), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 151/517 (29%), Positives = 240/517 (46%), Gaps = 92/517 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+ ++ Y+ D F E 
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  GA+I + M+  + +AVW++Q         W+    + ++ S+  G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPEAVWIIQA--------WQANPREEMIASLNQG 352

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            L+VLDL++E +P       +W   K F    +++CML NF GN+ ++G ++ +  G  +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412

Query: 229 ARTSENTTM----------------------------------AWINQYSVRRYGRSV-P 253
           A    N  M                                   W+  Y   RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472

Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
            I +AW  L +TVYN      D   +  +       S++    G + +            
Sbjct: 473 EIMEAWRALEYTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511

Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
             TS++ +  L+Y+     +A  LF +  ++   +N + YDL+D+ RQ+ A   N L   
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570

Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
           I ++Y   D     + +++FL+L+   D LL+    F +  WL +A+ L   EE+++ YE
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKRLYE 630

Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
           WNA   IT+W D+       L DY ++ WSGLL+D Y
Sbjct: 631 WNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLY 667


>gi|395804724|ref|ZP_10483959.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
 gi|395433112|gb|EJF99070.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
          Length = 722

 Score =  222 bits (565), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 157/547 (28%), Positives = 255/547 (46%), Gaps = 79/547 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++   GPLPQ W  ++  LQKKIL R+  L M+PV+PAFSG VP A     P AKIT
Sbjct: 208 MGNINSLEGPLPQEWFVKKEALQKKILERMKALDMHPVVPAFSGYVPKAFAEKHPEAKIT 267

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L +W    S   +  T+LLD+ DPLF +IG+ FIE   K YG+ S+ Y  D+F+E  PP
Sbjct: 268 ELKSW----SGGGFASTFLLDSKDPLFKQIGKRFIEIYTKMYGK-SNFYLADSFNEIEPP 322

Query: 121 V---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
           V   +  E +S+ G+A+Y  +      AVW+MQGWLF  +  FW     KA L+ VP  K
Sbjct: 323 VSEHNKYEELSNYGSAVYETIDEAAPGAVWVMQGWLFGDNKEFWTKEATKAFLSKVPNEK 382

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD---------------- 220
           ++V D   +   +W   + FYG  + +  +HN+ G+  +YG L+                
Sbjct: 383 VMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKDELASLLKNPNRG 442

Query: 221 -SIAFGPVEARTSENTTM-----------------AWINQYSVRRYGRSVPAIQDAWNVL 262
             + +G +    + N+ +                  W+ +Y   RYG++  ++  AW +L
Sbjct: 443 NIVGYGAMPEGLNNNSIVYEYIYDLPWTKAEQPLNDWMAKYLNARYGQTSESVFHAWELL 502

Query: 263 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
             +VYN     T    D   A+       + +TE K    G P  K              
Sbjct: 503 LKSVYNVKYWETRWWNDWAGAYLLFKRPTVKITEFK----GNPGDK-------------- 544

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
                  ++  AL++      + + +N  +YDLID++R   +   +E  +  I+AYQ  +
Sbjct: 545 ------IKLKEALDILKKEAKKYNKNNLIQYDLIDVSRHYNSLSIDEELIECIKAYQEKN 598

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
                QL ++  + V + D +++      L  W++SA     + E    Y  NA+T IT+
Sbjct: 599 IAKGDQLFKQIEKQVLETDKMMSGQPLNNLNQWVKSASDYGSSPEVSSLYAKNAKTLITL 658

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKD 495
           W          L DY ++ W G+ + +Y PR  ++ + + ++  +   F        +K+
Sbjct: 659 WGGEGH-----LNDYASRSWKGMYKGFYWPRWKMFLEALKKAAVTNTSFDENKERESIKN 713

Query: 496 WRREWIK 502
           W   W +
Sbjct: 714 WEINWTE 720


>gi|154492110|ref|ZP_02031736.1| hypothetical protein PARMER_01741 [Parabacteroides merdae ATCC
           43184]
 gi|154087335|gb|EDN86380.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
           43184]
          Length = 752

 Score =  222 bits (565), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 165/584 (28%), Positives = 249/584 (42%), Gaps = 102/584 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D+ +VL K+I+ R  ELGM P+   FSG VP  L+  +P AKI 
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250

Query: 61  QLGNWFSVKSDPRWC---CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
                   +  P WC       LD TD LF  IGR F+E++ K YG    +Y  D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
            PPVD+PEY+ ++G AI+      D +++W MQ W             ++++ +VP   L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSLR----------ESIVKAVPKENL 351

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------------------- 217
           ++LDL              +G P +   LHNF G I ++G                    
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408

Query: 218 -----ILDSIAFGPV--------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
                 ++SI   PV             E     W+ +Y+ RRYG+       AW  L  
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
             Y    G     R  I+A     P++     G     G P                   
Sbjct: 469 GPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            YS   V++A  L +     L  S+ YR+D++D+ RQ ++     +     EA++  D  
Sbjct: 505 -YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMSNLGQAIHKQAAEAFRKKDKE 563

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
                S RFLE++ D D LL     F    WL  A+    N E++  +E +A   +T+W 
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEKDLFEKDATALVTVW- 622

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW----RREW 500
               +   L+ DY  + W+GL+  YY  R   ++  + + L++G  +  KD      RE 
Sbjct: 623 --GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDHLDAGTNYSEKDLPQTHGRES 680

Query: 501 IKLTN------DWQ-----NGRNVYPVESNGDALITSQWLYNKY 533
            +  +      DW+         V    + GD + T+  LY KY
Sbjct: 681 FRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRLYKKY 724


>gi|346323119|gb|EGX92717.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
          Length = 742

 Score =  221 bits (564), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 151/526 (28%), Positives = 253/526 (48%), Gaps = 75/526 (14%)

Query: 10  PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVK 69
           PL  SW+DQQ  LQK+I+ R+ +LG+ P+LPAF G VP A   + P A + +   W  + 
Sbjct: 195 PLSLSWIDQQFALQKRIVARMVQLGITPILPAFPGFVPDAFARLRPGADLVRAPAWGGLP 254

Query: 70  SDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS 129
           +D        L   D  + E+ R F+E Q++ YG  +++Y  D F+E  P   + +Y+S+
Sbjct: 255 ADSPNTRALFLSPLDDAYAELQRLFVEAQIEAYGNVTNVYAMDQFNEINPVSGATDYLSA 314

Query: 130 LGAAIYSGMQSGDSDAVWLMQGWLF--SYDPFWRPPQMKALLNSVP-LGKLVVLDLFAEV 186
           +    Y+ + + +  AVWLMQGWLF  S   FW   +++A L        +V+LDLF+E 
Sbjct: 315 VSRRSYAALAAANPAAVWLMQGWLFYLSEGNFWTQERIEAYLRGPEDRAGMVILDLFSET 374

Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENT---------- 235
            P W  +  + G P+IWC +H+F GN  ++G + +    P+EA R S++           
Sbjct: 375 APQWQRTGSYAGRPWIWCQVHDFGGNQNLFGKITNTTVNPMEALRESDSMVGLGIATEAY 434

Query: 236 ---------------------TMAWINQYSVRRYG--RSVPA-IQDAWNVLYHTVYNCTD 271
                                T+++ + ++ RRY   R +PA +  AW +L  TVY+   
Sbjct: 435 EGNEVLYDLFFDQGWSATPIDTVSYFHDWTTRRYSGVRQLPASLYQAWELLRVTVYDY-- 492

Query: 272 GATDKNRDVI---VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
               +  D+I   V+   ++P++  +       Y     K   L  + ++   P +W   
Sbjct: 493 ----RASDLIGVPVSVYQLEPNLTGL-------YNTTTGKPTALHYDPAAL--PPIW--- 536

Query: 329 SEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
                   LF+A+      L A   +R DL+D+ RQ L+     L+ +++ A+       
Sbjct: 537 -------RLFVAAAAAQPRLWAEPGFRLDLVDVMRQVLSNAFGRLYADLVAAFTGGAPPS 589

Query: 386 -VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            + Q  +R   ++ D+D LLA    F L  WL +A+   ++  +     + AR+Q+T+W 
Sbjct: 590 EIAQRGQRMRAVLGDVDALLATQPHFSLRRWLNAARAWGESTGENAAIAYEARSQVTIWA 649

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
             T     LL DY  K WSGL+  YY  R  I+   ++++ E+  G
Sbjct: 650 PGT-----LLNDYAAKAWSGLIATYYDERWRIFVDRLVDAAENHGG 690


>gi|198277542|ref|ZP_03210073.1| hypothetical protein BACPLE_03764 [Bacteroides plebeius DSM 17135]
 gi|198270040|gb|EDY94310.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides plebeius DSM
           17135]
          Length = 722

 Score =  221 bits (564), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 163/577 (28%), Positives = 260/577 (45%), Gaps = 95/577 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ  LQKKIL R+ E G+ PV P +SG VP  A + +  +   
Sbjct: 191 MNNLEGWGGPNPDSWYTQQEALQKKILKRMREYGIEPVFPGYSGMVPHDANKKLGLNVTE 250

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
             L N F+  +         L  TD  F EI   + ++  K +G+ ++ Y+ D F E  +
Sbjct: 251 PALWNGFTRPA--------FLLPTDSRFNEIASLYYKELEKLFGKANY-YSMDPFHELED 301

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G A+   M++ +  A W++QGW  + +P  RP  +K L N    G +
Sbjct: 302 AGSVD----FDAAGKAVLKAMKNVNPKATWVIQGW--TENP--RPEMIKNLNN----GDI 349

Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSI-------AF 224
           ++LDLF+E +P      IW   K +    +++CM+ NF GN+ ++G +D +         
Sbjct: 350 LILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMIENFGGNVGLHGRMDQLLNNFYLTKN 409

Query: 225 GPVEA---------------------------RTSENTTMAWINQYSVRRYGRSVPAIQD 257
            P+ A                           R  + T   W+  Y   RYG     I  
Sbjct: 410 NPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKFTKEEWLKDYLFARYGVRDEKITQ 469

Query: 258 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
           AW++L   +YNC  G   +  +  +    P ++ +  + +  K QNY  P S EA     
Sbjct: 470 AWSILADGIYNCPFGNNQQGPHESIFCGRPGLN-NFQASSWSKMQNYYDPTSTEA----- 523

Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
                             A  L +   ++   +N + YDL+D+ RQ+L+     ++   I
Sbjct: 524 ------------------AARLMLEVADKYKGNNNFEYDLVDIVRQSLSDRGRIVYNQTI 565

Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 435
             ++  D       S+ FL ++   D LL     F +G W+E A+ L    E++  YEWN
Sbjct: 566 ADFKSFDKKSFATHSQEFLNILLAQDRLLGTRSEFRVGRWIEQARNLGTTPEEKDLYEWN 625

Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
           AR QIT W +        LRDY +K W+GLL+D+Y  R A Y++ + + L+      L  
Sbjct: 626 ARVQITTWGNRVCANDGGLRDYAHKEWNGLLKDFYYKRWAAYWQTLQDVLDGKPMVEL-- 683

Query: 496 WRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
              ++  +   W    N Y  +  GD +  ++ ++NK
Sbjct: 684 ---DYYAMEEPWTLAHNPYASQPEGDCVSVAKEVFNK 717


>gi|358378969|gb|EHK16650.1| glycoside hydrolase family 89 protein [Trichoderma virens Gv29-8]
          Length = 748

 Score =  221 bits (563), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 149/520 (28%), Positives = 258/520 (49%), Gaps = 73/520 (14%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG +P+SW+D Q  LQ KIL R+ ELG+ P+LPAF G VP  +  VFP   ++ 
Sbjct: 203 NIQGSWGGSMPRSWVDSQFDLQLKILDRMEELGITPILPAFPGFVPRNISRVFPDISLST 262

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W +  ++        ++  DP F ++ + FI +Q + YG  ++ +  D F+EN P  
Sbjct: 263 SPIWSNFGTEL--SADIYINPFDPRFAQLQKLFISKQQELYGNVTNFWTLDQFNENQPLS 320

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-LVV 179
               Y+ ++    +S +++ D +AVW+MQ WLFS D  FW   ++++ L  +P+   +++
Sbjct: 321 GDLGYLQNVSHNTWSALKAADPEAVWVMQAWLFSSDSAFWTNDRIESFLGGIPVNSDMLL 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
           LDLFAE  P W  +  FYG P+IWC LH++ GN+ +YG ++++    ++A  +  +    
Sbjct: 381 LDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGLYGQIENVTINSMDAVRNSGSLVGF 440

Query: 236 ----------------------------TMAWINQYSVRRYG-RSVPAIQDAWNVLYHTV 266
                                       T  + + +   RYG ++V ++   W +L  TV
Sbjct: 441 GLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHDWVSTRYGTKNVKSLYTGWELLRPTV 500

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           +N T+   +  +  I+   ++ PS   +  G+  ++G  ++                  Y
Sbjct: 501 FNNTNLTMNAVQKSIL---ELVPSTTGLL-GRVGHHGTTIT------------------Y 538

Query: 327 STSEVIRA-LELFIASGNELS--ASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           + + ++ A  ELF A   ++    +  Y+YDL+D TRQ L      L+ +++ AY    +
Sbjct: 539 NPAVMVEAWTELFKAGLQDIKLFTNPAYQYDLVDWTRQVLVNSFEGLYKDLVAAYNSAAS 598

Query: 384 HGVFQLSR--RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
             V + SR  + + L+  +D +LA ++ F L PW+  A+  A +       E+NAR QIT
Sbjct: 599 SSVIK-SRGAKLIALLRTLDAVLATNEHFQLTPWINEAR--ASSPSTADFLEYNARNQIT 655

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
           +W      E     DY +K W+GL+  YY  R   +  Y+
Sbjct: 656 LWGPQGNIE-----DYASKQWAGLVGTYYVERWQQFIDYL 690


>gi|62088640|dbj|BAD92767.1| huntingtin interacting protein-1-related [Homo sapiens]
          Length = 449

 Score =  221 bits (563), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 134/413 (32%), Positives = 207/413 (50%), Gaps = 73/413 (17%)

Query: 142 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 200
           D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+VLDLFAE +P+++ +  F G P
Sbjct: 18  DTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQP 77

Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM----------------------- 237
           +IWCMLHNF GN  ++G L+++  GP  AR   N+TM                       
Sbjct: 78  FIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAEL 137

Query: 238 -----------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 285
                      AW+  ++ RRYG S P    AW +L  +VYNC+ +     NR  +V  P
Sbjct: 138 GWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRP 197

Query: 286 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 345
                                     L+  TS      +WY+ S+V  A  L + S   L
Sbjct: 198 S-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAPSL 226

Query: 346 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLL 404
           + S  +RYDL+DLTRQA+ +  +  +     AY   +   + +       EL+  +D +L
Sbjct: 227 ATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVL 286

Query: 405 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 464
           A    FLLG WLE A+  A +E +   YE N+R Q+T+W      E ++L DY NK  +G
Sbjct: 287 ASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAG 341

Query: 465 LLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 517
           L+ +YY PR  ++ + +++S+  G  F+   + +   +L   +   +  YP +
Sbjct: 342 LVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQ 394


>gi|410095990|ref|ZP_11290981.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409227396|gb|EKN20294.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 753

 Score =  221 bits (562), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 173/590 (29%), Positives = 251/590 (42%), Gaps = 114/590 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D    L KK++ R  ELGM P+   FSG VP  L+N +P AKI 
Sbjct: 193 MQNLQSYGGPLPKSWIDSHAELGKKVINRQLELGMQPIQQGFSGYVPRELKNKYPDAKI- 251

Query: 61  QLGNWFSVKSDPRWC---CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
                   +  P WC       LD TD LF   GR F+E++ K +G    +Y  D F E+
Sbjct: 252 --------QLQPSWCGFTGAAQLDPTDSLFSAFGRDFLEEEKKLFG-AHGVYAADPFHES 302

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
            PP+D+PEY+S++G +IY   Q  D  A+W MQ W        R P +KA    VP   L
Sbjct: 303 RPPIDTPEYLSAVGNSIYKLFQDFDPSAIWAMQAWSL------REPIVKA----VPKEHL 352

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------------------- 217
           ++LDL        +T    +G P +   LHNF G I ++G                    
Sbjct: 353 LILDLNGGRSRQENTC---WGYPVVAGNLHNFGGRINLHGDLRLLASNQYAVAKQKSPNV 409

Query: 218 -----ILDSIAFGPV--------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
                 ++SI   PV             E     W+  Y+ RRYG +      AW  L  
Sbjct: 410 CGSGLFMESIEQNPVYYDLAFEMPLHADEVDIEEWLGDYAERRYGAASENAHKAWLHLLE 469

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
             Y    G     R  I+A     P++     G     G P                   
Sbjct: 470 GPYR--PGTNGTERSSIIA---ARPALNVKKSGPNAGLGIP------------------- 505

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            YS   VI+A  L +   ++L+AS  YR+D++D+ RQ ++     +     EA+   D  
Sbjct: 506 -YSPLLVIQAQGLLLKDADKLNASTPYRFDVVDIQRQLMSNLGQAIHKKAAEAFVKKDKA 564

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
                S RFLE++ D+D LL     F    WL  A+      E++   E +A   +T+W 
Sbjct: 565 AFTLHSNRFLEMLRDVDVLLRTRPEFNFDKWLTDARSWGTTNEEKDLLEKDATALVTVW- 623

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG--------------DG 490
               +   L+ DY  + W+GL+  YY  R   ++  + E L+ G              + 
Sbjct: 624 --GADGDPLIFDYSWREWTGLIDSYYLKRWEKFYAMLQEHLDEGNEYSEKGLPMTHGREA 681

Query: 491 FR-------LKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           FR       L DW  E++  TN  +      P+ + GD + T+  +Y KY
Sbjct: 682 FRANDFYSELGDWELEFVSRTNKART-----PI-TQGDEIETALKMYKKY 725


>gi|294775488|ref|ZP_06741000.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
 gi|294450633|gb|EFG19121.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
          Length = 712

 Score =  221 bits (562), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 166/538 (30%), Positives = 252/538 (46%), Gaps = 100/538 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 182 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 233

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   +  +  K YG+T   Y  D F E
Sbjct: 234 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 289

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W    +P  R P    ++  +  
Sbjct: 290 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 337

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++  G  
Sbjct: 338 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 397

Query: 228 EARTS-----------------EN------------------TTMAWINQYSVRRYGRSV 252
           +A+                   EN                  T   W+  Y   RYG   
Sbjct: 398 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 457

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
            A+Q  W++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 458 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 498

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 499 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 553

Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 554 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 612

Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
             YEWNAR QIT W +    +   LRDY +K W+G+L+D+Y  R   YF Y+   L+ 
Sbjct: 613 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 670


>gi|150004413|ref|YP_001299157.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
 gi|149932837|gb|ABR39535.1| glycoside hydrolase family 89 [Bacteroides vulgatus ATCC 8482]
          Length = 717

 Score =  221 bits (562), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 165/532 (31%), Positives = 249/532 (46%), Gaps = 100/532 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   +  +  K YG+T   Y  D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W    +P  R P    ++  +  
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++  G  
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 402

Query: 228 EA-------RTSENTTMA----------------------------WINQYSVRRYGRSV 252
           +A       RT     M                             W+  Y   RYG   
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
            A+Q  W++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 463 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558

Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617

Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
             YEWNAR QIT W +    +   LRDY +K W+G+L+D+Y  R   YF Y+
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYL 669


>gi|345519733|ref|ZP_08799147.1| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
 gi|345457107|gb|EET15964.2| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
          Length = 717

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 166/538 (30%), Positives = 251/538 (46%), Gaps = 100/538 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   +  +  K YG+T   Y  D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W    +P  R P    ++  +  
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++  G  
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALIDGFY 402

Query: 228 EA-------RTSENTTMA----------------------------WINQYSVRRYGRSV 252
           +A       RT     M                             W+  Y   RYG   
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
            A+Q  W++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 463 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558

Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617

Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
             YEWNAR QIT W +    +   LRDY +K W+G+L+D+Y  R   YF Y+   L+ 
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 675


>gi|319643377|ref|ZP_07998003.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
 gi|317385006|gb|EFV65959.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
          Length = 718

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 166/538 (30%), Positives = 251/538 (46%), Gaps = 100/538 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   +  +  K YG+T   Y  D F E
Sbjct: 240 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W    +P  R P    ++  +  
Sbjct: 296 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 343

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++  G  
Sbjct: 344 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALIDGFY 403

Query: 228 EA-------RTSENTTMA----------------------------WINQYSVRRYGRSV 252
           +A       RT     M                             W+  Y   RYG   
Sbjct: 404 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 463

Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
            A+Q  W++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 464 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 504

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 559

Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 560 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 618

Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
             YEWNAR QIT W +    +   LRDY +K W+G+L+D+Y  R   YF Y+   L+ 
Sbjct: 619 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 676


>gi|423212382|ref|ZP_17198911.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392694828|gb|EIY88054.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 705

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 155/527 (29%), Positives = 237/527 (44%), Gaps = 94/527 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ VLQKKI+ R+ ELG+ PV P ++G VP  +       +I 
Sbjct: 196 MNNLEGWGGPNPDSWYRQQEVLQKKIVARMRELGIEPVFPGYAGMVPRNIGEKL-GYQIA 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W S    PR      L   D  F      + E+  K YG+ ++ Y+ D F E  NT
Sbjct: 255 DPGKWCSF---PR---PAFLSTEDEHFESFAAMYYEELEKLYGKANY-YSMDPFHEGGNT 307

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    ++  GA+I + M+  +  AVW++Q         W+    + +++S+  G ++
Sbjct: 308 EGVD----LAKTGASIMAAMKKANPKAVWVIQA--------WQANPREEMISSLNQGDML 355

Query: 179 VLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
           VLDL++E  P W          K F    +++CML NF  N+ ++G +D +  G  +A  
Sbjct: 356 VLDLYSERLPQWGDPDSKWYREKGFGKHDWLYCMLLNFGANVGLHGRMDLLVNGYYDACA 415

Query: 232 SENTTM----------------------------------AWINQYSVRRYGRSV-PAIQ 256
             N                                      W+  Y   RYG+ V P + 
Sbjct: 416 HANGKTLRGVGATPEGIENNPVMFELLYELPWREERFSPDEWLQGYLKARYGKDVSPEVM 475

Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVL 312
           +AW  L HTVYN         RD                   YQ  G   S    +    
Sbjct: 476 EAWRALEHTVYNAP-------RD-------------------YQGEGTVESLLCARPGFH 509

Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
              TS++ +  L+YS     +A  L  +   +   SN + YDL+D+ RQ+ A   N L  
Sbjct: 510 LDRTSTWGYAKLFYSPDSTAKAARLLTSVAKQYEGSNNFEYDLVDIVRQSNADKGNVLLE 569

Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
           +I ++Y   D     + +++FL+L+   D LL+    F +  WL++A+ L   + ++K Y
Sbjct: 570 DISQSYDRKDKENFRKQTQQFLDLIVSQDSLLSTRKEFSVSTWLDAARSLGTTDAEKKLY 629

Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 479
           EWNA   IT+W D+       L DY ++ WSG+L+D Y  R   +F+
Sbjct: 630 EWNASALITVWGDSIASNQGGLHDYSHREWSGILKDLYYQRWKAFFE 676


>gi|153807690|ref|ZP_01960358.1| hypothetical protein BACCAC_01972 [Bacteroides caccae ATCC 43185]
 gi|149130052|gb|EDM21264.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
           43185]
          Length = 707

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 156/564 (27%), Positives = 252/564 (44%), Gaps = 109/564 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVSRMRELGIEPVFPGYAGMVPRNIGE-------- 247

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L + D  F      + E+  K YG+  + Y+ D F E 
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 304

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  G +I   M+  + +AVW++Q W  +  P        A+++ +  G
Sbjct: 305 GNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAWQANPRP--------AMVDVLNAG 352

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            ++VLDL++E  P       +W   K F    +++CML NF GN+ ++G ++ +  G  +
Sbjct: 353 DMLVLDLYSERLPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412

Query: 229 ARTSENTTM----------------------------------AWINQYSVRRYGRSV-P 253
           A T  N                                      W+  Y   RYG  + P
Sbjct: 413 ACTHANGKTLRGVGTTPEGIENNPVMFELLYELPWRAERFSPDTWLQGYLKARYGGELSP 472

Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
            + +AW  L HTVYN                               +NY    + E++L 
Sbjct: 473 EVMEAWRALEHTVYNAP-----------------------------KNYQGEGTVESLLC 503

Query: 314 S-------ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 366
           +        TS++ +  L+YS     +A +L ++   +   +N + YDL+D+ RQ+ A  
Sbjct: 504 ARPGFHLDRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGNNNFEYDLVDIVRQSNADK 563

Query: 367 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 426
            N L   I ++Y   D     + +++FLEL+   D LL+    F +  WL +A+ L   +
Sbjct: 564 GNALLDEISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEFSVSSWLTAARSLGNTD 623

Query: 427 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
            ++K YEWNA   IT+W D+       L DY ++ WSGLL+D Y  R   +F+   + LE
Sbjct: 624 AEKKLYEWNASALITVWGDSIASNQGGLHDYSHREWSGLLKDLYYLRWKTFFEQKQQELE 683

Query: 487 ---SGDGFRLKDWRREWIKLTNDW 507
              SG+        + W +  +++
Sbjct: 684 GKASGEVINFYGMEKAWAEKAHEY 707


>gi|358391826|gb|EHK41230.1| glycoside hydrolase family 89 protein [Trichoderma atroviride IMI
           206040]
          Length = 751

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 154/541 (28%), Positives = 262/541 (48%), Gaps = 72/541 (13%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G W G +P +W+D Q  LQ +IL R+ ELG+ P+LPAF G VP  +  VFP   ++ 
Sbjct: 204 NIQGSWNGNMPGNWVDDQFALQLQILDRMKELGITPILPAFPGFVPRNISRVFPGISLST 263

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W +   D     TY ++  DP F ++ + FI +Q + YG  +  +  D F+EN P  
Sbjct: 264 SPLWENFAEDLS-ADTY-VNPFDPHFTQLQKLFIGKQQELYGNVTKFWTLDQFNENQPLS 321

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPL-GKLVV 179
               Y+ ++    ++ ++S   DA+W+MQ WLFS D  FW    ++A L  +     +++
Sbjct: 322 SDLGYLRNVSQNTWTALKSASPDAIWVMQAWLFSADSSFWTNDAIEAFLGGITEDSDMLL 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
           LDLFAE  P W  +  FYG P+IWC LH++ GN+ +YG ++++    ++A  + ++    
Sbjct: 382 LDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGLYGQIENVTINAMQAVRNSSSLVGF 441

Query: 236 ----------------------------TMAWINQYSVRRYG-RSVPAIQDAWNVLYHTV 266
                                       T  + + +   RYG  +V ++   W +L  TV
Sbjct: 442 GLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHDWVSARYGTENVKSLYTGWELLRPTV 501

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           +N T+   +     I+   ++ P+I  +  G+   +G  +           +YD   +  
Sbjct: 502 FNNTNLTVNAVPKSIL---ELTPNINGLL-GRVGRHGTTI-----------NYDPAVMVD 546

Query: 327 STSEVIRA-LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN-DAH 384
           + +E+ +A LE     GN       Y+YDL+D TRQ L    + L+ +++ AY  + +A 
Sbjct: 547 AWTELFKAGLEDVKLFGNP-----AYQYDLVDWTRQVLVNSFDGLYKDLVTAYNSSANAA 601

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            +     +   L++ +D +LA ++ F L  W+ +A+  A N       E+NAR Q+T+W 
Sbjct: 602 EIRSRGSKLTALLKTLDAVLATNENFQLATWIAAAR--ASNPSNTSFLEYNARNQVTLWG 659

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGF--RLKDWRRE 499
              Q E     DY +K W+GL+ DYY  R   +  Y+  +  S      F  +L+ W  +
Sbjct: 660 PTGQIE-----DYASKQWAGLVGDYYLGRWQQFIDYLATTKHSSYNQTAFYHKLQAWEIQ 714

Query: 500 W 500
           W
Sbjct: 715 W 715


>gi|374312699|ref|YP_005059129.1| alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
 gi|358754709|gb|AEU38099.1| Alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
          Length = 754

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 160/551 (29%), Positives = 245/551 (44%), Gaps = 96/551 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++ + GPLPQ +++++ +LQ+++L R+ ELGM PV PAF+G VP   + + P  +  
Sbjct: 221 MGNINHFAGPLPQHFMEEKRILQRQVLNRMRELGMKPVAPAFAGFVPQGFKRLHPEVETF 280

Query: 61  QLGNWF--SVKSDPRWCCTYLLD-ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
            L  W     K+ PR   T++L      L+ +IG+ FIE+   EYG   + Y  DTF+E 
Sbjct: 281 TL-LWLRKEFKTIPRSTRTFILHPGQQELYRQIGKKFIEEYKAEYGEVEY-YLADTFNEL 338

Query: 118 TPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
             PV      E +   G  ++  +Q+GD    W+MQGWLF YD  FW    ++ALL  +P
Sbjct: 339 EVPVREDHRYEDLERFGRTVFESIQAGDPKGTWVMQGWLFVYDSDFWNKESVEALLRGIP 398

Query: 174 LGKLVVLDLFAEVKPI---------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
             +++++D   ++ P          W   K F+G P+I  M H F GN  + G L  +A 
Sbjct: 399 NDRMLIIDYANDLAPSVQGKYLPGQWKLQKAFFGKPWINGMAHTFGGNNNIKGNLKLMAT 458

Query: 225 GPVEARTS----------------ENTTM------------------AWINQYSVRRYGR 250
            P     S                EN  +                   WI  Y   RYG 
Sbjct: 459 EPSTVLASPERGNLVGWGMCPEGIENNEVVYELMTDAGWQSEAIDLATWIPAYCRSRYGD 518

Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
             PA+Q AW +L  + Y+     T   +    A P V P   SV  G       P  +  
Sbjct: 519 CPPAMQQAWELLLKSAYSSHIWMT---KQAWQAEPSVHPIAASVDAG-------PTFQ-- 566

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                                 RA+ELF++   +L+ S  YR DLI+   QA+    +E 
Sbjct: 567 ----------------------RAVELFLSCAPQLAKSELYRNDLIEFVSQAVGGRVDEA 604

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
               ++A            + R +E +  +DGL+       L  W+++ +  A+ +++  
Sbjct: 605 LALAVQAGDAKQDEDAVAHAARAVEWMRRIDGLMNLRPDRRLETWMQATRAYAKTDDEAT 664

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            Y+ NAR  IT W          L DY ++ WSGL+RDYY  R   +F    ES  +G  
Sbjct: 665 FYDENARLLITTW------GWPELSDYASRVWSGLIRDYYAARWEAWF----ESRHTGRS 714

Query: 491 FRLKDWRREWI 501
           F L  W++ W+
Sbjct: 715 FSLDLWQQTWL 725


>gi|423345423|ref|ZP_17323112.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
           CL03T12C32]
 gi|409223209|gb|EKN16146.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
           CL03T12C32]
          Length = 752

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 168/584 (28%), Positives = 250/584 (42%), Gaps = 102/584 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D+ +VL K+I+ R  ELGM P+   FSG VP  L+  +P AKI 
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250

Query: 61  QLGNWFSVKSDPRWC---CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
                   +  P WC       LD TD LF  IGR F+E++ K YG    +Y  D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
            PPVD+PEY+ ++G AI+      D +++W MQ W        R P +KA    VP   L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL------REPIVKA----VPKENL 351

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------------------- 217
           ++LDL              +G P +   LHNF G I ++G                    
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408

Query: 218 -----ILDSIAFGPV--------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
                 ++SI   PV             E     W+ +Y+ RRYG+       AW  L  
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
             Y      T+  R  I+A     P++     G     G P                   
Sbjct: 469 GPYRPGTNGTE--RSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            YS   V++A  L +     L  S+ YR+D++D+ RQ ++     +     +A++  D  
Sbjct: 505 -YSPLSVVQAEGLLLKDAARLEDSDPYRFDIVDIQRQLMSNLGQVIHKQAAKAFRKKDKE 563

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
                S RFLE++ D D LL     F    WL  A+    N E++  +E +A   +T+W 
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEKDLFEKDATALVTVW- 622

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW----RREW 500
               +   L+ DY  + W+GL+  YY  R   ++  + + L++G  +  KD      RE 
Sbjct: 623 --GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDHLDAGTNYSEKDLPQTHGRES 680

Query: 501 IKLTN------DWQ-----NGRNVYPVESNGDALITSQWLYNKY 533
            +  +      DW+         V    + GD + T+  LY KY
Sbjct: 681 FRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRLYKKY 724


>gi|410097657|ref|ZP_11292638.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409223747|gb|EKN16682.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 740

 Score =  218 bits (555), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 160/584 (27%), Positives = 262/584 (44%), Gaps = 99/584 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPLP+SW+D  + L K+++ R  ELGM P+   FSG VP  +   FP AKI 
Sbjct: 195 MPNIESFGGPLPKSWIDSHIALGKQVVNRQLELGMTPIQQGFSGAVPRKMMEKFPEAKIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  +W+  +      C   LD  DPLF E+G+ F+E++ K YG T  +Y  D F E+ PP
Sbjct: 255 KQPDWYGFEG----ICQ--LDPLDPLFTELGKTFLEEEQKLYG-TYGLYAADPFHESKPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD+PEY++++G++I+  M++ D DA+W+MQ W F  D          + + VP   L+VL
Sbjct: 308 VDTPEYLNAVGSSIHKLMKTFDPDALWVMQAWSFRKD----------IASVVPKHDLLVL 357

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG----------------------- 217
            L   +         F    ++   LHNF G + ++G                       
Sbjct: 358 SLNGAL----GGEDHFCNHDFVVGNLHNFGGRVNLHGDLPLVSSNQFMKAKQKTPNVVGS 413

Query: 218 --ILDSIAFGPV------EARTSENTTM--AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
              ++SI   PV      E    +++     W+N+Y+ RRYG    A   AW +L    Y
Sbjct: 414 GLFMESIGQNPVFYELAFEMPVHQDSVKLEEWLNKYAERRYGAFSDAANKAWELLLAGPY 473

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                  + +  +I A P VD        G    +  P   ++++++E            
Sbjct: 474 RAGTNGVESS-SIICARPAVDVK----KSGPNAGFNIPYDPQSLIEAEVC---------- 518

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
                      +    +L  S  YR+D++D+ RQ ++    E+     EA++  D     
Sbjct: 519 ----------LLQDAEQLKGSGPYRFDIVDVQRQIMSNLGQEIHKKAAEAFKKKDKEAFA 568

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
             S RFLEL++D+D LL     F    WL  A+     +E+   +E NA + +T+W    
Sbjct: 569 LHSGRFLELLKDVDILLRTRTEFNFDQWLTDARAWGTTDEERNLFEKNASSLVTIWGGQV 628

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR---------- 497
                   DY  + W+GL+  YY  R   ++  +   L++G  +R +D +          
Sbjct: 629 DVRQF---DYSWREWTGLIEGYYLQRWKQFYDMLQGHLDNGTIYREEDAKMDLGRQAFRA 685

Query: 498 REWIKLTNDWQ------NGRNVYPVESNGDALITSQWLYNKYLQ 535
            E+     DW+       G+   PV + GD +  ++ + +KY Q
Sbjct: 686 NEFYDSLADWELAFVDRPGKARTPV-TEGDEVAVARRMLDKYKQ 728


>gi|261880159|ref|ZP_06006586.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
 gi|270333130|gb|EFA43916.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
          Length = 772

 Score =  218 bits (555), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 155/543 (28%), Positives = 238/543 (43%), Gaps = 98/543 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP +W  QQ  LQK+IL R  ELGM+PVLP + G +P   +       +T
Sbjct: 193 MNNLEGWGGPLPDAWYAQQEALQKRILKREKELGMSPVLPGYCGMMPHDAKAKL-GLDVT 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W              L ATDP F  I   +  +  + YG+  + Y+ D F E+  P
Sbjct: 252 DGGTWNGYTRPAN------LSATDPKFDHIADLYYRELTRLYGKADY-YSMDPFHES--P 302

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D+    +  G  + + M+  +  + W++QGW+ +  P         ++ ++P G +++L
Sbjct: 303 DDASVDYAEAGRKLLAAMKRANGKSNWVIQGWMENPRP--------QMIEALPEGDIIIL 354

Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD------SIAFGPV- 227
           DLF+E +P      IW   + +    +++CML NF  N+ ++G +D       +A  P  
Sbjct: 355 DLFSECRPMFGAPSIWQRKEGYGRHNWLFCMLENFGANVGLHGRMDQLVHNFKLAASPST 414

Query: 228 ---EART-----------SENTTM---------------------------AWINQYSVR 246
               AR            SEN  +                            W   Y   
Sbjct: 415 PYQNARKHLKGIGFTMEGSENNPIMFELMSELVWRANDLVSAERDRRDFKEGWTRNYVKA 474

Query: 247 RYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGK 304
           RYG   P IQ+AW +L  ++YNC  G   +  +  +    P +D   +  +  K +NY  
Sbjct: 475 RYGIDNPKIQEAWQLLIGSIYNCPVGNNQQGPHESIFNGRPSLDNFQVK-SWSKMRNY-- 531

Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
                                Y  +  +RA +L  +  +    +N + YDL+D+ RQA+ 
Sbjct: 532 ---------------------YDPNVTLRAAQLMTSVADRYRGNNNFEYDLVDIVRQAMD 570

Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
             A   +L  I  Y+  D       S RFL ++   D LL     F LG  +E A+ L+ 
Sbjct: 571 DQARLQYLRTIADYKGFDRTAFSADSARFLNMLLLQDKLLGTRQEFRLGTRIEQARSLST 630

Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
             E++  YEWNAR QIT W + T      LRDY +K W GLLRD+Y  R   Y   + + 
Sbjct: 631 TLEEKNLYEWNARVQITTWGNRTCANEGGLRDYAHKEWQGLLRDFYFMRWHTYLDALSKQ 690

Query: 485 LES 487
           + +
Sbjct: 691 MTA 693


>gi|262406054|ref|ZP_06082604.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|294648118|ref|ZP_06725661.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|294806859|ref|ZP_06765684.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|345510559|ref|ZP_08790126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|229443271|gb|EEO49062.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|262356929|gb|EEZ06019.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|292636502|gb|EFF54977.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|294445888|gb|EFG14530.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
          Length = 718

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 158/551 (28%), Positives = 263/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP A     P  +  
Sbjct: 197 MGNLNKWDGPLSDTWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLTEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +    +     D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVVPDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+ +  +EL +S  YR DLI+     LA  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYLAAKAENFYKQALKDDSENRVFAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVEFARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697

Query: 510 GRNVY--PVES 518
               +  PV++
Sbjct: 698 STTPFDDPVKA 708


>gi|329962235|ref|ZP_08300241.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
 gi|328530343|gb|EGF57220.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
          Length = 726

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 152/517 (29%), Positives = 246/517 (47%), Gaps = 93/517 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP   SW + ++ LQK+IL R+ E G++PVLP +SG +P   +       ++
Sbjct: 188 MNNLEGWGGPNTDSWYEDRIALQKRILKRMREYGIHPVLPGYSGMLPHNAKEKL-GVNVS 246

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
             G W          C Y     L  TD  F EI   + E+  + YG+ +  Y+ D F E
Sbjct: 247 DPGTW----------CGYNRPAFLQPTDTRFGEIAALYYEEMNRLYGK-ADFYSMDPFHE 295

Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
                     + + G AI+  M+    ++VW++Q W  +       P+ + ++ +VP G 
Sbjct: 296 GGKVAGVN--LDAAGQAIWQAMKKNSRNSVWVVQAWGAN-------PRAQ-MIKNVPRGD 345

Query: 177 LVVLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSI------- 222
           ++VLDL++E +P W   +        F G  +++CML N+ GN+ ++G +  +       
Sbjct: 346 MLVLDLYSESRPQWGEPESSWYRENGFDGHQWLYCMLLNYGGNVGLHGKMQHVIDAYYKA 405

Query: 223 ---AFG------PVEARTSENTTM------------------AWINQYSVRRYGRSVPAI 255
              +FG       +    SEN  +                   W+  Y   RYG+  P +
Sbjct: 406 SRSSFGNTLKGVGMTMEGSENNPVMYELLCELPWRPSTFSKDEWLEGYIAARYGKCTPRL 465

Query: 256 QDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
           ++AW +L +++YNC   +T +  +  +  A P +                   + +A   
Sbjct: 466 REAWVLLGNSIYNCPPRSTQQGTHESIFCARPSLK------------------AYQASSW 507

Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
           SE S Y  P       +VIRA  LF+    +   ++ + YDL+D+TRQA+A+    ++  
Sbjct: 508 SEMSDYYRPQ------DVIRAAGLFLEEAGQFKGNDNFEYDLVDITRQAVAEKGRLIYKV 561

Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
           I  +Y+  D   + Q S RFLEL+   D LLA    F +G W+E A+ L     ++   E
Sbjct: 562 IQASYEAGDKPLLRQASDRFLELLLLQDRLLATRPEFKVGRWIEQARNLGHTPAEKDWLE 621

Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
           WNAR QIT W + T  +   LRDY +K W+GLL+D+Y
Sbjct: 622 WNARVQITTWGNRTASDRGGLRDYAHKEWNGLLKDFY 658


>gi|422873453|ref|ZP_16919938.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens F262]
 gi|380305838|gb|EIA18115.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens F262]
          Length = 2104

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 167/563 (29%), Positives = 260/563 (46%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GNLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE    
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604

Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                   NT          MAW    IN       Y  RRYG++   I DAWN++  T 
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   DA   
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNRDAEKF 761

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900


>gi|423214208|ref|ZP_17200736.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693153|gb|EIY86388.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 718

 Score =  216 bits (550), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 158/551 (28%), Positives = 263/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP A     P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNAMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +        + D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADM-----GWSSDSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVVPDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+ +  +EL +S  YR DLI+     +A  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697

Query: 510 GRNVY--PVES 518
               +  PVE+
Sbjct: 698 STTPFDDPVEA 708


>gi|429766730|ref|ZP_19298977.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
           celatum DSM 1785]
 gi|429183354|gb|EKY24416.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
           celatum DSM 1785]
          Length = 2284

 Score =  216 bits (549), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 159/563 (28%), Positives = 256/563 (45%), Gaps = 83/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPLP +W + ++ L +++  R+  LG+ PVL  +SG VP   Q   P A+I 
Sbjct: 378 MQNMTSFGGPLPDNWFEDRVELGRQLHERMQTLGIKPVLQGYSGMVPLDFQKKNPDAQIL 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ D     F E+   F E+Q + YG  +  Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PNMLKTYVNDGERDYFQEVADVFYEKQKEVYGDITDYYAVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +DS    + +   I   M   D DA+W++Q W  + D      ++  L N     + +
Sbjct: 497 GGMDS----ARIYGTIQDKMIEHDEDAIWVIQHWQGNPDN----TKLSGLTNK---EQAL 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT- 236
           +LDL +++ P + T      +P++W MLHNF G + + G ++++A    EA  T+EN   
Sbjct: 546 ILDLNSDLNPDY-TRFDNQDIPWVWNMLHNFGGRMGLDGQVETVATSITEALATTENMKG 604

Query: 237 ------------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           W+N Y  RRYG       +AW +L  T 
Sbjct: 605 IGITPEALANSPIVYELMGDMIWTRDPINYREWVNNYIERRYGAVNEDAIEAWEILLETA 664

Query: 267 YNCTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
           Y  +D    GA +              SII+             ++ A   +  S++ H 
Sbjct: 665 YKTSDYYYQGAAE--------------SIIN-------------ARPATSINSASTWGHS 697

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
            + Y   E+ RA+ELFI+  +EL  S+ + YD +D+T+Q LA  A E    ++ AY   D
Sbjct: 698 KISYDKKELERAMELFISCYDELKDSDAFVYDFLDVTKQVLANSAQEYHKEMVAAYNSGD 757

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQI 440
           A    ++S  FL+L+   + +L+    FL+G W+E ++ +  + +   +  +E+NAR  I
Sbjct: 758 AEKFERISEHFLDLIRLQERVLSTSPEFLVGTWIEQSRTMLADADDWTKDLFEFNARALI 817

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
           T W D        L+DY N+ W+GL  D Y  R  ++   +   LE+G      DW +  
Sbjct: 818 TTWGDYKN---GSLKDYSNRQWAGLTEDLYLKRWEMWIDGIRTELETGVTAPSIDWHKVE 874

Query: 501 IKLTNDWQNGRNVYPVESNGDAL 523
            +   +  +  N YP E +G+ L
Sbjct: 875 YEWATEKTDESNAYPTEGSGEDL 897


>gi|326934230|ref|XP_003213195.1| PREDICTED: hypothetical protein LOC100549752 [Meleagris gallopavo]
          Length = 650

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 102/218 (46%), Positives = 143/218 (65%), Gaps = 7/218 (3%)

Query: 33  LGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 92
           LGM  VLPAF+G+VP  +  VFP    T+LGNW     D    C YLL   +P+F  IG 
Sbjct: 4   LGMTTVLPAFAGHVPPGVLRVFPRINATRLGNWSHF--DCTLSCAYLLSPEEPMFQVIGT 61

Query: 93  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 152
            F+++ +KE+G T HIY+ DTF+E +P    P Y++ +  A++  M   D +A WLMQGW
Sbjct: 62  LFLKELIKEFG-TDHIYSADTFNEMSPLSSDPAYLAGITNAVFRAMTGADPEAQWLMQGW 120

Query: 153 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 211
           LF + P FW+PPQ++A+L +VPLG+++VLDLFAE KP++  ++ FYG P+IWCMLHNF G
Sbjct: 121 LFQHQPAFWQPPQVQAVLRAVPLGRMIVLDLFAESKPVYEWTESFYGQPFIWCMLHNFGG 180

Query: 212 NIEMYGILDSIAFGPVEARTSENTTMA---WINQYSVR 246
           N  ++G +++I  GP  AR   N+TM     I  Y +R
Sbjct: 181 NHGLFGAVEAINRGPFVARRFPNSTMVLRELIEPYELR 218


>gi|160884066|ref|ZP_02065069.1| hypothetical protein BACOVA_02042 [Bacteroides ovatus ATCC 8483]
 gi|423291473|ref|ZP_17270321.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
           CL02T12C04]
 gi|156110408|gb|EDO12153.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
 gi|392663473|gb|EIY57023.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
           CL02T12C04]
          Length = 718

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 158/551 (28%), Positives = 263/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP A     P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +        + D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADM-----GWSSDSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMTIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+ +  +EL +S  YR DLI+     +A  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWIN 697

Query: 510 GRNVY--PVES 518
               +  PVE+
Sbjct: 698 STTPFDDPVEA 708


>gi|295085513|emb|CBK67036.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 718

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 158/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +WST +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWSTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +    +     D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+ +  +EL +S  YR DLI+     +A  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697

Query: 510 GRNVY--PVES 518
               +  PVE+
Sbjct: 698 STTPFDDPVEA 708


>gi|453081268|gb|EMF09317.1| glycoside hydrolase family 89 protein [Mycosphaerella populorum
           SO2202]
          Length = 784

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 152/532 (28%), Positives = 253/532 (47%), Gaps = 64/532 (12%)

Query: 8   GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
           GG LPQSW+DQQ  L + I+ R+ ELGM PVLP F+G VP  +  ++P+A       W  
Sbjct: 218 GGDLPQSWIDQQFELNQLIIARMIELGMTPVLPCFTGFVPTQISRLYPNASFVNGSQWNG 277

Query: 68  VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
            ++  ++     L+  DPLF  + ++FI +    YG  S +Y  D ++EN P   +  Y+
Sbjct: 278 FQA--QYTNVTFLEPFDPLFTTLQKSFISKLDAAYGNVSSVYTLDQYNENDPFSGNVTYL 335

Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEV 186
             + +     +++ D +A+W +QGWLF S   FW   ++KA L  V    +++LDLF+E 
Sbjct: 336 EDVASNTIKSLKAADPEAIWFIQGWLFYSAADFWDEERIKAYLGGVEDKDMLILDLFSES 395

Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVR 246
           +P W  +  ++G P+IWC LH++ GN  ++G ++++   P+ A  +E +TM  I      
Sbjct: 396 QPQWQRTNSYFGKPWIWCQLHDYGGNQGLHGQVENVTMNPILALANETSTMVGIGLTMEG 455

Query: 247 RYGRSV---PAIQDAW-----------NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSII 292
           + G  +     +  AW           +    + Y+C D      +D+ +A+  +  +I 
Sbjct: 456 QEGNEIIYDILLDQAWTPEPIESAGYFDDWVTSRYHCDDAVAGLPQDLYIAWDMMRQTIY 515

Query: 293 SVTEGKYQNYGKPVSKEAVLKS------------ETSSYDHPHLWYSTSEVIRALELFIA 340
           + T+          + EAV KS            + + +    + Y    ++ A + F +
Sbjct: 516 NNTDID--------TAEAVTKSIFELQPNTTGLLDRTGHHSTRILYDPEILVSAWKHFYS 567

Query: 341 SGNE---LSASNTYRYDLIDLTRQALAKYANEL---FLNIIE--AYQLNDAHGVFQLSRR 392
           +  E   L    +YR+DL+D+TRQ LA     L   F+N+    +   +      Q   R
Sbjct: 568 ASQETPQLWELESYRFDLVDITRQVLANAFYPLYGEFVNMTANSSLPSSSTASAEQTGAR 627

Query: 393 FLELVEDMDGLLACHDG--FLLGPWLESAKQLAQNEEQEKQ------------YEWNART 438
            L L+ D+D +L       F L  W+ SA+  A  E                 YE+NAR 
Sbjct: 628 MLSLLLDLDSVLEASGNAHFSLESWIHSARLWAPTETNAADGDNMTAAAIADFYEYNARN 687

Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
           QIT+W    +     + DY +K W+GL++ YY PR   +  + + S  S DG
Sbjct: 688 QITLWGPGGE-----ISDYASKQWAGLIKTYYVPRWERFVHFTLNSSTSADG 734


>gi|146300873|ref|YP_001195464.1| alpha-N-acetylglucosaminidase [Flavobacterium johnsoniae UW101]
 gi|146155291|gb|ABQ06145.1| Candidate alpha-glycosidase; Glycoside hydrolase family 89
           [Flavobacterium johnsoniae UW101]
          Length = 723

 Score =  215 bits (548), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 157/548 (28%), Positives = 251/548 (45%), Gaps = 80/548 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++   GPLPQ W  ++  LQKKIL R+  L M+PV+PAFSG VP A     P AKIT
Sbjct: 208 MGNINSLEGPLPQEWFSKKEELQKKILERMRTLDMHPVVPAFSGYVPKAFAEKHPEAKIT 267

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L +W    S   +  T+LLD+ DPLF +IG+ FIE   K YG+ S+ Y  D+F+E  PP
Sbjct: 268 ELNSW----SGGGFESTFLLDSKDPLFKKIGKRFIEIYTKMYGK-SNFYLADSFNEIEPP 322

Query: 121 V---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
           V   +  E +++ G+AIY  ++     AVW+MQGWLF  +  FW      A L+ VP  +
Sbjct: 323 VTEHNKYEELANYGSAIYETIEEAAPGAVWVMQGWLFGDNKNFWTKEATSAFLSKVPNDR 382

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL----------------- 219
           L+V D   +   +W   + FYG  + +  +HN+ G+  +YG L                 
Sbjct: 383 LMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKNELVSLLKNPHRG 442

Query: 220 DSIAFGPVEA-----------------RTSENTTMAWINQYSVRRY-GRSVPAIQDAWNV 261
           + + +G +                      E +   W+  Y   RY  ++  ++  AW +
Sbjct: 443 NVVGYGAMPEGLNNNAIVYEFIYDLPWSKGEQSVKDWLTNYLNARYEQKTSDSVFKAWEL 502

Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
           L  +VY+     T    D   A+        ++TE K    G P  K+            
Sbjct: 503 LLESVYSTKYWETRWWNDRAGAYLLFKRPTATITEFK----GNPGDKD------------ 546

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
                   ++  AL++  A   +    N  +YDLID +R   +   +E  +  ++AYQ  
Sbjct: 547 --------KLKEALDILKAEAKKYDKKNFIQYDLIDASRHYYSLSIDEDLVECVKAYQQK 598

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
           D     QL ++  + V ++D  ++      L  W++SA +     E  K Y  NA+T IT
Sbjct: 599 DITKGDQLFKKIEKQVLEIDKSMSGQPLNSLNYWVKSASEYGSTPEVSKLYVKNAKTLIT 658

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL-------ESGDGFRLK 494
           +W          L DY ++ W G+ + +Y PR  ++     ++        E+ +   +K
Sbjct: 659 LWGGEGH-----LNDYASRSWQGMYKGFYWPRWKMFLTAFKKTAVNNTPFDETKEREEIK 713

Query: 495 DWRREWIK 502
           +W  +W K
Sbjct: 714 NWEIKWTK 721


>gi|293371915|ref|ZP_06618319.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
 gi|292633161|gb|EFF51738.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
          Length = 718

 Score =  214 bits (545), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 157/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +    +     D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVISDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+ +  +EL +S  YR DLI+     +A  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697

Query: 510 GRNVY--PVES 518
               +  PVE+
Sbjct: 698 STTPFDDPVEA 708


>gi|336404352|ref|ZP_08585050.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
 gi|335943680|gb|EGN05519.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
          Length = 718

 Score =  214 bits (545), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 157/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +    +     D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVISDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+ +  +EL +S  YR DLI+     +A  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697

Query: 510 GRNVY--PVES 518
               +  PVE+
Sbjct: 698 STTPFDDPVEA 708


>gi|299144719|ref|ZP_07037787.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
 gi|298515210|gb|EFI39091.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
          Length = 718

 Score =  214 bits (545), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 157/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKADKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +        + D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADM-----GWSSDSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVVPDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+ +  +EL +S  YR DLI+     +A  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697

Query: 510 GRNVY--PVES 518
               +  PVE+
Sbjct: 698 STTPFDDPVEA 708


>gi|422345314|ref|ZP_16426228.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
           WAL-14572]
 gi|373228039|gb|EHP50349.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
           WAL-14572]
          Length = 1842

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 165/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE    
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604

Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                   NT          MAW    IN       Y  RRYG++   I DAWN++  T 
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +    
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900


>gi|298480124|ref|ZP_06998323.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
 gi|298273933|gb|EFI15495.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
          Length = 718

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 157/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +        + D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADM-----GWSSDSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+ +  +EL +S  YR DLI+     +A  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697

Query: 510 GRNVY--PVES 518
               +  PVE+
Sbjct: 698 STTPFDDPVEA 708


>gi|237719039|ref|ZP_04549520.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229451817|gb|EEO57608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 718

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 156/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +        + D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADM-----GWSSDSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+++  +EL  S  YR DLI+     +A  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYVSCADELKGSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDKLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697

Query: 510 GRNVY--PVES 518
               +  PV++
Sbjct: 698 STTPFDDPVKA 708


>gi|168216263|ref|ZP_02641888.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens NCTC 8239]
 gi|182381741|gb|EDT79220.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens NCTC 8239]
          Length = 2104

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 165/563 (29%), Positives = 260/563 (46%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEEEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L      G+ +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKGQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE    
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604

Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                   NT          MAW    IN       Y  RRYG++   I +AWN++  T 
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTA 664

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +    
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900


>gi|168207628|ref|ZP_02633633.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens E str. JGS1987]
 gi|170661027|gb|EDT13710.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens E str. JGS1987]
          Length = 2104

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 165/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQEEVFGEVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE    
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604

Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                   NT          MAW    IN       Y  RRYG++   I DAWN++  T 
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +    
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900


>gi|182624959|ref|ZP_02952737.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens D str. JGS1721]
 gi|177909756|gb|EDT72174.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens D str. JGS1721]
          Length = 2104

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 165/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE    
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604

Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                   NT          MAW    IN       Y  RRYG++   I DAWN++  T 
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +    
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900


>gi|423293381|ref|ZP_17271508.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
           CL03T12C18]
 gi|392678324|gb|EIY71732.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
           CL03T12C18]
          Length = 718

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 157/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPITPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +    +     D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+ +  +EL +S  YR DLI+     +A  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWIN 697

Query: 510 GRNVY--PVES 518
               +  PVE+
Sbjct: 698 STTPFDDPVEA 708


>gi|194695302|gb|ACF81735.1| unknown [Zea mays]
          Length = 173

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 100/152 (65%), Positives = 119/152 (78%)

Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 448
           L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITMWFDNT+
Sbjct: 7   LCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTE 66

Query: 449 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQ 508
            +ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+   F LK+WRREWI LTN+WQ
Sbjct: 67  TKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWISLTNNWQ 126

Query: 509 NGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
           + R V+   + GD L  SQ LY KYL    + 
Sbjct: 127 SDRKVFSTTATGDPLNISQSLYTKYLSNADLL 158


>gi|169346867|ref|ZP_02865815.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens C str. JGS1495]
 gi|169296926|gb|EDS79050.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens C str. JGS1495]
          Length = 2104

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 165/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE    
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604

Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                   NT          MAW    IN       Y  RRYG++   I DAWN++  T 
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +    
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKME 877

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900


>gi|18309848|ref|NP_561782.1| alpha-N-acetylglucosaminidase [Clostridium perfringens str. 13]
 gi|18144526|dbj|BAB80572.1| probable alpha-N-acetylglucosaminidase [Clostridium perfringens
           str. 13]
 gi|288872041|dbj|BAI70446.1| alpha-N-acetylglucosaminidase [Clostridium perfringens]
          Length = 2104

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 165/563 (29%), Positives = 258/563 (45%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F  +   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE    
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604

Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                   NT          MAW    IN       Y  RRYG++   I DAWN++  T 
Sbjct: 605 IGITPEAINTNPLAHELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +    
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKME 877

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900


>gi|168212494|ref|ZP_02638119.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens CPE str. F4969]
 gi|170716100|gb|EDT28282.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens CPE str. F4969]
          Length = 2104

 Score =  213 bits (541), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 165/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE    
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604

Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                   NT          MAW    IN       Y  RRYG++   I DAWN++  T 
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +    
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDADDWTKDLFEFNARALVTTWG 821

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900


>gi|320106778|ref|YP_004182368.1| alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
 gi|319925299|gb|ADV82374.1| Alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
          Length = 754

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 165/551 (29%), Positives = 243/551 (44%), Gaps = 96/551 (17%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++   GPLP+ +++Q+ VLQ+KIL R+  LGM PV PAFSG VP   + + P A+  
Sbjct: 220 MGNVNNIDGPLPEHFIEQKRVLQRKILDRMRSLGMRPVAPAFSGFVPQGFKRLHPKAETF 279

Query: 61  QLGNWF--SVKSDPRWCCTYLLDATDP-LFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
            L  W     K+ PR   T++L   +  L+  IG+ FIE+   EYG   + Y  DTF+E 
Sbjct: 280 TL-LWLPEEFKTIPRSTRTFILHPGEQDLYRLIGKKFIEEYKAEYGEVQY-YLADTFNEL 337

Query: 118 TPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
             PV      E +   G  +Y G+ +GD +  W+MQGWLF YD  FW    + ALL  +P
Sbjct: 338 AVPVREEHRFEDLERFGRTVYEGILAGDPNGTWVMQGWLFVYDVAFWNSESVAALLRGIP 397

Query: 174 LGKLVVLDLFAEVKPI---------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
             +++++D   ++ P          W T K F+G  +I  M H F GN  + G L  +A 
Sbjct: 398 NDRMLIIDYANDLAPAVKGKYAPGQWKTQKAFFGKQWINGMAHTFGGNNNVKGNLKLMAS 457

Query: 225 GPVEARTS-----------------------ENTTMA-----------WINQYSVRRYGR 250
            P    TS                       E  T A           WI  Y   RYG 
Sbjct: 458 EPASVLTSPERGNLVGWGMCPEGIETNEVVYELMTDAGWQREAIDLKQWIPAYCRSRYGA 517

Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
             P + +AW +L  + Y+     T +        P + P+  SV  G       P  +  
Sbjct: 518 CPPVMLEAWTLLMQSAYSAHIWMTHQAWQTE---PSLAPAAASVDAG-------PTFR-- 565

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                                 RA+ LF++   EL     YR DLI+L  QA     ++ 
Sbjct: 566 ----------------------RAVALFLSCAPELGQKELYRNDLIELVVQAAGGSVDQT 603

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
           F   ++A Q +      + +   L  +  MD LL       L  W+++A+  A+++++  
Sbjct: 604 FSLAVQAGQSHQNEVATEYAAHALGWMGRMDALLNLRPDRRLETWMQAARSYAKSDDEAA 663

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            Y+ NAR  IT W          L DY ++ WSGL RDYY  R   +F     SL +G  
Sbjct: 664 YYDENARRLITTW------GWPELSDYASRAWSGLTRDYYASRWEAWFA----SLHAGRP 713

Query: 491 FRLKDWRREWI 501
           F L  W++ W+
Sbjct: 714 FSLDIWQQTWL 724


>gi|383115207|ref|ZP_09935965.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
 gi|313695376|gb|EFS32211.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
          Length = 718

 Score =  212 bits (539), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 156/551 (28%), Positives = 261/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +    +     D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVVPDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+ +  +EL  S  YR DLI+     +A  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYASCADELKGSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697

Query: 510 GRNVY--PVES 518
               +  PV++
Sbjct: 698 STTPFDDPVKA 708


>gi|440792549|gb|ELR13759.1| peptidase, S8/S53 subfamily protein [Acanthamoeba castellanii str.
           Neff]
          Length = 981

 Score =  211 bits (538), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 146/531 (27%), Positives = 236/531 (44%), Gaps = 104/531 (19%)

Query: 50  LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIY 109
           ++ ++P+A +T+  +W       ++   Y L   D L+  IG   I    +E+G T HIY
Sbjct: 434 IKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG-TDHIY 490

Query: 110 NCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALL 169
           N DTF+E +PP   P Y+++   A+Y GM + D  A+W+MQGW F +DPFW   ++KA L
Sbjct: 491 NADTFNEMSPPSADPTYLAAASRAVYEGMATQDPQALWVMQGWSFVFDPFWTKDRIKAYL 550

Query: 170 NSVPLGKLVVLDLFAEVKPIWSTSKQF----YGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
           + V    +++LDL ++  P W+ + QF    +G  ++WCMLHN  G   +YG L   +  
Sbjct: 551 SGVDNSDMLILDLASDNSPEWNKTGQFRDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSD 610

Query: 226 PVEARTSENTTMA---------------------------------WINQYSVRRYGRSV 252
           P+ A  +   TM                                  W+ +Y+ RRYG + 
Sbjct: 611 PLIALATPGNTMVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAT 670

Query: 253 PA--IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
            +  + +AW +L    YN +                +D  +          +G   +   
Sbjct: 671 GSSPVGEAWELLREATYNQS---------------GLDAGLFGFAPALGMGHGGTSN--- 712

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNE--LSASNTYRYDLIDLTRQALAKYAN 368
                           +T EV  AL LF+ S      + +  ++YD +DLTRQ LA   N
Sbjct: 713 ----------------ATKEV-EALRLFLQSAQTEGYAPNGPWQYDCVDLTRQVLANTFN 755

Query: 369 ELFLNIIEAY-----QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 423
           +++  +  AY       +D      L+   L ++ D+D LLA +  +LLG W++ A   A
Sbjct: 756 DVYSQLDAAYTSYATNKSDTLPFLPLAAELLGIISDLDRLLATNPNYLLGTWIKDAVSWA 815

Query: 424 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
              EQ   Y++NAR QIT+W  + Q     + DY  K+W+GLL               ++
Sbjct: 816 SIPEQALHYQFNARNQITLWGPDGQ-----ISDYATKHWAGLL---------------MK 855

Query: 484 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
           ++ +G  F    +  E ++L   W      YP    GD L  +  +  KYL
Sbjct: 856 AVGAGVMFNSTAYGTELLQLEQKWNQENTTYPTTPTGDTLQVALRISQKYL 906


>gi|168209163|ref|ZP_02634788.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens B str. ATCC 3626]
 gi|170712640|gb|EDT24822.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens B str. ATCC 3626]
          Length = 2104

 Score =  211 bits (538), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 165/563 (29%), Positives = 257/563 (45%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F  +   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE    
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604

Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                   NT          MAW    IN       Y  RRYG++   I DAWN++  T 
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +    
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDADDWTKDLFEFNARALVTTWG 821

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                    L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 822 SRNNANGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900


>gi|110801838|ref|YP_698175.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens SM101]
 gi|110682339|gb|ABG85709.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens SM101]
          Length = 2095

 Score =  211 bits (537), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 166/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 369 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 428

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F  +   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 429 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 487

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 488 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 536

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE    
Sbjct: 537 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 595

Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                   NT          MAW    IN       Y  RRYG++   I +AWN++  T 
Sbjct: 596 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNEEILEAWNIILDTA 655

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 656 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 692

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   DA   
Sbjct: 693 DKSEFEKAIEIFSKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNRDAEKF 752

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 753 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDADDWTKDLFEFNARALVTTWG 812

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 813 SRNNADGGGLKDYSNRQWSGLTGDYYYARWEKWINGLQIELDGG----AKAPNIDWFKME 868

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 869 YDWVNKKSDTDKLYPTEASNENL 891


>gi|440731409|ref|ZP_20911430.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
 gi|440373101|gb|ELQ09870.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
          Length = 732

 Score =  211 bits (537), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 165/591 (27%), Positives = 247/591 (41%), Gaps = 110/591 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +  PLPQ W++ +  LQ++IL R+  LGM PVLPAFSG VP A     P A+I 
Sbjct: 168 MGNIEAYDAPLPQQWIEDKYALQQRILQRMRTLGMKPVLPAFSGYVPKAFAQAHPQARIY 227

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF +I + FI+   + YG+ ++ Y  D F+E  PP
Sbjct: 228 RMRAWEGFHE------TYWLDPADPLFTKIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 280

Query: 121 VDS----------------------PEY--------ISSLGAAIYSGMQSGDSDAVWLMQ 150
           + +                      PE         ++  G A+Y  +   + DAVW+MQ
Sbjct: 281 IAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 340

Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW P  + A L  VP  KL+VLD+  +  P  W  S  F G  +I+  +HN
Sbjct: 341 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 400

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENT-------TMAWINQ-- 242
           + G+  +YG L                   + FG       +N+       T+AW  Q  
Sbjct: 401 YGGSNPVYGDLAFYRDDLRALLADKDKQQLVGFGAFPEGLHDNSVVYEYMYTLAWGGQQR 460

Query: 243 --------YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
                   Y+  RYG + PA++ AW+ L   V +                    P     
Sbjct: 461 SLQDWLGDYTRARYGHTSPALRAAWDDLQAAVLSTRYWT---------------PRWWRS 505

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P     + + E +  D P L        RAL+  +A   E + +  YRYD
Sbjct: 506 RAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLYRYD 556

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
           L+D  R       +      + AY+  D         R    V+ +DGL+      +L  
Sbjct: 557 LVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDAAFARVQAAVQQLDGLVGGQQE-ILSS 615

Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
           WL  A+  A+  +    Y  +A+ QI++W          L DY +K W G+  DYY PR 
Sbjct: 616 WLGDAEGDAKTPQDAAYYRRDAKAQISVWGGEGN-----LGDYASKAWQGMYADYYLPRW 670

Query: 475 AIYFKYMIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVES 518
           A+  + +  +  SG          RL+ W R+W+     +       PV +
Sbjct: 671 ALAMQALRAAAVSGGSVDEAALQQRLRVWERDWVACETPYTRRAPADPVAA 721


>gi|170292392|pdb|2VC9|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
           In Complex With 2-Acetamido-1,2-Dideoxynojirmycin
 gi|170292393|pdb|2VCA|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
           In Complex With Beta-N-Acetyl-D-Glucosamine
 gi|170292394|pdb|2VCB|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
           In Complex With Pugnac
 gi|170292395|pdb|2VCC|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
          Length = 891

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 161/563 (28%), Positives = 257/563 (45%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +     A+  
Sbjct: 344 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 403

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 404 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 462

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 463 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 511

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTT- 236
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+   
Sbjct: 512 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 570

Query: 237 --------------------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                               MAW    IN       Y  RRYG++   I +AWN++  T 
Sbjct: 571 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTA 630

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 631 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 667

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +    
Sbjct: 668 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 727

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 728 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 787

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 788 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKME 843

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 844 YDWVNKKSDTDKLYPTEASNENL 866


>gi|154321596|ref|XP_001560113.1| hypothetical protein BC1G_00945 [Botryotinia fuckeliana B05.10]
          Length = 701

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 150/506 (29%), Positives = 243/506 (48%), Gaps = 81/506 (16%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG +P +W++ Q +LQKKI+ R+ ELG+ PVLPAF+G VP  L+ V P+A I
Sbjct: 185 FGNIQGSWGGTIPLAWIEDQHLLQKKIVQRMVELGITPVLPAFTGFVPRDLRRVAPNANI 244

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
               +W ++        T+L    DPLF  +   F+  Q + YG  SHIY  D F+EN P
Sbjct: 245 INGSDWGNLFPFEYSNDTFLY-PIDPLFKTLQHTFLSLQSEYYGNVSHIYTLDQFNENLP 303

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-L 177
               P Y+ ++    Y  +QS DS+A W++QGWLF +   FW   +++A L  VP  + +
Sbjct: 304 ASGDPLYLGNISRGTYDSLQSFDSNATWMLQGWLFYAASSFWTQDRVEAYLGGVPKNESM 363

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT 236
           ++LDLF+E  P W  + Q+YG P+IWC LH + G   +YG + +I    +EA R SE   
Sbjct: 364 LILDLFSESFPEWENTHQYYGKPWIWCQLHGYGGTPGIYGQIYNITNSSIEAFRNSEKMV 423

Query: 237 MAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 296
                                        + N  +G       +  +  ++ P+I     
Sbjct: 424 ----------------------------GMGNTMEGQDGNGLILPKSLLEMQPNIT---- 451

Query: 297 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI---ASGNELSASNTYRY 353
              +N+G+      + +S T    +P      +++ RA  L      S  EL   N +++
Sbjct: 452 ---ENHGR------LGQSLTIDLFNP------ADLFRAWGLLYNASISVPELWYDNGWKF 496

Query: 354 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV-EDMDGLLACHDGFLL 412
           D++D+TRQ L++     ++++IE Y    A   F+ +   L ++  ++D +L+    F L
Sbjct: 497 DMVDVTRQVLSERFKLEYVDLIEKYT---AEIDFEATSENLSMILRELDDILSTSPHFRL 553

Query: 413 GPWLESAKQLAQNE-----------------EQEKQYEWNARTQITMWFDNTQEEASLLR 455
             W+ +A   + N                  + +  + +NA  QIT+W    Q     + 
Sbjct: 554 DTWINAAIASSPNSSTYPIPSSDGSSELNITQTQHLFAYNAINQITIWGPTGQ-----IN 608

Query: 456 DYGNKYWSGLLRDYYGPRAAIYFKYM 481
           DY +K W GL+R YY  R  I+  Y+
Sbjct: 609 DYASKSWGGLVRGYYLKRWEIFLDYI 634


>gi|322703040|gb|EFY94656.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
           ARSEF 23]
          Length = 774

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 151/541 (27%), Positives = 255/541 (47%), Gaps = 84/541 (15%)

Query: 5   HGWGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 62
             WGG   LP ++++QQ  LQK+I+ R+ ELG+ PVLPAF G VP +++ V P+A +T  
Sbjct: 209 RSWGGKGDLPLAFIEQQFELQKQIVTRMVELGITPVLPAFPGFVPESIKKVRPNANLTVS 268

Query: 63  GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 122
            NWF+   D ++     LD  D  + E+ + F+ +Q+  +G  +++Y  D F+E +P   
Sbjct: 269 PNWFAPAPD-KYTRDLFLDPLDDTYAELQKLFVTKQIDAFGNVTNVYTLDQFNELSPASG 327

Query: 123 SPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGK-LVVL 180
              Y+  +    Y+G+ + +  AVWL+QGWL FS   FW  P++ A L  V   + ++VL
Sbjct: 328 DTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSSRNFWTQPRIDAYLGGVEDHQGMLVL 387

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT----- 235
           DL++EV P W  +  + G P+IWC LH+F GN+ + G + ++   P++A     +     
Sbjct: 388 DLYSEVNPQWQRTNSYSGKPWIWCQLHDFGGNMALEGRVQTLTSAPIDALAQSKSLVGFG 447

Query: 236 ---------------------------TMAWINQYSVRRYG--RSVPA-IQDAWNVLYHT 265
                                      T A+   +  +RY    S+P+ +  AW +L   
Sbjct: 448 LTPEAYEGNEVVYDILLDQAWSATPLDTQAYFASWVTKRYAGISSIPSELYRAWEILRTD 507

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VY+ T   TD  + V VA   + P++  +                   + T  + HP   
Sbjct: 508 VYSNTR--TDIPQ-VPVATYQLRPALSGIA------------------NRTGHFPHPTAL 546

Query: 326 YSTSEVIRA-----LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
           +    V++      LE     G+ L     ++ D +D++RQ L+   + L+ +++ AY+ 
Sbjct: 547 HYDPLVLQGVWKLMLEALTRQGS-LWKVPAFQLDFVDVSRQMLSNQFDVLYADLVNAYKC 605

Query: 381 NDAHG-------------VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 427
           +   G             V     R L L+  +D  L     F L  W+++A    +   
Sbjct: 606 STGAGGSRELRSNTPNCDVKAAGARLLFLLSTLDLTLLTSRHFALQSWVDAASAWGKAAG 665

Query: 428 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
            E  + +NAR+Q+T+W    Q  A+ L DY  K W GL+  YY  R +I+   ++ +  S
Sbjct: 666 NEDLFTFNARSQVTVW----QVNATNLNDYAAKAWGGLVGSYYKGRWSIFVDALVAASSS 721

Query: 488 G 488
           G
Sbjct: 722 G 722


>gi|336412611|ref|ZP_08592964.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942657|gb|EGN04499.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
           3_8_47FAA]
          Length = 718

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 156/551 (28%), Positives = 261/551 (47%), Gaps = 72/551 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           V+A  +        N+ ++  +G +   +++   V+Y  + +    +     D I    D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474

Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
           +D  +    E +Y  Y   + +   L  +T   S Y +P   + T               
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534

Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
           + ++A+ L+ +  +EL  S  YR DLI+     +A  A   +   ++    N      + 
Sbjct: 535 DYLQAIRLYASCADELKNSELYRNDLIEFVSYYVAAKAEIFYKQALKDDSENRVLAAQRN 594

Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
            ++ ++L+ D+D LLA H  + L  W+E A+      +++  YE NA+  IT W      
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648

Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
              +  DY  ++WSGL++DYY PR  +YF        + D  ++++W  +WI  T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697

Query: 510 GRNVY--PVES 518
               +  PV++
Sbjct: 698 STTPFDDPVKA 708


>gi|380512475|ref|ZP_09855882.1| N-acetylglucosaminidase [Xanthomonas sacchari NCPPB 4393]
          Length = 785

 Score =  210 bits (534), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 158/574 (27%), Positives = 242/574 (42%), Gaps = 110/574 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W++ +  LQ +IL R+  LGM PVLPAF+G VP A     P A+I 
Sbjct: 221 MGNIEGYDAPLPQQWIEDKHALQLRILQRMRALGMKPVLPAFAGYVPKAFAQAHPQARIY 280

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE---- 116
           ++  W           TY LD  DPLF +I + FI+   + YG+ ++ Y  D F+E    
Sbjct: 281 RMRAWEGFHE------TYWLDPADPLFAQIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 333

Query: 117 --------------------------NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
                                       PPV   + +++ G A+Y+ +   + DAVW+MQ
Sbjct: 334 IAADGSDARLASYGDSTANTAKTKPPEVPPVQRDKRLAAYGRALYASIHRANPDAVWVMQ 393

Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW P  + A L  VP  KL+VLD+  +  P  W  S  F G  +I+  +HN
Sbjct: 394 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 453

Query: 209 FAGNIEMYGIL-----------------DSIAFG--PVEARTS---------------EN 234
           + G+  +YG L                   + FG  P    T+               + 
Sbjct: 454 YGGSNPVYGDLAFYREDLRALLADKDKQQLVGFGAFPEGLHTTSVVYEYMYALAWGAQQR 513

Query: 235 TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W++ Y+  RYG + PA++ AW+ L  +V +                    P     
Sbjct: 514 PLQDWLDDYTRARYGHTSPALRAAWDDLQASVLSTRYWT---------------PRWWRS 558

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P     + + E +  D P L        RAL+  +A   E + +  YRYD
Sbjct: 559 RAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALQQLLALAPEYADAPLYRYD 609

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
           L+D  R       +      + AY+  D       + R  E V  +D L+       L  
Sbjct: 610 LVDFARHYATGRVDVQLQQAVAAYRRGDVAAGDAATARVREAVTQLDSLVGGQQD-TLSS 668

Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
           WL++A   A   +    Y  +A+ Q+++W          L DY +K W G+  DYY PR 
Sbjct: 669 WLDAAAGYATTPQDAAYYRRDAKAQVSVWGGEGN-----LGDYASKAWQGMYADYYLPRW 723

Query: 475 AIYFKYMIESLESGDGF-------RLKDWRREWI 501
            +  + + E+  +G          RL+ W R+W+
Sbjct: 724 TLALQMLSEAAVAGGSVDEAQLQQRLRAWERDWV 757


>gi|294674521|ref|YP_003575137.1| alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
 gi|294472030|gb|ADE81419.1| putative alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
          Length = 754

 Score =  209 bits (533), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 153/532 (28%), Positives = 237/532 (44%), Gaps = 87/532 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP SW  +Q  LQK+IL R+ +LGM+PVLP + G VP   +       + 
Sbjct: 186 MNNLEGWGGPLPTSWYARQEKLQKQILARMKQLGMHPVLPGYCGMVPHDAKEKL-GLNVA 244

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
             G W   +          L  TD  F EI   +  +  K +G+  + Y+ D F E+   
Sbjct: 245 DAGLWNGFQRPAN------LLPTDARFSEIATLYYNELTKLFGKADY-YSMDPFHESNDD 297

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
           P +D     +  G A+   M+  +  AVW++QGW  + +P       +A+++ +  G L+
Sbjct: 298 PNID----YAKAGQAMMQAMKRVNPKAVWVIQGW--TENP------REAMVDDMKTGDLL 345

Query: 179 VLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD------------ 220
           VLDLF+E +P      IW   + +    +++C+L NF  N+ ++G +D            
Sbjct: 346 VLDLFSECRPMFGIPSIWKREQGYKQHQWLFCLLENFGANVGLHGRMDQLLDNFYMLQSS 405

Query: 221 ----------SIAF-------GPVEA--------RTSENTTMAWINQYSVRRYGRSVPAI 255
                      I F        PV          R  + T   W+  Y   RYG    AI
Sbjct: 406 KFQAQSSKLKGIGFTMEGSENNPVMFELMSELPWRPEKFTKEQWVKNYVKARYGVEDEAI 465

Query: 256 QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
           + AW  L  ++YNC  G   +             SI           G+P     +   +
Sbjct: 466 EKAWLTLAKSIYNCPAGNNQQG---------PHESIFC---------GRPT----LNNFQ 503

Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
            SS+     +Y  +   +A +L  +   +   +N + YDL+D+TRQALA  A   +   I
Sbjct: 504 ASSWSKMKNYYDPAMTKKAAKLMNSVAEKYRGNNNFEYDLVDITRQALADQARLQYQKTI 563

Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 435
             Y+        + + RFL+++   D LL     F +G W + A       E++K YEWN
Sbjct: 564 ADYKAFSRKQFDRDAERFLKMLLLQDKLLGTRTEFRVGHWTQDAVNAGNTAEEKKLYEWN 623

Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
           AR QIT W +    +   LRDY +K W GLL+D+Y  R   YF  +   +++
Sbjct: 624 ARVQITTWGNRYCADTGGLRDYAHKEWQGLLKDFYYVRWKSYFDALAAQMKA 675


>gi|383280354|pdb|4A4A|A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In
           Complex With Its Substrate Glcnac-Alpha-1,4-Galactose
          Length = 914

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 160/563 (28%), Positives = 257/563 (45%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +     A+  
Sbjct: 367 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 426

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F +  NT
Sbjct: 427 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHQGGNT 485

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 486 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 534

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTT- 236
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+   
Sbjct: 535 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 593

Query: 237 --------------------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                               MAW    IN       Y  RRYG++   I +AWN++  T 
Sbjct: 594 IGITPQAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTA 653

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 654 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 690

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +    
Sbjct: 691 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 750

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 751 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 810

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 811 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKME 866

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 867 YDWVNKKSDTDKLYPTEASNENL 889


>gi|224027030|ref|ZP_03645396.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
           18228]
 gi|224020266|gb|EEF78264.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
           18228]
          Length = 837

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 160/552 (28%), Positives = 233/552 (42%), Gaps = 106/552 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI- 59
           M N+ G  GPL   W   QL LQ KIL R+  LGM P+ P F G +P A + ++P   I 
Sbjct: 193 MGNVSGIDGPLNPDWHAGQLALQHKILDRMRALGMKPICPGFPGFIPEAFKRIYPDLHIV 252

Query: 60  -TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            T  G  F           +++  T+PLF +I  AFI++  KE+G+  + Y  D+F+E  
Sbjct: 253 ETHWGGAFH---------NWMISPTEPLFAKISEAFIKEWEKEFGKCDY-YLVDSFNEMD 302

Query: 119 PPVDSP------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNS 171
            P          E  +S G  +YS ++  + DAVW+MQGW+F Y    W    + AL++ 
Sbjct: 303 IPFPEKGNPARYEMAASYGEKVYSSIKRANKDAVWVMQGWMFGYQRHIWDYETLGALVSR 362

Query: 172 VPLGKLVVLDL-------FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
           VP  K+++LDL       F   +  W   K FY   +++ ++ N  G   M G+LD  A 
Sbjct: 363 VPDDKMLLLDLAVDYNRHFWHSEVNWEYYKGFYNKQWVYSVIPNMGGKTGMTGVLDFYAN 422

Query: 225 GPVEARTSENT----------------------------------TMAWINQYSVRRYGR 250
           G +EA +S N                                      W+ QYS+ RYG+
Sbjct: 423 GHLEALSSSNRGNLVAHGLAPEGIENNEVLYELVTDAGWSDHRMDVRDWLKQYSINRYGK 482

Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
           +   +  AW+ L  +VY                F D          G  +N    +S   
Sbjct: 483 APAQLMKAWDYLLKSVYG--------------TFTDHPRFNWQFRPGLVKNGSINIS--- 525

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
                              +  + LE F+A+  EL  S  Y  DL ++T   L   A  L
Sbjct: 526 ------------------DDYFKGLESFVAASEELKDSPYYLTDLCEMTAHYLGSKAEIL 567

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
              I + Y L D      L  RF   +  MD +L+ H    L  W+  A + A+ E Q K
Sbjct: 568 TRQIDQEYLLGDTLQAHFLQSRFETFMLGMDRILSQHPTLRLDRWVSFASKAARTEAQRK 627

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
           QYE NAR  +T+W          + DY  + WSGL+  YY  R   Y+K      +SG  
Sbjct: 628 QYEMNARRIVTVW-------GPPVDDYSARMWSGLVGSYYLGRWKEYYK----GRDSGKS 676

Query: 491 FRLKDWRREWIK 502
             L  W R+W++
Sbjct: 677 ADLSSWERKWVE 688


>gi|374384144|ref|ZP_09641670.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
           12061]
 gi|373228751|gb|EHP51054.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
           12061]
          Length = 835

 Score =  208 bits (530), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 152/549 (27%), Positives = 248/549 (45%), Gaps = 103/549 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+    GP+P  W   Q+ LQ KIL R+  LGM P+ PAF+G VP AL+ ++P  KI 
Sbjct: 192 MGNISQIDGPMPVEWHSDQVELQHKILKRMKLLGMKPICPAFAGFVPLALKRLYPDVKII 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
           +   W    +       ++L   + LF  IG+ FIE+  KE+G+ +  Y  D+F+E    
Sbjct: 252 ET-TWAGFHN-------WMLSPEEELFTRIGQLFIEEWEKEFGK-NDFYLADSFNEMDVP 302

Query: 119 -PPVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
            PP+ + E    L   G  +Y G+++G+ DAVW+MQGW+F Y    W    ++AL++ VP
Sbjct: 303 FPPIGTKERYDMLAFYGEQVYKGIKAGNPDAVWVMQGWMFGYQRDIWDYETLQALVSKVP 362

Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K+++LDL A+  K +W         K F+   +++ ++ N  G     GIL   A G 
Sbjct: 363 DDKMMLLDLAADYNKNVWGNGMNWEFYKGFFNKLWVYSVIPNMGGKTGATGILSFYANGH 422

Query: 227 VEA----------------------------------RTSENTTMAWINQYSVRRYGRSV 252
           +EA                                   +SE     W+  YS+ RYG++ 
Sbjct: 423 LEALNSPNRGRLFGFGMAPEGTENNEVVYEMICDAGWSSSEIDVKQWLKDYSLCRYGKTC 482

Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
           P + + W  L  +VY                F D    +  +  G+        S +  +
Sbjct: 483 PEMDEVWEGLCKSVYG--------------TFTDHPRFLWQLRPGR--------SGKGTV 520

Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
            ++++ Y             RA+E       +++ S  ++ D +++T   L      L  
Sbjct: 521 NTDSNFY-------------RAVEKMAECAPKMTESPLFKADFLEMTAFYLGGKMEALAS 567

Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
            I ++Y   +     ++ ++F EL E +D LL  H  + L  W++ A++    E+ +  Y
Sbjct: 568 AIGKSYLYGNTADALKMQQQFEELGEGLDSLLESHPVYRLQRWIDFARKHGDTEKLKDYY 627

Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
           E NAR  +T+W          + DY  K WSGL+RDYY PR   YF+      E+G  + 
Sbjct: 628 EMNARRIVTIW-------GPPVSDYACKLWSGLIRDYYLPRWREYFR----CKETGSKYD 676

Query: 493 LKDWRREWI 501
           L  W  +W+
Sbjct: 677 LASWESDWV 685


>gi|110800516|ref|YP_695309.1| alpha-N-acetylglucosaminidase [Clostridium perfringens ATCC 13124]
 gi|110675163|gb|ABG84150.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens ATCC 13124]
          Length = 2095

 Score =  208 bits (530), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 163/563 (28%), Positives = 258/563 (45%), Gaps = 80/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +     A+  
Sbjct: 369 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 428

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 429 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 487

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 488 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 536

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE    
Sbjct: 537 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 595

Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
                   NT          MAW    IN       Y  RRYG++   I +AWN++  T 
Sbjct: 596 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTA 655

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y        K  D    +     SII+   G    +G        +KS  S++ H  + Y
Sbjct: 656 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 692

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +    
Sbjct: 693 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 752

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             +S +FLEL++  + +L+    FL+G W+E A+ + ++ +   +  +E+NAR  +T W 
Sbjct: 753 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 812

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                +   L+DY N+ WSGL  DYY  R   +   +   L+ G     K    +W K+ 
Sbjct: 813 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKME 868

Query: 505 NDWQNGRN----VYPVESNGDAL 523
            DW N ++    +YP E++ + L
Sbjct: 869 YDWVNKKSDTDKLYPTEASNENL 891


>gi|404487024|ref|ZP_11022211.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335520|gb|EJZ61989.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
           YIT 11860]
          Length = 722

 Score =  208 bits (530), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 163/570 (28%), Positives = 244/570 (42%), Gaps = 110/570 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+GW GPL   W  +Q+ LQ KIL R+ ELGM+P+ PAF+G VP A     P  +  
Sbjct: 200 MGNLNGWDGPLTNGWQKEQIKLQHKILNRMRELGMDPIAPAFAGFVPTAFAERHPEIQFK 259

Query: 61  QLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            L  W  F  K +      Y+L    P F EIG+ FIE+  KE+G+ ++ Y  D+F+E  
Sbjct: 260 HL-EWGGFDEKYN-----AYVLPPETPYFKEIGKLFIEEWEKEFGKNTY-YLSDSFNEMK 312

Query: 119 PPV------DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNS 171
            PV         + ++  G +IY  + +G+ DAVW+ QGW F Y   FW    ++ALL+ 
Sbjct: 313 LPVAEGDDDGKHKLLAQYGESIYHSIAAGNPDAVWVTQGWTFGYQHDFWDKASLQALLSR 372

Query: 172 VPLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
           VP  K++++DL  +        +  W     FYG  +I+  + NF G   M G L   A 
Sbjct: 373 VPDDKMIIIDLGNDYPKWVWGTEQTWKNHDGFYGKKWIFSYVPNFGGKTPMTGDLQMYAT 432

Query: 225 GPVEARTSENTTM----------------------------------AWINQYSVRRYGR 250
              EA  S N                                     +W+  Y   RYG 
Sbjct: 433 SSAEALHSANAGNLVGFGSAPEGLENNEVVYELLADMGWTADSIDLDSWLPVYCKARYGG 492

Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
              A+  AW     T Y+            +V  PD                 + +SK  
Sbjct: 493 CPAAMDSAWQRFKETAYSSLYSYPRFTWQTVV--PDT----------------RRISKLD 534

Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
           V  S                 ++ +ELF++  + L +S  Y  D I+     LA  A++ 
Sbjct: 535 VSDS----------------FLQGVELFLSCADSLESSPLYVNDAIEYASYYLAAKADDC 578

Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
           +   ++   L +     Q   R +E++ D+D LLA H  + L  W++ A+   + + ++ 
Sbjct: 579 YKRALKEDSLGNRVAAMQQLDRSVEILLDVDKLLASHPLYRLEEWVDMARDWGKTDLEKD 638

Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
            YE NA+  IT W            DY  ++WSGL++DYY PR  +YF     S +  D 
Sbjct: 639 AYEANAKRLITTW-------GGFQEDYAARFWSGLIKDYYIPRMKLYF-----SEQRAD- 685

Query: 491 FRLKDWRREWIKLTNDWQNGRNVY--PVES 518
             L  W   WIK    W N    +  P++S
Sbjct: 686 --LDRWEENWIKAP--WHNTSTSFEDPLQS 711


>gi|187734575|ref|YP_001876687.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
 gi|187424627|gb|ACD03906.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 848

 Score =  208 bits (529), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 157/536 (29%), Positives = 255/536 (47%), Gaps = 69/536 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+    GPLP SW  +Q+ LQ +IL R+  LGM P+ PAFSG VP  +  ++P AK+ 
Sbjct: 202 MGNIVNHDGPLPASWHKEQIALQHRILHRMKSLGMTPICPAFSGFVPRGILRLYPEAKLH 261

Query: 61  QLG--NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +LG   W      P+    + L   +PLF++IGR ++++  KE+G+ ++ +  D+F+E  
Sbjct: 262 RLGWGGW------PQKNHAHFLSPEEPLFLKIGRLYMQEWQKEFGKNTY-FLADSFNEME 314

Query: 119 PPVDSP------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNS 171
            P +          +SSLG  IY  + S + DAVW+MQGW+F Y    W    +KALL+ 
Sbjct: 315 LPENKGGVEARNNMLSSLGEQIYRSISSTNPDAVWVMQGWMFGYQRNIWNADTLKALLSK 374

Query: 172 VPLGKLVVLDLFAEVKPI-------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
           VP  K+++LDL A+           W   K F+  P+++ ++ N  G   M G++D  A 
Sbjct: 375 VPDDKMLLLDLAADYNKTFWRNGMNWDVFKGFFNKPWVYSVVPNMGGKCAMTGVMDFYAN 434

Query: 225 GPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 284
           G +EA  S +       +  +   G +   I++  +V+Y  +   TD A  +NR      
Sbjct: 435 GHLEALNSSS-------RGRLSGMGMAPEGIENN-DVIYELI---TDAAW-RNRQ----- 477

Query: 285 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY----DHPHLWY------------ST 328
            +V+  + +    +Y NY   + KEA      ++Y    DHP   +            ++
Sbjct: 478 ENVEQYLENYCRARYGNYPDSM-KEAWNLFRRTAYSNLKDHPRFNWQMKPGTRGCSVNTS 536

Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
            + ++ L LF+ +   L  S  +R D +++    L    NE      EA    D     +
Sbjct: 537 EDFLKGLSLFVNT-RGLEQSPLFRQDAVEMAVHYLGIRMNEAIRAAQEALDEQDQENAEK 595

Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 448
               F +     D LL  H  + L  W+  A+    + E++ +YE NAR  +T W     
Sbjct: 596 CMAYFRKYALLADSLLEGHPTWRLSRWISFARSHGTSPEEKNKYEQNARRLVTRW----- 650

Query: 449 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
                + DY  K WSGL+RDYY PR    +++ I+S  S     +  W  +W++ T
Sbjct: 651 --GPPVDDYAAKIWSGLIRDYYLPR----WEHFIQSRLSEKNPDMGAWEEKWVRST 700


>gi|340514474|gb|EGR44736.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
          Length = 762

 Score =  207 bits (528), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 146/525 (27%), Positives = 249/525 (47%), Gaps = 74/525 (14%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           NL G W   LP  W+D Q  LQKKI+ R+ ELG+ P+LPAF G VP A   V P A++  
Sbjct: 209 NLQGSWSSSLPFEWVDDQFALQKKIVKRMVELGITPILPAFPGFVPRAAPRVLPDARLLH 268

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W        +     LD  DPLF ++ R+FI +Q + YG  ++ Y  D F+E  PP 
Sbjct: 269 SIQWAGFPE--IFTEDTFLDPVDPLFAQMQRSFITKQKQAYGNVTNFYTLDQFNEMIPPS 326

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPL-GKLVV 179
               Y+ ++ +  +  ++S D +A+W+ Q WLF+ +  FW   +++A L  V     +++
Sbjct: 327 GDVAYLRNVSSNTWKALKSADPNAIWVFQAWLFAQNTTFWTNERIEAYLGGVTADSDMLI 386

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
           LD+++E  P W  ++ +YG P+IWC L N+   I +YG + ++   P+ A     +    
Sbjct: 387 LDIWSESMPQWQRAQSYYGKPWIWCELQNYGATINLYGQIQNVTNSPILALQESTSLSGF 446

Query: 236 ----------------------------TMAWINQYSVRRYG--RSVPAIQDAWNVLYHT 265
                                       T A+ + ++  RY   +    I DAW  +  T
Sbjct: 447 GLSMEGQQNNEIVYDLLLAQAWSSEPLDTEAYFHNWASARYSSDQRPGFIHDAWETVRTT 506

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VY+ T+         +   P V  SII +           V + + + ++ +      L 
Sbjct: 507 VYDNTN---------LTLMPSVPKSIIEL-----------VPRTSNM-ADITGILGTKLP 545

Query: 326 YSTSEVIRALELFIASG---NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           Y  + ++ A +    +G     L  ++ Y+YDL+D TRQ LA     ++ NI++ Y  ++
Sbjct: 546 YDPAVMVSAWKQLYHAGLQDTSLFNNSAYQYDLVDWTRQVLANAFIPIYKNIVDIYYNSN 605

Query: 383 AHGVFQLSR------RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
                ++ R      +  +L+  +D +L+ +  F L  WL +A+  A +      +E+ A
Sbjct: 606 QTAGSRIQRLKAQGQQVTKLLLSLDLVLSSNRNFRLSTWLSAARSSAPSPAYVDSFEYEA 665

Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
           R QIT+W  + Q     L DY +K WSGL++ Y+  R  ++ +Y+
Sbjct: 666 RNQITLWGPSGQ-----LIDYASKAWSGLMKTYHLKRWQMFVEYL 705


>gi|322699924|gb|EFY91682.1| alpha-N-acetylglucosaminidase, putative [Metarhizium acridum CQMa
           102]
          Length = 775

 Score =  207 bits (528), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 156/565 (27%), Positives = 260/565 (46%), Gaps = 84/565 (14%)

Query: 5   HGWGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 62
             WGG   LP ++++ Q  LQKKI+ R+ ELG+ PVLPAF G VP +++ V P   +T  
Sbjct: 211 RSWGGKGDLPLAFIELQFELQKKIVARMVELGITPVLPAFPGFVPESIKKVRPDVNLTVS 270

Query: 63  GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 122
            NWF+   D ++     LD  D  + E+ R F+ +Q+  +G  ++IY  D F+E +P   
Sbjct: 271 PNWFAPAPD-KYTRDLFLDPLDDTYAELQRLFVSKQMDAFGNVTNIYTLDQFNELSPASG 329

Query: 123 SPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGK-LVVL 180
              Y+  +    Y+G+ + +  AVWL+QGWL FS   FW  P++ A L  V   + ++VL
Sbjct: 330 DTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSSRRFWTQPRIDAYLGGVEDDQGMLVL 389

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT----- 235
           DL++E  P W  +  + G P+IWC LH+F GN+ + G + ++   P++A     +     
Sbjct: 390 DLYSEANPQWQRTNSYSGKPWIWCQLHDFGGNMALEGRVQTLTSAPIDALAQSESLVGFG 449

Query: 236 ---------------------------TMAWINQYSVRRYG--RSVPA-IQDAWNVLYHT 265
                                      T  +   +  +RY    S+P+ +  AW +L   
Sbjct: 450 LTPEAYEGNEVVYDILLDQAWSATPLDTQTYFASWVTKRYAGVSSIPSELYRAWEMLRTD 509

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           VY+ T   TD  + V VA   + P++  +                   + T  + HP   
Sbjct: 510 VYSNTR--TDIPQ-VPVATYQLRPALSGIA------------------NRTGHFPHPTAL 548

Query: 326 YSTSEVIR-----ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
           +    V++      LE     G+ L     ++ D +D++RQ L+   + L+ +++ AY+ 
Sbjct: 549 HYDPLVLQEAWKLMLEAMTRQGS-LWKVPAFQLDFVDVSRQMLSNQFDVLYADLVNAYKC 607

Query: 381 NDAHGVFQL------------SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
           + A G  +L              R L L+  +D  L     F L  W+++A    +    
Sbjct: 608 SAAGGSRELRSSAPSCDVEAAGARLLSLLSTLDLTLLTSRHFTLQSWVDAAGSWGKAAGN 667

Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
           E  + +NAR+Q+T+W    Q +A+ L DY  K W GL+  YY  R +I+   ++ +  SG
Sbjct: 668 EDLFTFNARSQVTVW----QVDATNLNDYAAKAWGGLVGSYYKGRWSIFVDALVAASNSG 723

Query: 489 DGFRLKDWRREWIKLTNDWQNGRNV 513
                    RE      +WQ G + 
Sbjct: 724 -SLDEGALTRELEAFEAEWQAGEHA 747


>gi|429740221|ref|ZP_19273923.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
 gi|429153946|gb|EKX96707.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
          Length = 721

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 160/551 (29%), Positives = 241/551 (43%), Gaps = 103/551 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W  QQ+ LQ KIL R+  LGM+P+ PAF+G VP     + P  ++ 
Sbjct: 197 MGNLNTWNGPLSANWHSQQIALQHKILERMRLLGMHPITPAFAGFVPEGFVKLHPEVRVK 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
                F      +    Y+L    P F++IG+ FIE+  KE+ + ++ Y  D+F+E   P
Sbjct: 257 H----FEWGGFDKSLNAYMLPPDSPYFLQIGKLFIEEWEKEFSKNTY-YLSDSFNEMELP 311

Query: 121 VDSPE-------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSV 172
           V SP+        +S  G AIY  + +G+ +AVW+ QGW F Y   FW    ++ALL  V
Sbjct: 312 V-SPDDTDGKHRLLSKYGEAIYQSIVAGNPNAVWITQGWTFGYQHRFWDKESLQALLERV 370

Query: 173 PLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDS---- 221
           P  KL+++DL  +        +  W T K FYG  +I   + NF G   + G L+     
Sbjct: 371 PNDKLIIVDLANDYPKWVWKTEQTWKTHKGFYGKRWILSYVPNFGGKTLLTGDLNLYASC 430

Query: 222 -------------IAFGPVEARTSENTT-------MAWINQ----------YSVRRYGRS 251
                        I FG        N         M W NQ          Y   RYG  
Sbjct: 431 SAEALAHPDKGRLIGFGSAPEGLENNEVVYELLADMGWQNQPIDLDHWLIEYCRSRYGSC 490

Query: 252 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
             A+Q AW  L  +VY+                     S+ S     +Q     V  + +
Sbjct: 491 PNAMQKAWKGLCRSVYS---------------------SLYSYPRFTWQT----VIPDTL 525

Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
            KS+   YD    ++      RA+E F+    +L  S  YR D +    Q +   A+ L+
Sbjct: 526 RKSK---YDFNDTYF------RAVEDFLLCAPQLKDSPLYRSDALLFAAQYIGAKADNLY 576

Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
              ++A  + +     QL  + ++L+   D LLA H    L  W+++A+  A   ++  Q
Sbjct: 577 RKALQAKAVGNRARAKQLVDKVIQLLLQADKLLASHPTDRLSRWVDAARTAAATPQERMQ 636

Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
           YE +A+  IT W         + +DY  +YWSGL++ YY PR  +YF             
Sbjct: 637 YEMDAKRLITSW-------GGIQQDYAARYWSGLIKTYYVPRIKLYF-------AGSKKK 682

Query: 492 RLKDWRREWIK 502
            L +W   W+K
Sbjct: 683 ELNNWEENWLK 693


>gi|294648124|ref|ZP_06725667.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|292636508|gb|EFF54983.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
          Length = 499

 Score =  205 bits (521), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 156/543 (28%), Positives = 244/543 (44%), Gaps = 93/543 (17%)

Query: 35  MNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAF 94
           M PVLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F
Sbjct: 1   MKPVLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLF 57

Query: 95  IEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
           +++Q K +G T HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F
Sbjct: 58  LDEQKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMF 116

Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
            +D   W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN 
Sbjct: 117 YFDKDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNT 176

Query: 214 EMYGI----------------------------LDSIAFGPVEARTSENTTM-----AWI 240
            + G                             LD + F P E    +   +      WI
Sbjct: 177 TLTGNVKESGARLENALINGGGNLKGIGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWI 235

Query: 241 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 300
              + R  G     ++DAW  L++ +Y              V  P         T G   
Sbjct: 236 ECLADRHVGCVSQPVRDAWKRLFNDIY--------------VQVP--------RTLGTLP 273

Query: 301 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDL 358
            Y +P   +   K  ++ Y +  L     EV R L       NE  +   + +R DLI +
Sbjct: 274 GY-RPALNKNSEKRTSNVYSNVELL----EVWRKL-------NEAPSDRRDAFRLDLITV 321

Query: 359 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 418
            RQ L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ 
Sbjct: 322 GRQVLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDD 381

Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
           A+++  + + +  YE NAR  IT W          L DY ++ W+GL+ DYY  R  +Y 
Sbjct: 382 ARKMGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYV 434

Query: 479 KYMIESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLY 530
              I++ E G   D  +L+D  +E    W+  T+     ++V+   S  D L++ S +L+
Sbjct: 435 NTFIKAAEEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLF 491

Query: 531 NKY 533
           +KY
Sbjct: 492 SKY 494


>gi|393783265|ref|ZP_10371440.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
           CL02T12C01]
 gi|392669544|gb|EIY63032.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
           CL02T12C01]
          Length = 723

 Score =  204 bits (520), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 155/554 (27%), Positives = 248/554 (44%), Gaps = 70/554 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL   W   Q+ LQ +I+ R+ ELGM P+ PAF+G VP A     P  K  
Sbjct: 199 MGNLNTWDGPLSDEWQKSQIELQHQIINRMRELGMQPIAPAFAGFVPMAFAEKHPDIKFK 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W            Y+L    P F EIG+ F+++  KE+G+ ++ Y  D+F+E   P
Sbjct: 259 HL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKEWEKEFGKNTY-YLSDSFNEMELP 313

Query: 121 VDSPEY------ISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           V   +       ++  G +IY  + +G+ DA+W+ QGW F Y   FW    ++ALL+ VP
Sbjct: 314 VAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQGWTFGYQHDFWDKASLQALLSHVP 373

Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +        +  W     FYG  +I+  + NF G   + G L   A   
Sbjct: 374 DDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIFSYVPNFGGKTPLTGDLQMYATSS 433

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNC--TDGATDKNRDVIVAF 284
            EA  + +         ++  +G +   +++   V+Y  + +   TD A D  + +    
Sbjct: 434 AEALKAPS-------HGNLIGFGSAPEGLENN-EVVYELLADMGWTDQAIDPEQWM---- 481

Query: 285 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST--------------SE 330
               PS  +   G Y    K   +     + +S Y +P   + T               +
Sbjct: 482 ----PSYCTARYGAYPESMKNAWELFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDVSDD 537

Query: 331 VIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLS 390
            +  +ELF+AS + L+ S  Y  D I+     +A  A++L+   +           +Q  
Sbjct: 538 FLHGIELFLASADSLNRSKLYVNDAIEFASYYIAAQADKLYKQALTEDTAGKPVAAYQHL 597

Query: 391 RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEE 450
            + ++L+ ++D LLA H  + L  W+E A+       ++  YE NA+  IT W       
Sbjct: 598 NQAIDLLLNVDKLLASHPLYRLEEWVELARNSGTTPAEKDAYEANAKRLITTW------- 650

Query: 451 ASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNG 510
                DY  ++WSGL++DYY PR  +YF     S + GD   L +W  EWI+    W N 
Sbjct: 651 GGFQEDYAARFWSGLIKDYYIPRLKLYF-----SKQRGD---LDNWEEEWIR--TPWHN- 699

Query: 511 RNVYPVESNGDALI 524
               P E   D  I
Sbjct: 700 -TTTPFEKPLDMAI 712


>gi|224026593|ref|ZP_03644959.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
           18228]
 gi|224019829|gb|EEF77827.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
           18228]
          Length = 635

 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 144/476 (30%), Positives = 211/476 (44%), Gaps = 78/476 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D+ +VL ++I+ R  ELGM P+   FSG VP  L+  +P AKI 
Sbjct: 196 MQNLQSYGGPLPKSWIDKHVVLGQQIIRRELELGMKPIQQGFSGYVPRELKEKYPEAKIQ 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
              +W   K   +      LD TD LF  IGR F+E++ K +G    +Y  D F E+ PP
Sbjct: 256 PQPSWCGFKGAAQ------LDPTDSLFQVIGRDFLEEEKKLFG-AHGVYAADPFHESRPP 308

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD+PEY+S++G +I++  Q  D  ++W MQ W        R P +KA    VP   L++L
Sbjct: 309 VDTPEYLSAVGRSIHTLFQEFDPYSLWAMQAWSL------REPIVKA----VPEEHLLIL 358

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DL        +     +G P +   LHNF G I M+G L  +A    EA  S +  +   
Sbjct: 359 DLNGSK---CTQRNACWGYPVVAGNLHNFGGRINMHGDLPLLAGNQYEAAVSLSPNVCGS 415

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                          W+ +Y++RRYG        A  +L    Y
Sbjct: 416 GLFMEGIEQNPLYYELAFEMPLQKGKVELDGWLKEYALRRYGSKWENTHKALLLLLEGPY 475

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
                 T+ +  +I A     P++     G     G P                    YS
Sbjct: 476 RPGTNGTELS-SIIAA----RPALHVKKSGPNAGLGIP--------------------YS 510

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
              +I A    +     L  S  YR+D++DL RQ +      +     +A++  D  G  
Sbjct: 511 PWLLIEAQAFMLKDAGILKTSEAYRFDIMDLQRQIMTNLGQAIHKEAAKAFEAGDEKGFE 570

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
             SRR+LEL+ D+D LL     F    WL  A+     EE++ Q+E NA   +T+W
Sbjct: 571 LHSRRYLELLTDVDTLLRTRPEFNFDRWLADARSWGDTEEEKNQFERNATALVTIW 626


>gi|282881077|ref|ZP_06289764.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
           5C-B1]
 gi|281304881|gb|EFA96954.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
           5C-B1]
          Length = 688

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 146/522 (27%), Positives = 231/522 (44%), Gaps = 79/522 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGP+ Q  +D +  LQ KIL R+ +LG+ PV+  F G VP+ L + +P A + 
Sbjct: 164 MGNLEGWGGPMSQQMIDDRYKLQIKILRRMRQLGIEPVVQGFPGIVPSFLHDKYPKACVV 223

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W   +         +L     LF  + +A+ +   + YG     +  D F E  N 
Sbjct: 224 SQGKWNGFQRPS------ILLPQSQLFYCMAKAYYDNMKRYYGTDLRYFGGDLFHEGGNA 277

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    +SS  + +   M S   DA W++QGW  +  P        ALL  +    ++
Sbjct: 278 KGVD----LSSTASKVQKCMLSHFPDAKWVLQGWNGNPSP--------ALLAGLDKKHVL 325

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN---- 234
           +++L  E+   W  S +F   P+IW  +++F G  +M G L  +   P  A  +      
Sbjct: 326 LINLAGEIDASWKQSDEFGQTPWIWGSVNHFGGKTDMGGQLPVLVEQPHRALAASQHGRL 385

Query: 235 --------------------TTMAW----------INQYSVRRYGRSVPAIQDAWNVLYH 264
                                  AW          + QY   RYG     +  AW +L  
Sbjct: 386 KGLGILPEGIHTNPVVYDLALQTAWSDTVPSVDHLLRQYIWYRYGTWNDDLYRAWQLLAS 445

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           +VY                            EG Y++     ++ ++  S  S++    +
Sbjct: 446 SVYG---------------------EFEVKGEGTYESVF--CARPSLHVSSVSTWGPKKM 482

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            Y   ++++AL LF  +      S TY YDL+DL RQ +A  A  ++  ++ AY   D+ 
Sbjct: 483 QYQPEKLLQALVLFRKAAVHFKGSETYEYDLVDLARQVMANNARNVYNQVVHAYNEKDSL 542

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            + + S  FL L++  D LL+ +  FLLG WL++A+Q  +NE+ ++Q   NART I+ W 
Sbjct: 543 ALNRYSSTFLHLIDLQDSLLSTNKFFLLGKWLQAARQYGENEQDQRQALVNARTLISYW- 601

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
               +  + L DY NK W+GLL+ YY PR   +F  +   L 
Sbjct: 602 -GPDDATTRLHDYANKEWAGLLKQYYAPRWRAFFAMLAGQLR 642


>gi|210611122|ref|ZP_03288736.1| hypothetical protein CLONEX_00926, partial [Clostridium nexile DSM
            1787]
 gi|210152109|gb|EEA83116.1| hypothetical protein CLONEX_00926 [Clostridium nexile DSM 1787]
          Length = 1662

 Score =  202 bits (514), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 156/577 (27%), Positives = 253/577 (43%), Gaps = 102/577 (17%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  SW  ++  L +K  + + +LGM PVL  +SG VP  + +  PSA++ 
Sbjct: 685  MANLSGFGGPVHDSWFTERTELARKNQLIMRKLGMQPVLQGYSGMVPVDITDKDPSAQVI 744

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
            + G W S +         +L      F +  + F + Q + YG  S  Y  D F E  NT
Sbjct: 745  KQGTWCSFQRPS------MLKTDSETFDKYAQLFYKVQKEVYGDVSDYYATDPFHEGGNT 798

Query: 119  PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-- 176
              + SP  I+     + + M   D + +W++Q W        +     ALL  +   +  
Sbjct: 799  GGM-SPTVIAE---KVLANMMEADENGIWIIQSW--------QGNPSTALLQGLDAARDH 846

Query: 177  LVVLDLFAEVKPIWSTSK-----------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
             +VLDL+AE  P W+ +            +F   P+++CML+NF G + ++G +++   G
Sbjct: 847  ALVLDLYAEKTPHWNETDPGSYGGAEGGGEFLNTPWVYCMLNNFGGRLGLHGHIENFVNG 906

Query: 226  PVEARTSENTTMA-------------------------------------WINQYSVRRY 248
              +A    N                                         W   Y+ RRY
Sbjct: 907  VAQAAAQANHMAGIGITPEASVNNPVLYDLFFETIWSDDGENLSAINLDEWFKDYTTRRY 966

Query: 249  GRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKPV 306
            G    +  +A  +L  TVYN       +     V+ A P +D        G    +G  V
Sbjct: 967  GAESQSAYEAMQILNDTVYNPEMNMKGQGAPESVVNARPGLDI-------GAASTWGNAV 1019

Query: 307  SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 366
                             + Y  +E+ +A  L +   ++L  S  Y+YDL ++  Q L+  
Sbjct: 1020 -----------------IDYDKAELEKAAALLLKDYDKLKDSAGYQYDLANVLEQVLSNT 1062

Query: 367  ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 426
            A E    + +A++  DA    ++S  FLE++  ++ +    + F+LG WLESAK LA+N 
Sbjct: 1063 AQEYQKKMADAFREGDAEKFEKMSNSFLEIITKVEEVTGTQEEFMLGTWLESAKALAKNA 1122

Query: 427  EQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
            +   ++ YE NAR  IT W    Q  +  L DY N+ WSGL  DYY PR   + K++ E 
Sbjct: 1123 DDFTKELYELNARGLITTWGSIEQANSGGLIDYSNRQWSGLTSDYYKPR---WEKWIAER 1179

Query: 485  LESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNG 520
             +   G   K++   +W ++   W    N YP ++NG
Sbjct: 1180 KKELAGEESKNYSAADWFEMEWAWARSNNEYPTKANG 1216


>gi|160914140|ref|ZP_02076362.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
 gi|158433951|gb|EDP12240.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
          Length = 2150

 Score =  202 bits (514), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 141/514 (27%), Positives = 242/514 (47%), Gaps = 81/514 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ +GGPLP  W +Q++ L +K+  R+   G++PV+  F G VP +       A +T
Sbjct: 387 MQNLYSFGGPLPDDWFEQRVELGRKMHDRMQAFGIDPVIQGFCGQVPMSFVEKNEGAVLT 446

Query: 61  QLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
            +  W S  + P    TYL            F ++ + F E+Q   +G  S  Y  D F 
Sbjct: 447 PIDEWPSF-TRPAMIKTYLSQEEIAAGKKDYFKDVAKTFYEKQKNVFGDVSDYYASDPFH 505

Query: 116 E--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
           E  NT  +D    ++++   +   M   ++DA+W+MQ W  + D      ++  L   V 
Sbjct: 506 EGGNTQGLD----VTNIFKTVQEEMLKSNADAIWVMQQWQGNLD----HAKLSGL---VK 554

Query: 174 LGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS- 232
             + + LDL +++ P  S+  +  G+ +IWCMLHNF G + + G ++ IA  P  A ++ 
Sbjct: 555 PEQALALDLQSDMNP--SSVMENEGISWIWCMLHNFGGRMGLDGEVEVIAKEPAIAASNN 612

Query: 233 -------------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNV 261
                        EN+ +                  AW+++Y+ RR G S  ++Q+AW++
Sbjct: 613 QYMKGIGITPEALENSPIVYEMLFDMTWSKDPIDYQAWVDKYATRRAGGSSDSLQEAWDM 672

Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
           L  T Y          +D  + +     ++I+   G   N+           S  S++ H
Sbjct: 673 LLETAY----------KDKGIYYQGAGETVINARPG--TNF-----------SSASTWGH 709

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
            ++ Y   E+ + L L I + +  +AS  YRYDL D+  Q L   A E    +++A    
Sbjct: 710 SNILYDKEELDKVLSLLIENYDAFAASEAYRYDLADVAEQVLCNAAIEYHALMVQALNNK 769

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
           D+    ++S  FLEL++  D +L   + F+LG W+  A+++  N +   +  +E+NAR  
Sbjct: 770 DSAEFKRISTHFLELIDLSDRILGSSEEFMLGTWIHDAREMLDNADDWTKDLFEFNARAV 829

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
           +T W     E +  L+DY N+ W+GL   +Y  R
Sbjct: 830 VTTW---GGERSGSLKDYSNRKWAGLTSSFYKER 860


>gi|126347839|emb|CAJ89559.1| putative alpha-N-acetylglucosaminidase [Streptomyces ambofaciens
           ATCC 23877]
          Length = 740

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 144/556 (25%), Positives = 234/556 (42%), Gaps = 77/556 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGP+ +  L+ +  L ++I  R+ +LGM PVLP + G VP       P   + 
Sbjct: 213 MQNMSGFGGPVSERLLEDRADLGRRIADRLRQLGMTPVLPGYYGTVPPGFTERNPVGPVV 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W   +  P W     LD    +F  +  AF   Q + +G TS +Y  D   E   P
Sbjct: 273 PQGDWVGFER-PDW-----LDPRSAVFPRVAAAFYRHQRELFG-TSTMYKMDLLHEGGRP 325

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            + P  +     A+   +Q+    AVW + GW  +       P  + +++++   +L+++
Sbjct: 326 GNVP--VRDAAQAVMKALQTARPGAVWTLIGWQNN-------PSTQ-IIDAIDKRRLLIV 375

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILDSI 222
           D  ++          ++G PY +  + NF G+  M                     L  I
Sbjct: 376 DGLSDRYDGLDREATWHGAPYAFGTIPNFGGHTTMGANTAVWAERFDQWRTKAGSALAGI 435

Query: 223 AFGP---------------VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
           A+ P               +  RT       W  +Y+ RRYG + P    AW +L    Y
Sbjct: 436 AYMPEGTGGNPVAYELFTELAWRTEPVDQRKWFAEYAQRRYGGADPHAASAWELLRSGPY 495

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +   G   +++D +                         ++  +  +  +S+    + Y 
Sbjct: 496 STPSGTWSESQDSLF-----------------------TARPRLTATNAASWSPGAMRYD 532

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
              V RAL   +     L A++ YR+DL+D+ RQ LA  +  L   I  AY   D     
Sbjct: 533 PGTVRRALTELVRVAPALRATDAYRFDLVDVARQVLANRSRTLLPQIKAAYDAEDLPRFR 592

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
             +  +   +  +D LLA    FLLGPWLE AK   + E +    E++AR+ +T W   +
Sbjct: 593 ARAAEWKNCLSLLDRLLATDARFLLGPWLEDAKSWGRTEAERAAAEFDARSILTTWGHRS 652

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
             +A  LRDY N+ WSGL+ D+Y  R   Y   +  +L +G      D    W  L +DW
Sbjct: 653 GSDAGGLRDYANREWSGLVSDFYAMRWTKYLDSLDTALVTGRPPVAID----WFALEDDW 708

Query: 508 QNGRNVYPVESNGDAL 523
              R+ YPV  +GD +
Sbjct: 709 NRQRDGYPVRPSGDPV 724


>gi|365104185|ref|ZP_09333846.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
           4_7_47CFAA]
 gi|363644798|gb|EHL84079.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
           4_7_47CFAA]
          Length = 1049

 Score =  202 bits (513), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 150/555 (27%), Positives = 242/555 (43%), Gaps = 74/555 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW  Q+  L +KI  R+   G+ PV P F+G VP       P A++ 
Sbjct: 361 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 420

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W      P    TY+    D  F ++   + +     +G  SH Y  D F E    
Sbjct: 421 DQGEWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 478

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D    +  +   + + M   D DAVW++Q W        +     A LN +     ++L
Sbjct: 479 ADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW--------QENPTDAFLNGLKKDHALIL 528

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------FG 225
           DL+A+ KP  +   +F   P+IW MLH F G +   G+ + +A                G
Sbjct: 529 DLYADNKPNHAMRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLAQEIPQSLAESKKMKGVG 588

Query: 226 PVEARTSEN-----------------TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
                   N                 ++ A+I+ +   RYG   P I+ AW+++  T Y+
Sbjct: 589 VTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAYH 648

Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
                 D+ R         + SII    G    +G  V++     +    YD        
Sbjct: 649 RR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------- 684

Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
           +E  + L L+++  +   A+  Y++DL+D+TRQ LA  + E +    +A+   D     Q
Sbjct: 685 AEFEKILPLYLSVYDHFKANPAYQHDLVDITRQVLANASYEYYRAFEDAWIAKDYSAFNQ 744

Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFDN 446
           LS +FL L++  D +L+    F+LG W+ SA+ +    ++    Q+E+NAR  +T W   
Sbjct: 745 LSGKFLRLIKLQDQVLSTRPEFMLGTWINSARTMLDGMDDWTRDQFEFNARAMVTTWGTE 804

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
              +A  LRDY N+ W GL  D+Y  R A + + +  +  +G   +    +  W  L   
Sbjct: 805 QAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALKSAAATGQ--KQDAIKVNWFPLEYR 861

Query: 507 WQN-GRNVYPVESNG 520
           W N   N YP + +G
Sbjct: 862 WVNQSGNGYPTQPSG 876


>gi|296115989|ref|ZP_06834611.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
           23769]
 gi|295977458|gb|EFG84214.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
           23769]
          Length = 758

 Score =  201 bits (512), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 150/579 (25%), Positives = 252/579 (43%), Gaps = 91/579 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGP+P+  ++++ V  ++I+ R+ ELGM PVLP F G VP      FP A + 
Sbjct: 222 MANMCCYGGPVPRELIEKRAVSAQQIIGRMRELGMRPVLPGFYGMVPDDFGKRFPQAHVI 281

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W   +  P W     LD  DP+F ++   + ++Q K +G  + +Y+   F E   P
Sbjct: 282 GQGEWNRFRR-PAW-----LDPRDPMFAKVAAIYYDEQKKLFG-DAPVYDIQPFQEGGTP 334

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  ++  G  I   + +    A+W++  W    D          +L  V   +L ++
Sbjct: 335 GDVP--LADAGQGIQKALDTAHPGAMWMLMAWYEEPD--------ARMLAGVDRKRLFIV 384

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG---PVEARTSENT-- 235
           DL    +   +    F G P+++  L +F G   + G   S  +G   P   RT +N   
Sbjct: 385 DLEQNTRVRENRDADFQGAPFLYGGLWDFGGRTSLGG--SSYDYGVRLPGLWRTQKNMIG 442

Query: 236 -----------------------------TMAWINQYSVRRYGRSVPA-IQDAWNVLYHT 265
                                        T  W   Y+ RRYG+      + AW++L H+
Sbjct: 443 TAVFPEGMDNNPYIFDLFTEAAWRRDGVDTTQWTRDYADRRYGQPGDVHARKAWDLLLHS 502

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP-VSKEAVLKSETSSYDHPHL 324
            +                         S      Q++G+   + +++  ++ S   H   
Sbjct: 503 AF-------------------------SYRATGIQDFGEASAAPDSLFNAQPSLDTHSAA 537

Query: 325 W-------YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 377
           W       Y    V  A+   + + +   A+  YRYDL+D+TRQA+A  A  +   I +A
Sbjct: 538 WNGMKVLPYDPHLVEAAMAELLQASDATRATEAYRYDLVDVTRQAVANQARAMLPQIGDA 597

Query: 378 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 437
           +   D   +  L+ R+LEL++  D LLA +  F +G WL   +  + +  Q K  +++AR
Sbjct: 598 FAARDRAKLHALTTRWLELMDRQDSLLATNTFFRVGTWLSWPQAWSDDPAQRKLMDYDAR 657

Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR 497
             +T W   T  +   LRDY NK W+GL +DYY  R  ++F  +  SL +G   R  D  
Sbjct: 658 VILTNWGGRTASQVGHLRDYANKDWAGLTKDYYRVRWQLFFDSLETSLATGRPPREID-- 715

Query: 498 REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 536
             W K+  +W +   VY     GD+   ++ +++   QG
Sbjct: 716 --WYKVGEEWCHNGRVYSPTPEGDSYTVARDIHDYLTQG 752


>gi|315500594|ref|YP_004089396.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
 gi|315418606|gb|ADU15245.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
          Length = 765

 Score =  201 bits (511), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 154/572 (26%), Positives = 231/572 (40%), Gaps = 103/572 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  P+PQ+W+ ++  LQ +IL R+ ELGM P+LPAF G VP A     P A+I 
Sbjct: 202 MGNIEGYLAPVPQAWIQKKHKLQSRILGRMKELGMTPILPAFGGYVPKAFAQKHPQARIY 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W           TY LD  DPLF +I   FI    + YG   + Y  D+F+E  PP
Sbjct: 262 PMRPWEGFHE------TYWLDPADPLFAKIAARFIALYTETYGEGRY-YLADSFNEMLPP 314

Query: 121 VD-----------------------------SPEYISSLGAAIYSGMQSGDSDAVWLMQG 151
           +                                E +++ G AIY  ++    DAVW MQG
Sbjct: 315 ISHDGSDVKNAKYGDSTANTKETETVVDPAVKAERLAAYGKAIYDSIRQARPDAVWTMQG 374

Query: 152 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNF 209
           WLF  D  FW P  + A L  VP  KL++LD+  +  P +W +S  F G P+I+  +HN+
Sbjct: 375 WLFGADKHFWTPDAIGAFLRDVPQDKLMILDIGNDRYPGVWQSSNAFQGKPWIYGYVHNY 434

Query: 210 AGNIEMYGIL-----------------DSIAFGPVEARTSEN-----------------T 235
             +  +YG L                 D   FG        N                 T
Sbjct: 435 GASNPVYGDLGFYRDDIRGLLARKDTGDLKGFGLFPEGLHNNSIVYEYAYDLAWGQANQT 494

Query: 236 TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 295
              W+  Y   RYG+  PA+  AW+      ++    +               P      
Sbjct: 495 VTEWLTTYLKSRYGQVTPALILAWSTYVEAAFSTRYWS---------------PRWWRSK 539

Query: 296 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 355
            G Y    +P +     +       HP       ++ RA++  + S      S  YR+D+
Sbjct: 540 AGAYLLCKRPTADMVEFEG------HPG---DRKKLRRAIDALL-SLKGFGGSALYRHDV 589

Query: 356 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 415
           ID  R  +++  ++  +  ++AY+  D      L    + LV  +D L+       L  W
Sbjct: 590 IDAVRHLVSEEIDDRLIAAMKAYKSGDVKTGDGLREEVIALVTQVDTLMGAQPD-TLASW 648

Query: 416 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 475
           ++ A       E++  Y  NA+ Q+T+W          L DY +K W GL +D+Y PR  
Sbjct: 649 IDEASAYGDTSEEKAYYVMNAKAQVTVWGGKGN-----LNDYASKAWQGLYKDFYLPRWM 703

Query: 476 IYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
                +  S   G  F  K + RE I     W
Sbjct: 704 KLLAALRASASGGAPFDQKTFTRELIDWEQAW 735


>gi|393785795|ref|ZP_10373941.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
           CL02T12C05]
 gi|392661414|gb|EIY55000.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
           CL02T12C05]
          Length = 724

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 149/549 (27%), Positives = 234/549 (42%), Gaps = 102/549 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL   W + Q+ LQ +I+ R+ ELGM+P+ PAF+G VP A     P  K  
Sbjct: 199 MGNLNTWDGPLSDEWQEGQIQLQHQIINRMRELGMSPIAPAFAGFVPMAFAEKHPDIKFK 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W            Y+L    P F EIG+ F+++  KE+G+ ++ Y  D+F+E   P
Sbjct: 259 HL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKEWEKEFGKNTY-YLSDSFNEMELP 313

Query: 121 VDSPEY------ISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           V   +       ++  G +IY  + +G+ DA+W+ QGW F Y   FW    ++ALL+ VP
Sbjct: 314 VAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQGWTFGYQHSFWDKASLQALLSHVP 373

Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +        +  W     FYG  +I+  + NF G   M G L   A   
Sbjct: 374 DDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIFSYVPNFGGKTPMTGDLQMYASSS 433

Query: 227 VEARTSEN------------------------TTMAWINQ----------YSVRRYGRSV 252
            EA  SE+                          M W +Q          Y + RYG   
Sbjct: 434 AEALQSESHGNLIGFGSAPEGLENNEVVYELLADMGWTDQAIDLDKWMPSYCMARYGAYP 493

Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
             ++DAW++   T Y+            ++  PD                     K  + 
Sbjct: 494 ETMKDAWDLFRKTAYSSLYSYPRFTWQTVI--PD---------------------KRRIS 530

Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
           K + S            + +  +ELF+ S + L  S  Y  D I+     +A  A++L+ 
Sbjct: 531 KIDVSD-----------DFLHGVELFLNSADSLKNSKLYVNDAIEFASYYIAAKADKLYG 579

Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
             +    +  +    Q   + ++++ ++D LLA H  + L  W+  A+       ++  Y
Sbjct: 580 KALAEDTVGRSAVAQQYLNQTIDMLLNVDKLLASHPLYRLEEWVNFARNSGTTPAEKDAY 639

Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
           E NA+  IT W            DY  ++WSGL++DYY PR  IYF     SL+      
Sbjct: 640 EINAKRLITTW-------GGFQEDYAARFWSGLIKDYYIPRLKIYFSKQRGSLD------ 686

Query: 493 LKDWRREWI 501
             +W  EWI
Sbjct: 687 --NWEEEWI 693


>gi|291086028|ref|ZP_06354661.2| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
           ATCC 29220]
 gi|291069185|gb|EFE07294.1| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
           ATCC 29220]
          Length = 1014

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 150/555 (27%), Positives = 241/555 (43%), Gaps = 74/555 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW  Q+  L +KI  R+   G+ PV P F+G VP       P A++ 
Sbjct: 326 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 385

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W      P    TY+    D  F ++   + +     +G  SH Y  D F E    
Sbjct: 386 DQGDWVGFVRPPM-LRTYVKQGAD-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 443

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D    +  +   + + M   D DAVW++Q         W+     A LN +     ++L
Sbjct: 444 ADLD--MVKVAQTVQNKMLEHDKDAVWIIQN--------WQENPTDAFLNGLKKDHALIL 493

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------FG 225
           DL+A+ KP  +   +F   P+IW MLH F G +   G+ + +A                G
Sbjct: 494 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLAQEIPQSLAESKYMKGVG 553

Query: 226 PVEARTSEN-----------------TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
                   N                 ++ A+I+ +   RYG   P I+ AW+++  T Y+
Sbjct: 554 VTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAYH 613

Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
                 D+ R         + SII    G    +G  V++     +    YD        
Sbjct: 614 RR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------- 649

Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
           +E  + L L+++  +    +  Y++DL+D+TRQ LA  + E +    +A+   D     Q
Sbjct: 650 AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFNQ 709

Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFDN 446
           LS +FL L++  D +L     F+LG WL SA+ +    ++    Q+E+NAR  +T W   
Sbjct: 710 LSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLDGMDDWTRDQFEFNARAMVTTWGIE 769

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
              +A  LRDY N+ W GL  D+Y  R A + + +  +  +G   +    +  W  L   
Sbjct: 770 QAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALKNAAATGQ--KQDAIKVNWFPLEYR 826

Query: 507 WQNGR-NVYPVESNG 520
           W N   N YP + +G
Sbjct: 827 WVNQTGNGYPTQPSG 841


>gi|225875033|ref|YP_002756492.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
           51196]
 gi|225793771|gb|ACO33861.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
           51196]
          Length = 800

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 154/558 (27%), Positives = 243/558 (43%), Gaps = 78/558 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +  P+ +S LD+++   ++I+ R+ ELG+ PV P + G VP       P A + 
Sbjct: 241 MGNLCCFDEPISRSLLDRRIRSAQQIIRRLRELGITPVFPGYFGMVPEDFARRHPGAHVI 300

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             GNW   +  P W     LD  DPLF  +  +F + Q + +G +S IY+ + F E    
Sbjct: 301 PQGNWNGFRR-PAW-----LDPRDPLFAAVAASFYKHQQELFGDSS-IYDIELFQEGGSA 353

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +SS   AI   +      A+W+   W    +P       +ALL++V    L+V+
Sbjct: 354 ADVP--VSSAAKAIQKALLRAHPQAMWMTLAW--QNNP------SRALLSAVDRSHLLVV 403

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA----------------- 223
           D+     P  +  + F G  Y++  L +F G   +   L   A                 
Sbjct: 404 DIDQGRTPHENRERDFMGAAYLFGGLWDFGGRTTLGANLYDYAVRLPRMGLRAGSTMKGT 463

Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                           F  +  RTS      W  +Y+ RRYG   P  + AW +L  T Y
Sbjct: 464 ALFSEGLDNNPAAFDLFTEMAWRTSPVDLRTWSREYARRRYGMDDPHTRRAWRILMETAY 523

Query: 268 NC-TDGATDKN-RDVIV-AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
               DG ++   RD    +  D  PS+ +V+                     SS+    L
Sbjct: 524 GTRADGVSNHGERDAPPESLFDAQPSLDAVS--------------------ASSWSPDRL 563

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            Y   +   AL   + +   +    TY+YDL+D+ RQ LA ++ +    I +AY      
Sbjct: 564 RYDPKKFEAALTELLQAPPGMREMPTYQYDLVDVARQTLANWSRKTLPEIKDAYDHRHEA 623

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
               L +++L ++   D LLA +  F++GPWL +    A    ++++ +++AR+ +T W 
Sbjct: 624 RFETLEKQWLCMMMLQDKLLATNTSFMVGPWLNAVSPWAATATEQRRLDYDARSILTTWG 683

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
           + T  EA  LRDYGNK W+GL RDYY  R  IYF  +  SL++G      D    W    
Sbjct: 684 NRTASEAG-LRDYGNKDWAGLTRDYYYRRWQIYFNDLDRSLKTGTPPHPID----WFAFG 738

Query: 505 NDWQNGRNVYPVESNGDA 522
             W   +  Y  ++ GD+
Sbjct: 739 EKWNRAQTHYATQARGDS 756


>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 1105

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 153/542 (28%), Positives = 229/542 (42%), Gaps = 103/542 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W++ +  LQ++IL R+  LGM PVLPAF+G VP A     P A+I 
Sbjct: 164 MGNIEGYDAPLPQQWIEDKHALQQRILQRMRALGMKPVLPAFAGYVPKAFAQAHPQARIY 223

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF +I + FI+   + YG+ ++ Y  D F+E  PP
Sbjct: 224 RMRAWEGFHE------TYWLDPADPLFAKIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 276

Query: 121 VDS----------------------PEY--------ISSLGAAIYSGMQSGDSDAVWLMQ 150
           + +                      PE         ++  G A+Y  +   + DAVW+MQ
Sbjct: 277 IAADGSDARLASYGDSTANTANTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 336

Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW P  + A L  VP  KL+VLD+  +  P  W  S  F G  +I+  +HN
Sbjct: 337 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 396

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
           + G+  +YG L                   + FG        N+ +              
Sbjct: 397 YGGSNPVYGDLAFYRDDLRALLADKDKQQLVGFGAFPEGLHTNSVVYEYMYALAWGGQQR 456

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+  Y+  RYG S PA++ AW+ L  +V +                    P     
Sbjct: 457 SLQDWLGDYTRARYGHSSPALRAAWDDLQASVLSTR---------------YWTPRWWRS 501

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P     + + E +  D P L        RAL+  +A   E + +  YRYD
Sbjct: 502 RAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLYRYD 552

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
           L+D  R       +      + AY+  D         R    V  +DGL+       L  
Sbjct: 553 LVDFARHYATGRVDTQLQQALAAYKRGDVAAGDAAFARVQAAVRQLDGLVGGQQE-TLSS 611

Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
           WL++A+  A+  +    Y  +A+ Q+++W          L DY +K W G+  DYY PR 
Sbjct: 612 WLDAAEGDAKTPQDAAYYRRDAKAQVSVWGGEGN-----LGDYASKAWQGMYADYYLPRW 666

Query: 475 AI 476
           A+
Sbjct: 667 AL 668


>gi|433678127|ref|ZP_20510026.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
           translucens DSM 18974]
 gi|430816763|emb|CCP40478.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
           translucens DSM 18974]
          Length = 691

 Score =  198 bits (504), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 160/592 (27%), Positives = 241/592 (40%), Gaps = 112/592 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PL Q W++ +  LQ++IL R+  LGM PVLPAF G VP A     P A+I 
Sbjct: 127 MGNIEGYDAPLQQQWIEDKHALQQRILQRMRTLGMKPVLPAFVGYVPKAFAQAHPQARIY 186

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF +I   FI+   + YG+ ++ Y  D F+E  PP
Sbjct: 187 RMRAWEGFHE------TYWLDPADPLFAKIALRFIQLYDRTYGKGTY-YLADAFNEMLPP 239

Query: 121 VDS----------------------PEY--------ISSLGAAIYSGMQSGDSDAVWLMQ 150
           + +                      PE         ++  G A+Y  +   + DAVW+MQ
Sbjct: 240 IAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 299

Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW P  + A L  VP  KL+VLD+  +  P  W  S  F G  +I+  +HN
Sbjct: 300 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 359

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
           + G+  +YG L                   + FG       +N+ +              
Sbjct: 360 YGGSNPVYGDLAFYRDDLRALLADKDKQQLVGFGAFPEGLHDNSVVYEYMYALAWGGQQR 419

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+  Y   RYG + PA++ AW+ L   V +                    P     
Sbjct: 420 SLQDWLGDYIRARYGHTSPALRAAWDDLQAAVLSTRYWT---------------PRWWRS 464

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P     + + E +  D P L        RAL+  +A   E + +  YRYD
Sbjct: 465 RAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLYRYD 515

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
           L+D  R       +      + AY+  D         R    V+ +DGL+       L  
Sbjct: 516 LVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDAAFARVQAAVQQLDGLVGGQQE-TLSS 574

Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
           WL  A+  A+  +    Y  +A+ Q+++W          L DY +K W G+  DYY PR 
Sbjct: 575 WLGDAEGDAKTPQDAAYYRRDAKAQVSVWGGEGN-----LGDYASKAWQGMYADYYLPRW 629

Query: 475 AIYFKYMIESLESGDGF--------RLKDWRREWIKLTNDWQNGRNVYPVES 518
           A+  +  + +   G G         RL+ W  +W+K    +       PV +
Sbjct: 630 ALAMQ-ALRAAAVGSGSVDEAALQQRLRAWELDWVKRETPYTRQAPADPVAA 680


>gi|210631701|ref|ZP_03296968.1| hypothetical protein COLSTE_00853, partial [Collinsella stercoris
           DSM 13279]
 gi|210159960|gb|EEA90931.1| F5/8 type C domain protein, partial [Collinsella stercoris DSM
           13279]
          Length = 1906

 Score =  198 bits (504), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 154/535 (28%), Positives = 241/535 (45%), Gaps = 73/535 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP SW +Q++ L ++I  R+   G++PV+  F G VP   Q   P++   
Sbjct: 354 MQNLYSVGGPLPDSWFEQRVELARRIHDRMQTYGIDPVIQGFGGQVPTDFQQKNPNSVAA 413

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G+W S  + P    TYL DA      +  F ++G  F E Q + +G+ SH Y  D F 
Sbjct: 414 SSGSW-SGFARPYMIKTYLTDADRAAGKEDYFQKVGTTFYEAQERIFGKVSHFYAVDPFH 472

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E    V     I  +   +   M   D  AVW+MQ W +  D          L       
Sbjct: 473 EGG-TVPQGFNIVDIYRTVQQKMLDYDPQAVWVMQQWQWGIDE-------NKLSGLAKKE 524

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
           + +VLDL ++++   ++  +   VP++W MLHNF G + M G+ + +A    +A  S   
Sbjct: 525 QSLVLDLQSDLRS-QASPMENQQVPWVWNMLHNFGGRMGMDGVPEVLAIKIPQAYNSNRY 583

Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
                      +N+ +                  AW   Y  RRYG +   IQ+AW++L 
Sbjct: 584 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRAWTRSYIERRYGGTDAKIQEAWDILL 643

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            T Y   DG           +     SI++          +P   +       S++ H  
Sbjct: 644 DTAYKHVDGEY---------YQGASESIMNA---------RPSDNKI---GSASTWGHSD 682

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           + Y   E  RA +LFI S +    S  +RYD +D+ RQ LA    E      +AY+  DA
Sbjct: 683 IDYDKKEFERAAQLFIESYDTYKDSEAFRYDFVDVMRQVLANAFQEYQPLAGDAYKQRDA 742

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQIT 441
                L+ + LE+++  D +L+    F+LG W+E+A+ L ++ +      +E NAR+ IT
Sbjct: 743 ERFELLANQMLEMLDAQDRMLSTSSDFMLGTWIENARTLLEDADDWTADLFELNARSLIT 802

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
            W    ++  SL+ DY N+ WSGL   YY PR   +     ++LE G   +  +W
Sbjct: 803 TW--GLEKNGSLI-DYSNRQWSGLTGSYYKPRWESWANARKKALEDGGSAQDLNW 854


>gi|432896403|ref|ZP_20107613.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
 gi|433031274|ref|ZP_20219108.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
 gi|431432398|gb|ELH14169.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
 gi|431538475|gb|ELI14460.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
          Length = 1049

 Score =  198 bits (503), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 148/555 (26%), Positives = 240/555 (43%), Gaps = 74/555 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW  Q+  L +KI  R+   G+ PV P F+G VP       P A++ 
Sbjct: 361 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 420

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W      P    TY+    D  F ++   + +     +G  SH Y  D F E    
Sbjct: 421 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 478

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D    +  +   + + M   D +AVW++Q W        +       LN +     ++L
Sbjct: 479 ADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW--------QENPTDDFLNDLKKDHALIL 528

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------FG 225
           DL+A+ KP  +   +F   P+IW MLH F G +   G+ + +A                G
Sbjct: 529 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMQEVLAQEIPQSLAESKYMKGVG 588

Query: 226 PVEARTSEN-----------------TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
                   N                 ++ A+I+ +   RYG   P I+ AW+++  T Y+
Sbjct: 589 VTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAYH 648

Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
                 D+ R         + SII    G    +G  V++     +    YD        
Sbjct: 649 RR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------- 684

Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
           +E  + L L+++  +    +  Y++DL+D+TRQ LA  + E +    +A+   D     Q
Sbjct: 685 AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFNQ 744

Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFDN 446
           LS +FL L++  D +L     F+LG WL SA+ +    ++    Q+E+NAR  +T W   
Sbjct: 745 LSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLDGMDDWTRDQFEFNARAMVTTWGTE 804

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
              +A  LRDY N+ W GL  D+Y  R A + + +  +  +G   +    +  W  L   
Sbjct: 805 QAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQTLKSAAATGQ--KQDAIKVHWFPLEYR 861

Query: 507 WQNGR-NVYPVESNG 520
           W N   N YP + +G
Sbjct: 862 WVNQTGNGYPTQPSG 876


>gi|355706271|gb|AES02588.1| N-acetylglucosaminidase [Mustela putorius furo]
          Length = 333

 Score =  198 bits (503), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 129/372 (34%), Positives = 185/372 (49%), Gaps = 83/372 (22%)

Query: 142 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 200
           D DAVWL+QGWLF + P FW P Q++A+L +VP G+L++LDLFAE +P++  +  F+G P
Sbjct: 3   DPDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLILDLFAESQPVYLRTASFHGQP 62

Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM----------------------- 237
           +IWCMLHNF GN  ++G L+++  GP  AR   N+TM                       
Sbjct: 63  FIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVGTGMAPEGIGQNEVVYALMAEL 122

Query: 238 -----------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 285
                      AW+  ++ RRYG      + AW +L  +VYNC+ +  T  NR  +V   
Sbjct: 123 GWRKDPVADLEAWVTSFAARRYGVDSKETEVAWRLLLGSVYNCSGEACTGHNRSPLVR-- 180

Query: 286 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 345
              PS+  VT                            +WY+ S V  A  L +A+   L
Sbjct: 181 --RPSLQMVTT---------------------------VWYNRSAVFEAWRLLLAAAPTL 211

Query: 346 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 399
           + S T+RYDL+D+TRQA  +  +  +     AY       L  A G+        EL+  
Sbjct: 212 AKSPTFRYDLLDVTRQAAQELVSLYYTEARTAYLNKELVPLMRAAGIL-----VYELLPA 266

Query: 400 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 459
           +DG+LA    FLLG WLE A+ +A +E   + YE N R Q+T+W      E ++L DY N
Sbjct: 267 LDGVLASDSRFLLGTWLEQARAVAVSETDARFYEQNGRYQLTLW----GPEGNIL-DYAN 321

Query: 460 KYWSGLLRDYYG 471
           K  +GL+  YY 
Sbjct: 322 KQLAGLVAGYYA 333


>gi|62318937|dbj|BAD94027.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
          Length = 182

 Score =  197 bits (502), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 94/181 (51%), Positives = 128/181 (70%), Gaps = 1/181 (0%)

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
           ++DLTRQ L+K AN+++   + A+   D   + QLS +FLEL++DMD LLA  D  LLG 
Sbjct: 1   MVDLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGT 60

Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
           WLESAK+LA+N ++ KQYEWNARTQ+TMW+D+     S L DY NK+WSGLL DYY PRA
Sbjct: 61  WLESAKKLAKNGDERKQYEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRA 120

Query: 475 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKY 533
            +YF  M++SL     F+++ WRREWI +++ WQ   + VYPV++ GDAL  S+ L +KY
Sbjct: 121 RLYFNEMLKSLRDKKIFKVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKY 180

Query: 534 L 534
            
Sbjct: 181 F 181


>gi|331660873|ref|ZP_08361805.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
           TA206]
 gi|422369309|ref|ZP_16449711.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
 gi|315298924|gb|EFU58178.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
 gi|331051915|gb|EGI23954.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
           TA206]
          Length = 1052

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 148/555 (26%), Positives = 240/555 (43%), Gaps = 74/555 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW  Q+  L +KI  R+   G+ PV P F+G VP       P A++ 
Sbjct: 364 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 423

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W      P    TY+    D  F ++   + +     +G  SH Y  D F E    
Sbjct: 424 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 481

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D    +  +   + + M   D +AVW++Q W        +       LN +     ++L
Sbjct: 482 ADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW--------QENPTDDFLNGLKKDHALIL 531

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------FG 225
           DL+A+ KP  +   +F   P+IW MLH F G +   G+ + +A                G
Sbjct: 532 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMQEVLAQEIPQSLAESKYMKGVG 591

Query: 226 PVEARTSEN-----------------TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
                   N                 ++ A+I+ +   RYG   P I+ AW+++  T Y+
Sbjct: 592 VTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAYH 651

Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
                 D+ R         + SII    G    +G  V++     +    YD        
Sbjct: 652 RR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------- 687

Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
           +E  + L L+++  +    +  Y++DL+D+TRQ LA  + E +    +A+   D     Q
Sbjct: 688 AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFNQ 747

Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFDN 446
           LS +FL L++  D +L     F+LG WL SA+ +    ++    Q+E+NAR  +T W   
Sbjct: 748 LSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLDGMDDWTRDQFEFNARAMVTTWGTE 807

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
              +A  LRDY N+ W GL  D+Y  R A + + +  +  +G   +    +  W  L   
Sbjct: 808 QAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQTLKSAAATGQ--KQDAIKVHWFPLEYR 864

Query: 507 WQNGR-NVYPVESNG 520
           W N   N YP + +G
Sbjct: 865 WVNQTGNGYPTQPSG 879


>gi|161505009|ref|YP_001572121.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160866356|gb|ABX22979.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 1014

 Score =  197 bits (500), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 146/555 (26%), Positives = 242/555 (43%), Gaps = 74/555 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW  Q+  L +KI  R+   G+ PV P F+G VP       P A++ 
Sbjct: 326 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 385

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W      P    TY+    D  F ++   + +     +G  SH Y  D F E    
Sbjct: 386 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFYEGGNR 443

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D    +  +   + + M   D DAVW++Q         W+     A LN +     ++L
Sbjct: 444 ADL--NMVKVAQTVQNKMLEHDKDAVWIIQN--------WQENPTDAFLNGLKKDHALIL 493

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------FG 225
           DL+A+ KP  +   +F   P+IW MLH F G +   G+ + +A                G
Sbjct: 494 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLAQEIPQSLAESKYMKGVG 553

Query: 226 PVEARTSEN-----------------TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
                   N                 ++ A+I+ +   RYG   P I+ AW+++  T Y+
Sbjct: 554 VTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHNWLTSRYGAQSPEIEQAWDIMVKTAYH 613

Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
                 D+ R         + SII    G    +G          +   +Y +  + Y  
Sbjct: 614 RR---KDRQR--------AEDSIIDAKPG----FG---------VTRACTYYNALIDYDK 649

Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
           +E  + L L+++  +    +  Y++DL+D+TRQ LA  + E +    +A+   D     Q
Sbjct: 650 AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAQDYSAFNQ 709

Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFDN 446
           LS +FL L++  D +L+    F+LG W+ +++ +    ++    Q+E+NAR  +T W   
Sbjct: 710 LSGKFLRLIKLQDKVLSTRPEFMLGNWINNSRTMLDGMDDWTRDQFEFNARAMVTTWGTE 769

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
              +A  LRDY N+ W GL  D+Y  R A + + +  +  +G   +    +  W  L   
Sbjct: 770 QAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALKTAAATGQ--KQDAIKVSWFPLEYR 826

Query: 507 WQNGR-NVYPVESNG 520
           W N   N YP + +G
Sbjct: 827 WVNQTGNGYPTQPSG 841


>gi|421734750|ref|ZP_16173809.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
 gi|407077324|gb|EKE50171.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
          Length = 1919

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 152/537 (28%), Positives = 240/537 (44%), Gaps = 77/537 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q++ L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 362 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 421

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++G  F + Q   +G+ S+ Y  D F 
Sbjct: 422 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 480

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 481 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 532

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
           + +VLDL ++++   +++ +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 533 QTLVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 591

Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
                      +N+ +                  +W  +Y+ RRYG +   I+ AW++L 
Sbjct: 592 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDILL 651

Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
            T Y  TDG      +  +I A P  D +I S                       S++ H
Sbjct: 652 DTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 688

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
             + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+  
Sbjct: 689 SDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 748

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
           D      LS R L++++  D LL+  D FL+G W++ A+ +    +      +E NAR  
Sbjct: 749 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 808

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           +T W  N  +  SL+ DY N+ W+GL  DYY  R   Y    +  LE G  F   DW
Sbjct: 809 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 862


>gi|313140918|ref|ZP_07803111.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
           bifidum NCIMB 41171]
 gi|313133428|gb|EFR51045.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
           bifidum NCIMB 41171]
          Length = 2005

 Score =  196 bits (497), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 152/537 (28%), Positives = 240/537 (44%), Gaps = 77/537 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q++ L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 448 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 507

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++G  F + Q   +G+ S+ Y  D F 
Sbjct: 508 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQESVFGKVSNYYAVDPFH 566

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 567 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 618

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
           + +VLDL ++++   +++ +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 619 QALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 677

Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
                      +N+ +                  +W  +Y+ RRYG +   I+ AW++L 
Sbjct: 678 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDILL 737

Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
            T Y  TDG      +  +I A P  D +I S                       S++ H
Sbjct: 738 DTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 774

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
             + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+  
Sbjct: 775 SDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 834

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
           D      LS R L++++  D LL+  D FL+G W++ A+ +    +      +E NAR  
Sbjct: 835 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 894

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           +T W  N  +  SL+ DY N+ W+GL  DYY  R   Y    +  LE G  F   DW
Sbjct: 895 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 948


>gi|311064845|ref|YP_003971571.1| beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
 gi|310867165|gb|ADP36534.1| Beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
          Length = 1923

 Score =  195 bits (496), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 152/537 (28%), Positives = 240/537 (44%), Gaps = 77/537 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q++ L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 366 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 425

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++G  F + Q   +G+ S+ Y  D F 
Sbjct: 426 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 484

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 485 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 536

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
           + +VLDL ++++   +++ +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 537 QALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 595

Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
                      +N+ +                  +W  +Y+ RRYG +   I+ AW++L 
Sbjct: 596 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDILL 655

Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
            T Y  TDG      +  +I A P  D +I S                       S++ H
Sbjct: 656 DTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 692

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
             + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+  
Sbjct: 693 SDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 752

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
           D      LS R L++++  D LL+  D FL+G W++ A+ +    +      +E NAR  
Sbjct: 753 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 812

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           +T W  N  +  SL+ DY N+ W+GL  DYY  R   Y    +  LE G  F   DW
Sbjct: 813 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 866


>gi|118370728|ref|XP_001018564.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila]
 gi|89300331|gb|EAR98319.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila
           SB210]
          Length = 879

 Score =  195 bits (496), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 166/551 (30%), Positives = 254/551 (46%), Gaps = 93/551 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGP+ Q+++D Q  LQKKIL R+  LGM P+L  F G VP +L+  FP +KI 
Sbjct: 218 MGNLEGYGGPVTQAYIDGQYNLQKKILKRMRNLGMQPILQGFYGMVPNSLKAKFPLSKIY 277

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--T 118
              +W   +          LDA D LF  I   F  +  K YGR +  Y  D F E    
Sbjct: 278 GDQSWLGFRRPA------FLDANDELFSNIANIFYSESEKLYGR-AKFYGGDPFHEGAIV 330

Query: 119 PPVDSPEYISSLGAAIYSGMQ----SGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
           P ++    ++S   +IY  MQ      D    W++Q W  +      P Q   LL  +  
Sbjct: 331 PGLN----LTSQAQSIYRAMQYTDNPKDEKVKWILQSWQEN------PSQQ--LLQGLQN 378

Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
            + ++LDL AE +  W T+  F G  ++W  L NF   I  YG+++     P  A + +N
Sbjct: 379 DECIILDLMAEARSKWQTND-FSGHDFLWTSLPNFGLRIGQYGMIEQYVSQPPLAYSIKN 437

Query: 235 TTM-----------------------AWI--NQ----------------YSVRRYG-RSV 252
           +TM                       AWI  NQ                +   RYG ++ 
Sbjct: 438 STMKGIGSIPEGILTNVLDYEILFDKAWIQPNQDTNLTPRQQVLQYLGDFIRYRYGEQNN 497

Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA-- 310
             +  AW++L +++YN T+     +  V++A P      +S     Y  +      EA  
Sbjct: 498 KNLFSAWSLLTNSIYNSTNPWDGPSESVMLARPASYIDKVSSWGTSYIYWNTTNVLEAWK 557

Query: 311 ----VLKSETSSYDHPHLWYSTSEVIRAL-------ELFIA-SGNELSA--SNTYRYDLI 356
                +K +       HL     E+ + L       E F+  S NE      +T+ YDL+
Sbjct: 558 LFTNYVKEKKQKNRSQHL-QKLEEINKKLGRSDDDMEAFVEISQNEERNIFKDTFLYDLV 616

Query: 357 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 416
           D+ RQ LA Y+  L+  ++ A+   D       S++FLEL++D D LL+    F+LG +L
Sbjct: 617 DVARQNLASYSYLLYNKVMLAFNQTDTIKFALYSQQFLELIKDQDQLLSSRKEFMLGYYL 676

Query: 417 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 476
           ES  +L   +++++ +    + QIT+W D      S L DY NK W+G+L+D+Y PR  +
Sbjct: 677 ESVSKLGTTDQEKQNFIEQIKRQITVWSDF----PSDLHDYANKEWNGILKDFYLPRWEL 732

Query: 477 YFK----YMIE 483
           YFK    Y++E
Sbjct: 733 YFKSLQSYIVE 743


>gi|289667570|ref|ZP_06488645.1| N-acetylglucosaminidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 798

 Score =  195 bits (495), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 162/575 (28%), Positives = 243/575 (42%), Gaps = 113/575 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQ++IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQQQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG T   Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFMELYTQAYG-TGEFYLADAFNEMLPP 313

Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
                  V + +Y  S+                       G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYG------------ILDS-----IAFGPVEARTSENTTM-------------- 237
           +  +  +YG            + DS       FG        N+ +              
Sbjct: 434 YGASNPLYGDFAFYRQDLQALLADSEKRNLRGFGIFPEGLHSNSVVYEYLYALAWEGPQQ 493

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+ QY   RYGRS  A+  AW+ L   +Y     +               P   + 
Sbjct: 494 PWSQWLTQYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P +   ++K +    D   L        RA++  +      + +  YRYD
Sbjct: 539 RAGAYLLFKRPTAD--IVKFDDRPGDPQRL-------RRAIDALLQQAERYADAPLYRYD 589

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
           LI+  R  L+  A+     +++AY   D A G  QL+ R  +LV+ +D L+       L 
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDVQLA-RITQLVQGLDALVGGQHE-TLA 647

Query: 414 PWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 472
            W   A   A N+    + Y  NAR Q+++W  +       L DY +K W G+  D+Y  
Sbjct: 648 DWTGQAAAAAGNDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQ 702

Query: 473 RAAIYFKYMIESLESGDGF-------RLKDWRREW 500
           R   +      + ++G  F       +L  W R W
Sbjct: 703 RWTRFLSAYRAARKAGTPFEAAAVDQQLATWERHW 737


>gi|390937398|ref|YP_006394957.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
 gi|389891011|gb|AFL05078.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
          Length = 1957

 Score =  195 bits (495), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 152/537 (28%), Positives = 239/537 (44%), Gaps = 77/537 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q++ L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 400 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 459

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++G  F + Q   +G+ S+ Y  D F 
Sbjct: 460 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 518

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 519 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 570

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
           + +VLDL ++++   ++  +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 571 QALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 629

Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
                      +N+ +                  +W  +Y+ RRYG +   I+ AW++L 
Sbjct: 630 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDILL 689

Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
            T Y  TDG      +  +I A P  D +I S                       S++ H
Sbjct: 690 DTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 726

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
             + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+  
Sbjct: 727 SDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 786

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
           D      LS R L++++  D LL+  D FL+G W++ A+ +    +      +E NAR  
Sbjct: 787 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 846

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           +T W  N  +  SL+ DY N+ W+GL  DYY  R   Y    +  LE G  F   DW
Sbjct: 847 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 900


>gi|169351448|ref|ZP_02868386.1| hypothetical protein CLOSPI_02228 [Clostridium spiroforme DSM 1552]
 gi|169291670|gb|EDS73803.1| LPXTG-motif cell wall anchor domain protein [Clostridium spiroforme
           DSM 1552]
          Length = 1990

 Score =  194 bits (493), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 152/562 (27%), Positives = 253/562 (45%), Gaps = 77/562 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GG LP +W ++++ L +K+  R+   G+ PVL  FSG VP   ++ +   +  
Sbjct: 360 MQNMTSYGGKLPNNWFEERVELARKMHDRMQTYGITPVLSGFSGQVPTNFKDKYQDVQYV 419

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W   +  P    TY+ +     F ++   F + Q   +G  ++IY  D F E    
Sbjct: 420 AQGSWCGYER-PDMLRTYVDNGGTDYFSQMADVFYKAQRDIFGDVTNIYAVDPFHEGGKI 478

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK--LV 178
            D     + +   +   M   D DA+WL+Q W  S      P ++      + L K  ++
Sbjct: 479 GDMN--YTKVYETVQKKMMENDEDAIWLIQEWSGSIAS--NPSKL------INLDKEHVI 528

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
           VLDLF+EV P  +++ +    P+IW MLHNF G + +    + ++   P   + SE    
Sbjct: 529 VLDLFSEVSP-RNSALEAADTPWIWNMLHNFGGRMGLDANPEKVSQNIPNTYQNSEHMVG 587

Query: 234 ---------NTTMA------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                    N+ MA                  W   Y+ R YG +   I++ WN+L  T 
Sbjct: 588 IGMTPEAIENSPMAYELLWDMTWTKDPIDFRQWCQDYAKRIYGGTNEDIEEVWNILLDTG 647

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           YN  D            +     S+I+          +P +      +  SS+ H  + Y
Sbjct: 648 YNRKDNY----------YQGAPESVIN---------ARPTTN----FTSASSWGHSTINY 684

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
              E+ RA+ L   + +E   S  + YDL D+TRQ ++  A E    ++ AYQ  +    
Sbjct: 685 DKEELERAVYLMAKNYDEFKDSPAFIYDLSDITRQLISNSAQEYHKAMVNAYQAGNLSEF 744

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
             LS +FLE++   D +L+ +  FL+G W+E A+ + ++ +   +  +E+NAR  IT W 
Sbjct: 745 EVLSDKFLEMILLQDQILSTNSDFLVGKWIEQARTMIEDSDDWTKDLFEFNARDLITTWG 804

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW---RREWI 501
                    LRDY N+ W+GL +DYY PR   +   + E++++       DW     EW 
Sbjct: 805 GLKNANGGGLRDYSNRQWAGLTKDYYYPRWQKWINDVKEAMKNNTAVPSTDWFLMEWEWA 864

Query: 502 KLTNDWQNGRNVYPVESNGDAL 523
            L +D  N    Y +E++  AL
Sbjct: 865 NLKSDEGNE---YSIEASNLAL 883


>gi|288927801|ref|ZP_06421648.1| putative alpha-N-acetylglucosaminidase
           (N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
           oral taxon 317 str. F0108]
 gi|288330635|gb|EFC69219.1| putative alpha-N-acetylglucosaminidase
           (N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
           oral taxon 317 str. F0108]
          Length = 723

 Score =  194 bits (493), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 152/550 (27%), Positives = 238/550 (43%), Gaps = 102/550 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++G  GPL   W   Q+ LQ KIL R+ +L M+P+ P F+G VP AL+ ++P+A I 
Sbjct: 198 MGNIYGIDGPLSNQWHQDQIALQHKILDRMRKLDMHPICPGFAGFVPEALKELYPTADI- 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
           Q   W     +      Y+L   DPLF +IG  FI++  KE+GR    Y  D+F+E    
Sbjct: 257 QYTTWEKAFHN------YILSPADPLFHKIGVMFIQEWEKEFGRCD-FYLIDSFNEMDIP 309

Query: 119 -PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
            PP D P   E+++  G  +Y  ++  +  A W+MQGW+F Y P  W    + AL++ VP
Sbjct: 310 FPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQGWMFGYQPEIWDYKTLNALVSQVP 369

Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K+++LDL A+  K +W T       K F G  +I+ ++ N  G   + G LD  A G 
Sbjct: 370 DNKMIMLDLAADYNKFLWKTPFNWDFYKGFCGKQWIYSVIPNMGGKSALTGALDFYAKGH 429

Query: 227 VEARTSENT----------------------------------TMAWINQYSVRRYGRSV 252
           +EA  S+N                                      W+  Y+  RYG   
Sbjct: 430 LEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGWAKQGVELRPWLRNYTYSRYGCYP 489

Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
             ++  WN +  +VY                             G ++++ +   +    
Sbjct: 490 IGMEQYWNEMIQSVY-----------------------------GSFKSHPRFNWQFRPG 520

Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
           K +  S D  + +Y   E++  + L    GN+L     +  D  ++    L      L  
Sbjct: 521 KEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL-----FEADFKEMAANYLGGKVEILVR 574

Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
            I +AY+  D     QL  RF  L+  MD +L  H    +  W++ A+    +  +   Y
Sbjct: 575 QIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPTKDMQKWIDYARARGVSYNKADCY 634

Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
           E NAR  +T+W          + DY  + W+GL+RDYY PR   YF        SG  F 
Sbjct: 635 ESNARRIVTVW-------GPPIDDYSARIWAGLIRDYYLPRWKHYF----NQKRSGKPFD 683

Query: 493 LKDWRREWIK 502
              W  ++++
Sbjct: 684 FSTWELDFVE 693


>gi|293402122|ref|ZP_06646261.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
           bacterium 5_2_54FAA]
 gi|291304514|gb|EFE45764.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
           bacterium 5_2_54FAA]
          Length = 2295

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 144/535 (26%), Positives = 247/535 (46%), Gaps = 88/535 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ +GGPLP +W +Q+  L +K+  R+   G++PV+  FSG VP       P+A IT
Sbjct: 398 MQNLYSYGGPLPDNWFEQRTELARKMHDRMQTYGISPVVQGFSGQVPDNFDKKQPTALIT 457

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
           ++ +W    + P     Y+ ++      + L+ ++ + F + Q   +G  ++ Y  D F 
Sbjct: 458 EMKDWVGY-TRPSIIQPYITESDAAKGKENLYPQVAKDFYDAQKNVFGNVTNYYATDPFH 516

Query: 116 ENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
           E   P  +D  E        + + M   +  AVW+MQ W  + D          L   V 
Sbjct: 517 EGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQQWQGNLDA-------TKLSGLVK 565

Query: 174 LGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
             + + LDL  ++ P   +   S+     P++WCMLHNF G + M G L ++A  P  A 
Sbjct: 566 PSQALALDLQTDLNPQNGVMENSE----TPWLWCMLHNFGGRMGMDGNLPNVAKNPAIAM 621

Query: 231 TS--------------ENTTMA------------------WINQYSVRRYGRSVPAIQDA 258
                           EN+ +A                  WI +Y+ RR G +   +Q+A
Sbjct: 622 NESKYMKGIGITPEALENSPVAYELLFDMTWTKDPIDEDAWIAKYAQRRAGGTSEKLQEA 681

Query: 259 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 318
           W +L  T Y    GA  +      ++     +II+ T                 +S  S+
Sbjct: 682 WKILNETAY----GAKQE------SYQGAAETIINAT------------PRDSFRS-AST 718

Query: 319 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
           + H ++ Y   E  +AL+L I + ++  AS  YRYDL D+  Q L   A E    +++A 
Sbjct: 719 WGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYDLADVANQVLCNVAIEYHSLMVKAK 778

Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNA 436
             ++A    + S++FLE+++  D +L   + F++G W+  A+ +  + +   +  +E+NA
Sbjct: 779 NESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGNWINDARNMMSDGDDWTKDLFEFNA 838

Query: 437 RTQITMWFDNTQEEASL--LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
           R  +T W   + E +SL  L DY N+ W+GL +D+YG R  ++ +     L+  D
Sbjct: 839 RAMVTTW---SGERSSLNNLNDYSNRKWNGLTKDFYGKRWKVWIENRQAELDGKD 890


>gi|333031147|ref|ZP_08459208.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
 gi|332741744|gb|EGJ72226.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
          Length = 721

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 149/568 (26%), Positives = 244/568 (42%), Gaps = 106/568 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL   W   Q+ LQ KIL R+ EL M P+ PAF+G VP A     P     
Sbjct: 199 MGNLNKWDGPLSDEWHTSQIELQHKILDRMRELEMKPIAPAFAGFVPMAFAEKHPDINFK 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     DP +   Y+L    P F EIG+ FIE+   E+G  ++ Y  D+F+E   P
Sbjct: 259 HM-RWGGF--DPEYNA-YVLPPDSPFFEEIGKLFIEEWENEFGSNTY-YLSDSFNEMELP 313

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       +   G +IY  + +G+ +A+W+ QGW F Y   FW    ++ALL++VP
Sbjct: 314 IDKDDTEGKYRLLRQYGESIYKSISAGNPEAIWVTQGWTFGYQHSFWDTTSLQALLSNVP 373

Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +        +  W     FYG  +I+  + NF G   M G +   A   
Sbjct: 374 NEKMIIIDLGNDYPKWVWNTEQTWKVQNGFYGKGWIFSYVPNFGGKTTMTGDMQMYATSS 433

Query: 227 VEAR---------------------------------TSENTTM-AWINQYSVRRYGRSV 252
            EA                                  TSE+  +  W+  Y + RYG   
Sbjct: 434 AEALASPNKGNLIGFGSAPEGLENNEVIYELLADMGWTSESINLDEWMQSYCLSRYGGYP 493

Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
             +Q AW +   TVY+            + ++P                Y      E  L
Sbjct: 494 ENVQKAWELFRKTVYSN-----------LYSYP---------------RYTWQTVVEDTL 527

Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
           +    +        ++ E +  +ELF+++ NEL  S  Y  DLI+ +    A  A++++ 
Sbjct: 528 RINKIN--------TSDEFLIGVELFVSAVNELKDSELYVNDLIEFSSFYAAAKADKIYK 579

Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
             +  ++  +      L  + ++++  +D LLA H  + L  W++ A+       ++  +
Sbjct: 580 EALILFERGNKKEARSLLNQSIQILLKVDKLLASHPIYRLEEWVKYARNSGSTVAEKDAF 639

Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
           E NA+  IT W         +  DY  ++WSGL++DYY PR  + F        S +   
Sbjct: 640 EANAKRLITTW-------GGIQDDYAARFWSGLIKDYYIPRMELNF--------SSERNS 684

Query: 493 LKDWRREWIKLTNDWQNGRNVY--PVES 518
           L+ W   W+  +  W N    +  P+E+
Sbjct: 685 LRQWEENWV--STPWNNPTQPFDNPIEA 710


>gi|373451393|ref|ZP_09543318.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
           3_1_31]
 gi|371968665|gb|EHO86120.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
           3_1_31]
          Length = 2190

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 144/535 (26%), Positives = 250/535 (46%), Gaps = 88/535 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ +GGPLP +W +Q+  L +K+  R+   G++PV+  FSG VP       P+A IT
Sbjct: 390 MQNLYSYGGPLPDNWFEQRTELARKMHDRMQTYGISPVVQGFSGQVPDNFDKKQPTALIT 449

Query: 61  QLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
           ++ +W    + P     Y+ +       + L+ ++ + F + Q   +G  ++ Y  D F 
Sbjct: 450 EMKDWVGY-TRPSIIQPYITENDAAKGKENLYPQVAKDFYDAQKNVFGNVTNYYATDPFH 508

Query: 116 ENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
           E   P  +D  E        + + M   +  AVW+MQ W  + D      ++  LL    
Sbjct: 509 EGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQQWQGNLDA----TKLSGLLKP-- 558

Query: 174 LGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
             + + LDL  ++ P   +   S+     P++WCMLHNF G + M G L ++A  P  A 
Sbjct: 559 -SQALALDLQTDLNPQNGVMENSE----TPWLWCMLHNFGGRMGMDGNLPNVAKNPAIAM 613

Query: 231 TS--------------ENTTMA------------------WINQYSVRRYGRSVPAIQDA 258
                           EN+ +A                  WI +Y+ RR G +   +Q+A
Sbjct: 614 NESKYMKGIGITPEALENSPVAYELLFDMTWTKDPIDEDAWIAKYAQRRAGGTSEKLQEA 673

Query: 259 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 318
           W +L  T Y    GA  +      ++     +II+ T            +++   +  S+
Sbjct: 674 WKILNETAY----GAKQE------SYQGAAETIINAT-----------PRDSFRSA--ST 710

Query: 319 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
           + H ++ Y   E  +AL+L I + ++  AS  YRYDL D+  Q L   A E    +++A 
Sbjct: 711 WGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYDLADVADQVLCNVAIEYHSLMVKAK 770

Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNA 436
             ++A    + S++FLE+++  D +L   + F++G W+  A+ +  + +   +  +E+NA
Sbjct: 771 NESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGNWINDARNMMSDGDDWTKDLFEFNA 830

Query: 437 RTQITMWFDNTQEEASL--LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
           R  +T W   + E +SL  L DY N+ W+GL +D+YG R  I+ +     L+  D
Sbjct: 831 RAMVTTW---SGERSSLNNLNDYSNRKWNGLTKDFYGKRWKIWIENRQAELDGKD 882


>gi|289663931|ref|ZP_06485512.1| N-acetylglucosaminidase [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 798

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 162/575 (28%), Positives = 243/575 (42%), Gaps = 113/575 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQ++IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQQQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG T   Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFMELYTQAYG-TGEFYLADAFNEMLPP 313

Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
                  V + +Y  S+                       G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYG------------ILDS-----IAFGPVEARTSENTTM-------------- 237
           +  +  +YG            + DS       FG        N+ +              
Sbjct: 434 YGASNPLYGDFAFYRQDLQALLADSEKRNLRGFGIFPEGLHSNSVVYEYLYALAWEGPQQ 493

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+ QY   RYGRS  A+  AW+ L   +Y     +               P   + 
Sbjct: 494 PWSQWLMQYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P +   ++K +    D   L        RA++  +      + +  YRYD
Sbjct: 539 RAGAYLLFKRPTAD--IVKFDDRPGDPQRL-------RRAIDALLQQAERYADAPLYRYD 589

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
           LI+  R  L+  A+     +++AY   D A G  QL+ R  +LV+ +D L+       L 
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDVQLA-RITQLVQGLDALVGGQHE-TLA 647

Query: 414 PWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 472
            W   A   A N+    + Y  NAR Q+++W  +       L DY +K W G+  D+Y  
Sbjct: 648 DWTGQAAAAAGNDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQ 702

Query: 473 RAAIYFKYMIESLESGDGF-------RLKDWRREW 500
           R   +      + ++G  F       +L  W R W
Sbjct: 703 RWTRFLSAYRAARKAGTPFDAAAVDQQLATWERHW 737


>gi|410634789|ref|ZP_11345419.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
 gi|410145665|dbj|GAC22286.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
          Length = 750

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 158/587 (26%), Positives = 268/587 (45%), Gaps = 108/587 (18%)

Query: 1   MSNLHGWGG--PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           M NL  W G   LP+S+ D+Q+ L  KIL R+  LGM P++ AF+G VP A   +FP A+
Sbjct: 211 MGNLITWDGGDKLPESYFDEQIALNHKILKRLRSLGMTPIVHAFAGFVPPATSELFPEAQ 270

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE-N 117
           I +L +W      P     YLL   +PLF++IG+ +IE+  KE+G+  + Y  D+F+E +
Sbjct: 271 IRRL-SWGG--GLPESTYGYLLSPENPLFVKIGKMYIEEWQKEFGKNEY-YLADSFNEMD 326

Query: 118 TPPVDS-PEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP------FWRPPQMKA 167
            PP D+  E ++ L   G  +Y  +++ + DA W+MQGW F Y        FW P ++ A
Sbjct: 327 VPPADTEAELLTELAGYGDRVYQSIKAANPDATWVMQGWTFPYHKDENRQLFWTPERLHA 386

Query: 168 LLNSVPLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
           L++ VP  KL++LDL  E       + P W     F+   +I+  + N  G   + G  D
Sbjct: 387 LVSKVPDDKLLILDLANEYNKLWWKIDPSWKMYSGFFNKKWIYSFIPNMGGKTPLNGRFD 446

Query: 221 SIAFGPVEARTSEN------------------------TTMAW------INQ----YSVR 246
             A  P++A   ++                        T MAW      ++Q    Y+++
Sbjct: 447 IYAELPIDALNYKDKGNLIGFGFAPEGIENNEMIYELLTDMAWQRKAIDVDQWQAKYAMQ 506

Query: 247 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 306
           RYG    +++ A++ L              N+  + +F  VD  I       Y+N   P 
Sbjct: 507 RYGAYPGSLEKAFSYL--------------NKSALGSF--VDHPIHRFQLRPYRN---PE 547

Query: 307 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 366
             E          DH  + + + + I+A  LF+ +  +L  +  Y++D +++T   L+  
Sbjct: 548 GVE----------DHATV-HESEDFIKATGLFLQASEQLKDNKLYQHDAMEITTLFLSLV 596

Query: 367 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 426
            + L    +        + V   +   + ++  MD LLA H    L  W++ A+      
Sbjct: 597 TDNLLTKFLAKDVEQRDYSVLDEA---ISVMHTMDKLLAEHPNHQLVTWVDYARTWGSTT 653

Query: 427 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
            ++  YE NA+  +T W  +       + DY  + WSGL+ +YY PR   Y     ++++
Sbjct: 654 AEKDYYESNAKRLLTTWGGDP------VNDYAGRVWSGLIGNYYAPRWQSYH----DAVK 703

Query: 487 SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
           +   F ++ W   W+     ++N    Y      D +  +Q +Y KY
Sbjct: 704 NNQTFDVRQWEENWV--MTPYKNTSTAYQ-----DPVRVAQAMYFKY 743


>gi|281424178|ref|ZP_06255091.1| N-acetylglucosaminidase [Prevotella oris F0302]
 gi|281401447|gb|EFB32278.1| N-acetylglucosaminidase [Prevotella oris F0302]
          Length = 723

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 153/534 (28%), Positives = 246/534 (46%), Gaps = 70/534 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++G  GPL   W   Q+ LQ KIL R+ +L M+P+ P F+G VP AL+ ++P+A I 
Sbjct: 198 MGNIYGIDGPLSNQWHQDQIALQHKILDRMRKLDMHPICPGFAGFVPEALKELYPTADI- 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
           Q   W     +      Y+L   DPLF +IG  FI++  KE+GR    Y  D+F+E    
Sbjct: 257 QYTTWEKAFHN------YILSPADPLFHKIGVMFIQEWEKEFGRCD-FYLIDSFNEMDIP 309

Query: 119 -PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
            PP D P   E+++  G  +Y  ++  +  A W+MQGW+F Y P  W    + AL++ VP
Sbjct: 310 FPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQGWMFGYQPEIWDYKTLNALVSQVP 369

Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K+++LDL  +  K +W T       K F G  +I+ ++ N  G   + G LD  A G 
Sbjct: 370 DNKMIMLDLAVDYNKFLWKTPFNWDFYKGFCGKQWIYSVIPNMGGKSALTGALDFYAKGH 429

Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
           +EA  S+N          +  +G +   I++   V+Y  +  C  G   +         +
Sbjct: 430 LEALNSQNRG-------KLIGFGFAPEGIENN-EVVYELL--CDAGWAKQG-------VE 472

Query: 287 VDPSIISVTEGKYQNYGKPVSKE----AVLKSETSSY-DHPHL-W--------YSTSEV- 331
           + P + + T  +Y  Y  P+  E     +L+S   S+  HP   W        Y + ++ 
Sbjct: 473 LRPWLRNYTYSRYGCY--PIGMEQYWNEMLQSVYGSFKSHPRFNWQFRPGKEKYGSVDLD 530

Query: 332 ---IRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
                A+E+     +++  +  +  D  ++    L      L   I +AY+  D     Q
Sbjct: 531 NHFYHAVEIMAGMLSQMKGNKLFEADFKEMAANYLGGKVEILVRQIDKAYESQDTINANQ 590

Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 448
           L  RF  L+  MD +L  H    +  W++ A+    +  +   YE NAR  +T+W     
Sbjct: 591 LETRFYRLMTGMDLVLQGHPTKDMQKWIDYARARGVSYNKADCYESNARRIVTVW----- 645

Query: 449 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
                + DY  + W+GL+RDYY PR   YF        SG  F    W  ++++
Sbjct: 646 --GPPIDDYSARIWAGLIRDYYLPRWKHYF----NQKRSGKPFDFSTWELDFVE 693


>gi|310287970|ref|YP_003939229.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
 gi|309251907|gb|ADO53655.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
          Length = 1923

 Score =  192 bits (488), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 151/537 (28%), Positives = 238/537 (44%), Gaps = 77/537 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q++ L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 366 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 425

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++G  F + Q   +G+ S+ Y  D F 
Sbjct: 426 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGDTFYKAQESVFGKVSNYYAVDPFH 484

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 485 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 536

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
           + +VLDL ++++   ++  +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 537 QALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 595

Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
                      +N+ +                  +W  +Y+ RRYG +   I+ AW++L 
Sbjct: 596 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDILL 655

Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
            T Y   DG      +  +I A P  D +I S                       S++ H
Sbjct: 656 DTAYKHMDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 692

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
             + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+  
Sbjct: 693 SDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 752

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
           D      LS R L++++  D LL+  D FL+G W++ A+ +    +      +E NAR  
Sbjct: 753 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 812

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           +T W  N  +  SL+ DY N+ W+GL  DYY  R   Y    +  LE G  F   DW
Sbjct: 813 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 866


>gi|345014586|ref|YP_004816940.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
 gi|344040935|gb|AEM86660.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
          Length = 1044

 Score =  192 bits (488), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 148/564 (26%), Positives = 240/564 (42%), Gaps = 78/564 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+  +GGP+  + LD++  L ++I  R+ ELGM PV P + G VP    +  P A+  
Sbjct: 216 LQNMSEYGGPVSTALLDKRTELGRRIADRLRELGMRPVFPGYFGTVPDGFADRNPEARTV 275

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W  ++  P W     LD     F ++  AF   Q + +G  + ++  D   E   P
Sbjct: 276 PQGDWNGLRR-PDW-----LDPRTESFRKVAAAFYRHQRELFGE-AGLFKMDLLHEGGDP 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+ + +++    A+W++ GW    +P       + LL++V   +++V+
Sbjct: 329 GDVP--VPDAARAVETALRTARPGAIWVILGW--QENP------RRDLLDAVDHDRMLVV 378

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI------------------LDSI 222
           D  +++  +    K +  VPY +  + NF G   +                     L   
Sbjct: 379 DGLSDLDTVTDREKDWGAVPYAFGTIPNFGGRTTIGAKTHMWTKRFTVWRDKPGSKLVGT 438

Query: 223 AFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
           A+ P  A               R        W   Y+  RYG      ++A+  L  T Y
Sbjct: 439 AYMPEAAERDPAAFELFSELAWREEAVDRAEWFRSYAEMRYGGRDAKAREAFAALRDTAY 498

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             +      +  V  A P       S+T     NY           + T ++D       
Sbjct: 499 EISSKDGRPHDSVFAARP-------SLTARSGTNYA----------THTPAFDPAGF--- 538

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             +V  A  L + +G  L  S+ YR+DL D+ RQALA  + +L   + +AY   D     
Sbjct: 539 --DVAFAALLGVRAG--LRDSDAYRHDLTDIARQALANRSWQLIPQLQDAYDRKDRTAFR 594

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
            L+R +L+L+   D +   H  FLLGPWLE AK++A  +E+  + E  ART IT W D  
Sbjct: 595 TLARLWLKLMRLSDDMTGAHRRFLLGPWLEDAKRMASGDEESARLERAARTLITTWADRA 654

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
             +   L +Y N+ WSGL+ D++ P+   Y   + ++L      R  D    W  +   W
Sbjct: 655 TADGGKLANYANRDWSGLIADFHLPQWQSYLDELEDALAENRPPRAFD----WFAVEEPW 710

Query: 508 QNGRNVYPVESNGDALITSQWLYN 531
              R  YPV    DA  T+Q +Y 
Sbjct: 711 TRERTSYPVRPTTDAHRTAQRVYE 734


>gi|384417770|ref|YP_005627130.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
 gi|353460684|gb|AEQ94963.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
          Length = 798

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 154/574 (26%), Positives = 239/574 (41%), Gaps = 111/574 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQQWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADAFNEMLPP 313

Query: 121 -------VDSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
                  V + +Y                       +++ G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAAYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADRAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
           +  +  +YG +                 +   FG        N+ +              
Sbjct: 434 YGASNPLYGDVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYLYALAWEGPQH 493

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+ +Y   RYGRS  A+  AW  L   +Y     +               P   + 
Sbjct: 494 PWSQWLARYLRARYGRSDAALLSAWTDLEAGIYQTRYWS---------------PRWWNT 538

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P +            D P        + RA++  +   +  + +  YRYD
Sbjct: 539 HAGAYLLFKRPTADIVNFD------DRPG---DPQRLRRAIDALLQQADRYADAPLYRYD 589

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
           LI+  R  L+  A+     +++AY   D A G  QL+R   +LV+ +D L+      L  
Sbjct: 590 LIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDALVGGQHETLAA 648

Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
              ++A     +    + Y  NAR Q+++W  +       L DY +K W G+  D+Y  R
Sbjct: 649 WTGQAAAAAGNDARLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 703

Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
              +      + ++G  F       +L  W R+W
Sbjct: 704 WTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 737


>gi|260910505|ref|ZP_05917173.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
           F0295]
 gi|260635347|gb|EEX53369.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
           F0295]
          Length = 1566

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 150/559 (26%), Positives = 254/559 (45%), Gaps = 83/559 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGP+ +S +  +L  Q+++L R+ +LG+ PV+  F G VP   +  FP A+I 
Sbjct: 199 MGNLEGWGGPMSESLIALRLQQQRQMLQRMRQLGIQPVVQGFPGIVPTFFKERFPQARII 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--T 118
           + G W S +         LL   D +F ++  A+ +   K +G        D F E   T
Sbjct: 259 EQGKWGSFQRP-----AVLLPNNDGVFEKVAEAYYQSLTKLFGTDFEFLGGDLFHEGGIT 313

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    + S+ A +   M      A W++QGW  + +P    PQ+  +L+       +
Sbjct: 314 TGVD----VGSVAAQVQRQMLRFFPRAKWVLQGW--NKNP---SPQLLRVLDK---RHTL 361

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA--------- 229
           +++L  E+   W +S +F G P++W  +++F G  +M G L  I   P  A         
Sbjct: 362 LVNLSGEIAASWESSDEFGGTPWLWGSVNHFGGKTDMGGQLPVIVTEPHRALALTVDSVM 421

Query: 230 -------------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
                                     T+     + + QY   RYG   P +  AW ++  
Sbjct: 422 QGIGILPEGIGTNPVVYDLALKTAWHTATPDVDSMLVQYLGYRYGEVHPDLLAAWRIMLK 481

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           +VY                            EG +++     ++ ++  +  S++    +
Sbjct: 482 SVYG---------------------EFAIKGEGTFESVF--CARPSLRVTSVSTWGPKQM 518

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            Y  +++ RAL LF+ +  +L  S TY+YDL+DL RQ+LA YA   + ++++AY+  +A 
Sbjct: 519 QYQPADLYRALGLFLKAAPKLRDSETYQYDLVDLARQSLANYARTAYADVVKAYEAKNAE 578

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            + Q ++RF  L+   D LL  +  FLLG WL+ A Q A NE   +    NA+T I+ W 
Sbjct: 579 QLQQATQRFERLIVLQDSLLLTNRHFLLGNWLQQATQYAPNEADRQLCLHNAQTLISYW- 637

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
               E  + + DY NK W+G+L  YY PR   +F+ +  S+ +G+   +     ++ ++ 
Sbjct: 638 -GPDEPTTKVHDYANKEWAGMLSTYYLPRWQAFFRVLQASINTGNPPAI-----DFFEME 691

Query: 505 NDWQNGRNVYPVESNGDAL 523
             W N       +  GDA+
Sbjct: 692 KRWANTPQPINTKPQGDAV 710


>gi|403512485|ref|YP_006644123.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
           [Nocardiopsis alba ATCC BAA-2165]
 gi|402798758|gb|AFR06168.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
           [Nocardiopsis alba ATCC BAA-2165]
          Length = 718

 Score =  191 bits (486), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 139/509 (27%), Positives = 222/509 (43%), Gaps = 71/509 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  + GP+P+SW++    L +++L R   LGM PVLP F+G+VP +L      ++  
Sbjct: 160 MGNLDHFAGPMPRSWIEGHRELGRRVLERQRALGMTPVLPGFTGHVPPSLAPGRTGSRTW 219

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q               T++L  TDPL+  +    +E Q KE   T H Y  D F E  P 
Sbjct: 220 Q------------GLVTHVLVPTDPLYTTLCAEIVETQ-KELFDTDHQYAIDPFIEMIPV 266

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P +   +  A   G+   D  AVW +Q W FSY   FW P +++A L+++P   L +
Sbjct: 267 DSDPGFPGLVARATIEGLTRADPRAVWFLQTWPFSYQSDFWSPERVEAFLDAIPDDHLHL 326

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI----EMYGILDSIAFGPVEARTSENT 235
           LDL+AE  P WS    F G P+ WC L NF G      ++ G  D I      A   E  
Sbjct: 327 LDLWAEYDPQWSRFHAFGGTPWTWCALLNFGGRTDPMADLQGAADRIGAAKDSAHPPEGI 386

Query: 236 TMA----------------------------WINQYSVRRYGRSV-PAIQDAWNVLYHTV 266
            ++                            W+  +  +RYG    PA+ + W  L  TV
Sbjct: 387 GLSMEATRNNPAFFELVVDQAWTRTGRVEEEWLPDFVAQRYGPGHDPALLEGWRGLLRTV 446

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
              +          +  FP+    ++++     + + + +   + L++E ++     +WY
Sbjct: 447 LGASG---------VRIFPEQFNGVLTL-----RPHYRHLEDSSALRAEVTAL----VWY 488

Query: 327 STSEVIRALELFIASG--NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
              +++ A E  +A    + L+      +DL+D+    L++ A+  +L ++E    +   
Sbjct: 489 PWPDLLAAWERLVAGAETDPLAVEGPLGHDLVDVAMAVLSRVADHRYLEMVEHLDHHPEL 548

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
               L  RFLE+ +D+D LL     +    W   A   A   E  +    NAR  +T+W 
Sbjct: 549 PEGDLE-RFLEVFDDLDALLETRPEYRYRTWEAKATSWATGTEDHRVLTDNARRILTVW- 606

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
             T  +   L DY  + WSGL+  YY PR
Sbjct: 607 --TTLDDPRLDDYAGRLWSGLVGGYYRPR 633


>gi|375146756|ref|YP_005009197.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
           koreensis GR20-10]
 gi|361060802|gb|AEV99793.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
           koreensis GR20-10]
          Length = 1147

 Score =  191 bits (485), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 146/552 (26%), Positives = 228/552 (41%), Gaps = 80/552 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGP+PQS +D + +L +K++ R+  LG+ PV+P F G VP    N    A++ 
Sbjct: 191 MGNIEGWGGPMPQSQIDSRKILVQKMIARMQALGIEPVMPGFYGMVPHNF-NTKSKARVI 249

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             GNW +      +    +LD TD  F  +   F E+  K YGR    ++ D F E    
Sbjct: 250 TQGNWGA------FIRPAILDPTDTAFDRVAGIFYEETKKLYGRNIRFFSGDPFHEGG-- 301

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           + +   +   GA I   MQ     A+W++QG        W+    K LL       L++ 
Sbjct: 302 ITNGVNLGKAGANIQKAMQQYFPGAIWVLQG--------WQDNPKKELLAETDKSALLIQ 353

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFA------GNIEMY-GILDSIAFGPVEA---- 229
           +LF E    W T   + G P+IWC ++NF       G +E Y G +   A GP       
Sbjct: 354 ELFGENTNNWETRNGYEGTPFIWCCVNNFGERPGLNGKLERYAGEVYRAATGPFREYMKG 413

Query: 230 -----------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                        T  WIN Y   RYG++   I  AW +   T+
Sbjct: 414 VGIMPEGINNNPASYDLVLELGWHNQPVETGKWINDYVKARYGKANDQIATAWTLFLQTI 473

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y+          +++ A P                        A+     SS+      Y
Sbjct: 474 YSNPGYQEGPPENILCARP------------------------ALQVKSVSSWGKLKKGY 509

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
            T+   + ++ F A+      S TY+ DLI+ TRQ L+  A+ +F +++ AY+  +    
Sbjct: 510 DTALFEKGVQAFAAAAPLFGNSETYKIDLINFTRQVLSNRADTVFASLVTAYKEENTVAF 569

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              +  FL L    + LL  H  + L  + + A +      + K    NA   IT W +N
Sbjct: 570 NAAAEAFLSLHALTNELLNSHSYYRLTSYQQQALRSGNTPIERKNNLHNAMMLITYWGEN 629

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD---WRREWIKL 503
            ++E   L +Y  K W G++  +Y  R  +YF Y+  +L +G      D   W REW+  
Sbjct: 630 NRQE-DYLHEYAYKEWGGMMTTFYQQRWKLYFDYLRNNL-AGKSVTPPDFFAWEREWVTQ 687

Query: 504 TNDWQNGRNVYP 515
               ++    YP
Sbjct: 688 NEQVKSEVQPYP 699


>gi|408676293|ref|YP_006876120.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
 gi|328880622|emb|CCA53861.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
          Length = 855

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 146/556 (26%), Positives = 229/556 (41%), Gaps = 81/556 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL G+ GP+ +  ++ +  L  +I   +  LGM PVLP + G VP       P A   
Sbjct: 327 LQNLSGFAGPVSEQLIEARAALGARIARHLRSLGMTPVLPGYFGTVPPDFTARNPGAHTV 386

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W      P W     LD T P+F  +   +   Q + +G  S ++  D   E   P
Sbjct: 387 PQGRWVGF-GRPDW-----LDPTGPVFARLAAVYYRHQRQRFG-DSDMFKMDLLHEGGAP 439

Query: 121 --VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    +S+   A+   +++    A W+M GW  +  P        ALL+ V   +L+
Sbjct: 440 GTVD----VSAAAGAVQRALEAARPGATWVMLGWQLNPTP--------ALLHGVDRRRLL 487

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILD 220
           ++D  ++         ++ G PY +  + NF G+  +                     L 
Sbjct: 488 IVDGLSDRYDELDRETRWGGTPYAFGTIPNFGGHTSIGANTGAWVSRFHAWLAKPDSALR 547

Query: 221 SIAFGPVEARTSEN-----TTMAW----INQ------YSVRRYGRSVPAIQDAWNVLYHT 265
            IA+ P    T+       T +AW    I+Q      Y+ RRYG +      AW  L   
Sbjct: 548 GIAYLPEATGTNPVAFGLFTELAWQPGPIDQQRWFAGYAARRYGGADRHAAAAWEALRLG 607

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
            Y+   G+  + +D + A     PS+ + T  ++                        + 
Sbjct: 608 PYSMRTGSWSEPQDSLFA---ARPSLTASTAARWSPKA--------------------MR 644

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
           Y  + V RAL   +     L  S+ YR+D++D+ RQAL   A  L   I  AY+  D   
Sbjct: 645 YDAATVERALAELLRVAPRLRTSDAYRFDVVDVARQALTNRARVLLPRIRAAYEARDLDA 704

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
              L R +    E +  L+     FL+GPWL +A+    +  +  + E++AR+ +T W D
Sbjct: 705 FRALVREWGAAEELLGRLVGSDRRFLVGPWLAAARSWGADPAERDRLEYDARSILTTWAD 764

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
               E+  L DY N+ WSGL+RD Y PR A YF  +  +L +G      D    W    +
Sbjct: 765 RVPSESGGLHDYANREWSGLVRDVYAPRWAAYFASLDRALVNGTEPVAID----WFARDD 820

Query: 506 DWQNGRNVYPVESNGD 521
            W  G   YP   +GD
Sbjct: 821 AWARGHRSYPTLPSGD 836


>gi|32564213|ref|NP_496948.2| Protein K09E4.4 [Caenorhabditis elegans]
 gi|25814792|emb|CAB70170.2| Protein K09E4.4 [Caenorhabditis elegans]
          Length = 715

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 136/522 (26%), Positives = 227/522 (43%), Gaps = 81/522 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GG L  + +     L K+I+ R+ ELG+ P+LP F+G VP  L+ +FP++K  
Sbjct: 210 MGNLKAYGGGLSDAQMLNDHNLAKRIIDRLLELGITPILPTFAGFVPDHLETLFPASKFN 269

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
           +L  W +  S+    C   +   DPLF +IG  F+  Q K +G   +++Y+ D F+E  P
Sbjct: 270 RLPRWNNFTSET--SCMLSVSPFDPLFQKIGSTFLRHQKKMFGGDVTNMYSADPFNEILP 327

Query: 120 PVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
              +    +++     AI +  +  D + VW++Q W F+YD  W    +K+ L+++P+G 
Sbjct: 328 SESAKFDAKFVKQTAQAIMNSCKKVDKNCVWVLQSWSFTYDQ-WPAWAIKSFLSAIPVGN 386

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           L++LDL+AEV P W  +  F G  ++WC+LHNF G+ E+ G L  I  G   A     + 
Sbjct: 387 LLILDLYAEVVPAWQMTSSFQGHHFVWCLLHNFGGSRELRGNLQKIDKGYQLALMKAGSN 446

Query: 237 MA---------------------------------WINQYSVRRYGRSVPAIQDAWNVLY 263
           +                                  W+  YS  RY       Q  W +L 
Sbjct: 447 LVGAGLSMEAIDQNYVVYQFMIDRMWSPEPLPLNNWLKAYSESRYSADFKVAQKFWTLLA 506

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            T YN  +        V +                   Y +P     +            
Sbjct: 507 GTFYNQPEKWGTPRFSVFL-------------------YHRPGFGRKI-----------E 536

Query: 324 LWYSTSEVI-RALELFIASGNELSASNTYRYDLIDLTRQALA-KYANELFLNIIEAYQLN 381
            W+   E   R  EL  A  + L     +R DL D+ R+    +  NE  L++ EA+ + 
Sbjct: 537 YWFPVEETFSRFRELLPALVHVLGEHPLFREDLNDVMREMTQFEMGNEAALSMSEAFLME 596

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
           D   V       +E+ + ++     +    +  W+E+AK +A   E+ + +   A   +T
Sbjct: 597 DKQQVGASCEMLMEMFQKLES----YSNRDVRQWIENAKSIAPTSEERQVFPVTAGDILT 652

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
           +W    Q       DY ++ W+GL+  YYG R   +  +++E
Sbjct: 653 VWGPTGQN-----LDYAHREWAGLMSGYYGRRWQYFCDWILE 689


>gi|421736727|ref|ZP_16175487.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
           IPLA 20015]
 gi|407295984|gb|EKF15606.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
           IPLA 20015]
          Length = 1044

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 150/537 (27%), Positives = 236/537 (43%), Gaps = 77/537 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q + L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 338 MQNLYSVGGPLPAAWFEQCVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 397

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++   F + Q   +G+ S+ Y  D F 
Sbjct: 398 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVCDTFYKAQENVFGKVSNYYAVDPFH 456

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 457 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 508

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
           + +VLDL ++++   ++  +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 509 QTLVLDLQSDLRSQ-ASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 567

Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
                      +N+ +                  +W  +Y+ RRYG +   I+  W++L 
Sbjct: 568 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKVWDILL 627

Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
            T Y  TDG      +  +I A P  D +I S                       S++ H
Sbjct: 628 DTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 664

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
             + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+  
Sbjct: 665 SDIDYDKRQFEKAAALFEQAYDSYKNSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 724

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
           D      LS R L++++  D LL+  D FL+G W++ A+ +    +      +E NAR  
Sbjct: 725 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 784

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
           +T W  N  +  SL+ DY N+ W+GL  DYY  R   Y    +  LE G  F   DW
Sbjct: 785 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 838


>gi|341892319|gb|EGT48254.1| hypothetical protein CAEBREN_28412 [Caenorhabditis brenneri]
          Length = 713

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 140/503 (27%), Positives = 233/503 (46%), Gaps = 43/503 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GG L  + +   L L K+I+ R+ ELG+ P+LP F+G VP  L+ +FPS+K T
Sbjct: 208 MGNLKAYGGGLSDAQMLNDLNLAKRIINRLLELGITPILPTFAGFVPDQLEKLFPSSKFT 267

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
           +L  W +  S+    C   +   DPLF +IG  F+  Q K +G   +++Y+ D F+E  P
Sbjct: 268 RLPCWNNFTSET--SCLLSVSPFDPLFQKIGSLFLRHQKKMFGGDITNLYSADPFNEILP 325

Query: 120 PVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
              +    +++     AI +  +  D + +W++Q W F+YD  W    +K+ L++VP+G 
Sbjct: 326 SDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWSFTYDE-WPSWAIKSFLSAVPIGN 384

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT- 235
           L++LDL++EV P W ++  F+G  YIWCMLH+F G+ E+ G L  +  G   A     + 
Sbjct: 385 LLILDLYSEVVPAWQSTSSFHGHNYIWCMLHSFGGSRELRGNLQKVDKGYQLALMKGGSN 444

Query: 236 ------TMAWIN------QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVA 283
                 TM  I+      Q+ V R   S P   + W   Y       D         ++A
Sbjct: 445 LIGAGLTMEAIDQNYVIYQFMVDRMWSSEPLPLNTWIKSYSESRYSADFKVSHKFWTLLA 504

Query: 284 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS-G 342
           F   +         + + +G P      L    +       W+   E    L+  I S  
Sbjct: 505 FSFYN---------QPEKWGNP-RFSVFLYHRPAFGKKIEYWFPVEETFGHLQSLIPSLI 554

Query: 343 NELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDM 400
           + L     ++ DL D+ R A+ ++   N+  L + EA+ + D   +       +    DM
Sbjct: 555 HVLGDHPLFKEDLNDVMR-AITQFEVGNDAALTLTEAFLMEDKQQIGSTCENLM----DM 609

Query: 401 DGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNK 460
              L  +    +  W+E +K +A   E+ + +   A   +T+W    Q       DY ++
Sbjct: 610 FLKLESYSNRDMKHWIEDSKSIAATSEERQVFPATAADILTVWGPEGQN-----LDYAHR 664

Query: 461 YWSGLLRDYYGPRAAIYFKYMIE 483
            W GLL  YYG R   +  +++E
Sbjct: 665 EWEGLLSGYYGRRWQYFCDWILE 687


>gi|147860882|emb|CAN83148.1| hypothetical protein VITISV_031934 [Vitis vinifera]
          Length = 562

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 86/100 (86%), Positives = 94/100 (94%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 100
           +LGNWF+V  +PRWCCTYLLDATDPLFIEIGRAFI+QQLK
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGRAFIQQQLK 334



 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 82/92 (89%), Positives = 89/92 (96%)

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           DTFDENTPPVD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+S
Sbjct: 429 DTFDENTPPVDDPEYISSLGAAIFKGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHS 488

Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW 203
           VP+G+LVVLDLFAEVKPIW TS+QFYGVPYIW
Sbjct: 489 VPMGRLVVLDLFAEVKPIWITSEQFYGVPYIW 520


>gi|390989490|ref|ZP_10259787.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372555759|emb|CCF66762.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 798

 Score =  189 bits (481), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 153/574 (26%), Positives = 232/574 (40%), Gaps = 111/574 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313

Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
           V                                   ++  G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
           +  +  +YG                   +   FG        N+ +              
Sbjct: 434 YGASNPLYGDFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEGPQQ 493

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+  Y   RYGRS  A+  AW+ L   +Y     +               P   + 
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P +            D P        + RA++  +   N  + +  YRYD
Sbjct: 539 RAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLYRYD 589

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
           LI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+      L  
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGGQHETLAD 648

Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
              ++A     +    + Y  NAR Q+++W  +       L DY +K W G+  D+Y  R
Sbjct: 649 WTGQAAAATGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 703

Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
              +      + ++G  F       +L  W R+W
Sbjct: 704 WTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737


>gi|197302378|ref|ZP_03167435.1| hypothetical protein RUMLAC_01107 [Ruminococcus lactaris ATCC 29176]
 gi|197298557|gb|EDY33100.1| F5/8 type C domain protein [Ruminococcus lactaris ATCC 29176]
          Length = 1655

 Score =  189 bits (480), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 156/580 (26%), Positives = 249/580 (42%), Gaps = 108/580 (18%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  +W  ++  L +K  + + +LGM PVL  +SG VP  + +  PSA++ 
Sbjct: 680  MANLSGYGGPVHDTWFTERTELARKNQLIMRKLGMQPVLQGYSGMVPVDITSKDPSAEVI 739

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
            + G W S +         +L      F +    F + Q + YG ++H Y  D F E  NT
Sbjct: 740  KQGTWCSFQRPS------MLRTDSESFTKYAALFYKVQKEVYGDSAHYYATDPFHEGGNT 793

Query: 119  PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-- 176
              +DS    + +   + + M + D  A W++Q         W+     ALL  +   +  
Sbjct: 794  GGMDS----AVISQKVLASMMTADPHATWVIQS--------WQGNPTTALLQGLGDNRDH 841

Query: 177  LVVLDLFAEVKPIWSTSK-----------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
             +VLDL+AE  P W+ +            +F   P+++CML+NF G + ++G +D+   G
Sbjct: 842  ALVLDLYAEKTPHWNETNPGYYGGAEGGGEFLNTPWVYCMLNNFGGRLGLHGHIDNYVEG 901

Query: 226  PVEARTSENTTMA--------------------------------------WINQYSVRR 247
             V A + +   MA                                      W   Y  RR
Sbjct: 902  IVNA-SKQAEHMAGIGITPEASVNNPVLYDLFFETIWADDGNNLQKINLDEWFKNYVTRR 960

Query: 248  YGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKP 305
            YG    +   A  +L+ TVYN       +     V+ A P +D        G    +G  
Sbjct: 961  YGADSDSAYQAMEILHDTVYNPAYNMKGQGAPESVVNARPGLDI-------GAASTWGNA 1013

Query: 306  VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 365
            V                 + Y   ++ +A EL +A  ++L  S  Y+YDL ++  Q L+ 
Sbjct: 1014 V-----------------VDYDKKKLEKAAELLLADYDKLKNSAGYQYDLANVLEQVLSN 1056

Query: 366  YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 425
             A E    +  A++  DA     LS +FL +++ ++ +      FL+G W+  AK+LA+N
Sbjct: 1057 TAQEYQKKMAAAFRSGDAEEFSTLSDKFLSIIDMVEKVTGTQKEFLVGTWINGAKKLAKN 1116

Query: 426  EEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
             +   ++ YE NAR+ IT W    Q  +  L DY N+ W+GL  DYY  R   +     +
Sbjct: 1117 SDDFTKELYELNARSLITTWGSYDQAISGGLIDYSNRQWAGLTNDYYKMRWEKWITERKK 1176

Query: 484  SL--ESGDGFRLKDW-RREWIKLTNDWQNGRNVYPVESNG 520
             L  ES   +  +DW   EW      W  G N Y    NG
Sbjct: 1177 ELAGESYTNYSAQDWFEMEWA-----WARGTNKYSGTPNG 1211


>gi|294812279|ref|ZP_06770922.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
           27064]
 gi|294324878|gb|EFG06521.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
           27064]
          Length = 1086

 Score =  189 bits (479), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 148/564 (26%), Positives = 237/564 (42%), Gaps = 87/564 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+  +GGP+  + LD+++ L ++I+ R+  LGM PV+P + G VP       P A++ 
Sbjct: 257 LQNMSEYGGPVSPALLDRRIELGQRIVTRMRRLGMRPVVPGYFGTVPDGFVARNPGARVI 316

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W  +   P W     LD   P+F EI  A+   Q + +G   H +  D   E   P
Sbjct: 317 PQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRHQEELFGEIDH-FKMDLLHEGGTP 369

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+ + +++    A W++ GW  +  P        ALL+++   K++++
Sbjct: 370 GDVP--VPDAARAVETALRAARPAATWVILGWQSNPRP--------ALLDAIDTSKVLIV 419

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD------------------SI 222
           D  +++  +     ++ G PY +  + NF G   +    D                    
Sbjct: 420 DGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPNSALVGT 479

Query: 223 AFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
           A+ P  A               R  +    AW   Y+  RYG   PA ++A+  L  T Y
Sbjct: 480 AYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYAQFRYGAKDPAAEEAFAALAGTAY 539

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T   T   R +   F                     + + ++  S  +++D       
Sbjct: 540 QLT---TTDGRPIDSLF---------------------LRRPSMSSSVATAFDQAAFDRG 575

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
            + ++R  E       EL  S+ YRYDL DL RQALA  +  L L +  AY   D     
Sbjct: 576 FAALLRVNE-------ELRGSDAYRYDLTDLARQALALRSRTLQLALRAAYATKDVTAFR 628

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
            ++  +L L+   D +  CH  FLLGPWLE AK+ A + E+  + E  AR  IT W D  
Sbjct: 629 GVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFATSTEEAVELERTARVLITTWGD-- 686

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
           +  A  L +Y N+ W GL+ D + P+   YF  +  +L  G   +  D    W      W
Sbjct: 687 RAAAVELSNYANRDWQGLIGDVHVPQWEQYFTEVATALAEGRAPKAID----WYPGEETW 742

Query: 508 QNGRNVYPVESNGDALITSQWLYN 531
              R  YPV   GD    +Q +++
Sbjct: 743 TKDRRPYPVRPTGDVHKVAQRVHD 766


>gi|326440885|ref|ZP_08215619.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
           27064]
          Length = 1038

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 148/564 (26%), Positives = 237/564 (42%), Gaps = 87/564 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+  +GGP+  + LD+++ L ++I+ R+  LGM PV+P + G VP       P A++ 
Sbjct: 209 LQNMSEYGGPVSPALLDRRIELGQRIVTRMRRLGMRPVVPGYFGTVPDGFVARNPGARVI 268

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W  +   P W     LD   P+F EI  A+   Q + +G   H +  D   E   P
Sbjct: 269 PQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRHQEELFGEIDH-FKMDLLHEGGTP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+ + +++    A W++ GW  +  P        ALL+++   K++++
Sbjct: 322 GDVP--VPDAARAVETALRAARPAATWVILGWQSNPRP--------ALLDAIDTSKVLIV 371

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD------------------SI 222
           D  +++  +     ++ G PY +  + NF G   +    D                    
Sbjct: 372 DGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPNSALVGT 431

Query: 223 AFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
           A+ P  A               R  +    AW   Y+  RYG   PA ++A+  L  T Y
Sbjct: 432 AYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYAQFRYGAKDPAAEEAFAALAGTAY 491

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T   T   R +   F                     + + ++  S  +++D       
Sbjct: 492 QLT---TTDGRPIDSLF---------------------LRRPSMSSSVATAFDQAAFDRG 527

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
            + ++R  E       EL  S+ YRYDL DL RQALA  +  L L +  AY   D     
Sbjct: 528 FAALLRVNE-------ELRGSDAYRYDLTDLARQALALRSRTLQLALRAAYATKDVTAFR 580

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
            ++  +L L+   D +  CH  FLLGPWLE AK+ A + E+  + E  AR  IT W D  
Sbjct: 581 GVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFATSTEEAVELERTARVLITTWGD-- 638

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
           +  A  L +Y N+ W GL+ D + P+   YF  +  +L  G   +  D    W      W
Sbjct: 639 RAAAVELSNYANRDWQGLIGDVHVPQWEQYFTEVATALAEGRAPKAID----WYPGEETW 694

Query: 508 QNGRNVYPVESNGDALITSQWLYN 531
              R  YPV   GD    +Q +++
Sbjct: 695 TKDRRPYPVRPTGDVHKVAQRVHD 718


>gi|268533054|ref|XP_002631655.1| Hypothetical protein CBG20846 [Caenorhabditis briggsae]
          Length = 712

 Score =  188 bits (478), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 131/523 (25%), Positives = 234/523 (44%), Gaps = 83/523 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GG L  + +     L K+I+ R+ ELG+ P+LP FSG VP  L+ +FP++K  
Sbjct: 207 MGNLKGYGGGLSDAQMLNDFNLAKRIINRLLELGITPILPTFSGFVPDRLEKLFPTSKFN 266

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
           +L  W +  S+    C   +   DPLF +IG +F+  Q K  G   +++Y+ D F+E  P
Sbjct: 267 RLPCWNNFTSET--SCLLSVSPFDPLFQKIGSSFLRHQKKMLGGDITNLYSADPFNEVLP 324

Query: 120 PVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
              +    +++     AI +  +  D + +W++Q W F+YD  W    +K+ L++VP+G+
Sbjct: 325 SDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWSFTYDQ-WPNWAIKSFLSAVPIGQ 383

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           +++LDL++EV P W  +  F+G  ++WCMLHNF G+ E+ G +  +  G   A     + 
Sbjct: 384 MLILDLYSEVVPAWQMTSSFHGHNFVWCMLHNFGGSRELRGNVQKVDKGYQLALMKAGSN 443

Query: 237 M---------------------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
           +                                 +W+  YS  RY          W +L 
Sbjct: 444 LVGAGLSMEAIDQNYMMYQFMIDRMWTQEPIPLNSWLKSYSESRYSADFKVAHKFWTILA 503

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            + YN  +   +    V +                   Y +P   + +            
Sbjct: 504 GSFYNQPEKWGNPRFSVFL-------------------YHRPAFGKKI-----------E 533

Query: 324 LWYSTSEVIRALE-LFIASGNELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQL 380
            W+   E    LE L ++  + L     ++ DL D+ R A+ ++   NE  L++ EA+ +
Sbjct: 534 YWFPVEETFTHLESLVLSLLHILGDHPLFKEDLNDVMR-AITQFEIGNEAALSLTEAFLM 592

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
            D   +       + + + ++     +    +  W+E AK +A   E+ + +  +A   +
Sbjct: 593 EDKQQIGTTCENLMGMFQKLEP----YSNRDVRDWIEDAKSIAPTTEEREVFPISASDIL 648

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
           T+W    Q       DY ++ W+GLL  YYG R   +  +++E
Sbjct: 649 TVWGPTGQN-----LDYAHREWAGLLSGYYGRRWQYFCDWILE 686


>gi|331092442|ref|ZP_08341267.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
            2_1_46FAA]
 gi|330401285|gb|EGG80874.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
            2_1_46FAA]
          Length = 1598

 Score =  187 bits (476), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 144/574 (25%), Positives = 248/574 (43%), Gaps = 111/574 (19%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  SW +++  L +K  + +  LGM PVL  +SG VP  ++    SA++ 
Sbjct: 680  MANLSGFGGPIHDSWFEERTELARKNQLSMRRLGMQPVLQGYSGMVPTNIREKDSSAEVI 739

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--- 117
            + G W S +         +L      F +  + F + Q + YG ++H Y  D F E    
Sbjct: 740  EQGTWCSFRRPD------MLKTDSASFDKYAKLFYQAQKEVYGESAHYYATDPFHEGGDT 793

Query: 118  ---TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
                P V        +   +   M   D D +W++Q         W+     ALL  +  
Sbjct: 794  GGLNPTV--------IAGKVLDAMLEADKDGIWIIQS--------WQGNPTTALLKGLEG 837

Query: 175  GK--LVVLDLFAEVKPIWSTSK-------QFYGVPYIWCMLHNFAGNIEMYGILDSIA-- 223
             K   +VLDL+AE  P W+ +        +F   P+++CML+NF G + ++G LD++A  
Sbjct: 838  RKEHALVLDLYAEKTPHWNETNPNEYGGGEFNDTPWVFCMLNNFGGRLGLHGHLDNLAKN 897

Query: 224  -------------FGPVEARTSENTTM-----------------------AWINQYSVRR 247
                          G     +  N  +                        W+  Y+ RR
Sbjct: 898  IPAALNSAKHMEGIGITPEASVNNPLLYDFLFETVWTDNAKEKLPVIDLDKWLKDYAKRR 957

Query: 248  YGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 302
            YG+   +  +A  ++  TVY         GA +    V+ A P +D    S        +
Sbjct: 958  YGKESQSAYEALLIMKDTVYKAELNMKGQGAPES---VVNARPALDIGAAST-------W 1007

Query: 303  GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 362
            G  V                 + Y  +++ +A EL +   ++L  S+ Y YDL  + +Q 
Sbjct: 1008 GNAV-----------------ISYDKAKLEKAAELLLKDYDKLKDSDGYMYDLATMLQQV 1050

Query: 363  LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 422
            L+  A E    +  A++ N+       + +FL +++ M+ + +    +LLG W+E AK L
Sbjct: 1051 LSNSAQEYQRKMANAFKENNKEEFNTYADKFLSIIDSMEKVTSTSKYYLLGTWVEQAKAL 1110

Query: 423  AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 480
            A+N +   +  YE+NA+  +T W    Q E   L+DY N+ WSGLL+D+Y  R   + + 
Sbjct: 1111 AKNADDFTKDLYEFNAKALVTTWGSINQAEGGGLKDYSNRQWSGLLKDFYKVRWQKWIQA 1170

Query: 481  MIESLESGDGFRLK--DWRREWIKLTNDWQNGRN 512
              + L+      +   +W  +W++   ++ N  N
Sbjct: 1171 RNDELDGKQPENINWFEWEWKWVRENTEYTNTPN 1204


>gi|374990497|ref|YP_004965992.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
 gi|297161149|gb|ADI10861.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
          Length = 1001

 Score =  187 bits (476), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 148/564 (26%), Positives = 232/564 (41%), Gaps = 78/564 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+ G+GGP+    L +++ L +KI  R+ ELGM PV P + G VP    +  P A+  
Sbjct: 174 LQNMSGYGGPVSPELLAKRIALGQKIAERLRELGMRPVYPGYFGTVPDGFVDRNPGARTV 233

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W  + + P W     LD     F ++  AF   Q + +G    ++  D   E    
Sbjct: 234 PQGTWNGL-ARPDW-----LDPRTESFGQVAAAFYRHQQELFGECD-LFKMDLLHEGGAA 286

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  ++    A+ + +Q+    A W++ GW  +          + LL++V    ++V+
Sbjct: 287 GDVP--VADAARAVETALQTARPGATWVILGWQAN--------PRRELLDAVNHDHMLVV 336

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI----------------------EMYGI 218
           D  +++  I    + +  VPY +  + NF G                        ++ G 
Sbjct: 337 DGLSDLDSIGDREQDWGSVPYAFGTIPNFGGRTTIGAKTHIWARRFTQWRDKPGSKLVGT 396

Query: 219 L--------DSIAFGPVEARTSENTTM---AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                    D  AF         NT +    W   Y+  R G      +DA+  L  T Y
Sbjct: 397 AYMAEAVGRDPAAFELFSELAWRNTAVDRDEWFRTYADVRLGGRDERARDAYAALRDTAY 456

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             T      +  V  A PD       VT     NY   +                   + 
Sbjct: 457 QITSSDGRPHDSVFSARPD-------VTARSGTNYATRIPA-----------------FD 492

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
            ++   AL   +     L  S+ YR+DL D+ RQALA  +  L  ++ +AY+  D     
Sbjct: 493 LADFDPALAALLDVRPSLRDSDAYRHDLTDIARQALADRSWTLIPHLHDAYERKDLEAFR 552

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
            L+R +L+L+   D +   H GFLLGPWLE AK+LA +E +    E  ART IT W D  
Sbjct: 553 TLARLWLKLMRLSDDMTGAHRGFLLGPWLEDAKRLASDEAEAAHLEHLARTLITTWADRV 612

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
             +   L +Y N+ W+GL+ D++ P+   Y   + ++L  G   R  D    W  +   W
Sbjct: 613 TADTGKLANYANRDWNGLIGDFHLPQWQSYLDELEDALAEGREPRDFD----WFAVEEPW 668

Query: 508 QNGRNVYPVESNGDALITSQWLYN 531
              R  YPV    DA  T + +Y 
Sbjct: 669 TRERKSYPVRPTTDAHRTGRRVYE 692


>gi|282877909|ref|ZP_06286718.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
 gi|281299910|gb|EFA92270.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
          Length = 717

 Score =  187 bits (476), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 148/550 (26%), Positives = 232/550 (42%), Gaps = 104/550 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W  QQ+ LQ KI+ R+ ELGM+P+ PAF+G VP A     P     
Sbjct: 195 MGNLNQWDGPLSDAWHKQQITLQHKIISRMRELGMHPIAPAFAGFVPKAFAKKHPEINFK 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W            Y+L      F ++G+ FIE+  +E+G  ++ Y  D+F+E   P
Sbjct: 255 HL-RWGGFADS---LNAYVLPPESSYFKQLGKLFIEEWEREFGENTY-YLSDSFNEMKLP 309

Query: 121 V------DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           V      +    ++  G AIY  + +G+  A+W+ QGW F Y   FW    + ALL+ VP
Sbjct: 310 VNPNDEEEKCRLLAEYGKAIYQSINAGNPHAIWVTQGWTFGYQHDFWNRKSLSALLSQVP 369

Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFA------GNIEMYGILD 220
             +++++DL  +        +  W     FYG  +I+  + NF       G++EMY    
Sbjct: 370 NDRMIIIDLGNDYPKWVWHTEQTWKRHNGFYGKQWIFSYVPNFGGKTLLTGDLEMYATDA 429

Query: 221 SIAFGPVE---------------------------ARTSENTTM-AWINQYSVRRYGRSV 252
           S+A                                A T +   +  WI  Y + RYG+  
Sbjct: 430 SLALSAANKGNLVGIGSAPEGLENNEVVYELLSDAAWTDKGINLDEWIANYCMARYGKYP 489

Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD-PSIISVTEGKYQNYGKPVSKEAV 311
             ++ AWN    +VY+            + ++P     ++I  T  K             
Sbjct: 490 DKMKAAWNGFRKSVYSS-----------LYSYPRFTWQTVIPDTRRK------------- 525

Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
                S +D    ++      +A+E F++  +EL  +  Y+ D I    Q L   A+  +
Sbjct: 526 -----SRHDLNETYF------KAVEDFLSCADELGGAKFYQDDAILFAAQYLGAKADIYY 574

Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
            N +    LN      +   + +EL+   D +LA H    L  W+  A+      +++ Q
Sbjct: 575 ENALRYGSLNKHVEANKQLSKAIELLLFADKILASHPTDRLDVWIAKARSQGHTPQEKNQ 634

Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
           YE NA+  IT W  + +       DY  + WSGL++DYY PR  IYF        S    
Sbjct: 635 YEANAKRLITTWGGHQE-------DYAARCWSGLIKDYYIPRIQIYF--------SNQRK 679

Query: 492 RLKDWRREWI 501
            L  W   WI
Sbjct: 680 MLDQWEENWI 689


>gi|21241480|ref|NP_641062.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
 gi|21106823|gb|AAM35598.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
          Length = 798

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 153/574 (26%), Positives = 232/574 (40%), Gaps = 111/574 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313

Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
           V                                   ++  G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASRAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
           +  +  +YG                   +   FG        N+ +              
Sbjct: 434 YGASNPLYGDFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEGPQQ 493

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+  Y   RYGRS  A+  AW+ L   +Y     +               P   + 
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P +            D P        + RA++  +   N  + +  YRYD
Sbjct: 539 RAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLYRYD 589

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
           LI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+      L  
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALVGGQHETLAD 648

Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
              ++A     +    + Y  NAR Q+++W  +       L DY +K W G+  D+Y  R
Sbjct: 649 WTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 703

Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
              +      + ++G  F       +L  W R+W
Sbjct: 704 WTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737


>gi|317501265|ref|ZP_07959469.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
            8_1_57FAA]
 gi|316897332|gb|EFV19399.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
            8_1_57FAA]
          Length = 1847

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 142/533 (26%), Positives = 235/533 (44%), Gaps = 103/533 (19%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  SW +++  L +K  + + +LGM PVL  +SG VP  + +   +A++ 
Sbjct: 658  MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 717

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            + G W S +         +L  T   F +  + F + Q + YG  S+ Y  D F E    
Sbjct: 718  EQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 769

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
            +      S +   + + M + D DAVW++Q W        +     ALLN +        
Sbjct: 770  ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 821

Query: 176  KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA-- 223
              ++LDL+AE  P           +   ++F   P+++CML+NF G + ++G LD++A  
Sbjct: 822  HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLANN 881

Query: 224  -------------FGPVEARTSENTTM-----------------------AWINQYSVRR 247
                          G     +  N  +                        W++ Y+ RR
Sbjct: 882  IPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATRR 941

Query: 248  YGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 302
            YG    +   AW++L  TVY  +      GA +    V+ A P++       T G    +
Sbjct: 942  YGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAASTW 991

Query: 303  GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 362
            G  V                 + Y   ++  A  L +A  ++L  S  Y+YDL ++ +Q 
Sbjct: 992  GNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQV 1034

Query: 363  LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 422
            L+  A E    +  A+   D       S +F+ ++EDM+ +    + FLLG W+E AK L
Sbjct: 1035 LSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKAL 1094

Query: 423  AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
            A N +   ++ YE+NA+  +T W    Q E   L+DY N+ WSGL+ D+Y  R
Sbjct: 1095 ANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKDYSNRQWSGLIGDFYKAR 1147


>gi|418515337|ref|ZP_13081518.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|410708056|gb|EKQ66505.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 782

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 154/574 (26%), Positives = 232/574 (40%), Gaps = 111/574 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 185 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 244

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 245 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 297

Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
           V                                   ++  G A+Y  +   +  A W+MQ
Sbjct: 298 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 357

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 358 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 417

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
           +  +  +YG                   +   FG        N+ +              
Sbjct: 418 YGASNPLYGDFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEGPQQ 477

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+  Y   RYGRS  A+  AW+ L   +Y     +               P   + 
Sbjct: 478 SWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 522

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P +  A         D P        + RA++  +   N  + +  YRYD
Sbjct: 523 RAGAYLLFKRPTADIADFD------DRPG---DPQRLRRAIDALLQQANRYADAPLYRYD 573

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
           LI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+      L  
Sbjct: 574 LIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALVGGQYETLAD 632

Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
              ++A     +    + Y  NAR Q+++W  +       L DY +K W G+  D+Y  R
Sbjct: 633 WTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 687

Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
              +      +  +G  F       +L  W R+W
Sbjct: 688 WTRFLSAYRAARMAGTPFDAVAMDHQLATWERQW 721


>gi|418520969|ref|ZP_13087015.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410702945|gb|EKQ61442.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
          Length = 798

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 154/574 (26%), Positives = 232/574 (40%), Gaps = 111/574 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313

Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
           V                                   ++  G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
           +  +  +YG                   +   FG        N+ +              
Sbjct: 434 YGASNPLYGDFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEGPQQ 493

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+  Y   RYGRS  A+  AW+ L   +Y     +               P   + 
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P +  A         D P        + RA++  +   N  + +  YRYD
Sbjct: 539 RAGAYLLFKRPTADIADFD------DRPG---DPQRLRRAIDALLQQANRYADAPLYRYD 589

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
           LI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+      L  
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALVGGQYETLAD 648

Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
              ++A     +    + Y  NAR Q+++W  +       L DY +K W G+  D+Y  R
Sbjct: 649 WTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 703

Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
              +      +  +G  F       +L  W R+W
Sbjct: 704 WTRFLSAYRAARMAGTPFDAVAMDHQLATWERQW 737


>gi|153814573|ref|ZP_01967241.1| hypothetical protein RUMTOR_00787 [Ruminococcus torques ATCC 27756]
 gi|331089988|ref|ZP_08338878.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
            3_1_46FAA]
 gi|145848067|gb|EDK24985.1| F5/8 type C domain protein [Ruminococcus torques ATCC 27756]
 gi|330402902|gb|EGG82468.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
            3_1_46FAA]
          Length = 1863

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 142/533 (26%), Positives = 235/533 (44%), Gaps = 103/533 (19%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  SW +++  L +K  + + +LGM PVL  +SG VP  + +   +A++ 
Sbjct: 674  MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 733

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            + G W S +         +L  T   F +  + F + Q + YG  S+ Y  D F E    
Sbjct: 734  EQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 785

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
            +      S +   + + M + D DAVW++Q W        +     ALLN +        
Sbjct: 786  ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 837

Query: 176  KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA-- 223
              ++LDL+AE  P           +   ++F   P+++CML+NF G + ++G LD++A  
Sbjct: 838  HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLANN 897

Query: 224  -------------FGPVEARTSENTTM-----------------------AWINQYSVRR 247
                          G     +  N  +                        W++ Y+ RR
Sbjct: 898  IPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATRR 957

Query: 248  YGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 302
            YG    +   AW++L  TVY  +      GA +    V+ A P++       T G    +
Sbjct: 958  YGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAASTW 1007

Query: 303  GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 362
            G  V                 + Y   ++  A  L +A  ++L  S  Y+YDL ++ +Q 
Sbjct: 1008 GNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQV 1050

Query: 363  LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 422
            L+  A E    +  A+   D       S +F+ ++EDM+ +    + FLLG W+E AK L
Sbjct: 1051 LSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKAL 1110

Query: 423  AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
            A N +   ++ YE+NA+  +T W    Q E   L+DY N+ WSGL+ D+Y  R
Sbjct: 1111 ANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKDYSNRQWSGLIGDFYKAR 1163


>gi|336439030|ref|ZP_08618649.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
            1_1_57FAA]
 gi|336017072|gb|EGN46842.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
            1_1_57FAA]
          Length = 1863

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 142/533 (26%), Positives = 235/533 (44%), Gaps = 103/533 (19%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  SW +++  L +K  + + +LGM PVL  +SG VP  + +   +A++ 
Sbjct: 674  MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 733

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            + G W S +         +L  T   F +  + F + Q + YG  S+ Y  D F E    
Sbjct: 734  EQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 785

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
            +      S +   + + M + D DAVW++Q W        +     ALLN +        
Sbjct: 786  ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 837

Query: 176  KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA-- 223
              ++LDL+AE  P           +   ++F   P+++CML+NF G + ++G LD++A  
Sbjct: 838  HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLANN 897

Query: 224  -------------FGPVEARTSENTTM-----------------------AWINQYSVRR 247
                          G     +  N  +                        W++ Y+ RR
Sbjct: 898  IPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATRR 957

Query: 248  YGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 302
            YG    +   AW++L  TVY  +      GA +    V+ A P++       T G    +
Sbjct: 958  YGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAASTW 1007

Query: 303  GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 362
            G  V                 + Y   ++  A  L +A  ++L  S  Y+YDL ++ +Q 
Sbjct: 1008 GNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQV 1050

Query: 363  LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 422
            L+  A E    +  A+   D       S +F+ ++EDM+ +    + FLLG W+E AK L
Sbjct: 1051 LSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKAL 1110

Query: 423  AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
            A N +   ++ YE+NA+  +T W    Q E   L+DY N+ WSGL+ D+Y  R
Sbjct: 1111 ANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKDYSNRQWSGLIGDFYKAR 1163


>gi|381169859|ref|ZP_09879021.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689629|emb|CCG35508.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 798

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 153/574 (26%), Positives = 232/574 (40%), Gaps = 111/574 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313

Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
           V                                   ++  G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
           +  +  +YG                   +   FG        N+ +              
Sbjct: 434 YGASNPLYGDFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWESPQQ 493

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+  Y   RYGRS  A+  AW+ L   +Y     +               P   + 
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P +            D P        + RA++  +   N  + +  YRYD
Sbjct: 539 RAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLYRYD 589

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
           LI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+      L  
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALIGGQYETLAD 648

Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
              ++A     +    + Y  NAR Q+++W  +       L DY +K W G+  D+Y  R
Sbjct: 649 WTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 703

Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
              +      + ++G  F       +L  W R+W
Sbjct: 704 WTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737


>gi|294667089|ref|ZP_06732314.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 10535]
 gi|292603099|gb|EFF46525.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 10535]
          Length = 798

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 158/576 (27%), Positives = 236/576 (40%), Gaps = 115/576 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWTDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 313

Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
                  V + +Y  S+                       G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
           +  +  +YG                   +   FG        N+ +              
Sbjct: 434 YGASNPLYGDFAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVIYAYLYALAWEGPQQ 493

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+  Y   RYGRS  A+  AW  L   +Y     +               P   + 
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQTRYWS---------------PRWWNK 538

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P +            D P        + RA++  +   N  + +  YRYD
Sbjct: 539 RAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLQQANRYADAPLYRYD 589

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLA-CHDGFLL 412
           LI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+   HD   L
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGDQHD--TL 646

Query: 413 GPWL-ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 471
             W  ++A     +    + Y  NAR Q+++W  +       L DY +K W G+  D+Y 
Sbjct: 647 ADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYL 701

Query: 472 PRAAIYFKYMIESLESGDGF-------RLKDWRREW 500
            R   +      + ++G  F       +L  W R+W
Sbjct: 702 QRWTRFLSAYRAARKAGTPFDAVTVDHQLAAWERQW 737


>gi|374985456|ref|YP_004960951.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
 gi|297156108|gb|ADI05820.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
          Length = 1039

 Score =  185 bits (470), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 156/560 (27%), Positives = 233/560 (41%), Gaps = 88/560 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+ G+GGP+ +  +DQ+  L KKI+ R+ ELGM PVLP + G VP       P A + 
Sbjct: 214 LQNMSGFGGPVSKHLIDQRAALAKKIINRVRELGMTPVLPGYYGTVPDDFLAKNPGASLV 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W + K  P W     LD    LF E+  AF   Q + YG +S +Y  D   E   P
Sbjct: 274 AQGTWGAFKR-PDW-----LDPRTDLFAEVAAAFYRHQRERYGDSS-MYKMDLLHEGGNP 326

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+ + +Q   + AVW + G        W+    + +L +V    ++V+
Sbjct: 327 GDVP--VGEAAKAVEAALQKAHAGAVWAILG--------WQTNPSREILGAVDKSMMLVV 376

Query: 181 DLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGN----------IEMY--------GILDS 221
           D  ++    +      + G PY +  + NF G+          +E Y          L  
Sbjct: 377 DGLSDRYTTVIDRESDWDGTPYAFGSIWNFGGHTPIGANAPDWVEQYPKWRDKTGSALTG 436

Query: 222 IAFGPVEARTSENTTMA----------------WINQYSVRRYGRSVPAIQDAWNVLYHT 265
           IA  P E   +    MA                W   Y+V RYG   P    AW  +  T
Sbjct: 437 IAMMP-EGADNNPAAMALFTDLAWTPGAIGLDDWFASYAVSRYGGEDPHAVAAWKAIRDT 495

Query: 266 VYNCTDGATDKNRDVIVAFPD----VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
            YN +        D     PD      PS+               +K A    E   YD 
Sbjct: 496 AYNMS------RADAWSEAPDGLFGARPSL-------------GANKAAAWGPEADRYDT 536

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
                + +E+++     +A G  L  S+ Y YDL D+ RQ L+  +  L   I  AY+  
Sbjct: 537 TAFDAALTELLQ-----VAPG--LRDSSAYAYDLADVARQVLSNRSRVLLPQIKTAYEAG 589

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
           D     +L++ +L  ++ MD +LA     LLG WL  A+       ++ Q E++AR+ IT
Sbjct: 590 DRGRFDRLTKTWLSWMKLMDKVLATSGQHLLGRWLADARSWGATRAEKDQLEYDARSIIT 649

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 501
            W      E   L DY N+ WSGLL   Y  R   YF  +  +L +G     +    +W 
Sbjct: 650 TWGGRASSEEG-LHDYANREWSGLLGGLYHLRWKTYFDELSTALAAG----RQPAGIDWF 704

Query: 502 KLTNDWQNGRNVYPVESNGD 521
            L + W    + YPV ++GD
Sbjct: 705 ALEDHWARRHDSYPVRTSGD 724


>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
 gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
          Length = 1552

 Score =  185 bits (469), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 139/525 (26%), Positives = 242/525 (46%), Gaps = 78/525 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGP+ ++ ++ +  LQ+K+L R+  LG+ PV+  F G VP+  +  FP+A++ 
Sbjct: 196 MGNLEGWGGPMSEALIEARYQLQRKMLQRMQALGIQPVVQGFPGLVPSFFKERFPAAQLV 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P      LL +   LF ++ +A+ E  ++ YGR       D F E  NT
Sbjct: 256 LQGRWGHFNRPP-----MLLPSDKDLFQQVAKAYYESLIRCYGRDFKFLGGDLFHEGGNT 310

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD     +++   +     S    A W++QGW  +  P         LL+ +    ++
Sbjct: 311 KGVDVAATAAAVQQTMLRYFPS----AKWVLQGWNNNPSP--------TLLSKLDKQHVL 358

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTM 237
           +++L  E+   W +S +F G P++W  +++F G  +M G L  +   P  A + ++N  M
Sbjct: 359 LINLSGEIAASWESSNEFGGTPWLWGSVNHFGGKTDMGGQLPVLVAEPHRAFSQTKNGVM 418

Query: 238 -----------------------AW----------INQYSVRRYGRSVPAIQDAWNVLYH 264
                                  AW          +  Y   RYG    ++  AW++L H
Sbjct: 419 QGIGILPEGINSNPVVYDLALKTAWYTTTPDLDRLLRDYIAYRYGHVDESLVQAWHILSH 478

Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           +VY                            EG +++     ++  +  +  S++    +
Sbjct: 479 SVYG---------------------EFKIKGEGTFESIF--CARPGLHVTSVSTWGPKQM 515

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            Y+  ++ +AL LF    ++   S TY+YDL+DL RQ +A +A +++   ++AY+  DA 
Sbjct: 516 QYNPKDLEKALGLFRRVADQYKGSATYQYDLVDLARQVMANHARDVYAAAMQAYRNKDAA 575

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
            + +  + F+ L++  D LL     FLLG WL  A         ++Q   NA+  IT W 
Sbjct: 576 LLHEKGQEFMHLLQLQDRLLQTDTHFLLGNWLAQAANYGVTAADKQQALHNAKMLITYWG 635

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
            ++   A+ + DY NK W+GLL+ YY PR   +F  + +S+ +G+
Sbjct: 636 PDS--AATRVHDYANKEWAGLLKSYYEPRWQKFFYALYQSVNTGE 678


>gi|406693970|gb|EKC97309.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
           asahii CBS 8904]
          Length = 781

 Score =  184 bits (468), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 149/574 (25%), Positives = 244/574 (42%), Gaps = 74/574 (12%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+HG W G     WL+ Q  LQK+IL R  E GM PVLP F G VP  L N         
Sbjct: 238 NIHGNWHGTTTWQWLEGQHNLQKQILARQREFGMTPVLPGFCGFVPPELHNYIGGPDFKT 297

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W S  ++  +     +D     +  +  AF+ +Q + YG TS  Y  D F E+ P  
Sbjct: 298 YPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRKQKELYGFTSDYYMVDLFTESKPTS 355

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVL 180
             P Y+  +  A+   + +   +A W+MQGW+F  DP  W     KA L+      L+VL
Sbjct: 356 TDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVNDPKSWTETASKAFLDGAG-ESLLVL 414

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN------ 234
           DL AE  P W   K F+G  ++WC L N+  N  +YG LD      ++A+ +        
Sbjct: 415 DLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLYGALDKWNHDIMDAKANGGRLSGMG 474

Query: 235 ----------------TTMAWINQ----------YSVRRY-GRSVPAIQDAWNVLYHTVY 267
                           T   W +Q          +  RRY G+++   Q AW +L ++VY
Sbjct: 475 IVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNWVKRRYRGQNLDLAQKAWELLDNSVY 534

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
              + A       ++   D+ P++                  + L   T +Y    + Y 
Sbjct: 535 KSNNTALKCTTRSLI---DLRPAV------------------SGLIGTTGNYLATAITYE 573

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             +V+ AL+  + S +  +    + YDL+D+ RQ     A  ++  +I A+  ++     
Sbjct: 574 PRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQVFVNAAIPIYQAMINAWNGSNKADTE 632

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           +  R  + L+ D+D L+A    F L  W+  A+  AQ+   +   E+ AR Q+ +W   T
Sbjct: 633 KYGRELVGLINDIDRLMATSRHFRLESWVGDARNWAQDAGAKDDMEFQARNQLILWGAAT 692

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI------ 501
                L R Y  K+W G++ + Y     + ++ ++++         K W +         
Sbjct: 693 FAPWPLDR-YAAKHWHGIMSEVYAKGWELLYQNLLKT-------EPKAWNKTAFASELME 744

Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
           K+   W+N ++       GD++   + L  KY Q
Sbjct: 745 KVEKPWENVKSGGVQGPQGDSVAVIRELREKYKQ 778


>gi|401885538|gb|EJT49648.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
           asahii CBS 2479]
          Length = 781

 Score =  184 bits (468), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 149/574 (25%), Positives = 241/574 (41%), Gaps = 74/574 (12%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+HG W G     WL+ Q  LQK+IL R  E GM PVLP F G VP  L N         
Sbjct: 238 NIHGNWHGTTTWQWLEGQHNLQKQILARQREFGMTPVLPGFCGFVPPELHNYIGGPDFKT 297

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W S  ++  +     +D     +  +  AF+ +Q + YG TS  Y  D F E+ P  
Sbjct: 298 YPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRKQKELYGFTSDYYMVDLFTESKPTS 355

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVL 180
             P Y+  +  A+   + +   +A W+MQGW+F  DP  W     KA L+      L+VL
Sbjct: 356 TDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVNDPKSWTETASKAFLDGAG-ESLLVL 414

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDS---------------IAFG 225
           DL AE  P W   K F+G  ++WC L N+  N  +YG LD                   G
Sbjct: 415 DLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLYGALDKWNHDIMDAKANGGRLSGMG 474

Query: 226 PVEARTSENTTM-----------------AWINQYSVRRY-GRSVPAIQDAWNVLYHTVY 267
            V    + N  +                  W   +  RRY G+++   Q AW +L ++VY
Sbjct: 475 IVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNWVKRRYRGQNLDLAQKAWELLDNSVY 534

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
              + A       ++   D+ P++                  + L   T +Y    + Y 
Sbjct: 535 KSNNTALKCTTRSLI---DLRPAV------------------SGLIGTTGNYLATAITYE 573

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
             +V+ AL+  + S +  +    + YDL+D+ RQ     A  ++  +I A+  ++     
Sbjct: 574 PRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQVFVNAAIPIYQAMINAWNGSNKADTE 632

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           +  R  + L+ D+D L+A    F L  W+  A+  AQ+   +   E+ AR Q+ +W   T
Sbjct: 633 KYGRELVGLINDIDRLMATSRHFRLESWVGDARNWAQDAGAKDDMEFQARNQLILWGPAT 692

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI------ 501
                L R Y  K+W G++ + Y     + ++ ++++         K W +         
Sbjct: 693 FAPWPLDR-YAAKHWHGIMSEVYAKGWELLYQNLLKT-------EPKAWNKTAFASELME 744

Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
           K+   W+N ++       GD++   + L  KY Q
Sbjct: 745 KVEKPWENVKSGGVQGPQGDSVAVIRELREKYKQ 778


>gi|423219557|ref|ZP_17206053.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
           CL03T12C61]
 gi|392624762|gb|EIY18840.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
           CL03T12C61]
          Length = 715

 Score =  184 bits (468), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 143/540 (26%), Positives = 222/540 (41%), Gaps = 81/540 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGP+PQS +D +  L +K+L R+  LG+ P++P F G VP+ L+N    A I 
Sbjct: 195 MGNIEGWGGPMPQSQIDSRKKLVQKMLKRMKSLGIEPLMPGFYGMVPSNLKNK-SKAHII 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W +           +LD  DP F  +   F ++  + YG     ++ D F E    
Sbjct: 254 PQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDETRRLYGSDIRFFSGDPFHEGG-- 305

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
                 +   G AI   MQ     ++W++QGW  +  P         LL  +    ++V 
Sbjct: 306 ATDGVALGDAGRAIQKTMQKHFPGSIWVLQGWQDNPKP--------GLLEKLDKRYVLVQ 357

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           +LF E    W T K + G P+IW  + NF     + G L   A     A  SE       
Sbjct: 358 ELFGENTNNWETRKGYEGTPFIWATVTNFGERPGINGKLQRFADEVYRASNSEYAKYMKG 417

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           WI  Y   RYGR    I+ AW ++  ++
Sbjct: 418 VGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESYVTARYGRITDEIRTAWKMMLKSI 477

Query: 267 YNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           Y+   G  +    +++ A P ++   +S      + Y + + K+A               
Sbjct: 478 YSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKYDRDLYKKAAF------------- 524

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
                      LF  +  E +   TYR DLI   RQ +A  A+ +F ++I AYQ      
Sbjct: 525 -----------LFAKAMPEFNEVRTYRIDLIHFLRQVIANEADSVFYDMITAYQEKKVEK 573

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
             Q   +FL +++  + LLA    F L  W + AK       ++K    N    IT W +
Sbjct: 574 FEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDAGNTAAEKKNNFHNLMMLITYWGE 633

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD---WRREWIK 502
           +   E + L DY  K W+G++  YY  R  +YF Y+  +L  G+  +  D   W REW++
Sbjct: 634 HVTSEDN-LHDYAYKEWAGMMNTYYKERWLVYFDYL-RALLRGEEAKAPDYFHWEREWVE 691


>gi|153806010|ref|ZP_01958678.1| hypothetical protein BACCAC_00255 [Bacteroides caccae ATCC 43185]
 gi|149130687|gb|EDM21893.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
           43185]
          Length = 715

 Score =  184 bits (468), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 143/540 (26%), Positives = 222/540 (41%), Gaps = 81/540 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGP+PQS +D +  L +K+L R+  LG+ P++P F G VP+ L+N    A I 
Sbjct: 195 MGNIEGWGGPMPQSQIDSRKKLVQKMLKRMKSLGIEPLMPGFYGMVPSNLKNK-SKAHII 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W +           +LD  DP F  +   F ++  + YG     ++ D F E    
Sbjct: 254 PQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDETRRLYGSDIRFFSGDPFHEGG-- 305

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
                 +   G AI   MQ     ++W++QGW  +  P         LL  +    ++V 
Sbjct: 306 ATDGVALGDAGRAIQKTMQKHFPGSIWVLQGWQDNPKP--------GLLEKLDKRYVLVQ 357

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           +LF E    W T K + G P+IW  + NF     + G L   A     A  SE       
Sbjct: 358 ELFGENTNNWETRKGYEGTPFIWATVTNFGERPGINGKLQRFADEVYRASNSEYAKYMKG 417

Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
                                           WI  Y   RYGR    I+ AW ++  ++
Sbjct: 418 VGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESYVTARYGRITDEIRTAWKMMLKSI 477

Query: 267 YNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           Y+   G  +    +++ A P ++   +S      + Y + + K+A               
Sbjct: 478 YSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKYDRDLYKKAAF------------- 524

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
                      LF  +  E +   TYR DLI   RQ +A  A+ +F ++I AYQ      
Sbjct: 525 -----------LFAKAMPEFNEVRTYRIDLIHFLRQVIANEADSVFYDMITAYQEKKVEK 573

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
             Q   +FL +++  + LLA    F L  W + AK       ++K    N    IT W +
Sbjct: 574 FEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDAGNTAAEKKNNFHNLMMLITYWGE 633

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD---WRREWIK 502
           +   E + L DY  K W+G++  YY  R  +YF Y+  +L  G+  +  D   W REW++
Sbjct: 634 HVTSEDN-LHDYAYKEWAGMMNTYYKERWLVYFDYL-RALLRGEEAKAPDYFHWEREWVE 691


>gi|302526099|ref|ZP_07278441.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
 gi|302434994|gb|EFL06810.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
          Length = 860

 Score =  184 bits (467), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 140/561 (24%), Positives = 232/561 (41%), Gaps = 85/561 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL------QNVF 54
           + N+  + GP+    LD ++ + KK++ R+ +LGM PVLP + G VP          +  
Sbjct: 200 LQNMASFTGPVSPQLLDARVAMAKKVITRLKDLGMTPVLPGYFGTVPRGFADKSKKADAS 259

Query: 55  PSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTF 114
             A++   G W      P W     LD     + ++  AF + Q   +G TS +Y  D  
Sbjct: 260 SDARVIGQGTWVGFDR-PDW-----LDPRTSSYRKVAAAFYQAQHDLFGDTS-MYKMDLL 312

Query: 115 DENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
            E     D P  +      + + +Q+    A W++ GW  +      PP  +A++++V  
Sbjct: 313 HEGGKSGDVP--VGDAARGVMTALQTARPGATWVLLGWQNN------PP--RAIVDAVDK 362

Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG----------------- 217
            KL V+D  ++         Q+   PY +  ++NF G+  +                   
Sbjct: 363 SKLFVVDGLSDRYGQRDPDSQWNNTPYAFGTIYNFGGHTTIGANTGVWTQRFPQWRTKQG 422

Query: 218 -ILDSIAFGPVEARTSENT---------------TMAWINQYSVRRYGRSVPAIQDAWNV 261
             L  IA+ P    T+                    AW   Y+ RRYG        AW++
Sbjct: 423 SALTGIAYLPEGTGTNPAAFELFTELAWRQTPIHQAAWFADYASRRYGGPDTRAATAWDL 482

Query: 262 LYHTVYNC-TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
           L  T Y+    G ++    +  A P++D                         +  +++ 
Sbjct: 483 LRQTAYSMPASGWSEAQDSLYAARPNLD------------------------AATAATWS 518

Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
              L Y  +   +AL+  +     L  ++ YR+DL+D+ RQAL   +  L   I  AY  
Sbjct: 519 PASLRYQQATFGKALDELLNVDPALRGTDAYRFDLVDVARQALTNTSRTLLPQIKTAYTN 578

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
            D      L+ R++  +  +D LLA    FLLGPWLE+AK  A  + ++ + E++AR+ I
Sbjct: 579 RDRTQFTTLTSRWMSNMTLLDKLLATDSRFLLGPWLEAAKSWAGTDTEQARLEYDARSLI 638

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
           T W      +   L DY N+ WSGL+ D+Y  R   YF  +  ++ +G     +    +W
Sbjct: 639 TTWGPRAGSDDGRLHDYANREWSGLVSDFYAKRWKQYFDSLNTAMNTGG----QPASIDW 694

Query: 501 IKLTNDWQNGRNVYPVESNGD 521
               + W   RN YP    GD
Sbjct: 695 FAAEDGWAKQRNPYPTTPAGD 715


>gi|329851961|ref|ZP_08266642.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
           biprosthecum C19]
 gi|328839810|gb|EGF89383.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
           biprosthecum C19]
          Length = 731

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 149/568 (26%), Positives = 232/568 (40%), Gaps = 109/568 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLP SW+ ++  LQK+IL  + ELGM P+LPAF+G VP A     P A+I 
Sbjct: 182 MGNIEGYQAPLPLSWIVKKRELQKRILGAMRELGMEPILPAFAGYVPKAFAESHPQARIY 241

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++   F++   + YG+    Y  D F+E  PP
Sbjct: 242 RMRAWEGFHE------TYWLDPADPLFAKLAGRFLDLYDQTYGK-GRFYLADAFNEMLPP 294

Query: 121 V-DSP------------------------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 155
           V D P                        E +++ G  ++  ++S   DAVW+MQGWLF 
Sbjct: 295 VGDGPVEGGYGDSTANKEAVAEVDPAVKAERLAAYGQRLHDSIRSARPDAVWVMQGWLFG 354

Query: 156 YDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNI 213
            D  FW    + A L +VP   L+VLD+  +  P +  T++ F+G  +I+  +HN+  + 
Sbjct: 355 ADQGFWTGDAIAAFLRNVPDDGLMVLDIGNDRYPKVRQTAQAFHGKGWIYGYVHNYGASN 414

Query: 214 EMYGILD-------SIAFGPVEAR---------------------------TSENTTMAW 239
            +YG L        +I   P   R                               +   W
Sbjct: 415 PIYGDLGFYRRDMAAITSDPARGRLQGFGVFPEGLDSNSIVYAYLYDLAWNGGTKSLSDW 474

Query: 240 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 299
           +  Y+  RYG S P +  AW  +   VY          R    A+       I++ + + 
Sbjct: 475 LAGYTRARYGISSPEVVTAWLDIVKGVYGTRYWTPRWWRSTAGAYLLCKRPDIAMADFEG 534

Query: 300 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 359
               +   +  + +     +D P L                           RYD+I+ T
Sbjct: 535 APGDRAALRAGLARLAAIRHDSPLL---------------------------RYDVIEFT 567

Query: 360 RQALAKYANELFLNIIEAYQLNDAHGVFQLS---RRFLELVEDMDGLLACHDGFLLGPWL 416
           R   + + + L    + AY+  D     + +   RR    ++D+ G   CH    L  W+
Sbjct: 568 RHLASLHLDNLIRTALVAYRDGDVAAGDRSATEVRRVTIAIDDLMGAQPCH----LAGWI 623

Query: 417 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 476
           E A+       ++  YE NAR Q+T+W          L DY +K W GL RD+Y PR  +
Sbjct: 624 EQARAYGDTATEKPYYERNARAQVTVWGGKGN-----LHDYASKAWQGLYRDFYLPRWEM 678

Query: 477 YFKYMIESL--ESGDGFRLKDWRREWIK 502
            F  +       +    RL  W   W++
Sbjct: 679 LFAALRTGTYNPAATTERLIAWENAWVE 706


>gi|294627661|ref|ZP_06706243.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 11122]
 gi|292598013|gb|EFF42168.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 11122]
          Length = 798

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 156/576 (27%), Positives = 238/576 (41%), Gaps = 115/576 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+   LPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRASLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 313

Query: 121 -------VDSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
                  V + +Y                       ++  G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSVANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
           +  +  +YG                   +   FG        N+ +              
Sbjct: 434 YGASNPLYGDFAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVIYEYLYALAWEGPQQ 493

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+  Y   RYGRS  A+  AW  L   +Y     +               P   + 
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQTRYWS---------------PRWWNK 538

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
             G Y  + +P +   ++  +    D   L        RA++  +   N  + +  YRYD
Sbjct: 539 RAGAYLLFKRPTAD--IVDFDDCPGDPQRL-------RRAIDALLQQANRYADAPLYRYD 589

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLAC-HDGFLL 412
           LI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+   HD   L
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGGQHD--TL 646

Query: 413 GPWL-ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 471
             W  ++A     +    + Y  NAR Q+++W  +       L DY +K W G+  D+Y 
Sbjct: 647 ADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYL 701

Query: 472 PRAAIYFKYMIESLESGDGF-------RLKDWRREW 500
            R   +      + ++G  F       +L  W R+W
Sbjct: 702 QRWTRFLSAYRAARKAGTPFDAVAVDHQLAAWERQW 737


>gi|373461651|ref|ZP_09553390.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
 gi|371951955|gb|EHO69797.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
          Length = 713

 Score =  182 bits (462), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 141/509 (27%), Positives = 211/509 (41%), Gaps = 86/509 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGP+ Q ++D Q  L ++IL R+  LG+ PVL  F G V  ++++ +P+A + 
Sbjct: 194 MGNLEGWGGPVSQDFIDAQSRLGRRILDRMATLGIQPVLQGFYGMVSRSIRDRYPNAVMP 253

Query: 61  QLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
           Q G W F  + D       +L  T+ LF EI   +  +  K YG   H +  D F E   
Sbjct: 254 Q-GMWGFFERPD-------ILKPTEKLFDEIADTYYREIKKHYGTGFHYFGGDLFHEGGQ 305

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
                  ++  G A+   MQ     + W++QGW  + +P         LL  +   K++V
Sbjct: 306 --TGTLNVADCGLAVQQAMQRNFPGSTWVLQGWSGNPNPL--------LLTKLDREKVLV 355

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           +DLF E    W+ +K + G P++WC++ NF     MYG L  IA    + R S+      
Sbjct: 356 VDLFGENDEAWNRTKAYQGTPFLWCIVSNFGEQCGMYGKLQRIALQIDKVRKSDYKAYLK 415

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  Y   RYG     I  AW +   T
Sbjct: 416 GVGIMPEGINNNPVVYDMVLHAPLTDRKINVEAWLKSYITYRYGSYNADIYAAWLIFLQT 475

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVLKSETSSYDH 321
           +Y                         SV E     YG P S    +  V  ++TSS+  
Sbjct: 476 IY------------------------ASVPE----KYGLPESVFCARPGVKVTQTSSWGV 507

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
              +Y        + LF+ +      S TY YD+ DL RQ  +   N ++ ++I A    
Sbjct: 508 RARYYDMDFFKEGVRLFLKAKTSFEDSETYAYDMFDLLRQVQSDKGNRVYDDMIAAIDAK 567

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
           + +   Q S RFL  +   D LLA   GF L  WL  A +  +          NA+ Q+T
Sbjct: 568 NPNRFEQTSDRFLHELLRQDTLLAQSKGFTLERWLGQASRFGKTVYDRDLALKNAKMQLT 627

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
            W  +     + + DY  K W+G+LR  Y
Sbjct: 628 FWGPDWNPTTT-VHDYAAKEWAGMLRTLY 655


>gi|325922205|ref|ZP_08183992.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
           19865]
 gi|325547324|gb|EGD18391.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
           19865]
          Length = 807

 Score =  182 bits (461), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 150/579 (25%), Positives = 234/579 (40%), Gaps = 117/579 (20%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQ +IL R+ ELGM PVLPAF+G VP A     P+A+I 
Sbjct: 202 MGNIEGYRAPLPQQWIDSKRVLQTQILTRMRELGMQPVLPAFAGYVPKAFAQAHPNARIY 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 262 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 314

Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
                  V + +Y  S+                       G A+Y  +   +  A W+MQ
Sbjct: 315 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPQATWVMQ 374

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 375 GWLFGADREFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 434

Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
           +  +  +YG                   +   FG        N+ +              
Sbjct: 435 YGASNPLYGDFAFYRQDLQALLADPDKRNLRGFGVFPEGLHSNSVVYEYLYALAWEGPQQ 494

Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
               W+ QY+  RYG S  A+  AW+                         D+D  I   
Sbjct: 495 SWSQWLTQYTRARYGHSDAALLQAWS-------------------------DLDAGIYQT 529

Query: 295 TEGKYQNYGKPVSKEAVLKSETSSY----DHPHLWYSTSEVIRALELFIASGNELSASNT 350
                + + K      + K  T+      D P        + RA++  +   +  + +  
Sbjct: 530 RYWSLRWWNKRAGAYLLFKRPTADIVGFDDRPG---DPQRLRRAIDALLQQADRYADAPL 586

Query: 351 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 410
           YRYDLI+  R  L+ +A+     +++AY   D      L  R   LV+ +D L+      
Sbjct: 587 YRYDLIEDARHYLSLHADRQLQAVVQAYGTGDFARGDALLARTTRLVQGLDALVGGQHET 646

Query: 411 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
           L     ++A     +    + Y  NAR Q+++W  +       L DY +K W G+  ++Y
Sbjct: 647 LADWTDQAAAAAGDDAALRRVYVGNARAQVSVWGGDGN-----LADYASKAWQGMYAEFY 701

Query: 471 GPRAAIYFKYMIESLESGDGF-------RLKDWRREWIK 502
             R   +      + ++G  F       +L  W R+W +
Sbjct: 702 LQRWTRFLSAYRAARKAGTPFDEAAFNKQLAAWERQWAE 740


>gi|372221472|ref|ZP_09499893.1| alpha-N-acetylglucosaminidase [Mesoflavibacter zeaxanthinifaciens
           S86]
          Length = 712

 Score =  181 bits (460), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 95/224 (42%), Positives = 131/224 (58%), Gaps = 9/224 (4%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++G  GPLPQ W+ ++  LQKKIL ++ +LGM PV+PAFSG +PAAL   FP+AKI+
Sbjct: 200 MGNINGHAGPLPQEWITKKAKLQKKILSKMRDLGMKPVVPAFSGYIPAALAEKFPNAKIS 259

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W     D     TYLLD  DPLF EIG+ FIE   +EYG+  + Y  D+F+E TPP
Sbjct: 260 ELNGWSGGGFD----STYLLDPKDPLFKEIGKRFIELYNQEYGKAEY-YLADSFNEVTPP 314

Query: 121 VDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
           V +   +  L   G  IY  +      A W+MQGWLF +D  FW    + A L+ VP  K
Sbjct: 315 VSTENKLDELAAYGQVIYETLNEAAPGATWVMQGWLFGHDAYFWEKDAVIAFLSKVPNDK 374

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
           L++ D   +   +W     FYG  + +  +HN+ G+  +YG  D
Sbjct: 375 LIIQDFGNDRYKVWEKQDAFYGKQWTYGYVHNYGGSNPIYGDFD 418


>gi|404403947|ref|ZP_10995531.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
          Length = 828

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 157/528 (29%), Positives = 244/528 (46%), Gaps = 62/528 (11%)

Query: 1   MSNLHGW-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
           M N+ G  G P PQ W + Q+ LQ +I+ R+  LGM PV   F+G VP A++ + P   +
Sbjct: 184 MGNMSGLDGAPTPQ-WHEAQIALQHRIIDRMEALGMTPVYQGFAGFVPPAMKRIHPETTL 242

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE-NT 118
           T+   W   K+       ++L   DPLF EIG AF+    +E+G+  + Y  D+F+E + 
Sbjct: 243 TET-KWSGFKN-------WMLSPLDPLFSEIGTAFVRAWEEEFGKGKY-YLIDSFNEMDV 293

Query: 119 P--PVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSV 172
           P  P  SPE  ++L   G  IY  +   + DAVW+MQGW+F Y    W P  ++ALL   
Sbjct: 294 PFGPKGSPERAATLRHYGETIYRSLAEANPDAVWVMQGWMFGYQRNSWDPESVRALLEGA 353

Query: 173 PLGKLVVLDLFAEVKP-IWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
           P G++++LDL  +    IW + K       F+G  +I+  + NF G   + G L+  A G
Sbjct: 354 PDGRMMILDLAVDFNNFIWRSEKSWNHLQGFFGREWIYSTVPNFGGRTALIGNLEFYANG 413

Query: 226 PVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFP 285
            +EA +S        N+  +  YG S   ++    ++Y  +     G +D   D+     
Sbjct: 414 HLEALSSP-------NRGRLTGYGTSPEGVESN-EIVYEII--AAAGWSDDRIDLKKFLH 463

Query: 286 DVDPS--------IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSE-VIRALE 336
           D   +        I     G  Q+     +  A  + +   Y H       +E    A+E
Sbjct: 464 DYSAARYGGCPEGIDRFWSGMLQSSYNECTNNARYRWQLRPYSHRMPTMGINENYYTAIE 523

Query: 337 LFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG----VFQLSRR 392
            F+A   EL  +  YR D I      LA  A+ L    +EA    D +G     +  + R
Sbjct: 524 QFLACAGELGGNELYRTDAIQYAALYLASKADML----LEAANWADLYGAREEAYDCAMR 579

Query: 393 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 452
             EL+ D D LLA H    L  W   A++    EE+++++   +R  I++W   +     
Sbjct: 580 IEELLLDADRLLASHPLLRLDRWSGMARKAGCTEEEKERFVGESRRLISVWGGPS----- 634

Query: 453 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
            L DY  + WSG++RDYY PR   Y    +E+   G  F  + W  +W
Sbjct: 635 -LSDYSARVWSGVIRDYYVPRLNKY----LEAKTDGTVFDFRTWDEQW 677


>gi|383643231|ref|ZP_09955637.1| N-acetylglucosaminidase [Sphingomonas elodea ATCC 31461]
          Length = 778

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 143/594 (24%), Positives = 230/594 (38%), Gaps = 115/594 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PL  +W++++ VLQ++IL R+  LGM P+LPAFSG VP A     P AKI 
Sbjct: 209 MGNIAGYRAPLSANWIEKKRVLQRQILARMRSLGMKPILPAFSGYVPEAFAKAHPEAKIY 268

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W           TY LD +DPLF  +   F++     YG     Y  D F+E  PP
Sbjct: 269 QMRQWEGFPG------TYWLDPSDPLFARLAARFLQLYTATYG-PGEYYLADAFNEMVPP 321

Query: 121 VDS-------------------------PEYI-----SSLGAAIYSGMQSGDSDAVWLMQ 150
           +                           P+ +     ++ G  +Y  + +   +A W+MQ
Sbjct: 322 IAEDGSDARAATYGDAIANTAATRAAALPKEVRDARLAAYGERLYRSITAAAPNATWVMQ 381

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW P  + A L+ VP  ++++LD+  +  P IW+ ++ FYG  + +  +HN
Sbjct: 382 GWLFGADKAFWTPDAIAAFLSKVPDERMLILDIGNDRYPGIWNATRAFYGKGWAYGYVHN 441

Query: 209 FAGNIEMYGILD---------------------------------------SIAFGPVEA 229
           + G+  +YG L                                         +A+G ++A
Sbjct: 442 YGGSNPVYGDLAFYRSDITAALANPGHGRMRGFGLFPEGLHSNGIAYAYAYDLAWGEIDA 501

Query: 230 RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 289
                   AWI  Y+  RYG++ PA+  AW+      Y                     P
Sbjct: 502 TGKARPLDAWIGDYTRARYGKTSPALVAAWDKAIAGAYTTR---------------YWTP 546

Query: 290 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 349
                  G Y  +  P    A         D+P      + +   +E  +A   +     
Sbjct: 547 RWWHEQAGGYLFFKFPSLDGA---------DYPAAPGDPAALRAGIEALLAQAPQHGGEP 597

Query: 350 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 409
            Y YD++DL R   +   ++     + AY+  D     + +     L   +D  LA +  
Sbjct: 598 LYTYDVVDLVRHYASVQLDDRLKTAVAAYKAGDLAAGDRATAAAERLARHIDA-LAGNQQ 656

Query: 410 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 469
             LG WL  A        ++  +   A+  +T+W          L DY ++ W GL   Y
Sbjct: 657 ETLGSWLADAAAYGDTPAEKAAFVEQAKAVVTVWGGTGH-----LSDYASRAWQGLYAGY 711

Query: 470 YGPRAAIYFKYMIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPV 516
           Y PR   +      +  +   F        ++ W+  W+K    W   R   P+
Sbjct: 712 YWPRWQRFLAAQRAAAAAHTPFDAKATSDAIRTWQAAWLKDGRMWPRQRPAAPL 765


>gi|261880009|ref|ZP_06006436.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
 gi|270333325|gb|EFA44111.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
          Length = 722

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 143/534 (26%), Positives = 228/534 (42%), Gaps = 70/534 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ KI+ R+  LGM+P+ PAF+G VP       P  ++ 
Sbjct: 194 MGNLNSWNGPLTDAWQQGQITLQHKIIDRMRALGMHPIAPAFAGFVPEQFVEAHPGLQVK 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W     D R    Y+L    P F +IGR F+E+  KE+G+ +  Y  D+F+E   P
Sbjct: 254 KL-TWGGF--DDR-LNAYVLSPESPYFKQIGRLFVEEWEKEFGKNT-FYQSDSFNEMEIP 308

Query: 121 VDSPEYISS------LGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           V+  + I         G  IY  +   + DAVW+ QGW F Y    W    ++ALL  VP
Sbjct: 309 VEPGDSIGKWKLLEQYGDVIYRSIAEANPDAVWVTQGWTFGYQHKMWDSKSLQALLRHVP 368

Query: 174 LGKLVVLDL-------FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL         + +  W     +YG  +++  + NF G     G +   A   
Sbjct: 369 DDKMLIIDLANDYPKWIWKTQQTWKVQHGYYGKQWVFSYVPNFGGKTLPTGDMQMYASAS 428

Query: 227 VEA-RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFP 285
            EA   SE   M          +G +   I++  +V+Y  + +   G TDK         
Sbjct: 429 AEALHHSERGNMV--------GFGSAPEGIENN-DVIYELLADM--GWTDKA-------V 470

Query: 286 DVDPSIISVTEGKYQNYGKPVSK--EAVLKSETSS-YDHPHLWYST-------------- 328
           D+D  I    E +Y  Y   + K  + +L+S   S Y +P   + T              
Sbjct: 471 DLDLWIKDYCEARYGGYPSDMQKAWQCMLRSVYGSLYSYPRFTWQTVTPDSRRVSTHALN 530

Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
              +  +  F+    +L +S  YR D I L    L   A+  +   ++            
Sbjct: 531 DTFLSGVAHFLRCARQLGSSPLYRSDAISLASLYLGTKADRHYTKALDLKASGKQQAASA 590

Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 448
              + ++L+   D LLA H    L  W++ A+       ++ +YE +A+  IT+W     
Sbjct: 591 ELHQTIDLLTKADRLLASHPTHRLDRWIQFARNHGITTAEKNRYESDAKRLITIW----- 645

Query: 449 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
                  DY  ++W+GL+  YY PR   YF +   +        L  W  +W+K
Sbjct: 646 --GGFQEDYAARFWNGLIAHYYIPRIRYYFDHGRPA--------LMQWEEQWVK 689


>gi|302546018|ref|ZP_07298360.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
           [Streptomyces hygroscopicus ATCC 53653]
 gi|302463636|gb|EFL26729.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
           [Streptomyces himastatinicus ATCC 53653]
          Length = 679

 Score =  179 bits (453), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 147/550 (26%), Positives = 226/550 (41%), Gaps = 82/550 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  WGGP+  + L+++  L ++I  R+ ELGM PV P + G VP    +  P A   
Sbjct: 177 MQNMSEWGGPVSTALLEKRTDLGRRIADRLRELGMRPVFPGYFGTVPDGFADRNPGAHTV 236

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G+W  ++  P W     LD     F E+  AF   Q   +G    ++  D   E  N 
Sbjct: 237 PQGDWNGLRR-PDW-----LDPRTDAFHEVAAAFYRHQHDLFG-ACDLFKMDLLHEGGNA 289

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             V  P+       A+   +Q+    A+W++ GW  +          + LL++V    ++
Sbjct: 290 GDVSVPDAAR----AVEKALQTSRPGAIWVILGWQSN--------PRRDLLDAVDHDHML 337

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI----------EMYGI--------LD 220
           V+D  +++  I    K +  VPY +  + NF G            E + +        L 
Sbjct: 338 VVDGLSDLDTITDREKDWGSVPYAFGTIPNFGGRTTIGAKTHMWTERFTVWRDKPGSKLV 397

Query: 221 SIAFGP--VEA-------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
             A+ P  VE              R +     AW   Y+  RYG      ++A+  L  T
Sbjct: 398 GTAYMPEAVERDPAAYELFSELAWRDTAVDRDAWFRDYADVRYGARDAKAREAFAALRDT 457

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
            Y  +      +  V  A P       S+T     NY           + T ++D     
Sbjct: 458 AYQISSKDGRPHDSVFAARP-------SLTARSGTNYA----------THTPAFD----- 495

Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
              +    AL   +     L  S+ YRYDL D  RQALA  + +L   + +AY   D   
Sbjct: 496 --PARFDAALAALLGVRAGLRDSDAYRYDLADTARQALANRSWQLIGQLADAYARKDLDT 553

Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
              LSR +L+L+   D +   H   LLGPWLE AK++A   E+  Q E+ AR  IT W D
Sbjct: 554 FRALSRLWLKLMRLSDDITGTHRLLLLGPWLEDAKRMASGAEESAQLEFAARALITTWAD 613

Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
               +   L +Y N+ W+GL+ D++ P+   Y   + ++L  G   R  D    W  +  
Sbjct: 614 RGAADPGKLANYANRDWNGLIGDFHVPQWQTYLDELEDALAEGRAPRTFD----WYTVEE 669

Query: 506 DWQNGRNVYP 515
            W   R  YP
Sbjct: 670 PWTRERKSYP 679


>gi|293369246|ref|ZP_06615836.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
 gi|292635671|gb|EFF54173.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
          Length = 521

 Score =  178 bits (451), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 102/300 (34%), Positives = 151/300 (50%), Gaps = 37/300 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GW  PLP+ WL  Q  LQ++I+ R  E  M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 196 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R  CT+ L+  D L+  I + ++ +Q + YG T+HIY  D F+E  PP
Sbjct: 256 RVSEWGGFADQYR--CTF-LNPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
               + +  +   IY  + + D +AVWL   WLF  D   W  P++K+ L SVP  +L++
Sbjct: 312 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
           LD F E   IW  +  ++G PY+WC L NF GN  + G ++ ++                
Sbjct: 372 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 431

Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                           F   +A     T   W  + + RR G+  P  + AW +L + VY
Sbjct: 432 VGSTLEGIDLNQFMYEFVLDKAWNGGQTDKEWFFKLADRRIGKISPEARKAWEILANKVY 491


>gi|295690503|ref|YP_003594196.1| alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
 gi|295432406|gb|ADG11578.1| Alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
          Length = 770

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 153/589 (25%), Positives = 233/589 (39%), Gaps = 111/589 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLP +W+D++  LQ +IL R+  LGM P+LPAF G VP A     P A+I 
Sbjct: 202 MGNIEGYRAPLPTNWIDKKKDLQVQILGRMRSLGMTPILPAFGGYVPKAFAQKNPKARIY 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF +I   F+    + YG T   Y  D+F+E  PP
Sbjct: 262 RMRPWEGFHE------TYWLDPADPLFAKIAGRFLALYTQTYG-TGTYYLADSFNEMLPP 314

Query: 121 VD-------------------------------SPEYISSLGAAIYSGMQSGDSDAVWLM 149
           ++                                 + +++ G AIY  ++    DAVW+M
Sbjct: 315 INADGADARDAAYGDGAANTAATKTKVEVDPALKAQRLAAYGKAIYDSIRQARPDAVWVM 374

Query: 150 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 207
           QGWLF  D  FW P  + A L+ VP  KL++LD+  +  P +W  +K F G P+I+  +H
Sbjct: 375 QGWLFGADSHFWDPTAISAYLSLVPDDKLMILDIGNDRYPAVWKNAKAFGGKPWIYGYVH 434

Query: 208 NFAGNIEMYGILD-------SIAFGPVEAR---------------------------TSE 233
           N+ G+  +YG LD       +IA  P   +                              
Sbjct: 435 NYGGSNPVYGDLDYYRRDIPAIAANPEAGKLAGFGMFPEGLHNNSIVYDAVYDLAWGAGR 494

Query: 234 NTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 293
            +  AW++ Y+  RYG++ P +  A   L    Y+    +               P    
Sbjct: 495 ESLSAWLSTYARARYGKTSPELDAALGQLVEAAYSTRYWS---------------PRWWK 539

Query: 294 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRAL-ELFIASGNELSASNTYR 352
              G Y  + +P +             HP         ++AL  L  A  NE      + 
Sbjct: 540 SKAGAYLFFKRPTATIGEFPP------HPGDRAKLEAAVKALTALAPAYANE----PLFV 589

Query: 353 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 412
            DL D TR       ++L    + AY+  D     Q       L   +D LL       L
Sbjct: 590 LDLTDATRHLATMKIDDLLQAAVAAYRRGDVASGDQARVEIAALALSIDKLLGVQPE-TL 648

Query: 413 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 472
             W++ A+           Y  NA+ Q+T+W          L DY +K W GL R +Y P
Sbjct: 649 ATWIDDARAYGDTPADAAAYVANAKAQVTVWGGEGN-----LNDYASKAWQGLYRGFYLP 703

Query: 473 RAAIYFKYM----IESLESGDGFRLK-DWRREWIKLTNDWQNGRNVYPV 516
           R +++   +      + +     R    W R W+     ++  +   PV
Sbjct: 704 RWSMFLDALKAAGTGTFDEPAAVRASIAWERAWVDAEVAYRREKPADPV 752


>gi|187735714|ref|YP_001877826.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
 gi|187425766|gb|ACD05045.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 852

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 141/576 (24%), Positives = 244/576 (42%), Gaps = 85/576 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ-NV-FPSAK 58
           M NL G GGPL Q  +++   + ++I+ R+ +LGM PVL  + G VP+  Q NV     K
Sbjct: 198 MGNLEGHGGPLSQQQINKMAQMGRRIVSRMEQLGMTPVLQGYVGFVPSDFQENVRIDGLK 257

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +   G W + +        +++D T   F ++   + +   K YG    ++  D F E  
Sbjct: 258 LIPQGEWVNFRR------PWVVDPTCEAFPKLAADWYKALRKVYGIPGKMFGGDLFHEGG 311

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
              D    ++     +   MQ     A W++Q W    +P       + LL+ +   + +
Sbjct: 312 RKGDID--VTQAAQEVQKAMQKASPGAFWVIQAW--GGNP------TRELLSGLDPERAL 361

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG--------------------- 217
           VL L  ++       + F G+P++WC L NF GN  MYG                     
Sbjct: 362 VLQLTKDMANGGKNLRTFNGIPWVWCELANFGGNTGMYGGVPLLSRLGSELSGYKDKGLV 421

Query: 218 ----ILDSIAFGPVEAR-------TSENTTM-AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
               + + +   P+          T E+ ++  W+ +Y+ +RYG +  A+  A  VL  +
Sbjct: 422 GMGTLSEGLETNPLHYALFSDRLWTREDISVREWLGKYARQRYGFAPKAVVKALEVLSFS 481

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
           +YN           +I A P  +                          + S++     +
Sbjct: 482 IYNPVRSQEGCTESIICARPSWNV------------------------RKASTWSSGERY 517

Query: 326 YSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
           Y   ++++A   ++ + N+   L    T+RYDL+D+ RQALA  A      +  A+   D
Sbjct: 518 YHLGDIVKAARGYLKAANDQPNLVKKETFRYDLVDVVRQALADAAFYQLQQVRSAFDSGD 577

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
                +  +RFL L+ DMD LLA    FLLG W + A     + +++   + +A+  IT 
Sbjct: 578 LAAYRKQVKRFLSLISDMDALLATDSQFLLGTWQKRALDWGDSRQEKALMDKSAKMLITT 637

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           W D        L DY N+ W+GL+ D+Y PR   +F++ ++ L +G   R         K
Sbjct: 638 WIDQVPRS---LNDYSNRQWAGLVSDFYLPRWKNFFEFQMDVL-TGKKTRDAAHAAFMDK 693

Query: 503 LTND---WQNGRNVYPVESNGDALITSQWLYNKYLQ 535
           +  D   +     +Y V+  GD L  +  + N + +
Sbjct: 694 MVRDELAFAGNGKIYSVKPAGDTLAVANRVMNTHRE 729


>gi|229818803|ref|YP_002880329.1| alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
 gi|229564716|gb|ACQ78567.1| Alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
          Length = 751

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 162/562 (28%), Positives = 252/562 (44%), Gaps = 66/562 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M +   +GGPLP SW +++  L ++IL R  ELGM  VLPAF G+VP  L          
Sbjct: 194 MGSTSSFGGPLPDSWFERRAELGRRILERQRELGMRAVLPAFGGHVPDGLGA-------- 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
                   ++  +   T LL   D  F  +   F  QQ + +G T H+Y  D F E+ PP
Sbjct: 246 ------GARTHWQGFSTALLGPDDDAFAVVAAEFARQQRELFG-TDHLYAADPFIESVPP 298

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PE +++  AA Y+GM + D +A W+MQ W F Y   FW   ++ A+ ++VP  +L++
Sbjct: 299 SGEPEDLAAFAAATYAGMSAADPEATWVMQAWPFHYHRRFWTAERIAAVTDAVPRDRLLL 358

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPV---EARTSEN 234
           LDL+AE  P+W   +      ++WC +HNF G   ++G L  +A   G V    ART   
Sbjct: 359 LDLWAEHAPVWDDGRGIAEHQWLWCAVHNFGGRFSVHGDLHGLARDLGGVLDDGARTGGF 418

Query: 235 TTM----------------------------AWINQYSVRRYG-----RSVPAIQDAWNV 261
           T +                            AW+ ++  +RYG      +  A+  AW +
Sbjct: 419 TGVGMAMEALENNPVFYELLTDLVWERPDVDAWVGRFVDQRYGFADGTAARDAVHGAWAI 478

Query: 262 LYHTVYNCTDGATDKNRDVIVAFPD--VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 319
           L  T+Y    G T      ++A P   V P       G++ +   PV   A + +E    
Sbjct: 479 LLRTLYGP--GMTRSIPSPVIARPADVVAPFHTQRLAGEFLDPDAPVIVSANIDAEAD-- 534

Query: 320 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
             P +     E+ RA  L   +     A     +DL DL    +A+        I+ A +
Sbjct: 535 --PRVEGDLPEIARAAALLREAAGSSDAGGPLAHDLADLLTHVVAQRTRAPIRAIVAAAR 592

Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
             DA  V          + D+D + A     LLG WL +A++ A ++ + +    +AR Q
Sbjct: 593 AGDADAVRANGALLAAAIADLDAVAATQPDRLLGTWLAAAQRWADDDGERRVLLRDARRQ 652

Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
           +T+W     E+ S L DY  ++WSGLL  +Y PR  ++  ++ E+ ESG     ++ RR 
Sbjct: 653 LTVW----GEQTSGLHDYSGRHWSGLLGGFYAPRWQLWVDWLAEAAESGSEPDPQELRRA 708

Query: 500 WIKLTNDWQNGRNVYPVESNGD 521
            + L   W       P +  GD
Sbjct: 709 VVALEESWVARDETGPTDPAGD 730


>gi|308480701|ref|XP_003102557.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
 gi|308261289|gb|EFP05242.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
          Length = 718

 Score =  175 bits (444), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 128/526 (24%), Positives = 228/526 (43%), Gaps = 86/526 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN---VFPSA 57
           M NL  +GG L  + +     L K+I+ R+ ELG+ P+LP F+G VP  L+    +FP++
Sbjct: 210 MGNLKAYGGGLSDAQMLNDFNLAKRIINRLLELGIVPILPTFAGFVPDQLEKDFRLFPTS 269

Query: 58  KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDE 116
           K  +L  W +  S+    C   +   DPLF +IG  F+  Q K  G   +++Y+ D F+E
Sbjct: 270 KFNRLPCWNNFTSET--SCLLSVSPFDPLFQKIGSTFLRHQKKMLGGDITNLYSADPFNE 327

Query: 117 NTPPVDSPEYISSL----GAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSV 172
              P DS ++ +S       +I +  +  D + +W++Q W F+YD  W    +K+ L++V
Sbjct: 328 -ILPSDSSKFDASFMKQTAQSIMNSCRKVDKNCIWVLQSWSFTYDQ-WPNWAIKSFLSAV 385

Query: 173 PLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS 232
           P+G L++LDL++EV P W  +  F+G  ++WC+LHNF G+ E+ G L  +  G   A   
Sbjct: 386 PIGNLLILDLYSEVVPAWQMTSSFHGHNFVWCLLHNFGGSRELRGNLQKVDKGYQLALMK 445

Query: 233 ENTTMA---------------------------------WINQYSVRRYGRSVPAIQDAW 259
             + +                                  W+  YS  RY          W
Sbjct: 446 AGSNLVGAGLSMEAIDQNYVVYQFMIDRMWSQEPIPLNNWLKSYSESRYSADFKVSHKFW 505

Query: 260 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 319
            +L  + Y+  +   +    V +                   Y +P   + +        
Sbjct: 506 TILAGSFYSQPEKWGNPRFSVFL-------------------YHRPAFAKKI-------- 538

Query: 320 DHPHLWYSTSEVIRALELFIAS-GNELSASNTYRYDLIDLTRQALA-KYANELFLNIIEA 377
                W+   E    L+  + S  + L     ++ DL D+ R  +  +  NE  L++ EA
Sbjct: 539 ---EYWFPVEETFNHLQSLMPSLMHVLGDHPLFKEDLNDVMRAVIQFEIGNEAALSLTEA 595

Query: 378 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 437
           + + D   +       +++ + ++     +       W+E +K +A   E+ + +   A 
Sbjct: 596 FLMEDKQQIGASCENLMDMFQKLES----YSNRDFKEWIEDSKSIAPTSEERQVFPVTAS 651

Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
             +T+W    Q       DY ++ W+GLL  YYG R   +  +++E
Sbjct: 652 DILTVWGPTGQN-----LDYAHREWAGLLSGYYGRRWQYFCDWILE 692


>gi|453051703|gb|EME99203.1| alpha-N-acetylglucosaminidase [Streptomyces mobaraensis NBRC 13819
           = DSM 40847]
          Length = 763

 Score =  174 bits (442), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 140/564 (24%), Positives = 233/564 (41%), Gaps = 83/564 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGP+ ++ LD++  L ++I  R+ ELG+ PVLP ++G VP         A+  
Sbjct: 239 MQNMSAFGGPVSRALLDRRTALAQRITRRLRELGITPVLPGYAGTVPPDFTRRNKGARTV 298

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W      P W     LD     F  + R +   Q + YG  S +Y  D   E   P
Sbjct: 299 PQGDWAGFPR-PDW-----LDPRTAHFARVARTYYRVQRELYG-ASSMYKIDLLHEGGTP 351

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P  + +   A+   +++   DA W + GW  +          + +L++V   K++VL
Sbjct: 352 --GPVPVGAAAKAVEKALRAAHPDATWAILGWQTN--------PRREILDAVDRSKMLVL 401

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILDS 221
           D   +  P +    K + G PY +  + NF G+  M                     L  
Sbjct: 402 DGIPDHYPRVTDREKDWGGTPYAFGTIWNFGGHTAMGANTQDWVSLFHRWRTKKGSALRG 461

Query: 222 IAFGPVEARTSENTTMA----------------WINQYSVRRYGRSVPAIQDAWNVLYHT 265
           IA  P EA  +    +A                W  ++ V+RYG + P  + AW+VL  T
Sbjct: 462 IALMP-EAADNNPAALALFSDLAWTEGRLDLKDWFARWPVQRYGAADPNARRAWDVLRRT 520

Query: 266 VYNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            Y  T  DG ++    +  A PD+                      AV  +  +++    
Sbjct: 521 AYGTTRADGWSEAADGLFGARPDL----------------------AV--NRAAAWSPRQ 556

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           L Y  +    AL   +A    L  S+ YR DL D+ RQ ++  +  L   I  AY   D 
Sbjct: 557 LRYDAAAFDEALPALLAVAPALRGSSAYRCDLTDVARQCVSNRSRLLLPRIKAAYDAGDR 616

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
                L+R++L+ +  ++  +A  +  LLG W+  A+       +  + E +A + +T+W
Sbjct: 617 TRFRTLTRQWLDWMTLLEETVATSERHLLGRWIAEARAWGGTAAERDRLEHDAVSLLTVW 676

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 +   L DY N+ W+GL+   Y  R   YF  +  +L +    R K    +W  L
Sbjct: 677 GPRASADGGKLHDYANREWAGLVGGLYRLRWKTYFTELEAALTA----RRKPKPIDWYAL 732

Query: 504 TNDWQNGRNVYPVESNGDALITSQ 527
            + W   R  YP + +GD +  ++
Sbjct: 733 EDRWTRKRPAYPAKPSGDIVAVAR 756


>gi|291302495|ref|YP_003513773.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
 gi|290571715|gb|ADD44680.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
          Length = 696

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 151/574 (26%), Positives = 246/574 (42%), Gaps = 95/574 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M  + G+GG + +  ++++  L ++I  R+ ELG+ PVLP F+G VP    ++  +A I 
Sbjct: 178 MGCMCGFGG-VSRRLVEERAELGRRITDRMRELGIEPVLPGFAGLVPG---DIGDTAAIP 233

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           Q G WF     P W  T     T   + E+   F  +Q +  G T      D   E   +
Sbjct: 234 Q-GQWFGFDR-PAWLPT-----TTRAYAEVAEVFYAKQTERLGAT-RAQAVDLLHEGGTS 285

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD  +    + AA    M+    D +W++Q W       W  P  + L  +       
Sbjct: 286 GGVDLADATRGIAAA----MERAHDDYLWVLQAW-------WDNPLPEVLAAT----DSD 330

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM- 237
            L L       W  +K ++G P+    L NF G   ++G L  IA  P      + +++ 
Sbjct: 331 HLLLLDLTGEGWRKTKGWHGKPWARGSLTNFGGRTVLFGGLPEIAELPSLKDDPKASSLV 390

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+ +Y   RYG++ P    AW+ L  T
Sbjct: 391 GTALVEEAWQVNPVVWSLFTQTSWADGDIDLNAWVPEYVAARYGKAHPRAVRAWHGLLAT 450

Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH-L 324
            Y   DG       ++ A P +D                         ++ +S + PH L
Sbjct: 451 AYRSMDGRPGGAESLLCAMPSLD-------------------------ADRASMNGPHSL 485

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            Y    +  A    +A+   L  ++T+R+DL+D+TRQ ++  A  L   +  AY + +  
Sbjct: 486 PYPAEALEVAWRDLLAAREALGGADTFRFDLVDVTRQVISNRARPLLPLLRTAYAMKELD 545

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
               LS  F++L E +D +LA  + FL+G WL  A+ LA +E++    E++ART IT W 
Sbjct: 546 RFIALSHSFIDLFELLDPVLATREEFLVGRWLADARALAADEDEADALEFDARTIITTWG 605

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
           D+ +  A+L+ DY N  W+GL+ DYY PR   Y K +   L  G      D+  +     
Sbjct: 606 DSPESSATLI-DYANHEWAGLIADYYRPRWEKYLKSLETELREGKPAEPIDFYAD----A 660

Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNK--YLQG 536
             W    + YP E +GDA+ + + +++   Y +G
Sbjct: 661 AAWARSHDTYPTEPSGDAVSSCRAVHHALPYFEG 694


>gi|418473272|ref|ZP_13042874.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
           coelicoflavus ZG0656]
 gi|371546106|gb|EHN74664.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
           coelicoflavus ZG0656]
          Length = 716

 Score =  171 bits (434), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 136/548 (24%), Positives = 224/548 (40%), Gaps = 77/548 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+ GP+ +  ++Q+  L ++I  R+ ELGM PVLP + G VP       P   + 
Sbjct: 213 MQNMSGFAGPVSERLIEQRAALGRRIANRLRELGMTPVLPGYYGTVPPDFTARNPGGTVV 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W   +  P W     LD    +F  +  +F   Q + +G  S +Y  D   E   P
Sbjct: 273 PQGQWVGFER-PDW-----LDPRTGVFSRVAASFYRHQRELFG-DSTMYKMDLLHEGGRP 325

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            + P  +     A+ + +Q+    AVW + GW  +       P  + ++++V   +L+++
Sbjct: 326 GNVP--VGDAARAVMNALQTARPGAVWTLIGWQNN-------PSTQ-IIDAVDKSRLLIV 375

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI------------------EMYGILDSI 222
           D  ++          ++G PY +  + NF G+                   E    L  I
Sbjct: 376 DGLSDRYDGLDRETAWHGAPYAFGTIPNFGGHTTVGANTAVWAERFDRWRTEPGSALAGI 435

Query: 223 AFGP---------------VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
           A+ P               +  RT       W   Y+ RRYGR  P    AW +L    Y
Sbjct: 436 AYLPEGTGGNPVAYELFTELAWRTEPVDHSGWFAAYAERRYGRPDPHAARAWELLRTGPY 495

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           +   G   + +D +                         ++  +  +  +S+    + Y 
Sbjct: 496 SMPSGTWSEAQDSLF-----------------------TARPRLTATSAASWSPGAMRYD 532

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
              V  AL   +     L  ++ YR+DL+D+ RQALA  +  L   I  AY   D     
Sbjct: 533 PDTVRAALAELLKVAPALRTTDAYRFDLVDVARQALANRSRSLLPEIKAAYDAGDLSRFR 592

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
             +  + + ++ +D LLA    FLLGPWL  A+   +   ++   E++AR+ +T W   +
Sbjct: 593 AGAAEWKDDLDLLDRLLATDSRFLLGPWLADARSWGRTAAEKDAAEFDARSLLTTWGHRS 652

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
             +A  LRDY N+ WSGL+ D+Y  R   Y   +  +L +G      D    W  L N W
Sbjct: 653 GSDAGGLRDYANREWSGLVSDFYAMRWTTYLDSLDTALVTGRPPAAID----WFSLENAW 708

Query: 508 QNGRNVYP 515
               + YP
Sbjct: 709 NQRHDDYP 716


>gi|257067709|ref|YP_003153964.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
           4810]
 gi|256558527|gb|ACU84374.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
           4810]
          Length = 768

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 145/583 (24%), Positives = 235/583 (40%), Gaps = 91/583 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M   H  G  L    L+ +  L ++I  R  ELGM  VLP F G +PA L     + ++ 
Sbjct: 201 MGITHDLGAALTDEALEARAELGRRIAERERELGMTVVLPGFGGQLPAELVG---TERMI 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W +  + P           DPLF E   +    Q +  G T H Y  D + E+ PP
Sbjct: 258 DWQGWHNALAAP----------GDPLFAEAAASLHRHQRQLLG-TDHHYAVDPYIESLPP 306

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             SP+ ++    AI++ M+  D  AVW++QGW F Y   +W   ++ +LL+ VP  +L++
Sbjct: 307 TTSPQQLAEHAEAIFTAMRDADPQAVWILQGWPFHYRAAYWTEERVHSLLSRVPEDRLIL 366

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSI----------------- 222
           LDL+ E  P+W  +   YG  ++WC+ H F G   ++G L ++                 
Sbjct: 367 LDLWGEHAPMWHRTAAMYGRRWLWCLAHTFGGRFGLFGDLAALDDDLRGLRTAAEAGTRG 426

Query: 223 ---AFGPVEARTSENTTM----------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
               FG       +N  +                 W+ ++ +RRYG + P +Q AW V+ 
Sbjct: 427 RLEGFGITSEALDDNAVVYELATRALWSPMPPRERWLEEHIIRRYGTAAPEVQQAWQVIA 486

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
           HT+Y    G T      ++A P                 G P + + +        D P 
Sbjct: 487 HTLYGP--GRTRSTPSPLIARP--------------WTRGLPFASQRLAGEALPDADGPP 530

Query: 324 LWYSTSE--------------VIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 369
                +E               +R+L   + SG      +    DL  L     A+ A  
Sbjct: 531 SANIDAENDAEMLGALAPLAHAVRSLLPVLRSGEH---RDALARDLAQLAIHVGAQSARA 587

Query: 370 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ- 428
               I+ A    D   +   +     L+  +D + A     L+G W+  A+  A  +E+ 
Sbjct: 588 PLRAIVAAAAEADGERLRAEASTLEALLRAVDAVAATRPDMLVGRWIADARAGAGTDERL 647

Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES- 487
               E +AR+ I++W   TQ+  S L DY  ++WSG L D +  R   +  ++  + E  
Sbjct: 648 ADALERDARSLISVW--GTQD--SGLHDYSARHWSGSLTDLHLARWRAWTDWLARTAEEP 703

Query: 488 GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD-ALITSQWL 529
                L+    +   +  DW++    YP    G+ A   SQ L
Sbjct: 704 STPPDLEQLHAQIRGIEEDWRDSTAPYPTTPRGEPAAAISQLL 746


>gi|16124795|ref|NP_419359.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
 gi|221233511|ref|YP_002515947.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
 gi|13421729|gb|AAK22527.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
 gi|220962683|gb|ACL94039.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
          Length = 770

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 138/540 (25%), Positives = 217/540 (40%), Gaps = 104/540 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLP +W+D++  LQ KIL R+  LGM P+LPAF G VP A     P A+I 
Sbjct: 202 MGNIEGYKAPLPTAWIDKKKDLQVKILGRMRSLGMTPILPAFGGYVPKAFAEKNPKARIY 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF +I   F+    + +G  ++ Y  D+F+E  PP
Sbjct: 262 RMRPWEGFHE------TYWLDPADPLFAKIAARFLALYTETFGAGTY-YLADSFNEMLPP 314

Query: 121 VD-------------------------------SPEYISSLGAAIYSGMQSGDSDAVWLM 149
           ++                                 + +++ G AIY  ++    DAVW+M
Sbjct: 315 INADGADARDAAYGDGTANTAVTKTKVEVDPALKAQRLAAYGKAIYDSIRQTRPDAVWVM 374

Query: 150 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 207
           QGWLF  D  FW P  + A L+ VP  KL++LD+  +  P +W  +K F G P+I+  +H
Sbjct: 375 QGWLFGADSHFWDPAAISAYLSLVPDDKLMILDIGNDRYPNVWKNAKAFGGKPWIYGYVH 434

Query: 208 NFAGNIEMYGILD-------SIAFGPVEARTS---------------------------E 233
           N+ G+  +YG L        +IA  P   + +                           +
Sbjct: 435 NYGGSNPVYGDLGFYRQDIPAIAANPDAGKLAGFGMFPEGLHNNSIVYEAVYDLAWSEGQ 494

Query: 234 NTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 293
            +   W+ +Y+  RYG++ PA+  A   L    ++    +               P    
Sbjct: 495 ASPATWLTRYARARYGKTSPALDAALGQLVEAAFSTRYWS---------------PRWWK 539

Query: 294 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 353
              G Y  + +P +            D P      +++  A++   A          +  
Sbjct: 540 SKAGAYLFFKRPTATVG---------DFPQHPGDRAKLEAAVKALTALAPTYGQEPLFVL 590

Query: 354 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
           DL D TR       ++L    + AY+  D             L   +D LL       L 
Sbjct: 591 DLTDATRHLATMKIDDLLQVAVAAYRRGDTAAGDAARVEIEALALSIDKLLGVQPD-TLA 649

Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
            W++ A+           Y  NA+ Q+T+W          L DY +K W GL + +Y PR
Sbjct: 650 TWIDEARAYGDTPADAAAYVANAKAQVTIWGGEGN-----LNDYASKAWQGLYKSFYLPR 704


>gi|329934959|ref|ZP_08285000.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
 gi|329305781|gb|EGG49637.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
          Length = 1017

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 136/562 (24%), Positives = 223/562 (39%), Gaps = 81/562 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPS-AKI 59
           + NL+G+GGPL    + ++  L ++I  R+  LGM PVLP + G+VP    +     A +
Sbjct: 191 LQNLYGYGGPLSAELIARRAALGRRIADRLRALGMRPVLPGYYGHVPKDFADRRGGDAHV 250

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
              G W      P W     LD     F E+  +F   Q   +G     +  D   E   
Sbjct: 251 VPQGTWHGFDR-PSW-----LDPRTDAFAEVAASFYRHQEDVFGPAGD-FKMDLLHEGGT 303

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
             D P   ++ G  +   +++    A W++ GW  +         +  LL++V   ++++
Sbjct: 304 AGDVPVPDAARG--VEKALRAARPGATWVILGWEAN--------PLPELLDAVDKKRMLI 353

Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILD 220
           +D  ++    +    + + G PY +  + NF G   +                     L 
Sbjct: 354 VDGVSDRYTSVTDREEDWGGTPYAFGTIPNFGGRTTIGARTHIWREKFFAWRDKPGSALA 413

Query: 221 SIAFGPVEAR--------------TSENTTMA-WINQYSVRRYGRSVPAIQDAWNVLYHT 265
             A+ P  A               T E    A W   Y+  RYG      + AW  L+ T
Sbjct: 414 GTAYLPEAADRDPAAFELFSELAWTDEPVDRARWFTGYADFRYGGRDAGARRAWRALHDT 473

Query: 266 VYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
            Y    +  +D +  +  A PD+                          +  + Y    L
Sbjct: 474 AYQQHANERSDPHDSLFCARPDL------------------------AATRAARYAPAAL 509

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            Y  +    AL   +A          YRYDL+D+ RQALA  + +    +  A+   DA 
Sbjct: 510 TYDPARFDAALSGLLAVAAHRRGGAAYRYDLVDVARQALAHRSRQYLPQLKAAFDREDAA 569

Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
               L+ ++L L+   + +   H  FLLGPW+E A+++A N  +  ++E  A+  +T+W 
Sbjct: 570 TFKALATQWLTLMRLSEDITGTHPAFLLGPWIEDARRMATNPRERAEFERTAKALVTVWG 629

Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
           D    +A  L +YGN+ W GLL D+Y PR   +     ++L +G      D    W    
Sbjct: 630 DRATSDAGNLHEYGNREWHGLLSDFYLPRWQKWLDACEDALATGTAPAAVD----WFAFE 685

Query: 505 NDWQNGRNVYPVESNGDALITS 526
             W   R  YP+   GDA  T+
Sbjct: 686 EPWTRERKDYPLRPVGDAYRTA 707


>gi|333023613|ref|ZP_08451677.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
 gi|332743465|gb|EGJ73906.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
          Length = 741

 Score =  168 bits (426), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 147/560 (26%), Positives = 241/560 (43%), Gaps = 90/560 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+    ++Q+  L  +I+ R+ ELGM+PVLP + G VPA   +  P AK  
Sbjct: 215 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 274

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD    LF E+  AF E Q + YGR + +Y  D   E    
Sbjct: 275 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            + P   ++ G  +   +++   DAVW++ GW  +      PP  K ++ +     ++V+
Sbjct: 328 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 377

Query: 181 D----LFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN----------IEMY--------GI 218
           D     F+EV         + G PY +  + NF G+          +++Y          
Sbjct: 378 DGLSDRFSEVN---DRESDWQGTPYAFGSIWNFGGHTALGANARDWVDLYPRWRDRSGSR 434

Query: 219 LDSIAFGPVEARTSENTTMA------------------WINQYSVRRYGRSVPAIQDAWN 260
           L  IA  P  A   +N   A                  W  +Y+  RYG S    + AW+
Sbjct: 435 LSGIALMPEAA---DNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWD 491

Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
           +L  T Y       D+  +         P++ +V+ GK+            L+   +S++
Sbjct: 492 ILRTTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFE 541

Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
                        AL+  +A   EL  S TYR DL+D+ RQALA  +  L   +  AYQ 
Sbjct: 542 P------------ALDELLAVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYQA 589

Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
            +     +L RR++ L++ ++ L+A  +  LLG W+ESA+    +  ++ Q +++A + +
Sbjct: 590 KNQAEFARLGRRWIALMDLLEQLVATDENHLLGRWVESARAWGGSAREKSQLQYDALSLL 649

Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
           T W      +A  LRDY N+ WSGL+   Y  R + Y   +  +L+ G     K    +W
Sbjct: 650 TTWGTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELSAALKEG----RKPVAVDW 704

Query: 501 IKLTNDWQNGRNVYPVESNG 520
             L + W         +  G
Sbjct: 705 FALEDRWTRNPGALATQPRG 724


>gi|318078904|ref|ZP_07986236.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actF]
          Length = 719

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 140/533 (26%), Positives = 235/533 (44%), Gaps = 80/533 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+    ++Q+  L  +I+ R+ ELGM+PVLP + G VPA   +  P AK  
Sbjct: 193 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 252

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD    LF E+  AF E Q + YGR + +Y  D   E    
Sbjct: 253 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 305

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            + P   ++ G  +   +++   DAVW++ GW  +      PP  K ++ +     ++V+
Sbjct: 306 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 355

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGN----------IEMY--------GILDS 221
           D  ++  P +      + G PY +  + NF G+          +++Y          L  
Sbjct: 356 DGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALGANTRDWVDLYPRWRDRSGSRLSG 415

Query: 222 IAFGPVEARTSENTTMA------------------WINQYSVRRYGRSVPAIQDAWNVLY 263
           IA  P  A   +N   A                  W  +Y+  RYG S    + AW++L 
Sbjct: 416 IALMPEAA---DNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILR 472

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            T Y       D+  +         P++ +V+ GK+            L+   +S++   
Sbjct: 473 TTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP-- 520

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
                     AL+  ++   EL  S TYR DL+D+ RQALA  +  L   +  AY+  + 
Sbjct: 521 ----------ALDELLSVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQ 570

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
               +L RR++ L++ ++ L+A  +  LLG W+ESA+    +  ++ Q +++A + +T W
Sbjct: 571 AEFARLGRRWIALIDLLEQLVATDENHLLGRWVESARAWGGSAREKNQLQYDALSLLTTW 630

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
                 +A  LRDY N+ WSGL+   Y  R + Y   +  +L+ G      DW
Sbjct: 631 GTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELSAALKEGRKPVAVDW 682


>gi|429201402|ref|ZP_19192867.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
           91-03]
 gi|428663010|gb|EKX62401.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
           91-03]
          Length = 1042

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 136/559 (24%), Positives = 222/559 (39%), Gaps = 83/559 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
           + NL G+GGPL    +D++  L ++I  R+ ELGM+PVLP + G+VP   ++     A +
Sbjct: 212 LQNLSGYGGPLSPELIDRRAALGRRIADRLRELGMSPVLPGYYGHVPKEFVERNGGDAHV 271

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
              G W   +  P W     LD     F ++  +F   Q   +G  +H +  D   E   
Sbjct: 272 VPQGVWHGFER-PDW-----LDPRTDSFAKVAASFYGHQEDVFGEAAH-FKMDLLHEGGT 324

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
             D P  +      +   +Q     A W++ GW    +P      +  LL+++   ++++
Sbjct: 325 AGDVP--VPGAAQGVERALQKARPGATWVILGW--QENP------LPELLDAIDKSRMLI 374

Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILD 220
           +D  ++    +    + + G PY +  + NF G   +                     L 
Sbjct: 375 VDGVSDRYTSVTDRERDWGGTPYCFGTIPNFGGRTTIGARAHIWNEKFFAWRDKANSALA 434

Query: 221 SIAFGPVEARTSENTTM----------------AWINQYSVRRYGRSVPAIQDAWNVLYH 264
             AF P EA   +                    AW + Y+  RYG    + + AW  L+ 
Sbjct: 435 GTAFMP-EATDRDPAAFELFSELAWTPTKIDRAAWFSAYADYRYGARDDSARRAWRALHD 493

Query: 265 TVYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
           T Y       +D +  +  A PD+                             + Y    
Sbjct: 494 TAYQQRAVERSDPHDSLFCARPDL------------------------AADRAAEYAPRA 529

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
           L Y       AL   +     L  S  Y+YD++D+ RQALA  + +    +  AYQ  D 
Sbjct: 530 LTYDPGRFDAALAGLLGVAGGLRGSAAYKYDVVDVARQALAHRSRQYLPQLRAAYQRKDL 589

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
                LS  +L L+   D +   +  FLLGPW+  A+ LA N+ +  ++E  A+  IT+W
Sbjct: 590 ATFRALSTLWLRLMRLSDEVTGANSAFLLGPWVNDARLLATNDAERAEFERTAKVLITVW 649

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 +A  L +YGN+ W GL+ D+Y PR   +   + ++L +G      D    W   
Sbjct: 650 GGRATSDAGDLHEYGNREWHGLMADFYVPRWEKWLDTLEDALATGTAPAAVD----WFAF 705

Query: 504 TNDWQNGRNVYPVESNGDA 522
              W   R  Y +   GDA
Sbjct: 706 EEPWTRERKDYALRPVGDA 724


>gi|318057780|ref|ZP_07976503.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actG]
          Length = 741

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 143/557 (25%), Positives = 241/557 (43%), Gaps = 84/557 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+    ++Q+  L  +I+ R+ ELGM+PVLP + G VPA   +  P AK  
Sbjct: 215 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 274

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD    LF E+  AF E Q + YGR + +Y  D   E    
Sbjct: 275 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            + P   ++ G  +   +++   DAVW++ GW  +      PP  K ++ +     ++V+
Sbjct: 328 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 377

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGN----------IEMY--------GILDS 221
           D  ++  P +      + G PY +  + NF G+          +++Y          L  
Sbjct: 378 DGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALGANTRDWVDLYPRWRDRSGSRLSG 437

Query: 222 IAFGPVEARTSENTTMA------------------WINQYSVRRYGRSVPAIQDAWNVLY 263
           IA  P  A   +N   A                  W  +Y+  RYG S    + AW++L 
Sbjct: 438 IALMPEAA---DNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILR 494

Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
            T Y       D+  +         P++ +V+ GK+            L+   +S++   
Sbjct: 495 TTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP-- 542

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
                     AL+  ++   EL  S TYR DL+D+ RQALA  +  L   +  AY+  + 
Sbjct: 543 ----------ALDELLSVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQ 592

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
               +L RR++ L++ ++ L+A  +  LLG W+ESA+    +  ++ Q +++A + +T W
Sbjct: 593 AEFARLGRRWIALIDLLEQLVATDENHLLGRWVESARAWGGSAREKNQLQYDALSLLTTW 652

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 +A  LRDY N+ WSGL+   Y  R + Y   +  +L+ G     K    +W  L
Sbjct: 653 GTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELSAALKEG----RKPVAVDWFAL 707

Query: 504 TNDWQNGRNVYPVESNG 520
            + W         +  G
Sbjct: 708 EDRWTRNPGTLATQPRG 724


>gi|429198382|ref|ZP_19190217.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
 gi|428665917|gb|EKX65105.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
          Length = 747

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 139/555 (25%), Positives = 233/555 (41%), Gaps = 77/555 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+ +  LD +  L ++I+ R+ ELGM PV P + G VP       P A+  
Sbjct: 220 LQNLSSFPSPVSRQLLDARAALGRRIVGRLRELGMTPVFPGYFGTVPPGFAERNPGARTV 279

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W    + P W     LD     F  +  AF   Q + +G  S +Y  D   E   P
Sbjct: 280 PQGDWMGF-ARPDW-----LDPRTNEFKRVAAAFYRAQDELFGGPSTLYKMDLLHEGGDP 333

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  ++     +   +++   DA W++ GW  +      PP  +A++++V   +++V+
Sbjct: 334 GDVP--VADAAKGVERALRAAHPDATWVILGWQHN------PP--RAIVDAVDKKRMLVV 383

Query: 181 DLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNI----------EMY--------GILDS 221
           D  ++  P     +  +G  PY +  + NF G+           E+Y          L  
Sbjct: 384 DGLSDRFPTVIDREADWGDTPYAFGSIWNFGGHTALGANTPVWAELYEKWRTKDGSKLRG 443

Query: 222 IAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
           IA  P  A               R  E     W ++++  RYG   P  + AW++L  T 
Sbjct: 444 IALMPEAADNNPAAFALFSELAWRKDELDLKTWFSEWAHARYGARDPHAEAAWDILRRTA 503

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y    G T  +R                +EG    +G   S+ A+     + +    L Y
Sbjct: 504 Y----GTTRADR---------------WSEGADGLFG---SRPALNTVRAARWSPKQLRY 541

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             +E   AL   ++    L +S+ YR DL+D+ RQ L+  +  L   I  AY   D    
Sbjct: 542 DAAEFEPALGELLSVRPGLRSSSAYRRDLLDVARQTLSNRSRVLLPRIRGAYDARDTARF 601

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
            +L+  +L L++ +D LLA     LLG W+  A+    ++ + ++  ++  + +T+W   
Sbjct: 602 DELTGTWLSLMDLLDRLLATDSAHLLGRWVADARAWGASDAERERLAYDNLSLLTVWGTR 661

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
              +A  LRDY N+ W+GL+   Y  R + YF+ +  +L  G   +  D    W  L + 
Sbjct: 662 KGADAG-LRDYANREWAGLVGGLYRLRWSTYFEELRAALREGRTPKKID----WFALEDR 716

Query: 507 WQNGRNVYPVESNGD 521
           W         E  GD
Sbjct: 717 WTRAPGRLATEPTGD 731


>gi|291301158|ref|YP_003512436.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
 gi|290570378|gb|ADD43343.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
          Length = 734

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 136/554 (24%), Positives = 229/554 (41%), Gaps = 79/554 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  + GP+ Q  LD +  L ++I  R+ ELG+ PVLP + G +P         A+  
Sbjct: 209 MQNLSAFPGPISQHLLDSRAELARRIRTRMAELGIRPVLPGYFGTIPGGFAKRNQQARTV 268

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W+   S P W     LD T   F ++  +F   Q +  G  + +Y  D   E   P
Sbjct: 269 PQGVWYGF-SRPDW-----LDPTGNEFAKVAASFYRHQAQLLGE-ADMYKMDLMHEGGDP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P   ++ G A+   +Q     A W+M GW        R      +L  +   +++++
Sbjct: 322 GGIPIPDAAKGVAL--ALQRARPGATWVMLGW--------RKNPRTDILTDIDTSRVLIV 371

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
           D  ++          + G PY +  + NF G+  +       A    + RT+ ++ ++  
Sbjct: 372 DGISDRFDDLDREHTWPGTPYAFGTIPNFGGHTTIGANAKVWAKRFGQWRTAPDSAVSGI 431

Query: 239 ------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
                                         W   Y+ RRYG +    + AW+ L  + Y 
Sbjct: 432 AWMPEGAGRDPAAFELFAELAWRDSIDLGEWFADYADRRYGGADDNARTAWDALRRSAYA 491

Query: 269 CTDGATDKNRDVIV-AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
              G   +  D +  A P +D     VT   Y                   +    L Y 
Sbjct: 492 MPSGRWAEAADGLFGARPGLD-----VTHADY-------------------FSPEFLRYD 527

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
            +   +AL   +     L  ++ YR+DL+D+ RQ+L     EL   +  A+   +     
Sbjct: 528 AAVFAQALPALLDVDKSLH-NDAYRFDLVDVARQSLVNAGRELLPRVKSAFVNQNKKQFD 586

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
           + +R +L+ +  +D LL     FLLGPWLE+A++ A+  ++ K  E++ART +++W   +
Sbjct: 587 KHTRTWLDWMRLLDRLLETDRRFLLGPWLEAARRSARTADEAKDLEYDARTIVSVWGHRS 646

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
             +   L DY N+  +GL+ D Y  R   YF  + ESL+SG   +  D    W  L ++W
Sbjct: 647 GSDEGRLHDYANRELAGLVSDLYAMRWRRYFDSLAESLDSGQAPQHID----WFALEHEW 702

Query: 508 QNGRNVYPVESNGD 521
            +  + +  E  GD
Sbjct: 703 ASKTDDHATEPKGD 716


>gi|365876979|ref|ZP_09416485.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
 gi|442587289|ref|ZP_21006107.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
           anophelis R26]
 gi|365755253|gb|EHM97186.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
 gi|442562959|gb|ELR80176.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
           anophelis R26]
          Length = 712

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 140/551 (25%), Positives = 227/551 (41%), Gaps = 87/551 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGP+    + QQ  LQKKIL R+ ELG+ PVL  F G VP  L+N    AK+ 
Sbjct: 193 MGNLEGWGGPVSMDMMKQQAELQKKILKRMKELGIEPVLQGFYGMVPHDLKNKISEAKVI 252

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           + G W      P      +LD T  LF +I   +  +    YG   H +  + F E   T
Sbjct: 253 EQGKWAGEFQRPG-----ILDPTTKLFSKIADTYYTEMKNLYGEDIHYFGGEPFHEGGKT 307

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D    + S    I + MQ    ++ W++QG        W+      LL  +     +
Sbjct: 308 NGLDLKNVVES----IQTSMQKSYPNSTWVLQG--------WQQNPSDGLLAGLKKENTL 355

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL------------------- 219
           +++LF E    W   K + G  +IW  + NF     +YG L                   
Sbjct: 356 IIELFGENTANWEKRKGYGGTSFIWSNVSNFGEKNGLYGKLQRFIDEVFRAKESIYGANL 415

Query: 220 -------DSIAFGPVE-------ARTSENTTM-AWINQYSVRRYGRSVPAIQDAWNVLYH 264
                  + I   PV        A  SE   +  W+ +Y+  RYG+    +  AW     
Sbjct: 416 KGIGIIPEGIFNNPVAYDLMLDIAWYSEKPILDQWLTEYTKYRYGKENQDVIQAWKEFAQ 475

Query: 265 TVYNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
           T+Y+  D   +  +  +  A P ++ + +S    + +NY +   KEAV            
Sbjct: 476 TIYSSPDVYQEGPSESIYCARPSLNVNPVSSWGTRKRNYDQSRFKEAV------------ 523

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
                       ++F+ +  +   S TY+ D  D  RQ  A   + ++  +I+A      
Sbjct: 524 ------------KVFVKADTDFKDSETYQTDKTDFLRQVWANKGDVVYDELIKAIHEKKT 571

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
             + +   +FLE++   + LL  +  F L   L+ A+   +     +   +NA++Q+T W
Sbjct: 572 TKIQKSGHQFLEMISIQNMLLGNNRYFTLNRLLKEAEHFGEKLPDAQNVMFNAKSQLTYW 631

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
             +   +   LRDY +K W+GLL   Y  R    +K  IE  +SG    +      +  +
Sbjct: 632 GPDNNPKTD-LRDYAHKEWNGLLSSLYYNR----WKVFIEQAQSG----IITAPEVFYNM 682

Query: 504 TNDWQNGRNVY 514
             +W  G+N+Y
Sbjct: 683 EVEWSKGKNMY 693


>gi|290956360|ref|YP_003487542.1| alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
 gi|260645886|emb|CBG68977.1| putative alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
          Length = 732

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 146/556 (26%), Positives = 229/556 (41%), Gaps = 80/556 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL G+  P+ +  LD + VL ++I  R  ELGM PV P + G VPA      P A+  
Sbjct: 206 LQNLSGFPSPVSRQLLDARAVLGRRIADRARELGMIPVFPGYFGTVPAGFAERVPGARTV 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD     F  +  AF   Q + +G  S +Y  D   E   P
Sbjct: 266 PQGRWMGF-ARPDW-----LDPRTDEFARVAAAFYRTQDEMFG-PSALYKMDLLHEGGDP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  ++     +   +Q     A W+M GW  +      PP  +A++++V    ++V+
Sbjct: 319 GDVP--VADAAKGVERALQRAHPGATWVMLGWQHN------PP--RAIVDAVDKQHMLVV 368

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILDS 221
           D  ++  P +      + G PY +  + NF G+  +                     L  
Sbjct: 369 DGLSDRFPTVTDREADWGGTPYAFGSIWNFGGHTALGANTPDWAALYEKWRTKDGSTLHG 428

Query: 222 IAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
           IA  P  A               R  E     W  +++  RYG   P  + AW++L  T 
Sbjct: 429 IALMPEAADNNPAAFALFSELAWREGELDLETWFAEWAHARYGARDPHAEAAWDILRRTA 488

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y  T                      S +EG    +G   S+ A+       +    L Y
Sbjct: 489 YGTT-------------------RADSWSEGADGLFG---SRPALTAVRAGRWSPKQLRY 526

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
           + ++   AL   +    EL AS+ YR DL+D+ RQAL+  +  +   +  AY   DA  +
Sbjct: 527 NAADFEPALGEMLKVRPELRASSAYRRDLLDVARQALSNRSRVMLPQLKAAYDAKDAARL 586

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
            + SR +L L++ +D L+A     LLG W+  A+  A    +  +  ++A + +T+W   
Sbjct: 587 AKGSRDWLSLMDLLDELVATDSRHLLGRWVADARSWAVGSTERTELAYDALSLLTVW--G 644

Query: 447 TQEEASL-LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
           T+E A   LRDY N+ W+GL+   Y  R A YF+ +  +L  G   +  D    W  L +
Sbjct: 645 TREGADAGLRDYANREWAGLVGGLYRLRWATYFEELRAALAEGRAPKKID----WFALED 700

Query: 506 DWQNGRNVYPVESNGD 521
            W         E  GD
Sbjct: 701 RWARNPGTLATEPAGD 716


>gi|440695019|ref|ZP_20877582.1| Tat pathway signal sequence domain protein [Streptomyces
           turgidiscabies Car8]
 gi|440282912|gb|ELP70302.1| Tat pathway signal sequence domain protein [Streptomyces
           turgidiscabies Car8]
          Length = 1050

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 147/569 (25%), Positives = 237/569 (41%), Gaps = 85/569 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL--QNVFPSAK 58
           + N+ G+GGP+ +  ++++  L  KI  R+ ELGM PVLP + G VP     +N   +A 
Sbjct: 216 LQNMSGFGGPVSRRLIEKRADLAAKITERVRELGMTPVLPGYFGTVPDEFVARNGGDAAV 275

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           + Q G+W + K  P W     LD     F E+  AF + Q + +G  S +Y  D   E  
Sbjct: 276 VPQ-GDWGAFKR-PDW-----LDPRTTAFGEVAAAFYQAQSERFG-DSTMYKMDLLHEGG 327

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
            P D P  +     A+ + ++     AVW + GW  +       P  + +L++V   ++ 
Sbjct: 328 NPGDVP--VGRAAQAVEAALRKAHPGAVWAILGWQNN-------PSGE-ILDAVDKSRMF 377

Query: 179 VLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGN----------IEMY--------GIL 219
           V+D  ++    +      + G PY +  + NF G+          +E Y          L
Sbjct: 378 VVDGLSDRYTTVTDRESDWGGTPYAFGSIWNFGGHTPMGANAPDWVEQYPKWRDKEDSAL 437

Query: 220 DSIAFGPVEARTSENTTMA---------------WINQYSVRRYGRSVPAIQDAWNVLYH 264
             IA  P  A  +                     W   Y+V RYG   P    AW ++  
Sbjct: 438 AGIAAMPEAADNNHAALALLTDLAWTPGTIDLDDWFASYAVSRYGAEDPHALAAWKIIGD 497

Query: 265 TVYNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
           T Y  +  DG ++    +  A P +                   +K A    E   YD  
Sbjct: 498 TAYGMSRADGWSEAPDGLFGARPSLG-----------------ANKAAAWGPEADRYD-- 538

Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
                T+    AL   +     L  ++ YRYDL D+ RQ L+  +  L   I  AY   D
Sbjct: 539 -----TTAFDLALTELLQVAPALRGNSAYRYDLADVARQVLSNRSRMLLPQIRAAYDTAD 593

Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
                +L+  +L+ +  MD +LA     LLG WL  A+       ++ Q E++AR+ IT 
Sbjct: 594 RVRFDELTGVWLDWMRLMDKVLATSGQHLLGRWLADARSWGATRGEKDQLEYDARSIITT 653

Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
           W      E   L DY N+ WSGL+   Y  R  +YF+ +  +L      +  D    W  
Sbjct: 654 WGGRASSEEG-LHDYANREWSGLVGGLYLTRWTLYFRELSRALRQNRPPKTVD----WFT 708

Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYN 531
           L +DW +  + +P +++GD    ++ ++N
Sbjct: 709 LEDDWAHRHDSHPTKTSGDVHKLARRVHN 737


>gi|398786493|ref|ZP_10549210.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
 gi|396993639|gb|EJJ04702.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
          Length = 1048

 Score =  161 bits (407), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 132/565 (23%), Positives = 225/565 (39%), Gaps = 80/565 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+ G+GGP     + ++  L ++I  R+ ELGM+PVLP + G VP       P A+  
Sbjct: 206 LQNMSGYGGPTSSELIAKRAELGQRITGRLRELGMHPVLPGYFGTVPGGFAARNPGARTV 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W  + + P W     LD    +F +   AF   Q    G   H +  D   E   P
Sbjct: 266 PQGTWSGL-ARPDW-----LDPRTEVFAKTAAAFYRHQEHLLGPADH-FKMDLLHEGGDP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+   +++    A W++ GW  +          + LL++V   +++++
Sbjct: 319 GDVP--VPDAARAVEKALRTARPGATWVILGWQNN--------PRRDLLDAVDHDRMLIV 368

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI------------------LDSI 222
           D  ++++ +    + + GVPY +  + NF G   +                     L   
Sbjct: 369 DGLSDLETVTDRERDWGGVPYAFGSIPNFGGRTTIGAKTHVWAERFPAWRDKPGSRLAGT 428

Query: 223 AFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
           A+ P  A               R       AW + Y+  RYG      + A+  L  + Y
Sbjct: 429 AYMPEAAERDPAAFELFSELAWRERPVDRAAWFDGYADLRYGARDKGARAAFAALGTSAY 488

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
             +      +  V  A PD+                       V  + T ++D       
Sbjct: 489 EISSKDGRPHDSVFAARPDLA-----------------ARSGTVYATHTPAFD------- 524

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
            +    A    +     L  S+ YR DL D  RQALA  + +L   + +AY+  D     
Sbjct: 525 PAAFDTAFAALLTVRPALRGSDAYRRDLTDTARQALANRSWQLIGQLQDAYRRKDRATFR 584

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
            LS  +L L+   + +   H  FLLGPWL  A+ +A   E+E + E +AR  +T W D  
Sbjct: 585 ALSGLWLHLMRLSEDVTGAHRQFLLGPWLTDARAMASGPEEEARLEHSARALLTTWADRP 644

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW-RREWIKLTND 506
             +   L +Y N+ W GL+ + + P+   Y   + ++L +    +  DW  RE       
Sbjct: 645 TADGGSLANYANRDWHGLIGEVHLPQWQAYLGELADALAADRPPKPFDWYARE-----EP 699

Query: 507 WQNGRNVYPVESNGDALITSQWLYN 531
           W + R   P+    +A  T++ +++
Sbjct: 700 WTHERTTPPLHPTTEAYRTARRVHD 724


>gi|29828556|ref|NP_823190.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
 gi|29605660|dbj|BAC69725.1| putative alpha-N-acetylglucosaminidase [Streptomyces avermitilis
           MA-4680]
          Length = 728

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 133/555 (23%), Positives = 222/555 (40%), Gaps = 77/555 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+ Q  LD +  L ++I  R+ ELGM PV P + G VP    +    A   
Sbjct: 201 LQNLSAFPDPVSQQLLDARAALGRRIANRLRELGMTPVFPGYFGTVPPGFADRNAGAHTV 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD     F  +  AF   Q + +G  S  Y  D   E   P
Sbjct: 261 PQGTWMGF-ARPDW-----LDPRTEHFTRVAAAFYRIQDEMFGGASTRYKMDLLHEGGSP 314

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +      +   +++    AVW++ GW  +      PP  +A++++V   +++V+
Sbjct: 315 GDVP--VGDAAKGVERALRAAHPGAVWVILGWQHN------PP--RAIVDAVDKDRMLVV 364

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILDS 221
           D   +  P +      ++G PY +  + NF G+  +                     L  
Sbjct: 365 DGLCDRFPKVTDREADWHGTPYAFGSIWNFGGHTTLGANTPDWASLYERWRTRPGSTLRG 424

Query: 222 IAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
           +A  P  A               R  +    AW  +++  RYG   P  + AW++L  T 
Sbjct: 425 VALLPEAADNNPAAFALFSELAWREGDLDLRAWFARWARSRYGGRDPHAEAAWDILRRTA 484

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y  T                      S +EG    +G   ++ ++  ++ +S+    L Y
Sbjct: 485 YGTT-------------------RADSWSEGADGLFG---ARPSLAATKAASWSPKRLRY 522

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
              E   AL   +     L  S+ YR DL+D+ RQAL+  +  L   I  AY+  D    
Sbjct: 523 RPEEFEPALGELLKVRPGLRGSSAYRRDLLDVARQALSNRSRVLLPQIRTAYEAKDTARF 582

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
            +L+  +L L++ ++ LLA     LLG W+  A+    +  +  +  ++A + +T+W   
Sbjct: 583 DRLTGVWLALMDLLEALLATDSRHLLGRWVADARAWGASAAERDRLAYDALSLLTVWGTR 642

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
              +A  LRDY N+ W+GL+   Y  R + YF  +  +   G   +  D    W  L + 
Sbjct: 643 AGADAG-LRDYANREWAGLVGGLYRLRWSTYFAELRSASREGRTPKKTD----WFALEDR 697

Query: 507 WQNGRNVYPVESNGD 521
           W            GD
Sbjct: 698 WTRNPGGLATRPTGD 712


>gi|456388164|gb|EMF53654.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
           25435]
          Length = 732

 Score =  158 bits (399), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 141/556 (25%), Positives = 228/556 (41%), Gaps = 80/556 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+ +  LD + VL ++I  R+ ELGM PV P + G VPA      P A+  
Sbjct: 206 LQNLSAFPSPVSRQLLDARAVLGRRIADRVRELGMTPVFPGYFGTVPAGFAERVPGARTV 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD     F  +  AF   Q + +G +S +Y  D   E   P
Sbjct: 266 PQGEWMGF-ARPDW-----LDPRTDDFARVAAAFYRVQEEMFGPSS-LYKMDLLHEGGDP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  ++     +   ++     A W++ GW  +      PP  +A++++V    ++V+
Sbjct: 319 GDVP--VADAAKGVERALRRSRPGATWVILGWQHN------PP--RAIVDAVDKQHMLVV 368

Query: 181 DLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMYG------------------ILDS 221
           D  ++  P  +  +  +G  PY +  + NF G+  +                     L  
Sbjct: 369 DGLSDRFPTVTDREADWGDTPYAFGSIWNFGGHTALGANTPDWAALYEKWRTKDGSRLHG 428

Query: 222 IAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
           IA  P  A               R  E     W  +++  RYG   P  + AW++L  T 
Sbjct: 429 IALMPEAADNNPAAFALFSELAWREGELDLKTWFAEWAHARYGGRDPHAEAAWDILRRTA 488

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y  T                      S +EG    +G   S+ A+       +    L Y
Sbjct: 489 YGTT-------------------RADSWSEGADGLFG---SRPALNAVRAGRWSPKQLRY 526

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
             ++   AL   +    EL AS+ YR DL+D+ RQAL+  +  +   I  AY   DA  +
Sbjct: 527 DAADFEPALGEMLRVRPELRASSAYRRDLLDVARQALSNRSRVMLPQIKAAYDAKDATRL 586

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              SR +L L++ +D L+A     LLG W+  A+       +  +  ++  + +T+W   
Sbjct: 587 AAASRDWLSLMDLLDELVATDSRHLLGRWVADARSWGAGAAERTELGYDNLSLLTVW--G 644

Query: 447 TQEEASL-LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
           T+E A   LRDY N+ W+GL+   Y  R + YF+ +  +L +G   +  D    W  L +
Sbjct: 645 TREGADAGLRDYANREWAGLVGGLYRLRWSTYFEELRAALAAGRAPKKID----WFALED 700

Query: 506 DWQNGRNVYPVESNGD 521
            W         E  GD
Sbjct: 701 RWARNPGPLATEPTGD 716


>gi|297194750|ref|ZP_06912148.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
           25486]
 gi|297152431|gb|EFH31740.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
           25486]
          Length = 816

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 139/541 (25%), Positives = 230/541 (42%), Gaps = 68/541 (12%)

Query: 16  LDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPS--AKITQLGNWFSVKSDPR 73
           ++++  L ++I  R+ ELGM+PVLP + G VP       P   A++   G W      P 
Sbjct: 6   IERRTELGRRITDRLRELGMHPVLPGYFGTVPDDFPGHNPGSDARVIPQGTWGGGMRRPD 65

Query: 74  WCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAA 133
           W     LD     F ++  AF   Q + +G  SH +  D   E     D P  +     A
Sbjct: 66  W-----LDPRTQAFSDVAAAFYRHQGELFGDVSH-FKMDLLHEGGTAGDVP--VPDAARA 117

Query: 134 IYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS 193
           + + +Q+    A W++ GW  +  P         +L+S+   +++++D  +++  +    
Sbjct: 118 VETSLQTARPGATWVILGWQSNPRPV--------MLDSIDTSRVLIVDGLSDLDTVTDRE 169

Query: 194 KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSV- 252
             + G PY +       G I  +G   +I  G    R +E  T AW ++      G +  
Sbjct: 170 ADWGGAPYAF-------GTIPNFGGRTTI--GANTDRWTEKFT-AWRDKPGSALVGTAYM 219

Query: 253 -------PAI---------------QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 290
                  PA                ++AW   Y  +     G  D +     A       
Sbjct: 220 PEAAERDPAALELFSELAWREEKIDREAWFAEYAQIRY---GGVDHSAREAFAALAATAY 276

Query: 291 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 350
            ++ T+G+   Y    S+   L +   +   P      +   RA    +A    L  S+ 
Sbjct: 277 KLTSTDGR--PYDSLFSRRPSLTTAIGTAFDP------AGFDRAFAALLAVRAPLRDSDA 328

Query: 351 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 410
           YR+DL D+ RQALA  +  L L +  AY+  D      +S  +L+++   D +  CH  F
Sbjct: 329 YRHDLTDVARQALANRSRTLQLALRAAYRNKDVATFRAVSALWLKVMRLSDTMAGCHRQF 388

Query: 411 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
           LLGPWLE AK+LA + E+  Q E  ART IT W D  +  A+ L +Y N+ W GL+ D +
Sbjct: 389 LLGPWLEDAKRLATSPEEAVQLERTARTLITTWAD--RPTANSLSNYANRDWQGLMADVH 446

Query: 471 GPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
            P+   +     +++ +G   +  DW  +       W   R+ YPV   GDA  T+  ++
Sbjct: 447 VPQWEAFLTEQADAMAAGRAPKSFDWYPQ----EEAWTQERHTYPVRPTGDAYSTALRVF 502

Query: 531 N 531
           +
Sbjct: 503 D 503


>gi|29832531|ref|NP_827165.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
 gi|29609651|dbj|BAC73700.1| putative alpha-N-acetylglucosaminidase, secreted [Streptomyces
           avermitilis MA-4680]
          Length = 1038

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 133/563 (23%), Positives = 223/563 (39%), Gaps = 83/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
           + NL G+GGPL    + ++  L ++I  R+  LGM PVLP + G+VP   ++     A +
Sbjct: 212 LQNLSGYGGPLSPELIAERAGLGRRICDRLRALGMAPVLPGYYGHVPKGFVERNGGDAHV 271

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
              G W   +  P W     LD     F  + ++F   Q   +G+ +H +  D   E   
Sbjct: 272 VPQGIWHGFER-PDW-----LDPRTASFAAVAKSFYRHQKDVFGKAAH-FKMDLLHEGGT 324

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
             D P  +      +   +Q+    A W++ GW    +P      + ALL+++   K+++
Sbjct: 325 AGDVP--VPGAARGVEKALQAAHPGATWVILGW--EANP------LPALLDAIDKKKMLI 374

Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILD 220
           +D  ++    +    K + G PY +  + NF G   +                     L 
Sbjct: 375 VDGVSDRYTSVTDREKDWGGTPYAFGTIPNFGGRTTIGARAHLWNEKFFAWRDKAGSALA 434

Query: 221 SIAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
             A+ P  A                  +    AW + Y+  RYG    + Q AW  L+ T
Sbjct: 435 GTAYLPEAADRDPAAFELFSELAWSAGKIDRAAWFSSYADFRYGGRDASAQKAWRALHDT 494

Query: 266 VYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
            Y       +D +  +  A PD+                          +  + Y    L
Sbjct: 495 AYQQHAVERSDAHDSLFCARPDL------------------------AANRAAEYAPRAL 530

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
            Y       AL   +     L  S  Y YDL+D+ RQALA  + + +L ++ A       
Sbjct: 531 TYDPGRFDAALSGLLGVAGGLRGSAAYTYDLVDVARQALAHRSRQ-YLPLLRAAYARKDA 589

Query: 385 GVF-QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
             F  L+  +L L+   D +   H  FLLGPW+  A+ LA +  +  ++E  A+  +T+W
Sbjct: 590 AAFTSLATLWLRLMGLSDEVTGTHPAFLLGPWINDARLLATDAGERAEFERTAKVLLTVW 649

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 +A  L +Y  + W+GL+ D+Y PR   +   + ++L +G      D    W  +
Sbjct: 650 GGRATSDAGDLHEYAGREWNGLMADFYLPRWKKWLDALADALATGTPPAAVD----WFAV 705

Query: 504 TNDWQNGRNVYPVESNGDALITS 526
              W   R  YP+   GD   T+
Sbjct: 706 EEPWTRERKDYPLRPVGDPYRTA 728


>gi|409097333|ref|ZP_11217357.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Pedobacter agri
           PB92]
          Length = 724

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 137/557 (24%), Positives = 219/557 (39%), Gaps = 83/557 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN---VFPSA 57
           M NL GWGG      +  Q  +QKK+L R+ EL ++P+L  F G VP  L         A
Sbjct: 195 MGNLEGWGGTNSLQLMQLQSNIQKKVLSRMKELEIDPILQGFYGMVPHDLNKKVAALKDA 254

Query: 58  KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
           +I   GNW   +    +    +L  T+  F  +   +  +  K YG     +  + F E 
Sbjct: 255 QIIDQGNWVFTE----FIRPAILAPTNDKFNTVADVYYSELKKLYGSDIKFFGGEPFHEG 310

Query: 118 TPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
                VD    I+++  ++   MQ    ++ W++QGW        +     ALL  +   
Sbjct: 311 GKKGGVD----ITAVAKSVQDVMQKNFPNSTWVLQGW--------QNNPADALLAGLKKE 358

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL---------------- 219
             ++++LF E    W   K + G  +IW  + NF     +YG L                
Sbjct: 359 NTLIIELFGENTSNWEQRKGYGGTNFIWSNVSNFGEKNGLYGRLQRFLDEVYRIKQSPYK 418

Query: 220 ----------DSIAFGPVEA--------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNV 261
                     + I   PV          R  +     WI  Y+  RYG     + DAW V
Sbjct: 419 DYLKGVGIIPEGINNNPVAYDLMLDIAWRNEKPPLDKWITDYTTYRYGSYNKDVADAWKV 478

Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
              TVY+      +K + V    P         +E  Y       ++ ++  +  SS+  
Sbjct: 479 FTETVYSSP--VNEKGKIVYQEGP---------SESIY------CARPSLKVNPVSSWGT 521

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
               Y T    +A+ LFI +  +   S TY+ D  D  RQ +A   ++ +  +I A Q  
Sbjct: 522 RKRNYDTKLFKQAVALFIKAETQFKNSETYQTDKTDFLRQVMADKGDQAYDELINAIQAK 581

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
           D + + +    FL ++   D LL  +  F L  WL  A  L +     K   +NA+ QIT
Sbjct: 582 DKNAIKEKGNHFLTMILQQDSLLNNNHFFTLNRWLNQAVALGKGLPDAKNILFNAKAQIT 641

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI-ESLESGDGFRLKDWRREW 500
            W  +   + + LRDY +K W GLL   Y  R  ++    + + + S   F         
Sbjct: 642 FWGPDNNPKTT-LRDYAHKEWGGLLSSLYYNRWKLFIDDALNDKITSASTF--------- 691

Query: 501 IKLTNDWQNGRNVYPVE 517
             +   W    N+YP++
Sbjct: 692 YDMEVKWSKDSNLYPIK 708


>gi|329940646|ref|ZP_08289927.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
 gi|329300707|gb|EGG44604.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
          Length = 798

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 133/530 (25%), Positives = 210/530 (39%), Gaps = 83/530 (15%)

Query: 10  PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVK 69
           P+    LD + VL +++  R+ ELGM PVLP + G VP         A+    G W    
Sbjct: 227 PVSTQLLDARAVLGRRLADRLRELGMVPVLPGYFGTVPPGFAARNRGARTVPQGTWMGFD 286

Query: 70  SDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS 129
             P W     LD    LF  +  AF   Q + +G ++H Y  D   E     D P  +  
Sbjct: 287 R-PDW-----LDPRTDLFARVAAAFYRVQGELFGASTH-YKMDLLHEGGTAGDVP--VGE 337

Query: 130 LGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG---------KLVVL 180
               +   ++    DAVW++ GW  +      PP  +A+L++V  G         +L+V+
Sbjct: 338 AAKGVERALRRARPDAVWVLLGWRHN------PP--RAILDAVASGGPDGAAGRERLLVV 389

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILDS 221
           D  ++  P +      + GVPY +  + NF G+  +                     L  
Sbjct: 390 DGLSDRFPTVTDREADWGGVPYAFGSIWNFGGHTTLGANTPDWARLYEAWRTKEGSALRG 449

Query: 222 IAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
           IA  P  A                  E    AW  +++  RYG      + AW+VL  T 
Sbjct: 450 IALLPEAADNNPAAFALFSELPWHEGELDLKAWFARWARSRYGAYDAHAEAAWDVLRRTA 509

Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
           Y  T                      S +EG    +G   ++ ++     +S+    L Y
Sbjct: 510 YGTT-------------------RADSWSEGADGLFG---ARPSLTARRAASWSPKELRY 547

Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
              E  RAL+  +     L  S+ YR DL+D+ RQ L+  +  L   I  A    D    
Sbjct: 548 DAHEFERALDELLKVRPGLRESSAYRRDLLDVARQCLSNRSRALLPRIARACAARDVKAF 607

Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
              S  +L L++ ++ L+      LLG W   A+    +E +  + +++A + +T+W   
Sbjct: 608 DAASGDWLSLMDLLERLVGTDARHLLGRWTAQARAWGADEAERDRLQYDALSLLTVWGTR 667

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
              EA  LRDY N+ W+GL+   Y  R + YF  +  +L  G      DW
Sbjct: 668 QGAEAG-LRDYANREWAGLVGGLYRLRWSTYFTELRAALTEGRAPAAVDW 716


>gi|456390168|gb|EMF55563.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
           25435]
          Length = 1042

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 135/563 (23%), Positives = 227/563 (40%), Gaps = 83/563 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
           + NL G+GGPL    + ++  L ++I  R+ ELGM+PVLP + G+VP   ++     A +
Sbjct: 212 LQNLSGYGGPLSPQLIARRAGLGRRITDRLRELGMSPVLPGYYGHVPKQFVERNGGDAHV 271

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
              G W   +  P W     LD     F  +  +F       +G  +H +  D   E   
Sbjct: 272 VPQGLWHGFER-PDW-----LDPRTDSFARVAASFYGHVRDVFGAAAH-FKMDLLHEGGT 324

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
             D P  +      +   +     DA+W++ GW    +P      +  LL+++   ++++
Sbjct: 325 AGDVP--VPDAARGVERALHKAHPDAIWVILGW--QENP------LPELLDAIDRSRMLI 374

Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILD 220
           +D  ++    +    + + G PY +  + NF G   +                     L 
Sbjct: 375 VDGVSDRYASVTDRERDWGGTPYCFGTIPNFGGRTTIGARAHLWTDKFFAWRDKPDSALV 434

Query: 221 SIAFGPVEARTSENTTM----------------AWINQYSVRRYGRSVPAIQDAWNVLYH 264
             A+ P EA   +                    AW + Y+  RYG    A + AW  L+ 
Sbjct: 435 GTAYMP-EATDRDPAAFELFSELAWTPGKIDRAAWFSAYADFRYGGRDDAARAAWRALHE 493

Query: 265 TVYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
           T Y       +D +  +  A PD+                    + A     T +YD   
Sbjct: 494 TAYQQRAVERSDPHDSLFCARPDL-----------------AADRAAEYAPRTLTYDPGR 536

Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
                 +   A  L +A G   + +  YRYD++DL RQALA  + +    +  A++  D 
Sbjct: 537 F-----DAAFAGLLDVAGGRRRNPA--YRYDVVDLARQALAHRSRQYLPQLRAAHRRKDL 589

Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
                LS  +L L+   D +      FLLGPW+  A+ LA ++ +  ++E  A+  IT+W
Sbjct: 590 TTFRALSTLWLRLMRLSDEVTGTDGAFLLGPWVNDARLLATDDAERAEFERTAKVLITVW 649

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 +   L +YGN+ W+GL+ D+Y PR   +   + ++L +G      D    W   
Sbjct: 650 GGRATSDTGDLHEYGNREWNGLMADFYVPRWQKWLDALEDALATGTAPAAVD----WFAF 705

Query: 504 TNDWQNGRNVYPVESNGDALITS 526
              W   R  YP+   GDA  T+
Sbjct: 706 EEPWTRERKDYPLRPVGDAYRTA 728


>gi|380804373|gb|AFE74062.1| alpha-N-acetylglucosaminidase precursor, partial [Macaca mulatta]
          Length = 265

 Score =  138 bits (347), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 64/145 (44%), Positives = 92/145 (63%), Gaps = 3/145 (2%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 124 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 183

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 184 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 240

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDA 145
             +P Y+++   A+Y  M + D++A
Sbjct: 241 SSAPSYLAAATTAVYEAMIAVDTEA 265


>gi|339238239|ref|XP_003380674.1| GDP-L-fucose synthetase [Trichinella spiralis]
 gi|316976398|gb|EFV59699.1| GDP-L-fucose synthetase [Trichinella spiralis]
          Length = 1203

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 110/449 (24%), Positives = 190/449 (42%), Gaps = 54/449 (12%)

Query: 89   EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 148
             +G   + + L+ Y    H Y+ D F+E  P      ++ ++  AIY+ M S D  +VW+
Sbjct: 801  HVGNEVVWKSLENYFGLFHAYSADPFNEMVPNTFDVMFLRNVSFAIYNVMLSVDPKSVWV 860

Query: 149  MQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 207
            +Q W+F     W   +  K  L +VP G ++V+DL+AE  P++     FY  P+IWCMLH
Sbjct: 861  LQSWMFLSSERWLENENAKHFLTAVPTGSILVVDLYAEEYPLYEKFSGFYNQPFIWCMLH 920

Query: 208  NFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
            NF G   +YG L  I     +  T  N  M           G S+  I   + V+Y    
Sbjct: 921  NFGGVQGLYGNLARINQKLADVSTVSNINMVGT--------GLSMEGIDQNY-VVYQMAL 971

Query: 268  NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
            +      ++  D+   +  +    + +T+  Y  +G                        
Sbjct: 972  DRFWSPNNQKVDLAAWYIYIHLG-VGITKSIYTAWGA----------------------- 1007

Query: 328  TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
                      F+ S      +  Y  DL++LT+ AL     +L+  +  +Y         
Sbjct: 1008 ----------FLQSSRTCQENEIYINDLVELTKHALMLTGAKLYEKLQASYIRKCGQEFL 1057

Query: 388  QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
            + +    +++ D++ +   H   +L  W+E A+   +   Q  Q E N R Q+T+W    
Sbjct: 1058 ENAAAVEQVLSDLEWISKTHSRSMLSKWIEIARANGKTAAQSDQLEENLRMQVTIW--GP 1115

Query: 448  QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM---IESLESGDGFRLKDWRREWIKLT 504
            Q E   + DY  K W+ L  +YY PR  ++F ++   I  LE+ +   L       I+L 
Sbjct: 1116 QGE---IVDYARKQWAALFSEYYLPRWRLFFAHLYADILQLETFNQTLLNSRLFHEIELP 1172

Query: 505  NDWQNGRNVYPVESNGDALITSQWLYNKY 533
               Q   N+   +  G+ ++ S+ LY++Y
Sbjct: 1173 FALQKIPNI--DQPTGNTVVVSKILYSRY 1199


>gi|212722968|ref|NP_001131519.1| uncharacterized protein LOC100192858 [Zea mays]
 gi|194691748|gb|ACF79958.1| unknown [Zea mays]
          Length = 114

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 61/99 (61%), Positives = 75/99 (75%)

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 501
           MWFDNT+ +ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+   F LK+WRREWI
Sbjct: 1   MWFDNTETKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWI 60

Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
            LTN+WQ+ R V+   + GD L  SQ LY KYL    + 
Sbjct: 61  SLTNNWQSDRKVFSTTATGDPLNISQSLYTKYLSNADLL 99


>gi|328867426|gb|EGG15808.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
          Length = 992

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 95/321 (29%), Positives = 150/321 (46%), Gaps = 43/321 (13%)

Query: 212 NIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTD 271
           N  MY +++ +A+     RT+      WINQY+ RRYG  VP +  AWN+L  TV+N T 
Sbjct: 232 NYMMYDLMNEMAW-----RTTAPNMTEWINQYTQRRYGVFVPELAQAWNLLIPTVFNATL 286

Query: 272 GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEV 331
           G                             YG P S   +      + D   L+Y  S V
Sbjct: 287 GY----------------------------YGPPSSFVGMRPQLNMTND---LYYDPSVV 315

Query: 332 IRALELFIASGNE-LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLS 390
            +A +L++   +E + ++ T+ +D+ ++T QAL+    +  + + +AY  N +    + +
Sbjct: 316 QQAWQLYLGVTDEYVLSTATFSFDVSEITLQALSNLFMDTQMAMYDAYLTNQSTVFEERA 375

Query: 391 RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE--QEKQYEWNARTQITMWFDNTQ 448
              L ++ DMD + A     L+G W  +A+Q A N    +   +E+NAR QIT+W     
Sbjct: 376 TSCLNIITDMDTIAATQQMLLVGTWTANARQWALNTSSGETAPFEFNARNQITLW----G 431

Query: 449 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQ 508
              S L DY    WSGLL D+Y  R A++ KYM  SL +   F   D+  +   L   W 
Sbjct: 432 PPNSSLHDYAYHLWSGLLNDFYFARWALFIKYMDTSLSTNTTFNNTDYTNDIESLEESWN 491

Query: 509 NGRNVYPVESNGDALITSQWL 529
           N    YP    G+A + S+++
Sbjct: 492 NQNYQYPTLPTGNAYLLSKFI 512


>gi|402824586|ref|ZP_10873940.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
 gi|402261896|gb|EJU11905.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
          Length = 486

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 78/252 (30%), Positives = 123/252 (48%), Gaps = 39/252 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+  PL   W++++  LQ +IL R+  LGM PVLPAF+G VP A     P A+I 
Sbjct: 223 MGNLAGYRAPLSSGWIEKKHQLQLRILARMRALGMKPVLPAFAGYVPEAFAKAHPKARIY 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD +DPLF ++   F+    + YG   + Y  D F+E  PP
Sbjct: 283 KMRAWEGFPP------TYWLDPSDPLFTQLAARFVTLYNRTYGEGEY-YLADAFNEMIPP 335

Query: 121 V-------DSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
           +        + EY                       +++ G  +Y  + +    A W+MQ
Sbjct: 336 IAEDGSDAAAAEYGDSIANTAATRAAALPPAVRDARLAAYGERLYGSITAAAPKATWVMQ 395

Query: 151 GWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  +R P+ + A L+ VP  ++++LD+  +  P IW  +  F G  + +  +HN
Sbjct: 396 GWLFGADKAFRTPEAIAAFLSRVPDDRMLILDIGNDRYPGIWQKTDAFDGKAWTYGYVHN 455

Query: 209 FAGNIEMYGILD 220
           + G+  +YG L+
Sbjct: 456 YGGSNPVYGDLE 467


>gi|84625358|ref|YP_452730.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
 gi|84369298|dbj|BAE70456.1| putative N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
           MAFF 311018]
          Length = 590

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 124/521 (23%), Positives = 202/521 (38%), Gaps = 111/521 (21%)

Query: 54  FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 113
            P A+I ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D 
Sbjct: 46  LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 98

Query: 114 FDENTPPVD-------SPEY-----------------------ISSLGAAIYSGMQSGDS 143
           F+E  PPV        + +Y                       +++ G A+Y  +   + 
Sbjct: 99  FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 158

Query: 144 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 201
            A W+MQGWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +
Sbjct: 159 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 218

Query: 202 IWCMLHNFAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM------- 237
           I+  +HN+  +  +YG +                 +   FG        N+ +       
Sbjct: 219 IYGYVHNYGASNPLYGDVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYLYAL 278

Query: 238 ----------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 287
                      W+ QY   RYGRS  A+  AW  L   +Y     +              
Sbjct: 279 AWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTRYWS-------------- 324

Query: 288 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 347
            P   +   G Y  + +P +            D P        +  A++  +   +  + 
Sbjct: 325 -PRWWNTHAGAYLLFKRPTADIVNFD------DRPG---DPQRLRSAIDALLQQADRYAD 374

Query: 348 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLAC 406
           +  YRYDLI+  R  L+  A+     +++AY   D A G  QL+R   +LV+ +D L+  
Sbjct: 375 APLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDALVGG 433

Query: 407 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 466
               L     ++A  +  +    + Y  NAR Q+++W  +       L DY +K W G+ 
Sbjct: 434 QHETLAAWTGQAAAAVGNDARLLRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMY 488

Query: 467 RDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 500
            D+Y  R   +      + ++G  F       +L  W R+W
Sbjct: 489 ADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 529


>gi|293371910|ref|ZP_06618314.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
 gi|292633156|gb|EFF51733.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
          Length = 411

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 114/449 (25%), Positives = 188/449 (41%), Gaps = 89/449 (19%)

Query: 129 SLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVK 187
            + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++LD   E  
Sbjct: 3   KIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMILLDYHCENV 62

Query: 188 PIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI----------------------------L 219
            +W  ++ F+  PYIWC L NF GN  + G                             L
Sbjct: 63  ELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKGIGSTLEGL 122

Query: 220 DSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGAT 274
           D + F P E    +   +      WI   + R  G    +++DAW  L++ +Y       
Sbjct: 123 DVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDIY------- 174

Query: 275 DKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA 334
                  V  P         T G    Y +P   +   K  ++ Y +  L     EV R 
Sbjct: 175 -------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVELL----EVWRK 214

Query: 335 LELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR 392
           L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D   +     +
Sbjct: 215 L-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQALKACGEK 267

Query: 393 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 452
             E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W         
Sbjct: 268 MKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW-------GG 320

Query: 453 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE----WIKLTN 505
            L DY ++ W+GL+ DYY  R  +Y    I+ +  G   D  +L+D  +E    W+  T+
Sbjct: 321 SLNDYASRSWAGLISDYYAKRWEVYINTFIKVVGEGVEVDQKQLEDELKEIEEGWVNATD 380

Query: 506 DWQNGRNVYPVESNGDALIT-SQWLYNKY 533
                ++V+   S  D L++ S +L++KY
Sbjct: 381 RKDTRKDVH---STTDGLLSFSTFLFSKY 406


>gi|58583545|ref|YP_202561.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
 gi|58428139|gb|AAW77176.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
          Length = 753

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 122/521 (23%), Positives = 200/521 (38%), Gaps = 111/521 (21%)

Query: 54  FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 113
            P A+I ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D 
Sbjct: 209 LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 261

Query: 114 FDENTPPV------------------------------DSPEYISSLGAAIYSGMQSGDS 143
           F+E  PPV                                   +++ G A+Y  +   + 
Sbjct: 262 FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 321

Query: 144 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 201
            A W+MQGWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +
Sbjct: 322 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 381

Query: 202 IWCMLHNFAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM------- 237
           I+  +HN+  +  +YG +                 +   FG        N+ +       
Sbjct: 382 IYGYVHNYGASNPLYGDVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYLYAL 441

Query: 238 ----------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 287
                      W+ QY   RYGRS  A+  AW  L   +Y                    
Sbjct: 442 AWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTR---------------YW 486

Query: 288 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 347
            P   +   G Y  + +P +   ++  +    D          +  A++  +   +  + 
Sbjct: 487 SPRWWNTHAGAYLLFKRPTAD--IVNFDDRPGD-------PQRLRSAIDALLQQADRYAD 537

Query: 348 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLAC 406
           +  YRYDLI+  R  L+  A+     +++AY   D A G  QL+ R  +LV+ +D L+  
Sbjct: 538 APLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLA-RTTQLVQGLDALVGG 596

Query: 407 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 466
               L     ++A  +  +    + Y  NAR Q+++W  +       L DY +K W G+ 
Sbjct: 597 QHETLAAWTGQAAAAVGNDARLLRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMY 651

Query: 467 RDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 500
            D+Y  R   +      + ++G  F       +L  W R+W
Sbjct: 652 ADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 692


>gi|47212645|emb|CAF95026.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 121

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 56/121 (46%), Positives = 83/121 (68%), Gaps = 3/121 (2%)

Query: 20  LVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYL 79
           L LQ KIL ++   GM PVLPAFSGNVP  +  ++P A++T+LG W   K +  + C+Y+
Sbjct: 4   LSLQFKILEQMRSFGMTPVLPAFSGNVPKGILRLYPEARVTRLGPW--SKFNCSFSCSYI 61

Query: 80  LDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQ 139
           LD  DPLF+ IG  ++ Q +K++G T+HIYN DTF+E TPP   P Y+S++  A+++ M 
Sbjct: 62  LDPRDPLFLRIGSLYLAQVVKQFG-TNHIYNTDTFNEMTPPSSEPNYLSAVSRAVFAAMT 120

Query: 140 S 140
           +
Sbjct: 121 A 121


>gi|281423204|ref|ZP_06254117.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
 gi|281402540|gb|EFB33371.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
          Length = 291

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 70/240 (29%), Positives = 112/240 (46%), Gaps = 22/240 (9%)

Query: 247 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 306
           RYG++ P I+ AW +L  T+YNC  G   +             SI           G+P 
Sbjct: 4   RYGKTSPEIERAWQLLSETIYNCPAGNNQQG---------PHESIFC---------GRP- 44

Query: 307 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 366
              ++   +  S+     +Y     + A +L     ++   +N + YDL+D+ RQALA  
Sbjct: 45  ---SLNNFQVKSWSKMRNYYDLQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALADQ 101

Query: 367 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 426
               +L  I  Y         + + RFLE++   D LL     F LG W E+A++L   +
Sbjct: 102 GRLQYLKTIADYNGFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGTTQ 161

Query: 427 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
           +++  YEWNAR QIT W +    +   L DY +K W G+L+D+Y  R  I+   + + +E
Sbjct: 162 QEKDLYEWNARVQITTWGNRMCADKGGLHDYAHKEWQGILKDFYYKRWKIFMDALAKQME 221


>gi|169351438|ref|ZP_02868376.1| hypothetical protein CLOSPI_02218 [Clostridium spiroforme DSM 1552]
 gi|169291660|gb|EDS73793.1| F5/8 type C domain protein [Clostridium spiroforme DSM 1552]
          Length = 1762

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 141/564 (25%), Positives = 227/564 (40%), Gaps = 111/564 (19%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M N+   GGP+   W+  +L + ++       LGM  VL  ++G VP    + +   +I 
Sbjct: 816  MDNMEVIGGPVSDEWVKGRLEMARENQRWKNSLGMQTVLQGYAGMVPNNFTD-YQDVEIL 874

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP- 119
            + GNW  V   PR     ++     L+ +  + F E Q   +G+TS+ Y  D F E    
Sbjct: 875  EQGNWCGV---PR---PDMIRTDGELYDQYAKLFYEAQEWAFGKTSNYYAVDPFHEGGKR 928

Query: 120  PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKAL--LNSVPLGKL 177
            P D  + +  +   + + +   D +AVW++Q W       W  P    L  +       +
Sbjct: 929  PSDLTDDV--ISREVLNSLLEYDQEAVWMVQAW-------WSNPTNDLLKGMGDDREDHV 979

Query: 178  VVLDLFA------------EVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG----ILDS 221
            ++LDL              E       S +F    ++WCML N+ GN  M G    I++ 
Sbjct: 980  IILDLNGLNDAYDSYWDKTEYNGTVLESDEFNSTSWVWCMLENYGGNPSMDGRPKEIINR 1039

Query: 222  IAFGPVEAR-------TSENT-----------TMA----------WINQYSVRRYGRSVP 253
            I     +A         SE T            MA          W+++Y +RRYG    
Sbjct: 1040 INKASTQAEHMKGIGFISEATYDNPMIYELLLDMAWQQDTIDLDDWLDEYVLRRYGDYSE 1099

Query: 254  AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
            +  +AW++L  TVY+ +   TD     ++A    DPS++         YG P        
Sbjct: 1100 SAGEAWDILLKTVYSRSGKTTD-----VIARS--DPSLVQ--------YGLP-------- 1136

Query: 314  SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
                        Y+ SE+  ALEL     ++LSAS  YRYDL ++ RQ +  YA     +
Sbjct: 1137 ------------YTASELEEALELLYKDYDKLSASEAYRYDLTEIMRQVVNNYAVVRLGD 1184

Query: 374  IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK-QY 432
            +  AY   +      L  ++L  ++ ++ +       L+G W+  A   A++   +   Y
Sbjct: 1185 LKTAYDAKEIDNFKSLKEQYLNAIDLLNEVCGTQQDLLIGEWVGRAVDWAKDTNSDDFAY 1244

Query: 433  E---WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
            +    NA+T IT+W  +T      L  Y  + + G++ D Y      Y     E LE G 
Sbjct: 1245 DSMIINAKTLITVWAPSTT-----LGTYAYRNYEGMINDIYKVIWQAYLDQSEEILEFGS 1299

Query: 490  G----FRLKDWRREWIKLTNDWQN 509
                     D   +WI    D QN
Sbjct: 1300 AKTNLVNYHDLCMDWIYADWDLQN 1323


>gi|281423203|ref|ZP_06254116.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
 gi|281402539|gb|EFB33370.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
          Length = 450

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 78/226 (34%), Positives = 113/226 (50%), Gaps = 24/226 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP SW +QQ  LQKKIL R++E GM PVLP F G +P   +       +T
Sbjct: 186 MNNLEGWGGPLPDSWYNQQEALQKKILKRMHEYGMQPVLPGFCGMMPHDAKAKL-GLNVT 244

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W              L  TD    +I   +  +    YG+ ++ Y+ D F E    
Sbjct: 245 DGGIWNGYTRPAN------LSPTDAHSDKIADLYYAELTNLYGKANY-YSMDPFHETND- 296

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D     S  G  +   M+  + +A W++QGW  + +P  RP  +K + N    G L+VL
Sbjct: 297 -DEAIDYSKAGRKVMEAMKRVNPNATWVIQGW--TENP--RPQMIKNMKN----GDLLVL 347

Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
           DLF+E +P      IW   K +    +++CML NF  N+ ++G +D
Sbjct: 348 DLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMD 393


>gi|255079272|ref|XP_002503216.1| GH family 89 protein [Micromonas sp. RCC299]
 gi|226518482|gb|ACO64474.1| GH family 89 protein [Micromonas sp. RCC299]
          Length = 1260

 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 68/179 (37%), Positives = 96/179 (53%), Gaps = 7/179 (3%)

Query: 58  KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
           K  +LG +   K D      + LD +D LF  +G AF +Q ++++G T H+Y  DTF E 
Sbjct: 396 KDAELGKY--AKKDDSVRSVHFLDPSDALFQSLGAAFTKQLVEDFG-TDHLYLADTFREI 452

Query: 118 TPPVD--SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL 174
             P D  S  ++  +GAA  + M+S D  A W+ Q   F  +P FW   +  ALL SV +
Sbjct: 453 RDPNDDFSETHVVRVGAATLAAMRSADPRATWVFQSDAFRRNPRFWNEGRRGALLRSVDI 512

Query: 175 GKLVVLDLFAEVKPIW-STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS 232
           G ++VLD  AE  P +      F G P++WC+ HN  GN+ M G L +IA GP  A  S
Sbjct: 513 GDMLVLDSAAETDPYYLREPVHFAGQPFVWCVKHNHGGNLGMRGRLSAIATGPAAAMDS 571



 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 65/220 (29%), Positives = 98/220 (44%), Gaps = 34/220 (15%)

Query: 256 QDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
           ++AW +L  TVY       D++  RD     P +    +S     Y +  K V       
Sbjct: 727 REAWEILGKTVYGARAKGEDEDHVRDACSWQPSLRADELSP---DYFDAAKVVD------ 777

Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
                       Y+   +I A     A+G    A     YD++D+ RQ LA+ +N L   
Sbjct: 778 ------------YAFKPLIDAAPTLRANG----AGTRVDYDIVDVGRQLLARQSNVLATQ 821

Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA---QNEEQEK 430
           I ++   N+A        + LEL++DMD LL  H GFLLG ++ESAK  A     E  E 
Sbjct: 822 IRDSLNSNNASEAKMYGTQMLELLDDMDALLRSHKGFLLGNYIESAKSWAGKRNKESDEA 881

Query: 431 QYEWNARTQITMWFDNTQEEASLL----RDYGNKYWSGLL 466
             E +AR+ I+ +  +  +  + L     DY N+ WSG+L
Sbjct: 882 NLERSARSLISGFGPSGSKLGAPLGHPMHDYSNRQWSGML 921



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 38/61 (62%)

Query: 7   WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWF 66
           W G  P+ WL +Q  LQ+  +  + + GM PVLP F+G+VP A+   FP AK+ ++ NW 
Sbjct: 217 WTGGRPKKWLKRQWDLQRDAVKLMRDFGMTPVLPGFNGHVPPAIARRFPEAKLRRVENWL 276

Query: 67  S 67
           +
Sbjct: 277 T 277


>gi|293402299|ref|ZP_06646437.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
            5_2_54FAA]
 gi|291304406|gb|EFE45657.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
            5_2_54FAA]
          Length = 2330

 Score =  116 bits (290), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 122/532 (22%), Positives = 213/532 (40%), Gaps = 97/532 (18%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M N+  +GGP+P  ++  ++ L +        LGMN VL  ++G VP       P+  +T
Sbjct: 670  MQNIETFGGPIPDQYVVDRVELARTTQRWKNSLGMNTVLQGYAGMVPTNFNEFQPNVPLT 729

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               +W  + + P    T       P + E  + F E Q   YG TS  Y  D + E    
Sbjct: 730  AQKSWGGL-ARPSMIPT-----DSPYYDEYAKLFYEAQEYIYGATSDYYAVDPYHEGGT- 782

Query: 121  VDSPEYIS--SLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSV---PLG 175
               PE +S  ++   + + +   D DAVW++Q         W+      LLN +      
Sbjct: 783  --RPEGLSDETVAREVLNSLLDYDKDAVWVVQA--------WQSNPTDGLLNGMGEYREN 832

Query: 176  KLVVLDLFAEVKPIWSTSK----QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
             ++++DL     PI S +K    +F G  + W +L  F GN  M G + ++    ++   
Sbjct: 833  HVLIVDLIK--YPIKSWTKYNKSEFKGTSWAWGLLGGFGGNPTMNGEMQTM-VNDIQTAK 889

Query: 232  SENTTMA---------------------------------WINQYSVRRYGRSVPAIQDA 258
             E T MA                                 W+N+Y  RRYG +    ++A
Sbjct: 890  KERTHMAGLGIISEAQYDNPVLYDLIFDLAWVDDDFSLDQWLNKYIERRYGGTSDNAKEA 949

Query: 259  WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 318
            W ++ +  YN             V F      +  +     Q+YGK              
Sbjct: 950  WKIMKNANYNHG-----------VRFT---AQVYGMKGKSPQDYGK-------------- 981

Query: 319  YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
                ++ Y   ++  A  L I   ++   S  YRYDL ++ RQ ++ Y+   + N+I+A 
Sbjct: 982  ---QNISYGADKLETAFRLLIEDYDKFKDSECYRYDLTEIMRQMVSNYSTLTYNNVIDAR 1038

Query: 379  QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNA 436
            +  +     +   +FL+  + ++ +       L G W+  A+  A + +   +  +E NA
Sbjct: 1039 EDKNIEKFKEEKAKFLKSFDVLNDIQETQVDQLAGEWIGKAQDRAADYDDFAKDAFEMNA 1098

Query: 437  RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
            +  IT W   ++  A  L+DY  + + G+  D Y      Y   +  +LE+G
Sbjct: 1099 KALITSWA--SRSSAGGLKDYAWRNYQGMFIDLYKQNWIDYLDQVEANLENG 1148


>gi|260821254|ref|XP_002605948.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
 gi|229291285|gb|EEN61958.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
          Length = 673

 Score =  115 bits (289), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 83/289 (28%), Positives = 122/289 (42%), Gaps = 66/289 (22%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPLPQSW   QL LQ KIL R+              N  + L +++    + 
Sbjct: 368 MGNIRGWGGPLPQSWHQNQLELQHKILARMR-------------NFDSTLMHLY----LD 410

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G     ++    C T  +                             +C    E    
Sbjct: 411 YSGGDLKTRTVAHTCWTLRI-----------------------------HCFLTLEECLL 441

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P Y+S  GAA+Y+GM +GD  A+WLMQGWLF    FW+P Q KALL SVP G  +  
Sbjct: 442 LSEPNYLSKAGAAVYAGMLAGDPQAIWLMQGWLFQARDFWQPAQTKALLQSVPEGPFL-- 499

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCML--HNFAGNIEMYGILDSIAFGPVEARTSENTTMA 238
                       ++++ G   +   L       N  MY +++ +A+ P   +  +N    
Sbjct: 500 ------------ARKYLGSTMVGTGLTPEGIDQNYIMYELMNEVAWMPQPFQILDN---- 543

Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 287
           W + Y+  RYG         W +L  +VY+C +G  D    V+V  PD+
Sbjct: 544 WASDYAWSRYGVKNSNASLGWQILLKSVYDCENGFKDHCDSVVVHRPDL 592


>gi|386386798|ref|ZP_10071901.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
 gi|385665738|gb|EIF89378.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
          Length = 1033

 Score =  115 bits (288), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 67/193 (34%), Positives = 99/193 (51%), Gaps = 6/193 (3%)

Query: 334 ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF 393
           A    +A+G EL  S+TYR+DL +L RQALA  +  L   +  A    D      +S  +
Sbjct: 531 AFAALLAAGPELRDSDTYRHDLTELARQALANRSRTLQFALRAARASKDVAAFRGVSALW 590

Query: 394 LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASL 453
           L+L+   D +  CH  FLLGPWLE AK+LA +  +  + E  AR  IT W D  +  A+ 
Sbjct: 591 LKLMRLADTMAGCHRSFLLGPWLEDAKRLATSPAEAVELERTARALITTWAD--RPAANA 648

Query: 454 LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNV 513
           L +Y N+ W+GL+ D + P+   +   + ++LE+G   +  DW  +       W   R V
Sbjct: 649 LSNYANRDWNGLIADVHVPQWDAFLTEVADALEAGRAPKSFDWYPQ----EEAWTKDRRV 704

Query: 514 YPVESNGDALITS 526
           YP    GD   T+
Sbjct: 705 YPSAPTGDPYATA 717



 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 57/215 (26%), Positives = 102/215 (47%), Gaps = 17/215 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+  +GGPL ++ LD +  L +KI  R+ ELGM PVLP + G VP    +  P A++ 
Sbjct: 213 LQNMSEYGGPLSKTLLDARAELGRKITARLRELGMRPVLPGYFGTVPDGFADRNPGARVV 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W  ++  P W     LD    +F ++  AF   Q K +G    ++  D   E    
Sbjct: 273 AQGLWNGLRR-PDW-----LDPRTTVFPKVAAAFYRHQTKLFG-ACDLFKMDLLHEGGNA 325

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+   +++   +AVW++ GW  +          +ALL++V   +++++
Sbjct: 326 GDVP--VPDAARAVEKALRTARPNAVWVILGWQSN--------PRRALLDAVDKRRMLIV 375

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 215
           D  +++        ++ G PY +  + NF G   +
Sbjct: 376 DGLSDLDTTGDRESEWGGTPYAFGTIPNFGGRTTL 410


>gi|297723521|ref|NP_001174124.1| Os04g0650900 [Oryza sativa Japonica Group]
 gi|255675839|dbj|BAH92852.1| Os04g0650900, partial [Oryza sativa Japonica Group]
          Length = 128

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 49/64 (76%), Positives = 56/64 (87%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQSWLD QL LQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 64  MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 123

Query: 61  QLGN 64
            LGN
Sbjct: 124 HLGN 127


>gi|302522684|ref|ZP_07275026.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
 gi|302431579|gb|EFL03395.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
          Length = 355

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 72/258 (27%), Positives = 122/258 (47%), Gaps = 23/258 (8%)

Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 298
           W  +Y+  RYG S    + AW++L  TVY       D+  +         P++ +V+ GK
Sbjct: 84  WFREYARVRYGGSDAHAEAAWDILRTTVYGTRR--DDRWSEPADGLFGARPALDAVSAGK 141

Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 358
           +            L+   +S++             AL+  ++   EL  S TYR DL+D+
Sbjct: 142 WS--------PKALRYPAASFEP------------ALDELLSVRAELRDSATYRRDLLDV 181

Query: 359 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 418
            RQALA  +  L   +  AY+  +     +L RR++ L++ ++ L+A  +  LLG W+ES
Sbjct: 182 ARQALANRSRTLLPRLAAAYKAKNQAEFARLGRRWIALIDLLEQLVATDENHLLGRWVES 241

Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
           A+    +  ++ Q +++A + +T W      +A  LRDY N+ WSGL+   Y  R   Y 
Sbjct: 242 ARAWGGSAREKSQLQYDALSLLTTWGTRQGADAG-LRDYANREWSGLVGGLYRLRWGTYI 300

Query: 479 KYMIESLESGDGFRLKDW 496
             +  +L+ G      DW
Sbjct: 301 DELSAALKEGRKPVAVDW 318


>gi|293369245|ref|ZP_06615835.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
 gi|292635670|gb|EFF54172.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
          Length = 221

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/214 (26%), Positives = 102/214 (47%), Gaps = 7/214 (3%)

Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
           P + Y   +++ A  L ++  +     ++Y +DL+++ RQ L  Y N +      AY+  
Sbjct: 11  PTIEYQPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAG 68

Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
           D   +     +  E++ D+D L++CH  F L  W+  A+ +  +   +  YE NAR+ IT
Sbjct: 69  DIPMMKNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHDAASKNYYEMNARSLIT 128

Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 501
           +W D+       L DY N+ W+GL   YY  R   +   +IE+ E    F  +++  +  
Sbjct: 129 IWGDSYH-----LTDYANRSWAGLTNQYYSVRWDHFINEVIEAAEKKKNFDEEEFFNQSR 183

Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
              N+W N  N       GD +  ++ +Y KY +
Sbjct: 184 MYENEWVNPSNRISYNEGGDGIKLARQIYKKYAK 217


>gi|293371911|ref|ZP_06618315.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
 gi|292633157|gb|EFF51734.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
          Length = 289

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 3/98 (3%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 184 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 243

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 98
            LG W       R  C + L+  D LF +I + F+++Q
Sbjct: 244 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQ 278


>gi|321458423|gb|EFX69492.1| hypothetical protein DAPPUDRAFT_35389 [Daphnia pulex]
          Length = 132

 Score = 92.8 bits (229), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 45/137 (32%), Positives = 78/137 (56%), Gaps = 5/137 (3%)

Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
           ++DLTRQ++ +  + L+  ++E Y   ++  +  ++ + + L++D+D L+     FLLG 
Sbjct: 1   MVDLTRQSMQEIFHLLYSKLLEVYLEKNSTAIEGIAYKMINLLQDLDELIQTGKTFLLGK 60

Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
           W+  AK     E ++ QYEWNAR QIT+W    +     +RDY  K W+G++ DYY P  
Sbjct: 61  WIADAKSWGTTEGEKLQYEWNARNQITLWGPRGE-----IRDYAAKKWAGVVADYYKPHW 115

Query: 475 AIYFKYMIESLESGDGF 491
            ++ + M  SL+    F
Sbjct: 116 EVFIREMQMSLDENRAF 132


>gi|342731751|ref|YP_004770590.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-Japan]
 gi|342329206|dbj|BAK55848.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-mouse-Japan]
          Length = 898

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 115/527 (21%), Positives = 209/527 (39%), Gaps = 105/527 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+   GG L   W + +  L   I  R+ E+G+ P+   F G  P           + 
Sbjct: 352 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 408

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W  +K   R      LD  +     I   + E+Q +  G++ + +  D F E  N 
Sbjct: 409 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 461

Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              D+ E    L   + S +++    D+VW++Q W          P  +++ N +    +
Sbjct: 462 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 509

Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           ++LDL +++   W            +K+F    +I+ +L+NF G   +YG  + +     
Sbjct: 510 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 569

Query: 228 EAR-----------TSENTTM--------------------AWINQYSVRRYGRSVPAIQ 256
           +A+           TSE                         ++ +Y   RYG+S   + 
Sbjct: 570 DAKYNSDYLSGIANTSEGVGFNNFIDELSTELIFSDEVNMDEFVKRYLKNRYGKSDRDLL 629

Query: 257 DAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
            A+N+L  TVYN  TD    GA++    VI A P ++ +                     
Sbjct: 630 VAFNILLDTVYNPVTDIYHEGASES---VINARPSLEIN--------------------- 665

Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
                S +   H  Y + ++ R +E++I+  +E   +  Y  DLID+  + +   A+E +
Sbjct: 666 ---SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYY 722

Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
             I E Y   +   +  +S++FL L+     +L+ +D   L   +     L  ++  +  
Sbjct: 723 QIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFKDT 782

Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
            ++N +  +T W+D    E   LRDY N        D+Y     +Y+
Sbjct: 783 LKYNKKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 822


>gi|417965571|ref|ZP_12607078.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-4]
 gi|380336329|gb|EIA26351.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-4]
          Length = 685

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 111/523 (21%), Positives = 208/523 (39%), Gaps = 97/523 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+   GG L   W + +  L   I  R+ E+G+ P+   F G  P           + 
Sbjct: 146 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 202

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W  +K   R      LD  +     I   + E+Q +  G++ + +  D F E  N 
Sbjct: 203 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 255

Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              D+ E    L   + S +++    D+VW++Q W          P  +++ N +    +
Sbjct: 256 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 303

Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           ++LDL +++   W            +K+F    +I+ +L+NF G   +YG  + +     
Sbjct: 304 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 363

Query: 228 EAR-----------TSENTTM--------------------AWINQYSVRRYGRSVPAIQ 256
           +A+           TSE                         ++ +Y   RYG+S   + 
Sbjct: 364 DAKYNSDYLSGIANTSEGVGFNNFIDELSTELIFSDEVNMDEFVKRYLKNRYGKSDRDLL 423

Query: 257 DAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
            A+N+L  TVYN  TD          +       S+I+             ++ ++  + 
Sbjct: 424 VAFNILLDTVYNPVTD----------IYHEGASESVIN-------------ARPSLGINS 460

Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
            S +   H  Y + ++ R +E++I+  +E   +  Y  DLID+  + +   A+E +  I 
Sbjct: 461 ASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQIIQ 520

Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 435
           E Y   +   +  +S++FL L+     +L+ +D   L   +     L  ++  +   ++N
Sbjct: 521 EYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFKDTLKYN 580

Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
            +  +T W+D    E   LRDY N        D+Y     +Y+
Sbjct: 581 KKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 616


>gi|417967717|ref|ZP_12608785.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-co]
 gi|380340884|gb|EIA29424.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-co]
          Length = 741

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 111/523 (21%), Positives = 208/523 (39%), Gaps = 97/523 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+   GG L   W + +  L   I  R+ E+G+ P+   F G  P           + 
Sbjct: 195 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W  +K   R      LD  +     I   + E+Q +  G++ + +  D F E  N 
Sbjct: 252 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 304

Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              D+ E    L   + S +++    D+VW++Q W          P  +++ N +    +
Sbjct: 305 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 352

Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           ++LDL +++   W            +K+F    +I+ +L+NF G   +YG  + +     
Sbjct: 353 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 412

Query: 228 EAR-----------TSENTTM--------------------AWINQYSVRRYGRSVPAIQ 256
           +A+           TSE                         ++ +Y   RYG+S   + 
Sbjct: 413 DAKYNSDYLSGIANTSEGVGFNNFIDELSTELIFSDEVNMDEFVKRYLKNRYGKSDRDLL 472

Query: 257 DAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
            A+N+L  TVYN  TD          +       S+I+             ++ ++  + 
Sbjct: 473 VAFNILLDTVYNPVTD----------IYHEGASESVIN-------------ARPSLGINS 509

Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
            S +   H  Y + ++ R +E++I+  +E   +  Y  DLID+  + +   A+E +  I 
Sbjct: 510 ASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQIIQ 569

Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 435
           E Y   +   +  +S++FL L+     +L+ +D   L   +     L  ++  +   ++N
Sbjct: 570 EYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFKDTLKYN 629

Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
            +  +T W+D    E   LRDY N        D+Y     +Y+
Sbjct: 630 KKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 665


>gi|384455191|ref|YP_005667784.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-mouse-Yit]
 gi|418016862|ref|ZP_12656425.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-NYU]
 gi|418371995|ref|ZP_12964091.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-SU]
 gi|345505596|gb|EGX27892.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-NYU]
 gi|346983532|dbj|BAK79208.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-mouse-Yit]
 gi|380342872|gb|EIA31299.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-SU]
          Length = 898

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 115/527 (21%), Positives = 208/527 (39%), Gaps = 105/527 (19%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+   GG L   W + +  L   I  R+ E+G+ P+   F G  P           + 
Sbjct: 352 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 408

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W  +K   R      LD  +     I   + E+Q +  G++ + +  D F E  N 
Sbjct: 409 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 461

Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              D+ E    L   + S +++    D+VW++Q W          P  +++ N +    +
Sbjct: 462 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 509

Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           ++LDL +++   W            +K+F    +I+ +L+NF G   +YG  + +     
Sbjct: 510 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 569

Query: 228 EAR-----------TSENTTM--------------------AWINQYSVRRYGRSVPAIQ 256
           +A+           TSE                         ++ +Y   RYG+S   + 
Sbjct: 570 DAKYNSDYLSGIANTSEGVGFNNFIDELSTELIFSDEVNMDEFVKRYLKNRYGKSDRDLL 629

Query: 257 DAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
            A+N+L  TVYN  TD    GA++    VI A P +  +                     
Sbjct: 630 VAFNILLDTVYNPVTDIYHEGASES---VINARPSLGIN--------------------- 665

Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
                S +   H  Y + ++ R +E++I+  +E   +  Y  DLID+  + +   A+E +
Sbjct: 666 ---SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYY 722

Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
             I E Y   +   +  +S++FL L+     +L+ +D   L   +     L  ++  +  
Sbjct: 723 QIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFKDT 782

Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
            ++N +  +T W+D    E   LRDY N        D+Y     +Y+
Sbjct: 783 LKYNKKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 822


>gi|322792283|gb|EFZ16267.1| hypothetical protein SINV_02225 [Solenopsis invicta]
          Length = 87

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 42/92 (45%), Positives = 61/92 (66%), Gaps = 5/92 (5%)

Query: 393 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 452
            LEL +D++ +LA    FLLG WL  AK++A NEE+ + YE+NAR QIT+W  N +    
Sbjct: 1   LLELFDDLESILASGSNFLLGTWLTQAKEMADNEEERRSYEYNARNQITLWGPNGE---- 56

Query: 453 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
            +RDY NK WSG++ DY+ PR  ++ K + +S
Sbjct: 57  -IRDYANKQWSGVVADYFKPRWELFLKALEKS 87


>gi|315131339|emb|CBM69278.1| venom protein Ci-120 [Chelonus inanitus]
          Length = 165

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 79/131 (60%), Gaps = 7/131 (5%)

Query: 357 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPW 415
           D+TRQ+L   A  ++L + +++   D   VF+     L +L  D++ +L+ +  FL+G W
Sbjct: 1   DVTRQSLQLIAEHVYLKLQQSFHQKDL-AVFKAHANLLMQLFSDLESILSTNKHFLVGKW 59

Query: 416 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 475
           +++A+ L  N +++K YE NAR QIT+W  N +     +RDY NK W+G++  Y+G R +
Sbjct: 60  IKNARSLGTNVQEQKLYELNARNQITLWGPNGE-----IRDYANKQWAGVMSQYFGARWS 114

Query: 476 IYFKYMIESLE 486
           +Y   +  +LE
Sbjct: 115 LYLSVLEFALE 125


>gi|347541919|ref|YP_004856555.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-rat-Yit]
 gi|346984954|dbj|BAK80629.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-rat-Yit]
          Length = 912

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 116/544 (21%), Positives = 213/544 (39%), Gaps = 100/544 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+   GG L   W + +  L   I  R+ E G+ P+   F G  P           + 
Sbjct: 364 MGNISSIGGELTPKWFEDRAKLSIDIQTRMIEFGIEPIHQMFIGYFPYKEN---SGVNVI 420

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           +   W  +K   R      LD  +     I   + ++Q + +G + + +  D F E  N 
Sbjct: 421 RGSYWSKIKGPDR------LDFNNNDVEFISSVYYKKQKELFGESKY-FAGDLFHEGNNL 473

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
              D  E  + +   +   + +   +++W++Q W  S       P  + + N +     +
Sbjct: 474 YGYDPVELSNKVLKLL---IDNNGENSIWIIQSWSHS-------PSSETIEN-LNRNNTL 522

Query: 179 VLDLFAEVKPIWSTSKQFYGVPY----------IWCMLHNFAGNIEMYGILDSIAFGPVE 228
           +LDL +++   W    +F  + +          I+ +L+NF G   +YG    +     +
Sbjct: 523 ILDLHSQLNTRWKGISKFNNMSWKDREFDRSNWIFGVLNNFGGRSGLYGHTRHLLNQFYD 582

Query: 229 ARTSENTTMA-------------------------------WINQYSVRRYGRSVPAIQD 257
           A+ + N                                   ++++Y   RYG+S   +  
Sbjct: 583 AKYNSNYLKGVAHTSEGIGFNNFIDELVTEIIFSDKLDIDEFVSRYLRNRYGKSDNDLLK 642

Query: 258 AWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
           A+N+L  TVYN       +GA++    VI A P +D                       +
Sbjct: 643 AFNILLDTVYNPVINIYHEGASES---VINARPSLD-----------------------V 676

Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
           KS  S +   H  Y++ ++  AL ++ +  NE   S  Y  DLID+  + +   +NE + 
Sbjct: 677 KS-ASKWGSIHKNYNSEKLEEALRIYFSKYNEFKDSKGYMTDLIDIASEVIINLSNEYYK 735

Query: 373 NIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
           N+ + Y  N     F+L S+RFL ++     +L  ++   L   ++    L  ++  E  
Sbjct: 736 NLQDYYN-NGEIEFFKLNSQRFLNMILLQANILYYNERKSLQKLIDKLDDLNYDDYFEDT 794

Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG--D 489
              N +T +T W+D    E   LRDY N  +  ++   Y  R   +F  + E+  +G  D
Sbjct: 795 LIINKKTILTTWYDKQVSEDDGLRDYANTDFYDIVGTLYYNRWKRFFDNIQENAVNGFYD 854

Query: 490 GFRL 493
            +R 
Sbjct: 855 DYRF 858


>gi|440799252|gb|ELR20307.1| AlphaN-acetylglucosaminidase, putative, partial [Acanthamoeba
           castellanii str. Neff]
          Length = 389

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 50/132 (37%), Positives = 68/132 (51%), Gaps = 19/132 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA----------- 49
           M N+ GWGGPL  +W   Q  LQKKI+ R    GM PVLPAF+G VP A           
Sbjct: 261 MGNIQGWGGPLDLAWRLAQAELQKKIVERQRMFGMLPVLPAFAGFVPEASVKFTLGRGGG 320

Query: 50  -----LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGR 104
                ++ ++P+A +T+  +W       ++   Y L   D L+  IG   I    +E+G 
Sbjct: 321 CGEQGIKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG- 377

Query: 105 TSHIYNCDTFDE 116
           T HIYN DTF+E
Sbjct: 378 TDHIYNADTFNE 389


>gi|326435733|gb|EGD81303.1| alpha-N-acetylglucosaminidase [Salpingoeca sp. ATCC 50818]
          Length = 696

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 47/116 (40%), Positives = 61/116 (52%), Gaps = 18/116 (15%)

Query: 6   GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW 65
           G GGPLP  W  QQ  LQ+ I+ R  ELG+  +LPAF GNVPAAL  ++P A I+    W
Sbjct: 246 GVGGPLPSQWYKQQWELQRAIVQRQTELGIGSLLPAFQGNVPAALAQLYPHANISN--GW 303

Query: 66  FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT-FDENTPP 120
                         LD  DPLF  I    +++ + ++G T H Y  D  FD +T P
Sbjct: 304 --------------LDGLDPLFATIADLTMQELIADFGAT-HFYQADGFFDHSTGP 344



 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 57/116 (49%), Gaps = 7/116 (6%)

Query: 134 IYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST 192
           +Y+ M   D  A+W+ QGW+    D       M    ++VP G+LV+LD+ AE   IW+ 
Sbjct: 500 VYTTMTKRDPHAIWVYQGWIWLDLDNAQGFSFMSGFTSAVPRGRLVILDMEAEFDEIWAW 559

Query: 193 SKQFYGVPYIWCMLHNFAGNIEMYGILDSI------AFGPVEARTSENTTMAWINQ 242
           S+ F+   +IW  + NF GN  MYG +  +       F   +A      TM  I+Q
Sbjct: 560 SQSFFNTTFIWAAMDNFGGNNGMYGDIQLVFDRTRRVFAQSDAVVGVGITMEGIDQ 615


>gi|84625359|ref|YP_452731.1| hypothetical protein XOO_3702 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|84369299|dbj|BAE70457.1| truncated N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
           MAFF 311018]
          Length = 369

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 33/62 (53%), Positives = 43/62 (69%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQQWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260

Query: 61  QL 62
           ++
Sbjct: 261 RM 262


>gi|323456608|gb|EGB12475.1| hypothetical protein AURANDRAFT_20306 [Aureococcus anophagefferens]
          Length = 243

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/64 (56%), Positives = 47/64 (73%), Gaps = 1/64 (1%)

Query: 5   HGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT-QLG 63
           HG GGPLP+++ D QL L K+IL R+  LG+ PVLP+F GNVP AL+++FP A IT Q  
Sbjct: 120 HGVGGPLPRTFADAQLALAKRILARMRGLGIVPVLPSFQGNVPPALKDLFPEANITVQAP 179

Query: 64  NWFS 67
           +W S
Sbjct: 180 HWTS 183


>gi|390353486|ref|XP_003728120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Strongylocentrotus
           purpuratus]
          Length = 385

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 32/50 (64%), Positives = 38/50 (76%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL 50
           M N+ GWGGP+PQSW   QL LQ KIL R+ ELGM PVLPAF+G+VP + 
Sbjct: 218 MGNIDGWGGPIPQSWHTNQLALQHKILKRMRELGMIPVLPAFAGHVPKSF 267


>gi|358381741|gb|EHK19415.1| hypothetical protein TRIVIDRAFT_224650 [Trichoderma virens Gv29-8]
          Length = 217

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 40/158 (25%), Positives = 76/158 (48%), Gaps = 14/158 (8%)

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL-G 175
           TPP     Y+ +  +  +  ++S D +A+W+ Q WLF+ +  FW   +++     + +  
Sbjct: 2   TPPSGELNYLRNASSNTWKALKSADPEAIWVFQAWLFAQNTTFWTNDRIEVYPGGITIDS 61

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LD++ E    W  ++ +Y  P+IWC L N+   I MYG + ++   P+ A      
Sbjct: 62  DMLILDIWLESMSQWQCAQSYYSKPWIWCELQNYGATINMYGQIQNLTKSPILALQESQ- 120

Query: 236 TMAWINQYSVRRYGRSVPAIQD---AWNVLYHTVYNCT 270
                   S+   G S+ A Q     +++L    +NCT
Sbjct: 121 --------SLVGLGLSMEAQQSNEIVFDLLLSQAWNCT 150


>gi|224135741|ref|XP_002322149.1| predicted protein [Populus trichocarpa]
 gi|222869145|gb|EEF06276.1| predicted protein [Populus trichocarpa]
          Length = 173

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 33/60 (55%), Gaps = 2/60 (3%)

Query: 477 YFKYMIESLESGD--GFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
           ++ + I++ E G    F +K   +   K +      R + PVESNG+AL  S+WL+ KYL
Sbjct: 97  HYTFQIQNTEFGKWPRFPVKGLEKRMDKASKQLAESRKIIPVESNGNALNISRWLFYKYL 156


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.136    0.432 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,295,306,757
Number of Sequences: 23463169
Number of extensions: 401806658
Number of successful extensions: 854199
Number of sequences better than 100.0: 514
Number of HSP's better than 100.0 without gapping: 500
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 850859
Number of HSP's gapped (non-prelim): 1141
length of query: 542
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 394
effective length of database: 8,886,646,355
effective search space: 3501338663870
effective search space used: 3501338663870
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)