BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 012032
         (472 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255540793|ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
 gi|223550576|gb|EEF52063.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
          Length = 809

 Score =  819 bits (2115), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/464 (82%), Positives = 423/464 (91%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGG LPQSW  QQL+LQKKIL R+YELGMNPVLPAFSGNVPAAL+N+FPSAKI 
Sbjct: 234 MGNLHRWGGSLPQSWFFQQLILQKKILARMYELGMNPVLPAFSGNVPAALRNIFPSAKIA 293

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWFSVKSD RWCCTYLLDATDPLFIEIGRAFIEQQL+EYG TSHIYNCDTFDENTPP
Sbjct: 294 RLGNWFSVKSDLRWCCTYLLDATDPLFIEIGRAFIEQQLEEYGSTSHIYNCDTFDENTPP 353

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD P+YIS+LGAA++ GMQSGD+DAVWLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 354 VDDPKYISALGAAVFKGMQSGDNDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGRLVVL 413

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DLFAEVKPIW++S QFYGVPYIWCMLHNFAGN+EMYGILDSIA GPVEARTSEN+TMVGV
Sbjct: 414 DLFAEVKPIWTSSYQFYGVPYIWCMLHNFAGNVEMYGILDSIASGPVEARTSENSTMVGV 473

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNPVVYDLMSEMAFQH+KVDVKAWIN YS RRYGRSVP+IQDAW++LYHTVY
Sbjct: 474 GMSMEGIEQNPVVYDLMSEMAFQHKKVDVKAWINLYSTRRYGRSVPSIQDAWDILYHTVY 533

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           NCTDGA DKNRDVIVAFPDV+P   SV++ ++   GKPVS+ AVLK  + SYDHPHLWYS
Sbjct: 534 NCTDGAYDKNRDVIVAFPDVNPFYFSVSQKRHHLNGKPVSRRAVLKENSDSYDHPHLWYS 593

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
           TSEV+ ALELFI SG ELS S+TY YDL+DLTRQALAKY NELFL IIE+YQ ND +GV 
Sbjct: 594 TSEVLHALELFITSGEELSGSSTYSYDLVDLTRQALAKYGNELFLKIIESYQANDGNGVA 653

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             S++FL+LVEDMD LL CH+GFLLGPWLESAKQLAQ++EQEKQ
Sbjct: 654 SRSQKFLDLVEDMDTLLGCHEGFLLGPWLESAKQLAQDQEQEKQ 697


>gi|225457148|ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
          Length = 813

 Score =  810 bits (2092), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/468 (81%), Positives = 418/468 (89%), Gaps = 4/468 (0%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWF+V  +PRWCCTYLLDATDPLFIEIG+AFI+QQLKEYGRT HIYNCDTFDENTPP
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPP 354

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 355 VDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVL 414

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYGILD++A GPVEARTSEN+TMVGV
Sbjct: 415 DLFAEVKPIWITSEQFYGVPYIWCMLHNFAGNIEMYGILDAVASGPVEARTSENSTMVGV 474

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNPVVYDLMSEMAFQH KVDVK WI  YS RRYG+SVP IQDAWN+LYHTVY
Sbjct: 475 GMSMEGIEQNPVVYDLMSEMAFQHSKVDVKVWIALYSTRRYGKSVPEIQDAWNILYHTVY 534

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTE----GKYQNYGKPVSKEAVLKSETSSYDHPH 356
           NCTDG+ DKNRDVIVAFPD+DPS I   +    G Y  YGK VS+  VLK  T+S++ PH
Sbjct: 535 NCTDGSYDKNRDVIVAFPDIDPSFIPTPKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPH 594

Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           LWYSTSEV  AL LFIASG +L  SNTYRYDL+DLTRQALAKYAN+LFL +IEAYQLND 
Sbjct: 595 LWYSTSEVKDALGLFIASGGQLLGSNTYRYDLVDLTRQALAKYANQLFLEVIEAYQLNDV 654

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            G    S++FLELVEDMD LLACHDGFLLGPWLESAKQLAQ+E+QE Q
Sbjct: 655 RGAACHSQKFLELVEDMDTLLACHDGFLLGPWLESAKQLAQDEQQEIQ 702


>gi|224121634|ref|XP_002318632.1| predicted protein [Populus trichocarpa]
 gi|222859305|gb|EEE96852.1| predicted protein [Populus trichocarpa]
          Length = 812

 Score =  808 bits (2086), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/464 (81%), Positives = 420/464 (90%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NLH WGGPLPQSW DQQLVLQKKIL R+YELGM PVLPAFSGNVPAAL+N+FPSAKIT
Sbjct: 238 MANLHRWGGPLPQSWFDQQLVLQKKILARMYELGMTPVLPAFSGNVPAALRNIFPSAKIT 297

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWFSV+SD RWCCTYLLDATDPLFIEIGRAFIEQQL EYG TSHIYNCDTFDENTPP
Sbjct: 298 RLGNWFSVRSDVRWCCTYLLDATDPLFIEIGRAFIEQQLTEYGSTSHIYNCDTFDENTPP 357

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD PEYISSLG +I+ GMQSGDS+AVWLMQGWLFSYDPFWRPPQ KALL+SVP+G+LVVL
Sbjct: 358 VDDPEYISSLGGSIFEGMQSGDSNAVWLMQGWLFSYDPFWRPPQTKALLHSVPIGRLVVL 417

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DLFAEVKPIW+TS+QFYGVPYIWCMLHNFAGN+EMYG LDS+A GPVEARTSEN+TMVGV
Sbjct: 418 DLFAEVKPIWNTSEQFYGVPYIWCMLHNFAGNLEMYGYLDSVASGPVEARTSENSTMVGV 477

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNPVVYDLMSEMAFQ  KVDVK WI+ YS RRYGRSVP IQ+AWN+LYHTVY
Sbjct: 478 GMSMEGIEQNPVVYDLMSEMAFQKNKVDVKEWIDLYSARRYGRSVPTIQNAWNILYHTVY 537

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           NCTDGA DKNRDVIVAFPDV+P+++S+ +G++    K VS+ A L   T SY+HPHLWYS
Sbjct: 538 NCTDGAYDKNRDVIVAFPDVNPNLVSMLQGRHHTDVKLVSRRAALIKNTDSYEHPHLWYS 597

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
           T+EV+RALELFIA G+ELS S+TY YDL+DLTRQ LAKYANELFL +IEAY+L D+HGV 
Sbjct: 598 TTEVVRALELFIAGGDELSGSSTYSYDLVDLTRQVLAKYANELFLKVIEAYRLKDSHGVA 657

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             S+ FL+LVED+D LLACH+GFLLGPWLESAKQLAQ+EEQ+ Q
Sbjct: 658 HQSQMFLDLVEDIDTLLACHEGFLLGPWLESAKQLAQDEEQQIQ 701


>gi|356519003|ref|XP_003528164.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
          Length = 812

 Score =  797 bits (2058), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/464 (80%), Positives = 417/464 (89%), Gaps = 1/464 (0%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLPQSW DQQL+LQKKIL R++ELGM PVLPAFSGNVPAAL+++FPSAKIT
Sbjct: 233 MGNLHGWGGPLPQSWFDQQLILQKKILARMFELGMTPVLPAFSGNVPAALKHIFPSAKIT 292

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWFSVK+D +WCCTYLLDATD LF+EIG+AFIE+QL+EYGRTSHIYNCDTFDENTPP
Sbjct: 293 RLGNWFSVKNDLKWCCTYLLDATDSLFVEIGKAFIEKQLQEYGRTSHIYNCDTFDENTPP 352

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD PEYISSLGAA + GMQSGD DAVWLMQGWLFSYDPFWRPPQMKALL+SVP+GKLVVL
Sbjct: 353 VDDPEYISSLGAATFKGMQSGDDDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGKLVVL 412

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYGILD+IA GP++ARTS N+TMVGV
Sbjct: 413 DLFAEVKPIWVTSEQFYGVPYIWCMLHNFAGNIEMYGILDAIASGPIDARTSNNSTMVGV 472

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNP+VYDLMSEMAFQH+KVDVKAW++ YS RRYG+++P IQ+ WNVLYHT+Y
Sbjct: 473 GMSMEGIEQNPIVYDLMSEMAFQHKKVDVKAWVDMYSTRRYGQTLPLIQEGWNVLYHTIY 532

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           NCTDGA DKNRDVIVAFPDVDPS+ISV   +  +  KP S   ++K  T S+D PHLWY 
Sbjct: 533 NCTDGAYDKNRDVIVAFPDVDPSLISVQHEQSHHNDKPYSG-TIIKEITDSFDRPHLWYP 591

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
           TSEVI ALELFI SG+ELS  NTYRYDL+DLTRQ LAKYANELF  +IEAYQ +D HG+ 
Sbjct: 592 TSEVIYALELFITSGDELSRCNTYRYDLVDLTRQVLAKYANELFFKVIEAYQSHDIHGMT 651

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            LS+RFL+LVED+D LLACHDGFLLGPWLESAKQLA NEEQE+Q
Sbjct: 652 LLSQRFLDLVEDLDTLLACHDGFLLGPWLESAKQLALNEEQERQ 695


>gi|297733843|emb|CBI15090.3| unnamed protein product [Vitis vinifera]
          Length = 846

 Score =  772 bits (1994), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/501 (74%), Positives = 411/501 (82%), Gaps = 37/501 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWF+V  +PRWCCTYLLDATDPLFIEIG+AFI+QQLKEYGRT HIYNCDTFDENTPP
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPP 354

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 355 VDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVL 414

Query: 181 DLFAEVKPIWSTSKQFYGVPYIW--------------------------------CMLHN 208
           DLFAEVKPIW TS+QFYGVPYIW                                CMLHN
Sbjct: 415 DLFAEVKPIWITSEQFYGVPYIWKVTKSGRQQSLKFTNEKCCSFFRSHSPDSEVLCMLHN 474

Query: 209 FAGNIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 267
           FAGNIEMYGILD++A GP+  R     + +VGVGMSMEGIEQNPVVYDLMSEMAFQH KV
Sbjct: 475 FAGNIEMYGILDAVASGPILLRAKYAESAVVGVGMSMEGIEQNPVVYDLMSEMAFQHSKV 534

Query: 268 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 327
           DVK WI  YS RRYG+SVP IQDAWN+LYHTVYNCTDG+ DKNRDVIVAFPD+DPS I  
Sbjct: 535 DVKVWIALYSTRRYGKSVPEIQDAWNILYHTVYNCTDGSYDKNRDVIVAFPDIDPSFIPT 594

Query: 328 TE----GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 383
            +    G Y  YGK VS+  VLK  T+S++ PHLWYSTSEV  AL LFIASG +L  SNT
Sbjct: 595 PKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPHLWYSTSEVKDALGLFIASGGQLLGSNT 654

Query: 384 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 443
           YRYDL+DLTRQALAKYAN+LFL +IEAYQLND  G    S++FLELVEDMD LLACHDGF
Sbjct: 655 YRYDLVDLTRQALAKYANQLFLEVIEAYQLNDVRGAACHSQKFLELVEDMDTLLACHDGF 714

Query: 444 LLGPWLESAKQLAQNEEQEKQ 464
           LLGPWLESAKQLAQ+E+QE Q
Sbjct: 715 LLGPWLESAKQLAQDEQQEIQ 735


>gi|449441031|ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
          Length = 808

 Score =  759 bits (1960), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/464 (75%), Positives = 409/464 (88%), Gaps = 2/464 (0%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLPQSW DQQL+LQKK++ R++ELGM PVLPAFSGN+PAA + ++P+AKIT
Sbjct: 236 MGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKIT 295

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWF+V SDPRWCCTYLLDA DPLF+EIG+AFIEQQ KEYGRTSH+YNCDTFDENTPP
Sbjct: 296 RLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPP 355

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD  EYISSLG+AI+ GMQ+GDS+AVWLMQGW+FSYDPFWRP QMKALL+SVPLG+LVVL
Sbjct: 356 VDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVL 415

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+AEVKPIW +S+QFYG+PYIWCMLHNFAGN+EMYGILDSIA GP+EAR+S  +TMVGV
Sbjct: 416 DLYAEVKPIWISSEQFYGIPYIWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGV 475

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNPVVYDLMSEMAFQH KVDVK W+ QYSVRRYG  VP+IQDAW+VLYHTVY
Sbjct: 476 GMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVY 535

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           NCTDGA DKNRDVIVAFPDVDPS I V       +G   S  +V + + +++D PHLWY 
Sbjct: 536 NCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHLWYP 593

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
           TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF  I++AYQL+D   + 
Sbjct: 594 TSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMA 653

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQ
Sbjct: 654 SLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQ 697


>gi|326515664|dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 829

 Score =  704 bits (1816), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 319/469 (68%), Positives = 393/469 (83%), Gaps = 6/469 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y  GM+PVLPAFSGN+PAAL+  FPSAK+T
Sbjct: 239 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVT 298

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 299 HLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 358

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA +  MQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 359 LSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 418

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+AEVKP+W  S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TMVGV
Sbjct: 419 DLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 478

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNP+VYDLMSEM F H +VD+K W+  Y  RRYG+SV  +QDAW +L+ T+Y
Sbjct: 479 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLY 538

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAVLK-SETSSYDHP 355
           NCTDG  DKNRDVIVAFPDV+PS+I  T G Y    +NY   +S+  V+K +   +Y+ P
Sbjct: 539 NCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSKNYSTMLSENYVMKDAPNDAYEQP 597

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
           H+WY T  VI ALELF+ SG+E+S S+T+RYDL+DLTRQALAKYAN++FL II+ Y+ N+
Sbjct: 598 HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            + V  L  RFL LV+D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 658 VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQ 706


>gi|326519955|dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 829

 Score =  702 bits (1811), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 319/469 (68%), Positives = 392/469 (83%), Gaps = 6/469 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y  GM+PVLPAFSGN+PAAL+  FPSAK+T
Sbjct: 239 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVT 298

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 299 HLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 358

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA +  MQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 359 LSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 418

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+AEVKP W  S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TMVGV
Sbjct: 419 DLYAEVKPAWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 478

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNP+VYDLMSEM F H +VD+K W+  Y  RRYG+SV  +QDAW +L+ T+Y
Sbjct: 479 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLY 538

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAVLK-SETSSYDHP 355
           NCTDG  DKNRDVIVAFPDV+PS+I  T G Y    +NY   +S+  V+K +   +Y+ P
Sbjct: 539 NCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSKNYSTMLSENYVMKDAPNDAYEQP 597

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
           H+WY T  VI ALELF+ SG+E+S S+T+RYDL+DLTRQALAKYAN++FL II+ Y+ N+
Sbjct: 598 HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            + V  L  RFL LV+D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 658 VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQ 706


>gi|414585093|tpg|DAA35664.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
          Length = 721

 Score =  702 bits (1811), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/470 (68%), Positives = 392/470 (83%), Gaps = 6/470 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYNCDTFDENTPP
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYNCDTFDENTPP 359

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA + GMQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+GK++VL
Sbjct: 360 LSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGKMIVL 419

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+AEVKP+W  S Q YGVPYIWCMLHNFA + EMYG+LD++A GP++AR S+N+TMVGV
Sbjct: 420 DLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGV 479

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  Y  RRYG+ V  +QDAW +LY T+Y
Sbjct: 480 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLY 539

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHP 355
           NCTDG  DKNRDVIVAFPDV+P +I+ T G + N    Y    SK  + K  +S +Y+HP
Sbjct: 540 NCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHP 598

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
           HLWY T+ VI ALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+
Sbjct: 599 HLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNN 658

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            + V  L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE QV
Sbjct: 659 MNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQV 708


>gi|222629680|gb|EEE61812.1| hypothetical protein OsJ_16433 [Oryza sativa Japonica Group]
          Length = 1129

 Score =  701 bits (1810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/469 (67%), Positives = 388/469 (82%), Gaps = 6/469 (1%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+N+HGWGGPLPQSWLD QL LQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 537  MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 596

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 597  HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 656

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            +  P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 657  LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 716

Query: 181  DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
            DL+AEVKPIW  S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TMVGV
Sbjct: 717  DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMVGV 776

Query: 241  GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
            GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  Y  RRYG+S+  +QDAW +LY T+Y
Sbjct: 777  GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSMVGLQDAWKILYQTLY 836

Query: 301  NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 355
            NCTDG  DKNRDVIVAFPDV+P +I  T G Y    + Y   +SK  + + +    Y+HP
Sbjct: 837  NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 895

Query: 356  HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
            HLWY T  VIRALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 896  HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 955

Query: 416  AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             + V  L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 956  VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQ 1004


>gi|414585092|tpg|DAA35663.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
          Length = 831

 Score =  701 bits (1808), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/469 (68%), Positives = 391/469 (83%), Gaps = 6/469 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYNCDTFDENTPP
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYNCDTFDENTPP 359

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA + GMQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+GK++VL
Sbjct: 360 LSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGKMIVL 419

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+AEVKP+W  S Q YGVPYIWCMLHNFA + EMYG+LD++A GP++AR S+N+TMVGV
Sbjct: 420 DLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGV 479

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  Y  RRYG+ V  +QDAW +LY T+Y
Sbjct: 480 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLY 539

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHP 355
           NCTDG  DKNRDVIVAFPDV+P +I+ T G + N    Y    SK  + K  +S +Y+HP
Sbjct: 540 NCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHP 598

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
           HLWY T+ VI ALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+
Sbjct: 599 HLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNN 658

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            + V  L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE Q
Sbjct: 659 MNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQ 707


>gi|218195716|gb|EEC78143.1| hypothetical protein OsI_17702 [Oryza sativa Indica Group]
          Length = 829

 Score =  697 bits (1800), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/469 (67%), Positives = 388/469 (82%), Gaps = 6/469 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQSWLD QL LQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 237 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 296

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 297 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 356

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 357 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 416

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+AEVKPIW  S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TM+GV
Sbjct: 417 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMIGV 476

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  Y  RRYG+S+  +QDAW +LY T+Y
Sbjct: 477 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSIVGLQDAWKILYQTLY 536

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 355
           NCTDG  DKNRDVIVAFPDV+P +I  T G Y    + Y   +SK  + + +    Y+HP
Sbjct: 537 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 595

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
           HLWY T  VIRALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 596 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 655

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            + V  L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 656 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQ 704


>gi|38345908|emb|CAE04506.2| OSJNBb0059K02.16 [Oryza sativa Japonica Group]
          Length = 829

 Score =  697 bits (1798), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/469 (67%), Positives = 388/469 (82%), Gaps = 6/469 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQSWLD QL LQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 237 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 296

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 297 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 356

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 357 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 416

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+AEVKPIW  S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TMVGV
Sbjct: 417 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMVGV 476

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  Y  RRYG+S+  +QDAW +LY T+Y
Sbjct: 477 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSMVGLQDAWKILYQTLY 536

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 355
           NCTDG  DKNRDVIVAFPDV+P +I  T G Y    + Y   +SK  + + +    Y+HP
Sbjct: 537 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 595

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
           HLWY T  VIRALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 596 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 655

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            + V  L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 656 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQ 704


>gi|357166414|ref|XP_003580702.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
           distachyon]
          Length = 829

 Score =  695 bits (1793), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/469 (67%), Positives = 394/469 (84%), Gaps = 6/469 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y  GM+PVLPAFSG++PAAL++ FPSAK+T
Sbjct: 238 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGSIPAALKSKFPSAKVT 297

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 298 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 357

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISSLGAA + GMQSGD DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 358 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 417

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+AEVKP+W  S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TMVGV
Sbjct: 418 DLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 477

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNP+VYDLMSEM F H +VD++ W+  Y  RRYG+S+  +QDAW +L+ T+Y
Sbjct: 478 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLQVWVETYPTRRYGKSIVELQDAWRILHQTLY 537

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVLKSETS-SYDHP 355
           NCTDG  DKNRDVIVAFPDV+P +I  T G + +  K  S    K  ++K E++ +Y+ P
Sbjct: 538 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLHTSASKMFSTMSAKSYLVKDESNDAYEQP 596

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
           HLWY T+ VIRAL+LF+  G+E+S S+T+RYDL+DLTRQALAKYAN++F  II++Y+ N+
Sbjct: 597 HLWYDTNVVIRALQLFLQYGDEVSDSSTFRYDLVDLTRQALAKYANQIFAKIIQSYKSNN 656

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            + V  LS  FL+LV D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 657 MNQVTTLSECFLDLVNDLDMLLASHEGFLLGPWLESAKGLARDQEQEIQ 705


>gi|218192858|gb|EEC75285.1| hypothetical protein OsI_11626 [Oryza sativa Indica Group]
          Length = 812

 Score =  639 bits (1649), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 298/466 (63%), Positives = 365/466 (78%), Gaps = 2/466 (0%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVP+  + +FPSA IT
Sbjct: 243 MGNLHGWGGPLSQNWLDQQLTLQKKILSRMIELGMVPVLPSFSGNVPSVFKKLFPSANIT 302

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V  DPRWCCTYLLD +D LFI++G+AFI QQ+KEYG  ++IYNCDTF+ENTPP
Sbjct: 303 KLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQAFIRQQMKEYGDITNIYNCDTFNENTPP 362

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            + P YISSLG+AIY  M  G+ DAVWLMQGWLF  D  FW+ PQMKALL+SVP GK++V
Sbjct: 363 TNEPAYISSLGSAIYEAMSRGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPTGKMIV 422

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFA+VKPIW  S QFYGVPYIWCMLHNF GNIEMYGILDSIA GP++ARTS N+TMVG
Sbjct: 423 LDLFADVKPIWQMSSQFYGVPYIWCMLHNFGGNIEMYGILDSIASGPIDARTSHNSTMVG 482

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIE NPVVY+LMSEMAF+ +KV+V+ W+  YS RRYG+S   ++ AW +LYHT+
Sbjct: 483 VGMCMEGIEHNPVVYELMSEMAFRSQKVEVEDWLKIYSYRRYGQSNVEVEKAWGILYHTI 542

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 358
           YNCTDG  D N+D IV FPD+ P+  S    K +   +       + SE S S  HPHLW
Sbjct: 543 YNCTDGIADHNKDYIVQFPDISPNSFSSDVSKRKAISEVKKHRRFVLSEVSASLPHPHLW 602

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           YST E I+ALELF+ +GN+LS S TYRYDL+DLTRQ+L+K ANE++L+ + AY+  D++G
Sbjct: 603 YSTKEAIKALELFLNAGNDLSKSLTYRYDLVDLTRQSLSKLANEVYLDAMNAYRKKDSNG 662

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
           +   +++FLEL+ D+D LLA  D FLLGPWLE AK LA+ E + KQ
Sbjct: 663 LNFYTKKFLELIVDIDTLLASDDNFLLGPWLEDAKSLARTENERKQ 708


>gi|222624949|gb|EEE59081.1| hypothetical protein OsJ_10898 [Oryza sativa Japonica Group]
          Length = 812

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 298/466 (63%), Positives = 364/466 (78%), Gaps = 2/466 (0%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVP+  + +FPSA IT
Sbjct: 243 MGNLHGWGGPLSQNWLDQQLTLQKKILSRMIELGMVPVLPSFSGNVPSVFKKLFPSANIT 302

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V  DPRWCCTYLLD +D LFI++G+AFI QQ+KEYG  ++IYNCDTF+ENTPP
Sbjct: 303 KLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQAFIRQQMKEYGDITNIYNCDTFNENTPP 362

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            + P YISSLG+AIY  M  G+ DAVWLMQGWLF  D  FW+ PQMKALL+SVP GK++V
Sbjct: 363 TNEPAYISSLGSAIYEAMSRGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPTGKMIV 422

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFA+VKPIW  S QFYGVPYIWCMLHNF GNIEMYGILDSIA GP++ARTS N+TMVG
Sbjct: 423 LDLFADVKPIWQMSSQFYGVPYIWCMLHNFGGNIEMYGILDSIASGPIDARTSHNSTMVG 482

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIE NPVVY+LMSEMAF+ +KV+V+ W+  YS RRYG+S   ++ AW +LYHT+
Sbjct: 483 VGMCMEGIEHNPVVYELMSEMAFRSQKVEVEDWLKIYSYRRYGQSNVEVEKAWGILYHTI 542

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 358
           YNCTDG  D N D IV FPD+ P+  S    K +   +       + SE S S  HPHLW
Sbjct: 543 YNCTDGIADHNNDYIVEFPDISPNSFSSDVSKRKAISEVKKHRRFVLSEVSASLPHPHLW 602

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           YST E I+ALELF+ +GN+LS S TYRYDL+DLTRQ+L+K ANE++L+ + AY+  D++G
Sbjct: 603 YSTKEAIKALELFLNAGNDLSKSLTYRYDLVDLTRQSLSKLANEVYLDAMNAYRKKDSNG 662

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
           +   +++FLEL+ D+D LLA  D FLLGPWLE AK LA+ E + KQ
Sbjct: 663 LNFYTKKFLELIVDIDTLLASDDNFLLGPWLEDAKSLARTENERKQ 708


>gi|413955691|gb|AFW88340.1| hypothetical protein ZEAMMB73_315381 [Zea mays]
          Length = 814

 Score =  628 bits (1620), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 292/466 (62%), Positives = 360/466 (77%), Gaps = 2/466 (0%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVPA    +FPSA IT
Sbjct: 244 MGNLHGWGGPLSQNWLDQQLALQKKILSRMIELGMVPVLPSFSGNVPAIFAKLFPSANIT 303

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V ++P+WCCTYLLD +D LFI++G+AFI QQ+KEYG  ++IYNCDTF+ENTPP
Sbjct: 304 RLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQAFIRQQIKEYGDVTNIYNCDTFNENTPP 363

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            D P YISSLG+AIY  M  G+ +AVWLMQGWLF  D  FW+ PQMKALL+SVP+GK++V
Sbjct: 364 TDEPAYISSLGSAIYEAMSRGNKNAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 423

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFA+VKPIW  S QFYGVPYIWCMLHNF GNIEMYGILDSI+ GP++ARTS N+TM+G
Sbjct: 424 LDLFADVKPIWKVSSQFYGVPYIWCMLHNFGGNIEMYGILDSISSGPIDARTSYNSTMIG 483

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIE NPVVY+LMSEMAF ++KV+V+ W+  YS RRYG++   I+ AW  LYHT+
Sbjct: 484 VGMCMEGIEHNPVVYELMSEMAFHNKKVEVEDWLKTYSCRRYGQANADIEKAWRYLYHTI 543

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS-YDHPHLW 358
           YNCTDG  D N+D IV FPD+ PS ++    K +      +      SE S     PHLW
Sbjct: 544 YNCTDGIADHNKDYIVEFPDISPSSVTYQVSKRRGMSITRNHRRFFLSEVSGILPQPHLW 603

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           YST E ++ALELF+ +G+  S S TYRYDL+DLTRQ L+K ANE++L+ I  YQ  D+HG
Sbjct: 604 YSTKEAVKALELFLDAGSTFSESLTYRYDLVDLTRQCLSKLANEVYLDAISLYQKKDSHG 663

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
           +   +R+FLE++ D+D LLA  D FLLGPWLESAK LA  E++ +Q
Sbjct: 664 LNAHARKFLEIIVDIDTLLAADDNFLLGPWLESAKSLAITEKERQQ 709


>gi|297736304|emb|CBI24942.3| unnamed protein product [Vitis vinifera]
          Length = 868

 Score =  624 bits (1610), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 296/472 (62%), Positives = 363/472 (76%), Gaps = 9/472 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLD+QLVLQK+IL R+ ELGM PVLP+FSGNVP AL+ +FPSA IT
Sbjct: 294 MGNLHGWGGPLSQNWLDEQLVLQKQILCRMLELGMTPVLPSFSGNVPEALKKIFPSANIT 353

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI QQ+KEYG  + IYNCDTF+EN+PP
Sbjct: 354 RLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQQIKEYGDVTDIYNCDTFNENSPP 413

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            + P YISSLGAAIY  M  GD D+VWLMQGWLF  D  FW+PPQMKALL+SVP GK+VV
Sbjct: 414 TNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSDSGFWKPPQMKALLHSVPFGKMVV 473

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMYGILD+++ GPV+AR S+N+TMVG
Sbjct: 474 LDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMYGILDAVSSGPVDARISKNSTMVG 533

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIEQNPV Y+LMSEMAF+ EKV +  W+  YS RRYG++V  ++ AW +LY T+
Sbjct: 534 VGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTYSYRRYGKAVHHVEAAWEILYRTI 593

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
           YNCTDG  D N D +V FPD DPS+     IS  +   Q       +  +L  ETSS D 
Sbjct: 594 YNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHIIQKILTQTGRRKILFQETSS-DL 652

Query: 355 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
           P  HLWYST EV+ AL LF+ +GNELS S+TYRYDL+DLTRQ L+K  N+++L+ + A++
Sbjct: 653 PQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLVDLTRQVLSKLGNQVYLDAVIAFR 712

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             DA      S++F++LV+D+D LLA  D FLLG WLESAK+LA N  + +Q
Sbjct: 713 QKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWLESAKKLAVNPREMEQ 764


>gi|225450036|ref|XP_002273084.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
          Length = 803

 Score =  624 bits (1609), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 296/472 (62%), Positives = 363/472 (76%), Gaps = 9/472 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLD+QLVLQK+IL R+ ELGM PVLP+FSGNVP AL+ +FPSA IT
Sbjct: 229 MGNLHGWGGPLSQNWLDEQLVLQKQILCRMLELGMTPVLPSFSGNVPEALKKIFPSANIT 288

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI QQ+KEYG  + IYNCDTF+EN+PP
Sbjct: 289 RLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQQIKEYGDVTDIYNCDTFNENSPP 348

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            + P YISSLGAAIY  M  GD D+VWLMQGWLF  D  FW+PPQMKALL+SVP GK+VV
Sbjct: 349 TNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSDSGFWKPPQMKALLHSVPFGKMVV 408

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMYGILD+++ GPV+AR S+N+TMVG
Sbjct: 409 LDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMYGILDAVSSGPVDARISKNSTMVG 468

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIEQNPV Y+LMSEMAF+ EKV +  W+  YS RRYG++V  ++ AW +LY T+
Sbjct: 469 VGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTYSYRRYGKAVHHVEAAWEILYRTI 528

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
           YNCTDG  D N D +V FPD DPS+     IS  +   Q       +  +L  ETSS D 
Sbjct: 529 YNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHIIQKILTQTGRRKILFQETSS-DL 587

Query: 355 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
           P  HLWYST EV+ AL LF+ +GNELS S+TYRYDL+DLTRQ L+K  N+++L+ + A++
Sbjct: 588 PQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLVDLTRQVLSKLGNQVYLDAVIAFR 647

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             DA      S++F++LV+D+D LLA  D FLLG WLESAK+LA N  + +Q
Sbjct: 648 QKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWLESAKKLAVNPREMEQ 699


>gi|357112065|ref|XP_003557830.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
           distachyon]
          Length = 809

 Score =  620 bits (1598), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 292/468 (62%), Positives = 368/468 (78%), Gaps = 6/468 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPL Q+WLD QL LQKKIL R+ ELGM PVLP+FSGNVP A + +FPSA IT
Sbjct: 240 MGNLHAWGGPLSQNWLDGQLALQKKILSRMTELGMVPVLPSFSGNVPVAFKKLFPSANIT 299

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W +V  DPRWCCTY+LD +D LFI++G AFI QQ+KEYG  + IYNCDTF+ENTPP
Sbjct: 300 RLGEWNTVDGDPRWCCTYILDPSDALFIDVGHAFIRQQIKEYGDITSIYNCDTFNENTPP 359

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            + P YISSLG+AIY  M SG+ DAVWLMQGWLF  D  FW+ PQMKALL+SVP+GK++V
Sbjct: 360 TNEPAYISSLGSAIYEAMSSGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 419

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFA+VKP+W  S QFYGVPYIWCMLHNF GNIEMYGILDSI+ GP++ARTS  +TMVG
Sbjct: 420 LDLFADVKPVWKMSSQFYGVPYIWCMLHNFGGNIEMYGILDSISSGPIDARTSYGSTMVG 479

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM+MEGIE NPVV++LMSEM+F+ +KV+V+ W+  YS RRYG+S   I+ AW VLYHT+
Sbjct: 480 VGMTMEGIEHNPVVFELMSEMSFRSQKVEVEDWLKSYSYRRYGQSNVKIEKAWGVLYHTI 539

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA---VLKSETSSYDHPH 356
           YNCTDG  D NRD IV FPD+ PS  S    K +  G P+ ++     L   +++  HPH
Sbjct: 540 YNCTDGIADHNRDYIVEFPDMSPSSFSSHFSKQR--GMPIVRKHPRFFLSEVSANLPHPH 597

Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           LWYST+E ++ALELF+ +GN+LS S T+RYDL+DLTRQ+L+K AN+++L+ +++Y+  ++
Sbjct: 598 LWYSTNEAVKALELFLNAGNDLSKSLTFRYDLVDLTRQSLSKLANKVYLDAMDSYKNKNS 657

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            G+   +++FLEL+ D+D LLA  D FLLGPWLESAK LA +EE+ KQ
Sbjct: 658 SGLNFHTKKFLELIVDIDILLASDDNFLLGPWLESAKSLAMSEEERKQ 705


>gi|356534602|ref|XP_003535842.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
          Length = 807

 Score =  619 bits (1597), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 297/470 (63%), Positives = 358/470 (76%), Gaps = 8/470 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL  +FPSAKIT
Sbjct: 230 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 289

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V  DPRWCCTYLLD +DPLF+EIG AFI +Q+KEYG  + IYNCDTF+EN+PP
Sbjct: 290 RLGDWNTVDGDPRWCCTYLLDPSDPLFVEIGEAFIRKQIKEYGDVTDIYNCDTFNENSPP 349

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            + PEYIS+LGAA+Y G+  GD DAVWLMQGWLF  D  FW+PPQMKALL+SVP GK++V
Sbjct: 350 TNDPEYISNLGAAVYKGISKGDKDAVWLMQGWLFYSDSSFWKPPQMKALLHSVPFGKMIV 409

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFA+VKPIW  S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S N+TMVG
Sbjct: 410 LDLFADVKPIWKNSFQFYGTPYIWCMLHNFGGNIEMYGTLDSISSGPVDARVSANSTMVG 469

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIEQNP+VY+LMSEMAF+ +KV V  WI  Y  RRYG+ +  ++ AW +LYHT+
Sbjct: 470 VGMCMEGIEQNPIVYELMSEMAFRDKKVKVSEWIKSYCHRRYGKVIHQVESAWEILYHTI 529

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN---YGKPVSKEAVLKSETSSYDHP- 355
           YNCTDG  D N D IV FPD +PS  SVT G   N   Y  P      L  ET S D P 
Sbjct: 530 YNCTDGIADHNHDFIVMFPDWNPSTNSVT-GTSNNQKIYLLPPGNRRYLFQETLS-DMPQ 587

Query: 356 -HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
            HLWY + +VI+AL+LF+A G  L+ S TYRYDL+DLTRQ L+K AN+++   + +YQ  
Sbjct: 588 AHLWYPSDDVIKALQLFLAGGKNLAGSLTYRYDLVDLTRQVLSKLANQVYHKAVTSYQKK 647

Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
           +   +   S +FL+L++D+D LLA  D FLLG WLESAK+LA N  + KQ
Sbjct: 648 NIEALQFHSNKFLQLIKDIDVLLASDDNFLLGTWLESAKKLAVNPSEIKQ 697


>gi|297807393|ref|XP_002871580.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317417|gb|EFH47839.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
           lyrata]
          Length = 806

 Score =  609 bits (1571), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 291/472 (61%), Positives = 363/472 (76%), Gaps = 9/472 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPL ++WL+ QL+LQK+IL ++ +LGM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHTWGGPLSKNWLNDQLILQKQILSQMLKLGMTPVLPSFSGNVPSALRKIYPGANIT 290

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L NW +V  D RWCCTYLL+ +DPLFI+IG AFI+QQ +EYG  ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIDIGEAFIKQQPEEYGEITNIYNCDTFNENTPP 350

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEYISSLGAA+Y  M  G+ +AVWLMQGWLFS D  FW+PPQMK LL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQMKVLLHSVPFGKMIV 410

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL+AEVKPIW+TS QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TMVG
Sbjct: 411 LDLYAEVKPIWNTSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIEQNPVVY+L+SEMAF+ EKVDV+ W+  Y+ RRY +    I+ AW +LYHTV
Sbjct: 471 VGMCMEGIEQNPVVYELISEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN-----YGKPVSKEAVLKSETSSYDH 354
           YNCTDG  D N D IV  PD DPS     E K+ +      G   +K  VL  + SS D 
Sbjct: 531 YNCTDGIADHNTDFIVKLPDWDPSSSVQDESKHTDSYMISTGPYETKRRVLFQDKSS-DL 589

Query: 355 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
           P  HLWYST EVI+AL+LF+ +G+ELS S TYRYD++DLTRQ L+K AN++++  + A+ 
Sbjct: 590 PKAHLWYSTKEVIQALKLFLEAGDELSRSLTYRYDMVDLTRQVLSKLANQVYIEAVTAFV 649

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             D   + QLS +FLEL++D+D LLA  D FLLG WLESAK+LA+N ++ KQ
Sbjct: 650 KKDIGSLGQLSEKFLELIKDIDVLLASDDNFLLGTWLESAKKLARNGDERKQ 701


>gi|224106113|ref|XP_002314048.1| predicted protein [Populus trichocarpa]
 gi|222850456|gb|EEE88003.1| predicted protein [Populus trichocarpa]
          Length = 806

 Score =  606 bits (1563), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 282/462 (61%), Positives = 354/462 (76%), Gaps = 6/462 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPAAL+ +FPSA IT
Sbjct: 233 MGNLHGWGGPLSQNWLDQQLCLQKQILSRMLELGMTPVLPSFSGNVPAALKKIFPSANIT 292

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V  +PRWCCTYLL+ +DPLF+EIG AFI QQ+KEYG  + IYNCDTF+EN+PP
Sbjct: 293 RLGDWNTVDKNPRWCCTYLLNPSDPLFVEIGEAFIRQQVKEYGDVTDIYNCDTFNENSPP 352

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              P YISSLGAA+Y  M  GD DAVWLMQGWLF  D  FW+PPQM+ALL+SVP GK++V
Sbjct: 353 TSDPAYISSLGAAVYKAMSRGDKDAVWLMQGWLFYSDSAFWKPPQMQALLHSVPFGKMIV 412

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE KPIW  S QFYG PY+WC+LHNF GNIEMYGILD+I+ GPV+AR  EN+TMVG
Sbjct: 413 LDLFAEAKPIWKNSSQFYGTPYVWCLLHNFGGNIEMYGILDAISSGPVDARIIENSTMVG 472

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIE NPVVY+LMSEMAF+  K  V  W+  YS RRYG++V  +  AW++LYHT+
Sbjct: 473 VGMCMEGIEHNPVVYELMSEMAFRSGKPQVLEWLKTYSRRRYGKAVRQVVAAWDILYHTI 532

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV-----SKEAVLKSETSSYDH 354
           YNCTDG  D N D IV FPD DPS+ S +    Q+  + +     ++  + +  +S +  
Sbjct: 533 YNCTDGIADHNTDFIVKFPDWDPSLHSGSNISEQDNMRILLTSSGTRRFLFQETSSDFPE 592

Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
            HLWYST EVI+AL LF+ +GN+L+ S TYRYDL+DLTRQ L+K AN+++ + + A++  
Sbjct: 593 AHLWYSTQEVIQALWLFLDAGNDLAGSPTYRYDLVDLTRQVLSKLANQVYRDAMIAFRRK 652

Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
           DA  +    ++FL++++D+D LLA  D FLLG WLESAK+LA
Sbjct: 653 DARALNLHGQKFLQIIKDIDVLLASDDNFLLGTWLESAKKLA 694


>gi|15240689|ref|NP_196873.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|9758035|dbj|BAB08696.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|19423948|gb|AAL87291.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|21436231|gb|AAM51254.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|332004545|gb|AED91928.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
          Length = 806

 Score =  605 bits (1560), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 286/472 (60%), Positives = 357/472 (75%), Gaps = 9/472 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPL ++WLD QL+LQK+IL R+ + GM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMTPVLPSFSGNVPSALRKIYPEANIT 290

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L NW +V  D RWCCTYLL+ +DPLFIEIG AFI+QQ +EYG  ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIKQQTEEYGEITNIYNCDTFNENTPP 350

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEYISSLGAA+Y  M  G+ +AVWLMQGWLFS D  FW+PPQ+KALL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQLKALLHSVPFGKMIV 410

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL+AEVKPIW+ S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TMVG
Sbjct: 411 LDLYAEVKPIWNKSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIEQNPVVY+L SEMAF+ EKVDV+ W+  Y+ RRY +    I+ AW +LYHTV
Sbjct: 471 VGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS-------KEAVLKSETSSY 352
           YNCTDG  D N D IV  PD DPS  SV +   Q     +S       +  + + +T+  
Sbjct: 531 YNCTDGIADHNTDFIVKLPDWDPS-SSVQDDLKQKDSYMISTGPYETKRRVLFQDKTADL 589

Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
              HLWYST EVI+AL+LF+ +G++LS S TYRYD++DLTRQ L+K AN+++   + A+ 
Sbjct: 590 PKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYTEAVTAFV 649

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             D   + QLS +FLEL++DMD LLA  D  LLG WLESAK+LA+N ++ KQ
Sbjct: 650 KKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQ 701


>gi|357458267|ref|XP_003599414.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
 gi|355488462|gb|AES69665.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
          Length = 832

 Score =  598 bits (1541), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 287/496 (57%), Positives = 354/496 (71%), Gaps = 33/496 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL  +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
           +LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K                    
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353

Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
                 EYG  + IYNCDTF+EN+PP   P YIS+LGAA+Y G+  GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413

Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
             D  FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473

Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 273
           EMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE NP+VY+LMSEMAF+ EKV +  W+
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKINEWL 533

Query: 274 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 333
             YS RRYG+++  +  AW +LYHT+YN TDG  D N D IV  PD DPS  +V  G   
Sbjct: 534 KSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHNHDYIVMLPDWDPS-AAVKSGMSN 592

Query: 334 NYGK-----PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 388
           +  K     P ++  + +   +     HLWY   +VI+AL+LF+A G  L  S TYRYDL
Sbjct: 593 HQKKIYFLPPGNRRYLFQQTPAGMPQAHLWYPPEDVIKALQLFLAGGKNLKGSLTYRYDL 652

Query: 389 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 448
           +DLTRQ L+K+AN++++  I ++Q  +   +   S  FLEL++D+D LLA  D FLLG W
Sbjct: 653 VDLTRQVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELIKDIDLLLASDDNFLLGTW 712

Query: 449 LESAKQLAQNEEQEKQ 464
           L+SAK+LA N  + KQ
Sbjct: 713 LQSAKKLAVNPSELKQ 728


>gi|357458271|ref|XP_003599416.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
 gi|355488464|gb|AES69667.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
          Length = 807

 Score =  592 bits (1525), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 285/491 (58%), Positives = 346/491 (70%), Gaps = 48/491 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL  +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
           +LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K                    
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353

Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
                 EYG  + IYNCDTF+EN+PP   P YIS+LGAA+Y G+  GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413

Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
             D  FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473

Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 273
           EMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE NP+VY+LMSEMAF+ EKV +  W+
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKINEWL 533

Query: 274 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 333
             YS RRYG+++  +  AW +LYHT+YN TDG  D N D IV  PD DPS    + G  Q
Sbjct: 534 KSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHNHDYIVMLPDWDPSAAVKSAGMPQ 593

Query: 334 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 393
                                 HLWY   +VI+AL+LF+A G  L  S TYRYDL+DLTR
Sbjct: 594 ---------------------AHLWYPPEDVIKALQLFLAGGKNLKGSLTYRYDLVDLTR 632

Query: 394 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
           Q L+K+AN++++  I ++Q  +   +   S  FLEL++D+D LLA  D FLLG WL+SAK
Sbjct: 633 QVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELIKDIDLLLASDDNFLLGTWLQSAK 692

Query: 454 QLAQNEEQEKQ 464
           +LA N  + KQ
Sbjct: 693 KLAVNPSELKQ 703


>gi|449436325|ref|XP_004135943.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
          Length = 774

 Score =  577 bits (1486), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 278/465 (59%), Positives = 339/465 (72%), Gaps = 24/465 (5%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL ++WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPA L  +FPSA IT
Sbjct: 229 MGNLHGWGGPLSKNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAGLVEIFPSANIT 288

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNW S+ +DP  CCTYLL+ +DPLF++IG AFI QQ+KEYG  ++IY+CDTF+ENTPP
Sbjct: 289 KLGNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTNIYSCDTFNENTPP 348

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            +   YISSLGA++Y  M   D DAVWLMQGWLF  D  FW+P QMKALL+SVP GK++V
Sbjct: 349 TNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSDFWKPDQMKALLHSVPFGKMIV 408

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFA+VKPIW +S QFYG PY+WCMLHNF GNIEMYGILD+I+ GPV+A  SEN+TMVG
Sbjct: 409 LDLFADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVG 468

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIE NPVVY+LMSEMAF+ +KV V+ W+  YS  RYG++   +  AWN+LYHT+
Sbjct: 469 VGMCMEGIEHNPVVYELMSEMAFRSKKVQVQEWLKTYSRCRYGKADHYVDAAWNILYHTI 528

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YNCTDG  + N D IV  PD DPS  S  + K     KP                PHLWY
Sbjct: 529 YNCTDGIANHNTDFIVKLPDWDPS--STFDLK-----KP----------------PHLWY 565

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           ST EVI AL+L +   + L  S TYRYDL+DLTRQ L K ANE +L  + A++  +    
Sbjct: 566 STQEVINALQLLVNVDDNLVHSATYRYDLVDLTRQVLGKLANEEYLKAVTAFRRQNVKAQ 625

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
              S+RF++L+ D+D LLA +  FLLG WLESAK+LA N  + KQ
Sbjct: 626 NLHSKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLATNPAEMKQ 670


>gi|449489156|ref|XP_004158231.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
           [Cucumis sativus]
          Length = 567

 Score =  567 bits (1461), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 274/462 (59%), Positives = 335/462 (72%), Gaps = 24/462 (5%)

Query: 4   LHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLG 63
           L  WGGPL ++WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPA L  +FPSA IT+LG
Sbjct: 25  LKEWGGPLSKNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAGLVEIFPSANITKLG 84

Query: 64  NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDS 123
           NW S+ +DP  CCTYLL+ +DPLF++IG AFI QQ+KEYG  ++IY+CDTF+ENTPP + 
Sbjct: 85  NWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTNIYSCDTFNENTPPTND 144

Query: 124 PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDL 182
             YISSLGA++Y  M   D DAVWLMQGWLF  D  FW+P QMKALL+SVP GK++VLDL
Sbjct: 145 TSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSDFWKPDQMKALLHSVPFGKMIVLDL 204

Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGM 242
           FA+VKPIW +S QFYG PY+WCMLHNF GNIEMYGILD+I+ GPV+A  SEN+TMVGVGM
Sbjct: 205 FADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGM 264

Query: 243 SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNC 302
            MEGIE NPVVY+LMSEMAF+ +KV V+ W+  YS  RYG++   +  AWN+LYHT+YNC
Sbjct: 265 CMEGIEHNPVVYELMSEMAFRXQKVQVQEWLKTYSRCRYGKADHYVDAAWNILYHTIYNC 324

Query: 303 TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTS 362
           TDG  + N D IV  PD DPS  S  + K     KP                PHLWYST 
Sbjct: 325 TDGIANHNTDFIVKLPDWDPS--STFDLK-----KP----------------PHLWYSTQ 361

Query: 363 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 422
           EVI AL+L +   + L  S TYRYDL+DLTRQ L K ANE +L  + A++  +       
Sbjct: 362 EVINALQLLVNVDDNLVHSATYRYDLVDLTRQVLGKLANEEYLKAVTAFRRQNVKAQNLH 421

Query: 423 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
           S+RF++L+ D+D LLA +  FLLG WLESAK+LA N  + KQ
Sbjct: 422 SKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLATNPAEMKQ 463


>gi|242035709|ref|XP_002465249.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
 gi|241919103|gb|EER92247.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
          Length = 777

 Score =  557 bits (1436), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 273/466 (58%), Positives = 337/466 (72%), Gaps = 39/466 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQL LQKK+L R+ ELGM PVLP+FSGNVPA    +FPSA IT
Sbjct: 244 MGNLHGWGGPLSQNWLDQQLALQKKVLSRMIELGMVPVLPSFSGNVPAVFAKLFPSANIT 303

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG+W +V ++P+WCCTYLLD +D LFI++G+AFI QQ+KEYG  ++IYNCDTF+ENTPP
Sbjct: 304 LLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQAFIRQQIKEYGDVTNIYNCDTFNENTPP 363

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
            D P YISSLG+AIY  M  G+ +AVWLMQGWLF  D  FW+ PQMKALL+SVP+GK++V
Sbjct: 364 TDEPAYISSLGSAIYEAMSRGNKNAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 423

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFA+VKPIW  S QFYGVPYIWCMLHNF GNIEMYG+LDSI+ GP++ARTS N+TM+G
Sbjct: 424 LDLFADVKPIWKMSSQFYGVPYIWCMLHNFGGNIEMYGVLDSISSGPIDARTSYNSTMIG 483

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIE NPVVY+LMSEMAF ++KV+V+                             
Sbjct: 484 VGMCMEGIEHNPVVYELMSEMAFHNKKVEVE----------------------------- 514

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 358
                   D N+D IV FPD+ PS IS    K +      +      SE S S  HPHLW
Sbjct: 515 --------DHNKDYIVEFPDISPSSISSQLSKRRGMSIMRNHRRFFLSEVSGSLPHPHLW 566

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           YST E I+ALELF+ +G+  S S TYRYDL+DLTRQ L+K ANE++L+ + +YQ  D++G
Sbjct: 567 YSTKEAIKALELFLDAGSTFSKSLTYRYDLVDLTRQCLSKLANEVYLDAMSSYQKKDSNG 626

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
           +   +R+FLE++ D+D LLA  D FLLGPWLESAK LA  E++ +Q
Sbjct: 627 LNSHTRKFLEIIMDIDTLLAADDNFLLGPWLESAKSLAITEKERQQ 672


>gi|4160292|emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum]
          Length = 811

 Score =  557 bits (1435), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 271/477 (56%), Positives = 347/477 (72%), Gaps = 13/477 (2%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPL Q+WL+ QL LQK+IL R+ ELGM PVLP+FSGNVPAAL+ +FPSA IT
Sbjct: 231 MGNLHAWGGPLSQNWLNIQLALQKQILSRMRELGMTPVLPSFSGNVPAALKKIFPSANIT 290

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +V  DPRWCCT+LL  +DPLFIEIG AFI +Q++EYG  + IYNCDTF+ENTPP
Sbjct: 291 RLGDWNTVNGDPRWCCTFLLAPSDPLFIEIGEAFIRKQIEEYGDITDIYNCDTFNENTPP 350

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWL-MQGWLFSYD-PFWRPPQMKALLNSVPLGKLV 178
            D P YI           Q   +   WL  + WLF  D  +W+ PQM+ALL+SVP GK++
Sbjct: 351 TDDPTYIHLSALLCTKQCQKQITMRCWLNARVWLFYSDSKYWKSPQMEALLHSVPRGKMI 410

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           VLDLFA+VKPIW +S QFYG PYIWCMLHNF GNIEMYG+LD++A GP++ARTSEN+TMV
Sbjct: 411 VLDLFADVKPIWKSSSQFYGTPYIWCMLHNFGGNIEMYGVLDAVASGPIDARTSENSTMV 470

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           GVGM MEGIE NPVVY+LMSEMAF+ +   ++ W+  YS RRYG+    IQ AW++LYHT
Sbjct: 471 GVGMCMEGIEHNPVVYELMSEMAFREDNFQLQGWLKSYSHRRYGKVNDQIQAAWDILYHT 530

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPS-----IISVTEGKYQNYGKPVS-----KEAVLKSE 348
           +YNCTDG  D N+D IV FPD DPS      IS T+   QN  + ++     +  +   +
Sbjct: 531 IYNCTDGIADHNKDYIVEFPDWDPSGKTGTDISGTDSSSQNRMQKLAGFQWNRRFLFFEK 590

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
           +SS   P LWYST +V +AL+LFI +  +LS S TYRYDL+DL+RQ+L+K AN+++L+ I
Sbjct: 591 SSSLPKPRLWYSTEDVFQALQLFIDALKKLSGSLTYRYDLVDLSRQSLSKLANQVYLDAI 650

Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ-LAQNEEQEKQ 464
            A++  DA  + Q S +FL L++D+D LLA  D FLLG WLE+  Q LA N +++KQ
Sbjct: 651 SAFRREDAKPLNQHSPKFLPLLQDIDRLLAADDNFLLGTWLENCPQNLAMNSDEKKQ 707


>gi|168060822|ref|XP_001782392.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666123|gb|EDQ52786.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 801

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 259/464 (55%), Positives = 331/464 (71%), Gaps = 17/464 (3%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  WGGPLPQ WLDQQL LQ KIL R+ ELGM PVLPAF+GNVPAA+   +PSA++T
Sbjct: 236 MGNLKRWGGPLPQKWLDQQLQLQIKILARMRELGMTPVLPAFAGNVPAAITKKYPSARVT 295

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W +V  D R+CCT+LLD  DPLF++IG+AFI QQ+KEYG T HIYNCDTF+EN PP
Sbjct: 296 RLGEWNTVNGDTRYCCTFLLDPKDPLFVDIGKAFILQQIKEYGGTQHIYNCDTFNENQPP 355

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P YIS+LG+ +Y  M + D DA+WLMQ    +YD FW+PPQMKALL+SVP+G++VVL
Sbjct: 356 TDDPSYISALGSIVYEAMSAADQDAIWLMQ----AYDKFWKPPQMKALLHSVPVGRMVVL 411

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DLFA+VKP+WS S  FYGVPYIWCMLHNF GN+EMYG LD +A  P++A TS N+TMVGV
Sbjct: 412 DLFADVKPMWSRSDHFYGVPYIWCMLHNFGGNVEMYGRLDVVATAPIQAVTSSNSTMVGV 471

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GM MEGIEQNPVVYDLM+EMAF +  V V+ WI +Y+ RRYG      + AW +L+ ++Y
Sbjct: 472 GMCMEGIEQNPVVYDLMAEMAFHNATVVVEDWIEEYARRRYGELTAGARIAWKMLHESIY 531

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-HLWY 359
           NC+DG  D N DVIV FPD+DP          Q+ G+ +              HP H+WY
Sbjct: 532 NCSDGIADHNGDVIVEFPDIDPKRSLFQIRPRQSLGQQI------------LGHPQHIWY 579

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           S  +   AL+  ++S + L  S  YRYD++DLTRQ L+K AN+L   +++ +++ +   +
Sbjct: 580 SPQDAAVALQYLLSSADALGLSKPYRYDVVDLTRQVLSKLANQLHSQVLDQFRMFNVEKM 639

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
             +S R LEL+ DMD LL   + FLLG WLESAK LA ++E+ K
Sbjct: 640 DNISSRLLELLSDMDDLLGASEEFLLGTWLESAKDLATSDEERK 683


>gi|302786446|ref|XP_002974994.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
 gi|300157153|gb|EFJ23779.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
          Length = 761

 Score =  550 bits (1418), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 259/463 (55%), Positives = 335/463 (72%), Gaps = 4/463 (0%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLP+ WL+ QL+LQKKIL  +  LGM  VLPAFSGNVP AL+ ++PSA IT
Sbjct: 192 MGNLHGWGGPLPEKWLELQLILQKKILHHMRSLGMIAVLPAFSGNVPRALKILYPSANIT 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L +W +V  +P+WCCTYLL   DPLFI+IG+AFIEQQ+KEYG T H+YNCDTF+EN PP
Sbjct: 252 RLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGKAFIEQQVKEYGSTQHVYNCDTFNENLPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            D P YIS+L A++Y  M   D  A+WLMQGWLFS D  FW+PPQMKALL++VP GK++V
Sbjct: 312 TDDPSYISALAASVYGAMIVADKQAIWLMQGWLFSSDAQFWKPPQMKALLHAVPFGKMIV 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAEV+PIWS S  FYGVPYIWCMLHNF GN EMYG LD ++ GPV+A+TS N+TM+G
Sbjct: 372 LDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGGNHEMYGRLDVVSSGPVDAKTSANSTMIG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIEQNPVVY+LM+EMAF+  +  +K W+N YS RRYG++VP   +AW +L HT+
Sbjct: 432 VGMCMEGIEQNPVVYELMAEMAFRSTRNALKDWVNDYSTRRYGKAVPEALEAWQILSHTL 491

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YNC+DG  D N DVIV FPD++ S ++ T  +Y           +L    +S+   HLWY
Sbjct: 492 YNCSDGLQDHNTDVIVKFPDLNASSLT-TLSRYLAEEAGTQTRRLLTEGLTSFG--HLWY 548

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
             +E   AL   + + + LS   TYRYDL+DLTRQ L K AN++ L  + ++   D   +
Sbjct: 549 RPTEAKVALSYMLNASSSLSNVATYRYDLVDLTRQVLMKLANQIHLQALVSFVKGDLEEL 608

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
            +     + +++D + LL  ++GFLLGPWLESAK+L  N +++
Sbjct: 609 TKNCDILIGIIKDSELLLRSNNGFLLGPWLESAKKLGTNSDEK 651


>gi|302791289|ref|XP_002977411.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
 gi|300154781|gb|EFJ21415.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
          Length = 761

 Score =  549 bits (1414), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 257/462 (55%), Positives = 333/462 (72%), Gaps = 4/462 (0%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLP+ WL+ QL+LQKKIL  +  LGM  VLPAFSGNVP AL+ ++PSA IT
Sbjct: 192 MGNLHGWGGPLPEKWLELQLILQKKILHHMRSLGMIAVLPAFSGNVPRALKILYPSANIT 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L +W +V  +P+WCCTYLL   DPLFI+IG+AFIEQQ+KEYG T H+YNCDTF+EN PP
Sbjct: 252 RLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGKAFIEQQVKEYGSTQHVYNCDTFNENLPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            D P YIS+L A++Y  M   D  A+WLMQGWLFS D  FW+PPQMKALL++VP GK++V
Sbjct: 312 TDDPSYISALAASVYGAMIVADKQAIWLMQGWLFSSDAQFWKPPQMKALLHAVPFGKMIV 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAEV+PIWS S  FYGVPYIWCMLHNF GN EMYG LD ++ GPV+A+TS N+TM+G
Sbjct: 372 LDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGGNHEMYGRLDVVSSGPVDAKTSANSTMIG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VGM MEGIEQNPVVY+LM+EMAF+  +  +K W++ YS RRYG++VP   +AW +L HT+
Sbjct: 432 VGMCMEGIEQNPVVYELMAEMAFRSTRNALKDWVDDYSTRRYGKAVPEALEAWQILSHTL 491

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YNC+DG  D N DVIV FPD++ S ++         G   ++  + +  TS     HLWY
Sbjct: 492 YNCSDGLQDHNTDVIVKFPDLNASSLTTLSRYLAEEGGTQTRRLLTEGLTS---FGHLWY 548

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
             +E   AL   + + + LS   TYRYDL+DLTRQ L K AN++ L  + ++   D   +
Sbjct: 549 RPTEAKVALSYMLNASSSLSNVATYRYDLVDLTRQVLMKLANQIHLQALVSFVKGDLEEL 608

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            +     + +++D + LL  ++GFLLGPWLESAK+L  N ++
Sbjct: 609 TKNCDILIGIIKDSELLLRSNNGFLLGPWLESAKKLGTNSDE 650


>gi|357458269|ref|XP_003599415.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
 gi|355488463|gb|AES69666.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
          Length = 539

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 199/296 (67%), Positives = 231/296 (78%), Gaps = 27/296 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL  +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
           +LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K                    
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353

Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
                 EYG  + IYNCDTF+EN+PP   P YIS+LGAA+Y G+  GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413

Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
             D  FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473

Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 269
           EMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE NP+VY+LMSEMAF+ EKV +
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKI 529


>gi|156399499|ref|XP_001638539.1| predicted protein [Nematostella vectensis]
 gi|156225660|gb|EDO46476.1| predicted protein [Nematostella vectensis]
          Length = 675

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 203/459 (44%), Positives = 281/459 (61%), Gaps = 37/459 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+HGWGGPLP +W   +L LQ KIL  +   GM PVLP F+G+VPA L  ++P A ++
Sbjct: 142 MGNMHGWGGPLPSTWYGMKLNLQHKILAAMRNFGMTPVLPGFAGHVPAGLLRLYPKANVS 201

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W +  S   +CCTYLL+ +DPLF +IG AFI++Q  EYG T+HIYN DTF+E  P 
Sbjct: 202 KLGDWGNFNST--YCCTYLLEPSDPLFQKIGTAFIKEQTAEYG-TNHIYNADTFNEMRPR 258

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P Y+ +  +A+Y GM  GD DAVWLMQGWLF  + FW+P Q+KALL+ VP G ++VL
Sbjct: 259 SSDPTYLGAASSAVYRGMAGGDPDAVWLMQGWLFVDEGFWKPDQIKALLHGVPQGFMIVL 318

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+AE  PIWS ++ FYG P+IWCML NF GNI ++G + S++ GP +A  S N+TM+G 
Sbjct: 319 DLWAENSPIWSRTQSFYGTPFIWCMLLNFGGNIGLFGNIKSVSTGPPKAFQSFNSTMIGT 378

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHE---KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           G++MEGIEQN ++++LM+EM ++ E    VD+  WI  Y++RRYG + PAI  AW +L  
Sbjct: 379 GLTMEGIEQNDMMFELMNEMGYRLEPLNPVDLDNWIKDYALRRYGGTNPAIIQAWRLLIR 438

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           +VY C     D    + V  P +D                               + P+L
Sbjct: 439 SVYQCNGYCADHIHSIFVWKPSLD-------------------------------NKPNL 467

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY   +V  A +   ++  E     T+RYDL+D+TRQAL      ++ ++I AY+   A 
Sbjct: 468 WYDPEDVFNAWDELRSTAAEFMHVETFRYDLVDVTRQALHLRVIPIYNDLISAYKNRSAL 527

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
            V     R LE+ +D+D LL  +  FLLG WL SAK L 
Sbjct: 528 NVIHFGSRLLEMFDDLDSLLQTNRNFLLGRWLNSAKALG 566


>gi|384247107|gb|EIE20595.1| hypothetical protein COCSUDRAFT_37819 [Coccomyxa subellipsoidea
           C-169]
          Length = 762

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 210/467 (44%), Positives = 296/467 (63%), Gaps = 30/467 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGPLPQS++D Q  LQ+KI+ R+ ELGM+PV PAF+G VP AL    P+A+I+
Sbjct: 199 MGNLRGYGGPLPQSYIDDQAELQRKIVRRMRELGMSPVFPAFAGFVPGALARERPAARIS 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTS-HIYNCDTFDENTP 119
           +  NW S  +  R+CC +LLD  +PLF EIG AF++   +EYG      Y+ DTF+E TP
Sbjct: 259 RSDNWCSFPA--RYCCVHLLDPLEPLFQEIGSAFVKVLREEYGSDEVGFYSADTFNEMTP 316

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD--PFWRPPQMKALLNSVPLGKL 177
           P   P Y++S+ +AIY+ M + D  A WLMQ WLF YD   FW+PPQ++AL++ VP   L
Sbjct: 317 PSSDPAYLTSVTSAIYNAMAAADPSARWLMQAWLF-YDNQKFWQPPQIQALVSGVPRDAL 375

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL+AEV P+W ++K F+G P+I+CMLHNF GNIEMYG L+++A GP E +      +
Sbjct: 376 IMLDLYAEVFPLWKSTKSFFGAPFIYCMLHNFGGNIEMYGALEAVARGPAEGQIDGVAGL 435

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ-DAWNVLY 296
           +G+GM  EGIEQNPVVY+LMSE AF+ + V+V+ WI  Y+ RRYG S P     AW++L 
Sbjct: 436 IGIGMCPEGIEQNPVVYELMSEWAFRRQPVEVEGWIEAYARRRYGNSTPPTALVAWDLLL 495

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
            +VYN TDG TD +RD+  + P + P+ + +   K                       PH
Sbjct: 496 RSVYNATDGHTDHSRDIPTSRPGLSPAEVGLWGLK-----------------------PH 532

Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           LWY+  +V+ A  L + S  EL     YRYDL+D+ RQ ++K A +++  + EAY    +
Sbjct: 533 LWYNEQQVVDAWGLLLRSAGELQQVEGYRYDLVDVGRQVISKRATDIWKAVAEAYVDGRS 592

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
             V +   R L+L++D++ LLA + GFLLGP LE A      E + +
Sbjct: 593 IVVRREGARLLQLLDDLEELLATNRGFLLGPKLEEASSAGHTEAEAR 639


>gi|326521470|dbj|BAK00311.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 428

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 193/317 (60%), Positives = 237/317 (74%), Gaps = 1/317 (0%)

Query: 149 MQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 207
           +QGWLF  D  FW+  QMKALL+SVP+GK++VLDLFA+VKPIW TS QFYGVPYIWCMLH
Sbjct: 8   VQGWLFYSDAVFWKESQMKALLHSVPIGKMMVLDLFADVKPIWQTSSQFYGVPYIWCMLH 67

Query: 208 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 267
           NF GNIEMYG+LDSI+ GPV+ARTS N+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV
Sbjct: 68  NFGGNIEMYGVLDSISSGPVDARTSYNSTMVGVGMCMEGIEHNPVVYELMSEMAFRSQKV 127

Query: 268 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 327
            V+ W+  YS RRYG+S   IQ AW +LYHT+YNCTDG  D N+D IV FPD+ PS  S 
Sbjct: 128 KVEDWLKTYSHRRYGQSNVEIQKAWGILYHTIYNCTDGIADHNKDYIVEFPDMSPSSFSS 187

Query: 328 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 387
              K             L   ++S   PHLWYST E I++LELF+ +GN+LS S TYRYD
Sbjct: 188 QYSKRSISLARKHPRFFLSEVSASLPQPHLWYSTEEAIKSLELFLNAGNDLSKSLTYRYD 247

Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
           L+DLTRQ+L+K AN+++ + I +YQ  D+ G+   ++ FLEL+ D+D LLA  D FLLGP
Sbjct: 248 LVDLTRQSLSKLANKVYHDAISSYQKRDSSGLNFHTKEFLELIVDIDTLLASDDNFLLGP 307

Query: 448 WLESAKQLAQNEEQEKQ 464
           WLESAK LA  E++ KQ
Sbjct: 308 WLESAKSLAMTEDERKQ 324


>gi|14861378|gb|AAK73654.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
          Length = 753

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 199/471 (42%), Positives = 289/471 (61%), Gaps = 46/471 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGW GPLP++W  +QL +Q ++L R+  LGM  VLPAF+G+VP  +   FP    T
Sbjct: 212 MGNLHGWAGPLPRAWHLKQLYVQYRVLERMRSLGMITVLPAFAGHVPQGVLRAFPRVNAT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W     D  + CTYLLD  DP+F  IG  F+++ +KE+G T HIY+ DTF+E  P 
Sbjct: 272 RLGGWSHF--DCTYSCTYLLDPEDPMFQVIGTLFLKELIKEFG-TDHIYSADTFNEMNPL 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S + +A++  M   D  AVWLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 329 SSDPAYLSRVSSAVFRSMTGADPKAVWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 388

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  ++ FYG P+IWCMLHNF GN  ++G +++I  GP  AR   N+TMVG
Sbjct: 389 LDLFAESRPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 448

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G+  EGIEQN +VY+LM+E+ ++ E +D+ +W+ +Y+ RRYG    A   AW +L  +V
Sbjct: 449 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVARYAERRYGAPNAAAASAWQLLLRSV 508

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YNCT    + NR  +V  P                          L+ +T       +WY
Sbjct: 509 YNCTGVCVNHNRSPLVRRPS-------------------------LRMDT------EVWY 537

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 413
           + S+V  A  L +++G EL +S T+ YDL D+TRQA  +  +E +L+I +A+Q      L
Sbjct: 538 NKSDVYEAWRLLLSAGAELGSSPTFGYDLADVTRQAAQQLVSEYYLSIRQAFQSRSLPEL 597

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             A GV        +L+ ++DGLL+ H  FLLG WLESA+ +A ++ + +Q
Sbjct: 598 LTAGGVL-----VYDLLPELDGLLSSHRLFLLGRWLESARAVATSDREAEQ 643


>gi|390348210|ref|XP_785272.3| PREDICTED: alpha-N-acetylglucosaminidase [Strongylocentrotus
           purpuratus]
          Length = 793

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 188/465 (40%), Positives = 271/465 (58%), Gaps = 36/465 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPLPQSW   QL LQ +IL R+ +LGM PVLPAF+G+VP +   VFP+A I+
Sbjct: 220 MGNIDGWGGPLPQSWHTNQLALQHQILKRMRDLGMIPVLPAFAGHVPXSFSKVFPNASIS 279

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG+W   +  P +CCT LLD  DP+F ++G+AFI+   +E+  T HIY+ DTF+EN P 
Sbjct: 280 NLGDW--GRFGPEYCCTSLLDPQDPMFKQVGKAFIDAMSEEFNGTDHIYSADTFNENKPK 337

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
                Y+S+    +Y G+  GD   VWLM GWLF    FW P Q+KALL  VP+G+++VL
Sbjct: 338 SRDSAYLSAASKGVYQGIIEGDPKGVWLMMGWLFQDTGFWGPTQIKALLQGVPIGRMIVL 397

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+AE +P + T+  FYG P+IWCMLHNF GN  +YG LD++  GP EAR  +N+TM+G+
Sbjct: 398 DLYAEARPFYKTTYSFYGQPFIWCMLHNFGGNTGLYGKLDAVNQGPFEARNYDNSTMIGM 457

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPAIQDAWNVLYH 297
           G + EGI QN V+Y+ +++M ++    +V  WI QY+ RRY           +AW +L  
Sbjct: 458 GTTPEGIFQNYVMYNFLTDMTWRSGSTNVSKWIEQYAGRRYSNDPNKSEEATEAWVILKE 517

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           TVYN T    D                          Y  PV + + + +         +
Sbjct: 518 TVYNNTGTLQD------------------------HQYAVPVRRPSNIMTSP-------V 546

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY  ++V +A E  + +  +L  S  +RYDL+D+TR  L   A +    ++ ++++ +A 
Sbjct: 547 WYDYTKVAKAWEFLLEASTKLGTSPVFRYDLVDVTRNVLQDLAFDFQQKLMVSFRIRNAG 606

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
            V         L+ DMD + + H+ +LLG WLE AK LA N ++E
Sbjct: 607 AVGGNGTLLCNLILDMDNITSSHEDWLLGTWLEDAKSLATNNDEE 651


>gi|14861380|gb|AAK73655.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
          Length = 753

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 199/471 (42%), Positives = 289/471 (61%), Gaps = 46/471 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGW GPLP++W  +QL +Q ++L R+  LGM  VLPAF+G+VP  +   FP    T
Sbjct: 212 MGNLHGWAGPLPRAWHLKQLYVQYRVLERMRSLGMITVLPAFAGHVPQGVLRAFPRVNAT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W     D  + CTYLLD  DP+F  IG  F+++ +KE+G T HIY+ DTF+E  P 
Sbjct: 272 RLGGWSHF--DCTYSCTYLLDPEDPMFQVIGTLFLKELIKEFG-TDHIYSADTFNEMNPL 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S + +A++  M   D  AVWLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 329 SSDPAYLSRVSSAVFRSMTGADPKAVWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 388

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  ++ FYG P+IWCMLHNF GN  ++G +++I  GP  AR   N+TMVG
Sbjct: 389 LDLFAESRPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 448

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G+  EGIEQN +VY+LM+E+ ++ E +D+ +W+ +Y+ RRYG    A   AW +L  +V
Sbjct: 449 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVARYAERRYGAPNAAAASAWXLLLRSV 508

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YNCT    + NR  +V  P                          L+ +T       +WY
Sbjct: 509 YNCTGVCVNHNRSPLVRRPS-------------------------LRMDT------EVWY 537

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 413
           + S+V  A  L +++G EL +S T+ YDL D+TRQA  +  +E +L+I +A+Q      L
Sbjct: 538 NKSDVYEAWRLLLSAGAELGSSPTFGYDLADVTRQAAQQLVSEYYLSIRQAFQSRSLPEL 597

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             A GV        +L+ ++DGLL+ H  FLLG WLESA+ +A ++ + +Q
Sbjct: 598 LTAGGVL-----VYDLLPELDGLLSSHRLFLLGRWLESARAVATSDREAEQ 643


>gi|443691318|gb|ELT93213.1| hypothetical protein CAPTEDRAFT_144379, partial [Capitella teleta]
          Length = 718

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 192/462 (41%), Positives = 277/462 (59%), Gaps = 36/462 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL  +W  QQ++LQ +IL R+ +LGM P LPAF+G+VPA +  +FP  K++
Sbjct: 188 MGNMRGWGGPLSTNWHHQQILLQHRILKRMRDLGMTPALPAFAGHVPANITRLFPRVKVS 247

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W    S   +CCT LLD  DPLF EIG+AFI++  +E+G T H+YN DTF+E TP 
Sbjct: 248 KLGDWGRFNST--YCCTTLLDVEDPLFKEIGKAFIDEYTREFG-TDHVYNTDTFNEMTPA 304

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P Y++  G A+YSGM S DS A+WLMQGWLF  D FW+PPQ KALL SVP GK++VL
Sbjct: 305 SSDPSYLTKAGQAVYSGMVSSDSKAIWLMQGWLFLSD-FWKPPQAKALLTSVPQGKMLVL 363

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL++EV P +   + +YG P+IWCMLHNF G + MYG ++S+  GP E R+  N+TMVG+
Sbjct: 364 DLYSEVNPQYPRLQSYYGQPFIWCMLHNFGGTLPMYGAIESVNQGPFEGRSFVNSTMVGI 423

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN V+Y+ M E +F+ + V++  W ++Y+ RRY       + AW +   TVY
Sbjct: 424 GLTPEGINQNEVMYEFMMENSFRSQPVELTEWFDKYATRRYASRNANARAAWQIFKRTVY 483

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           NC+DG    N+++ V  P     I                                +WY 
Sbjct: 484 NCSDGVKHHNKNIPVCRPSRKNKI-------------------------------DVWYD 512

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
             +  +  +L IA+  E+  S  +RYDL+D++RQAL   +   +  I+ +Y+  +   + 
Sbjct: 513 VEDFFKGWDLMIAASKEVD-SPLFRYDLVDVSRQALQVISITYYNQILTSYKQKNLTSLA 571

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
                 L L++DMD +LA    FLLG W+  A +     E++
Sbjct: 572 SSGNDLLHLLDDMDTVLATDSHFLLGAWIAGAHRNGVTPEEK 613


>gi|326679829|ref|XP_688608.3| PREDICTED: alpha-N-acetylglucosaminidase-like [Danio rerio]
          Length = 757

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 196/464 (42%), Positives = 279/464 (60%), Gaps = 37/464 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  WGGPLPQSW  +QL LQ KIL R+   GM PVLPAFSG VP  +  +FP A +T
Sbjct: 214 MGNLFQWGGPLPQSWHVKQLYLQFKILDRMRSFGMIPVLPAFSGIVPEGITRLFPKANVT 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W        + C Y+LD  DPLF  IG  F+ Q ++E+G T HIYN DTF+E  P 
Sbjct: 274 KLSPWSHFNCT--YSCAYVLDPRDPLFHRIGALFLTQVIEEFG-TDHIYNTDTFNEMPPA 330

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y++S+  AI++ M S D  A+WLMQGWLF  DP FW+  Q+KALL+ VPLG+++V
Sbjct: 331 SSDPTYLASISRAIFNTMTSVDPQAIWLMQGWLFISDPSFWKADQVKALLHGVPLGRMIV 390

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE  P++S++  FYG P+IWCMLHNF GN  ++G +DSI  GP  A    N+T+VG
Sbjct: 391 LDLFAESMPVYSSTNSFYGQPFIWCMLHNFGGNSGLFGTVDSINSGPFNAVRFPNSTLVG 450

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +GM+ EGIEQNPV+Y+LMSE+A++ + V++  W++ Y++RRYG     +  AW +L+ +V
Sbjct: 451 LGMTPEGIEQNPVIYELMSELAWRKDPVNLYKWVSLYALRRYGSMDENLALAWQLLFRSV 510

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK-PVSKEAVLKSETSSYDHPHLW 358
           YNCT              P            KY+N+ + P+     L  +T       +W
Sbjct: 511 YNCT-------------LP------------KYKNHNRSPLVHRPSLHMQTD------IW 539

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y  ++  RA +L   +   L    T+RYDL+D+TRQAL     E + +I  A+Q      
Sbjct: 540 YDPADFYRAWKLLFEAAPGLVTLETFRYDLVDVTRQALQLLTTEFYKDIKSAFQTQKLSD 599

Query: 419 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
           +       + +L+ ++D +L+ ++ FLLG WL+ A+    +E +
Sbjct: 600 LLTAGGVLVYDLLPELDRILSSNEHFLLGAWLQQAQSQGVDEHE 643


>gi|348533253|ref|XP_003454120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oreochromis
           niloticus]
          Length = 845

 Score =  373 bits (958), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 190/471 (40%), Positives = 290/471 (61%), Gaps = 47/471 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL  + GPLPQSW   QL LQ KIL R+   GM PVLPAFSGN+P  +  ++P A++T
Sbjct: 308 MANLFKFAGPLPQSWHVNQLYLQFKILERMRSFGMIPVLPAFSGNIPKGILRLYPEARVT 367

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W        + C+ +LD  DPLF  IG  ++ Q LK++G T HIY+ DTF+E TPP
Sbjct: 368 RLGPWSHFNCS--YSCSLVLDPQDPLFHHIGSLYLSQVLKQFG-TDHIYSTDTFNEMTPP 424

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S++  ++++ M + D  AVWLMQGWLF  D  FW+P Q++ALL+ VPLG+++V
Sbjct: 425 SSDPAYLSAVSRSVFASMTAVDPQAVWLMQGWLFFSDAAFWKPAQIQALLHGVPLGRMIV 484

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +PI+S ++ FYG P+IWCML NF GN  ++G ++SI  GP +A    N+T+VG
Sbjct: 485 LDLFAETEPIFSYTESFYGQPFIWCMLQNFGGNSGLFGTVESINSGPFKALHFPNSTLVG 544

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +GM+ EGIEQNPV Y+LMSE+A++ E V++  W++ Y++RRYG +  ++  AW +L+ ++
Sbjct: 545 IGMTPEGIEQNPVTYELMSELAWRKEPVNLAKWVSLYAIRRYGNTQESLTTAWRLLFASI 604

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG-KPVSKEAVLKSETSSYDHPHLW 358
           YNCTD                           Y+N+   P+ +    +  T       LW
Sbjct: 605 YNCTDP-------------------------HYRNHNHSPLVRRPSFQMNTG------LW 633

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
           Y  +++ +A +L + +   L +  T+RYDL+D+TR+ L       + +I +A+      +
Sbjct: 634 YDPADLYKAWKLIMDAAPSLMSKETFRYDLVDVTREVLQVLTTSFYRDIADAFKKQNLSE 693

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           L  A GV        +L+ +++ LL+ +  FLLG WLE A+ LA ++++ +
Sbjct: 694 LLTAGGVL-----VYDLLPELNRLLSSNRNFLLGAWLERARSLAVDDKEAQ 739


>gi|375144105|ref|YP_005006546.1| alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
 gi|361058151|gb|AEV97142.1| Alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
          Length = 735

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 192/455 (42%), Positives = 278/455 (61%), Gaps = 36/455 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  WGGPLP SW+     LQ+KIL R  ELGM PVLPAF+G+VP A +  +P+AK+ 
Sbjct: 201 MGNLDAWGGPLPLSWMKSHKALQEKILQRERELGMKPVLPAFTGHVPPAFKKKYPNAKL- 259

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  NW +  +D     TY+LD+ DPLF E+G+ F+++Q   +G T H+Y+ DTF+EN PP
Sbjct: 260 KATNWTNGFAD-----TYILDSQDPLFAEMGKRFLQKQTSLFG-TDHLYSADTFNENEPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
            D P ++S+L A IY GM+  D+ A W+MQGWLF  D  FW+ PQ++ALL +VP  K+++
Sbjct: 314 SDDPAFLSALSARIYEGMKQADTAATWVMQGWLFYSDRKFWKAPQIEALLKAVPDNKMIL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMV 238
           LDL AE++P+W  +  FYG P+IW MLHNF GN+ ++G +D +A  P E    + +  + 
Sbjct: 374 LDLAAEIEPVWKRTDAFYGKPWIWNMLHNFGGNVNLFGRMDGVATQPAETLNDKASGKLW 433

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+G++ME IEQNPV+Y+LM+   +Q   VD+ AWI QY + RY  +   + DAW +L  T
Sbjct: 434 GIGLTMEAIEQNPVMYELMTRHTWQTTPVDLDAWIPQYVLNRYRTNNTNLVDAWQILRKT 493

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYN   GA  +         D   SII+         G+P      + + T     PH  
Sbjct: 494 VYN---GAVIR---------DGAESIIT---------GRPTFDSTTVWTRTKLNYAPH-- 530

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
               E++ A +LF+ +  +   S+ ++YDL+D+TRQ LA YA  L    + A+   D+  
Sbjct: 531 ----ELLPAWDLFVQAAGKGVNSDGFQYDLVDVTRQVLANYAAPLQKKWVTAFNAKDSAA 586

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             + S+ FL+L+ DMD LLA    F+LGPWL +A+
Sbjct: 587 FNKYSKAFLQLISDMDLLLASRKDFMLGPWLSAAR 621


>gi|373953359|ref|ZP_09613319.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
 gi|373889959|gb|EHQ25856.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
          Length = 733

 Score =  367 bits (942), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 188/458 (41%), Positives = 272/458 (59%), Gaps = 37/458 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  WGGPLP  W+     LQKKI+ R   LGM PVLPAF+G+VPAA +N +P+AK+ 
Sbjct: 195 MGNMDSWGGPLPLRWMQTHFDLQKKIIARERALGMKPVLPAFTGHVPAAFKNKYPTAKL- 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  NW +  +D     TY+LD+ DP+F  IG+ F+++Q    G T H+Y+ DTF+EN PP
Sbjct: 254 KTTNWKNGFAD-----TYILDSADPMFARIGQLFLQKQTALLG-TDHLYSADTFNENEPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
            D PEY+  L   +Y GM   D+ AVW+MQGWLF  D  FW+P Q +ALL +VP  K+++
Sbjct: 308 SDEPEYLGKLSERVYQGMHQADTAAVWVMQGWLFYSDRKFWKPEQTRALLKAVPDDKMII 367

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
           LDL  E++P+W  ++ FYG P+IW ML+NF  N  ++G +DS A GP EA    ++  M 
Sbjct: 368 LDLATEIEPVWKRTEAFYGKPWIWNMLNNFGANTNLFGRMDSAAKGPAEAYHDPKSGQMK 427

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+G++MEGIEQNPV+YDL+++  ++++ ++V  W+ +Y + RYG+     Q AWN+L  T
Sbjct: 428 GIGLTMEGIEQNPVLYDLLTDNTWRNQPINVDEWLPKYVLNRYGKPNAQAQKAWNILRKT 487

Query: 299 VYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
           VY+        D    +I A P  D S                          S +    
Sbjct: 488 VYSVLADRYIRDGAESIIQARPTTDSS--------------------------SRWARTT 521

Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           L Y    ++ A +  I +  +LS S+ +R+DL+DL+RQ LA YA  L    + A+Q  DA
Sbjct: 522 LNYEPKALLPAWQAMIKASEDLSTSDGFRFDLVDLSRQVLANYAFTLQRRFVLAHQQKDA 581

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
               + S  F+EL++DMD LLA    FLLGPW+  A++
Sbjct: 582 AAFKKHSAEFIELIQDMDQLLATRKDFLLGPWVADARR 619


>gi|73965663|ref|XP_548088.2| PREDICTED: alpha-N-acetylglucosaminidase [Canis lupus familiaris]
          Length = 747

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 195/465 (41%), Positives = 280/465 (60%), Gaps = 38/465 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +QL LQ +IL R+   GM PVLPAFSG+VP AL  VFP   IT
Sbjct: 205 MGNLHTWGGPLPHSWHLKQLYLQHRILDRMRSFGMIPVLPAFSGHVPKALTRVFPQINIT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ + ++E+G T+HIY  DTF+E  PP
Sbjct: 265 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELIQEFG-TNHIYGADTFNEMQPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y++S  A++Y  M + DSDAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 322 SSEPSYLASATASVYQAMITVDSDAVWLLQGWLFQHQPQFWGPAQVKAVLEAVPRGRLLV 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM+G
Sbjct: 382 LDLFAESQPVYIQTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMLG 441

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D++AW++ ++ RRYG +    + AW +L  +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVSSFAARRYGVAHRDTEAAWRLLLRS 501

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +  +  NR  +V      PS+  VT                            +
Sbjct: 502 VYNCSGEACSGHNRSPLVR----RPSLQMVTT---------------------------V 530

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +   L++S T+RYDL+D+TRQA  +  +  ++    AY   +  
Sbjct: 531 WYNRSDVFEAWRLLLTAAPTLASSPTFRYDLLDVTRQAAQELVSLYYVEARSAYLRKELV 590

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            + + +   + EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 591 PLLRAAGVLVYELLPALDKVLASDSRFLLGRWLEQARAAAVSEAE 635


>gi|410930376|ref|XP_003978574.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Takifugu rubripes]
          Length = 751

 Score =  365 bits (938), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 192/465 (41%), Positives = 289/465 (62%), Gaps = 35/465 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPLPQSW   QL LQ KIL ++   GM PVLPAFSGN+P  +  +FP A++T
Sbjct: 214 MGNMFKFGGPLPQSWHVNQLYLQFKILAQMRSFGMIPVLPAFSGNIPKGILRLFPEARVT 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W   K +  + C+Y+LD  DPLF  IG  ++ Q +K++G T+HIYN DTF+E TPP
Sbjct: 274 RLEPW--SKFNCSFSCSYILDPRDPLFSRIGSLYLSQVVKQFG-TNHIYNTDTFNEMTPP 330

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S++  A+++ M + D  AVWLMQGWLF  D  FW+P Q++ALLN VP+G+++V
Sbjct: 331 SSEPTYLSAVSRAVFASMTAVDPQAVWLMQGWLFLSDALFWKPAQIQALLNGVPVGRMIV 390

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++S ++ FYG P+IWCMLHNF GN   +G ++SI  GP +A    N+++VG
Sbjct: 391 LDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNGGFFGTVESINTGPFKALHFPNSSLVG 450

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +GM+ EGIEQNPVVY+LMSE+A++ E V++  W++ Y  RRYG    ++  AW +L+ +V
Sbjct: 451 IGMTPEGIEQNPVVYELMSELAWRKEPVNLLKWVSLYVTRRYGSMHESVSAAWKILFASV 510

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YNCT              P             Y+N+       + L    S + +  LWY
Sbjct: 511 YNCT-------------LP------------HYRNH-----NHSPLVRRPSFHMNSELWY 540

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
             +++ RA +L + +     +  T++YDL+D+TRQ +       + +I++A+Q +    +
Sbjct: 541 DPADLYRAWKLILEAAPSFMSKETFQYDLVDVTRQVMQVLTTSYYQDIVDAFQKHKMQEL 600

Query: 420 FQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
                  L +L+ +++ LL+ +  FLLG WLE A+ LA +E + K
Sbjct: 601 LTAGGVLLYDLLPELNRLLSSNHNFLLGTWLEQARSLALDEREAK 645


>gi|270005801|gb|EFA02249.1| hypothetical protein TcasGA2_TC007912 [Tribolium castaneum]
          Length = 747

 Score =  365 bits (936), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 182/465 (39%), Positives = 287/465 (61%), Gaps = 35/465 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W  + LVLQK+IL R+   G+ PVLPAF+G++P A + ++P A ++
Sbjct: 201 MGNMRGFGGPLSPAWHSRSLVLQKQILQRMRAFGIIPVLPAFAGHLPRAFKTLYPDANMS 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W     +  +CC Y LD T+PLF EIG+AF+ +Q+ E+G T H+YNCD+F+EN P 
Sbjct: 261 KMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSEQISEFG-TDHMYNCDSFNENVPT 317

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVV 179
                Y++++G +IY  M   D DAVWL+QGW+F  D FW+  + ++++L SVPLGK++V
Sbjct: 318 SGDLTYLANVGKSIYKAMTDTDPDAVWLLQGWMFYNDNFWQDTERVRSILTSVPLGKMIV 377

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P +    Q++G PYIWCMLH+F G + M+G    I   P++AR  EN+TM+G
Sbjct: 378 LDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMFGSSTVINEVPIKARHLENSTMIG 437

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ EGI QN V+Y+LM+E A++   V++  W  +YS RRYG      ++AW +L  TV
Sbjct: 438 TGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKYSTRRYGFPDSDAENAWRILQRTV 497

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                    D   ++   GKY      ++K   LK +  +      WY
Sbjct: 498 Y--------------------DYQGLNRMRGKY-----AITKSPSLKIKIWT------WY 526

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           ST++++ A    + + + L A++ Y +DL+D+TRQ L  Y +  +  +++ YQ +D+   
Sbjct: 527 STNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVLQVYGDLYYKEMVKNYQSHDSANF 586

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
              S++FLE+++D+D +L+ +  FLLGPWLE+AK+ A +  +E Q
Sbjct: 587 QANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAANDSAEEAQ 631


>gi|91080563|ref|XP_973259.1| PREDICTED: similar to alpha-N-acetyl glucosaminidase [Tribolium
           castaneum]
          Length = 747

 Score =  365 bits (936), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 183/465 (39%), Positives = 286/465 (61%), Gaps = 35/465 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W  + LVLQK+IL R+   G+ PVLPAF+G++P A + ++P A ++
Sbjct: 201 MGNMRGFGGPLSPAWHSRSLVLQKQILQRMRAFGIIPVLPAFAGHLPRAFKTLYPDANMS 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W     +  +CC Y LD T+PLF EIG+AF+ +Q+ E+G T H+YNCD+F+EN P 
Sbjct: 261 KMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSEQISEFG-TDHMYNCDSFNENVPT 317

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
                Y++++G +IY  M   D DAVW+MQGWLF++D F W   + KA+L +VP GK++V
Sbjct: 318 SGDLTYLANVGKSIYKAMTDTDPDAVWVMQGWLFAHDFFYWTRNRAKAILTAVPKGKMIV 377

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P +    Q++G PYIWCMLH+F G + M+G    I   P++AR  EN+TM+G
Sbjct: 378 LDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMFGSSTVINEVPIKARHLENSTMIG 437

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ EGI QN V+Y+LM+E A++   V++  W  +YS RRYG      ++AW +L  TV
Sbjct: 438 TGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKYSTRRYGFPDSDAENAWRILQRTV 497

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                    D   ++   GKY      ++K   LK +  +      WY
Sbjct: 498 Y--------------------DYQGLNRMRGKY-----AITKSPSLKIKIWT------WY 526

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           ST++++ A    + + + L A++ Y +DL+D+TRQ L  Y +  +  +++ YQ +D+   
Sbjct: 527 STNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVLQVYGDLYYKEMVKNYQSHDSANF 586

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
              S++FLE+++D+D +L+ +  FLLGPWLE+AK+ A +  +E Q
Sbjct: 587 QANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAANDSAEEAQ 631


>gi|348562747|ref|XP_003467170.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
           [Cavia porcellus]
          Length = 750

 Score =  362 bits (929), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 189/466 (40%), Positives = 281/466 (60%), Gaps = 36/466 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLP++W  +QL LQ +IL R+  LGM PVLPAF+G+VP A+  VFP   IT
Sbjct: 211 MGNLHGWGGPLPRTWHLKQLSLQHQILDRMRALGMTPVLPAFAGHVPKAIGRVFPQVNIT 270

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ + ++E+G T+HIY  DTF+E  PP
Sbjct: 271 QLGSWGHFNCS--YSCSFLLAPEDPLFPLIGGIFLRELIREFG-TNHIYGADTFNEMQPP 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A++  M + DSDAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 328 SSDPAYLAAATEAVFKAMVAVDSDAVWLLQGWLFQHQPEFWGPAQVGAVLGAVPQGRLLV 387

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 388 LDLFAESQPVYTRTASFRGQPFIWCMLHNFGGNHGLFGALEAVNRGPTAARLFPNSTMVG 447

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            G++ EGI QN VVY LM+E+ ++ + V D+ AW+++++ RRYG + P  + AW +L  +
Sbjct: 448 TGITPEGIGQNEVVYALMAELGWRKDPVPDLLAWVSRFAERRYGVAQPDAEAAWRLLLRS 507

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNC+  A   +                           P+ +   L+  T+      +W
Sbjct: 508 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTA------VW 537

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y+ S+V  A  L + +  +L+ S T+RYDL+D+TRQAL +  +  +  +  AY   +  G
Sbjct: 538 YNRSDVFEAWRLLLKASPKLTTSPTFRYDLLDVTRQALQELVSLYYEEVRAAYLHQELAG 597

Query: 419 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           + +       +L+  +D +LA    FLLG WL  A+  A +E + +
Sbjct: 598 LLRAGGVLAYQLLPALDEVLASDHHFLLGSWLAQARAAAASETEAR 643


>gi|410981277|ref|XP_003996997.1| PREDICTED: alpha-N-acetylglucosaminidase [Felis catus]
          Length = 857

 Score =  362 bits (928), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 193/465 (41%), Positives = 273/465 (58%), Gaps = 38/465 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 318 MGNLHTWGGPLPPSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVT 377

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 378 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 434

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y++S  A++Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 435 SSEPSYLASATASVYQAMVTVDPDAVWLLQGWLFQHQPQFWGPAQVSAVLGAVPRGRLLV 494

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 495 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMVG 554

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D++AW+  ++ RRYG S    + AW +L  +
Sbjct: 555 TGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVTGFAARRYGVSHGNTEAAWRLLLRS 614

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +  +  NR  +V  P                          LK  T+      +
Sbjct: 615 VYNCSGEACSGHNRSPLVRRPS-------------------------LKMTTT------V 643

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +   L+ S T+RYDL+D+TRQA  +  +  +     AY   +  
Sbjct: 644 WYNRSDVFEAWRLLLTTTPSLATSPTFRYDLLDVTRQAAQELVSLYYGEARTAYLNKELV 703

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            + + +   + EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 704 PLLRAAGILVYELLPSLDKVLASDSRFLLGSWLEQARAAAVSEAE 748


>gi|432926094|ref|XP_004080826.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oryzias latipes]
          Length = 882

 Score =  360 bits (925), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 191/470 (40%), Positives = 285/470 (60%), Gaps = 45/470 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N++ +GGPLPQSW   QL LQ +IL R+   GM PVLPAFSGNVP  +  + P A +T
Sbjct: 345 MANMYKFGGPLPQSWHVNQLRLQFRILERMRAFGMIPVLPAFSGNVPKGILKLHPEANVT 404

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W        + C+Y+LD  DPLF++IG  ++ Q +K++G T HIYN DTF+E TPP
Sbjct: 405 RLGPWAHFNCS--FSCSYVLDPRDPLFLQIGSLYLSQVVKQFG-TDHIYNTDTFNEMTPP 461

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S++  ++++ M + D  A+WLMQGWLF  D  FW+PPQ++ALL+ VPLG+++V
Sbjct: 462 SSDPAYLSAISRSVFASMTAVDPKAIWLMQGWLFFSDAAFWKPPQIRALLHGVPLGRMIV 521

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++S ++ FYG P+IWCMLHNF GN   +G ++SI  GP +A   +N+TMVG
Sbjct: 522 LDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNNGFFGTVESINSGPFKALNFKNSTMVG 581

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +GM+ EGI QNPV+Y+LMSE+A++ E V++  W + Y+ RRYG    ++  AW +L+ +V
Sbjct: 582 IGMTPEGIHQNPVIYELMSELAWRKESVNLTKWASLYAARRYGSMHESLSAAWKLLFSSV 641

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YNCT                       V   +  N+  P+ +       T       LWY
Sbjct: 642 YNCT-----------------------VPHYRNHNH-SPLVRRPSFNMNTG------LWY 671

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QL 413
             ++++   +LF+ +   L +  T+RYDL+D+TRQ L       + +I +A+      +L
Sbjct: 672 DPADLLETWKLFMEAAPSLMSKETFRYDLVDVTRQVLQDLTTYFYQDIKDAFHSKKMPEL 731

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
             + GV        +L  +++ LL     FLLG WLE A+  A +E + +
Sbjct: 732 LTSGGVL-----IYDLFPELNRLLNSDRNFLLGTWLEQAQSFALDEPEAR 776


>gi|149054264|gb|EDM06081.1| rCG33377, isoform CRA_d [Rattus norvegicus]
          Length = 580

 Score =  360 bits (923), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 192/467 (41%), Positives = 277/467 (59%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 43  MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 102

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLGNW        + C++LL   DPLF  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 103 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 159

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 160 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 219

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++S +  F+G P+IWCMLHNF GN  ++G L+ +  GP  AR   N+TMVG
Sbjct: 220 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 279

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            G++ EGI QN VVY LM+E+ ++ + V D+ AW++ ++ RRYG S P    AW +L  +
Sbjct: 280 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLRS 339

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 340 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 368

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +   L+AS  +RYDL+D+TRQA+ +  +  +     A+   D  
Sbjct: 369 WYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQAVQELVSSCYEEARTAFLNQDLD 428

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            + +       +L+  +D LLA +  FLLG WL+ A+++A +E + +
Sbjct: 429 LLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAREVAVSESEAQ 475


>gi|255533666|ref|YP_003094038.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
 gi|255346650|gb|ACU05976.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
          Length = 735

 Score =  360 bits (923), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 190/466 (40%), Positives = 278/466 (59%), Gaps = 38/466 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPLP+S +     LQKKIL R    GM P+LPAF+G+VP A ++ FP AK+ 
Sbjct: 199 MGNIDGWGGPLPKSQMLAHEALQKKILERERSFGMTPILPAFTGHVPPAFKDKFPKAKLK 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  NW +  S       Y+LD  D LF  IG+ FIE+++K +G T H+Y  DTF+ENTPP
Sbjct: 259 KT-NWTTFPS------VYILDPEDELFTTIGKRFIEEEVKTFG-TDHLYTADTFNENTPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
                Y+S++   +Y  M   D +A W+MQGWLF + + FW+P Q+KALLN++P  K++V
Sbjct: 311 TSDSLYLSNVSKKVYQSMALADPEATWIMQGWLFYHGEKFWKPTQIKALLNAIPNDKMIV 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMV 238
           LDL++E  P+W  +  +YG P+IW MLHNF GNI +YG +D +A G ++A+ + N+  MV
Sbjct: 371 LDLWSENHPVWQRTAAYYGKPWIWNMLHNFGGNISLYGRMDEVASGAIKAKQAANSGNMV 430

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+G++ E IEQNPV+Y LM +  +  E ++V AW+  YS +RYG      + AW +LY T
Sbjct: 431 GIGLTPEAIEQNPVMYQLMLDNIWTDEPINVTAWLKNYSRQRYGAQNALAEQAWQILYKT 490

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VY  T G            P    SI++         G+P        +E++    P   
Sbjct: 491 VY--TGG----------ILPGGPESILT---------GRPTM------AESTRSTRPKKN 523

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y  +E+I A E  + +  +LS ++ ++YDL+D+TRQ L  YA+ L     +AYQ  D   
Sbjct: 524 YKPAELIPAWEALLKASQQLS-TDGFKYDLVDVTRQVLVNYADTLQRQFAQAYQGKDGKK 582

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             +LS  FL +++D+D LLA    FLLG WL  AK++    E++K+
Sbjct: 583 FDRLSGDFLAVMDDVDYLLATRKDFLLGKWLNEAKRMGTTAEEKKR 628


>gi|281344539|gb|EFB20123.1| hypothetical protein PANDA_011160 [Ailuropoda melanoleuca]
          Length = 619

 Score =  359 bits (922), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 188/467 (40%), Positives = 275/467 (58%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 81  MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 140

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 141 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 197

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  A++Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 198 SSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQPEFWGPAQVTAVLGAVPRGRLLV 257

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F+G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM G
Sbjct: 258 LDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMAG 317

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN +VY LM+E+ ++ + V D++AW++  + RRYG +    + AW +L  +
Sbjct: 318 TGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSSSAARRYGVTHKDTEAAWRLLLRS 377

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +  +  NR  +V  P +  +                                 +
Sbjct: 378 VYNCSGEACSGHNRSPLVRRPSLQMATA-------------------------------V 406

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +   L+AS ++RYDL+D+TRQA  +  +  +     AY   +  
Sbjct: 407 WYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQAAQELVSLYYEEARAAYLNKELV 466

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            + + + R + EL+  +D +LA    FLLG WLE A+  A +E + +
Sbjct: 467 PLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQARAAAVSEAEAR 513


>gi|109491871|ref|XP_001081442.1| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
 gi|392351622|ref|XP_002727861.2| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
 gi|149054262|gb|EDM06079.1| rCG33377, isoform CRA_b [Rattus norvegicus]
          Length = 739

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 192/467 (41%), Positives = 277/467 (59%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 202 MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLGNW        + C++LL   DPLF  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 262 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 319 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++S +  F+G P+IWCMLHNF GN  ++G L+ +  GP  AR   N+TMVG
Sbjct: 379 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 438

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            G++ EGI QN VVY LM+E+ ++ + V D+ AW++ ++ RRYG S P    AW +L  +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLRS 498

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 499 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 527

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +   L+AS  +RYDL+D+TRQA+ +  +  +     A+   D  
Sbjct: 528 WYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQAVQELVSSCYEEARTAFLNQDLD 587

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            + +       +L+  +D LLA +  FLLG WL+ A+++A +E + +
Sbjct: 588 LLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAREVAVSESEAQ 634


>gi|301773566|ref|XP_002922216.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Ailuropoda
           melanoleuca]
          Length = 634

 Score =  359 bits (921), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 188/467 (40%), Positives = 275/467 (58%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 95  MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 154

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 155 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 211

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  A++Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 212 SSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQPEFWGPAQVTAVLGAVPRGRLLV 271

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F+G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM G
Sbjct: 272 LDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMAG 331

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN +VY LM+E+ ++ + V D++AW++  + RRYG +    + AW +L  +
Sbjct: 332 TGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSSSAARRYGVTHKDTEAAWRLLLRS 391

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +  +  NR  +V  P +  +                                 +
Sbjct: 392 VYNCSGEACSGHNRSPLVRRPSLQMATA-------------------------------V 420

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +   L+AS ++RYDL+D+TRQA  +  +  +     AY   +  
Sbjct: 421 WYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQAAQELVSLYYEEARAAYLNKELV 480

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            + + + R + EL+  +D +LA    FLLG WLE A+  A +E + +
Sbjct: 481 PLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQARAAAVSEAEAR 527


>gi|295132875|ref|YP_003583551.1| hypothetical protein ZPR_1010 [Zunongwangia profunda SM-A87]
 gi|294980890|gb|ADF51355.1| predicted protein [Zunongwangia profunda SM-A87]
          Length = 750

 Score =  359 bits (921), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 188/464 (40%), Positives = 275/464 (59%), Gaps = 36/464 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGPLPQSW +    LQKKIL R  ELGM PVLPAF+G+VPA+ +  FP A + 
Sbjct: 206 MGNLDGWGGPLPQSWKESHRDLQKKILKRSRELGMKPVLPAFTGHVPASFKKFFPDADLK 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  NW +   D     TY+LDA DPLF EIG+ F+E+Q + +G T H Y  DTF+EN PP
Sbjct: 266 KT-NWGNDFGD-----TYILDAEDPLFAEIGKRFLEKQEEVFG-TDHFYTADTFNENEPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVV 179
            D P+Y+  L   I+ GM++ D +A W+MQGWLF S+  FW+ PQ+K LL++VP  ++++
Sbjct: 319 SDDPKYLGELSEKIFEGMKAADPEATWVMQGWLFYSHKDFWKTPQIKGLLSTVPDDRMII 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
           LDL  E++P+W  ++ FYG  +IW MLHNF GNI M+G ++++A  P  A   S +  + 
Sbjct: 379 LDLATEIEPVWKQTEAFYGKQWIWNMLHNFGGNISMFGRIETVAEQPALALNDSTSGNLK 438

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+G++ME IEQNPV+Y+LM++  ++   +++K+W+  Y+  RYG    +I +AW++L  T
Sbjct: 439 GIGLTMEAIEQNPVLYELMTDNTWRDTPIELKSWLKNYTRNRYGAVNDSILEAWDILVAT 498

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
            YN T    D    +I A P          EG Y+ + +                     
Sbjct: 499 AYNGT-TIRDGAESIIAARP--------TFEG-YRRWARTKMN----------------- 531

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y   +++ A +LFI + +    S+ + YDL+DL+RQ LA YA  +   +  AY+ ND   
Sbjct: 532 YDPLDLLPAWDLFIGARDRFKDSDGFAYDLVDLSRQVLANYALPVQQQMRIAYENNDKEA 591

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             + S   L L+ D+D LLA    FLLGPW+  A+      E++
Sbjct: 592 FKKHSEELLTLISDLDRLLATRKDFLLGPWIADARSWGTTPEEK 635


>gi|449491231|ref|XP_004174728.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase,
           partial [Taeniopygia guttata]
          Length = 752

 Score =  358 bits (919), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 194/471 (41%), Positives = 286/471 (60%), Gaps = 46/471 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  W GPLP +W  +QL LQ +I+ R+  LGM  VLPAF+G+VP  +  VFP    T
Sbjct: 211 MGNLRRWAGPLPPAWHFKQLYLQYRIVERMRSLGMTTVLPAFAGHVPQGILRVFPRVNAT 270

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W     D  + C YLLD  DP+F  IG  F+++ +KE+G T H+Y+ DTF+E TP 
Sbjct: 271 RLGHWSHF--DCTYSCIYLLDPEDPMFQVIGTLFLKELIKEFG-TDHVYSADTFNEMTPL 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S +  A++  M   D  A+WLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 328 SSDPAYLSRVSNAVFRSMTGADPKALWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 387

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE KP++  ++ FYG P+IWCMLHNF GN  ++G +++I  GP  AR   N+TMVG
Sbjct: 388 LDLFAESKPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 447

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G+  EGIEQN +VY+LM+E+ ++ E +D+ +W+ +Y+ RRYG    A   AW +L  +V
Sbjct: 448 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVTRYAERRYGAPNAAAASAWRLLLRSV 507

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YNCT    + NR                          P+ +   L+ +T       LWY
Sbjct: 508 YNCTGVCVNHNRS-------------------------PLVRRPSLRMDT------ELWY 536

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 413
           + S+V  A  L +++G EL +S  + YDL+D+TRQA  +  +  +L+I +A+Q      L
Sbjct: 537 NASDVFEAWRLLLSAGAELGSSPAFLYDLVDVTRQAAQQLVSHYYLSIRQAFQSHALPEL 596

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             A GV        +L+ ++D LL+ H  FLLG WL+SA+ +A ++++ +Q
Sbjct: 597 LTAGGVL-----VYDLLPELDSLLSSHSLFLLGRWLQSARAVATSDQEAEQ 642


>gi|340717403|ref|XP_003397173.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus terrestris]
          Length = 770

 Score =  358 bits (919), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 189/464 (40%), Positives = 267/464 (57%), Gaps = 35/464 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  SW ++ L LQ KIL R+ ELG+ PVLPAF+G+VP A   +FP A +T
Sbjct: 218 MGNIRGFGGPLTSSWHERSLQLQHKILQRMRELGIIPVLPAFTGHVPRAFPRLFPEANVT 277

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   W S     ++CC YLL+ TDPLF +IG  F+   +KE+G T HIYNCDTF+EN PP
Sbjct: 278 KSATWNSFSD--KYCCPYLLEPTDPLFHKIGDQFLRTYIKEFG-TDHIYNCDTFNENEPP 334

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++G +I+  M S D  A+WLMQGWLF +D  FW  P++K  L SVPLG+L+V
Sbjct: 335 TSELKFLRNVGHSIFQTMLSVDPQAIWLMQGWLFVHDALFWTEPRIKTFLTSVPLGRLIV 394

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P++   K +YG P+IWCMLHNF G + M+G    I     E R  E +TM+G
Sbjct: 395 LDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINRRVFEGRNMEGSTMIG 454

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ EGI QN V+Y+LM+EMA++ E V++  W   Y+ RRYG        AW  L  TV
Sbjct: 455 TGLTPEGINQNYVIYELMNEMAYRQEPVNLDNWFEDYASRRYGAWNEYAVAAWKNLGSTV 514

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YN                       IS   GKY    +P    A L            WY
Sbjct: 515 YNFRG--------------------ISKIRGKYVITRRPSLNLARLT-----------WY 543

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              +      +F+ + +    S  YR+D++D+TRQAL   A++++  ++E++   D    
Sbjct: 544 DPEKFYSTWYIFLQARHGRQNSTLYRHDVVDITRQALQLKADKIYSALVESFNQKDVTTF 603

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              + R LEL +D++ +LA  + FLLG WLE AK LA ++ + K
Sbjct: 604 KLQADRLLELFDDLEAILASSEDFLLGTWLEMAKNLATDDAESK 647


>gi|440903235|gb|ELR53922.1| Alpha-N-acetylglucosaminidase, partial [Bos grunniens mutus]
          Length = 614

 Score =  358 bits (918), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 195/469 (41%), Positives = 274/469 (58%), Gaps = 46/469 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 77  MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 136

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+GNW        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 137 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 193

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 194 SSEPSYLAAATTAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 253

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+TMVG
Sbjct: 254 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 313

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S    + AW +L  +
Sbjct: 314 TGMAPEGIGQNEVVYALMAELGWKKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 373

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNC                       S  E +  N+  P+ +   L+  T+      +W
Sbjct: 374 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 403

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
           Y+ S+V  A  L +A+ + L++S  +RYDL+D+TRQA+ +  +  +  +  AY       
Sbjct: 404 YNRSDVFEAWRLLLAATSTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 463

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
           L  A G+        EL+  +D +LA    FLLG WLE A+Q A +E +
Sbjct: 464 LTRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARQAAVSETE 507


>gi|350407422|ref|XP_003488083.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus impatiens]
          Length = 770

 Score =  358 bits (918), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 189/464 (40%), Positives = 268/464 (57%), Gaps = 35/464 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  SW ++ L LQ +IL R+ ELG+ PVLPAF+G+VP A   +FP A +T
Sbjct: 218 MGNIRGFGGPLTSSWHERSLQLQHRILQRMRELGIIPVLPAFTGHVPRAFPRLFPEANVT 277

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   W S     ++CC YLL+ TDPLF +IG  F+   +KE+G T HIYNCDTF+EN PP
Sbjct: 278 KSATWNSFSD--KYCCPYLLEPTDPLFHKIGDQFLRTYIKEFG-TDHIYNCDTFNENEPP 334

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++G +I+  M S D  A+WLMQGWLF +D  FW  P++KA L SVPLG+L+V
Sbjct: 335 TSELKFLRNVGHSIFQTMLSVDPQAIWLMQGWLFVHDAVFWTEPRIKAFLTSVPLGRLIV 394

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P++   K +YG P+IWCMLHNF G + M+G    I     E R  E +TM+G
Sbjct: 395 LDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINRRVFEGRNMEGSTMIG 454

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ EGI QN V+Y+LM+EMA++ E V++  W   Y+ RRYG        AW  L  TV
Sbjct: 455 TGLTPEGINQNYVIYELMNEMAYRQEPVNLDNWFEDYASRRYGAWNEYAVAAWKNLGSTV 514

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YN                       IS   GKY    +P    A L            WY
Sbjct: 515 YNFRG--------------------ISKIRGKYVITRRPSLNLARLT-----------WY 543

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              +      +F+ + +    S  YR+D++D+TRQAL   A++++  ++E++   D    
Sbjct: 544 DPEKFYSTWYIFLQARHGRKNSTLYRHDVVDITRQALQLKADKIYSVLVESFNQKDVTTF 603

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              + R LEL +D++ +LA  + FLLG WLE AK LA ++ + K
Sbjct: 604 KLQAGRLLELFDDLEAILASSEDFLLGTWLEMAKNLATDDAESK 647


>gi|355568706|gb|EHH24987.1| Alpha-N-acetylglucosaminidase, partial [Macaca mulatta]
          Length = 711

 Score =  358 bits (918), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 193/470 (41%), Positives = 273/470 (58%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 172 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 231

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 232 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 288

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             +P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 289 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 348

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 349 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 408

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ +RYG S P    AW +L  +
Sbjct: 409 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 468

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 469 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 497

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S V  A  L + S   L+AS  +RYDL+DLTRQA+ +  +  +     AY      
Sbjct: 498 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 557

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D LLA    FLLG WLE A+  A +E +
Sbjct: 558 SLLRAGGVLA-----YELLPALDELLASDSRFLLGSWLEQARAAAVSEAE 602


>gi|402900329|ref|XP_003913130.1| PREDICTED: alpha-N-acetylglucosaminidase [Papio anubis]
          Length = 743

 Score =  358 bits (918), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 193/470 (41%), Positives = 272/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             +P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPEAARLFPNSTMVG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ +RYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 500

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 529

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S V  A  L + S   L+AS  +RYDL+DLTRQA+ +  +  +     AY      
Sbjct: 530 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 589

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D LLA    FLLG WLE A+  A +E +
Sbjct: 590 SLLRAGGVLA-----YELLPALDELLASDSRFLLGSWLEQARAAAVSEAE 634


>gi|126307960|ref|XP_001366343.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
           domestica]
          Length = 741

 Score =  357 bits (917), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 183/465 (39%), Positives = 269/465 (57%), Gaps = 38/465 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +Q  LQ +IL R+   GM PVLPAF+G++P A   VFP A +T
Sbjct: 202 MGNLHTWGGPLPSSWDLKQSYLQYRILERMRSFGMKPVLPAFAGHIPKAFTRVFPQANVT 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W     +  + C+YLL   DPLF  +G  F+ +  KE+G T HIY+ D F+E  PP
Sbjct: 262 NLGMWGHFSCN--YSCSYLLAPEDPLFPVVGSLFLRELTKEFG-TDHIYSADIFNEMDPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             +P Y+++  AA+Y  M + D DAVWL QGWLF   P FW+PPQMKA+L +VP G+ ++
Sbjct: 319 SSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNHPDFWKPPQMKAVLEAVPRGRFLI 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++S +  FYG P+IWCMLHNF GN  ++G+LD++  GP  AR   N+T+VG
Sbjct: 379 LDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLFGVLDAVNRGPSTARLFPNSTIVG 438

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            G+  EGI QN VVY LM+E+ ++ +   D+ AW+  ++ +RYG   P  + AW +L  +
Sbjct: 439 TGIVPEGINQNEVVYALMAELGWRKDPFPDLGAWVAGFAAQRYGTPHPQAEAAWRLLLRS 498

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +  T  N   +V  P +                                    +
Sbjct: 499 VYNCSWENCTGHNHSPLVKRPSLHLDF-------------------------------SV 527

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +  +L+ S+ +RYDL+D+TRQ   +  +  +  +  A++     
Sbjct: 528 WYNRSDVFEAWRLLLEAAPQLATSSAFRYDLLDVTRQVAQELVSLYYGELKTAFEAGSMP 587

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            +       + +L+  +D LL   + FLLG WLE A+++A +E +
Sbjct: 588 ALLSAGGLLVFDLLPSLDELLGTDERFLLGGWLEQAREMAVSEAE 632


>gi|114667172|ref|XP_523654.2| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Pan
           troglodytes]
 gi|410216584|gb|JAA05511.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
 gi|410258938|gb|JAA17435.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
 gi|410304442|gb|JAA30821.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
 gi|410337929|gb|JAA37911.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
          Length = 743

 Score =  357 bits (915), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY      
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 590 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 634


>gi|66346698|ref|NP_000254.2| alpha-N-acetylglucosaminidase precursor [Homo sapiens]
 gi|317373322|sp|P54802.2|ANAG_HUMAN RecName: Full=Alpha-N-acetylglucosaminidase; AltName:
           Full=N-acetyl-alpha-glucosaminidase; Short=NAG;
           Contains: RecName: Full=Alpha-N-acetylglucosaminidase 82
           kDa form; Contains: RecName:
           Full=Alpha-N-acetylglucosaminidase 77 kDa form; Flags:
           Precursor
          Length = 743

 Score =  357 bits (915), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY      
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 590 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 634


>gi|1171229|gb|AAC50512.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|1171231|gb|AAC50513.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|1197840|gb|AAB06188.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|1479981|gb|AAB36604.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|32450702|gb|AAH53991.1| N-acetylglucosaminidase, alpha- [Homo sapiens]
 gi|119581237|gb|EAW60833.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
           CRA_b [Homo sapiens]
          Length = 743

 Score =  357 bits (915), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY      
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 590 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 634


>gi|355754184|gb|EHH58149.1| Alpha-N-acetylglucosaminidase, partial [Macaca fascicularis]
          Length = 650

 Score =  356 bits (914), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 193/470 (41%), Positives = 272/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 111 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 170

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 171 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 227

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             +P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 228 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 287

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 288 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 347

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ +RYG S P    AW +L  +
Sbjct: 348 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 407

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 408 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 436

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S V  A  L + S   L+AS  +RYDL+DLTRQA+ +  +  +     AY      
Sbjct: 437 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 496

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D LLA    FLLG WLE A+  A +E +
Sbjct: 497 SLLRAGGVLA-----YELLPALDELLASDSRFLLGSWLEQARAAAVSEAE 541


>gi|1479983|gb|AAB36605.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|119581236|gb|EAW60832.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
           CRA_a [Homo sapiens]
          Length = 639

 Score =  356 bits (914), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 100 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 159

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 160 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 216

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 217 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 276

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 277 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 336

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S P    AW +L  +
Sbjct: 337 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 396

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 397 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 425

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY      
Sbjct: 426 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 485

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 486 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 530


>gi|297701096|ref|XP_002827555.1| PREDICTED: alpha-N-acetylglucosaminidase [Pongo abelii]
          Length = 836

 Score =  356 bits (913), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 191/470 (40%), Positives = 271/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 297 MGNLHSWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 356

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HI+  DTF+E  PP
Sbjct: 357 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIFGADTFNEMQPP 413

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 414 SSEPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 473

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 474 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 533

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S P    AW +L  +
Sbjct: 534 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 593

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 594 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 622

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY      
Sbjct: 623 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 682

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 683 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 727


>gi|395827009|ref|XP_003786703.1| PREDICTED: alpha-N-acetylglucosaminidase [Otolemur garnettii]
          Length = 756

 Score =  356 bits (913), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 191/464 (41%), Positives = 271/464 (58%), Gaps = 36/464 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWGGPLPFSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QL +W        + C++LL   DP+F  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 264 QLSSWGHFNCS--YSCSFLLAPGDPIFSLIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D DAVWL+QGWLF + P FW P Q+KA+L +VPLG+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMIAVDPDAVWLLQGWLFQHQPQFWGPTQIKAVLRAVPLGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++S +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 381 LDLFAESQPVYSRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNQGPKAARLFPNSTMVG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S    + AW +L  +
Sbjct: 441 TGMAPEGINQNEVVYALMAELGWRKDPVPDLVAWVTSFADRRYGISHGDAEAAWRLLLRS 500

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNC+  A   +                           P+ K   L+  T+      +W
Sbjct: 501 VYNCSGEACSGHNH------------------------SPLVKRPSLQMNTT------VW 530

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y+ S+V  A  L + S   L+AS  +RYDL+D+TRQA+ +  +  +     AY   +   
Sbjct: 531 YNRSDVFEAWRLLLTSAPTLAASPIFRYDLLDITRQAIQELVSLYYEKARTAYLNKELVP 590

Query: 419 VFQLSRRF-LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
           + +       EL+  +D +LA  + FLLG WL  A+ +A +E +
Sbjct: 591 LLRAGGLLAYELLPALDEVLASDNHFLLGSWLAQARAVAISEAE 634


>gi|358419179|ref|XP_003584151.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bos taurus]
          Length = 741

 Score =  356 bits (913), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 196/469 (41%), Positives = 274/469 (58%), Gaps = 46/469 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+GNW        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+TMVG
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ +Q + V D+ AW+  ++ RRYG S    + AW +L  +
Sbjct: 441 TGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 500

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNC                       S  E +  N+  P+ +   L+  T+      +W
Sbjct: 501 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 530

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
           Y+ S+V  A  L + + + L++S  +RYDL+D+TRQA+ +  +  +  +  AY       
Sbjct: 531 YNRSDVFEAWRLLLTATSTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 590

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
           L  A G+        EL+  +D +LA    FLLG WLE A+Q A +E +
Sbjct: 591 LTRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARQAAVSETE 634


>gi|426348060|ref|XP_004041658.1| PREDICTED: alpha-N-acetylglucosaminidase [Gorilla gorilla gorilla]
          Length = 743

 Score =  356 bits (913), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIEAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY      
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 590 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 634


>gi|397485721|ref|XP_003813989.1| PREDICTED: alpha-N-acetylglucosaminidase [Pan paniscus]
          Length = 682

 Score =  355 bits (912), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 143 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 202

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 203 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 259

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 260 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 319

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 320 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNEGPEAARLFPNSTMVG 379

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S P    AW +L  +
Sbjct: 380 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 439

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 440 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 468

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY      
Sbjct: 469 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 528

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 529 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 573


>gi|301626955|ref|XP_002942650.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Xenopus (Silurana)
           tropicalis]
          Length = 759

 Score =  353 bits (907), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 181/463 (39%), Positives = 273/463 (58%), Gaps = 46/463 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+H WGGPL  SW++++L LQ +I  R+  LGM  VLPAF+G++P  +  VFP   ++
Sbjct: 212 MGNIHTWGGPLSISWMEKRLSLQLQITERMRSLGMITVLPAFAGHIPEGILRVFPKVTVS 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG W +      + C+YLLD  DPLF  IG  F+ Q ++ +G T HIY+ DTF+E +P 
Sbjct: 272 RLGGWSNFNCT--YSCSYLLDPEDPLFQWIGELFLSQMVQSFG-TDHIYSADTFNEMSPT 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S++  AI+  M   D DA+WLMQGWLF  +P FWRP Q KALL+  P+G+++V
Sbjct: 329 SSDPGYLSAVSGAIFKSMAKVDPDAIWLMQGWLFINNPSFWRPAQTKALLHGAPIGRIIV 388

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE  P++ T++ FYG P+IWCML+NF GN  ++G ++ +  GP +A    N+TMVG
Sbjct: 389 LDLFAETVPVYLTTESFYGQPFIWCMLNNFGGNHGLFGNIEGVNRGPFDAAKFPNSTMVG 448

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ EGIEQN ++Y+ M+E+ +  + +++  WI+ YS RRYG+S    + AW +L  +V
Sbjct: 449 TGLTPEGIEQNDMIYEFMNEIGWSSQPINLTKWISNYSDRRYGQSNTDARMAWQILLRSV 508

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YNCT    + N   +V  P ++ +                                 + Y
Sbjct: 509 YNCTQILHNHNHSPLVRRPSLNMNT-------------------------------DICY 537

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QL 413
           + +++  A      +   L  S T+ YDL+D+TR+A+ +  +E +L I EAY      QL
Sbjct: 538 NKADIYEAWRFMHNASFALGKSATFLYDLVDITREAVQQLVSEYYLEIKEAYGKKSLQQL 597

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
             A GV        +L+ ++D LL+   GFLLG WL++AK +A
Sbjct: 598 MTAGGVL-----VYDLLPELDSLLSSQPGFLLGSWLKAAKSMA 635


>gi|148671928|gb|EDL03875.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
           CRA_a [Mus musculus]
          Length = 538

 Score =  353 bits (905), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 185/467 (39%), Positives = 273/467 (58%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW   Q+ LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 1   MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 60

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W        + C++LL   DP+F  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 61  KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 117

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 118 FSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 177

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE  P++  +  F+G P+IWCMLHNF GN  ++G L+ +  GP  AR   N+TMVG
Sbjct: 178 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 237

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            G++ EGI QN VVY LM+E+ ++ + V D+ AW++ +++RRYG S P    AW +L  +
Sbjct: 238 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 297

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 298 VYNCSGEACSGHNRS-------------------------PLVKRPSLQMSTA------V 326

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQA+ +  +  +     AY   +  
Sbjct: 327 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 386

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            + +     + +L+  +D LLA    FLLG WL+ A++ A +E + +
Sbjct: 387 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQ 433


>gi|254910995|ref|NP_038820.2| alpha-N-acetylglucosaminidase precursor [Mus musculus]
 gi|20385160|gb|AAM21194.1|AF363242_1 N-acetyl-glucosaminidase [Mus musculus]
 gi|3329361|gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus]
 gi|33585908|gb|AAH55733.1| Alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB) [Mus
           musculus]
 gi|74211094|dbj|BAE37639.1| unnamed protein product [Mus musculus]
 gi|74218052|dbj|BAE42009.1| unnamed protein product [Mus musculus]
 gi|148671929|gb|EDL03876.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
           CRA_b [Mus musculus]
          Length = 739

 Score =  352 bits (903), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 185/467 (39%), Positives = 273/467 (58%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW   Q+ LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 202 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W        + C++LL   DP+F  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 262 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 319 FSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE  P++  +  F+G P+IWCMLHNF GN  ++G L+ +  GP  AR   N+TMVG
Sbjct: 379 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 438

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            G++ EGI QN VVY LM+E+ ++ + V D+ AW++ +++RRYG S P    AW +L  +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 498

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 499 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 527

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQA+ +  +  +     AY   +  
Sbjct: 528 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 587

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            + +     + +L+  +D LLA    FLLG WL+ A++ A +E + +
Sbjct: 588 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQ 634


>gi|2660688|gb|AAB88084.1| Naglu [Mus musculus]
          Length = 739

 Score =  352 bits (903), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 185/467 (39%), Positives = 273/467 (58%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW   Q+ LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 202 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LG+W        + C++LL   DP+F  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 262 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 319 FSEPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE  P++  +  F+G P+IWCMLHNF GN  ++G L+ +  GP  AR   N+TMVG
Sbjct: 379 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 438

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            G++ EGI QN VVY LM+E+ ++ + V D+ AW++ +++RRYG S P    AW +L  +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 498

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 499 VYNCSGEACSGHNRS-------------------------PLVKRPSLQMSTA------V 527

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQA+ +  +  +     AY   +  
Sbjct: 528 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 587

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            + +     + +L+  +D LLA    FLLG WL+ A++ A +E + +
Sbjct: 588 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQ 634


>gi|311267179|ref|XP_003131436.1| PREDICTED: alpha-N-acetylglucosaminidase [Sus scrofa]
          Length = 744

 Score =  352 bits (903), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 191/470 (40%), Positives = 270/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 205 MGNLHTWSGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQISVT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPSYLAAATAAVYQAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+TM G
Sbjct: 382 LDLFAESQPVYVRTASFLGQPFIWCMLHNFGGNHGLFGALESVNQGPAAARLFPNSTMAG 441

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+  W+  ++ RRYG S    + AW +L  +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGTWVTSFAARRYGVSQGDAEAAWRLLLRS 501

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +G T  NR  +V  P +  +                                 +
Sbjct: 502 VYNCSGEGCTGHNRSPLVRRPSLQMATT-------------------------------V 530

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S+V  A  L + +   L++S  +RYDL+D+TRQA+ +  +  +     AY      
Sbjct: 531 WYNQSDVFEAWRLLLKATPTLASSPAFRYDLVDITRQAVQELVSLYYEEARTAYLNKELV 590

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A G+        EL+  +D +LA    FLLG WLE A+ +A +E +
Sbjct: 591 SLMRAGGILA-----YELLPALDKVLASDSHFLLGSWLEQARGVAVSEAE 635


>gi|444714090|gb|ELW54978.1| Alpha-N-acetylglucosaminidase [Tupaia chinensis]
          Length = 724

 Score =  351 bits (900), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 193/467 (41%), Positives = 277/467 (59%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +QL LQ ++L R+   GM PVLPAF G+VP A+  VFP   +T
Sbjct: 174 MGNLHTWGGPLPHSWHLKQLYLQHRVLDRMRSFGMIPVLPAFPGHVPKAITRVFPQVNVT 233

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DP+F  IG  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 234 QLGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRELTKEFG-TDHIYGADTFNELQPP 290

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AAIY+ M + D  AVWL+QGW+F + P FW P Q+KA+L +VP G+L+V
Sbjct: 291 SSEPSYLAAATAAIYAAMTAVDPGAVWLLQGWIFQHQPDFWGPAQVKAVLEAVPRGRLLV 350

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  +YG L+++ +GP  AR   N++MVG
Sbjct: 351 LDLFAETRPVYLYTASFLGQPFIWCMLHNFGGNHGLYGTLEAVNWGPKAARLFPNSSMVG 410

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  Y+ RRYG S+   + AW +L  +
Sbjct: 411 TGMAPEGINQNEVVYALMAELGWRKDPVPDLAAWVTSYADRRYGVSLGDAEAAWRLLLRS 470

Query: 299 VYNCTDG-ATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+    +  NR                          P+ K   L+  T+      +
Sbjct: 471 VYNCSGQMCSGHNR-------------------------SPLVKRPSLQMNTT------V 499

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +   L+AS T+RYDL+D+TRQA+ +  +  +     AY   +  
Sbjct: 500 WYNRSDVFEAWRLLLTAAPTLAASPTFRYDLLDVTRQAVQELVSLYYEEARTAYLNKELV 559

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            + +     + EL+ D+D LLA    F+LG WLE A+ +A +E + +
Sbjct: 560 SLLRAGGILVYELLPDLDNLLATDGRFMLGSWLEQARAVAVSETEAQ 606


>gi|405964692|gb|EKC30145.1| Alpha-N-acetylglucosaminidase [Crassostrea gigas]
          Length = 859

 Score =  350 bits (898), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 177/485 (36%), Positives = 276/485 (56%), Gaps = 31/485 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+HGWGGP+ Q+W+D QL+LQ KIL R+   GM PVLP F+G+VP A    +P A ++
Sbjct: 217 MGNMHGWGGPITQNWIDDQLILQHKILERMRSFGMIPVLPGFAGHVPEATILRYPQANVS 276

Query: 61  QLGNWFSVKSDPRW--------------------CCTYLLDATDPLFIEIGRAFIEQQLK 100
           +L +W        W                    CC YLLD  DPLF++I   FI++   
Sbjct: 277 RLTDWAGFNQSFCWHYPTANVSRLRDWGHFNKTYCCNYLLDFNDPLFMKIAVRFIKEMEN 336

Query: 101 EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 160
           E+G   H+Y+ DTF+E  P  +S EY++  G  +Y  ++  DS A+WLMQGWLF    FW
Sbjct: 337 EFG-VDHVYSVDTFNEMRPRSNSTEYLALSGRTVYKSLKEADSKAIWLMQGWLFIDGGFW 395

Query: 161 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
           + PQ+KALL +VP G++++LDL++E+ PI++ ++ +YG P+IWCMLH+F G +E+YG L 
Sbjct: 396 KQPQIKALLTAVPQGEMIILDLYSEIIPIYTQTESYYGQPFIWCMLHDFGGTMELYGALK 455

Query: 221 SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRR 280
            I  GP   R   N++MVG+GM+ EGI QN VVY+  +E  ++    D+  WI++Y + R
Sbjct: 456 LINEGPFNGRAFPNSSMVGLGMTPEGIFQNEVVYEFFTENVWRKAPRDISTWISKYVLNR 515

Query: 281 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE---GKYQN--- 334
           YG++   I  AW  L ++VYN +D   D + + I   PD  PS+        G Y N   
Sbjct: 516 YGKTNKFIDLAWQYLKNSVYNNSDNLKDHDSNAI---PDHRPSLSPALHPDLGIYNNTDY 572

Query: 335 -YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 393
            +   ++         +      +WY+  ++  A ++   + +E S S+ + YD++D+TR
Sbjct: 573 LHDNSINIIVTTLPRMTPLIQQDVWYNPEDLYVAWDIMTLNLDEFSNSSLFMYDIVDVTR 632

Query: 394 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
            +L   + + + +++ A+   D H V     + L L+ DMD +L     FLLG W+++A 
Sbjct: 633 NSLQILSIKYYTDLVYAFGRGDIHAVESHGNQLLGLLSDMDTVLGSDSHFLLGRWIKAAT 692

Query: 454 QLAQN 458
             A +
Sbjct: 693 DNAMD 697


>gi|354485058|ref|XP_003504701.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cricetulus griseus]
 gi|344251941|gb|EGW08045.1| Alpha-N-acetylglucosaminidase [Cricetulus griseus]
          Length = 740

 Score =  350 bits (897), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 187/472 (39%), Positives = 276/472 (58%), Gaps = 48/472 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   + 
Sbjct: 203 MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRAFGMIPVLPAFAGHVPKAITRVFPQVNVF 262

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  P 
Sbjct: 263 QLGSWGHFNCS--YSCSFLLAPGDPVFPLIGSLFLRELIKEFG-TDHIYGADTFNEMQPI 319

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P ++++  AA+Y  M S D DA+WL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 320 SSDPSFLTAATAAVYEAMISVDPDAIWLLQGWLFQHQPQFWGPAQVKAVLQAVPRGRLLV 379

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE  P++  +  FYG P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 380 LDLFAESHPVYMQTASFYGQPFIWCMLHNFGGNHGLFGALEAVNQGPRAARIFPNSTMVG 439

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            G++ EGI QN +VY LM+E+ ++ + V D++ W+++++  RYG S P  + AW +L  +
Sbjct: 440 TGIAPEGIGQNEMVYALMAELGWRKDPVPDLEVWVSRFASHRYGMSHPDAEAAWRLLLRS 499

Query: 299 VYNC-TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC  +     NR  +V  P +  + I                               +
Sbjct: 500 VYNCPGETYNGHNRSPLVKRPSLQINTI-------------------------------V 528

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ----- 412
           WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQ+L +  +  +     A+      
Sbjct: 529 WYNRSDVFEAWRLLLTAAPNLTTSKAFRYDLLDVTRQSLQELVSLFYEEARIAFMKEELD 588

Query: 413 -LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            L  A G+  ++R+   L+  +D LLA    FLLG WL  A+ +A +E++ +
Sbjct: 589 LLLRAGGI--ITRK---LLPALDELLASDSRFLLGTWLNQARAMAVSEDEAQ 635


>gi|344285558|ref|XP_003414528.1| PREDICTED: alpha-N-acetylglucosaminidase [Loxodonta africana]
          Length = 744

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 193/470 (41%), Positives = 271/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 205 MGNLHSWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAVTRVFPQVNVT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DP+F  IG  F+ +   E+G T HIY  DTF+E  PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRELTTEFG-TDHIYGADTFNEMQPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G L+V
Sbjct: 322 SSEPSYLAAATAAVYEAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGHLLV 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 382 LDLFAETQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGTLETVNQGPAAARLFPNSTMVG 441

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG      + AW +L  +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVPDLGAWVASFAARRYGGIHQDAETAWRLLLRS 501

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +  +  NR                          P+ K   L+  T+      +
Sbjct: 502 VYNCSGESCSGHNRS-------------------------PLVKRPSLQMNTT------V 530

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S+V  A  L +A+   L+AS  +RYDL+D+TRQA  +  +  +  +  AY      
Sbjct: 531 WYNRSDVFEAWRLLLATTPALAASPAFRYDLLDVTRQAAQELVSFYYGEVRTAYLNKELV 590

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 591 HLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARVAAVSEAE 635


>gi|414585094|tpg|DAA35665.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
          Length = 1202

 Score =  348 bits (894), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 167/269 (62%), Positives = 209/269 (77%), Gaps = 6/269 (2%)

Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 260
           Y WCMLHNFA + EMYG+LD++A GP++AR S+N+TMVGVGMSMEGIEQNP+VYDLMSEM
Sbjct: 348 YNWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGVGMSMEGIEQNPIVYDLMSEM 407

Query: 261 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 320
           AF H +VD++ W+  Y  RRYG+ V  +QDAW +LY T+YNCTDG  DKNRDVIVAFPDV
Sbjct: 408 AFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLYNCTDGKNDKNRDVIVAFPDV 467

Query: 321 DPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASG 375
           +P +I+ T G + N    Y    SK  + K  +S +Y+HPHLWY T+ VI ALELF+  G
Sbjct: 468 EPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHPHLWYDTNAVIHALELFLQHG 526

Query: 376 NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 435
           +E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+ + V  L + FL LV D+D 
Sbjct: 527 DEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNNMNQVTILCQHFLSLVNDLDT 586

Query: 436 LLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
           LL+ H+GFLLGPWLESAK LA+N EQE Q
Sbjct: 587 LLSSHEGFLLGPWLESAKGLARNSEQEIQ 615



 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 85/110 (77%), Positives = 101/110 (91%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYN 110
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYN
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYN 349


>gi|395532374|ref|XP_003768245.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Sarcophilus
           harrisii]
          Length = 726

 Score =  348 bits (894), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 189/470 (40%), Positives = 272/470 (57%), Gaps = 38/470 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPL  SW  +Q  LQ +IL R+   GM PVLPAF+G+VP A   VFP A +T
Sbjct: 187 MGNLHSWGGPLSSSWHRKQSSLQYQILERMRSFGMKPVLPAFAGHVPKAFTRVFPQAYVT 246

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W        + C+YLL   DPLF  +G  F+ +  +E+G T HIY+ DTF+E  PP
Sbjct: 247 HLGMWGHFNCT--YSCSYLLAPEDPLFPVVGSLFLRELTQEFG-TDHIYSADTFNEMEPP 303

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW+PPQ+KA+L +VPLG+L+V
Sbjct: 304 SSEPAYLAAATAAVYEAMIAVDVDAVWLLQGWLFQHQPDFWKPPQVKAVLKAVPLGRLLV 363

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL+AE KP++S +  FYG P+IWCMLHNF GN  ++G LD++  GP +A    N+T VG
Sbjct: 364 LDLYAESKPVYSRTDSFYGQPFIWCMLHNFGGNHGLFGALDAVNRGPSDAWLFPNSTFVG 423

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            G+  EGI QN VVY LM+E+ +Q   + D+ AW+  ++ +RYG      + AW +L  +
Sbjct: 424 TGIVPEGINQNEVVYALMAELGWQKGPLPDLGAWVAGFAAQRYGTPHSHAEAAWKLLLQS 483

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ D  T  NR  +V  P +   I                                +
Sbjct: 484 VYNCSGDLCTGHNRSPLVKRPSLHLDI-------------------------------SV 512

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + +   L++S  +RYDL+D+TRQ   +  +  +  +  A++     
Sbjct: 513 WYNRSDVFEAWRLLLEAAPVLASSPAFRYDLLDVTRQVAQELVSLYYEELRTAFEAGAMP 572

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVR 466
            +       + +L+  +D LLA  + FLLG WLE A+++A +E +  Q +
Sbjct: 573 ALLTAGGLLVFDLLPSLDELLASDERFLLGAWLEQAREMAVSEAEAWQYK 622


>gi|291406137|ref|XP_002719212.1| PREDICTED: alpha-N-acetylglucosaminidase [Oryctolagus cuniculus]
          Length = 743

 Score =  348 bits (892), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 189/458 (41%), Positives = 268/458 (58%), Gaps = 48/458 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP+SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWAGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAVTRVFPHINVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DP+F  IG  F+ +  +E+G T H+Y  DTF+E  PP
Sbjct: 264 QLGSWGHFNCS--YSCSFLLAPEDPMFPLIGSLFLRELTREFG-TDHVYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA++  M + D DAVWL+QGWLF + P FW P Q+KA+LN+VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVFEAMIAVDPDAVWLLQGWLFQHQPQFWGPSQVKAVLNAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 381 LDLFAENQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMVG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            G++ EGI QN VVY LM+E+ ++ E V D++AW+  ++ RRYG + P    AW +L  +
Sbjct: 441 TGIAPEGISQNEVVYALMAELGWRKEPVPDLEAWVTSFAGRRYGVAHPDAGAAWRLLLRS 500

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ D     NR  +V  P                          L+  T+      +
Sbjct: 501 VYNCSGDACRGHNRSPLVRRPS-------------------------LQLNTT------V 529

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S+V  A  L + +   L++S  +RYDL+D+TRQA+ +  +  +     AY      
Sbjct: 530 WYNRSDVFEAWRLLLKATPTLASSPAFRYDLLDVTRQAVQELVSLYYEEARTAYLHKELA 589

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
            L  A GV        EL+  +D +LA    FLLG WL
Sbjct: 590 TLLRAGGVLA-----YELLPALDRVLATDSRFLLGSWL 622


>gi|196001339|ref|XP_002110537.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
 gi|190586488|gb|EDV26541.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
          Length = 757

 Score =  347 bits (890), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 181/473 (38%), Positives = 269/473 (56%), Gaps = 35/473 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  W GPLP  W+++Q+ LQ KIL R+ + GM P+LPAF+GN+P AL  ++P AKI 
Sbjct: 217 MGNIQRWAGPLPHDWINKQITLQVKILDRMRKYGMLPILPAFNGNIPNALTKIYPKAKIV 276

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   WF      R+  T LLD  D LFI I + FIE+++K YG T H+Y+ D F+E  P 
Sbjct: 277 KSSPWFGFSK--RYGETALLDPRDKLFIVISKLFIEEEIKAYG-TDHLYSLDLFNEIDPQ 333

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
               EY++++  + Y  + S D+ AVW+MQGW+F  D + W   +++A L+ +P G++V+
Sbjct: 334 SKELEYLTAVSKSAYLALNSADTKAVWIMQGWMFYNDNYYWENKRIQAFLSPIPKGRIVI 393

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAEV+P +  S  F+G P+IWCML+NF GN  MYG  ++I  G + A   +N+TM+G
Sbjct: 394 LDLFAEVEPQYHRSNSFFGHPFIWCMLNNFGGNAGMYGTFETITEGAISAYDMKNSTMIG 453

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            GM+ EGI  N ++YDLM+EM ++   VDV+ W+  Y+ RRYG     I  AW  L  TV
Sbjct: 454 TGMAPEGIGNNYIMYDLMAEMGWRKIAVDVRDWVVVYTERRYGGLDENIIKAWLRLSETV 513

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YNC D      R    A P V PS+    +                           +WY
Sbjct: 514 YNCNDMRQYHCR----ALPAVRPSLKIAND---------------------------VWY 542

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           S  ++  A E  + + NE  +  T++YD++D+TRQAL + A  ++  + + Y  N+   +
Sbjct: 543 SADDIFFAWEHMLRANNEFISEETFQYDIVDVTRQALQELAFIMYKKVTQCYHDNNQETL 602

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVSQ 472
                  +EL  DMD LL  +  FLLG W+  A Q + N   ++Q+R   ++Q
Sbjct: 603 KTAGGELIELFTDMDTLLGTNSHFLLGRWVADALQHSNNISIKQQLRFNALNQ 655


>gi|351699889|gb|EHB02808.1| Alpha-N-acetylglucosaminidase, partial [Heterocephalus glaber]
          Length = 652

 Score =  347 bits (889), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 185/459 (40%), Positives = 271/459 (59%), Gaps = 36/459 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLP +W  +QL LQ ++L R+  LGM PVLPAF+G+VP A+  VFP   +T
Sbjct: 114 MGNLHGWGGPLPHAWHLKQLYLQHRVLDRMRALGMTPVLPAFAGHVPKAVTRVFPQVNVT 173

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  IG  F+ +  +E+G T H Y  DTF+E  PP
Sbjct: 174 QLGSWGHFNCS--YSCSFLLAPGDPLFPLIGSLFLRELNREFG-TDHFYGADTFNEMQPP 230

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 231 SSEPAYLAAATAAVYEAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVGAVLGAVPQGRLLV 290

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+T+VG
Sbjct: 291 LDLFAENQPVYTRTASFGGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTVVG 350

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            G++ EGI QN VVY LM+E+ ++ + V D+ AW+ +++ +RYG + P    AW +L H+
Sbjct: 351 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLSAWVARFAEQRYGVAQPDAVLAWRLLLHS 410

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNC+  A   +                           P+ +   L+  T+      +W
Sbjct: 411 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------VW 440

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y+ S+V  A  L + +   L+AS  +RYDL+D+TRQ L +  +  +     AY   +  G
Sbjct: 441 YNRSDVFEAWRLLLKATPNLTASPAFRYDLLDVTRQGLQELVSLYYEEARAAYMRQELEG 500

Query: 419 VFQLSRRF-LELVEDMDGLLACHDGFLLGPWLESAKQLA 456
           + +       +L+  +D +LA    FLLG WLE A+ +A
Sbjct: 501 LLRAGGVLAYKLLPALDEVLASDHRFLLGSWLEQARAVA 539


>gi|156545487|ref|XP_001606979.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Nasonia vitripennis]
          Length = 755

 Score =  346 bits (888), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 181/457 (39%), Positives = 264/457 (57%), Gaps = 35/457 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N  GWGGPL Q+W +  + LQ  I+ R+ ELG+ PVLPAF+G+VP     VFP A +T
Sbjct: 221 MGNFRGWGGPLSQAWHNHTIQLQHSIVRRMRELGITPVLPAFAGHVPRDFIRVFPEANVT 280

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   ++CC Y LD TDPLF  +GR F++    E+G T+HIYNCD+F+EN P 
Sbjct: 281 KVVSWNGFED--QYCCPYSLDPTDPLFKTVGREFLKAYTDEFG-TNHIYNCDSFNENDPH 337

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +Y+S+ G AIYSGM   D DA+WLMQGWLF +   FW  P++KA + SVP+GK+++
Sbjct: 338 TGDLDYLSNTGKAIYSGMTGADPDAIWLMQGWLFVHSEYFWTFPRVKAFVTSVPIGKMII 397

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P +     ++G P+IWCMLHNF G + M+G    I  G  EART+  +TM+G
Sbjct: 398 LDLQSEQFPQYKRFHSYFGQPFIWCMLHNFGGTLGMFGSAGVINKGVFEARTTNGSTMIG 457

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ EGI QN V+Y+ M+EM+++ + V +  W   Y+VRRYG++  +I+ +W  L   +
Sbjct: 458 TGLTPEGINQNYVIYEFMNEMSYRKKPVVLDNWFENYAVRRYGQADESIRTSWQELGREL 517

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YN  DG T                      G Y      ++K   L  E      P  WY
Sbjct: 518 YNY-DGKTK-------------------IRGHY-----VITKRPSLNIE------PWYWY 546

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
                +     F+ +GN    +  +++DL+D+TRQAL   A+ ++ +I  AY   +   +
Sbjct: 547 DLKTFLAVWNSFVHAGNGTMKNELFKHDLVDITRQALQITADFIYADIKAAYTQKNLTQL 606

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
              S   L+L +D++  LA    FLLG WLE AK +A
Sbjct: 607 QIASSHLLDLFDDLEKNLASSKDFLLGSWLEDAKAIA 643


>gi|307192254|gb|EFN75548.1| Alpha-N-acetylglucosaminidase [Harpegnathos saltator]
          Length = 741

 Score =  346 bits (888), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 175/464 (37%), Positives = 267/464 (57%), Gaps = 35/464 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W ++ + LQ +IL R+ +LG+ PVLPAF+G+VP A   +FP+A +T
Sbjct: 194 MGNIRGFGGPLSINWHERTVRLQHRILRRMRDLGIVPVLPAFAGHVPRAFARLFPNANMT 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W   K + ++CC YLL+ TDPLF  IG  F+   + E+G T HIYNCDTF+EN P 
Sbjct: 254 KIEPW--NKFEDKYCCPYLLEPTDPLFQTIGEKFLRMYINEFG-TDHIYNCDTFNENEPG 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
                Y+S++G +++  M + D  A+WLMQGWLF +D  FW  P++++ L SVP G+++V
Sbjct: 311 NSELAYLSNVGRSVFQAMSTVDPQAIWLMQGWLFVHDFIFWTEPRVRSFLTSVPTGRMLV 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P +   K +YG P+IWCMLHNF G + M+G    I     E R    +TMVG
Sbjct: 371 LDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRHMNGSTMVG 430

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ EGI QN V+Y+LM+EMA++HE VD+ AW   Y+ RRYG        AW  L  T+
Sbjct: 431 TGLTPEGINQNYVIYELMNEMAYRHEPVDLDAWFESYATRRYGAWNEYAVAAWKHLGRTI 490

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YN       +   VI   P ++ S                               P +WY
Sbjct: 491 YNFVGIERIRGHYVITRRPSLNIS-------------------------------PWVWY 519

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           +  +      +F+ +      +  YR+D++D+TRQAL   A+ +++N+++ Y+  +  G 
Sbjct: 520 NREDFYHTWNVFLKARYGRGNNTLYRHDVVDITRQALQLMADNIYMNVVDCYKRKNITGF 579

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +   L+L +D++ +LA    FLLG WL  AK +A +E++ +
Sbjct: 580 QSHAAALLDLFDDIEAILASGSNFLLGTWLAQAKDMAVDEKERQ 623


>gi|403304646|ref|XP_003942904.1| PREDICTED: alpha-N-acetylglucosaminidase [Saimiri boliviensis
           boliviensis]
          Length = 754

 Score =  346 bits (888), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 188/464 (40%), Positives = 276/464 (59%), Gaps = 36/464 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP++W  +QL LQ +IL R+   GM PVLPAFSG+VP A+  VFP   +T
Sbjct: 214 MGNLHTWDGPLPRAWHIKQLYLQHRILDRMRSFGMIPVLPAFSGHVPRAINRVFPRVNVT 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DP+F  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 274 QMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRELTKEFG-TDHIYGADTFNEMQPP 330

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D+DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 331 SSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLV 390

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 391 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPEAARLFPNSTMVG 450

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E++++ + V D+ AW+  ++ +RYG S P    AW +L  +
Sbjct: 451 TGMAPEGINQNEVVYSLMAELSWRKDPVPDLAAWVTSFATQRYGVSHPDAGAAWRLLLRS 510

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNC+  A   +                           P+ +   L+  T+      +W
Sbjct: 511 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------VW 540

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y+ S+V  A  L +++   L+AS T+RYDL+D+TRQA+ +     +     AY   + H 
Sbjct: 541 YNRSDVFEAWRLLLSAAATLAASPTFRYDLLDVTRQAVQELVGLYYEEARSAYLSKELHS 600

Query: 419 VFQLSRRF-LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
           + +       EL+  +D +LA    FLLG WLE A+ +A +E +
Sbjct: 601 LLRAGGILAYELLPALDEVLASDSHFLLGSWLEQARAVAVSEAE 644


>gi|431890602|gb|ELK01481.1| Alpha-N-acetylglucosaminidase [Pteropus alecto]
          Length = 740

 Score =  344 bits (883), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 186/465 (40%), Positives = 270/465 (58%), Gaps = 38/465 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 201 MGNLHTWGGPLPFSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+ +W        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 261 QMDSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 317

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 318 SSEPSYLAAATAAVYQAMTTVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 377

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 378 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVG 437

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI+QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S    + AW +L  +
Sbjct: 438 TGMAPEGIDQNEVVYALMAELGWRKDPVTDLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 497

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     N   +V      PS+  VT                            +
Sbjct: 498 VYNCSGEDCRGHNHSPLVR----RPSLQMVTT---------------------------V 526

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  + + +   L+ S  + Y+L+D+TRQA+ +  +  +  +  AY   D  
Sbjct: 527 WYNQSDVFEAWRMLLTATPTLATSPLFSYELVDITRQAIQELVSLYYEEVRTAYLNKDLV 586

Query: 418 GVFQLSRRF-LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            +F+ +     EL+  +D +LA    FLLG WLE A+  A ++ +
Sbjct: 587 TLFRAAGILAYELLPSLDNILATDSHFLLGSWLEQARAAAVSKAE 631


>gi|328778968|ref|XP_623833.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Apis mellifera]
          Length = 752

 Score =  343 bits (881), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 176/465 (37%), Positives = 272/465 (58%), Gaps = 37/465 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W D+ + LQ +IL R+  LG+ PVLPAF+G+VP AL  +FP A +T
Sbjct: 198 MGNMRGFGGPLNSNWHDKSIRLQHRILERMRALGIIPVLPAFAGHVPRALLKLFPKANVT 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   W +     ++CC YLL+ TDPLF +IG+ F++  ++E+G T H+YNCDTF+EN P 
Sbjct: 258 KSAVWNNFSD--KYCCPYLLEPTDPLFKQIGQQFLKTYIEEFG-TDHVYNCDTFNENEPY 314

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++G +I+  M S DS A+WLMQGWLF +D  FW  P+ +  L SVPLG+++V
Sbjct: 315 TSELKFLRNIGHSIFEAMNSVDSKAIWLMQGWLFYHDSVFWTEPRTRTFLTSVPLGRMIV 374

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P +     +YG P+IWCMLHNF G + M+G  + I     EAR    +TMVG
Sbjct: 375 LDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGGTLGMFGSAEIINHRVFEARNMNGSTMVG 434

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPAIQDAWNVLYH 297
            G++ EGI QN V+Y+LM+EMA++ + V++  W   ++ RRYG  +       AW    +
Sbjct: 435 TGLTPEGINQNYVIYELMNEMAYRKKPVNLDKWFENFANRRYGDIKGNEHTVTAWKGFKN 494

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           TVYN +D    + + VI   P+++                                 P  
Sbjct: 495 TVYNFSDTRRIRGKYVITIRPNLNF-------------------------------FPWR 523

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+    I    + + + +    S  YR+D++D+TRQAL   A+E++ ++IE++   +  
Sbjct: 524 WYNKDAFIYYWYVLLQARDLKRNSTLYRHDVVDVTRQALQLIADEIYTDLIESFNKKNID 583

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              Q ++  L L +D++ +LA  + FLLG WL+ AK LA ++E+E
Sbjct: 584 LFKQNAKLLLALFDDLEEILASSEDFLLGKWLKMAKDLATDDEEE 628


>gi|426238065|ref|XP_004012978.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 1 [Ovis aries]
          Length = 748

 Score =  343 bits (880), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 192/469 (40%), Positives = 269/469 (57%), Gaps = 46/469 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 211 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 270

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 271 QMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D  AVWL+QGWLF   P FW P Q+ A+L +VP G+L+V
Sbjct: 328 SSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQPEFWGPAQVAAVLGAVPRGRLLV 387

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+T+VG
Sbjct: 388 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPATARRFPNSTLVG 447

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S    + AW +L  +
Sbjct: 448 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 507

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNC                       S  E +  N+  P+ K   L   T+      +W
Sbjct: 508 VYNC-----------------------SGEECRGHNH-SPLVKRPSLHMVTT------VW 537

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
           Y+ S+V  A  L + +   L++S  +RYDL+D+TRQA+ +  +  +  +  AY       
Sbjct: 538 YNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 597

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
           L  A G+        EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 598 LMRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARLAAVSETE 641


>gi|426238067|ref|XP_004012979.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Ovis aries]
          Length = 739

 Score =  343 bits (879), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 192/469 (40%), Positives = 269/469 (57%), Gaps = 46/469 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 202 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 262 QMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D  AVWL+QGWLF   P FW P Q+ A+L +VP G+L+V
Sbjct: 319 SSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQPEFWGPAQVAAVLGAVPRGRLLV 378

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+T+VG
Sbjct: 379 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPATARRFPNSTLVG 438

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S    + AW +L  +
Sbjct: 439 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 498

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNC                       S  E +  N+  P+ K   L   T+      +W
Sbjct: 499 VYNC-----------------------SGEECRGHNH-SPLVKRPSLHMVTT------VW 528

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
           Y+ S+V  A  L + +   L++S  +RYDL+D+TRQA+ +  +  +  +  AY       
Sbjct: 529 YNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 588

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
           L  A G+        EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 589 LMRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARLAAVSETE 632


>gi|380030624|ref|XP_003698943.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
           [Apis florea]
          Length = 769

 Score =  342 bits (876), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 173/465 (37%), Positives = 269/465 (57%), Gaps = 37/465 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W ++ + LQ +IL R+  LG+ PVLPAF+G+VP A   +FP A +T
Sbjct: 216 MGNMRGFGGPLSSNWHEKSIRLQHRILERMRALGIIPVLPAFAGHVPRAFLRLFPKANVT 275

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   W +     ++CC YLL+  DPLF +IG+ F++  ++E+G T H+YNCDTF+EN P 
Sbjct: 276 KSAVWNNFSD--KYCCPYLLEPMDPLFKQIGQQFLKTYIEEFG-TDHVYNCDTFNENEPY 332

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++G +I+  M + DS A+WLMQGWLF +D  FW  P+ +  L S+PLG+++V
Sbjct: 333 TSELKFLRNIGHSIFEAMSNVDSKAIWLMQGWLFYHDSVFWTEPRTRTFLTSIPLGRMIV 392

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P +     +YG P+IWCMLHNF G + M+G  + I     EAR    +TMVG
Sbjct: 393 LDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGGTLGMFGSAEIINHRIFEARNMNGSTMVG 452

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPAIQDAWNVLYH 297
            G++ EGI QN V+Y+LM+EMA++   V++  W   Y+ RRYG  +       AW    +
Sbjct: 453 TGLTPEGINQNYVIYELMNEMAYRKRPVNLDKWFENYANRRYGDTKGNEHTVTAWKGFKN 512

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           TVYN +D    + +  I   P+++ S                               P  
Sbjct: 513 TVYNFSDTRRIRGKYAITIRPNLNFS-------------------------------PWR 541

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+    I    + + + +    S  YR+D++D+TRQAL   A+E++ ++IE++   +  
Sbjct: 542 WYNKDAFIHYWYMLLQARDLKRNSTLYRHDVVDVTRQALQLIADEIYTDLIESFNKKNID 601

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              Q ++  L L +D++ +LA  + FLLG WL+ AK LA N+E+E
Sbjct: 602 LFKQNAKLLLALFDDLEEILASSEDFLLGKWLKMAKDLATNDEEE 646


>gi|307168312|gb|EFN61518.1| Alpha-N-acetylglucosaminidase [Camponotus floridanus]
          Length = 737

 Score =  341 bits (875), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 174/464 (37%), Positives = 262/464 (56%), Gaps = 35/464 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W ++ + LQ +IL R+  LG+ PVLPAF+G+VP A   +FP+A +T
Sbjct: 221 MGNIRGFGGPLSTNWHNRTIHLQHQILRRMRNLGIVPVLPAFAGHVPRAFARLFPNANMT 280

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W + +   ++CC YLL+ TDPLF  IG  F+   + E+G T HIYNCDTF+EN P 
Sbjct: 281 KINPWNNFED--KYCCPYLLEPTDPLFQIIGEKFLRMYINEFG-TDHIYNCDTFNENEPG 337

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
                Y+ ++  A+++ + + DS A+WLMQ WLF +D  FW  P++K+ L SVP+G++++
Sbjct: 338 STELIYLRNVSHAVFAAINAVDSKAIWLMQAWLFVHDFMFWTEPRVKSFLTSVPMGRMLI 397

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P +   K +YG P+IWCMLHNF G + M+G    I     E R    +TMVG
Sbjct: 398 LDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRNMNGSTMVG 457

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ EGI QN V+Y+LM+EMA++HE VD+ AW   Y+ RRYG         W  L  TV
Sbjct: 458 TGLTPEGINQNYVIYELMNEMAYRHEPVDLDAWFQNYATRRYGAWNEYAVTTWQYLGRTV 517

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YN       +   V+   P ++ S+                                +WY
Sbjct: 518 YNFIGSQRIRGHYVVTRRPSLNISL-------------------------------WIWY 546

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           +          F+ + +    S  YR+D++D+TRQAL    ++L+  I+++Y+  +    
Sbjct: 547 NRKNFYSMWNTFLKARHGRRNSTLYRHDVVDITRQALQLMGDDLYTIILDSYKKRNITAF 606

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +   LEL +D++ +LA    FLLG WL  AK +A NEE+ K
Sbjct: 607 RSSANALLELFDDLESILASGSNFLLGTWLSQAKDVATNEEERK 650


>gi|256422141|ref|YP_003122794.1| alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
 gi|256037049|gb|ACU60593.1| Alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
          Length = 728

 Score =  341 bits (874), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 177/457 (38%), Positives = 266/457 (58%), Gaps = 40/457 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  WGGPLPQ W D   VLQ++IL     +GM P+LPAF+G+VP A ++ +P+ +I 
Sbjct: 195 MGNIDAWGGPLPQHWKDSHKVLQQQILAAERSMGMLPILPAFTGHVPPAFKDKYPN-EIV 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  NW     D  +   Y+LD   P+F +IG+ F+E Q K +G T H Y+ DTF+EN PP
Sbjct: 254 KPTNW-----DAGFPDVYILDPNSPMFDKIGKKFLEAQTKAFG-TDHFYSADTFNENVPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
                ++ ++   +Y+ M + D  AVW+MQGW+F Y+  +W  PQ++ALLN+VP   ++V
Sbjct: 308 SSDSSFLDAMSRKVYASMAAADPKAVWVMQGWMFHYNASYWHQPQIRALLNAVPDDHMIV 367

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
           LDL++E  P W  ++ +YG P+IW MLHNF GN  M+G +D+ A  P  A     +  M 
Sbjct: 368 LDLYSESHPEWRNTQAYYGKPWIWNMLHNFGGNTGMWGGMDAAAHDPATALHDPASGKMS 427

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+G++ EGIEQNP +Y LM +  ++ + ++V  W+  Y+ +RYG    A+  AW +LYHT
Sbjct: 428 GIGLTPEGIEQNPALYQLMIDNVWRDQPINVDTWLQSYAKQRYGAENEAVNKAWQILYHT 487

Query: 299 VY--NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
           VY    T+GA +    +IVA P +D                 ++ E V            
Sbjct: 488 VYIGGPTEGAPES---IIVARPTLD-----------------IAAERV---------KTK 518

Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           L Y  ++V+ A +LFI +  +L  +  ++YDL+DLTRQ L  YA+ L   +  AY+  D 
Sbjct: 519 LEYDPAKVVPAWDLFINAAAQLKPTEGFKYDLVDLTRQVLGNYASPLQQRVATAYRNKDL 578

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
               Q S +F+ L++DMD LL   +GFLLG W+  A+
Sbjct: 579 AAFKQYSTQFIGLLDDMDMLLGTQEGFLLGKWVSDAR 615


>gi|198433857|ref|XP_002122480.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 880

 Score =  340 bits (871), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 189/455 (41%), Positives = 263/455 (57%), Gaps = 37/455 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLP  W+  QL+LQ +IL+R+  LGM PVLP F+G++P+A+ N++P A + 
Sbjct: 211 MGNLHGWGGPLPSFWIKSQLILQHQILIRMRSLGMIPVLPGFAGHIPSAILNLYPKADVI 270

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QL +W        + CTYLL   DPLF  IG  FI++Q+ EY  T+HIYN DTF+E TPP
Sbjct: 271 QLSHWSHFNCT--YSCTYLLQPHDPLFNTIGSMFIKEQMLEYNGTNHIYNADTFNEMTPP 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+S+   A+Y  M   D DAVWLMQGWLF ++P FW+  Q KALL  VP GK++V
Sbjct: 329 SSDPGYLSNASRAVYDAMAVADPDAVWLMQGWLFHHEPTFWKTAQKKALLTGVPKGKMLV 388

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLF+E  P +     ++G P++WCMLH+F GN+  YG ++++   P  A TS N+TMVG
Sbjct: 389 LDLFSESYPQY-LPDWYFGQPFLWCMLHDFGGNMGFYGKINTVNTQPGIALTSVNSTMVG 447

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ EGI QN ++YD M E  F    V+V  W+ +Y++RRY  S P     WN+L +T+
Sbjct: 448 TGVTPEGINQNYMIYDFMLETGFTVHSVNVTNWLKEYTMRRYNTSSPEAIKTWNILGNTI 507

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL-- 357
           YN          D    FP    S+I          G PV +  +        D+P L  
Sbjct: 508 YN----------DTKPGFP--SKSLIR---------GSPVKRPTL--------DNPGLPY 538

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY  S +  A + F  S N L    T RYD +D+TRQ L      L+  ++E +      
Sbjct: 539 WYQYSSLALAWDNFSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDP 598

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 452
           G  +L  + L+L++D D +L     F +G W++ A
Sbjct: 599 G--KLGEQLLDLLDDFDKMLCSDAHFSMGKWIQDA 631



 Score = 38.9 bits (89), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 2/96 (2%)

Query: 367 ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF 426
           A + F  S N L    T RYD +D+TRQ L      L+  ++E +      G  +L  + 
Sbjct: 680 AWDNFSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDPG--KLGEQL 737

Query: 427 LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
           L+L++D D +L     F +G W++ AK L    E++
Sbjct: 738 LDLLDDFDKMLCSDAHFSMGKWIQDAKILGTTAEEK 773


>gi|325103828|ref|YP_004273482.1| alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
 gi|324972676|gb|ADY51660.1| Alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
          Length = 738

 Score =  340 bits (871), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 182/467 (38%), Positives = 264/467 (56%), Gaps = 37/467 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  WGGPLPQSW+D    LQKKIL R  ELGM PVLPAF+G+VP +    FP AK+ 
Sbjct: 195 MNNMDAWGGPLPQSWIDSHKDLQKKILARQRELGMIPVLPAFTGHVPKSFVKKFPEAKVD 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
                 SV     +   Y+L+  DP+F +IG  F+++Q +EYG T H Y+ D F+E  PP
Sbjct: 255 ------SVNWQGNFPNIYMLNPNDPMFSKIGEQFLKEQTREYG-TDHYYSSDIFNELNPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY---DPFWRPPQMKALLNSVPLGKL 177
              P+Y+  +   +YS M+  D  +VW+MQ WLF       FW P +M+A L  VP  KL
Sbjct: 308 SSDPKYLYDISEKVYSSMKKVDPKSVWVMQAWLFVSAHGRKFWTPERMQAFLKPVPDDKL 367

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTT 236
           ++LDL+ E +P W  ++ +YG  ++W MLHNF GNI ++G   +IA  P    +      
Sbjct: 368 IILDLYTENRPRWKNTEGYYGKKWVWNMLHNFGGNIGLFGKAQTIASEPARVLSDPMKGN 427

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
             G+G++MEGIEQNP +Y LM +  + +E ++++ W N+Y  RRYG        AW +L 
Sbjct: 428 YSGIGLTMEGIEQNPFIYQLMLDHVWNNEPIELEKWTNKYITRRYGVLDNNAVKAWEILL 487

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
           +TVY       D N+D          SI+S         G+P        ++ S +    
Sbjct: 488 NTVY------KDNNKD-----QGAPESILS---------GRPTF------AQNSYWTWTD 521

Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           L+Y   E +RA +  I S ++L  S+ ++YD++D+TRQA+A YA  L   +   Y   D 
Sbjct: 522 LYYDNREFVRAWDYLIKSADKLRNSDGFQYDIVDITRQAMANYATALQRQLAYTYYAGDV 581

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           +   + SRRFLEL+ D+D LLA    FLLG W++ AK+ A N+ + K
Sbjct: 582 NTYEKESRRFLELLSDLDRLLATRKDFLLGIWIDDAKKWATNDAERK 628


>gi|332018247|gb|EGI58852.1| Alpha-N-acetylglucosaminidase [Acromyrmex echinatior]
          Length = 686

 Score =  340 bits (871), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 175/464 (37%), Positives = 265/464 (57%), Gaps = 35/464 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPL  +W +  + LQ +IL R+ +LG+ PVLPAF+G+VP A   +FP+A +T
Sbjct: 152 MGNIRGFGGPLSSNWHNYTIRLQHQILQRMRDLGIVPVLPAFAGHVPRAFARLFPNANMT 211

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W   K + ++CC YLL+ TDPLF  IG  F++  + E+G T HIYNCDTF+EN P 
Sbjct: 212 KINPW--NKFEDKYCCPYLLEPTDPLFRTIGEKFLQMYIDEFG-TDHIYNCDTFNENEPG 268

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
                Y+ ++G +I+S M + DS A+WLMQ WLF +D  FW   +++A L SVP+G+++V
Sbjct: 269 NTELIYLRNVGHSIFSAMNAVDSKAIWLMQAWLFVHDIMFWTKSRVRAFLTSVPIGRMLV 328

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P +   K +YG P+IWCMLHNF G + M+G    I     E R   ++TMVG
Sbjct: 329 LDLQSEQFPQYDRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRNMNDSTMVG 388

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ EGI QN V+Y+LM+EMA++H  V++  W   Y+ RRYG        AW  L  TV
Sbjct: 389 TGLTPEGINQNYVIYELMNEMAYRHVPVNLDNWFESYATRRYGAWNEYAVAAWQHLGRTV 448

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YN       +   VI   P ++ S+ +                               WY
Sbjct: 449 YNFIGTQKIRGHYVITRRPSLNISLWT-------------------------------WY 477

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              +      +F+ +      +  YR+D++D+TRQAL   A+++++ I++ Y+  +    
Sbjct: 478 DRKDFYAMWNMFLKARYGRGNNTLYRHDVVDITRQALQLIADDIYMTILDCYKKKNITAF 537

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +   LEL +D++ +LA  + FLLG WL  AK +A NEE+ +
Sbjct: 538 QSSANALLELFDDLESILASGNNFLLGTWLAQAKDIAVNEEERR 581


>gi|449518399|ref|XP_004166229.1| PREDICTED: alpha-N-acetylglucosaminidase-like, partial [Cucumis
           sativus]
          Length = 336

 Score =  334 bits (857), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 163/227 (71%), Positives = 190/227 (83%), Gaps = 2/227 (0%)

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VGVGMSMEGIEQNPVVYDLMSEMAFQH KVDVK W+ QYSVRRYG  VP+IQDAW+VLYH
Sbjct: 1   VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 60

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           TVYNCTDGA DKNRDVIVAFPDVDPS I V       +G   S  +V + + +++D PHL
Sbjct: 61  TVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHL 118

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF  I++AYQL+D  
Sbjct: 119 WYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQ 178

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            +  LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQ
Sbjct: 179 TMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQ 225


>gi|255533286|ref|YP_003093658.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
 gi|255346270|gb|ACU05596.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
          Length = 734

 Score =  333 bits (855), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 172/465 (36%), Positives = 264/465 (56%), Gaps = 37/465 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  WGGP+ ++++ +Q  LQKKIL R   LGM P+LP+F+G+VP + ++ FP  K+ 
Sbjct: 190 MGNLDAWGGPMSKNFMAKQEALQKKILARERALGMTPILPSFTGHVPPSFKDKFPDIKVN 249

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
                + +   P     Y+L+   P+F EIGR F+   +  +G T H+Y+ DTF+E TP 
Sbjct: 250 TQQ--WGINVSP----AYVLNPETPMFKEIGRKFLTALINTFG-TDHLYSADTFNEMTPV 302

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            +   Y++ +   IY  M + D+ AVW+MQGW+F   P FW+P QMKAL ++VP  KL+V
Sbjct: 303 SNDSTYLNGMAKKIYESMAAVDTQAVWIMQGWMFLDRPNFWQPTQMKALFSAVPQDKLIV 362

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
           LDL +E+ P+WS +  FYG  +IWCMLHNF G + M+G +  I   P  A +  +   M 
Sbjct: 363 LDLNSELNPVWSRTDAFYGEKWIWCMLHNFGGRLSMFGDMSRIGNDPAAALKNDQRGKMS 422

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+G++MEGIEQNP +Y LM E  +  + +D+  W+  Y+ RRYG+     + AW VL +T
Sbjct: 423 GIGLTMEGIEQNPAIYSLMLEHIWNDKPIDLDNWLKGYAQRRYGKRNSNAEKAWEVLKNT 482

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VY+           +I   P  D + +                          + +  + 
Sbjct: 483 VYSHQPWWG--TNTIITGRPTFDAATV--------------------------WTYTAIP 514

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           YS+ E+++A    + + +EL +S+ ++YDL+D+TRQ LA YAN L  +   +Y+  D   
Sbjct: 515 YSSKELMKAWSYLLTASDELKSSDGFQYDLVDVTRQVLANYANVLQQDFASSYKQKDMAT 574

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
             + S +FLEL++D+D LL     FLLG W+ +AK L  N  ++K
Sbjct: 575 FNKKSAQFLELIDDIDQLLGTRSDFLLGKWINNAKALGDNPAEKK 619


>gi|320162905|gb|EFW39804.1| lysosomal alpha-N-acetyl glucosaminidase [Capsaspora owczarzaki
           ATCC 30864]
          Length = 786

 Score =  328 bits (841), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 176/467 (37%), Positives = 268/467 (57%), Gaps = 33/467 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGP+   W+ +Q  LQ  IL R+   GM PVLP+F+G+VP+AL   FP+A IT
Sbjct: 238 MGNIKGWGGPISLEWIYKQRNLQVLILQRMRTFGMTPVLPSFAGHVPSALAQHFPNANIT 297

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q  +W +     ++CC   LDA+DPLF +IG  F+  Q + YG T+H+YNCD F+E TP 
Sbjct: 298 QSSDWNNFPD--QYCCVGFLDASDPLFTQIGAEFLRLQNETYG-TNHLYNCDQFNEMTPA 354

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
                Y+   G A+Y  M + D  AVW+MQGWLF  +  +W   +++ALL+ VP   +++
Sbjct: 355 STDLGYLKQAGMAVYQSMTAYDPAAVWVMQGWLFFNEAAWWSNDRVQALLSGVPDDHMII 414

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLF++V P+W+  + +YG P+IW MLH+F GNI +YGIL SI  GP  A  +   TMVG
Sbjct: 415 LDLFSDVTPVWNRLESYYGKPFIWNMLHDFGGNIGLYGILPSINEGPFAALATPGNTMVG 474

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD-AWNVLYHT 298
           +G++ EGI QN ++Y+ M E  ++   V++  W++ +  RRYG S PA+   A+  L  +
Sbjct: 475 IGLTPEGINQNYILYEFMMENMWRSAPVNLPTWVDAFVGRRYGPSTPAVAKLAYQQLLQS 534

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNCT+G     + ++   P V+                 +S+   + +        +L+
Sbjct: 535 VYNCTNGQYSVTKSLLEIRPAVN-----------------MSRNGFMPT--------NLY 569

Query: 359 YSTSEVIRALELFIA---SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
           Y    VI A++  +A   S  +L++   +RYD++D TRQ L+  A +   N+  A     
Sbjct: 570 YDPGHVILAVDHILAAAKSAPQLASVVPFRYDVVDFTRQMLSNLAIDFHSNLTLALTSKQ 629

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
           AH V    +  + L+ D+D LL     FLLGPWL +A+  ++N   +
Sbjct: 630 AHLVHLYGQGIVGLIADLDELLVSDAHFLLGPWLAAARSWSENTAAQ 676


>gi|390463730|ref|XP_003733088.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
           [Callithrix jacchus]
          Length = 830

 Score =  327 bits (839), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 186/465 (40%), Positives = 262/465 (56%), Gaps = 45/465 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W  PLP +W  +QL LQ  IL R+   GM PVLP F G+VP A+  VFP   +T
Sbjct: 297 MGNLHTWDAPLPHAWHIKQLYLQHWILDRMRSFGMVPVLPMFLGHVPKAITRVFPRVSVT 356

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+G+W        + C++LL   DP+F  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 357 QMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRELTKEFG-TDHIYGADTFNEMQPP 413

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D+DAVWL+QGWLF Y P FW P Q++A+L S P G L+V
Sbjct: 414 SSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQYQPQFWGPAQVRAVLGSAPHGCLLV 473

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 474 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAMNRGPEAARLFPNSTMVG 533

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+++  + V D+ AW        YG S P    AW +L  +
Sbjct: 534 TGMAPEGISQNXVVYSLMAELSWXKDPVPDLVAWX-------YGVSHPDTGAAWRLLLRS 586

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNC+  A   +                           P+ +   L+  T+      +W
Sbjct: 587 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------IW 616

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y+ S+V  A  L  ++   L+AS T+RYDL+D+TRQ + +  +  +     AY L+   G
Sbjct: 617 YNQSDVFEAWRLLFSAAATLAASPTFRYDLLDVTRQVVQELVSLYYEEARSAY-LSKELG 675

Query: 419 VFQLSRRFL--ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
               +   L  EL+  +D +LA    FLLG WLE A+ +A +E +
Sbjct: 676 SLLRAGGILAYELLPALDEVLASDSHFLLGSWLEQARAVAVSEAE 720


>gi|332260899|ref|XP_003279518.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
           [Nomascus leucogenys]
          Length = 736

 Score =  325 bits (833), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 174/446 (39%), Positives = 257/446 (57%), Gaps = 38/446 (8%)

Query: 20  LVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYL 79
           L LQ ++L R+     +PVLPAF+G+VP A+  VFP   +T++G+W        + C++L
Sbjct: 216 LFLQHRVLDRMRSSAXDPVLPAFAGHVPEAVTRVFPRVNVTKMGSWGHFNCS--YSCSFL 273

Query: 80  LDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQ 139
           L   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP   P Y+++   A+Y  M 
Sbjct: 274 LAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMI 332

Query: 140 SGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYG 198
           + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ +  F G
Sbjct: 333 AVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQG 392

Query: 199 VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMS 258
            P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+
Sbjct: 393 QPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMA 452

Query: 259 EMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVA 316
           E+ ++ + V D+ AW+  ++ +RYG S P    AW +L  +VYNC+ +     NR  +V 
Sbjct: 453 ELGWRKDPVPDLAAWVTSFAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVR 512

Query: 317 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 376
            P                          L+  TS      +WY+ S+V  A  L + S  
Sbjct: 513 RPS-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAP 541

Query: 377 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF-LELVEDMDG 435
            L+AS  +RYDL+DLTRQA+ +  +  +     AY   +   + +       EL+  +D 
Sbjct: 542 SLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLRKELASLLRAGGVLAYELLPALDE 601

Query: 436 LLACHDGFLLGPWLESAKQLAQNEEQ 461
           +LA    FLLG WLE A+  A +E +
Sbjct: 602 VLASDSRFLLGSWLELARAAAVSEAE 627


>gi|383856382|ref|XP_003703688.1| PREDICTED: alpha-N-acetylglucosaminidase [Megachile rotundata]
          Length = 744

 Score =  325 bits (833), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 179/472 (37%), Positives = 271/472 (57%), Gaps = 37/472 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPL  SW +Q + LQ KIL R+  LG+ PVLP+F+G+VP A   +FP+A +T
Sbjct: 194 MGNIRAFGGPLYPSWHEQSINLQHKILERMRSLGIIPVLPSFAGHVPRAFPRLFPNANVT 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W +      +CC YLL  TDPLF +IG+ F++  ++E+G T HIYNCDTF+EN P 
Sbjct: 254 KLAPWNNFPD--VYCCLYLLAPTDPLFQQIGQLFLKTYIEEFG-TDHIYNCDTFNENEPH 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++G + +  M + D DA+WLMQGWLF++D  FW  P+++A L SVP G+++V
Sbjct: 311 TSELKFLRNVGHSTFQAMNAVDPDAIWLMQGWLFTHDKLFWTEPRVEAFLTSVPRGRMIV 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P +   K ++G P+IWCMLHNF G + M+G    I     E R  +N+TMVG
Sbjct: 371 LDLQSEQFPQYGRLKSYFGQPFIWCMLHNFGGTLGMFGSAQIINQRVFEGRNMKNSTMVG 430

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ EGI QN V+Y+LM+EMA++ E V++  W   Y+ RRYG        AW  L  TV
Sbjct: 431 TGLTPEGINQNYVIYELMNEMAYRKEPVNLNKWFENYASRRYGVWNEYAVSAWQSLGRTV 490

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YN +     + + VI   P ++ S  +                               WY
Sbjct: 491 YNFSGTRKIRGKYVISRRPSLNLSTWT-------------------------------WY 519

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
               +     +F+ + +    S  YR+D++DLTRQ L   A E++  +I+++   +    
Sbjct: 520 DRDTLYNTWSVFLQARHGRRNSTLYRHDVVDLTRQVLQAKAEEIYPVLIDSFNKKNLTAF 579

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK--QVRCPY 469
              S + L+L +D++ +LA    FLLG WL++AK+LA N+E+ +  QV   Y
Sbjct: 580 KYHSDKLLDLFDDLELILASGKDFLLGKWLDAAKKLASNDEELRLYQVNAKY 631


>gi|194216885|ref|XP_001917396.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
           [Equus caballus]
          Length = 744

 Score =  324 bits (831), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 189/470 (40%), Positives = 270/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPL +SW  +QL LQ +IL R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 205 MGNLHTWDGPLTRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG+W        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 265 QLGSWGHFNCS--YSCSFLLAPEDPLFPVVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF +   FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPAYLAAATAAVYQAMTAVDPDAVWLLQGWLFHHQRTFWGPAQVGAVLGAVPRGRLLV 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 382 LDLFAESQPMYIRTASFQGQPFIWCMLHNFGGNQGLFGALEAVNRGPAAARLFPNSTMVG 441

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D++AW+  ++ RRYG S    + AW +L  +
Sbjct: 442 TGMTPEGIGQNEVVYALMAELGWRKDPVADLEAWVTSFAARRYGVSHKDAETAWKLLLRS 501

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNC+  A   +                           P+ K   L+  T+      +W
Sbjct: 502 VYNCSAEAYSGHNQ------------------------SPLVKRPSLQMGTT------VW 531

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY-------ANELFLNIIEAY 411
           Y+ S+V  A  L + +   L++S  + YDL+D+TRQA  +        A   +LN  E  
Sbjct: 532 YNRSDVFEAWWLLLTAAPALASSPAFLYDLVDVTRQAAQELISLYYEEARTAYLN-KELV 590

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A G+        EL+  +D +LA    FLLG WL+ A+++A +E +
Sbjct: 591 PLLRAGGILA-----YELLPALDKVLASDSRFLLGSWLKQAREMAVSEAE 635


>gi|374385255|ref|ZP_09642763.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
           12061]
 gi|373226460|gb|EHP48786.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
           12061]
          Length = 736

 Score =  322 bits (825), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 178/466 (38%), Positives = 258/466 (55%), Gaps = 40/466 (8%)

Query: 2   SNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
            N+ GW GPLP+SW++    LQKKIL R  ELGM P+LPAF+G+VP   +  FP A++ Q
Sbjct: 199 GNIDGWCGPLPKSWMESHEELQKKILARERELGMTPILPAFTGHVPPTFKEHFPEARLRQ 258

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
           + NW     + R+  TYLL+A DPLF  IG  F+E+Q++ +G T H+Y  DTF+E  PP 
Sbjct: 259 V-NW-----EGRFDDTYLLEADDPLFQTIGNRFMEEQIRTFG-TDHLYGADTFNEMFPPS 311

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
           +   Y+  +  A+Y  M + D +AVW+MQGWLF     FW+P QMKA L +VP   L+VL
Sbjct: 312 EDSTYLDGISKAVYQSMAAVDPEAVWVMQGWLFHDKRDFWKPAQMKAYLGAVPDEHLIVL 371

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTMVG 239
           DL+ E  PIW  ++ FYG P+IWCMLHNF G   ++G    +A  P           ++G
Sbjct: 372 DLWGEEFPIWDRTEAFYGKPWIWCMLHNFGGRNMLFGNALKLAEEPSRVLADPAKGQLLG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G   EGIEQNPV+Y L+    +++  V++  W   Y   RYG    A++ AW++L  TV
Sbjct: 432 LGAVPEGIEQNPVIYSLLFSHIWRNTAVELDEWFETYLESRYGCRDEAVEKAWDILRKTV 491

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                            EG   NY   ++     +   +++ +  + Y
Sbjct: 492 Y--------------------------ANEG---NYESAITARPTFEKH-NNWAYTDIPY 521

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           +  EVI+A +  + + + L  +  YRYDLI + +Q LA YA  +     E Y+  D    
Sbjct: 522 NPVEVIKAWKYLLQAADRLGENPCYRYDLILVGKQVLANYATIIQQKFGEDYRTKDLPAF 581

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            + SR F+EL++DMD L+  H+ FLLG WLE A+   +    EKQ+
Sbjct: 582 TRNSREFMELIDDMDELMGTHEAFLLGKWLEDARSWGKT-ASEKQL 626


>gi|321472423|gb|EFX83393.1| hypothetical protein DAPPUDRAFT_301977 [Daphnia pulex]
          Length = 799

 Score =  322 bits (824), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 171/468 (36%), Positives = 273/468 (58%), Gaps = 38/468 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N   WGGPL  +W    L+LQ KIL R+   GM PVLPAF+G+VP A++ V+P+A  T
Sbjct: 212 MGNFRAWGGPLSDNWQQATLILQHKILERMRSFGMTPVLPAFAGHVPRAMERVYPNASYT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L +W + +   ++CC   L  T+PLF EIG  FI++   E+G + H+YNCD F+E  P 
Sbjct: 272 HLTSWLNFQD--QYCCPLFLQPTEPLFTEIGSRFIKEMALEFG-SDHVYNCDVFNEVRPT 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P ++SS+G A+++ M + D DA+WLMQGWLF  D  +W     KALL SVP G++++
Sbjct: 329 QADPVFVSSVGTAVFNAMTTADPDAIWLMQGWLFKSDADYWTADLSKALLTSVPQGRMLI 388

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL AE+ P +     FYG P+++C+LHNF G + + G +  I+   ++AR   N+TMVG
Sbjct: 389 LDLQAELDPQYIRLNSFYGQPFVFCLLHNFGGTLGLNGAIQIISQRVIDARNFPNSTMVG 448

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++MEGI+QN VVYD M EM ++ +  ++  W ++Y+VRRYG +  A+  AW  L ++V
Sbjct: 449 TGLTMEGIDQNYVVYDKMLEMGWRDKVPNLNQWFDEYTVRRYGVNNTAVMSAWRFLQNSV 508

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YN +   + + + V+V  P +                               +  P +WY
Sbjct: 509 YNDSSRRSFRGQYVLVTRPAL-------------------------------WQLPFVWY 537

Query: 360 STSEVIRALELFIA---SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           +  +VI A +  I+   +   LS ++ +R+D++DLTRQ++ +  + L+  ++E Y   ++
Sbjct: 538 NPHDVILAWDHLISGLMTEPLLSNASNFRHDMVDLTRQSMQEIFHLLYSQLLEVYLEKNS 597

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             +  ++ + ++L++D+D LL     FLLG W+  AK     E ++ Q
Sbjct: 598 TAIEGIAYKMIDLLQDLDELLQTGKKFLLGKWIADAKSWGTTEGEKLQ 645


>gi|301106961|ref|XP_002902563.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
 gi|262098437|gb|EEY56489.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
          Length = 684

 Score =  320 bits (820), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 173/461 (37%), Positives = 264/461 (57%), Gaps = 32/461 (6%)

Query: 1   MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           M NL G W  GPLPQ+++D Q  LQ KIL R+ E GM P LPAF+G+VP  ++ +FP+AK
Sbjct: 121 MGNLRGSWVKGPLPQAFIDSQYALQLKILNRMREFGMIPALPAFAGHVPEEMKALFPNAK 180

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            T+  NW     +  +CC Y+LD +DPL+ +IG+ F+E+Q   Y  TS +Y CDT++E  
Sbjct: 181 FTRSPNWGDFSDE--FCCVYMLDFSDPLYYDIGKTFLEEQRALYDYTSSLYQCDTYNEMD 238

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
           P    P  + +   A+   M + D++AVWL+QGWLF   P +W   ++KA L+ V   K+
Sbjct: 239 PDFTDPAKLQAASRAVIDSMTAADANAVWLIQGWLFENSPDYWTKNRVKAYLDGVSNEKM 298

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL++EV+P+WS    ++G  +++C+LHNF GN  M G L ++   PV+A    N TM
Sbjct: 299 IILDLYSEVRPVWSKMDNYFGKSWVYCVLHNFGGNTGMRGDLATLGTAPVQASRDSNGTM 358

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           +GVG++MEGI QN VVYDL  +MA+    +D+  W+  ++ +RY       + AW  L  
Sbjct: 359 IGVGLTMEGIYQNYVVYDLTLQMAWVDTPLDMDEWVPSFAAQRYHSQDVHTERAWGFLLQ 418

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           +VYN T G     + +I   P                + K V    +  S T        
Sbjct: 419 SVYNRTLGFGGVTKSLICLIP----------------HWKLVRDGFMPTSIT-------- 454

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ--LND 415
            Y   ++ RA +  + +G+EL A +TYR+DL+D+TRQ L+ +    +L++ E Y+     
Sbjct: 455 -YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFLSDHFMAQYLHLKEMYEGKTQP 513

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
           AH +   + R L  +E MD +LA ++  LLG W+  A+ LA
Sbjct: 514 AHQLCAWTERMLLTIERMDEILATNEDSLLGNWIADARALA 554


>gi|449675146|ref|XP_002156234.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Hydra
           magnipapillata]
          Length = 646

 Score =  320 bits (819), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 166/464 (35%), Positives = 254/464 (54%), Gaps = 31/464 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL-QNVFPSAKI 59
           M NL GWGGPL  SW  +QL LQ+ I+ R+   GM PVLP F G++P AL   +FP++K 
Sbjct: 89  MGNLEGWGGPLSSSWYSKQLQLQQNIISRMRSFGMIPVLPGFGGHIPKALVSRLFPTSKY 148

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
            +L  W   K   ++  T+LLD  DPLF ++G AF+E Q + Y  T H+YN D F+E  P
Sbjct: 149 YKLKPW--NKFTGKYGGTFLLDPQDPLFKKVGAAFVEMQKQLYNGTDHVYNADIFNEMDP 206

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
           P  +  +I++    +Y+ M + DSDAVWLMQGW+F     W+P  ++A L ++P GKL++
Sbjct: 207 PQLTSAFITNTSIGVYNAMLASDSDAVWLMQGWMF-LSSVWKPELVEAWLQAIPYGKLII 265

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +++ P++  +  FYG P+IWCM+ NF G   +YG L  +  G + AR +  + M+G
Sbjct: 266 LDLASDIYPLYDQTNAFYGHPFIWCMIENFGGTTRLYGQLTGVMKGVISARKTYKSFMIG 325

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            GM+ EGI QN + ++LM+EM +++E+ ++  W   Y  RRYG     + DAW +L  T+
Sbjct: 326 TGMTPEGINQNDINFELMNEMGWRNEEFNISDWTLSYIKRRYGDYPKMVSDAWLILIDTI 385

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           YNC DG  +   D  +  P + P +         N   PV                H+WY
Sbjct: 386 YNCNDGRENGGYDGRI--PVMRPQL---------NAKLPV----------------HMWY 418

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           S  ++  A +L +   + +   +T+R DL+ L  Q L   +   +  ++  Y       V
Sbjct: 419 SIKDLYNAWKLMVKGSDYMPLIDTFRNDLVRLGTQVLEDLSIVFYTQMVSGYFNKSTLNV 478

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            +   +   L+ DMD LLA     LLG W++SA+ +     + K
Sbjct: 479 EKYGSKITVLLTDMDRLLATDQYSLLGRWIQSARSMGDTLNETK 522


>gi|255533285|ref|YP_003093657.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
 gi|255346269|gb|ACU05595.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
          Length = 749

 Score =  317 bits (811), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 172/464 (37%), Positives = 256/464 (55%), Gaps = 38/464 (8%)

Query: 2   SNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
            N+ G  GPLP+SW++    LQKKIL R  ELGM P+LPAFSG+VP   +  FP+A++ +
Sbjct: 207 GNIDGLNGPLPKSWMESHEQLQKKILARERELGMKPILPAFSGHVPPTFKARFPNARVDR 266

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
           L NW     + R+  TY+L   DPLF +I   F+ +Q K +G T H+Y  DTF+E   P 
Sbjct: 267 L-NW-----EGRFADTYVLHPDDPLFQQIADKFMAEQDKAFGNTDHLYGADTFNEMYLPY 320

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
               Y+  +G A+Y GM   D +A+W+MQGW+F     FW+P  +K  L+ VP   L++L
Sbjct: 321 TDTAYVRKIGTAVYKGMAKADPEAIWVMQGWMFWDKRDFWKPEVVKNYLSGVPDDNLIML 380

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVG 239
           DLFA+ +PIW+ ++ F+G  +IWCMLHNF G   +YG L+ I   P E     N   + G
Sbjct: 381 DLFADEQPIWTKTEAFWGKKWIWCMLHNFGGRNPLYGDLNYIGREPAEMVHDPNRGRLSG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G+  EGIEQNPVVY LM E  +  + +DVK+W+  Y+ RRYG+  P  + AW +L+ TV
Sbjct: 441 IGLVPEGIEQNPVVYSLMLEHVWNDQVIDVKSWLVNYAQRRYGQRDPQTEKAWQILHQTV 500

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y   +G+ +              +IIS          +P  ++    ++ +  D P   Y
Sbjct: 501 Y-AKEGSYE--------------TIISA---------RPTHEK---HADWTGTDLP---Y 530

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              +++ A    + + N    ++ Y++DL+ + RQ LA YA  L       ++  +    
Sbjct: 531 DGDKLVPAWTYLLNASNRFKNNDCYQFDLVTVGRQVLANYATVLQRLFARDFRNKNLTAY 590

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +  FL L+ DMD L+     FLLG WL  AK+ A NE + +
Sbjct: 591 RAHTAEFLTLIADMDKLMGTRKDFLLGKWLNDAKKWATNESESR 634


>gi|428176410|gb|EKX45295.1| hypothetical protein GUITHDRAFT_51145, partial [Guillardia theta
           CCMP2712]
          Length = 680

 Score =  315 bits (806), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 173/478 (36%), Positives = 272/478 (56%), Gaps = 35/478 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  WGG L QSW+DQQ  LQ KIL R  ELGM PVLPAF+G VP  ++++FP AK T
Sbjct: 121 MINIKAWGGGLTQSWIDQQRDLQLKILARERELGMLPVLPAFAGGVPEGMKSLFPEAKFT 180

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           + GNW         CC  ++D TDPLF++IG+ F+E+    YG ++HIY+CDTF+EN P 
Sbjct: 181 RHGNWGGFAEQH--CCVMMVDPTDPLFLKIGKMFVEEVRAVYG-SNHIYSCDTFNENRPR 237

Query: 121 VDSP----EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG 175
            +      +++S    A++  M++ D DAVWLMQGWLF  D  FW+  ++ A L+ VP  
Sbjct: 238 SEHGSVGLDFLSHSSRAVFESMRAADPDAVWLMQGWLFMNDARFWQKRELDAYLSGVPED 297

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVP-----YIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
           ++++LDLF +V P+W         P     ++W MLH+F GN  MYG L  I+  PV A+
Sbjct: 298 RMIILDLFTDVFPVWKRRDLQRPTPIEKRRWVWNMLHSFGGNSGMYGRLQVISKDPVVAK 357

Query: 231 TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG----RSVP 286
             E+ TMVGVG++ EGIEQNPVVY++M+EM ++ ++VDV +W+ +++ RR G    R   
Sbjct: 358 -KESQTMVGVGITTEGIEQNPVVYEMMAEMRWREQEVDVMSWVEKWADRRLGPEASRERK 416

Query: 287 AI-QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
           A+ ++AW  L  TVY+C      + + ++ + P +D +   +    +    +   +EA++
Sbjct: 417 ALGEEAWRELASTVYSCPGTQMGQVKSMVESRPRLDLASGWIPNSDFMPIKRHYPEEALV 476

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
           ++          W      ++ L       +  + S++  +D+ D+TRQ L+     LF 
Sbjct: 477 RA----------W------LKLLRATRGGADGYTCSSSASFDIADVTRQVLSDLFARLFQ 520

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            +    Q   A       +  L ++ DMD ++      LLG W+E A+   +++E+E+
Sbjct: 521 PLSSFCQTRLAGSAAVRMQTLLGIISDMDKMVGTQPRMLLGKWIEDARAWGKSKEEEE 578


>gi|156121099|ref|NP_001095696.1| alpha-N-acetylglucosaminidase precursor [Bos taurus]
 gi|151554244|gb|AAI48148.1| NAGLU protein [Bos taurus]
 gi|296476361|tpg|DAA18476.1| TPA: alpha-N-acetylglucosaminidase [Bos taurus]
          Length = 667

 Score =  314 bits (804), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 166/383 (43%), Positives = 229/383 (59%), Gaps = 35/383 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ +IL R+   GM PVLPAF+G+VP AL  VFP   +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+GNW        + C++LL   DPLF  +G  F+ +  KE+G T HIY  DTF+E  PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++  AA+Y  M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++  +  F G P+IWCMLHNF GN  ++G L+S+  GP  AR   N+TMVG
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ +Q + V D+ AW+  ++ RRYG S    + AW +L  +
Sbjct: 441 TGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 500

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYNC                       S  E +  N+  P+ +   L+  T+      +W
Sbjct: 501 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 530

Query: 359 YSTSEVIRALELFIASGNELSAS 381
           Y+ S+V  A  L + + + L++S
Sbjct: 531 YNRSDVFEAWRLLLTATSTLASS 553


>gi|328867411|gb|EGG15793.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
          Length = 1501

 Score =  311 bits (797), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 170/472 (36%), Positives = 269/472 (56%), Gaps = 46/472 (9%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M N++GWGGPL   ++  Q  LQ++IL R+ + GM PVLP F+G+VP A  ++FP+A IT
Sbjct: 950  MGNVNGWGGPLDYDFIAGQHDLQQQILERMRQYGMKPVLPGFAGHVPRAFMSLFPTANIT 1009

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            QLG+W +         TY LD +DPLF  + + F++ Q   YG T H Y+ D F+E TPP
Sbjct: 1010 QLGDWRAFNG------TYYLDPSDPLFANVSQTFVKVQTAIYG-TDHYYSFDPFNEITPP 1062

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
                 Y+ +  +++Y+ +   D  AVW++Q W F  D  FW+PPQ+KA L  VP+G L+V
Sbjct: 1063 SSDAGYLQNSSSSMYNALAYADPQAVWVLQAWFFISDAWFWQPPQVKAFLGGVPIGHLLV 1122

Query: 180  LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
            LD +AE  P W+ + QF G  +IWCMLHNF G   MYG +  I  GP++AR  ++  M G
Sbjct: 1123 LDTWAEESPAWTVTDQFNGHDWIWCMLHNFGGRTGMYGKIPRITAGPIDAR-KQSPGMKG 1181

Query: 240  VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
             G++ E IEQN ++YDLMSEM+++    ++  WINQY+ RRYG  VP +  AWN L  TV
Sbjct: 1182 TGLTPEAIEQNYIMYDLMSEMSWRTTAPNMTEWINQYTQRRYGVFVPELAQAWNSLASTV 1241

Query: 300  YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
            YN  D + DKN     +F  + P +                               +++Y
Sbjct: 1242 YNAPD-SIDKNPS---SFVGIRPELNMTN---------------------------NIYY 1270

Query: 360  STSEVIRALELFIASGNE-LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
             +S + +A +L+++  +E + +++TY +D+ ++T QAL+    E  + + +AY+      
Sbjct: 1271 DSSIIQKAWQLYLSVTDEYVLSTSTYSFDIAEITIQALSNLFIETEIAMYDAYKTGKGTE 1330

Query: 419  VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ-----LAQNEEQEKQV 465
              + +   L ++ DMD + +     L+G W  +A+Q     L++N+++++  
Sbjct: 1331 FDEHAMNCLNIITDMDMIASTQQLLLVGTWTANARQWANYNLSRNKDEDRNT 1382


>gi|348681836|gb|EGZ21652.1| hypothetical protein PHYSODRAFT_247428 [Phytophthora sojae]
          Length = 991

 Score =  308 bits (790), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 171/461 (37%), Positives = 264/461 (57%), Gaps = 32/461 (6%)

Query: 1   MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           M NL G W  GPLPQ+++D Q  LQ +IL R+ E GM P LPAF+G+VP  L+   P+A 
Sbjct: 422 MGNLRGSWVKGPLPQAFIDNQHELQLRILQRMREFGMIPALPAFAGHVPEDLKLTLPNAN 481

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            T+  NW +     ++CC Y+++ TDPL+ EIG+AF+E+Q   Y  TS +Y CDT+ E  
Sbjct: 482 FTRSPNWGNFTD--QYCCVYMIEPTDPLYREIGKAFLEEQRALYNYTSSLYQCDTYMEMA 539

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
           P       +     A+  GM + D +AVWLMQGW F  DP +W  P++KA L  VP  KL
Sbjct: 540 PEFTDLSELKGAARAVIDGMTAADPNAVWLMQGWPFVDDPHYWTRPRVKAYLEGVPTDKL 599

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LD ++E  PIW+    ++G  +I+ +LHNF GN  M G L ++A  PV+A+   N TM
Sbjct: 600 IILDFYSEAVPIWNKMDNYFGKNWIYSVLHNFGGNTGMRGDLPTLATAPVQAQRDGNGTM 659

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VGVG++MEGI QN VVYDL  +MA++   +DV  W+++Y+ RRY      ++ AW+ L  
Sbjct: 660 VGVGLTMEGIFQNYVVYDLTLQMAWEDSPLDVDEWVSKYASRRYHTQNEHVERAWSYLSR 719

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           +VYN T           +A+  V  S++ +       Y +              +    +
Sbjct: 720 SVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR--------------FQPTLI 754

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF--LNIIEAYQLND 415
            Y   +++ A +  + +G+EL   +TYR+DL+D+T+Q L+    E +  L +I + +   
Sbjct: 755 KYDPKDIVLAWKELLLAGDELRNVDTYRHDLVDVTKQFLSNKLLEQYQHLKVIYSAKSAP 814

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
           A+ V +L++  L  +  ++ +LA ++ FLLG W+  A  LA
Sbjct: 815 ANEVCELTKTMLTTINRLEEILATNEDFLLGNWVADALNLA 855


>gi|194759443|ref|XP_001961958.1| GF14678 [Drosophila ananassae]
 gi|190615655|gb|EDV31179.1| GF14678 [Drosophila ananassae]
          Length = 783

 Score =  307 bits (787), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 166/468 (35%), Positives = 259/468 (55%), Gaps = 40/468 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL   W   QL+LQ++IL     LGM+  LPAF+G+VP AL  + P+   T
Sbjct: 228 MGNIRGWAGPLKPEWRQFQLLLQQEILSAQRNLGMSVALPAFAGHVPRALSRLHPNTSFT 287

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R+CC   ++ T+PLF +I   F++  +  YG ++HI+ CD F+E  PP
Sbjct: 288 DVQRWNQFPD--RYCCGLFVEPTEPLFHQIATTFLQSVVTIYG-SNHIFFCDPFNELEPP 344

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAI++ M + D +A+WL+QGW+F  +PFW P   +A L +VP G+++VL
Sbjct: 345 VAKPEYMRSTAAAIHNSMTAVDPEAIWLLQGWMFVKNPFWTPDMAEAFLTAVPRGRILVL 404

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  +  ++G P+IWCMLHNF G + M+G    I  G   AR+  N+++VG 
Sbjct: 405 DLQSEQFPQYELTHSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEAARSMPNSSIVGT 464

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN VVY L  E  +    +D+ +W   ++V RYG    ++  AW +L ++VY
Sbjct: 465 GITPEGIGQNYVVYSLTLERGWSRNSIDLDSWFRHFTVTRYGVKDESLAKAWLLLKNSVY 524

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH-PHLWY 359
           +                       +    G+Y    +P            S++H P  WY
Sbjct: 525 SFHG--------------------LQKMRGQYVVTRRP------------SFNHDPFTWY 552

Query: 360 STSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
           + S+V+ A  L +++      E    + Y +DL+D+TRQ L   A++L++N+  +++   
Sbjct: 553 NASDVLEAWHLLLSARVIIPLEDDRYDVYEHDLVDITRQFLQITADQLYVNLKSSFRKRQ 612

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
                 LS R L+L +D++ +L+    FLLG WLE AKQ+A + E  K
Sbjct: 613 LPRFEFLSTRLLQLFDDLELILSSGRNFLLGNWLEQAKQVAPHPEDRK 660


>gi|348681870|gb|EGZ21686.1| hypothetical protein PHYSODRAFT_495971 [Phytophthora sojae]
          Length = 692

 Score =  304 bits (779), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 164/461 (35%), Positives = 261/461 (56%), Gaps = 32/461 (6%)

Query: 1   MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           M NL G W  GPLPQ+++D Q  LQ KIL R+   GM P LPAF+G+VP  L+ ++P+AK
Sbjct: 121 MGNLRGSWVEGPLPQAFIDGQYELQLKILERMRGFGMVPALPAFAGHVPEELKTLYPNAK 180

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            T+  NW     +  +CC Y+LD  DPL+ EIG+ F+E+Q   Y  TS +Y CDT++E  
Sbjct: 181 FTRSPNWGGFSDE--FCCVYMLDPQDPLYYEIGKTFLEEQRALYDYTSSLYQCDTYNEMD 238

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
           P    P  + +   A+   M + D +AVWL+QGWLF   P +W   +++  L+ VP  K+
Sbjct: 239 PDFTDPAKLQAASRAVIDSMTAADPNAVWLIQGWLFVNSPNYWTKERVQTYLDGVPNDKM 298

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL++EV+P+W+    ++G  +I+C+LHNF GN  M G L ++   PV A  + + TM
Sbjct: 299 IILDLYSEVRPVWNKMDNYFGKSWIYCVLHNFGGNTGMRGDLPTLGTAPVLANRASSGTM 358

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           +G+G++MEGI QN VVYDL  +MA+    +D+  W+  ++ +RY       + AW  L  
Sbjct: 359 IGMGLTMEGIFQNYVVYDLTLQMAWVDAPLDMDEWVPSFAAQRYHSQDAHTERAWGFLLQ 418

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           +VYN T G     + ++   P                    + ++  + +  +       
Sbjct: 419 SVYNRTLGYGGVTKSLVCLIPHW-----------------KLVRDGFMPTLIT------- 454

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-- 415
            Y   ++ RA +  + +G+EL A +TYR+DL+D+TRQ L+ +    +L++ + Y   +  
Sbjct: 455 -YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFLSDHFMAQYLHLEDMYAGKETP 513

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
           A  +   + R L  +E +D +LA +D FLLG W+  A+ LA
Sbjct: 514 ADQLCAWTDRMLVTIEWLDEILATNDDFLLGNWVADARALA 554


>gi|157134500|ref|XP_001656341.1| alpha-n-acetylglucosaminidase [Aedes aegypti]
 gi|108881379|gb|EAT45604.1| AAEL003150-PA [Aedes aegypti]
          Length = 763

 Score =  303 bits (776), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 166/464 (35%), Positives = 258/464 (55%), Gaps = 37/464 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL  ++++    LQ +++  +  LGM   LPAF+G++P     ++P AK+T
Sbjct: 211 MGNIRGWGGPLTTNFINFSKKLQNQVIDEMRRLGMVLALPAFAGHLPVQFAQLYPEAKLT 270

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            + NW    +  ++     LD  DPLF EIG+ F+ + ++ YG ++HIY CD F+E  P 
Sbjct: 271 PVENWNGFPA--QYASPLFLDPIDPLFQEIGKRFLTKVIERYG-SNHIYFCDPFNEIQPR 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
             S +Y+SS  A IY  M   D  AVWL+QGW+F  +P+W    ++A L +VPLG+++VL
Sbjct: 328 SFSAKYLSSASAGIYKAMNDVDPFAVWLLQGWMFVKNPYWSDVAIRAFLQAVPLGRMLVL 387

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  ++ ++G P+IWCML NF G + M G +D +     + RT+++ TM+G 
Sbjct: 388 DLQSEQFPQYDRTESYHGQPFIWCMLSNFGGTLGMLGSVDLVFQRIRDVRTNDSMTMIGT 447

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN  +Y+   EM +     +V+ W   Y+  RYG     ++DAW++  +TVY
Sbjct: 448 GITPEGINQNYGLYEFALEMGWNPNIDNVEEWFRTYASVRYGTQDKRLKDAWSMFRYTVY 507

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +  +                      +  GKY    +P  K            HP LWY+
Sbjct: 508 SFKEQ--------------------EMMRGKYTFNRRPSLKL-----------HPWLWYN 536

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
            +     ++L + S    S +  +R D++DLTRQ L   A+ L+LNI+EAY   + + V 
Sbjct: 537 ETLFNAGVQLLLESN---STNTLFRNDVVDLTRQFLQNTADRLYLNIMEAYNTKNPNSVK 593

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            LS  F +L+EDMD LL     FLLG WLESAK +A+   + ++
Sbjct: 594 YLSILFQKLLEDMDRLLRTDQHFLLGRWLESAKAVAETSLERQK 637


>gi|242011515|ref|XP_002426494.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
 gi|212510620|gb|EEB13756.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
          Length = 1345

 Score =  302 bits (774), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 172/465 (36%), Positives = 270/465 (58%), Gaps = 41/465 (8%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M N+  +   L  +WL QQL+LQ KIL R+ ELG+ PVLP+F G VP + ++ +P AK+ 
Sbjct: 836  MGNVRNFSYGLTNNWLQQQLLLQHKILNRLRELGITPVLPSFCGIVPRSFKDSYPFAKLL 895

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            ++  W     D  +CC YLLD+ DPLF  + R F+++ + E+G T+HIYNCD F+EN P 
Sbjct: 896  EMPKWNKFSRD--YCCPYLLDSNDPLFSVVSRVFLKEYINEFG-TNHIYNCDVFNENKPA 952

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRP-PQMKALLNSVPLGKLVV 179
             +S +Y+S++ + IY  M S D  A WL+QGW+F  DPFW    ++KA +N+VP G++++
Sbjct: 953  SESLDYLSTISSTIYKAMSSVDPRATWLVQGWMF-IDPFWASLKRVKAFINAVPKGRMLI 1011

Query: 180  LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
            LDL +++ P +   + ++G P+IWC LHNF G + MYG L+ +  G  + R  +N+TMVG
Sbjct: 1012 LDLQSDLTPQYKRLQSYFGQPFIWCTLHNFGGQLGMYGHLNRVNLGVFKGRKFKNSTMVG 1071

Query: 240  VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            +G++ EGI+QN ++YD   ++A + + VD+  WI +Y++RRYG     I DAW +L +T+
Sbjct: 1072 IGIAPEGIDQNYIMYDFTLDLALRTKPVDLDDWITKYALRRYGLIEKNILDAWLILKNTL 1131

Query: 300  YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
            YN                PD +  + S     Y         + +L    S   +   WY
Sbjct: 1132 YNYN--------------PDSNFRLTSSNVKMYTLVKGEHIAKNILTKFPSLRMNEFTWY 1177

Query: 360  STSEVIRALELF-IASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            + S ++   E F IAS N + ++++ +++DLID+TRQ +                     
Sbjct: 1178 NRSIILDIFEKFQIASSNSILSTSSLFQHDLIDVTRQTI--------------------Q 1217

Query: 418  GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
               + S  FLEL+ ++D +L     FLLG WLESAK +A N+ ++
Sbjct: 1218 IAIENSNMFLELLNELDMILNTGKKFLLGNWLESAKNMATNKLEK 1262


>gi|195577611|ref|XP_002078662.1| GD22403 [Drosophila simulans]
 gi|194190671|gb|EDX04247.1| GD22403 [Drosophila simulans]
          Length = 778

 Score =  302 bits (774), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 164/467 (35%), Positives = 251/467 (53%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL   W   QL+LQ++I+   + LGM+  LPAF+G+VP AL+ + P +   
Sbjct: 223 MGNIRGWAGPLTPGWRRYQLLLQQEIITAQHNLGMSVALPAFAGHVPRALKRLHPESTFM 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R+CC   ++ TD LF EI   F++  + +YG ++HI+ CD F+E  PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTDNLFKEIASRFLQNIITKYG-SNHIFFCDPFNELEPP 339

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAIY  M+  D  A+WL+QGW+F  +PFW     +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  ++ ++G P+IWCMLHNF G + M+G    I  G  EAR   N+++VG 
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIDEARRLPNSSLVGT 459

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN V+Y    E  + +  +D+ +W   +S  RYG     ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNFSHTRYGVKDERLEQAWLLLKNSVY 519

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +                       +    G+Y           V+    S    P  WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548

Query: 361 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
            S V+ A  L + S      E      Y +DL+D+TRQ L   A++L++N+  AY+    
Sbjct: 549 ASAVLDAWHLLLTSRAIIPLEDDRYEIYEHDLVDITRQFLQISADQLYVNLRSAYRKRQV 608

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
                LS + L+L +DM+ +LA    FLLG WL+ AKQ A N  +++
Sbjct: 609 ARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGEQR 655


>gi|195339231|ref|XP_002036223.1| GM12949 [Drosophila sechellia]
 gi|194130103|gb|EDW52146.1| GM12949 [Drosophila sechellia]
          Length = 778

 Score =  300 bits (768), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 163/467 (34%), Positives = 250/467 (53%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL   W   QL+LQ++I+     LGM+  LPAF+G+VP AL+ + P +   
Sbjct: 223 MGNIRGWAGPLTAGWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLHPESTFM 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R+CC   ++ T+ LF EI   F++  + +YG ++HI+ CD F+E  PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLQNIITKYG-SNHIFFCDPFNELEPP 339

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAIY  M+  D +A+WL+QGW+F  +PFW     +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPEAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  ++ ++G P+IWCMLHNF G + M+G    I  G  EAR   N+++VG 
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN V+Y    E  + +  +D+  W   +S  RYG     ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDGWFTNFSHTRYGVKDERLEQAWLLLKNSVY 519

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +                       +    G+Y           V+    S    P  WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548

Query: 361 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
            S V+ A  L + S      E      Y +DL+D+TRQ L   A++L++N+  AY+    
Sbjct: 549 ASAVLDAWHLLLTSRAIIPLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAYRKRQV 608

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
                LS + L+L +DM+ +LA    FLLG WL+ AKQ A N  +++
Sbjct: 609 ARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGEQR 655


>gi|21356587|ref|NP_652045.1| CG13397, isoform A [Drosophila melanogaster]
 gi|442626853|ref|NP_001260251.1| CG13397, isoform B [Drosophila melanogaster]
 gi|16185856|gb|AAL13967.1| LP03571p [Drosophila melanogaster]
 gi|22945953|gb|AAF52672.2| CG13397, isoform A [Drosophila melanogaster]
 gi|440213562|gb|AGB92787.1| CG13397, isoform B [Drosophila melanogaster]
          Length = 778

 Score =  299 bits (765), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 163/467 (34%), Positives = 250/467 (53%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL  +W   QL+LQ++I+     LGM+  LPAF+G+VP AL+ + P +   
Sbjct: 223 MGNIRGWAGPLTPAWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLNPESTFM 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R+CC   ++ T+ LF EI   F+   + +YG ++HI+ CD F+E  PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLHNIITKYG-SNHIFFCDPFNELEPP 339

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAIY  M+  D  A+WL+QGW+F  +PFW     +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  ++ ++G P+IWCMLHNF G + M+G    I  G  EAR   N+++VG 
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN V+Y    E  + +  +D+ +W   +S  RYG     ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNFSHSRYGVKDERLEQAWLLLKNSVY 519

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +                       +    G+Y           V+    S    P  WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548

Query: 361 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
            S V+ A  L +        E +    Y +DL+D+TRQ L   A++L++N+  AY+    
Sbjct: 549 ASAVLDAWHLLLTFRAIIPLEDNRYEIYEHDLVDITRQFLQISADQLYINLRSAYRKRQV 608

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
                LS + L+L +DM+ +LA    FLLG WL+ AKQ A N  Q++
Sbjct: 609 SRFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGQQR 655


>gi|194863164|ref|XP_001970307.1| GG23441 [Drosophila erecta]
 gi|190662174|gb|EDV59366.1| GG23441 [Drosophila erecta]
          Length = 778

 Score =  298 bits (762), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 163/467 (34%), Positives = 251/467 (53%), Gaps = 38/467 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL   W   QL+LQ++I+     LGM+  LPAF+G+VP AL+ + P +   
Sbjct: 223 MGNIRGWAGPLTPEWRRYQLLLQQEIIAAQRNLGMSVALPAFAGHVPRALKRLHPGSTFM 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R+CC   L+ TD LF EI   F+++ +  YG ++HI+ CD F+E  PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFLEPTDNLFNEIALIFLQKIITAYG-SNHIFFCDPFNELEPP 339

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAIY  ++  D  A+WL+QGW+F  +PFW     +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESIRRLDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  ++ ++G P+IWCMLHNF G + M+G    I  G  EAR   N+++VG 
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN V+Y    E  + +  +D+ +W   +S  RYG     ++ AW  L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNRPLDLDSWFTSFSHARYGVKDERLEQAWLQLKNSVY 519

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +                       +    G+Y    +P  K+            P  WY+
Sbjct: 520 SFHG--------------------LQKMRGQYVVTRRPSFKQ-----------EPFTWYN 548

Query: 361 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
            S V+ A  L ++S      E      Y +DL+D+TRQ L   A++L++N+  AY+    
Sbjct: 549 ASAVLDAWHLLLSSRAIIPLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAYKKRQV 608

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
                LS + L+L +DM+ +LA    FLLG WL+ AKQ A +  +++
Sbjct: 609 SRFEFLSSKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPHPGEQR 655


>gi|90399367|emb|CAJ86183.1| H0212B02.15 [Oryza sativa Indica Group]
 gi|116311963|emb|CAJ86322.1| OSIGBa0113E10.5 [Oryza sativa Indica Group]
          Length = 692

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 133/189 (70%), Positives = 157/189 (83%), Gaps = 3/189 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQSWLD QL LQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 261 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 320

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 321 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 380

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG---KL 177
           +  P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMK  +     G     
Sbjct: 381 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKIGVGMSMEGIEQNP 440

Query: 178 VVLDLFAEV 186
           +V DL +E+
Sbjct: 441 IVYDLMSEM 449



 Score =  293 bits (750), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 139/232 (59%), Positives = 178/232 (76%), Gaps = 6/232 (2%)

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           +GVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  Y  RRYG+S+  +QDAW +LY 
Sbjct: 427 IGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSIVGLQDAWKILYQ 486

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSY 352
           T+YNCTDG  DKNRDVIVAFPDV+P +I  T G Y    + Y   +SK  + + +    Y
Sbjct: 487 TLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEY 545

Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
           +HPHLWY T  VIRALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+
Sbjct: 546 EHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYK 605

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            N+ + V  L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 606 ANNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQ 657


>gi|198476648|ref|XP_001357424.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
 gi|198137793|gb|EAL34493.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
          Length = 767

 Score =  296 bits (757), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 161/462 (34%), Positives = 257/462 (55%), Gaps = 38/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL   +   Q +LQ+ IL    +LG++  LPAF+G++P A++ ++P+   T
Sbjct: 212 MGNIRGWGGPLKPEYQRLQELLQQHILRAQRDLGISVALPAFAGHLPTAMRRIYPNGNYT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W S   DP +CC   +D  DP+F  +   F+ + ++ YG ++HI+ CD F+E  PP
Sbjct: 272 EVERWNSF-PDP-YCCGLFVDPLDPIFDLVAALFLRRVVQRYG-SNHIFFCDPFNELQPP 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  P+Y+ S  AAI++ M+S D +AVWL+QGW+F  + FW    M+A L +VP+G+L+VL
Sbjct: 329 VAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNIFWTDAMMEAFLTAVPIGRLIVL 388

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  +  +YG P++WCMLHNF G + M+G  D +  G   AR   N+++VGV
Sbjct: 389 DLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGSADLVNNGIEAARRMPNSSIVGV 448

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN V+Y L+ E  +    +D+ +W   ++  RYG     +Q AW +L  +VY
Sbjct: 449 GITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFARTRYGVDDEGLQQAWQLLRRSVY 508

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +       K R                        G  V++   L  +      P  WY+
Sbjct: 509 SFR--GLQKMRG-----------------------GYTVTRRPALNLD------PFTWYN 537

Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
            S+V+ A +L ++S   +   +     Y +DL+D+TRQ L   A++L++N+  AY+    
Sbjct: 538 ASDVLEAWKLLLSSRAIIPLEDDNYAIYEHDLVDITRQYLQISADQLYVNLKSAYRKRQV 597

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
                L  + L+L  D++ +LA    FLLG WL  A++ A N
Sbjct: 598 ARFEYLGSKLLQLFGDLERILASGSNFLLGTWLADAQRAAPN 639


>gi|301107007|ref|XP_002902586.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
 gi|262098460|gb|EEY56512.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
          Length = 736

 Score =  295 bits (755), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 169/454 (37%), Positives = 258/454 (56%), Gaps = 32/454 (7%)

Query: 1   MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           M NL G W  GPLPQ+++D Q  LQ +IL R+ E GM P LPAF+G+VP  L+   P+A 
Sbjct: 196 MGNLRGSWVKGPLPQAFIDNQHELQLRILERMREFGMIPALPAFAGHVPEELKLRLPNAH 255

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            TQ  NW +   +   CC ++++ TD L+ EIG+ F+++Q + Y  TS +Y CDT+ E  
Sbjct: 256 FTQSPNWGNFSEEH--CCVFMIEPTDALYREIGKNFLKEQRELYNYTSSLYQCDTYMEMA 313

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
           P       +     A+  GM + D +AVWLMQGW F  DP FW  P++KA L+ VP  KL
Sbjct: 314 PEFTDLTELEGAARAVIDGMTAADPNAVWLMQGWPFVDDPHFWTKPRVKAYLDGVPTDKL 373

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LD ++E  PIWS    ++G  +I+ +LHNF GN  M G L ++A  PV A  + N TM
Sbjct: 374 IILDFYSESVPIWSKMDNYFGKSWIYSVLHNFGGNTGMRGDLLTLATAPVLANWAGNGTM 433

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VGVG++MEGI QN +VYDL  +MA+    +DV  WI QY+ +RY      ++ AW+ L  
Sbjct: 434 VGVGLTMEGIFQNYIVYDLTLQMAWVDNPLDVNTWIPQYAAQRYHTHNEHVEQAWSYLLR 493

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           +VYN T           +A+  V  S++ +       Y +   +  ++K           
Sbjct: 494 SVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR--FQPTLIK----------- 529

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA- 416
            Y  ++V+ A +  + + NEL   +TYR+DL+D+T+Q L+    E ++++   Y    A 
Sbjct: 530 -YDPNDVVLAWKELLLAENELRDVDTYRHDLVDVTKQFLSNKLLEQYIHLKGIYNAKKAS 588

Query: 417 -HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
            + V  L++  L  +E ++ +LA ++ FLLG W+
Sbjct: 589 PNEVCGLTKTMLTTMERLEEILATNEDFLLGNWI 622


>gi|158300970|ref|XP_320760.4| AGAP011750-PA [Anopheles gambiae str. PEST]
 gi|157013415|gb|EAA00039.4| AGAP011750-PA [Anopheles gambiae str. PEST]
          Length = 770

 Score =  295 bits (754), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 163/459 (35%), Positives = 248/459 (54%), Gaps = 35/459 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL  S+      LQ +++  +  LGM   LPAF+G++P   + ++P+    
Sbjct: 213 MGNIRGWGGPLTPSFTQFAHTLQVRVVGEMRRLGMAVALPAFAGHLPVQFRTLYPNVSFA 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W +    P++     LD T+PLF  IG  F++  +K YG T H+Y  D F+E  P 
Sbjct: 273 NVSVWNNFP--PQYASPLFLDPTEPLFAAIGSRFLQLAIKTYG-TDHVYFSDPFNEIDPT 329

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           + S +Y+SS+  AIYS M   D DA+WL+QGW+F  +PFW    +++ L++VPLG+++VL
Sbjct: 330 LPSGKYLSSVSEAIYSTMVQVDPDAIWLLQGWMFVKNPFWSDRAIRSFLSAVPLGRMLVL 389

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  +  + G P+IWCML NF G + M G + ++  G  E R +   T++G 
Sbjct: 390 DLQSEQYPQYGRTASYAGQPFIWCMLSNFGGTLGMLGSVGNVFRGIRETRDNSTYTLLGT 449

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR-SVPAIQDAWNVLYHTV 299
           G++ EGI QN  +Y+   EM +  E    + W ++Y+V RYG  S    Q AWN+   TV
Sbjct: 450 GITPEGINQNYALYEFALEMGWNAELDSAEQWFSEYAVARYGNDSDERAQQAWNIFLRTV 509

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y               AF  ++     +  GKY    +P SK             P  WY
Sbjct: 510 Y---------------AFEGLE-----LMRGKYTFNRRPSSK-----------IRPWTWY 538

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
                 + LEL ++   E S +   +YDL+D TRQ L   A+ L+L ++++++  D    
Sbjct: 539 DVHTFNQGLELLLSFAEEASCNQLCQYDLVDATRQCLQHTADALYLTLMDSFKKRDLTSF 598

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
              S  FL+L+ D+D LL  ++ FLLGPWLESAK  A+ 
Sbjct: 599 RLHSSLFLQLLSDLDVLLRTNEHFLLGPWLESAKAHAET 637


>gi|195155652|ref|XP_002018715.1| GL25802 [Drosophila persimilis]
 gi|194114868|gb|EDW36911.1| GL25802 [Drosophila persimilis]
          Length = 767

 Score =  294 bits (753), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 160/462 (34%), Positives = 257/462 (55%), Gaps = 38/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL   +   Q +LQ+ IL    +LG++  LPAF+G++P A++ ++P+   T
Sbjct: 212 MGNIRGWGGPLKPEYQRLQELLQQHILRAQRDLGISVALPAFAGHLPTAMRRIYPNGNYT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W S   DP +CC   +D  DP+F  +   F+ + ++ YG ++HI+ CD F+E  PP
Sbjct: 272 EVERWNSF-PDP-YCCGLFVDPLDPIFDLVAALFLRRVVQRYG-SNHIFFCDPFNELQPP 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  P+Y+ S  AAI++ M+S D +AVWL+QGW+F  + +W    M+A L +VP+G+L+VL
Sbjct: 329 VAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNIYWTDAMMEAFLTAVPIGRLIVL 388

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  +  +YG P++WCMLHNF G + M+G  D +  G   AR   N+++VGV
Sbjct: 389 DLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGSADLVNNGIEAARRMPNSSIVGV 448

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN V+Y L+ E  +    +D+ +W   ++  RYG     +Q AW +L  +VY
Sbjct: 449 GITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFARTRYGVDDEGLQQAWQLLRRSVY 508

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +       K R                        G  V++   L  +      P  WY+
Sbjct: 509 SFR--GLQKMRG-----------------------GYTVTRRPALNLD------PFTWYN 537

Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
            S+V+ A +L ++S   +   +     Y +DL+D+TRQ L   A++L++N+  AY+    
Sbjct: 538 ASDVLEAWKLLLSSRAIIPLEDDKYAIYEHDLVDITRQYLQISADQLYVNLKSAYRKRQV 597

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
                L  + L+L  D++ +LA    FLLG WL  A++ A N
Sbjct: 598 ARFEYLGSKLLQLFGDLEHILASGSNFLLGTWLADAQRAAPN 639


>gi|170060634|ref|XP_001865888.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
 gi|167879069|gb|EDS42452.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
          Length = 761

 Score =  293 bits (750), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 164/466 (35%), Positives = 249/466 (53%), Gaps = 41/466 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL +S+      LQ K++  +   GM   LPAF+G++P   + +FP AK+ 
Sbjct: 210 MGNIRGWGGPLKESFKTFASDLQAKVVQEMRRFGMILALPAFAGHLPVQFKTLFPQAKLN 269

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W    +  ++     LD  DPLF +IG  F+ + +  YG T HIY  D F+E  P 
Sbjct: 270 PVEVWNGFPA--QYASPLFLDPVDPLFQKIGSKFVAKAIARYG-TDHIYFSDPFNEIQPR 326

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            +S  Y++S  A IY  M   D  AVWL+QGW+   +PFW    +KA   +VP G+++VL
Sbjct: 327 SESARYLASAAAGIYQAMVDVDPLAVWLLQGWMLVKNPFWSDRAIKAFFTAVPNGRMLVL 386

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  ++ +YG P+IWCML NF G + M G +D +     E R++E+ TM+G 
Sbjct: 387 DLQSEQFPQYVRTQSYYGQPFIWCMLSNFGGTLGMLGSVDLVFERIRETRSNESMTMIGT 446

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN  +Y+   EM +  +  DV  W  +Y++ RYG     +QDAW++   TVY
Sbjct: 447 GITPEGINQNYGLYEFALEMGWNPDISDVDNWFTRYAMVRYGNDDKRLQDAWSIFRSTVY 506

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +                       + +  GKY    +P  K             P +WY+
Sbjct: 507 SFKG--------------------MEMMRGKYTFNRRPSLKL-----------QPWVWYN 535

Query: 361 TSEVIRALELFIA--SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
            +     +EL +A    NEL     +++D++DLTRQ L   A++L+L I++ Y L +A  
Sbjct: 536 ETRFDEGVELILAVNGSNEL-----FKHDVVDLTRQFLQNTADKLYLTIMDTYTLKNAAA 590

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
               S  F EL++++D LLA +  FLLG WLESAK LA    + ++
Sbjct: 591 FKHYSNLFKELLQNIDRLLATNTHFLLGRWLESAKSLATTSLERQK 636


>gi|440800773|gb|ELR21808.1| AlphaN-acetylglucosaminidase (NAGLU) [Acanthamoeba castellanii str.
           Neff]
          Length = 800

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 175/480 (36%), Positives = 257/480 (53%), Gaps = 67/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  WGGPL +SW + Q  LQKKI+  ++         +      A L+ V+P A IT
Sbjct: 199 MGNVQSWGGPLTKSWREGQAELQKKIVQGVWN---EERAVSVRWARAAGLKRVYPHANIT 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W       R    +LLD  DP+F +IG AFI+ Q + YG T HIYN DTF+E  PP
Sbjct: 256 LSPTWAHFTDPYR---VWLLDPFDPIFQKIGTAFIDAQTRVYG-TDHIYNADTFNELDPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P Y+++   A+Y GM + D  A+WLMQGWLF    +W   ++KA L+ V    +++L
Sbjct: 312 SADPTYLAAASNAVYQGMAAADPKALWLMQGWLFR-SVWWSNDRIKAYLSGVKNDNMLIL 370

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+AEV PIWS ++ ++G P++WCMLH+F GN ++YG L  IA  PV+ART+  +TMVG 
Sbjct: 371 DLYAEVDPIWSKTESYFGKPFVWCMLHDFGGNRDLYGNLTHIATAPVDARTAPGSTMVGT 430

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ME IEQNPV+Y+LMSEM ++   VDV  W++ Y   RYG   P+ + AW +L+ + Y
Sbjct: 431 GLTMEAIEQNPVIYELMSEMGWRSAHVDVDDWLDHYVSFRYGADSPSAKKAWRLLHQSAY 490

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
                            P +  SI +    ++                 S   H    YS
Sbjct: 491 QN---------------PVIMRSIYTFVPNRH----------------VSRNHH----YS 515

Query: 361 TSEVIRALELFIASGNEL----SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND- 415
              ++ A  L + S  EL      +  + YDL+D+TRQ L    + LF    +AY L D 
Sbjct: 516 PDVLVEAWGLLLQSRLELPNPAQPNGPWEYDLVDVTRQVL----DNLFH---DAYGLLDG 568

Query: 416 AHGVFQLSRR------------FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           A+  +  +RR             ++++ D+D +LA +  +LLG W E A+  A NEE+++
Sbjct: 569 AYDAYVATRRDPFNQVKTIGAALIQILSDIDTVLATNQNYLLGVWTERARSWATNEEEKR 628


>gi|330791218|ref|XP_003283691.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
 gi|325086434|gb|EGC39824.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
          Length = 712

 Score =  288 bits (737), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 158/458 (34%), Positives = 253/458 (55%), Gaps = 37/458 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++ WGGP+   WL++Q  LQ +IL R+   GM PVLP F+G++P A+Q +FP+A ++
Sbjct: 170 MGNVNNWGGPITMDWLEKQRDLQIQILTRMRAYGMKPVLPGFAGHIPGAIQTLFPTANVS 229

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W           T+ LD +DPLF +I + FI + +  +G T H YN D F+E  PP
Sbjct: 230 ILSTWCEFNG------TFYLDPSDPLFGKITQLFITELIGVFG-TDHYYNFDPFNELAPP 282

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSYDPFWRPPQMKALLNSVPLGKLVV 179
                ++      +Y+ M + D  AVW++QGW +  Y  FW+  Q +A  + VP+G  +V
Sbjct: 283 SSDLGFLKQTSQQMYNNMLAADPKAVWVLQGWFIVDYPEFWQANQTQAWFSGVPIGGFIV 342

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL+++V P W+ ++ FYG  ++WCMLHNF G   MYG +  IA  P+ AR S +  M+G
Sbjct: 343 LDLWSDVAPAWNITEYFYGHYWLWCMLHNFGGRSGMYGRIPFIATNPIIAR-SLSDNMMG 401

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ E IEQN VVYDLMSEMA++    D++ WI QY+ RRYG+ +P + + W  +  TV
Sbjct: 402 TGLTPEAIEQNVVVYDLMSEMAWRSTAPDLEEWITQYTNRRYGKIMPEVVEVWMSMVDTV 461

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           +N T     +N     +F  + PSI         N+G  V                  +Y
Sbjct: 462 FNATAYWARRNMGAPESFIALRPSI---------NFGDNV------------------FY 494

Query: 360 STSEVIRALELF-IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
             S +  A  +F + + + + ++ T+++D+ ++T QAL+ +  + + N+I++Y ++D   
Sbjct: 495 DPSVMFNAWHVFSLVNDSYVISTETFQFDISEITMQALSNFFMDTYFNLIKSYNVSDIES 554

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
             + S   +E +  MD + +      LG W   A+  A
Sbjct: 555 FQRESITMMETISFMDLIASTQPELQLGVWTYRARLWA 592


>gi|195473052|ref|XP_002088810.1| GE10991 [Drosophila yakuba]
 gi|194174911|gb|EDW88522.1| GE10991 [Drosophila yakuba]
          Length = 778

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 158/460 (34%), Positives = 245/460 (53%), Gaps = 38/460 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPL   W   QL+LQ++I+     LGM+  LPAF+G+VP AL+ + P +   
Sbjct: 223 MGNIRGWAGPLTPQWRRYQLLLQQEIIAAQRNLGMSVALPAFAGHVPRALKRLNPDSTFM 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       ++CC   ++  + LF EI   F+++ +  YG ++HI+ CD F+E  PP
Sbjct: 283 EVQRWNQFPD--QYCCGLFVEPKENLFNEIALNFLQKIITIYG-SNHIFFCDPFNELEPP 339

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  PEY+ S  AAIY  M+  D  A+WL+QGW+F  +PFW     +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTSAAIYESMRRIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  ++ ++G P+IWCMLHNF G + M+G    I  G  EAR   N+++VG 
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN V+Y    E  + ++ +D+ +W   +S  RYG     ++ AW  L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNKPLDLDSWFTNFSHTRYGVKDERLEQAWLQLKNSVY 519

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +                       +    G+Y           V+    S    P  WY 
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYD 548

Query: 361 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
            S V+ A  L ++S      E      Y +DL+D+TRQ L   A++L++N+  A++    
Sbjct: 549 ASAVLDAWHLLLSSRAIIPLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAFRKRQV 608

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
                LS + L+L +DM+ +LA    FLLG WL+ AK+ A
Sbjct: 609 TRFEYLSTKLLKLFDDMELILASSRNFLLGNWLQQAKRAA 648


>gi|297273081|ref|XP_001095618.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Macaca mulatta]
          Length = 691

 Score =  286 bits (733), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 174/472 (36%), Positives = 245/472 (51%), Gaps = 85/472 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+   +  P A 
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRTSCMPVAS 263

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +       S+   P                  GR  I                 + +   
Sbjct: 264 LPA-----SLPPSPG-----------------GRKLIH----------------SINLMQ 285

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
           PP  +P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L
Sbjct: 286 PPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRL 345

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           +VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM
Sbjct: 346 LVLDLFAESQPVYTLTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTM 405

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
           VG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ +RYG S P    AW +L 
Sbjct: 406 VGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLL 465

Query: 297 HTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
            +VYNC+ +     NR  +V  P                          L+  TS     
Sbjct: 466 RSVYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS----- 495

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY---- 411
            +WY+ S V  A  L + S   L+AS  +RYDL+DLTRQA+ +  +  +     AY    
Sbjct: 496 -VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKE 554

Query: 412 --QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
              L  A GV        EL+  +D LLA    FLLG WLE A+  A +E +
Sbjct: 555 LTSLLRAGGVLA-----YELLPALDELLASDSRFLLGSWLEQARAAAVSEAE 601


>gi|195115262|ref|XP_002002183.1| GI17241 [Drosophila mojavensis]
 gi|193912758|gb|EDW11625.1| GI17241 [Drosophila mojavensis]
          Length = 773

 Score =  286 bits (732), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 164/466 (35%), Positives = 258/466 (55%), Gaps = 38/466 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  WGGPLP +    Q +LQ++IL    ELGM+  LPAFSG VP A++ VFP+A  T
Sbjct: 212 MGNLRSWGGPLPPAHRQLQQLLQQRILAAQRELGMSVALPAFSGYVPTAMRRVFPNASFT 271

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q   W     DP +CC   ++  DPLF ++G  F+ + ++ YG ++HIY  D F+E  P 
Sbjct: 272 QSDRWNHF-PDP-YCCVLFVEPQDPLFQQVGAMFLRRVIQVYG-SNHIYFSDPFNEMMPR 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V  P Y+     AIY+ MQ  D+DAVWL+QGW+F    +W    ++A L +VP G+++ L
Sbjct: 329 VREPNYVRYTAKAIYNSMQVVDADAVWLIQGWMFLKSVYWTNDLIEAYLTAVPRGRILAL 388

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  +  +YG P++WCML+NF GN+ ++G    I  G + AR+  N +MVGV
Sbjct: 389 DLQSEQFPQYERTHSYYGQPFVWCMLNNFGGNLGLFGSAQLIPSGIIAARSMPNGSMVGV 448

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN  ++ L  E A+  +++ ++ W   +++ RYG +   +   W +L  +VY
Sbjct: 449 GITPEGIGQNYALFALTLEQAWSPDELQLEDWFEYFALTRYGVNDTRLSQVWQLLRESVY 508

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +       + R+ +               GKY            L    S + +P +WY+
Sbjct: 509 SF------QGRERM--------------RGKY-----------TLNKRPSLHHYPWVWYN 537

Query: 361 TSEVIRALELFIASGNELSASNT----YRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
            + V  A  L + +   +  ++     Y +DL+D+TRQ L    +  ++N+  A +    
Sbjct: 538 VTMVYEAWRLMLEAKETVPLNDNRRAIYEHDLVDITRQCLQLSFDRFYVNLKSACRHKQL 597

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
           + V  L+ + LEL  DM+ +LA  + +LLG WLE+AK+LA +EEQ 
Sbjct: 598 NRVEYLAGKLLELFADMERILASGEHYLLGNWLEAAKRLAPSEEQR 643


>gi|66801665|ref|XP_629757.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
 gi|60463162|gb|EAL61355.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
          Length = 798

 Score =  282 bits (721), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 157/457 (34%), Positives = 253/457 (55%), Gaps = 35/457 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++GWGGP+   WL++Q  LQ KIL R+ + GM PVLP F+G++P A+Q +FP A I+
Sbjct: 255 MGNVNGWGGPITLDWLEKQRDLQIKILERMRQYGMKPVLPGFAGHIPGAIQQLFPQANIS 314

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W +         T+ L++TDPLF +I   FI + +  +G T H YN D F+E  PP
Sbjct: 315 VLSTWCNFNG------TFYLESTDPLFAKITTMFIGELIDVFG-TDHFYNFDPFNELEPP 367

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
            +  +Y+     ++Y  +   D  AVW++QGW     P FW+  Q +A  + VP+G ++V
Sbjct: 368 SNDTDYLRQTSQSMYENVLLADPKAVWVLQGWFIVDAPEFWQAKQTEAWFSGVPIGGVLV 427

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL+++V P W+T+  +YG  ++WCMLHNF G   MYG L  I+  P+ AR   +  MVG
Sbjct: 428 LDLWSDVIPGWTTTNYYYGHYWVWCMLHNFGGRSGMYGRLPWISSNPITAR-GLSPNMVG 486

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G++ E IEQN VVYD+MSEM+++  + ++  W+ QY+ RRYG+ VP I D W  L +TV
Sbjct: 487 IGLTPEAIEQNVVVYDMMSEMSWRSVQPNLTEWVTQYTHRRYGKLVPEIVDVWISLVNTV 546

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           +N T      N     +F  + P +          +G             +S+ +P++ Y
Sbjct: 547 FNATAATARANMGAPESFIALRPQL---------TFGN------------NSFYNPNILY 585

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           +   V   ++        + ++ T+ +D+ + T Q+L+ Y  + +  +IEA+  +D   +
Sbjct: 586 NAWNVFSMVD-----DEYVISTETFEFDISEFTMQSLSNYFMDQYFLLIEAFNASDVQTL 640

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
             +S   L+++  MD + +      LG W   A+  A
Sbjct: 641 STISIELLDIINYMDEIASTQSSLQLGLWTYRARLWA 677


>gi|440799253|gb|ELR20308.1| alpha-N-acetylglucosaminidase family protein [Acanthamoeba
           castellanii str. Neff]
          Length = 854

 Score =  281 bits (718), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 163/480 (33%), Positives = 247/480 (51%), Gaps = 57/480 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPL  +W   Q  LQKKI+ R    GM P+LP F+G VP  ++ ++P+A +T
Sbjct: 240 MGNIQGWGGPLDPAWRKAQAELQKKIVERQRMFGMLPILPGFAGFVPDGIKRIYPTANLT 299

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  +W       ++   Y L   D L+  IGR  I +   E+G T HIYN DTF+E +PP
Sbjct: 300 KSADWAGFPH--QYTNVYFLSPLDSLYKTIGRMVIRRVTAEFG-TDHIYNADTFNEMSPP 356

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWR-PPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y GM + D  A+W+MQGW F +D FW    ++++ L+ V    +++
Sbjct: 357 SADPTYLAAASRAVYEGMAAEDPQALWVMQGWSFVFDKFWEDKSRVRSYLSGVSDKDMLI 416

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDL ++  P WS +  ++G  ++WCMLHN  G   +YG L   +  P+ A  +   TM  
Sbjct: 417 LDLASDNNPEWSKTDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSDPLLALATPGNTMLI 476

Query: 238 ------VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS--VPAIQ 289
                 VGVGM+ME IEQNPVVY+LMSEM ++ E  D+  W+ +Y+ RRYG +  + ++ 
Sbjct: 477 CGTCEQVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAAGLSSVG 536

Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
           +AW +L    Y         N+ VI      D      T G    YG             
Sbjct: 537 EAWELLREATY---------NQSVI------DYGWFGFTPGLGMGYGGVA---------- 571

Query: 350 SSYDHPHLWYSTSEVIRALELFIASG--NELSASNTYRYDLIDLTRQALAKYANELFLNI 407
                     + ++ + AL LF+ S      + +  ++YD +DLTRQ LA    +++   
Sbjct: 572 ----------NAAKEVEALRLFLQSALTKGYAPNGPWQYDCVDLTRQVLANTFRDIYAQF 621

Query: 408 IEAYQLNDAHGVF------QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
             AY    AH  +       L    L L+ D+D +LA +  +LLG W++SA   A   +Q
Sbjct: 622 DAAYSAYAAHKTYTVDQLKSLGSALLTLIGDIDEILATNPNYLLGTWIQSALSWADTPDQ 681


>gi|255553488|ref|XP_002517785.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
 gi|223543057|gb|EEF44592.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
          Length = 360

 Score =  279 bits (713), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 135/255 (52%), Positives = 182/255 (71%), Gaps = 5/255 (1%)

Query: 215 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 274
           MYGILDSI+ GP+EAR SEN+TMVGVGM MEGIE NPVVY+LMSEMAF+ EKV V  W+ 
Sbjct: 1   MYGILDSISTGPIEARVSENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSEKVQVLEWLK 60

Query: 275 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
            YS RRYG++V  ++ AW +LYHT+YNCTDG  D N D IV FPD DPS+ S ++   Q+
Sbjct: 61  TYSRRRYGKAVHQVEAAWEILYHTIYNCTDGIADHNTDFIVKFPDWDPSVQSGSDTSQQD 120

Query: 335 -----YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 389
                  +  S+  + +   S+    H+WYS  +VI AL+LFI  G+ L+ S TYRYDL+
Sbjct: 121 NKHIFLHRSGSRRFLFEGPNSTLPQAHIWYSIQKVINALQLFIDGGSHLTGSLTYRYDLV 180

Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
           DLTRQ L+K AN+++++ I A++ NDA  +   S++F++L++D+D LLA  D FL+G WL
Sbjct: 181 DLTRQVLSKLANQVYVDAIIAFRSNDARALNLHSQKFIQLIKDIDVLLASDDNFLIGTWL 240

Query: 450 ESAKQLAQNEEQEKQ 464
           ESAK+LA N  + +Q
Sbjct: 241 ESAKELALNPSEMRQ 255


>gi|121698957|ref|XP_001267859.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
           1]
 gi|119396001|gb|EAW06433.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
           1]
          Length = 671

 Score =  278 bits (712), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 162/471 (34%), Positives = 259/471 (54%), Gaps = 39/471 (8%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G W G LP SW+D Q  LQKKI+ R+ ELGM PVLPAF+G VP A+  V P A +  
Sbjct: 197 NIQGSWHGELPYSWIDAQFELQKKIVRRMVELGMTPVLPAFTGFVPRAITRVLPDATVVN 256

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W     D ++     L+  DP F  + R+FI +Q + YG  +HIY  D ++EN P  
Sbjct: 257 GSRWSGF--DEKYTNDTFLEPFDPNFARLQRSFIHKQQQAYGNITHIYTLDQYNENDPYS 314

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
             PEY+ ++    +  ++S D DA+W+MQGWLF S   FW   ++ A L+ V   + ++V
Sbjct: 315 GDPEYLRNVTHNTWQSLKSADPDAIWMMQGWLFYSNSDFWTDERVHAYLSGVETDEDMLV 374

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLF+E +P W  ++ +YG P+IWC LH++ GN+ +YG + +I     +A    ++ +VG
Sbjct: 375 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNITVNATDALAVSDS-LVG 433

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG----RSVP-AIQDAWNV 294
            G++MEG E N +VYDL+ + A+    +D  ++ + +   RY      +VP  +  AW++
Sbjct: 434 YGLTMEGQEGNEIVYDLLLDQAWSSRPIDTDSYFHDWVKARYSTARRHNVPHELYQAWDI 493

Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
           L  T YN T+ AT            V  SI  +         +P  K   L ++T    H
Sbjct: 494 LRTTAYNNTNLATATA---------VSKSIFEL---------QP--KLTGLVNQTGH--H 531

Query: 355 PHLW-YSTSEVIRALELFIASGNELSA---SNTYRYDLIDLTRQALAKYANELFLNIIEA 410
           P +  Y  S ++R+ +L +++ +E +A      +RYD++D+TRQ +A     ++LN+   
Sbjct: 532 PTVVNYEASSLVRSWKLMVSAASESTALWSHPAFRYDMVDVTRQVMANAFIPMYLNVTST 591

Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
           YQ      + Q     + L+ D+D +L+ +D F L  W+ESA+  A+N+ +
Sbjct: 592 YQ--KGGPISQQGDSLIRLLRDLDAVLSTNDNFRLATWIESARTWARNDTE 640


>gi|195398029|ref|XP_002057627.1| GJ18000 [Drosophila virilis]
 gi|194141281|gb|EDW57700.1| GJ18000 [Drosophila virilis]
          Length = 766

 Score =  277 bits (709), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 156/464 (33%), Positives = 256/464 (55%), Gaps = 38/464 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GW GPLP +    Q +LQ+ I+    ELGM+  LPAF+G+VP A++ VFP+A  T
Sbjct: 214 MGNIRGWAGPLPPAHRRLQQLLQQLIVRAQRELGMSVALPAFAGHVPTAMRRVFPNANYT 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W +     ++CC   ++  DPLF ++G  F+ + ++ YG ++HIY  D F+E  PP
Sbjct: 274 PAERWNNFPD--QYCCDLFVEPHDPLFQQLGAMFLRRVIQVYG-SNHIYFSDPFNEMQPP 330

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P Y+ S   AIY+ M+  D +AVWL+QGW+F  D FW    ++A L +VP G+++VL
Sbjct: 331 LAEPGYMRSTAKAIYNSMREVDGNAVWLLQGWMFLKDIFWTDELIEAFLTAVPRGRILVL 390

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  +  +YG P++WCML+NF G + ++G    I  G   AR   N+++VGV
Sbjct: 391 DLQSEQFPQYQRTHSYYGQPFVWCMLNNFGGTLGLFGSAQFIGSGIASARIMPNSSLVGV 450

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN  ++ L  E  +   ++ +  W + +++ RYG +   +  AW +L   VY
Sbjct: 451 GITPEGIGQNYAIFALTLEQGWSASELQLGDWFDHFALTRYGVNDTRLAQAWQLLRGGVY 510

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +                       +    GKY            L        +P  WY+
Sbjct: 511 SFHG--------------------LQRMRGKY-----------ALNRRPGLNLNPWTWYN 539

Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
            S V  A +L +AS   +  ++     Y +DL+D+TRQ L +  +++++N+  AY+    
Sbjct: 540 GSSVTDAWQLLLASREMVPLTDDRYAIYEHDLVDITRQFLQQSFDQIYVNLRSAYRKEQL 599

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
           + +  L+ + LEL++DM+ +LA    +LLG WLE+AK+LA +++
Sbjct: 600 NRLEYLAGKLLELLDDMERILASGVHYLLGTWLEAAKKLAPSDK 643


>gi|195050088|ref|XP_001992825.1| GH13491 [Drosophila grimshawi]
 gi|193899884|gb|EDV98750.1| GH13491 [Drosophila grimshawi]
          Length = 771

 Score =  275 bits (702), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 161/466 (34%), Positives = 262/466 (56%), Gaps = 38/466 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPLP +    Q +LQ++I+    +LGM+  LPAF+G+VP  L  +FP+A  T
Sbjct: 213 MGNIRGWGGPLPPAHRRLQQLLQQRIVQAQRDLGMSVALPAFAGHVPTGLPRIFPTANFT 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     DP +CC   ++ +DPLF  +G  F+ + ++ YG ++HIY  D F+E  P 
Sbjct: 273 SVERWNQFP-DP-YCCALFIEPSDPLFQLVGAQFLRRVIQIYG-SNHIYFSDPFNEMQPR 329

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P YISS   AIY+ M+  D D VWL+QGW+F  + +W    ++A L +VP G+++VL
Sbjct: 330 IAEPGYISSTARAIYNSMRMVDKDPVWLLQGWMFLDNAYWSDELIEAFLTAVPRGRMLVL 389

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL +E  P +  +  +YG P++WCML+NF G + M+G    I  G + AR+  N++MVGV
Sbjct: 390 DLQSEQFPQYQRTFSYYGQPFVWCMLNNFGGTLGMFGSAHLINAGIMAARSMPNSSMVGV 449

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ EGI QN  ++ L  E  +   K+++  W +Q+++ RYG +   +  AW +L  +VY
Sbjct: 450 GITPEGIGQNYALFALTLEQGWSGSKLELSDWFDQFTLTRYGVNDTDLILAWQLLRGSVY 509

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +                       +    GKY            L    S    P +WY+
Sbjct: 510 HFHG--------------------LQRMRGKY-----------ALNKRPSFNLKPWIWYN 538

Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
            S V+ A +L +A+   +   +     Y++DL+D+TRQ L +  +++++N+  AY+ +  
Sbjct: 539 ASSVVEAWQLLLAANQTIPVEDDRYALYKHDLVDITRQFLQQSFDQVYVNLKSAYRKSQL 598

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
                L+ + LEL+ DM+ +LA  + +LLG WLE+AK+LA + +Q 
Sbjct: 599 ARFEYLAAKLLELLADMERILASGEHYLLGNWLEAAKELAPSADQR 644


>gi|357622373|gb|EHJ73879.1| putative alpha-N-acetyl glucosaminidase [Danaus plexippus]
          Length = 780

 Score =  274 bits (700), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 158/459 (34%), Positives = 245/459 (53%), Gaps = 41/459 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+HGWGGPLPQSW D+Q  +Q+ +   +++LGM PV PAF+G+VP A + +FP+    
Sbjct: 206 MGNVHGWGGPLPQSWHDRQKQIQEVVTDLMFKLGMIPVFPAFNGHVPKAFEKIFPNTTFH 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W   K D  +CC   +D  +P F  I + F+ +     G +SHIY  D F+E    
Sbjct: 266 PVETW--NKFDEDYCCNLFVDPREPDFKMISKMFMREITAGLG-SSHIYTADPFNEIKIQ 322

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
             S   +     AI+S +   D DAVWL+Q W+F ++P  W   ++ + L SVP G+++V
Sbjct: 323 PWSTSLVVETAKAIFSSISEYDKDAVWLVQNWMFVHNPLLWPLKRVNSFLTSVPNGRMLV 382

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL +E  P +   + +YG P+IW MLHNF G + M+G   +I     E R  EN+TMVG
Sbjct: 383 LDLQSEQWPQYDLYQMYYGQPFIWSMLHNFGGTLGMFGNTKTINKDVYEVRKRENSTMVG 442

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           +G++ EGI QN V+YDLM E A++   V D++ W++ Y+ RRYG +  +I   W  L  +
Sbjct: 443 IGLTPEGINQNYVIYDLMLESAWRKGPVPDLEEWVSDYAERRYGCNATSI--GWKYLLRS 500

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYN T                     ++   GKY           V+    S    P  W
Sbjct: 501 VYNFTG--------------------LNRIRGKY-----------VMTRRPSFNIRPWAW 529

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y   ++  AL+ F+   N   +++ + +DL+D+TRQAL     ++++N+      N  + 
Sbjct: 530 YKGHDLFEALKNFVYVQNPACSTSGFLHDLVDVTRQALQYKIEQIYMNLQNDRYSN--YM 587

Query: 419 VFQLS-RRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
           VF  +   F++ + DM  +LA    F +  WL SA+ ++
Sbjct: 588 VFNYTISSFIDAMTDMQNILATSSDFKITSWLSSARAIS 626


>gi|119480815|ref|XP_001260436.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
           181]
 gi|119408590|gb|EAW18539.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
           181]
          Length = 748

 Score =  274 bits (700), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 156/472 (33%), Positives = 262/472 (55%), Gaps = 41/472 (8%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LP SW+D Q  LQKKI+ R+ ELGM PVLPAF+G VP A+  V P+A +  
Sbjct: 197 NIQGSWGGDLPYSWIDSQFELQKKIVRRMVELGMTPVLPAFTGFVPRAISRVLPNATVVN 256

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W     D R+     L+  DP F  + R+FI++Q + YG  +H+Y  D ++EN P  
Sbjct: 257 GSRWGGF--DERYTNDTFLEPFDPSFTRLQRSFIQKQQQAYGNITHVYTLDQYNENDPYS 314

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
              +Y+ ++    +  ++S D +AVWLMQGWLF S   FW   ++KA L+ V + + ++V
Sbjct: 315 GDLDYLRNVTRNTWLSLKSADPNAVWLMQGWLFYSNSDFWTDERVKAYLSGVEVDQDMLV 374

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLF+E +P W  ++ +YG P+IWC LH++ GN+ +YG + ++     +A  + + ++VG
Sbjct: 375 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNVTVNATQALAASD-SLVG 433

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-----GRSVP-AIQDAWN 293
            G++MEG E N ++YDL+ + A+  + +D   + + +   RY     G +VP  +  AW+
Sbjct: 434 FGLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHNWVKTRYSSGVRGSAVPEELHQAWD 493

Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
           +L  TVYN T+  +     V  +  ++ PSI     G     G                D
Sbjct: 494 ILRTTVYNNTNLTSTA---VSKSIFELQPSI----SGLLNRTG----------------D 530

Query: 354 HP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
           HP  + Y  + +++A +L  ++ ++   L +   + YD++D+TRQ +A     +++N++ 
Sbjct: 531 HPTTVNYDPAALVQAWQLMDSAASKDRSLWSQPAFLYDMVDITRQVMANAFIPMYINLVS 590

Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            YQ      V       ++L+ D+D +L+ +D F L  W+ SA+  A+N+ +
Sbjct: 591 TYQA--GASVSTDGSNLIQLLRDVDSVLSTNDNFRLSTWIRSARSWARNDTE 640


>gi|390334740|ref|XP_003724005.1| PREDICTED: uncharacterized protein LOC100893810 [Strongylocentrotus
           purpuratus]
          Length = 1043

 Score =  274 bits (700), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 144/367 (39%), Positives = 207/367 (56%), Gaps = 36/367 (9%)

Query: 100 KEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF 159
           +E+  T HIYN DTF+EN P  +   Y+S+    +Y G+  GD   VWLMQGWLF    F
Sbjct: 573 EEFNGTDHIYNADTFNENQPRSNDSAYLSAASRGVYQGIVEGDPQGVWLMQGWLFQKTDF 632

Query: 160 WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL 219
           W P Q+KALL+ VP+G+++VLDLFAE +PI++ ++ FYG P+IWCMLHNF GN  +YG L
Sbjct: 633 WGPSQIKALLHGVPIGRMIVLDLFAEARPIYNATQSFYGQPFIWCMLHNFGGNTGLYGKL 692

Query: 220 DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 279
           D++   P EAR   ++TM+G+G++ EGI QN V+Y+ +++M ++ E ++V  WI +YS R
Sbjct: 693 DAVNKFPFEARQFNSSTMIGMGLTPEGILQNYVMYNFLTDMTWRSESMNVSKWIEEYSGR 752

Query: 280 RY----GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 335
           RY    G S  A + AW +L  TVYN T    D       A P V PS          N 
Sbjct: 753 RYSPESGHSEEAAK-AWAILQATVYNNTGIDKDHQH----AVPVVRPS----------NK 797

Query: 336 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 395
            K V                 +WY  +EV +A    + +   L  S+ +RYDL+D+TR  
Sbjct: 798 TKSV-----------------IWYDYTEVAKAWGFLLQASETLGTSSLFRYDLVDVTRNV 840

Query: 396 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
           L   A + +  I+ ++   +   +         L+ DMD + + H  +LLG WLE AK L
Sbjct: 841 LQDLAFDFYEQIMVSFHAKNITAIRGNGTLLCNLILDMDNITSSHQDWLLGTWLEDAKSL 900

Query: 456 AQNEEQE 462
           A N ++E
Sbjct: 901 ATNHKEE 907


>gi|392588150|gb|EIW77482.1| glycoside hydrolase family 89 protein [Coniophora puteana
           RWD-64-598 SS2]
          Length = 761

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 157/471 (33%), Positives = 252/471 (53%), Gaps = 40/471 (8%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LP  W+D Q VL K+I+ R+ +LGM PVLPAF+G VP A+ N++P+A I
Sbjct: 206 FGNIQGSWGGDLPTQWIDDQFVLGKQIVQRMVDLGMTPVLPAFTGFVPPAMHNLYPNASI 265

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
                W      P++     L+  DPLF ++ ++FI +Q   +G  SHIY  D ++EN P
Sbjct: 266 VNGSAWNDFA--PQFTNDSFLEPFDPLFAQVQQSFISKQQAAFGNVSHIYTLDQYNENDP 323

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPL---- 174
               P Y++++ AA +S +++ D DA WLMQGWLF S   FW P +++A L  VP     
Sbjct: 324 YSGDPSYLTNISAATFSSLRAADPDATWLMQGWLFFSSADFWTPERVEAYLAGVPGDDDG 383

Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
             +++LDL++E +P W     ++G  +IWC LH++ GN+   G   ++   P+ A  S N
Sbjct: 384 SGMLILDLYSEAQPQWQRLSSYFGKRWIWCELHDYGGNMGFEGNFANVTEAPLAALASPN 443

Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQ-DAW 292
            +MVGVG++ EG+E N ++YD++ + A+    ++   +   ++ RRY    +P    +AW
Sbjct: 444 VSMVGVGLTPEGMEGNEIIYDVLLDQAWSSSPINKTEYAQAWATRRYPADELPECAIEAW 503

Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
             L  TVY+ TD  +      I+        +++VT                        
Sbjct: 504 QTLAATVYSNTDPGSQATVKSILELEPALSGLVNVTG----------------------- 540

Query: 353 DHP-HLWYST-SEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNI 407
            HP H++Y T + ++ AL+  + +G+    L A   YRYDL+DLTRQ L     +L+ ++
Sbjct: 541 HHPTHVFYDTNTTIVPALQQLVQAGHSTPSLLAIPEYRYDLVDLTRQLLVNRFIDLYADL 600

Query: 408 IEAYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
           +  Y      +  V    +  LELV D+D +L  ++ F L  W ++A+  A
Sbjct: 601 LAVYNTTSASSASVSAAGQPMLELVADLDKVLMTNENFQLSRWTDAARSWA 651


>gi|126307952|ref|XP_001365931.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
           domestica]
          Length = 481

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 128/253 (50%), Positives = 170/253 (67%), Gaps = 4/253 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGGPLP SW  +Q  LQ +IL R+   GM PVLPAF+G++P A   VFP A +T
Sbjct: 205 MGNLHTWGGPLPSSWDLKQSYLQYQILERMRSFGMKPVLPAFAGHIPKAFTRVFPQANVT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L NW        + C+YLL   DPLF  +G  F+ +  KE+G T HIY+ D F+E  PP
Sbjct: 265 KLDNWIDFNCT--YSCSYLLAPEDPLFPVVGSLFLRELAKEFG-TDHIYSADIFNEMDPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             +P Y+++  AA+Y  M + D DAVWL QGWLF   P FW+PPQMKA+L +VP G+ ++
Sbjct: 322 SSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNHPDFWKPPQMKAVLEAVPRGRFLI 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P++S +  FYG P+IWCMLHNF GN  ++G+LD++  GP  AR   N+T+VG
Sbjct: 382 LDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLFGVLDAVNRGPSTARLFPNSTIVG 441

Query: 240 VGMSMEGIEQNPV 252
            G+  EGI QN +
Sbjct: 442 TGIVPEGINQNEI 454


>gi|281200618|gb|EFA74836.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
          Length = 469

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 166/466 (35%), Positives = 252/466 (54%), Gaps = 54/466 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++ W G L   W+  Q  LQ +IL R+ + GM  VLP F+G+VP AL++ +P+A IT
Sbjct: 1   MGNVNEWAGNLTLGWMVDQRDLQIQILTRMRQFGMQAVLPGFAGHVPEALKSHYPNANIT 60

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QL +W           T  +  +   F+ I       Q   YG T H YN D F+E  PP
Sbjct: 61  QLSSW---------NMTVYIHQSPNTFMSI-------QQDLYG-TDHFYNFDPFNELEPP 103

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+ +   ++++ + + D   +W++QGWLF YD  FW+PPQ++A L+ VP+GK++V
Sbjct: 104 SSDPAYLKNCSQSMFNNLIAVDPQGIWVLQGWLFVYDTEFWQPPQIEAFLSGVPIGKMIV 163

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL+A+V   W  +  FYG  +IWCMLHNF G   MYG +  I+  P+EAR S +  MVG
Sbjct: 164 LDLWADVDAGWKITNYFYGHNWIWCMLHNFGGRSGMYGKIPFISTNPIEAR-SLSPNMVG 222

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G++ E IEQN +VYDLMSEMA++    D+K W++QY  RRYG+ +  + D W  L  TV
Sbjct: 223 TGLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQYVTRRYGKYIEVLADTWYELVGTV 282

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           +NC+                       VT+G        VS    L   TS Y  P +  
Sbjct: 283 FNCS----------------------IVTKGPVTIL---VSVRPQLNFTTSLYYDPIV-- 315

Query: 360 STSEVIRALELFIASGN-ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
               + +A   F++  +  +  ++T+ +DL ++T QAL+       L +  A+ LND++ 
Sbjct: 316 ----ISKAWSAFLSIDDLHVVNTSTFSFDLTEITTQALSNLFMTTELQMNAAF-LNDSYE 370

Query: 419 VFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA-QNEEQE 462
            F  LS   L +++D++ +++  +  L+G W   A+ L   NE  E
Sbjct: 371 EFSLLSDALLSIIQDINTIVSTQEMLLVGNWTARARALTPANETTE 416


>gi|288927792|ref|ZP_06421639.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella sp. oral taxon 317 str. F0108]
 gi|288330626|gb|EFC69210.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella sp. oral taxon 317 str. F0108]
          Length = 734

 Score =  272 bits (695), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 159/468 (33%), Positives = 241/468 (51%), Gaps = 43/468 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W GPLP SW+  Q  LQKKIL R  +LGM PVLPAF+G+VP  L+  +P AKIT
Sbjct: 198 MSNIDHWMGPLPMSWIKNQEKLQKKILRRTRDLGMKPVLPAFAGHVPEILKEKYPKAKIT 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W   +   R C  + LD  D LF +I + +I++Q K YG T HIY  D F+E  PP
Sbjct: 258 PLSIWGDFEDQYR-C--HFLDPFDSLFTDIQKTYIDEQTKLYG-TDHIYGVDPFNELAPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEY+++  A IY  +++ DS AVWL   W+FSY    W   ++K+ + +VP  K ++
Sbjct: 314 SWEPEYLANASAKIYDVLKNADSKAVWLQMTWMFSYQRKDWTDERIKSYITAVPDKKQIL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD +AE   +W  S+ +Y  P+IWC L NF GN  + G +  +     EA  +   +MVG
Sbjct: 374 LDYYAERTEVWKFSESYYKQPFIWCYLGNFGGNTMIAGNIAEVDRRLNEAFANAE-SMVG 432

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG ++EG + NP++YD + E  +  + + +  W  Q++ RR G +    + AW +L   +
Sbjct: 433 VGSTLEGFDVNPIMYDFVFEKVWHKDGISLHDWTVQWAQRRVGTTDENAEKAWKLLIDKI 492

Query: 300 YN----CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
           Y     CT+G     R          PS+        +N+ K                  
Sbjct: 493 YVQYSLCTEGTLTNAR----------PSLTGHGNWTTKNWTK------------------ 524

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
              Y+  +++ A  L + S  +      Y+YD++++ RQ L  Y   L     +AY+  D
Sbjct: 525 ---YNNRDLLEAWGLLLRS--KAITKIAYKYDIVNIGRQVLGNYFTVLRDEFTQAYERKD 579

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +       L L+ D++ LL     FLLGPWL +A+ + +N E+ +
Sbjct: 580 ISALTIKGNEMLSLLNDLEALLYTSPSFLLGPWLTNAQNMGRNMEESR 627


>gi|449541596|gb|EMD32579.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
           B]
          Length = 754

 Score =  271 bits (692), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 147/473 (31%), Positives = 262/473 (55%), Gaps = 38/473 (8%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LP  W++ Q  LQK+IL R+ ELGM P LPAF+G VP A+  ++P+A I
Sbjct: 199 FGNIQGSWGGALPMQWVNDQFALQKQILTRMTELGMTPALPAFTGFVPRAMSTLYPNASI 258

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENT 118
                W    +         L+  DPLF  + ++FI +Q + YG   +HIY  D ++EN 
Sbjct: 259 VNGSAWSGFPASL--TNVSFLEPFDPLFSTLQKSFITKQQQAYGTNVTHIYTLDQYNENN 316

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-K 176
           P   +  Y+SS+ A  ++ +++ D DA+W++QGWLF S + FW   +++A L  VP    
Sbjct: 317 PFSGNISYLSSVSAGTFASLRAADPDAIWMLQGWLFFSSETFWTDERIQAYLGGVPTNDS 376

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           ++VLDL++E +P W+ +  ++G  ++WC LH + GN+ + G L++I  GP+ A +S+ ++
Sbjct: 377 MIVLDLYSEAQPQWNRTSSYFGKQWVWCELHGYGGNMGLEGNLNAITAGPIAALSSQGSS 436

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNV 294
           M G+G++MEG E N +VYD++ + A+    +D+ +++  +  RRY    +P A Q+AW +
Sbjct: 437 MKGMGLTMEGQEGNEIVYDVLLDQAWSSAPIDIASYVKSWVARRYTVEPLPSAAQEAWQI 496

Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
           L  TVYN  D     ++  I +  +++P++  +                       +  H
Sbjct: 497 LSTTVYNNQD---PNSQATIKSIYELEPTLTGLVN--------------------RTGHH 533

Query: 355 PHL--WYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
           P L  + + + V+ AL+L + +  +   L+A   + YD +D++RQ L+    + +  +++
Sbjct: 534 PTLIPYDTNTTVVPALQLLVKAKEQNAALAAIPEFVYDAVDVSRQLLSNRFIDAYTGLVD 593

Query: 410 AYQLNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
            Y   +A    V +  +  + ++  +D LLA ++ FLL  W+  A+  +  +E
Sbjct: 594 TYNNANATSDAVVRAGQPLMVILSQLDALLATNENFLLSSWIAQARNWSHGDE 646


>gi|340617022|ref|YP_004735475.1| alpha-N-acetylglucosaminidase [Zobellia galactanivorans]
 gi|339731819|emb|CAZ95084.1| Alpha-N-acetylglucosaminidase, family GH89 [Zobellia
           galactanivorans]
          Length = 747

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 158/466 (33%), Positives = 255/466 (54%), Gaps = 24/466 (5%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G GGPLPQ+W+ Q+  LQ KIL R+  LGM PVL AF+G+VP  L+ ++P A I 
Sbjct: 197 MGNIDGMGGPLPQNWITQRKELQVKILNRMRSLGMKPVLQAFTGHVPQVLKKLYPEANIF 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+ +W  V+       TY LD TD LF +IG AFI++Q + YG T H+Y+ D F E  PP
Sbjct: 257 QIEDWAGVEG------TYFLDPTDELFQKIGTAFIKKQTELYG-TDHLYDADCFIEVDPP 309

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM-KALLNSVPLGKLVV 179
              P ++  +  ++Y  M+  DS A W++QGW F +   +   +  +A L+ +P  + +V
Sbjct: 310 SKDPAFLKQVSESVYKSMELADSKATWVLQGWFFFFKKDFWTKERGRAFLDGIPKNRAIV 369

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMV 238
           LDL+ E  P W  +  FYG P+IW ++ N    + M G L+ +     EA TSE    + 
Sbjct: 370 LDLYGEKNPTWDKTDAFYGQPWIWNVICNEDQKVNMSGDLEEMQRQFQEAYTSEIGNNLK 429

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+G+  EG+  NP+V D + E A+  +KV+V+ WI  Y+  RYG   P+++ AW +L  +
Sbjct: 430 GIGVIPEGLGYNPIVQDFIFEKAWDPQKVNVQEWIEDYATIRYGTKSPSVKKAWQLLGES 489

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VY    G T      ++      P ++   EG  ++  + V K+  + +ET     P  W
Sbjct: 490 VY----GRTRTMWSPLI----TTPRLMIFEEGSKEDI-RHVRKDFKI-TETD----PFAW 535

Query: 359 -YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            +   ++ +A  L +   NEL    TY +DL ++ R+ L    ++   ++  AYQ  D  
Sbjct: 536 DFDVYKLAKAAGLLLGEANELQDVETYNFDLTNVYRELLFSLTHKSINDVSVAYQEKDRQ 595

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            + + ++   +L++D++ +   ++ FLLG WLE AK      E+++
Sbjct: 596 ALDRSAKSLFKLMDDLEAITGANENFLLGKWLEDAKSWGSTPEEKE 641


>gi|71001188|ref|XP_755275.1| alpha-N-acetylglucosaminidase [Aspergillus fumigatus Af293]
 gi|66852913|gb|EAL93237.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
           Af293]
 gi|159129357|gb|EDP54471.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
           A1163]
          Length = 756

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 152/468 (32%), Positives = 257/468 (54%), Gaps = 33/468 (7%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LP SW+D Q  LQKKI+ R+ ELGM PVLPAF+G VP A+  V P+A +  
Sbjct: 205 NIQGSWGGELPYSWIDSQFELQKKIVRRMVELGMTPVLPAFTGFVPRAVSRVLPNATVVN 264

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W   + D R+     L+  DP F+ + R+FI++Q + YG  +HIY  D ++EN P  
Sbjct: 265 GSRW--EEFDERYTSDTFLEPFDPSFMRLQRSFIKKQQQAYGNITHIYTLDQYNENAPYS 322

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
              +Y+ ++    +  ++S D +AVWLMQGWLF S   FW   ++KA L+ V + + ++V
Sbjct: 323 GDLDYLHNVTHNTWLSLKSADPNAVWLMQGWLFYSSSGFWTDERVKAYLSGVEVDQDMLV 382

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLF+E +P W  ++ +YG P+IWC LH++ GN+ +YG + ++     +A  + + ++VG
Sbjct: 383 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNVTVNATQALAASD-SLVG 441

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-----GRSVP-AIQDAWN 293
            G++MEG E N ++YDL+ + A+  + +D   + + ++  RY     G +VP  +  AW+
Sbjct: 442 FGLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHNWAKTRYSSGVRGSAVPEELYQAWD 501

Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
           +L  T YN T+  +      I    ++ PSI  +      +              T SYD
Sbjct: 502 ILRITAYNNTNLTSTAVSKSIF---ELQPSISGLLNRTSHH------------PTTVSYD 546

Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
              L     +  R ++   +    L +   + YD++D+TRQ ++     ++ N++  YQ 
Sbjct: 547 PAAL----VQAWRLMDSAASKAPSLWSQPAFLYDMVDITRQVMSNAFIPVYTNLVSTYQA 602

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
             +  V       ++L+ D+D +L+ +D F L  W++SA+   +N+ +
Sbjct: 603 GGS--VSTDGSNLIQLLRDLDSVLSTNDNFRLSTWIQSARSWVRNDTE 648


>gi|449541595|gb|EMD32578.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
           B]
          Length = 752

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 152/471 (32%), Positives = 263/471 (55%), Gaps = 33/471 (7%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LP  W++ Q  LQK+IL R+ ELGM PVLPAF+G VP A+  V  +A I
Sbjct: 196 FGNIQGSWGGELPMQWVNDQFALQKQILARMTELGMTPVLPAFTGFVPRAMSTVHSNASI 255

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENT 118
                W +    P       L+  DPLF  + ++FI +Q + YG   SHIY  D ++EN 
Sbjct: 256 VNGSQW-APGFPPSLTNVSFLEPFDPLFATLQKSFIAKQQEAYGANISHIYTLDQYNENN 314

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-K 176
           P   +  Y+SS+    ++ +++ D DAVW++QGWLF S + FW   +++A L  VP    
Sbjct: 315 PFSGNLSYLSSISEGTFTSLRAADPDAVWMLQGWLFFSSEAFWTNERIEAYLGGVPTNDS 374

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           ++VLDL++E +P W+ +  ++G  ++WC LH++ G I + G LD+I  GP+ A  S  ++
Sbjct: 375 MIVLDLYSEAQPQWNRTSSYFGKQWVWCELHDYGGTIGLEGNLDAITTGPIAALNSPGSS 434

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVP-AIQDAWNV 294
           M G+G++MEG E N +VYDL+ + A+    +++ +++  +  RRY    +P A Q+AW +
Sbjct: 435 MKGMGLTMEGQEGNEIVYDLLLDQAWSSSPINIASYVKGWVSRRYLVEPLPSAAQEAWRI 494

Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
           L  TVYN  D     ++  I    +++P +  +           V++  +L +    YD 
Sbjct: 495 LSTTVYNNQD---PNSQSTIKNIYELEPVLTGL-----------VNRTGILPT-VIPYD- 538

Query: 355 PHLWYSTSEVIRALELFIASGNELSASNT---YRYDLIDLTRQALAKYANELFLNIIEAY 411
                + S ++ AL+L + +  + +A +T   + +D++D++RQ L+    + +  +I+ Y
Sbjct: 539 -----TNSTIVPALQLLVKAKAQNAALSTVPEFVHDVVDVSRQLLSNRFIDAYTALIDTY 593

Query: 412 QLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
              +  +  V +  +  + ++  +D LLA ++ FLL  W+  A+ L+  +E
Sbjct: 594 NNTNVTSDAVIRAGQPLMTILSQLDALLATNENFLLSSWIAQARNLSHGDE 644


>gi|393782608|ref|ZP_10370791.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
           CL02T12C01]
 gi|392672835|gb|EIY66301.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
           CL02T12C01]
          Length = 761

 Score =  265 bits (678), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 161/466 (34%), Positives = 232/466 (49%), Gaps = 48/466 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+S +D+   L KKI+ R  ELGM P+   FSG VP  L++ +P+A I 
Sbjct: 199 MQNLQSYGGPLPKSVIDRHAALGKKIIARQLELGMQPIQQGFSGYVPRELKDKYPTANIN 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q  +W   K   +      LD TD LF  +GR F+E+Q + +G    +Y  D F E+ PP
Sbjct: 259 QQRSWCGFKGAAQ------LDPTDSLFTRMGRVFLEEQARLFG-AHGVYAADPFHESVPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD+PEY+ ++G  I+   +  D  + W MQ W             +A++ +VP   L++L
Sbjct: 312 VDTPEYLKAVGETIHRLFREFDPQSTWAMQSWSLR----------EAIVKAVPKEALLIL 361

Query: 181 DLFAEVKPIWSTSK-QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           DL        STSK +F+G P +   LHNF G I M+G L  +A          N  + G
Sbjct: 362 DLRGS-----STSKAEFWGYPTVVGNLHNFGGRINMHGDLALLASNQYSKAKRLNPAVCG 416

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G+ ME IEQNPV Y+L  EM    + +D++AW+ QY+ RRYG   PA Q AW +L    
Sbjct: 417 SGLFMEAIEQNPVYYELAFEMPCHPDSIDLRAWLKQYATRRYGAFSPATQKAWMLLLEGP 476

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y      T+K+  ++ A P +D                 V K          YD P L  
Sbjct: 477 YRQGTNGTEKS-SIVAARPALD-----------------VKKSGPNAGLEIPYD-PAL-- 515

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
               +IRA  L +   ++LSAS  YR+DL+D+ RQ +      +     EA++  D    
Sbjct: 516 ----IIRAQSLLLEDADKLSASRPYRFDLVDVQRQMMTNLGQLIHRKAAEAFRSKDREAF 571

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
              S RFL ++ DMD LL     +    WL  A+   + EE++ Q+
Sbjct: 572 TLHSGRFLGMLADMDTLLRTRSEYSFDRWLTEARSWGETEEEKNQM 617


>gi|298385999|ref|ZP_06995556.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           1_1_14]
 gi|298261227|gb|EFI04094.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           1_1_14]
          Length = 715

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 149/464 (32%), Positives = 243/464 (52%), Gaps = 45/464 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL   GGPLP  W  +Q+VLQKKIL R+ E GM PV   F G VP+ L+  +P A++ 
Sbjct: 194 MGNLENIGGPLPDEWFKEQIVLQKKILARMREYGMKPVFQGFFGMVPSLLKEKYPEARLV 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           + G W S++  P      +LD  DPLF  + + +  +  K YG+ + ++  D F E   T
Sbjct: 254 EQGLWNSLQRPP------VLDPADPLFERMAKVWYAEYEKLYGK-ADLFGGDLFHEGGKT 306

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D    ++     + + M+  + DA W++Q WL +       P+ K LL  +     +
Sbjct: 307 GGID----VTDAARRVQTAMKRYNPDATWVIQAWLGN-------PK-KELLAGLDRKNTL 354

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR--TSENTT 236
           ++DL AE    W   K F G P++W  + N+ GNI ++G LD+IA GPV+ +  ++ + +
Sbjct: 355 IVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGGNIGLHGRLDAIATGPVDGQKDSAASPS 414

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
           M G   + EGIE NPVV+DL++EM ++ E +D+  W+ +YSVRRYG     +++AW + +
Sbjct: 415 MKGTSSTPEGIEVNPVVFDLLNEMRWRSEHLDLDVWLKEYSVRRYGVEDENLKEAWTIFH 474

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
            T Y    G    +  V  A P +    I+                       S++    
Sbjct: 475 RTAYGTYTGHRRPSESVFCAPPSLKRDKITA----------------------SAWSQCR 512

Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           ++Y      + + LF+ S + L  ++TY+YD +D  RQ LA    E + N+++AY+  D 
Sbjct: 513 IFYDPELFAQGVGLFLQSADRLKQTSTYQYDAVDFVRQYLADLGRETYYNLVDAYRAKDT 572

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
                 S RFL+L++D + LL+ H+ F +G WL+ A+  ++  E
Sbjct: 573 KQFDYWSERFLQLIKDQNELLSTHERFFVGRWLDMARLKSKQPE 616


>gi|346324333|gb|EGX93930.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
          Length = 751

 Score =  265 bits (677), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 149/465 (32%), Positives = 250/465 (53%), Gaps = 37/465 (7%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LPQ+W++ Q  LQKKI+ R+ ELGM P+LPAF G VP  +  V+P+  +
Sbjct: 201 FGNIQGSWGGDLPQAWIEDQFELQKKIVKRMIELGMTPILPAFPGFVPENITRVWPNVSL 260

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
            +   W       R+     +   DP F E+ +AF+ +Q + YG  +  +  D F+EN P
Sbjct: 261 AESPIWSGFSG--RFTADKYITPYDPHFAELQKAFLTKQNEAYGNVTSFWTLDQFNENKP 318

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK-L 177
                +Y+ ++    +  +++ D  AVW+MQGWLF+ D  +W   ++K+ L+ VP+ + +
Sbjct: 319 ASGELDYLKNVSHNTWQTLKAADPSAVWVMQGWLFASDKTYWIDDRVKSFLDGVPVNEDM 378

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDLFAE  P W  ++ FYG P+IWC LH++ GN+ +YG ++++    VEA    + ++
Sbjct: 379 LLLDLFAESTPQWQRTESFYGKPWIWCQLHDYGGNMGLYGQIENVTKNAVEA-VQTSKSI 437

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAWN 293
           VG G+SMEG E N ++YDL+ + A++ E ++   + + +   RYG   + +P  +  AW+
Sbjct: 438 VGFGLSMEGQEGNEIMYDLLLDQAWRKEAIETDKYFSDWVTVRYGADHKEIPENLYTAWD 497

Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
            +  TVYN TD +      V  +  ++ PSI  +                       +  
Sbjct: 498 KVRSTVYNNTDSSVTA---VTKSIFELAPSISGLV--------------------NRTGH 534

Query: 354 HP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
           HP  + Y T  +I A     ++G++   L  +  YRYDL D TRQ LA      +  ++E
Sbjct: 535 HPTKITYDTKTLISAWNDMFSAGDQARWLFDNEAYRYDLTDWTRQVLANAFEATYNKLVE 594

Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
            Y+ N+  GV     R   +++ MD +L  +  F L  W+++A++
Sbjct: 595 KYKSNNTKGVKCAGDRLQAILQTMDQVLDTNPSFKLSTWIQAARK 639


>gi|404406438|ref|ZP_10998022.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
          Length = 726

 Score =  263 bits (673), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 153/457 (33%), Positives = 240/457 (52%), Gaps = 37/457 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLPQSW+D QL LQ++I+ R  ELG+ PV  +F+G+VP AL+ +FP A I 
Sbjct: 190 MANIDAWHGPLPQSWIDGQLELQRRIIARERELGIQPVFTSFTGHVPKALKTLFPDADIE 249

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W S +   R   +Y L+  +PLF  I +A++++Q + +G +S +Y  D F+E  PP
Sbjct: 250 RLNPWTSFE---RPYNSYYLNPAEPLFNRIQQAYMQEQRRLFGESS-VYGVDPFNELDPP 305

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
              PEY++      Y  +   D DAVWL   W+F +    W P ++KA L +VP GKL++
Sbjct: 306 NWDPEYLARAARLTYESITQFDKDAVWLQMAWVFYHKRRDWTPERLKAYLCAVPDGKLLM 365

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   +W +++ FYG P+IW  L NF GN  + G +  ++     A        VG
Sbjct: 366 LDYYCDKVELWRSTESFYGQPFIWSYLGNFGGNTMLAGDVKDVSRKLDRAYAEAGRNFVG 425

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++ NP +Y+ + + A+  +  D   WI++ + R  GR     + AW +LY  +
Sbjct: 426 IGCTLEGLDVNPFMYEYVLDRAWT-QLYDDAGWIDRLADRHSGRIDVHYRQAWRILYDKI 484

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y    G    NR   V                     +P  K        S +  PHL Y
Sbjct: 485 YCAPSG----NRSAAVC-------------------ARPNMK------GRSKWSGPHLDY 515

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              +++R  E    +  E +AS+  R+D +++ RQ L  Y   L    I A +  D   V
Sbjct: 516 DNRDLLRVWEQLTLARPERTASS--RFDCVNIPRQCLENYFGNLNERCIAACRGGDRETV 573

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
            +LS R LEL++D+D L+A    FLLG W+  A+++ 
Sbjct: 574 ARLSARLLELLDDIDRLVAADAYFLLGKWIADARRMG 610


>gi|291515668|emb|CBK64878.1| Alpha-N-acetylglucosaminidase (NAGLU) [Alistipes shahii WAL 8301]
          Length = 713

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 159/459 (34%), Positives = 233/459 (50%), Gaps = 36/459 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W GPLP+ W+D QL LQ++IL R  ELGM PVLPAF+G+VP  L+ + P A+IT
Sbjct: 179 MSNIDRWQGPLPEEWIDGQLALQQRILARERELGMKPVLPAFAGHVPQELKRLHPDARIT 238

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W     D R+ C++ LD  DPLF  I R F+ +Q + +G T HIY  D F+E   P
Sbjct: 239 RVSYWGGF--DDRYRCSF-LDPMDPLFAVIQREFLTEQTRLFG-TGHIYGADPFNEIDAP 294

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PE ++ +   IY  M   D +AVWL  GWLF  DP  W    ++A L +VP  +L++
Sbjct: 295 TWDPETLAGMSRHIYESMAEVDPEAVWLQMGWLFYADPTHWTAENIRAFLGAVPQDRLLM 354

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD F E   IW  +++F+G PY+WC L NF GN  + G   +++    +A       + G
Sbjct: 355 LDYFCEFTEIWKQTEKFHGQPYLWCYLGNFGGNTMLSGNFHTVSARMEDAFAHGGDNLRG 414

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG ++EG   N  +Y+ + + A+     D   WI + + RR G   PA +  W  L  +V
Sbjct: 415 VGSTLEGFGVNQFMYEFVLDKAWNTGIAD-DEWIARLADRRTGFRDPAARTGWRTLCDSV 473

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T  A      +  A P +        EG +    KP          T+ Y  P LW 
Sbjct: 474 Y--TLPAQTGQSPLTNAHPAL--------EGNWHWTTKP----------TTGYRFPTLW- 512

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
                 R  E  +A  +E    +TYR+D++++ RQ L  Y          AY  +D   +
Sbjct: 513 ------RVWEELLAVDSE---RDTYRFDVVNIGRQVLGDYFLIERDRFAAAYAQHDRKAM 563

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
              +RR   L+ D++ L ACH  F L  W+ +A+    +
Sbjct: 564 DAAARRMTGLLADINLLTACHPEFSLERWIAAARGFGSD 602


>gi|281210062|gb|EFA84230.1| hypothetical protein PPL_03307 [Polysphondylium pallidum PN500]
          Length = 744

 Score =  262 bits (669), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 159/467 (34%), Positives = 251/467 (53%), Gaps = 48/467 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGG LPQSW+  Q  LQ KIL R+ E GM+PV P F+G+VP A +  +PSA I 
Sbjct: 214 MGNLDGWGGVLPQSWIKGQHELQIKILKRMSEYGMSPVFPGFAGHVPVAFKQFYPSANIV 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHI--YNCDTFDENT 118
           +L +W    +      T  L  TDP++  +   F + Q + YG  + I  ++ D F+E  
Sbjct: 274 ELPSWHGFNA------TNHLLTTDPMYDIVADRFYQVQNEIYGAYAKIDYFSIDPFNELI 327

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
           PP +S ++++   + I++ +   + D+ W++Q W  +   FW   Q+ + L  VP+G+L+
Sbjct: 328 PPSNSSQFLNECSSRIFNAINRFNPDSTWVLQNWFLN-SAFWGDGQVASFLGGVPIGRLI 386

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           VLDL++E+KP+W+ +  + G  +IW MLHNF G   + G +  IA  P+EA+ S + TMV
Sbjct: 387 VLDLWSELKPLWNRTANYQGHKWIWNMLHNFGGRPTISGRMPIIANEPLEAKAS-SPTMV 445

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+G++ E IEQN +VYDLMSEM ++    D+  W++ Y  RRYG ++P ++  W +L +T
Sbjct: 446 GIGLTPEAIEQNVIVYDLMSEMGWRSRSFDLNLWVDAYVTRRYGVNLPNLKPVWKMLAYT 505

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VY                 P+  P+          NY   ++K+  L  +   Y +P + 
Sbjct: 506 VYFS---------------PNRSPA----------NY---IAKKPSLDFQLGLYYNPVV- 536

Query: 359 YSTSEVIRAL-ELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
                ++ A  EL       + +S TYRYDL ++T QAL+ Y N     + ++Y  +D  
Sbjct: 537 -----IVDAWRELLAVDSTIVRSSETYRYDLAEITLQALSNYFNGNLKQLYQSYYASDFQ 591

Query: 418 GVFQLSRRFLEL-VEDMDGLLACHDGFLLGPWLESAKQLA-QNEEQE 462
             FQ +R+     +  MD +        LG W   A++ A  N E+E
Sbjct: 592 -TFQSARQNCSFALRAMDAVADTVQLLKLGKWTADARKWATDNNERE 637


>gi|380692804|ref|ZP_09857663.1| putative alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
          Length = 709

 Score =  261 bits (668), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 148/464 (31%), Positives = 238/464 (51%), Gaps = 45/464 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL   GGPLP  W  +Q VLQKKIL R+ E GM PV   F G VP++L+  +P A + 
Sbjct: 188 MGNLENIGGPLPDEWFKEQTVLQKKILARMREYGMKPVFQGFFGMVPSSLKEKYPEAHLV 247

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           + G W S++  P      +LD  DPLF ++ + +  +  K YG+ + ++  D F E   T
Sbjct: 248 EQGLWNSLQRPP------VLDPADPLFEQMAKVWYTEYEKLYGK-ADLFGGDLFHEGGKT 300

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D    ++     + + M+  + DA W++Q WL +       P+ K LL  +     +
Sbjct: 301 GGID----VTDAARRVQTAMKQYNPDATWVIQAWLGN-------PK-KELLAGLDRKHTL 348

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART--SENTT 236
           ++DL AE    W   K F G P++W  + N+  NI ++G LD+IA GP++ R     + +
Sbjct: 349 IVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGANIGLHGRLDAIATGPIDGRKDPEASPS 408

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
           M G   + EGIE NPVV+DL++EM ++ E +D+  W+ +YSVRRYG     ++ AW + +
Sbjct: 409 MKGTSSTPEGIEVNPVVFDLLNEMRWRSEYLDIDTWLKEYSVRRYGAEDENLKKAWIIFH 468

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
            T Y    G    +  V  A P +    I+                       S++    
Sbjct: 469 RTAYGTYSGHRRPSESVFCAPPSLKRDKITA----------------------SAWSQCR 506

Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           ++Y      + + LF+ S + L  ++TY+YD +D  RQ LA    E + N+++AY+  D 
Sbjct: 507 IFYDPDLFAQGVGLFLQSADHLKQTSTYQYDAVDFVRQYLADLGREAYYNLVDAYRAKDT 566

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
                 S RFL+L++D + LL+ H  F +G WL+ A+  ++  E
Sbjct: 567 KQFDYWSERFLQLIKDQNELLSTHKCFFVGRWLDMARSKSKQPE 610


>gi|313203962|ref|YP_004042619.1| alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
 gi|312443278|gb|ADQ79634.1| Alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
          Length = 738

 Score =  261 bits (666), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 162/457 (35%), Positives = 241/457 (52%), Gaps = 35/457 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  WGGPLP S+++ Q  LQ+ IL R   LGM P+L AF+G+VP  L+ + PSAKIT
Sbjct: 196 MANMDKWGGPLPISYIEGQKKLQQHILQRSRALGMKPILSAFAGHVPEQLKTLRPSAKIT 255

Query: 61  QLG-NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
           ++   W  + ++     TY LD TD LF EI + F+  Q K YG T H+Y+ D F+E TP
Sbjct: 256 RIEPGWGGMAAE---YTTYFLDPTDNLFGEIQKRFLTVQQKLYG-TDHLYSADPFNEITP 311

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLV 178
           P   P+Y++++G  IY  M   D +A+W    W F  DP  W  P++ A++++VP GKL 
Sbjct: 312 PSWEPDYLANVGKTIYETMSQVDKEAIWYQMSWTFYNDPTHWTRPRLSAMIHAVPQGKLF 371

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
            LD   E +  +  S  FYG P+IWC L NF  N  +   L+ +       + +  +  V
Sbjct: 372 FLDYNCEEEEFFRKSDNFYGAPFIWCYLGNFGANTHLVAPLNKVV--NRLGKLTYGSACV 429

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQ-HEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           GVG ++EGI  NP +Y+ + EM ++  E V     I  Y+ RR G    A+ +AW +L  
Sbjct: 430 GVGSTLEGINVNPEIYETVLEMPWRADETVTADTLIRHYAERRAGARDKAVIEAWQLLRQ 489

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
            V    D A       +V    V P +  +T   +                     +P +
Sbjct: 490 HV--LVDTAVGIWNHCVVF--QVSP-VTDLTRAFWAT-------------------NPKI 525

Query: 358 WYSTSEVIRAL-ELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
            Y   ++  AL  +F AS N    ++ YR+D+++LTRQAL  Y   L+  ++EAY   + 
Sbjct: 526 PYRNVDLAIALNRMFQASANS-KKTDAYRFDVVNLTRQALGNYGTVLYHKMMEAYSRKNL 584

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
               + S  FL+L +++DGLLA    FLLG WL  A+
Sbjct: 585 IDFRKYSGEFLQLGQEIDGLLATRHEFLLGKWLADAR 621


>gi|295086519|emb|CBK68042.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 727

 Score =  261 bits (666), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 147/460 (31%), Positives = 229/460 (49%), Gaps = 35/460 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W GPLP+ WLD Q  LQK+I+ R  +  M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   R   ++ LD  DPLF  I + F+E+Q K +G T HIY  D F+E  PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PE++++    IY  M   D DA WL   WLF  D   W   +++A L +VP  KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 361

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + E   +W  + +++G PY+WC L NF GN  + G    +        T+      G
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG + NP +Y+ +   A+     D   WI Q + RR G     ++ AW +LY ++
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 480

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                     P+ +          G  ++    LK   +    P + Y
Sbjct: 481 YTV-------------------PAALGQ--------GALMNARPCLKGNGNWTTTPTVAY 513

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           S   +    E+ + +G    ++  Y YD++++ RQ L  Y  +L     EAY       +
Sbjct: 514 SNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLL 571

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
            Q      +L+ D+D LL+    FLLG W+E A+ L  +E
Sbjct: 572 KQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGTDE 611


>gi|336374066|gb|EGO02404.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 761

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 150/476 (31%), Positives = 259/476 (54%), Gaps = 40/476 (8%)

Query: 7   WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW- 65
           WGG LP+ W++ Q  LQK+I+ R+ ELGM PVLP+F+G VP A+  ++P+A I     W 
Sbjct: 214 WGGDLPEQWINDQFALQKQIISRMVELGMTPVLPSFTGFVPRAMHTLYPNASIVNGSQWN 273

Query: 66  -FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
            F+++    +     L+  DPLF  +  +FI +Q+  YG  SH+Y  D ++EN+P     
Sbjct: 274 GFTIQ----YTNDSFLEPFDPLFSTLQTSFISKQVAAYGNVSHVYTLDQYNENSPYSGDT 329

Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-KLVVLDL 182
            Y++++ AA ++ +++ D  AVWLMQGWLF  D  FW   +++A L  VP    +++LDL
Sbjct: 330 SYLANVTAATFASLRAADPQAVWLMQGWLFYSDSTFWTTERVEAYLGGVPGNDSMIILDL 389

Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGM 242
           ++E +P W     ++G  +IWC LH++ GN+   G  +++   P++A  +   +MVG+G+
Sbjct: 390 YSEAQPQWQRLNSYFGKQWIWCELHDYGGNMGFEGNFENVTTQPIKALATPGNSMVGMGL 449

Query: 243 SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP----AIQDAWNVLYHT 298
           +MEG E N ++YD++ + A+    ++  A+I+ ++ RRY  +VP    A  +AW +L  T
Sbjct: 450 TMEGQEGNEIIYDVLLDQAWSSTPLNRTAYISAWASRRY--NVPDLPTAALEAWEILGAT 507

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYN  D  T      I+   ++ PSI                    L + T ++     +
Sbjct: 508 VYNNQDVTTQSTVKSIL---ELSPSITG------------------LVNRTGTHSTKLFY 546

Query: 359 YSTSEVIRALELFIASGNELSA-SNT--YRYDLIDLTRQALAKYANELFLNIIEAY--QL 413
            + + ++ AL+L + +  E SA SN   ++YD++D+TRQ LA    +L+ ++I+ +    
Sbjct: 547 DTNTTIVPALKLLLQARQEASALSNIPEFQYDVVDVTRQLLANRFIDLYTSLIDTFSSTS 606

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPY 469
           + +  V       L L++D+D +L     FLL  W+ +A+     +         Y
Sbjct: 607 SSSSAVSAAGAPLLALLQDLDSVLLTDTHFLLARWISAARNWTHGDNATYAAYLEY 662


>gi|336386984|gb|EGO28130.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 738

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 150/476 (31%), Positives = 259/476 (54%), Gaps = 40/476 (8%)

Query: 7   WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW- 65
           WGG LP+ W++ Q  LQK+I+ R+ ELGM PVLP+F+G VP A+  ++P+A I     W 
Sbjct: 191 WGGDLPEQWINDQFALQKQIISRMVELGMTPVLPSFTGFVPRAMHTLYPNASIVNGSQWN 250

Query: 66  -FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
            F+++    +     L+  DPLF  +  +FI +Q+  YG  SH+Y  D ++EN+P     
Sbjct: 251 GFTIQ----YTNDSFLEPFDPLFSTLQTSFISKQVAAYGNVSHVYTLDQYNENSPYSGDT 306

Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-KLVVLDL 182
            Y++++ AA ++ +++ D  AVWLMQGWLF  D  FW   +++A L  VP    +++LDL
Sbjct: 307 SYLANVTAATFASLRAADPQAVWLMQGWLFYSDSTFWTTERVEAYLGGVPGNDSMIILDL 366

Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGM 242
           ++E +P W     ++G  +IWC LH++ GN+   G  +++   P++A  +   +MVG+G+
Sbjct: 367 YSEAQPQWQRLNSYFGKQWIWCELHDYGGNMGFEGNFENVTTQPIKALATPGNSMVGMGL 426

Query: 243 SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP----AIQDAWNVLYHT 298
           +MEG E N ++YD++ + A+    ++  A+I+ ++ RRY  +VP    A  +AW +L  T
Sbjct: 427 TMEGQEGNEIIYDVLLDQAWSSTPLNRTAYISAWASRRY--NVPDLPTAALEAWEILGAT 484

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYN  D  T      I+   ++ PSI                    L + T ++     +
Sbjct: 485 VYNNQDVTTQSTVKSIL---ELSPSITG------------------LVNRTGTHSTKLFY 523

Query: 359 YSTSEVIRALELFIASGNELSA-SNT--YRYDLIDLTRQALAKYANELFLNIIEAY--QL 413
            + + ++ AL+L + +  E SA SN   ++YD++D+TRQ LA    +L+ ++I+ +    
Sbjct: 524 DTNTTIVPALKLLLQARQEASALSNIPEFQYDVVDVTRQLLANRFIDLYTSLIDTFSSTS 583

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPY 469
           + +  V       L L++D+D +L     FLL  W+ +A+     +         Y
Sbjct: 584 SSSSAVSAAGAPLLALLQDLDSVLLTDTHFLLARWISAARNWTHGDNATYAAYLEY 639


>gi|400599317|gb|EJP67021.1| alpha-N-acetylglucosaminidase, putative [Beauveria bassiana ARSEF
           2860]
          Length = 753

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 146/464 (31%), Positives = 242/464 (52%), Gaps = 35/464 (7%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LPQ+W+D Q  LQ+KI+ R+ ELGM P+LPAF G VP  +  V+P+  +
Sbjct: 201 FGNIQGSWGGDLPQAWIDDQFALQRKIIKRMVELGMTPILPAFPGFVPENITRVWPNVSL 260

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
            +   W       R+     +   DP F E+ +AF+ +Q + YG  +  +  D F+EN P
Sbjct: 261 AESPTWSGFSG--RFTADKYITPYDPRFAELQKAFLTKQNEAYGNVTSFWTLDQFNENKP 318

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-L 177
                 Y+ ++    +  ++  D  AVW+MQGWLF+ D  +W   ++K+ L+ VP+ + +
Sbjct: 319 ASGELGYLRNVSHNTWQTLKDADPSAVWVMQGWLFASDKAYWTDDRVKSFLDGVPVNEDM 378

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDLFAE  P W  +  FYG P+IWC LH + GN+ +YG ++++    VEA   ++ ++
Sbjct: 379 LLLDLFAESTPQWQRTDSFYGKPWIWCQLHGYGGNMGLYGQIENVTRNAVEA-VQKSPSI 437

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAWN 293
           VG+G+SMEG E N ++Y+L+ + A+  E ++   + + +   RYG   + +P  +  AW+
Sbjct: 438 VGLGLSMEGQEGNEIMYNLLLDQAWSKEALETDKYFSDWVTVRYGADQKEIPKDLYTAWD 497

Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
            +  TVYN TD +       + A       ++  T G     G   +K            
Sbjct: 498 KVRSTVYNNTDSS-------VTAVAKSIFELVPSTSGLVNRTGHHATK------------ 538

Query: 354 HPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
              + Y T  +I A      +G++   L  +  Y YDL D TRQ LA      +  ++E 
Sbjct: 539 ---ITYDTETLISAWNDMFNAGSQARWLFDNEAYSYDLTDWTRQVLANAFEATYNKLVEK 595

Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
           Y+ N+  GV     R   ++  MD +L  +  F L  W+++A++
Sbjct: 596 YKSNNIKGVKCAGSRLQAILRTMDQVLETNVHFRLSTWIQAARK 639


>gi|242809019|ref|XP_002485282.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
           ATCC 10500]
 gi|218715907|gb|EED15329.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
           ATCC 10500]
          Length = 755

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 152/473 (32%), Positives = 252/473 (53%), Gaps = 38/473 (8%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G W G LP  W+D Q  LQKKI+ R+ ELGM P+LPAF G VP A+  V P A +
Sbjct: 201 FGNIQGSWSGSLPYDWVDSQFDLQKKIVKRMTELGMTPILPAFPGFVPRAITRVLPDADV 260

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
                W +  +   +     ++ TDP F EI ++FI +Q++ YG  +  Y  D F+EN P
Sbjct: 261 INGSAWEAFPT--MYTNDTFMEPTDPHFTEIQKSFIAKQIEAYGNVTTFYTLDQFNENNP 318

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPL-GKL 177
                 Y+ ++    +  +++ DS+AVW+MQGWLF S   FW   +++A L  V +   L
Sbjct: 319 SSGDLSYLRNVSQGTWKTLKAADSNAVWVMQGWLFTSNSAFWTNDRIEAYLGGVAVDSDL 378

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL +E  P W  +  +YG P+IWC +H++ GN+  YG + +I   P+ A    ++++
Sbjct: 379 LILDLASESSPQWQRTNSYYGKPWIWCEIHDYGGNMGFYGQVMNITNNPIAA-LHNSSSL 437

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVP-AIQDAWNV 294
           VG G+SMEG E N +VYDL+ + A+    +D +++ + +   RY   RS+P ++  AW++
Sbjct: 438 VGFGLSMEGQEGNEIVYDLLLDQAWNAAPIDTESYFHDWVTARYAGSRSIPSSVYSAWDI 497

Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
           L  TVYN T+ A +       A P     +I  T G     G                 H
Sbjct: 498 LRTTVYNNTNLAAN-------AVPKAIFELIPSTTGLLNRTGH----------------H 534

Query: 355 P-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
           P  L Y+T+++++A  LF  S  +   L  +  + +DL+D++RQ LA     ++ N+I  
Sbjct: 535 PTKLNYNTADMVQAWNLFYTSAFKEPSLWLNPAFEFDLVDMSRQVLANAFIPVYENLIST 594

Query: 411 YQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
           Y  ++  +  +  +    + +++ +D +LA +  F L  WL +A+  A ++  
Sbjct: 595 YNTSNPSSTKLQTIGAELIGILQALDTVLATNKNFKLSTWLSAARASAGSQHN 647


>gi|423292430|ref|ZP_17271008.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
           CL02T12C04]
 gi|423294620|ref|ZP_17272747.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
           CL03T12C18]
 gi|392661665|gb|EIY55241.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
           CL02T12C04]
 gi|392675811|gb|EIY69252.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
           CL03T12C18]
          Length = 727

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 146/460 (31%), Positives = 228/460 (49%), Gaps = 35/460 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W GPLP+ WLD Q  LQK+I+ R  +  M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   R   ++ LD  DPLF  I + F+E+Q K +G T HIY  D F+E  PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PE++++    IY  M   D DA WL   WLF  D   W   +++A L +VP  KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQNKLLL 361

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + E   +W  + +++G PY+WC L NF GN  + G    +        T+      G
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG + NP +Y+ +   A+     D   WI Q + RR G     ++ AW +LY ++
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 480

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                     P+ +          G  ++    LK   +    P + Y
Sbjct: 481 YTA-------------------PAALGQ--------GTLMNARPCLKGNGNWTTTPTVAY 513

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           S   +    E+ + +G    +  TY YD++++ RQ L  Y  +L     E Y       +
Sbjct: 514 SNETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVLGNYFGKLRDEFAETYSRKQLPLL 571

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
            Q      +L+ D++ LL+    FLLG W+E A+ L  +E
Sbjct: 572 KQKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLGIDE 611


>gi|160883168|ref|ZP_02064171.1| hypothetical protein BACOVA_01137 [Bacteroides ovatus ATCC 8483]
 gi|156111393|gb|EDO13138.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
          Length = 737

 Score =  258 bits (659), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 146/460 (31%), Positives = 228/460 (49%), Gaps = 35/460 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W GPLP+ WLD Q  LQK+I+ R  +  M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 196 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   R   ++ LD  DPLF  I + F+E+Q K +G T HIY  D F+E  PP
Sbjct: 256 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PE++++    IY  M   D DA WL   WLF  D   W   +++A L +VP  KL++
Sbjct: 312 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQNKLLL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + E   +W  + +++G PY+WC L NF GN  + G    +        T+      G
Sbjct: 372 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG + NP +Y+ +   A+     D   WI Q + RR G     ++ AW +LY ++
Sbjct: 432 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 490

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                     P+ +          G  ++    LK   +    P + Y
Sbjct: 491 YTA-------------------PAALGQ--------GTLMNARPCLKGNGNWTTTPTVAY 523

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           S   +    E+ + +G    +  TY YD++++ RQ L  Y  +L     E Y       +
Sbjct: 524 SNETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVLGNYFGKLRDEFAETYSRKQLPLL 581

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
            Q      +L+ D++ LL+    FLLG W+E A+ L  +E
Sbjct: 582 KQKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLGIDE 621


>gi|237719130|ref|ZP_04549611.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229451509|gb|EEO57300.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 737

 Score =  257 bits (657), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 149/460 (32%), Positives = 232/460 (50%), Gaps = 35/460 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W GPLP+ WLD Q  LQK+I+ R  +  M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 196 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   R   ++ LD  DPLF  I + F+E+Q K +G T HIY  D F+E  PP
Sbjct: 256 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PE++++    IY  M   D DA WL   WLF  D   W   +++A L +VP  KL++
Sbjct: 312 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + E   +W  + +++G PY+WC L NF GN  + G    +        T+      G
Sbjct: 372 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG + NP +Y+ +   A+     D   WI Q + RR G     ++ AW +LY ++
Sbjct: 432 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 490

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                     P+ +   +G   N  +P  K     + TS+     + Y
Sbjct: 491 YTA-------------------PAALG--QGTLMN-ARPCLKGNGNWTTTST-----VAY 523

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           S   +    E+ + +G    ++  Y YD++++ RQ L  Y  +L     EAY       +
Sbjct: 524 SNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLL 581

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
            Q      +L+ D+D LL+    FLLG W+E A+ L  +E
Sbjct: 582 KQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGIDE 621


>gi|340520426|gb|EGR50662.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
          Length = 747

 Score =  257 bits (657), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 147/461 (31%), Positives = 256/461 (55%), Gaps = 33/461 (7%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LP+SW+D+Q  LQ KIL R+ ELG+ P+LPAF G VP  +  VFP   +
Sbjct: 201 FGNIQGSWGGTLPRSWVDEQFSLQLKILKRMEELGITPILPAFPGFVPRNISRVFPDISL 260

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
           +    W +  +         ++  DP F ++ + FI +Q + YG  ++ +  D F+EN P
Sbjct: 261 STSPIWSNFGTTL--SADIYINPFDPRFAQLQKLFINKQQELYGNVTNFWTLDQFNENRP 318

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-L 177
                +Y+ ++    ++ +++ D +AVW+MQ WLFS D  FW   +++ALL  VP+ + +
Sbjct: 319 LSGDLDYLRNVSHNTWAALKAADPEAVWVMQAWLFSSDSSFWTNDRVEALLGGVPVNQDM 378

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT 236
           ++LDLFAE  P W  +  FYG P+IWC LHN+ GN+ +YG ++++    ++A R S+  +
Sbjct: 379 LLLDLFAESAPQWQRTDSFYGKPWIWCELHNYGGNMGLYGQIENVTINSMDAVRNSD--S 436

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVPAIQDAWNVL 295
           +VG G++MEG E N ++YDL+ + A+  + +D   + + +   RYG ++V  +   W +L
Sbjct: 437 IVGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTDTYFHDWVSARYGAKNVKGLYKGWEML 496

Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
             TV+N T+   +  +  I+   ++ PSI  +  G+   +G  +  +  +  E  S    
Sbjct: 497 RPTVFNNTNLTVNAVQKSIL---ELTPSISGLL-GRTGRHGTTIMYDPAVMVEAWS---- 548

Query: 356 HLWYSTSEVIRALELFIASGNELSASN--TYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
                        ELF A   +L+  N  +Y+YDL+D TRQ L     + + ++++AY  
Sbjct: 549 -------------ELFKAGLQDLTLFNNPSYQYDLVDWTRQVLVNSFEDHYKDLVDAYNK 595

Query: 414 NDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
           + +  V +    + + L++ +D +LA +  F L PW++ A+
Sbjct: 596 SSSPTVIRTRGAKLVTLLKTLDAVLATNKNFQLTPWIDRAR 636


>gi|295087651|emb|CBK69174.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 703

 Score =  257 bits (657), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 151/472 (31%), Positives = 239/472 (50%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP SWL  Q  LQK+I+ R   LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 169 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 228

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  DPLF +I + ++E+Q K YG T HIY  D F+E   P
Sbjct: 229 QMSQWGGYDEKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 284

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++   I+  ++  DS A W+   W+F +    W  P++KA LNSVP  KL++
Sbjct: 285 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 344

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G +D ++             + G
Sbjct: 345 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 404

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG ++EG++ NP +Y+ + E A+ H   +   W+  +++ R G     I DAW  LY  +
Sbjct: 405 VGATLEGLDVNPFMYEFVLEKAWSHTITNAD-WMKNWALCRGGSKSSHIIDAWQQLYKKI 463

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                       I   T G+       ++   +L+   S   HP ++Y
Sbjct: 464 Y-----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYY 496

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+      F+ + N    S+ Y++D+I++ RQ L    ++   +    Y+  +  G+
Sbjct: 497 DNKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGM 554

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESA----KQLAQNEEQEKQVRC 467
            + + +   L  D+D LL+C   F +G W++ A    K L + E  E+  RC
Sbjct: 555 KEWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARC 606


>gi|423287380|ref|ZP_17266231.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
           CL02T12C04]
 gi|392672495|gb|EIY65962.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
           CL02T12C04]
          Length = 726

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 151/472 (31%), Positives = 239/472 (50%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP SWL  Q  LQK+I+ R   LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 192 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  DPLF +I + ++E+Q K YG T HIY  D F+E   P
Sbjct: 252 QMSQWGGYDKKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++   I+  ++  DS A W+   W+F +    W  P++KA LNSVP  KL++
Sbjct: 308 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 367

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G +D ++             + G
Sbjct: 368 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 427

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG ++EG++ NP +Y+ + E A+ H   +   W+  +++ R G     I DAW  LY  +
Sbjct: 428 VGATLEGLDVNPFMYEFVLEKAWSHTITNAD-WMKNWALCRGGSKSSHIIDAWQQLYKKI 486

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                       I   T G+       ++   +L+   S   HP ++Y
Sbjct: 487 Y-----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYY 519

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+      F+ + N    S+ Y++D+I++ RQ L    ++   +    Y+  +  G+
Sbjct: 520 DNKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGM 577

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESA----KQLAQNEEQEKQVRC 467
            + + +   L  D+D LL+C   F +G W++ A    K L + E  E+  RC
Sbjct: 578 KEWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARC 629


>gi|423213214|ref|ZP_17199743.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693674|gb|EIY86904.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 726

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 151/472 (31%), Positives = 239/472 (50%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP SWL  Q  LQK+I+ R   LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 192 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  DPLF +I + ++E+Q K YG T HIY  D F+E   P
Sbjct: 252 QMSQWGGYDEKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
               +++ ++   I+  ++  DS A W+   W+F +    W  P++KA LNSVP  KL++
Sbjct: 308 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 367

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G +D ++             + G
Sbjct: 368 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 427

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG ++EG++ NP +Y+ + E A+ H   +   W+  +++ R G     I DAW  LY  +
Sbjct: 428 VGATLEGLDVNPFMYEFVLEKAWSHTITNAD-WMKNWALCRGGSKSSHIIDAWQQLYKKI 486

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                       I   T G+       ++   +L+   S   HP ++Y
Sbjct: 487 Y-----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYY 519

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+      F+ + N    S+ Y++D+I++ RQ L    ++   +    Y+  +  G+
Sbjct: 520 DNKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGM 577

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESA----KQLAQNEEQEKQVRC 467
            + + +   L  D+D LL+C   F +G W++ A    K L + E  E+  RC
Sbjct: 578 KEWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARC 629


>gi|153808241|ref|ZP_01960909.1| hypothetical protein BACCAC_02529 [Bacteroides caccae ATCC 43185]
 gi|423219048|ref|ZP_17205544.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
           CL03T12C61]
 gi|149129144|gb|EDM20360.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
           43185]
 gi|392625814|gb|EIY19870.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
           CL03T12C61]
          Length = 752

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 165/465 (35%), Positives = 227/465 (48%), Gaps = 51/465 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D+ ++L KKI+ R  ELGM P+   FSG VP  L++ +P AKI 
Sbjct: 193 MQNLQSYGGPLPKSWIDKHIILAKKIIDRERELGMTPIQQGFSGYVPRELKDKYPEAKI- 251

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
                   +  P WC       LD TD LF  +GR F+E++ K YG T  IY  D F E+
Sbjct: 252 --------RLQPGWCGFKGAGQLDPTDALFATLGRDFLEEEKKLYG-TYGIYAADPFHES 302

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
            PPV++PEY+S++G AIY  ++  D  A W MQ W        R P +KA    VP   L
Sbjct: 303 APPVNTPEYLSAVGHAIYKLIKDFDPKAKWAMQAWSL------REPIVKA----VPQNDL 352

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL  E        K F+G P +   LHNF G I M+G L  +A         +   +
Sbjct: 353 IILDLNGEKI---KGRKGFWGYPAVEGNLHNFGGRINMHGDLRLLASNQYMTALKQYPNV 409

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G G+ ME IEQNPV YDL  EM     +V ++ W+ QY+ RRYG   P+ Q A   L  
Sbjct: 410 CGSGLFMEAIEQNPVYYDLAFEMPLHKGEVAIEEWLKQYANRRYGAVSPSAQQAMICLLE 469

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
             Y      T+  R  I+A     P++     G     G P                   
Sbjct: 470 GPYRPGTNGTE--RSSIIA---ARPALNVKKSGPNAGLGIP------------------- 505

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            YS   VI+A  L +   ++L  S  YR+D+ID+ RQ +      +     EA+   D  
Sbjct: 506 -YSPLLVIQAEGLLLKDADKLKNSEPYRFDVIDVQRQMMTNMGQVIHKRAAEAFLNRDKE 564

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
                S+RFL+++ED+D LL     F    WL SA+     EE++
Sbjct: 565 AFALHSKRFLQMLEDVDELLRTRPEFNFDRWLTSARSWGDTEEEK 609


>gi|281200617|gb|EFA74835.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
          Length = 688

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 126/284 (44%), Positives = 180/284 (63%), Gaps = 9/284 (3%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++ W G L   W+  Q  LQ +IL R+ + GM  VLP F+G+VP AL+  +P A IT
Sbjct: 213 MGNVNEWAGNLTLGWMADQRDLQIQILTRMRQFGMQAVLPGFAGHVPEALETHYPKANIT 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           QLG W +         TY L+  DPLF +I +AF+  Q + YG T H YN D F+E  PP
Sbjct: 273 QLGGWGTFSG------TYYLNPDDPLFSKIAQAFVITQNQLYG-TDHFYNFDPFNELEPP 325

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
                Y+ +   ++++ + + D   +W++QGW    DP FW PPQ +A L+ VP+GK++V
Sbjct: 326 SSDLTYLKNCSQSMFNNLIAADPQGIWVLQGWFLVDDPEFWLPPQTEAFLSGVPIGKMIV 385

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL+++V P W+++  +YG  +IWCMLHNF G   MYG +  I+  P+EAR S +  MVG
Sbjct: 386 LDLWSDVIPAWNSTNYYYGHNWIWCMLHNFGGRSGMYGKIPFISTNPIEAR-SLSPNMVG 444

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
            G++ E IEQN +VYDLMSEMA++    D+K W++QY  RRYG+
Sbjct: 445 TGLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQYVTRRYGK 488


>gi|383114162|ref|ZP_09934927.1| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
 gi|382948607|gb|EFS30964.2| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
          Length = 727

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 148/459 (32%), Positives = 231/459 (50%), Gaps = 35/459 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W GPLP+ WLD Q  LQK+I+ R  +  M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++ +W   +   R   ++ LD  DPLF  I + F+E+Q K +G T HIY  D F+E  PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PE++++    IY  M   D DA WL   WLF  D   W   +++A L +VP  KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 361

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + E   +W  + +++G PY+WC L NF GN  + G    +        T+      G
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG + NP +Y+ +   A+     D   WI Q + RR G     ++ AW +LY ++
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 480

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                     P+ +   +G   N  +P  K     + TS+     + Y
Sbjct: 481 YTA-------------------PAALG--QGTLMN-ARPCLKGNGNWTTTST-----VAY 513

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           S   +    E+ + +G    ++  Y YD++++ RQ L  Y  +L     EAY       +
Sbjct: 514 SNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLL 571

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
            Q      +L+ D+D LL+    FLLG W+E A+ L  +
Sbjct: 572 KQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGTD 610


>gi|391338146|ref|XP_003743422.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Metaseiulus
           occidentalis]
          Length = 665

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 146/460 (31%), Positives = 235/460 (51%), Gaps = 38/460 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGPLP SW  QQ +LQK IL R+ + GM PV+P F+G VP A + + P+   +
Sbjct: 188 MGNLRGFGGPLPSSWQLQQQLLQKMILRRMRDFGMTPVVPGFNGFVPRAFERLHPAVSWS 247

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +   W +   +  +     L  T+  F+ +   +I      YG + H+Y+ D F+E TP 
Sbjct: 248 RASRWNNFPDE--YAMLTFLAPTESFFLNVSSLYITMYRSIYG-SDHLYSVDLFNEETPD 304

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
            + P  ++ + + +Y  +   D   +W+MQGWLF +   +W   ++KA L   PLGK++V
Sbjct: 305 TNDPAALAEMSSNVYESIAKADPKGIWVMQGWLFVHGGDYWNHDRVKAFLGGPPLGKMIV 364

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLF+E  P +     ++G P+IWCMLHN+ G   ++G L+ I   P+  R S    M+G
Sbjct: 365 LDLFSEQSPQFPRFSNYFGQPFIWCMLHNYGGVSGLFGNLEWINSEPLNVRRSV-PNMIG 423

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G++ EG  QN V+Y+ M+E +++    +V  W+  Y   RYG S P +++AW +L  +V
Sbjct: 424 IGIAPEGTGQNEVIYEFMAENSYRDSSENVSLWLQNYVGARYGLSDPHLENAWELLRKSV 483

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                         S+T    +N+G       +L         P +WY
Sbjct: 484 Y-------------------------SLTSKSIENHGN-----YILTHRPKLNSTPLIWY 513

Query: 360 STSEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           + S+VI A    I       EL     +  DL+D+ RQAL    ++ +L ++  ++ N  
Sbjct: 514 NGSDVIGAATELIRGATLHRELCHERLFHQDLVDVVRQALQVRVSDEYLQMMSHFKANSL 573

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
               + SRR L  +  +D +L+    FLLG WL  +++ A
Sbjct: 574 IDFEEHSRRLLHCIRVLDKVLSTDPNFLLGSWLRDSRESA 613


>gi|212537509|ref|XP_002148910.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
 gi|210068652|gb|EEA22743.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
          Length = 768

 Score =  255 bits (652), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 149/474 (31%), Positives = 248/474 (52%), Gaps = 37/474 (7%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WG PLP +W+D Q  LQKKI+ R+ ELGM P+LPAF G VP A+  V P A +
Sbjct: 210 FGNIQGSWGSPLPYAWVDSQFDLQKKIVKRMVELGMTPILPAFPGFVPRAITRVLPDADV 269

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
                W +  +   +     ++ TDP F EI ++FI +Q   YG  +  Y  D F+EN P
Sbjct: 270 INGSAWEAFPA--MFTSDTFMEPTDPHFTEIQKSFISKQTAAYGNVTTFYTLDQFNENNP 327

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPL-GKL 177
                 Y+ S+    +  +++ D  AVW+MQGWLF S   FW   +++A L  V +   L
Sbjct: 328 SSGDLNYLRSVSHGTWQALKAADPSAVWVMQGWLFFSNSAFWTNDRVEAYLGGVTVDSDL 387

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           +VLDL +E +P W  +  ++G P+IWC +H++ GN+  YG + +I   P+ A  +   ++
Sbjct: 388 LVLDLASESQPQWQRTNSYFGKPWIWCQIHDYGGNMGFYGQVMNITVNPIAALNNATASL 447

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNV 294
           VG G+SMEG E N VVYDL+ + A+  + +D   + + +   RY   +S+P  +  AW++
Sbjct: 448 VGFGLSMEGQEGNEVVYDLLLDQAWSAKPIDTATYFHDWVTARYAGSKSIPTDVYSAWDM 507

Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
           L  +VYN T+ A++       A P     +I  T G     G                 H
Sbjct: 508 LRTSVYNNTNLASN-------AVPKAIFELIPSTTGLVNRTGH----------------H 544

Query: 355 P-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
           P  L Y+ +++++A  LF ++  +   L  +  Y +DL+D++RQ LA     ++ ++I A
Sbjct: 545 PTTLNYNPADMVKAWSLFYSAAFKEPSLWLNPAYEFDLVDMSRQVLANAFIPVYHDLIAA 604

Query: 411 YQLNDAHGVF--QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
           +   +   +    +    + +++ +D +L  ++ F L  W+ +A+  A  +  E
Sbjct: 605 WNTTNPSTIRIQIIGAELIGILQAIDTILDTNEHFKLSTWISAARTSAGEQSLE 658


>gi|195454475|ref|XP_002074254.1| GK18384 [Drosophila willistoni]
 gi|194170339|gb|EDW85240.1| GK18384 [Drosophila willistoni]
          Length = 743

 Score =  254 bits (650), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 142/435 (32%), Positives = 233/435 (53%), Gaps = 38/435 (8%)

Query: 32  ELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 91
           ELG++  LPAF+G+VP AL+ +FP A  T+   W    +   +CC   ++  +PLF ++ 
Sbjct: 262 ELGISVALPAFAGHVPRALRRIFPQANFTETERWNRFPNA--YCCDLFVEPQEPLFRQLA 319

Query: 92  RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 151
             F+ +  + YG ++HI+ CD F+E  PPV   +++ S  AAIY+ M+  D  A+WL+QG
Sbjct: 320 TTFLRRVTQRYG-SNHIFFCDPFNELEPPVSQADFMRSTAAAIYASMREVDPKAIWLLQG 378

Query: 152 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 211
           W+F  + FW    ++A L +VP G L+VLDL +E  P +  +K +YG P++WCMLHNF G
Sbjct: 379 WMFVKNIFWTDELIEAFLTAVPQGNLLVLDLQSEQFPQYQRTKSYYGQPFVWCMLHNFGG 438

Query: 212 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
            + M G ++ +  G   AR   N++MVG G++ EGI QN V+Y    E  +   K+D   
Sbjct: 439 TLGMLGSVELVNSGMDLARQMPNSSMVGAGITPEGIGQNYVMYSFALERGWSDRKLDSAG 498

Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 331
           W   +++ RYG     +  AW +L  +VY        + +  I   P ++ S        
Sbjct: 499 WFTHFALTRYGVQDERLNQAWQLLRTSVYTFHGLQKMRGKYTITRRPAINLS-------- 550

Query: 332 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL----SASNTYRYD 387
                                  P  WY+ + V+ A +L +++ + +    +  + Y++D
Sbjct: 551 -----------------------PFTWYNVTHVLEAWQLMLSARSIIPLDDNRYDIYQHD 587

Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
           L+D+TRQ L   A++L++N+  +Y+         L  + LEL++D++ +L     FLLG 
Sbjct: 588 LVDITRQYLQITADQLYVNLNSSYRKRQLARFVYLGNKLLELLDDLERILGSGSNFLLGT 647

Query: 448 WLESAKQLAQNEEQE 462
           WLE+AK LA   E +
Sbjct: 648 WLEAAKLLAPTVEDQ 662


>gi|452988463|gb|EME88218.1| glycoside hydrolase family 89 protein [Pseudocercospora fijiensis
           CIRAD86]
          Length = 772

 Score =  253 bits (647), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 143/474 (30%), Positives = 251/474 (52%), Gaps = 37/474 (7%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LPQSW+D Q  L KKI+ R+ ELGM PVLP F+G VP  +  ++P+A  
Sbjct: 200 FGNIQGSWGGDLPQSWIDHQFELNKKIVARMVELGMTPVLPCFTGFVPTQISRLYPNASF 259

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
                W   +++  +     L+  DPLF  + ++FI +Q++ YG  S IY  D ++EN P
Sbjct: 260 VNGSRWNGFQAE--YTNVTFLEPFDPLFTTLQKSFISKQIEAYGNVSSIYTLDQYNENDP 317

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLV 178
                 Y+ ++ +     +++ D +A+W +QGWLF S   FW   +++A L  V    ++
Sbjct: 318 FSGELAYLKNVTSNTIKSLKAADPEAIWFIQGWLFYSSADFWTDERVEAYLGGVANEDML 377

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           +LDLF+E +P W  +  ++G P+IWC LH++ GN  ++G ++++   PV+A  ++ +TMV
Sbjct: 378 ILDLFSESQPQWQRTNSYFGKPWIWCQLHDYGGNQGLHGQVENVTINPVQALANKTSTMV 437

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPA-IQDAWNVLY 296
           G+G +MEG E N ++YD++ + A+  E +D  ++ + +   RY G  +P+ +  AW+V+ 
Sbjct: 438 GMGSTMEGQEGNEIIYDILLDQAWSKEPIDSDSYFHDWVTSRYAGSKLPSGLYTAWDVMR 497

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
            TVYN TD   +    V  +  +++P+   +   +  +    +    VL S  +      
Sbjct: 498 QTVYNSTD--IEAAEAVTKSIFELEPNTTGLLNRRGHHSTLILYDPNVLVSAWN------ 549

Query: 357 LWYSTSEVIRALELFIASGNELSA--SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
                       +L+ AS +++       Y++DL+D TRQ LA     L+ + + +    
Sbjct: 550 ------------DLYNASNDDIQLWDVKAYQFDLVDTTRQVLANAFYPLYTDFVHSAN-K 596

Query: 415 DAHGVF------QLSRRFLELVEDMDGLLACHDG--FLLGPWLESAKQLAQNEE 460
              G +      +  +  + L++D+D +L       F L  W+ESA+  A  E+
Sbjct: 597 SVQGTYSPTKAEEKGKEMIMLLKDLDSVLEASGNAHFKLSSWIESARLWAPAED 650


>gi|29348998|ref|NP_812501.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340905|gb|AAO78695.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 732

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 153/472 (32%), Positives = 240/472 (50%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK+IL R  E  M PVLPAF+G+VPA L+ ++P+AKI 
Sbjct: 196 MSNVDFWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  D L+  I R F+E+Q K YG T HIY  D F+E   P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
             S ++++++ + IY  +   DS A WL   W+F YD   W  P++++ L +VP  KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDNKLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++++YG PYIWC L NF GN  + G L+ I F            + G
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG + NP++Y+ + + A+ +  V    WI  +S+ R G     I  AW  L+  +
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDY-PVTTDQWITNWSMCRGGNQDANIIKAWRALHQKI 490

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T+ AT     ++ A P        +T  K  N    +            Y +  LW 
Sbjct: 491 Y--TEHATCGQSVLMNARP-------RLTGTKSWNTNPGI-----------HYANNDLWQ 530

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+++A  +          ++ +R+D+I++ RQ L    +E        Y   D  G+
Sbjct: 531 IWKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSEYRDQFTACYNRKDTTGM 581

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ----LAQNEEQEKQVRC 467
            + S R   L+ D+D LL+C     +G WL+ A+     +++ +  E+  RC
Sbjct: 582 REWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCGATVSEKDYYEENARC 633


>gi|395331391|gb|EJF63772.1| alpha-N-acetylglucosaminidase [Dichomitus squalens LYAD-421 SS1]
          Length = 750

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 157/476 (32%), Positives = 259/476 (54%), Gaps = 33/476 (6%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LP +W+D Q  LQK+IL R+ ELGM PVLP+F+G VP AL +++P+A I
Sbjct: 195 FGNIQGSWGGDLPVTWVDDQFQLQKQILQRMVELGMTPVLPSFTGFVPRALSSLYPNASI 254

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
                W    +         L+  DPLF  I  +FI +Q + YG  SHIY  D ++EN P
Sbjct: 255 VNGSQWEGFPT--ALTNDSFLEPFDPLFTTIQTSFISKQREAYGNVSHIYALDQYNENDP 312

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KL 177
               P Y++++ A  ++ +++ D DAVWLMQGWLF S   FW   +++A L  VP    +
Sbjct: 313 FSGDPAYLANVTAGTFASLRAADPDAVWLMQGWLFFSSAAFWTNERIEAYLGGVPGNDSM 372

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL++E +P W+ +  +YG  ++WC LH + GNI M G LD++   P+ A  +  ++M
Sbjct: 373 IILDLYSEAQPQWNRTSSYYGKQWVWCELHGYGGNIGMEGDLDALTQNPIAALHAPGSSM 432

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNVL 295
            GVG++MEG E N +VYD++ + A+    +++ ++++Q+  RRY  R +P +  DAW  L
Sbjct: 433 KGVGLTMEGQEGNELVYDILLDQAWSSAPLNLSSYVDQWVARRYNVRRLPKSALDAWRTL 492

Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
             TVY+  D  T   +  I +  ++ P++  +T                  + T  +   
Sbjct: 493 ATTVYSNKDSGT---QAAIKSIYELAPALTGMT------------------NRTGHHPTA 531

Query: 356 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
             + + S V+ A +  + + +E   L+    + YD++D+TRQ L+    + +  ++  Y 
Sbjct: 532 IPYDTNSTVLVAAKALLEARSENPLLATIPEFAYDVVDVTRQLLSNRFIDHYNVLVATYN 591

Query: 413 LNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVR 466
            N      V   +   L L++D+D LLA ++ FLL  W+  AK+     ++    R
Sbjct: 592 SNATAPRNVAAAAGPLLALLDDLDELLATNEHFLLSNWIADAKRWTHGADRAAYAR 647


>gi|393788286|ref|ZP_10376416.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
           CL02T12C05]
 gi|392655959|gb|EIY49600.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
           CL02T12C05]
          Length = 757

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 155/463 (33%), Positives = 230/463 (49%), Gaps = 48/463 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP++ +D+   L KKI+ R  ELGM P+   FSG VP  L+  +P+A I 
Sbjct: 196 MQNLQSYGGPLPKTVIDKHAALGKKIISRQLELGMQPIQQGFSGYVPRELKEKYPTANIN 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q  +W   K   +      LD TD LF  +GRAF+E+Q + +G    +Y  D F E+ PP
Sbjct: 256 QQRSWCGFKGAAQ------LDPTDSLFTRMGRAFLEEQARLFG-AHGVYAADPFHESAPP 308

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +D+PEY+ ++G  I+   +  D  + W MQ W    D          ++ +VP   L++L
Sbjct: 309 IDTPEYLKAVGERIHHLFRDFDPHSTWAMQSWSLRED----------IVKAVPKDALLIL 358

Query: 181 DLFAEVKPIWSTSKQ-FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           DL  +     STSK  F+G   +   LHNF G I M+G L  +A          N  + G
Sbjct: 359 DLNGK-----STSKALFWGYSTVVGNLHNFGGRINMHGDLKLLASNQYSKAKRLNPAVCG 413

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G+ ME +EQNPV Y+L  EM    + ++++AW+ QY+ RRYG   PA Q+AW +L +  
Sbjct: 414 SGLFMEAVEQNPVYYELAFEMPCHADSINLQAWLKQYATRRYGAFSPAAQEAWLLLLNGP 473

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y      T+K+  ++ A P +D                 V K     +    YD P L  
Sbjct: 474 YRRGTNGTEKS-SIVAARPALD-----------------VKKSGPNAALEIPYD-PTL-- 512

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
               VIRA  L +   ++LS S  YR+D++D+ RQ +      +     EA++  D    
Sbjct: 513 ----VIRAQSLLLKDIDKLSVSRPYRFDIVDVQRQLMTNLGQLIHRQAAEAFRKKDQCAF 568

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              S RFLE++ DMD LL     +    WL  A+     +E++
Sbjct: 569 TLHSGRFLEMLADMDKLLRTRSEYSFDRWLTEARSWGDTDEEK 611


>gi|298386708|ref|ZP_06996263.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
 gi|298260382|gb|EFI03251.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
          Length = 732

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 152/472 (32%), Positives = 240/472 (50%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK+IL R  E  M PVLPAF+G+VPA L+ ++P+AKI 
Sbjct: 196 MSNVDYWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  D L+  I R F+E+Q K YG T HIY  D F+E   P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
             S ++++++ + IY  +   DS A WL   W+F YD   W  P++++ L +VP  KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDNKLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++++YG PYIWC L NF GN  + G L+ I F            + G
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG + NP++Y+ + + A+ +  V    WI  +S+ R G     I  AW  L+  +
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDYS-VTTDQWITNWSMCRGGNQDANIIKAWRALHQKI 490

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T+ AT     ++ A P        +T  K  N    +            Y +  LW 
Sbjct: 491 Y--TEHATCGQSVLMNARP-------RLTGTKSWNTNPGI-----------HYANNDLWQ 530

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+++A  +          ++ +R+D+I++ RQ L    ++        Y   D  G+
Sbjct: 531 IWKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSKYRDQFTACYNRKDTTGM 581

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ----LAQNEEQEKQVRC 467
            + S R   L+ D+D LL+C     +G WL+ A+     +++ +  E+  RC
Sbjct: 582 REWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCGATVSEKDYYEENARC 633


>gi|392584963|gb|EIW74305.1| glycoside hydrolase family 89 protein [Coniophora puteana
           RWD-64-598 SS2]
          Length = 772

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 149/479 (31%), Positives = 249/479 (51%), Gaps = 53/479 (11%)

Query: 8   GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
           GG LPQ W+D QL LQK+I+ RI ELGM PVLPAF G VP A+  +FP+A I     +  
Sbjct: 221 GGKLPQEWMDAQLALQKQIVPRIVELGMTPVLPAFPGFVPPAMHTLFPNASIVNGSEYPG 280

Query: 68  VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
           + +  ++     L   DPL+ ++  +F+ +Q +  G  +H++  D ++EN+P      Y+
Sbjct: 281 IPA--QYSNDSFLAPFDPLYAQLQSSFLAKQTEALGNVTHVWTIDQYNENSPYSGDLTYL 338

Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 186
           +++  + ++ +++ D DA+WLMQGWLF  D PFW   ++ A L+ +P   +++LDLF++V
Sbjct: 339 ANIANSTFASLRAHDPDAIWLMQGWLFFADEPFWTSDRVDAYLDQIPNDGMIILDLFSDV 398

Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEG 246
            P W     + G  ++WC +H+F GN+ + G    +  GPV+A  S N++M GVG++MEG
Sbjct: 399 YPQWQRLDSYRGKSWVWCEVHDFGGNMGLEGNFSVVTNGPVDALNSPNSSMKGVGLAMEG 458

Query: 247 IEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-------------GRSVPAIQ-DAW 292
           +E N ++YD++ + A+    +D  A+   ++ RR+               S+PA   +AW
Sbjct: 459 LEGNEIIYDVLLDQAWSAAPLDRDAYAKAWATRRFHLPTANSSTTTATNTSIPASAIEAW 518

Query: 293 NVLYHTVYNCTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
             L  TVY+ T+    GAT               S+I +       Y  P S      + 
Sbjct: 519 QTLASTVYSSTNPNVWGATK--------------SLIELAPSLGGMYSAPSSTIIFYDTN 564

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFL 405
           TS             ++ AL   +A+G     L A + +R D ID+ RQ LA    + + 
Sbjct: 565 TS-------------LVPALRGLVAAGTSAPALWALDEFRTDSIDVARQLLANRFADAYT 611

Query: 406 NIIEAYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
               AY  +   +  +   + R +++++D+D LL  H+ +LL   + SA+  A +   E
Sbjct: 612 ATTGAYNASGPGSAALNATAARMMQIIDDLDRLLMTHEPYLLSSRIASARAWAGDGGDE 670


>gi|403416059|emb|CCM02759.1| predicted protein [Fibroporia radiculosa]
          Length = 705

 Score =  251 bits (641), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 148/470 (31%), Positives = 254/470 (54%), Gaps = 37/470 (7%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G W G LP  W++ Q  LQ++I+ R+ ELGM PVLPAF+G VP A+  ++P+A I
Sbjct: 156 FGNIQGSWSGALPTQWINDQWALQQQIVQRMVELGMTPVLPAFTGFVPRAMSTLYPNASI 215

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENT 118
                W    S   +  T  L+  DPLF  + ++FI +Q   YG   SH+Y  D ++EN 
Sbjct: 216 VNGSQWEGFPSTLTY--TTFLEPFDPLFTTMQKSFISKQQAAYGANVSHVYTLDQYNEND 273

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-K 176
           P      Y++++ A  ++ +Q+ D +AVW+MQGWL F+ + FW   ++ A L +VP    
Sbjct: 274 PYSGDVGYLANISAGTFASLQAADPEAVWMMQGWLFFASEAFWTTERIAAFLGAVPSNDS 333

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           +++LDL++E  P W  +  +YG  +IWC LH+F GN+   G L  +  GP++A  S  ++
Sbjct: 334 MIILDLYSEAAPQWQRTDSYYGKQWIWCELHDFGGNMGFEGNLPELVTGPIQAL-SNASS 392

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNV 294
           M G+G++ EG E N +VYD++ + A+    +D+ +++  +  RRY  + +P A Q+AW +
Sbjct: 393 MRGMGLTPEGQEGNEIVYDILLDQAWSSTSIDIASYVEAWVARRYTVQDLPSAAQEAWTI 452

Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
           L  TVY+ +D             P+   +I S+ E         ++ +    ++ + +  
Sbjct: 453 LSTTVYSNSD-------------PNTQATIKSIFE---------LAPDLSGLTDRTGHHC 490

Query: 355 PHLWYSTS-EVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
             + Y T+  ++ AL+  + +  E   L +   + YD++D+TRQ LA    +++  ++  
Sbjct: 491 TEIPYDTNITIVPALQNLVQAATENPLLLSVPEFMYDVVDVTRQLLANRFIDVYNELVST 550

Query: 411 YQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
           +      A  V    +  L ++ D+D LL  +D FLL  W+  A  L+ N
Sbjct: 551 FYSTGVTAASVKNAGQPLLTILSDVDTLLWTNDNFLLSNWILGAINLSDN 600


>gi|383124408|ref|ZP_09945072.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
 gi|251839096|gb|EES67180.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
          Length = 732

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 152/472 (32%), Positives = 239/472 (50%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK+IL R  E  M PVLPAF+G+VPA L+ ++P+AKI 
Sbjct: 196 MSNVDYWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W       R   ++ +D  D L+  I R F+E+Q K YG T HIY  D F+E   P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYQVIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
             S ++++++ + IY  +   DS A WL   W+F YD   W  P++++ L +VP  KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDDKLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++++YG PYIWC L NF GN  + G L+ I F            + G
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG + NP++Y+ + + A+ +  V    WI  +S+ R G     I  AW  L+  +
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDY-PVTTDQWITNWSMCRGGDQDANIIKAWRALHQNI 490

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T+ A      ++ A P        +T  K  N    +            Y +  LW 
Sbjct: 491 Y--TEYAICGQSVLMNARP-------RLTGTKSWNTNPGIH-----------YANNDLWQ 530

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+++A  +          ++ +R+D+I++ RQ L    +E        Y   D  G+
Sbjct: 531 IWKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSEYRDQFTACYNRKDTTGM 581

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ----LAQNEEQEKQVRC 467
            + S R   L+ D+D LL+C     +G WL+ A+     +++ +  E+  RC
Sbjct: 582 REWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARDCGTTVSEKDYYEENARC 633


>gi|410100551|ref|ZP_11295511.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409215586|gb|EKN08585.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 739

 Score =  248 bits (633), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 158/469 (33%), Positives = 234/469 (49%), Gaps = 56/469 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW+D    L ++IL R  ELGM P+L +F+G VP  L+  +P A+I 
Sbjct: 194 MTNIETYGGPLPQSWIDSHQALGQQILERQRELGMTPILQSFTGFVPIKLKEKYPDARI- 252

Query: 61  QLGNWFSVKSDPRWC----CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
                   K   RWC     T  LD  DPLF E+G+AF+E+Q K YG T+HIY  D F E
Sbjct: 253 --------KDKNRWCNAFTATVQLDPLDPLFKEMGQAFLEEQQKLYG-TNHIYAADPFHE 303

Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
              P +   Y+ ++G  I+      D +AV  MQ W             +A+  + P  +
Sbjct: 304 GAAPSNEKSYLEAVGKVIWEVASGFDPEAVIAMQTWSLR----------EAITRTFPQDR 353

Query: 177 LVVLDLFAEVKPIWSTSK--QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS-E 233
           L++LDL       W+  K   F+  PY+  +LHN+ G + M G L   A    E + S +
Sbjct: 354 LLLLDLGG-----WNVEKFNSFWNYPYVAGVLHNYGGRVYMGGNLALYAKNAHELKQSPK 408

Query: 234 NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWN 293
              + G+G+  E IE NPVVY+L +E+ +  +  D++ WI  Y+  RYG+     +  W 
Sbjct: 409 GGNIQGIGLFPEAIEHNPVVYELSTEITWMQDAPDLQKWITDYARARYGKLPAGAEQGWK 468

Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
           VL  TVY    G       V+ A P +    ++       +  +P               
Sbjct: 469 VLLETVYGSKAGRLPSTESVMCARPALTIQKVAAN----GDLSRP--------------- 509

Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
                YST  +  A++ F+ + N+L  S+TYRYDL+D+ RQ L+  +  L   I EAY  
Sbjct: 510 -----YSTVRLWDAVDHFLQASNDLKKSDTYRYDLVDVMRQCLSDLSLPLQKQITEAYLA 564

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
            D   + Q   +FL L++D D LL     FLLG W++ A+Q    EE++
Sbjct: 565 EDNEKLQQAGEQFLALIDDFDRLLGTRSTFLLGKWIKEARQWGTTEEEK 613


>gi|336371253|gb|EGN99592.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336384013|gb|EGO25161.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 761

 Score =  248 bits (632), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 147/473 (31%), Positives = 252/473 (53%), Gaps = 41/473 (8%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LP+ W++ Q VLQK+IL R+ ELGM PVLP+F+G VP A+  ++P+A I
Sbjct: 206 FGNIQGSWGGDLPEQWINDQFVLQKQILARMVELGMTPVLPSFTGFVPRAMHTLYPNASI 265

Query: 60  TQLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
                W  F+++          L+  DPLF  +  +F+ +    YG  SHIY  D ++E 
Sbjct: 266 VNGSQWSTFTIQH----TNDSFLEPFDPLFSTLQTSFMTKYAAAYGNVSHIYTLDQYNEM 321

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS-YDPFWRPPQMKALLNSVPLG- 175
            P   +  Y+SS+ +A ++ +++ D +AVW+MQGWLF  Y  FW   +++A L  VP   
Sbjct: 322 MPYSGNTSYLSSISSATFASLRATDPEAVWMMQGWLFYIYASFWTDERVEAYLGGVPGND 381

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LDLF+E  P W     ++G  +IWC LH+F GN+   G  +++   PV+A  +   
Sbjct: 382 SMIILDLFSEAYPQWQRLNSYFGKQWIWCELHDFGGNMGFEGNFENVTTQPVKALATPGN 441

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP----AIQDA 291
           TMVG+G++MEG E N ++YD++ + A+    ++  ++++ ++ RRY  +VP    A  +A
Sbjct: 442 TMVGMGLTMEGQEGNEIMYDVLFDQAWSPTPINRTSYVSAWTSRRY--NVPNLPTAATEA 499

Query: 292 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 351
           W +L  TVYN  D         I         ++++T  +    G P             
Sbjct: 500 WEILASTVYNNQDPLLQATIKSIFELEPAINGLVNLTVLQ----GIPTG----------- 544

Query: 352 YDHPHLWYST-SEVIRALELFIASGNELSASNT---YRYDLIDLTRQALAKYANELFLNI 407
                L+Y T + ++ AL+  + +  E SA +    ++YD++ + RQ LA    +L+ ++
Sbjct: 545 -----LFYDTNTTIVPALQSLLQARQESSALDEVPEFQYDVVYIIRQLLANRFIDLYTSL 599

Query: 408 IEAYQLNDAHGVFQLSRR--FLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
           ++ Y    +      +     + L++D+D +L     FLL  W+ +A+  A +
Sbjct: 600 VDTYNSTTSSSSDVSTAGAPLITLLKDVDSVLLTDTHFLLSNWISAARNWAHD 652


>gi|326437768|gb|EGD83338.1| lysosomal alpha-N-acetyl glucosaminidase [Salpingoeca sp. ATCC
           50818]
          Length = 820

 Score =  248 bits (632), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 153/477 (32%), Positives = 233/477 (48%), Gaps = 54/477 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  W  PL + W   Q  LQ KIL R  ELGM   LP F+G+VP A++ +FP A +T
Sbjct: 215 MGNLKYWAAPLDKDWRTSQYNLQLKILSRARELGMVSALPGFAGHVPTAIKRIFPHANLT 274

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q   W +  S   +    LL  TDPLF+++G  F +  +K +G T H++  DT++E  P 
Sbjct: 275 QTAGWANFNS--TYSDVSLLQPTDPLFLQLGTKFYKMLIKAFG-TDHVFQMDTYNEMQPS 331

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
             +   ++     +Y  M + D +AV+LMQGWLF ++ +W P  +K  L+ VP  K+++L
Sbjct: 332 FTNMTLLAESNRVVYQAMANADPEAVYLMQGWLF-HESYWTPEHVKVYLSGVPDDKMIIL 390

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL  E  P++S +  ++G  +IW ML N+ G   +YG    I+  P+        TM G+
Sbjct: 391 DLNTEANPVFSLTSDYFGKLWIWNMLLNYGGRRGLYGNATDISTRPLLDLHRAQGTMDGI 450

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ E IE NPV+++LM EM +     D+  WI  Y+  RYG+     Q AW +L   VY
Sbjct: 451 GITPEAIENNPVMFELMLEMGWHATPPDMHDWIAAYASSRYGKRESLTQSAWQLLLEHVY 510

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +      D +R  +   PD     +S +E +  N                          
Sbjct: 511 D----QPDIDRFHMEMVPD-----LSSSESRNSN-------------------------- 535

Query: 361 TSEVIRALELFI--ASGNELSASNTYRYDLIDLTRQA-----------LAKYANELFLNI 407
           T+ +++A  L +  A    L  +  + YDL+D+ RQA           L  +  E   NI
Sbjct: 536 TTALVQAWRLLVTAAVNGSLPITGPFSYDLVDVGRQALLNLWSDVRGMLVAHVKEYNANI 595

Query: 408 IEAYQLNDAH--GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             +     +H   +  L    L++  D+D LL     +LLG WLESAK  A N ++ 
Sbjct: 596 DSSPSTAASHVPAIKSLFTLLLDITSDLDRLLGTDVNYLLGVWLESAKATAANADER 652


>gi|409042145|gb|EKM51629.1| glycoside hydrolase family 89 protein [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 749

 Score =  247 bits (631), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 143/464 (30%), Positives = 249/464 (53%), Gaps = 37/464 (7%)

Query: 6   GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW 65
            WGG LP  W+  Q  LQK+IL R+ ELGM P+LPAF+G VP+ +   +P+A I     W
Sbjct: 201 AWGGLLPMQWISDQQALQKQILPRMLELGMTPILPAFTGFVPSNMSAHYPNASIIDGSAW 260

Query: 66  FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPE 125
               S         L+  DPL+ ++ ++FI +Q + YG  +H Y  D ++EN P   +  
Sbjct: 261 SGFPS--TLTNVSFLEPFDPLYPQMQQSFITKQQEAYGNITHFYTLDQYNENNPFSGNDS 318

Query: 126 YISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KLVVLDLF 183
           Y+SS+  +  + +++ D +A W+MQGWLF S + FW   +++A L        +++LDL+
Sbjct: 319 YLSSVSTSTIASLRAADPEATWVMQGWLFFSSETFWTNDRIEAYLGGAQGNDSMLILDLY 378

Query: 184 AEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMS 243
           +E +P W+ +  ++G  ++WC LH++ GN+ + G L +I  GP+ A  S  ++MVG+G++
Sbjct: 379 SEAQPQWNRTDSYFGKQWVWCELHDYGGNMGLEGNLAAITEGPIAALNSNGSSMVGMGLT 438

Query: 244 MEGIE-QNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPA-IQDAWNVLYHTVY 300
           MEG+E  N +VYD++ + A+    ++V  W+ +++ RRY  +++P  +Q AW +L  T+Y
Sbjct: 439 MEGMEIGNEIVYDILLDQAWSSTPLNVSDWVAKWAARRYLVKTLPTELQQAWTILSTTIY 498

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-HLWY 359
           N  D  +      I+        +++VT                         HP  + Y
Sbjct: 499 NNQDPNSQATIKSILELEPATTGLVNVTG-----------------------HHPTEIPY 535

Query: 360 ST-SEVIRALELFI-ASGNELSASNT--YRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
            T + ++ AL+LF+ AS ++ S      +  D+++L+RQ +     +L+ ++I  +  + 
Sbjct: 536 DTNTTILHALQLFVNASKSQPSLKQVPEFAVDILELSRQLMVNRFIDLYTDLINTWNSSS 595

Query: 416 --AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
             A  V       L L+ D+D LL  ++ +L   W+  AKQ A 
Sbjct: 596 STAQNVTTAGVPLLSLISDLDVLLYTNENYLFSTWIADAKQWAH 639


>gi|404487028|ref|ZP_11022215.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335524|gb|EJZ61993.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
           YIT 11860]
          Length = 726

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 148/461 (32%), Positives = 230/461 (49%), Gaps = 38/461 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+ GW GPLP  WL+ Q  LQKKIL R  EL M PVLPAF+G+VPAAL+ + P A I 
Sbjct: 196 MANIDGWNGPLPMQWLESQAELQKKILARERELNMTPVLPAFAGHVPAALKRIHPDANIQ 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R    + L+  +PLF EI ++F+E+Q K +G T HIY  D F+E  PP
Sbjct: 256 YLGKWAGFGDSYR---CHFLNPEEPLFAEIQKSFLEEQEKMFG-TDHIYGVDPFNEVDPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEY++ + + +Y  + + D DAVWL   W+F +D   W  P++KALL  VP  KLV+
Sbjct: 312 SWEPEYLAQVSSDMYKSLAAADPDAVWLQMTWMFYHDRKLWTAPRVKALLTGVPSDKLVL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W ++++F+G PYIWC L NF GN  + G +         A  +    + G
Sbjct: 372 LDYHCENVELWKSTEKFHGQPYIWCYLGNFGGNTTLTGNVKESGDRLDNALINGGDNLKG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++ N   Y+ + E A+  + V+ + W+ + + R  G    + ++AW +L+   
Sbjct: 432 IGSTLEGLDINQFPYEYIFEKAWTID-VNGQDWVERLADRHVGAVSESAREAWQILFD-- 488

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
                       DV V  P     +         NY K  S E                Y
Sbjct: 489 ------------DVFVQVPRTLGILPGYRPKLGDNYNKRTSNE----------------Y 520

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
             + ++R  EL +   +     + +  D+I   RQ L  Y  ++       Y+  +  G+
Sbjct: 521 DNATLLRVWELLLEVPS--CDRDAFEIDVIMTGRQLLGNYFLDVKKEFDGFYKKRNVPGL 578

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
            + +    E++ D++ L + H+   L  W+E A+ L   +E
Sbjct: 579 KEKASEMREILSDLELLNSFHNRASLDKWIEDARSLGDTDE 619


>gi|317158657|ref|XP_001827155.2| alpha-N-acetylglucosaminidase [Aspergillus oryzae RIB40]
          Length = 849

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 152/479 (31%), Positives = 250/479 (52%), Gaps = 44/479 (9%)

Query: 1   MSNLHG-WGG-PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           + N+ G WGG  +  +W++ Q  LQKKI+ RI ELGM PVLPAF G VP A++ V P A 
Sbjct: 157 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 216

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W   +   ++     L+  D  F ++ ++ I +Q + +G  +H+Y  D F+E  
Sbjct: 217 VVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISRQTRAFGNVTHVYALDQFNEIN 274

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
           P      Y+ +L    +  +++ +  AVW+MQGWLF YD   FW P ++ A L+ V    
Sbjct: 275 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDPNRISAYLSGVERND 333

Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ MYG + +I   P+EA  +++ 
Sbjct: 334 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEA-LNKSD 392

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVP-AIQDAW 292
           ++VG G++MEG E N +VYDL+ + A+  + +D +A+   +   RY    SVP  +  AW
Sbjct: 393 SLVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQSWVRSRYSGNFSVPNELYTAW 452

Query: 293 NVLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEGKYQ-----NYGKPVSKEAVLK 346
           ++L  TVYN T+  T    + +    PD+   +  V  G Y      NY   V  E    
Sbjct: 453 DLLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--GHYPTPTSINYDPMVLNEVWSL 510

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
              ++   P LW+S +                     Y YD++D+TRQ +      ++ +
Sbjct: 511 FMNATRKEPSLWHSPA---------------------YEYDMVDITRQLMGNAFVNVYSD 549

Query: 407 IIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
           +I +++    N    V   S R L L+  +D +L+C++ F L  W+ SA+      E +
Sbjct: 550 LISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNENFSLTTWISSARDWGNTTETK 608


>gi|237717696|ref|ZP_04548177.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229453015|gb|EEO58806.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 729

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 144/459 (31%), Positives = 234/459 (50%), Gaps = 35/459 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GW  PLP+ WL  Q  LQ++I+ R  E  M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 196 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R  CT+L +  D L+  I + ++ +Q + YG T+HIY  D F+E  PP
Sbjct: 256 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
               + +  +   IY  + + D +AVWL   WLF  D   W  P++K+ L SVP  +L++
Sbjct: 312 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD F E   IW  +  ++G PY+WC L NF GN  + G ++ ++    +A  +  + + G
Sbjct: 372 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG ++EGI+ N  +Y+ + + A+   + D K W  + + RR G+  P  + AW +L + V
Sbjct: 432 VGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKLADRRIGKISPEARKAWEILANKV 490

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                   V P+   V +G   N  +P      LK        P + Y
Sbjct: 491 Y-------------------VQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEY 523

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              +++ A  L ++  +     ++Y +DL+++ RQ L  Y N +      AY+  D   +
Sbjct: 524 QPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIPMM 581

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
                +  E++ D+D L++CH  F L  W+  A+ +  +
Sbjct: 582 KNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHD 620


>gi|83775903|dbj|BAE66022.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 633

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 152/479 (31%), Positives = 250/479 (52%), Gaps = 44/479 (9%)

Query: 1   MSNLHG-WGG-PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           + N+ G WGG  +  +W++ Q  LQKKI+ RI ELGM PVLPAF G VP A++ V P A 
Sbjct: 73  LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 132

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W   +   ++     L+  D  F ++ ++ I +Q + +G  +H+Y  D F+E  
Sbjct: 133 VVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISRQTRAFGNVTHVYALDQFNEIN 190

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
           P      Y+ +L    +  +++ +  AVW+MQGWLF YD   FW P ++ A L+ V    
Sbjct: 191 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDPNRISAYLSGVERND 249

Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ MYG + +I   P+EA  +++ 
Sbjct: 250 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEA-LNKSD 308

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVP-AIQDAW 292
           ++VG G++MEG E N +VYDL+ + A+  + +D +A+   +   RY    SVP  +  AW
Sbjct: 309 SLVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQSWVRSRYSGNFSVPNELYTAW 368

Query: 293 NVLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEGKYQ-----NYGKPVSKEAVLK 346
           ++L  TVYN T+  T    + +    PD+   +  V  G Y      NY   V  E    
Sbjct: 369 DLLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--GHYPTPTSINYDPMVLNEVWSL 426

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
              ++   P LW+S +                     Y YD++D+TRQ +      ++ +
Sbjct: 427 FMNATRKEPSLWHSPA---------------------YEYDMVDITRQLMGNAFVNVYSD 465

Query: 407 IIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
           +I +++    N    V   S R L L+  +D +L+C++ F L  W+ SA+      E +
Sbjct: 466 LISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNENFSLTTWISSARDWGNTTETK 524


>gi|299149196|ref|ZP_07042257.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
 gi|298512863|gb|EFI36751.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
          Length = 738

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 144/459 (31%), Positives = 234/459 (50%), Gaps = 35/459 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GW  PLP+ WL  Q  LQ++I+ R  E  M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 205 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 264

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R  CT+L +  D L+  I + ++ +Q + YG T+HIY  D F+E  PP
Sbjct: 265 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
               + +  +   IY  + + D +AVWL   WLF  D   W  P++K+ L SVP  +L++
Sbjct: 321 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD F E   IW  +  ++G PY+WC L NF GN  + G ++ ++    +A  +  + + G
Sbjct: 381 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG ++EGI+ N  +Y+ + + A+   + D K W  + + RR G+  P  + AW +L + V
Sbjct: 441 VGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKLADRRIGKISPEARKAWEILANKV 499

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                   V P+   V +G   N  +P      LK        P + Y
Sbjct: 500 Y-------------------VQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEY 532

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              +++ A  L ++  +     ++Y +DL+++ RQ L  Y N +      AY+  D   +
Sbjct: 533 QPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIPMM 590

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
                +  E++ D+D L++CH  F L  W+  A+ +  +
Sbjct: 591 KNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHD 629


>gi|294647264|ref|ZP_06724861.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|292637401|gb|EFF55822.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
          Length = 733

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 153/472 (32%), Positives = 237/472 (50%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 196 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I R F+E+Q K YG T+HIY  D F+E   P
Sbjct: 256 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +Q  DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 312 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F            + G
Sbjct: 372 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G+++EG++ NP++Y+ + E A+Q+  + V  WI  ++  R G     I  AW  LY  +
Sbjct: 432 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 490

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T  A      ++ A P ++      T   Y                   Y +  LW 
Sbjct: 491 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 530

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+++A  ++          + Y +D+I++ RQ L     +      + Y+  D  G 
Sbjct: 531 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 581

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
               +R  +L+ D+D LL C   F +G W++ A+  A NE+++K      RC
Sbjct: 582 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARC 633


>gi|345511813|ref|ZP_08791352.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|229443748|gb|EEO49539.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
          Length = 720

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 153/472 (32%), Positives = 237/472 (50%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 183 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 242

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I R F+E+Q K YG T+HIY  D F+E   P
Sbjct: 243 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 298

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +Q  DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 299 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 358

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F            + G
Sbjct: 359 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 418

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G+++EG++ NP++Y+ + E A+Q+  + V  WI  ++  R G     I  AW  LY  +
Sbjct: 419 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 477

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T  A      ++ A P ++      T   Y                   Y +  LW 
Sbjct: 478 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 517

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+++A  ++          + Y +D+I++ RQ L     +      + Y+  D  G 
Sbjct: 518 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 568

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
               +R  +L+ D+D LL C   F +G W++ A+  A NE+++K      RC
Sbjct: 569 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARC 620


>gi|262407713|ref|ZP_06084261.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|262354521|gb|EEZ03613.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
          Length = 735

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 153/472 (32%), Positives = 237/472 (50%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 198 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I R F+E+Q K YG T+HIY  D F+E   P
Sbjct: 258 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +Q  DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 314 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F            + G
Sbjct: 374 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 433

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G+++EG++ NP++Y+ + E A+Q+  + V  WI  ++  R G     I  AW  LY  +
Sbjct: 434 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 492

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T  A      ++ A P ++      T   Y                   Y +  LW 
Sbjct: 493 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 532

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+++A  ++          + Y +D+I++ RQ L     +      + Y+  D  G 
Sbjct: 533 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 583

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
               +R  +L+ D+D LL C   F +G W++ A+  A NE+++K      RC
Sbjct: 584 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARC 635


>gi|294807833|ref|ZP_06766618.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|294444952|gb|EFG13634.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
          Length = 703

 Score =  246 bits (627), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 153/472 (32%), Positives = 237/472 (50%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 166 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 225

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I R F+E+Q K YG T+HIY  D F+E   P
Sbjct: 226 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 281

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +Q  DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 282 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 341

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F            + G
Sbjct: 342 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 401

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G+++EG++ NP++Y+ + E A+Q+  + V  WI  ++  R G     I  AW  LY  +
Sbjct: 402 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 460

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T  A      ++ A P ++      T   Y                   Y +  LW 
Sbjct: 461 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 500

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+++A  ++          + Y +D+I++ RQ L     +      + Y+  D  G 
Sbjct: 501 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 551

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
               +R  +L+ D+D LL C   F +G W++ A+  A NE+++K      RC
Sbjct: 552 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARC 603


>gi|391873368|gb|EIT82411.1| alpha-N-acetylglucosaminidase [Aspergillus oryzae 3.042]
          Length = 633

 Score =  246 bits (627), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 151/481 (31%), Positives = 247/481 (51%), Gaps = 48/481 (9%)

Query: 1   MSNLHG-WGG-PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           + N+ G WGG  +  +W++ Q  LQKKI+ RI ELGM PVLPAF G VP A++ V P A 
Sbjct: 73  LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 132

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W   +   ++     L   D  F ++ ++ I +Q++ +G  +H+Y  D F+E  
Sbjct: 133 VVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISRQMRAFGNITHVYALDQFNEIN 190

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
           P      Y+ +L    +  +++ +  AVW+MQGWLF YD   FW   ++ A L+ V    
Sbjct: 191 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDSNRISAYLSGVERND 249

Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ MYG + +I   P+EA    N 
Sbjct: 250 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEALNKSN- 308

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVP-AIQDAW 292
           ++VG G++MEG E N +VYDL+ + A+    +D +A+   +   RY R  SVP  +  AW
Sbjct: 309 SLVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQSWVRSRYSRNFSVPNELYTAW 368

Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS--------KEAV 344
           ++L  TVYN T+  T     V  +  ++ P I  +  G+  +Y  P S         E  
Sbjct: 369 DLLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GRVGHYPTPTSINYDPMVLNEVW 424

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
                ++   P LW++ +                     Y YD++D+TRQ +      ++
Sbjct: 425 SLFMNATRKEPSLWHNPA---------------------YEYDMVDITRQLMGNAFVNVY 463

Query: 405 LNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
             +I +++    N    V   S R L L+  +D +L+C++ F L  W+ SA+      E 
Sbjct: 464 SVLITSWKSETENRTTKVTSQSERLLNLLSAIDKVLSCNENFSLATWISSARDWGNTTET 523

Query: 462 E 462
           +
Sbjct: 524 K 524


>gi|423722278|ref|ZP_17696454.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
           CL09T00C40]
 gi|409242419|gb|EKN35181.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
           CL09T00C40]
          Length = 752

 Score =  245 bits (626), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 154/465 (33%), Positives = 220/465 (47%), Gaps = 51/465 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D+ +VL K+I+ R  ELGM P+   FSG VP  L+  +P AKI 
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
                   +  P WC       LD TD LF  IGR F+E++ K YG    +Y  D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
            PPVD+PEY+ ++G AI+      D +++W MQ W        R P +KA    VP   L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL------REPIVKA----VPKENL 351

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL              +G P +   LHNF G I ++G L  +A         +N  +
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G G+ ME IEQNPV YDL  EM    ++V+++ W+ +Y+ RRYG+       AW  L  
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
             Y      T+  R  I+A     P++     G     G P                   
Sbjct: 469 GPYRPGTNGTE--RSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            YS   V++A  L +     L  S+ YR+D++D+ RQ ++     +     EA++  D  
Sbjct: 505 -YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMSNLGQAIHKQAAEAFRKKDKE 563

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
                S RFLE++ D D LL     F    WL  A+    N E++
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEK 608


>gi|154492110|ref|ZP_02031736.1| hypothetical protein PARMER_01741 [Parabacteroides merdae ATCC
           43184]
 gi|154087335|gb|EDN86380.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
           43184]
          Length = 752

 Score =  244 bits (624), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 150/465 (32%), Positives = 220/465 (47%), Gaps = 51/465 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D+ +VL K+I+ R  ELGM P+   FSG VP  L+  +P AKI 
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
                   +  P WC       LD TD LF  IGR F+E++ K YG    +Y  D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
            PPVD+PEY+ ++G AI+      D +++W MQ W             ++++ +VP   L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSLR----------ESIVKAVPKENL 351

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL              +G P +   LHNF G I ++G L  +A         +N  +
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G G+ ME IEQNPV YDL  EM    ++V+++ W+ +Y+ RRYG+       AW  L  
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
             Y      T+  R  I+A     P++     G     G P                   
Sbjct: 469 GPYRPGTNGTE--RSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            YS   V++A  L +     L  S+ YR+D++D+ RQ ++     +     EA++  D  
Sbjct: 505 -YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMSNLGQAIHKQAAEAFRKKDKE 563

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
                S RFLE++ D D LL     F    WL  A+    N E++
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEK 608


>gi|424665881|ref|ZP_18102917.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
           616]
 gi|404574134|gb|EKA78885.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
           616]
          Length = 732

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 157/474 (33%), Positives = 236/474 (49%), Gaps = 54/474 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGP+ Q ++D+Q  LQKK++ R+ E GM PVL  F G VP ++   FP+A I 
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTQLQKKMIDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W + +          L  +DPLF ++ + F E+Q K +G+ S  Y  D F E  N+
Sbjct: 258 DAGKWITYQRPA------FLVPSDPLFAKVAQIFYEEQEKLFGK-SRYYGGDPFHEGGNS 310

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             ++    I+   + IY  M++ + DA+W++QGW    +P +      ALL  +  G+ +
Sbjct: 311 EGIN----ITEAASDIYKAMKANNPDAIWVLQGW--GANPSY------ALLKGLKQGEAL 358

Query: 179 VLDLFAEVKPIW-----STSKQFYGV---PYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
           +LDL +  +P W     S S +  G     +IWC L NF G I MYG L S A G + A 
Sbjct: 359 ILDLMSCARPQWGGDPSSQSHREDGYLDHNWIWCALPNFGGRIGMYGKLQSYATGVIRAE 418

Query: 231 TSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
                  V GVG + EGI  NP+ YD++ +MA++ + +DVK+WI  Y+  RYG      +
Sbjct: 419 HHPKGKYVCGVGTTPEGIGTNPIDYDMVYDMAWRTDSIDVKSWIANYTTYRYGSPNNNAK 478

Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
            A   L  +VYNC   A         A P +                            T
Sbjct: 479 AAMQQLSTSVYNCPWAADGPQESYFCARPSLKID------------------------RT 514

Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
           SS+   HL+Y    V++ALE  + + NEL   +TYRYD++D+TRQ LA Y   +   I +
Sbjct: 515 SSWGTAHLYYQPINVLQALEHLLKAENELKEIDTYRYDVVDVTRQMLADYGKYIHKCIAD 574

Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           AY   D       + +FL+++ D D LL+    FLLG ++  A     N  +++
Sbjct: 575 AYYGKDTEKFDFYTSKFLQMISDQDLLLSTRKEFLLGKFIRQADACGSNPMEKR 628


>gi|224025137|ref|ZP_03643503.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
           18228]
 gi|224018373|gb|EEF76371.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
           18228]
          Length = 718

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 158/475 (33%), Positives = 247/475 (52%), Gaps = 58/475 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW ++Q  LQK+IL R+ E G+ PVLP +SG VP   ++      + 
Sbjct: 187 MNNLEGWGGPNPDSWYERQEELQKRILKRMREYGIEPVLPGYSGMVPHNAKDRL-GLNVA 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      PR      L  TDP F  I   +  +  + YG+ S+ Y+ D F E  NT
Sbjct: 246 DPGRW---NGYPR---PAFLQPTDPQFERIAALYYREMTRLYGKVSY-YSMDPFHEGGNT 298

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    + + G AI+  M+  +  A W++Q W    +P    PQM   + ++P G +V
Sbjct: 299 SGVD----LEAAGKAIWKAMKQANPRAAWVVQAW--GANP---RPQM---IRNLPAGDMV 346

Query: 179 VLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
           VLDLF+E +P W         K+ +G   +++CML N+ GN+ ++G +  +     +A+ 
Sbjct: 347 VLDLFSESRPQWGDPASSWYRKEGFGQHDWLFCMLLNYGGNVGLHGKMAHLIEEFYKAKD 406

Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
           S    T+ GVGM+MEGIE NPV+Y+L+ E+ ++ ++     W+  Y   RYG+S   +  
Sbjct: 407 SSFGKTLKGVGMTMEGIENNPVMYELLCELPWREQRFSKDEWLEGYLKARYGKSDSQVSQ 466

Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
           AW +L +T+YNC   +T +  +  ++ A P      +S                    SE
Sbjct: 467 AWMLLSNTIYNCPAASTQQGTHESILCARPSWKAYQVSSW------------------SE 508

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
            S Y      Y  ++VIRA  + + +      +N + YDL+D+ RQA+A+    ++  ++
Sbjct: 509 MSDY------YDPADVIRAAGMMVDAAERFRGNNNFEYDLVDIVRQAVAEKGRLMYRVLV 562

Query: 409 EAYQLNDAHGVFQLSR-RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
           +AY+  D   +F+LS  RFL L+   D LLA    F +G WLESA+ L   EE++
Sbjct: 563 DAYKAGDRE-LFKLSSDRFLRLILMQDRLLATRSEFKVGRWLESARNLGSTEEEK 616


>gi|410095990|ref|ZP_11290981.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409227396|gb|EKN20294.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 753

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 156/465 (33%), Positives = 221/465 (47%), Gaps = 51/465 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D    L KK++ R  ELGM P+   FSG VP  L+N +P AKI 
Sbjct: 193 MQNLQSYGGPLPKSWIDSHAELGKKVINRQLELGMQPIQQGFSGYVPRELKNKYPDAKI- 251

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
                   +  P WC       LD TD LF   GR F+E++ K +G    +Y  D F E+
Sbjct: 252 --------QLQPSWCGFTGAAQLDPTDSLFSAFGRDFLEEEKKLFG-AHGVYAADPFHES 302

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
            PP+D+PEY+S++G +IY   Q  D  A+W MQ W        R P +KA    VP   L
Sbjct: 303 RPPIDTPEYLSAVGNSIYKLFQDFDPSAIWAMQAWSL------REPIVKA----VPKEHL 352

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL        +T    +G P +   LHNF G I ++G L  +A         ++  +
Sbjct: 353 LILDLNGGRSRQENTC---WGYPVVAGNLHNFGGRINLHGDLRLLASNQYAVAKQKSPNV 409

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G G+ ME IEQNPV YDL  EM    ++VD++ W+  Y+ RRYG +      AW  L  
Sbjct: 410 CGSGLFMESIEQNPVYYDLAFEMPLHADEVDIEEWLGDYAERRYGAASENAHKAWLHLLE 469

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
             Y      T+  R  I+A     P++     G     G P                   
Sbjct: 470 GPYRPGTNGTE--RSSIIA---ARPALNVKKSGPNAGLGIP------------------- 505

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            YS   VI+A  L +   ++L+AS  YR+D++D+ RQ ++     +     EA+   D  
Sbjct: 506 -YSPLLVIQAQGLLLKDADKLNASTPYRFDVVDIQRQLMSNLGQAIHKKAAEAFVKKDKA 564

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
                S RFLE++ D+D LL     F    WL  A+      E++
Sbjct: 565 AFTLHSNRFLEMLRDVDVLLRTRPEFNFDKWLTDARSWGTTNEEK 609


>gi|404406328|ref|ZP_10997912.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
          Length = 738

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 146/463 (31%), Positives = 219/463 (47%), Gaps = 35/463 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSNL  W  PLPQSWLD Q+ LQK+I+ R  EL M PVLPAF+G+VPA L  ++P AKI+
Sbjct: 206 MSNLDYWQSPLPQSWLDAQVELQKRIVARERELNMKPVLPAFAGHVPAELGEIYPEAKIS 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W   +   R   ++ LD  DPLF  I R F+ +Q   +G T HIY  D F+E  PP
Sbjct: 266 RMSKWGGFEDRYR---SHFLDPLDPLFARIQREFLAEQTALFG-TDHIYGADPFNEVDPP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PE+++ +   IY  M   D +A WL   WLF  D   W   +++A + +VP  K+++
Sbjct: 322 SWEPEFLARVSRTIYDTMTEADPEAEWLQMTWLFYLDRDKWHDDRIEAFVTAVPQDKMLL 381

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + E   +W  +  ++G PY WC L NF GN  + G  D ++             + G
Sbjct: 382 LDYYCENTEVWRQTHSYHGQPYFWCYLGNFGGNTMLVGNFDEVSKRIDGVLAEGGNNLRG 441

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++ NP +YD + E A+    VD   W +  + R  G      + AW+ L   V
Sbjct: 442 LGSTLEGLDSNPFMYDYVFERAWDF-PVDDDRWFDALADRYLGYEDTGYRRAWDALRKNV 500

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                       I S   G         + E +L   T +     + Y
Sbjct: 501 Y-----------------------ITSSKYGHCPLLNARPTLEGILTGTTDA----EIKY 533

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+       I +G+  S  +TYRY L+++ RQ L      L      A +  D   +
Sbjct: 534 DNDELFEVWAKMIDAGD--SGRDTYRYWLVNVGRQTLGNLFLPLRDGFTAACRAKDLARM 591

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
            +L    LEL  D++ L A H  F +  W++ ++      E+ 
Sbjct: 592 KELRSEMLELAADLETLTAQHGAFSMQKWIDDSRSFGTTPEER 634


>gi|224026593|ref|ZP_03644959.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
           18228]
 gi|224019829|gb|EEF77827.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
           18228]
          Length = 635

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 156/464 (33%), Positives = 225/464 (48%), Gaps = 45/464 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D+ +VL ++I+ R  ELGM P+   FSG VP  L+  +P AKI 
Sbjct: 196 MQNLQSYGGPLPKSWIDKHVVLGQQIIRRELELGMKPIQQGFSGYVPRELKEKYPEAKIQ 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
              +W   K   +      LD TD LF  IGR F+E++ K +G    +Y  D F E+ PP
Sbjct: 256 PQPSWCGFKGAAQ------LDPTDSLFQVIGRDFLEEEKKLFG-AHGVYAADPFHESRPP 308

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD+PEY+S++G +I++  Q  D  ++W MQ W        R P +KA    VP   L++L
Sbjct: 309 VDTPEYLSAVGRSIHTLFQEFDPYSLWAMQAWSL------REPIVKA----VPEEHLLIL 358

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL        +     +G P +   LHNF G I M+G L  +A    EA  S +  + G 
Sbjct: 359 DLNGSKC---TQRNACWGYPVVAGNLHNFGGRINMHGDLPLLAGNQYEAAVSLSPNVCGS 415

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G+ MEGIEQNP+ Y+L  EM  Q  KV++  W+ +Y++RRYG        A  +L    Y
Sbjct: 416 GLFMEGIEQNPLYYELAFEMPLQKGKVELDGWLKEYALRRYGSKWENTHKALLLLLEGPY 475

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
                 T+ +  +I A     P++     G     G P                    YS
Sbjct: 476 RPGTNGTELS-SIIAA----RPALHVKKSGPNAGLGIP--------------------YS 510

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
              +I A    +     L  S  YR+D++DL RQ +      +     +A++  D  G  
Sbjct: 511 PWLLIEAQAFMLKDAGILKTSEAYRFDIMDLQRQIMTNLGQAIHKEAAKAFEAGDEKGFE 570

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             SRR+LEL+ D+D LL     F    WL  A+     EE++ Q
Sbjct: 571 LHSRRYLELLTDVDTLLRTRPEFNFDRWLADARSWGDTEEEKNQ 614


>gi|423345423|ref|ZP_17323112.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
           CL03T12C32]
 gi|409223209|gb|EKN16146.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
           CL03T12C32]
          Length = 752

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 153/465 (32%), Positives = 220/465 (47%), Gaps = 51/465 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGPLP+SW+D+ +VL K+I+ R  ELGM P+   FSG VP  L+  +P AKI 
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
                   +  P WC       LD TD LF  IGR F+E++ K YG    +Y  D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
            PPVD+PEY+ ++G AI+      D +++W MQ W        R P +KA    VP   L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL------REPIVKA----VPKENL 351

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDL              +G P +   LHNF G I ++G L  +A         +N  +
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G G+ ME IEQNPV YDL  EM    ++V+++ W+ +Y+ RRYG+       AW  L  
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
             Y      T+  R  I+A     P++     G     G P                   
Sbjct: 469 GPYRPGTNGTE--RSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            YS   V++A  L +     L  S+ YR+D++D+ RQ ++     +     +A++  D  
Sbjct: 505 -YSPLSVVQAEGLLLKDAARLEDSDPYRFDIVDIQRQLMSNLGQVIHKQAAKAFRKKDKE 563

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
                S RFLE++ D D LL     F    WL  A+    N E++
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEK 608


>gi|336417192|ref|ZP_08597519.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
           3_8_47FAA]
 gi|423297818|ref|ZP_17275878.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
           CL03T12C18]
 gi|335936512|gb|EGM98438.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
           3_8_47FAA]
 gi|392664455|gb|EIY57993.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
           CL03T12C18]
          Length = 727

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 142/459 (30%), Positives = 230/459 (50%), Gaps = 35/459 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GW  PLP+ WL  Q  LQ++I+ R  E  M PVLPAF+G+VPAAL+ V+P+ K +
Sbjct: 194 MCNLDGWQSPLPKEWLSSQAELQEQIVAREREFNMQPVLPAFAGHVPAALKRVYPNIKTS 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R  CT+L +  D L+  I + ++ +Q + YG T+HIY  D F+E  PP
Sbjct: 254 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 309

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
               + +  +   IY  + + D  A+WL   WLF  D   W  P++K+ L SVP  KL++
Sbjct: 310 SWDTDSLGMMAKHIYESVAAVDPKAIWLQMTWLFYADIKHWTTPRIKSYLRSVPQDKLIL 369

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD F E   IW  +  ++G PY+WC L NF GN  + G +  ++    +A  +  + + G
Sbjct: 370 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVKLVSERLADALKNGGSNLKG 429

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG ++EGI+ N  +Y+ + + A+   + D K W  + + RR G+  P  + AW +L   V
Sbjct: 430 VGSTLEGIDLNQFMYEFVLDKAWNSGQTD-KEWFLKLADRRTGKVSPEARKAWEILADKV 488

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                   + P+   V +G   N  +P      LK        P + Y
Sbjct: 489 Y-------------------IQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEY 521

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              +++ A  L +   +     ++Y +DL+++ RQ L  Y N +      AY+  D   +
Sbjct: 522 QPKDLVEAWRLLLLVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIMMM 579

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
                +  E++ D+D L++CH  F L  W+  A+ +  +
Sbjct: 580 KNRGDKMREILADLDKLVSCHPTFSLNKWITDARDMGHD 618


>gi|282877910|ref|ZP_06286719.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
 gi|281299911|gb|EFA92271.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
          Length = 723

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 150/460 (32%), Positives = 227/460 (49%), Gaps = 38/460 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP SWL+QQ  LQ++IL+R   L M PVLPAFSG+VPA L+ ++P A I 
Sbjct: 199 MANIDKWNGPLPMSWLEQQKELQQRILLRERSLNMKPVLPAFSGHVPAKLKELYPQANIK 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W     + R    + L+  DPLF +I + ++E+Q   +G T HIY  D F+E  PP
Sbjct: 259 YLGRWAGFSDNYR---CHFLNPEDPLFAKIQKMYLEEQKALFG-TDHIYGIDPFNEVDPP 314

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
              PEY+  +   IY  + S D  A W+   W+F ++   W P ++KALL  V  GK+ +
Sbjct: 315 SWKPEYLKEISHNIYRTVTSVDPGAEWMQMSWMFYHNKKQWTPKRIKALLTGVSRGKMSL 374

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W T+  FYG PYIWC L NF GN  + G +        EA   +N  ++G
Sbjct: 375 LDYHCENVELWKTTNNFYGQPYIWCYLGNFGGNTTITGNVKESGQRLNEALNKKNKNLIG 434

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ +   A+     D K WI+  + R  G S P ++ AW +L++ +
Sbjct: 435 IGSTLEGLDVIQFPYEYILTQAWTATPAD-KEWIDNLADRHVGFSSPKLRQAWQILFNDI 493

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y      T   R + +  P + P +     GKYQ     ++                  Y
Sbjct: 494 Y------TQIPRSLGI-LPALRPIL-----GKYQERRTEIT------------------Y 523

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
            T  +    +L   S       N Y+ DLI + RQ L     +L L +   Y   D  G+
Sbjct: 524 PTKRLEEVWKLM--SDVSECDRNEYQLDLIAVGRQVLGNKFLKLKLELDSCYVNKDLVGL 581

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
            +      E++ D+D L A +    +G W++ A+    N+
Sbjct: 582 QRTGNTMKEVLVDLDYLTAGNSRCSIGKWIDDARAYGNND 621


>gi|379334158|gb|AFD03088.1| putative alpha-N-acetylglucosaminidase [uncultured bacterium 8]
          Length = 726

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 157/461 (34%), Positives = 238/461 (51%), Gaps = 36/461 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           +++L GW GPLPQSW+D+   L ++IL R   LGM PVL  FSG+VP  L      A+ T
Sbjct: 181 LASLDGWSGPLPQSWIDRHADLGRRILARERALGMRPVLQGFSGHVPQELIAER-GARST 239

Query: 61  QLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
            L  W F V          +LD  DPLF E G   + +Q + +G T H+Y  D F E TP
Sbjct: 240 TLPWWDFEVG---------MLDPRDPLFEEFGTTLLTEQTRLFG-TDHLYAADPFIETTP 289

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLV 178
           PV  P  ++ +  A++  M + D  A W++Q W FSY   +W P +  A L+++P   ++
Sbjct: 290 PVSDPADLAQVARAVHGVMTAVDDRATWVLQAWPFSYRSRYWTPERTGAFLDAIPDDGML 349

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTM 237
           +LDL+AE +P+W  +  +   P++WCMLH+  G   +YG LD IA G   A+  +   ++
Sbjct: 350 ILDLWAEHRPVWQRTDGYRKKPWVWCMLHSLGGRPGLYGKLDEIATGAARAQADARGGSL 409

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G+G SME    +PV+Y+L++++A+Q    DV+AW+  ++  RYGR+ P +  AW++L+ 
Sbjct: 410 SGIGASMEAFGGDPVLYELLADVAWQGSVDDVRAWLETWTRARYGRATPGLLRAWDLLHD 469

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           +VY  ++G       VIV  P +        EG  ++      +  V  ++  S D P  
Sbjct: 470 SVY-ASEGPGPPG-SVIVGRPTL--------EGDLRH------ELPVHLADPPSPDVP-- 511

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
               +       L   +  E SA    R DL D+T Q L   A E      +A    DA 
Sbjct: 512 ---PALAEAWALLADEATQEDSAGPLGR-DLCDVTAQVLTHVACERQWRAADAALARDAD 567

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
           G  + +R  L+ +ED+D LLA      L  WL  A+  A  
Sbjct: 568 GFQRAARALLDTIEDLDTLLATRPEHRLDGWLADARGWATT 608


>gi|238506383|ref|XP_002384393.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
           NRRL3357]
 gi|220689106|gb|EED45457.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
           NRRL3357]
          Length = 669

 Score =  241 bits (616), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 149/481 (30%), Positives = 249/481 (51%), Gaps = 48/481 (9%)

Query: 1   MSNLHG-WGG-PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           + N+ G WGG  +  +W++ Q  LQKKI+ RI ELGM PVLPAF G VP A++ V P A 
Sbjct: 109 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMRPVLPAFPGFVPPAIKRVRPHAT 168

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W   +   ++     L   D  F ++ ++ I +Q++ +G  +H+Y  D F+E  
Sbjct: 169 VVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISRQMRAFGNITHVYALDQFNEIN 226

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
           P      Y+ +L    +  +++ +  AVW+MQGWLF YD   FW   ++ A L+ V    
Sbjct: 227 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDSNRISAYLSGVERND 285

Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ MYG + +I   P+EA  +++ 
Sbjct: 286 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEA-LNKSD 344

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVP-AIQDAW 292
           ++VG G++MEG E N +VYDL+ + A+    +D +A+   +   RY    SVP  +  AW
Sbjct: 345 SLVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQSWVRSRYSGNLSVPNELYTAW 404

Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS--------KEAV 344
           ++L  TVYN T+  T     V  +  ++ P I  +  G+  +Y  P S         E +
Sbjct: 405 DLLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GRVGHYPTPTSINYDPMVLNEVL 460

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
                ++   P LW++ +                     Y YD++D+TRQ +      ++
Sbjct: 461 SLFMNATRKEPSLWHNPA---------------------YEYDMVDITRQLMGNAFVNVY 499

Query: 405 LNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
             +I +++    N    V   S R L L+  +D +L+C++ F L  W+ SA+      E 
Sbjct: 500 SVLITSWKSETENRTTKVTSHSERLLNLLSAIDKVLSCNENFSLATWISSARDWGNTTET 559

Query: 462 E 462
           +
Sbjct: 560 K 560


>gi|260642393|ref|ZP_05415712.2| alpha-N-acetylglucosaminidase [Bacteroides finegoldii DSM 17565]
 gi|260622285|gb|EEX45156.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides finegoldii DSM
           17565]
          Length = 735

 Score =  241 bits (616), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 151/472 (31%), Positives = 236/472 (50%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 198 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I   F+E+Q K YG T+HIY  D F+E   P
Sbjct: 258 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQHRFLEEQTKVYG-TNHIYGIDPFNEVDSP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +QS DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 314 NWNEEFLSNVSDKIYKSIQSVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F            + G
Sbjct: 374 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 433

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G+++EG++ NP++Y+ + E A+++  + V  WI  ++  R G     I  AW  LY  +
Sbjct: 434 LGVTLEGLDVNPLMYEFVFERAWEN-SIPVHQWIANWAQCRGGNVDNHIIKAWKQLYEKI 492

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T  A      ++ A P ++      T   Y                   Y +  LW 
Sbjct: 493 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 532

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+++A  ++          + Y +D+I++ RQ L     +      + Y+  D  G 
Sbjct: 533 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFADCYRKKDLEGT 583

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
               +R  +L+ D+D LL C     +G W++ A+  A NE+++K      RC
Sbjct: 584 KVWGQRMDQLLLDVDRLLCCSPVLSIGKWIKDARDFAVNEQEQKYYEENARC 635


>gi|393785791|ref|ZP_10373937.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
           CL02T12C05]
 gi|392661410|gb|EIY54996.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
           CL02T12C05]
          Length = 727

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 147/461 (31%), Positives = 229/461 (49%), Gaps = 39/461 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+ GW GPLP  WLD Q+ LQKKIL R  EL M PVLPAF+G+VPAAL+ ++P A I 
Sbjct: 198 MANIDGWNGPLPMEWLDNQVELQKKILARERELNMKPVLPAFAGHVPAALKRIYPEANIQ 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R    Y L+  +PLF  I + F+++Q + +G T HIY  D F+E  PP
Sbjct: 258 HLGKWAGFADTYR---CYFLNPEEPLFATIQKHFLQEQTRLFG-TDHIYGVDPFNEVDPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEY+S + + +Y  + + D  A W+   W+F +D   W  P++KALL  VP  K+ +
Sbjct: 314 SWEPEYLSQVSSDMYRTLTAADPKAEWMQMTWMFYHDRKDWTAPRIKALLTGVPQDKMFL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W  ++ F+G PYIWC L NF GN  + G +         A  +  + + G
Sbjct: 374 LDYHCENVELWKNTEHFHGQPYIWCYLGNFGGNTTLTGNVKESGDRLDNALINGGSNLRG 433

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+    +D +AW+   + R  G     +++AW++L++ +
Sbjct: 434 IGSTLEGLDVMQFPYEYIFEKAWDL-NLDNEAWLQNLADRHAGTVSQPVREAWDILFNQI 492

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y              V  P         T G   NY +PV  +   ++           Y
Sbjct: 493 Y--------------VQVPK--------TLGVLPNY-RPVMNKPNRRTVID--------Y 521

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           S + +++A E  + + +     +  R D+I + RQ L  Y   +  +    Y + D  G+
Sbjct: 522 SNATLLQAWEKLLQATD--CNRDALRLDIITVGRQLLGNYFLIVKDDFDRMYTVKDLPGL 579

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
              +    E++ D+D L A H    L  WL  A+ L    E
Sbjct: 580 KARAAEMKEILNDLDRLNAFHSRCALDKWLADARALGTTPE 620


>gi|423346424|ref|ZP_17324112.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
           CL03T12C32]
 gi|409220242|gb|EKN13198.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
           CL03T12C32]
          Length = 718

 Score =  241 bits (615), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 156/474 (32%), Positives = 241/474 (50%), Gaps = 54/474 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQK+I+ R+ E G+ PV P +SG VP   +       ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQKRIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W   +          L  TDP F EI   + ++  K YG+  + Y+ D F E    
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKADY-YSMDPFHEGGSV 298

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           V     + + G AI   M+  +  AVW+ Q W    +P    PQM   + ++  G L+VL
Sbjct: 299 VGVD--LDAAGKAIMQAMKKNNPKAVWVAQAW--QANP---RPQM---IGNLEAGDLIVL 348

Query: 181 DLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE 233
           DLFAE +P W         K  +G   +I+CML N+ GN+ ++G +  +     +A+ S 
Sbjct: 349 DLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNVGLHGKMKHVIDEFYKAKESP 408

Query: 234 -NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
              T+ GVGM+MEG E NPV+++L++E+ ++ ++ D   W+ +Y+V RYG+S P +QDAW
Sbjct: 409 FGKTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDKDQWLREYTVARYGKSNPTVQDAW 468

Query: 293 NVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
            +L +++YNC D  T +  +  V  A P         TE  YQ                S
Sbjct: 469 ILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ---------------VS 504

Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE- 409
           S+     +Y  + VIRA  + ++  +E   +N + YDL+D+ RQA+A+    L   ++E 
Sbjct: 505 SWSEMKDYYDPNNVIRAAAMMVSVADEFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVVEA 563

Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           A+   D       S RFL L+   D LLA    F +G W+  A+ L    E+++
Sbjct: 564 AFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGSTPEEKE 617


>gi|393236266|gb|EJD43816.1| putative alpha-N-acetylglucosaminidase [Auricularia delicata
           TFB-10046 SS5]
          Length = 778

 Score =  241 bits (615), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 144/480 (30%), Positives = 243/480 (50%), Gaps = 50/480 (10%)

Query: 8   GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
           G  LP  W+D Q  LQKKI+ R+ ELGM P LP+F+G VP A+  V P A +     W S
Sbjct: 219 GSSLPMEWIDDQFELQKKIVRRMVELGMTPALPSFTGFVPRAISRVLPGASVVNGSRW-S 277

Query: 68  VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
              D     T+L +  DP F  + ++FIE+Q+  YG  SH+Y  D ++EN P  +   Y+
Sbjct: 278 GFPDALTRVTFL-EPFDPAFARLQKSFIEKQIAAYGPVSHVYTLDQYNENDPLKNDVGYL 336

Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KLVVLDLFAE 185
             +  + +  +++ D DA+WLMQGWLF S   FW   +++A L  V     +++LDLF+E
Sbjct: 337 RDVSRSTWQSLKAADPDAIWLMQGWLFYSNRGFWTNARVEAFLGGVEKNDDMLILDLFSE 396

Query: 186 VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSME 245
            +P W  +  +YG P+IWC LH++ GN+ +YG + +I    VEA   ++ ++VG G++ME
Sbjct: 397 SEPQWQRTNSYYGKPWIWCQLHDYGGNLGLYGQVMNITLNAVEA-LEKSPSLVGFGLTME 455

Query: 246 GIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY------GRSVP-AIQDAWNVLYHT 298
           G E N ++YDL+   A+  + +D  ++   ++ RRY      G  +P AI +AW++L  T
Sbjct: 456 GQEGNEIMYDLLLSQAWSRKPIDTASYFRSWATRRYNAGGIIGSLLPSAIYNAWDILRTT 515

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VYN T  A++    V  +  ++ P++  +                   +  + +    + 
Sbjct: 516 VYNNTKLASNA---VTKSVFELRPALSGI-------------------ANRTGHHATTIT 553

Query: 359 YSTSEVIRALELFIASGNELSA----SNTYRYDLIDLTRQALAKYANELFLNIIEAYQ-- 412
           Y T  +++A +LF  +     A    +  Y +D +D  RQ L+   +  + +++  Y   
Sbjct: 554 YDTQALVKAYDLFDKAAIYTPALWFNNPAYEFDNVDFARQVLSNAFSTQYDDLVATYNEI 613

Query: 413 ----------LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
                        A  +     R + ++  +D +L     F L  WL+ A+  A+   +E
Sbjct: 614 SKPGGSGATLAEAAKIIHDKGERMMGVLASLDKVLRTSKHFTLKKWLQDARAWARGGHEE 673


>gi|404487206|ref|ZP_11022393.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335702|gb|EJZ62171.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
           YIT 11860]
          Length = 731

 Score =  240 bits (613), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 156/477 (32%), Positives = 234/477 (49%), Gaps = 55/477 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGP+ Q ++D+Q  LQ+K+L R+ EL M PV   F G VP +L+  FP A I 
Sbjct: 196 MGNLEGFGGPVSQKFIDRQTDLQQKMLRRMRELDMAPVFQGFYGMVPNSLKEKFPEANIK 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           + G W + +          LD  DPLF +I   + E+Q K +G+  + +  D F E   +
Sbjct: 256 EQGEWQTYQRPA------FLDPNDPLFDKIADIYYEEQEKLFGKAVY-FAGDPFHEGGQS 308

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D    + +    I   M+    +AVW++QGW        R P M+ LL  +  G+ +
Sbjct: 309 EGID----VKAAAKKILKAMRRKTPEAVWIIQGWQ-------RNP-MRDLLEGLEHGEAI 356

Query: 179 VLDLFAEVKPIWSTSKQ--FYGVP------YIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
           +LDL A  +P W   K   FY         +IWC L NF G   ++G + S A G V A+
Sbjct: 357 ILDLMACERPQWGGIKNSLFYKAEGHMHHDWIWCALPNFGGKTGLHGKMSSYASGVVFAK 416

Query: 231 TSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
                  + G+G + EGI   PVVYD++ +MA++ + +D+K W+NQY+  RYG++ P   
Sbjct: 417 NHPLGKNLCGIGTAPEGIGTIPVVYDMVYDMAWREDSIDIKDWVNQYTQYRYGKADPNCN 476

Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
            AW +L  T+Y C +         I A P                        +      
Sbjct: 477 RAWEILSKTIYECHNEIGGPVESYICARP------------------------SDTIKHA 512

Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
           SS+    ++Y  +E++ A E      +E + S TY+YDL+DLTRQ L  YA  L    + 
Sbjct: 513 SSWGTAEIFYDPAEIVTAWECMYNVRHEFAQSETYQYDLVDLTRQVLGDYAKYLHKQAVN 572

Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN-EEQEKQV 465
           A+  ND  G    S +FL L+ D D LL+    F +G W+  A+  A   +EQE+ V
Sbjct: 573 AFYRNDLKGFQTYSSKFLVLIRDEDKLLSTRKEFNVGTWINQARNAACTPQEQERFV 629


>gi|156046298|ref|XP_001589681.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980]
 gi|154693798|gb|EDN93536.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 795

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 153/465 (32%), Positives = 254/465 (54%), Gaps = 39/465 (8%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LP SW+++Q +LQKKI+ R+ ELG+ PVLPAF+G VP+AL+ + P+A I
Sbjct: 207 FGNIQGSWGGTLPLSWIEEQHLLQKKIVKRMVELGITPVLPAFTGFVPSALRRIAPNANI 266

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
              G+W ++        T+L   TDPLF  +   F+  Q + YG  +HIY  D ++EN P
Sbjct: 267 INGGDWGNIFPVEYSNDTFLY-PTDPLFTTLQHKFLSFQSEYYGNVTHIYTLDQYNENNP 325

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-L 177
                 Y+ ++    Y  +QS D  AVW++QGWLF S   FW   +++A +  VP  + +
Sbjct: 326 ASGDLSYLRNVSRGTYESLQSFDPCAVWMLQGWLFYSLSSFWTQDRIEAYIGGVPKNESM 385

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT 236
           ++LDLF+E  P W  +  +YG P+IWC L ++ G + +YG + +I    +EA R SEN  
Sbjct: 386 LILDLFSESFPQWERTHYYYGKPWIWCQLRDYGGTLGLYGQIYNITNSLIEAFRESEN-- 443

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV----KAWINQ-YSVRRYGRSVPA-IQD 290
           MVGVG +MEG   N ++Y+L+ + A+  + +D     K+W+ + Y ++   + +P  I +
Sbjct: 444 MVGVGNTMEGQGGNGLMYELLLDQAWNIDPIDTEDYFKSWVRKRYHIKGAKKRLPGEIYE 503

Query: 291 AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
           AW++L  T YN T+     +  V  +  ++ P+I        +N+G+          ++S
Sbjct: 504 AWDILRRTAYNNTNLTLADS--VPKSLHELQPNIT-------ENHGR--------LGQSS 546

Query: 351 SYDHPHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNI 407
           + D     Y   ++ RA EL      S  EL     +++D++D+TRQ LA+     ++ +
Sbjct: 547 TIDL----YDPDDLFRAWELLYNASVSVPELWEDKGWKFDMVDITRQVLAERFKLEYVEL 602

Query: 408 IEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 452
           IE Y+      +       + ++E +D +L+    F L  W+ +A
Sbjct: 603 IEKYK--KGADISCDGDILIGILESLDDVLSASPHFRLDTWVNAA 645


>gi|154489986|ref|ZP_02030247.1| hypothetical protein PARMER_00215 [Parabacteroides merdae ATCC
           43184]
 gi|423722990|ref|ZP_17697143.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
           CL09T00C40]
 gi|154089428|gb|EDN88472.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
           43184]
 gi|409241820|gb|EKN34587.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
           CL09T00C40]
          Length = 718

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 158/476 (33%), Positives = 243/476 (51%), Gaps = 58/476 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQK+I+ R+ E G+ PV P +SG VP   +       ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQKRIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W   +          L  TDP F EI   + ++  K YG+  + Y+ D F E  + 
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKADY-YSMDPFHEGGSV 298

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    + + G AI   M+  +  AVW+ Q W    +P    PQM   + ++  G L+
Sbjct: 299 AGVD----LDAAGKAIMQAMKKNNPKAVWVAQAW--QANP---RPQM---IGNLEAGDLI 346

Query: 179 VLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
           VLDLFAE +P W         K  +G   +I+CML N+ GN+ ++G L  +     +A+ 
Sbjct: 347 VLDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNVGLHGKLKHVIDEFYKAKE 406

Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
           S    T+ GVGM+MEG E NPV+++L++E+ +  ++ D   W+ +Y+V RYG+S P +QD
Sbjct: 407 SPFGKTLKGVGMTMEGSENNPVMFELLTELPWCPQRFDKDQWLREYTVARYGKSNPTVQD 466

Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
           AW +L +++YNC D  T +  +  V  A P         TE  YQ               
Sbjct: 467 AWILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ--------------- 502

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
            SS+     +Y  ++VIRA  + ++  +E   +N + YDL+D+ RQA+A+    L   ++
Sbjct: 503 VSSWSEMKDYYDPNDVIRAAAMMVSVADEFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVV 561

Query: 409 E-AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           E A+   D       S RFL L+   D LLA    F +G W+  A+ L    E+++
Sbjct: 562 EAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGGTPEEKE 617


>gi|358378969|gb|EHK16650.1| glycoside hydrolase family 89 protein [Trichoderma virens Gv29-8]
          Length = 748

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 142/462 (30%), Positives = 253/462 (54%), Gaps = 35/462 (7%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG +P+SW+D Q  LQ KIL R+ ELG+ P+LPAF G VP  +  VFP   +
Sbjct: 201 FGNIQGSWGGSMPRSWVDSQFDLQLKILDRMEELGITPILPAFPGFVPRNISRVFPDISL 260

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
           +    W +  ++        ++  DP F ++ + FI +Q + YG  ++ +  D F+EN P
Sbjct: 261 STSPIWSNFGTEL--SADIYINPFDPRFAQLQKLFISKQQELYGNVTNFWTLDQFNENQP 318

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-L 177
                 Y+ ++    +S +++ D +AVW+MQ WLFS D  FW   ++++ L  +P+   +
Sbjct: 319 LSGDLGYLQNVSHNTWSALKAADPEAVWVMQAWLFSSDSAFWTNDRIESFLGGIPVNSDM 378

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDLFAE  P W  +  FYG P+IWC LH++ GN+ +YG ++++    ++A    + ++
Sbjct: 379 LLLDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGLYGQIENVTINSMDA-VRNSGSL 437

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVPAIQDAWNVLY 296
           VG G++MEG E N ++YDL+ + A+  + +D + + + +   RYG ++V ++   W +L 
Sbjct: 438 VGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHDWVSTRYGTKNVKSLYTGWELLR 497

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
            TV+N T+   +  +  I+   ++ PS   +  G+  ++G  ++                
Sbjct: 498 PTVFNNTNLTMNAVQKSIL---ELVPSTTGLL-GRVGHHGTTIT---------------- 537

Query: 357 LWYSTSEVIRA-LELFIASGNE--LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
             Y+ + ++ A  ELF A   +  L  +  Y+YDL+D TRQ L      L+ +++ AY  
Sbjct: 538 --YNPAVMVEAWTELFKAGLQDIKLFTNPAYQYDLVDWTRQVLVNSFEGLYKDLVAAYNS 595

Query: 414 NDAHGVFQLSR--RFLELVEDMDGLLACHDGFLLGPWLESAK 453
             +  V + SR  + + L+  +D +LA ++ F L PW+  A+
Sbjct: 596 AASSSVIK-SRGAKLIALLRTLDAVLATNEHFQLTPWINEAR 636


>gi|423248233|ref|ZP_17229249.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
           CL03T00C08]
 gi|423253182|ref|ZP_17234113.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
           CL03T12C07]
 gi|392657082|gb|EIY50719.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
           CL03T12C07]
 gi|392660340|gb|EIY53954.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
           CL03T00C08]
          Length = 732

 Score =  238 bits (608), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 153/474 (32%), Positives = 232/474 (48%), Gaps = 54/474 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGP+ Q ++D+Q  LQKK+L R+ E GM PVL  F G VP ++   FP+A I 
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTKLQKKMLDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W + +          L  +DPLF ++   F E+Q K +G  S  Y  D F E  N+
Sbjct: 258 NAGKWITYQRPA------FLVPSDPLFAKVAEIFYEEQKKLFGE-SRYYGGDPFHEGGNS 310

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             ++    I+   + IY  M++ + +A+W++QGW  S +P        ALL  +  G+ +
Sbjct: 311 KGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--SGNP------SVALLKGLKHGEAL 358

Query: 179 VLDLFAEVKPIWS--------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
           VLDL A  +P W             F    +IWC L NF G I MYG L S A G ++A 
Sbjct: 359 VLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALPNFGGRIGMYGKLQSYATGVIKAE 418

Query: 231 TSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
                  V G+G + EGI  NP+ YD++ +MA++ + +D+K+WI  Y+  RYG      +
Sbjct: 419 HHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDSIDIKSWIANYTTYRYGSENSNAK 478

Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
            A   L  +VYNC   A         A P +    +S                       
Sbjct: 479 AAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYVS----------------------- 515

Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
            S+   HL+Y    V++ALE  + +  EL   +TYRYD++D+TRQ LA Y   +   I +
Sbjct: 516 -SWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRYDVVDITRQMLADYGKYIHKCISD 574

Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           AY+  +       + +FL+++ D D LL+    FLLG ++  A     N  +++
Sbjct: 575 AYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLGEYIRQADTCGSNPTEKR 628


>gi|218258436|ref|ZP_03474815.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
           DSM 18315]
 gi|423342591|ref|ZP_17320305.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
           CL02T12C29]
 gi|218225494|gb|EEC98144.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
           DSM 18315]
 gi|409217508|gb|EKN10484.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 718

 Score =  238 bits (607), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 156/475 (32%), Positives = 243/475 (51%), Gaps = 58/475 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQ++I+ R+ E G+ PV P +SG VP   +       ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQQQIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W   +          L  TDP F EI   + ++  K YG+ ++ Y+ D F E  + 
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKANY-YSMDPFHEGGSV 298

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    + + G AI   M+  +  AVW+ Q W    +P    PQM   + ++  G L+
Sbjct: 299 AGVD----LDAAGKAIMQAMKKNNPKAVWVAQAW--QANP---RPQM---IGNLEAGDLI 346

Query: 179 VLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
            LDLFAE +P W         K  +G   +I+CML N+ GNI ++G +  +     +A+ 
Sbjct: 347 ALDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNIGLHGKMKHVIDEFYKAKE 406

Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
           S   TT+ GVGM+MEG E NPV+++L++E+ ++ ++ D   W+  Y+V RYG+S P +QD
Sbjct: 407 SPFGTTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDKDQWLKAYTVARYGKSNPVVQD 466

Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
           AW +L +++YNC D  T +  +  V  A P         TE  YQ               
Sbjct: 467 AWILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ--------------- 502

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
            SS+     +Y  ++VIRA  + ++  ++   +N + YDL+D+ RQA+A+    L   ++
Sbjct: 503 VSSWSEMKDYYDPNDVIRAAAMMVSVSDQFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVV 561

Query: 409 E-AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
           E A+   D       S RFL L+   D LLA    F +G W+  A+ L    E++
Sbjct: 562 EAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGNTSEEK 616


>gi|423269877|ref|ZP_17248849.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
           CL05T00C42]
 gi|423272668|ref|ZP_17251615.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
           CL05T12C13]
 gi|392700723|gb|EIY93885.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
           CL05T00C42]
 gi|392708745|gb|EIZ01850.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
           CL05T12C13]
          Length = 732

 Score =  238 bits (607), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 153/474 (32%), Positives = 232/474 (48%), Gaps = 54/474 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GGP+ Q ++D+Q  LQKK+L R+ E GM PVL  F G VP ++   FP+A I 
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTKLQKKMLDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W + +          L  +DPLF ++   F E+Q K +G  S  Y  D F E  N+
Sbjct: 258 DAGKWITYQRPA------FLVPSDPLFAKVAEIFYEEQKKLFGE-SRYYGGDPFHEGGNS 310

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             ++    I+   + IY  M++ + +A+W++QGW  S +P        ALL  +  G+ +
Sbjct: 311 KGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--SGNP------SVALLKGLKHGEAL 358

Query: 179 VLDLFAEVKPIWS--------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
           VLDL A  +P W             F    +IWC L NF G I MYG L S A G ++A 
Sbjct: 359 VLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALPNFGGRIGMYGKLQSYATGVIKAE 418

Query: 231 TSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
                  V G+G + EGI  NP+ YD++ +MA++ + +D+K+WI  Y+  RYG      +
Sbjct: 419 HHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDSIDIKSWIANYTTYRYGSENSNAK 478

Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
            A   L  +VYNC   A         A P +    +S                       
Sbjct: 479 AAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYVS----------------------- 515

Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
            S+   HL+Y    V++ALE  + +  EL   +TYRYD++D+TRQ LA Y   +   I +
Sbjct: 516 -SWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRYDVVDITRQMLADYGKYIHKCISD 574

Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           AY+  +       + +FL+++ D D LL+    FLLG ++  A     N  +++
Sbjct: 575 AYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLGEYIRQADTCGSNPTEKR 628


>gi|380697007|ref|ZP_09861866.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
          Length = 703

 Score =  238 bits (607), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 150/472 (31%), Positives = 234/472 (49%), Gaps = 39/472 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLPQSWL  Q  LQK IL R     M P+LPAF+G+VPA L+ ++P AKI 
Sbjct: 166 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 225

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I R F+E+Q K YG T+HIY  D F+E   P
Sbjct: 226 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 281

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  +Q  DS A WL   W+F +    W  P++K+ LN+VP  KL++
Sbjct: 282 NWNEEFLSNVSDKIYKSIQDVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 341

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++Q+YG PYIWC L NF GN  + G L+ + F            + G
Sbjct: 342 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 401

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G+++EG++ NP++Y+ + E A+++  +    WI  ++  R G     I  AW  LY  +
Sbjct: 402 LGVTLEGLDVNPLMYEFVFERAWEN-SMPAHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 460

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T  A      ++ A P ++      T   Y                   Y +  LW 
Sbjct: 461 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 500

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E+++A  ++          + Y +D+I++ RQ L     +      + Y+       
Sbjct: 501 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKKLEET 551

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
               +R  +L+ D+D LL C   F +G W++ AK  A NE+++K      RC
Sbjct: 552 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDAKDFAVNEQEQKYYEENARC 603


>gi|392566857|gb|EIW60032.1| alpha-N-acetylglucosaminidase [Trametes versicolor FP-101664 SS1]
          Length = 747

 Score =  238 bits (606), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 156/467 (33%), Positives = 251/467 (53%), Gaps = 34/467 (7%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG LP +W+D Q  LQK++L R+ ELGM PV+P+F+G VP AL  + P+A I
Sbjct: 193 FGNIQGSWGGELPTAWVDDQFALQKRLLPRMVELGMTPVMPSFTGFVPRALAALHPNASI 252

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGR-TSHIYNCDTFDENT 118
                W    +         L+  DPLF  + ++FI +Q   YG   SH+Y  D ++EN 
Sbjct: 253 VTGSQWSGFPTSL--TNDSFLEPFDPLFATLQQSFIAKQQAAYGADISHVYTLDQYNEND 310

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-K 176
           P     +Y+ ++ A  ++ +++ D  AVWLMQGWLF  D  FW   ++ A L  VP    
Sbjct: 311 PFSGDLDYLRNVSAGTFASLRAADPAAVWLMQGWLFFSDAVFWTDDRVAAYLGGVPGNDS 370

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           ++VLDL++E +P W+ +  + G  ++WC LH++ GNI M G LD +   P+ A +S  ++
Sbjct: 371 MIVLDLYSEAQPQWNRTASYSGKQWVWCELHDYGGNIGMEGNLDVLTHAPLTALSSPGSS 430

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNV 294
           M GVG++MEG E N +VY ++ + A+    ++  ++++ +  RRY  + +P A QDAW +
Sbjct: 431 MKGVGLTMEGQEGNEIVYGVLLDQAWSATSLNTSSYVSSWVSRRYPVKPLPKAAQDAWRI 490

Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
           L  TVYN  D  T   +  I    ++ P++  +T     +   P S           YD 
Sbjct: 491 LSTTVYNNQDPNT---QATIKGIYELAPALTGMTNRIGHH---PTSIP---------YD- 534

Query: 355 PHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 411
                + + ++ AL+L +   A    LSA   + YD++D+ RQ L+     L+  +I+ Y
Sbjct: 535 -----TDATMLSALKLLLEARAQHPTLSAVPEFVYDVVDVARQLLSNRFIGLYDTLIQTY 589

Query: 412 QLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
                 A  V    +  L L+ D+D LL+ ++ FLL  W+  A++ A
Sbjct: 590 NSTSSTAQSVSAAGQPLLALLTDLDALLSTNEHFLLSSWIADARKWA 636


>gi|423223006|ref|ZP_17209475.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392640582|gb|EIY34381.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 755

 Score =  237 bits (605), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 153/462 (33%), Positives = 221/462 (47%), Gaps = 46/462 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPLP+S +D+ ++L KKIL R  ELGM P+   FSG VP  LQ  +P AKI+
Sbjct: 195 MQNIQSYGGPLPKSVIDKHVILGKKILARQLELGMQPIQQGFSGYVPRELQAKYPQAKIS 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W           T  LD TDPLF E+G AF+E+Q K +G +  +Y  D F E+ PP
Sbjct: 255 MKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDKLFG-SYGVYAADPFHESAPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +D+PEY++ +G  I+   Q+ D+ A+W+MQ W    D          ++ +VP   L++L
Sbjct: 308 IDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED----------IVKAVPKESLLIL 357

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL        + +   +G P I   LHNF G I M+G L  +A    +   +    + G 
Sbjct: 358 DLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLALLASNQYQKAKARYPNVCGS 413

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G+ ME IEQNPV Y+L  EM    + + ++AW+  Y+ RRYG    A   AW  L    Y
Sbjct: 414 GLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERRYGAKSAAAGKAWMYLLEGPY 473

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
               G     R  IVA     P++     G     G P                    Y 
Sbjct: 474 R--QGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP--------------------YE 508

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
              VIRA    +   ++L+ S  YR+D++D+ RQ +      +     EA+   D     
Sbjct: 509 PMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLGQLVHKKAAEAFASKDKAAFV 568

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             S RFLEL+ DMD LL     +    WL  A+   + +E++
Sbjct: 569 LHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKEEK 610


>gi|329963073|ref|ZP_08300853.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
 gi|328529114|gb|EGF56044.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
          Length = 717

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 154/479 (32%), Positives = 250/479 (52%), Gaps = 64/479 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  Q+  LQK+IL R+ E G+ PVLP +SG VP   +         
Sbjct: 187 MNNLEGWGGPNPDSWYTQREALQKQILKRMREYGIQPVLPGYSGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  TDP F EI   + ++  + YG+ +  Y+ D F E
Sbjct: 239 RLG--LNV-SDPGLWCGYPRPAFLQPTDPRFGEIADLYYKEMTRLYGK-ADFYSMDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +   VD    +++ G AI+  M+  +  AVW+ Q W  +       P+ K ++ ++P 
Sbjct: 295 GGSIAGVD----LNAAGQAIWGAMKKVNPKAVWVAQAWQAN-------PRQK-MIENIPQ 342

Query: 175 GKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G L+VLDLF+E +P W         K+ +G   +++CML N+ GN+ ++G +  +     
Sbjct: 343 GDLIVLDLFSESRPQWGDPASTWYRKEGFGKHDWLYCMLLNYGGNVGLHGKMRHVIDEFY 402

Query: 228 EARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP 286
           +A+TS    T+ GVGM+MEG E N V+++L+ E+ ++  + +   W+  Y+  RYG++  
Sbjct: 403 KAKTSPFGKTLKGVGMTMEGSENNSVMFELLCELPWRPAQFEKDEWLKNYTAARYGKADA 462

Query: 287 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
            +Q AW +L +++YNC D  T +  +  V  A P +D          YQ           
Sbjct: 463 TVQQAWLLLSNSIYNCPDANTQQGTHESVFCARPGMDV---------YQ----------- 502

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
                SS+     +Y   EVIRA  + +++ +    +N + YDL+D+ RQA+A+    ++
Sbjct: 503 ----VSSWSEMVKYYEPEEVIRAAGILLSAADRFKGNNNFEYDLVDIVRQAVAEKGRLVY 558

Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
             +I+A +  +       S+RFL L+   D LLA    F +G W+E A+ L   +E++K
Sbjct: 559 PIMIDALKAGEKELFAAASQRFLNLILLQDRLLATRPEFKVGTWIEKARNLGTTQEEKK 617


>gi|393783261|ref|ZP_10371436.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
           CL02T12C01]
 gi|392669540|gb|EIY63028.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
           CL02T12C01]
          Length = 724

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 138/461 (29%), Positives = 228/461 (49%), Gaps = 39/461 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+ GW GPLP  WLD Q+ LQKKIL R  EL M PVLPAF+G+VP AL+ +FP A I 
Sbjct: 198 MANIDGWNGPLPMHWLDSQVELQKKILTRERELNMKPVLPAFAGHVPGALKRIFPEANIQ 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W     + R    + L+  + LF  I + +I++Q + +G T HIY  D F+E  PP
Sbjct: 258 NLGKWAGFAEEYR---CHFLNPEEALFATIQKQYIKEQTRLFG-TDHIYGVDPFNEVDPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PEY+S + A +Y  + + D  A W+   W+F +D   W  P++KA+L  VP GK+V+
Sbjct: 314 SWEPEYLSKVSADMYHTLTAADPKAEWMQMTWMFYFDRKDWTAPRVKAMLTGVPQGKMVL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W T++ F+G PYIWC L NF GN  + G +            +  +   G
Sbjct: 374 LDYHCENVELWKTTEHFHGQPYIWCYLGNFGGNTTLTGNVKESGARLDNTLINGGSNFKG 433

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+     D ++W+N  + R  G +   +++AW++L++ V
Sbjct: 434 IGSTLEGLDVMQFPYEYIFEKAWTL-NTDDRSWLNALADRHTGVTSEPVREAWDILFNQV 492

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y             +   P++ P +            KP ++ ++              Y
Sbjct: 493 YVQVP-------RTLAVLPNLRPVM-----------NKPNNRTSIN-------------Y 521

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
             + +++A +  + + +     +  R D+I + RQ L  Y   +  +    Y+  D   +
Sbjct: 522 PNTALLQAWQKLLQAPD--CNRDALRLDIITVGRQLLGNYFLTVKDDFDRMYEAKDLPAL 579

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
              +    E++ D++ L A H    L  W+  A++     E
Sbjct: 580 KARAAEMREILNDLERLNAFHSRCSLDKWISDARKYGNTPE 620


>gi|224537227|ref|ZP_03677766.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521150|gb|EEF90255.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 755

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 153/462 (33%), Positives = 221/462 (47%), Gaps = 46/462 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPLP+S +D+ ++L KKIL R  ELGM P+   FSG VP  LQ  +P AKI+
Sbjct: 195 MQNIQSYGGPLPKSVIDKHVILGKKILARQLELGMQPIQQGFSGYVPRELQAKYPQAKIS 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W           T  LD TDPLF E+G AF+E+Q K +G +  +Y  D F E+ PP
Sbjct: 255 MKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDKLFG-SYGVYAADPFHESAPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +D+PEY++ +G  I+   Q+ D+ A+W+MQ W    D          ++ +VP   L++L
Sbjct: 308 IDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED----------IVKAVPKESLLIL 357

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL        + +   +G P I   LHNF G I M+G L  +A    +   +    + G 
Sbjct: 358 DLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLALLASNQYQKAKARYPNVCGS 413

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G+ ME IEQNPV Y+L  EM    + + ++AW+  Y+ RRYG    A   AW  L    Y
Sbjct: 414 GLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERRYGAKSAAAGKAWMYLLEGPY 473

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
               G     R  IVA     P++     G     G P                    Y 
Sbjct: 474 R--RGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP--------------------YE 508

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
              VIRA    +   ++L+ S  YR+D++D+ RQ +      +     EA+   D     
Sbjct: 509 PMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLGQLVHKKAAEAFASKDKAAFA 568

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             S RFLEL+ DMD LL     +    WL  A+   + +E++
Sbjct: 569 LHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKEEK 610


>gi|319900259|ref|YP_004159987.1| alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
 gi|319415290|gb|ADV42401.1| Alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
          Length = 718

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 151/478 (31%), Positives = 238/478 (49%), Gaps = 64/478 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P  W   Q  LQK+IL R+ E G+ PVLP +SG +PA  +         
Sbjct: 187 MNNLEGWGGPNPDQWYSHQEQLQKRILKRMREYGIEPVLPGYSGMIPANAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY---LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG    V    +WC       L  +D  F  I R + ++  + YG+ ++ Y+ D F E 
Sbjct: 239 KLG--LDVADPGKWCGYRRPAFLQPSDKNFRRIARLYYKEMTRLYGKANY-YSMDPFHEG 295

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSYDPFWRPPQMKALLNSVPL 174
            NT  VD    + + G +I   M+  +  AVW+ Q W    YD          ++ ++P 
Sbjct: 296 GNTKGVD----LDAAGKSIRDAMKEANPQAVWVAQAWGACPYD---------NMIKNLPE 342

Query: 175 GKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G ++VLDL++E +P W         KQ +G   +I+CML NF GN+ +YG ++ +     
Sbjct: 343 GDMIVLDLYSESRPQWGDPASAWYRKQGFGRHGWIYCMLLNFGGNVGLYGKMEHVIDEFY 402

Query: 228 EARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP 286
           +AR S    T+ GVG++MEG E NPV+Y+L+ E+ +   ++    W+  Y   RYG++ P
Sbjct: 403 KARESAFGGTLQGVGLTMEGSENNPVMYELLCELPWHGRRISKDQWLKSYLKARYGKTTP 462

Query: 287 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
              +AW  L +T+YN  + +T +  +  V  A P ++          YQ           
Sbjct: 463 QTVEAWLKLSNTIYNSPNASTQQGTHESVFCARPSLEA---------YQ----------- 502

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
                SS+     +Y+ +++IRA    I +  E   +N + YDLID+ RQA+A+    ++
Sbjct: 503 ----VSSWSEMKDYYAPADIIRAAGKMIEAAEEFRGNNNFEYDLIDVVRQAVAEKGRLVY 558

Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             ++ AY+  D       S RFLEL+E  D LL     F LG W   A+ + + + Q+
Sbjct: 559 PIVVSAYKAADKQLFEAASARFLELIELQDKLLGTRREFRLGTWTNYARNMGETDAQK 616


>gi|400595379|gb|EJP63180.1| alpha-N-acetylglucosaminidase [Beauveria bassiana ARSEF 2860]
          Length = 761

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 146/467 (31%), Positives = 241/467 (51%), Gaps = 29/467 (6%)

Query: 1   MSNLHG-WGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSA 57
             N+HG WGG   L   W++QQ  LQKKI+ R+ ELG+ PVLP F G VPAAL+ + P  
Sbjct: 205 FGNIHGTWGGEGRLSAEWINQQFALQKKIVARMVELGITPVLPGFPGFVPAALKKLRPDV 264

Query: 58  KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
            I +   W  V  +     T  L+ TD  + E+   FI+ Q+KE+G  +++Y  D F+E 
Sbjct: 265 NIAEAPVWVDVPRNN--TATAFLNPTDKTYAELQSLFIKNQIKEFGNVTNVYTVDQFNEI 322

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVP-LG 175
            P     +YI+ + ++ Y G+ + +  A+WLMQGWLF S   FW   ++ A L   P   
Sbjct: 323 NPSSGDTKYITDVSSSTYKGITAANPAAIWLMQGWLFYSSQSFWTQQRVDAYLAGPPGQD 382

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LDLF+E +P W  ++ ++G P+IWC LH+F GN  ++G + ++    V+A   E+ 
Sbjct: 383 DMIILDLFSESQPQWQRTRSYFGRPWIWCELHDFGGNQALHGKITNVTQNSVQA-LKESG 441

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD---AW 292
           ++VG G++ EG E N VVYD++ + A++   +D   +   ++  RY  +    +D   AW
Sbjct: 442 SIVGYGLTPEGYEGNEVVYDILLDQAWEGSPIDTANYFRAWARNRYSAAGIIPEDVFTAW 501

Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
             L    Y+  D A      V V+   + PS+  +   +  +Y  P + +   K   + +
Sbjct: 502 EQLRQHAYDVQDNAIPS---VGVSVYQLFPSLKGLVN-RTGHYPPPTALQYDPKVMKNIW 557

Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
              HL+Y+++         I S   L     +  D +D+TRQ L     +++ +++  +Q
Sbjct: 558 ---HLFYNST---------IDSPGLLQIP-AFHLDFVDVTRQVLGNAFIDIYTDLVNQFQ 604

Query: 413 LNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
                 V Q L    L  +ED+D  L  ++ F    WL SA+   Q+
Sbjct: 605 ATANATVIQDLGNSMLSFIEDLDMALNTNEHFTFKKWLNSAESWGQS 651


>gi|383122982|ref|ZP_09943669.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
 gi|251841923|gb|EES70003.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
          Length = 730

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 142/473 (30%), Positives = 239/473 (50%), Gaps = 41/473 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP+SWL+QQ VLQK+IL R  +  M PVLPAFSG+VP  L+ ++P AKI 
Sbjct: 197 MSNVDYWQSPLPKSWLEQQEVLQKQILKRERDFNMTPVLPAFSGHVPKELKAIYPDAKIH 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W    S  R   ++ ++  D LF  I + ++E+Q   YG T HIY  D F+E   P
Sbjct: 257 EMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEEQTAIYG-TDHIYGIDPFNEVDSP 312

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
             + ++++ +   IY  +   D++A WL   W+F +D   W  P++++ L +VP  KL++
Sbjct: 313 NWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHDQKKWTQPRIRSFLEAVPDDKLIL 372

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++ +YG PY+WC L NF GN  M G LD +     +        + G
Sbjct: 373 LDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMVGNLDDVDVKIEKLFVEGGENVYG 432

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG + NP +Y+ + + A+ +  +    WI  ++  R G     I  AW+ L+  +
Sbjct: 433 LGATLEGFDVNPFMYEFVFDQAWDY-PLTTDQWIQNWAKCRGGNQDRHILKAWDSLHKKI 491

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y     AT     ++ A P +                  V  ++       +Y++  LW 
Sbjct: 492 YK--KYATAGQAVLMNARPML------------------VGTDSWNTYPDITYNNRDLWD 531

Query: 360 STSEVIRALELFIASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
             +E+++A  +          +NT YR+D+I++ RQ L    +    +  + Y   D  G
Sbjct: 532 IWTEMLKASHI----------NNTGYRFDVINVGRQVLGNLFSSFRDHFTQCYSEKDIDG 581

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ----EKQVRC 467
           + + + +   L+ D D LL+C   F +G W++ A+   + E +    E+  RC
Sbjct: 582 MKKWADQMDSLLIDTDRLLSCETNFSIGKWIDDARSFGKTEAEKEYYEENARC 634


>gi|383120707|ref|ZP_09941431.1| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
 gi|382984934|gb|EES68331.2| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
          Length = 736

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 149/477 (31%), Positives = 237/477 (49%), Gaps = 59/477 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 197 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 248

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+  + Y+ D F E 
Sbjct: 249 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 305

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  G +I S M+  + +AVW+MQ W        +    +A+++++  G
Sbjct: 306 GNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQAW--------QANPREAMVSTLDSG 353

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            L+VLDL++E  P       +W   K F    +++CML NF GN+ ++G ++ +  G   
Sbjct: 354 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 413

Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
           A    N  T+ GVG + EGIE NPV+++L+ E+ ++ E+    AW+  Y   RYG  + P
Sbjct: 414 ACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWREERFAPDAWLQAYLKARYGNDLSP 473

Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
            + +AW  L HTVYN       +             S++    G +Q+            
Sbjct: 474 EVAEAWRALEHTVYNAPKNYQGEG---------TVESLLCARPGFHQD------------ 512

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
             TS++ +  L+YS     +A  L ++  ++   +N + YDL+D+ RQ+LA   N L   
Sbjct: 513 -RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNNFEYDLVDVVRQSLADKGNVLLEE 571

Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           I ++Y   D     + S++FLEL+   D LL+    F +  WL +A+ L   EE++K
Sbjct: 572 ISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEFSVSSWLNAARSLGTTEEEKK 628


>gi|29349767|ref|NP_813270.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29341678|gb|AAO79464.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 744

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 149/477 (31%), Positives = 237/477 (49%), Gaps = 59/477 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 205 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 256

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+  + Y+ D F E 
Sbjct: 257 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 313

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  G +I S M+  + +AVW+MQ W        +    +A+++++  G
Sbjct: 314 GNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQAW--------QANPREAMVSTLDSG 361

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            L+VLDL++E  P       +W   K F    +++CML NF GN+ ++G ++ +  G   
Sbjct: 362 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 421

Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
           A    N  T+ GVG + EGIE NPV+++L+ E+ ++ E+    AW+  Y   RYG  + P
Sbjct: 422 ACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWREERFAPDAWLQAYLKARYGNDLSP 481

Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
            + +AW  L HTVYN       +             S++    G +Q+            
Sbjct: 482 EVAEAWRALEHTVYNAPKNYQGEG---------TVESLLCARPGFHQD------------ 520

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
             TS++ +  L+YS     +A  L ++  ++   +N + YDL+D+ RQ+LA   N L   
Sbjct: 521 -RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNNFEYDLVDVVRQSLADKGNVLLEE 579

Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           I ++Y   D     + S++FLEL+   D LL+    F +  WL +A+ L   EE++K
Sbjct: 580 ISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEFSVSSWLNAARSLGTTEEEKK 636


>gi|29345848|ref|NP_809351.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29337741|gb|AAO75545.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 730

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 142/473 (30%), Positives = 239/473 (50%), Gaps = 41/473 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP+SWL+QQ VLQK+IL R  +  M PVLPAFSG+VP  L+ ++P AKI 
Sbjct: 197 MSNVDYWQSPLPKSWLEQQEVLQKQILKRERDFNMTPVLPAFSGHVPKELKAIYPDAKIH 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W    S  R   ++ ++  D LF  I + ++E+Q   YG T HIY  D F+E   P
Sbjct: 257 EMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEEQTAIYG-TDHIYGIDPFNEVDSP 312

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
             + ++++ +   IY  +   D++A WL   W+F +D   W  P++++ L +VP  KL++
Sbjct: 313 NWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHDQKKWTQPRIRSFLEAVPDDKLIL 372

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++ +YG PY+WC L NF GN  M G LD +     +        + G
Sbjct: 373 LDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMVGNLDDVDVKIEKLFVEGGENVYG 432

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG + NP +Y+ + + A+ +  +    WI  ++  R G     I  AW+ L+  +
Sbjct: 433 LGATLEGFDVNPFMYEFVFDQAWDY-PLTTDQWIQNWAKCRGGNQDRHILKAWDSLHKKI 491

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y     AT     ++ A P +                  V  ++       +Y++  LW 
Sbjct: 492 YK--KYATAGQAVLMNARPML------------------VGTDSWNTYPDITYNNRDLWD 531

Query: 360 STSEVIRALELFIASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
             +E+++A  +          +NT YR+D+I++ RQ L    +    +  + Y   D  G
Sbjct: 532 IWTEMLKASHI----------NNTGYRFDVINVGRQVLGNLFSSFRDHFTQCYSEKDIDG 581

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ----EKQVRC 467
           + + + +   L+ D D LL+C   F +G W++ A+   + E +    E+  RC
Sbjct: 582 MKKWADQMDALLIDTDRLLSCETNFSIGKWIDDARSFGKTEAEKEYYEENARC 634


>gi|374385779|ref|ZP_09643282.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
           12061]
 gi|373225481|gb|EHP47815.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
           12061]
          Length = 715

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 153/477 (32%), Positives = 231/477 (48%), Gaps = 62/477 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW ++Q+ LQ +IL R+ E G+ PV P ++G        + P     
Sbjct: 186 MNNLEGWGGPNPESWYERQMQLQHRILNRMREYGIEPVFPGYAG--------MLPHNASE 237

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG    VK DP   C Y     L   +P F  I   +  +  K +G+ +  Y  D F E
Sbjct: 238 KLG--IEVK-DPGLWCGYQRPAFLYPENPAFKRIAGLYYMEMEKRFGK-AKFYGMDPFHE 293

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             N   +D    +++   ++   M++ + +AVW+MQ W  +       P+ + ++ ++  
Sbjct: 294 GGNVQGID----LAAAAQSVLQAMKTANPEAVWVMQAWQAN-------PRHE-MITALQP 341

Query: 175 GKLVVLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G +++LDL +E +P+W          K F G  +++CML NF GN+ MYG +D +  G  
Sbjct: 342 GNVLILDLSSENRPMWGDKESVWYREKGFEGQDWLYCMLLNFGGNVGMYGRMDRVINGFY 401

Query: 228 EARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP 286
            A    N  ++ GVG +MEGIE NPV+Y+L+ E+ ++      + W+  Y   RYG+  P
Sbjct: 402 AAVQHPNGASLRGVGKTMEGIENNPVMYELLLELPWRKIPFTKEEWLKGYVKARYGKDDP 461

Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
            +Q AW +L    YNC          V  A P                        A   
Sbjct: 462 RLQQAWQILGKAAYNCPVVQEGTTESVFCARP------------------------AEEI 497

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
           S  SS+    L+Y+  E  +   LF+    +   +N + YDL D+ RQALA   N L   
Sbjct: 498 SGASSWGTSELYYAPEESKKVAALFLEVSEQYKGNNNFEYDLTDIMRQALADKGNVLQKK 557

Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           I EAY+L D      LSR FL+L+   D LLA    F LG WLE AK   + EE+++
Sbjct: 558 ITEAYRLKDETAFRNLSREFLQLILWQDTLLATRPEFRLGTWLERAKAKGETEEEKR 614


>gi|449299394|gb|EMC95408.1| glycoside hydrolase family 89 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 801

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 139/469 (29%), Positives = 239/469 (50%), Gaps = 44/469 (9%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           N+ G WGG LP SW+  Q  L K+I+ R+ ELGM PVLP F G VP  +   +P+A    
Sbjct: 226 NIQGSWGGDLPMSWISSQFTLGKQIVARMVELGMTPVLPCFPGFVPMQIGRYYPNAMYIN 285

Query: 62  LGNWFSVKSDPRWCCTY-LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
              W      PR       L+  DPL+  + ++FI +Q   YG  S IY  D ++EN P 
Sbjct: 286 GSQW---NGFPRQNTNVSFLEPFDPLYTTLQKSFISKQTAAYGNVSSIYTLDQYNENNPY 342

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVV 179
                Y+ ++ A   + +++ D +AVW++QGWLF S   FW    ++A L  V    +++
Sbjct: 343 SADTTYLRNISAGTIAALKAADPNAVWMLQGWLFFSSATFWTDAAIRAYLGGVNNTDMII 402

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLF+E +P W  +  +YG P+IWC LH++ GN+ +YG ++++   P++A  + ++TMVG
Sbjct: 403 LDLFSETQPQWQRTNSYYGKPWIWCELHDYGGNMGLYGQVENVTINPIQALNNASSTMVG 462

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV---PAIQDAWNVLY 296
           +G++MEG E N ++YD++ + A+    ++   + + +   RY  +    P +  AW+ + 
Sbjct: 463 MGLTMEGQEGNEIMYDILLDQAWSSTPLNNSLYFHDWVTSRYHGAASLPPGLYTAWDTMR 522

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP- 355
            TVYN T  +T +   V  +  ++ P++  +                       +  HP 
Sbjct: 523 QTVYNNTQISTIQ--SVTKSIWELTPNVTGLLN--------------------RTGHHPT 560

Query: 356 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
            + Y+TS ++ A + F  +  +   L  S  Y +DL D+TRQ +A     L+ + + A  
Sbjct: 561 TIQYNTSTLVGAWKQFYGAAAQEPTLWDSPGYLFDLTDVTRQVMANAFYPLYTSFVSASN 620

Query: 413 LNDAHGVFQ------LSRRFLELVEDMDGLLACH--DGFLLGPWLESAK 453
            + A+  +         ++ + L+  +D +LA      F L  W+  A+
Sbjct: 621 -HSANATYSPGNATIYGQQMVSLLSALDSMLAASPIPYFHLSTWIAEAR 668


>gi|440792549|gb|ELR13759.1| peptidase, S8/S53 subfamily protein [Acanthamoeba castellanii str.
           Neff]
          Length = 981

 Score =  234 bits (597), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 139/425 (32%), Positives = 217/425 (51%), Gaps = 51/425 (12%)

Query: 50  LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIY 109
           ++ ++P+A +T+  +W       ++   Y L   D L+  IG   I    +E+G T HIY
Sbjct: 434 IKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG-TDHIY 490

Query: 110 NCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALL 169
           N DTF+E +PP   P Y+++   A+Y GM + D  A+W+MQGW F +DPFW   ++KA L
Sbjct: 491 NADTFNEMSPPSADPTYLAAASRAVYEGMATQDPQALWVMQGWSFVFDPFWTKDRIKAYL 550

Query: 170 NSVPLGKLVVLDLFAEVKPIWSTSKQF----YGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
           + V    +++LDL ++  P W+ + QF    +G  ++WCMLHN  G   +YG L   +  
Sbjct: 551 SGVDNSDMLILDLASDNSPEWNKTGQFRDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSD 610

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           P+ A  +   TMVGVGM+ME IEQNPVVY+LMSEM ++ E  D+  W+ +Y+ RRYG + 
Sbjct: 611 PLIALATPGNTMVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAT 670

Query: 286 PA--IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
            +  + +AW +L    YN +                +D  +          +G   +   
Sbjct: 671 GSSPVGEAWELLREATYNQS---------------GLDAGLFGFAPALGMGHGGTSN--- 712

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNE--LSASNTYRYDLIDLTRQALAKYAN 401
                           +T EV  AL LF+ S      + +  ++YD +DLTRQ LA   N
Sbjct: 713 ----------------ATKEV-EALRLFLQSAQTEGYAPNGPWQYDCVDLTRQVLANTFN 755

Query: 402 ELFLNIIEAY-----QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
           +++  +  AY       +D      L+   L ++ D+D LLA +  +LLG W++ A   A
Sbjct: 756 DVYSQLDAAYTSYATNKSDTLPFLPLAAELLGIISDLDRLLATNPNYLLGTWIKDAVSWA 815

Query: 457 QNEEQ 461
              EQ
Sbjct: 816 SIPEQ 820


>gi|212541222|ref|XP_002150766.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
 gi|210068065|gb|EEA22157.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
          Length = 787

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 155/470 (32%), Positives = 242/470 (51%), Gaps = 41/470 (8%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
           + N+ G WG PLP  W++ Q  LQKKIL R+ ELG+ PVLP+F+G VP A+  V P+AK+
Sbjct: 203 LGNIQGFWGDPLPNEWIESQFELQKKILARMVELGITPVLPSFTGFVPRAITRVLPNAKV 262

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
                W    S+  + C   L+  D  F  + ++ I +Q   YG  SHIY  D ++EN P
Sbjct: 263 VPGSRWNVFSSN--YTCDTFLEPFDDNFALLQKSTISKQQAYYGNISHIYALDQYNENNP 320

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-LV 178
              +P+Y+ ++       +++ D DAVWLMQ WLF    FW    + A L+ V     ++
Sbjct: 321 FSSNPDYLRNISRTTSQSLKAADPDAVWLMQSWLFLDATFWNNVTICAYLSGVENNSDML 380

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           +LDLFAE +P+W  +  +YG P+IWC +H++ GN+ +YG + +I      A  S   +MV
Sbjct: 381 ILDLFAESQPVWQLTDSYYGKPWIWCQVHDYGGNMGLYGQIMNITENATAALASSG-SMV 439

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVP-AIQDAWNVL 295
           G G +ME  E N +VYDL+ + A+    ++   +   +   RY   + VP  + DAW +L
Sbjct: 440 GFGHTMESQEGNEIVYDLLLDQAWSETPINTSQYFEDWVTVRYAGTQHVPQQLFDAWEIL 499

Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISV--TEGKYQNYGKPVSKEAVLKSETSSYD 353
             + YN T+ A+      I+   +++PSI  +   EG +                T +YD
Sbjct: 500 RWSAYNNTNLASSSVPKSIL---ELEPSISGLLNREGHHPT--------------TINYD 542

Query: 354 HPHLWYSTSEVIRALEL-FIASGNELSASNT--YRYDLIDLTRQALAKYANELFLNIIEA 410
            P L      V+ A  L + A+  ELS  +   + YDLI LTRQ L       +  +I  
Sbjct: 543 -PEL------VVEAWALTYEAALLELSLWDNPAFNYDLIFLTRQVLVNAFIPRYELLISF 595

Query: 411 YQLNDAHGV---FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
           Y  N+ + V       R+ ++L++ +D +L  ++ F L  W+  A   A 
Sbjct: 596 YN-NENYSVPAIVSAGRQLIDLLQSLDTVLGTNECFQLAQWINKAVSRAH 644


>gi|373461342|ref|ZP_09553084.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
 gi|371952896|gb|EHO70729.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
          Length = 731

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 145/473 (30%), Positives = 235/473 (49%), Gaps = 52/473 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++GW GPLPQSW+D Q  LQ++IL R  E GM PVLPAF+G+VP   + + P A+IT
Sbjct: 194 MLNINGWQGPLPQSWIDGQADLQRRILQREREFGMRPVLPAFNGSVPLDYKRLHPEARIT 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G W       R   TY L  TDP F ++ ++F+++Q + +G T H+Y  D+F+E  PP
Sbjct: 254 EVGQWGGFGQAYR---TYFLSPTDPRFGKLQKSFLDEQRRMFG-TDHLYCLDSFNEVQPP 309

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
             SP+ +  L   I++ +   D  +VW+  GWLF  D   W P  ++A L+ +P  + ++
Sbjct: 310 SWSPDTLCMLARHIHASLDKADPQSVWVQMGWLFYNDRKHWTPDVIRAYLSGIPKDRALL 369

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   +W  ++ FYG PYI C+L NF GN  + G +  ++   ++A  +++  M G
Sbjct: 370 LDYYIDHTELWRLTESFYGRPYIACVLGNFGGNTMLQGDVGKVS-SRLDAAIAQDGNMAG 428

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG +MEG   NP  Y  + + A+     D + W+ + + R  G +  A + AW VL+  +
Sbjct: 429 VGATMEGFGVNPDFYAFVFDKAWDCGTTD-RDWLCRMADRHVGFASAAGRTAWQVLFDRI 487

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKS--ETSSYDHPHL 357
                   +++  V+ A P          E +Y N   P     V K   +  S    HL
Sbjct: 488 ---MPSYVNESGTVVCARPSF--------EARYLNTTYPAELLGVWKMLLDIDSDKREHL 536

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
                                       YD++++ RQ L  +       +  AY    + 
Sbjct: 537 ----------------------------YDVVNVGRQVLGDFFAFERDGLHRAYLSQRSD 568

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL----AQNEEQEKQVR 466
            V   +RR  ++++D+D LLAC + F L  W+E A+      A+ +  E+  R
Sbjct: 569 SVDYYARRMDKMLDDLDRLLACSEEFSLRKWIEDARGFGATAAEKDYYERNAR 621


>gi|410097657|ref|ZP_11292638.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409223747|gb|EKN16682.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 740

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 143/462 (30%), Positives = 225/462 (48%), Gaps = 46/462 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPLP+SW+D  + L K+++ R  ELGM P+   FSG VP  +   FP AKI 
Sbjct: 195 MPNIESFGGPLPKSWIDSHIALGKQVVNRQLELGMTPIQQGFSGAVPRKMMEKFPEAKIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +  +W+  +      C   LD  DPLF E+G+ F+E++ K YG T  +Y  D F E+ PP
Sbjct: 255 KQPDWYGFEG----ICQ--LDPLDPLFTELGKTFLEEEQKLYG-TYGLYAADPFHESKPP 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD+PEY++++G++I+  M++ D DA+W+MQ W F  D          + + VP   L+VL
Sbjct: 308 VDTPEYLNAVGSSIHKLMKTFDPDALWVMQAWSFRKD----------IASVVPKHDLLVL 357

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
            L   +         F    ++   LHNF G + ++G L  ++         +   +VG 
Sbjct: 358 SLNGAL----GGEDHFCNHDFVVGNLHNFGGRVNLHGDLPLVSSNQFMKAKQKTPNVVGS 413

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G+ ME I QNPV Y+L  EM    + V ++ W+N+Y+ RRYG    A   AW +L    Y
Sbjct: 414 GLFMESIGQNPVFYELAFEMPVHQDSVKLEEWLNKYAERRYGAFSDAANKAWELLLAGPY 473

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
                  + +  +I A P VD        G    +  P   ++++++E            
Sbjct: 474 RAGTNGVESS-SIICARPAVDVK----KSGPNAGFNIPYDPQSLIEAEVC---------- 518

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
                      +    +L  S  YR+D++D+ RQ ++    E+     EA++  D     
Sbjct: 519 ----------LLQDAEQLKGSGPYRFDIVDVQRQIMSNLGQEIHKKAAEAFKKKDKEAFA 568

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             S RFLEL++D+D LL     F    WL  A+     +E+ 
Sbjct: 569 LHSGRFLELLKDVDILLRTRTEFNFDQWLTDARAWGTTDEER 610


>gi|261199246|ref|XP_002626024.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
 gi|239594232|gb|EEQ76813.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
          Length = 752

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 140/473 (29%), Positives = 242/473 (51%), Gaps = 43/473 (9%)

Query: 1   MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
             NL G WGG   P  W D Q  LQKKIL R+ ELGM P+LPAF G VP A+  V P A+
Sbjct: 201 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 260

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W  +  +P++  T  L   DP  + + ++FI + ++ YG  +H Y  D F+E  
Sbjct: 261 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 318

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS-YDPFWRPPQMKALLNS-VPLGK 176
           P    PE++  +       ++S D +A W+MQGWLF  +  +W   +++A L++      
Sbjct: 319 PSSGDPEFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 378

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           +++LDLFAE  P+W  +K F+G  ++WC +  F GN  +YG + +I  GP +A  +++  
Sbjct: 379 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAQA-MAQHPN 437

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY---GRSVP-AIQDAW 292
           MVGVG + EG   N +V+ L+ +  +    +D + + + +  RRY   GR+VP  + +AW
Sbjct: 438 MVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHGRTVPNELYEAW 497

Query: 293 NVLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSET 349
            +L  + YN T+   A      +  A P ++  + +   EG                   
Sbjct: 498 QLLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG------------------- 538

Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
                  L Y  +++++A  L I     L   ++Y+YD++D+TRQ L+     +  ++  
Sbjct: 539 -------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKV 589

Query: 410 AYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            Y+      VF  +  + L +++ +D +L+ ++ F L  W+ +A+  A ++ +
Sbjct: 590 KYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDDSE 642


>gi|239615395|gb|EEQ92382.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ER-3]
          Length = 829

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 140/473 (29%), Positives = 242/473 (51%), Gaps = 43/473 (9%)

Query: 1   MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
             NL G WGG   P  W D Q  LQKKIL R+ ELGM P+LPAF G VP A+  V P A+
Sbjct: 221 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 280

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W  +  +P++  T  L   DP  + + ++FI + ++ YG  +H Y  D F+E  
Sbjct: 281 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 338

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS-YDPFWRPPQMKALLNS-VPLGK 176
           P    P+++  +       ++S D +A W+MQGWLF  +  +W   +++A L++      
Sbjct: 339 PSSGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 398

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           +++LDLFAE  P+W  +K F+G  ++WC +  F GN  +YG + +I  GP EA  +++  
Sbjct: 399 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAEA-MAQHPN 457

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAW 292
           MVGVG + EG   N +V+ L+ +  +    +D + + + +  RRY    R+VP+ + +AW
Sbjct: 458 MVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHERTVPSELYEAW 517

Query: 293 NVLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSET 349
            +L  + YN T+   A      +  A P ++  + +   EG                   
Sbjct: 518 QLLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG------------------- 558

Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
                  L Y  +++++A  L I     L   ++Y+YD++D+TRQ L+     +  ++  
Sbjct: 559 -------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKV 609

Query: 410 AYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            Y+      VF  +  + L +++ +D +L+ ++ F L  W+ +A+  A +E +
Sbjct: 610 KYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDESE 662


>gi|429740222|ref|ZP_19273924.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
 gi|429153947|gb|EKX96708.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
          Length = 730

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 142/463 (30%), Positives = 222/463 (47%), Gaps = 37/463 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP+ WL++Q  LQK+IL R     M PVLPAF+G+VPA L+ +FP A I 
Sbjct: 198 MANIDRWNGPLPKEWLEEQRDLQKQILARERAFNMKPVLPAFAGHVPAELKRIFPDANIK 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W     D ++ C + L+  +PLF +I + F+E+Q   +G T HIY  D F+E  PP
Sbjct: 258 SLGKWGGF--DEQYLC-HFLNPGEPLFAKIQKLFLEEQTALFG-TDHIYGVDPFNEGEPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              P Y+  +   +Y  + + D  A W+  GW+F YD   W P ++KA L  VP GK+ +
Sbjct: 314 SWEPAYLKEISKNMYGTLTAVDPKAEWMQMGWMFYYDKKVWTPKRVKAFLTGVPQGKMSL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W T+  FYG PYIWC L NF GN  + G +         A  +    M+G
Sbjct: 374 LDYHCENVELWKTNDGFYGQPYIWCYLGNFGGNTTLTGNVKETGKRLDAALKAARRNMLG 433

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG ++EG++     Y+ + +  + H     + WI++ + R  G + P+++ AW +L+  +
Sbjct: 434 VGSTLEGLDVIQFPYEYVFDKVWTHSDKGNQQWIDELADRHAGFTSPSVRKAWQILFDEI 493

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           +    G              + PS   V    +              SE +   +P    
Sbjct: 494 FVQVPGTY-----------SILPSRSPVLNDNH--------------SERTEIKYPA--- 525

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E + +L L +         N  + DLI + RQ L      +      AY   D   +
Sbjct: 526 QRLEEVWSLLLDVPQ----CERNELQVDLIAVGRQVLGNKFLAVKSEFDAAYAAKDITLL 581

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
            Q +    EL+ D+D L + +    +  W++ A+ L +N E +
Sbjct: 582 RQKAYEMEELLSDLDCLTSFNTRCTVNKWIDDARALGRNAEMK 624


>gi|453081268|gb|EMF09317.1| glycoside hydrolase family 89 protein [Mycosphaerella populorum
           SO2202]
          Length = 784

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 124/398 (31%), Positives = 215/398 (54%), Gaps = 27/398 (6%)

Query: 8   GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
           GG LPQSW+DQQ  L + I+ R+ ELGM PVLP F+G VP  +  ++P+A       W  
Sbjct: 218 GGDLPQSWIDQQFELNQLIIARMIELGMTPVLPCFTGFVPTQISRLYPNASFVNGSQWNG 277

Query: 68  VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
            ++  ++     L+  DPLF  + ++FI +    YG  S +Y  D ++EN P   +  Y+
Sbjct: 278 FQA--QYTNVTFLEPFDPLFTTLQKSFISKLDAAYGNVSSVYTLDQYNENDPFSGNVTYL 335

Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEV 186
             + +     +++ D +A+W +QGWLF S   FW   ++KA L  V    +++LDLF+E 
Sbjct: 336 EDVASNTIKSLKAADPEAIWFIQGWLFYSAADFWDEERIKAYLGGVEDKDMLILDLFSES 395

Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEG 246
           +P W  +  ++G P+IWC LH++ GN  ++G ++++   P+ A  +E +TMVG+G++MEG
Sbjct: 396 QPQWQRTNSYFGKPWIWCQLHDYGGNQGLHGQVENVTMNPILALANETSTMVGIGLTMEG 455

Query: 247 IEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY--GRSVPAI-QD---AWNVLYHTVY 300
            E N ++YD++ + A+  E ++   + + +   RY    +V  + QD   AW+++  T+Y
Sbjct: 456 QEGNEIIYDILLDQAWTPEPIESAGYFDDWVTSRYHCDDAVAGLPQDLYIAWDMMRQTIY 515

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           N TD   D    V  +  ++ P+   + +    +  + +    +L S        H + +
Sbjct: 516 NNTD--IDTAEAVTKSIFELQPNTTGLLDRTGHHSTRILYDPEILVSA-----WKHFYSA 568

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
           + E  +  EL            +YR+DL+D+TRQ LA 
Sbjct: 569 SQETPQLWEL-----------ESYRFDLVDITRQVLAN 595


>gi|399028591|ref|ZP_10729778.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
 gi|398073682|gb|EJL64846.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
          Length = 727

 Score =  231 bits (590), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 149/470 (31%), Positives = 234/470 (49%), Gaps = 37/470 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++   GPLPQ W++++  +QKKIL R+  LGM+PV+PAFSG VP A     P +KI+
Sbjct: 209 MGNINSLEGPLPQEWINKKENVQKKILQRMRALGMHPVVPAFSGYVPKAFAEKHPGSKIS 268

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L +W    S   +  TYLLDA DPLF EIG+ FIE   K YG+ +  Y  D F+E TPP
Sbjct: 269 ELKSW----SGGGFESTYLLDANDPLFKEIGKRFIEIYTKLYGQ-ADFYLADAFNEITPP 323

Query: 121 VDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
           V      E +S  G  I+  +     DA W+MQGWLF  +  FW     KA L+ VP  +
Sbjct: 324 VSKEHKYEELSDYGKTIFETINEASPDATWVMQGWLFGDNKEFWTKEATKAFLSKVPNDR 383

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT- 235
           +++ D   +   +W   + FYG  + +  +HN+ G+  +YG L+           + N  
Sbjct: 384 MMIQDYANDRHKVWEKQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKNELTHLLGNSNKG 443

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVPAIQDAWN 293
            +VG G+  EG+  N +VY+ + ++ +   K  V  W+N+Y   RYG+  S P  Q AW 
Sbjct: 444 NVVGYGVMPEGLNNNSIVYEYIYDLPWSQGKESVNDWLNKYLSARYGKNISTPVFQ-AWK 502

Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
           +L  +VY+     T    D   A+       + +TE K    G P  K+           
Sbjct: 503 LLIESVYSTKYWETRWWDDRAGAYLFFKRPTLKITEFK----GNPGDKQ----------- 547

Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
                    ++ +AL++        + ++ Y YDL+D++R   +   ++L +  + AY+L
Sbjct: 548 ---------KLKQALDILKRESKSFNKNSLYFYDLLDMSRHYYSLCIDDLLIECVTAYEL 598

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            D     +L ++  +   D+D +L+      L  WL+SA     + E  K
Sbjct: 599 KDIKKADELFKKIEKQALDIDNMLSGQPLNSLNNWLKSASDYGSSPEVSK 648


>gi|358391826|gb|EHK41230.1| glycoside hydrolase family 89 protein [Trichoderma atroviride IMI
           206040]
          Length = 751

 Score =  231 bits (590), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 139/459 (30%), Positives = 246/459 (53%), Gaps = 29/459 (6%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G W G +P +W+D Q  LQ +IL R+ ELG+ P+LPAF G VP  +  VFP   +
Sbjct: 202 FGNIQGSWNGNMPGNWVDDQFALQLQILDRMKELGITPILPAFPGFVPRNISRVFPGISL 261

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
           +    W +   D     TY ++  DP F ++ + FI +Q + YG  +  +  D F+EN P
Sbjct: 262 STSPLWENFAEDLS-ADTY-VNPFDPHFTQLQKLFIGKQQELYGNVTKFWTLDQFNENQP 319

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPL-GKL 177
                 Y+ ++    ++ ++S   DA+W+MQ WLFS D  FW    ++A L  +     +
Sbjct: 320 LSSDLGYLRNVSQNTWTALKSASPDAIWVMQAWLFSADSSFWTNDAIEAFLGGITEDSDM 379

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           ++LDLFAE  P W  +  FYG P+IWC LH++ GN+ +YG ++++    ++A    ++++
Sbjct: 380 LLLDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGLYGQIENVTINAMQA-VRNSSSL 438

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVPAIQDAWNVLY 296
           VG G++MEG E N ++YDL+ + A+  + +D + + + +   RYG  +V ++   W +L 
Sbjct: 439 VGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHDWVSARYGTENVKSLYTGWELLR 498

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
            TV+N T+   +     I+   ++ P+I  +  G+   +G  +           +YD   
Sbjct: 499 PTVFNNTNLTVNAVPKSIL---ELTPNINGLL-GRVGRHGTTI-----------NYDPAV 543

Query: 357 LWYSTSEVIRA-LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN- 414
           +  + +E+ +A LE     GN       Y+YDL+D TRQ L    + L+ +++ AY  + 
Sbjct: 544 MVDAWTELFKAGLEDVKLFGNP-----AYQYDLVDWTRQVLVNSFDGLYKDLVTAYNSSA 598

Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
           +A  +     +   L++ +D +LA ++ F L  W+ +A+
Sbjct: 599 NAAEIRSRGSKLTALLKTLDAVLATNENFQLATWIAAAR 637


>gi|423259033|ref|ZP_17239956.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
           CL07T00C01]
 gi|423263996|ref|ZP_17242999.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
           CL07T12C05]
 gi|387776613|gb|EIK38713.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
           CL07T00C01]
 gi|392706262|gb|EIY99385.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
           CL07T12C05]
          Length = 718

 Score =  231 bits (589), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 154/480 (32%), Positives = 239/480 (49%), Gaps = 65/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R++E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++   F 
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++   + W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619


>gi|62088640|dbj|BAD92767.1| huntingtin interacting protein-1-related [Homo sapiens]
          Length = 449

 Score =  231 bits (589), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 130/329 (39%), Positives = 185/329 (56%), Gaps = 45/329 (13%)

Query: 142 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 200
           D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+VLDLFAE +P+++ +  F G P
Sbjct: 18  DTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQP 77

Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 260
           +IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+
Sbjct: 78  FIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAEL 137

Query: 261 AFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 318
            ++ + V D+ AW+  ++ RRYG S P    AW +L  +VYNC+ +     NR  +V  P
Sbjct: 138 GWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRP 197

Query: 319 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 378
                                     L+  TS      +WY+ S+V  A  L + S   L
Sbjct: 198 S-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAPSL 226

Query: 379 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 432
           + S  +RYDL+DLTRQA+ +  +  +     AY       L  A GV        EL+  
Sbjct: 227 ATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELASLLRAGGVLA-----YELLPA 281

Query: 433 MDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
           +D +LA    FLLG WLE A+  A +E +
Sbjct: 282 LDEVLASDSRFLLGSWLEQARAAAVSEAE 310


>gi|423248659|ref|ZP_17229675.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
           CL03T00C08]
 gi|423253608|ref|ZP_17234539.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
           CL03T12C07]
 gi|392655237|gb|EIY48880.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
           CL03T12C07]
 gi|392657600|gb|EIY51231.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
           CL03T00C08]
          Length = 718

 Score =  231 bits (589), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 154/480 (32%), Positives = 239/480 (49%), Gaps = 65/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R++E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++   F 
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++   + W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619


>gi|53711968|ref|YP_097960.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis YCH46]
 gi|52214833|dbj|BAD47426.1| alpha-N-acetylglucosaminidase precursor [Bacteroides fragilis
           YCH46]
          Length = 718

 Score =  231 bits (589), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 154/480 (32%), Positives = 239/480 (49%), Gaps = 65/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R++E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++   F 
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++   + W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619


>gi|374312699|ref|YP_005059129.1| alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
 gi|358754709|gb|AEU38099.1| Alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
          Length = 754

 Score =  231 bits (589), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 149/478 (31%), Positives = 231/478 (48%), Gaps = 53/478 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++ + GPLPQ +++++ +LQ+++L R+ ELGM PV PAF+G VP   + + P  +  
Sbjct: 221 MGNINHFAGPLPQHFMEEKRILQRQVLNRMRELGMKPVAPAFAGFVPQGFKRLHPEVETF 280

Query: 61  QLGNWF--SVKSDPRWCCTYLLD-ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
            L  W     K+ PR   T++L      L+ +IG+ FIE+   EYG   + Y  DTF+E 
Sbjct: 281 TL-LWLRKEFKTIPRSTRTFILHPGQQELYRQIGKKFIEEYKAEYGEVEY-YLADTFNEL 338

Query: 118 TPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
             PV      E +   G  ++  +Q+GD    W+MQGWLF YD  FW    ++ALL  +P
Sbjct: 339 EVPVREDHRYEDLERFGRTVFESIQAGDPKGTWVMQGWLFVYDSDFWNKESVEALLRGIP 398

Query: 174 LGKLVVLDLFAEVKPI---------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
             +++++D   ++ P          W   K F+G P+I  M H F GN  + G L  +A 
Sbjct: 399 NDRMLIIDYANDLAPSVQGKYLPGQWKLQKAFFGKPWINGMAHTFGGNNNIKGNLKLMAT 458

Query: 225 GPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
            P     S E   +VG GM  EGIE N VVY+LM++  +Q E +D+  WI  Y   RYG 
Sbjct: 459 EPSTVLASPERGNLVGWGMCPEGIENNEVVYELMTDAGWQSEAIDLATWIPAYCRSRYGD 518

Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
             PA+Q AW +L  + Y+     T   +    A P V P   SV  G       P  +  
Sbjct: 519 CPPAMQQAWELLLKSAYSSHIWMT---KQAWQAEPSVHPIAASVDAG-------PTFQ-- 566

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                                 RA+ELF++   +L+ S  YR DLI+   QA+    +E 
Sbjct: 567 ----------------------RAVELFLSCAPQLAKSELYRNDLIEFVSQAVGGRVDEA 604

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
               ++A            + R +E +  +DGL+       L  W+++ +  A+ +++
Sbjct: 605 LALAVQAGDAKQDEDAVAHAARAVEWMRRIDGLMNLRPDRRLETWMQATRAYAKTDDE 662


>gi|333031143|ref|ZP_08459204.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
 gi|332741740|gb|EGJ72222.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
          Length = 723

 Score =  231 bits (589), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 145/461 (31%), Positives = 220/461 (47%), Gaps = 39/461 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP+ WLD Q  LQK+IL R  EL M PVLPAF+G+VP+ L+++FP A I 
Sbjct: 197 MANIDSWNGPLPKEWLDHQSDLQKQILKRERELNMKPVLPAFAGHVPSELKHLFPEADIQ 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C +L +  DPLF +I R F+E+Q + +G T HIY  D F+E  PP
Sbjct: 257 HLGKWAGFADKYR--CNFL-NPNDPLFAKIQRLFLEEQTRLFG-TDHIYGVDPFNEVDPP 312

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
              PEY+  + A +Y  +   D  A WL   WLF +    W  P+++ALL  VP  +L +
Sbjct: 313 SWEPEYLKKVAADMYRTLTDVDPKAKWLQMTWLFYHGKKKWTAPRIEALLTGVPQDELYL 372

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W T+  F+G PYIWC L NF GN  + G +            +      G
Sbjct: 373 LDYHCENVELWKTTDYFHGQPYIWCYLGNFGGNTTITGNVKESGQRLENTLINGGNNFKG 432

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + + A+    +D  +W+   + R  G+   A ++AW +L++ V
Sbjct: 433 IGSTLEGLDVMQFPYEYIFDKAWTFN-MDDNSWVENLADRHLGKKSEAYREAWKILFNDV 491

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y     +       +   P+  P +            KP +K  V  ++  + D   +W 
Sbjct: 492 YVQVPKS-------LGVLPNFRPEM-----------SKP-NKRTV--NDYKNKDLVKVWA 530

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              EV                 + Y  DLI + RQ L  Y   +     + YQ  D  G+
Sbjct: 531 KLLEVKEC------------TRDAYIIDLITVGRQVLGNYFLVVKNEFDQMYQFKDLPGL 578

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
                +  E++ D++ L A H+   L  W+  A+ L    E
Sbjct: 579 ESRGAKLREILNDLENLTAFHNHCTLEKWISDARALGNTIE 619


>gi|160887167|ref|ZP_02068170.1| hypothetical protein BACOVA_05183 [Bacteroides ovatus ATCC 8483]
 gi|423295093|ref|ZP_17273220.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
           CL03T12C18]
 gi|156107578|gb|EDO09323.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
 gi|392673999|gb|EIY67450.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
           CL03T12C18]
          Length = 711

 Score =  231 bits (588), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 148/477 (31%), Positives = 240/477 (50%), Gaps = 59/477 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+ ++ Y+ D F E 
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  GA+I + M+  +  AVW++Q W  S       P+ + ++ S+  G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPKAVWIIQAWQAS-------PR-EEMIASLNQG 352

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            L+VLDL++E +P       +W   K F    +++CML NF GN+ ++G ++ +  G  +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412

Query: 229 ARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
           A    N  M+ GVG + EGIE NPV+++L+ E+ ++ E+     W+  Y   RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472

Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
            I +AW  L HTVYN      D   +  +       S++    G + +            
Sbjct: 473 EIMEAWRALEHTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
             TS++ +  L+Y+     +A  LF +  ++   +N + YDL+D+ RQ+ A   N L   
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570

Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           I ++Y   D     + +++FL+L+   D LL+    F +  WL +A+ L   EE+++
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKR 627


>gi|327356744|gb|EGE85601.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ATCC 18188]
          Length = 752

 Score =  231 bits (588), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 139/473 (29%), Positives = 242/473 (51%), Gaps = 43/473 (9%)

Query: 1   MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
             NL G WGG   P  W D Q  LQKKIL R+ ELGM P+LPAF G VP A+  V P A+
Sbjct: 201 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 260

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +     W  +  +P++  T  L   DP  + + ++FI + ++ YG  +H Y  D F+E  
Sbjct: 261 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 318

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS-YDPFWRPPQMKALLNS-VPLGK 176
           P    P+++  +       ++S D +A W+MQGWLF  +  +W   +++A L++      
Sbjct: 319 PSSGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 378

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           +++LDLFAE  P+W  +K F+G  ++WC +  F GN  +YG + +I  GP EA  +++  
Sbjct: 379 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAEA-MAQHPN 437

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAW 292
           MVGVG + EG   N +V+ L+ +  +    +D + + + +  RRY    R+VP+ + +AW
Sbjct: 438 MVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHERTVPSELYEAW 497

Query: 293 NVLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSET 349
            +L  + YN T+   A      +  A P ++  + +   EG                   
Sbjct: 498 QLLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG------------------- 538

Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
                  L Y  +++++A  L I     L   ++Y+YD++D+TRQ L+     +  ++  
Sbjct: 539 -------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKV 589

Query: 410 AYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            Y+      VF  +  + L +++ +D +L+ ++ F L  W+ +A+  A ++ +
Sbjct: 590 KYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDDSE 642


>gi|313145188|ref|ZP_07807381.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
 gi|313133955|gb|EFR51315.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
          Length = 718

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 154/479 (32%), Positives = 236/479 (49%), Gaps = 65/479 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ +  Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLGA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
           G L++LDL +E +P W  S      K  YG   +++CML N+ GN+ ++G +D++   F 
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A     +T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+   + W+ +Y   RYG   
Sbjct: 404 LAKADPHAGSTLKGVGMAPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y   +VI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  AY+  D       S++FL L+   D LL     F +G W+E A+ L    E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEK 618


>gi|424666301|ref|ZP_18103337.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
           616]
 gi|404573840|gb|EKA78592.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
           616]
          Length = 718

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 154/479 (32%), Positives = 236/479 (49%), Gaps = 65/479 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ +  Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
           G L++LDL +E +P W  S      K  YG   +++CML N+ GN+ ++G +D++   F 
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A     +T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+   + W+ +Y   RYG   
Sbjct: 404 LAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHEAVFCARP---------AEDVYQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y   +VI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  AY+  D       S++FL L+   D LL     F +G W+E A+ L    E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEK 618


>gi|237719043|ref|ZP_04549524.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229451821|gb|EEO57612.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 713

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 145/459 (31%), Positives = 224/459 (48%), Gaps = 42/459 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 181 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 240

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C +L +  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 241 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 296

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 297 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 356

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G
Sbjct: 357 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 416

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+ +  VD   WI   + R  G    +++DAW  L++ +
Sbjct: 417 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 475

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 476 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 510

Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
              EV R L       NE S+   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 511 --LEVWRKL-------NEASSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVETKDHQ 561

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
            +     +  E++ D+D L A H    L  W++ A+++ 
Sbjct: 562 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 600


>gi|423280158|ref|ZP_17259071.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
           610]
 gi|404584494|gb|EKA89159.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
           610]
          Length = 718

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 154/479 (32%), Positives = 236/479 (49%), Gaps = 65/479 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ +  Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLGA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
           G L++LDL +E +P W  S      K  YG   +++CML N+ GN+ ++G +D++   F 
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A     +T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+   + W+ +Y   RYG   
Sbjct: 404 LAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHESVFSARP---------AEDVYQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y   +VI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  AY+  D       S++FL L+   D LL     F +G W+E A+ L    E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEK 618


>gi|423299508|ref|ZP_17277533.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
           CL09T03C10]
 gi|408473317|gb|EKJ91839.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
           CL09T03C10]
          Length = 727

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 139/459 (30%), Positives = 228/459 (49%), Gaps = 45/459 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           MSN+  W  PLP+ WL QQ  LQK+IL R  E  M PVLPAF+G+VPA L+ ++P+AKI 
Sbjct: 194 MSNVDYWQSPLPKDWLVQQEELQKRILAREREFNMTPVLPAFAGHVPAELKKIYPNAKIY 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W       R   ++ +D  D L+  I + F+E+Q K YG T HIY  D F+E   P
Sbjct: 254 TMSQWGGFDKQYR---SHFIDPMDSLYSVIQKRFLEEQTKIYG-TDHIYGIDPFNEVDSP 309

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
             + E++S++   IY  + S D +A WL   W+F Y    W P ++K+ L +VP  KL++
Sbjct: 310 DWNEEFLSNVSRKIYESLHSVDPEAQWLQMTWMFYYAKDKWTPSRIKSFLRAVPQDKLIL 369

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + +   IW  ++ +YG PYIWC L NF GN  + G L+       +        + G
Sbjct: 370 LDYYCDHTEIWKKTEGYYGQPYIWCYLGNFGGNTMLAGNLNDTYEKIHQVLAEGGQNIHG 429

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G+++E  + NP++Y+ + E A++  +     WI  ++  R G++ PA+  AW  L+  +
Sbjct: 430 LGVTLEAFDVNPMMYEFVFEQAWEGAQ-PTDEWIATWAKCRGGQTCPAVLKAWKELHEKI 488

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                   + PS+                 +AVL +     +    W 
Sbjct: 489 Y-------------------IAPSLCG---------------QAVLMNARPQLEGVQGWN 514

Query: 360 STSEV-IRALELFIASGNELSASNT----YRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
           +  E      +L++  G+ L   +     + +D++++ RQ L    ++        Y+  
Sbjct: 515 TFPEYKYDNKDLWVIWGSLLQVGSIDKPGHAFDVVNVGRQVLGNLFSDYRAQFTACYKRK 574

Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
           D  G  + ++R   L+ D+D LLAC   F +G W++ A+
Sbjct: 575 DVKGAQEWAQRMDALLLDVDRLLACSPLFSMGKWIQDAR 613


>gi|423282107|ref|ZP_17260992.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
           615]
 gi|404582594|gb|EKA87288.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
           615]
          Length = 718

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 154/480 (32%), Positives = 238/480 (49%), Gaps = 65/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++   F 
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++   + W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619


>gi|299148671|ref|ZP_07041733.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           3_1_23]
 gi|383114572|ref|ZP_09935334.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
 gi|298513432|gb|EFI37319.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           3_1_23]
 gi|313693722|gb|EFS30557.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
          Length = 711

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 146/477 (30%), Positives = 238/477 (49%), Gaps = 59/477 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+ ++ Y+ D F E 
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  GA+I + M+  +  AVW++Q         W+    + ++ S+  G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPKAVWIIQA--------WQANPREEMIASLNQG 352

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            L+VLDL++E +P       +W   K F    +++CML NF GN+ ++G ++ +  G  +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412

Query: 229 ARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
           A    N  M+ GVG + EGIE NPV+++L+ E+ ++ E+     W+  Y   RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472

Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
            I +AW  L HTVYN      D   +  +       S++    G + +            
Sbjct: 473 EIMEAWRALEHTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
             TS++ +  L+Y+     +A  LF +  ++   +N + YDL+D+ RQ+ A   N L   
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570

Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           I ++Y   D     + +++FL+L+   D LL+    F +  WL +A+ L   EE+++
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKR 627


>gi|60680169|ref|YP_210313.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC 9343]
 gi|375357012|ref|YP_005109784.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
 gi|383116930|ref|ZP_09937677.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
 gi|60491603|emb|CAH06355.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC
           9343]
 gi|251947777|gb|EES88059.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
 gi|301161693|emb|CBW21233.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
          Length = 718

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 154/480 (32%), Positives = 238/480 (49%), Gaps = 65/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++   F 
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++   + W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619


>gi|423269418|ref|ZP_17248390.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
           CL05T00C42]
 gi|423273021|ref|ZP_17251968.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
           CL05T12C13]
 gi|392701212|gb|EIY94372.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
           CL05T00C42]
 gi|392708585|gb|EIZ01692.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
           CL05T12C13]
          Length = 718

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 154/480 (32%), Positives = 238/480 (49%), Gaps = 65/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++   F 
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++   + W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619


>gi|265765312|ref|ZP_06093587.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
 gi|263254696|gb|EEZ26130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
          Length = 718

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 154/480 (32%), Positives = 238/480 (49%), Gaps = 65/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++ ++  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++   F 
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++   + W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619


>gi|423241433|ref|ZP_17222546.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
           CL03T12C01]
 gi|392641326|gb|EIY35103.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
           CL03T12C01]
          Length = 754

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 152/478 (31%), Positives = 238/478 (49%), Gaps = 48/478 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +  +  S +S       +       +L   DP F +I   F E+  K YG TS  Y+ 
Sbjct: 252 EKKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTQIANLFYEETEKLYG-TSDYYSI 310

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E        ++    G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 311 DPFHEAKSLPAGLDF-GKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362

Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
              G L++LDLF+E +P      IW   K +    +++C+L NF GN+ ++G +D +   
Sbjct: 363 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 421

Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
               + +     + G+G++MEGIE NPV+++LM E+ ++ EK   + WI QY   RYG  
Sbjct: 422 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 481

Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
             +I+ AW +L + +YNC  G   +             SI           G+P    ++
Sbjct: 482 DESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 519

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
              + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A  A  ++
Sbjct: 520 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 579

Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L    E++
Sbjct: 580 NYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITSEEK 637


>gi|265753065|ref|ZP_06088634.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|263236251|gb|EEZ21746.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
          Length = 750

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 153/478 (32%), Positives = 237/478 (49%), Gaps = 48/478 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 188 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 247

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +  +  S +S       +       +L   DP F  I   F E+  K YG TS  Y+ 
Sbjct: 248 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 306

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 307 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 358

Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
              G L++LDLF+E +P      IW   K +    +++C+L NF GN+ ++G +D +   
Sbjct: 359 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 417

Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
               + +     + G+G++MEGIE NPV+++LM E+ ++ EK   + WI QY   RYG  
Sbjct: 418 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 477

Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
             +IQ AW +L + +YNC  G   +             SI           G+P    ++
Sbjct: 478 DESIQQAWQILTNGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 515

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
              + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A  A  ++
Sbjct: 516 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 575

Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L    E++
Sbjct: 576 NYAVADFKSFDKKNYNTHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITPEEK 633


>gi|212695333|ref|ZP_03303461.1| hypothetical protein BACDOR_04880 [Bacteroides dorei DSM 17855]
 gi|212662112|gb|EEB22686.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
          Length = 754

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 153/478 (32%), Positives = 236/478 (49%), Gaps = 48/478 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +     S +S       +       +L   DP F  I   F E+  K YG TS  Y+ 
Sbjct: 252 EEKTAGDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362

Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
              G L++LDLF+E +P      IW   K +    +++C+L NF GN+ ++G +D +   
Sbjct: 363 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 421

Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
               + +     + G+G++MEGIE NPV+++LM E+ ++ EK   + WI QY   RYG  
Sbjct: 422 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 481

Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
             +IQ AW +L + +YNC  G   +             SI           G+P    ++
Sbjct: 482 DESIQQAWQILTNGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 519

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
              + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A  A  ++
Sbjct: 520 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 579

Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L    E++
Sbjct: 580 NYAVADFKSFDKKNYNTHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITPEEK 637


>gi|325299497|ref|YP_004259414.1| alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
 gi|324319050|gb|ADY36941.1| Alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
          Length = 723

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 151/474 (31%), Positives = 236/474 (49%), Gaps = 57/474 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ  LQKKIL R+ E G+ PVLP +SG VP  A Q +  +   
Sbjct: 190 MNNLEGWGGPNPDSWYTQQEALQKKILKRMREYGIEPVLPGYSGMVPHDAHQKLGLNVTE 249

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
            +L N F+  +         L  TD  F EI   + E+Q K +G+ ++ Y+ D F E  N
Sbjct: 250 PELWNGFTRPA--------FLMPTDKRFAEIAALYYEEQEKLFGKANY-YSMDPFHELEN 300

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G A+   M+  +  AVW++QGW  + +P  RP  MK L N    G L
Sbjct: 301 AGEVD----FDAAGKAVMDAMKQVNPKAVWVVQGW--TENP--RPEMMKNLKN----GDL 348

Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
           ++LDLF+E +P      IW   K +    +++CML NF  N+ ++G +D +       + 
Sbjct: 349 LILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLLNNFYLTKN 408

Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
           +     + G+G++MEG E NPV+++LM E+ ++ EK+  ++W+ +Y   RYG     I+ 
Sbjct: 409 NPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKITKESWLKEYLAARYGAKDEKIEQ 468

Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
           AW +L   +YNC  G   +  +  +    P ++   +S +  K +NY  P S EA     
Sbjct: 469 AWMILADGIYNCPFGNNQQGPHESIFCGRPSMNNFQVS-SWSKMENYYDPTSTEA----- 522

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
                             A  L + + ++   +N + YDL+D+ RQALA     ++   I
Sbjct: 523 ------------------AARLMLEAADKFRGNNNFEYDLVDIVRQALADRGRIVYNRAI 564

Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             ++  D     + S+ FL L+   D LLA    F +G W+  A+ L    E++
Sbjct: 565 ADFKSFDKRSYARHSKEFLNLLLAQDRLLATRSEFRVGRWINQARSLGNTPEEK 618


>gi|393784337|ref|ZP_10372502.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666113|gb|EIY59630.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
           CL02T12C01]
          Length = 728

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 140/473 (29%), Positives = 228/473 (48%), Gaps = 50/473 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGP+   ++ +Q  LQ+K+L R+ ELGM PV   F G VP  L+  +P A+I 
Sbjct: 193 MGNLEGFGGPVSPEFIARQTDLQQKMLKRMRELGMKPVFQGFYGMVPNVLKKKYPDARIK 252

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           + G W + +          LD TDPLF  +   + E+Q K +G  +  +  D F E    
Sbjct: 253 EQGTWQTYQRPA------FLDPTDPLFDRVAAIYYEEQKKLFG-DAEFFGGDPFHEGG-- 303

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
                ++      I   M+  +  AVW++QGW   ++P      +K L++ +  G+ ++L
Sbjct: 304 TSEGIHVKLAAQKILQAMRKVNPKAVWVLQGW--QHNP------VKDLMDGLNPGETIIL 355

Query: 181 DLFAEVKPIWS--TSKQFYGVP------YIWCMLHNFAGNIEMYGILDSIAFGPVEARTS 232
           DL A  +P W   T+  F+         +IWC L NF G   ++G + S A G V A+  
Sbjct: 356 DLMACERPQWGGVTTSMFHKPEGHQDHRWIWCALPNFGGKTGLHGKMSSYASGAVFAKEH 415

Query: 233 E-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDA 291
                + G+G + EGI   PVVYD++ +MA++ + + +  W+  Y+  RYG        A
Sbjct: 416 PMGRNICGIGTAPEGIGTVPVVYDMVYDMAWRTDSIQIPQWLTNYTYYRYGMEDTNCDKA 475

Query: 292 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 351
           W +L  TVY C +         I A P                        A      S+
Sbjct: 476 WKILSETVYECHNELGGPVESYICARP------------------------ADTIDHVST 511

Query: 352 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 411
           + +  ++Y   +++ A E    S N  +  +TY YDL+D+TRQ L+ YA  L   ++EA+
Sbjct: 512 WGNARIFYEPVKMVEAWEFLYQSRNRFNHCDTYEYDLVDVTRQVLSDYAKYLHKEMVEAF 571

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
              + +G  + S  FL++++D D LL+    F+LG WL  A+      E++++
Sbjct: 572 HQKNENGFMKYSTEFLDVIKDEDRLLSTRKEFMLGTWLTEAENAGCTPEEKRR 624


>gi|345513909|ref|ZP_08793424.1| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
 gi|345456132|gb|EEO45798.2| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
          Length = 754

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 152/478 (31%), Positives = 237/478 (49%), Gaps = 48/478 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +  +  S +S       +       +L   DP F  I   F E+  K YG TS  Y+ 
Sbjct: 252 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362

Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
              G L++LDLF+E +P      IW   K +    +++C+L NF GN+ ++G +D +   
Sbjct: 363 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 421

Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
               + +     + G+G++MEGIE NPV+++LM E+ ++ EK   + WI QY   RYG  
Sbjct: 422 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 481

Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
             +I+ AW +L + +YNC  G   +             SI           G+P    ++
Sbjct: 482 DESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 519

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
              + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A  A  ++
Sbjct: 520 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 579

Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L    E++
Sbjct: 580 NYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITSEEK 637


>gi|423230938|ref|ZP_17217342.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
           CL02T00C15]
 gi|423244649|ref|ZP_17225724.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
           CL02T12C06]
 gi|392630058|gb|EIY24060.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
           CL02T00C15]
 gi|392641498|gb|EIY35274.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
           CL02T12C06]
          Length = 754

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 152/478 (31%), Positives = 237/478 (49%), Gaps = 48/478 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +  +  S +S       +       +L   DP F  I   F E+  K YG TS  Y+ 
Sbjct: 252 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362

Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
              G L++LDLF+E +P      IW   K +    +++C+L NF GN+ ++G +D +   
Sbjct: 363 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 421

Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
               + +     + G+G++MEGIE NPV+++LM E+ ++ EK   + WI QY   RYG  
Sbjct: 422 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 481

Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
             +I+ AW +L + +YNC  G   +             SI           G+P    ++
Sbjct: 482 DESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 519

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
              + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A  A  ++
Sbjct: 520 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 579

Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L    E++
Sbjct: 580 NYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITSEEK 637


>gi|237711645|ref|ZP_04542126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
 gi|229454340|gb|EEO60061.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
          Length = 732

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 152/478 (31%), Positives = 237/478 (49%), Gaps = 48/478 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
           M+NL GWGGP P SW +QQ  LQKKIL R+ E GM+PVLP +SG +P+ L       S K
Sbjct: 170 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 229

Query: 59  ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
             +  +  S +S       +       +L   DP F  I   F E+  K YG TS  Y+ 
Sbjct: 230 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 288

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  MKAL   
Sbjct: 289 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 340

Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
              G L++LDLF+E +P      IW   K +    +++C+L NF GN+ ++G +D +   
Sbjct: 341 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 399

Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
               + +     + G+G++MEGIE NPV+++LM E+ ++ EK   + WI QY   RYG  
Sbjct: 400 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 459

Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
             +I+ AW +L + +YNC  G   +             SI           G+P    ++
Sbjct: 460 DESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 497

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
              + SS+     +Y  +    A  L ++  ++   +N + YDL+D+TRQA+A  A  ++
Sbjct: 498 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 557

Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  ++  D       +R+FLEL+   D LL     F +G W++ A+ L    E++
Sbjct: 558 NYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITSEEK 615


>gi|336412606|ref|ZP_08592959.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942652|gb|EGN04494.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
           3_8_47FAA]
          Length = 727

 Score =  228 bits (582), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 144/459 (31%), Positives = 224/459 (48%), Gaps = 42/459 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C +L +  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G
Sbjct: 371 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+ +  VD   WI   + R  G    +++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
            +   + +  E++ D+D L A H    L  W++ A+++ 
Sbjct: 576 ALKACAEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 614


>gi|322702923|gb|EFY94542.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
           ARSEF 23]
          Length = 589

 Score =  228 bits (582), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 150/484 (30%), Positives = 242/484 (50%), Gaps = 53/484 (10%)

Query: 1   MSNLHG-WGG--PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSA 57
             N  G WGG   L   W+D Q  LQKKI+ R+ ELG+ PVLPAF G VP A   V P A
Sbjct: 21  FGNTQGSWGGVGNLSSGWIDAQFELQKKIVARMVELGITPVLPAFPGFVPPAFSRVQPDA 80

Query: 58  KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
             T+   W  +  D     T+ L   D  +  + +AFI +Q++ +G  ++IY  D F+E 
Sbjct: 81  NTTKAPRWTGLP-DTNTRDTF-LSPLDTSYARLQQAFISKQIEAFGNVTNIYTLDQFNEM 138

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG-- 175
            P  + P Y+S +    Y  + + +  AVWL+QGWLF     W   ++ A L   P G  
Sbjct: 139 PPTSNEPSYLSQVSTYTYKALTAANPAAVWLLQGWLFLNSGLWTEERVTAYLGG-PEGHN 197

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSEN 234
            ++VLDL++E +P W  +K ++G P+IWC LH+F GN+ MYG +  I    ++A RTS  
Sbjct: 198 SMLVLDLYSESRPQWQRTKGYFGRPWIWCQLHDFGGNMGMYGQISDITVQSMDALRTSP- 256

Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPAIQDAW 292
            ++ G GM+ EG E N VVY ++ + A+    +D   +   Y VRRY       ++  AW
Sbjct: 257 -SLSGFGMTPEGYEGNEVVYQMLFDQAWTTTPIDTSGYFYGYVVRRYAGVSQTNSLFQAW 315

Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
           ++L   +Y+      +K+R V             V  G YQN        + L + T ++
Sbjct: 316 DILRQNIYD------NKDRQVPC-----------VGVGIYQN----APSLSGLVNRTGNW 354

Query: 353 DHP-HLWYSTSEVIRALELFIASGNELSA---SNTYRYDLIDLTRQALAKYANELFLNII 408
             P  ++Y  + + +A  L I + NE+       T++ D++D+TRQ ++   N ++ + +
Sbjct: 355 PPPTKVYYDPATLKKAHSLLIQAANEIPQLWDIPTFQLDVVDVTRQVMSNAFNTMYTDYV 414

Query: 409 EAYQLNDAHGVFQLSRR---------------FLELVEDMDGLLACHDGFLLGPWLESAK 453
           + +    +     +S R                L+ + D+D +LA +  F L  WL++A+
Sbjct: 415 QTFNSQLSRQKSHISNRGGLQRRDDFATKGKQLLDFLTDLDRVLATNQHFRLDSWLDAAQ 474

Query: 454 QLAQ 457
             A+
Sbjct: 475 YWAK 478


>gi|320106778|ref|YP_004182368.1| alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
 gi|319925299|gb|ADV82374.1| Alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
          Length = 754

 Score =  228 bits (582), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 154/478 (32%), Positives = 231/478 (48%), Gaps = 53/478 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++   GPLP+ +++Q+ VLQ+KIL R+  LGM PV PAFSG VP   + + P A+  
Sbjct: 220 MGNVNNIDGPLPEHFIEQKRVLQRKILDRMRSLGMRPVAPAFSGFVPQGFKRLHPKAETF 279

Query: 61  QLGNWF--SVKSDPRWCCTYLLDATDP-LFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
            L  W     K+ PR   T++L   +  L+  IG+ FIE+   EYG   + Y  DTF+E 
Sbjct: 280 TL-LWLPEEFKTIPRSTRTFILHPGEQDLYRLIGKKFIEEYKAEYGEVQY-YLADTFNEL 337

Query: 118 TPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
             PV      E +   G  +Y G+ +GD +  W+MQGWLF YD  FW    + ALL  +P
Sbjct: 338 AVPVREEHRFEDLERFGRTVYEGILAGDPNGTWVMQGWLFVYDVAFWNSESVAALLRGIP 397

Query: 174 LGKLVVLDLFAEVKPI---------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
             +++++D   ++ P          W T K F+G  +I  M H F GN  + G L  +A 
Sbjct: 398 NDRMLIIDYANDLAPAVKGKYAPGQWKTQKAFFGKQWINGMAHTFGGNNNVKGNLKLMAS 457

Query: 225 GPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
            P    TS E   +VG GM  EGIE N VVY+LM++  +Q E +D+K WI  Y   RYG 
Sbjct: 458 EPASVLTSPERGNLVGWGMCPEGIETNEVVYELMTDAGWQREAIDLKQWIPAYCRSRYGA 517

Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
             P + +AW +L  + Y+     T +        P + P+  SV  G       P  +  
Sbjct: 518 CPPVMLEAWTLLMQSAYSAHIWMTHQAWQTE---PSLAPAAASVDAG-------PTFR-- 565

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                                 RA+ LF++   EL     YR DLI+L  QA     ++ 
Sbjct: 566 ----------------------RAVALFLSCAPELGQKELYRNDLIELVVQAAGGSVDQT 603

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
           F   ++A Q +      + +   L  +  MD LL       L  W+++A+  A+++++
Sbjct: 604 FSLAVQAGQSHQNEVATEYAAHALGWMGRMDALLNLRPDRRLETWMQAARSYAKSDDE 661


>gi|380694112|ref|ZP_09858971.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
          Length = 736

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 147/484 (30%), Positives = 236/484 (48%), Gaps = 73/484 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 199 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 250

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+  + Y+ D F E 
Sbjct: 251 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 307

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  G +I   M+  + +AVW+MQ         W+    +A++N++  G
Sbjct: 308 GNTEGVD----LAKAGTSIMGAMKKANPEAVWVMQA--------WQANPREAMVNTLDSG 355

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            L+VLDL++E  P       +W   K F    +++CML NF GN+ ++G ++ +  G   
Sbjct: 356 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 415

Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
           A    N  T+ GVG + EGIE NP++++L+ E+ ++ E+     W+  Y   RYG  + P
Sbjct: 416 ACAHINGKTLRGVGATPEGIENNPMMFELLYELPWREERFSPDIWLQGYLKARYGDDLSP 475

Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
            + +AW  L HTVYN                               +NY    + E++L 
Sbjct: 476 EVTEAWRALEHTVYNAP-----------------------------KNYQGEGTVESLLC 506

Query: 347 S-------ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
           +        TS++ +  L+YS     +A +L ++  +    +N + YDL+D+ RQ+LA  
Sbjct: 507 ARPGFHLDRTSTWGYAKLFYSPDSTAKAAQLLLSVADRYKGNNNFEYDLVDIVRQSLADK 566

Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
           AN L   I ++Y   D     + +++FL L+   D LL+    F +  WL +A+ L   E
Sbjct: 567 ANVLLEEISQSYDRKDKDSFRKQTQQFLGLILSQDSLLSTRKEFSVSSWLSAARSLGTTE 626

Query: 460 EQEK 463
           E++K
Sbjct: 627 EEKK 630


>gi|336408181|ref|ZP_08588675.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
 gi|335939481|gb|EGN01355.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
          Length = 718

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 153/480 (31%), Positives = 237/480 (49%), Gaps = 65/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  +Q+ LQKKIL R+ E G+ P+LP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPMLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  +DP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             NT  VD    + + G A+   M+  +  AVW+ Q W  +  P         ++  +  
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIEDLKA 343

Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
           G L++LDL +E +P W  S      K  YG   +I+CML N+ GN+ ++G +D++   F 
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++   + W+ +Y   RYG   
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           P +Q AW  L +++YN     T +  +  V  A P          E  YQ          
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y   EVI A  L ++  +    +N + YDL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              +  AY+  D       S +FL+L+   D LL     F +G W+E A+ L    E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619


>gi|295085509|emb|CBK67032.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 716

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 144/459 (31%), Positives = 223/459 (48%), Gaps = 42/459 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 184 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 243

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C +L +  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 244 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 299

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 300 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 359

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G
Sbjct: 360 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 419

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+ +  VD   WI   + R  G    +++DAW  L++ +
Sbjct: 420 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 478

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 479 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 513

Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 514 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 564

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
            +     +  E++ D+D L A H    L  W++ A+++ 
Sbjct: 565 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 603


>gi|237721435|ref|ZP_04551916.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|293370838|ref|ZP_06617383.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
 gi|229449231|gb|EEO55022.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|292634054|gb|EFF52598.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
          Length = 711

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 145/477 (30%), Positives = 239/477 (50%), Gaps = 59/477 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L   D  F      + E+  K YG+ ++ Y+ D F E 
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  GA+I + M+  + +AVW++Q         W+    + ++ S+  G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPEAVWIIQA--------WQANPREEMIASLNQG 352

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            L+VLDL++E +P       +W   K F    +++CML NF GN+ ++G ++ +  G  +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412

Query: 229 ARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
           A    N  M+ GVG + EGIE NPV+++L+ E+ ++ E+     W+  Y   RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472

Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
            I +AW  L +TVYN      D   +  +       S++    G + +            
Sbjct: 473 EIMEAWRALEYTVYNA---PKDYQGEGTI------ESLLCARPGFHLD------------ 511

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
             TS++ +  L+Y+     +A  LF +  ++   +N + YDL+D+ RQ+ A   N L   
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570

Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           I ++Y   D     + +++FL+L+   D LL+    F +  WL +A+ L   EE+++
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKR 627


>gi|298480128|ref|ZP_06998327.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
 gi|336404356|ref|ZP_08585054.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
 gi|298273937|gb|EFI15499.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
 gi|335943684|gb|EGN05523.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
          Length = 727

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 144/459 (31%), Positives = 223/459 (48%), Gaps = 42/459 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C +L +  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G
Sbjct: 371 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+ +  VD   WI   + R  G    +++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
            +     +  E++ D+D L A H    L  W++ A+++ 
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 614


>gi|340347658|ref|ZP_08670763.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
 gi|433652542|ref|YP_007296396.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
           3688]
 gi|339608852|gb|EGQ13735.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
 gi|433303075|gb|AGB28890.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
           3688]
          Length = 781

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 155/478 (32%), Positives = 227/478 (47%), Gaps = 64/478 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP SW  QQ  LQK+IL R  ELGM PVLP + G +P   +       +T
Sbjct: 207 MNNLEGWGGPLPDSWYRQQEALQKRILQRERELGMEPVLPGYCGMMPHDAKQKL-GLDVT 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W              L ATDP F EI   +  +Q + YG+ SH Y+ D F E +  
Sbjct: 266 PGGTWNGYVRPAN------LSATDPRFDEIADLYYREQTRLYGK-SHYYSMDPFHETSDD 318

Query: 121 VDSPEYI--SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
           V    YI  +  G  + + M+  +  A W++QGW  +  P        A+ + +P G L 
Sbjct: 319 V----YIDYAQAGRKLMAAMKRENPKANWVIQGWTENPRP--------AMTDGLPAGSLT 366

Query: 179 VLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDS------IAFGP 226
           VLDLF+E +P      IW  ++ +    +++CML NF GN+ ++G +D       +A  P
Sbjct: 367 VLDLFSECRPMFGAPSIWKRAEGYGQHDWLFCMLENFGGNVGLHGRMDQLIGNFRLATSP 426

Query: 227 VEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA--------WINQYSV 278
                     + G+G +MEG E NP++++LMSE+ ++ ++V   A        W+  Y  
Sbjct: 427 QSPLQQARRHLRGIGFTMEGSENNPIMFELMSELPWRTDEVAQAADARTFRTEWVRGYVK 486

Query: 279 RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP 338
            RYG   P  Q AW +L  T+YNC  G   +             SI           G+P
Sbjct: 487 ARYGTDDPHAQQAWQLLAETIYNCPAGNNQQG---------PHESIFD---------GRP 528

Query: 339 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
                 +KS +   ++    Y  S  + A  L  A+ + L  +N Y YDL+D+ RQA+  
Sbjct: 529 SLNNFQVKSWSKMRNY----YEPSATLEAARLMAAAADRLKGNNNYEYDLVDIVRQAIDD 584

Query: 399 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
            A +++L+ I  Y   D     + S RFL L+   D LL     F LG W E+A+ L 
Sbjct: 585 QARQVYLHAIADYNGFDRRAFSRDSARFLGLLLMQDRLLGTRREFRLGRWTEAARSLG 642


>gi|393788556|ref|ZP_10376683.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
           CL02T12C05]
 gi|392654236|gb|EIY47884.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
           CL02T12C05]
          Length = 732

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 148/475 (31%), Positives = 231/475 (48%), Gaps = 54/475 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGP+   ++ +Q  LQ+K+L R+ ELGM PV   F G VP AL+  FP A+I 
Sbjct: 196 MGNLEGFGGPVTPEFIARQTDLQQKMLKRMRELGMKPVFQGFYGMVPNALKEKFPDARIK 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W + +          LD TDPLF ++   + E+Q   +G  +  +  D F E    
Sbjct: 256 DQGIWGTYQRPA------FLDPTDPLFDKLAAIYYEEQKNLFGE-AQFFGGDPFHEG--- 305

Query: 121 VDSPEYISSLGAA--IYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             + E I+   AA  I   M+  +  AVW++QGW   ++P      +K L+  V  G+ +
Sbjct: 306 -GTSEGINVKLAAQKILQAMRKVNPQAVWVLQGW--QHNP------VKELMEGVKPGETI 356

Query: 179 VLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
           +LDL A  +P W   K          +   +IWC L NF G   ++G + S A GPV A+
Sbjct: 357 ILDLMACERPQWGGVKTSMFHKPEGHWNHQWIWCALPNFGGKTGLHGKMSSYASGPVFAK 416

Query: 231 TSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
                  + G+G + EGI   PVVYD++ +MA++ + + +  W++ Y+  RYG       
Sbjct: 417 HHPMGKNICGIGTAPEGIGTIPVVYDMVYDMAWRTDSIHIPQWLDNYTYYRYGTEDNNCN 476

Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
            AW +L  T+Y C +         I A P       S T      +G  V          
Sbjct: 477 RAWKLLSETIYECHNELGGPVESYICARP-------SDTIQHVSTWGNAV---------- 519

Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
                  ++Y   +V++A +L   S    + S+TY YDL D+TRQ L+ YA  L   ++ 
Sbjct: 520 -------MFYDPMKVVKAWDLLYQSRKRFNHSDTYEYDLTDVTRQVLSDYAKYLHERMVL 572

Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
           A+Q  D     + S +FL +++D D LL+    F+LG WL  A++     E++++
Sbjct: 573 AFQKKDKERFMEYSGKFLNIIKDEDRLLSTRKEFMLGTWLAEAEKAGGTPEEKRR 627


>gi|423293377|ref|ZP_17271504.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
           CL03T12C18]
 gi|392678320|gb|EIY71728.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
           CL03T12C18]
          Length = 727

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 142/461 (30%), Positives = 223/461 (48%), Gaps = 42/461 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+ +  VD   WI   + R  G     ++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQPVRDAWKRLFNDI 489

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                      + +  T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
            +     +  E++ D+D L A H    L  W++ A+++  +
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDS 616


>gi|262406058|ref|ZP_06082608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|294806855|ref|ZP_06765680.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|345510563|ref|ZP_08790130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|262356933|gb|EEZ06023.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|294445884|gb|EFG14526.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|345454460|gb|EEO49066.2| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
          Length = 727

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 144/459 (31%), Positives = 222/459 (48%), Gaps = 42/459 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C +L +  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+ +  VD   WI   + R  G     ++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQPVRDAWKRLFNDI 489

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
            +     +  E++ D+D L A H    L  W++ A+++ 
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 614


>gi|294777713|ref|ZP_06743164.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
 gi|294448781|gb|EFG17330.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
          Length = 752

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 153/493 (31%), Positives = 235/493 (47%), Gaps = 78/493 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
           M+NL GWGGP P SW  QQ  LQKKIL R+ E GM+PVLP +SG +P+            
Sbjct: 190 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 249

Query: 49  ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
               L N    +  + L  W             +L   DP F +I   F E+  K YG T
Sbjct: 250 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 302

Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
           S  Y+ D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  M
Sbjct: 303 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 357

Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
           KAL      G L++LDLF+E +P+W       G+P IW             C+L NF GN
Sbjct: 358 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 406

Query: 213 IEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
           + ++G +D +       + +     + G+G++MEGIE NPV+++LM E+ ++ EK   + 
Sbjct: 407 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 466

Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTE 329
           WI QY   RYG    +I  AW +L + +YNC  G   +  +  +    P ++ +  + + 
Sbjct: 467 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQGPHESIFCGRPSLN-NFQASSW 525

Query: 330 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 389
            K  NY  P                     +T+E  R   L ++  ++   +N + YDL+
Sbjct: 526 SKMCNYYDPT--------------------TTAEAAR---LMVSVAHKYRGNNNFEYDLV 562

Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
           D+TRQA+A  A  ++   +  ++  D       +R+FLEL+   D LL     F +G W+
Sbjct: 563 DITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWI 622

Query: 450 ESAKQLAQNEEQE 462
           + A+ L    E++
Sbjct: 623 QQARNLGSTSEEK 635


>gi|345517325|ref|ZP_08796802.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
 gi|345457718|gb|EET14396.2| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
          Length = 754

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 153/493 (31%), Positives = 235/493 (47%), Gaps = 78/493 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
           M+NL GWGGP P SW  QQ  LQKKIL R+ E GM+PVLP +SG +P+            
Sbjct: 192 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 251

Query: 49  ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
               L N    +  + L  W             +L   DP F +I   F E+  K YG T
Sbjct: 252 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 304

Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
           S  Y+ D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  M
Sbjct: 305 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 359

Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
           KAL      G L++LDLF+E +P+W       G+P IW             C+L NF GN
Sbjct: 360 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 408

Query: 213 IEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
           + ++G +D +       + +     + G+G++MEGIE NPV+++LM E+ ++ EK   + 
Sbjct: 409 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 468

Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTE 329
           WI QY   RYG    +I  AW +L + +YNC  G   +  +  +    P ++ +  + + 
Sbjct: 469 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQGPHESIFCGRPSLN-NFQASSW 527

Query: 330 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 389
            K  NY  P                     +T+E  R   L ++  ++   +N + YDL+
Sbjct: 528 SKMCNYYDPT--------------------TTAEAAR---LMVSVAHKYRGNNNFEYDLV 564

Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
           D+TRQA+A  A  ++   +  ++  D       +R+FLEL+   D LL     F +G W+
Sbjct: 565 DITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWI 624

Query: 450 ESAKQLAQNEEQE 462
           + A+ L    E++
Sbjct: 625 QQARNLGSTSEEK 637


>gi|319640296|ref|ZP_07995021.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
 gi|317388071|gb|EFV68925.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
          Length = 752

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 153/493 (31%), Positives = 235/493 (47%), Gaps = 78/493 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
           M+NL GWGGP P SW  QQ  LQKKIL R+ E GM+PVLP +SG +P+            
Sbjct: 190 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 249

Query: 49  ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
               L N    +  + L  W             +L   DP F +I   F E+  K YG T
Sbjct: 250 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 302

Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
           S  Y+ D F E    + +       G AI   M+  +  AVW++QGW  + +P  RP  M
Sbjct: 303 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 357

Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
           KAL      G L++LDLF+E +P+W       G+P IW             C+L NF GN
Sbjct: 358 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 406

Query: 213 IEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
           + ++G +D +       + +     + G+G++MEGIE NPV+++LM E+ ++ EK   + 
Sbjct: 407 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 466

Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTE 329
           WI QY   RYG    +I  AW +L + +YNC  G   +  +  +    P ++ +  + + 
Sbjct: 467 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQGPHESIFCGRPSLN-NFQASSW 525

Query: 330 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 389
            K  NY  P                     +T+E  R   L ++  ++   +N + YDL+
Sbjct: 526 SKMCNYYDPT--------------------TTAEAAR---LMVSVAHKYRGNNNFEYDLV 562

Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
           D+TRQA+A  A  ++   +  ++  D       +R+FLEL+   D LL     F +G W+
Sbjct: 563 DITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWI 622

Query: 450 ESAKQLAQNEEQE 462
           + A+ L    E++
Sbjct: 623 QQARNLGSTSEEK 635


>gi|383115203|ref|ZP_09935961.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
 gi|313695380|gb|EFS32215.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
          Length = 727

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 144/459 (31%), Positives = 222/459 (48%), Gaps = 42/459 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C +L +  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+     D K WI   + R  G    +++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAWNLNADDNK-WIECLADRHVGCVSQSVRDAWKRLFNDI 489

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y              V  P         T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFFDVKVEFDRMVEAKDYQ 575

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
            +     +  E++ D+D L A H    L  W++ A+++ 
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 614


>gi|299140550|ref|ZP_07033688.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella oris C735]
 gi|298577516|gb|EFI49384.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella oris C735]
          Length = 741

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 149/470 (31%), Positives = 229/470 (48%), Gaps = 49/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP SW +QQ  LQKKIL R++E GM PVLP F G +P   +       +T
Sbjct: 186 MNNLEGWGGPLPDSWYNQQEALQKKILKRMHEYGMQPVLPGFCGMMPHDAKAKL-GLNVT 244

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W              L  TD  F +I   +  +  K YG+ ++ Y+ D F E T  
Sbjct: 245 DGGIWNGYTRPAN------LSPTDAHFDKIADLYYAELTKLYGKANY-YSMDPFHE-TND 296

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            ++ +Y S  G  +   M+  +  A W++QGW  + +P  RP  +K + N    G L+VL
Sbjct: 297 DETIDY-SKAGCKVMEAMKRVNPKATWVIQGW--TENP--RPQMIKNMKN----GDLLVL 347

Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
           DLF+E +P      IW   K +    +++CML NF  N+ ++G +D +       + S  
Sbjct: 348 DLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLLHNFYSTKQSSP 407

Query: 235 TT--MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
            T  + G+G +MEG E NPV+++LMSE+ ++ E    + WI  Y   RYG++ P I+ AW
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSELPWRTE-CKKEDWIKGYVKARYGKTSPEIERAW 466

Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
            +L  T+YNC  G   +             SI           G+P      +KS +   
Sbjct: 467 QLLSETIYNCPAGNNQQG---------PHESIFC---------GRPSLNNFQVKSWSKMR 508

Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
           ++    Y     + A +L     ++   +N + YDL+D+ RQALA      +L  I  Y 
Sbjct: 509 NY----YDPQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALADQGRLQYLKTIADYN 564

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
                   + + RFLE++   D LL     F LG W E+A++L   ++++
Sbjct: 565 GFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGTTQQEK 614


>gi|423214204|ref|ZP_17200732.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693149|gb|EIY86384.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 727

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 141/461 (30%), Positives = 223/461 (48%), Gaps = 42/461 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T H+Y  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHVYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGERLENALINGGGNLKG 430

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+ +  VD   WI   + R  G     ++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDDKWIECLADRHVGCVSQPVRDAWKRLFNDI 489

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                      + +  T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNIEL-- 524

Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
            +     +  E++ D+D L A H    L  W++ A+++  +
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDS 616


>gi|160884062|ref|ZP_02065065.1| hypothetical protein BACOVA_02038 [Bacteroides ovatus ATCC 8483]
 gi|423291477|ref|ZP_17270325.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
           CL02T12C04]
 gi|156110404|gb|EDO12149.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
 gi|392663477|gb|EIY57027.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
           CL02T12C04]
          Length = 727

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 144/461 (31%), Positives = 222/461 (48%), Gaps = 42/461 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G T HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGERLENALINGGGNLKG 430

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+ +  VD   WI   + R  G     ++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDDKWIECLADRHVGCVSQPVRDAWKRLFNDI 489

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y              V  P         T G    Y +P       K  ++ Y +  L  
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNRNSEKRTSNVYSNVEL-- 524

Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKVEFDRMVEAKDHQ 575

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
            +     +  E++ D+D L A H    L  W++ A+++  +
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDS 616


>gi|423212382|ref|ZP_17198911.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392694828|gb|EIY88054.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 705

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 149/478 (31%), Positives = 233/478 (48%), Gaps = 61/478 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ VLQKKI+ R+ ELG+ PV P ++G VP  +       +I 
Sbjct: 196 MNNLEGWGGPNPDSWYRQQEVLQKKIVARMRELGIEPVFPGYAGMVPRNIGEKL-GYQIA 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W S    PR      L   D  F      + E+  K YG+ ++ Y+ D F E  NT
Sbjct: 255 DPGKWCSF---PR---PAFLSTEDEHFESFAAMYYEELEKLYGKANY-YSMDPFHEGGNT 307

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    ++  GA+I + M+  +  AVW++Q         W+    + +++S+  G ++
Sbjct: 308 EGVD----LAKTGASIMAAMKKANPKAVWVIQA--------WQANPREEMISSLNQGDML 355

Query: 179 VLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
           VLDL++E  P W          K F    +++CML NF  N+ ++G +D +  G  +A  
Sbjct: 356 VLDLYSERLPQWGDPDSKWYREKGFGKHDWLYCMLLNFGANVGLHGRMDLLVNGYYDACA 415

Query: 232 SEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-PAIQ 289
             N  T+ GVG + EGIE NPV+++L+ E+ ++ E+     W+  Y   RYG+ V P + 
Sbjct: 416 HANGKTLRGVGATPEGIENNPVMFELLYELPWREERFSPDEWLQGYLKARYGKDVSPEVM 475

Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVL 345
           +AW  L HTVYN         RD                   YQ  G   S    +    
Sbjct: 476 EAWRALEHTVYNAP-------RD-------------------YQGEGTVESLLCARPGFH 509

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
              TS++ +  L+YS     +A  L  +   +   SN + YDL+D+ RQ+ A   N L  
Sbjct: 510 LDRTSTWGYAKLFYSPDSTAKAARLLTSVAKQYEGSNNFEYDLVDIVRQSNADKGNVLLE 569

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           +I ++Y   D     + +++FL+L+   D LL+    F +  WL++A+ L   + ++K
Sbjct: 570 DISQSYDRKDKENFRKQTQQFLDLIVSQDSLLSTRKEFSVSTWLDAARSLGTTDAEKK 627


>gi|323344412|ref|ZP_08084637.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
 gi|323094539|gb|EFZ37115.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
          Length = 730

 Score =  224 bits (572), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 141/462 (30%), Positives = 221/462 (47%), Gaps = 39/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP+ WL+ Q  LQKKIL R     M PVLPAF+G+VPA L+ +FP A I 
Sbjct: 198 MANIDRWNGPLPKEWLNGQKELQKKILARERAFNMKPVLPAFAGHVPAELKRIFPDANIK 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W   +   ++ C + L   +PLF +I + ++E+Q   +G T HIY  D F+E  PP
Sbjct: 258 SLGKWGGFEE--KYLC-HFLSPEEPLFSKIQKLYLEEQTALFG-TDHIYGVDPFNEVEPP 313

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              P Y+  +   +Y  + + D  A W+  GW+FSYD   W P +++A L  VP GK+ +
Sbjct: 314 SWEPAYLRKVSKNMYGTLTAVDPKAEWMQMGWMFSYDNKHWTPDRVQAFLTGVPKGKMSL 373

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD + E   +W T+  FYG PYIWC L NF GN  + G +         A  +    M+G
Sbjct: 374 LDYYCENVELWKTTDGFYGQPYIWCYLGNFGGNTTLMGNVKESGRRLDNALANGQRNMLG 433

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
            G ++EG++     Y+ +    + H   D + WI+  + R YG   P+++ AW++L++ +
Sbjct: 434 AGSTLEGLDVIQFPYEYLYNKLWSHAVADSR-WIDDLADRHYGGVSPSVRKAWHILFNDI 492

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                       + +  +G   N+ +P             Y    L  
Sbjct: 493 Y---------------------VQVSASMQGVLTNF-RPALNNNYPHRTAIEYPAERL-- 528

Query: 360 STSEVIR-ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
              EV R  L++     NEL      + D+I + RQ L      +      AY   D   
Sbjct: 529 --EEVWRLLLDVPRCDRNEL------QLDIIAVGRQVLGNRFAVVKTQFDSAYANKDIPR 580

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
           +   +    EL+ D+D L + +    +  W++ A++L   +E
Sbjct: 581 LKAKACEMEELLGDLDRLTSFNSRCSINRWIDDARKLGSTKE 622


>gi|346323119|gb|EGX92717.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
          Length = 742

 Score =  224 bits (570), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 142/476 (29%), Positives = 240/476 (50%), Gaps = 39/476 (8%)

Query: 10  PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVK 69
           PL  SW+DQQ  LQK+I+ R+ +LG+ P+LPAF G VP A   + P A + +   W  + 
Sbjct: 195 PLSLSWIDQQFALQKRIVARMVQLGITPILPAFPGFVPDAFARLRPGADLVRAPAWGGLP 254

Query: 70  SDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS 129
           +D        L   D  + E+ R F+E Q++ YG  +++Y  D F+E  P   + +Y+S+
Sbjct: 255 ADSPNTRALFLSPLDDAYAELQRLFVEAQIEAYGNVTNVYAMDQFNEINPVSGATDYLSA 314

Query: 130 LGAAIYSGMQSGDSDAVWLMQGWLF--SYDPFWRPPQMKALLNSVP-LGKLVVLDLFAEV 186
           +    Y+ + + +  AVWLMQGWLF  S   FW   +++A L        +V+LDLF+E 
Sbjct: 315 VSRRSYAALAAANPAAVWLMQGWLFYLSEGNFWTQERIEAYLRGPEDRAGMVILDLFSET 374

Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEG 246
            P W  +  + G P+IWC +H+F GN  ++G + +    P+EA   E+ +MVG+G++ E 
Sbjct: 375 APQWQRTGSYAGRPWIWCQVHDFGGNQNLFGKITNTTVNPMEA-LRESDSMVGLGIATEA 433

Query: 247 IEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNVLYHTVYNCT 303
            E N V+YDL  +  +    +D  ++ + ++ RRY   R +PA +  AW +L  TVY+  
Sbjct: 434 YEGNEVLYDLFFDQGWSATPIDTVSYFHDWTTRRYSGVRQLPASLYQAWELLRVTVYDY- 492

Query: 304 DGATDKNRDVI---VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
                +  D+I   V+   ++P++  +       Y     K   L  + ++   P +W  
Sbjct: 493 -----RASDLIGVPVSVYQLEPNLTGL-------YNTTTGKPTALHYDPAAL--PPIW-- 536

Query: 361 TSEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
                    LF+A+      L A   +R DL+D+ RQ L+     L+ +++ A+      
Sbjct: 537 --------RLFVAAAAAQPRLWAEPGFRLDLVDVMRQVLSNAFGRLYADLVAAFTGGAPP 588

Query: 418 G-VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVSQ 472
             + Q  +R   ++ D+D LLA    F L  WL +A+   ++  +   +     SQ
Sbjct: 589 SEIAQRGQRMRAVLGDVDALLATQPHFSLRRWLNAARAWGESTGENAAIAYEARSQ 644


>gi|299144715|ref|ZP_07037783.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
 gi|298515206|gb|EFI39087.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
          Length = 727

 Score =  224 bits (570), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 141/461 (30%), Positives = 223/461 (48%), Gaps = 42/461 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRLYPEADIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W       R  C + L+  D LF +I + F+++Q K +G   HIY  D F+E  PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-IDHIYGLDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              PEY+  + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++
Sbjct: 311 SFEPEYLRKIVSDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD   E   +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           +G ++EG++     Y+ + E A+ +  VD   WI   + R  G    +++DAW  L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y                      + +  T G    Y +P   +   K  ++ Y +  L  
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524

Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
              EV R L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D  
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
            +     +  E++ D+D L A H    L  W++ A+++  +
Sbjct: 576 ALKACGEKMKEILHDLDKLNAFHPYCSLDKWIDDARKMGDS 616


>gi|423217398|ref|ZP_17203894.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
           CL03T12C61]
 gi|392628557|gb|EIY22583.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
           CL03T12C61]
          Length = 707

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 144/477 (30%), Positives = 233/477 (48%), Gaps = 59/477 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVSRMRELGIEPVFPGYAGMVPRNIGE-------- 247

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L + D  F      + E+  K YG+  + Y+ D F E 
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 304

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  G +I   M+  + +AVW++Q W  +  P        A+++ +  G
Sbjct: 305 GNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAWQANPRP--------AMIDVLNAG 352

Query: 176 KLVVLDLFAEVKPIWSTS-------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            ++VLDL++E +P W  S       K F    +++CML NF GN+ ++G ++ +  G  +
Sbjct: 353 DMLVLDLYSEKRPQWGDSDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412

Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
           A    N   M GVG + EGIE NPV+++L+ E+ ++ E+     W+  Y   RYG  + P
Sbjct: 413 ACAHVNGKRMRGVGATPEGIENNPVMFELLYELPWRAERFSPDVWLQGYLKARYGGELSP 472

Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
            + +AW  L HTVYN                P   P      EG  ++     ++     
Sbjct: 473 EVMEAWRALEHTVYNA---------------PKNSPG-----EGTLESL--LCARPGFHL 510

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
             TS++ +  L+YS     +A +L ++   +    N + YDL+D+ RQ+ A   N L   
Sbjct: 511 DRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGDNNFEYDLVDIVRQSNADKGNALLDE 570

Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           I ++Y   D     + +++FLEL+   D LL+    F +  WL +A+ L   + ++K
Sbjct: 571 ISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEFSVSSWLAAARSLGNTDAEKK 627


>gi|340514474|gb|EGR44736.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
          Length = 762

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 142/468 (30%), Positives = 241/468 (51%), Gaps = 38/468 (8%)

Query: 3   NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
           NL G W   LP  W+D Q  LQKKI+ R+ ELG+ P+LPAF G VP A   V P A++  
Sbjct: 209 NLQGSWSSSLPFEWVDDQFALQKKIVKRMVELGITPILPAFPGFVPRAAPRVLPDARLLH 268

Query: 62  LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
              W        +     LD  DPLF ++ R+FI +Q + YG  ++ Y  D F+E  PP 
Sbjct: 269 SIQWAGFPE--IFTEDTFLDPVDPLFAQMQRSFITKQKQAYGNVTNFYTLDQFNEMIPPS 326

Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPL-GKLVV 179
               Y+ ++ +  +  ++S D +A+W+ Q WLF+ +  FW   +++A L  V     +++
Sbjct: 327 GDVAYLRNVSSNTWKALKSADPNAIWVFQAWLFAQNTTFWTNERIEAYLGGVTADSDMLI 386

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD+++E  P W  ++ +YG P+IWC L N+   I +YG + ++   P+ A   E+T++ G
Sbjct: 387 LDIWSESMPQWQRAQSYYGKPWIWCELQNYGATINLYGQIQNVTNSPILA-LQESTSLSG 445

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPAIQDAWNVLYH 297
            G+SMEG + N +VYDL+   A+  E +D +A+ + ++  RY   +    I DAW  +  
Sbjct: 446 FGLSMEGQQNNEIVYDLLLAQAWSSEPLDTEAYFHNWASARYSSDQRPGFIHDAWETVRT 505

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           TVY+ T+         +   P V  SII +           V + + + ++ +      L
Sbjct: 506 TVYDNTN---------LTLMPSVPKSIIEL-----------VPRTSNM-ADITGILGTKL 544

Query: 358 WYSTSEVIRALELFIASG---NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
            Y  + ++ A +    +G     L  ++ Y+YDL+D TRQ LA     ++ NI++ Y  +
Sbjct: 545 PYDPAVMVSAWKQLYHAGLQDTSLFNNSAYQYDLVDWTRQVLANAFIPIYKNIVDIYYNS 604

Query: 415 DAHGVFQLSR------RFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
           +     ++ R      +  +L+  +D +L+ +  F L  WL +A+  A
Sbjct: 605 NQTAGSRIQRLKAQGQQVTKLLLSLDLVLSSNRNFRLSTWLSAARSSA 652


>gi|153807690|ref|ZP_01960358.1| hypothetical protein BACCAC_01972 [Bacteroides caccae ATCC 43185]
 gi|149130052|gb|EDM21264.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
           43185]
          Length = 707

 Score =  222 bits (566), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 144/484 (29%), Positives = 236/484 (48%), Gaps = 73/484 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ  LQKKI+ R+ ELG+ PV P ++G VP  +          
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVSRMRELGIEPVFPGYAGMVPRNIGE-------- 247

Query: 61  QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
           +LG  + +    +WC       L + D  F      + E+  K YG+  + Y+ D F E 
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 304

Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
            NT  VD    ++  G +I   M+  + +AVW++Q W  +  P        A+++ +  G
Sbjct: 305 GNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAWQANPRP--------AMVDVLNAG 352

Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
            ++VLDL++E  P       +W   K F    +++CML NF GN+ ++G ++ +  G  +
Sbjct: 353 DMLVLDLYSERLPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412

Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
           A T  N  T+ GVG + EGIE NPV+++L+ E+ ++ E+     W+  Y   RYG  + P
Sbjct: 413 ACTHANGKTLRGVGTTPEGIENNPVMFELLYELPWRAERFSPDTWLQGYLKARYGGELSP 472

Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
            + +AW  L HTVYN                               +NY    + E++L 
Sbjct: 473 EVMEAWRALEHTVYNAP-----------------------------KNYQGEGTVESLLC 503

Query: 347 S-------ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
           +        TS++ +  L+YS     +A +L ++   +   +N + YDL+D+ RQ+ A  
Sbjct: 504 ARPGFHLDRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGNNNFEYDLVDIVRQSNADK 563

Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
            N L   I ++Y   D     + +++FLEL+   D LL+    F +  WL +A+ L   +
Sbjct: 564 GNALLDEISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEFSVSSWLTAARSLGNTD 623

Query: 460 EQEK 463
            ++K
Sbjct: 624 AEKK 627


>gi|355706271|gb|AES02588.1| N-acetylglucosaminidase [Mustela putorius furo]
          Length = 333

 Score =  222 bits (565), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 129/331 (38%), Positives = 187/331 (56%), Gaps = 45/331 (13%)

Query: 142 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 200
           D DAVWL+QGWLF + P FW P Q++A+L +VP G+L++LDLFAE +P++  +  F+G P
Sbjct: 3   DPDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLILDLFAESQPVYLRTASFHGQP 62

Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 260
           +IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+
Sbjct: 63  FIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVGTGMAPEGIGQNEVVYALMAEL 122

Query: 261 AFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 318
            ++ + V D++AW+  ++ RRYG      + AW +L  +VYNC+ +  T  NR  +V   
Sbjct: 123 GWRKDPVADLEAWVTSFAARRYGVDSKETEVAWRLLLGSVYNCSGEACTGHNRSPLVR-- 180

Query: 319 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 378
              PS+  VT                            +WY+ S V  A  L +A+   L
Sbjct: 181 --RPSLQMVTT---------------------------VWYNRSAVFEAWRLLLAAAPTL 211

Query: 379 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 432
           + S T+RYDL+D+TRQA  +  +  +     AY       L  A G+        EL+  
Sbjct: 212 AKSPTFRYDLLDVTRQAAQELVSLYYTEARTAYLNKELVPLMRAAGIL-----VYELLPA 266

Query: 433 MDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           +DG+LA    FLLG WLE A+ +A +E   +
Sbjct: 267 LDGVLASDSRFLLGTWLEQARAVAVSETDAR 297


>gi|422873453|ref|ZP_16919938.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens F262]
 gi|380305838|gb|EIA18115.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens F262]
          Length = 2104

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 143/463 (30%), Positives = 232/463 (50%), Gaps = 41/463 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GNLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+  M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+G++ E I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I DAWN++  
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y        K  D    +     SII+   G    +G        +KS  S++ H  +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   DA 
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNRDAE 759

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
               +S +FLEL++  + +L+    FL+G W+E A+ + ++ +
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 802


>gi|429766730|ref|ZP_19298977.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
           celatum DSM 1785]
 gi|429183354|gb|EKY24416.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
           celatum DSM 1785]
          Length = 2284

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 138/458 (30%), Positives = 230/458 (50%), Gaps = 41/458 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGPLP +W + ++ L +++  R+  LG+ PVL  +SG VP   Q   P A+I 
Sbjct: 378 MQNMTSFGGPLPDNWFEDRVELGRQLHERMQTLGIKPVLQGYSGMVPLDFQKKNPDAQIL 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ D     F E+   F E+Q + YG  +  Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PNMLKTYVNDGERDYFQEVADVFYEKQKEVYGDITDYYAVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +DS    + +   I   M   D DA+W++Q W  + D      ++  L N     + +
Sbjct: 497 GGMDS----ARIYGTIQDKMIEHDEDAIWVIQHWQGNPDN----TKLSGLTNK---EQAL 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTM 237
           +LDL +++ P + T      +P++W MLHNF G + + G ++++A    EA  T+EN  M
Sbjct: 546 ILDLNSDLNPDY-TRFDNQDIPWVWNMLHNFGGRMGLDGQVETVATSITEALATTEN--M 602

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G+G++ E +  +P+VY+LM +M +  + ++ + W+N Y  RRYG       +AW +L  
Sbjct: 603 KGIGITPEALANSPIVYELMGDMIWTRDPINYREWVNNYIERRYGAVNEDAIEAWEILLE 662

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y  +D            +     SII+             ++ A   +  S++ H  +
Sbjct: 663 TAYKTSD----------YYYQGAAESIIN-------------ARPATSINSASTWGHSKI 699

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y   E+ RA+ELFI+  +EL  S+ + YD +D+T+Q LA  A E    ++ AY   DA 
Sbjct: 700 SYDKKELERAMELFISCYDELKDSDAFVYDFLDVTKQVLANSAQEYHKEMVAAYNSGDAE 759

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
              ++S  FL+L+   + +L+    FL+G W+E ++ +
Sbjct: 760 KFERISEHFLDLIRLQERVLSTSPEFLVGTWIEQSRTM 797


>gi|261880010|ref|ZP_06006437.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
 gi|270333326|gb|EFA44112.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
          Length = 719

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 140/460 (30%), Positives = 214/460 (46%), Gaps = 44/460 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP   + +Q  LQ+KIL R   L M PVLPAFSG+VP  ++ ++P + I 
Sbjct: 195 MANIDKWKGPLPYHTVVEQRDLQQKILARERSLNMTPVLPAFSGHVPGQIKQLYPESNIQ 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            LG W +     R    Y +   DPLF +I R ++E+Q   YG T HIY  D F+E  PP
Sbjct: 255 HLGRWAAFSDQYR---CYFMSPQDPLFAKIQRMYLEEQRAIYG-TDHIYGIDPFNEVDPP 310

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
              P+Y+  +   IY  +   D  A WL   WLF +    W P ++KAL+  V  GK+V+
Sbjct: 311 SWDPDYLFQISKGIYQTLAHVDPKAEWLQMSWLFYHKKKKWTPERVKALITGVETGKMVL 370

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD F +   IW  + +FYG PYIWC L NF GN  + G + +         T     + G
Sbjct: 371 LDYFCDRNEIWKMTDKFYGQPYIWCYLGNFGGNTTVAGNVKACGAKLDSTLTLGGKNLQG 430

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG+++EG +     Y+ + +  +     +   WI+  +    G + P+ + AW +LYH V
Sbjct: 431 VGLTLEGFDVCQFPYEYILDKVWSGNSSE-NQWIDALADSHVGYASPSFRKAWQLLYHDV 489

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           +  + G+           P   P + S+                       ++ + H+ Y
Sbjct: 490 FVQSAGSNG-------ILPCYRPELNSL-----------------------NWHYTHVDY 519

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN---IIEAYQLNDA 416
              ++I A +L     +  S     + DLI   RQ L    NE   +      AY   D 
Sbjct: 520 DRQKLIEAWKLMQHDAD--SKRTAAQLDLIHYGRQVL---GNEFLTHKQLFDSAYAHCDL 574

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
            G+   +     ++ D+D L A H    L  W++ A+Q+A
Sbjct: 575 AGMMAQAASMRHIMLDIDTLTAYHPRCTLAGWIDGARQMA 614


>gi|212693694|ref|ZP_03301822.1| hypothetical protein BACDOR_03214 [Bacteroides dorei DSM 17855]
 gi|265755881|ref|ZP_06090348.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
 gi|212663753|gb|EEB24327.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
 gi|263233959|gb|EEZ19560.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
          Length = 718

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 152/480 (31%), Positives = 241/480 (50%), Gaps = 67/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPESWYIRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   + ++  K YG+T   Y  D F E
Sbjct: 240 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYKELTKLYGKTG-FYAIDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W        +     +++  +  
Sbjct: 296 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--------QDNPRTSMIEHLEA 343

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++  G  
Sbjct: 344 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALINGFY 403

Query: 228 EARTSENT--TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           +A+T  +   T+ GVGM+ EGIE NPV+Y+L+ E+ ++  +     W+  Y   RYG   
Sbjct: 404 DAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
            A+Q AW++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 464 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 560 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGTRKEFKVGTWIEAARSAGQTQEEK 618


>gi|373460171|ref|ZP_09551927.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
 gi|371956556|gb|EHO74342.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
          Length = 742

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 142/472 (30%), Positives = 224/472 (47%), Gaps = 52/472 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP SW  QQ  LQKKIL R++E GM PVLP F G +P   +       +T
Sbjct: 186 MNNLEGWGGPLPDSWYKQQETLQKKILQRMHEYGMEPVLPGFCGMMPHDAKEKL-GLNVT 244

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W              L  TD  F  I   +  +  + YG+ ++ Y+ D F E+   
Sbjct: 245 DGGKWNGYTRPAN------LSPTDSQFNRIADLYYAELTRLYGKANY-YSMDPFHESNDD 297

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D+ +Y    G+ +   M+  +  A W++QGW  + +P  RP  ++ + N    G L++L
Sbjct: 298 -DALDY-GKAGSVMLEAMKRINPKATWVIQGW--TENP--RPRMIQDMKN----GDLLIL 347

Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTS 232
           DLF+E +P      +W   K +    +++CML NF  N+ ++G +D +   F   + R+ 
Sbjct: 348 DLFSECRPMFGIPSVWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLIHNFYSTKKRSP 407

Query: 233 ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
               + G+G +MEG E NPV+++LMSE+ ++ E    + W+  Y   RYGR    I+ AW
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSELPWRPEIFKKEDWVRGYVKARYGRKDETIERAW 467

Query: 293 NVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
            +L  T+YNC  G   +  +  V    P ++   +  +  K +NY               
Sbjct: 468 LLLAETIYNCPAGNNQQGPHESVFCGRPGLNNFQVK-SWSKMRNY--------------- 511

Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
                   Y     + A  L  +  +    +N + YDLID+ RQALA      +L  I  
Sbjct: 512 --------YDPQATLEAARLMASVSSRYKGNNNFEYDLIDICRQALADQGRLQYLKTIAD 563

Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
           Y         + ++RFL+++   D LL     F LG W E+A+ L   + ++
Sbjct: 564 YNGFSRAAFAKDAKRFLDMILLQDRLLGTRKEFRLGHWTEAARSLGTTQAEK 615


>gi|237708859|ref|ZP_04539340.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
 gi|345513372|ref|ZP_08792893.1| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
 gi|423228941|ref|ZP_17215347.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
           CL02T00C15]
 gi|423242228|ref|ZP_17223337.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
           CL03T12C01]
 gi|423247755|ref|ZP_17228803.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
           CL02T12C06]
 gi|229457285|gb|EEO63006.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
 gi|345456211|gb|EEO47557.2| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
 gi|392631297|gb|EIY25272.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
           CL02T12C06]
 gi|392635177|gb|EIY29082.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
           CL02T00C15]
 gi|392639514|gb|EIY33330.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
           CL03T12C01]
          Length = 717

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 152/480 (31%), Positives = 241/480 (50%), Gaps = 67/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 187 MNNLEGWGGPNPESWYIRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   + ++  K YG+T   Y  D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYKELTKLYGKTG-FYAIDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W        +     +++  +  
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--------QDNPRTSMIEHLEA 342

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++  G  
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALINGFY 402

Query: 228 EARTSENT--TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           +A+T  +   T+ GVGM+ EGIE NPV+Y+L+ E+ ++  +     W+  Y   RYG   
Sbjct: 403 DAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
            A+Q AW++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 463 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558

Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGTRKEFKVGTWIEAARSAGQTQEEK 617


>gi|329962235|ref|ZP_08300241.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
 gi|328530343|gb|EGF57220.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
          Length = 726

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 144/471 (30%), Positives = 237/471 (50%), Gaps = 60/471 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP   SW + ++ LQK+IL R+ E G++PVLP +SG +P   +       ++
Sbjct: 188 MNNLEGWGGPNTDSWYEDRIALQKRILKRMREYGIHPVLPGYSGMLPHNAKEKL-GVNVS 246

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
             G W          C Y     L  TD  F EI   + E+  + YG+ +  Y+ D F E
Sbjct: 247 DPGTW----------CGYNRPAFLQPTDTRFGEIAALYYEEMNRLYGK-ADFYSMDPFHE 295

Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
                     + + G AI+  M+    ++VW++Q W  +       P+ + ++ +VP G 
Sbjct: 296 GGKVAGVN--LDAAGQAIWQAMKKNSRNSVWVVQAWGAN-------PRAQ-MIKNVPRGD 345

Query: 177 LVVLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA 229
           ++VLDL++E +P W   +        F G  +++CML N+ GN+ ++G +  +     +A
Sbjct: 346 MLVLDLYSESRPQWGEPESSWYRENGFDGHQWLYCMLLNYGGNVGLHGKMQHVIDAYYKA 405

Query: 230 -RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAI 288
            R+S   T+ GVGM+MEG E NPV+Y+L+ E+ ++        W+  Y   RYG+  P +
Sbjct: 406 SRSSFGNTLKGVGMTMEGSENNPVMYELLCELPWRPSTFSKDEWLEGYIAARYGKCTPRL 465

Query: 289 QDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
           ++AW +L +++YNC   +T +  +  +  A P +                   + +A   
Sbjct: 466 REAWVLLGNSIYNCPPRSTQQGTHESIFCARPSLK------------------AYQASSW 507

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
           SE S Y  P       +VIRA  LF+    +   ++ + YDL+D+TRQA+A+    ++  
Sbjct: 508 SEMSDYYRPQ------DVIRAAGLFLEEAGQFKGNDNFEYDLVDITRQAVAEKGRLIYKV 561

Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
           I  +Y+  D   + Q S RFLEL+   D LLA    F +G W+E A+ L  
Sbjct: 562 IQASYEAGDKPLLRQASDRFLELLLLQDRLLATRPEFKVGRWIEQARNLGH 612


>gi|395804724|ref|ZP_10483959.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
 gi|395433112|gb|EJF99070.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
          Length = 722

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 145/465 (31%), Positives = 231/465 (49%), Gaps = 34/465 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++   GPLPQ W  ++  LQKKIL R+  L M+PV+PAFSG VP A     P AKIT
Sbjct: 208 MGNINSLEGPLPQEWFVKKEALQKKILERMKALDMHPVVPAFSGYVPKAFAEKHPEAKIT 267

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L +W    S   +  T+LLD+ DPLF +IG+ FIE   K YG+ S+ Y  D+F+E  PP
Sbjct: 268 ELKSW----SGGGFASTFLLDSKDPLFKQIGKRFIEIYTKMYGK-SNFYLADSFNEIEPP 322

Query: 121 V---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
           V   +  E +S+ G+A+Y  +      AVW+MQGWLF  +  FW     KA L+ VP  K
Sbjct: 323 VSEHNKYEELSNYGSAVYETIDEAAPGAVWVMQGWLFGDNKEFWTKEATKAFLSKVPNEK 382

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT- 235
           ++V D   +   +W   + FYG  + +  +HN+ G+  +YG L+           + N  
Sbjct: 383 VMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKDELASLLKNPNRG 442

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
            +VG G   EG+  N +VY+ + ++ +   +  +  W+ +Y   RYG++  ++  AW +L
Sbjct: 443 NIVGYGAMPEGLNNNSIVYEYIYDLPWTKAEQPLNDWMAKYLNARYGQTSESVFHAWELL 502

Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
             +VYN     T    D   A+       + +TE K    G P  K              
Sbjct: 503 LKSVYNVKYWETRWWNDWAGAYLLFKRPTVKITEFK----GNPGDK-------------- 544

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
                  ++  AL++      + + +N  +YDLID++R   +   +E  +  I+AYQ  +
Sbjct: 545 ------IKLKEALDILKKEAKKYNKNNLIQYDLIDVSRHYNSLSIDEELIECIKAYQEKN 598

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
                QL ++  + V + D +++      L  W++SA     + E
Sbjct: 599 IAKGDQLFKQIEKQVLETDKMMSGQPLNNLNQWVKSASDYGSSPE 643


>gi|168216263|ref|ZP_02641888.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens NCTC 8239]
 gi|182381741|gb|EDT79220.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens NCTC 8239]
          Length = 2104

 Score =  218 bits (555), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 141/461 (30%), Positives = 231/461 (50%), Gaps = 41/461 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEEEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L      G+ +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKGQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+  M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+G++ E I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I +AWN++  
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILD 662

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y        K  D    +     SII+   G    +G        +KS  S++ H  +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +  
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
               +S +FLEL++  + +L+    FL+G W+E A+ + ++
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKD 800


>gi|110801838|ref|YP_698175.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens SM101]
 gi|110682339|gb|ABG85709.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens SM101]
          Length = 2095

 Score =  218 bits (554), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 142/461 (30%), Positives = 230/461 (49%), Gaps = 41/461 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 369 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 428

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F  +   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 429 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 487

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 488 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 536

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+  M
Sbjct: 537 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 593

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+G++ E I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I +AWN++  
Sbjct: 594 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNEEILEAWNIILD 653

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y        K  D    +     SII+   G    +G        +KS  S++ H  +
Sbjct: 654 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 690

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   DA 
Sbjct: 691 VYDKSEFEKAIEIFSKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNRDAE 750

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
               +S +FLEL++  + +L+    FL+G W+E A+ + ++
Sbjct: 751 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKD 791


>gi|168207628|ref|ZP_02633633.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens E str. JGS1987]
 gi|170661027|gb|EDT13710.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens E str. JGS1987]
          Length = 2104

 Score =  218 bits (554), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 141/463 (30%), Positives = 231/463 (49%), Gaps = 41/463 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQEEVFGEVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+  M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+G++ E I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I DAWN++  
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y        K  D    +     SII+   G    +G        +KS  S++ H  +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +  
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
               +S +FLEL++  + +L+    FL+G W+E A+ + ++ +
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 802


>gi|422345314|ref|ZP_16426228.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
           WAL-14572]
 gi|373228039|gb|EHP50349.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
           WAL-14572]
          Length = 1842

 Score =  218 bits (554), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 141/463 (30%), Positives = 231/463 (49%), Gaps = 41/463 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+  M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+G++ E I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I DAWN++  
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y        K  D    +     SII+   G    +G        +KS  S++ H  +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +  
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
               +S +FLEL++  + +L+    FL+G W+E A+ + ++ +
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 802


>gi|182624959|ref|ZP_02952737.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens D str. JGS1721]
 gi|177909756|gb|EDT72174.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens D str. JGS1721]
          Length = 2104

 Score =  217 bits (553), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 141/463 (30%), Positives = 231/463 (49%), Gaps = 41/463 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+  M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+G++ E I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I DAWN++  
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y        K  D    +     SII+   G    +G        +KS  S++ H  +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +  
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
               +S +FLEL++  + +L+    FL+G W+E A+ + ++ +
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 802


>gi|169346867|ref|ZP_02865815.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens C str. JGS1495]
 gi|169296926|gb|EDS79050.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens C str. JGS1495]
          Length = 2104

 Score =  217 bits (553), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 141/461 (30%), Positives = 230/461 (49%), Gaps = 41/461 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+  M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+G++ E I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I DAWN++  
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y        K  D    +     SII+   G    +G        +KS  S++ H  +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +  
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
               +S +FLEL++  + +L+    FL+G W+E A+ + ++
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKD 800


>gi|262406054|ref|ZP_06082604.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|294648118|ref|ZP_06725661.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|294806859|ref|ZP_06765684.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|345510559|ref|ZP_08790126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|229443271|gb|EEO49062.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|262356929|gb|EEZ06019.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|292636502|gb|EFF54977.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|294445888|gb|EFG14530.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
          Length = 718

 Score =  217 bits (553), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 138/468 (29%), Positives = 230/468 (49%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP A     P  +  
Sbjct: 197 MGNLNKWDGPLSDTWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLTEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +   + +       +SK  + 
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                          + + ++A+ L+ +  +EL +S  YR DLI+     LA  A   + 
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYLAAKAENFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVFAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVEFAR 625


>gi|160914140|ref|ZP_02076362.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
 gi|158433951|gb|EDP12240.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
          Length = 2150

 Score =  217 bits (553), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 135/465 (29%), Positives = 236/465 (50%), Gaps = 45/465 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ +GGPLP  W +Q++ L +K+  R+   G++PV+  F G VP +       A +T
Sbjct: 387 MQNLYSFGGPLPDDWFEQRVELGRKMHDRMQAFGIDPVIQGFCGQVPMSFVEKNEGAVLT 446

Query: 61  QLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
            +  W S  + P    TYL            F ++ + F E+Q   +G  S  Y  D F 
Sbjct: 447 PIDEWPSF-TRPAMIKTYLSQEEIAAGKKDYFKDVAKTFYEKQKNVFGDVSDYYASDPFH 505

Query: 116 E--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
           E  NT  +D    ++++   +   M   ++DA+W+MQ W  + D      ++  L   V 
Sbjct: 506 EGGNTQGLD----VTNIFKTVQEEMLKSNADAIWVMQQWQGNLDH----AKLSGL---VK 554

Query: 174 LGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE 233
             + + LDL +++ P  S+  +  G+ +IWCMLHNF G + + G ++ IA  P  A  S 
Sbjct: 555 PEQALALDLQSDMNP--SSVMENEGISWIWCMLHNFGGRMGLDGEVEVIAKEPAIA-ASN 611

Query: 234 NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWN 293
           N  M G+G++ E +E +P+VY+++ +M +  + +D +AW+++Y+ RR G S  ++Q+AW+
Sbjct: 612 NQYMKGIGITPEALENSPIVYEMLFDMTWSKDPIDYQAWVDKYATRRAGGSSDSLQEAWD 671

Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
           +L  T Y          +D  + +     ++I+   G   N+           S  S++ 
Sbjct: 672 MLLETAY----------KDKGIYYQGAGETVINARPG--TNF-----------SSASTWG 708

Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
           H ++ Y   E+ + L L I + +  +AS  YRYDL D+  Q L   A E    +++A   
Sbjct: 709 HSNILYDKEELDKVLSLLIENYDAFAASEAYRYDLADVAEQVLCNAAIEYHALMVQALNN 768

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
            D+    ++S  FLEL++  D +L   + F+LG W+  A+++  N
Sbjct: 769 KDSAEFKRISTHFLELIDLSDRILGSSEEFMLGTWIHDAREMLDN 813


>gi|168212494|ref|ZP_02638119.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens CPE str. F4969]
 gi|170716100|gb|EDT28282.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens CPE str. F4969]
          Length = 2104

 Score =  217 bits (553), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 141/461 (30%), Positives = 230/461 (49%), Gaps = 41/461 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+  M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+G++ E I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I DAWN++  
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y        K  D    +     SII+   G    +G        +KS  S++ H  +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +  
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
               +S +FLEL++  + +L+    FL+G W+E A+ + ++
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKD 800


>gi|326934230|ref|XP_003213195.1| PREDICTED: hypothetical protein LOC100549752 [Meleagris gallopavo]
          Length = 650

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 100/207 (48%), Positives = 140/207 (67%), Gaps = 4/207 (1%)

Query: 33  LGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 92
           LGM  VLPAF+G+VP  +  VFP    T+LGNW     D    C YLL   +P+F  IG 
Sbjct: 4   LGMTTVLPAFAGHVPPGVLRVFPRINATRLGNWSHF--DCTLSCAYLLSPEEPMFQVIGT 61

Query: 93  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 152
            F+++ +KE+G T HIY+ DTF+E +P    P Y++ +  A++  M   D +A WLMQGW
Sbjct: 62  LFLKELIKEFG-TDHIYSADTFNEMSPLSSDPAYLAGITNAVFRAMTGADPEAQWLMQGW 120

Query: 153 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 211
           LF + P FW+PPQ++A+L +VPLG+++VLDLFAE KP++  ++ FYG P+IWCMLHNF G
Sbjct: 121 LFQHQPAFWQPPQVQAVLRAVPLGRMIVLDLFAESKPVYEWTESFYGQPFIWCMLHNFGG 180

Query: 212 NIEMYGILDSIAFGPVEARTSENTTMV 238
           N  ++G +++I  GP  AR   N+TMV
Sbjct: 181 NHGLFGAVEAINRGPFVARRFPNSTMV 207


>gi|168209163|ref|ZP_02634788.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens B str. ATCC 3626]
 gi|170712640|gb|EDT24822.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens B str. ATCC 3626]
          Length = 2104

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 141/461 (30%), Positives = 229/461 (49%), Gaps = 41/461 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F  +   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+  M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+G++ E I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I DAWN++  
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y        K  D    +     SII+   G    +G        +KS  S++ H  +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +  
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
               +S +FLEL++  + +L+    FL+G W+E A+ + ++
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKD 800


>gi|224027030|ref|ZP_03645396.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
           18228]
 gi|224020266|gb|EEF78264.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
           18228]
          Length = 837

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 148/481 (30%), Positives = 223/481 (46%), Gaps = 62/481 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI- 59
           M N+ G  GPL   W   QL LQ KIL R+  LGM P+ P F G +P A + ++P   I 
Sbjct: 193 MGNVSGIDGPLNPDWHAGQLALQHKILDRMRALGMKPICPGFPGFIPEAFKRIYPDLHIV 252

Query: 60  -TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            T  G  F           +++  T+PLF +I  AFI++  KE+G+  + Y  D+F+E  
Sbjct: 253 ETHWGGAFH---------NWMISPTEPLFAKISEAFIKEWEKEFGKCDY-YLVDSFNEMD 302

Query: 119 PPVDSP------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNS 171
            P          E  +S G  +YS ++  + DAVW+MQGW+F Y    W    + AL++ 
Sbjct: 303 IPFPEKGNPARYEMAASYGEKVYSSIKRANKDAVWVMQGWMFGYQRHIWDYETLGALVSR 362

Query: 172 VPLGKLVVLDL-------FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
           VP  K+++LDL       F   +  W   K FY   +++ ++ N  G   M G+LD  A 
Sbjct: 363 VPDDKMLLLDLAVDYNRHFWHSEVNWEYYKGFYNKQWVYSVIPNMGGKTGMTGVLDFYAN 422

Query: 225 GPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
           G +EA +S N   +V  G++ EGIE N V+Y+L+++  +   ++DV+ W+ QYS+ RYG+
Sbjct: 423 GHLEALSSSNRGNLVAHGLAPEGIENNEVLYELVTDAGWSDHRMDVRDWLKQYSINRYGK 482

Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
           +   +  AW+ L  +VY                F D          G  +N    +S   
Sbjct: 483 APAQLMKAWDYLLKSVYG--------------TFTDHPRFNWQFRPGLVKNGSINIS--- 525

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                              +  + LE F+A+  EL  S  Y  DL ++T   L   A  L
Sbjct: 526 ------------------DDYFKGLESFVAASEELKDSPYYLTDLCEMTAHYLGSKAEIL 567

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
              I + Y L D      L  RF   +  MD +L+ H    L  W+  A + A+ E Q K
Sbjct: 568 TRQIDQEYLLGDTLQAHFLQSRFETFMLGMDRILSQHPTLRLDRWVSFASKAARTEAQRK 627

Query: 464 Q 464
           Q
Sbjct: 628 Q 628


>gi|427385205|ref|ZP_18881710.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727373|gb|EKU90233.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
           12058]
          Length = 719

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 142/474 (29%), Positives = 234/474 (49%), Gaps = 57/474 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ VLQKKIL R+ E G+ PV P +SG VP  A + +  +   
Sbjct: 190 MNNLEGWGGPNPDSWYAQQEVLQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 249

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
           + L N F+  +         L  TD  F EI   +  +Q K +G+  + Y+ D F E  N
Sbjct: 250 SDLWNGFTRPA--------FLQPTDTRFAEIANLYYREQEKLFGKADY-YSMDPFHEAEN 300

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G AI   M+  +  A W++QGW  + +P  RP  ++ + N    G L
Sbjct: 301 AASVD----FDAAGKAIMQAMKKVNPKATWVVQGW--TENP--RPEMIENMKN----GDL 348

Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
           ++LDLF+E +P      IW   K +    +++CML NF GN+ ++G +D +     + + 
Sbjct: 349 LILDLFSECRPMWGIPSIWKRDKGYEQHDWLFCMLLNFGGNVGLHGRMDQLLDNFYQTKD 408

Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
           +   T + G+G++MEG E NPV+++LM E+ ++ EK   + W+  Y   RYG     I+ 
Sbjct: 409 NPLATHLKGIGLTMEGSENNPVMFELMCELPWRPEKFTKEEWLKDYLFARYGVKDEKIEK 468

Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
           AW +L +++YNC  G   +  +  +    P ++ +  + +  K +NY  P   E      
Sbjct: 469 AWTLLANSIYNCPFGNNQQGPHESIFCGRPSMN-NFQASSWSKMKNYYDPTVTE------ 521

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
                             A  L +   ++   +N + YDL+D+ RQ+L+     ++   I
Sbjct: 522 -----------------EAARLMLEVADKYRGNNNFEYDLVDIVRQSLSDKGRIVYNQTI 564

Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             ++  D     + S++FL+++   D LL     F +G W+E A+ L    E++
Sbjct: 565 ADFKSFDKRSFARDSQKFLDILLLQDRLLGTRSEFRVGRWIEQARNLGTTPEEK 618


>gi|322703040|gb|EFY94656.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
           ARSEF 23]
          Length = 774

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 130/426 (30%), Positives = 228/426 (53%), Gaps = 36/426 (8%)

Query: 5   HGWGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 62
             WGG   LP ++++QQ  LQK+I+ R+ ELG+ PVLPAF G VP +++ V P+A +T  
Sbjct: 209 RSWGGKGDLPLAFIEQQFELQKQIVTRMVELGITPVLPAFPGFVPESIKKVRPNANLTVS 268

Query: 63  GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 122
            NWF+   D ++     LD  D  + E+ + F+ +Q+  +G  +++Y  D F+E +P   
Sbjct: 269 PNWFAPAPD-KYTRDLFLDPLDDTYAELQKLFVTKQIDAFGNVTNVYTLDQFNELSPASG 327

Query: 123 SPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGK-LVVL 180
              Y+  +    Y+G+ + +  AVWL+QGWL FS   FW  P++ A L  V   + ++VL
Sbjct: 328 DTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSSRNFWTQPRIDAYLGGVEDHQGMLVL 387

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL++EV P W  +  + G P+IWC LH+F GN+ + G + ++   P++A  +++ ++VG 
Sbjct: 388 DLYSEVNPQWQRTNSYSGKPWIWCQLHDFGGNMALEGRVQTLTSAPIDA-LAQSKSLVGF 446

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNVLYH 297
           G++ E  E N VVYD++ + A+    +D +A+   +  +RY    S+P+ +  AW +L  
Sbjct: 447 GLTPEAYEGNEVVYDILLDQAWSATPLDTQAYFASWVTKRYAGISSIPSELYRAWEILRT 506

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
            VY+ T   TD  + V VA   + P++  +                   + T  + HP  
Sbjct: 507 DVYSNTR--TDIPQ-VPVATYQLRPALSGIA------------------NRTGHFPHPTA 545

Query: 358 WYSTSEVIRA-----LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
            +    V++      LE     G+ L     ++ D +D++RQ L+   + L+ +++ AY+
Sbjct: 546 LHYDPLVLQGVWKLMLEALTRQGS-LWKVPAFQLDFVDVSRQMLSNQFDVLYADLVNAYK 604

Query: 413 LNDAHG 418
            +   G
Sbjct: 605 CSTGAG 610


>gi|423312588|ref|ZP_17290525.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
           CL09T03C04]
 gi|392688276|gb|EIY81565.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
           CL09T03C04]
          Length = 717

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 154/480 (32%), Positives = 239/480 (49%), Gaps = 67/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   +  +  K YG+T   Y  D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W    +P  R P    ++  +  
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++   F 
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 402

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A      T+ GVGM+ EGIE NPV+Y+L+ E+ ++  +     W+  Y   RYG   
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
            A+Q AW++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 463 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558

Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617


>gi|146300873|ref|YP_001195464.1| alpha-N-acetylglucosaminidase [Flavobacterium johnsoniae UW101]
 gi|146155291|gb|ABQ06145.1| Candidate alpha-glycosidase; Glycoside hydrolase family 89
           [Flavobacterium johnsoniae UW101]
          Length = 723

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 146/469 (31%), Positives = 230/469 (49%), Gaps = 35/469 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++   GPLPQ W  ++  LQKKIL R+  L M+PV+PAFSG VP A     P AKIT
Sbjct: 208 MGNINSLEGPLPQEWFSKKEELQKKILERMRTLDMHPVVPAFSGYVPKAFAEKHPEAKIT 267

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L +W    S   +  T+LLD+ DPLF +IG+ FIE   K YG+ S+ Y  D+F+E  PP
Sbjct: 268 ELNSW----SGGGFESTFLLDSKDPLFKKIGKRFIEIYTKMYGK-SNFYLADSFNEIEPP 322

Query: 121 V---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
           V   +  E +++ G+AIY  ++     AVW+MQGWLF  +  FW      A L+ VP  +
Sbjct: 323 VTEHNKYEELANYGSAIYETIEEAAPGAVWVMQGWLFGDNKNFWTKEATSAFLSKVPNDR 382

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE-ARTSENT 235
           L+V D   +   +W   + FYG  + +  +HN+ G+  +YG L+      V   +     
Sbjct: 383 LMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKNELVSLLKNPHRG 442

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQDAWNV 294
            +VG G   EG+  N +VY+ + ++ +   +  VK W+  Y   RY  ++  ++  AW +
Sbjct: 443 NVVGYGAMPEGLNNNAIVYEFIYDLPWSKGEQSVKDWLTNYLNARYEQKTSDSVFKAWEL 502

Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
           L  +VY+     T    D   A+        ++TE K    G P  K+            
Sbjct: 503 LLESVYSTKYWETRWWNDRAGAYLLFKRPTATITEFK----GNPGDKD------------ 546

Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
                   ++  AL++  A   +    N  +YDLID +R   +   +E  +  ++AYQ  
Sbjct: 547 --------KLKEALDILKAEAKKYDKKNFIQYDLIDASRHYYSLSIDEDLVECVKAYQQK 598

Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           D     QL ++  + V ++D  ++      L  W++SA +     E  K
Sbjct: 599 DITKGDQLFKKIEKQVLEIDKSMSGQPLNSLNYWVKSASEYGSTPEVSK 647


>gi|410096483|ref|ZP_11291470.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409226447|gb|EKN19356.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 718

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 147/479 (30%), Positives = 240/479 (50%), Gaps = 66/479 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P SW  QQ+ LQ++I+ R+ E G+ PV P +SG VP   +         
Sbjct: 187 MNNLEGWGGPNPDSWYKQQITLQQRIVKRMREYGIEPVFPGYSGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L  TDP F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 239 KLG--LNV-SDPGLWCGYHRPAFLQPTDPRFQEIASLYYKELNKLYGK-ANFYSMDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +   VD    + + G AI   M+  +  AVW+ Q W        +      ++ ++  
Sbjct: 295 GGSVAGVD----LDAAGKAIMQAMKKNNPKAVWVAQAW--------QANPRSQMIENLKA 342

Query: 175 GKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           G ++VLDLF+E +P        W     F    +I+CML N+ GN+ ++G +  +     
Sbjct: 343 GDMIVLDLFSESRPQWGDPESTWHRKDGFGQHDWIYCMLLNYGGNVGLHGKMAHVIDEYY 402

Query: 228 EARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP 286
           +A+ S    T+ GVGM+MEG E NPV+++L++E+ ++    D   W+  Y+V RYG++ P
Sbjct: 403 KAKESSFGKTLCGVGMTMEGSENNPVMFELLTELPWRPVHFDKNEWLKNYTVARYGKANP 462

Query: 287 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
            +Q+AW +L +++YNC    T +  +  +  A P   P ++S                  
Sbjct: 463 TVQEAWILLSNSIYNCPPENTQQGTHESIFCARPSDHPYLVSSW---------------- 506

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
             SE S Y      Y+  +VIRA  + ++  ++ + +N + YDL+D+ RQA+A+    L 
Sbjct: 507 --SEMSDY------YNPDDVIRAAAMMVSVADQFTGNNNFEYDLVDIVRQAIAE-KGRLV 557

Query: 405 LNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             ++EA +   D       + RFL+L+   D LL     F +G W+   + L    E++
Sbjct: 558 EKVVEASFASGDKQLYNTAANRFLQLLLLQDELLGTRPEFKVGNWIARTRSLGNTPEEK 616


>gi|210631701|ref|ZP_03296968.1| hypothetical protein COLSTE_00853, partial [Collinsella stercoris
           DSM 13279]
 gi|210159960|gb|EEA90931.1| F5/8 type C domain protein, partial [Collinsella stercoris DSM
           13279]
          Length = 1906

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 142/465 (30%), Positives = 228/465 (49%), Gaps = 37/465 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP SW +Q++ L ++I  R+   G++PV+  F G VP   Q   P++   
Sbjct: 354 MQNLYSVGGPLPDSWFEQRVELARRIHDRMQTYGIDPVIQGFGGQVPTDFQQKNPNSVAA 413

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G+W S  + P    TYL DA      +  F ++G  F E Q + +G+ SH Y  D F 
Sbjct: 414 SSGSW-SGFARPYMIKTYLTDADRAAGKEDYFQKVGTTFYEAQERIFGKVSHFYAVDPFH 472

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E    V     I  +   +   M   D  AVW+MQ W +  D          L       
Sbjct: 473 EG-GTVPQGFNIVDIYRTVQQKMLDYDPQAVWVMQQWQWGIDE-------NKLSGLAKKE 524

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
           + +VLDL ++++   ++  +   VP++W MLHNF G + M G+ + +A    +A  S N 
Sbjct: 525 QSLVLDLQSDLRS-QASPMENQQVPWVWNMLHNFGGRMGMDGVPEVLAIKIPQAYNS-NR 582

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
            M G+G++ E I+ +P+VY+L+ +M ++ + VD +AW   Y  RRYG +   IQ+AW++L
Sbjct: 583 YMRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRAWTRSYIERRYGGTDAKIQEAWDIL 642

Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
             T Y   DG           +     SI++          +P   +       S++ H 
Sbjct: 643 LDTAYKHVDGEY---------YQGASESIMNA---------RPSDNKI---GSASTWGHS 681

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
            + Y   E  RA +LFI S +    S  +RYD +D+ RQ LA    E      +AY+  D
Sbjct: 682 DIDYDKKEFERAAQLFIESYDTYKDSEAFRYDFVDVMRQVLANAFQEYQPLAGDAYKQRD 741

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
           A     L+ + LE+++  D +L+    F+LG W+E+A+ L ++ +
Sbjct: 742 AERFELLANQMLEMLDAQDRMLSTSSDFMLGTWIENARTLLEDAD 786


>gi|423226735|ref|ZP_17213200.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392627008|gb|EIY21049.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 718

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 142/474 (29%), Positives = 233/474 (49%), Gaps = 57/474 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ  LQKKIL R+ E G+ PV P +SG VP  A + +  +   
Sbjct: 189 MNNLEGWGGPNPDSWYVQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 248

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
           + L N F+  +         L  TD  F EI   + ++Q K +G+  + Y+ D F E  N
Sbjct: 249 SDLWNGFTRPA--------FLQPTDVRFAEIADLYYQEQEKLFGKVDY-YSMDPFHEAEN 299

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G AI + M+  +  A W++QGW  + +P  RP  +K + N    G L
Sbjct: 300 AASVD----FDAAGKAIMAAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 347

Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
           ++LDLF+E +P      IW   K +    +++CML NF GN+ ++G +D +       + 
Sbjct: 348 LILDLFSECRPMWGIPSIWKRDKGYEQHNWLFCMLLNFGGNVGLHGRMDQLLDNFYLTKN 407

Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
           +     + G+G++MEG E NP++++LM E+ ++ EK   + W+  Y   RYG     I+ 
Sbjct: 408 NPLAVHLKGIGLTMEGAENNPMMFELMCELPWRPEKFTKEEWLKDYLFARYGVRDEKIEK 467

Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
           AW +L +T+YNC  G   +  +  +    P ++ +  + +  K +NY  P   E      
Sbjct: 468 AWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVTE------ 520

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
                             A  L +   ++   +N + YDL+D+ RQ+L+     ++   I
Sbjct: 521 -----------------EAARLMVEVADKYRGNNNFEYDLVDIVRQSLSDKGRIVYNRTI 563

Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             ++  D     + SR+FL+++   D LL     F +G W+E A+ L    E++
Sbjct: 564 ADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARNLGTTPEEK 617


>gi|224537466|ref|ZP_03678005.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520904|gb|EEF90009.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 721

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 142/474 (29%), Positives = 233/474 (49%), Gaps = 57/474 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ  LQKKIL R+ E G+ PV P +SG VP  A + +  +   
Sbjct: 192 MNNLEGWGGPNPDSWYVQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 251

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
           + L N F+  +         L  TD  F EI   + ++Q K +G+  + Y+ D F E  N
Sbjct: 252 SDLWNGFTRPA--------FLQPTDVRFAEIADLYYQEQEKLFGKVDY-YSMDPFHEAEN 302

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G AI + M+  +  A W++QGW  + +P  RP  +K + N    G L
Sbjct: 303 AASVD----FDAAGKAIMAAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 350

Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
           ++LDLF+E +P      IW   K +    +++CML NF GN+ ++G +D +       + 
Sbjct: 351 LILDLFSECRPMWGIPSIWKRDKGYEQHNWLFCMLLNFGGNVGLHGRMDQLLDNFYLTKN 410

Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
           +     + G+G++MEG E NP++++LM E+ ++ EK   + W+  Y   RYG     I+ 
Sbjct: 411 NPLAVHLKGIGLTMEGAENNPMMFELMCELPWRPEKFTKEEWLKDYLFARYGVRDEKIEK 470

Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
           AW +L +T+YNC  G   +  +  +    P ++ +  + +  K +NY  P   E      
Sbjct: 471 AWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVTE------ 523

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
                             A  L +   ++   +N + YDL+D+ RQ+L+     ++   I
Sbjct: 524 -----------------EAARLMVEVADKYRGNNNFEYDLVDIVRQSLSDKGRIVYNRTI 566

Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             ++  D     + SR+FL+++   D LL     F +G W+E A+ L    E++
Sbjct: 567 ADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARNLGTTPEEK 620


>gi|18309848|ref|NP_561782.1| alpha-N-acetylglucosaminidase [Clostridium perfringens str. 13]
 gi|18144526|dbj|BAB80572.1| probable alpha-N-acetylglucosaminidase [Clostridium perfringens
           str. 13]
 gi|288872041|dbj|BAI70446.1| alpha-N-acetylglucosaminidase [Clostridium perfringens]
          Length = 2104

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 140/463 (30%), Positives = 230/463 (49%), Gaps = 41/463 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +   P A+  
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F  +   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+  M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+G++ E I  NP+ ++L+ +MA+  ++++ + W   Y  RRYG++   I DAWN++  
Sbjct: 603 VGIGITPEAINTNPLAHELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y        K  D    +     SII+   G    +G        +KS  S++ H  +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +  
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
               +S +FLEL++  + +L+    FL+G W+E A+ + ++ +
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 802


>gi|189465172|ref|ZP_03013957.1| hypothetical protein BACINT_01517 [Bacteroides intestinalis DSM
           17393]
 gi|189437446|gb|EDV06431.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides intestinalis DSM
           17393]
          Length = 723

 Score =  215 bits (547), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 142/474 (29%), Positives = 233/474 (49%), Gaps = 57/474 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ  LQKKIL R+ E G+ PV P +SG VP  A + +  +   
Sbjct: 192 MNNLEGWGGPNPDSWYAQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 251

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD--EN 117
           + L N F+  +         L  TD  F EI   +  +Q K +G+  + Y+ D F   EN
Sbjct: 252 SDLWNGFTRPA--------FLQPTDARFAEIADLYYREQEKLFGKADY-YSMDPFHEAEN 302

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G AI + M+  +  A W++QGW  + +P  RP  +K + N    G L
Sbjct: 303 AASVD----FDAAGKAIMTAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 350

Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
           ++LDLF+E +P      IW   K +    +++CML NF GN+ ++G +D +       + 
Sbjct: 351 LILDLFSECRPMWGIPSIWKRDKGYEQHDWLFCMLLNFGGNVGLHGRMDQLLNNFYLTKN 410

Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
           +   T + G+G++MEG E N ++++LM E+ ++ EK   + W+  Y   RYG     I+ 
Sbjct: 411 NPLATHLKGIGLTMEGSENNAMMFELMCELPWRPEKFTKEEWLKDYLFARYGVRDEKIEQ 470

Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
           AW +L +T+YNC  G   +  +  +    P ++ +  + +  K +NY  P   E      
Sbjct: 471 AWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVTE------ 523

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
                             A  L +   ++   +N + YDL+D+ RQ+L+     ++   I
Sbjct: 524 -----------------EAARLMLEVADKYRGNNNFEYDLVDIVRQSLSDKGRIVYNRTI 566

Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             ++  D     + SR+FL+++   D LL     F +G W+E A++L    E++
Sbjct: 567 ADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARKLGTTPEEK 620


>gi|423214208|ref|ZP_17200736.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693153|gb|EIY86388.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 718

 Score =  215 bits (547), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 137/468 (29%), Positives = 230/468 (49%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP A     P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNAMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +   + +       +SK  + 
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                          + + ++A+ L+ +  +EL +S  YR DLI+     +A  A   + 
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625


>gi|160884066|ref|ZP_02065069.1| hypothetical protein BACOVA_02042 [Bacteroides ovatus ATCC 8483]
 gi|423291473|ref|ZP_17270321.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
           CL02T12C04]
 gi|156110408|gb|EDO12153.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
 gi|392663473|gb|EIY57023.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
           CL02T12C04]
          Length = 718

 Score =  215 bits (547), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 137/468 (29%), Positives = 230/468 (49%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP A     P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMTIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +   + +       +SK  + 
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                          + + ++A+ L+ +  +EL +S  YR DLI+     +A  A   + 
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625


>gi|237719039|ref|ZP_04549520.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229451817|gb|EEO57608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 718

 Score =  214 bits (545), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 136/468 (29%), Positives = 229/468 (48%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +   + +       +SK  + 
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                          + + ++A+ L+++  +EL  S  YR DLI+     +A  A   + 
Sbjct: 533 ---------------SDDYLQAIRLYVSCADELKGSELYRNDLIEFVSYYVAAKAENFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDKLLASHPLYRLEEWVELAR 625


>gi|150004413|ref|YP_001299157.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
 gi|149932837|gb|ABR39535.1| glycoside hydrolase family 89 [Bacteroides vulgatus ATCC 8482]
          Length = 717

 Score =  214 bits (545), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 153/480 (31%), Positives = 238/480 (49%), Gaps = 67/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   +  +  K YG+T   Y  D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W    +P  R P    ++  +  
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++   F 
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 402

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A      T+ GVGM+ EGIE NPV+Y+L+ E+ ++  +     W+  Y   RYG   
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
            A+Q  W++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 463 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558

Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617


>gi|294775488|ref|ZP_06741000.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
 gi|294450633|gb|EFG19121.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
          Length = 712

 Score =  214 bits (545), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 153/480 (31%), Positives = 238/480 (49%), Gaps = 67/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 182 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 233

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   +  +  K YG+T   Y  D F E
Sbjct: 234 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 289

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W    +P  R P    ++  +  
Sbjct: 290 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 337

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++   F 
Sbjct: 338 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 397

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A      T+ GVGM+ EGIE NPV+Y+L+ E+ ++  +     W+  Y   RYG   
Sbjct: 398 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 457

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
            A+Q  W++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 458 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 498

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 499 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 553

Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 554 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 612


>gi|393786624|ref|ZP_10374756.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
           CL02T12C05]
 gi|392657859|gb|EIY51489.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
           CL02T12C05]
          Length = 717

 Score =  214 bits (545), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 147/477 (30%), Positives = 232/477 (48%), Gaps = 61/477 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P  W  QQ+ LQKKIL R++E G+ PVLP + G VP   +         
Sbjct: 187 MNNLEGWGGPNPDHWYTQQVSLQKKILKRMHEYGIEPVLPGYCGMVPHNAK--------A 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V SDP   C Y     L   D  F EI   + ++  K YG+ ++ Y+ D F E
Sbjct: 239 KLG--LNV-SDPGVWCGYRRPAFLQPDDSRFEEISSLYYKELEKLYGKANY-YSMDPFHE 294

Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
               +D    + ++G A+   M+  +  AVW++Q W  +  P         L+ ++  G 
Sbjct: 295 GGS-IDGVN-LDAVGKAVMKAMKKANPKAVWVIQAWQANPRP--------ELIRNLETGD 344

Query: 177 LVVLDLFAEVKPIWST------SKQFYGVP-YIWCMLHNFAGNIEMYGILDSIAFGPVEA 229
           L++LDL +E +P W         K  YG   +++CML N+  N+ ++G +D++      A
Sbjct: 345 LLILDLTSECRPQWGDPESEWYRKDGYGKHNWVYCMLLNYGANVGLHGKMDNVIDNYYLA 404

Query: 230 RTS--ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
           + +     T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+   + W+  Y   RYG+  P 
Sbjct: 405 KENLRARATLKGVGMTPEGIENNPVMYELLMELPWRPERFTKEDWLKGYVKARYGKDEPV 464

Query: 288 IQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
           +Q AW  L +++YN     T +  +  V  A P +D          YQ            
Sbjct: 465 LQLAWGKLANSIYNAPKELTQQGTHESVFCARPGLDV---------YQ------------ 503

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
               SS+     +Y   EVI A  L ++  +    +  + YDL+D+ RQA+A+    +  
Sbjct: 504 ---VSSWSEMKDYYDPQEVIEAARLMVSVADRYRGNTNFEYDLVDVVRQAIAEKGRLMQK 560

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
            +  AY+  D       S++FL L+   D LL     F LG W+ SA+ L    E++
Sbjct: 561 AVTTAYRAGDKELFAMASQKFLNLILLQDQLLGTRTEFRLGRWINSARALGVTPEEK 617


>gi|345519733|ref|ZP_08799147.1| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
 gi|345457107|gb|EET15964.2| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
          Length = 717

 Score =  214 bits (545), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 153/480 (31%), Positives = 238/480 (49%), Gaps = 67/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   +  +  K YG+T   Y  D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W    +P  R P    ++  +  
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++   F 
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALIDGFY 402

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A      T+ GVGM+ EGIE NPV+Y+L+ E+ ++  +     W+  Y   RYG   
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
            A+Q  W++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 463 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558

Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617


>gi|295085513|emb|CBK67036.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 718

 Score =  214 bits (545), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 137/468 (29%), Positives = 229/468 (48%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +WST +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWSTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +   + +       +SK  + 
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                          + + ++A+ L+ +  +EL +S  YR DLI+     +A  A   + 
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625


>gi|319643377|ref|ZP_07998003.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
 gi|317385006|gb|EFV65959.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
          Length = 718

 Score =  214 bits (545), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 153/480 (31%), Positives = 238/480 (49%), Gaps = 67/480 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGP P+SW  +Q  LQKKI+ R+ E G+ PVLP + G VP   +         
Sbjct: 188 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 239

Query: 61  QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
           +LG   +V +DP + C+Y     L   D  F EI   +  +  K YG+T   Y  D F E
Sbjct: 240 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 295

Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
             +T  V+    + + G AI   M+  + DAVW+ Q W    +P  R P    ++  +  
Sbjct: 296 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 343

Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
           G L+VLDL +E +P W         K  YG   +++CML NF GNI ++G +D++   F 
Sbjct: 344 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALIDGFY 403

Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +A      T+ GVGM+ EGIE NPV+Y+L+ E+ ++  +     W+  Y   RYG   
Sbjct: 404 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 463

Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
            A+Q  W++L + +YN       +  +  V  A P +D          YQ          
Sbjct: 464 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 504

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                 SS+     +Y+  +VI A  L ++  ++   +N + +DL+D+ RQALA+    +
Sbjct: 505 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 559

Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
              +  A++  D   VF+L S+ FL L+   D LL     F +G W+E+A+   Q +E++
Sbjct: 560 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 618


>gi|404487024|ref|ZP_11022211.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335520|gb|EJZ61989.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
           YIT 11860]
          Length = 722

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 147/476 (30%), Positives = 228/476 (47%), Gaps = 58/476 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+GW GPL   W  +Q+ LQ KIL R+ ELGM+P+ PAF+G VP A     P  +  
Sbjct: 200 MGNLNGWDGPLTNGWQKEQIKLQHKILNRMRELGMDPIAPAFAGFVPTAFAERHPEIQFK 259

Query: 61  QLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
            L  W  F  K +      Y+L    P F EIG+ FIE+  KE+G+ ++ Y  D+F+E  
Sbjct: 260 HL-EWGGFDEKYN-----AYVLPPETPYFKEIGKLFIEEWEKEFGKNTY-YLSDSFNEMK 312

Query: 119 PPV------DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNS 171
            PV         + ++  G +IY  + +G+ DAVW+ QGW F Y   FW    ++ALL+ 
Sbjct: 313 LPVAEGDDDGKHKLLAQYGESIYHSIAAGNPDAVWVTQGWTFGYQHDFWDKASLQALLSR 372

Query: 172 VPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
           VP  K++++DL  +  K +W T +       FYG  +I+  + NF G   M G L   A 
Sbjct: 373 VPDDKMIIIDLGNDYPKWVWGTEQTWKNHDGFYGKKWIFSYVPNFGGKTPMTGDLQMYAT 432

Query: 225 GPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
              EA  S N   +VG G + EG+E N VVY+L+++M +  + +D+ +W+  Y   RYG 
Sbjct: 433 SSAEALHSANAGNLVGFGSAPEGLENNEVVYELLADMGWTADSIDLDSWLPVYCKARYGG 492

Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
              A+  AW     T Y+            +V  PD                 + +SK  
Sbjct: 493 CPAAMDSAWQRFKETAYSSLYSYPRFTWQTVV--PDT----------------RRISKLD 534

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
           V  S                 ++ +ELF++  + L +S  Y  D I+     LA  A++ 
Sbjct: 535 VSDS----------------FLQGVELFLSCADSLESSPLYVNDAIEYASYYLAAKADDC 578

Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
           +   ++   L +     Q   R +E++ D+D LLA H  + L  W++ A+   + +
Sbjct: 579 YKRALKEDSLGNRVAAMQQLDRSVEILLDVDKLLASHPLYRLEEWVDMARDWGKTD 634


>gi|293371915|ref|ZP_06618319.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
 gi|292633161|gb|EFF51738.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
          Length = 718

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 137/468 (29%), Positives = 228/468 (48%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +             +S +  +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTV-------------ISDQRRI 527

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                S D+          ++A+ L+ +  +EL +S  YR DLI+     +A  A   + 
Sbjct: 528 SKIDLSDDY----------LQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625


>gi|429740221|ref|ZP_19273923.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
 gi|429153946|gb|EKX96707.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
          Length = 721

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 146/480 (30%), Positives = 233/480 (48%), Gaps = 56/480 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W  QQ+ LQ KIL R+  LGM+P+ PAF+G VP     + P  ++ 
Sbjct: 197 MGNLNTWNGPLSANWHSQQIALQHKILERMRLLGMHPITPAFAGFVPEGFVKLHPEVRVK 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
                F      +    Y+L    P F++IG+ FIE+  KE+ + ++ Y  D+F+E   P
Sbjct: 257 H----FEWGGFDKSLNAYMLPPDSPYFLQIGKLFIEEWEKEFSKNTY-YLSDSFNEMELP 311

Query: 121 VDSPE-------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSV 172
           V SP+        +S  G AIY  + +G+ +AVW+ QGW F Y   FW    ++ALL  V
Sbjct: 312 V-SPDDTDGKHRLLSKYGEAIYQSIVAGNPNAVWITQGWTFGYQHRFWDKESLQALLERV 370

Query: 173 PLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
           P  KL+++DL  +        +  W T K FYG  +I   + NF G   + G L+  A  
Sbjct: 371 PNDKLIIVDLANDYPKWVWKTEQTWKTHKGFYGKRWILSYVPNFGGKTLLTGDLNLYASC 430

Query: 226 PVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
             EA    +   ++G G + EG+E N VVY+L+++M +Q++ +D+  W+ +Y   RYG  
Sbjct: 431 SAEALAHPDKGRLIGFGSAPEGLENNEVVYELLADMGWQNQPIDLDHWLIEYCRSRYGSC 490

Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
             A+Q AW  L  +VY+                     S+ S     +Q     V  + +
Sbjct: 491 PNAMQKAWKGLCRSVYS---------------------SLYSYPRFTWQT----VIPDTL 525

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
            KS+   YD    ++      RA+E F+    +L  S  YR D +    Q +   A+ L+
Sbjct: 526 RKSK---YDFNDTYF------RAVEDFLLCAPQLKDSPLYRSDALLFAAQYIGAKADNLY 576

Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
              ++A  + +     QL  + ++L+   D LLA H    L  W+++A+  A   ++  Q
Sbjct: 577 RKALQAKAVGNRARAKQLVDKVIQLLLQADKLLASHPTDRLSRWVDAARTAAATPQERMQ 636


>gi|336404352|ref|ZP_08585050.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
 gi|335943680|gb|EGN05519.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
          Length = 718

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 137/468 (29%), Positives = 228/468 (48%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +             +S +  +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTV-------------ISDQRRI 527

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                S D+          ++A+ L+ +  +EL +S  YR DLI+     +A  A   + 
Sbjct: 528 SKIDLSDDY----------LQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625


>gi|299144719|ref|ZP_07037787.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
 gi|298515210|gb|EFI39091.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
          Length = 718

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 136/468 (29%), Positives = 229/468 (48%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKADKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +   + +       +SK  + 
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                          + + ++A+ L+ +  +EL +S  YR DLI+     +A  A   + 
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625


>gi|170292392|pdb|2VC9|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
           In Complex With 2-Acetamido-1,2-Dideoxynojirmycin
 gi|170292393|pdb|2VCA|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
           In Complex With Beta-N-Acetyl-D-Glucosamine
 gi|170292394|pdb|2VCB|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
           In Complex With Pugnac
 gi|170292395|pdb|2VCC|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
          Length = 891

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 136/462 (29%), Positives = 229/462 (49%), Gaps = 39/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +     A+  
Sbjct: 344 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 403

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 404 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 462

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 463 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 511

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   +    + +  MV
Sbjct: 512 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLA-TEIPKALANSEHMV 569

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+G++ E I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I +AWN++  T
Sbjct: 570 GIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDT 629

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
            Y        K  D    +     SII+   G    +G        +KS  S++ H  + 
Sbjct: 630 AYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIV 666

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +   
Sbjct: 667 YDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEK 726

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
              +S +FLEL++  + +L+    FL+G W+E A+ + ++ +
Sbjct: 727 FKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 768


>gi|261880159|ref|ZP_06006586.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
 gi|270333130|gb|EFA43916.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
          Length = 772

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 143/485 (29%), Positives = 229/485 (47%), Gaps = 65/485 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP +W  QQ  LQK+IL R  ELGM+PVLP + G +P   +       +T
Sbjct: 193 MNNLEGWGGPLPDAWYAQQEALQKRILKREKELGMSPVLPGYCGMMPHDAKAKL-GLDVT 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W              L ATDP F  I   +  +  + YG+  + Y+ D F E+  P
Sbjct: 252 DGGTWNGYTRPAN------LSATDPKFDHIADLYYRELTRLYGKADY-YSMDPFHES--P 302

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D+    +  G  + + M+  +  + W++QGW+   +P  RP  ++AL    P G +++L
Sbjct: 303 DDASVDYAEAGRKLLAAMKRANGKSNWVIQGWM--ENP--RPQMIEAL----PEGDIIIL 354

Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD------SIAFGPVE 228
           DLF+E +P      IW   + +    +++CML NF  N+ ++G +D       +A  P  
Sbjct: 355 DLFSECRPMFGAPSIWQRKEGYGRHNWLFCMLENFGANVGLHGRMDQLVHNFKLAASPST 414

Query: 229 ARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF--------QHEKVDVK-AWINQYSVR 279
              +    + G+G +MEG E NP++++LMSE+ +        + ++ D K  W   Y   
Sbjct: 415 PYQNARKHLKGIGFTMEGSENNPIMFELMSELVWRANDLVSAERDRRDFKEGWTRNYVKA 474

Query: 280 RYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGK 337
           RYG   P IQ+AW +L  ++YNC  G   +  +  +    P +D   +  +  K +NY  
Sbjct: 475 RYGIDNPKIQEAWQLLIGSIYNCPVGNNQQGPHESIFNGRPSLDNFQVK-SWSKMRNY-- 531

Query: 338 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 397
                                Y  +  +RA +L  +  +    +N + YDL+D+ RQA+ 
Sbjct: 532 ---------------------YDPNVTLRAAQLMTSVADRYRGNNNFEYDLVDIVRQAMD 570

Query: 398 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
             A   +L  I  Y+  D       S RFL ++   D LL     F LG  +E A+ L+ 
Sbjct: 571 DQARLQYLRTIADYKGFDRTAFSADSARFLNMLLLQDKLLGTRQEFRLGTRIEQARSLST 630

Query: 458 NEEQE 462
             E++
Sbjct: 631 TLEEK 635


>gi|110800516|ref|YP_695309.1| alpha-N-acetylglucosaminidase [Clostridium perfringens ATCC 13124]
 gi|110675163|gb|ABG84150.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens ATCC 13124]
          Length = 2095

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 139/463 (30%), Positives = 230/463 (49%), Gaps = 41/463 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +     A+  
Sbjct: 369 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 428

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F E  NT
Sbjct: 429 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 487

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 488 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 536

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   P     SE+  M
Sbjct: 537 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 593

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+G++ E I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I +AWN++  
Sbjct: 594 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILD 653

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T Y        K  D    +     SII+   G    +G        +KS  S++ H  +
Sbjct: 654 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 690

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +  
Sbjct: 691 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 750

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
               +S +FLEL++  + +L+    FL+G W+E A+ + ++ +
Sbjct: 751 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 793


>gi|423293381|ref|ZP_17271508.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
           CL03T12C18]
 gi|392678324|gb|EIY71732.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
           CL03T12C18]
          Length = 718

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 136/468 (29%), Positives = 229/468 (48%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPITPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +   + +       +SK  + 
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                          + + ++A+ L+ +  +EL +S  YR DLI+     +A  A   + 
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625


>gi|298480124|ref|ZP_06998323.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
 gi|298273933|gb|EFI15495.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
          Length = 718

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 136/468 (29%), Positives = 229/468 (48%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +   + +       +SK  + 
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                          + + ++A+ L+ +  +EL +S  YR DLI+     +A  A   + 
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625


>gi|383115207|ref|ZP_09935965.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
 gi|313695376|gb|EFS32211.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
          Length = 718

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 136/468 (29%), Positives = 228/468 (48%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +   + +       +SK  + 
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                          + + ++A+ L+ +  +EL  S  YR DLI+     +A  A   + 
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKGSELYRNDLIEFVSYYVAAKAENFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625


>gi|322699924|gb|EFY91682.1| alpha-N-acetylglucosaminidase, putative [Metarhizium acridum CQMa
           102]
          Length = 775

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 131/430 (30%), Positives = 226/430 (52%), Gaps = 36/430 (8%)

Query: 5   HGWGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 62
             WGG   LP ++++ Q  LQKKI+ R+ ELG+ PVLPAF G VP +++ V P   +T  
Sbjct: 211 RSWGGKGDLPLAFIELQFELQKKIVARMVELGITPVLPAFPGFVPESIKKVRPDVNLTVS 270

Query: 63  GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 122
            NWF+   D ++     LD  D  + E+ R F+ +Q+  +G  ++IY  D F+E +P   
Sbjct: 271 PNWFAPAPD-KYTRDLFLDPLDDTYAELQRLFVSKQMDAFGNVTNIYTLDQFNELSPASG 329

Query: 123 SPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGK-LVVL 180
              Y+  +    Y+G+ + +  AVWL+QGWL FS   FW  P++ A L  V   + ++VL
Sbjct: 330 DTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSSRRFWTQPRIDAYLGGVEDDQGMLVL 389

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL++E  P W  +  + G P+IWC LH+F GN+ + G + ++   P++A  +++ ++VG 
Sbjct: 390 DLYSEANPQWQRTNSYSGKPWIWCQLHDFGGNMALEGRVQTLTSAPIDA-LAQSESLVGF 448

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNVLYH 297
           G++ E  E N VVYD++ + A+    +D + +   +  +RY    S+P+ +  AW +L  
Sbjct: 449 GLTPEAYEGNEVVYDILLDQAWSATPLDTQTYFASWVTKRYAGVSSIPSELYRAWEMLRT 508

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
            VY+ T   TD  + V VA   + P++  +                   + T  + HP  
Sbjct: 509 DVYSNT--RTDIPQ-VPVATYQLRPALSGIA------------------NRTGHFPHPTA 547

Query: 358 WYSTSEVIRA-----LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
            +    V++      LE     G+ L     ++ D +D++RQ L+   + L+ +++ AY+
Sbjct: 548 LHYDPLVLQEAWKLMLEAMTRQGS-LWKVPAFQLDFVDVSRQMLSNQFDVLYADLVNAYK 606

Query: 413 LNDAHGVFQL 422
            + A G  +L
Sbjct: 607 CSAAGGSREL 616


>gi|32564213|ref|NP_496948.2| Protein K09E4.4 [Caenorhabditis elegans]
 gi|25814792|emb|CAB70170.2| Protein K09E4.4 [Caenorhabditis elegans]
          Length = 715

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 139/471 (29%), Positives = 224/471 (47%), Gaps = 44/471 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GG L  + +     L K+I+ R+ ELG+ P+LP F+G VP  L+ +FP++K  
Sbjct: 210 MGNLKAYGGGLSDAQMLNDHNLAKRIIDRLLELGITPILPTFAGFVPDHLETLFPASKFN 269

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
           +L  W +  S+    C   +   DPLF +IG  F+  Q K +G   +++Y+ D F+E  P
Sbjct: 270 RLPRWNNFTSET--SCMLSVSPFDPLFQKIGSTFLRHQKKMFGGDVTNMYSADPFNEILP 327

Query: 120 PVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
              +    +++     AI +  +  D + VW++Q W F+YD  W    +K+ L+++P+G 
Sbjct: 328 SESAKFDAKFVKQTAQAIMNSCKKVDKNCVWVLQSWSFTYDQ-WPAWAIKSFLSAIPVGN 386

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           L++LDL+AEV P W  +  F G  ++WC+LHNF G+ E+ G L  I  G   A     + 
Sbjct: 387 LLILDLYAEVVPAWQMTSSFQGHHFVWCLLHNFGGSRELRGNLQKIDKGYQLALMKAGSN 446

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
           +VG G+SME I+QN VVY  M +  +  E + +  W+  YS  RY       Q  W +L 
Sbjct: 447 LVGAGLSMEAIDQNYVVYQFMIDRMWSPEPLPLNNWLKAYSESRYSADFKVAQKFWTLLA 506

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
            T YN  +        V +                   Y +P     +            
Sbjct: 507 GTFYNQPEKWGTPRFSVFL-------------------YHRPGFGRKI-----------E 536

Query: 357 LWYSTSEVI-RALELFIASGNELSASNTYRYDLIDLTRQALA-KYANELFLNIIEAYQLN 414
            W+   E   R  EL  A  + L     +R DL D+ R+    +  NE  L++ EA+ + 
Sbjct: 537 YWFPVEETFSRFRELLPALVHVLGEHPLFREDLNDVMREMTQFEMGNEAALSMSEAFLME 596

Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
           D   V       +E+ + ++     +    +  W+E+AK +A   E E+QV
Sbjct: 597 DKQQVGASCEMLMEMFQKLES----YSNRDVRQWIENAKSIAPTSE-ERQV 642


>gi|293369246|ref|ZP_06615836.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
 gi|292635671|gb|EFF54173.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
          Length = 521

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 111/301 (36%), Positives = 172/301 (57%), Gaps = 6/301 (1%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GW  PLP+ WL  Q  LQ++I+ R  E  M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 196 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W       R  CT+L +  D L+  I + ++ +Q + YG T+HIY  D F+E  PP
Sbjct: 256 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 311

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
               + +  +   IY  + + D +AVWL   WLF  D   W  P++K+ L SVP  +L++
Sbjct: 312 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 371

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LD F E   IW  +  ++G PY+WC L NF GN  + G ++ ++    +A  +  + + G
Sbjct: 372 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 431

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG ++EGI+ N  +Y+ + + A+   + D K W  + + RR G+  P  + AW +L + V
Sbjct: 432 VGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKLADRRIGKISPEARKAWEILANKV 490

Query: 300 Y 300
           Y
Sbjct: 491 Y 491


>gi|341892319|gb|EGT48254.1| hypothetical protein CAEBREN_28412 [Caenorhabditis brenneri]
          Length = 713

 Score =  211 bits (537), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 136/472 (28%), Positives = 227/472 (48%), Gaps = 46/472 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +GG L  + +   L L K+I+ R+ ELG+ P+LP F+G VP  L+ +FPS+K T
Sbjct: 208 MGNLKAYGGGLSDAQMLNDLNLAKRIINRLLELGITPILPTFAGFVPDQLEKLFPSSKFT 267

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
           +L  W +  S+    C   +   DPLF +IG  F+  Q K +G   +++Y+ D F+E  P
Sbjct: 268 RLPCWNNFTSET--SCLLSVSPFDPLFQKIGSLFLRHQKKMFGGDITNLYSADPFNEILP 325

Query: 120 PVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
              +    +++     AI +  +  D + +W++Q W F+YD  W    +K+ L++VP+G 
Sbjct: 326 SDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWSFTYDE-WPSWAIKSFLSAVPIGN 384

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           L++LDL++EV P W ++  F+G  YIWCMLH+F G+ E+ G L  +  G   A     + 
Sbjct: 385 LLILDLYSEVVPAWQSTSSFHGHNYIWCMLHSFGGSRELRGNLQKVDKGYQLALMKGGSN 444

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
           ++G G++ME I+QN V+Y  M +  +  E + +  WI  YS  RY          W +L 
Sbjct: 445 LIGAGLTMEAIDQNYVIYQFMVDRMWSSEPLPLNTWIKSYSESRYSADFKVSHKFWTLLA 504

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
            + YN  +   +    V +                   Y +P   + +            
Sbjct: 505 FSFYNQPEKWGNPRFSVFL-------------------YHRPAFGKKI-----------E 534

Query: 357 LWYSTSEVIRALELFIASG-NELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQL 413
            W+   E    L+  I S  + L     ++ DL D+ R A+ ++   N+  L + EA+ +
Sbjct: 535 YWFPVEETFGHLQSLIPSLIHVLGDHPLFKEDLNDVMR-AITQFEVGNDAALTLTEAFLM 593

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            D   +       +    DM   L  +    +  W+E +K +A   E E+QV
Sbjct: 594 EDKQQIGSTCENLM----DMFLKLESYSNRDMKHWIEDSKSIAATSE-ERQV 640


>gi|383280354|pdb|4A4A|A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In
           Complex With Its Substrate Glcnac-Alpha-1,4-Galactose
          Length = 914

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 134/462 (29%), Positives = 229/462 (49%), Gaps = 39/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +     A+  
Sbjct: 367 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 426

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F +  NT
Sbjct: 427 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHQGGNT 485

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D+      +   I + M   D+DAVW++Q W  +       P    L       + +
Sbjct: 486 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 534

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           VLDLF+EV P W+  ++   +P+IW MLHNF G + M    + +A   +    + +  MV
Sbjct: 535 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLA-TEIPKALANSEHMV 592

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+G++ + I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I +AWN++  T
Sbjct: 593 GIGITPQAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDT 652

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
            Y        K  D    +     SII+   G    +G        +KS  S++ H  + 
Sbjct: 653 AYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIV 689

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +   
Sbjct: 690 YDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEK 749

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
              +S +FLEL++  + +L+    FL+G W+E A+ + ++ +
Sbjct: 750 FKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 791


>gi|336412611|ref|ZP_08592964.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942657|gb|EGN04499.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
           3_8_47FAA]
          Length = 718

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 136/468 (29%), Positives = 228/468 (48%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ +IL R+ ELGM P+ PAF+G VP       P  +  
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     +      Y+L    P F EIG+ F+E+  KE+G  ++ Y  D+F+E   P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       ++  G  IY  + +G+ DAVW+ QGW F Y   FW    +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G LD  A   
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           V+A R +    ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y   RYG   
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            A+++AW +   T Y+            + ++P      +   + +       +SK  + 
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                          + + ++A+ L+ +  +EL  S  YR DLI+     +A  A   + 
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKNSELYRNDLIEFVSYYVAAKAEIFYK 577

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++    N      +  ++ ++L+ D+D LLA H  + L  W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625


>gi|373461651|ref|ZP_09553390.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
 gi|371951955|gb|EHO69797.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
          Length = 713

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 142/464 (30%), Positives = 212/464 (45%), Gaps = 52/464 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGP+ Q ++D Q  L ++IL R+  LG+ PVL  F G V  ++++ +P+A + 
Sbjct: 194 MGNLEGWGGPVSQDFIDAQSRLGRRILDRMATLGIQPVLQGFYGMVSRSIRDRYPNAVMP 253

Query: 61  QLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
           Q G W F  + D       +L  T+ LF EI   +  +  K YG   H +  D F E   
Sbjct: 254 Q-GMWGFFERPD-------ILKPTEKLFDEIADTYYREIKKHYGTGFHYFGGDLFHEGGQ 305

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
                  ++  G A+   MQ     + W++QGW  + +P         LL  +   K++V
Sbjct: 306 T--GTLNVADCGLAVQQAMQRNFPGSTWVLQGWSGNPNPL--------LLTKLDREKVLV 355

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMV 238
           +DLF E    W+ +K + G P++WC++ NF     MYG L  IA    + R S+    + 
Sbjct: 356 VDLFGENDEAWNRTKAYQGTPFLWCIVSNFGEQCGMYGKLQRIALQIDKVRKSDYKAYLK 415

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           GVG+  EGI  NPVVYD++        K++V+AW+  Y   RYG     I  AW +   T
Sbjct: 416 GVGIMPEGINNNPVVYDMVLHAPLTDRKINVEAWLKSYITYRYGSYNADIYAAWLIFLQT 475

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVLKSETSSYDH 354
           +Y                         SV E     YG P S    +  V  ++TSS+  
Sbjct: 476 IY------------------------ASVPE----KYGLPESVFCARPGVKVTQTSSWGV 507

Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
              +Y        + LF+ +      S TY YD+ DL RQ  +   N ++ ++I A    
Sbjct: 508 RARYYDMDFFKEGVRLFLKAKTSFEDSETYAYDMFDLLRQVQSDKGNRVYDDMIAAIDAK 567

Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
           + +   Q S RFL  +   D LLA   GF L  WL  A +  + 
Sbjct: 568 NPNRFEQTSDRFLHELLRQDTLLAQSKGFTLERWLGQASRFGKT 611


>gi|268533054|ref|XP_002631655.1| Hypothetical protein CBG20846 [Caenorhabditis briggsae]
          Length = 712

 Score =  208 bits (530), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 128/470 (27%), Positives = 228/470 (48%), Gaps = 45/470 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GG L  + +     L K+I+ R+ ELG+ P+LP FSG VP  L+ +FP++K  
Sbjct: 207 MGNLKGYGGGLSDAQMLNDFNLAKRIINRLLELGITPILPTFSGFVPDRLEKLFPTSKFN 266

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
           +L  W +  S+    C   +   DPLF +IG +F+  Q K  G   +++Y+ D F+E  P
Sbjct: 267 RLPCWNNFTSET--SCLLSVSPFDPLFQKIGSSFLRHQKKMLGGDITNLYSADPFNEVLP 324

Query: 120 PVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
              +    +++     AI +  +  D + +W++Q W F+YD  W    +K+ L++VP+G+
Sbjct: 325 SDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWSFTYDQ-WPNWAIKSFLSAVPIGQ 383

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
           +++LDL++EV P W  +  F+G  ++WCMLHNF G+ E+ G +  +  G   A     + 
Sbjct: 384 MLILDLYSEVVPAWQMTSSFHGHNFVWCMLHNFGGSRELRGNVQKVDKGYQLALMKAGSN 443

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
           +VG G+SME I+QN ++Y  M +  +  E + + +W+  YS  RY          W +L 
Sbjct: 444 LVGAGLSMEAIDQNYMMYQFMIDRMWTQEPIPLNSWLKSYSESRYSADFKVAHKFWTILA 503

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
            + YN  +   +    V +                   Y +P   + +            
Sbjct: 504 GSFYNQPEKWGNPRFSVFL-------------------YHRPAFGKKI-----------E 533

Query: 357 LWYSTSEVIRALE-LFIASGNELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQL 413
            W+   E    LE L ++  + L     ++ DL D+ R A+ ++   NE  L++ EA+ +
Sbjct: 534 YWFPVEETFTHLESLVLSLLHILGDHPLFKEDLNDVMR-AITQFEIGNEAALSLTEAFLM 592

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            D   +       + + + ++     +    +  W+E AK +A   E+ +
Sbjct: 593 EDKQQIGTTCENLMGMFQKLEP----YSNRDVRDWIEDAKSIAPTTEERE 638


>gi|421734750|ref|ZP_16173809.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
 gi|407077324|gb|EKE50171.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
          Length = 1919

 Score =  208 bits (529), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 137/462 (29%), Positives = 227/462 (49%), Gaps = 41/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q++ L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 362 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 421

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++G  F + Q   +G+ S+ Y  D F 
Sbjct: 422 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 480

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 481 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 532

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
           + +VLDL ++++   +++ +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 533 QTLVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 591

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
            M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W  +Y+ RRYG +   I+ AW++L
Sbjct: 592 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 650

Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
             T Y  TDG      +  +I A P  D +I S                       S++ 
Sbjct: 651 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 687

Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
           H  + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+ 
Sbjct: 688 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 747

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
            D      LS R L++++  D LL+  D FL+G W++ A+ +
Sbjct: 748 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 789


>gi|294674521|ref|YP_003575137.1| alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
 gi|294472030|gb|ADE81419.1| putative alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
          Length = 754

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 139/475 (29%), Positives = 228/475 (48%), Gaps = 54/475 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP SW  +Q  LQK+IL R+ +LGM+PVLP + G VP   +       + 
Sbjct: 186 MNNLEGWGGPLPTSWYARQEKLQKQILARMKQLGMHPVLPGYCGMVPHDAKEKL-GLNVA 244

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
             G W   +          L  TD  F EI   +  +  K +G+  + Y+ D F E+   
Sbjct: 245 DAGLWNGFQRPAN------LLPTDARFSEIATLYYNELTKLFGKADY-YSMDPFHESNDD 297

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
           P +D     +  G A+   M+  +  AVW++QGW  + +P       +A+++ +  G L+
Sbjct: 298 PNID----YAKAGQAMMQAMKRVNPKAVWVIQGW--TENP------REAMVDDMKTGDLL 345

Query: 179 VLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYG----ILDSIAFGPVE 228
           VLDLF+E +P      IW   + +    +++C+L NF  N+ ++G    +LD+       
Sbjct: 346 VLDLFSECRPMFGIPSIWKREQGYKQHQWLFCLLENFGANVGLHGRMDQLLDNFYMLQSS 405

Query: 229 ARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAI 288
              ++++ + G+G +MEG E NPV+++LMSE+ ++ EK   + W+  Y   RYG    AI
Sbjct: 406 KFQAQSSKLKGIGFTMEGSENNPVMFELMSELPWRPEKFTKEQWVKNYVKARYGVEDEAI 465

Query: 289 QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
           + AW  L  ++YNC  G   +             SI           G+P     +   +
Sbjct: 466 EKAWLTLAKSIYNCPAGNNQQG---------PHESIFC---------GRPT----LNNFQ 503

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
            SS+     +Y  +   +A +L  +   +   +N + YDL+D+TRQALA  A   +   I
Sbjct: 504 ASSWSKMKNYYDPAMTKKAAKLMNSVAEKYRGNNNFEYDLVDITRQALADQARLQYQKTI 563

Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
             Y+        + + RFL+++   D LL     F +G W + A       E++K
Sbjct: 564 ADYKAFSRKQFDRDAERFLKMLLLQDKLLGTRTEFRVGHWTQDAVNAGNTAEEKK 618


>gi|313140918|ref|ZP_07803111.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
           bifidum NCIMB 41171]
 gi|313133428|gb|EFR51045.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
           bifidum NCIMB 41171]
          Length = 2005

 Score =  207 bits (528), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 137/462 (29%), Positives = 227/462 (49%), Gaps = 41/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q++ L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 448 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 507

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++G  F + Q   +G+ S+ Y  D F 
Sbjct: 508 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQESVFGKVSNYYAVDPFH 566

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 567 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 618

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
           + +VLDL ++++   +++ +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 619 QALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 677

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
            M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W  +Y+ RRYG +   I+ AW++L
Sbjct: 678 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 736

Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
             T Y  TDG      +  +I A P  D +I S                       S++ 
Sbjct: 737 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 773

Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
           H  + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+ 
Sbjct: 774 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 833

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
            D      LS R L++++  D LL+  D FL+G W++ A+ +
Sbjct: 834 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 875


>gi|311064845|ref|YP_003971571.1| beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
 gi|310867165|gb|ADP36534.1| Beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
          Length = 1923

 Score =  207 bits (528), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 137/462 (29%), Positives = 227/462 (49%), Gaps = 41/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q++ L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 366 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 425

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++G  F + Q   +G+ S+ Y  D F 
Sbjct: 426 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 484

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 485 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 536

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
           + +VLDL ++++   +++ +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 537 QALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 595

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
            M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W  +Y+ RRYG +   I+ AW++L
Sbjct: 596 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 654

Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
             T Y  TDG      +  +I A P  D +I S                       S++ 
Sbjct: 655 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 691

Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
           H  + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+ 
Sbjct: 692 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 751

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
            D      LS R L++++  D LL+  D FL+G W++ A+ +
Sbjct: 752 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 793


>gi|187734575|ref|YP_001876687.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
 gi|187424627|gb|ACD03906.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 848

 Score =  207 bits (527), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 139/482 (28%), Positives = 231/482 (47%), Gaps = 63/482 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+    GPLP SW  +Q+ LQ +IL R+  LGM P+ PAFSG VP  +  ++P AK+ 
Sbjct: 202 MGNIVNHDGPLPASWHKEQIALQHRILHRMKSLGMTPICPAFSGFVPRGILRLYPEAKLH 261

Query: 61  QL--GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +L  G W      P+    + L   +PLF++IGR ++++  KE+G+ ++ +  D+F+E  
Sbjct: 262 RLGWGGW------PQKNHAHFLSPEEPLFLKIGRLYMQEWQKEFGKNTY-FLADSFNEME 314

Query: 119 PPVDSP------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNS 171
            P +          +SSLG  IY  + S + DAVW+MQGW+F Y    W    +KALL+ 
Sbjct: 315 LPENKGGVEARNNMLSSLGEQIYRSISSTNPDAVWVMQGWMFGYQRNIWNADTLKALLSK 374

Query: 172 VPLGKLVVLDLFAEVKPI-------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
           VP  K+++LDL A+           W   K F+  P+++ ++ N  G   M G++D  A 
Sbjct: 375 VPDDKMLLLDLAADYNKTFWRNGMNWDVFKGFFNKPWVYSVVPNMGGKCAMTGVMDFYAN 434

Query: 225 GPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
           G +EA  +S    + G+GM+ EGIE N V+Y+L+++ A+++ + +V+ ++  Y   RYG 
Sbjct: 435 GHLEALNSSSRGRLSGMGMAPEGIENNDVIYELITDAAWRNRQENVEQYLENYCRARYGN 494

Query: 284 SVPAIQDAWNVLYHTVY-NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
              ++++AWN+   T Y N  D             P  +  +   T G   N        
Sbjct: 495 YPDSMKEAWNLFRRTAYSNLKD------------HPRFNWQMKPGTRGCSVN-------- 534

Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
                            ++ + ++ L LF+ +   L  S  +R D +++    L    NE
Sbjct: 535 -----------------TSEDFLKGLSLFVNT-RGLEQSPLFRQDAVEMAVHYLGIRMNE 576

Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
                 EA    D     +    F +     D LL  H  + L  W+  A+    + E++
Sbjct: 577 AIRAAQEALDEQDQENAEKCMAYFRKYALLADSLLEGHPTWRLSRWISFARSHGTSPEEK 636

Query: 463 KQ 464
            +
Sbjct: 637 NK 638


>gi|374384144|ref|ZP_09641670.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
           12061]
 gi|373228751|gb|EHP51054.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
           12061]
          Length = 835

 Score =  207 bits (527), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 138/475 (29%), Positives = 232/475 (48%), Gaps = 59/475 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+    GP+P  W   Q+ LQ KIL R+  LGM P+ PAF+G VP AL+ ++P  KI 
Sbjct: 192 MGNISQIDGPMPVEWHSDQVELQHKILKRMKLLGMKPICPAFAGFVPLALKRLYPDVKII 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
           +   W    +       ++L   + LF  IG+ FIE+  KE+G+ +  Y  D+F+E    
Sbjct: 252 ET-TWAGFHN-------WMLSPEEELFTRIGQLFIEEWEKEFGK-NDFYLADSFNEMDVP 302

Query: 119 -PPVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
            PP+ + E    L   G  +Y G+++G+ DAVW+MQGW+F Y    W    ++AL++ VP
Sbjct: 303 FPPIGTKERYDMLAFYGEQVYKGIKAGNPDAVWVMQGWMFGYQRDIWDYETLQALVSKVP 362

Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K+++LDL A+  K +W         K F+   +++ ++ N  G     GIL   A G 
Sbjct: 363 DDKMMLLDLAADYNKNVWGNGMNWEFYKGFFNKLWVYSVIPNMGGKTGATGILSFYANGH 422

Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           +EA  S N   + G GM+ EG E N VVY+++ +  +   ++DVK W+  YS+ RYG++ 
Sbjct: 423 LEALNSPNRGRLFGFGMAPEGTENNEVVYEMICDAGWSSSEIDVKQWLKDYSLCRYGKTC 482

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
           P + + W  L  +VY                F D    +  +  G+        S +  +
Sbjct: 483 PEMDEVWEGLCKSVYG--------------TFTDHPRFLWQLRPGR--------SGKGTV 520

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
            ++++ Y             RA+E       +++ S  ++ D +++T   L      L  
Sbjct: 521 NTDSNFY-------------RAVEKMAECAPKMTESPLFKADFLEMTAFYLGGKMEALAS 567

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
            I ++Y   +     ++ ++F EL E +D LL  H  + L  W++ A++    E+
Sbjct: 568 AIGKSYLYGNTADALKMQQQFEELGEGLDSLLESHPVYRLQRWIDFARKHGDTEK 622


>gi|198277542|ref|ZP_03210073.1| hypothetical protein BACPLE_03764 [Bacteroides plebeius DSM 17135]
 gi|198270040|gb|EDY94310.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides plebeius DSM
           17135]
          Length = 722

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 140/474 (29%), Positives = 229/474 (48%), Gaps = 57/474 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
           M+NL GWGGP P SW  QQ  LQKKIL R+ E G+ PV P +SG VP  A + +  +   
Sbjct: 191 MNNLEGWGGPNPDSWYTQQEALQKKILKRMREYGIEPVFPGYSGMVPHDANKKLGLNVTE 250

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD--EN 117
             L N F+  +         L  TD  F EI   + ++  K +G+ ++ Y+ D F   E+
Sbjct: 251 PALWNGFTRPA--------FLLPTDSRFNEIASLYYKELEKLFGKANY-YSMDPFHELED 301

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              VD      + G A+   M++ +  A W++QGW  + +P  RP  +K L N    G +
Sbjct: 302 AGSVD----FDAAGKAVLKAMKNVNPKATWVIQGW--TENP--RPEMIKNLNN----GDI 349

Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
           ++LDLF+E +P      IW   K +    +++CM+ NF GN+ ++G +D +       + 
Sbjct: 350 LILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMIENFGGNVGLHGRMDQLLNNFYLTKN 409

Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
           +     + G+G++MEG E NPV+++LM E+ ++ EK   + W+  Y   RYG     I  
Sbjct: 410 NPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKFTKEEWLKDYLFARYGVRDEKITQ 469

Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
           AW++L   +YNC  G   +  +  +    P ++ +  + +  K QNY  P S EA     
Sbjct: 470 AWSILADGIYNCPFGNNQQGPHESIFCGRPGLN-NFQASSWSKMQNYYDPTSTEA----- 523

Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
                             A  L +   ++   +N + YDL+D+ RQ+L+     ++   I
Sbjct: 524 ------------------AARLMLEVADKYKGNNNFEYDLVDIVRQSLSDRGRIVYNQTI 565

Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             ++  D       S+ FL ++   D LL     F +G W+E A+ L    E++
Sbjct: 566 ADFKSFDKKSFATHSQEFLNILLAQDRLLGTRSEFRVGRWIEQARNLGTTPEEK 619


>gi|380512475|ref|ZP_09855882.1| N-acetylglucosaminidase [Xanthomonas sacchari NCPPB 4393]
          Length = 785

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 147/502 (29%), Positives = 225/502 (44%), Gaps = 71/502 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W++ +  LQ +IL R+  LGM PVLPAF+G VP A     P A+I 
Sbjct: 221 MGNIEGYDAPLPQQWIEDKHALQLRILQRMRALGMKPVLPAFAGYVPKAFAQAHPQARIY 280

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE---- 116
           ++  W           TY LD  DPLF +I + FI+   + YG+ ++ Y  D F+E    
Sbjct: 281 RMRAWEGFHE------TYWLDPADPLFAQIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 333

Query: 117 ------------------NT--------PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
                             NT        PPV   + +++ G A+Y+ +   + DAVW+MQ
Sbjct: 334 IAADGSDARLASYGDSTANTAKTKPPEVPPVQRDKRLAAYGRALYASIHRANPDAVWVMQ 393

Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW P  + A L  VP  KL+VLD+  +  P  W  S  F G  +I+  +HN
Sbjct: 394 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 453

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           + G+  +YG    +AF   + R      +   +VG G   EG+    VVY+ M  +A+  
Sbjct: 454 YGGSNPVYG---DLAFYREDLRALLADKDKQQLVGFGAFPEGLHTTSVVYEYMYALAWGA 510

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
           ++  ++ W++ Y+  RYG + PA++ AW+ L  +V +                    P  
Sbjct: 511 QQRPLQDWLDDYTRARYGHTSPALRAAWDDLQASVLSTRYWT---------------PRW 555

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
                G Y  + +P     + + E +  D P L        RAL+  +A   E + +  Y
Sbjct: 556 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALQQLLALAPEYADAPLY 606

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
           RYDL+D  R       +      + AY+  D       + R  E V  +D L+       
Sbjct: 607 RYDLVDFARHYATGRVDVQLQQAVAAYRRGDVAAGDAATARVREAVTQLDSLVGGQQD-T 665

Query: 445 LGPWLESAKQLAQNEEQEKQVR 466
           L  WL++A   A   +     R
Sbjct: 666 LSSWLDAAAGYATTPQDAAYYR 687


>gi|390937398|ref|YP_006394957.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
 gi|389891011|gb|AFL05078.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
          Length = 1957

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 137/462 (29%), Positives = 226/462 (48%), Gaps = 41/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q++ L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 400 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 459

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++G  F + Q   +G+ S+ Y  D F 
Sbjct: 460 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 518

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 519 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 570

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
           + +VLDL ++++   ++  +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 571 QALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 629

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
            M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W  +Y+ RRYG +   I+ AW++L
Sbjct: 630 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 688

Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
             T Y  TDG      +  +I A P  D +I S                       S++ 
Sbjct: 689 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 725

Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
           H  + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+ 
Sbjct: 726 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 785

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
            D      LS R L++++  D LL+  D FL+G W++ A+ +
Sbjct: 786 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 827


>gi|440731409|ref|ZP_20911430.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
 gi|440373101|gb|ELQ09870.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
          Length = 732

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 147/502 (29%), Positives = 223/502 (44%), Gaps = 71/502 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +  PLPQ W++ +  LQ++IL R+  LGM PVLPAFSG VP A     P A+I 
Sbjct: 168 MGNIEAYDAPLPQQWIEDKYALQQRILQRMRTLGMKPVLPAFSGYVPKAFAQAHPQARIY 227

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF +I + FI+   + YG+ ++ Y  D F+E  PP
Sbjct: 228 RMRAWEGFHE------TYWLDPADPLFTKIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 280

Query: 121 VDS----------------------PE--------YISSLGAAIYSGMQSGDSDAVWLMQ 150
           + +                      PE         ++  G A+Y  +   + DAVW+MQ
Sbjct: 281 IAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 340

Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW P  + A L  VP  KL+VLD+  +  P  W  S  F G  +I+  +HN
Sbjct: 341 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 400

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           + G+  +YG    +AF   + R      +   +VG G   EG+  N VVY+ M  +A+  
Sbjct: 401 YGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFGAFPEGLHDNSVVYEYMYTLAWGG 457

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
           ++  ++ W+  Y+  RYG + PA++ AW+ L   V +                    P  
Sbjct: 458 QQRSLQDWLGDYTRARYGHTSPALRAAWDDLQAAVLSTRYWT---------------PRW 502

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
                G Y  + +P     + + E +  D P L        RAL+  +A   E + +  Y
Sbjct: 503 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLY 553

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
           RYDL+D  R       +      + AY+  D         R    V+ +DGL+      +
Sbjct: 554 RYDLVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDAAFARVQAAVQQLDGLVGGQQE-I 612

Query: 445 LGPWLESAKQLAQNEEQEKQVR 466
           L  WL  A+  A+  +     R
Sbjct: 613 LSSWLGDAEGDAKTPQDAAYYR 634


>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 1105

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 148/502 (29%), Positives = 225/502 (44%), Gaps = 71/502 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W++ +  LQ++IL R+  LGM PVLPAF+G VP A     P A+I 
Sbjct: 164 MGNIEGYDAPLPQQWIEDKHALQQRILQRMRALGMKPVLPAFAGYVPKAFAQAHPQARIY 223

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF +I + FI+   + YG+ ++ Y  D F+E  PP
Sbjct: 224 RMRAWEGFHE------TYWLDPADPLFAKIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 276

Query: 121 VDS----------------------PEY--------ISSLGAAIYSGMQSGDSDAVWLMQ 150
           + +                      PE         ++  G A+Y  +   + DAVW+MQ
Sbjct: 277 IAADGSDARLASYGDSTANTANTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 336

Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW P  + A L  VP  KL+VLD+  +  P  W  S  F G  +I+  +HN
Sbjct: 337 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 396

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           + G+  +YG    +AF   + R      +   +VG G   EG+  N VVY+ M  +A+  
Sbjct: 397 YGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFGAFPEGLHTNSVVYEYMYALAWGG 453

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
           ++  ++ W+  Y+  RYG S PA++ AW+ L  +V +                    P  
Sbjct: 454 QQRSLQDWLGDYTRARYGHSSPALRAAWDDLQASVLSTR---------------YWTPRW 498

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
                G Y  + +P     + + E +  D P L        RAL+  +A   E + +  Y
Sbjct: 499 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLY 549

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
           RYDL+D  R       +      + AY+  D         R    V  +DGL+       
Sbjct: 550 RYDLVDFARHYATGRVDTQLQQALAAYKRGDVAAGDAAFARVQAAVRQLDGLVGGQQE-T 608

Query: 445 LGPWLESAKQLAQNEEQEKQVR 466
           L  WL++A+  A+  +     R
Sbjct: 609 LSSWLDAAEGDAKTPQDAAYYR 630


>gi|365104185|ref|ZP_09333846.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
           4_7_47CFAA]
 gi|363644798|gb|EHL84079.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
           4_7_47CFAA]
          Length = 1049

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 130/455 (28%), Positives = 218/455 (47%), Gaps = 37/455 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW  Q+  L +KI  R+   G+ PV P F+G VP       P A++ 
Sbjct: 361 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 420

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W      P    TY+    D  F ++   + +     +G  SH Y  D F E    
Sbjct: 421 DQGEWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 478

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D    +  +   + + M   D DAVW++Q W        +     A LN +     ++L
Sbjct: 479 ADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW--------QENPTDAFLNGLKKDHALIL 528

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+A+ KP  +   +F   P+IW MLH F G +   G+ + +A   +    +E+  M GV
Sbjct: 529 DLYADNKPNHAMRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLA-QEIPQSLAESKKMKGV 587

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ E +  NP++Y+++ +MA++   +   A+I+ +   RYG   P I+ AW+++  T Y
Sbjct: 588 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 647

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +      D+ R         + SII    G    +G  V++     +    YD       
Sbjct: 648 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 684

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
            +E  + L L+++  +   A+  Y++DL+D+TRQ LA  + E +    +A+   D     
Sbjct: 685 -AEFEKILPLYLSVYDHFKANPAYQHDLVDITRQVLANASYEYYRAFEDAWIAKDYSAFN 743

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
           QLS +FL L++  D +L+    F+LG W+ SA+ +
Sbjct: 744 QLSGKFLRLIKLQDQVLSTRPEFMLGTWINSARTM 778


>gi|282881077|ref|ZP_06289764.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
           5C-B1]
 gi|281304881|gb|EFA96954.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
           5C-B1]
          Length = 688

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 138/468 (29%), Positives = 222/468 (47%), Gaps = 44/468 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGP+ Q  +D +  LQ KIL R+ +LG+ PV+  F G VP+ L + +P A + 
Sbjct: 164 MGNLEGWGGPMSQQMIDDRYKLQIKILRRMRQLGIEPVVQGFPGIVPSFLHDKYPKACVV 223

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W   +         +L     LF  + +A+ +   + YG     +  D F E  N 
Sbjct: 224 SQGKWNGFQRPS------ILLPQSQLFYCMAKAYYDNMKRYYGTDLRYFGGDLFHEGGNA 277

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    +SS  + +   M S   DA W++QGW  +  P        ALL  +    ++
Sbjct: 278 KGVD----LSSTASKVQKCMLSHFPDAKWVLQGWNGNPSP--------ALLAGLDKKHVL 325

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTM 237
           +++L  E+   W  S +F   P+IW  +++F G  +M G L  +   P  A   S++  +
Sbjct: 326 LINLAGEIDASWKQSDEFGQTPWIWGSVNHFGGKTDMGGQLPVLVEQPHRALAASQHGRL 385

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G+G+  EGI  NPVVYDL  + A+      V   + QY   RYG     +  AW +L  
Sbjct: 386 KGLGILPEGIHTNPVVYDLALQTAWSDTVPSVDHLLRQYIWYRYGTWNDDLYRAWQLLAS 445

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           +VY                            EG Y++     ++ ++  S  S++    +
Sbjct: 446 SVYG---------------------EFEVKGEGTYESVF--CARPSLHVSSVSTWGPKKM 482

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y   ++++AL LF  +      S TY YDL+DL RQ +A  A  ++  ++ AY   D+ 
Sbjct: 483 QYQPEKLLQALVLFRKAAVHFKGSETYEYDLVDLARQVMANNARNVYNQVVHAYNEKDSL 542

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            + + S  FL L++  D LL+ +  FLLG WL++A+Q  +NE+ ++Q 
Sbjct: 543 ALNRYSSTFLHLIDLQDSLLSTNKFFLLGKWLQAARQYGENEQDQRQA 590


>gi|291086028|ref|ZP_06354661.2| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
           ATCC 29220]
 gi|291069185|gb|EFE07294.1| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
           ATCC 29220]
          Length = 1014

 Score =  205 bits (522), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 130/455 (28%), Positives = 217/455 (47%), Gaps = 37/455 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW  Q+  L +KI  R+   G+ PV P F+G VP       P A++ 
Sbjct: 326 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 385

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W      P    TY+    D  F ++   + +     +G  SH Y  D F E    
Sbjct: 386 DQGDWVGFVRPPM-LRTYVKQGAD-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 443

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D    +  +   + + M   D DAVW++Q W        +     A LN +     ++L
Sbjct: 444 ADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW--------QENPTDAFLNGLKKDHALIL 493

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+A+ KP  +   +F   P+IW MLH F G +   G+ + +A   +    +E+  M GV
Sbjct: 494 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLA-QEIPQSLAESKYMKGV 552

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ E +  NP++Y+++ +MA++   +   A+I+ +   RYG   P I+ AW+++  T Y
Sbjct: 553 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 612

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +      D+ R         + SII    G    +G  V++     +    YD       
Sbjct: 613 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 649

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
            +E  + L L+++  +    +  Y++DL+D+TRQ LA  + E +    +A+   D     
Sbjct: 650 -AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFN 708

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
           QLS +FL L++  D +L     F+LG WL SA+ +
Sbjct: 709 QLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTM 743


>gi|410634789|ref|ZP_11345419.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
 gi|410145665|dbj|GAC22286.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
          Length = 750

 Score =  205 bits (522), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 141/474 (29%), Positives = 239/474 (50%), Gaps = 58/474 (12%)

Query: 1   MSNLHGWGG--PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
           M NL  W G   LP+S+ D+Q+ L  KIL R+  LGM P++ AF+G VP A   +FP A+
Sbjct: 211 MGNLITWDGGDKLPESYFDEQIALNHKILKRLRSLGMTPIVHAFAGFVPPATSELFPEAQ 270

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE-N 117
           I +L +W      P     YLL   +PLF++IG+ +IE+  KE+G+  + Y  D+F+E +
Sbjct: 271 IRRL-SWGG--GLPESTYGYLLSPENPLFVKIGKMYIEEWQKEFGKNEY-YLADSFNEMD 326

Query: 118 TPPVDS-PEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP------FWRPPQMKA 167
            PP D+  E ++ L   G  +Y  +++ + DA W+MQGW F Y        FW P ++ A
Sbjct: 327 VPPADTEAELLTELAGYGDRVYQSIKAANPDATWVMQGWTFPYHKDENRQLFWTPERLHA 386

Query: 168 LLNSVPLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
           L++ VP  KL++LDL  E       + P W     F+   +I+  + N  G   + G  D
Sbjct: 387 LVSKVPDDKLLILDLANEYNKLWWKIDPSWKMYSGFFNKKWIYSFIPNMGGKTPLNGRFD 446

Query: 221 SIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 279
             A  P++A    +   ++G G + EGIE N ++Y+L+++MA+Q + +DV  W  +Y+++
Sbjct: 447 IYAELPIDALNYKDKGNLIGFGFAPEGIENNEMIYELLTDMAWQRKAIDVDQWQAKYAMQ 506

Query: 280 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 339
           RYG    +++ A++ L              N+  + +F  VD  I       Y+N   P 
Sbjct: 507 RYGAYPGSLEKAFSYL--------------NKSALGSF--VDHPIHRFQLRPYRN---PE 547

Query: 340 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
             E          DH  + + + + I+A  LF+ +  +L  +  Y++D +++T   L+  
Sbjct: 548 GVE----------DHATV-HESEDFIKATGLFLQASEQLKDNKLYQHDAMEITTLFLSLV 596

Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
            + L    +        + V   +   + ++  MD LLA H    L  W++ A+
Sbjct: 597 TDNLLTKFLAKDVEQRDYSVLDEA---ISVMHTMDKLLAEHPNHQLVTWVDYAR 647


>gi|310287970|ref|YP_003939229.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
 gi|309251907|gb|ADO53655.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
          Length = 1923

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 136/462 (29%), Positives = 225/462 (48%), Gaps = 41/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q++ L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 366 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 425

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++G  F + Q   +G+ S+ Y  D F 
Sbjct: 426 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGDTFYKAQESVFGKVSNYYAVDPFH 484

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 485 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 536

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
           + +VLDL ++++   ++  +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 537 QALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 595

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
            M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W  +Y+ RRYG +   I+ AW++L
Sbjct: 596 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 654

Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
             T Y   DG      +  +I A P  D +I S                       S++ 
Sbjct: 655 LDTAYKHMDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 691

Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
           H  + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+ 
Sbjct: 692 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 751

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
            D      LS R L++++  D LL+  D FL+G W++ A+ +
Sbjct: 752 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 793


>gi|289667570|ref|ZP_06488645.1| N-acetylglucosaminidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 798

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 144/477 (30%), Positives = 219/477 (45%), Gaps = 72/477 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQ++IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQQQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG T   Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFMELYTQAYG-TGEFYLADAFNEMLPP 313

Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
                  V + +Y  S+                       G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           +  +  +YG     AF   + +     SE   + G G+  EG+  N VVY+ +  +A++ 
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADSEKRNLRGFGIFPEGLHSNSVVYEYLYALAWEG 490

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
            +     W+ QY   RYGRS  A+  AW+ L   +Y     +               P  
Sbjct: 491 PQQPWSQWLTQYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
            +   G Y  + +P +   ++K +    D   L        RA++  +      + +  Y
Sbjct: 536 WNKRAGAYLLFKRPTAD--IVKFDDRPGDPQRL-------RRAIDALLQQAERYADAPLY 586

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACH 440
           RYDLI+  R  L+  A+     +++AY   D A G  QL+ R  +LV+ +D L+   
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDVQLA-RITQLVQGLDALVGGQ 642


>gi|333031147|ref|ZP_08459208.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
 gi|332741744|gb|EGJ72226.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
          Length = 721

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 135/469 (28%), Positives = 226/469 (48%), Gaps = 54/469 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL   W   Q+ LQ KIL R+ EL M P+ PAF+G VP A     P     
Sbjct: 199 MGNLNKWDGPLSDEWHTSQIELQHKILDRMRELEMKPIAPAFAGFVPMAFAEKHPDINFK 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W     DP +   Y+L    P F EIG+ FIE+   E+G  ++ Y  D+F+E   P
Sbjct: 259 HM-RWGGF--DPEYNA-YVLPPDSPFFEEIGKLFIEEWENEFGSNTY-YLSDSFNEMELP 313

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           +D  +       +   G +IY  + +G+ +A+W+ QGW F Y   FW    ++ALL++VP
Sbjct: 314 IDKDDTEGKYRLLRQYGESIYKSISAGNPEAIWVTQGWTFGYQHSFWDTTSLQALLSNVP 373

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W+T +       FYG  +I+  + NF G   M G +   A   
Sbjct: 374 NEKMIIIDLGNDYPKWVWNTEQTWKVQNGFYGKGWIFSYVPNFGGKTTMTGDMQMYATSS 433

Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
            EA  S N   ++G G + EG+E N V+Y+L+++M +  E +++  W+  Y + RYG   
Sbjct: 434 AEALASPNKGNLIGFGSAPEGLENNEVIYELLADMGWTSESINLDEWMQSYCLSRYGGYP 493

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
             +Q AW +   TVY+            + ++P                Y      E  L
Sbjct: 494 ENVQKAWELFRKTVYSN-----------LYSYP---------------RYTWQTVVEDTL 527

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
           +    +        ++ E +  +ELF+++ NEL  S  Y  DLI+ +    A  A++++ 
Sbjct: 528 RINKIN--------TSDEFLIGVELFVSAVNELKDSELYVNDLIEFSSFYAAAKADKIYK 579

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
             +  ++  +      L  + ++++  +D LLA H  + L  W++ A+ 
Sbjct: 580 EALILFERGNKKEARSLLNQSIQILLKVDKLLASHPIYRLEEWVKYARN 628


>gi|289663931|ref|ZP_06485512.1| N-acetylglucosaminidase [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 798

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 144/477 (30%), Positives = 219/477 (45%), Gaps = 72/477 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQ++IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQQQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG T   Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFMELYTQAYG-TGEFYLADAFNEMLPP 313

Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
                  V + +Y  S+                       G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           +  +  +YG     AF   + +     SE   + G G+  EG+  N VVY+ +  +A++ 
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADSEKRNLRGFGIFPEGLHSNSVVYEYLYALAWEG 490

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
            +     W+ QY   RYGRS  A+  AW+ L   +Y     +               P  
Sbjct: 491 PQQPWSQWLMQYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
            +   G Y  + +P +   ++K +    D   L        RA++  +      + +  Y
Sbjct: 536 WNKRAGAYLLFKRPTAD--IVKFDDRPGDPQRL-------RRAIDALLQQAERYADAPLY 586

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACH 440
           RYDLI+  R  L+  A+     +++AY   D A G  QL+ R  +LV+ +D L+   
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDVQLA-RITQLVQGLDALVGGQ 642


>gi|372221472|ref|ZP_09499893.1| alpha-N-acetylglucosaminidase [Mesoflavibacter zeaxanthinifaciens
           S86]
          Length = 712

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 118/311 (37%), Positives = 170/311 (54%), Gaps = 18/311 (5%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++G  GPLPQ W+ ++  LQKKIL ++ +LGM PV+PAFSG +PAAL   FP+AKI+
Sbjct: 200 MGNINGHAGPLPQEWITKKAKLQKKILSKMRDLGMKPVVPAFSGYIPAALAEKFPNAKIS 259

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W     D     TYLLD  DPLF EIG+ FIE   +EYG+  + Y  D+F+E TPP
Sbjct: 260 ELNGWSGGGFD----STYLLDPKDPLFKEIGKRFIELYNQEYGKAEY-YLADSFNEVTPP 314

Query: 121 VDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
           V +   +  L   G  IY  +      A W+MQGWLF +D  FW    + A L+ VP  K
Sbjct: 315 VSTENKLDELAAYGQVIYETLNEAAPGATWVMQGWLFGHDAYFWEKDAVIAFLSKVPNDK 374

Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA----RTS 232
           L++ D   +   +W     FYG  + +  +HN+ G+  +YG  D   F   E        
Sbjct: 375 LIIQDFGNDRYKVWEKQDAFYGKQWTYGYVHNYGGSNPIYGDFD---FYKEEINYLLEHD 431

Query: 233 ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS-VPAIQDA 291
           ++T ++G G+  EG+ QN +VY+ + ++ +   K+ VK W+      RYG+        A
Sbjct: 432 KSTKVLGYGVMPEGLHQNSMVYEYLYDLPWD-SKIPVKDWLKTNIKARYGKDFTKETLTA 490

Query: 292 WNVLYHTVYNC 302
           W  L   VY+ 
Sbjct: 491 WIKLDSAVYST 501


>gi|288927801|ref|ZP_06421648.1| putative alpha-N-acetylglucosaminidase
           (N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
           oral taxon 317 str. F0108]
 gi|288330635|gb|EFC69219.1| putative alpha-N-acetylglucosaminidase
           (N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
           oral taxon 317 str. F0108]
          Length = 723

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 140/468 (29%), Positives = 225/468 (48%), Gaps = 58/468 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++G  GPL   W   Q+ LQ KIL R+ +L M+P+ P F+G VP AL+ ++P+A I 
Sbjct: 198 MGNIYGIDGPLSNQWHQDQIALQHKILDRMRKLDMHPICPGFAGFVPEALKELYPTADI- 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
           Q   W     +      Y+L   DPLF +IG  FI++  KE+GR    Y  D+F+E    
Sbjct: 257 QYTTWEKAFHN------YILSPADPLFHKIGVMFIQEWEKEFGRCD-FYLIDSFNEMDIP 309

Query: 119 -PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
            PP D P   E+++  G  +Y  ++  +  A W+MQGW+F Y P  W    + AL++ VP
Sbjct: 310 FPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQGWMFGYQPEIWDYKTLNALVSQVP 369

Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K+++LDL A+  K +W T       K F G  +I+ ++ N  G   + G LD  A G 
Sbjct: 370 DNKMIMLDLAADYNKFLWKTPFNWDFYKGFCGKQWIYSVIPNMGGKSALTGALDFYAKGH 429

Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           +EA  S+N   ++G G + EGIE N VVY+L+ +  +  + V+++ W+  Y+  RYG   
Sbjct: 430 LEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGWAKQGVELRPWLRNYTYSRYGCYP 489

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
             ++  WN +  +VY                             G ++++ +   +    
Sbjct: 490 IGMEQYWNEMIQSVY-----------------------------GSFKSHPRFNWQFRPG 520

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
           K +  S D  + +Y   E++  + L    GN+L     +  D  ++    L      L  
Sbjct: 521 KEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL-----FEADFKEMAANYLGGKVEILVR 574

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
            I +AY+  D     QL  RF  L+  MD +L  H    +  W++ A+
Sbjct: 575 QIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPTKDMQKWIDYAR 622


>gi|393783265|ref|ZP_10371440.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
           CL02T12C01]
 gi|392669544|gb|EIY63032.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
           CL02T12C01]
          Length = 723

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 132/468 (28%), Positives = 224/468 (47%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL   W   Q+ LQ +I+ R+ ELGM P+ PAF+G VP A     P  K  
Sbjct: 199 MGNLNTWDGPLSDEWQKSQIELQHQIINRMRELGMQPIAPAFAGFVPMAFAEKHPDIKFK 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W            Y+L    P F EIG+ F+++  KE+G+ ++ Y  D+F+E   P
Sbjct: 259 HL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKEWEKEFGKNTY-YLSDSFNEMELP 313

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           V   +       ++  G +IY  + +G+ DA+W+ QGW F Y   FW    ++ALL+ VP
Sbjct: 314 VAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQGWTFGYQHDFWDKASLQALLSHVP 373

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W T +       FYG  +I+  + NF G   + G L   A   
Sbjct: 374 DDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIFSYVPNFGGKTPLTGDLQMYATSS 433

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
            EA +   +  ++G G + EG+E N VVY+L+++M +  + +D + W+  Y   RYG   
Sbjct: 434 AEALKAPSHGNLIGFGSAPEGLENNEVVYELLADMGWTDQAIDPEQWMPSYCTARYGAYP 493

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
            ++++AW +   T Y+            + ++P      +   + +       +SK  V 
Sbjct: 494 ESMKNAWELFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDV- 534

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                          + + +  +ELF+AS + L+ S  Y  D I+     +A  A++L+ 
Sbjct: 535 ---------------SDDFLHGIELFLASADSLNRSKLYVNDAIEFASYYIAAQADKLYK 579

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             +           +Q   + ++L+ ++D LLA H  + L  W+E A+
Sbjct: 580 QALTEDTAGKPVAAYQHLNQAIDLLLNVDKLLASHPLYRLEEWVELAR 627


>gi|331660873|ref|ZP_08361805.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
           TA206]
 gi|422369309|ref|ZP_16449711.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
 gi|315298924|gb|EFU58178.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
 gi|331051915|gb|EGI23954.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
           TA206]
          Length = 1052

 Score =  201 bits (512), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 128/455 (28%), Positives = 216/455 (47%), Gaps = 37/455 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW  Q+  L +KI  R+   G+ PV P F+G VP       P A++ 
Sbjct: 364 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 423

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W      P    TY+    D  F ++   + +     +G  SH Y  D F E    
Sbjct: 424 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 481

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D    +  +   + + M   D +AVW++Q W        +       LN +     ++L
Sbjct: 482 ADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW--------QENPTDDFLNGLKKDHALIL 531

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+A+ KP  +   +F   P+IW MLH F G +   G+ + +A   +    +E+  M GV
Sbjct: 532 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMQEVLA-QEIPQSLAESKYMKGV 590

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ E +  NP++Y+++ +MA++   +   A+I+ +   RYG   P I+ AW+++  T Y
Sbjct: 591 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 650

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +      D+ R         + SII    G    +G  V++     +    YD       
Sbjct: 651 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 687

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
            +E  + L L+++  +    +  Y++DL+D+TRQ LA  + E +    +A+   D     
Sbjct: 688 -AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFN 746

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
           QLS +FL L++  D +L     F+LG WL SA+ +
Sbjct: 747 QLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTM 781


>gi|401885538|gb|EJT49648.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
           asahii CBS 2479]
          Length = 781

 Score =  201 bits (512), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 135/472 (28%), Positives = 220/472 (46%), Gaps = 33/472 (6%)

Query: 2   SNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            N+HG W G     WL+ Q  LQK+IL R  E GM PVLP F G VP  L N        
Sbjct: 237 GNIHGNWHGTTTWQWLEGQHNLQKQILARQREFGMTPVLPGFCGFVPPELHNYIGGPDFK 296

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W S  ++  +     +D     +  +  AF+ +Q + YG TS  Y  D F E+ P 
Sbjct: 297 TYPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRKQKELYGFTSDYYMVDLFTESKPT 354

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+  +  A+   + +   +A W+MQGW+F  DP  W     KA L+      L+V
Sbjct: 355 STDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVNDPKSWTETASKAFLDGAG-ESLLV 413

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL AE  P W   K F+G  ++WC L N+  N  +YG LD      ++A+ +    + G
Sbjct: 414 LDLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLYGALDKWNHDIMDAK-ANGGRLSG 472

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQDAWNVLYHT 298
           +G+  EGI  N  +++L ++  +  + +D+K W   +  RRY G+++   Q AW +L ++
Sbjct: 473 MGIVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNWVKRRYRGQNLDLAQKAWELLDNS 532

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VY   + A       ++   D+ P++                  + L   T +Y    + 
Sbjct: 533 VYKSNNTALKCTTRSLI---DLRPAV------------------SGLIGTTGNYLATAIT 571

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y   +V+ AL+  + S +  +    + YDL+D+ RQ     A  ++  +I A+  ++   
Sbjct: 572 YEPRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQVFVNAAIPIYQAMINAWNGSNKAD 630

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN----EEQEKQVR 466
             +  R  + L+ D+D L+A    F L  W+  A+  AQ+    ++ E Q R
Sbjct: 631 TEKYGRELVGLINDIDRLMATSRHFRLESWVGDARNWAQDAGAKDDMEFQAR 682


>gi|406693970|gb|EKC97309.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
           asahii CBS 8904]
          Length = 781

 Score =  201 bits (512), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 135/472 (28%), Positives = 220/472 (46%), Gaps = 33/472 (6%)

Query: 2   SNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            N+HG W G     WL+ Q  LQK+IL R  E GM PVLP F G VP  L N        
Sbjct: 237 GNIHGNWHGTTTWQWLEGQHNLQKQILARQREFGMTPVLPGFCGFVPPELHNYIGGPDFK 296

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W S  ++  +     +D     +  +  AF+ +Q + YG TS  Y  D F E+ P 
Sbjct: 297 TYPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRKQKELYGFTSDYYMVDLFTESKPT 354

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+  +  A+   + +   +A W+MQGW+F  DP  W     KA L+      L+V
Sbjct: 355 STDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVNDPKSWTETASKAFLDGAG-ESLLV 413

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDL AE  P W   K F+G  ++WC L N+  N  +YG LD      ++A+ +    + G
Sbjct: 414 LDLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLYGALDKWNHDIMDAK-ANGGRLSG 472

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQDAWNVLYHT 298
           +G+  EGI  N  +++L ++  +  + +D+K W   +  RRY G+++   Q AW +L ++
Sbjct: 473 MGIVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNWVKRRYRGQNLDLAQKAWELLDNS 532

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           VY   + A       ++   D+ P++                  + L   T +Y    + 
Sbjct: 533 VYKSNNTALKCTTRSLI---DLRPAV------------------SGLIGTTGNYLATAIT 571

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y   +V+ AL+  + S +  +    + YDL+D+ RQ     A  ++  +I A+  ++   
Sbjct: 572 YEPRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQVFVNAAIPIYQAMINAWNGSNKAD 630

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN----EEQEKQVR 466
             +  R  + L+ D+D L+A    F L  W+  A+  AQ+    ++ E Q R
Sbjct: 631 TEKYGRELVGLINDIDRLMATSRHFRLESWVGDARNWAQDAGAKDDMEFQAR 682


>gi|293402122|ref|ZP_06646261.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
           bacterium 5_2_54FAA]
 gi|291304514|gb|EFE45764.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
           bacterium 5_2_54FAA]
          Length = 2295

 Score =  201 bits (512), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 132/465 (28%), Positives = 232/465 (49%), Gaps = 50/465 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ +GGPLP +W +Q+  L +K+  R+   G++PV+  FSG VP       P+A IT
Sbjct: 398 MQNLYSYGGPLPDNWFEQRTELARKMHDRMQTYGISPVVQGFSGQVPDNFDKKQPTALIT 457

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
           ++ +W    + P     Y+ ++      + L+ ++ + F + Q   +G  ++ Y  D F 
Sbjct: 458 EMKDWVGY-TRPSIIQPYITESDAAKGKENLYPQVAKDFYDAQKNVFGNVTNYYATDPFH 516

Query: 116 ENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
           E   P  +D  E        + + M   +  AVW+MQ W  + D      ++  L   V 
Sbjct: 517 EGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQQWQGNLDA----TKLSGL---VK 565

Query: 174 LGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
             + + LDL  ++ P   +   S+     P++WCMLHNF G + M G L ++A  P  A 
Sbjct: 566 PSQALALDLQTDLNPQNGVMENSE----TPWLWCMLHNFGGRMGMDGNLPNVAKNPAIA- 620

Query: 231 TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
            +E+  M G+G++ E +E +PV Y+L+ +M +  + +D  AWI +Y+ RR G +   +Q+
Sbjct: 621 MNESKYMKGIGITPEALENSPVAYELLFDMTWTKDPIDEDAWIAKYAQRRAGGTSEKLQE 680

Query: 291 AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
           AW +L  T Y    GA  +      ++     +II+ T                 +S  S
Sbjct: 681 AWKILNETAY----GAKQE------SYQGAAETIINAT------------PRDSFRS-AS 717

Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
           ++ H ++ Y   E  +AL+L I + ++  AS  YRYDL D+  Q L   A E    +++A
Sbjct: 718 TWGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYDLADVANQVLCNVAIEYHSLMVKA 777

Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
              ++A    + S++FLE+++  D +L   + F++G W+  A+ +
Sbjct: 778 KNESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGNWINDARNM 822


>gi|393785795|ref|ZP_10373941.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
           CL02T12C05]
 gi|392661414|gb|EIY55000.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
           CL02T12C05]
          Length = 724

 Score =  201 bits (512), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 134/468 (28%), Positives = 223/468 (47%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL   W + Q+ LQ +I+ R+ ELGM+P+ PAF+G VP A     P  K  
Sbjct: 199 MGNLNTWDGPLSDEWQEGQIQLQHQIINRMRELGMSPIAPAFAGFVPMAFAEKHPDIKFK 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W            Y+L    P F EIG+ F+++  KE+G+ ++ Y  D+F+E   P
Sbjct: 259 HL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKEWEKEFGKNTY-YLSDSFNEMELP 313

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           V   +       ++  G +IY  + +G+ DA+W+ QGW F Y   FW    ++ALL+ VP
Sbjct: 314 VAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQGWTFGYQHSFWDKASLQALLSHVP 373

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K +W T +       FYG  +I+  + NF G   M G L   A   
Sbjct: 374 DDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIFSYVPNFGGKTPMTGDLQMYASSS 433

Query: 227 VEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
            EA  SE +  ++G G + EG+E N VVY+L+++M +  + +D+  W+  Y + RYG   
Sbjct: 434 AEALQSESHGNLIGFGSAPEGLENNEVVYELLADMGWTDQAIDLDKWMPSYCMARYGAYP 493

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
             ++DAW++   T Y+            ++  PD                     K  + 
Sbjct: 494 ETMKDAWDLFRKTAYSSLYSYPRFTWQTVI--PD---------------------KRRIS 530

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
           K + S            + +  +ELF+ S + L  S  Y  D I+     +A  A++L+ 
Sbjct: 531 KIDVS-----------DDFLHGVELFLNSADSLKNSKLYVNDAIEFASYYIAAKADKLYG 579

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             +    +  +    Q   + ++++ ++D LLA H  + L  W+  A+
Sbjct: 580 KALAEDTVGRSAVAQQYLNQTIDMLLNVDKLLASHPLYRLEEWVNFAR 627


>gi|432896403|ref|ZP_20107613.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
 gi|433031274|ref|ZP_20219108.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
 gi|431432398|gb|ELH14169.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
 gi|431538475|gb|ELI14460.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
          Length = 1049

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 128/455 (28%), Positives = 216/455 (47%), Gaps = 37/455 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW  Q+  L +KI  R+   G+ PV P F+G VP       P A++ 
Sbjct: 361 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 420

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W      P    TY+    D  F ++   + +     +G  SH Y  D F E    
Sbjct: 421 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 478

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D    +  +   + + M   D +AVW++Q W        +       LN +     ++L
Sbjct: 479 ADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW--------QENPTDDFLNDLKKDHALIL 528

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+A+ KP  +   +F   P+IW MLH F G +   G+ + +A   +    +E+  M GV
Sbjct: 529 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMQEVLA-QEIPQSLAESKYMKGV 587

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ E +  NP++Y+++ +MA++   +   A+I+ +   RYG   P I+ AW+++  T Y
Sbjct: 588 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 647

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +      D+ R         + SII    G    +G  V++     +    YD       
Sbjct: 648 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 684

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
            +E  + L L+++  +    +  Y++DL+D+TRQ LA  + E +    +A+   D     
Sbjct: 685 -AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFN 743

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
           QLS +FL L++  D +L     F+LG WL SA+ +
Sbjct: 744 QLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTM 778


>gi|161505009|ref|YP_001572121.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160866356|gb|ABX22979.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 1014

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 126/455 (27%), Positives = 218/455 (47%), Gaps = 37/455 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGPLPQSW  Q+  L +KI  R+   G+ PV P F+G VP       P A++ 
Sbjct: 326 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 385

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W      P    TY+    D  F ++   + +     +G  SH Y  D F E    
Sbjct: 386 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFYEGGNR 443

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D    +  +   + + M   D DAVW++Q         W+     A LN +     ++L
Sbjct: 444 ADLN--MVKVAQTVQNKMLEHDKDAVWIIQN--------WQENPTDAFLNGLKKDHALIL 493

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DL+A+ KP  +   +F   P+IW MLH F G +   G+ + +A   +    +E+  M GV
Sbjct: 494 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLA-QEIPQSLAESKYMKGV 552

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ E +  NP++Y+++ +MA++   +   A+I+ +   RYG   P I+ AW+++  T Y
Sbjct: 553 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHNWLTSRYGAQSPEIEQAWDIMVKTAY 612

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +      D+ R         + SII    G    +G          +   +Y +  + Y 
Sbjct: 613 HRR---KDRQR--------AEDSIIDAKPG----FG---------VTRACTYYNALIDYD 648

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
            +E  + L L+++  +    +  Y++DL+D+TRQ LA  + E +    +A+   D     
Sbjct: 649 KAEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAQDYSAFN 708

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
           QLS +FL L++  D +L+    F+LG W+ +++ +
Sbjct: 709 QLSGKFLRLIKLQDKVLSTRPEFMLGNWINNSRTM 743


>gi|373451393|ref|ZP_09543318.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
           3_1_31]
 gi|371968665|gb|EHO86120.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
           3_1_31]
          Length = 2190

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 131/465 (28%), Positives = 233/465 (50%), Gaps = 50/465 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ +GGPLP +W +Q+  L +K+  R+   G++PV+  FSG VP       P+A IT
Sbjct: 390 MQNLYSYGGPLPDNWFEQRTELARKMHDRMQTYGISPVVQGFSGQVPDNFDKKQPTALIT 449

Query: 61  QLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
           ++ +W    + P     Y+ +       + L+ ++ + F + Q   +G  ++ Y  D F 
Sbjct: 450 EMKDWVGY-TRPSIIQPYITENDAAKGKENLYPQVAKDFYDAQKNVFGNVTNYYATDPFH 508

Query: 116 ENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
           E   P  +D  E        + + M   +  AVW+MQ W  + D      ++  LL    
Sbjct: 509 EGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQQWQGNLDA----TKLSGLLKP-- 558

Query: 174 LGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
             + + LDL  ++ P   +   S+     P++WCMLHNF G + M G L ++A  P  A 
Sbjct: 559 -SQALALDLQTDLNPQNGVMENSE----TPWLWCMLHNFGGRMGMDGNLPNVAKNPAIA- 612

Query: 231 TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
            +E+  M G+G++ E +E +PV Y+L+ +M +  + +D  AWI +Y+ RR G +   +Q+
Sbjct: 613 MNESKYMKGIGITPEALENSPVAYELLFDMTWTKDPIDEDAWIAKYAQRRAGGTSEKLQE 672

Query: 291 AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
           AW +L  T Y    GA  +      ++     +II+ T            +++   +  S
Sbjct: 673 AWKILNETAY----GAKQE------SYQGAAETIINAT-----------PRDSFRSA--S 709

Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
           ++ H ++ Y   E  +AL+L I + ++  AS  YRYDL D+  Q L   A E    +++A
Sbjct: 710 TWGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYDLADVADQVLCNVAIEYHSLMVKA 769

Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
              ++A    + S++FLE+++  D +L   + F++G W+  A+ +
Sbjct: 770 KNESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGNWINDARNM 814


>gi|281424178|ref|ZP_06255091.1| N-acetylglucosaminidase [Prevotella oris F0302]
 gi|281401447|gb|EFB32278.1| N-acetylglucosaminidase [Prevotella oris F0302]
          Length = 723

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 139/468 (29%), Positives = 224/468 (47%), Gaps = 58/468 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N++G  GPL   W   Q+ LQ KIL R+ +L M+P+ P F+G VP AL+ ++P+A I 
Sbjct: 198 MGNIYGIDGPLSNQWHQDQIALQHKILDRMRKLDMHPICPGFAGFVPEALKELYPTADI- 256

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
           Q   W     +      Y+L   DPLF +IG  FI++  KE+GR    Y  D+F+E    
Sbjct: 257 QYTTWEKAFHN------YILSPADPLFHKIGVMFIQEWEKEFGRCD-FYLIDSFNEMDIP 309

Query: 119 -PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
            PP D P   E+++  G  +Y  ++  +  A W+MQGW+F Y P  W    + AL++ VP
Sbjct: 310 FPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQGWMFGYQPEIWDYKTLNALVSQVP 369

Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K+++LDL  +  K +W T       K F G  +I+ ++ N  G   + G LD  A G 
Sbjct: 370 DNKMIMLDLAVDYNKFLWKTPFNWDFYKGFCGKQWIYSVIPNMGGKSALTGALDFYAKGH 429

Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
           +EA  S+N   ++G G + EGIE N VVY+L+ +  +  + V+++ W+  Y+  RYG   
Sbjct: 430 LEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGWAKQGVELRPWLRNYTYSRYGCYP 489

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
             ++  WN +  +VY                             G ++++ +   +    
Sbjct: 490 IGMEQYWNEMLQSVY-----------------------------GSFKSHPRFNWQFRPG 520

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
           K +  S D  + +Y   E++  + L    GN+L     +  D  ++    L      L  
Sbjct: 521 KEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL-----FEADFKEMAANYLGGKVEILVR 574

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
            I +AY+  D     QL  RF  L+  MD +L  H    +  W++ A+
Sbjct: 575 QIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPTKDMQKWIDYAR 622


>gi|421736727|ref|ZP_16175487.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
           IPLA 20015]
 gi|407295984|gb|EKF15606.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
           IPLA 20015]
          Length = 1044

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 135/462 (29%), Positives = 223/462 (48%), Gaps = 41/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+  GGPLP +W +Q + L ++I  R+   G+ PV+  F G VPA  Q   P++   
Sbjct: 338 MQNLYSVGGPLPAAWFEQCVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 397

Query: 61  QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
             G W      P    TYL DA      +  F ++   F + Q   +G+ S+ Y  D F 
Sbjct: 398 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVCDTFYKAQENVFGKVSNYYAVDPFH 456

Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
           E     D  + I  +   +   M   D  AVW+MQ W +  D      ++  L +    G
Sbjct: 457 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 508

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
           + +VLDL ++++   ++  +  GVP++W MLHNF G + + G+ + I+    +A  S   
Sbjct: 509 QTLVLDLQSDLRSQ-ASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 567

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
            M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W  +Y+ RRYG +   I+  W++L
Sbjct: 568 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKVWDIL 626

Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
             T Y  TDG      +  +I A P  D +I S                       S++ 
Sbjct: 627 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 663

Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
           H  + Y   +  +A  LF  + +    S  +RYD +D+ RQ LA    E      +AY+ 
Sbjct: 664 HSDIDYDKRQFEKAAALFEQAYDSYKNSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 723

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
            D      LS R L++++  D LL+  D FL+G W++ A+ +
Sbjct: 724 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 765


>gi|260910505|ref|ZP_05917173.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
           F0295]
 gi|260635347|gb|EEX53369.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
           F0295]
          Length = 1566

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 140/466 (30%), Positives = 231/466 (49%), Gaps = 43/466 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGP+ +S +  +L  Q+++L R+ +LG+ PV+  F G VP   +  FP A+I 
Sbjct: 199 MGNLEGWGGPMSESLIALRLQQQRQMLQRMRQLGIQPVVQGFPGIVPTFFKERFPQARII 258

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--T 118
           + G W S +         LL   D +F ++  A+ +   K +G        D F E   T
Sbjct: 259 EQGKWGSFQRP-----AVLLPNNDGVFEKVAEAYYQSLTKLFGTDFEFLGGDLFHEGGIT 313

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    + S+ A +   M      A W++QGW  + +P    PQ+  +L+       +
Sbjct: 314 TGVD----VGSVAAQVQRQMLRFFPRAKWVLQGW--NKNP---SPQLLRVLDKR---HTL 361

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTM 237
           +++L  E+   W +S +F G P++W  +++F G  +M G L  I   P  A   + ++ M
Sbjct: 362 LVNLSGEIAASWESSDEFGGTPWLWGSVNHFGGKTDMGGQLPVIVTEPHRALALTVDSVM 421

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G+G+  EGI  NPVVYDL  + A+     DV + + QY   RYG   P +  AW ++  
Sbjct: 422 QGIGILPEGIGTNPVVYDLALKTAWHTATPDVDSMLVQYLGYRYGEVHPDLLAAWRIMLK 481

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           +VY                            EG +++     ++ ++  +  S++    +
Sbjct: 482 SVYG---------------------EFAIKGEGTFESVF--CARPSLRVTSVSTWGPKQM 518

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y  +++ RAL LF+ +  +L  S TY+YDL+DL RQ+LA YA   + ++++AY+  +A 
Sbjct: 519 QYQPADLYRALGLFLKAAPKLRDSETYQYDLVDLARQSLANYARTAYADVVKAYEAKNAE 578

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            + Q ++RF  L+   D LL  +  FLLG WL+ A Q A NE   +
Sbjct: 579 QLQQATQRFERLIVLQDSLLLTNRHFLLGNWLQQATQYAPNEADRQ 624


>gi|433678127|ref|ZP_20510026.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
           translucens DSM 18974]
 gi|430816763|emb|CCP40478.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
           translucens DSM 18974]
          Length = 691

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 146/502 (29%), Positives = 219/502 (43%), Gaps = 71/502 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PL Q W++ +  LQ++IL R+  LGM PVLPAF G VP A     P A+I 
Sbjct: 127 MGNIEGYDAPLQQQWIEDKHALQQRILQRMRTLGMKPVLPAFVGYVPKAFAQAHPQARIY 186

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF +I   FI+   + YG+ ++ Y  D F+E  PP
Sbjct: 187 RMRAWEGFHE------TYWLDPADPLFAKIALRFIQLYDRTYGKGTY-YLADAFNEMLPP 239

Query: 121 VDS----------------------PE--------YISSLGAAIYSGMQSGDSDAVWLMQ 150
           + +                      PE         ++  G A+Y  +   + DAVW+MQ
Sbjct: 240 IAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 299

Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW P  + A L  VP  KL+VLD+  +  P  W  S  F G  +I+  +HN
Sbjct: 300 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 359

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           + G+  +YG    +AF   + R      +   +VG G   EG+  N VVY+ M  +A+  
Sbjct: 360 YGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFGAFPEGLHDNSVVYEYMYALAWGG 416

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
           ++  ++ W+  Y   RYG + PA++ AW+ L   V +                    P  
Sbjct: 417 QQRSLQDWLGDYIRARYGHTSPALRAAWDDLQAAVLSTRYWT---------------PRW 461

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
                G Y  + +P     + + E +  D P L        RAL+  +A   E + +  Y
Sbjct: 462 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLY 512

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
           RYDL+D  R       +      + AY+  D         R    V+ +DGL+       
Sbjct: 513 RYDLVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDAAFARVQAAVQQLDGLVGGQQE-T 571

Query: 445 LGPWLESAKQLAQNEEQEKQVR 466
           L  WL  A+  A+  +     R
Sbjct: 572 LSSWLGDAEGDAKTPQDAAYYR 593


>gi|308480701|ref|XP_003102557.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
 gi|308261289|gb|EFP05242.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
          Length = 718

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 130/475 (27%), Positives = 225/475 (47%), Gaps = 49/475 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN---VFPSA 57
           M NL  +GG L  + +     L K+I+ R+ ELG+ P+LP F+G VP  L+    +FP++
Sbjct: 210 MGNLKAYGGGLSDAQMLNDFNLAKRIINRLLELGIVPILPTFAGFVPDQLEKDFRLFPTS 269

Query: 58  KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDE 116
           K  +L  W +  S+    C   +   DPLF +IG  F+  Q K  G   +++Y+ D F+E
Sbjct: 270 KFNRLPCWNNFTSET--SCLLSVSPFDPLFQKIGSTFLRHQKKMLGGDITNLYSADPFNE 327

Query: 117 NTPPVDSPEYISSL----GAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSV 172
              P DS ++ +S       +I +  +  D + +W++Q W F+YD  W    +K+ L++V
Sbjct: 328 -ILPSDSSKFDASFMKQTAQSIMNSCRKVDKNCIWVLQSWSFTYDQ-WPNWAIKSFLSAV 385

Query: 173 PLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS 232
           P+G L++LDL++EV P W  +  F+G  ++WC+LHNF G+ E+ G L  +  G   A   
Sbjct: 386 PIGNLLILDLYSEVVPAWQMTSSFHGHNFVWCLLHNFGGSRELRGNLQKVDKGYQLALMK 445

Query: 233 ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
             + +VG G+SME I+QN VVY  M +  +  E + +  W+  YS  RY          W
Sbjct: 446 AGSNLVGAGLSMEAIDQNYVVYQFMIDRMWSQEPIPLNNWLKSYSESRYSADFKVSHKFW 505

Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
            +L  + Y+  +   +    V +                   Y +P   + +        
Sbjct: 506 TILAGSFYSQPEKWGNPRFSVFL-------------------YHRPAFAKKI-------- 538

Query: 353 DHPHLWYSTSEVIRALELFIAS-GNELSASNTYRYDLIDLTRQALA-KYANELFLNIIEA 410
                W+   E    L+  + S  + L     ++ DL D+ R  +  +  NE  L++ EA
Sbjct: 539 ---EYWFPVEETFNHLQSLMPSLMHVLGDHPLFKEDLNDVMRAVIQFEIGNEAALSLTEA 595

Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
           + + D   +       +++ + ++     +       W+E +K +A   E E+QV
Sbjct: 596 FLMEDKQQIGASCENLMDMFQKLES----YSNRDFKEWIEDSKSIAPTSE-ERQV 645


>gi|423219557|ref|ZP_17206053.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
           CL03T12C61]
 gi|392624762|gb|EIY18840.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
           CL03T12C61]
          Length = 715

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 135/465 (29%), Positives = 208/465 (44%), Gaps = 43/465 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGP+PQS +D +  L +K+L R+  LG+ P++P F G VP+ L+N    A I 
Sbjct: 195 MGNIEGWGGPMPQSQIDSRKKLVQKMLKRMKSLGIEPLMPGFYGMVPSNLKNK-SKAHII 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W +           +LD  DP F  +   F ++  + YG     ++ D F E    
Sbjct: 254 PQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDETRRLYGSDIRFFSGDPFHEGGAT 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
                 +   G AI   MQ     ++W++QGW  +  P         LL  +    ++V 
Sbjct: 308 DGVA--LGDAGRAIQKTMQKHFPGSIWVLQGWQDNPKP--------GLLEKLDKRYVLVQ 357

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT-MVG 239
           +LF E    W T K + G P+IW  + NF     + G L   A     A  SE    M G
Sbjct: 358 ELFGENTNNWETRKGYEGTPFIWATVTNFGERPGINGKLQRFADEVYRASNSEYAKYMKG 417

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG+  EGI  NPV Y+L+ E+ +  ++VDV  WI  Y   RYGR    I+ AW ++  ++
Sbjct: 418 VGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESYVTARYGRITDEIRTAWKMMLKSI 477

Query: 300 YNCTDGATDK-NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           Y+   G  +    +++ A P ++   +S      + Y + + K+A               
Sbjct: 478 YSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKYDRDLYKKAAF------------- 524

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
                      LF  +  E +   TYR DLI   RQ +A  A+ +F ++I AYQ      
Sbjct: 525 -----------LFAKAMPEFNEVRTYRIDLIHFLRQVIANEADSVFYDMITAYQEKKVEK 573

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
             Q   +FL +++  + LLA    F L  W + AK       ++K
Sbjct: 574 FEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDAGNTAAEKK 618


>gi|153806010|ref|ZP_01958678.1| hypothetical protein BACCAC_00255 [Bacteroides caccae ATCC 43185]
 gi|149130687|gb|EDM21893.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
           43185]
          Length = 715

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 135/465 (29%), Positives = 208/465 (44%), Gaps = 43/465 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGP+PQS +D +  L +K+L R+  LG+ P++P F G VP+ L+N    A I 
Sbjct: 195 MGNIEGWGGPMPQSQIDSRKKLVQKMLKRMKSLGIEPLMPGFYGMVPSNLKNK-SKAHII 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W +           +LD  DP F  +   F ++  + YG     ++ D F E    
Sbjct: 254 PQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDETRRLYGSDIRFFSGDPFHEGGAT 307

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
                 +   G AI   MQ     ++W++QGW  +  P         LL  +    ++V 
Sbjct: 308 DGVA--LGDAGRAIQKTMQKHFPGSIWVLQGWQDNPKP--------GLLEKLDKRYVLVQ 357

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT-MVG 239
           +LF E    W T K + G P+IW  + NF     + G L   A     A  SE    M G
Sbjct: 358 ELFGENTNNWETRKGYEGTPFIWATVTNFGERPGINGKLQRFADEVYRASNSEYAKYMKG 417

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG+  EGI  NPV Y+L+ E+ +  ++VDV  WI  Y   RYGR    I+ AW ++  ++
Sbjct: 418 VGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESYVTARYGRITDEIRTAWKMMLKSI 477

Query: 300 YNCTDGATDK-NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           Y+   G  +    +++ A P ++   +S      + Y + + K+A               
Sbjct: 478 YSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKYDRDLYKKAAF------------- 524

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
                      LF  +  E +   TYR DLI   RQ +A  A+ +F ++I AYQ      
Sbjct: 525 -----------LFAKAMPEFNEVRTYRIDLIHFLRQVIANEADSVFYDMITAYQEKKVEK 573

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
             Q   +FL +++  + LLA    F L  W + AK       ++K
Sbjct: 574 FEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDAGNTAAEKK 618


>gi|169351448|ref|ZP_02868386.1| hypothetical protein CLOSPI_02228 [Clostridium spiroforme DSM 1552]
 gi|169291670|gb|EDS73803.1| LPXTG-motif cell wall anchor domain protein [Clostridium spiroforme
           DSM 1552]
          Length = 1990

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 130/463 (28%), Positives = 226/463 (48%), Gaps = 40/463 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GG LP +W ++++ L +K+  R+   G+ PVL  FSG VP   ++ +   +  
Sbjct: 360 MQNMTSYGGKLPNNWFEERVELARKMHDRMQTYGITPVLSGFSGQVPTNFKDKYQDVQYV 419

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W   +  P    TY+ +     F ++   F + Q   +G  ++IY  D F E    
Sbjct: 420 AQGSWCGYER-PDMLRTYVDNGGTDYFSQMADVFYKAQRDIFGDVTNIYAVDPFHEGGKI 478

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK--LV 178
            D     + +   +   M   D DA+WL+Q W  S      P ++      + L K  ++
Sbjct: 479 GDMN--YTKVYETVQKKMMENDEDAIWLIQEWSGSIAS--NPSKL------INLDKEHVI 528

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
           VLDLF+EV P  +++ +    P+IW MLHNF G + +    + ++   P   + SE+  M
Sbjct: 529 VLDLFSEVSP-RNSALEAADTPWIWNMLHNFGGRMGLDANPEKVSQNIPNTYQNSEH--M 585

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
           VG+GM+ E IE +P+ Y+L+ +M +  + +D + W   Y+ R YG +   I++ WN+L  
Sbjct: 586 VGIGMTPEAIENSPMAYELLWDMTWTKDPIDFRQWCQDYAKRIYGGTNEDIEEVWNILLD 645

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           T YN  D            +     S+I+          +P +      +  SS+ H  +
Sbjct: 646 TGYNRKDN----------YYQGAPESVIN---------ARPTTN----FTSASSWGHSTI 682

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y   E+ RA+ L   + +E   S  + YDL D+TRQ ++  A E    ++ AYQ  +  
Sbjct: 683 NYDKEELERAVYLMAKNYDEFKDSPAFIYDLSDITRQLISNSAQEYHKAMVNAYQAGNLS 742

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
               LS +FLE++   D +L+ +  FL+G W+E A+ + ++ +
Sbjct: 743 EFEVLSDKFLEMILLQDQILSTNSDFLVGKWIEQARTMIEDSD 785


>gi|384417770|ref|YP_005627130.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
 gi|353460684|gb|AEQ94963.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
          Length = 798

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 140/474 (29%), Positives = 218/474 (45%), Gaps = 72/474 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQQWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADAFNEMLPP 313

Query: 121 -------VDSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
                  V + +Y                       +++ G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAAYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADRAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           +  +  +YG    +AF    ++A  ++     + G G+  EG+  N VVY+ +  +A++ 
Sbjct: 434 YGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYLYALAWEG 490

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
            +     W+ +Y   RYGRS  A+  AW  L   +Y     +               P  
Sbjct: 491 PQHPWSQWLARYLRARYGRSDAALLSAWTDLEAGIYQTRYWS---------------PRW 535

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
            +   G Y  + +P +            D P        + RA++  +   +  + +  Y
Sbjct: 536 WNTHAGAYLLFKRPTADIVNFD------DRPG---DPQRLRRAIDALLQQADRYADAPLY 586

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
           RYDLI+  R  L+  A+     +++AY   D A G  QL+R   +LV+ +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDALV 639


>gi|375146756|ref|YP_005009197.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
           koreensis GR20-10]
 gi|361060802|gb|AEV99793.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
           koreensis GR20-10]
          Length = 1147

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 132/453 (29%), Positives = 204/453 (45%), Gaps = 42/453 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGP+PQS +D + +L +K++ R+  LG+ PV+P F G VP    N    A++ 
Sbjct: 191 MGNIEGWGGPMPQSQIDSRKILVQKMIARMQALGIEPVMPGFYGMVPHNF-NTKSKARVI 249

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             GNW +      +    +LD TD  F  +   F E+  K YGR    ++ D F E    
Sbjct: 250 TQGNWGA------FIRPAILDPTDTAFDRVAGIFYEETKKLYGRNIRFFSGDPFHEGG-- 301

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           + +   +   GA I   MQ     A+W++QGW        +    K LL       L++ 
Sbjct: 302 ITNGVNLGKAGANIQKAMQQYFPGAIWVLQGW--------QDNPKKELLAETDKSALLIQ 353

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVG 239
           +LF E    W T   + G P+IWC ++NF     + G L+  A     A T      M G
Sbjct: 354 ELFGENTNNWETRNGYEGTPFIWCCVNNFGERPGLNGKLERYAGEVYRAATGPFREYMKG 413

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           VG+  EGI  NP  YDL+ E+ + ++ V+   WIN Y   RYG++   I  AW +   T+
Sbjct: 414 VGIMPEGINNNPASYDLVLELGWHNQPVETGKWINDYVKARYGKANDQIATAWTLFLQTI 473

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y+          +++ A P +                K VS    LK            Y
Sbjct: 474 YSNPGYQEGPPENILCARPALQV--------------KSVSSWGKLKKG----------Y 509

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
            T+   + ++ F A+      S TY+ DLI+ TRQ L+  A+ +F +++ AY+  +    
Sbjct: 510 DTALFEKGVQAFAAAAPLFGNSETYKIDLINFTRQVLSNRADTVFASLVTAYKEENTVAF 569

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 452
              +  FL L    + LL  H  + L  + + A
Sbjct: 570 NAAAEAFLSLHALTNELLNSHSYYRLTSYQQQA 602


>gi|418515337|ref|ZP_13081518.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|410708056|gb|EKQ66505.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 782

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 138/474 (29%), Positives = 210/474 (44%), Gaps = 72/474 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 185 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 244

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 245 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 297

Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
           V                                   ++  G A+Y  +   +  A W+MQ
Sbjct: 298 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 357

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 358 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 417

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           +  +  +YG     AF   + +      +   + G G+  EG+  N V+Y+ +  +A++ 
Sbjct: 418 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 474

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
            +     W+  Y   RYGRS  A+  AW+ L   +Y     +               P  
Sbjct: 475 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 519

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
            +   G Y  + +P +  A         D P        + RA++  +   N  + +  Y
Sbjct: 520 WNKRAGAYLLFKRPTADIADFD------DRPG---DPQRLRRAIDALLQQANRYADAPLY 570

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
           RYDLI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+
Sbjct: 571 RYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALV 623


>gi|403512485|ref|YP_006644123.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
           [Nocardiopsis alba ATCC BAA-2165]
 gi|402798758|gb|AFR06168.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
           [Nocardiopsis alba ATCC BAA-2165]
          Length = 718

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 131/471 (27%), Positives = 215/471 (45%), Gaps = 45/471 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  + GP+P+SW++    L +++L R   LGM PVLP F+G+VP +L      ++  
Sbjct: 160 MGNLDHFAGPMPRSWIEGHRELGRRVLERQRALGMTPVLPGFTGHVPPSLAPGRTGSRTW 219

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q               T++L  TDPL+  +    +E Q KE   T H Y  D F E  P 
Sbjct: 220 QG------------LVTHVLVPTDPLYTTLCAEIVETQ-KELFDTDHQYAIDPFIEMIPV 266

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P +   +  A   G+   D  AVW +Q W FSY   FW P +++A L+++P   L +
Sbjct: 267 DSDPGFPGLVARATIEGLTRADPRAVWFLQTWPFSYQSDFWSPERVEAFLDAIPDDHLHL 326

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI----EMYGILDSIAFGPVEARTSENT 235
           LDL+AE  P WS    F G P+ WC L NF G      ++ G  D I      A   E  
Sbjct: 327 LDLWAEYDPQWSRFHAFGGTPWTWCALLNFGGRTDPMADLQGAADRIGAAKDSAHPPE-- 384

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-PAIQDAWNV 294
              G+G+SME    NP  ++L+ + A+       + W+  +  +RYG    PA+ + W  
Sbjct: 385 ---GIGLSMEATRNNPAFFELVVDQAWTRTGRVEEEWLPDFVAQRYGPGHDPALLEGWRG 441

Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
           L  TV   +          +  FP+    ++++     + + + +   + L++E ++   
Sbjct: 442 LLRTVLGASG---------VRIFPEQFNGVLTL-----RPHYRHLEDSSALRAEVTAL-- 485

Query: 355 PHLWYSTSEVIRALELFIASG--NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
             +WY   +++ A E  +A    + L+      +DL+D+    L++ A+  +L ++E   
Sbjct: 486 --VWYPWPDLLAAWERLVAGAETDPLAVEGPLGHDLVDVAMAVLSRVADHRYLEMVEHLD 543

Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            +       L  RFLE+ +D+D LL     +    W   A   A   E  +
Sbjct: 544 HHPELPEGDLE-RFLEVFDDLDALLETRPEYRYRTWEAKATSWATGTEDHR 593


>gi|418520969|ref|ZP_13087015.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410702945|gb|EKQ61442.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
          Length = 798

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 138/474 (29%), Positives = 210/474 (44%), Gaps = 72/474 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313

Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
           V                                   ++  G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           +  +  +YG     AF   + +      +   + G G+  EG+  N V+Y+ +  +A++ 
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
            +     W+  Y   RYGRS  A+  AW+ L   +Y     +               P  
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
            +   G Y  + +P +  A         D P        + RA++  +   N  + +  Y
Sbjct: 536 WNKRAGAYLLFKRPTADIADFD------DRPG---DPQRLRRAIDALLQQANRYADAPLY 586

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
           RYDLI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALV 639


>gi|21241480|ref|NP_641062.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
 gi|21106823|gb|AAM35598.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
          Length = 798

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 137/474 (28%), Positives = 209/474 (44%), Gaps = 72/474 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313

Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
           V                                   ++  G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASRAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           +  +  +YG     AF   + +      +   + G G+  EG+  N V+Y+ +  +A++ 
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
            +     W+  Y   RYGRS  A+  AW+ L   +Y     +               P  
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
            +   G Y  + +P +            D P        + RA++  +   N  + +  Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLY 586

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
           RYDLI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALV 639


>gi|381169859|ref|ZP_09879021.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689629|emb|CCG35508.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 798

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 137/474 (28%), Positives = 209/474 (44%), Gaps = 72/474 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313

Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
           V                                   ++  G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           +  +  +YG     AF   + +      +   + G G+  EG+  N V+Y+ +  +A++ 
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWES 490

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
            +     W+  Y   RYGRS  A+  AW+ L   +Y     +               P  
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
            +   G Y  + +P +            D P        + RA++  +   N  + +  Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLY 586

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
           RYDLI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALI 639


>gi|154321596|ref|XP_001560113.1| hypothetical protein BC1G_00945 [Botryotinia fuckeliana B05.10]
          Length = 701

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 104/257 (40%), Positives = 153/257 (59%), Gaps = 7/257 (2%)

Query: 1   MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
             N+ G WGG +P +W++ Q +LQKKI+ R+ ELG+ PVLPAF+G VP  L+ V P+A I
Sbjct: 185 FGNIQGSWGGTIPLAWIEDQHLLQKKIVQRMVELGITPVLPAFTGFVPRDLRRVAPNANI 244

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
               +W ++        T+L    DPLF  +   F+  Q + YG  SHIY  D F+EN P
Sbjct: 245 INGSDWGNLFPFEYSNDTFLY-PIDPLFKTLQHTFLSLQSEYYGNVSHIYTLDQFNENLP 303

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-L 177
               P Y+ ++    Y  +QS DS+A W++QGWLF +   FW   +++A L  VP  + +
Sbjct: 304 ASGDPLYLGNISRGTYDSLQSFDSNATWMLQGWLFYAASSFWTQDRVEAYLGGVPKNESM 363

Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT 236
           ++LDLF+E  P W  + Q+YG P+IWC LH + G   +YG + +I    +EA R SE   
Sbjct: 364 LILDLFSESFPEWENTHQYYGKPWIWCQLHGYGGTPGIYGQIYNITNSSIEAFRNSEK-- 421

Query: 237 MVGVGMSMEGIEQNPVV 253
           MVG+G +MEG + N ++
Sbjct: 422 MVGMGNTMEGQDGNGLI 438


>gi|282877909|ref|ZP_06286718.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
 gi|281299910|gb|EFA92270.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
          Length = 717

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 133/480 (27%), Positives = 223/480 (46%), Gaps = 56/480 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W  QQ+ LQ KI+ R+ ELGM+P+ PAF+G VP A     P     
Sbjct: 195 MGNLNQWDGPLSDAWHKQQITLQHKIISRMRELGMHPIAPAFAGFVPKAFAKKHPEINFK 254

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            L  W            Y+L      F ++G+ FIE+  +E+G  ++ Y  D+F+E   P
Sbjct: 255 HL-RWGGFADS---LNAYVLPPESSYFKQLGKLFIEEWEREFGENTY-YLSDSFNEMKLP 309

Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           V+  +       ++  G AIY  + +G+  A+W+ QGW F Y   FW    + ALL+ VP
Sbjct: 310 VNPNDEEEKCRLLAEYGKAIYQSINAGNPHAIWVTQGWTFGYQHDFWNRKSLSALLSQVP 369

Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             +++++DL  +        +  W     FYG  +I+  + NF G   + G L+  A   
Sbjct: 370 NDRMIIIDLGNDYPKWVWHTEQTWKRHNGFYGKQWIFSYVPNFGGKTLLTGDLEMYATDA 429

Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             A ++ N   +VG+G + EG+E N VVY+L+S+ A+  + +++  WI  Y + RYG+  
Sbjct: 430 SLALSAANKGNLVGIGSAPEGLENNEVVYELLSDAAWTDKGINLDEWIANYCMARYGKYP 489

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD-PSIISVTEGKYQNYGKPVSKEAV 344
             ++ AWN    +VY+            + ++P     ++I  T  K             
Sbjct: 490 DKMKAAWNGFRKSVYSS-----------LYSYPRFTWQTVIPDTRRK------------- 525

Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
                S +D    ++      +A+E F++  +EL  +  Y+ D I    Q L   A+  +
Sbjct: 526 -----SRHDLNETYF------KAVEDFLSCADELGGAKFYQDDAILFAAQYLGAKADIYY 574

Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
            N +    LN      +   + +EL+   D +LA H    L  W+  A+      +++ Q
Sbjct: 575 ENALRYGSLNKHVEANKQLSKAIELLLFADKILASHPTDRLDVWIAKARSQGHTPQEKNQ 634


>gi|390989490|ref|ZP_10259787.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372555759|emb|CCF66762.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 798

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 137/474 (28%), Positives = 209/474 (44%), Gaps = 72/474 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313

Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
           V                                   ++  G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           +  +  +YG     AF   + +      +   + G G+  EG+  N V+Y+ +  +A++ 
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
            +     W+  Y   RYGRS  A+  AW+ L   +Y     +               P  
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
            +   G Y  + +P +            D P        + RA++  +   N  + +  Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLY 586

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
           RYDLI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALV 639


>gi|325922205|ref|ZP_08183992.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
           19865]
 gi|325547324|gb|EGD18391.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
           19865]
          Length = 807

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 136/477 (28%), Positives = 210/477 (44%), Gaps = 78/477 (16%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQ +IL R+ ELGM PVLPAF+G VP A     P+A+I 
Sbjct: 202 MGNIEGYRAPLPQQWIDSKRVLQTQILTRMRELGMQPVLPAFAGYVPKAFAQAHPNARIY 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 262 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 314

Query: 121 VD-------SPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
           V        + +Y  S+                       G A+Y  +   +  A W+MQ
Sbjct: 315 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPQATWVMQ 374

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 375 GWLFGADREFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 434

Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           +  +  +YG     AF   + +      +   + G G+  EG+  N VVY+ +  +A++ 
Sbjct: 435 YGASNPLYG---DFAFYRQDLQALLADPDKRNLRGFGVFPEGLHSNSVVYEYLYALAWEG 491

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
            +     W+ QY+  RYG S  A+  AW+                         D+D  I
Sbjct: 492 PQQSWSQWLTQYTRARYGHSDAALLQAWS-------------------------DLDAGI 526

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSY----DHPHLWYSTSEVIRALELFIASGNELSA 380
                   + + K      + K  T+      D P        + RA++  +   +  + 
Sbjct: 527 YQTRYWSLRWWNKRAGAYLLFKRPTADIVGFDDRPG---DPQRLRRAIDALLQQADRYAD 583

Query: 381 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLL 437
           +  YRYDLI+  R  L+ +A+     +++AY   D      L  R   LV+ +D L+
Sbjct: 584 APLYRYDLIEDARHYLSLHADRQLQAVVQAYGTGDFARGDALLARTTRLVQGLDALV 640


>gi|295690503|ref|YP_003594196.1| alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
 gi|295432406|gb|ADG11578.1| Alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
          Length = 770

 Score =  191 bits (484), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 140/488 (28%), Positives = 211/488 (43%), Gaps = 68/488 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLP +W+D++  LQ +IL R+  LGM P+LPAF G VP A     P A+I 
Sbjct: 202 MGNIEGYRAPLPTNWIDKKKDLQVQILGRMRSLGMTPILPAFGGYVPKAFAQKNPKARIY 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF +I   F+    + YG T   Y  D+F+E  PP
Sbjct: 262 RMRPWEGFHE------TYWLDPADPLFAKIAGRFLALYTQTYG-TGTYYLADSFNEMLPP 314

Query: 121 VD-------------------------------SPEYISSLGAAIYSGMQSGDSDAVWLM 149
           ++                                 + +++ G AIY  ++    DAVW+M
Sbjct: 315 INADGADARDAAYGDGAANTAATKTKVEVDPALKAQRLAAYGKAIYDSIRQARPDAVWVM 374

Query: 150 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 207
           QGWLF  D  FW P  + A L+ VP  KL++LD+  +  P +W  +K F G P+I+  +H
Sbjct: 375 QGWLFGADSHFWDPTAISAYLSLVPDDKLMILDIGNDRYPAVWKNAKAFGGKPWIYGYVH 434

Query: 208 NFAGNIEMYGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 266
           N+ G+  +YG LD      P  A   E   + G GM  EG+  N +VYD + ++A+   +
Sbjct: 435 NYGGSNPVYGDLDYYRRDIPAIAANPEAGKLAGFGMFPEGLHNNSIVYDAVYDLAWGAGR 494

Query: 267 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 326
             + AW++ Y+  RYG++ P +  A   L    Y+    +               P    
Sbjct: 495 ESLSAWLSTYARARYGKTSPELDAALGQLVEAAYSTRYWS---------------PRWWK 539

Query: 327 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRAL-ELFIASGNELSASNTYR 385
              G Y  + +P +             HP         ++AL  L  A  NE      + 
Sbjct: 540 SKAGAYLFFKRPTATIGEFPP------HPGDRAKLEAAVKALTALAPAYANE----PLFV 589

Query: 386 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 445
            DL D TR       ++L    + AY+  D     Q       L   +D LL       L
Sbjct: 590 LDLTDATRHLATMKIDDLLQAAVAAYRRGDVASGDQARVEIAALALSIDKLLGVQPE-TL 648

Query: 446 GPWLESAK 453
             W++ A+
Sbjct: 649 ATWIDDAR 656


>gi|315500594|ref|YP_004089396.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
 gi|315418606|gb|ADU15245.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
          Length = 765

 Score =  191 bits (484), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 137/497 (27%), Positives = 216/497 (43%), Gaps = 71/497 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  P+PQ+W+ ++  LQ +IL R+ ELGM P+LPAF G VP A     P A+I 
Sbjct: 202 MGNIEGYLAPVPQAWIQKKHKLQSRILGRMKELGMTPILPAFGGYVPKAFAQKHPQARIY 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            +  W           TY LD  DPLF +I   FI    + YG   + Y  D+F+E  PP
Sbjct: 262 PMRPWEGFHE------TYWLDPADPLFAKIAARFIALYTETYGEGRY-YLADSFNEMLPP 314

Query: 121 VD-----------------------------SPEYISSLGAAIYSGMQSGDSDAVWLMQG 151
           +                                E +++ G AIY  ++    DAVW MQG
Sbjct: 315 ISHDGSDVKNAKYGDSTANTKETETVVDPAVKAERLAAYGKAIYDSIRQARPDAVWTMQG 374

Query: 152 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNF 209
           WLF  D  FW P  + A L  VP  KL++LD+  +  P +W +S  F G P+I+  +HN+
Sbjct: 375 WLFGADKHFWTPDAIGAFLRDVPQDKLMILDIGNDRYPGVWQSSNAFQGKPWIYGYVHNY 434

Query: 210 AGNIEMYGIL----DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE 265
             +  +YG L    D I    + AR  +   + G G+  EG+  N +VY+   ++A+   
Sbjct: 435 GASNPVYGDLGFYRDDIR--GLLAR-KDTGDLKGFGLFPEGLHNNSIVYEYAYDLAWGQA 491

Query: 266 KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSII 325
              V  W+  Y   RYG+  PA+  AW+      ++    +               P   
Sbjct: 492 NQTVTEWLTTYLKSRYGQVTPALILAWSTYVEAAFSTRYWS---------------PRWW 536

Query: 326 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 385
               G Y    +P +     +       HP       ++ RA++  + S      S  YR
Sbjct: 537 RSKAGAYLLCKRPTADMVEFEG------HPG---DRKKLRRAIDALL-SLKGFGGSALYR 586

Query: 386 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 445
           +D+ID  R  +++  ++  +  ++AY+  D      L    + LV  +D L+       L
Sbjct: 587 HDVIDAVRHLVSEEIDDRLIAAMKAYKSGDVKTGDGLREEVIALVTQVDTLMGAQPD-TL 645

Query: 446 GPWLESAKQLAQNEEQE 462
             W++ A       E++
Sbjct: 646 ASWIDEASAYGDTSEEK 662


>gi|147860882|emb|CAN83148.1| hypothetical protein VITISV_031934 [Vitis vinifera]
          Length = 562

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 86/100 (86%), Positives = 94/100 (94%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 100
           +LGNWF+V  +PRWCCTYLLDATDPLFIEIGRAFI+QQLK
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGRAFIQQQLK 334



 Score =  181 bits (459), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 82/92 (89%), Positives = 89/92 (96%)

Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
           DTFDENTPPVD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+S
Sbjct: 429 DTFDENTPPVDDPEYISSLGAAIFKGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHS 488

Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW 203
           VP+G+LVVLDLFAEVKPIW TS+QFYGVPYIW
Sbjct: 489 VPMGRLVVLDLFAEVKPIWITSEQFYGVPYIW 520


>gi|261880009|ref|ZP_06006436.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
 gi|270333325|gb|EFA44111.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
          Length = 722

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 135/468 (28%), Positives = 208/468 (44%), Gaps = 54/468 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL+ W GPL  +W   Q+ LQ KI+ R+  LGM+P+ PAF+G VP       P  ++ 
Sbjct: 194 MGNLNSWNGPLTDAWQQGQITLQHKIIDRMRALGMHPIAPAFAGFVPEQFVEAHPGLQVK 253

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +L  W     D R    Y+L    P F +IGR F+E+  KE+G+ +  Y  D+F+E   P
Sbjct: 254 KL-TWGGF--DDR-LNAYVLSPESPYFKQIGRLFVEEWEKEFGKNT-FYQSDSFNEMEIP 308

Query: 121 VDSPEYISS------LGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
           V+  + I         G  IY  +   + DAVW+ QGW F Y    W    ++ALL  VP
Sbjct: 309 VEPGDSIGKWKLLEQYGDVIYRSIAEANPDAVWVTQGWTFGYQHKMWDSKSLQALLRHVP 368

Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
             K++++DL  +  K IW T +       +YG  +++  + NF G     G +   A   
Sbjct: 369 DDKMLIIDLANDYPKWIWKTQQTWKVQHGYYGKQWVFSYVPNFGGKTLPTGDMQMYASAS 428

Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
            EA   SE   MVG G + EGIE N V+Y+L+++M +  + VD+  WI  Y   RYG   
Sbjct: 429 AEALHHSERGNMVGFGSAPEGIENNDVIYELLADMGWTDKAVDLDLWIKDYCEARYGGYP 488

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
             +Q AW  +  +VY             + ++P      ++          + VS  A+ 
Sbjct: 489 SDMQKAWQCMLRSVYGS-----------LYSYPRFTWQTVTPDS-------RRVSTHALN 530

Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
            +  S   H                F+    +L +S  YR D I L    L   A+  + 
Sbjct: 531 DTFLSGVAH----------------FLRCARQLGSSPLYRSDAISLASLYLGTKADRHYT 574

Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
             ++               + ++L+   D LLA H    L  W++ A+
Sbjct: 575 KALDLKASGKQQAASAELHQTIDLLTKADRLLASHPTHRLDRWIQFAR 622


>gi|404403947|ref|ZP_10995531.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
          Length = 828

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 148/485 (30%), Positives = 231/485 (47%), Gaps = 71/485 (14%)

Query: 1   MSNLHGW-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
           M N+ G  G P PQ W + Q+ LQ +I+ R+  LGM PV   F+G VP A++ + P   +
Sbjct: 184 MGNMSGLDGAPTPQ-WHEAQIALQHRIIDRMEALGMTPVYQGFAGFVPPAMKRIHPETTL 242

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE-NT 118
           T+   W   K+       ++L   DPLF EIG AF+    +E+G+  + Y  D+F+E + 
Sbjct: 243 TET-KWSGFKN-------WMLSPLDPLFSEIGTAFVRAWEEEFGKGKY-YLIDSFNEMDV 293

Query: 119 P--PVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSV 172
           P  P  SPE  ++L   G  IY  +   + DAVW+MQGW+F Y    W P  ++ALL   
Sbjct: 294 PFGPKGSPERAATLRHYGETIYRSLAEANPDAVWVMQGWMFGYQRNSWDPESVRALLEGA 353

Query: 173 PLGKLVVLDLFAEVKP-IWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
           P G++++LDL  +    IW + K       F+G  +I+  + NF G   + G L+  A G
Sbjct: 354 PDGRMMILDLAVDFNNFIWRSEKSWNHLQGFFGREWIYSTVPNFGGRTALIGNLEFYANG 413

Query: 226 PVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
            +EA +S N   + G G S EG+E N +VY++++   +  +++D+K +++ YS  RYG  
Sbjct: 414 HLEALSSPNRGRLTGYGTSPEGVESNEIVYEIIAAAGWSDDRIDLKKFLHDYSAARYGGC 473

Query: 285 VPAIQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
              I   W+ +  + YN CT+ A                        +Y+   +P S   
Sbjct: 474 PEGIDRFWSGMLQSSYNECTNNA------------------------RYRWQLRPYSHRM 509

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
                  +Y              A+E F+A   EL  +  YR D I      LA  A+ L
Sbjct: 510 PTMGINENY------------YTAIEQFLACAGELGGNELYRTDAIQYAALYLASKADML 557

Query: 404 FLNIIEAYQLNDAHG----VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
               +EA    D +G     +  + R  EL+ D D LLA H    L  W   A++    E
Sbjct: 558 ----LEAANWADLYGAREEAYDCAMRIEELLLDADRLLASHPLLRLDRWSGMARKAGCTE 613

Query: 460 EQEKQ 464
           E++++
Sbjct: 614 EEKER 618


>gi|210611122|ref|ZP_03288736.1| hypothetical protein CLONEX_00926, partial [Clostridium nexile DSM
            1787]
 gi|210152109|gb|EEA83116.1| hypothetical protein CLONEX_00926 [Clostridium nexile DSM 1787]
          Length = 1662

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 134/480 (27%), Positives = 224/480 (46%), Gaps = 65/480 (13%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  SW  ++  L +K  + + +LGM PVL  +SG VP  + +  PSA++ 
Sbjct: 685  MANLSGFGGPVHDSWFTERTELARKNQLIMRKLGMQPVLQGYSGMVPVDITDKDPSAQVI 744

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
            + G W S +         +L      F +  + F + Q + YG  S  Y  D F E  NT
Sbjct: 745  KQGTWCSFQRPS------MLKTDSETFDKYAQLFYKVQKEVYGDVSDYYATDPFHEGGNT 798

Query: 119  PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-- 176
              + SP  I+     + + M   D + +W++Q         W+     ALL  +   +  
Sbjct: 799  GGM-SPTVIAE---KVLANMMEADENGIWIIQS--------WQGNPSTALLQGLDAARDH 846

Query: 177  LVVLDLFAEVKPIWSTSK-----------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
             +VLDL+AE  P W+ +            +F   P+++CML+NF G + ++G +++   G
Sbjct: 847  ALVLDLYAEKTPHWNETDPGSYGGAEGGGEFLNTPWVYCMLNNFGGRLGLHGHIENFVNG 906

Query: 226  PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE-----KVDVKAWINQYSVRR 280
              +A    N  M G+G++ E    NPV+YDL  E  +  +      +++  W   Y+ RR
Sbjct: 907  VAQAAAQAN-HMAGIGITPEASVNNPVLYDLFFETIWSDDGENLSAINLDEWFKDYTTRR 965

Query: 281  YGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKP 338
            YG    +  +A  +L  TVYN       +     V+ A P +D        G    +G  
Sbjct: 966  YGAESQSAYEAMQILNDTVYNPEMNMKGQGAPESVVNARPGLDI-------GAASTWGNA 1018

Query: 339  VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
            V                 + Y  +E+ +A  L +   ++L  S  Y+YDL ++  Q L+ 
Sbjct: 1019 V-----------------IDYDKAELEKAAALLLKDYDKLKDSAGYQYDLANVLEQVLSN 1061

Query: 399  YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
             A E    + +A++  DA    ++S  FLE++  ++ +    + F+LG WLESAK LA+N
Sbjct: 1062 TAQEYQKKMADAFREGDAEKFEKMSNSFLEIITKVEEVTGTQEEFMLGTWLESAKALAKN 1121


>gi|294667089|ref|ZP_06732314.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 10535]
 gi|292603099|gb|EFF46525.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 10535]
          Length = 798

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 142/479 (29%), Positives = 213/479 (44%), Gaps = 73/479 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQHWTDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 313

Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
                  V + +Y  S+                       G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           +  +  +YG     AF    ++A  ++     + G G+  EG+  N V+Y  +  +A++ 
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVIYAYLYALAWEG 490

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
            +     W+  Y   RYGRS  A+  AW  L   +Y     +               P  
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQTRYWS---------------PRW 535

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
            +   G Y  + +P +            D P        + RA++  +   N  + +  Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLQQANRYADAPLY 586

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLA-CHD 441
           RYDLI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+   HD
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGDQHD 644


>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
 gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
          Length = 1552

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 130/468 (27%), Positives = 225/468 (48%), Gaps = 43/468 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGP+ ++ ++ +  LQ+K+L R+  LG+ PV+  F G VP+  +  FP+A++ 
Sbjct: 196 MGNLEGWGGPMSEALIEARYQLQRKMLQRMQALGIQPVVQGFPGLVPSFFKERFPAAQLV 255

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W      P      LL +   LF ++ +A+ E  ++ YGR       D F E  NT
Sbjct: 256 LQGRWGHFNRPP-----MLLPSDKDLFQQVAKAYYESLIRCYGRDFKFLGGDLFHEGGNT 310

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD     +++   +     S    A W++QGW  +  P         LL+ +    ++
Sbjct: 311 KGVDVAATAAAVQQTMLRYFPS----AKWVLQGWNNNPSP--------TLLSKLDKQHVL 358

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTM 237
           +++L  E+   W +S +F G P++W  +++F G  +M G L  +   P  A + ++N  M
Sbjct: 359 LINLSGEIAASWESSNEFGGTPWLWGSVNHFGGKTDMGGQLPVLVAEPHRAFSQTKNGVM 418

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G+G+  EGI  NPVVYDL  + A+     D+   +  Y   RYG    ++  AW++L H
Sbjct: 419 QGIGILPEGINSNPVVYDLALKTAWYTTTPDLDRLLRDYIAYRYGHVDESLVQAWHILSH 478

Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           +VY                            EG +++     ++  +  +  S++    +
Sbjct: 479 SVYG---------------------EFKIKGEGTFESIF--CARPGLHVTSVSTWGPKQM 515

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y+  ++ +AL LF    ++   S TY+YDL+DL RQ +A +A +++   ++AY+  DA 
Sbjct: 516 QYNPKDLEKALGLFRRVADQYKGSATYQYDLVDLARQVMANHARDVYAAAMQAYRNKDAA 575

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            + +  + F+ L++  D LL     FLLG WL  A         ++Q 
Sbjct: 576 LLHEKGQEFMHLLQLQDRLLQTDTHFLLGNWLAQAANYGVTAADKQQA 623


>gi|296115989|ref|ZP_06834611.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
           23769]
 gi|295977458|gb|EFG84214.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
           23769]
          Length = 758

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 128/475 (26%), Positives = 220/475 (46%), Gaps = 58/475 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  +GGP+P+  ++++ V  ++I+ R+ ELGM PVLP F G VP      FP A + 
Sbjct: 222 MANMCCYGGPVPRELIEKRAVSAQQIIGRMRELGMRPVLPGFYGMVPDDFGKRFPQAHVI 281

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W   +  P W     LD  DP+F ++   + ++Q K +G  + +Y+   F E   P
Sbjct: 282 GQGEWNRFRR-PAW-----LDPRDPMFAKVAAIYYDEQKKLFG-DAPVYDIQPFQEGGTP 334

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  ++  G  I   + +    A+W++  W    D          +L  V   +L ++
Sbjct: 335 GDVP--LADAGQGIQKALDTAHPGAMWMLMAWYEEPD--------ARMLAGVDRKRLFIV 384

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG---PVEARTSENTTM 237
           DL    +   +    F G P+++  L +F G   + G   S  +G   P   RT +N  M
Sbjct: 385 DLEQNTRVRENRDADFQGAPFLYGGLWDFGGRTSLGG--SSYDYGVRLPGLWRTQKN--M 440

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA-IQDAWNVLY 296
           +G  +  EG++ NP ++DL +E A++ + VD   W   Y+ RRYG+      + AW++L 
Sbjct: 441 IGTAVFPEGMDNNPYIFDLFTEAAWRRDGVDTTQWTRDYADRRYGQPGDVHARKAWDLLL 500

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP-VSKEAVLKSETSSYDHP 355
           H+ +                         S      Q++G+   + +++  ++ S   H 
Sbjct: 501 HSAF-------------------------SYRATGIQDFGEASAAPDSLFNAQPSLDTHS 535

Query: 356 HLW-------YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
             W       Y    V  A+   + + +   A+  YRYDL+D+TRQA+A  A  +   I 
Sbjct: 536 AAWNGMKVLPYDPHLVEAAMAELLQASDATRATEAYRYDLVDVTRQAVANQARAMLPQIG 595

Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
           +A+   D   +  L+ R+LEL++  D LLA +  F +G WL   +  + +  Q K
Sbjct: 596 DAFAARDRAKLHALTTRWLELMDRQDSLLATNTFFRVGTWLSWPQAWSDDPAQRK 650


>gi|294627661|ref|ZP_06706243.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 11122]
 gi|292598013|gb|EFF42168.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 11122]
          Length = 798

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 139/479 (29%), Positives = 213/479 (44%), Gaps = 73/479 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+   LPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRASLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D F+E  PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 313

Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
           V                                   ++  G A+Y  +   +  A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSVANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW+   + A L  VP  +L+VLD+  +  P  W  S+ F    +I+  +HN
Sbjct: 374 GWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433

Query: 209 FAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
           +  +  +YG     AF    ++A  ++     + G G+  EG+  N V+Y+ +  +A++ 
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490

Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
            +     W+  Y   RYGRS  A+  AW  L   +Y     +               P  
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQTRYWS---------------PRW 535

Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
            +   G Y  + +P +   ++  +    D   L        RA++  +   N  + +  Y
Sbjct: 536 WNKRAGAYLLFKRPTAD--IVDFDDCPGDPQRL-------RRAIDALLQQANRYADAPLY 586

Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLAC-HD 441
           RYDLI+  R  L+  A+     +++AY   D A G  QL+R   +LV  +D L+   HD
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGGQHD 644


>gi|197302378|ref|ZP_03167435.1| hypothetical protein RUMLAC_01107 [Ruminococcus lactaris ATCC 29176]
 gi|197298557|gb|EDY33100.1| F5/8 type C domain protein [Ruminococcus lactaris ATCC 29176]
          Length = 1655

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 133/482 (27%), Positives = 224/482 (46%), Gaps = 65/482 (13%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  +W  ++  L +K  + + +LGM PVL  +SG VP  + +  PSA++ 
Sbjct: 680  MANLSGYGGPVHDTWFTERTELARKNQLIMRKLGMQPVLQGYSGMVPVDITSKDPSAEVI 739

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
            + G W S +         +L      F +    F + Q + YG ++H Y  D F E  NT
Sbjct: 740  KQGTWCSFQRPS------MLRTDSESFTKYAALFYKVQKEVYGDSAHYYATDPFHEGGNT 793

Query: 119  PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-- 176
              +DS    + +   + + M + D  A W++Q W        +     ALL  +   +  
Sbjct: 794  GGMDS----AVISQKVLASMMTADPHATWVIQSW--------QGNPTTALLQGLGDNRDH 841

Query: 177  LVVLDLFAEVKPIWSTSK-----------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
             +VLDL+AE  P W+ +            +F   P+++CML+NF G + ++G +D+   G
Sbjct: 842  ALVLDLYAEKTPHWNETNPGYYGGAEGGGEFLNTPWVYCMLNNFGGRLGLHGHIDNYVEG 901

Query: 226  PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH-----EKVDVKAWINQYSVRR 280
             V A + +   M G+G++ E    NPV+YDL  E  +       +K+++  W   Y  RR
Sbjct: 902  IVNA-SKQAEHMAGIGITPEASVNNPVLYDLFFETIWADDGNNLQKINLDEWFKNYVTRR 960

Query: 281  YGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKP 338
            YG    +   A  +L+ TVYN       +     V+ A P +D        G    +G  
Sbjct: 961  YGADSDSAYQAMEILHDTVYNPAYNMKGQGAPESVVNARPGLDI-------GAASTWGNA 1013

Query: 339  VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
            V                   Y   ++ +A EL +A  ++L  S  Y+YDL ++  Q L+ 
Sbjct: 1014 VVD-----------------YDKKKLEKAAELLLADYDKLKNSAGYQYDLANVLEQVLSN 1056

Query: 399  YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
             A E    +  A++  DA     LS +FL +++ ++ +      FL+G W+  AK+LA+N
Sbjct: 1057 TAQEYQKKMAAAFRSGDAEEFSTLSDKFLSIIDMVEKVTGTQKEFLVGTWINGAKKLAKN 1116

Query: 459  EE 460
             +
Sbjct: 1117 SD 1118


>gi|225875033|ref|YP_002756492.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
           51196]
 gi|225793771|gb|ACO33861.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
           51196]
          Length = 800

 Score =  187 bits (476), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 130/468 (27%), Positives = 215/468 (45%), Gaps = 40/468 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  +  P+ +S LD+++   ++I+ R+ ELG+ PV P + G VP       P A + 
Sbjct: 241 MGNLCCFDEPISRSLLDRRIRSAQQIIRRLRELGITPVFPGYFGMVPEDFARRHPGAHVI 300

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             GNW   +  P W     LD  DPLF  +  +F + Q + +G +S IY+ + F E    
Sbjct: 301 PQGNWNGFRR-PAW-----LDPRDPLFAAVAASFYKHQQELFGDSS-IYDIELFQEGGSA 353

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +SS   AI   +      A+W+   W    +P       +ALL++V    L+V+
Sbjct: 354 ADVP--VSSAAKAIQKALLRAHPQAMWMTLAW--QNNP------SRALLSAVDRSHLLVV 403

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           D+     P  +  + F G  Y++  L +F G   +   L   A           +TM G 
Sbjct: 404 DIDQGRTPHENRERDFMGAAYLFGGLWDFGGRTTLGANLYDYAVRLPRMGLRAGSTMKGT 463

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
            +  EG++ NP  +DL +EMA++   VD++ W  +Y+ RRYG   P  + AW +L  T Y
Sbjct: 464 ALFSEGLDNNPAAFDLFTEMAWRTSPVDLRTWSREYARRRYGMDDPHTRRAWRILMETAY 523

Query: 301 NC-TDGATDKN-RDVIV-AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
               DG ++   RD    +  D  PS+ +V+                     SS+    L
Sbjct: 524 GTRADGVSNHGERDAPPESLFDAQPSLDAVS--------------------ASSWSPDRL 563

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y   +   AL   + +   +    TY+YDL+D+ RQ LA ++ +    I +AY      
Sbjct: 564 RYDPKKFEAALTELLQAPPGMREMPTYQYDLVDVARQTLANWSRKTLPEIKDAYDHRHEA 623

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
               L +++L ++   D LLA +  F++GPWL +    A    +++++
Sbjct: 624 RFETLEKQWLCMMMLQDKLLATNTSFMVGPWLNAVSPWAATATEQRRL 671


>gi|294648124|ref|ZP_06725667.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|292636508|gb|EFF54983.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
          Length = 499

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 125/425 (29%), Positives = 199/425 (46%), Gaps = 42/425 (9%)

Query: 35  MNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAF 94
           M PVLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F
Sbjct: 1   MKPVLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLF 57

Query: 95  IEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
           +++Q K +G T HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F
Sbjct: 58  LDEQKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMF 116

Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
            +D   W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN 
Sbjct: 117 YFDKDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNT 176

Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 273
            + G +         A  +    + G+G ++EG++     Y+ + E A+ +  VD   WI
Sbjct: 177 TLTGNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWI 235

Query: 274 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 333
              + R  G     ++DAW  L++ +Y              V  P         T G   
Sbjct: 236 ECLADRHVGCVSQPVRDAWKRLFNDIY--------------VQVP--------RTLGTLP 273

Query: 334 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDL 391
            Y +P   +   K  ++ Y +  L     EV R L       NE  +   + +R DLI +
Sbjct: 274 GY-RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITV 321

Query: 392 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 451
            RQ L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ 
Sbjct: 322 GRQVLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDD 381

Query: 452 AKQLA 456
           A+++ 
Sbjct: 382 ARKMG 386


>gi|126347839|emb|CAJ89559.1| putative alpha-N-acetylglucosaminidase [Streptomyces ambofaciens
           ATCC 23877]
          Length = 740

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 122/462 (26%), Positives = 204/462 (44%), Gaps = 40/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGP+ +  L+ +  L ++I  R+ +LGM PVLP + G VP       P   + 
Sbjct: 213 MQNMSGFGGPVSERLLEDRADLGRRIADRLRQLGMTPVLPGYYGTVPPGFTERNPVGPVV 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W   +  P W     LD    +F  +  AF   Q + +G TS +Y  D   E   P
Sbjct: 273 PQGDWVGFER-PDW-----LDPRSAVFPRVAAAFYRHQRELFG-TSTMYKMDLLHEGGRP 325

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            + P  +     A+   +Q+    AVW + GW  +       P  + +++++   +L+++
Sbjct: 326 GNVP--VRDAAQAVMKALQTARPGAVWTLIGWQNN-------PSTQ-IIDAIDKRRLLIV 375

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           D  ++          ++G PY +  + NF G+  M       A    + RT   + + G+
Sbjct: 376 DGLSDRYDGLDREATWHGAPYAFGTIPNFGGHTTMGANTAVWAERFDQWRTKAGSALAGI 435

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
               EG   NPV Y+L +E+A++ E VD + W  +Y+ RRYG + P    AW +L    Y
Sbjct: 436 AYMPEGTGGNPVAYELFTELAWRTEPVDQRKWFAEYAQRRYGGADPHAASAWELLRSGPY 495

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +   G   +++D +                         ++  +  +  +S+    + Y 
Sbjct: 496 STPSGTWSESQDSLF-----------------------TARPRLTATNAASWSPGAMRYD 532

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
              V RAL   +     L A++ YR+DL+D+ RQ LA  +  L   I  AY   D     
Sbjct: 533 PGTVRRALTELVRVAPALRATDAYRFDLVDVARQVLANRSRTLLPQIKAAYDAEDLPRFR 592

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             +  +   +  +D LLA    FLLGPWLE AK   + E + 
Sbjct: 593 ARAAEWKNCLSLLDRLLATDARFLLGPWLEDAKSWGRTEAER 634


>gi|294812279|ref|ZP_06770922.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
           27064]
 gi|294324878|gb|EFG06521.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
           27064]
          Length = 1086

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 125/465 (26%), Positives = 210/465 (45%), Gaps = 48/465 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+  +GGP+  + LD+++ L ++I+ R+  LGM PV+P + G VP       P A++ 
Sbjct: 257 LQNMSEYGGPVSPALLDRRIELGQRIVTRMRRLGMRPVVPGYFGTVPDGFVARNPGARVI 316

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W  +   P W     LD   P+F EI  A+   Q + +G   H +  D   E   P
Sbjct: 317 PQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRHQEELFGEIDH-FKMDLLHEGGTP 369

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+ + +++    A W++ GW  +  P        ALL+++   K++++
Sbjct: 370 GDVP--VPDAARAVETALRAARPAATWVILGWQSNPRP--------ALLDAIDTSKVLIV 419

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           D  +++  +     ++ G PY +  + NF G   +    D         R   N+ +VG 
Sbjct: 420 DGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPNSALVGT 479

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
               E  +++P   +L +E+A++ EK+D  AW   Y+  RYG   PA ++A+  L  T Y
Sbjct: 480 AYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYAQFRYGAKDPAAEEAFAALAGTAY 539

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
             T   T   R +   F                     + + ++  S  +++D       
Sbjct: 540 QLT---TTDGRPIDSLF---------------------LRRPSMSSSVATAFDQ------ 569

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
            +   R     +    EL  S+ YRYDL DL RQALA  +  L L +  AY   D     
Sbjct: 570 -AAFDRGFAALLRVNEELRGSDAYRYDLTDLARQALALRSRTLQLALRAAYATKDVTAFR 628

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            ++  +L L+   D +  CH  FLLGPWLE AK+ A + E+  ++
Sbjct: 629 GVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFATSTEEAVEL 673


>gi|326440885|ref|ZP_08215619.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
           27064]
          Length = 1038

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 125/465 (26%), Positives = 210/465 (45%), Gaps = 48/465 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+  +GGP+  + LD+++ L ++I+ R+  LGM PV+P + G VP       P A++ 
Sbjct: 209 LQNMSEYGGPVSPALLDRRIELGQRIVTRMRRLGMRPVVPGYFGTVPDGFVARNPGARVI 268

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W  +   P W     LD   P+F EI  A+   Q + +G   H +  D   E   P
Sbjct: 269 PQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRHQEELFGEIDH-FKMDLLHEGGTP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+ + +++    A W++ GW  +  P        ALL+++   K++++
Sbjct: 322 GDVP--VPDAARAVETALRAARPAATWVILGWQSNPRP--------ALLDAIDTSKVLIV 371

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           D  +++  +     ++ G PY +  + NF G   +    D         R   N+ +VG 
Sbjct: 372 DGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPNSALVGT 431

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
               E  +++P   +L +E+A++ EK+D  AW   Y+  RYG   PA ++A+  L  T Y
Sbjct: 432 AYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYAQFRYGAKDPAAEEAFAALAGTAY 491

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
             T   T   R +   F                     + + ++  S  +++D       
Sbjct: 492 QLT---TTDGRPIDSLF---------------------LRRPSMSSSVATAFDQ------ 521

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
            +   R     +    EL  S+ YRYDL DL RQALA  +  L L +  AY   D     
Sbjct: 522 -AAFDRGFAALLRVNEELRGSDAYRYDLTDLARQALALRSRTLQLALRAAYATKDVTAFR 580

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            ++  +L L+   D +  CH  FLLGPWLE AK+ A + E+  ++
Sbjct: 581 GVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFATSTEEAVEL 625


>gi|317501265|ref|ZP_07959469.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
            8_1_57FAA]
 gi|316897332|gb|EFV19399.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
            8_1_57FAA]
          Length = 1847

 Score =  185 bits (470), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 134/484 (27%), Positives = 226/484 (46%), Gaps = 70/484 (14%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  SW +++  L +K  + + +LGM PVL  +SG VP  + +   +A++ 
Sbjct: 658  MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 717

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            + G W S +         +L  T   F +  + F + Q + YG  S+ Y  D F E    
Sbjct: 718  EQGEWCSFQRPT------MLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 769

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
            +      S +   + + M + D DAVW++Q W        +     ALLN +        
Sbjct: 770  ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 821

Query: 176  KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
              ++LDL+AE  P           +   ++F   P+++CML+NF G + ++G LD++A  
Sbjct: 822  HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLA-N 880

Query: 226  PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH------EKVDVKAWINQYSVR 279
             +    +E   + G+G++ E    NPV+YD + E  +Q       E +D+  W++ Y+ R
Sbjct: 881  NIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATR 940

Query: 280  RYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
            RYG    +   AW++L  TVY  +      GA +    V+ A P++       T G    
Sbjct: 941  RYGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAAST 990

Query: 335  YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 394
            +G  V                 + Y   ++  A  L +A  ++L  S  Y+YDL ++ +Q
Sbjct: 991  WGNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQ 1033

Query: 395  ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
             L+  A E    +  A+   D       S +F+ ++EDM+ +    + FLLG W+E AK 
Sbjct: 1034 VLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKA 1093

Query: 455  LAQN 458
            LA N
Sbjct: 1094 LANN 1097


>gi|118370728|ref|XP_001018564.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila]
 gi|89300331|gb|EAR98319.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila
           SB210]
          Length = 879

 Score =  185 bits (470), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 152/495 (30%), Positives = 236/495 (47%), Gaps = 53/495 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+GGP+ Q+++D Q  LQKKIL R+  LGM P+L  F G VP +L+  FP +KI 
Sbjct: 218 MGNLEGYGGPVTQAYIDGQYNLQKKILKRMRNLGMQPILQGFYGMVPNSLKAKFPLSKIY 277

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--T 118
              +W   +          LDA D LF  I   F  +  K YGR +  Y  D F E    
Sbjct: 278 GDQSWLGFRRPA------FLDANDELFSNIANIFYSESEKLYGR-AKFYGGDPFHEGAIV 330

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDS----DAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
           P ++    ++S   +IY  MQ  D+       W++Q W  +      P Q   LL  +  
Sbjct: 331 PGLN----LTSQAQSIYRAMQYTDNPKDEKVKWILQSWQEN------PSQQ--LLQGLQN 378

Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
            + ++LDL AE +  W T+  F G  ++W  L NF   I  YG+++     P  A + +N
Sbjct: 379 DECIILDLMAEARSKWQTND-FSGHDFLWTSLPNFGLRIGQYGMIEQYVSQPPLAYSIKN 437

Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD--------VKAWINQYSVRRYG-RSV 285
           +TM G+G   EGI  N + Y+++ + A+     D        V  ++  +   RYG ++ 
Sbjct: 438 STMKGIGSIPEGILTNVLDYEILFDKAWIQPNQDTNLTPRQQVLQYLGDFIRYRYGEQNN 497

Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA-- 343
             +  AW++L +++YN T+     +  V++A P      +S     Y  +      EA  
Sbjct: 498 KNLFSAWSLLTNSIYNSTNPWDGPSESVMLARPASYIDKVSSWGTSYIYWNTTNVLEAWK 557

Query: 344 ----VLKSETSSYDHPHLWYSTSEVIRAL-------ELFIA-SGNELSA--SNTYRYDLI 389
                +K +       HL     E+ + L       E F+  S NE      +T+ YDL+
Sbjct: 558 LFTNYVKEKKQKNRSQHL-QKLEEINKKLGRSDDDMEAFVEISQNEERNIFKDTFLYDLV 616

Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
           D+ RQ LA Y+  L+  ++ A+   D       S++FLEL++D D LL+    F+LG +L
Sbjct: 617 DVARQNLASYSYLLYNKVMLAFNQTDTIKFALYSQQFLELIKDQDQLLSSRKEFMLGYYL 676

Query: 450 ESAKQLAQNEEQEKQ 464
           ES  +L    +QEKQ
Sbjct: 677 ESVSKLG-TTDQEKQ 690


>gi|153814573|ref|ZP_01967241.1| hypothetical protein RUMTOR_00787 [Ruminococcus torques ATCC 27756]
 gi|331089988|ref|ZP_08338878.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
            3_1_46FAA]
 gi|145848067|gb|EDK24985.1| F5/8 type C domain protein [Ruminococcus torques ATCC 27756]
 gi|330402902|gb|EGG82468.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
            3_1_46FAA]
          Length = 1863

 Score =  185 bits (470), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 134/484 (27%), Positives = 226/484 (46%), Gaps = 70/484 (14%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  SW +++  L +K  + + +LGM PVL  +SG VP  + +   +A++ 
Sbjct: 674  MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 733

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            + G W S +         +L  T   F +  + F + Q + YG  S+ Y  D F E    
Sbjct: 734  EQGEWCSFQRPT------MLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 785

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
            +      S +   + + M + D DAVW++Q W        +     ALLN +        
Sbjct: 786  ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 837

Query: 176  KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
              ++LDL+AE  P           +   ++F   P+++CML+NF G + ++G LD++A  
Sbjct: 838  HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLA-N 896

Query: 226  PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH------EKVDVKAWINQYSVR 279
             +    +E   + G+G++ E    NPV+YD + E  +Q       E +D+  W++ Y+ R
Sbjct: 897  NIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATR 956

Query: 280  RYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
            RYG    +   AW++L  TVY  +      GA +    V+ A P++       T G    
Sbjct: 957  RYGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAAST 1006

Query: 335  YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 394
            +G  V                 + Y   ++  A  L +A  ++L  S  Y+YDL ++ +Q
Sbjct: 1007 WGNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQ 1049

Query: 395  ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
             L+  A E    +  A+   D       S +F+ ++EDM+ +    + FLLG W+E AK 
Sbjct: 1050 VLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKA 1109

Query: 455  LAQN 458
            LA N
Sbjct: 1110 LANN 1113


>gi|336439030|ref|ZP_08618649.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
            1_1_57FAA]
 gi|336017072|gb|EGN46842.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
            1_1_57FAA]
          Length = 1863

 Score =  185 bits (470), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 134/484 (27%), Positives = 226/484 (46%), Gaps = 70/484 (14%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  SW +++  L +K  + + +LGM PVL  +SG VP  + +   +A++ 
Sbjct: 674  MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 733

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            + G W S +         +L  T   F +  + F + Q + YG  S+ Y  D F E    
Sbjct: 734  EQGEWCSFQRPT------MLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 785

Query: 121  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
            +      S +   + + M + D DAVW++Q W        +     ALLN +        
Sbjct: 786  ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 837

Query: 176  KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
              ++LDL+AE  P           +   ++F   P+++CML+NF G + ++G LD++A  
Sbjct: 838  HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLA-N 896

Query: 226  PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH------EKVDVKAWINQYSVR 279
             +    +E   + G+G++ E    NPV+YD + E  +Q       E +D+  W++ Y+ R
Sbjct: 897  NIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATR 956

Query: 280  RYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
            RYG    +   AW++L  TVY  +      GA +    V+ A P++       T G    
Sbjct: 957  RYGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAAST 1006

Query: 335  YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 394
            +G  V                 + Y   ++  A  L +A  ++L  S  Y+YDL ++ +Q
Sbjct: 1007 WGNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQ 1049

Query: 395  ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
             L+  A E    +  A+   D       S +F+ ++EDM+ +    + FLLG W+E AK 
Sbjct: 1050 VLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKA 1109

Query: 455  LAQN 458
            LA N
Sbjct: 1110 LANN 1113


>gi|331092442|ref|ZP_08341267.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
            2_1_46FAA]
 gi|330401285|gb|EGG80874.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
            2_1_46FAA]
          Length = 1598

 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 130/479 (27%), Positives = 224/479 (46%), Gaps = 66/479 (13%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M+NL G+GGP+  SW +++  L +K  + +  LGM PVL  +SG VP  ++    SA++ 
Sbjct: 680  MANLSGFGGPIHDSWFEERTELARKNQLSMRRLGMQPVLQGYSGMVPTNIREKDSSAEVI 739

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
            + G W S +         +L      F +  + F + Q + YG ++H Y  D F E    
Sbjct: 740  EQGTWCSFRRPD------MLKTDSASFDKYAKLFYQAQKEVYGESAHYYATDPFHEGGDT 793

Query: 121  VD-SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK--L 177
               +P  I+     +   M   D D +W++Q         W+     ALL  +   K   
Sbjct: 794  GGLNPTVIA---GKVLDAMLEADKDGIWIIQS--------WQGNPTTALLKGLEGRKEHA 842

Query: 178  VVLDLFAEVKPIWSTSK-------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
            +VLDL+AE  P W+ +        +F   P+++CML+NF G + ++G LD++A   + A 
Sbjct: 843  LVLDLYAEKTPHWNETNPNEYGGGEFNDTPWVFCMLNNFGGRLGLHGHLDNLAKN-IPAA 901

Query: 231  TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ---HEK---VDVKAWINQYSVRRYGRS 284
             +    M G+G++ E    NP++YD + E  +     EK   +D+  W+  Y+ RRYG+ 
Sbjct: 902  LNSAKHMEGIGITPEASVNNPLLYDFLFETVWTDNAKEKLPVIDLDKWLKDYAKRRYGKE 961

Query: 285  VPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 339
              +  +A  ++  TVY         GA +    V+ A P +D    S        +G  V
Sbjct: 962  SQSAYEALLIMKDTVYKAELNMKGQGAPES---VVNARPALDIGAAST-------WGNAV 1011

Query: 340  SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
                             + Y  +++ +A EL +   ++L  S+ Y YDL  + +Q L+  
Sbjct: 1012 -----------------ISYDKAKLEKAAELLLKDYDKLKDSDGYMYDLATMLQQVLSNS 1054

Query: 400  ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
            A E    +  A++ N+       + +FL +++ M+ + +    +LLG W+E AK LA+N
Sbjct: 1055 AQEYQRKMANAFKENNKEEFNTYADKFLSIIDSMEKVTSTSKYYLLGTWVEQAKALAKN 1113


>gi|187735714|ref|YP_001877826.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
 gi|187425766|gb|ACD05045.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 852

 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 124/467 (26%), Positives = 212/467 (45%), Gaps = 45/467 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ-NV-FPSAK 58
           M NL G GGPL Q  +++   + ++I+ R+ +LGM PVL  + G VP+  Q NV     K
Sbjct: 198 MGNLEGHGGPLSQQQINKMAQMGRRIVSRMEQLGMTPVLQGYVGFVPSDFQENVRIDGLK 257

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           +   G W + +        +++D T   F ++   + +   K YG    ++  D F E  
Sbjct: 258 LIPQGEWVNFRR------PWVVDPTCEAFPKLAADWYKALRKVYGIPGKMFGGDLFHEGG 311

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
              D    ++     +   MQ     A W++Q W    +P       + LL+ +   + +
Sbjct: 312 RKGDID--VTQAAQEVQKAMQKASPGAFWVIQAW--GGNP------TRELLSGLDPERAL 361

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           VL L  ++       + F G+P++WC L NF GN  MYG +  ++    E    ++  +V
Sbjct: 362 VLQLTKDMANGGKNLRTFNGIPWVWCELANFGGNTGMYGGVPLLSRLGSELSGYKDKGLV 421

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+G   EG+E NP+ Y L S+  +  E + V+ W+ +Y+ +RYG +  A+  A  VL  +
Sbjct: 422 GMGTLSEGLETNPLHYALFSDRLWTREDISVREWLGKYARQRYGFAPKAVVKALEVLSFS 481

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
           +YN           +I A P  +                          + S++     +
Sbjct: 482 IYNPVRSQEGCTESIICARPSWNV------------------------RKASTWSSGERY 517

Query: 359 YSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
           Y   ++++A   ++ + N+   L    T+RYDL+D+ RQALA  A      +  A+   D
Sbjct: 518 YHLGDIVKAARGYLKAANDQPNLVKKETFRYDLVDVVRQALADAAFYQLQQVRSAFDSGD 577

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
                +  +RFL L+ DMD LLA    FLLG W + A     + +++
Sbjct: 578 LAAYRKQVKRFLSLISDMDALLATDSQFLLGTWQKRALDWGDSRQEK 624


>gi|16124795|ref|NP_419359.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
 gi|221233511|ref|YP_002515947.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
 gi|13421729|gb|AAK22527.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
 gi|220962683|gb|ACL94039.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
          Length = 770

 Score =  181 bits (460), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 130/487 (26%), Positives = 208/487 (42%), Gaps = 66/487 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLP +W+D++  LQ KIL R+  LGM P+LPAF G VP A     P A+I 
Sbjct: 202 MGNIEGYKAPLPTAWIDKKKDLQVKILGRMRSLGMTPILPAFGGYVPKAFAEKNPKARIY 261

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF +I   F+    + +G  ++ Y  D+F+E  PP
Sbjct: 262 RMRPWEGFHE------TYWLDPADPLFAKIAARFLALYTETFGAGTY-YLADSFNEMLPP 314

Query: 121 VD-------------------------------SPEYISSLGAAIYSGMQSGDSDAVWLM 149
           ++                                 + +++ G AIY  ++    DAVW+M
Sbjct: 315 INADGADARDAAYGDGTANTAVTKTKVEVDPALKAQRLAAYGKAIYDSIRQTRPDAVWVM 374

Query: 150 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 207
           QGWLF  D  FW P  + A L+ VP  KL++LD+  +  P +W  +K F G P+I+  +H
Sbjct: 375 QGWLFGADSHFWDPAAISAYLSLVPDDKLMILDIGNDRYPNVWKNAKAFGGKPWIYGYVH 434

Query: 208 NFAGNIEMYGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 266
           N+ G+  +YG L       P  A   +   + G GM  EG+  N +VY+ + ++A+   +
Sbjct: 435 NYGGSNPVYGDLGFYRQDIPAIAANPDAGKLAGFGMFPEGLHNNSIVYEAVYDLAWSEGQ 494

Query: 267 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 326
                W+ +Y+  RYG++ PA+  A   L    ++    +               P    
Sbjct: 495 ASPATWLTRYARARYGKTSPALDAALGQLVEAAFSTRYWS---------------PRWWK 539

Query: 327 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 386
              G Y  + +P +            D P      +++  A++   A          +  
Sbjct: 540 SKAGAYLFFKRPTATVG---------DFPQHPGDRAKLEAAVKALTALAPTYGQEPLFVL 590

Query: 387 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 446
           DL D TR       ++L    + AY+  D             L   +D LL       L 
Sbjct: 591 DLTDATRHLATMKIDDLLQVAVAAYRRGDTAAGDAARVEIEALALSIDKLLGVQPD-TLA 649

Query: 447 PWLESAK 453
            W++ A+
Sbjct: 650 TWIDEAR 656


>gi|345014586|ref|YP_004816940.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
 gi|344040935|gb|AEM86660.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
          Length = 1044

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 124/465 (26%), Positives = 212/465 (45%), Gaps = 41/465 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+  +GGP+  + LD++  L ++I  R+ ELGM PV P + G VP    +  P A+  
Sbjct: 216 LQNMSEYGGPVSTALLDKRTELGRRIADRLRELGMRPVFPGYFGTVPDGFADRNPEARTV 275

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W  ++  P W     LD     F ++  AF   Q + +G  + ++  D   E   P
Sbjct: 276 PQGDWNGLRR-PDW-----LDPRTESFRKVAAAFYRHQRELFGE-AGLFKMDLLHEGGDP 328

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+ + +++    A+W++ GW    +P       + LL++V   +++V+
Sbjct: 329 GDVP--VPDAARAVETALRTARPGAIWVILGW--QENP------RRDLLDAVDHDRMLVV 378

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           D  +++  +    K +  VPY +  + NF G   +              R    + +VG 
Sbjct: 379 DGLSDLDTVTDREKDWGAVPYAFGTIPNFGGRTTIGAKTHMWTKRFTVWRDKPGSKLVGT 438

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
               E  E++P  ++L SE+A++ E VD   W   Y+  RYG      ++A+  L  T Y
Sbjct: 439 AYMPEAAERDPAAFELFSELAWREEAVDRAEWFRSYAEMRYGGRDAKAREAFAALRDTAY 498

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
             +      +  V  A P       S+T     NY           + T ++D       
Sbjct: 499 EISSKDGRPHDSVFAARP-------SLTARSGTNYA----------THTPAFDPAGF--- 538

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
             +V  A  L + +G  L  S+ YR+DL D+ RQALA  + +L   + +AY   D     
Sbjct: 539 --DVAFAALLGVRAG--LRDSDAYRHDLTDIARQALANRSWQLIPQLQDAYDRKDRTAFR 594

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            L+R +L+L+   D +   H  FLLGPWLE AK++A  +E+  ++
Sbjct: 595 TLARLWLKLMRLSDDMTGAHRRFLLGPWLEDAKRMASGDEESARL 639


>gi|365876979|ref|ZP_09416485.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
 gi|442587289|ref|ZP_21006107.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
           anophelis R26]
 gi|365755253|gb|EHM97186.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
 gi|442562959|gb|ELR80176.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
           anophelis R26]
          Length = 712

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 126/461 (27%), Positives = 202/461 (43%), Gaps = 45/461 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGP+    + QQ  LQKKIL R+ ELG+ PVL  F G VP  L+N    AK+ 
Sbjct: 193 MGNLEGWGGPVSMDMMKQQAELQKKILKRMKELGIEPVLQGFYGMVPHDLKNKISEAKVI 252

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           + G W      P      +LD T  LF +I   +  +    YG   H +  + F E   T
Sbjct: 253 EQGKWAGEFQRPG-----ILDPTTKLFSKIADTYYTEMKNLYGEDIHYFGGEPFHEGGKT 307

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             +D    + S    I + MQ    ++ W++QG        W+      LL  +     +
Sbjct: 308 NGLDLKNVVES----IQTSMQKSYPNSTWVLQG--------WQQNPSDGLLAGLKKENTL 355

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTM 237
           +++LF E    W   K + G  +IW  + NF     +YG L         A+ S     +
Sbjct: 356 IIELFGENTANWEKRKGYGGTSFIWSNVSNFGEKNGLYGKLQRFIDEVFRAKESIYGANL 415

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G+G+  EGI  NPV YDLM ++A+  EK  +  W+ +Y+  RYG+    +  AW     
Sbjct: 416 KGIGIIPEGIFNNPVAYDLMLDIAWYSEKPILDQWLTEYTKYRYGKENQDVIQAWKEFAQ 475

Query: 298 TVYNCTDGATDK-NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
           T+Y+  D   +  +  +  A P ++ + +S    + +NY +   KEAV            
Sbjct: 476 TIYSSPDVYQEGPSESIYCARPSLNVNPVSSWGTRKRNYDQSRFKEAV------------ 523

Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
                       ++F+ +  +   S TY+ D  D  RQ  A   + ++  +I+A      
Sbjct: 524 ------------KVFVKADTDFKDSETYQTDKTDFLRQVWANKGDVVYDELIKAIHEKKT 571

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
             + +   +FLE++   + LL  +  F L   L+ A+   +
Sbjct: 572 TKIQKSGHQFLEMISIQNMLLGNNRYFTLNRLLKEAEHFGE 612


>gi|374990497|ref|YP_004965992.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
 gi|297161149|gb|ADI10861.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
          Length = 1001

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 123/465 (26%), Positives = 204/465 (43%), Gaps = 41/465 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+ G+GGP+    L +++ L +KI  R+ ELGM PV P + G VP    +  P A+  
Sbjct: 174 LQNMSGYGGPVSPELLAKRIALGQKIAERLRELGMRPVYPGYFGTVPDGFVDRNPGARTV 233

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W  + + P W     LD     F ++  AF   Q + +G    ++  D   E    
Sbjct: 234 PQGTWNGL-ARPDW-----LDPRTESFGQVAAAFYRHQQELFGECD-LFKMDLLHEGGAA 286

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  ++    A+ + +Q+    A W++ GW  +          + LL++V    ++V+
Sbjct: 287 GDVP--VADAARAVETALQTARPGATWVILGWQAN--------PRRELLDAVNHDHMLVV 336

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           D  +++  I    + +  VPY +  + NF G   +       A    + R    + +VG 
Sbjct: 337 DGLSDLDSIGDREQDWGSVPYAFGTIPNFGGRTTIGAKTHIWARRFTQWRDKPGSKLVGT 396

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
               E + ++P  ++L SE+A+++  VD   W   Y+  R G      +DA+  L  T Y
Sbjct: 397 AYMAEAVGRDPAAFELFSELAWRNTAVDRDEWFRTYADVRLGGRDERARDAYAALRDTAY 456

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
             T      +  V  A PDV       T     NY   +                   + 
Sbjct: 457 QITSSDGRPHDSVFSARPDV-------TARSGTNYATRIPA-----------------FD 492

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
            ++   AL   +     L  S+ YR+DL D+ RQALA  +  L  ++ +AY+  D     
Sbjct: 493 LADFDPALAALLDVRPSLRDSDAYRHDLTDIARQALADRSWTLIPHLHDAYERKDLEAFR 552

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            L+R +L+L+   D +   H GFLLGPWLE AK+LA +E +   +
Sbjct: 553 TLARLWLKLMRLSDDMTGAHRGFLLGPWLEDAKRLASDEAEAAHL 597


>gi|386386798|ref|ZP_10071901.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
 gi|385665738|gb|EIF89378.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
          Length = 1033

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 127/471 (26%), Positives = 211/471 (44%), Gaps = 52/471 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+  +GGPL ++ LD +  L +KI  R+ ELGM PVLP + G VP    +  P A++ 
Sbjct: 213 LQNMSEYGGPLSKTLLDARAELGRKITARLRELGMRPVLPGYFGTVPDGFADRNPGARVV 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W  ++  P W     LD    +F ++  AF   Q K +G    ++  D   E    
Sbjct: 273 AQGLWNGLRR-PDW-----LDPRTTVFPKVAAAFYRHQTKLFG-ACDLFKMDLLHEGGNA 325

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+   +++   +AVW++ GW  +          +ALL++V   +++++
Sbjct: 326 GDVP--VPDAARAVEKALRTARPNAVWVILGWQSN--------PRRALLDAVDKRRMLIV 375

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           D  +++        ++ G PY +  + NF G   +    D         R    + +VG 
Sbjct: 376 DGLSDLDTTGDRESEWGGTPYAFGTIPNFGGRTTLGANTDRWTDRFTVWRDRPGSALVGT 435

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
               E  E++P  ++L SE+A++ E++D +AW  +Y+  RYG    +   A+  L  T Y
Sbjct: 436 AYMPEAAERDPAAFELFSELAWRRERIDREAWFTEYAQIRYGSDDASAAAAFGALAATAY 495

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
                                   ++ T+G+   Y     +   L S   +   P  + +
Sbjct: 496 R-----------------------LASTDGR--PYDSHFLRRPSLTSSIGTAFDPAGFDT 530

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
                 A    +A+G EL  S+TYR+DL +L RQALA  +  L   +  A    D     
Sbjct: 531 ------AFAALLAAGPELRDSDTYRHDLTELARQALANRSRTLQFALRAARASKDVAAFR 584

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE----EQEKQVRC 467
            +S  +L+L+   D +  CH  FLLGPWLE AK+LA +     E E+  R 
Sbjct: 585 GVSALWLKLMRLADTMAGCHRSFLLGPWLEDAKRLATSPAEAVELERTARA 635


>gi|302546018|ref|ZP_07298360.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
           [Streptomyces hygroscopicus ATCC 53653]
 gi|302463636|gb|EFL26729.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
           [Streptomyces himastatinicus ATCC 53653]
          Length = 679

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 129/467 (27%), Positives = 202/467 (43%), Gaps = 45/467 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  WGGP+  + L+++  L ++I  R+ ELGM PV P + G VP    +  P A   
Sbjct: 177 MQNMSEWGGPVSTALLEKRTDLGRRIADRLRELGMRPVFPGYFGTVPDGFADRNPGAHTV 236

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G+W  ++  P W     LD     F E+  AF   Q   +G    ++  D   E  N 
Sbjct: 237 PQGDWNGLRR-PDW-----LDPRTDAFHEVAAAFYRHQHDLFG-ACDLFKMDLLHEGGNA 289

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             V  P+       A+   +Q+    A+W++ GW    +P       + LL++V    ++
Sbjct: 290 GDVSVPDAAR----AVEKALQTSRPGAIWVILGW--QSNP------RRDLLDAVDHDHML 337

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           V+D  +++  I    K +  VPY +  + NF G   +              R    + +V
Sbjct: 338 VVDGLSDLDTITDREKDWGSVPYAFGTIPNFGGRTTIGAKTHMWTERFTVWRDKPGSKLV 397

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G     E +E++P  Y+L SE+A++   VD  AW   Y+  RYG      ++A+  L  T
Sbjct: 398 GTAYMPEAVERDPAAYELFSELAWRDTAVDRDAWFRDYADVRYGARDAKAREAFAALRDT 457

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
            Y  +      +  V  A P       S+T     NY           + T ++D     
Sbjct: 458 AYQISSKDGRPHDSVFAARP-------SLTARSGTNYA----------THTPAFD----- 495

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
              +    AL   +     L  S+ YRYDL D  RQALA  + +L   + +AY   D   
Sbjct: 496 --PARFDAALAALLGVRAGLRDSDAYRYDLADTARQALANRSWQLIGQLADAYARKDLDT 553

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
              LSR +L+L+   D +   H   LLGPWLE AK++A   E+  Q+
Sbjct: 554 FRALSRLWLKLMRLSDDITGTHRLLLLGPWLEDAKRMASGAEESAQL 600


>gi|329851961|ref|ZP_08266642.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
           biprosthecum C19]
 gi|328839810|gb|EGF89383.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
           biprosthecum C19]
          Length = 731

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 135/504 (26%), Positives = 217/504 (43%), Gaps = 91/504 (18%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLP SW+ ++  LQK+IL  + ELGM P+LPAF+G VP A     P A+I 
Sbjct: 182 MGNIEGYQAPLPLSWIVKKRELQKRILGAMRELGMEPILPAFAGYVPKAFAESHPQARIY 241

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD  DPLF ++   F++   + YG+    Y  D F+E  PP
Sbjct: 242 RMRAWEGFHE------TYWLDPADPLFAKLAGRFLDLYDQTYGK-GRFYLADAFNEMLPP 294

Query: 121 V-DSP------------------------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 155
           V D P                        E +++ G  ++  ++S   DAVW+MQGWLF 
Sbjct: 295 VGDGPVEGGYGDSTANKEAVAEVDPAVKAERLAAYGQRLHDSIRSARPDAVWVMQGWLFG 354

Query: 156 YDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNI 213
            D  FW    + A L +VP   L+VLD+  +  P +  T++ F+G  +I+  +HN+  + 
Sbjct: 355 ADQGFWTGDAIAAFLRNVPDDGLMVLDIGNDRYPKVRQTAQAFHGKGWIYGYVHNYGASN 414

Query: 214 EMYGILD-------SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 266
            +YG L        +I   P   R      + G G+  EG++ N +VY  + ++A+    
Sbjct: 415 PIYGDLGFYRRDMAAITSDPARGR------LQGFGVFPEGLDSNSIVYAYLYDLAWNGGT 468

Query: 267 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF-----PDVD 321
             +  W+  Y+  RYG S P +  AW  +   VY          R    A+     PD+ 
Sbjct: 469 KSLSDWLAGYTRARYGISSPEVVTAWLDIVKGVYGTRYWTPRWWRSTAGAYLLCKRPDIA 528

Query: 322 PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 381
            +      G        +++ A ++     +D P L                        
Sbjct: 529 MADFEGAPGDRAALRAGLARLAAIR-----HDSPLL------------------------ 559

Query: 382 NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLS---RRFLELVEDMDGLLA 438
              RYD+I+ TR   + + + L    + AY+  D     + +   RR    ++D+ G   
Sbjct: 560 ---RYDVIEFTRHLASLHLDNLIRTALVAYRDGDVAAGDRSATEVRRVTIAIDDLMGAQP 616

Query: 439 CHDGFLLGPWLESAKQLAQNEEQE 462
           CH    L  W+E A+       ++
Sbjct: 617 CH----LAGWIEQARAYGDTATEK 636


>gi|229818803|ref|YP_002880329.1| alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
 gi|229564716|gb|ACQ78567.1| Alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
          Length = 751

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 133/408 (32%), Positives = 202/408 (49%), Gaps = 33/408 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M +   +GGPLP SW +++  L ++IL R  ELGM  VLPAF G+VP  L      A+  
Sbjct: 194 MGSTSSFGGPLPDSWFERRAELGRRILERQRELGMRAVLPAFGGHVPDGLGA---GARTH 250

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G  FS         T LL   D  F  +   F  QQ + +G T H+Y  D F E+ PP
Sbjct: 251 WQG--FS---------TALLGPDDDAFAVVAAEFARQQRELFG-TDHLYAADPFIESVPP 298

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
              PE +++  AA Y+GM + D +A W+MQ W F Y   FW   ++ A+ ++VP  +L++
Sbjct: 299 SGEPEDLAAFAAATYAGMSAADPEATWVMQAWPFHYHRRFWTAERIAAVTDAVPRDRLLL 358

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTTM 237
           LDL+AE  P+W   +      ++WC +HNF G   ++G L  +A   G V    +     
Sbjct: 359 LDLWAEHAPVWDDGRGIAEHQWLWCAVHNFGGRFSVHGDLHGLARDLGGVLDDGARTGGF 418

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-----RSVPAIQDAW 292
            GVGM+ME +E NPV Y+L++++ +  E+ DV AW+ ++  +RYG      +  A+  AW
Sbjct: 419 TGVGMAMEALENNPVFYELLTDLVW--ERPDVDAWVGRFVDQRYGFADGTAARDAVHGAW 476

Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPD--VDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
            +L  T+Y    G T      ++A P   V P       G++ +   PV   A + +E  
Sbjct: 477 AILLRTLYGP--GMTRSIPSPVIARPADVVAPFHTQRLAGEFLDPDAPVIVSANIDAEAD 534

Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
               P +     E+ RA  L   +     A     +DL DL    +A+
Sbjct: 535 ----PRVEGDLPEIARAAALLREAAGSSDAGGPLAHDLADLLTHVVAQ 578


>gi|383643231|ref|ZP_09955637.1| N-acetylglucosaminidase [Sphingomonas elodea ATCC 31461]
          Length = 778

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 130/490 (26%), Positives = 205/490 (41%), Gaps = 70/490 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PL  +W++++ VLQ++IL R+  LGM P+LPAFSG VP A     P AKI 
Sbjct: 209 MGNIAGYRAPLSANWIEKKRVLQRQILARMRSLGMKPILPAFSGYVPEAFAKAHPEAKIY 268

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           Q+  W           TY LD +DPLF  +   F++     YG     Y  D F+E  PP
Sbjct: 269 QMRQWEGFPG------TYWLDPSDPLFARLAARFLQLYTATYG-PGEYYLADAFNEMVPP 321

Query: 121 VDS-------------------------PEYI-----SSLGAAIYSGMQSGDSDAVWLMQ 150
           +                           P+ +     ++ G  +Y  + +   +A W+MQ
Sbjct: 322 IAEDGSDARAATYGDAIANTAATRAAALPKEVRDARLAAYGERLYRSITAAAPNATWVMQ 381

Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  FW P  + A L+ VP  ++++LD+  +  P IW+ ++ FYG  + +  +HN
Sbjct: 382 GWLFGADKAFWTPDAIAAFLSKVPDERMLILDIGNDRYPGIWNATRAFYGKGWAYGYVHN 441

Query: 209 FAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 267
           + G+  +YG L         A  +  +  M G G+  EG+  N + Y    ++A+     
Sbjct: 442 YGGSNPVYGDLAFYRSDITAALANPGHGRMRGFGLFPEGLHSNGIAYAYAYDLAWGEIDA 501

Query: 268 DVK-----AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 322
             K     AWI  Y+  RYG++ PA+  AW+      Y                     P
Sbjct: 502 TGKARPLDAWIGDYTRARYGKTSPALVAAWDKAIAGAYTTR---------------YWTP 546

Query: 323 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 382
                  G Y  +  P    A         D+P      + +   +E  +A   +     
Sbjct: 547 RWWHEQAGGYLFFKFPSLDGA---------DYPAAPGDPAALRAGIEALLAQAPQHGGEP 597

Query: 383 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 442
            Y YD++DL R   +   ++     + AY+  D     + +     L   +D  LA +  
Sbjct: 598 LYTYDVVDLVRHYASVQLDDRLKTAVAAYKAGDLAAGDRATAAAERLARHIDA-LAGNQQ 656

Query: 443 FLLGPWLESA 452
             LG WL  A
Sbjct: 657 ETLGSWLADA 666


>gi|257067709|ref|YP_003153964.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
           4810]
 gi|256558527|gb|ACU84374.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
           4810]
          Length = 768

 Score =  172 bits (435), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 99/323 (30%), Positives = 156/323 (48%), Gaps = 22/323 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M   H  G  L    L+ +  L ++I  R  ELGM  VLP F G +PA L     + ++ 
Sbjct: 201 MGITHDLGAALTDEALEARAELGRRIAERERELGMTVVLPGFGGQLPAELVG---TERMI 257

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               W +  + P           DPLF E   +    Q +  G T H Y  D + E+ PP
Sbjct: 258 DWQGWHNALAAP----------GDPLFAEAAASLHRHQRQLLG-TDHHYAVDPYIESLPP 306

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
             SP+ ++    AI++ M+  D  AVW++QGW F Y   +W   ++ +LL+ VP  +L++
Sbjct: 307 TTSPQQLAEHAEAIFTAMRDADPQAVWILQGWPFHYRAAYWTEERVHSLLSRVPEDRLIL 366

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL----DSIAFGPVEARTSENT 235
           LDL+ E  P+W  +   YG  ++WC+ H F G   ++G L    D +      A      
Sbjct: 367 LDLWGEHAPMWHRTAAMYGRRWLWCLAHTFGGRFGLFGDLAALDDDLRGLRTAAEAGTRG 426

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
            + G G++ E ++ N VVY+L +  A        + W+ ++ +RRYG + P +Q AW V+
Sbjct: 427 RLEGFGITSEALDDNAVVYELATR-ALWSPMPPRERWLEEHIIRRYGTAAPEVQQAWQVI 485

Query: 296 YHTVYNCTDGATDKNRDVIVAFP 318
            HT+Y    G T      ++A P
Sbjct: 486 AHTLYGP--GRTRSTPSPLIARP 506


>gi|429198382|ref|ZP_19190217.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
 gi|428665917|gb|EKX65105.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
          Length = 747

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 119/466 (25%), Positives = 214/466 (45%), Gaps = 39/466 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+ +  LD +  L ++I+ R+ ELGM PV P + G VP       P A+  
Sbjct: 220 LQNLSSFPSPVSRQLLDARAALGRRIVGRLRELGMTPVFPGYFGTVPPGFAERNPGARTV 279

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W    + P W     LD     F  +  AF   Q + +G  S +Y  D   E   P
Sbjct: 280 PQGDWMGF-ARPDW-----LDPRTNEFKRVAAAFYRAQDELFGGPSTLYKMDLLHEGGDP 333

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  ++     +   +++   DA W++ GW  +      PP  +A++++V   +++V+
Sbjct: 334 GDVP--VADAAKGVERALRAAHPDATWVILGWQHN------PP--RAIVDAVDKKRMLVV 383

Query: 181 DLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           D  ++  P     +  +G  PY +  + NF G+  +       A    + RT + + + G
Sbjct: 384 DGLSDRFPTVIDREADWGDTPYAFGSIWNFGGHTALGANTPVWAELYEKWRTKDGSKLRG 443

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           + +  E  + NP  + L SE+A++ +++D+K W ++++  RYG   P  + AW++L  T 
Sbjct: 444 IALMPEAADNNPAAFALFSELAWRKDELDLKTWFSEWAHARYGARDPHAEAAWDILRRTA 503

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y    G T  +R                +EG    +G   S+ A+     + +    L Y
Sbjct: 504 Y----GTTRADR---------------WSEGADGLFG---SRPALNTVRAARWSPKQLRY 541

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
             +E   AL   ++    L +S+ YR DL+D+ RQ L+  +  L   I  AY   D    
Sbjct: 542 DAAEFEPALGELLSVRPGLRSSSAYRRDLLDVARQTLSNRSRVLLPRIRGAYDARDTARF 601

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            +L+  +L L++ +D LLA     LLG W+  A+    ++ + +++
Sbjct: 602 DELTGTWLSLMDLLDRLLATDSAHLLGRWVADARAWGASDAERERL 647


>gi|333023613|ref|ZP_08451677.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
 gi|332743465|gb|EGJ73906.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
          Length = 741

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 128/476 (26%), Positives = 219/476 (46%), Gaps = 48/476 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+    ++Q+  L  +I+ R+ ELGM+PVLP + G VPA   +  P AK  
Sbjct: 215 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 274

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD    LF E+  AF E Q + YGR + +Y  D   E    
Sbjct: 275 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            + P   ++ G  +   +++   DAVW++ GW  +      PP  K ++ +     ++V+
Sbjct: 328 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 377

Query: 181 D----LFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENT 235
           D     F+EV         + G PY +  + NF G+  +     D +   P   R    +
Sbjct: 378 DGLSDRFSEVN---DRESDWQGTPYAFGSIWNFGGHTALGANARDWVDLYP-RWRDRSGS 433

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
            + G+ +  E  + NP  ++L +E+ +    VD+  W  +Y+  RYG S    + AW++L
Sbjct: 434 RLSGIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDIL 493

Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
             T Y       D+  +         P++ +V+ GK+            L+   +S++  
Sbjct: 494 RTTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP- 542

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
                      AL+  +A   EL  S TYR DL+D+ RQALA  +  L   +  AYQ  +
Sbjct: 543 -----------ALDELLAVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYQAKN 591

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVS 471
                +L RR++ L++ ++ L+A  +  LLG W+ESA+    +  ++ Q++   +S
Sbjct: 592 QAEFARLGRRWIALMDLLEQLVATDENHLLGRWVESARAWGGSAREKSQLQYDALS 647


>gi|456388164|gb|EMF53654.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
           25435]
          Length = 732

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 121/466 (25%), Positives = 207/466 (44%), Gaps = 40/466 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+ +  LD + VL ++I  R+ ELGM PV P + G VPA      P A+  
Sbjct: 206 LQNLSAFPSPVSRQLLDARAVLGRRIADRVRELGMTPVFPGYFGTVPAGFAERVPGARTV 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD     F  +  AF   Q + +G +S +Y  D   E   P
Sbjct: 266 PQGEWMGF-ARPDW-----LDPRTDDFARVAAAFYRVQEEMFGPSS-LYKMDLLHEGGDP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  ++     +   ++     A W++ GW  +      PP  +A++++V    ++V+
Sbjct: 319 GDVP--VADAAKGVERALRRSRPGATWVILGWQHN------PP--RAIVDAVDKQHMLVV 368

Query: 181 DLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           D  ++  P  +  +  +G  PY +  + NF G+  +       A    + RT + + + G
Sbjct: 369 DGLSDRFPTVTDREADWGDTPYAFGSIWNFGGHTALGANTPDWAALYEKWRTKDGSRLHG 428

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           + +  E  + NP  + L SE+A++  ++D+K W  +++  RYG   P  + AW++L  T 
Sbjct: 429 IALMPEAADNNPAAFALFSELAWREGELDLKTWFAEWAHARYGGRDPHAEAAWDILRRTA 488

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T                      S +EG    +G   S+ A+       +    L Y
Sbjct: 489 YGTT-------------------RADSWSEGADGLFG---SRPALNAVRAGRWSPKQLRY 526

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
             ++   AL   +    EL AS+ YR DL+D+ RQAL+  +  +   I  AY   DA  +
Sbjct: 527 DAADFEPALGEMLRVRPELRASSAYRRDLLDVARQALSNRSRVMLPQIKAAYDAKDATRL 586

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
              SR +L L++ +D L+A     LLG W+  A+       +  ++
Sbjct: 587 AAASRDWLSLMDLLDELVATDSRHLLGRWVADARSWGAGAAERTEL 632


>gi|453051703|gb|EME99203.1| alpha-N-acetylglucosaminidase [Streptomyces mobaraensis NBRC 13819
           = DSM 40847]
          Length = 763

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 120/474 (25%), Positives = 204/474 (43%), Gaps = 44/474 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+  +GGP+ ++ LD++  L ++I  R+ ELG+ PVLP ++G VP         A+  
Sbjct: 239 MQNMSAFGGPVSRALLDRRTALAQRITRRLRELGITPVLPGYAGTVPPDFTRRNKGARTV 298

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G+W      P W     LD     F  + R +   Q + YG  S +Y  D   E   P
Sbjct: 299 PQGDWAGFPR-PDW-----LDPRTAHFARVARTYYRVQRELYG-ASSMYKIDLLHEGGTP 351

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P  + +   A+   +++   DA W + GW  +          + +L++V   K++VL
Sbjct: 352 --GPVPVGAAAKAVEKALRAAHPDATWAILGWQTN--------PRREILDAVDRSKMLVL 401

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           D   +  P +    K + G PY +  + NF G+  M              RT + + + G
Sbjct: 402 DGIPDHYPRVTDREKDWGGTPYAFGTIWNFGGHTAMGANTQDWVSLFHRWRTKKGSALRG 461

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           + +  E  + NP    L S++A+   ++D+K W  ++ V+RYG + P  + AW+VL  T 
Sbjct: 462 IALMPEAADNNPAALALFSDLAWTEGRLDLKDWFARWPVQRYGAADPNARRAWDVLRRTA 521

Query: 300 YNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           Y  T  DG ++    +  A PD+                  V++ A        YD    
Sbjct: 522 YGTTRADGWSEAADGLFGARPDL-----------------AVNRAAAWSPRQLRYD---- 560

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
               +    AL   +A    L  S+ YR DL D+ RQ ++  +  L   I  AY   D  
Sbjct: 561 ---AAAFDEALPALLAVAPALRGSSAYRCDLTDVARQCVSNRSRLLLPRIKAAYDAGDRT 617

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVS 471
               L+R++L+ +  ++  +A  +  LLG W+  A+       +  ++    VS
Sbjct: 618 RFRTLTRQWLDWMTLLEETVATSERHLLGRWIAEARAWGGTAAERDRLEHDAVS 671


>gi|318057780|ref|ZP_07976503.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actG]
          Length = 741

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 124/473 (26%), Positives = 219/473 (46%), Gaps = 42/473 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+    ++Q+  L  +I+ R+ ELGM+PVLP + G VPA   +  P AK  
Sbjct: 215 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 274

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD    LF E+  AF E Q + YGR + +Y  D   E    
Sbjct: 275 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 327

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            + P   ++ G  +   +++   DAVW++ GW  +      PP  K ++ +     ++V+
Sbjct: 328 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 377

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENTTMV 238
           D  ++  P +      + G PY +  + NF G+  +     D +   P   R    + + 
Sbjct: 378 DGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALGANTRDWVDLYP-RWRDRSGSRLS 436

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+ +  E  + NP  ++L +E+ +    VD+  W  +Y+  RYG S    + AW++L  T
Sbjct: 437 GIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILRTT 496

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
            Y       D+  +         P++ +V+ GK+            L+   +S++     
Sbjct: 497 AYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP---- 542

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
                   AL+  ++   EL  S TYR DL+D+ RQALA  +  L   +  AY+  +   
Sbjct: 543 --------ALDELLSVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQAE 594

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVS 471
             +L RR++ L++ ++ L+A  +  LLG W+ESA+    +  ++ Q++   +S
Sbjct: 595 FARLGRRWIALIDLLEQLVATDENHLLGRWVESARAWGGSAREKNQLQYDALS 647


>gi|318078904|ref|ZP_07986236.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actF]
          Length = 719

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 124/473 (26%), Positives = 219/473 (46%), Gaps = 42/473 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+    ++Q+  L  +I+ R+ ELGM+PVLP + G VPA   +  P AK  
Sbjct: 193 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 252

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD    LF E+  AF E Q + YGR + +Y  D   E    
Sbjct: 253 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 305

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            + P   ++ G  +   +++   DAVW++ GW  +      PP  K ++ +     ++V+
Sbjct: 306 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 355

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENTTMV 238
           D  ++  P +      + G PY +  + NF G+  +     D +   P   R    + + 
Sbjct: 356 DGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALGANTRDWVDLYP-RWRDRSGSRLS 414

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+ +  E  + NP  ++L +E+ +    VD+  W  +Y+  RYG S    + AW++L  T
Sbjct: 415 GIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILRTT 474

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
            Y       D+  +         P++ +V+ GK+            L+   +S++     
Sbjct: 475 AYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP---- 520

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
                   AL+  ++   EL  S TYR DL+D+ RQALA  +  L   +  AY+  +   
Sbjct: 521 --------ALDELLSVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQAE 572

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVS 471
             +L RR++ L++ ++ L+A  +  LLG W+ESA+    +  ++ Q++   +S
Sbjct: 573 FARLGRRWIALIDLLEQLVATDENHLLGRWVESARAWGGSAREKNQLQYDALS 625


>gi|302526099|ref|ZP_07278441.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
 gi|302434994|gb|EFL06810.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
          Length = 860

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 117/472 (24%), Positives = 206/472 (43%), Gaps = 48/472 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN------VF 54
           + N+  + GP+    LD ++ + KK++ R+ +LGM PVLP + G VP    +        
Sbjct: 200 LQNMASFTGPVSPQLLDARVAMAKKVITRLKDLGMTPVLPGYFGTVPRGFADKSKKADAS 259

Query: 55  PSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTF 114
             A++   G W      P W     LD     + ++  AF + Q   +G TS +Y  D  
Sbjct: 260 SDARVIGQGTWVGFDR-PDW-----LDPRTSSYRKVAAAFYQAQHDLFGDTS-MYKMDLL 312

Query: 115 DENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
            E     D P  +      + + +Q+    A W++ GW  +      PP  +A++++V  
Sbjct: 313 HEGGKSGDVP--VGDAARGVMTALQTARPGATWVLLGWQNN------PP--RAIVDAVDK 362

Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
            KL V+D  ++         Q+   PY +  ++NF G+  +            + RT + 
Sbjct: 363 SKLFVVDGLSDRYGQRDPDSQWNNTPYAFGTIYNFGGHTTIGANTGVWTQRFPQWRTKQG 422

Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNV 294
           + + G+    EG   NP  ++L +E+A++   +   AW   Y+ RRYG        AW++
Sbjct: 423 SALTGIAYLPEGTGTNPAAFELFTELAWRQTPIHQAAWFADYASRRYGGPDTRAATAWDL 482

Query: 295 LYHTVYNC-TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
           L  T Y+    G ++    +  A P++D                         +  +++ 
Sbjct: 483 LRQTAYSMPASGWSEAQDSLYAARPNLD------------------------AATAATWS 518

Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
              L Y  +   +AL+  +     L  ++ YR+DL+D+ RQAL   +  L   I  AY  
Sbjct: 519 PASLRYQQATFGKALDELLNVDPALRGTDAYRFDLVDVARQALTNTSRTLLPQIKTAYTN 578

Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            D      L+ R++  +  +D LLA    FLLGPWLE+AK  A  + ++ ++
Sbjct: 579 RDRTQFTTLTSRWMSNMTLLDKLLATDSRFLLGPWLEAAKSWAGTDTEQARL 630


>gi|29828556|ref|NP_823190.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
 gi|29605660|dbj|BAC69725.1| putative alpha-N-acetylglucosaminidase [Streptomyces avermitilis
           MA-4680]
          Length = 728

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 119/466 (25%), Positives = 204/466 (43%), Gaps = 39/466 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL  +  P+ Q  LD +  L ++I  R+ ELGM PV P + G VP    +    A   
Sbjct: 201 LQNLSAFPDPVSQQLLDARAALGRRIANRLRELGMTPVFPGYFGTVPPGFADRNAGAHTV 260

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD     F  +  AF   Q + +G  S  Y  D   E   P
Sbjct: 261 PQGTWMGF-ARPDW-----LDPRTEHFTRVAAAFYRIQDEMFGGASTRYKMDLLHEGGSP 314

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +      +   +++    AVW++ GW  +      PP  +A++++V   +++V+
Sbjct: 315 GDVP--VGDAAKGVERALRAAHPGAVWVILGWQHN------PP--RAIVDAVDKDRMLVV 364

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           D   +  P +      ++G PY +  + NF G+  +       A      RT   +T+ G
Sbjct: 365 DGLCDRFPKVTDREADWHGTPYAFGSIWNFGGHTTLGANTPDWASLYERWRTRPGSTLRG 424

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           V +  E  + NP  + L SE+A++   +D++AW  +++  RYG   P  + AW++L  T 
Sbjct: 425 VALLPEAADNNPAAFALFSELAWREGDLDLRAWFARWARSRYGGRDPHAEAAWDILRRTA 484

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T                      S +EG    +G   ++ ++  ++ +S+    L Y
Sbjct: 485 YGTT-------------------RADSWSEGADGLFG---ARPSLAATKAASWSPKRLRY 522

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E   AL   +     L  S+ YR DL+D+ RQAL+  +  L   I  AY+  D    
Sbjct: 523 RPEEFEPALGELLKVRPGLRGSSAYRRDLLDVARQALSNRSRVLLPQIRTAYEAKDTARF 582

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            +L+  +L L++ ++ LLA     LLG W+  A+    +  +  ++
Sbjct: 583 DRLTGVWLALMDLLEALLATDSRHLLGRWVADARAWGASAAERDRL 628


>gi|374985456|ref|YP_004960951.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
 gi|297156108|gb|ADI05820.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
          Length = 1039

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 126/470 (26%), Positives = 197/470 (41%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+ G+GGP+ +  +DQ+  L KKI+ R+ ELGM PVLP + G VP       P A + 
Sbjct: 214 LQNMSGFGGPVSKHLIDQRAALAKKIINRVRELGMTPVLPGYYGTVPDDFLAKNPGASLV 273

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W + K  P W     LD    LF E+  AF   Q + YG +S +Y  D   E   P
Sbjct: 274 AQGTWGAFKR-PDW-----LDPRTDLFAEVAAAFYRHQRERYGDSS-MYKMDLLHEGGNP 326

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+ + +Q   + AVW + G        W+    + +L +V    ++V+
Sbjct: 327 GDVP--VGEAAKAVEAALQKAHAGAVWAILG--------WQTNPSREILGAVDKSMMLVV 376

Query: 181 DLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           D  ++    +      + G PY +  + NF G+  +            + R    + + G
Sbjct: 377 DGLSDRYTTVIDRESDWDGTPYAFGSIWNFGGHTPIGANAPDWVEQYPKWRDKTGSALTG 436

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           + M  EG + NP    L +++A+    + +  W   Y+V RYG   P    AW  +  T 
Sbjct: 437 IAMMPEGADNNPAAMALFTDLAWTPGAIGLDDWFASYAVSRYGGEDPHAVAAWKAIRDTA 496

Query: 300 YNCTDGATDKNRDVIVAFPD----VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
           YN +        D     PD      PS+               +K A    E   YD  
Sbjct: 497 YNMS------RADAWSEAPDGLFGARPSL-------------GANKAAAWGPEADRYD-- 535

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
                T+    AL   +     L  S+ Y YDL D+ RQ L+  +  L   I  AY+  D
Sbjct: 536 -----TTAFDAALTELLQVAPGLRDSSAYAYDLADVARQVLSNRSRVLLPQIKTAYEAGD 590

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
                +L++ +L  ++ MD +LA     LLG WL  A+       ++ Q+
Sbjct: 591 RGRFDRLTKTWLSWMKLMDKVLATSGQHLLGRWLADARSWGATRAEKDQL 640


>gi|290956360|ref|YP_003487542.1| alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
 gi|260645886|emb|CBG68977.1| putative alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
          Length = 732

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 124/466 (26%), Positives = 209/466 (44%), Gaps = 40/466 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL G+  P+ +  LD + VL ++I  R  ELGM PV P + G VPA      P A+  
Sbjct: 206 LQNLSGFPSPVSRQLLDARAVLGRRIADRARELGMIPVFPGYFGTVPAGFAERVPGARTV 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W    + P W     LD     F  +  AF   Q + +G  S +Y  D   E   P
Sbjct: 266 PQGRWMGF-ARPDW-----LDPRTDEFARVAAAFYRTQDEMFG-PSALYKMDLLHEGGDP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  ++     +   +Q     A W+M GW  +      PP  +A++++V    ++V+
Sbjct: 319 GDVP--VADAAKGVERALQRAHPGATWVMLGWQHN------PP--RAIVDAVDKQHMLVV 368

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           D  ++  P +      + G PY +  + NF G+  +       A    + RT + +T+ G
Sbjct: 369 DGLSDRFPTVTDREADWGGTPYAFGSIWNFGGHTALGANTPDWAALYEKWRTKDGSTLHG 428

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           + +  E  + NP  + L SE+A++  ++D++ W  +++  RYG   P  + AW++L  T 
Sbjct: 429 IALMPEAADNNPAAFALFSELAWREGELDLETWFAEWAHARYGARDPHAEAAWDILRRTA 488

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T                      S +EG    +G   S+ A+       +    L Y
Sbjct: 489 YGTT-------------------RADSWSEGADGLFG---SRPALTAVRAGRWSPKQLRY 526

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
           + ++   AL   +    EL AS+ YR DL+D+ RQAL+  +  +   +  AY   DA  +
Sbjct: 527 NAADFEPALGEMLKVRPELRASSAYRRDLLDVARQALSNRSRVMLPQLKAAYDAKDAARL 586

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            + SR +L L++ +D L+A     LLG W+  A+  A    +  ++
Sbjct: 587 AKGSRDWLSLMDLLDELVATDSRHLLGRWVADARSWAVGSTERTEL 632


>gi|409097333|ref|ZP_11217357.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Pedobacter agri
           PB92]
          Length = 724

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 127/471 (26%), Positives = 202/471 (42%), Gaps = 39/471 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN---VFPSA 57
           M NL GWGG      +  Q  +QKK+L R+ EL ++P+L  F G VP  L         A
Sbjct: 195 MGNLEGWGGTNSLQLMQLQSNIQKKVLSRMKELEIDPILQGFYGMVPHDLNKKVAALKDA 254

Query: 58  KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
           +I   GNW   +    +    +L  T+  F  +   +  +  K YG     +  + F E 
Sbjct: 255 QIIDQGNWVFTE----FIRPAILAPTNDKFNTVADVYYSELKKLYGSDIKFFGGEPFHEG 310

Query: 118 TPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
                VD    I+++  ++   MQ    ++ W++QGW        +     ALL  +   
Sbjct: 311 GKKGGVD----ITAVAKSVQDVMQKNFPNSTWVLQGW--------QNNPADALLAGLKKE 358

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-N 234
             ++++LF E    W   K + G  +IW  + NF     +YG L          + S   
Sbjct: 359 NTLIIELFGENTSNWEQRKGYGGTNFIWSNVSNFGEKNGLYGRLQRFLDEVYRIKQSPYK 418

Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNV 294
             + GVG+  EGI  NPV YDLM ++A+++EK  +  WI  Y+  RYG     + DAW V
Sbjct: 419 DYLKGVGIIPEGINNNPVAYDLMLDIAWRNEKPPLDKWITDYTTYRYGSYNKDVADAWKV 478

Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
              TVY+      +K + V    P         +E  Y       ++ ++  +  SS+  
Sbjct: 479 FTETVYSSP--VNEKGKIVYQEGP---------SESIY------CARPSLKVNPVSSWGT 521

Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
               Y T    +A+ LFI +  +   S TY+ D  D  RQ +A   ++ +  +I A Q  
Sbjct: 522 RKRNYDTKLFKQAVALFIKAETQFKNSETYQTDKTDFLRQVMADKGDQAYDELINAIQAK 581

Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
           D + + +    FL ++   D LL  +  F L  WL  A  L +     K +
Sbjct: 582 DKNAIKEKGNHFLTMILQQDSLLNNNHFFTLNRWLNQAVALGKGLPDAKNI 632


>gi|255079272|ref|XP_002503216.1| GH family 89 protein [Micromonas sp. RCC299]
 gi|226518482|gb|ACO64474.1| GH family 89 protein [Micromonas sp. RCC299]
          Length = 1260

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 144/510 (28%), Positives = 210/510 (41%), Gaps = 133/510 (26%)

Query: 58  KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
           K  +LG +   K D      + LD +D LF  +G AF +Q ++++G T H+Y  DTF E 
Sbjct: 396 KDAELGKY--AKKDDSVRSVHFLDPSDALFQSLGAAFTKQLVEDFG-TDHLYLADTFREI 452

Query: 118 TPPVD--SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL 174
             P D  S  ++  +GAA  + M+S D  A W+ Q   F  +P FW   +  ALL SV +
Sbjct: 453 RDPNDDFSETHVVRVGAATLAAMRSADPRATWVFQSDAFRRNPRFWNEGRRGALLRSVDI 512

Query: 175 GKLVVLDLFAEVKPIW-STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---- 229
           G ++VLD  AE  P +      F G P++WC+ HN  GN+ M G L +IA GP  A    
Sbjct: 513 GDMLVLDSAAETDPYYLREPVHFAGQPFVWCVKHNHGGNLGMRGRLSAIATGPAAAMDSL 572

Query: 230 -----------------------------RTSENTT----------MVGVGMSMEGIEQN 250
                                        R S   T          +VG G++ EG+EQN
Sbjct: 573 ASRRDGERGTTHGRGTRVGSSRRMLADNKRVSREATHGSRKVGKSQLVGFGITAEGVEQN 632

Query: 251 PVVYDLMSEMAFQHEKVDVKAWINQ--------YSVRRYGRSV----------------- 285
           PVVY+L +  +   + VDV  +++         YSVR+   +                  
Sbjct: 633 PVVYELAALTSQSEKGVDVDWFLSDYSRRRYGGYSVRQPAPTTLPVGTGQGAFLAGFIVG 692

Query: 286 ------------------PA-------------IQDAWNVLYHTVYNCTDGATDKN--RD 312
                             PA              ++AW +L  TVY       D++  RD
Sbjct: 693 NNPIAGSPGYLGPGEWYDPAKHGEMGKEEAYDRAREAWEILGKTVYGARAKGEDEDHVRD 752

Query: 313 VIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI 372
                P +    +S     Y +  K V                   Y+   +I A     
Sbjct: 753 ACSWQPSLRADELSP---DYFDAAKVVD------------------YAFKPLIDAAPTLR 791

Query: 373 ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVED 432
           A+G    A     YD++D+ RQ LA+ +N L   I ++   N+A        + LEL++D
Sbjct: 792 ANG----AGTRVDYDIVDVGRQLLARQSNVLATQIRDSLNSNNASEAKMYGTQMLELLDD 847

Query: 433 MDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
           MD LL  H GFLLG ++ESAK  A    +E
Sbjct: 848 MDALLRSHKGFLLGNYIESAKSWAGKRNKE 877



 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 38/61 (62%)

Query: 7   WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWF 66
           W G  P+ WL +Q  LQ+  +  + + GM PVLP F+G+VP A+   FP AK+ ++ NW 
Sbjct: 217 WTGGRPKKWLKRQWDLQRDAVKLMRDFGMTPVLPGFNGHVPPAIARRFPEAKLRRVENWL 276

Query: 67  S 67
           +
Sbjct: 277 T 277


>gi|398786493|ref|ZP_10549210.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
 gi|396993639|gb|EJJ04702.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
          Length = 1048

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 115/465 (24%), Positives = 195/465 (41%), Gaps = 41/465 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + N+ G+GGP     + ++  L ++I  R+ ELGM+PVLP + G VP       P A+  
Sbjct: 206 LQNMSGYGGPTSSELIAKRAELGQRITGRLRELGMHPVLPGYFGTVPGGFAARNPGARTV 265

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W  + + P W     LD    +F +   AF   Q    G   H +  D   E   P
Sbjct: 266 PQGTWSGL-ARPDW-----LDPRTEVFAKTAAAFYRHQEHLLGPADH-FKMDLLHEGGDP 318

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D P  +     A+   +++    A W++ GW  +          + LL++V   +++++
Sbjct: 319 GDVP--VPDAARAVEKALRTARPGATWVILGWQNN--------PRRDLLDAVDHDRMLIV 368

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           D  ++++ +    + + GVPY +  + NF G   +       A      R    + + G 
Sbjct: 369 DGLSDLETVTDRERDWGGVPYAFGSIPNFGGRTTIGAKTHVWAERFPAWRDKPGSRLAGT 428

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
               E  E++P  ++L SE+A++   VD  AW + Y+  RYG      + A+  L  + Y
Sbjct: 429 AYMPEAAERDPAAFELFSELAWRERPVDRAAWFDGYADLRYGARDKGARAAFAALGTSAY 488

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
             +      +  V  A PD+                       V  + T ++D       
Sbjct: 489 EISSKDGRPHDSVFAARPDLA-----------------ARSGTVYATHTPAFD------- 524

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
            +    A    +     L  S+ YR DL D  RQALA  + +L   + +AY+  D     
Sbjct: 525 PAAFDTAFAALLTVRPALRGSDAYRRDLTDTARQALANRSWQLIGQLQDAYRRKDRATFR 584

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            LS  +L L+   + +   H  FLLGPWL  A+ +A   E+E ++
Sbjct: 585 ALSGLWLHLMRLSEDVTGAHRQFLLGPWLTDARAMASGPEEEARL 629


>gi|418473272|ref|ZP_13042874.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
           coelicoflavus ZG0656]
 gi|371546106|gb|EHN74664.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
           coelicoflavus ZG0656]
          Length = 716

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 117/462 (25%), Positives = 199/462 (43%), Gaps = 40/462 (8%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+ GP+ +  ++Q+  L ++I  R+ ELGM PVLP + G VP       P   + 
Sbjct: 213 MQNMSGFAGPVSERLIEQRAALGRRIANRLRELGMTPVLPGYYGTVPPDFTARNPGGTVV 272

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W   +  P W     LD    +F  +  +F   Q + +G  S +Y  D   E   P
Sbjct: 273 PQGQWVGFER-PDW-----LDPRTGVFSRVAASFYRHQRELFG-DSTMYKMDLLHEGGRP 325

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            + P  +     A+ + +Q+    AVW + GW  +       P  + ++++V   +L+++
Sbjct: 326 GNVP--VGDAARAVMNALQTARPGAVWTLIGWQNN-------PSTQ-IIDAVDKSRLLIV 375

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           D  ++          ++G PY +  + NF G+  +       A      RT   + + G+
Sbjct: 376 DGLSDRYDGLDRETAWHGAPYAFGTIPNFGGHTTVGANTAVWAERFDRWRTEPGSALAGI 435

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
               EG   NPV Y+L +E+A++ E VD   W   Y+ RRYGR  P    AW +L    Y
Sbjct: 436 AYLPEGTGGNPVAYELFTELAWRTEPVDHSGWFAAYAERRYGRPDPHAARAWELLRTGPY 495

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
           +   G   + +D +                         ++  +  +  +S+    + Y 
Sbjct: 496 SMPSGTWSEAQDSLF-----------------------TARPRLTATSAASWSPGAMRYD 532

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
              V  AL   +     L  ++ YR+DL+D+ RQALA  +  L   I  AY   D     
Sbjct: 533 PDTVRAALAELLKVAPALRTTDAYRFDLVDVARQALANRSRSLLPEIKAAYDAGDLSRFR 592

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
             +  + + ++ +D LLA    FLLGPWL  A+   +   ++
Sbjct: 593 AGAAEWKDDLDLLDRLLATDSRFLLGPWLADARSWGRTAAEK 634


>gi|408676293|ref|YP_006876120.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
 gi|328880622|emb|CCA53861.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
          Length = 855

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 116/467 (24%), Positives = 196/467 (41%), Gaps = 44/467 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           + NL G+ GP+ +  ++ +  L  +I   +  LGM PVLP + G VP       P A   
Sbjct: 327 LQNLSGFAGPVSEQLIEARAALGARIARHLRSLGMTPVLPGYFGTVPPDFTARNPGAHTV 386

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W      P W     LD T P+F  +   +   Q + +G  S ++  D   E   P
Sbjct: 387 PQGRWVGF-GRPDW-----LDPTGPVFARLAAVYYRHQRQRFG-DSDMFKMDLLHEGGAP 439

Query: 121 --VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD    +S+   A+   +++    A W+M GW  +  P        ALL+ V   +L+
Sbjct: 440 GTVD----VSAAAGAVQRALEAARPGATWVMLGWQLNPTP--------ALLHGVDRRRLL 487

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           ++D  ++         ++ G PY +  + NF G+  +     +            ++ + 
Sbjct: 488 IVDGLSDRYDELDRETRWGGTPYAFGTIPNFGGHTSIGANTGAWVSRFHAWLAKPDSALR 547

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G+    E    NPV + L +E+A+Q   +D + W   Y+ RRYG +      AW  L   
Sbjct: 548 GIAYLPEATGTNPVAFGLFTELAWQPGPIDQQRWFAGYAARRYGGADRHAAAAWEALRLG 607

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
            Y+   G+  + +D + A     PS+ + T  ++       S +A+              
Sbjct: 608 PYSMRTGSWSEPQDSLFA---ARPSLTASTAARW-------SPKAMR------------- 644

Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
           Y  + V RAL   +     L  S+ YR+D++D+ RQAL   A  L   I  AY+  D   
Sbjct: 645 YDAATVERALAELLRVAPRLRTSDAYRFDVVDVARQALTNRARVLLPRIRAAYEARDLDA 704

Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
              L R +    E +  L+     FL+GPWL +A+    +  +  ++
Sbjct: 705 FRALVREWGAAEELLGRLVGSDRRFLVGPWLAAARSWGADPAERDRL 751


>gi|297194750|ref|ZP_06912148.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
           25486]
 gi|297152431|gb|EFH31740.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
           25486]
          Length = 816

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 119/452 (26%), Positives = 198/452 (43%), Gaps = 49/452 (10%)

Query: 16  LDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPS--AKITQLGNWFSVKSDPR 73
           ++++  L ++I  R+ ELGM+PVLP + G VP       P   A++   G W      P 
Sbjct: 6   IERRTELGRRITDRLRELGMHPVLPGYFGTVPDDFPGHNPGSDARVIPQGTWGGGMRRPD 65

Query: 74  WCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAA 133
           W     LD     F ++  AF   Q + +G  SH +  D   E     D P  +     A
Sbjct: 66  W-----LDPRTQAFSDVAAAFYRHQGELFGDVSH-FKMDLLHEGGTAGDVP--VPDAARA 117

Query: 134 IYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS 193
           + + +Q+    A W++ GW  +  P         +L+S+   +++++D  +++  +    
Sbjct: 118 VETSLQTARPGATWVILGWQSNPRPV--------MLDSIDTSRVLIVDGLSDLDTVTDRE 169

Query: 194 KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVV 253
             + G PY +  + NF G   +    D         R    + +VG     E  E++P  
Sbjct: 170 ADWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPGSALVGTAYMPEAAERDPAA 229

Query: 254 YDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDV 313
            +L SE+A++ EK+D +AW  +Y+  RYG    + ++A+  L  T Y  T          
Sbjct: 230 LELFSELAWREEKIDREAWFAEYAQIRYGGVDHSAREAFAALAATAYKLTS--------- 280

Query: 314 IVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIA 373
                         T+G+   Y    S+   L +   +   P      +   RA    +A
Sbjct: 281 --------------TDGR--PYDSLFSRRPSLTTAIGTAFDP------AGFDRAFAALLA 318

Query: 374 SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDM 433
               L  S+ YR+DL D+ RQALA  +  L L +  AY+  D      +S  +L+++   
Sbjct: 319 VRAPLRDSDAYRHDLTDVARQALANRSRTLQLALRAAYRNKDVATFRAVSALWLKVMRLS 378

Query: 434 DGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
           D +  CH  FLLGPWLE AK+LA + E+  Q+
Sbjct: 379 DTMAGCHRQFLLGPWLEDAKRLATSPEEAVQL 410


>gi|429201402|ref|ZP_19192867.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
           91-03]
 gi|428663010|gb|EKX62401.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
           91-03]
          Length = 1042

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 114/467 (24%), Positives = 195/467 (41%), Gaps = 44/467 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
           + NL G+GGPL    +D++  L ++I  R+ ELGM+PVLP + G+VP   ++     A +
Sbjct: 212 LQNLSGYGGPLSPELIDRRAALGRRIADRLRELGMSPVLPGYYGHVPKEFVERNGGDAHV 271

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
              G W   +  P W     LD     F ++  +F   Q   +G  +H +  D   E   
Sbjct: 272 VPQGVWHGFER-PDW-----LDPRTDSFAKVAASFYGHQEDVFGEAAH-FKMDLLHEGGT 324

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
             D P  +      +   +Q     A W++ GW    +P      +  LL+++   ++++
Sbjct: 325 AGDVP--VPGAAQGVERALQKARPGATWVILGW--QENP------LPELLDAIDKSRMLI 374

Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           +D  ++    +    + + G PY +  + NF G   +              R   N+ + 
Sbjct: 375 VDGVSDRYTSVTDRERDWGGTPYCFGTIPNFGGRTTIGARAHIWNEKFFAWRDKANSALA 434

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G     E  +++P  ++L SE+A+   K+D  AW + Y+  RYG    + + AW  L+ T
Sbjct: 435 GTAFMPEATDRDPAAFELFSELAWTPTKIDRAAWFSAYADYRYGARDDSARRAWRALHDT 494

Query: 299 VYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
            Y       +D +  +  A PD+                             + Y    L
Sbjct: 495 AYQQRAVERSDPHDSLFCARPDL------------------------AADRAAEYAPRAL 530

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y       AL   +     L  S  Y+YD++D+ RQALA  + +    +  AYQ  D  
Sbjct: 531 TYDPGRFDAALAGLLGVAGGLRGSAAYKYDVVDVARQALAHRSRQYLPQLRAAYQRKDLA 590

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
               LS  +L L+   D +   +  FLLGPW+  A+ LA N+ +  +
Sbjct: 591 TFRALSTLWLRLMRLSDEVTGANSAFLLGPWVNDARLLATNDAERAE 637


>gi|329940646|ref|ZP_08289927.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
 gi|329300707|gb|EGG44604.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
          Length = 798

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 119/466 (25%), Positives = 196/466 (42%), Gaps = 49/466 (10%)

Query: 10  PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVK 69
           P+    LD + VL +++  R+ ELGM PVLP + G VP         A+    G W    
Sbjct: 227 PVSTQLLDARAVLGRRLADRLRELGMVPVLPGYFGTVPPGFAARNRGARTVPQGTWMGFD 286

Query: 70  SDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS 129
             P W     LD    LF  +  AF   Q + +G ++H Y  D   E     D P  +  
Sbjct: 287 R-PDW-----LDPRTDLFARVAAAFYRVQGELFGASTH-YKMDLLHEGGTAGDVP--VGE 337

Query: 130 LGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG---------KLVVL 180
               +   ++    DAVW++ GW  +      PP  +A+L++V  G         +L+V+
Sbjct: 338 AAKGVERALRRARPDAVWVLLGWRHN------PP--RAILDAVASGGPDGAAGRERLLVV 389

Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           D  ++  P +      + GVPY +  + NF G+  +       A      RT E + + G
Sbjct: 390 DGLSDRFPTVTDREADWGGVPYAFGSIWNFGGHTTLGANTPDWARLYEAWRTKEGSALRG 449

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           + +  E  + NP  + L SE+ +   ++D+KAW  +++  RYG      + AW+VL  T 
Sbjct: 450 IALLPEAADNNPAAFALFSELPWHEGELDLKAWFARWARSRYGAYDAHAEAAWDVLRRTA 509

Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
           Y  T                      S +EG    +G   ++ ++     +S+    L Y
Sbjct: 510 YGTT-------------------RADSWSEGADGLFG---ARPSLTARRAASWSPKELRY 547

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
              E  RAL+  +     L  S+ YR DL+D+ RQ L+  +  L   I  A    D    
Sbjct: 548 DAHEFERALDELLKVRPGLRESSAYRRDLLDVARQCLSNRSRALLPRIARACAARDVKAF 607

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
              S  +L L++ ++ L+      LLG W   A+    +E +  ++
Sbjct: 608 DAASGDWLSLMDLLERLVGTDARHLLGRWTAQARAWGADEAERDRL 653


>gi|291302495|ref|YP_003513773.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
 gi|290571715|gb|ADD44680.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
          Length = 696

 Score =  154 bits (390), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 125/464 (26%), Positives = 209/464 (45%), Gaps = 55/464 (11%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M  + G+GG + +  ++++  L ++I  R+ ELG+ PVLP F+G VP    ++  +A I 
Sbjct: 178 MGCMCGFGG-VSRRLVEERAELGRRITDRMRELGIEPVLPGFAGLVPG---DIGDTAAIP 233

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           Q G WF     P W  T     T   + E+   F  +Q +  G T      D   E   +
Sbjct: 234 Q-GQWFGFDR-PAWLPT-----TTRAYAEVAEVFYAKQTERLGAT-RAQAVDLLHEGGTS 285

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
             VD  +    + AA    M+    D +W++Q W       W  P  + L  +       
Sbjct: 286 GGVDLADATRGIAAA----MERAHDDYLWVLQAW-------WDNPLPEVLAAT----DSD 330

Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
            L L       W  +K ++G P+    L NF G   ++G L  IA  P      + +++V
Sbjct: 331 HLLLLDLTGEGWRKTKGWHGKPWARGSLTNFGGRTVLFGGLPEIAELPSLKDDPKASSLV 390

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G  +  E  + NPVV+ L ++ ++    +D+ AW+ +Y   RYG++ P    AW+ L  T
Sbjct: 391 GTALVEEAWQVNPVVWSLFTQTSWADGDIDLNAWVPEYVAARYGKAHPRAVRAWHGLLAT 450

Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH-L 357
            Y   DG       ++ A P +D                         ++ +S + PH L
Sbjct: 451 AYRSMDGRPGGAESLLCAMPSLD-------------------------ADRASMNGPHSL 485

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y    +  A    +A+   L  ++T+R+DL+D+TRQ ++  A  L   +  AY + +  
Sbjct: 486 PYPAEALEVAWRDLLAAREALGGADTFRFDLVDVTRQVISNRARPLLPLLRTAYAMKELD 545

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
               LS  F++L E +D +LA  + FL+G WL  A+ LA +E++
Sbjct: 546 RFIALSHSFIDLFELLDPVLATREEFLVGRWLADARALAADEDE 589


>gi|291301158|ref|YP_003512436.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
 gi|290570378|gb|ADD43343.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
          Length = 734

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 116/466 (24%), Positives = 203/466 (43%), Gaps = 44/466 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL  + GP+ Q  LD +  L ++I  R+ ELG+ PVLP + G +P         A+  
Sbjct: 209 MQNLSAFPGPISQHLLDSRAELARRIRTRMAELGIRPVLPGYFGTIPGGFAKRNQQARTV 268

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W+   S P W     LD T   F ++  +F   Q +  G  + +Y  D   E   P
Sbjct: 269 PQGVWYGF-SRPDW-----LDPTGNEFAKVAASFYRHQAQLLGE-ADMYKMDLMHEGGDP 321

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P   ++ G A+   +Q     A W+M GW        R      +L  +   +++++
Sbjct: 322 GGIPIPDAAKGVAL--ALQRARPGATWVMLGW--------RKNPRTDILTDIDTSRVLIV 371

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           D  ++          + G PY +  + NF G+  +       A    + RT+ ++ + G+
Sbjct: 372 DGISDRFDDLDREHTWPGTPYAFGTIPNFGGHTTIGANAKVWAKRFGQWRTAPDSAVSGI 431

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
               EG  ++P  ++L +E+A++ + +D+  W   Y+ RRYG +    + AW+ L  + Y
Sbjct: 432 AWMPEGAGRDPAAFELFAELAWR-DSIDLGEWFADYADRRYGGADDNARTAWDALRRSAY 490

Query: 301 NCTDGATDKNRDVIV-AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
               G   +  D +  A P +D     VT   Y                   +    L Y
Sbjct: 491 AMPSGRWAEAADGLFGARPGLD-----VTHADY-------------------FSPEFLRY 526

Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
             +   +AL   +     L  ++ YR+DL+D+ RQ+L     EL   +  A+   +    
Sbjct: 527 DAAVFAQALPALLDVDKSLH-NDAYRFDLVDVARQSLVNAGRELLPRVKSAFVNQNKKQF 585

Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
            + +R +L+ +  +D LL     FLLGPWLE+A++ A+  ++ K +
Sbjct: 586 DKHTRTWLDWMRLLDRLLETDRRFLLGPWLEAARRSARTADEAKDL 631


>gi|329934959|ref|ZP_08285000.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
 gi|329305781|gb|EGG49637.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
          Length = 1017

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 113/468 (24%), Positives = 194/468 (41%), Gaps = 46/468 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPS-AKI 59
           + NL+G+GGPL    + ++  L ++I  R+  LGM PVLP + G+VP    +     A +
Sbjct: 191 LQNLYGYGGPLSAELIARRAALGRRIADRLRALGMRPVLPGYYGHVPKDFADRRGGDAHV 250

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
              G W      P W     LD     F E+  +F   Q   +G  +  +  D   E   
Sbjct: 251 VPQGTWHGFDR-PSW-----LDPRTDAFAEVAASFYRHQEDVFG-PAGDFKMDLLHEGGT 303

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
             D P  +      +   +++    A W++ GW    +P      +  LL++V   ++++
Sbjct: 304 AGDVP--VPDAARGVEKALRAARPGATWVILGW--EANP------LPELLDAVDKKRMLI 353

Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTM 237
           +D  ++    +    + + G PY +  + NF G   + G    I      A R    + +
Sbjct: 354 VDGVSDRYTSVTDREEDWGGTPYAFGTIPNFGGRTTI-GARTHIWREKFFAWRDKPGSAL 412

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G     E  +++P  ++L SE+A+  E VD   W   Y+  RYG      + AW  L+ 
Sbjct: 413 AGTAYLPEAADRDPAAFELFSELAWTDEPVDRARWFTGYADFRYGGRDAGARRAWRALHD 472

Query: 298 TVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
           T Y    +  +D +  +  A PD+                          +  + Y    
Sbjct: 473 TAYQQHANERSDPHDSLFCARPDL------------------------AATRAARYAPAA 508

Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           L Y  +    AL   +A          YRYDL+D+ RQALA  + +    +  A+   DA
Sbjct: 509 LTYDPARFDAALSGLLAVAAHRRGGAAYRYDLVDVARQALAHRSRQYLPQLKAAFDREDA 568

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
                L+ ++L L+   + +   H  FLLGPW+E A+++A N  +  +
Sbjct: 569 ATFKALATQWLTLMRLSEDITGTHPAFLLGPWIEDARRMATNPRERAE 616


>gi|169351438|ref|ZP_02868376.1| hypothetical protein CLOSPI_02218 [Clostridium spiroforme DSM 1552]
 gi|169291660|gb|EDS73793.1| F5/8 type C domain protein [Clostridium spiroforme DSM 1552]
          Length = 1762

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 122/473 (25%), Positives = 209/473 (44%), Gaps = 67/473 (14%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M N+   GGP+   W+  +L + ++       LGM  VL  ++G VP    + +   +I 
Sbjct: 816  MDNMEVIGGPVSDEWVKGRLEMARENQRWKNSLGMQTVLQGYAGMVPNNFTD-YQDVEIL 874

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP- 119
            + GNW  V   PR     ++     L+ +  + F E Q   +G+TS+ Y  D F E    
Sbjct: 875  EQGNWCGV---PR---PDMIRTDGELYDQYAKLFYEAQEWAFGKTSNYYAVDPFHEGGKR 928

Query: 120  PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKAL--LNSVPLGKL 177
            P D  + +  +   + + +   D +AVW++Q W       W  P    L  +       +
Sbjct: 929  PSDLTDDV--ISREVLNSLLEYDQEAVWMVQAW-------WSNPTNDLLKGMGDDREDHV 979

Query: 178  VVLDLFA------------EVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
            ++LDL              E       S +F    ++WCML N+ GN  M G    I   
Sbjct: 980  IILDLNGLNDAYDSYWDKTEYNGTVLESDEFNSTSWVWCMLENYGGNPSMDGRPKEI-IN 1038

Query: 226  PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
             +   +++   M G+G   E    NP++Y+L+ +MA+Q + +D+  W+++Y +RRYG   
Sbjct: 1039 RINKASTQAEHMKGIGFISEATYDNPMIYELLLDMAWQQDTIDLDDWLDEYVLRRYGDYS 1098

Query: 286  PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
             +  +AW++L  TVY+ +   TD     ++A    DPS++         YG P       
Sbjct: 1099 ESAGEAWDILLKTVYSRSGKTTD-----VIARS--DPSLVQ--------YGLP------- 1136

Query: 346  KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
                         Y+ SE+  ALEL     ++LSAS  YRYDL ++ RQ +  YA     
Sbjct: 1137 -------------YTASELEEALELLYKDYDKLSASEAYRYDLTEIMRQVVNNYAVVRLG 1183

Query: 406  NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
            ++  AY   +      L  ++L  ++ ++ +       L+G W+  A   A++
Sbjct: 1184 DLKTAYDAKEIDNFKSLKEQYLNAIDLLNEVCGTQQDLLIGEWVGRAVDWAKD 1236


>gi|29832531|ref|NP_827165.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
 gi|29609651|dbj|BAC73700.1| putative alpha-N-acetylglucosaminidase, secreted [Streptomyces
           avermitilis MA-4680]
          Length = 1038

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 114/468 (24%), Positives = 194/468 (41%), Gaps = 46/468 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
           + NL G+GGPL    + ++  L ++I  R+  LGM PVLP + G+VP   ++     A +
Sbjct: 212 LQNLSGYGGPLSPELIAERAGLGRRICDRLRALGMAPVLPGYYGHVPKGFVERNGGDAHV 271

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
              G W   +  P W     LD     F  + ++F   Q   +G+ +H +  D   E   
Sbjct: 272 VPQGIWHGFER-PDW-----LDPRTASFAAVAKSFYRHQKDVFGKAAH-FKMDLLHEGGT 324

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
             D P  +      +   +Q+    A W++ GW    +P      + ALL+++   K+++
Sbjct: 325 AGDVP--VPGAARGVEKALQAAHPGATWVILGW--EANP------LPALLDAIDKKKMLI 374

Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           +D  ++    +    K + G PY +  + NF G   +              R    + + 
Sbjct: 375 VDGVSDRYTSVTDREKDWGGTPYAFGTIPNFGGRTTIGARAHLWNEKFFAWRDKAGSALA 434

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G     E  +++P  ++L SE+A+   K+D  AW + Y+  RYG    + Q AW  L+ T
Sbjct: 435 GTAYLPEAADRDPAAFELFSELAWSAGKIDRAAWFSSYADFRYGGRDASAQKAWRALHDT 494

Query: 299 VYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
            Y       +D +  +  A PD+                          +  + Y    L
Sbjct: 495 AYQQHAVERSDAHDSLFCARPDL------------------------AANRAAEYAPRAL 530

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
            Y       AL   +     L  S  Y YDL+D+ RQALA  + + +L ++ A       
Sbjct: 531 TYDPGRFDAALSGLLGVAGGLRGSAAYTYDLVDVARQALAHRSRQ-YLPLLRAAYARKDA 589

Query: 418 GVF-QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
             F  L+  +L L+   D +   H  FLLGPW+  A+ LA +  +  +
Sbjct: 590 AAFTSLATLWLRLMGLSDEVTGTHPAFLLGPWINDARLLATDAGERAE 637


>gi|281423203|ref|ZP_06254116.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
 gi|281402539|gb|EFB33370.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
          Length = 450

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 93/267 (34%), Positives = 136/267 (50%), Gaps = 26/267 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+NL GWGGPLP SW +QQ  LQKKIL R++E GM PVLP F G +P   +       +T
Sbjct: 186 MNNLEGWGGPLPDSWYNQQEALQKKILKRMHEYGMQPVLPGFCGMMPHDAKAKL-GLNVT 244

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W              L  TD    +I   +  +    YG+ ++ Y+ D F E    
Sbjct: 245 DGGIWNGYTRPAN------LSPTDAHSDKIADLYYAELTNLYGKANY-YSMDPFHETND- 296

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
            D     S  G  +   M+  + +A W++QGW  + +P  RP  +K + N    G L+VL
Sbjct: 297 -DEAIDYSKAGRKVMEAMKRVNPNATWVIQGW--TENP--RPQMIKNMKN----GDLLVL 347

Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
           DLF+E +P      IW   K +    +++CML NF  N+ ++G +D +       + S  
Sbjct: 348 DLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDLLLHNFYSTKQSSP 407

Query: 235 TT--MVGVGMSMEGIEQNPVVYDLMSE 259
            T  + G+G +MEG E NPV+++LMSE
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSE 434


>gi|293402299|ref|ZP_06646437.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
            5_2_54FAA]
 gi|291304406|gb|EFE45657.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
            5_2_54FAA]
          Length = 2330

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 111/463 (23%), Positives = 197/463 (42%), Gaps = 56/463 (12%)

Query: 1    MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
            M N+  +GGP+P  ++  ++ L +        LGMN VL  ++G VP       P+  +T
Sbjct: 670  MQNIETFGGPIPDQYVVDRVELARTTQRWKNSLGMNTVLQGYAGMVPTNFNEFQPNVPLT 729

Query: 61   QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
               +W  + + P    T       P + E  + F E Q   YG TS  Y  D + E    
Sbjct: 730  AQKSWGGL-ARPSMIPT-----DSPYYDEYAKLFYEAQEYIYGATSDYYAVDPYHEGGT- 782

Query: 121  VDSPEYIS--SLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSV---PLG 175
               PE +S  ++   + + +   D DAVW++Q W        +      LLN +      
Sbjct: 783  --RPEGLSDETVAREVLNSLLDYDKDAVWVVQAW--------QSNPTDGLLNGMGEYREN 832

Query: 176  KLVVLDLFAEVKPIWS--TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE 233
             ++++DL       W+     +F G  + W +L  F GN  M G + ++    ++    E
Sbjct: 833  HVLIVDLIKYPIKSWTKYNKSEFKGTSWAWGLLGGFGGNPTMNGEMQTM-VNDIQTAKKE 891

Query: 234  NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWN 293
             T M G+G+  E    NPV+YDL+ ++A+  +   +  W+N+Y  RRYG +    ++AW 
Sbjct: 892  RTHMAGLGIISEAQYDNPVLYDLIFDLAWVDDDFSLDQWLNKYIERRYGGTSDNAKEAWK 951

Query: 294  VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
            ++ +  YN             V F      +  +     Q+YGK                
Sbjct: 952  IMKNANYNHG-----------VRFT---AQVYGMKGKSPQDYGK---------------- 981

Query: 354  HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
              ++ Y   ++  A  L I   ++   S  YRYDL ++ RQ ++ Y+   + N+I+A + 
Sbjct: 982  -QNISYGADKLETAFRLLIEDYDKFKDSECYRYDLTEIMRQMVSNYSTLTYNNVIDARED 1040

Query: 414  NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
             +     +   +FL+  + ++ +       L G W+  A+  A
Sbjct: 1041 KNIEKFKEEKAKFLKSFDVLNDIQETQVDQLAGEWIGKAQDRA 1083


>gi|380804373|gb|AFE74062.1| alpha-N-acetylglucosaminidase precursor, partial [Macaca mulatta]
          Length = 265

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 64/145 (44%), Positives = 92/145 (63%), Gaps = 3/145 (2%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L R+   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 124 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 183

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 184 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 240

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDA 145
             +P Y+++   A+Y  M + D++A
Sbjct: 241 SSAPSYLAAATTAVYEAMIAVDTEA 265


>gi|440695019|ref|ZP_20877582.1| Tat pathway signal sequence domain protein [Streptomyces
           turgidiscabies Car8]
 gi|440282912|gb|ELP70302.1| Tat pathway signal sequence domain protein [Streptomyces
           turgidiscabies Car8]
          Length = 1050

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 120/470 (25%), Positives = 201/470 (42%), Gaps = 47/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL--QNVFPSAK 58
           + N+ G+GGP+ +  ++++  L  KI  R+ ELGM PVLP + G VP     +N   +A 
Sbjct: 216 LQNMSGFGGPVSRRLIEKRADLAAKITERVRELGMTPVLPGYFGTVPDEFVARNGGDAAV 275

Query: 59  ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
           + Q G+W + K  P W     LD     F E+  AF + Q + +G  S +Y  D   E  
Sbjct: 276 VPQ-GDWGAFKR-PDW-----LDPRTTAFGEVAAAFYQAQSERFG-DSTMYKMDLLHEGG 327

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
            P D P  +     A+ + ++     AVW + GW  +       P  + +L++V   ++ 
Sbjct: 328 NPGDVP--VGRAAQAVEAALRKAHPGAVWAILGWQNN-------PSGE-ILDAVDKSRMF 377

Query: 179 VLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
           V+D  ++    +      + G PY +  + NF G+  M            + R  E++ +
Sbjct: 378 VVDGLSDRYTTVTDRESDWGGTPYAFGSIWNFGGHTPMGANAPDWVEQYPKWRDKEDSAL 437

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
            G+    E  + N     L++++A+    +D+  W   Y+V RYG   P    AW ++  
Sbjct: 438 AGIAAMPEAADNNHAALALLTDLAWTPGTIDLDDWFASYAVSRYGAEDPHALAAWKIIGD 497

Query: 298 TVYNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
           T Y  +  DG ++    +  A P +                   +K A    E   YD  
Sbjct: 498 TAYGMSRADGWSEAPDGLFGARPSLG-----------------ANKAAAWGPEADRYD-- 538

Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
                T+    AL   +     L  ++ YRYDL D+ RQ L+  +  L   I  AY   D
Sbjct: 539 -----TTAFDLALTELLQVAPALRGNSAYRYDLADVARQVLSNRSRMLLPQIRAAYDTAD 593

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
                +L+  +L+ +  MD +LA     LLG WL  A+       ++ Q+
Sbjct: 594 RVRFDELTGVWLDWMRLMDKVLATSGQHLLGRWLADARSWGATRGEKDQL 643


>gi|456390168|gb|EMF55563.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
           25435]
          Length = 1042

 Score =  134 bits (338), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 113/467 (24%), Positives = 198/467 (42%), Gaps = 44/467 (9%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
           + NL G+GGPL    + ++  L ++I  R+ ELGM+PVLP + G+VP   ++     A +
Sbjct: 212 LQNLSGYGGPLSPQLIARRAGLGRRITDRLRELGMSPVLPGYYGHVPKQFVERNGGDAHV 271

Query: 60  TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
              G W   +  P W     LD     F  +  +F       +G  +H +  D   E   
Sbjct: 272 VPQGLWHGFER-PDW-----LDPRTDSFARVAASFYGHVRDVFGAAAH-FKMDLLHEGGT 324

Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
             D P  +      +   +     DA+W++ GW    +P      +  LL+++   ++++
Sbjct: 325 AGDVP--VPDAARGVERALHKAHPDAIWVILGW--QENP------LPELLDAIDRSRMLI 374

Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
           +D  ++    +    + + G PY +  + NF G   +              R   ++ +V
Sbjct: 375 VDGVSDRYASVTDRERDWGGTPYCFGTIPNFGGRTTIGARAHLWTDKFFAWRDKPDSALV 434

Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
           G     E  +++P  ++L SE+A+   K+D  AW + Y+  RYG    A + AW  L+ T
Sbjct: 435 GTAYMPEATDRDPAAFELFSELAWTPGKIDRAAWFSAYADFRYGGRDDAARAAWRALHET 494

Query: 299 VYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
            Y       +D +  +  A PD+                    + A     T +YD    
Sbjct: 495 AYQQRAVERSDPHDSLFCARPDL-----------------AADRAAEYAPRTLTYDPGRF 537

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
                +   A  L +A G   + +  YRYD++DL RQALA  + +    +  A++  D  
Sbjct: 538 -----DAAFAGLLDVAGGRRRNPA--YRYDVVDLARQALAHRSRQYLPQLRAAHRRKDLT 590

Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
               LS  +L L+   D +      FLLGPW+  A+ LA ++ +  +
Sbjct: 591 TFRALSTLWLRLMRLSDEVTGTDGAFLLGPWVNDARLLATDDAERAE 637


>gi|84625358|ref|YP_452730.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
 gi|84369298|dbj|BAE70456.1| putative N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
           MAFF 311018]
          Length = 590

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 109/421 (25%), Positives = 177/421 (42%), Gaps = 72/421 (17%)

Query: 54  FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 113
            P A+I ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D 
Sbjct: 46  LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 98

Query: 114 FDENTPPV------------------------------DSPEYISSLGAAIYSGMQSGDS 143
           F+E  PPV                                   +++ G A+Y  +   + 
Sbjct: 99  FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 158

Query: 144 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 201
            A W+MQGWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +
Sbjct: 159 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 218

Query: 202 IWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLM 257
           I+  +HN+  +  +YG    +AF    ++A  ++     + G G+  EG+  N VVY+ +
Sbjct: 219 IYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYL 275

Query: 258 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 317
             +A++  +     W+ QY   RYGRS  A+  AW  L   +Y     +           
Sbjct: 276 YALAWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTRYWS----------- 324

Query: 318 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 377
               P   +   G Y  + +P +            D P        +  A++  +   + 
Sbjct: 325 ----PRWWNTHAGAYLLFKRPTADIVNFD------DRPG---DPQRLRSAIDALLQQADR 371

Query: 378 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGL 436
            + +  YRYDLI+  R  L+  A+     +++AY   D A G  QL+R   +LV+ +D L
Sbjct: 372 YADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDAL 430

Query: 437 L 437
           +
Sbjct: 431 V 431


>gi|260821254|ref|XP_002605948.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
 gi|229291285|gb|EEN61958.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
          Length = 673

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 93/321 (28%), Positives = 131/321 (40%), Gaps = 97/321 (30%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ GWGGPLPQSW   QL LQ KIL R+              N  + L +++    + 
Sbjct: 368 MGNIRGWGGPLPQSWHQNQLELQHKILARMR-------------NFDSTLMHLY----LD 410

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G     ++    C T  +                             +C    E    
Sbjct: 411 YSGGDLKTRTVAHTCWTLRI-----------------------------HCFLTLEECLL 441

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           +  P Y+S  GAA+Y+GM +GD  A+WLMQGWLF    FW+P Q KALL SVP G     
Sbjct: 442 LSEPNYLSKAGAAVYAGMLAGDPQAIWLMQGWLFQARDFWQPAQTKALLQSVPEG----- 496

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
                                                        P  AR    +TMVG 
Sbjct: 497 ---------------------------------------------PFLARKYLGSTMVGT 511

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDV-KAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
           G++ EGI+QN ++Y+LM+E+A+  +   +   W + Y+  RYG         W +L  +V
Sbjct: 512 GLTPEGIDQNYIMYELMNEVAWMPQPFQILDNWASDYAWSRYGVKNSNASLGWQILLKSV 571

Query: 300 YNCTDGATDKNRDVIVAFPDV 320
           Y+C +G  D    V+V  PD+
Sbjct: 572 YDCENGFKDHCDSVVVHRPDL 592


>gi|58583545|ref|YP_202561.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
 gi|58428139|gb|AAW77176.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
          Length = 753

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 108/421 (25%), Positives = 178/421 (42%), Gaps = 72/421 (17%)

Query: 54  FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 113
            P A+I ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D 
Sbjct: 209 LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 261

Query: 114 FDENTPPV------------------------------DSPEYISSLGAAIYSGMQSGDS 143
           F+E  PPV                                   +++ G A+Y  +   + 
Sbjct: 262 FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 321

Query: 144 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 201
            A W+MQGWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +
Sbjct: 322 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 381

Query: 202 IWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLM 257
           I+  +HN+  +  +YG    +AF    ++A  ++     + G G+  EG+  N VVY+ +
Sbjct: 382 IYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYL 438

Query: 258 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 317
             +A++  +     W+ QY   RYGRS  A+  AW  L   +Y                 
Sbjct: 439 YALAWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTR-------------- 484

Query: 318 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 377
               P   +   G Y  + +P +   ++  +    D          +  A++  +   + 
Sbjct: 485 -YWSPRWWNTHAGAYLLFKRPTAD--IVNFDDRPGD-------PQRLRSAIDALLQQADR 534

Query: 378 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGL 436
            + +  YRYDLI+  R  L+  A+     +++AY   D A G  QL+ R  +LV+ +D L
Sbjct: 535 YADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLA-RTTQLVQGLDAL 593

Query: 437 L 437
           +
Sbjct: 594 V 594


>gi|339238239|ref|XP_003380674.1| GDP-L-fucose synthetase [Trichinella spiralis]
 gi|316976398|gb|EFV59699.1| GDP-L-fucose synthetase [Trichinella spiralis]
          Length = 1203

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 75/220 (34%), Positives = 113/220 (51%), Gaps = 7/220 (3%)

Query: 89   EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 148
             +G   + + L+ Y    H Y+ D F+E  P      ++ ++  AIY+ M S D  +VW+
Sbjct: 801  HVGNEVVWKSLENYFGLFHAYSADPFNEMVPNTFDVMFLRNVSFAIYNVMLSVDPKSVWV 860

Query: 149  MQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 207
            +Q W+F     W   +  K  L +VP G ++V+DL+AE  P++     FY  P+IWCMLH
Sbjct: 861  LQSWMFLSSERWLENENAKHFLTAVPTGSILVVDLYAEEYPLYEKFSGFYNQPFIWCMLH 920

Query: 208  NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF--QHE 265
            NF G   +YG L  I     +  T  N  MVG G+SMEGI+QN VVY +  +  +   ++
Sbjct: 921  NFGGVQGLYGNLARINQKLADVSTVSNINMVGTGLSMEGIDQNYVVYQMALDRFWSPNNQ 980

Query: 266  KVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTD 304
            KVD+ AW   Y     G  +  +I  AW     +   C +
Sbjct: 981  KVDLAAW---YIYIHLGVGITKSIYTAWGAFLQSSRTCQE 1017


>gi|402824586|ref|ZP_10873940.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
 gi|402261896|gb|EJU11905.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
          Length = 486

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 78/252 (30%), Positives = 123/252 (48%), Gaps = 39/252 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL G+  PL   W++++  LQ +IL R+  LGM PVLPAF+G VP A     P A+I 
Sbjct: 223 MGNLAGYRAPLSSGWIEKKHQLQLRILARMRALGMKPVLPAFAGYVPEAFAKAHPKARIY 282

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++  W           TY LD +DPLF ++   F+    + YG   + Y  D F+E  PP
Sbjct: 283 KMRAWEGFPP------TYWLDPSDPLFTQLAARFVTLYNRTYGEGEY-YLADAFNEMIPP 335

Query: 121 V-------DSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
           +        + EY                       +++ G  +Y  + +    A W+MQ
Sbjct: 336 IAEDGSDAAAAEYGDSIANTAATRAAALPPAVRDARLAAYGERLYGSITAAAPKATWVMQ 395

Query: 151 GWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
           GWLF  D  +R P+ + A L+ VP  ++++LD+  +  P IW  +  F G  + +  +HN
Sbjct: 396 GWLFGADKAFRTPEAIAAFLSRVPDDRMLILDIGNDRYPGIWQKTDAFDGKAWTYGYVHN 455

Query: 209 FAGNIEMYGILD 220
           + G+  +YG L+
Sbjct: 456 YGGSNPVYGDLE 467


>gi|47212645|emb|CAF95026.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 121

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 56/121 (46%), Positives = 83/121 (68%), Gaps = 3/121 (2%)

Query: 20  LVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYL 79
           L LQ KIL ++   GM PVLPAFSGNVP  +  ++P A++T+LG W   K +  + C+Y+
Sbjct: 4   LSLQFKILEQMRSFGMTPVLPAFSGNVPKGILRLYPEARVTRLGPW--SKFNCSFSCSYI 61

Query: 80  LDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQ 139
           LD  DPLF+ IG  ++ Q +K++G T+HIYN DTF+E TPP   P Y+S++  A+++ M 
Sbjct: 62  LDPRDPLFLRIGSLYLAQVVKQFG-TNHIYNTDTFNEMTPPSSEPNYLSAVSRAVFAAMT 120

Query: 140 S 140
           +
Sbjct: 121 A 121


>gi|293371910|ref|ZP_06618314.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
 gi|292633156|gb|EFF51733.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
          Length = 411

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 84/331 (25%), Positives = 144/331 (43%), Gaps = 38/331 (11%)

Query: 129 SLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVK 187
            + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++LD   E  
Sbjct: 3   KIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMILLDYHCENV 62

Query: 188 PIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGI 247
            +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G+G ++EG+
Sbjct: 63  ELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKGIGSTLEGL 122

Query: 248 EQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGAT 307
           +     Y+ + E A+ +  VD   WI   + R  G    +++DAW  L++ +Y       
Sbjct: 123 DVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDIY------- 174

Query: 308 DKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA 367
                  V  P         T G    Y +P   +   K  ++ Y +  L     EV R 
Sbjct: 175 -------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL----LEVWRK 214

Query: 368 LELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR 425
           L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D   +     +
Sbjct: 215 L-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQALKACGEK 267

Query: 426 FLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
             E++ D+D L A H    L  W++ A+++ 
Sbjct: 268 MKEILNDLDKLNAFHPYCSLDKWIDDARKMG 298


>gi|297723521|ref|NP_001174124.1| Os04g0650900 [Oryza sativa Japonica Group]
 gi|255675839|dbj|BAH92852.1| Os04g0650900, partial [Oryza sativa Japonica Group]
          Length = 128

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 49/64 (76%), Positives = 56/64 (87%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+HGWGGPLPQSWLD QL LQKKIL R+Y  GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 64  MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 123

Query: 61  QLGN 64
            LGN
Sbjct: 124 HLGN 127


>gi|358381741|gb|EHK19415.1| hypothetical protein TRIVIDRAFT_224650 [Trichoderma virens Gv29-8]
          Length = 217

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 58/214 (27%), Positives = 110/214 (51%), Gaps = 11/214 (5%)

Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL-G 175
           TPP     Y+ +  +  +  ++S D +A+W+ Q WLF+ +  FW   +++     + +  
Sbjct: 2   TPPSGELNYLRNASSNTWKALKSADPEAIWVFQAWLFAQNTTFWTNDRIEVYPGGITIDS 61

Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
            +++LD++ E    W  ++ +Y  P+IWC L N+   I MYG + ++   P+ A   E+ 
Sbjct: 62  DMLILDIWLESMSQWQCAQSYYSKPWIWCELQNYGATINMYGQIQNLTKSPILA-LQESQ 120

Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY--GRSVPAIQDAWN 293
           ++VG+G+SME  + N +V+DL+   A+    +D   +   ++  RY   +   +I  AW 
Sbjct: 121 SLVGLGLSMEAQQSNEIVFDLLLSQAWNCTPIDTNIYFKSWAAARYLSSKRPASIYTAWE 180

Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 327
            +  TVY+ T      N +++ + P    S I V
Sbjct: 181 AVRATVYDNT------NLNMMSSVPKSRSSEIKV 208


>gi|242077446|ref|XP_002448659.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
 gi|241939842|gb|EES12987.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
          Length = 252

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 51/79 (64%), Positives = 62/79 (78%)

Query: 386 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 445
           YDL+DLTRQ LAKYAN++FL IIE+Y+ N  + V  L + FL LV D+D LL+ H+GFLL
Sbjct: 51  YDLVDLTRQVLAKYANDVFLKIIESYKSNKMNQVTILCKHFLNLVNDLDTLLSSHEGFLL 110

Query: 446 GPWLESAKQLAQNEEQEKQ 464
           GPWLESAK LA+N EQE Q
Sbjct: 111 GPWLESAKGLARNSEQEIQ 129


>gi|302522684|ref|ZP_07275026.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
 gi|302431579|gb|EFL03395.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
          Length = 355

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/277 (25%), Positives = 129/277 (46%), Gaps = 24/277 (8%)

Query: 196 FYGVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVY 254
           + G PY +  + NF G+  +     D +   P   R    + + G+ +  E  + NP  +
Sbjct: 8   WQGTPYAFGSIWNFGGHTALGANTRDWVDLYP-RWRDRSGSRLSGIALMPEAADNNPAAF 66

Query: 255 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 314
           +L +E+ +    VD+  W  +Y+  RYG S    + AW++L  TVY       D+  +  
Sbjct: 67  ELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILRTTVYGTRR--DDRWSEPA 124

Query: 315 VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS 374
                  P++ +V+ GK+            L+   +S++             AL+  ++ 
Sbjct: 125 DGLFGARPALDAVSAGKWS--------PKALRYPAASFEP------------ALDELLSV 164

Query: 375 GNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 434
             EL  S TYR DL+D+ RQALA  +  L   +  AY+  +     +L RR++ L++ ++
Sbjct: 165 RAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQAEFARLGRRWIALIDLLE 224

Query: 435 GLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVS 471
            L+A  +  LLG W+ESA+    +  ++ Q++   +S
Sbjct: 225 QLVATDENHLLGRWVESARAWGGSAREKSQLQYDALS 261


>gi|328867426|gb|EGG15808.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
          Length = 992

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 69/223 (30%), Positives = 109/223 (48%), Gaps = 32/223 (14%)

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
           M G G++ E IEQN ++YDLM+EMA++    ++  WINQY+ RRYG  VP +  AWN+L 
Sbjct: 219 MKGTGLTPEAIEQNYMMYDLMNEMAWRTTAPNMTEWINQYTQRRYGVFVPELAQAWNLLI 278

Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
            TV+N T G                             YG P S   +      + D   
Sbjct: 279 PTVFNATLGY----------------------------YGPPSSFVGMRPQLNMTND--- 307

Query: 357 LWYSTSEVIRALELFIASGNELSASN-TYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
           L+Y  S V +A +L++   +E   S  T+ +D+ ++T QAL+    +  + + +AY  N 
Sbjct: 308 LYYDPSVVQQAWQLYLGVTDEYVLSTATFSFDVSEITLQALSNLFMDTQMAMYDAYLTNQ 367

Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
           +    + +   L ++ DMD + A     L+G W  +A+Q A N
Sbjct: 368 STVFEERATSCLNIITDMDTIAATQQMLLVGTWTANARQWALN 410


>gi|347541919|ref|YP_004856555.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-rat-Yit]
 gi|346984954|dbj|BAK80629.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-rat-Yit]
          Length = 912

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 105/448 (23%), Positives = 193/448 (43%), Gaps = 69/448 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+   GG L   W + +  L   I  R+ E G+ P+   F G  P           + 
Sbjct: 364 MGNISSIGGELTPKWFEDRAKLSIDIQTRMIEFGIEPIHQMFIGYFPYKEN---SGVNVI 420

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
           +   W  +K   R      LD  +     I   + ++Q + +G + + +  D F E  N 
Sbjct: 421 RGSYWSKIKGPDR------LDFNNNDVEFISSVYYKKQKELFGESKY-FAGDLFHEGNNL 473

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
              D  E  + +   +   + +   +++W++Q W  S       P  + + N +     +
Sbjct: 474 YGYDPVELSNKVLKLL---IDNNGENSIWIIQSWSHS-------PSSETIEN-LNRNNTL 522

Query: 179 VLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
           +LDL +++   W             ++F    +I+ +L+NF G   +YG    +     +
Sbjct: 523 ILDLHSQLNTRWKGISKFNNMSWKDREFDRSNWIFGVLNNFGGRSGLYGHTRHLLNQFYD 582

Query: 229 ARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAI 288
           A+ + N  + GV  + EGI  N  + +L++E+ F  +K+D+  ++++Y   RYG+S   +
Sbjct: 583 AKYNSNY-LKGVAHTSEGIGFNNFIDELVTEIIFS-DKLDIDEFVSRYLRNRYGKSDNDL 640

Query: 289 QDAWNVLYHTVYN-----CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
             A+N+L  TVYN       +GA++    VI A P +D               K  SK  
Sbjct: 641 LKAFNILLDTVYNPVINIYHEGASES---VINARPSLDV--------------KSASKWG 683

Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
            +          H  Y++ ++  AL ++ +  NE   S  Y  DLID+  + +   +NE 
Sbjct: 684 SI----------HKNYNSEKLEEALRIYFSKYNEFKDSKGYMTDLIDIASEVIINLSNEY 733

Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELV 430
           + N+ + Y  N     F+L S+RFL ++
Sbjct: 734 YKNLQDYYN-NGEIEFFKLNSQRFLNMI 760


>gi|342731751|ref|YP_004770590.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-Japan]
 gi|342329206|dbj|BAK55848.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-mouse-Japan]
          Length = 898

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 106/459 (23%), Positives = 200/459 (43%), Gaps = 69/459 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+   GG L   W + +  L   I  R+ E+G+ P+   F G  P           + 
Sbjct: 352 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 408

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W  +K   R      LD  +     I   + E+Q +  G++ + +  D F E  N 
Sbjct: 409 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 461

Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              D+ E    L   + S +++    D+VW++Q W          P  +++ N +    +
Sbjct: 462 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 509

Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           ++LDL +++   W            +K+F    +I+ +L+NF G   +YG  + +     
Sbjct: 510 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 569

Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
           +A+ + +  + G+  + EG+  N  + +L +E+ F  E V++  ++ +Y   RYG+S   
Sbjct: 570 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 627

Query: 288 IQDAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
           +  A+N+L  TVYN  TD    GA++    VI A P ++ +                   
Sbjct: 628 LLVAFNILLDTVYNPVTDIYHEGASES---VINARPSLEIN------------------- 665

Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
                  S +   H  Y + ++ R +E++I+  +E   +  Y  DLID+  + +   A+E
Sbjct: 666 -----SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASE 720

Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 441
            +  I E Y   +   +  +S++FL L+     +L+ +D
Sbjct: 721 YYQIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYND 759


>gi|417967717|ref|ZP_12608785.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-co]
 gi|380340884|gb|EIA29424.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-co]
          Length = 741

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 102/455 (22%), Positives = 199/455 (43%), Gaps = 61/455 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+   GG L   W + +  L   I  R+ E+G+ P+   F G  P           + 
Sbjct: 195 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 251

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W  +K   R      LD  +     I   + E+Q +  G++ + +  D F E  N 
Sbjct: 252 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 304

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDS-DAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              D+ E    L   + S +++    D+VW++Q W          P  +++ N +    +
Sbjct: 305 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 352

Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           ++LDL +++   W            +K+F    +I+ +L+NF G   +YG  + +     
Sbjct: 353 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 412

Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
           +A+ + +  + G+  + EG+  N  + +L +E+ F  E V++  ++ +Y   RYG+S   
Sbjct: 413 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 470

Query: 288 IQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
           +  A+N+L  TVYN  TD          +       S+I+             ++ ++  
Sbjct: 471 LLVAFNILLDTVYNPVTD----------IYHEGASESVIN-------------ARPSLGI 507

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
           +  S +   H  Y + ++ R +E++I+  +E   +  Y  DLID+  + +   A+E +  
Sbjct: 508 NSASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQI 567

Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 441
           I E Y   +   +  +S++FL L+     +L+ +D
Sbjct: 568 IQEYYNNGNIKYLQLISKKFLNLILLQANILSYND 602


>gi|417965571|ref|ZP_12607078.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-4]
 gi|380336329|gb|EIA26351.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-4]
          Length = 685

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 106/459 (23%), Positives = 199/459 (43%), Gaps = 69/459 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+   GG L   W + +  L   I  R+ E+G+ P+   F G  P           + 
Sbjct: 146 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 202

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W  +K   R      LD  +     I   + E+Q +  G++ + +  D F E  N 
Sbjct: 203 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 255

Query: 119 PPVDSPEYISSLGAAIYSGMQSGDS-DAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              D+ E    L   + S +++    D+VW++Q W          P  +++ N +    +
Sbjct: 256 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 303

Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           ++LDL +++   W            +K+F    +I+ +L+NF G   +YG  + +     
Sbjct: 304 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 363

Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
           +A+ + +  + G+  + EG+  N  + +L +E+ F  E V++  ++ +Y   RYG+S   
Sbjct: 364 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 421

Query: 288 IQDAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
           +  A+N+L  TVYN  TD    GA++    VI A P +  +                   
Sbjct: 422 LLVAFNILLDTVYNPVTDIYHEGASES---VINARPSLGIN------------------- 459

Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
                  S +   H  Y + ++ R +E++I+  +E   +  Y  DLID+  + +   A+E
Sbjct: 460 -----SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASE 514

Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 441
            +  I E Y   +   +  +S++FL L+     +L+ +D
Sbjct: 515 YYQIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYND 553


>gi|384455191|ref|YP_005667784.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-mouse-Yit]
 gi|418016862|ref|ZP_12656425.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-NYU]
 gi|418371995|ref|ZP_12964091.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-SU]
 gi|345505596|gb|EGX27892.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-NYU]
 gi|346983532|dbj|BAK79208.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-mouse-Yit]
 gi|380342872|gb|EIA31299.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-SU]
          Length = 898

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 106/459 (23%), Positives = 199/459 (43%), Gaps = 69/459 (15%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+   GG L   W + +  L   I  R+ E+G+ P+   F G  P           + 
Sbjct: 352 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 408

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
             G W  +K   R      LD  +     I   + E+Q +  G++ + +  D F E  N 
Sbjct: 409 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 461

Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
              D+ E    L   + S +++    D+VW++Q W          P  +++ N +    +
Sbjct: 462 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 509

Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
           ++LDL +++   W            +K+F    +I+ +L+NF G   +YG  + +     
Sbjct: 510 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 569

Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
           +A+ + +  + G+  + EG+  N  + +L +E+ F  E V++  ++ +Y   RYG+S   
Sbjct: 570 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 627

Query: 288 IQDAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
           +  A+N+L  TVYN  TD    GA++    VI A P +  +                   
Sbjct: 628 LLVAFNILLDTVYNPVTDIYHEGASES---VINARPSLGIN------------------- 665

Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
                  S +   H  Y + ++ R +E++I+  +E   +  Y  DLID+  + +   A+E
Sbjct: 666 -----SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASE 720

Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 441
            +  I E Y   +   +  +S++FL L+     +L+ +D
Sbjct: 721 YYQIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYND 759


>gi|293371911|ref|ZP_06618315.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
 gi|292633157|gb|EFF51734.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
          Length = 289

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 3/98 (3%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M+N+  W GPLP  WL+ Q+ LQKKIL R  EL M PVLPAF+G+VPA L+ ++P A I 
Sbjct: 184 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 243

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 98
            LG W       R  C + L+  D LF +I + F+++Q
Sbjct: 244 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQ 278


>gi|326435733|gb|EGD81303.1| alpha-N-acetylglucosaminidase [Salpingoeca sp. ATCC 50818]
          Length = 696

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 45/126 (35%), Positives = 72/126 (57%), Gaps = 2/126 (1%)

Query: 134 IYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST 192
           +Y+ M   D  A+W+ QGW++   D       M    ++VP G+LV+LD+ AE   IW+ 
Sbjct: 500 VYTTMTKRDPHAIWVYQGWIWLDLDNAQGFSFMSGFTSAVPRGRLVILDMEAEFDEIWAW 559

Query: 193 SKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPV 252
           S+ F+   +IW  + NF GN  MYG +  + F       +++  +VGVG++MEGI+QNP 
Sbjct: 560 SQSFFNTTFIWAAMDNFGGNNGMYGDIQ-LVFDRTRRVFAQSDAVVGVGITMEGIDQNPA 618

Query: 253 VYDLMS 258
            Y  ++
Sbjct: 619 YYQAIA 624



 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 47/116 (40%), Positives = 61/116 (52%), Gaps = 18/116 (15%)

Query: 6   GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW 65
           G GGPLP  W  QQ  LQ+ I+ R  ELG+  +LPAF GNVPAAL  ++P A I+    W
Sbjct: 246 GVGGPLPSQWYKQQWELQRAIVQRQTELGIGSLLPAFQGNVPAALAQLYPHANISN--GW 303

Query: 66  FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT-FDENTPP 120
                         LD  DPLF  I    +++ + ++G T H Y  D  FD +T P
Sbjct: 304 --------------LDGLDPLFATIADLTMQELIADFGAT-HFYQADGFFDHSTGP 344


>gi|440799252|gb|ELR20307.1| AlphaN-acetylglucosaminidase, putative, partial [Acanthamoeba
           castellanii str. Neff]
          Length = 389

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 50/132 (37%), Positives = 68/132 (51%), Gaps = 19/132 (14%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA----------- 49
           M N+ GWGGPL  +W   Q  LQKKI+ R    GM PVLPAF+G VP A           
Sbjct: 261 MGNIQGWGGPLDLAWRLAQAELQKKIVERQRMFGMLPVLPAFAGFVPEASVKFTLGRGGG 320

Query: 50  -----LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGR 104
                ++ ++P+A +T+  +W       ++   Y L   D L+  IG   I    +E+G 
Sbjct: 321 CGEQGIKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG- 377

Query: 105 TSHIYNCDTFDE 116
           T HIYN DTF+E
Sbjct: 378 TDHIYNADTFNE 389


>gi|62318937|dbj|BAD94027.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
          Length = 182

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 39/77 (50%), Positives = 55/77 (71%)

Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
           ++DLTRQ L+K AN+++   + A+   D   + QLS +FLEL++DMD LLA  D  LLG 
Sbjct: 1   MVDLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGT 60

Query: 448 WLESAKQLAQNEEQEKQ 464
           WLESAK+LA+N ++ KQ
Sbjct: 61  WLESAKKLAKNGDERKQ 77


>gi|281423204|ref|ZP_06254117.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
 gi|281402540|gb|EFB33371.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
          Length = 291

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 47/185 (25%), Positives = 80/185 (43%), Gaps = 26/185 (14%)

Query: 280 RYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGK 337
           RYG++ P I+ AW +L  T+YNC  G   +  +  +    P ++   +  +  K +NY  
Sbjct: 4   RYGKTSPEIERAWQLLSETIYNCPAGNNQQGPHESIFCGRPSLNNFQVK-SWSKMRNY-- 60

Query: 338 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 397
                                Y     + A +L     ++   +N + YDL+D+ RQALA
Sbjct: 61  ---------------------YDLQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALA 99

Query: 398 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
                 +L  I  Y         + + RFLE++   D LL     F LG W E+A++L  
Sbjct: 100 DQGRLQYLKTIADYNGFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGT 159

Query: 458 NEEQE 462
            ++++
Sbjct: 160 TQQEK 164


>gi|84625359|ref|YP_452731.1| hypothetical protein XOO_3702 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|84369299|dbj|BAE70457.1| truncated N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
           MAFF 311018]
          Length = 369

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 33/62 (53%), Positives = 43/62 (69%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+  PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A     P A+I 
Sbjct: 201 MGNIEGYRAPLPQQWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260

Query: 61  QL 62
           ++
Sbjct: 261 RM 262


>gi|323456608|gb|EGB12475.1| hypothetical protein AURANDRAFT_20306 [Aureococcus anophagefferens]
          Length = 243

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/64 (56%), Positives = 47/64 (73%), Gaps = 1/64 (1%)

Query: 5   HGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT-QLG 63
           HG GGPLP+++ D QL L K+IL R+  LG+ PVLP+F GNVP AL+++FP A IT Q  
Sbjct: 120 HGVGGPLPRTFADAQLALAKRILARMRGLGIVPVLPSFQGNVPPALKDLFPEANITVQAP 179

Query: 64  NWFS 67
           +W S
Sbjct: 180 HWTS 183


>gi|390353486|ref|XP_003728120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Strongylocentrotus
           purpuratus]
          Length = 385

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 32/50 (64%), Positives = 38/50 (76%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL 50
           M N+ GWGGP+PQSW   QL LQ KIL R+ ELGM PVLPAF+G+VP + 
Sbjct: 218 MGNIDGWGGPIPQSWHTNQLALQHKILKRMRELGMIPVLPAFAGHVPKSF 267


>gi|194695302|gb|ACF81735.1| unknown [Zea mays]
          Length = 173

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 29/43 (67%), Positives = 34/43 (79%)

Query: 422 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
           L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE Q
Sbjct: 7   LCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQ 49


>gi|224135741|ref|XP_002322149.1| predicted protein [Populus trichocarpa]
 gi|222869145|gb|EEF06276.1| predicted protein [Populus trichocarpa]
          Length = 173

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/34 (76%), Positives = 29/34 (85%)

Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
           VGVGM M+GI+QNPVV DLMS+MAF H KVDVK 
Sbjct: 30  VGVGMPMDGIKQNPVVSDLMSKMAFHHNKVDVKG 63


>gi|147798252|emb|CAN69797.1| hypothetical protein VITISV_036335 [Vitis vinifera]
          Length = 273

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/47 (55%), Positives = 31/47 (65%), Gaps = 3/47 (6%)

Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD---VKAWINQYSVRR 280
           MVGVG+ MEGIEQNPVVY+ M EMAF  E V    + +  N  + RR
Sbjct: 112 MVGVGVCMEGIEQNPVVYESMFEMAFHSENVQLVVISSTCNTMARRR 158


>gi|293369245|ref|ZP_06615835.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
 gi|292635670|gb|EFF54172.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
          Length = 221

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/104 (24%), Positives = 51/104 (49%), Gaps = 2/104 (1%)

Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
           P + Y   +++ A  L ++  +     ++Y +DL+++ RQ L  Y N +      AY+  
Sbjct: 11  PTIEYQPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAG 68

Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
           D   +     +  E++ D+D L++CH  F L  W+  A+ +  +
Sbjct: 69  DIPMMKNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHD 112


>gi|321458423|gb|EFX69492.1| hypothetical protein DAPPUDRAFT_35389 [Daphnia pulex]
          Length = 132

 Score = 46.2 bits (108), Expect = 0.037,   Method: Composition-based stats.
 Identities = 21/77 (27%), Positives = 44/77 (57%)

Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
           ++DLTRQ++ +  + L+  ++E Y   ++  +  ++ + + L++D+D L+     FLLG 
Sbjct: 1   MVDLTRQSMQEIFHLLYSKLLEVYLEKNSTAIEGIAYKMINLLQDLDELIQTGKTFLLGK 60

Query: 448 WLESAKQLAQNEEQEKQ 464
           W+  AK     E ++ Q
Sbjct: 61  WIADAKSWGTTEGEKLQ 77


>gi|315131339|emb|CBM69278.1| venom protein Ci-120 [Chelonus inanitus]
          Length = 165

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 45/75 (60%), Gaps = 2/75 (2%)

Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPW 448
           D+TRQ+L   A  ++L + +++   D   VF+     L +L  D++ +L+ +  FL+G W
Sbjct: 1   DVTRQSLQLIAEHVYLKLQQSFHQKDL-AVFKAHANLLMQLFSDLESILSTNKHFLVGKW 59

Query: 449 LESAKQLAQNEEQEK 463
           +++A+ L  N +++K
Sbjct: 60  IKNARSLGTNVQEQK 74


>gi|322792283|gb|EFZ16267.1| hypothetical protein SINV_02225 [Solenopsis invicta]
          Length = 87

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 26/38 (68%)

Query: 426 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
            LEL +D++ +LA    FLLG WL  AK++A NEE+ +
Sbjct: 1   LLELFDDLESILASGSNFLLGTWLTQAKEMADNEEERR 38


>gi|417963477|ref|ZP_12605420.1| Alpha-N-acetylglucosaminidase family protein, partial [Candidatus
           Arthromitus sp. SFB-3]
 gi|380332800|gb|EIA23523.1| Alpha-N-acetylglucosaminidase family protein, partial [Candidatus
           Arthromitus sp. SFB-3]
          Length = 146

 Score = 38.5 bits (88), Expect = 7.3,   Method: Composition-based stats.
 Identities = 23/95 (24%), Positives = 48/95 (50%)

Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
           +  S +   H  Y + ++ R +E++I+  +E   +  Y  DLID+  + +   A+E +  
Sbjct: 30  NSASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQI 89

Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 441
           I E Y   +   +  +S++FL L+     +L+ +D
Sbjct: 90  IQEYYNNGNIKYLQLISKKFLNLILLQANILSYND 124


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.135    0.422 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,736,492,678
Number of Sequences: 23463169
Number of extensions: 325306966
Number of successful extensions: 708383
Number of sequences better than 100.0: 504
Number of HSP's better than 100.0 without gapping: 502
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 705589
Number of HSP's gapped (non-prelim): 774
length of query: 472
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 326
effective length of database: 8,933,572,693
effective search space: 2912344697918
effective search space used: 2912344697918
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)