BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 009062
         (545 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255540793|ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
 gi|223550576|gb|EEF52063.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
          Length = 809

 Score =  935 bits (2417), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 438/539 (81%), Positives = 491/539 (91%), Gaps = 1/539 (0%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGNVPAAL+N+FPSAKI +LGNWFSVKSD RWCCTYLLDATDPLFIEIGRAFIEQ
Sbjct: 271 VLPAFSGNVPAALRNIFPSAKIARLGNWFSVKSDLRWCCTYLLDATDPLFIEIGRAFIEQ 330

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           QL+EYG TSHIYNCDTFDENTPPVD P+YIS+LGAA++ GMQSGD+DAVWLMQGWLFSYD
Sbjct: 331 QLEEYGSTSHIYNCDTFDENTPPVDDPKYISALGAAVFKGMQSGDNDAVWLMQGWLFSYD 390

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           PFWRPPQMKALL+SVP+G+LVVLDLFAEVKPIW++S QFYGVPYIWCMLHNFAGN+EMYG
Sbjct: 391 PFWRPPQMKALLHSVPVGRLVVLDLFAEVKPIWTSSYQFYGVPYIWCMLHNFAGNVEMYG 450

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           ILDSIA GPVEARTSEN+TMVGVGMSMEGIEQNPVVYDLMSEMAFQH+KVDVKAWIN YS
Sbjct: 451 ILDSIASGPVEARTSENSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHKKVDVKAWINLYS 510

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYGRSVP+IQDAW++LYHTVYNCTDGA DKNRDVIVAFPDV+P   SV++ ++   GK
Sbjct: 511 TRRYGRSVPSIQDAWDILYHTVYNCTDGAYDKNRDVIVAFPDVNPFYFSVSQKRHHLNGK 570

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
           PVS+ AVLK  + SYDHPHLWYSTSEV+ ALELFI SG ELS S+TY YDL+DLTRQALA
Sbjct: 571 PVSRRAVLKENSDSYDHPHLWYSTSEVLHALELFITSGEELSGSSTYSYDLVDLTRQALA 630

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
           KY NELFL IIE+YQ ND +GV   S++FL+LVEDMD LL CH+GFLLGPWLESAKQLAQ
Sbjct: 631 KYGNELFLKIIESYQANDGNGVASRSQKFLDLVEDMDTLLGCHEGFLLGPWLESAKQLAQ 690

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
           ++EQEKQ+EWNARTQITMWFDNT++EASLL DYGNKYWSGLL+DYYGPRAAIYFKY+I+S
Sbjct: 691 DQEQEKQFEWNARTQITMWFDNTEDEASLLHDYGNKYWSGLLQDYYGPRAAIYFKYLIKS 750

Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTG-VFDH 545
           LE+G  F LKDWRREWIKLTN+WQ  RN +PV+SNG+ALI S+WLY+KYL+     +DH
Sbjct: 751 LENGKVFPLKDWRREWIKLTNEWQRSRNKFPVKSNGNALIISKWLYDKYLRNPDTTYDH 809


>gi|225457148|ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
          Length = 813

 Score =  927 bits (2395), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 437/541 (80%), Positives = 484/541 (89%), Gaps = 4/541 (0%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGNVPAAL+ +FPSAKIT+LGNWF+V  +PRWCCTYLLDATDPLFIEIG+AFI+Q
Sbjct: 272 VLPAFSGNVPAALKYIFPSAKITRLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQ 331

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           QLKEYGRT HIYNCDTFDENTPPVD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYD
Sbjct: 332 QLKEYGRTGHIYNCDTFDENTPPVDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYD 391

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           PFWRPPQMKALL+SVP+G+LVVLDLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYG
Sbjct: 392 PFWRPPQMKALLHSVPMGRLVVLDLFAEVKPIWITSEQFYGVPYIWCMLHNFAGNIEMYG 451

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           ILD++A GPVEARTSEN+TMVGVGMSMEGIEQNPVVYDLMSEMAFQH KVDVK WI  YS
Sbjct: 452 ILDAVASGPVEARTSENSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHSKVDVKVWIALYS 511

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE----GKYQ 303
            RRYG+SVP IQDAWN+LYHTVYNCTDG+ DKNRDVIVAFPD+DPS I   +    G Y 
Sbjct: 512 TRRYGKSVPEIQDAWNILYHTVYNCTDGSYDKNRDVIVAFPDIDPSFIPTPKLSMPGGYH 571

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
            YGK VS+  VLK  T+S++ PHLWYSTSEV  AL LFIASG +L  SNTYRYDL+DLTR
Sbjct: 572 RYGKSVSRRTVLKEITNSFEQPHLWYSTSEVKDALGLFIASGGQLLGSNTYRYDLVDLTR 631

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           QALAKYAN+LFL +IEAYQLND  G    S++FLELVEDMD LLACHDGFLLGPWLESAK
Sbjct: 632 QALAKYANQLFLEVIEAYQLNDVRGAACHSQKFLELVEDMDTLLACHDGFLLGPWLESAK 691

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           QLAQ+E+QE Q+EWNARTQITMWFDNT++EASLLRDYGNKYWSGLLRDYYGPRAAIYFKY
Sbjct: 692 QLAQDEQQEIQFEWNARTQITMWFDNTEDEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 751

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 543
           ++ESLE+G+ F LKDWRREWIKLTNDWQN RN YPV S+G+A+ TS+ LYNKYLQ   ++
Sbjct: 752 LLESLETGNEFALKDWRREWIKLTNDWQNSRNAYPVRSSGNAIDTSRRLYNKYLQDPEIY 811

Query: 544 D 544
           D
Sbjct: 812 D 812


>gi|224121634|ref|XP_002318632.1| predicted protein [Populus trichocarpa]
 gi|222859305|gb|EEE96852.1| predicted protein [Populus trichocarpa]
          Length = 812

 Score =  916 bits (2368), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 425/538 (78%), Positives = 482/538 (89%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGNVPAAL+N+FPSAKIT+LGNWFSV+SD RWCCTYLLDATDPLFIEIGRAFIEQ
Sbjct: 275 VLPAFSGNVPAALRNIFPSAKITRLGNWFSVRSDVRWCCTYLLDATDPLFIEIGRAFIEQ 334

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           QL EYG TSHIYNCDTFDENTPPVD PEYISSLG +I+ GMQSGDS+AVWLMQGWLFSYD
Sbjct: 335 QLTEYGSTSHIYNCDTFDENTPPVDDPEYISSLGGSIFEGMQSGDSNAVWLMQGWLFSYD 394

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           PFWRPPQ KALL+SVP+G+LVVLDLFAEVKPIW+TS+QFYGVPYIWCMLHNFAGN+EMYG
Sbjct: 395 PFWRPPQTKALLHSVPIGRLVVLDLFAEVKPIWNTSEQFYGVPYIWCMLHNFAGNLEMYG 454

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            LDS+A GPVEARTSEN+TMVGVGMSMEGIEQNPVVYDLMSEMAFQ  KVDVK WI+ YS
Sbjct: 455 YLDSVASGPVEARTSENSTMVGVGMSMEGIEQNPVVYDLMSEMAFQKNKVDVKEWIDLYS 514

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYGRSVP IQ+AWN+LYHTVYNCTDGA DKNRDVIVAFPDV+P+++S+ +G++    K
Sbjct: 515 ARRYGRSVPTIQNAWNILYHTVYNCTDGAYDKNRDVIVAFPDVNPNLVSMLQGRHHTDVK 574

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
            VS+ A L   T SY+HPHLWYST+EV+RALELFIA G+ELS S+TY YDL+DLTRQ LA
Sbjct: 575 LVSRRAALIKNTDSYEHPHLWYSTTEVVRALELFIAGGDELSGSSTYSYDLVDLTRQVLA 634

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
           KYANELFL +IEAY+L D+HGV   S+ FL+LVED+D LLACH+GFLLGPWLESAKQLAQ
Sbjct: 635 KYANELFLKVIEAYRLKDSHGVAHQSQMFLDLVEDIDTLLACHEGFLLGPWLESAKQLAQ 694

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
           +EEQ+ Q+EWNARTQITMW+DNT+ EASLLRDYGNKYWSGLL+DYYGPRAAIYF ++ +S
Sbjct: 695 DEEQQIQFEWNARTQITMWYDNTEVEASLLRDYGNKYWSGLLKDYYGPRAAIYFNFLTQS 754

Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH 545
           LE+G GF+LK WRREWIKLTN WQ  R ++PVESNG+AL  S+WLY+KYL     +DH
Sbjct: 755 LENGHGFQLKAWRREWIKLTNKWQKSRKIFPVESNGNALNISRWLYHKYLGNPDTYDH 812


>gi|356519003|ref|XP_003528164.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
          Length = 812

 Score =  908 bits (2347), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 424/530 (80%), Positives = 479/530 (90%), Gaps = 1/530 (0%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGNVPAAL+++FPSAKIT+LGNWFSVK+D +WCCTYLLDATD LF+EIG+AFIE+
Sbjct: 270 VLPAFSGNVPAALKHIFPSAKITRLGNWFSVKNDLKWCCTYLLDATDSLFVEIGKAFIEK 329

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           QL+EYGRTSHIYNCDTFDENTPPVD PEYISSLGAA + GMQSGD DAVWLMQGWLFSYD
Sbjct: 330 QLQEYGRTSHIYNCDTFDENTPPVDDPEYISSLGAATFKGMQSGDDDAVWLMQGWLFSYD 389

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           PFWRPPQMKALL+SVP+GKLVVLDLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYG
Sbjct: 390 PFWRPPQMKALLHSVPVGKLVVLDLFAEVKPIWVTSEQFYGVPYIWCMLHNFAGNIEMYG 449

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           ILD+IA GP++ARTS N+TMVGVGMSMEGIEQNP+VYDLMSEMAFQH+KVDVKAW++ YS
Sbjct: 450 ILDAIASGPIDARTSNNSTMVGVGMSMEGIEQNPIVYDLMSEMAFQHKKVDVKAWVDMYS 509

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYG+++P IQ+ WNVLYHT+YNCTDGA DKNRDVIVAFPDVDPS+ISV   +  +  K
Sbjct: 510 TRRYGQTLPLIQEGWNVLYHTIYNCTDGAYDKNRDVIVAFPDVDPSLISVQHEQSHHNDK 569

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
           P S   ++K  T S+D PHLWY TSEVI ALELFI SG+ELS  NTYRYDL+DLTRQ LA
Sbjct: 570 PYSG-TIIKEITDSFDRPHLWYPTSEVIYALELFITSGDELSRCNTYRYDLVDLTRQVLA 628

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
           KYANELF  +IEAYQ +D HG+  LS+RFL+LVED+D LLACHDGFLLGPWLESAKQLA 
Sbjct: 629 KYANELFFKVIEAYQSHDIHGMTLLSQRFLDLVEDLDTLLACHDGFLLGPWLESAKQLAL 688

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
           NEEQE+Q+EWNARTQITMWFDN+ EEASLLRDYGNKYW+GLL DYYGPRAAIYFKY+ ES
Sbjct: 689 NEEQERQFEWNARTQITMWFDNSDEEASLLRDYGNKYWNGLLHDYYGPRAAIYFKYLRES 748

Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           LESG+ F+L+ WRREWIKLTN+WQ  RN++PVES+GDAL TS+WL+NKYL
Sbjct: 749 LESGEDFKLRGWRREWIKLTNEWQKRRNIFPVESSGDALNTSRWLFNKYL 798


>gi|297733843|emb|CBI15090.3| unnamed protein product [Vitis vinifera]
          Length = 846

 Score =  890 bits (2299), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/574 (74%), Positives = 477/574 (83%), Gaps = 37/574 (6%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGNVPAAL+ +FPSAKIT+LGNWF+V  +PRWCCTYLLDATDPLFIEIG+AFI+Q
Sbjct: 272 VLPAFSGNVPAALKYIFPSAKITRLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQ 331

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           QLKEYGRT HIYNCDTFDENTPPVD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYD
Sbjct: 332 QLKEYGRTGHIYNCDTFDENTPPVDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYD 391

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------- 173
           PFWRPPQMKALL+SVP+G+LVVLDLFAEVKPIW TS+QFYGVPYIW              
Sbjct: 392 PFWRPPQMKALLHSVPMGRLVVLDLFAEVKPIWITSEQFYGVPYIWKVTKSGRQQSLKFT 451

Query: 174 ------------------CMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSM 214
                             CMLHNFAGNIEMYGILD++A GP+  R     + +VGVGMSM
Sbjct: 452 NEKCCSFFRSHSPDSEVLCMLHNFAGNIEMYGILDAVASGPILLRAKYAESAVVGVGMSM 511

Query: 215 EGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTD 274
           EGIEQNPVVYDLMSEMAFQH KVDVK WI  YS RRYG+SVP IQDAWN+LYHTVYNCTD
Sbjct: 512 EGIEQNPVVYDLMSEMAFQHSKVDVKVWIALYSTRRYGKSVPEIQDAWNILYHTVYNCTD 571

Query: 275 GATDKNRDVIVAFPDVDPSIISVTE----GKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 330
           G+ DKNRDVIVAFPD+DPS I   +    G Y  YGK VS+  VLK  T+S++ PHLWYS
Sbjct: 572 GSYDKNRDVIVAFPDIDPSFIPTPKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPHLWYS 631

Query: 331 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 390
           TSEV  AL LFIASG +L  SNTYRYDL+DLTRQALAKYAN+LFL +IEAYQLND  G  
Sbjct: 632 TSEVKDALGLFIASGGQLLGSNTYRYDLVDLTRQALAKYANQLFLEVIEAYQLNDVRGAA 691

Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
             S++FLELVEDMD LLACHDGFLLGPWLESAKQLAQ+E+QE Q+EWNARTQITMWFDNT
Sbjct: 692 CHSQKFLELVEDMDTLLACHDGFLLGPWLESAKQLAQDEQQEIQFEWNARTQITMWFDNT 751

Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 510
           ++EASLLRDYGNKYWSGLLRDYYGPRAAIYFKY++ESLE+G+ F LKDWRREWIKLTNDW
Sbjct: 752 EDEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYLLESLETGNEFALKDWRREWIKLTNDW 811

Query: 511 QNGRNVYPVESNGDALITSQWLYNKYLQGTGVFD 544
           QN RN YPV S+G+A+ TS+ LYNKYLQ   ++D
Sbjct: 812 QNSRNAYPVRSSGNAIDTSRRLYNKYLQDPEIYD 845


>gi|449441031|ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
          Length = 808

 Score =  873 bits (2255), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 409/531 (77%), Positives = 468/531 (88%), Gaps = 2/531 (0%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGN+PAA + ++P+AKIT+LGNWF+V SDPRWCCTYLLDA DPLF+EIG+AFIEQ
Sbjct: 273 VLPAFSGNIPAAFKQIYPAAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQ 332

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q KEYGRTSH+YNCDTFDENTPPVD  EYISSLG+AI+ GMQ+GDS+AVWLMQGW+FSYD
Sbjct: 333 QQKEYGRTSHVYNCDTFDENTPPVDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYD 392

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           PFWRP QMKALL+SVPLG+LVVLDL+AEVKPIW +S+QFYG+PYIWCMLHNFAGN+EMYG
Sbjct: 393 PFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWISSEQFYGIPYIWCMLHNFAGNVEMYG 452

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           ILDSIA GP+EAR+S  +TMVGVGMSMEGIEQNPVVYDLMSEMAFQH KVDVK W+ QYS
Sbjct: 453 ILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYS 512

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
           VRRYG  VP+IQDAW+VLYHTVYNCTDGA DKNRDVIVAFPDVDPS I V       +G 
Sbjct: 513 VRRYGHLVPSIQDAWDVLYHTVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGN 572

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
             S  +V + + +++D PHLWY TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALA
Sbjct: 573 LDS--SVDRLQDATFDRPHLWYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALA 630

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
           KY+NELF  I++AYQL+D   +  LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA+
Sbjct: 631 KYSNELFFRIVKAYQLHDVQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAR 690

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
           +EE+EKQYEWNARTQITMWFDNT+EEASLLRDYGNKYWSGLL DYY PRAAIY K++ ES
Sbjct: 691 SEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKES 750

Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
            E+G  F L +WRREWIKLTNDWQ+ R +YPVESNGDAL TS WLYNKYLQ
Sbjct: 751 SENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQ 801


>gi|222629680|gb|EEE61812.1| hypothetical protein OsJ_16433 [Oryza sativa Japonica Group]
          Length = 1129

 Score =  795 bits (2054), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/538 (67%), Positives = 443/538 (82%), Gaps = 6/538 (1%)

Query: 6    FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
            F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 572  FPVLPAFSGNIPAALRSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFI 631

Query: 66   EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
            E+Q++EYG TSH+Y+CDTFDENTPP+  P YISSLGAA + GMQSGD DA+WLMQGWLFS
Sbjct: 632  EEQIREYGGTSHVYSCDTFDENTPPLSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFS 691

Query: 126  YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
            YDPFW PPQMKALL+SVP+G+++VLDL+AEVKPIW  S QFYGVPYIWCMLHNFA + EM
Sbjct: 692  YDPFWEPPQMKALLHSVPVGRMIVLDLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEM 751

Query: 186  YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
            YG+LD +A GP++AR S N+TMVGVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  
Sbjct: 752  YGVLDMVASGPIDARLSANSTMVGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVET 811

Query: 246  YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY--- 302
            Y  RRYG+S+  +QDAW +LY T+YNCTDG  DKNRDVIVAFPDV+P +I  T G Y   
Sbjct: 812  YPTRRYGKSMVGLQDAWKILYQTLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSS 870

Query: 303  -QNYGKPVSKEAV-LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
             + Y   +SK  + + +    Y+HPHLWY T  VIRALELF+  G+E+S SNT+RYDL+D
Sbjct: 871  SKTYSTKLSKNYIAVDASNDEYEHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVD 930

Query: 361  LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
            LTRQ LAKYAN++F+ IIE+Y+ N+ + V  L + F++LV D+D LLA H+GFLLGPWLE
Sbjct: 931  LTRQTLAKYANQVFVKIIESYKSNNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLE 990

Query: 421  SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            SAK LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIY
Sbjct: 991  SAKGLARDKEQEMQYEWNARTQITMWFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIY 1050

Query: 481  FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
            FKY+I S+E  + F L++WRREWI LTN+WQ+   V+P  + GDAL  S+ LY KYL 
Sbjct: 1051 FKYLILSMEKKEPFALEEWRREWISLTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 1108


>gi|326515664|dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 829

 Score =  794 bits (2050), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/536 (67%), Positives = 447/536 (83%), Gaps = 6/536 (1%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGN+PAAL+  FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+
Sbjct: 276 VLPAFSGNIPAALKLKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEE 335

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q++EYGRTSH+YNCDTFDENTPP+  P YISSLGAA +  MQSGD+DA+WLMQGWLF+YD
Sbjct: 336 QIREYGRTSHVYNCDTFDENTPPLSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYD 395

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           PFW PPQMKALL+SVP+G+++VLDL+AEVKP+W  S QFYGVPYIWCMLHNFA + EMYG
Sbjct: 396 PFWEPPQMKALLHSVPVGRMIVLDLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYG 455

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           +LD++A GP++AR SEN+TMVGVGMSMEGIEQNP+VYDLMSEM F H +VD+K W+  Y 
Sbjct: 456 VLDAVASGPIDARLSENSTMVGVGMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYP 515

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----Q 303
            RRYG+SV  +QDAW +L+ T+YNCTDG  DKNRDVIVAFPDV+PS+I  T G Y    +
Sbjct: 516 TRRYGKSVVGLQDAWRILHQTLYNCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSK 574

Query: 304 NYGKPVSKEAVLK-SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
           NY   +S+  V+K +   +Y+ PH+WY T  VI ALELF+ SG+E+S S+T+RYDL+DLT
Sbjct: 575 NYSTMLSENYVMKDAPNDAYEQPHIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLT 634

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQALAKYAN++FL II+ Y+ N+ + V  L  RFL LV+D+D LLA H+GFLLGPWLESA
Sbjct: 635 RQALAKYANQIFLKIIQGYKSNNVNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESA 694

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           K LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFK
Sbjct: 695 KGLARSQEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFK 754

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           ++I SL+  + F L++WRREWI LTN+WQ+ R V+   + GDAL  S+ L+ KYL+
Sbjct: 755 HLISSLKKKEPFALEEWRREWISLTNNWQSDRKVFATTATGDALNISRALFTKYLR 810


>gi|326519955|dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 829

 Score =  792 bits (2046), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/536 (67%), Positives = 446/536 (83%), Gaps = 6/536 (1%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGN+PAAL+  FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+
Sbjct: 276 VLPAFSGNIPAALKLKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEE 335

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q++EYGRTSH+YNCDTFDENTPP+  P YISSLGAA +  MQSGD+DA+WLMQGWLF+YD
Sbjct: 336 QIREYGRTSHVYNCDTFDENTPPLSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYD 395

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           PFW PPQMKALL+SVP+G+++VLDL+AEVKP W  S QFYGVPYIWCMLHNFA + EMYG
Sbjct: 396 PFWEPPQMKALLHSVPVGRMIVLDLYAEVKPAWINSDQFYGVPYIWCMLHNFAADFEMYG 455

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           +LD++A GP++AR SEN+TMVGVGMSMEGIEQNP+VYDLMSEM F H +VD+K W+  Y 
Sbjct: 456 VLDAVASGPIDARLSENSTMVGVGMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYP 515

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----Q 303
            RRYG+SV  +QDAW +L+ T+YNCTDG  DKNRDVIVAFPDV+PS+I  T G Y    +
Sbjct: 516 TRRYGKSVVGLQDAWRILHQTLYNCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSK 574

Query: 304 NYGKPVSKEAVLK-SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
           NY   +S+  V+K +   +Y+ PH+WY T  VI ALELF+ SG+E+S S+T+RYDL+DLT
Sbjct: 575 NYSTMLSENYVMKDAPNDAYEQPHIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLT 634

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQALAKYAN++FL II+ Y+ N+ + V  L  RFL LV+D+D LLA H+GFLLGPWLESA
Sbjct: 635 RQALAKYANQIFLKIIQGYKSNNVNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESA 694

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           K LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFK
Sbjct: 695 KGLARSQEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFK 754

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           ++I SL+  + F L++WRREWI LTN+WQ+ R V+   + GDAL  S+ L+ KYL+
Sbjct: 755 HLISSLKKKEPFALEEWRREWISLTNNWQSDRKVFATTATGDALNISRALFTKYLR 810


>gi|414585092|tpg|DAA35663.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
          Length = 831

 Score =  791 bits (2043), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/537 (68%), Positives = 444/537 (82%), Gaps = 6/537 (1%)

Query: 6   FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 275 FPVLPAFSGNIPAALKSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFI 334

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           E+Q++EYGRTSHIYNCDTFDENTPP+  P YISSLGAA + GMQSGD+DA+WLMQGWLF+
Sbjct: 335 EEQIREYGRTSHIYNCDTFDENTPPLSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFT 394

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           YDPFW PPQMKALL+SVP+GK++VLDL+AEVKP+W  S Q YGVPYIWCMLHNFA + EM
Sbjct: 395 YDPFWEPPQMKALLHSVPVGKMIVLDLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEM 454

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG+LD++A GP++AR S+N+TMVGVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  
Sbjct: 455 YGVLDALASGPIDARLSDNSTMVGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKT 514

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN- 304
           Y  RRYG+ V  +QDAW +LY T+YNCTDG  DKNRDVIVAFPDV+P +I+ T G + N 
Sbjct: 515 YPTRRYGKPVKGLQDAWWILYRTLYNCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNT 573

Query: 305 ---YGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
              Y    SK  + K  +S +Y+HPHLWY T+ VI ALELF+  G+E+S SNT+RYDL+D
Sbjct: 574 RQMYSTVPSKNYIRKDVSSDAYEHPHLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVD 633

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           LTRQ LAKYAN++FL IIE+Y+ N+ + V  L + FL LV D+D LL+ H+GFLLGPWLE
Sbjct: 634 LTRQVLAKYANDVFLKIIESYKSNNMNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLE 693

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
           SAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLL+DYYGPRAAIY
Sbjct: 694 SAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLQDYYGPRAAIY 753

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           FK+++ S+E+   F LK+WRREWI LTN+WQ+ R V+   + GD L  SQ LY KYL
Sbjct: 754 FKHLLLSMENNAPFALKEWRREWISLTNNWQSDRKVFSTTATGDPLNISQSLYTKYL 810


>gi|218195716|gb|EEC78143.1| hypothetical protein OsI_17702 [Oryza sativa Indica Group]
          Length = 829

 Score =  791 bits (2043), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/538 (67%), Positives = 443/538 (82%), Gaps = 6/538 (1%)

Query: 6   FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 272 FPVLPAFSGNIPAALRSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFI 331

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           E+Q++EYG TSH+Y+CDTFDENTPP+  P YISSLGAA + GMQSGD DA+WLMQGWLFS
Sbjct: 332 EEQIREYGGTSHVYSCDTFDENTPPLSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFS 391

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           YDPFW PPQMKALL+SVP+G+++VLDL+AEVKPIW  S QFYGVPYIWCMLHNFA + EM
Sbjct: 392 YDPFWEPPQMKALLHSVPVGRMIVLDLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEM 451

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG+LD +A GP++AR S N+TM+GVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  
Sbjct: 452 YGVLDMVASGPIDARLSANSTMIGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVET 511

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY--- 302
           Y  RRYG+S+  +QDAW +LY T+YNCTDG  DKNRDVIVAFPDV+P +I  T G Y   
Sbjct: 512 YPTRRYGKSIVGLQDAWKILYQTLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSS 570

Query: 303 -QNYGKPVSKEAV-LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
            + Y   +SK  + + +    Y+HPHLWY T  VIRALELF+  G+E+S SNT+RYDL+D
Sbjct: 571 SKTYSTKLSKNYIAVDASNDEYEHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVD 630

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           LTRQ LAKYAN++F+ IIE+Y+ N+ + V  L + F++LV D+D LLA H+GFLLGPWLE
Sbjct: 631 LTRQTLAKYANQVFVKIIESYKSNNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLE 690

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
           SAK LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIY
Sbjct: 691 SAKGLARDKEQEMQYEWNARTQITMWFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIY 750

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           FKY+I S+E  + F L++WRREWI LTN+WQ+   V+P  + GDAL  S+ LY KYL 
Sbjct: 751 FKYLILSMEKKEPFALEEWRREWISLTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 808


>gi|38345908|emb|CAE04506.2| OSJNBb0059K02.16 [Oryza sativa Japonica Group]
          Length = 829

 Score =  790 bits (2041), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/538 (67%), Positives = 443/538 (82%), Gaps = 6/538 (1%)

Query: 6   FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 272 FPVLPAFSGNIPAALRSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFI 331

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           E+Q++EYG TSH+Y+CDTFDENTPP+  P YISSLGAA + GMQSGD DA+WLMQGWLFS
Sbjct: 332 EEQIREYGGTSHVYSCDTFDENTPPLSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFS 391

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           YDPFW PPQMKALL+SVP+G+++VLDL+AEVKPIW  S QFYGVPYIWCMLHNFA + EM
Sbjct: 392 YDPFWEPPQMKALLHSVPVGRMIVLDLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEM 451

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG+LD +A GP++AR S N+TMVGVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  
Sbjct: 452 YGVLDMVASGPIDARLSANSTMVGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVET 511

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY--- 302
           Y  RRYG+S+  +QDAW +LY T+YNCTDG  DKNRDVIVAFPDV+P +I  T G Y   
Sbjct: 512 YPTRRYGKSMVGLQDAWKILYQTLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSS 570

Query: 303 -QNYGKPVSKEAV-LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
            + Y   +SK  + + +    Y+HPHLWY T  VIRALELF+  G+E+S SNT+RYDL+D
Sbjct: 571 SKTYSTKLSKNYIAVDASNDEYEHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVD 630

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           LTRQ LAKYAN++F+ IIE+Y+ N+ + V  L + F++LV D+D LLA H+GFLLGPWLE
Sbjct: 631 LTRQTLAKYANQVFVKIIESYKSNNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLE 690

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
           SAK LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIY
Sbjct: 691 SAKGLARDKEQEMQYEWNARTQITMWFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIY 750

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           FKY+I S+E  + F L++WRREWI LTN+WQ+   V+P  + GDAL  S+ LY KYL 
Sbjct: 751 FKYLILSMEKKEPFALEEWRREWISLTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 808


>gi|357166414|ref|XP_003580702.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
           distachyon]
          Length = 829

 Score =  784 bits (2025), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/536 (67%), Positives = 447/536 (83%), Gaps = 6/536 (1%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSG++PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+
Sbjct: 275 VLPAFSGSIPAALKSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEE 334

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q++EYGRTSH+YNCDTFDENTPP+  P YISSLGAA + GMQSGD DA+WLMQGWLF+YD
Sbjct: 335 QIREYGRTSHVYNCDTFDENTPPLSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFTYD 394

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           PFW PPQMKALL+SVP+G+++VLDL+AEVKP+W  S QFYGVPYIWCMLHNFA + EMYG
Sbjct: 395 PFWEPPQMKALLHSVPVGRMIVLDLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYG 454

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           +LD++A GP++AR SEN+TMVGVGMSMEGIEQNP+VYDLMSEM F H +VD++ W+  Y 
Sbjct: 455 VLDAVASGPIDARLSENSTMVGVGMSMEGIEQNPIVYDLMSEMVFHHRQVDLQVWVETYP 514

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYG+S+  +QDAW +L+ T+YNCTDG  DKNRDVIVAFPDV+P +I  T G + +  K
Sbjct: 515 TRRYGKSIVELQDAWRILHQTLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLHTSASK 573

Query: 308 PVS----KEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
             S    K  ++K E++ +Y+ PHLWY T+ VIRAL+LF+  G+E+S S+T+RYDL+DLT
Sbjct: 574 MFSTMSAKSYLVKDESNDAYEQPHLWYDTNVVIRALQLFLQYGDEVSDSSTFRYDLVDLT 633

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQALAKYAN++F  II++Y+ N+ + V  LS  FL+LV D+D LLA H+GFLLGPWLESA
Sbjct: 634 RQALAKYANQIFAKIIQSYKSNNMNQVTTLSECFLDLVNDLDMLLASHEGFLLGPWLESA 693

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           K LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLL DYYGPRAAIYFK
Sbjct: 694 KGLARDQEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLGDYYGPRAAIYFK 753

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           Y+I SLE  + F L++WRREWI LTN+WQ+ R V+   + GDAL  ++ LY KYL+
Sbjct: 754 YLILSLEKKEPFALEEWRREWISLTNNWQSDRKVFATAATGDALNIARSLYMKYLR 809


>gi|218192858|gb|EEC75285.1| hypothetical protein OsI_11626 [Oryza sativa Indica Group]
          Length = 812

 Score =  701 bits (1810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/538 (60%), Positives = 412/538 (76%), Gaps = 2/538 (0%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLP+FSGNVP+  + +FPSA IT+LG+W +V  DPRWCCTYLLD +D LFI++G+
Sbjct: 275 LGMVPVLPSFSGNVPSVFKKLFPSANITKLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQ 334

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
           AFI QQ+KEYG  ++IYNCDTF+ENTPP + P YISSLG+AIY  M  G+ DAVWLMQGW
Sbjct: 335 AFIRQQMKEYGDITNIYNCDTFNENTPPTNEPAYISSLGSAIYEAMSRGNKDAVWLMQGW 394

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF  D  FW+ PQMKALL+SVP GK++VLDLFA+VKPIW  S QFYGVPYIWCMLHNF G
Sbjct: 395 LFYSDAAFWKEPQMKALLHSVPTGKMIVLDLFADVKPIWQMSSQFYGVPYIWCMLHNFGG 454

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           NIEMYGILDSIA GP++ARTS N+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV+V+ 
Sbjct: 455 NIEMYGILDSIASGPIDARTSHNSTMVGVGMCMEGIEHNPVVYELMSEMAFRSQKVEVED 514

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W+  YS RRYG+S   ++ AW +LYHT+YNCTDG  D N+D IV FPD+ P+  S    K
Sbjct: 515 WLKIYSYRRYGQSNVEVEKAWGILYHTIYNCTDGIADHNKDYIVQFPDISPNSFSSDVSK 574

Query: 302 YQNYGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
            +   +       + SE S S  HPHLWYST E I+ALELF+ +GN+LS S TYRYDL+D
Sbjct: 575 RKAISEVKKHRRFVLSEVSASLPHPHLWYSTKEAIKALELFLNAGNDLSKSLTYRYDLVD 634

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           LTRQ+L+K ANE++L+ + AY+  D++G+   +++FLEL+ D+D LLA  D FLLGPWLE
Sbjct: 635 LTRQSLSKLANEVYLDAMNAYRKKDSNGLNFYTKKFLELIVDIDTLLASDDNFLLGPWLE 694

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            AK LA+ E + KQYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL+ YY PRA+ Y
Sbjct: 695 DAKSLARTENERKQYEWNARTQVTMWYDNTKTEQSKLHDYANKFWSGLLKSYYLPRASKY 754

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           F  + + L+    F+L++WR++WI  +N+WQ+G+ +Y V++ GDAL  S  L+ KY +
Sbjct: 755 FSRLTKGLQENQSFQLEEWRKDWIAYSNEWQSGKELYAVKATGDALAISSSLFKKYFR 812


>gi|413955691|gb|AFW88340.1| hypothetical protein ZEAMMB73_315381 [Zea mays]
          Length = 814

 Score =  698 bits (1801), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/538 (59%), Positives = 410/538 (76%), Gaps = 2/538 (0%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLP+FSGNVPA    +FPSA IT+LG+W +V ++P+WCCTYLLD +D LFI++G+
Sbjct: 276 LGMVPVLPSFSGNVPAIFAKLFPSANITRLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQ 335

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
           AFI QQ+KEYG  ++IYNCDTF+ENTPP D P YISSLG+AIY  M  G+ +AVWLMQGW
Sbjct: 336 AFIRQQIKEYGDVTNIYNCDTFNENTPPTDEPAYISSLGSAIYEAMSRGNKNAVWLMQGW 395

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF  D  FW+ PQMKALL+SVP+GK++VLDLFA+VKPIW  S QFYGVPYIWCMLHNF G
Sbjct: 396 LFYSDAAFWKEPQMKALLHSVPIGKMIVLDLFADVKPIWKVSSQFYGVPYIWCMLHNFGG 455

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           NIEMYGILDSI+ GP++ARTS N+TM+GVGM MEGIE NPVVY+LMSEMAF ++KV+V+ 
Sbjct: 456 NIEMYGILDSISSGPIDARTSYNSTMIGVGMCMEGIEHNPVVYELMSEMAFHNKKVEVED 515

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W+  YS RRYG++   I+ AW  LYHT+YNCTDG  D N+D IV FPD+ PS ++    K
Sbjct: 516 WLKTYSCRRYGQANADIEKAWRYLYHTIYNCTDGIADHNKDYIVEFPDISPSSVTYQVSK 575

Query: 302 YQNYGKPVSKEAVLKSETSS-YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
            +      +      SE S     PHLWYST E ++ALELF+ +G+  S S TYRYDL+D
Sbjct: 576 RRGMSITRNHRRFFLSEVSGILPQPHLWYSTKEAVKALELFLDAGSTFSESLTYRYDLVD 635

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           LTRQ L+K ANE++L+ I  YQ  D+HG+   +R+FLE++ D+D LLA  D FLLGPWLE
Sbjct: 636 LTRQCLSKLANEVYLDAISLYQKKDSHGLNAHARKFLEIIVDIDTLLAADDNFLLGPWLE 695

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
           SAK LA  E++ +QYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL+ YY PRA+ Y
Sbjct: 696 SAKSLAITEKERQQYEWNARTQVTMWYDNTETEQSKLHDYANKFWSGLLKSYYLPRASKY 755

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           F Y+  SL+    F+L++WR++WI  +N+WQ+G+ VY V++ GDAL  ++ LY KYL+
Sbjct: 756 FAYLTRSLQENRSFQLEEWRKDWISYSNEWQSGKEVYAVKATGDALAIARSLYRKYLR 813


>gi|222624949|gb|EEE59081.1| hypothetical protein OsJ_10898 [Oryza sativa Japonica Group]
          Length = 812

 Score =  696 bits (1796), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/537 (60%), Positives = 409/537 (76%), Gaps = 2/537 (0%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLP+FSGNVP+  + +FPSA IT+LG+W +V  DPRWCCTYLLD +D LFI++G+
Sbjct: 275 LGMVPVLPSFSGNVPSVFKKLFPSANITKLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQ 334

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
           AFI QQ+KEYG  ++IYNCDTF+ENTPP + P YISSLG+AIY  M  G+ DAVWLMQGW
Sbjct: 335 AFIRQQMKEYGDITNIYNCDTFNENTPPTNEPAYISSLGSAIYEAMSRGNKDAVWLMQGW 394

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF  D  FW+ PQMKALL+SVP GK++VLDLFA+VKPIW  S QFYGVPYIWCMLHNF G
Sbjct: 395 LFYSDAAFWKEPQMKALLHSVPTGKMIVLDLFADVKPIWQMSSQFYGVPYIWCMLHNFGG 454

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           NIEMYGILDSIA GP++ARTS N+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV+V+ 
Sbjct: 455 NIEMYGILDSIASGPIDARTSHNSTMVGVGMCMEGIEHNPVVYELMSEMAFRSQKVEVED 514

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W+  YS RRYG+S   ++ AW +LYHT+YNCTDG  D N D IV FPD+ P+  S    K
Sbjct: 515 WLKIYSYRRYGQSNVEVEKAWGILYHTIYNCTDGIADHNNDYIVEFPDISPNSFSSDVSK 574

Query: 302 YQNYGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
            +   +       + SE S S  HPHLWYST E I+ALELF+ +GN+LS S TYRYDL+D
Sbjct: 575 RKAISEVKKHRRFVLSEVSASLPHPHLWYSTKEAIKALELFLNAGNDLSKSLTYRYDLVD 634

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           LTRQ+L+K ANE++L+ + AY+  D++G+   +++FLEL+ D+D LLA  D FLLGPWLE
Sbjct: 635 LTRQSLSKLANEVYLDAMNAYRKKDSNGLNFYTKKFLELIVDIDTLLASDDNFLLGPWLE 694

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            AK LA+ E + KQYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL+ YY PRA+ Y
Sbjct: 695 DAKSLARTENERKQYEWNARTQVTMWYDNTKTEQSKLHDYANKFWSGLLKSYYLPRASKY 754

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           F  + + L+    F+L++W ++WI  +N+WQ+G+ +Y V++ GDAL  S  L+ KY 
Sbjct: 755 FSRLTKGLQENQSFQLEEWTKDWIAYSNEWQSGKELYAVKATGDALAISSSLFKKYF 811


>gi|357112065|ref|XP_003557830.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
           distachyon]
          Length = 809

 Score =  692 bits (1786), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/539 (60%), Positives = 419/539 (77%), Gaps = 6/539 (1%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLP+FSGNVP A + +FPSA IT+LG W +V  DPRWCCTY+LD +D LFI++G 
Sbjct: 272 LGMVPVLPSFSGNVPVAFKKLFPSANITRLGEWNTVDGDPRWCCTYILDPSDALFIDVGH 331

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
           AFI QQ+KEYG  + IYNCDTF+ENTPP + P YISSLG+AIY  M SG+ DAVWLMQGW
Sbjct: 332 AFIRQQIKEYGDITSIYNCDTFNENTPPTNEPAYISSLGSAIYEAMSSGNKDAVWLMQGW 391

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF  D  FW+ PQMKALL+SVP+GK++VLDLFA+VKP+W  S QFYGVPYIWCMLHNF G
Sbjct: 392 LFYSDAAFWKEPQMKALLHSVPIGKMIVLDLFADVKPVWKMSSQFYGVPYIWCMLHNFGG 451

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           NIEMYGILDSI+ GP++ARTS  +TMVGVGM+MEGIE NPVV++LMSEM+F+ +KV+V+ 
Sbjct: 452 NIEMYGILDSISSGPIDARTSYGSTMVGVGMTMEGIEHNPVVFELMSEMSFRSQKVEVED 511

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W+  YS RRYG+S   I+ AW VLYHT+YNCTDG  D NRD IV FPD+ PS  S    K
Sbjct: 512 WLKSYSYRRYGQSNVKIEKAWGVLYHTIYNCTDGIADHNRDYIVEFPDMSPSSFSSHFSK 571

Query: 302 YQNYGKPVSKEA---VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
            +  G P+ ++     L   +++  HPHLWYST+E ++ALELF+ +GN+LS S T+RYDL
Sbjct: 572 QR--GMPIVRKHPRFFLSEVSANLPHPHLWYSTNEAVKALELFLNAGNDLSKSLTFRYDL 629

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           +DLTRQ+L+K AN+++L+ +++Y+  ++ G+   +++FLEL+ D+D LLA  D FLLGPW
Sbjct: 630 VDLTRQSLSKLANKVYLDAMDSYKNKNSSGLNFHTKKFLELIVDIDILLASDDNFLLGPW 689

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           LESAK LA +EE+ KQYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL++YY PRA+
Sbjct: 690 LESAKSLAMSEEERKQYEWNARTQVTMWYDNTKTEQSHLHDYANKFWSGLLKNYYLPRAS 749

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
            YF  +  SL+    F+L++WRR+WI  +N+WQ+G  +YPV++ GDAL  S+ L+ KYL
Sbjct: 750 KYFTGLSRSLQENRSFQLEEWRRDWISYSNEWQSGEELYPVKAKGDALAISKSLFRKYL 808


>gi|356534602|ref|XP_003535842.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
          Length = 807

 Score =  687 bits (1774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/536 (60%), Positives = 397/536 (74%), Gaps = 6/536 (1%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVPAAL  +FPSAKIT+LG+W +V  DPRWCCTYLLD +DPLF+EIG AFI +
Sbjct: 267 VLPSFSGNVPAALTKIFPSAKITRLGDWNTVDGDPRWCCTYLLDPSDPLFVEIGEAFIRK 326

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q+KEYG  + IYNCDTF+EN+PP + PEYIS+LGAA+Y G+  GD DAVWLMQGWLF  D
Sbjct: 327 QIKEYGDVTDIYNCDTFNENSPPTNDPEYISNLGAAVYKGISKGDKDAVWLMQGWLFYSD 386

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+PPQMKALL+SVP GK++VLDLFA+VKPIW  S QFYG PYIWCMLHNF GNIEMY
Sbjct: 387 SSFWKPPQMKALLHSVPFGKMIVLDLFADVKPIWKNSFQFYGTPYIWCMLHNFGGNIEMY 446

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G LDSI+ GPV+AR S N+TMVGVGM MEGIEQNP+VY+LMSEMAF+ +KV V  WI  Y
Sbjct: 447 GTLDSISSGPVDARVSANSTMVGVGMCMEGIEQNPIVYELMSEMAFRDKKVKVSEWIKSY 506

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
             RRYG+ +  ++ AW +LYHT+YNCTDG  D N D IV FPD +PS  SVT G   N  
Sbjct: 507 CHRRYGKVIHQVESAWEILYHTIYNCTDGIADHNHDFIVMFPDWNPSTNSVT-GTSNNQK 565

Query: 307 ----KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
                P ++  + +   S     HLWY + +VI+AL+LF+A G  L+ S TYRYDL+DLT
Sbjct: 566 IYLLPPGNRRYLFQETLSDMPQAHLWYPSDDVIKALQLFLAGGKNLAGSLTYRYDLVDLT 625

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L+K AN+++   + +YQ  +   +   S +FL+L++D+D LLA  D FLLG WLESA
Sbjct: 626 RQVLSKLANQVYHKAVTSYQKKNIEALQFHSNKFLQLIKDIDVLLASDDNFLLGTWLESA 685

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           K+LA N  + KQYEWNARTQ+TMWFD  +   S L DY NK+WSGLL  YY PRA+ YF 
Sbjct: 686 KKLAVNPSEIKQYEWNARTQVTMWFDTNETTQSKLHDYANKFWSGLLESYYLPRASTYFS 745

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           ++ ESL   D F+L +WR++WI  +N WQ G  +YPV++ GDAL  SQ LY KY Q
Sbjct: 746 HLTESLRQNDKFKLIEWRKQWISQSNKWQEGNELYPVKAKGDALTISQALYEKYFQ 801


>gi|297736304|emb|CBI24942.3| unnamed protein product [Vitis vinifera]
          Length = 868

 Score =  686 bits (1770), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/538 (60%), Positives = 404/538 (75%), Gaps = 9/538 (1%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVP AL+ +FPSA IT+LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI Q
Sbjct: 331 VLPSFSGNVPEALKKIFPSANITRLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQ 390

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q+KEYG  + IYNCDTF+EN+PP + P YISSLGAAIY  M  GD D+VWLMQGWLF  D
Sbjct: 391 QIKEYGDVTDIYNCDTFNENSPPTNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSD 450

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+PPQMKALL+SVP GK+VVLDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMY
Sbjct: 451 SGFWKPPQMKALLHSVPFGKMVVLDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMY 510

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           GILD+++ GPV+AR S+N+TMVGVGM MEGIEQNPV Y+LMSEMAF+ EKV +  W+  Y
Sbjct: 511 GILDAVSSGPVDARISKNSTMVGVGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTY 570

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGK 301
           S RRYG++V  ++ AW +LY T+YNCTDG  D N D +V FPD DPS+     IS  +  
Sbjct: 571 SYRRYGKAVHHVEAAWEILYRTIYNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHI 630

Query: 302 YQNYGKPVSKEAVLKSETSSYDHP--HLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
            Q       +  +L  ETSS D P  HLWYST EV+ AL LF+ +GNELS S+TYRYDL+
Sbjct: 631 IQKILTQTGRRKILFQETSS-DLPQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLV 689

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           DLTRQ L+K  N+++L+ + A++  DA      S++F++LV+D+D LLA  D FLLG WL
Sbjct: 690 DLTRQVLSKLGNQVYLDAVIAFRQKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWL 749

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           ESAK+LA N  + +QYEWNARTQ+TMWF  T+   S L DY NK+WSGLL +YY PRA++
Sbjct: 750 ESAKKLAVNPREMEQYEWNARTQLTMWFYVTKTNQSKLHDYANKFWSGLLENYYLPRASM 809

Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           YF Y+ ++L     F+L++WRREWI  +N WQ G+ +YPV + GD L  S+ LY KY 
Sbjct: 810 YFSYLAKALTENKNFKLEEWRREWISYSNKWQAGKELYPVRAKGDTLAISRALYEKYF 867


>gi|225450036|ref|XP_002273084.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
          Length = 803

 Score =  685 bits (1767), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/538 (60%), Positives = 404/538 (75%), Gaps = 9/538 (1%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVP AL+ +FPSA IT+LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI Q
Sbjct: 266 VLPSFSGNVPEALKKIFPSANITRLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQ 325

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q+KEYG  + IYNCDTF+EN+PP + P YISSLGAAIY  M  GD D+VWLMQGWLF  D
Sbjct: 326 QIKEYGDVTDIYNCDTFNENSPPTNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSD 385

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+PPQMKALL+SVP GK+VVLDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMY
Sbjct: 386 SGFWKPPQMKALLHSVPFGKMVVLDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMY 445

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           GILD+++ GPV+AR S+N+TMVGVGM MEGIEQNPV Y+LMSEMAF+ EKV +  W+  Y
Sbjct: 446 GILDAVSSGPVDARISKNSTMVGVGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTY 505

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGK 301
           S RRYG++V  ++ AW +LY T+YNCTDG  D N D +V FPD DPS+     IS  +  
Sbjct: 506 SYRRYGKAVHHVEAAWEILYRTIYNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHI 565

Query: 302 YQNYGKPVSKEAVLKSETSSYDHP--HLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
            Q       +  +L  ETSS D P  HLWYST EV+ AL LF+ +GNELS S+TYRYDL+
Sbjct: 566 IQKILTQTGRRKILFQETSS-DLPQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLV 624

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           DLTRQ L+K  N+++L+ + A++  DA      S++F++LV+D+D LLA  D FLLG WL
Sbjct: 625 DLTRQVLSKLGNQVYLDAVIAFRQKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWL 684

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           ESAK+LA N  + +QYEWNARTQ+TMWF  T+   S L DY NK+WSGLL +YY PRA++
Sbjct: 685 ESAKKLAVNPREMEQYEWNARTQLTMWFYVTKTNQSKLHDYANKFWSGLLENYYLPRASM 744

Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           YF Y+ ++L     F+L++WRREWI  +N WQ G+ +YPV + GD L  S+ LY KY 
Sbjct: 745 YFSYLAKALTENKNFKLEEWRREWISYSNKWQAGKELYPVRAKGDTLAISRALYEKYF 802


>gi|224106113|ref|XP_002314048.1| predicted protein [Populus trichocarpa]
 gi|222850456|gb|EEE88003.1| predicted protein [Populus trichocarpa]
          Length = 806

 Score =  671 bits (1732), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/536 (58%), Positives = 399/536 (74%), Gaps = 6/536 (1%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVPAAL+ +FPSA IT+LG+W +V  +PRWCCTYLL+ +DPLF+EIG AFI Q
Sbjct: 270 VLPSFSGNVPAALKKIFPSANITRLGDWNTVDKNPRWCCTYLLNPSDPLFVEIGEAFIRQ 329

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q+KEYG  + IYNCDTF+EN+PP   P YISSLGAA+Y  M  GD DAVWLMQGWLF  D
Sbjct: 330 QVKEYGDVTDIYNCDTFNENSPPTSDPAYISSLGAAVYKAMSRGDKDAVWLMQGWLFYSD 389

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+PPQM+ALL+SVP GK++VLDLFAE KPIW  S QFYG PY+WC+LHNF GNIEMY
Sbjct: 390 SAFWKPPQMQALLHSVPFGKMIVLDLFAEAKPIWKNSSQFYGTPYVWCLLHNFGGNIEMY 449

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           GILD+I+ GPV+AR  EN+TMVGVGM MEGIE NPVVY+LMSEMAF+  K  V  W+  Y
Sbjct: 450 GILDAISSGPVDARIIENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSGKPQVLEWLKTY 509

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S RRYG++V  +  AW++LYHT+YNCTDG  D N D IV FPD DPS+ S +    Q+  
Sbjct: 510 SRRRYGKAVRQVVAAWDILYHTIYNCTDGIADHNTDFIVKFPDWDPSLHSGSNISEQDNM 569

Query: 307 KPV-----SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
           + +     ++  + +  +S +   HLWYST EVI+AL LF+ +GN+L+ S TYRYDL+DL
Sbjct: 570 RILLTSSGTRRFLFQETSSDFPEAHLWYSTQEVIQALWLFLDAGNDLAGSPTYRYDLVDL 629

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           TRQ L+K AN+++ + + A++  DA  +    ++FL++++D+D LLA  D FLLG WLES
Sbjct: 630 TRQVLSKLANQVYRDAMIAFRRKDARALNLHGQKFLQIIKDIDVLLASDDNFLLGTWLES 689

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           AK+LA +    K YEWNARTQ+TMW+D T+   S L DY NK+WSGLL DYY PRA+ YF
Sbjct: 690 AKKLAVDPNDMKLYEWNARTQVTMWYDTTKTNQSQLHDYANKFWSGLLEDYYLPRASTYF 749

Query: 482 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
            ++++SLE    F+L +WR+EWI  +N WQ    +YPV++ GDAL  ++ LY KY 
Sbjct: 750 GHLMKSLEENKNFKLTEWRKEWIAFSNKWQADTKIYPVKAKGDALAIAKALYRKYF 805


>gi|297807393|ref|XP_002871580.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317417|gb|EFH47839.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
           lyrata]
          Length = 806

 Score =  670 bits (1728), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/539 (59%), Positives = 404/539 (74%), Gaps = 10/539 (1%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVP+AL+ ++P A IT+L NW +V  D RWCCTYLL+ +DPLFI+IG AFI+Q
Sbjct: 268 VLPSFSGNVPSALRKIYPGANITRLDNWNTVDGDSRWCCTYLLNPSDPLFIDIGEAFIKQ 327

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q +EYG  ++IYNCDTF+ENTPP   PEYISSLGAA+Y  M  G+ +AVWLMQGWLFS D
Sbjct: 328 QPEEYGEITNIYNCDTFNENTPPTSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSD 387

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+PPQMK LL+SVP GK++VLDL+AEVKPIW+TS QFYG PYIWCMLHNF GNIEMY
Sbjct: 388 SKFWKPPQMKVLLHSVPFGKMIVLDLYAEVKPIWNTSAQFYGTPYIWCMLHNFGGNIEMY 447

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G LDSI+ GPV+AR S+N+TMVGVGM MEGIEQNPVVY+L+SEMAF+ EKVDV+ W+  Y
Sbjct: 448 GALDSISSGPVDARVSKNSTMVGVGMCMEGIEQNPVVYELISEMAFRDEKVDVQKWLKSY 507

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN-- 304
           + RRY +    I+ AW +LYHTVYNCTDG  D N D IV  PD DPS     E K+ +  
Sbjct: 508 ARRRYMKENHQIEAAWEILYHTVYNCTDGIADHNTDFIVKLPDWDPSSSVQDESKHTDSY 567

Query: 305 ---YGKPVSKEAVLKSETSSYDHP--HLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
               G   +K  VL  + SS D P  HLWYST EVI+AL+LF+ +G+ELS S TYRYD++
Sbjct: 568 MISTGPYETKRRVLFQDKSS-DLPKAHLWYSTKEVIQALKLFLEAGDELSRSLTYRYDMV 626

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           DLTRQ L+K AN++++  + A+   D   + QLS +FLEL++D+D LLA  D FLLG WL
Sbjct: 627 DLTRQVLSKLANQVYIEAVTAFVKKDIGSLGQLSEKFLELIKDIDVLLASDDNFLLGTWL 686

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           ESAK+LA+N ++ KQYEWNARTQ+TMW+D+     S L DY NK WSGLL DYY PRA +
Sbjct: 687 ESAKKLARNGDERKQYEWNARTQVTMWYDSKDVNQSKLHDYANKLWSGLLEDYYLPRARL 746

Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKYL 537
           YF  M++SL     F+++ W+REWI +++ WQ   + VYPV++ GDAL  S+ L  KY 
Sbjct: 747 YFNEMLKSLRDKKKFKVEKWQREWIMMSHKWQQSSSEVYPVKAKGDALAISKHLLLKYF 805


>gi|15240689|ref|NP_196873.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|9758035|dbj|BAB08696.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|19423948|gb|AAL87291.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|21436231|gb|AAM51254.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
 gi|332004545|gb|AED91928.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
          Length = 806

 Score =  667 bits (1721), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/539 (58%), Positives = 401/539 (74%), Gaps = 10/539 (1%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVP+AL+ ++P A IT+L NW +V  D RWCCTYLL+ +DPLFIEIG AFI+Q
Sbjct: 268 VLPSFSGNVPSALRKIYPEANITRLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIKQ 327

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q +EYG  ++IYNCDTF+ENTPP   PEYISSLGAA+Y  M  G+ +AVWLMQGWLFS D
Sbjct: 328 QTEEYGEITNIYNCDTFNENTPPTSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSD 387

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+PPQ+KALL+SVP GK++VLDL+AEVKPIW+ S QFYG PYIWCMLHNF GNIEMY
Sbjct: 388 SKFWKPPQLKALLHSVPFGKMIVLDLYAEVKPIWNKSAQFYGTPYIWCMLHNFGGNIEMY 447

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G LDSI+ GPV+AR S+N+TMVGVGM MEGIEQNPVVY+L SEMAF+ EKVDV+ W+  Y
Sbjct: 448 GALDSISSGPVDARVSKNSTMVGVGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKWLKSY 507

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RRY +    I+ AW +LYHTVYNCTDG  D N D IV  PD DPS  SV +   Q   
Sbjct: 508 ARRRYMKENHQIEAAWEILYHTVYNCTDGIADHNTDFIVKLPDWDPS-SSVQDDLKQKDS 566

Query: 307 KPVS-------KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
             +S       +  + + +T+     HLWYST EVI+AL+LF+ +G++LS S TYRYD++
Sbjct: 567 YMISTGPYETKRRVLFQDKTADLPKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMV 626

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           DLTRQ L+K AN+++   + A+   D   + QLS +FLEL++DMD LLA  D  LLG WL
Sbjct: 627 DLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWL 686

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           ESAK+LA+N ++ KQYEWNARTQ+TMW+D+     S L DY NK+WSGLL DYY PRA +
Sbjct: 687 ESAKKLAKNGDERKQYEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRARL 746

Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKYL 537
           YF  M++SL     F+++ WRREWI +++ WQ   + VYPV++ GDAL  S+ L +KY 
Sbjct: 747 YFNEMLKSLRDKKIFKVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKYF 805


>gi|357458267|ref|XP_003599414.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
 gi|355488462|gb|AES69665.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
          Length = 832

 Score =  665 bits (1715), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 316/562 (56%), Positives = 396/562 (70%), Gaps = 33/562 (5%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVPAAL  +FPSAKIT+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +
Sbjct: 271 VLPSFSGNVPAALTKIFPSAKITRLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRK 330

Query: 68  QLK--------------------------EYGRTSHIYNCDTFDENTPPVDSPEYISSLG 101
           Q+K                          EYG  + IYNCDTF+EN+PP   P YIS+LG
Sbjct: 331 QIKATETIHQESEDLGSLIIMDRAVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLG 390

Query: 102 AAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIW 160
           AA+Y G+  GD DAVWLMQGWLF  D  FW+PPQMKALL SVP GK++VLDLFA+VKPIW
Sbjct: 391 AAVYQGISKGDKDAVWLMQGWLFYSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIW 450

Query: 161 STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQN 220
            TS QFYG PYIWCMLHNF GNIEMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE N
Sbjct: 451 KTSFQFYGTPYIWCMLHNFGGNIEMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHN 510

Query: 221 PVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKN 280
           P+VY+LMSEMAF+ EKV +  W+  YS RRYG+++  +  AW +LYHT+YN TDG  D N
Sbjct: 511 PIVYELMSEMAFRDEKVKINEWLKSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHN 570

Query: 281 RDVIVAFPDVDPSIISVTEGKYQNYGK-----PVSKEAVLKSETSSYDHPHLWYSTSEVI 335
            D IV  PD DPS  +V  G   +  K     P ++  + +   +     HLWY   +VI
Sbjct: 571 HDYIVMLPDWDPS-AAVKSGMSNHQKKIYFLPPGNRRYLFQQTPAGMPQAHLWYPPEDVI 629

Query: 336 RALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR 395
           +AL+LF+A G  L  S TYRYDL+DLTRQ L+K+AN++++  I ++Q  +   +   S  
Sbjct: 630 KALQLFLAGGKNLKGSLTYRYDLVDLTRQVLSKFANQVYIKAITSFQKKNIDALQLNSHM 689

Query: 396 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 455
           FLEL++D+D LLA  D FLLG WL+SAK+LA N  + KQYEWNARTQ+TMWFD  +   S
Sbjct: 690 FLELIKDIDLLLASDDNFLLGTWLQSAKKLAVNPSELKQYEWNARTQVTMWFDTNETTQS 749

Query: 456 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRN 515
            L DY NK+WSG+L +YY PRA+ YF ++ ESL+  + F L +WR+EWI ++N WQ G  
Sbjct: 750 KLHDYANKFWSGILENYYLPRASTYFSHLSESLKQNEKFNLTEWRKEWIPMSNKWQEGSE 809

Query: 516 VYPVESNGDALITSQWLYNKYL 537
           +YPV++ GDAL  SQ LY KY 
Sbjct: 810 LYPVKAKGDALTISQALYKKYF 831


>gi|357458271|ref|XP_003599416.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
 gi|355488464|gb|AES69667.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
          Length = 807

 Score =  660 bits (1702), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/557 (56%), Positives = 388/557 (69%), Gaps = 48/557 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVPAAL  +FPSAKIT+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +
Sbjct: 271 VLPSFSGNVPAALTKIFPSAKITRLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRK 330

Query: 68  QLK--------------------------EYGRTSHIYNCDTFDENTPPVDSPEYISSLG 101
           Q+K                          EYG  + IYNCDTF+EN+PP   P YIS+LG
Sbjct: 331 QIKATETIHQESEDLGSLIIMDRAVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLG 390

Query: 102 AAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIW 160
           AA+Y G+  GD DAVWLMQGWLF  D  FW+PPQMKALL SVP GK++VLDLFA+VKPIW
Sbjct: 391 AAVYQGISKGDKDAVWLMQGWLFYSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIW 450

Query: 161 STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQN 220
            TS QFYG PYIWCMLHNF GNIEMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE N
Sbjct: 451 KTSFQFYGTPYIWCMLHNFGGNIEMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHN 510

Query: 221 PVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKN 280
           P+VY+LMSEMAF+ EKV +  W+  YS RRYG+++  +  AW +LYHT+YN TDG  D N
Sbjct: 511 PIVYELMSEMAFRDEKVKINEWLKSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHN 570

Query: 281 RDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALEL 340
            D IV  PD DPS    + G  Q                      HLWY   +VI+AL+L
Sbjct: 571 HDYIVMLPDWDPSAAVKSAGMPQ---------------------AHLWYPPEDVIKALQL 609

Query: 341 FIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 400
           F+A G  L  S TYRYDL+DLTRQ L+K+AN++++  I ++Q  +   +   S  FLEL+
Sbjct: 610 FLAGGKNLKGSLTYRYDLVDLTRQVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELI 669

Query: 401 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 460
           +D+D LLA  D FLLG WL+SAK+LA N  + KQYEWNARTQ+TMWFD  +   S L DY
Sbjct: 670 KDIDLLLASDDNFLLGTWLQSAKKLAVNPSELKQYEWNARTQVTMWFDTNETTQSKLHDY 729

Query: 461 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 520
            NK+WSG+L +YY PRA+ YF ++ ESL+  + F L +WR+EWI ++N WQ G  +YPV+
Sbjct: 730 ANKFWSGILENYYLPRASTYFSHLSESLKQNEKFNLTEWRKEWIPMSNKWQEGSELYPVK 789

Query: 521 SNGDALITSQWLYNKYL 537
           + GDAL  SQ LY KY 
Sbjct: 790 AKGDALTISQALYKKYF 806


>gi|449489156|ref|XP_004158231.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
           [Cucumis sativus]
          Length = 567

 Score =  645 bits (1663), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 309/531 (58%), Positives = 380/531 (71%), Gaps = 24/531 (4%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVPA L  +FPSA IT+LGNW S+ +DP  CCTYLL+ +DPLF++IG AFI Q
Sbjct: 59  VLPSFSGNVPAGLVEIFPSANITKLGNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQ 118

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q+KEYG  ++IY+CDTF+ENTPP +   YISSLGA++Y  M   D DAVWLMQGWLF  D
Sbjct: 119 QIKEYGDVTNIYSCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSD 178

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+P QMKALL+SVP GK++VLDLFA+VKPIW +S QFYG PY+WCMLHNF GNIEMY
Sbjct: 179 SDFWKPDQMKALLHSVPFGKMIVLDLFADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMY 238

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           GILD+I+ GPV+A  SEN+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV V+ W+  Y
Sbjct: 239 GILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRXQKVQVQEWLKTY 298

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S  RYG++   +  AWN+LYHT+YNCTDG  + N D IV  PD DPS  S  + K     
Sbjct: 299 SRCRYGKADHYVDAAWNILYHTIYNCTDGIANHNTDFIVKLPDWDPS--STFDLK----- 351

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           KP                PHLWYST EVI AL+L +   + L  S TYRYDL+DLTRQ L
Sbjct: 352 KP----------------PHLWYSTQEVINALQLLVNVDDNLVHSATYRYDLVDLTRQVL 395

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
            K ANE +L  + A++  +       S+RF++L+ D+D LLA +  FLLG WLESAK+LA
Sbjct: 396 GKLANEEYLKAVTAFRRQNVKAQNLHSKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLA 455

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            N  + KQYEWNARTQ+TMW+DNT+   S L DY NKYWSGLL  YY PRA  YF Y+ +
Sbjct: 456 TNPAEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSK 515

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           SL   + F L+DWRREWI  +N WQ    +YPV++ G+A+  S+ LY KY 
Sbjct: 516 SLRKNESFHLEDWRREWILFSNKWQAASELYPVKAEGNAVAISKALYEKYF 566


>gi|449436325|ref|XP_004135943.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
          Length = 774

 Score =  643 bits (1658), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 309/531 (58%), Positives = 380/531 (71%), Gaps = 24/531 (4%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVPA L  +FPSA IT+LGNW S+ +DP  CCTYLL+ +DPLF++IG AFI Q
Sbjct: 266 VLPSFSGNVPAGLVEIFPSANITKLGNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQ 325

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q+KEYG  ++IY+CDTF+ENTPP +   YISSLGA++Y  M   D DAVWLMQGWLF  D
Sbjct: 326 QIKEYGDVTNIYSCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSD 385

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+P QMKALL+SVP GK++VLDLFA+VKPIW +S QFYG PY+WCMLHNF GNIEMY
Sbjct: 386 SDFWKPDQMKALLHSVPFGKMIVLDLFADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMY 445

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           GILD+I+ GPV+A  SEN+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV V+ W+  Y
Sbjct: 446 GILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVQVQEWLKTY 505

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S  RYG++   +  AWN+LYHT+YNCTDG  + N D IV  PD DPS  S  + K     
Sbjct: 506 SRCRYGKADHYVDAAWNILYHTIYNCTDGIANHNTDFIVKLPDWDPS--STFDLK----- 558

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           KP                PHLWYST EVI AL+L +   + L  S TYRYDL+DLTRQ L
Sbjct: 559 KP----------------PHLWYSTQEVINALQLLVNVDDNLVHSATYRYDLVDLTRQVL 602

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
            K ANE +L  + A++  +       S+RF++L+ D+D LLA +  FLLG WLESAK+LA
Sbjct: 603 GKLANEEYLKAVTAFRRQNVKAQNLHSKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLA 662

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            N  + KQYEWNARTQ+TMW+DNT+   S L DY NKYWSGLL  YY PRA  YF Y+ +
Sbjct: 663 TNPAEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSK 722

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           SL   + F L+DWRREWI  +N WQ    +YPV++ G+A+  S+ LY KY 
Sbjct: 723 SLRKNESFHLEDWRREWILFSNKWQAASELYPVKAEGNAVAISKALYEKYF 773


>gi|414585093|tpg|DAA35664.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
          Length = 721

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 295/434 (67%), Positives = 360/434 (82%), Gaps = 6/434 (1%)

Query: 6   FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 275 FPVLPAFSGNIPAALKSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFI 334

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           E+Q++EYGRTSHIYNCDTFDENTPP+  P YISSLGAA + GMQSGD+DA+WLMQGWLF+
Sbjct: 335 EEQIREYGRTSHIYNCDTFDENTPPLSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFT 394

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           YDPFW PPQMKALL+SVP+GK++VLDL+AEVKP+W  S Q YGVPYIWCMLHNFA + EM
Sbjct: 395 YDPFWEPPQMKALLHSVPVGKMIVLDLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEM 454

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG+LD++A GP++AR S+N+TMVGVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  
Sbjct: 455 YGVLDALASGPIDARLSDNSTMVGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKT 514

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN- 304
           Y  RRYG+ V  +QDAW +LY T+YNCTDG  DKNRDVIVAFPDV+P +I+ T G + N 
Sbjct: 515 YPTRRYGKPVKGLQDAWWILYRTLYNCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNT 573

Query: 305 ---YGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
              Y    SK  + K  +S +Y+HPHLWY T+ VI ALELF+  G+E+S SNT+RYDL+D
Sbjct: 574 RQMYSTVPSKNYIRKDVSSDAYEHPHLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVD 633

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           LTRQ LAKYAN++FL IIE+Y+ N+ + V  L + FL LV D+D LL+ H+GFLLGPWLE
Sbjct: 634 LTRQVLAKYANDVFLKIIESYKSNNMNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLE 693

Query: 421 SAKQLAQNEEQEKQ 434
           SAK LA+N EQE Q
Sbjct: 694 SAKGLARNSEQEIQ 707


>gi|242035709|ref|XP_002465249.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
 gi|241919103|gb|EER92247.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
          Length = 777

 Score =  627 bits (1618), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 304/537 (56%), Positives = 386/537 (71%), Gaps = 39/537 (7%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLP+FSGNVPA    +FPSA IT LG+W +V ++P+WCCTYLLD +D LFI++G+
Sbjct: 276 LGMVPVLPSFSGNVPAVFAKLFPSANITLLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQ 335

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
           AFI QQ+KEYG  ++IYNCDTF+ENTPP D P YISSLG+AIY  M  G+ +AVWLMQGW
Sbjct: 336 AFIRQQIKEYGDVTNIYNCDTFNENTPPTDEPAYISSLGSAIYEAMSRGNKNAVWLMQGW 395

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF  D  FW+ PQMKALL+SVP+GK++VLDLFA+VKPIW  S QFYGVPYIWCMLHNF G
Sbjct: 396 LFYSDAAFWKEPQMKALLHSVPIGKMIVLDLFADVKPIWKMSSQFYGVPYIWCMLHNFGG 455

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           NIEMYG+LDSI+ GP++ARTS N+TM+GVGM MEGIE NPVVY+LMSEMAF ++KV+V+ 
Sbjct: 456 NIEMYGVLDSISSGPIDARTSYNSTMIGVGMCMEGIEHNPVVYELMSEMAFHNKKVEVE- 514

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
                                               D N+D IV FPD+ PS IS    K
Sbjct: 515 ------------------------------------DHNKDYIVEFPDISPSSISSQLSK 538

Query: 302 YQNYGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
            +      +      SE S S  HPHLWYST E I+ALELF+ +G+  S S TYRYDL+D
Sbjct: 539 RRGMSIMRNHRRFFLSEVSGSLPHPHLWYSTKEAIKALELFLDAGSTFSKSLTYRYDLVD 598

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           LTRQ L+K ANE++L+ + +YQ  D++G+   +R+FLE++ D+D LLA  D FLLGPWLE
Sbjct: 599 LTRQCLSKLANEVYLDAMSSYQKKDSNGLNSHTRKFLEIIMDIDTLLAADDNFLLGPWLE 658

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
           SAK LA  E++ +QYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL+ YY PRA+ Y
Sbjct: 659 SAKSLAITEKERQQYEWNARTQVTMWYDNTETEQSKLHDYANKFWSGLLKSYYLPRASKY 718

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           F Y+  SL+    F+L++WR++WI  +N+WQ+G+ VY V++ GDAL  ++ LY KYL
Sbjct: 719 FAYLTRSLQENQSFQLEEWRKDWISYSNEWQSGKEVYAVKATGDALAIARSLYRKYL 775


>gi|4160292|emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum]
          Length = 811

 Score =  626 bits (1615), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 303/543 (55%), Positives = 392/543 (72%), Gaps = 13/543 (2%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVPAAL+ +FPSA IT+LG+W +V  DPRWCCT+LL  +DPLFIEIG AFI +
Sbjct: 268 VLPSFSGNVPAALKKIFPSANITRLGDWNTVNGDPRWCCTFLLAPSDPLFIEIGEAFIRK 327

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL-MQGWLFSY 126
           Q++EYG  + IYNCDTF+ENTPP D P YI           Q   +   WL  + WLF  
Sbjct: 328 QIEEYGDITDIYNCDTFNENTPPTDDPTYIHLSALLCTKQCQKQITMRCWLNARVWLFYS 387

Query: 127 D-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           D  +W+ PQM+ALL+SVP GK++VLDLFA+VKPIW +S QFYG PYIWCMLHNF GNIEM
Sbjct: 388 DSKYWKSPQMEALLHSVPRGKMIVLDLFADVKPIWKSSSQFYGTPYIWCMLHNFGGNIEM 447

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG+LD++A GP++ARTSEN+TMVGVGM MEGIE NPVVY+LMSEMAF+ +   ++ W+  
Sbjct: 448 YGVLDAVASGPIDARTSENSTMVGVGMCMEGIEHNPVVYELMSEMAFREDNFQLQGWLKS 507

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS-----IISVTEG 300
           YS RRYG+    IQ AW++LYHT+YNCTDG  D N+D IV FPD DPS      IS T+ 
Sbjct: 508 YSHRRYGKVNDQIQAAWDILYHTIYNCTDGIADHNKDYIVEFPDWDPSGKTGTDISGTDS 567

Query: 301 KYQNYGKPVS-----KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
             QN  + ++     +  +   ++SS   P LWYST +V +AL+LFI +  +LS S TYR
Sbjct: 568 SSQNRMQKLAGFQWNRRFLFFEKSSSLPKPRLWYSTEDVFQALQLFIDALKKLSGSLTYR 627

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
           YDL+DL+RQ+L+K AN+++L+ I A++  DA  + Q S +FL L++D+D LLA  D FLL
Sbjct: 628 YDLVDLSRQSLSKLANQVYLDAISAFRREDAKPLNQHSPKFLPLLQDIDRLLAADDNFLL 687

Query: 416 GPWLESAKQ-LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
           G WLE+  Q LA N +++KQYEWNARTQITMWFDNT+   S L DY NK+WSGLL  YY 
Sbjct: 688 GTWLENCPQNLAMNSDEKKQYEWNARTQITMWFDNTKYNQSQLHDYANKFWSGLLEAYYL 747

Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
           PRA+IYF+ + +SL+    F+L++WR+EWI  +N WQ    +YPV++ GDAL  +  L+ 
Sbjct: 748 PRASIYFELLSKSLKEKVDFKLEEWRKEWIAYSNKWQESTELYPVKAQGDALAIATALFE 807

Query: 535 KYL 537
           KY 
Sbjct: 808 KYF 810


>gi|168060822|ref|XP_001782392.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666123|gb|EDQ52786.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 801

 Score =  626 bits (1614), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 294/530 (55%), Positives = 376/530 (70%), Gaps = 17/530 (3%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+GNVPAA+   +PSA++T+LG W +V  D R+CCT+LLD  DPLF++IG+AFI Q
Sbjct: 273 VLPAFAGNVPAAITKKYPSARVTRLGEWNTVNGDTRYCCTFLLDPKDPLFVDIGKAFILQ 332

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q+KEYG T HIYNCDTF+EN PP D P YIS+LG+ +Y  M + D DA+WLMQ    +YD
Sbjct: 333 QIKEYGGTQHIYNCDTFNENQPPTDDPSYISALGSIVYEAMSAADQDAIWLMQ----AYD 388

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
            FW+PPQMKALL+SVP+G++VVLDLFA+VKP+WS S  FYGVPYIWCMLHNF GN+EMYG
Sbjct: 389 KFWKPPQMKALLHSVPVGRMVVLDLFADVKPMWSRSDHFYGVPYIWCMLHNFGGNVEMYG 448

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            LD +A  P++A TS N+TMVGVGM MEGIEQNPVVYDLM+EMAF +  V V+ WI +Y+
Sbjct: 449 RLDVVATAPIQAVTSSNSTMVGVGMCMEGIEQNPVVYDLMAEMAFHNATVVVEDWIEEYA 508

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYG      + AW +L+ ++YNC+DG  D N DVIV FPD+DP          Q+ G+
Sbjct: 509 RRRYGELTAGARIAWKMLHESIYNCSDGIADHNGDVIVEFPDIDPKRSLFQIRPRQSLGQ 568

Query: 308 PVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
            +              HP H+WYS  +   AL+  ++S + L  S  YRYD++DLTRQ L
Sbjct: 569 QI------------LGHPQHIWYSPQDAAVALQYLLSSADALGLSKPYRYDVVDLTRQVL 616

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
           +K AN+L   +++ +++ +   +  +S R LEL+ DMD LL   + FLLG WLESAK LA
Sbjct: 617 SKLANQLHSQVLDQFRMFNVEKMDNISSRLLELLSDMDDLLGASEEFLLGTWLESAKDLA 676

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            ++E+ K YEWNARTQITMWFDNT ++ S L DY NK WSGL RDYY PRA+IY KY+ +
Sbjct: 677 TSDEERKLYEWNARTQITMWFDNTLDKPSPLHDYANKMWSGLTRDYYLPRASIYIKYLKQ 736

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           SL     F  ++WRREWI LTN+WQ   N+YP  + GDAL  +  LY KY
Sbjct: 737 SLHENTSFAFQEWRREWIALTNEWQVASNLYPTVAKGDALEIATTLYEKY 786


>gi|302786446|ref|XP_002974994.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
 gi|300157153|gb|EFJ23779.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
          Length = 761

 Score =  615 bits (1587), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 290/535 (54%), Positives = 380/535 (71%), Gaps = 4/535 (0%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L   AVLPAFSGNVP AL+ ++PSA IT+L +W +V  +P+WCCTYLL   DPLFI+IG+
Sbjct: 224 LGMIAVLPAFSGNVPRALKILYPSANITRLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGK 283

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
           AFIEQQ+KEYG T H+YNCDTF+EN PP D P YIS+L A++Y  M   D  A+WLMQGW
Sbjct: 284 AFIEQQVKEYGSTQHVYNCDTFNENLPPTDDPSYISALAASVYGAMIVADKQAIWLMQGW 343

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LFS D  FW+PPQMKALL++VP GK++VLDLFAEV+PIWS S  FYGVPYIWCMLHNF G
Sbjct: 344 LFSSDAQFWKPPQMKALLHAVPFGKMIVLDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGG 403

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           N EMYG LD ++ GPV+A+TS N+TM+GVGM MEGIEQNPVVY+LM+EMAF+  +  +K 
Sbjct: 404 NHEMYGRLDVVSSGPVDAKTSANSTMIGVGMCMEGIEQNPVVYELMAEMAFRSTRNALKD 463

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W+N YS RRYG++VP   +AW +L HT+YNC+DG  D N DVIV FPD++ S ++ T  +
Sbjct: 464 WVNDYSTRRYGKAVPEALEAWQILSHTLYNCSDGLQDHNTDVIVKFPDLNASSLT-TLSR 522

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
           Y           +L    +S+   HLWY  +E   AL   + + + LS   TYRYDL+DL
Sbjct: 523 YLAEEAGTQTRRLLTEGLTSFG--HLWYRPTEAKVALSYMLNASSSLSNVATYRYDLVDL 580

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           TRQ L K AN++ L  + ++   D   + +     + +++D + LL  ++GFLLGPWLES
Sbjct: 581 TRQVLMKLANQIHLQALVSFVKGDLEELTKNCDILIGIIKDSELLLRSNNGFLLGPWLES 640

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           AK+L  N +++  YEWNARTQ+TMWFDNT+   S L DY NK WSGL  DYY PRA++Y 
Sbjct: 641 AKKLGTNSDEKHLYEWNARTQVTMWFDNTRSLPSALHDYANKMWSGLFEDYYLPRASLYT 700

Query: 482 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           K ++++L   + F    WR  WI LTN +QNG   YP+E+ GD++  ++ L++KY
Sbjct: 701 KLLVKALHDKEPFPYGSWRSSWILLTNTFQNGTKNYPLEAAGDSIEIAKSLFSKY 755


>gi|302791289|ref|XP_002977411.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
 gi|300154781|gb|EFJ21415.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
          Length = 761

 Score =  613 bits (1581), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 288/535 (53%), Positives = 378/535 (70%), Gaps = 4/535 (0%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L   AVLPAFSGNVP AL+ ++PSA IT+L +W +V  +P+WCCTYLL   DPLFI+IG+
Sbjct: 224 LGMIAVLPAFSGNVPRALKILYPSANITRLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGK 283

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
           AFIEQQ+KEYG T H+YNCDTF+EN PP D P YIS+L A++Y  M   D  A+WLMQGW
Sbjct: 284 AFIEQQVKEYGSTQHVYNCDTFNENLPPTDDPSYISALAASVYGAMIVADKQAIWLMQGW 343

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LFS D  FW+PPQMKALL++VP GK++VLDLFAEV+PIWS S  FYGVPYIWCMLHNF G
Sbjct: 344 LFSSDAQFWKPPQMKALLHAVPFGKMIVLDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGG 403

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           N EMYG LD ++ GPV+A+TS N+TM+GVGM MEGIEQNPVVY+LM+EMAF+  +  +K 
Sbjct: 404 NHEMYGRLDVVSSGPVDAKTSANSTMIGVGMCMEGIEQNPVVYELMAEMAFRSTRNALKD 463

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W++ YS RRYG++VP   +AW +L HT+YNC+DG  D N DVIV FPD++ S ++     
Sbjct: 464 WVDDYSTRRYGKAVPEALEAWQILSHTLYNCSDGLQDHNTDVIVKFPDLNASSLTTLSRY 523

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
               G   ++  + +  TS     HLWY  +E   AL   + + + LS   TYRYDL+DL
Sbjct: 524 LAEEGGTQTRRLLTEGLTS---FGHLWYRPTEAKVALSYMLNASSSLSNVATYRYDLVDL 580

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           TRQ L K AN++ L  + ++   D   + +     + +++D + LL  ++GFLLGPWLES
Sbjct: 581 TRQVLMKLANQIHLQALVSFVKGDLEELTKNCDILIGIIKDSELLLRSNNGFLLGPWLES 640

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           AK+L  N ++   YEWNARTQ+TMWFDNT+   S L DY NK WSGL  DYY PRA++Y 
Sbjct: 641 AKKLGTNSDETNLYEWNARTQVTMWFDNTRTLPSALHDYANKMWSGLFEDYYLPRASLYT 700

Query: 482 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           K ++++L   + F    WR  WI LTN +QNG   YP+E+ GD++  ++ L++KY
Sbjct: 701 KLLVKALHDKEPFPYDSWRSSWILLTNTFQNGTKNYPLEAAGDSIEIAKSLFSKY 755


>gi|326521470|dbj|BAK00311.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 428

 Score =  520 bits (1340), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 248/421 (58%), Positives = 316/421 (75%), Gaps = 1/421 (0%)

Query: 119 MQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 177
           +QGWLF  D  FW+  QMKALL+SVP+GK++VLDLFA+VKPIW TS QFYGVPYIWCMLH
Sbjct: 8   VQGWLFYSDAVFWKESQMKALLHSVPIGKMMVLDLFADVKPIWQTSSQFYGVPYIWCMLH 67

Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
           NF GNIEMYG+LDSI+ GPV+ARTS N+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV
Sbjct: 68  NFGGNIEMYGVLDSISSGPVDARTSYNSTMVGVGMCMEGIEHNPVVYELMSEMAFRSQKV 127

Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 297
            V+ W+  YS RRYG+S   IQ AW +LYHT+YNCTDG  D N+D IV FPD+ PS  S 
Sbjct: 128 KVEDWLKTYSHRRYGQSNVEIQKAWGILYHTIYNCTDGIADHNKDYIVEFPDMSPSSFSS 187

Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
              K             L   ++S   PHLWYST E I++LELF+ +GN+LS S TYRYD
Sbjct: 188 QYSKRSISLARKHPRFFLSEVSASLPQPHLWYSTEEAIKSLELFLNAGNDLSKSLTYRYD 247

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
           L+DLTRQ+L+K AN+++ + I +YQ  D+ G+   ++ FLEL+ D+D LLA  D FLLGP
Sbjct: 248 LVDLTRQSLSKLANKVYHDAISSYQKRDSSGLNFHTKEFLELIVDIDTLLASDDNFLLGP 307

Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
           WLESAK LA  E++ KQYEWNARTQ+TMW+D+T+ E S L DY NK+WSGLL+ YY PRA
Sbjct: 308 WLESAKSLAMTEDERKQYEWNARTQVTMWYDDTKTEQSKLHDYANKFWSGLLKSYYLPRA 367

Query: 478 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           + YF  +  SL+    F+L++WRR+WI  +N+WQ+G+ +YPV++ GD+L  S+ L+ KY 
Sbjct: 368 SKYFSRLSRSLQENRSFQLEEWRRDWISYSNEWQSGKELYPVKAIGDSLAISRSLFTKYF 427

Query: 538 Q 538
           +
Sbjct: 428 R 428


>gi|449518399|ref|XP_004166229.1| PREDICTED: alpha-N-acetylglucosaminidase-like, partial [Cucumis
           sativus]
          Length = 336

 Score =  511 bits (1315), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 247/331 (74%), Positives = 281/331 (84%), Gaps = 2/331 (0%)

Query: 208 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 267
           VGVGMSMEGIEQNPVVYDLMSEMAFQH KVDVK W+ QYSVRRYG  VP+IQDAW+VLYH
Sbjct: 1   VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 60

Query: 268 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 327
           TVYNCTDGA DKNRDVIVAFPDVDPS I V       +G   S  +V + + +++D PHL
Sbjct: 61  TVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHL 118

Query: 328 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 387
           WY TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF  I++AYQL+D  
Sbjct: 119 WYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQ 178

Query: 388 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 447
            +  LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQYEWNARTQITMWF
Sbjct: 179 TMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWF 238

Query: 448 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 507
           DNT+EEASLLRDYGNKYWSGLL DYY PRAAIY K++ ES E+G  F L +WRREWIKLT
Sbjct: 239 DNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLT 298

Query: 508 NDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           NDWQ+ R +YPVESNGDAL TS WLYNKYLQ
Sbjct: 299 NDWQSSRKIYPVESNGDALDTSHWLYNKYLQ 329


>gi|414585094|tpg|DAA35665.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
          Length = 1202

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 248/431 (57%), Positives = 312/431 (72%), Gaps = 25/431 (5%)

Query: 112 DSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPY 171
           DS+  W     L + DP +           V +GK+ + +   E    +  +   Y    
Sbjct: 308 DSNPRWCCTYLLDASDPLF-----------VEIGKMFIEEQIRE----YGRTSHIYN--- 349

Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMA 231
            WCMLHNFA + EMYG+LD++A GP++AR S+N+TMVGVGMSMEGIEQNP+VYDLMSEMA
Sbjct: 350 -WCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGVGMSMEGIEQNPIVYDLMSEMA 408

Query: 232 FQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD 291
           F H +VD++ W+  Y  RRYG+ V  +QDAW +LY T+YNCTDG  DKNRDVIVAFPDV+
Sbjct: 409 FHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLYNCTDGKNDKNRDVIVAFPDVE 468

Query: 292 PSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGN 346
           P +I+ T G + N    Y    SK  + K  +S +Y+HPHLWY T+ VI ALELF+  G+
Sbjct: 469 PFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHPHLWYDTNAVIHALELFLQHGD 527

Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
           E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+ + V  L + FL LV D+D L
Sbjct: 528 EVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNNMNQVTILCQHFLSLVNDLDTL 587

Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
           L+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWS
Sbjct: 588 LSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWS 647

Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
           GLL+DYYGPRAAIYFK+++ S+E+   F LK+WRREWI LTN+WQ+ R V+   + GD L
Sbjct: 648 GLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWISLTNNWQSDRKVFSTTATGDPL 707

Query: 527 ITSQWLYNKYL 537
             SQ LY KYL
Sbjct: 708 NISQSLYTKYL 718



 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 58/75 (77%), Positives = 70/75 (93%)

Query: 6   FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 275 FPVLPAFSGNIPAALKSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFI 334

Query: 66  EQQLKEYGRTSHIYN 80
           E+Q++EYGRTSHIYN
Sbjct: 335 EEQIREYGRTSHIYN 349


>gi|156399499|ref|XP_001638539.1| predicted protein [Nematostella vectensis]
 gi|156225660|gb|EDO46476.1| predicted protein [Nematostella vectensis]
          Length = 675

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 217/532 (40%), Positives = 320/532 (60%), Gaps = 42/532 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F+G+VPA L  ++P A +++LG+W +  S   +CCTYLL+ +DPLF +IG AFI++
Sbjct: 179 VLPGFAGHVPAGLLRLYPKANVSKLGDWGNFNST--YCCTYLLEPSDPLFQKIGTAFIKE 236

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q  EYG T+HIYN DTF+E  P    P Y+ +  +A+Y GM  GD DAVWLMQGWLF  +
Sbjct: 237 QTAEYG-TNHIYNADTFNEMRPRSSDPTYLGAASSAVYRGMAGGDPDAVWLMQGWLFVDE 295

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
            FW+P Q+KALL+ VP G ++VLDL+AE  PIWS ++ FYG P+IWCML NF GNI ++G
Sbjct: 296 GFWKPDQIKALLHGVPQGFMIVLDLWAENSPIWSRTQSFYGTPFIWCMLLNFGGNIGLFG 355

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE---KVDVKAWIN 244
            + S++ GP +A  S N+TM+G G++MEGIEQN ++++LM+EM ++ E    VD+  WI 
Sbjct: 356 NIKSVSTGPPKAFQSFNSTMIGTGLTMEGIEQNDMMFELMNEMGYRLEPLNPVDLDNWIK 415

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y++RRYG + PAI  AW +L  +VY C     D    + V  P +D             
Sbjct: 416 DYALRRYGGTNPAIIQAWRLLIRSVYQCNGYCADHIHSIFVWKPSLD------------- 462

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                             + P+LWY   +V  A +   ++  E     T+RYDL+D+TRQ
Sbjct: 463 ------------------NKPNLWYDPEDVFNAWDELRSTAAEFMHVETFRYDLVDVTRQ 504

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
           AL      ++ ++I AY+   A  V     R LE+ +D+D LL  +  FLLG WL SAK 
Sbjct: 505 ALHLRVIPIYNDLISAYKNRSALNVIHFGSRLLEMFDDLDSLLQTNRNFLLGRWLNSAKA 564

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           L     +   YE+NAR QIT+W    + E     DY NK WSGL++ YY PR  ++   M
Sbjct: 565 LGTTPAEVALYEFNARNQITLWGPRGEIE-----DYANKMWSGLVKAYYKPRWELFIDEM 619

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           + ++  G+    + ++++ ++    W +G+  YP + +GD+L  +++L+NK+
Sbjct: 620 VSAIAQGEELDYEAFKKKLLEQETAWTHGKEEYPDQPSGDSLAAAEFLHNKW 671


>gi|384247107|gb|EIE20595.1| hypothetical protein COCSUDRAFT_37819 [Coccomyxa subellipsoidea
           C-169]
          Length = 762

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 223/533 (41%), Positives = 317/533 (59%), Gaps = 31/533 (5%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V PAF+G VP AL    P+A+I++  NW S  +  R+CC +LLD  +PLF EIG AF++ 
Sbjct: 236 VFPAFAGFVPGALARERPAARISRSDNWCSFPA--RYCCVHLLDPLEPLFQEIGSAFVKV 293

Query: 68  QLKEYGRTS-HIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
             +EYG      Y+ DTF+E TPP   P Y++S+ +AIY+ M + D  A WLMQ WLF Y
Sbjct: 294 LREEYGSDEVGFYSADTFNEMTPPSSDPAYLTSVTSAIYNAMAAADPSARWLMQAWLF-Y 352

Query: 127 D--PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
           D   FW+PPQ++AL++ VP   L++LDL+AEV P+W ++K F+G P+I+CMLHNF GNIE
Sbjct: 353 DNQKFWQPPQIQALVSGVPRDALIMLDLYAEVFPLWKSTKSFFGAPFIYCMLHNFGGNIE 412

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           MYG L+++A GP E +      ++G+GM  EGIEQNPVVY+LMSE AF+ + V+V+ WI 
Sbjct: 413 MYGALEAVARGPAEGQIDGVAGLIGIGMCPEGIEQNPVVYELMSEWAFRRQPVEVEGWIE 472

Query: 245 QYSVRRYGRSVPAIQ-DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
            Y+ RRYG S P     AW++L  +VYN TDG TD +RD+  + P + P+ + +   K  
Sbjct: 473 AYARRRYGNSTPPTALVAWDLLLRSVYNATDGHTDHSRDIPTSRPGLSPAEVGLWGLK-- 530

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
                                PHLWY+  +V+ A  L + S  EL     YRYDL+D+ R
Sbjct: 531 ---------------------PHLWYNEQQVVDAWGLLLRSAGELQQVEGYRYDLVDVGR 569

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           Q ++K A +++  + EAY    +  V +   R L+L++D++ LLA + GFLLGP LE A 
Sbjct: 570 QVISKRATDIWKAVAEAYVDGRSIVVRREGARLLQLLDDLEELLATNRGFLLGPKLEEAS 629

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
                E + + YEWN R Q+T+W   +    S + DY N+ W+GL+  YY PR A++   
Sbjct: 630 SAGHTEAEARLYEWNLRKQLTVW-GTSDTGGSEIEDYANREWAGLISSYYKPRWALWLLR 688

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +   L  G  +  + WR E +  T  W   R+  P+   GD    SQ LY  Y
Sbjct: 689 LETDLAQGRRYDPEAWRMECLNFTLGWAYLRDQLPLHPQGDTGGVSQRLYEVY 741


>gi|255553488|ref|XP_002517785.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
 gi|223543057|gb|EEF44592.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
          Length = 360

 Score =  404 bits (1037), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 190/359 (52%), Positives = 260/359 (72%), Gaps = 6/359 (1%)

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           MYGILDSI+ GP+EAR SEN+TMVGVGM MEGIE NPVVY+LMSEMAF+ EKV V  W+ 
Sbjct: 1   MYGILDSISTGPIEARVSENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSEKVQVLEWLK 60

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            YS RRYG++V  ++ AW +LYHT+YNCTDG  D N D IV FPD DPS+ S ++   Q+
Sbjct: 61  TYSRRRYGKAVHQVEAAWEILYHTIYNCTDGIADHNTDFIVKFPDWDPSVQSGSDTSQQD 120

Query: 305 -----YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
                  +  S+  + +   S+    H+WYS  +VI AL+LFI  G+ L+ S TYRYDL+
Sbjct: 121 NKHIFLHRSGSRRFLFEGPNSTLPQAHIWYSIQKVINALQLFIDGGSHLTGSLTYRYDLV 180

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           DLTRQ L+K AN+++++ I A++ NDA  +   S++F++L++D+D LLA  D FL+G WL
Sbjct: 181 DLTRQVLSKLANQVYVDAIIAFRSNDARALNLHSQKFIQLIKDIDVLLASDDNFLIGTWL 240

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           ESAK+LA N  + +QYEWNARTQ+TMW+D T+   S L DY NK+WSGLL DYY PRA+ 
Sbjct: 241 ESAKELALNPSEMRQYEWNARTQVTMWYDTTKTNQSKLHDYANKFWSGLLEDYYLPRAST 300

Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNG-DALITSQWLYNKYL 537
           YF ++++SL+  + F+L++WR +WI  +N+WQ G  +YP++ +G DAL  S+ LY+KY 
Sbjct: 301 YFDHLVKSLKQNEKFKLQEWREKWIAFSNEWQAGTKLYPMKGSGDDALAISKALYDKYF 359


>gi|14861378|gb|AAK73654.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
          Length = 753

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 215/541 (39%), Positives = 316/541 (58%), Gaps = 51/541 (9%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G+VP  +   FP    T+LG W     D  + CTYLLD  DP+F  IG 
Sbjct: 244 LGMITVLPAFAGHVPQGVLRAFPRVNATRLGGWSHF--DCTYSCTYLLDPEDPMFQVIGT 301

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F+++ +KE+G T HIY+ DTF+E  P    P Y+S + +A++  M   D  AVWLMQGW
Sbjct: 302 LFLKELIKEFG-TDHIYSADTFNEMNPLSSDPAYLSRVSSAVFRSMTGADPKAVWLMQGW 360

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF + P FW+P Q++ALL+ VPLG+++VLDLFAE +P++  ++ FYG P+IWCMLHNF G
Sbjct: 361 LFQHQPDFWQPAQVRALLHGVPLGRMIVLDLFAESRPVYQWTESFYGQPFIWCMLHNFGG 420

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           N  ++G +++I  GP  AR   N+TMVG G+  EGIEQN +VY+LM+E+ ++ E +D+ +
Sbjct: 421 NHGLFGTVEAINHGPFAARRFPNSTMVGTGLVPEGIEQNDMVYELMNELGWRQEPLDLPS 480

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W+ +Y+ RRYG    A   AW +L  +VYNCT    + NR  +V  P             
Sbjct: 481 WVARYAERRYGAPNAAAASAWQLLLRSVYNCTGVCVNHNRSPLVRRPS------------ 528

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
                        L+ +T       +WY+ S+V  A  L +++G EL +S T+ YDL D+
Sbjct: 529 -------------LRMDT------EVWYNKSDVYEAWRLLLSAGAELGSSPTFGYDLADV 569

Query: 362 TRQALAKYANELFLNIIEAYQ------LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
           TRQA  +  +E +L+I +A+Q      L  A GV        +L+ ++DGLL+ H  FLL
Sbjct: 570 TRQAAQQLVSEYYLSIRQAFQSRSLPELLTAGGVL-----VYDLLPELDGLLSSHRLFLL 624

Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
           G WLESA+ +A ++ + +QYE NAR Q+T+W  N       + DY NK   GL+ DYYG 
Sbjct: 625 GRWLESARAVATSDREAEQYELNARNQVTLWGPNGN-----ILDYANKQLGGLVLDYYGV 679

Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
           R +++   ++ESL SG  F    + +   ++   +   +  YP    GD L  S+ ++ K
Sbjct: 680 RWSLFVSALVESLNSGSPFHQDQFNQAVFQVERGFIYNKKRYPTAPVGDTLEISKKIFLK 739

Query: 536 Y 536
           Y
Sbjct: 740 Y 740


>gi|14861380|gb|AAK73655.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
          Length = 753

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 215/541 (39%), Positives = 316/541 (58%), Gaps = 51/541 (9%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G+VP  +   FP    T+LG W     D  + CTYLLD  DP+F  IG 
Sbjct: 244 LGMITVLPAFAGHVPQGVLRAFPRVNATRLGGWSHF--DCTYSCTYLLDPEDPMFQVIGT 301

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F+++ +KE+G T HIY+ DTF+E  P    P Y+S + +A++  M   D  AVWLMQGW
Sbjct: 302 LFLKELIKEFG-TDHIYSADTFNEMNPLSSDPAYLSRVSSAVFRSMTGADPKAVWLMQGW 360

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF + P FW+P Q++ALL+ VPLG+++VLDLFAE +P++  ++ FYG P+IWCMLHNF G
Sbjct: 361 LFQHQPDFWQPAQVRALLHGVPLGRMIVLDLFAESRPVYQWTESFYGQPFIWCMLHNFGG 420

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           N  ++G +++I  GP  AR   N+TMVG G+  EGIEQN +VY+LM+E+ ++ E +D+ +
Sbjct: 421 NHGLFGTVEAINHGPFAARRFPNSTMVGTGLVPEGIEQNDMVYELMNELGWRQEPLDLPS 480

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W+ +Y+ RRYG    A   AW +L  +VYNCT    + NR  +V  P             
Sbjct: 481 WVARYAERRYGAPNAAAASAWXLLLRSVYNCTGVCVNHNRSPLVRRPS------------ 528

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
                        L+ +T       +WY+ S+V  A  L +++G EL +S T+ YDL D+
Sbjct: 529 -------------LRMDT------EVWYNKSDVYEAWRLLLSAGAELGSSPTFGYDLADV 569

Query: 362 TRQALAKYANELFLNIIEAYQ------LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
           TRQA  +  +E +L+I +A+Q      L  A GV        +L+ ++DGLL+ H  FLL
Sbjct: 570 TRQAAQQLVSEYYLSIRQAFQSRSLPELLTAGGVL-----VYDLLPELDGLLSSHRLFLL 624

Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
           G WLESA+ +A ++ + +QYE NAR Q+T+W  N       + DY NK   GL+ DYYG 
Sbjct: 625 GRWLESARAVATSDREAEQYELNARNQVTLWGPNGN-----ILDYANKQLGGLVLDYYGV 679

Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
           R +++   ++ESL SG  F    + +   ++   +   +  YP    GD L  S+ ++ K
Sbjct: 680 RWSLFVSALVESLNSGSPFHQDQFNQAVFQVERGFIYNKKRYPTAPVGDTLEISKKIFLK 739

Query: 536 Y 536
           Y
Sbjct: 740 Y 740


>gi|390348210|ref|XP_785272.3| PREDICTED: alpha-N-acetylglucosaminidase [Strongylocentrotus
           purpuratus]
          Length = 793

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 202/537 (37%), Positives = 300/537 (55%), Gaps = 41/537 (7%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G+VP +   VFP+A I+ LG+W   +  P +CCT LLD  DP+F ++G+
Sbjct: 252 LGMIPVLPAFAGHVPXSFSKVFPNASISNLGDW--GRFGPEYCCTSLLDPQDPMFKQVGK 309

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
           AFI+   +E+  T HIY+ DTF+EN P      Y+S+    +Y G+  GD   VWLM GW
Sbjct: 310 AFIDAMSEEFNGTDHIYSADTFNENKPKSRDSAYLSAASKGVYQGIIEGDPKGVWLMMGW 369

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
           LF    FW P Q+KALL  VP+G+++VLDL+AE +P + T+  FYG P+IWCMLHNF GN
Sbjct: 370 LFQDTGFWGPTQIKALLQGVPIGRMIVLDLYAEARPFYKTTYSFYGQPFIWCMLHNFGGN 429

Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
             +YG LD++  GP EAR  +N+TM+G+G + EGI QN V+Y+ +++M ++    +V  W
Sbjct: 430 TGLYGKLDAVNQGPFEARNYDNSTMIGMGTTPEGIFQNYVMYNFLTDMTWRSGSTNVSKW 489

Query: 243 INQYSVRRYG---RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
           I QY+ RRY           +AW +L  TVYN T    D                     
Sbjct: 490 IEQYAGRRYSNDPNKSEEATEAWVILKETVYNNTGTLQD--------------------- 528

Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
                Y  PV + + + +         +WY  ++V +A E  + +  +L  S  +RYDL+
Sbjct: 529 ---HQYAVPVRRPSNIMTSP-------VWYDYTKVAKAWEFLLEASTKLGTSPVFRYDLV 578

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           D+TR  L   A +    ++ ++++ +A  V         L+ DMD + + H+ +LLG WL
Sbjct: 579 DVTRNVLQDLAFDFQQKLMVSFRIRNAGAVGGNGTLLCNLILDMDNITSSHEDWLLGTWL 638

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           E AK LA N ++E  YE+NA+ QIT+W    +EE   + DY NK W GLLR YY  R  +
Sbjct: 639 EDAKSLATNNDEESLYEYNAKNQITIW--GPKEE---ILDYANKQWGGLLRTYYHRRWQL 693

Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           Y +Y+ E ++S   +    +        ++W + +  +P E  GD +  S+ LY KY
Sbjct: 694 YVQYLEECIQSHQPYDQNTFNVRSFVAESEWTHSKEKFPTEPVGDTMAISKALYVKY 750


>gi|348533253|ref|XP_003454120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oreochromis
           niloticus]
          Length = 845

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 197/533 (36%), Positives = 319/533 (59%), Gaps = 44/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGN+P  +  ++P A++T+LG W        + C+ +LD  DPLF  IG  ++ Q
Sbjct: 345 VLPAFSGNIPKGILRLYPEARVTRLGPWSHFNCS--YSCSLVLDPQDPLFHHIGSLYLSQ 402

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            LK++G T HIY+ DTF+E TPP   P Y+S++  ++++ M + D  AVWLMQGWLF  D
Sbjct: 403 VLKQFG-TDHIYSTDTFNEMTPPSSDPAYLSAVSRSVFASMTAVDPQAVWLMQGWLFFSD 461

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+P Q++ALL+ VPLG+++VLDLFAE +PI+S ++ FYG P+IWCML NF GN  ++
Sbjct: 462 AAFWKPAQIQALLHGVPLGRMIVLDLFAETEPIFSYTESFYGQPFIWCMLQNFGGNSGLF 521

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G ++SI  GP +A    N+T+VG+GM+ EGIEQNPV Y+LMSE+A++ E V++  W++ Y
Sbjct: 522 GTVESINSGPFKALHFPNSTLVGIGMTPEGIEQNPVTYELMSELAWRKEPVNLAKWVSLY 581

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           ++RRYG +  ++  AW +L+ ++YNCTD                           Y+N+ 
Sbjct: 582 AIRRYGNTQESLTTAWRLLFASIYNCTDP-------------------------HYRNHN 616

Query: 307 -KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             P+ +    +  T       LWY  +++ +A +L + +   L +  T+RYDL+D+TR+ 
Sbjct: 617 HSPLVRRPSFQMNTG------LWYDPADLYKAWKLIMDAAPSLMSKETFRYDLVDVTREV 670

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
           L       + +I +A++  +   +       + +L+ +++ LL+ +  FLLG WLE A+ 
Sbjct: 671 LQVLTTSFYRDIADAFKKQNLSELLTAGGVLVYDLLPELNRLLSSNRNFLLGAWLERARS 730

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           LA ++++ + Y+ NAR QIT+W  + +     + DY +K W GL+ DYY  R  ++ + +
Sbjct: 731 LAVDDKEAQLYDMNARNQITLWGPSGE-----ILDYASKEWGGLMEDYYAQRWGLFVQTL 785

Query: 485 IESLESGDGFRLKDWRREWIKLTNDW-QNGRNVYPVESNGDALITSQWLYNKY 536
           +E L SG  F+   + +   ++   +  NGR  YP +  GD    +  ++ KY
Sbjct: 786 VECLNSGQPFKQAAFNQAVFQIEKGFIYNGRK-YPTKPQGDTYEIAYRIFLKY 837


>gi|410930376|ref|XP_003978574.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Takifugu rubripes]
          Length = 751

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 202/531 (38%), Positives = 319/531 (60%), Gaps = 40/531 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGN+P  +  +FP A++T+L  W   K +  + C+Y+LD  DPLF  IG  ++ Q
Sbjct: 251 VLPAFSGNIPKGILRLFPEARVTRLEPW--SKFNCSFSCSYILDPRDPLFSRIGSLYLSQ 308

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +K++G T+HIYN DTF+E TPP   P Y+S++  A+++ M + D  AVWLMQGWLF  D
Sbjct: 309 VVKQFG-TNHIYNTDTFNEMTPPSSEPTYLSAVSRAVFASMTAVDPQAVWLMQGWLFLSD 367

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+P Q++ALLN VP+G+++VLDLFAE +P++S ++ FYG P+IWCMLHNF GN   +
Sbjct: 368 ALFWKPAQIQALLNGVPVGRMIVLDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNGGFF 427

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G ++SI  GP +A    N+++VG+GM+ EGIEQNPVVY+LMSE+A++ E V++  W++ Y
Sbjct: 428 GTVESINTGPFKALHFPNSSLVGIGMTPEGIEQNPVVYELMSELAWRKEPVNLLKWVSLY 487

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
             RRYG    ++  AW +L+ +VYNCT              P             Y+N+ 
Sbjct: 488 VTRRYGSMHESVSAAWKILFASVYNCT-------------LP------------HYRNH- 521

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                 + L    S + +  LWY  +++ RA +L + +     +  T++YDL+D+TRQ +
Sbjct: 522 ----NHSPLVRRPSFHMNSELWYDPADLYRAWKLILEAAPSFMSKETFQYDLVDVTRQVM 577

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQL 425
                  + +I++A+Q +    +       L +L+ +++ LL+ +  FLLG WLE A+ L
Sbjct: 578 QVLTTSYYQDIVDAFQKHKMQELLTAGGVLLYDLLPELNRLLSSNHNFLLGTWLEQARSL 637

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
           A +E + K Y+ NAR Q+T+W  + +     + DY NK W GL++DYY  R  ++   ++
Sbjct: 638 ALDEREAKLYDINARNQLTLWGPSGE-----ILDYANKQWGGLMQDYYAQRWGLFIHTLV 692

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           E L+SG  F+  ++ +   ++   +   R  YP +  GD    +  ++ KY
Sbjct: 693 ECLDSGQPFKQDNFNKVVFQVEKGFIYNRRQYPTKPQGDTFEIAHRIFLKY 743


>gi|375144105|ref|YP_005006546.1| alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
 gi|361058151|gb|AEV97142.1| Alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
          Length = 735

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 209/531 (39%), Positives = 310/531 (58%), Gaps = 40/531 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A +  +P+AK+ +  NW +  +D     TY+LD+ DPLF E+G+ F+++
Sbjct: 238 VLPAFTGHVPPAFKKKYPNAKL-KATNWTNGFAD-----TYILDSQDPLFAEMGKRFLQK 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   +G T H+Y+ DTF+EN PP D P ++S+L A IY GM+  D+ A W+MQGWLF  D
Sbjct: 292 QTSLFG-TDHLYSADTFNENEPPSDDPAFLSALSARIYEGMKQADTAATWVMQGWLFYSD 350

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+ PQ++ALL +VP  K+++LDL AE++P+W  +  FYG P+IW MLHNF GN+ ++
Sbjct: 351 RKFWKAPQIEALLKAVPDNKMILLDLAAEIEPVWKRTDAFYGKPWIWNMLHNFGGNVNLF 410

Query: 187 GILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           G +D +A  P E    + +  + G+G++ME IEQNPV+Y+LM+   +Q   VD+ AWI Q
Sbjct: 411 GRMDGVATQPAETLNDKASGKLWGIGLTMEAIEQNPVMYELMTRHTWQTTPVDLDAWIPQ 470

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y + RY  +   + DAW +L  TVYN   GA  +         D   SII+         
Sbjct: 471 YVLNRYRTNNTNLVDAWQILRKTVYN---GAVIR---------DGAESIIT--------- 509

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
           G+P      + + T     PH      E++ A +LF+ +  +   S+ ++YDL+D+TRQ 
Sbjct: 510 GRPTFDSTTVWTRTKLNYAPH------ELLPAWDLFVQAAGKGVNSDGFQYDLVDVTRQV 563

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           LA YA  L    + A+   D+    + S+ FL+L+ DMD LLA    F+LGPWL +A+  
Sbjct: 564 LANYAAPLQKKWVTAFNAKDSAAFNKYSKAFLQLISDMDLLLASRKDFMLGPWLSAARSN 623

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
                ++  YE NAR  IT+W D      S L +Y N+ WSGLL D+Y PR   +F  + 
Sbjct: 624 GTTPAEKALYEQNARDLITLWGDAN----SPLHEYSNRQWSGLLNDFYKPRWQQFFTLLQ 679

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +SL +G    LK +          W N +  YPV  +G+++  +Q LY KY
Sbjct: 680 QSLRTGSTPDLKQFEENIRSWEWKWVNTQKAYPVVPSGNSVQVAQMLYKKY 730


>gi|373953359|ref|ZP_09613319.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
 gi|373889959|gb|EHQ25856.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
          Length = 733

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 205/533 (38%), Positives = 304/533 (57%), Gaps = 41/533 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPAA +N +P+AK+ +  NW +  +D     TY+LD+ DP+F  IG+ F+++
Sbjct: 232 VLPAFTGHVPAAFKNKYPTAKL-KTTNWKNGFAD-----TYILDSADPMFARIGQLFLQK 285

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q    G T H+Y+ DTF+EN PP D PEY+  L   +Y GM   D+ AVW+MQGWLF  D
Sbjct: 286 QTALLG-TDHLYSADTFNENEPPSDEPEYLGKLSERVYQGMHQADTAAVWVMQGWLFYSD 344

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+P Q +ALL +VP  K+++LDL  E++P+W  ++ FYG P+IW ML+NF  N  ++
Sbjct: 345 RKFWKPEQTRALLKAVPDDKMIILDLATEIEPVWKRTEAFYGKPWIWNMLNNFGANTNLF 404

Query: 187 GILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           G +DS A GP EA    ++  M G+G++MEGIEQNPV+YDL+++  ++++ ++V  W+ +
Sbjct: 405 GRMDSAAKGPAEAYHDPKSGQMKGIGLTMEGIEQNPVLYDLLTDNTWRNQPINVDEWLPK 464

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQ 303
           Y + RYG+     Q AWN+L  TVY+        D    +I A P  D S          
Sbjct: 465 YVLNRYGKPNAQAQKAWNILRKTVYSVLADRYIRDGAESIIQARPTTDSS---------- 514

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
                           S +    L Y    ++ A +  I +  +LS S+ +R+DL+DL+R
Sbjct: 515 ----------------SRWARTTLNYEPKALLPAWQAMIKASEDLSTSDGFRFDLVDLSR 558

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           Q LA YA  L    + A+Q  DA    + S  F+EL++DMD LLA    FLLGPW+  A+
Sbjct: 559 QVLANYAFTLQRRFVLAHQQKDAAAFKKHSAEFIELIQDMDQLLATRKDFLLGPWVADAR 618

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           +      ++  YE NA+  IT+W D    +   L +Y  + WSGLL D+Y PR   YF+ 
Sbjct: 619 RCGATVSEKALYEMNAKDLITLWGD----KDCPLNEYACRQWSGLLNDFYKPRWQQYFEQ 674

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +   L     F  + + R+       W N R  YPV+  GD ++ ++ LY KY
Sbjct: 675 INLDLTGKKPFDKEAFERKIKSWEWQWVNARKDYPVKPQGDPVLEARKLYKKY 727


>gi|255533666|ref|YP_003094038.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
 gi|255346650|gb|ACU05976.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
          Length = 735

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 205/532 (38%), Positives = 311/532 (58%), Gaps = 42/532 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VP A ++ FP AK+ +  NW +  S       Y+LD  D LF  IG+ FIE+
Sbjct: 236 ILPAFTGHVPPAFKDKFPKAKLKKT-NWTTFPS------VYILDPEDELFTTIGKRFIEE 288

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
           ++K +G T H+Y  DTF+ENTPP     Y+S++   +Y  M   D +A W+MQGWLF + 
Sbjct: 289 EVKTFG-TDHLYTADTFNENTPPTSDSLYLSNVSKKVYQSMALADPEATWIMQGWLFYHG 347

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           + FW+P Q+KALLN++P  K++VLDL++E  P+W  +  +YG P+IW MLHNF GNI +Y
Sbjct: 348 EKFWKPTQIKALLNAIPNDKMIVLDLWSENHPVWQRTAAYYGKPWIWNMLHNFGGNISLY 407

Query: 187 GILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           G +D +A G ++A+ + N+  MVG+G++ E IEQNPV+Y LM +  +  E ++V AW+  
Sbjct: 408 GRMDEVASGAIKAKQAANSGNMVGIGLTPEAIEQNPVMYQLMLDNIWTDEPINVTAWLKN 467

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           YS +RYG      + AW +LY TVY  T G            P    SI++         
Sbjct: 468 YSRQRYGAQNALAEQAWQILYKTVY--TGG----------ILPGGPESILT--------- 506

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
           G+P        +E++    P   Y  +E+I A E  + +  +LS ++ ++YDL+D+TRQ 
Sbjct: 507 GRPTM------AESTRSTRPKKNYKPAELIPAWEALLKASQQLS-TDGFKYDLVDVTRQV 559

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           L  YA+ L     +AYQ  D     +LS  FL +++D+D LLA    FLLG WL  AK++
Sbjct: 560 LVNYADTLQRQFAQAYQGKDGKKFDRLSGDFLAVMDDVDYLLATRKDFLLGKWLNEAKRM 619

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
               E++K+YE NAR  IT+W D    + S L +Y  + WSGL+  +Y PR   +F Y  
Sbjct: 620 GTTAEEKKRYERNARNLITLWAD----QNSSLNEYSCRQWSGLISSFYKPRWQQFFSYAK 675

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           + L+SG     K +  +  +   DW N  +V+  + +G+ + T++ LY KY+
Sbjct: 676 QQLKSGAKLDQKVFEEKMKRWEWDWVNKNDVFTEQPSGNEIKTAESLYKKYI 727


>gi|326679829|ref|XP_688608.3| PREDICTED: alpha-N-acetylglucosaminidase-like [Danio rerio]
          Length = 757

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 201/532 (37%), Positives = 307/532 (57%), Gaps = 42/532 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSG VP  +  +FP A +T+L  W        + C Y+LD  DPLF  IG  F+ Q
Sbjct: 251 VLPAFSGIVPEGITRLFPKANVTKLSPWSHFNCT--YSCAYVLDPRDPLFHRIGALFLTQ 308

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            ++E+G T HIYN DTF+E  P    P Y++S+  AI++ M S D  A+WLMQGWLF  D
Sbjct: 309 VIEEFG-TDHIYNTDTFNEMPPASSDPTYLASISRAIFNTMTSVDPQAIWLMQGWLFISD 367

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW+  Q+KALL+ VPLG+++VLDLFAE  P++S++  FYG P+IWCMLHNF GN  ++
Sbjct: 368 PSFWKADQVKALLHGVPLGRMIVLDLFAESMPVYSSTNSFYGQPFIWCMLHNFGGNSGLF 427

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +DSI  GP  A    N+T+VG+GM+ EGIEQNPV+Y+LMSE+A++ + V++  W++ Y
Sbjct: 428 GTVDSINSGPFNAVRFPNSTLVGLGMTPEGIEQNPVIYELMSELAWRKDPVNLYKWVSLY 487

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           ++RRYG     +  AW +L+ +VYNCT              P            KY+N+ 
Sbjct: 488 ALRRYGSMDENLALAWQLLFRSVYNCT-------------LP------------KYKNHN 522

Query: 307 K-PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
           + P+     L  +T       +WY  ++  RA +L   +   L    T+RYDL+D+TRQA
Sbjct: 523 RSPLVHRPSLHMQTD------IWYDPADFYRAWKLLFEAAPGLVTLETFRYDLVDVTRQA 576

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
           L     E + +I  A+Q      +       + +L+ ++D +L+ ++ FLLG WL+ A+ 
Sbjct: 577 LQLLTTEFYKDIKSAFQTQKLSDLLTAGGVLVYDLLPELDRILSSNEHFLLGAWLQQAQS 636

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
              +E +   Y+ NAR QIT+W  + +     + DY +K W+GL+ DYY  R  ++   +
Sbjct: 637 QGVDEHEAHLYDINARNQITLWGPDGE-----ILDYASKEWAGLVEDYYLQRWGLFVNTL 691

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +E L+ G  F+   + +   ++   +   +  YP +  GD    ++ ++ KY
Sbjct: 692 VECLDRGRPFKQDVFNQAVFQVEKGFVFNQRKYPTKPLGDTYDIARRIFLKY 743


>gi|350407422|ref|XP_003488083.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus impatiens]
          Length = 770

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 207/544 (38%), Positives = 302/544 (55%), Gaps = 42/544 (7%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G+VP A   +FP A +T+   W S     ++CC YLL+ TDPLF +IG 
Sbjct: 250 LGIIPVLPAFTGHVPRAFPRLFPEANVTKSATWNSFSD--KYCCPYLLEPTDPLFHKIGD 307

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F+   +KE+G T HIYNCDTF+EN PP    +++ ++G +I+  M S D  A+WLMQGW
Sbjct: 308 QFLRTYIKEFG-TDHIYNCDTFNENEPPTSELKFLRNVGHSIFQTMLSVDPQAIWLMQGW 366

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF +D  FW  P++KA L SVPLG+L+VLDL +E  P++   K +YG P+IWCMLHNF G
Sbjct: 367 LFVHDAVFWTEPRIKAFLTSVPLGRLIVLDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGG 426

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
            + M+G    I     E R  E +TM+G G++ EGI QN V+Y+LM+EMA++ E V++  
Sbjct: 427 TLGMFGSAQIINRRVFEGRNMEGSTMIGTGLTPEGINQNYVIYELMNEMAYRQEPVNLDN 486

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W   Y+ RRYG        AW  L  TVYN                       IS   GK
Sbjct: 487 WFEDYASRRYGAWNEYAVAAWKNLGSTVYNFRG--------------------ISKIRGK 526

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
           Y    +P    A L            WY   +      +F+ + +    S  YR+D++D+
Sbjct: 527 YVITRRPSLNLARLT-----------WYDPEKFYSTWYIFLQARHGRKNSTLYRHDVVDI 575

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           TRQAL   A++++  ++E++   D       + R LEL +D++ +LA  + FLLG WLE 
Sbjct: 576 TRQALQLKADKIYSVLVESFNQKDVTTFKLQAGRLLELFDDLEAILASSEDFLLGTWLEM 635

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           AK LA ++ + K YE+NAR QIT+W    +     +RDY NK WSG++ DY+ PR AI+ 
Sbjct: 636 AKNLATDDAESKLYEYNARNQITLWGPRGE-----IRDYANKQWSGIVSDYFKPRWAIFL 690

Query: 482 KYMIESLESGDGFRLKDW-RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGT 540
             +  SL  G    +     R + ++   +   R +YP  + GD +  +  + +K+ Q +
Sbjct: 691 DGLTTSLTKGTSLNITRINERIFKEVEKPFTLSRKIYPTNATGDCIDIAMRILSKWYQPS 750

Query: 541 GVFD 544
            +FD
Sbjct: 751 -IFD 753


>gi|73965663|ref|XP_548088.2| PREDICTED: alpha-N-acetylglucosaminidase [Canis lupus familiaris]
          Length = 747

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 207/533 (38%), Positives = 312/533 (58%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSG+VP AL  VFP   ITQLG+W        + C++LL   DPLF  IG  F+ +
Sbjct: 242 VLPAFSGHVPKALTRVFPQINITQLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRE 299

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            ++E+G T+HIY  DTF+E  PP   P Y++S  A++Y  M + DSDAVWL+QGWLF + 
Sbjct: 300 LIQEFG-TNHIYGADTFNEMQPPSSEPSYLASATASVYQAMITVDSDAVWLLQGWLFQHQ 358

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+KA+L +VP G+L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 359 PQFWGPAQVKAVLEAVPRGRLLVLDLFAESQPVYIQTASFQGQPFIWCMLHNFGGNHGLF 418

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TM+G GM+ EGI QN VVY LM+E+ ++ + V D++AW++ 
Sbjct: 419 GALEAVNRGPAAARLFPNSTMLGTGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVSS 478

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG +    + AW +L  +VYNC+ +  +  NR  +V      PS+  VT      
Sbjct: 479 FAARRYGVAHRDTEAAWRLLLRSVYNCSGEACSGHNRSPLVR----RPSLQMVTT----- 529

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                                 +WY+ S+V  A  L + +   L++S T+RYDL+D+TRQ
Sbjct: 530 ----------------------VWYNRSDVFEAWRLLLTAAPTLASSPTFRYDLLDVTRQ 567

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A  +  +  ++    AY   +   + + +   + EL+  +D +LA    FLLG WLE A+
Sbjct: 568 AAQELVSLYYVEARSAYLRKELVPLLRAAGVLVYELLPALDKVLASDSRFLLGRWLEQAR 627

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  ++ + 
Sbjct: 628 AAAVSEAEAHLYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEM 682

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++ESL  G  F+   + +   +L   +  G   YP + +GD +  ++ L+ KY
Sbjct: 683 LVESLVQGIPFQQHQFDKNAFQLEQTFIFGTQRYPSQPDGDTVDLAKKLFIKY 735


>gi|443691318|gb|ELT93213.1| hypothetical protein CAPTEDRAFT_144379, partial [Capitella teleta]
          Length = 718

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 203/512 (39%), Positives = 302/512 (58%), Gaps = 42/512 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            LPAF+G+VPA +  +FP  K+++LG+W    S   +CCT LLD  DPLF EIG+AFI++
Sbjct: 225 ALPAFAGHVPANITRLFPRVKVSKLGDWGRFNST--YCCTTLLDVEDPLFKEIGKAFIDE 282

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             +E+G T H+YN DTF+E TP    P Y++  G A+YSGM S DS A+WLMQGWLF  D
Sbjct: 283 YTREFG-TDHVYNTDTFNEMTPASSDPSYLTKAGQAVYSGMVSSDSKAIWLMQGWLFLSD 341

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
            FW+PPQ KALL SVP GK++VLDL++EV P +   + +YG P+IWCMLHNF G + MYG
Sbjct: 342 -FWKPPQAKALLTSVPQGKMLVLDLYSEVNPQYPRLQSYYGQPFIWCMLHNFGGTLPMYG 400

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            ++S+  GP E R+  N+TMVG+G++ EGI QN V+Y+ M E +F+ + V++  W ++Y+
Sbjct: 401 AIESVNQGPFEGRSFVNSTMVGIGLTPEGINQNEVMYEFMMENSFRSQPVELTEWFDKYA 460

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRY       + AW +   TVYNC+DG    N+++ V  P                   
Sbjct: 461 TRRYASRNANARAAWQIFKRTVYNCSDGVKHHNKNIPVCRP------------------- 501

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
                       S  +   +WY   +  +  +L IA+  E+  S  +RYDL+D++RQAL 
Sbjct: 502 ------------SRKNKIDVWYDVEDFFKGWDLMIAASKEVD-SPLFRYDLVDVSRQALQ 548

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
             +   +  I+ +Y+  +   +       L L++DMD +LA    FLLG W+  A +   
Sbjct: 549 VISITYYNQILTSYKQKNLTSLASSGNDLLHLLDDMDTVLATDSHFLLGAWIAGAHRNGV 608

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
             E++  YE+NAR Q+T+W      +A++L DY NK W+GL+ DYY  R  ++   + +S
Sbjct: 609 TPEEKALYEFNARNQVTLW----GPDANIL-DYANKQWAGLVADYYHERWELFIDELKKS 663

Query: 488 LESGDGFRLKDWRRE-WIKLTNDWQNGRNVYP 518
           LE+   F  K ++++ + K  + +    NVYP
Sbjct: 664 LENKTSFDEKKFQKDVFEKAESPFTYRTNVYP 695


>gi|340717403|ref|XP_003397173.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus terrestris]
          Length = 770

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 204/538 (37%), Positives = 296/538 (55%), Gaps = 41/538 (7%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G+VP A   +FP A +T+   W S     ++CC YLL+ TDPLF +IG 
Sbjct: 250 LGIIPVLPAFTGHVPRAFPRLFPEANVTKSATWNSFSD--KYCCPYLLEPTDPLFHKIGD 307

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F+   +KE+G T HIYNCDTF+EN PP    +++ ++G +I+  M S D  A+WLMQGW
Sbjct: 308 QFLRTYIKEFG-TDHIYNCDTFNENEPPTSELKFLRNVGHSIFQTMLSVDPQAIWLMQGW 366

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF +D  FW  P++K  L SVPLG+L+VLDL +E  P++   K +YG P+IWCMLHNF G
Sbjct: 367 LFVHDALFWTEPRIKTFLTSVPLGRLIVLDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGG 426

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
            + M+G    I     E R  E +TM+G G++ EGI QN V+Y+LM+EMA++ E V++  
Sbjct: 427 TLGMFGSAQIINRRVFEGRNMEGSTMIGTGLTPEGINQNYVIYELMNEMAYRQEPVNLDN 486

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W   Y+ RRYG        AW  L  TVYN                       IS   GK
Sbjct: 487 WFEDYASRRYGAWNEYAVAAWKNLGSTVYNFRG--------------------ISKIRGK 526

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
           Y    +P    A L            WY   +      +F+ + +    S  YR+D++D+
Sbjct: 527 YVITRRPSLNLARLT-----------WYDPEKFYSTWYIFLQARHGRQNSTLYRHDVVDI 575

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           TRQAL   A++++  ++E++   D       + R LEL +D++ +LA  + FLLG WLE 
Sbjct: 576 TRQALQLKADKIYSALVESFNQKDVTTFKLQADRLLELFDDLEAILASSEDFLLGTWLEM 635

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           AK LA ++ + K YE+NAR QIT+W    +     +RDY NK WSG++ DY+ PR AI+ 
Sbjct: 636 AKNLATDDAESKLYEYNARNQITLWGPRGE-----IRDYANKQWSGIVSDYFKPRWAIFL 690

Query: 482 KYMIESLESGDGFRLKDW-RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
             +  SL  G    +     R + ++   +   R +YP    GD +  +  + +K+ Q
Sbjct: 691 DALTTSLTKGTSLNITRINERIFKEVEKPFTLSRKIYPTNVTGDCIDIAMRILSKWHQ 748


>gi|449491231|ref|XP_004174728.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase,
           partial [Taeniopygia guttata]
          Length = 752

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 207/536 (38%), Positives = 314/536 (58%), Gaps = 51/536 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP  +  VFP    T+LG+W     D  + C YLLD  DP+F  IG  F+++
Sbjct: 248 VLPAFAGHVPQGILRVFPRVNATRLGHWSHF--DCTYSCIYLLDPEDPMFQVIGTLFLKE 305

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T H+Y+ DTF+E TP    P Y+S +  A++  M   D  A+WLMQGWLF + 
Sbjct: 306 LIKEFG-TDHVYSADTFNEMTPLSSDPAYLSRVSNAVFRSMTGADPKALWLMQGWLFQHQ 364

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW+P Q++ALL+ VPLG+++VLDLFAE KP++  ++ FYG P+IWCMLHNF GN  ++
Sbjct: 365 PDFWQPAQVRALLHGVPLGRMIVLDLFAESKPVYQWTESFYGQPFIWCMLHNFGGNHGLF 424

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +++I  GP  AR   N+TMVG G+  EGIEQN +VY+LM+E+ ++ E +D+ +W+ +Y
Sbjct: 425 GTVEAINHGPFAARRFPNSTMVGTGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVTRY 484

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RRYG    A   AW +L  +VYNCT    + NR                         
Sbjct: 485 AERRYGAPNAAAASAWRLLLRSVYNCTGVCVNHNRS------------------------ 520

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
            P+ +   L+ +T       LWY+ S+V  A  L +++G EL +S  + YDL+D+TRQA 
Sbjct: 521 -PLVRRPSLRMDT------ELWYNASDVFEAWRLLLSAGAELGSSPAFLYDLVDVTRQAA 573

Query: 367 AKYANELFLNIIEAYQ------LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
            +  +  +L+I +A+Q      L  A GV        +L+ ++D LL+ H  FLLG WL+
Sbjct: 574 QQLVSHYYLSIRQAFQSHALPELLTAGGVL-----VYDLLPELDSLLSSHSLFLLGRWLQ 628

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
           SA+ +A ++++ +QYE NAR Q+T+W  +       + DY N    GL+ DYY  R +++
Sbjct: 629 SARAVATSDQEAEQYELNARNQVTLWGPSGN-----ILDYANXQLGGLVLDYYAVRWSLF 683

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
              ++ESL SG  F    + + + ++   +   +  YP    GD +  S+ L+ KY
Sbjct: 684 VSVLVESLNSGRPFHQNQFNQVFFQVERGFIYNKKRYPAVPFGDTMEISRKLFLKY 739


>gi|432926094|ref|XP_004080826.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oryzias latipes]
          Length = 882

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 202/537 (37%), Positives = 314/537 (58%), Gaps = 52/537 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGNVP  +  + P A +T+LG W        + C+Y+LD  DPLF++IG  ++ Q
Sbjct: 382 VLPAFSGNVPKGILKLHPEANVTRLGPWAHFNCS--FSCSYVLDPRDPLFLQIGSLYLSQ 439

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +K++G T HIYN DTF+E TPP   P Y+S++  ++++ M + D  A+WLMQGWLF  D
Sbjct: 440 VVKQFG-TDHIYNTDTFNEMTPPSSDPAYLSAISRSVFASMTAVDPKAIWLMQGWLFFSD 498

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+PPQ++ALL+ VPLG+++VLDLFAE +P++S ++ FYG P+IWCMLHNF GN   +
Sbjct: 499 AAFWKPPQIRALLHGVPLGRMIVLDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNNGFF 558

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G ++SI  GP +A   +N+TMVG+GM+ EGI QNPV+Y+LMSE+A++ E V++  W + Y
Sbjct: 559 GTVESINSGPFKALNFKNSTMVGIGMTPEGIHQNPVIYELMSELAWRKESVNLTKWASLY 618

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RRYG    ++  AW +L+ +VYNCT                       V   +  N+ 
Sbjct: 619 AARRYGSMHESLSAAWKLLFSSVYNCT-----------------------VPHYRNHNH- 654

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
            P+ +       T       LWY  ++++   +LF+ +   L +  T+RYDL+D+TRQ L
Sbjct: 655 SPLVRRPSFNMNTG------LWYDPADLLETWKLFMEAAPSLMSKETFRYDLVDVTRQVL 708

Query: 367 AKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
                  + +I +A+      +L  + GV        +L  +++ LL     FLLG WLE
Sbjct: 709 QDLTTYFYQDIKDAFHSKKMPELLTSGGVL-----IYDLFPELNRLLNSDRNFLLGTWLE 763

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            A+  A +E + + Y+ NAR Q+T+W  + +     + DY NK W GL+ DYY  R +++
Sbjct: 764 QAQSFALDEPEARLYDLNARNQLTLWGPSGE-----ILDYANKEWGGLVEDYYAQRWSLF 818

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDW-QNGRNVYPVESNGDALITSQWLYNKY 536
            + +++ L SG  F+   + +   ++   +  NGR  YP +  GD    +  ++ KY
Sbjct: 819 VQTLVDCLNSGLPFKQDAFNQAVFRVEKGFISNGRK-YPTKPQGDTYEIAHRIFLKY 874


>gi|149054264|gb|EDM06081.1| rCG33377, isoform CRA_d [Rattus norvegicus]
          Length = 580

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 204/533 (38%), Positives = 306/533 (57%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   + QLGNW        + C++LL   DPLF  IG  F+ +
Sbjct: 80  VLPAFAGHVPKAITRVFPQVNVIQLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRE 137

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 138 LTKEFG-TDHIYGADTFNEMQPPFSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQ 196

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+KA+L +VP G+L+VLDLFAE +P++S +  F+G P+IWCMLHNF GN  ++
Sbjct: 197 PQFWGPSQIKAVLEAVPRGRLLVLDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLF 256

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+ +  GP  AR   N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW++ 
Sbjct: 257 GALEDVNQGPQAARLFPNSTMVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSS 316

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S P    AW +L  +VYNC+ +  +  NR                       
Sbjct: 317 FASRRYGVSQPDAVAAWRLLLRSVYNCSGEACSGHNRS---------------------- 354

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
              P+ K   L+  T+      +WY+ S+V  A  L + +   L+AS  +RYDL+D+TRQ
Sbjct: 355 ---PLVKRPSLQMSTA------VWYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQ 405

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     A+   D   + +       +L+  +D LLA +  FLLG WL+ A+
Sbjct: 406 AVQELVSSCYEEARTAFLNQDLDLLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAR 465

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           ++A +E + + YE N+R QIT+W      E ++L DY NK  +GL+ DYY PR  ++   
Sbjct: 466 EVAVSESEAQFYEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGT 520

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +  SL  G  F+   + +    L   + N +  YP++  GD +  S+ ++ K+
Sbjct: 521 LAHSLARGIPFQQHQFEKSVFPLEQAFINNKKRYPIQPQGDTVDLSKKIFLKF 573


>gi|440903235|gb|ELR53922.1| Alpha-N-acetylglucosaminidase, partial [Bos grunniens mutus]
          Length = 614

 Score =  370 bits (950), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 209/537 (38%), Positives = 305/537 (56%), Gaps = 51/537 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP AL  VFP   +TQ+GNW        + C++LL   DPLF  +G  F+ +
Sbjct: 114 VLPAFAGHVPKALTRVFPQVNVTQMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRE 171

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++   A+Y  M + D DAVWL+QGWLF + 
Sbjct: 172 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYQAMTAVDPDAVWLLQGWLFQHQ 230

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 231 PEFWGPAQVAAVLGAVPRGRLLVLDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLF 290

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+S+  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 291 GALESVNQGPTTARHFPNSTMVGTGMAPEGIGQNEVVYALMAELGWKKDPVADLGAWVTS 350

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           ++ RRYG S    + AW +L  +VYNC                       S  E +  N+
Sbjct: 351 FAARRYGVSHGDAEAAWRLLLRSVYNC-----------------------SGEECRGHNH 387

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             P+ +   L+  T+      +WY+ S+V  A  L +A+ + L++S  +RYDL+D+TRQA
Sbjct: 388 -SPLVRRPSLQMVTT------VWYNRSDVFEAWRLLLAATSTLASSPAFRYDLVDVTRQA 440

Query: 366 LAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           + +  +  +  +  AY       L  A G+        EL+  +D +LA    FLLG WL
Sbjct: 441 VQELVSLYYEEMRTAYLKKELVPLTRAGGILA-----YELLPALDQVLASDCHFLLGSWL 495

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           E A+Q A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  +
Sbjct: 496 EQARQAAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLMADYYAPRWRL 550

Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           + + ++ESL  G  F+   + R   +L   +  G   YP +  GD +   + L+ KY
Sbjct: 551 FTETLVESLVQGVPFQQHQFDRNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 607


>gi|109491871|ref|XP_001081442.1| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
 gi|392351622|ref|XP_002727861.2| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
 gi|149054262|gb|EDM06079.1| rCG33377, isoform CRA_b [Rattus norvegicus]
          Length = 739

 Score =  370 bits (949), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 204/533 (38%), Positives = 306/533 (57%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   + QLGNW        + C++LL   DPLF  IG  F+ +
Sbjct: 239 VLPAFAGHVPKAITRVFPQVNVIQLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRE 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 297 LTKEFG-TDHIYGADTFNEMQPPFSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQ 355

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+KA+L +VP G+L+VLDLFAE +P++S +  F+G P+IWCMLHNF GN  ++
Sbjct: 356 PQFWGPSQIKAVLEAVPRGRLLVLDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLF 415

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+ +  GP  AR   N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW++ 
Sbjct: 416 GALEDVNQGPQAARLFPNSTMVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSS 475

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S P    AW +L  +VYNC+ +  +  NR                       
Sbjct: 476 FASRRYGVSQPDAVAAWRLLLRSVYNCSGEACSGHNR----------------------- 512

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
              P+ K   L+  T+      +WY+ S+V  A  L + +   L+AS  +RYDL+D+TRQ
Sbjct: 513 --SPLVKRPSLQMSTA------VWYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQ 564

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     A+   D   + +       +L+  +D LLA +  FLLG WL+ A+
Sbjct: 565 AVQELVSSCYEEARTAFLNQDLDLLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAR 624

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           ++A +E + + YE N+R QIT+W      E ++L DY NK  +GL+ DYY PR  ++   
Sbjct: 625 EVAVSESEAQFYEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGT 679

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +  SL  G  F+   + +    L   + N +  YP++  GD +  S+ ++ K+
Sbjct: 680 LAHSLARGIPFQQHQFEKSVFPLEQAFINNKKRYPIQPQGDTVDLSKKIFLKF 732


>gi|307192254|gb|EFN75548.1| Alpha-N-acetylglucosaminidase [Harpegnathos saltator]
          Length = 741

 Score =  368 bits (945), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 197/537 (36%), Positives = 299/537 (55%), Gaps = 46/537 (8%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G+VP A   +FP+A +T++  W   K + ++CC YLL+ TDPLF  IG 
Sbjct: 226 LGIVPVLPAFAGHVPRAFARLFPNANMTKIEPW--NKFEDKYCCPYLLEPTDPLFQTIGE 283

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F+   + E+G T HIYNCDTF+EN P      Y+S++G +++  M + D  A+WLMQGW
Sbjct: 284 KFLRMYINEFG-TDHIYNCDTFNENEPGNSELAYLSNVGRSVFQAMSTVDPQAIWLMQGW 342

Query: 123 LFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF +D  FW  P++++ L SVP G+++VLDL +E  P +   K +YG P+IWCMLHNF G
Sbjct: 343 LFVHDFIFWTEPRVRSFLTSVPTGRMLVLDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGG 402

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
            + M+G    I     E R    +TMVG G++ EGI QN V+Y+LM+EMA++HE VD+ A
Sbjct: 403 TLGMFGSAQIINQRTFEGRHMNGSTMVGTGLTPEGINQNYVIYELMNEMAYRHEPVDLDA 462

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W   Y+ RRYG        AW  L  T+YN       +   VI   P ++ S        
Sbjct: 463 WFESYATRRYGAWNEYAVAAWKHLGRTIYNFVGIERIRGHYVITRRPSLNIS-------- 514

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
                                  P +WY+  +      +F+ +      +  YR+D++D+
Sbjct: 515 -----------------------PWVWYNREDFYHTWNVFLKARYGRGNNTLYRHDVVDI 551

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           TRQAL   A+ +++N+++ Y+  +  G    +   L+L +D++ +LA    FLLG WL  
Sbjct: 552 TRQALQLMADNIYMNVVDCYKRKNITGFQSHAAALLDLFDDIEAILASGSNFLLGTWLAQ 611

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           AK +A +E++ + YE+NAR QIT+W  N +     +RDY NK WSG++ DY+ PR A + 
Sbjct: 612 AKDMAVDEKERQSYEYNARNQITLWGPNGE-----IRDYANKQWSGVVADYFKPRWAFFL 666

Query: 482 KYMIESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNGDAL-----ITSQWL 532
           K + +SL       + +   R ++++   +     +YPV + GD L     I S+WL
Sbjct: 667 KALEKSLVERTRLNMTEINDRMFLEVEQAFTFSTKLYPVGTKGDTLDIAVKIISKWL 723


>gi|358419179|ref|XP_003584151.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bos taurus]
          Length = 741

 Score =  367 bits (942), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 210/537 (39%), Positives = 305/537 (56%), Gaps = 51/537 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP AL  VFP   +TQ+GNW        + C++LL   DPLF  +G  F+ +
Sbjct: 241 VLPAFAGHVPKALTRVFPQVNVTQMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRE 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 299 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQ 357

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 358 PEFWGPAQVAAVLGAVPRGRLLVLDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLF 417

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+S+  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ +Q + V D+ AW+  
Sbjct: 418 GALESVNQGPTTARHFPNSTMVGTGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTS 477

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           ++ RRYG S    + AW +L  +VYNC                       S  E +  N+
Sbjct: 478 FAARRYGVSHGDAEAAWRLLLRSVYNC-----------------------SGEECRGHNH 514

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             P+ +   L+  T+      +WY+ S+V  A  L + + + L++S  +RYDL+D+TRQA
Sbjct: 515 -SPLVRRPSLQMVTT------VWYNRSDVFEAWRLLLTATSTLASSPAFRYDLVDVTRQA 567

Query: 366 LAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           + +  +  +  +  AY       L  A G+        EL+  +D +LA    FLLG WL
Sbjct: 568 VQELVSLYYEEMRTAYLKKELVPLTRAGGILA-----YELLPALDQVLASDCHFLLGSWL 622

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           E A+Q A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  +
Sbjct: 623 EQARQAAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRL 677

Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           + + ++ESL  G  F+   + R   +L   +  G   YP +  GD +   + L+ KY
Sbjct: 678 FTETLVESLVQGVPFQQHQFDRNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 734


>gi|270005801|gb|EFA02249.1| hypothetical protein TcasGA2_TC007912 [Tribolium castaneum]
          Length = 747

 Score =  367 bits (942), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 192/535 (35%), Positives = 314/535 (58%), Gaps = 46/535 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G++P A + ++P A ++++  W     +  +CC Y LD T+PLF EIG+AF+ +
Sbjct: 238 VLPAFAGHLPRAFKTLYPDANMSKMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSE 295

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q+ E+G T H+YNCD+F+EN P      Y++++G +IY  M   D DAVWL+QGW+F  D
Sbjct: 296 QISEFG-TDHMYNCDSFNENVPTSGDLTYLANVGKSIYKAMTDTDPDAVWLLQGWMFYND 354

Query: 128 PFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
            FW+  + ++++L SVPLGK++VLDL +E  P +    Q++G PYIWCMLH+F G + M+
Sbjct: 355 NFWQDTERVRSILTSVPLGKMIVLDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMF 414

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    I   P++AR  EN+TM+G G++ EGI QN V+Y+LM+E A++   V++  W  +Y
Sbjct: 415 GSSTVINEVPIKARHLENSTMIGTGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKY 474

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S RRYG      ++AW +L  TVY                    D   ++   GKY    
Sbjct: 475 STRRYGFPDSDAENAWRILQRTVY--------------------DYQGLNRMRGKY---- 510

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
             ++K   LK +  +      WYST++++ A    + + + L A++ Y +DL+D+TRQ L
Sbjct: 511 -AITKSPSLKIKIWT------WYSTNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVL 563

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y +  +  +++ YQ +D+      S++FLE+++D+D +L+ +  FLLGPWLE+AK+ A
Sbjct: 564 QVYGDLYYKEMVKNYQSHDSANFQANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAA 623

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +  +E Q+E+NAR QIT+W    +     + DY NK W+G++  ++ PR  ++  Y+  
Sbjct: 624 NDSAEEAQFEYNARNQITLWGPRGE-----IMDYANKQWAGVVSHFFAPRWYLFINYLNS 678

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL-----ITSQWLYNKY 536
           + +        D  + + ++   +   R  +PVE  GDA+     I  +W   +Y
Sbjct: 679 TFDGAFNQTYID-AKMFKEVEEPFTFDRTEFPVEPIGDAVEIAWKIHKKWTSEEY 732


>gi|126307960|ref|XP_001366343.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
           domestica]
          Length = 741

 Score =  367 bits (942), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 193/533 (36%), Positives = 296/533 (55%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G++P A   VFP A +T LG W     +  + C+YLL   DPLF  +G  F+ +
Sbjct: 239 VLPAFAGHIPKAFTRVFPQANVTNLGMWGHFSCN--YSCSYLLAPEDPLFPVVGSLFLRE 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY+ D F+E  PP  +P Y+++  AA+Y  M + D DAVWL QGWLF   
Sbjct: 297 LTKEFG-TDHIYSADIFNEMDPPSSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNH 355

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW+PPQMKA+L +VP G+ ++LDLFAE +P++S +  FYG P+IWCMLHNF GN  ++
Sbjct: 356 PDFWKPPQMKAVLEAVPRGRFLILDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLF 415

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G+LD++  GP  AR   N+T+VG G+  EGI QN VVY LM+E+ ++ +   D+ AW+  
Sbjct: 416 GVLDAVNRGPSTARLFPNSTIVGTGIVPEGINQNEVVYALMAELGWRKDPFPDLGAWVAG 475

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ +RYG   P  + AW +L  +VYNC+ +  T  N   +V  P +              
Sbjct: 476 FAAQRYGTPHPQAEAAWRLLLRSVYNCSWENCTGHNHSPLVKRPSLHLDF---------- 525

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                                 +WY+ S+V  A  L + +  +L+ S+ +RYDL+D+TRQ
Sbjct: 526 ---------------------SVWYNRSDVFEAWRLLLEAAPQLATSSAFRYDLLDVTRQ 564

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
              +  +  +  +  A++      +       + +L+  +D LL   + FLLG WLE A+
Sbjct: 565 VAQELVSLYYGELKTAFEAGSMPALLSAGGLLVFDLLPSLDELLGTDERFLLGGWLEQAR 624

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           ++A +E +   YE NAR Q+T+W          + DY NK  +GL+  YY PR  ++ + 
Sbjct: 625 EMAVSEAEAWHYEQNARYQLTLWGPTGN-----ILDYANKQLAGLVAGYYAPRWKLFVEM 679

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +++SL  G  F    +  E   L   + +GR  +P +  GD +  ++  + KY
Sbjct: 680 LVKSLAEGTPFHQNQFENEAFLLGQAFVSGREKFPTQPQGDTVDLARKFFLKY 732


>gi|281344539|gb|EFB20123.1| hypothetical protein PANDA_011160 [Ailuropoda melanoleuca]
          Length = 619

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 201/533 (37%), Positives = 306/533 (57%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP AL  VFP   +TQLG+W        + C++LL   DPLF  IG  F+ +
Sbjct: 118 VLPAFAGHVPKALTRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRE 175

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  A++Y  M + D DAVWL+QGWLF + 
Sbjct: 176 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQ 234

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P++  +  F+G P+IWCMLHNF GN  ++
Sbjct: 235 PEFWGPAQVTAVLGAVPRGRLLVLDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLF 294

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TM G GM+ EGI QN +VY LM+E+ ++ + V D++AW++ 
Sbjct: 295 GALEAVNRGPAAARLFPNSTMAGTGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSS 354

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            + RRYG +    + AW +L  +VYNC+ +  +  NR  +V  P                
Sbjct: 355 SAARRYGVTHKDTEAAWRLLLRSVYNCSGEACSGHNRSPLVRRPS--------------- 399

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  T+      +WY+ S+V  A  L + +   L+AS ++RYDL+D+TRQ
Sbjct: 400 ----------LQMATA------VWYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQ 443

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A  +  +  +     AY   +   + + + R + EL+  +D +LA    FLLG WLE A+
Sbjct: 444 AAQELVSLYYEEARAAYLNKELVPLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQAR 503

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A +E + + YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  ++ + 
Sbjct: 504 AAAVSEAEARFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWGLFMEM 558

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++ESL  G  F+   + +   +L   +      YP +  GD +  ++ L+ KY
Sbjct: 559 LVESLAQGIPFQQHQFDKNAFQLEQAFVFSTQRYPSQPQGDTVDLAKKLFLKY 611


>gi|91080563|ref|XP_973259.1| PREDICTED: similar to alpha-N-acetyl glucosaminidase [Tribolium
           castaneum]
          Length = 747

 Score =  366 bits (940), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 193/535 (36%), Positives = 313/535 (58%), Gaps = 46/535 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G++P A + ++P A ++++  W     +  +CC Y LD T+PLF EIG+AF+ +
Sbjct: 238 VLPAFAGHLPRAFKTLYPDANMSKMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSE 295

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q+ E+G T H+YNCD+F+EN P      Y++++G +IY  M   D DAVW+MQGWLF++D
Sbjct: 296 QISEFG-TDHMYNCDSFNENVPTSGDLTYLANVGKSIYKAMTDTDPDAVWVMQGWLFAHD 354

Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
            F W   + KA+L +VP GK++VLDL +E  P +    Q++G PYIWCMLH+F G + M+
Sbjct: 355 FFYWTRNRAKAILTAVPKGKMIVLDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMF 414

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    I   P++AR  EN+TM+G G++ EGI QN V+Y+LM+E A++   V++  W  +Y
Sbjct: 415 GSSTVINEVPIKARHLENSTMIGTGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKY 474

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S RRYG      ++AW +L  TVY                    D   ++   GKY    
Sbjct: 475 STRRYGFPDSDAENAWRILQRTVY--------------------DYQGLNRMRGKY---- 510

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
             ++K   LK +  +      WYST++++ A    + + + L A++ Y +DL+D+TRQ L
Sbjct: 511 -AITKSPSLKIKIWT------WYSTNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVL 563

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y +  +  +++ YQ +D+      S++FLE+++D+D +L+ +  FLLGPWLE+AK+ A
Sbjct: 564 QVYGDLYYKEMVKNYQSHDSANFQANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAA 623

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +  +E Q+E+NAR QIT+W    +     + DY NK W+G++  ++ PR  ++  Y+  
Sbjct: 624 NDSAEEAQFEYNARNQITLWGPRGE-----IMDYANKQWAGVVSHFFAPRWYLFINYLNS 678

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL-----ITSQWLYNKY 536
           + +        D  + + ++   +   R  +PVE  GDA+     I  +W   +Y
Sbjct: 679 TFDGAFNQTYID-AKMFKEVEEPFTFDRTEFPVEPIGDAVEIAWKIHKKWTSEEY 732


>gi|301773566|ref|XP_002922216.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Ailuropoda
           melanoleuca]
          Length = 634

 Score =  366 bits (940), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 201/533 (37%), Positives = 306/533 (57%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP AL  VFP   +TQLG+W        + C++LL   DPLF  IG  F+ +
Sbjct: 132 VLPAFAGHVPKALTRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRE 189

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  A++Y  M + D DAVWL+QGWLF + 
Sbjct: 190 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQ 248

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P++  +  F+G P+IWCMLHNF GN  ++
Sbjct: 249 PEFWGPAQVTAVLGAVPRGRLLVLDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLF 308

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TM G GM+ EGI QN +VY LM+E+ ++ + V D++AW++ 
Sbjct: 309 GALEAVNRGPAAARLFPNSTMAGTGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSS 368

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            + RRYG +    + AW +L  +VYNC+ +  +  NR  +V  P                
Sbjct: 369 SAARRYGVTHKDTEAAWRLLLRSVYNCSGEACSGHNRSPLVRRPS--------------- 413

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  T+      +WY+ S+V  A  L + +   L+AS ++RYDL+D+TRQ
Sbjct: 414 ----------LQMATA------VWYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQ 457

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A  +  +  +     AY   +   + + + R + EL+  +D +LA    FLLG WLE A+
Sbjct: 458 AAQELVSLYYEEARAAYLNKELVPLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQAR 517

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A +E + + YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  ++ + 
Sbjct: 518 AAAVSEAEARFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWGLFMEM 572

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++ESL  G  F+   + +   +L   +      YP +  GD +  ++ L+ KY
Sbjct: 573 LVESLAQGIPFQQHQFDKNAFQLEQAFVFSTQRYPSQPQGDTVDLAKKLFLKY 625


>gi|410981277|ref|XP_003996997.1| PREDICTED: alpha-N-acetylglucosaminidase [Felis catus]
          Length = 857

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 203/533 (38%), Positives = 304/533 (57%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +TQLG+W        + C++LL   DPLF  IG  F+ +
Sbjct: 355 VLPAFAGHVPKAITRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRE 412

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y++S  A++Y  M + D DAVWL+QGWLF + 
Sbjct: 413 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLASATASVYQAMVTVDPDAVWLLQGWLFQHQ 471

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 472 PQFWGPAQVSAVLGAVPRGRLLVLDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLF 531

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D++AW+  
Sbjct: 532 GALEAVNRGPAAARLFPNSTMVGTGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVTG 591

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S    + AW +L  +VYNC+ +  +  NR  +V  P                
Sbjct: 592 FAARRYGVSHGNTEAAWRLLLRSVYNCSGEACSGHNRSPLVRRPS--------------- 636

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     LK  T+      +WY+ S+V  A  L + +   L+ S T+RYDL+D+TRQ
Sbjct: 637 ----------LKMTTT------VWYNRSDVFEAWRLLLTTTPSLATSPTFRYDLLDVTRQ 680

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A  +  +  +     AY   +   + + +   + EL+  +D +LA    FLLG WLE A+
Sbjct: 681 AAQELVSLYYGEARTAYLNKELVPLLRAAGILVYELLPSLDKVLASDSRFLLGSWLEQAR 740

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  ++ + 
Sbjct: 741 AAAVSEAEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEM 795

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++ESL  G  F+   + +   +L   +      YP + +GD +  ++ L+ +Y
Sbjct: 796 LVESLVRGVPFQQHQFDQNAFQLEQTFVLSTQRYPSQPHGDTVDLAKKLFLRY 848


>gi|301626955|ref|XP_002942650.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Xenopus (Silurana)
           tropicalis]
          Length = 759

 Score =  364 bits (935), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 195/541 (36%), Positives = 299/541 (55%), Gaps = 51/541 (9%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G++P  +  VFP   +++LG W +      + C+YLLD  DPLF  IG 
Sbjct: 244 LGMITVLPAFAGHIPEGILRVFPKVTVSRLGGWSNFNCT--YSCSYLLDPEDPLFQWIGE 301

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F+ Q ++ +G T HIY+ DTF+E +P    P Y+S++  AI+  M   D DA+WLMQGW
Sbjct: 302 LFLSQMVQSFG-TDHIYSADTFNEMSPTSSDPGYLSAVSGAIFKSMAKVDPDAIWLMQGW 360

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF  +P FWRP Q KALL+  P+G+++VLDLFAE  P++ T++ FYG P+IWCML+NF G
Sbjct: 361 LFINNPSFWRPAQTKALLHGAPIGRIIVLDLFAETVPVYLTTESFYGQPFIWCMLNNFGG 420

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           N  ++G ++ +  GP +A    N+TMVG G++ EGIEQN ++Y+ M+E+ +  + +++  
Sbjct: 421 NHGLFGNIEGVNRGPFDAAKFPNSTMVGTGLTPEGIEQNDMIYEFMNEIGWSSQPINLTK 480

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           WI+ YS RRYG+S    + AW +L  +VYNCT    + N   +V  P ++ +        
Sbjct: 481 WISNYSDRRYGQSNTDARMAWQILLRSVYNCTQILHNHNHSPLVRRPSLNMNT------- 533

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
                                    + Y+ +++  A      +   L  S T+ YDL+D+
Sbjct: 534 ------------------------DICYNKADIYEAWRFMHNASFALGKSATFLYDLVDI 569

Query: 362 TRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
           TR+A+ +  +E +L I EAY      QL  A GV        +L+ ++D LL+   GFLL
Sbjct: 570 TREAVQQLVSEYYLEIKEAYGKKSLQQLMTAGGVL-----VYDLLPELDSLLSSQPGFLL 624

Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
           G WL++AK +A    +   Y+ NAR QIT+W          + DY NK + GL++DYY  
Sbjct: 625 GSWLKAAKSMASTPAEAALYDMNARNQITLWGPTGN-----ILDYANKQYGGLVQDYYTE 679

Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
           R  ++  ++++SL  G+ F    + +    L  D+      Y     GD L  +  +Y K
Sbjct: 680 RWGLFVWFLVQSLNKGEHFNQDKFNKAVFVLEEDFVYNGKEYMASPTGDTLEIANKIYLK 739

Query: 536 Y 536
           Y
Sbjct: 740 Y 740


>gi|148671928|gb|EDL03875.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
           CRA_a [Mus musculus]
          Length = 538

 Score =  364 bits (934), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 198/533 (37%), Positives = 302/533 (56%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   + +LG+W        + C++LL   DP+F  IG  F+ +
Sbjct: 38  VLPAFAGHVPKAITRVFPQVNVIKLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRE 95

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 96  LTKEFG-TDHIYGADTFNEMQPPFSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQ 154

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L +VP G+L+VLDLFAE  P++  +  F+G P+IWCMLHNF GN  ++
Sbjct: 155 PQFWGPSQIRAVLEAVPRGRLLVLDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLF 214

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+ +  GP  AR   N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW++ 
Sbjct: 215 GALEDVNRGPQAARLFPNSTMVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSS 274

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           +++RRYG S P    AW +L  +VYNC+ +  +  NR                       
Sbjct: 275 FAIRRYGVSQPDAVAAWKLLLRSVYNCSGEACSGHNRS---------------------- 312

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
              P+ K   L+  T+      +WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQ
Sbjct: 313 ---PLVKRPSLQMSTA------VWYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQ 363

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     AY   +   + +     + +L+  +D LLA    FLLG WL+ A+
Sbjct: 364 AVQELVSLCYEEARTAYLKQELDLLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQAR 423

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           + A +E + + YE N+R QIT+W      E ++L DY NK  +GL+ DYY PR  ++   
Sbjct: 424 KAAVSEAEAQFYEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGT 478

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +  SL  G  F+  ++ +    L   +   +  YP +  GD +  S+ ++ KY
Sbjct: 479 LAHSLARGVPFQQHEFEKNVFPLEQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 531


>gi|1171229|gb|AAC50512.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|1171231|gb|AAC50513.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|1197840|gb|AAB06188.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|1479981|gb|AAB36604.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|32450702|gb|AAH53991.1| N-acetylglucosaminidase, alpha- [Homo sapiens]
 gi|119581237|gb|EAW60833.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
           CRA_b [Homo sapiens]
          Length = 743

 Score =  363 bits (933), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 198/536 (36%), Positives = 304/536 (56%), Gaps = 43/536 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 241 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY  DTF+E  PP   P Y+++   A+Y  M + D++AVWL+QGWLF + 
Sbjct: 299 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 357

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 358 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 418 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 477

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S P    AW +L  +VYNC+ +     NR  +V  P                
Sbjct: 478 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 522

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  TS      +WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQ
Sbjct: 523 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 566

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     AY   +   + +       EL+  +D +LA    FLLG WLE A+
Sbjct: 567 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 626

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  ++ + 
Sbjct: 627 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 681

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 539
           +++S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY  G
Sbjct: 682 LVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPG 737


>gi|357458269|ref|XP_003599415.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
 gi|355488463|gb|AES69666.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
          Length = 539

 Score =  363 bits (933), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 171/259 (66%), Positives = 199/259 (76%), Gaps = 27/259 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+FSGNVPAAL  +FPSAKIT+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +
Sbjct: 271 VLPSFSGNVPAALTKIFPSAKITRLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRK 330

Query: 68  QLK--------------------------EYGRTSHIYNCDTFDENTPPVDSPEYISSLG 101
           Q+K                          EYG  + IYNCDTF+EN+PP   P YIS+LG
Sbjct: 331 QIKATETIHQESEDLGSLIIMDRAVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLG 390

Query: 102 AAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIW 160
           AA+Y G+  GD DAVWLMQGWLF  D  FW+PPQMKALL SVP GK++VLDLFA+VKPIW
Sbjct: 391 AAVYQGISKGDKDAVWLMQGWLFYSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIW 450

Query: 161 STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQN 220
            TS QFYG PYIWCMLHNF GNIEMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE N
Sbjct: 451 KTSFQFYGTPYIWCMLHNFGGNIEMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHN 510

Query: 221 PVVYDLMSEMAFQHEKVDV 239
           P+VY+LMSEMAF+ EKV +
Sbjct: 511 PIVYELMSEMAFRDEKVKI 529


>gi|1479983|gb|AAB36605.1| alpha-N-acetylglucosaminidase [Homo sapiens]
 gi|119581236|gb|EAW60832.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
           CRA_a [Homo sapiens]
          Length = 639

 Score =  363 bits (931), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 198/536 (36%), Positives = 304/536 (56%), Gaps = 43/536 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 137 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 194

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY  DTF+E  PP   P Y+++   A+Y  M + D++AVWL+QGWLF + 
Sbjct: 195 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 253

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 254 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 313

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 314 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 373

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S P    AW +L  +VYNC+ +     NR  +V  P                
Sbjct: 374 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 418

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  TS      +WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQ
Sbjct: 419 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 462

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     AY   +   + +       EL+  +D +LA    FLLG WLE A+
Sbjct: 463 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 522

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  ++ + 
Sbjct: 523 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 577

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 539
           +++S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY  G
Sbjct: 578 LVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPG 633


>gi|66346698|ref|NP_000254.2| alpha-N-acetylglucosaminidase precursor [Homo sapiens]
 gi|317373322|sp|P54802.2|ANAG_HUMAN RecName: Full=Alpha-N-acetylglucosaminidase; AltName:
           Full=N-acetyl-alpha-glucosaminidase; Short=NAG;
           Contains: RecName: Full=Alpha-N-acetylglucosaminidase 82
           kDa form; Contains: RecName:
           Full=Alpha-N-acetylglucosaminidase 77 kDa form; Flags:
           Precursor
          Length = 743

 Score =  362 bits (930), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 197/533 (36%), Positives = 303/533 (56%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 241 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY  DTF+E  PP   P Y+++   A+Y  M + D++AVWL+QGWLF + 
Sbjct: 299 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 357

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 358 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 418 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 477

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S P    AW +L  +VYNC+ +     NR  +V  P                
Sbjct: 478 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 522

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  TS      +WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQ
Sbjct: 523 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 566

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     AY   +   + +       EL+  +D +LA    FLLG WLE A+
Sbjct: 567 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 626

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  ++ + 
Sbjct: 627 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 681

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +++S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY
Sbjct: 682 LVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734


>gi|2660688|gb|AAB88084.1| Naglu [Mus musculus]
          Length = 739

 Score =  362 bits (930), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 198/533 (37%), Positives = 302/533 (56%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   + +LG+W        + C++LL   DP+F  IG  F+ +
Sbjct: 239 VLPAFAGHVPKAITRVFPQVNVIKLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRE 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 297 LTKEFG-TDHIYGADTFNEMQPPFSEPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQ 355

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L +VP G+L+VLDLFAE  P++  +  F+G P+IWCMLHNF GN  ++
Sbjct: 356 PQFWGPSQIRAVLEAVPRGRLLVLDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLF 415

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+ +  GP  AR   N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW++ 
Sbjct: 416 GALEDVNRGPQAARLFPNSTMVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSS 475

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           +++RRYG S P    AW +L  +VYNC+ +  +  NR                       
Sbjct: 476 FAIRRYGVSQPDAVAAWKLLLRSVYNCSGEACSGHNRS---------------------- 513

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
              P+ K   L+  T+      +WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQ
Sbjct: 514 ---PLVKRPSLQMSTA------VWYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQ 564

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     AY   +   + +     + +L+  +D LLA    FLLG WL+ A+
Sbjct: 565 AVQELVSLCYEEARTAYLKQELDLLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQAR 624

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           + A +E + + YE N+R QIT+W      E ++L DY NK  +GL+ DYY PR  ++   
Sbjct: 625 KAAVSEAEAQFYEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGT 679

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +  SL  G  F+  ++ +    L   +   +  YP +  GD +  S+ ++ KY
Sbjct: 680 LAHSLARGVPFQQHEFEKNVFPLEQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732


>gi|254910995|ref|NP_038820.2| alpha-N-acetylglucosaminidase precursor [Mus musculus]
 gi|20385160|gb|AAM21194.1|AF363242_1 N-acetyl-glucosaminidase [Mus musculus]
 gi|3329361|gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus]
 gi|33585908|gb|AAH55733.1| Alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB) [Mus
           musculus]
 gi|74211094|dbj|BAE37639.1| unnamed protein product [Mus musculus]
 gi|74218052|dbj|BAE42009.1| unnamed protein product [Mus musculus]
 gi|148671929|gb|EDL03876.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
           CRA_b [Mus musculus]
          Length = 739

 Score =  362 bits (930), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 198/533 (37%), Positives = 302/533 (56%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   + +LG+W        + C++LL   DP+F  IG  F+ +
Sbjct: 239 VLPAFAGHVPKAITRVFPQVNVIKLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRE 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 297 LTKEFG-TDHIYGADTFNEMQPPFSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQ 355

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L +VP G+L+VLDLFAE  P++  +  F+G P+IWCMLHNF GN  ++
Sbjct: 356 PQFWGPSQIRAVLEAVPRGRLLVLDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLF 415

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+ +  GP  AR   N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW++ 
Sbjct: 416 GALEDVNRGPQAARLFPNSTMVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSS 475

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           +++RRYG S P    AW +L  +VYNC+ +  +  NR                       
Sbjct: 476 FAIRRYGVSQPDAVAAWKLLLRSVYNCSGEACSGHNRS---------------------- 513

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
              P+ K   L+  T+      +WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQ
Sbjct: 514 ---PLVKRPSLQMSTA------VWYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQ 564

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     AY   +   + +     + +L+  +D LLA    FLLG WL+ A+
Sbjct: 565 AVQELVSLCYEEARTAYLKQELDLLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQAR 624

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           + A +E + + YE N+R QIT+W      E ++L DY NK  +GL+ DYY PR  ++   
Sbjct: 625 KAAVSEAEAQFYEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGT 679

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +  SL  G  F+  ++ +    L   +   +  YP +  GD +  S+ ++ KY
Sbjct: 680 LAHSLARGVPFQQHEFEKNVFPLEQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732


>gi|114667172|ref|XP_523654.2| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Pan
           troglodytes]
 gi|410216584|gb|JAA05511.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
 gi|410258938|gb|JAA17435.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
 gi|410304442|gb|JAA30821.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
 gi|410337929|gb|JAA37911.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
          Length = 743

 Score =  361 bits (927), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 197/533 (36%), Positives = 302/533 (56%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 241 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY  DTF+E  PP   P Y+++   A+Y  M + D++AVWL+QGWLF + 
Sbjct: 299 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 357

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 358 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 418 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 477

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S P    AW +L  +VYNC+ +     NR  +V  P                
Sbjct: 478 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 522

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  TS      +WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQ
Sbjct: 523 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 566

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     AY   +   + +       EL+  +D +LA    FLLG WLE A+
Sbjct: 567 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 626

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  ++ + 
Sbjct: 627 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 681

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           + +S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY
Sbjct: 682 LADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734


>gi|311267179|ref|XP_003131436.1| PREDICTED: alpha-N-acetylglucosaminidase [Sus scrofa]
          Length = 744

 Score =  361 bits (926), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 204/538 (37%), Positives = 301/538 (55%), Gaps = 53/538 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP AL  VFP   +TQ+G+W        + C++LL   DPLF  +G  F+ +
Sbjct: 242 VLPAFAGHVPKALTRVFPQISVTQMGSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRE 299

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 300 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMITVDPDAVWLLQGWLFQHQ 358

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 359 PQFWGPAQVGAVLGAVPRGRLLVLDLFAESQPVYVRTASFLGQPFIWCMLHNFGGNHGLF 418

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+S+  GP  AR   N+TM G GM+ EGI QN VVY LM+E+ ++ + V D+  W+  
Sbjct: 419 GALESVNQGPAAARLFPNSTMAGTGMAPEGIGQNEVVYALMAELGWRKDPVADLGTWVTS 478

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S    + AW +L  +VYNC+ +G T  NR  +V  P +  +           
Sbjct: 479 FAARRYGVSQGDAEAAWRLLLRSVYNCSGEGCTGHNRSPLVRRPSLQMATT--------- 529

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                                 +WY+ S+V  A  L + +   L++S  +RYDL+D+TRQ
Sbjct: 530 ----------------------VWYNQSDVFEAWRLLLKATPTLASSPAFRYDLVDITRQ 567

Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           A+ +  +  +     AY       L  A G+        EL+  +D +LA    FLLG W
Sbjct: 568 AVQELVSLYYEEARTAYLNKELVSLMRAGGILA-----YELLPALDKVLASDSHFLLGSW 622

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           LE A+ +A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  
Sbjct: 623 LEQARGVAVSEAEALFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWR 677

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++ + ++ESL  G  F+   + +   +L   +  G   YP +  GD +  ++ L+ KY
Sbjct: 678 LFMEMLVESLVQGIPFQQHQFDQNVFQLEQTFVLGTRRYPSQPQGDTVDLAKKLFLKY 735


>gi|380030624|ref|XP_003698943.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
           [Apis florea]
          Length = 769

 Score =  360 bits (925), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 197/552 (35%), Positives = 304/552 (55%), Gaps = 49/552 (8%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G+VP A   +FP A +T+   W +     ++CC YLL+  DPLF +IG+
Sbjct: 248 LGIIPVLPAFAGHVPRAFLRLFPKANVTKSAVWNNFSD--KYCCPYLLEPMDPLFKQIGQ 305

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F++  ++E+G T H+YNCDTF+EN P     +++ ++G +I+  M + DS A+WLMQGW
Sbjct: 306 QFLKTYIEEFG-TDHVYNCDTFNENEPYTSELKFLRNIGHSIFEAMSNVDSKAIWLMQGW 364

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF +D  FW  P+ +  L S+PLG+++VLDL +E  P +     +YG P+IWCMLHNF G
Sbjct: 365 LFYHDSVFWTEPRTRTFLTSIPLGRMIVLDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGG 424

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
            + M+G  + I     EAR    +TMVG G++ EGI QN V+Y+LM+EMA++   V++  
Sbjct: 425 TLGMFGSAEIINHRIFEARNMNGSTMVGTGLTPEGINQNYVIYELMNEMAYRKRPVNLDK 484

Query: 242 WINQYSVRRYG--RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
           W   Y+ RRYG  +       AW    +TVYN +D    + +  I   P+++ S      
Sbjct: 485 WFENYANRRYGDTKGNEHTVTAWKGFKNTVYNFSDTRRIRGKYAITIRPNLNFS------ 538

Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
                                    P  WY+    I    + + + +    S  YR+D++
Sbjct: 539 -------------------------PWRWYNKDAFIHYWYMLLQARDLKRNSTLYRHDVV 573

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           D+TRQAL   A+E++ ++IE++   +     Q ++  L L +D++ +LA  + FLLG WL
Sbjct: 574 DVTRQALQLIADEIYTDLIESFNKKNIDLFKQNAKLLLALFDDLEEILASSEDFLLGKWL 633

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           + AK LA N+E+E  YE+NAR QIT+W          +RDY NK WSG++ DY+ PR AI
Sbjct: 634 KMAKDLATNDEEEILYEYNARNQITLW-----GPLGEIRDYANKQWSGIVADYFKPRWAI 688

Query: 480 YFKYMIESLESGDGFRLKDWRREWIK-LTNDWQNGRNVYPVESNGDAL-----ITSQWLY 533
           +   +  SL +G          +  + +   +   R +YP ++ GD++     I S+W Y
Sbjct: 689 FLNELETSLTTGTRVNTTKMNEQIFENVEEAFTFSRKIYPTKATGDSIDIAERILSEW-Y 747

Query: 534 NKYLQGTGVFDH 545
           + YL     F H
Sbjct: 748 DPYLPFHKTFRH 759


>gi|426348060|ref|XP_004041658.1| PREDICTED: alpha-N-acetylglucosaminidase [Gorilla gorilla gorilla]
          Length = 743

 Score =  360 bits (924), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 197/533 (36%), Positives = 302/533 (56%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 241 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY  DTF+E  PP   P Y+++   A+Y  M + D++AVWL+QGWLF + 
Sbjct: 299 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 357

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 358 PQFWGPAQIEAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 418 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 477

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S P    AW +L  +VYNC+ +     NR  +V  P                
Sbjct: 478 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 522

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  TS      +WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQ
Sbjct: 523 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 566

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     AY   +   + +       EL+  +D +LA    FLLG WLE A+
Sbjct: 567 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 626

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  ++ + 
Sbjct: 627 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 681

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           + +S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY
Sbjct: 682 LADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPQGDTVDLAKKIFLKY 734


>gi|397485721|ref|XP_003813989.1| PREDICTED: alpha-N-acetylglucosaminidase [Pan paniscus]
          Length = 682

 Score =  360 bits (924), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 197/533 (36%), Positives = 302/533 (56%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 180 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 237

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY  DTF+E  PP   P Y+++   A+Y  M + D++AVWL+QGWLF + 
Sbjct: 238 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 296

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 297 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 356

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 357 GALEAVNEGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 416

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S P    AW +L  +VYNC+ +     NR  +V  P                
Sbjct: 417 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 461

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  TS      +WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQ
Sbjct: 462 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 505

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     AY   +   + +       EL+  +D +LA    FLLG WLE A+
Sbjct: 506 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 565

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  ++ + 
Sbjct: 566 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 620

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           + +S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY
Sbjct: 621 LADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 673


>gi|355568706|gb|EHH24987.1| Alpha-N-acetylglucosaminidase, partial [Macaca mulatta]
          Length = 711

 Score =  360 bits (923), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 202/538 (37%), Positives = 304/538 (56%), Gaps = 53/538 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 209 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRE 266

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY  DTF+E  PP  +P Y+++   A+Y  M + D++AVWL+QGWLF + 
Sbjct: 267 LVKEFG-TDHIYGADTFNEMQPPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQ 325

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 326 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 385

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 386 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTN 445

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ +RYG S P    AW +L  +VYNC+ +     NR  +V  P                
Sbjct: 446 FAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 490

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  TS      +WY+ S V  A  L + S   L+AS  +RYDL+DLTRQ
Sbjct: 491 ----------LQMNTS------VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQ 534

Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           A+ +  +  +     AY       L  A GV        EL+  +D LLA    FLLG W
Sbjct: 535 AVQELVSLYYEEARSAYLSKELTSLLRAGGVLAY-----ELLPALDELLASDSRFLLGSW 589

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           LE A+  A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  
Sbjct: 590 LEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWR 644

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++ + + +S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY
Sbjct: 645 LFLEALADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 702


>gi|402900329|ref|XP_003913130.1| PREDICTED: alpha-N-acetylglucosaminidase [Papio anubis]
          Length = 743

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 202/538 (37%), Positives = 303/538 (56%), Gaps = 53/538 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 241 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRE 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY  DTF+E  PP  +P Y+++   A+Y  M + D++AVWL+QGWLF + 
Sbjct: 299 LVKEFG-TDHIYGADTFNEMQPPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQ 357

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 358 PQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 418 GALEAVNRGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTN 477

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ +RYG S P    AW +L  +VYNC+ +     NR  +V  P                
Sbjct: 478 FAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 522

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  TS      +WY+ S V  A  L + S   L+AS  +RYDL+DLTRQ
Sbjct: 523 ----------LQMNTS------VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQ 566

Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           A+ +  +  +     AY       L  A GV        EL+  +D LLA    FLLG W
Sbjct: 567 AVQELVSLYYEEARSAYLSKELTSLLRAGGVLAY-----ELLPALDELLASDSRFLLGSW 621

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           LE A+  A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  
Sbjct: 622 LEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWR 676

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++ + + +S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY
Sbjct: 677 LFLEALADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734


>gi|332260899|ref|XP_003279518.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
           [Nomascus leucogenys]
          Length = 736

 Score =  359 bits (921), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 197/533 (36%), Positives = 302/533 (56%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 234 VLPAFAGHVPEAVTRVFPRVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY  DTF+E  PP   P Y+++   A+Y  M + D++AVWL+QGWLF + 
Sbjct: 292 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQ 350

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 351 PQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 410

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 411 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 470

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ +RYG S P    AW +L  +VYNC+ +     NR  +V  P                
Sbjct: 471 FAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 515

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  TS      +WY+ S+V  A  L + S   L+AS  +RYDL+DLTRQ
Sbjct: 516 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQ 559

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     AY   +   + +       EL+  +D +LA    FLLG WLE A+
Sbjct: 560 AVQELVSLYYEEARSAYLRKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLELAR 619

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  ++ + 
Sbjct: 620 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 674

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           + +S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY
Sbjct: 675 LADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 727


>gi|328778968|ref|XP_623833.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Apis mellifera]
          Length = 752

 Score =  358 bits (920), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 196/540 (36%), Positives = 305/540 (56%), Gaps = 52/540 (9%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G+VP AL  +FP A +T+   W +     ++CC YLL+ TDPLF +IG+
Sbjct: 230 LGIIPVLPAFAGHVPRALLKLFPKANVTKSAVWNNFSD--KYCCPYLLEPTDPLFKQIGQ 287

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F++  ++E+G T H+YNCDTF+EN P     +++ ++G +I+  M S DS A+WLMQGW
Sbjct: 288 QFLKTYIEEFG-TDHVYNCDTFNENEPYTSELKFLRNIGHSIFEAMNSVDSKAIWLMQGW 346

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF +D  FW  P+ +  L SVPLG+++VLDL +E  P +     +YG P+IWCMLHNF G
Sbjct: 347 LFYHDSVFWTEPRTRTFLTSVPLGRMIVLDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGG 406

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
            + M+G  + I     EAR    +TMVG G++ EGI QN V+Y+LM+EMA++ + V++  
Sbjct: 407 TLGMFGSAEIINHRVFEARNMNGSTMVGTGLTPEGINQNYVIYELMNEMAYRKKPVNLDK 466

Query: 242 WINQYSVRRYG--RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
           W   ++ RRYG  +       AW    +TVYN +D    + + VI   P+++        
Sbjct: 467 WFENFANRRYGDIKGNEHTVTAWKGFKNTVYNFSDTRRIRGKYVITIRPNLN-------- 518

Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
                                    P  WY+    I    + + + +    S  YR+D++
Sbjct: 519 -----------------------FFPWRWYNKDAFIYYWYVLLQARDLKRNSTLYRHDVV 555

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           D+TRQAL   A+E++ ++IE++   +     Q ++  L L +D++ +LA  + FLLG WL
Sbjct: 556 DVTRQALQLIADEIYTDLIESFNKKNIDLFKQNAKLLLALFDDLEEILASSEDFLLGKWL 615

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           + AK LA ++E+E  YE+NAR QIT+W          +RDY NK WSG++ DY+ PR AI
Sbjct: 616 KMAKDLATDDEEEILYEYNARNQITLW-----GPLGEIRDYANKQWSGIVADYFKPRWAI 670

Query: 480 YFKYMIESLESG---DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL-----ITSQW 531
           +   +  SL +G   +  R+   +R +  +   +   R +YP ++ GD++     I S+W
Sbjct: 671 FLNELETSLTTGTRVNTTRIN--KRIFENVEKAFTFSRKIYPTKATGDSIDIAERILSEW 728


>gi|355754184|gb|EHH58149.1| Alpha-N-acetylglucosaminidase, partial [Macaca fascicularis]
          Length = 650

 Score =  358 bits (919), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 202/538 (37%), Positives = 303/538 (56%), Gaps = 53/538 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 148 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRE 205

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY  DTF+E  PP  +P Y+++   A+Y  M + D++AVWL+QGWLF + 
Sbjct: 206 LVKEFG-TDHIYGADTFNEMQPPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQ 264

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 265 PQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 324

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 325 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTN 384

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ +RYG S P    AW +L  +VYNC+ +     NR  +V  P                
Sbjct: 385 FAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 429

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  TS      +WY+ S V  A  L + S   L+AS  +RYDL+DLTRQ
Sbjct: 430 ----------LQMNTS------VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQ 473

Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           A+ +  +  +     AY       L  A GV        EL+  +D LLA    FLLG W
Sbjct: 474 AVQELVSLYYEEARSAYLSKELTSLLRAGGVLAY-----ELLPALDELLASDSRFLLGSW 528

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           LE A+  A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  
Sbjct: 529 LEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWR 583

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++ + + +S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY
Sbjct: 584 LFLEALADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 641


>gi|297701096|ref|XP_002827555.1| PREDICTED: alpha-N-acetylglucosaminidase [Pongo abelii]
          Length = 836

 Score =  357 bits (917), Expect = 7e-96,   Method: Compositional matrix adjust.
 Identities = 200/538 (37%), Positives = 302/538 (56%), Gaps = 53/538 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 334 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 391

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HI+  DTF+E  PP   P Y+++   A+Y  M + D++AVWL+QGWLF + 
Sbjct: 392 LIKEFG-TDHIFGADTFNEMQPPSSEPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQ 450

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 451 PQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 510

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 511 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 570

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S P    AW +L  +VYNC+ +     NR  +V  P                
Sbjct: 571 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 615

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  TS      +WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQ
Sbjct: 616 ----------LQMNTS------VWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 659

Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           A+ +  +  +     AY       L  A GV        EL+  +D +LA    FLLG W
Sbjct: 660 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAY-----ELLPALDEVLASDSRFLLGSW 714

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           LE A+  A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  
Sbjct: 715 LEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWR 769

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++ + + +S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY
Sbjct: 770 LFLEALADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPQGDTVDLAKKIFLKY 827


>gi|307168312|gb|EFN61518.1| Alpha-N-acetylglucosaminidase [Camponotus floridanus]
          Length = 737

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 190/523 (36%), Positives = 286/523 (54%), Gaps = 41/523 (7%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G+VP A   +FP+A +T++  W + +   ++CC YLL+ TDPLF  IG 
Sbjct: 253 LGIVPVLPAFAGHVPRAFARLFPNANMTKINPWNNFED--KYCCPYLLEPTDPLFQIIGE 310

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F+   + E+G T HIYNCDTF+EN P      Y+ ++  A+++ + + DS A+WLMQ W
Sbjct: 311 KFLRMYINEFG-TDHIYNCDTFNENEPGSTELIYLRNVSHAVFAAINAVDSKAIWLMQAW 369

Query: 123 LFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF +D  FW  P++K+ L SVP+G++++LDL +E  P +   K +YG P+IWCMLHNF G
Sbjct: 370 LFVHDFMFWTEPRVKSFLTSVPMGRMLILDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGG 429

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
            + M+G    I     E R    +TMVG G++ EGI QN V+Y+LM+EMA++HE VD+ A
Sbjct: 430 TLGMFGSAQIINQRTFEGRNMNGSTMVGTGLTPEGINQNYVIYELMNEMAYRHEPVDLDA 489

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W   Y+ RRYG         W  L  TVYN       +   V+   P ++ S+       
Sbjct: 490 WFQNYATRRYGAWNEYAVTTWQYLGRTVYNFIGSQRIRGHYVVTRRPSLNISL------- 542

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
                                    +WY+          F+ + +    S  YR+D++D+
Sbjct: 543 ------------------------WIWYNRKNFYSMWNTFLKARHGRRNSTLYRHDVVDI 578

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           TRQAL    ++L+  I+++Y+  +       +   LEL +D++ +LA    FLLG WL  
Sbjct: 579 TRQALQLMGDDLYTIILDSYKKRNITAFRSSANALLELFDDLESILASGSNFLLGTWLSQ 638

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           AK +A NEE+ K YE+NA+ QIT+W  N +     +RDY NK WSG++ DY+ PR  ++ 
Sbjct: 639 AKDVATNEEERKSYEYNAKNQITLWGPNGE-----IRDYANKQWSGVMADYFKPRWELFL 693

Query: 482 KYMIESLESGDGFRLKDWRREWI-KLTNDWQNGRNVYPVESNG 523
           K + +SL     F + +   +   K+   +      YPVE  G
Sbjct: 694 KALEKSLVENTKFNVTEINNKIFDKVERPFTFSTKFYPVEPKG 736


>gi|256422141|ref|YP_003122794.1| alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
 gi|256037049|gb|ACU60593.1| Alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
          Length = 728

 Score =  355 bits (911), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 191/533 (35%), Positives = 301/533 (56%), Gaps = 44/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VP A ++ +P+ +I +  NW     D  +   Y+LD   P+F +IG+ F+E 
Sbjct: 232 ILPAFTGHVPPAFKDKYPN-EIVKPTNW-----DAGFPDVYILDPNSPMFDKIGKKFLEA 285

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T H Y+ DTF+EN PP     ++ ++   +Y+ M + D  AVW+MQGW+F Y+
Sbjct: 286 QTKAFG-TDHFYSADTFNENVPPSSDSSFLDAMSRKVYASMAAADPKAVWVMQGWMFHYN 344

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             +W  PQ++ALLN+VP   ++VLDL++E  P W  ++ +YG P+IW MLHNF GN  M+
Sbjct: 345 ASYWHQPQIRALLNAVPDDHMIVLDLYSESHPEWRNTQAYYGKPWIWNMLHNFGGNTGMW 404

Query: 187 GILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           G +D+ A  P  A     +  M G+G++ EGIEQNP +Y LM +  ++ + ++V  W+  
Sbjct: 405 GGMDAAAHDPATALHDPASGKMSGIGLTPEGIEQNPALYQLMIDNVWRDQPINVDTWLQS 464

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVY--NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
           Y+ +RYG    A+  AW +LYHTVY    T+GA +    +IVA P +D            
Sbjct: 465 YAKQRYGAENEAVNKAWQILYHTVYIGGPTEGAPES---IIVARPTLD------------ 509

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
                ++ E V            L Y  ++V+ A +LFI +  +L  +  ++YDL+DLTR
Sbjct: 510 -----IAAERVKTK---------LEYDPAKVVPAWDLFINAAAQLKPTEGFKYDLVDLTR 555

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           Q L  YA+ L   +  AY+  D     Q S +F+ L++DMD LL   +GFLLG W+  A+
Sbjct: 556 QVLGNYASPLQQRVATAYRNKDLAAFKQYSTQFIGLLDDMDMLLGTQEGFLLGKWVSDAR 615

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
                  ++  YE+NA+  +T+W D    + S + +Y N+ W+GL++ +Y PR   +F  
Sbjct: 616 SNGITPAEQDLYEFNAKDLVTLWGD----KDSPVHEYSNRQWNGLIKGFYKPRWQQFFTL 671

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +  SL+ G+   LK +  +       W NG + Y V+  GDA+  +  L+ KY
Sbjct: 672 LESSLKKGETADLKAFEEQVKAFEWKWANGHDKYAVKPQGDAVKAAVQLHKKY 724


>gi|395532374|ref|XP_003768245.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Sarcophilus
           harrisii]
          Length = 726

 Score =  355 bits (911), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 197/533 (36%), Positives = 298/533 (55%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A   VFP A +T LG W        + C+YLL   DPLF  +G  F+ +
Sbjct: 224 VLPAFAGHVPKAFTRVFPQAYVTHLGMWGHFNCT--YSCSYLLAPEDPLFPVVGSLFLRE 281

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             +E+G T HIY+ DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 282 LTQEFG-TDHIYSADTFNEMEPPSSEPAYLAAATAAVYEAMIAVDVDAVWLLQGWLFQHQ 340

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW+PPQ+KA+L +VPLG+L+VLDL+AE KP++S +  FYG P+IWCMLHNF GN  ++
Sbjct: 341 PDFWKPPQVKAVLKAVPLGRLLVLDLYAESKPVYSRTDSFYGQPFIWCMLHNFGGNHGLF 400

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G LD++  GP +A    N+T VG G+  EGI QN VVY LM+E+ +Q   + D+ AW+  
Sbjct: 401 GALDAVNRGPSDAWLFPNSTFVGTGIVPEGINQNEVVYALMAELGWQKGPLPDLGAWVAG 460

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ +RYG      + AW +L  +VYNC+ D  T  NR  +V  P +   I          
Sbjct: 461 FAAQRYGTPHSHAEAAWKLLLQSVYNCSGDLCTGHNRSPLVKRPSLHLDI---------- 510

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                                 +WY+ S+V  A  L + +   L++S  +RYDL+D+TRQ
Sbjct: 511 ---------------------SVWYNRSDVFEAWRLLLEAAPVLASSPAFRYDLLDVTRQ 549

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
              +  +  +  +  A++      +       + +L+  +D LLA  + FLLG WLE A+
Sbjct: 550 VAQELVSLYYEELRTAFEAGAMPALLTAGGLLVFDLLPSLDELLASDERFLLGAWLEQAR 609

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           ++A +E +  QY+ NA  Q+T+W          + DY NK  +GL+  YY PR  ++ + 
Sbjct: 610 EMAVSEAEAWQYKQNALYQLTLWGPTGN-----ILDYANKQLAGLVAGYYAPRWKLFVEM 664

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +++SL  G  F    +  E + L  ++  GR  +P +  GD +   +  + +Y
Sbjct: 665 LVKSLAEGTPFHQNQFESEALLLGQNFVLGREKFPTQPQGDTVDLVKKFFLRY 717


>gi|395827009|ref|XP_003786703.1| PREDICTED: alpha-N-acetylglucosaminidase [Otolemur garnettii]
          Length = 756

 Score =  355 bits (910), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 196/532 (36%), Positives = 295/532 (55%), Gaps = 41/532 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +TQL +W        + C++LL   DP+F  IG  F+ +
Sbjct: 241 VLPAFAGHVPKAITRVFPQVNVTQLSSWGHFNCS--YSCSFLLAPGDPIFSLIGSLFLRE 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++   A+Y  M + D DAVWL+QGWLF + 
Sbjct: 299 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMIAVDPDAVWLLQGWLFQHQ 357

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+KA+L +VPLG+L+VLDLFAE +P++S +  F G P+IWCMLHNF GN  ++
Sbjct: 358 PQFWGPTQIKAVLRAVPLGRLLVLDLFAESQPVYSRTASFQGQPFIWCMLHNFGGNHGLF 417

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 418 GALEAVNQGPKAARLFPNSTMVGTGMAPEGINQNEVVYALMAELGWRKDPVPDLVAWVTS 477

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           ++ RRYG S    + AW +L  +VYNC+  A   +                         
Sbjct: 478 FADRRYGISHGDAEAAWRLLLRSVYNCSGEACSGHNH----------------------- 514

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             P+ K   L+  T+      +WY+ S+V  A  L + S   L+AS  +RYDL+D+TRQA
Sbjct: 515 -SPLVKRPSLQMNTT------VWYNRSDVFEAWRLLLTSAPTLAASPIFRYDLLDITRQA 567

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
           + +  +  +     AY   +   + +       EL+  +D +LA  + FLLG WL  A+ 
Sbjct: 568 IQELVSLYYEKARTAYLNKELVPLLRAGGLLAYELLPALDEVLASDNHFLLGSWLAQARA 627

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +A +E +   YE N+R Q+T+W          + DY NK  +GL+ DYY PR  ++ + +
Sbjct: 628 VAISEAEANFYEQNSRYQLTLW-----GPVGNILDYANKQLAGLVADYYAPRWQLFMQAL 682

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
              L  G  F+ + + +    L   +      YP +  G+ +  ++ ++ KY
Sbjct: 683 GNCLAQGIPFQQRQFDKNVFPLEQAFVLNSKRYPSQPQGNTMDLAKKIFLKY 734


>gi|426238067|ref|XP_004012979.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Ovis aries]
          Length = 739

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 207/544 (38%), Positives = 301/544 (55%), Gaps = 51/544 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP AL  VFP   +TQ+G+W        + C++LL   DPLF  +G  F+ +
Sbjct: 239 VLPAFAGHVPKALTRVFPQVNVTQMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRE 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D  AVWL+QGWLF   
Sbjct: 297 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQ 355

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 356 PEFWGPAQVAAVLGAVPRGRLLVLDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLF 415

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+S+  GP  AR   N+T+VG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 416 GALESVNQGPATARRFPNSTLVGTGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTS 475

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           ++ RRYG S    + AW +L  +VYNC                       S  E +  N+
Sbjct: 476 FAARRYGVSHGDAEAAWRLLLRSVYNC-----------------------SGEECRGHNH 512

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             P+ K   L   T+      +WY+ S+V  A  L + +   L++S  +RYDL+D+TRQA
Sbjct: 513 -SPLVKRPSLHMVTT------VWYNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQA 565

Query: 366 LAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           + +  +  +  +  AY       L  A G+        EL+  +D +LA    FLLG WL
Sbjct: 566 VQELVSLYYEEMRTAYLKKELVPLMRAGGILA-----YELLPALDQVLASDCHFLLGSWL 620

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           E A+  A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  +
Sbjct: 621 EQARLAAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRL 675

Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 539
           + + + ESL  G  F+   + +   +L   +  G   YP +  GD +   + L+ KY   
Sbjct: 676 FAETLAESLVQGVPFQQHQFDKNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKYYPR 735

Query: 540 TGVF 543
            G F
Sbjct: 736 AGSF 739


>gi|426238065|ref|XP_004012978.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 1 [Ovis aries]
          Length = 748

 Score =  353 bits (905), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 207/544 (38%), Positives = 301/544 (55%), Gaps = 51/544 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP AL  VFP   +TQ+G+W        + C++LL   DPLF  +G  F+ +
Sbjct: 248 VLPAFAGHVPKALTRVFPQVNVTQMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRE 305

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D  AVWL+QGWLF   
Sbjct: 306 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQ 364

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 365 PEFWGPAQVAAVLGAVPRGRLLVLDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLF 424

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+S+  GP  AR   N+T+VG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 425 GALESVNQGPATARRFPNSTLVGTGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTS 484

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           ++ RRYG S    + AW +L  +VYNC                       S  E +  N+
Sbjct: 485 FAARRYGVSHGDAEAAWRLLLRSVYNC-----------------------SGEECRGHNH 521

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             P+ K   L   T+      +WY+ S+V  A  L + +   L++S  +RYDL+D+TRQA
Sbjct: 522 -SPLVKRPSLHMVTT------VWYNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQA 574

Query: 366 LAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           + +  +  +  +  AY       L  A G+        EL+  +D +LA    FLLG WL
Sbjct: 575 VQELVSLYYEEMRTAYLKKELVPLMRAGGILA-----YELLPALDQVLASDCHFLLGSWL 629

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           E A+  A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  +
Sbjct: 630 EQARLAAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRL 684

Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 539
           + + + ESL  G  F+   + +   +L   +  G   YP +  GD +   + L+ KY   
Sbjct: 685 FAETLAESLVQGVPFQQHQFDKNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKYYPR 744

Query: 540 TGVF 543
            G F
Sbjct: 745 AGSF 748


>gi|332018247|gb|EGI58852.1| Alpha-N-acetylglucosaminidase [Acromyrmex echinatior]
          Length = 686

 Score =  353 bits (905), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 192/539 (35%), Positives = 299/539 (55%), Gaps = 41/539 (7%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G+VP A   +FP+A +T++  W   K + ++CC YLL+ TDPLF  IG 
Sbjct: 184 LGIVPVLPAFAGHVPRAFARLFPNANMTKINPW--NKFEDKYCCPYLLEPTDPLFRTIGE 241

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F++  + E+G T HIYNCDTF+EN P      Y+ ++G +I+S M + DS A+WLMQ W
Sbjct: 242 KFLQMYIDEFG-TDHIYNCDTFNENEPGNTELIYLRNVGHSIFSAMNAVDSKAIWLMQAW 300

Query: 123 LFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF +D  FW   +++A L SVP+G+++VLDL +E  P +   K +YG P+IWCMLHNF G
Sbjct: 301 LFVHDIMFWTKSRVRAFLTSVPIGRMLVLDLQSEQFPQYDRLKSYYGQPFIWCMLHNFGG 360

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
            + M+G    I     E R   ++TMVG G++ EGI QN V+Y+LM+EMA++H  V++  
Sbjct: 361 TLGMFGSAQIINQRTFEGRNMNDSTMVGTGLTPEGINQNYVIYELMNEMAYRHVPVNLDN 420

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W   Y+ RRYG        AW  L  TVYN       +   VI   P ++ S+ +     
Sbjct: 421 WFESYATRRYGAWNEYAVAAWQHLGRTVYNFIGTQKIRGHYVITRRPSLNISLWT----- 475

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
                                     WY   +      +F+ +      +  YR+D++D+
Sbjct: 476 --------------------------WYDRKDFYAMWNMFLKARYGRGNNTLYRHDVVDI 509

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           TRQAL   A+++++ I++ Y+  +       +   LEL +D++ +LA  + FLLG WL  
Sbjct: 510 TRQALQLIADDIYMTILDCYKKKNITAFQSSANALLELFDDLESILASGNNFLLGTWLAQ 569

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           AK +A NEE+ + YE+NAR QIT+W  N +     +RDY NK WSG++ DY+  R  ++ 
Sbjct: 570 AKDIAVNEEERRSYEYNARNQITLWGPNGE-----IRDYANKQWSGVVADYFKLRWELFL 624

Query: 482 KYMIESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 539
           K + +SL       + +   R + ++   +     +YP+E+ GD +  +  + +K+ +G
Sbjct: 625 KALEKSLIQRIEPNITEINDRIFHEVERSFTFSTKLYPIETKGDTIDIAMKIISKWYKG 683


>gi|156545487|ref|XP_001606979.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Nasonia vitripennis]
          Length = 755

 Score =  352 bits (903), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 196/533 (36%), Positives = 297/533 (55%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP     VFP A +T++ +W   +   ++CC Y LD TDPLF  +GR F++ 
Sbjct: 258 VLPAFAGHVPRDFIRVFPEANVTKVVSWNGFED--QYCCPYSLDPTDPLFKTVGREFLKA 315

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
              E+G T+HIYNCD+F+EN P     +Y+S+ G AIYSGM   D DA+WLMQGWLF + 
Sbjct: 316 YTDEFG-TNHIYNCDSFNENDPHTGDLDYLSNTGKAIYSGMTGADPDAIWLMQGWLFVHS 374

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW  P++KA + SVP+GK+++LDL +E  P +     ++G P+IWCMLHNF G + M+
Sbjct: 375 EYFWTFPRVKAFVTSVPIGKMIILDLQSEQFPQYKRFHSYFGQPFIWCMLHNFGGTLGMF 434

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    I  G  EART+  +TM+G G++ EGI QN V+Y+ M+EM+++ + V +  W   Y
Sbjct: 435 GSAGVINKGVFEARTTNGSTMIGTGLTPEGINQNYVIYEFMNEMSYRKKPVVLDNWFENY 494

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +VRRYG++  +I+ +W  L   +YN  DG T                      G Y    
Sbjct: 495 AVRRYGQADESIRTSWQELGRELYN-YDGKTK-------------------IRGHYV--- 531

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
             ++K   L  E      P  WY     +     F+ +GN    +  +++DL+D+TRQAL
Sbjct: 532 --ITKRPSLNIE------PWYWYDLKTFLAVWNSFVHAGNGTMKNELFKHDLVDITRQAL 583

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
              A+ ++ +I  AY   +   +   S   L+L +D++  LA    FLLG WLE AK +A
Sbjct: 584 QITADFIYADIKAAYTQKNLTQLQIASSHLLDLFDDLEKNLASSKDFLLGSWLEDAKAIA 643

Query: 427 QN--EEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
                   + YE+NAR QIT+W    +     + DY NK WSG++ DY+ PR  IY K +
Sbjct: 644 PEGATRDRENYEFNARNQITLWGPRGE-----IVDYANKQWSGVVADYFKPRWEIYLKEL 698

Query: 485 IESLESGDGFRLKDWRRE-WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
            ES+           +R  + ++   +   + +YP +  GD+++ ++ LY K+
Sbjct: 699 QESIRKQTAVPTAKLKRMIFNQVELPFSYSKKLYPTQPKGDSILIAKELYAKW 751


>gi|354485058|ref|XP_003504701.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cricetulus griseus]
 gi|344251941|gb|EGW08045.1| Alpha-N-acetylglucosaminidase [Cricetulus griseus]
          Length = 740

 Score =  352 bits (902), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 198/543 (36%), Positives = 303/543 (55%), Gaps = 53/543 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   + QLG+W        + C++LL   DP+F  IG  F+ +
Sbjct: 240 VLPAFAGHVPKAITRVFPQVNVFQLGSWGHFNCS--YSCSFLLAPGDPVFPLIGSLFLRE 297

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY  DTF+E  P    P ++++  AA+Y  M S D DA+WL+QGWLF + 
Sbjct: 298 LIKEFG-TDHIYGADTFNEMQPISSDPSFLTAATAAVYEAMISVDPDAIWLLQGWLFQHQ 356

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+KA+L +VP G+L+VLDLFAE  P++  +  FYG P+IWCMLHNF GN  ++
Sbjct: 357 PQFWGPAQVKAVLQAVPRGRLLVLDLFAESHPVYMQTASFYGQPFIWCMLHNFGGNHGLF 416

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG G++ EGI QN +VY LM+E+ ++ + V D++ W+++
Sbjct: 417 GALEAVNQGPRAARIFPNSTMVGTGIAPEGIGQNEMVYALMAELGWRKDPVPDLEVWVSR 476

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++  RYG S P  + AW +L  +VYNC  +     NR  +V  P +  + I         
Sbjct: 477 FASHRYGMSHPDAEAAWRLLLRSVYNCPGETYNGHNRSPLVKRPSLQINTI--------- 527

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                                 +WY+ S+V  A  L + +   L+ S  +RYDL+D+TRQ
Sbjct: 528 ----------------------VWYNRSDVFEAWRLLLTAAPNLTTSKAFRYDLLDVTRQ 565

Query: 365 ALAKYANELFLNIIEAYQ------LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           +L +  +  +     A+       L  A G+  ++R+   L+  +D LLA    FLLG W
Sbjct: 566 SLQELVSLFYEEARIAFMKEELDLLLRAGGI--ITRK---LLPALDELLASDSRFLLGTW 620

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           L  A+ +A +E++ + YE N+  Q+T+W      E +++ DY NK  +GL+ DYY PR  
Sbjct: 621 LNQARAMAVSEDEAQFYELNSLYQLTLW----GPEGNIM-DYANKQLAGLVADYYQPRWG 675

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           ++ + +  SL  G  FR  ++ +    L   +   +  YP    GD +  S+ L+ KY  
Sbjct: 676 LFMEALAHSLARGVPFRQHEFEKNVFPLELAFIINKKRYPSHPQGDTVDLSKKLFLKYHP 735

Query: 539 GTG 541
             G
Sbjct: 736 QAG 738


>gi|444714090|gb|ELW54978.1| Alpha-N-acetylglucosaminidase [Tupaia chinensis]
          Length = 724

 Score =  351 bits (901), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 201/534 (37%), Positives = 302/534 (56%), Gaps = 43/534 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF G+VP A+  VFP   +TQLG+W        + C++LL   DP+F  IG  F+ +
Sbjct: 211 VLPAFPGHVPKAITRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRE 268

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AAIY+ M + D  AVWL+QGW+F + 
Sbjct: 269 LTKEFG-TDHIYGADTFNELQPPSSEPSYLAAATAAIYAAMTAVDPGAVWLLQGWIFQHQ 327

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+KA+L +VP G+L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  +Y
Sbjct: 328 PDFWGPAQVKAVLEAVPRGRLLVLDLFAETRPVYLYTASFLGQPFIWCMLHNFGGNHGLY 387

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++ +GP  AR   N++MVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 388 GTLEAVNWGPKAARLFPNSSMVGTGMAPEGINQNEVVYALMAELGWRKDPVPDLAAWVTS 447

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDG-ATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           Y+ RRYG S+   + AW +L  +VYNC+    +  NR                       
Sbjct: 448 YADRRYGVSLGDAEAAWRLLLRSVYNCSGQMCSGHNRS---------------------- 485

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
              P+ K   L+  T+      +WY+ S+V  A  L + +   L+AS T+RYDL+D+TRQ
Sbjct: 486 ---PLVKRPSLQMNTT------VWYNRSDVFEAWRLLLTAAPTLAASPTFRYDLLDVTRQ 536

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +     AY   +   + +     + EL+ D+D LLA    F+LG WLE A+
Sbjct: 537 AVQELVSLYYEEARTAYLNKELVSLLRAGGILVYELLPDLDNLLATDGRFMLGSWLEQAR 596

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
            +A +E + + YE N+R Q+T+W          + DY NK  +GL+ DYY PR  ++ + 
Sbjct: 597 AVAVSETEAQFYEQNSRYQLTLWGPTGN-----ILDYANKQLAGLVADYYAPRWQLFMEM 651

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           +  SL  G  F+   + +   +L   +      YP +  GD +  ++ ++ KY 
Sbjct: 652 LANSLTQGIPFQQHQFDQNAFQLEQAFVLSVERYPSQPQGDTVELAKKIFLKYF 705


>gi|348562747|ref|XP_003467170.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
           [Cavia porcellus]
          Length = 750

 Score =  349 bits (896), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 192/532 (36%), Positives = 304/532 (57%), Gaps = 41/532 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   ITQLG+W        + C++LL   DPLF  IG  F+ +
Sbjct: 248 VLPAFAGHVPKAIGRVFPQVNITQLGSWGHFNCS--YSCSFLLAPEDPLFPLIGGIFLRE 305

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            ++E+G T+HIY  DTF+E  PP   P Y+++   A++  M + DSDAVWL+QGWLF + 
Sbjct: 306 LIREFG-TNHIYGADTFNEMQPPSSDPAYLAAATEAVFKAMVAVDSDAVWLLQGWLFQHQ 364

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 365 PEFWGPAQVGAVLGAVPQGRLLVLDLFAESQPVYTRTASFRGQPFIWCMLHNFGGNHGLF 424

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW+++
Sbjct: 425 GALEAVNRGPTAARLFPNSTMVGTGITPEGIGQNEVVYALMAELGWRKDPVPDLLAWVSR 484

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           ++ RRYG + P  + AW +L  +VYNC+  A   +                         
Sbjct: 485 FAERRYGVAQPDAEAAWRLLLRSVYNCSGEACRGHNH----------------------- 521

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             P+ +   L+  T+      +WY+ S+V  A  L + +  +L+ S T+RYDL+D+TRQA
Sbjct: 522 -SPLVRRPSLQMNTA------VWYNRSDVFEAWRLLLKASPKLTTSPTFRYDLLDVTRQA 574

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
           L +  +  +  +  AY   +  G+ +       +L+  +D +LA    FLLG WL  A+ 
Sbjct: 575 LQELVSLYYEEVRAAYLHQELAGLLRAGGVLAYQLLPALDEVLASDHHFLLGSWLAQARA 634

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
            A +E + + YE N+R Q+T+W      E ++L DY NK  +GL+  YY PR  ++ + +
Sbjct: 635 AAASETEARLYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVAHYYAPRWQLFIESL 689

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
            +SL     F+   + ++   L   +      Y  +  GD +  ++ ++ ++
Sbjct: 690 ADSLARAAPFQQHQFDKDVFLLEQAFVLSSRRYRSQPQGDTVDLARKVFLRF 741


>gi|431890602|gb|ELK01481.1| Alpha-N-acetylglucosaminidase [Pteropus alecto]
          Length = 740

 Score =  349 bits (895), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 196/533 (36%), Positives = 300/533 (56%), Gaps = 43/533 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +TQ+ +W        + C++LL   DPLF  +G  F+ +
Sbjct: 238 VLPAFAGHVPKAITRVFPQVNVTQMDSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRE 295

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 296 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMTTVDPDAVWLLQGWLFQHQ 354

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 355 PQFWGPAQVGAVLGAVPRGRLLVLDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLF 414

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI+QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 415 GALEAVNQGPAAARLFPNSTMVGTGMAPEGIDQNEVVYALMAELGWRKDPVTDLGAWVTS 474

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG S    + AW +L  +VYNC+ +     N   +V      PS+  VT      
Sbjct: 475 FAARRYGVSHGDAEAAWRLLLRSVYNCSGEDCRGHNHSPLVR----RPSLQMVTT----- 525

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                                 +WY+ S+V  A  + + +   L+ S  + Y+L+D+TRQ
Sbjct: 526 ----------------------VWYNQSDVFEAWRMLLTATPTLATSPLFSYELVDITRQ 563

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
           A+ +  +  +  +  AY   D   +F+ +     EL+  +D +LA    FLLG WLE A+
Sbjct: 564 AIQELVSLYYEEVRTAYLNKDLVTLFRAAGILAYELLPSLDNILATDSHFLLGSWLEQAR 623

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             A ++ +   YE N+R Q+T+W      E ++L DY NK  +GL+ +YY PR  ++ + 
Sbjct: 624 AAAVSKAEASFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLIANYYTPRWRLFMEM 678

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++ESL  G  F+   + +   +L   +      YP +  GD +  ++ L+ KY
Sbjct: 679 LVESLVQGIPFQQHQFDKNAFQLEQTFVFSTQRYPNQPQGDTVDLAKKLFLKY 731


>gi|295132875|ref|YP_003583551.1| hypothetical protein ZPR_1010 [Zunongwangia profunda SM-A87]
 gi|294980890|gb|ADF51355.1| predicted protein [Zunongwangia profunda SM-A87]
          Length = 750

 Score =  348 bits (894), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 196/541 (36%), Positives = 304/541 (56%), Gaps = 63/541 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA+ +  FP A + +  NW +   D     TY+LDA DPLF EIG+ F+E+
Sbjct: 243 VLPAFTGHVPASFKKFFPDADLKKT-NWGNDFGD-----TYILDAEDPLFAEIGKRFLEK 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q + +G T H Y  DTF+EN PP D P+Y+  L   I+ GM++ D +A W+MQGWLF S+
Sbjct: 297 QEEVFG-TDHFYTADTFNENEPPSDDPKYLGELSEKIFEGMKAADPEATWVMQGWLFYSH 355

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+ PQ+K LL++VP  ++++LDL  E++P+W  ++ FYG  +IW MLHNF GNI M+
Sbjct: 356 KDFWKTPQIKGLLSTVPDDRMIILDLATEIEPVWKQTEAFYGKQWIWNMLHNFGGNISMF 415

Query: 187 GILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           G ++++A  P  A   S +  + G+G++ME IEQNPV+Y+LM++  ++   +++K+W+  
Sbjct: 416 GRIETVAEQPALALNDSTSGNLKGIGLTMEAIEQNPVLYELMTDNTWRDTPIELKSWLKN 475

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y+  RYG    +I +AW++L  T YN T    D    +I A P             ++ Y
Sbjct: 476 YTRNRYGAVNDSILEAWDILVATAYNGT-TIRDGAESIIAARP------------TFEGY 522

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
            +              +    + Y   +++ A +LFI + +    S+ + YDL+DL+RQ 
Sbjct: 523 RR--------------WARTKMNYDPLDLLPAWDLFIGARDRFKDSDGFAYDLVDLSRQV 568

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           LA YA  +   +  AY+ ND     + S   L L+ D+D LLA    FLLGPW+  A+  
Sbjct: 569 LANYALPVQQQMRIAYENNDKEAFKKHSEELLTLISDLDRLLATRKDFLLGPWIADARSW 628

Query: 426 AQNEEQEKQYEWNARTQITMWF--DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
               E++  YE NAR  IT+W   DN       L +Y  + WSG+L D+Y PR    ++ 
Sbjct: 629 GTTPEEKALYERNARDLITLWGGPDNP------LHEYSCRQWSGVLDDFYKPR----WQQ 678

Query: 484 MIESLESGDG--------FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
            I  +E+  G         ++K+W  +W+       N    YP + +GD+   ++ LY+K
Sbjct: 679 FIADVEANWGDFDQEVFDEKIKEWEWKWV-------NKEEAYPTQPSGDSYKVAKALYDK 731

Query: 536 Y 536
           Y
Sbjct: 732 Y 732


>gi|344285558|ref|XP_003414528.1| PREDICTED: alpha-N-acetylglucosaminidase [Loxodonta africana]
          Length = 744

 Score =  348 bits (892), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 200/538 (37%), Positives = 295/538 (54%), Gaps = 53/538 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +TQ+G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 242 VLPAFAGHVPKAVTRVFPQVNVTQMGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRE 299

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
              E+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 300 LTTEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYEAMITVDPDAVWLLQGWLFQHQ 358

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 359 PQFWGPAQVGAVLGAVPRGHLLVLDLFAETQPVYIRTASFQGQPFIWCMLHNFGGNHGLF 418

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  
Sbjct: 419 GTLETVNQGPAAARLFPNSTMVGTGMAPEGIGQNEVVYALMAELGWRKDPVPDLGAWVAS 478

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG      + AW +L  +VYNC+ +  +  NR                       
Sbjct: 479 FAARRYGGIHQDAETAWRLLLRSVYNCSGESCSGHNRS---------------------- 516

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
              P+ K   L+  T+      +WY+ S+V  A  L +A+   L+AS  +RYDL+D+TRQ
Sbjct: 517 ---PLVKRPSLQMNTT------VWYNRSDVFEAWRLLLATTPALAASPAFRYDLLDVTRQ 567

Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           A  +  +  +  +  AY       L  A GV        EL+  +D +LA    FLLG W
Sbjct: 568 AAQELVSFYYGEVRTAYLNKELVHLLRAGGVLA-----YELLPALDEVLASDSRFLLGSW 622

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           LE A+  A +E +   +E N+R Q+T+W          + DY NK  +GL+ DYY PR  
Sbjct: 623 LEQARVAAVSEAEAHFFEQNSRYQLTLW-----GPVGNILDYANKQLAGLVSDYYTPRWQ 677

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++   ++ESL     F+ + +     +L   +      YP +  GD +  ++ L+ KY
Sbjct: 678 LFVGALVESLVQDVPFQQRQFDENVFQLEQAFVLNTRRYPTQPKGDTVDLAKRLFLKY 735


>gi|291406137|ref|XP_002719212.1| PREDICTED: alpha-N-acetylglucosaminidase [Oryctolagus cuniculus]
          Length = 743

 Score =  347 bits (891), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 203/539 (37%), Positives = 305/539 (56%), Gaps = 53/539 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +TQLG+W        + C++LL   DP+F  IG  F+ +
Sbjct: 241 VLPAFAGHVPKAVTRVFPHINVTQLGSWGHFNCS--YSCSFLLAPEDPMFPLIGSLFLRE 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             +E+G T H+Y  DTF+E  PP   P Y+++  AA++  M + D DAVWL+QGWLF + 
Sbjct: 299 LTREFG-TDHVYGADTFNEMQPPSSEPSYLAAATAAVFEAMIAVDPDAVWLLQGWLFQHQ 357

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+KA+LN+VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 358 PQFWGPSQVKAVLNAVPRGRLLVLDLFAENQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG G++ EGI QN VVY LM+E+ ++ E V D++AW+  
Sbjct: 418 GALEAVNRGPAAARLFPNSTMVGTGIAPEGISQNEVVYALMAELGWRKEPVPDLEAWVTS 477

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           ++ RRYG + P    AW +L  +VYNC+ D     NR  +V  P                
Sbjct: 478 FAGRRYGVAHPDAGAAWRLLLRSVYNCSGDACRGHNRSPLVRRPS--------------- 522

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                     L+  T+      +WY+ S+V  A  L + +   L++S  +RYDL+D+TRQ
Sbjct: 523 ----------LQLNTT------VWYNRSDVFEAWRLLLKATPTLASSPAFRYDLLDVTRQ 566

Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           A+ +  +  +     AY       L  A GV        EL+  +D +LA    FLLG W
Sbjct: 567 AVQELVSLYYEEARTAYLHKELATLLRAGGVLA-----YELLPALDRVLATDSRFLLGSW 621

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           LE A+  A +E + + YE N+R Q+T+W      E ++L DY NK  +GL+  YY PR  
Sbjct: 622 LEQARAAAASEAEAQLYEQNSRFQLTLW----GPEGNIL-DYANKQLAGLVAQYYSPRWQ 676

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           ++ + + +SL  G  F+ + + +   +L   +      YP +  GD +  +Q ++ KY 
Sbjct: 677 LFLEALADSLARGVPFQQRLFDKLVFRLEQAFVLSSRRYPTQPQGDTVDLAQKIFLKYF 735


>gi|90399367|emb|CAJ86183.1| H0212B02.15 [Oryza sativa Indica Group]
 gi|116311963|emb|CAJ86322.1| OSIGBa0113E10.5 [Oryza sativa Indica Group]
          Length = 692

 Score =  345 bits (884), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 163/262 (62%), Positives = 205/262 (78%), Gaps = 6/262 (2%)

Query: 208 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 267
           +GVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+  Y  RRYG+S+  +QDAW +LY 
Sbjct: 427 IGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSIVGLQDAWKILYQ 486

Query: 268 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSY 322
           T+YNCTDG  DKNRDVIVAFPDV+P +I  T G Y    + Y   +SK  + + +    Y
Sbjct: 487 TLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEY 545

Query: 323 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 382
           +HPHLWY T  VIRALELF+  G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+
Sbjct: 546 EHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYK 605

Query: 383 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 442
            N+ + V  L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQ
Sbjct: 606 ANNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQ 665

Query: 443 ITMWFDNTQEEASLLRDYGNKY 464
           ITMWFDNT+ +ASLLRDYG  +
Sbjct: 666 ITMWFDNTKTKASLLRDYGEAH 687



 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 106/154 (68%), Positives = 127/154 (82%), Gaps = 3/154 (1%)

Query: 6   FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 296 FPVLPAFSGNIPAALRSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFI 355

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           E+Q++EYG TSH+Y+CDTFDENTPP+  P YISSLGAA + GMQSGD DA+WLMQGWLFS
Sbjct: 356 EEQIREYGGTSHVYSCDTFDENTPPLSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFS 415

Query: 126 YDPFWRPPQMKALLNSVPLG---KLVVLDLFAEV 156
           YDPFW PPQMK  +     G     +V DL +E+
Sbjct: 416 YDPFWEPPQMKIGVGMSMEGIEQNPIVYDLMSEM 449


>gi|403304646|ref|XP_003942904.1| PREDICTED: alpha-N-acetylglucosaminidase [Saimiri boliviensis
           boliviensis]
          Length = 754

 Score =  344 bits (883), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 196/532 (36%), Positives = 303/532 (56%), Gaps = 41/532 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSG+VP A+  VFP   +TQ+G+W        + C++LL   DP+F  +G  F+ +
Sbjct: 251 VLPAFSGHVPRAINRVFPRVNVTQMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRE 308

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D+DAVWL+QGWLF + 
Sbjct: 309 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQHQ 367

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 368 PQFWGPAQVRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 427

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E++++ + V D+ AW+  
Sbjct: 428 GALEAVNRGPEAARLFPNSTMVGTGMAPEGINQNEVVYSLMAELSWRKDPVPDLAAWVTS 487

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           ++ +RYG S P    AW +L  +VYNC+  A   +                         
Sbjct: 488 FATQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNH----------------------- 524

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             P+ +   L+  T+      +WY+ S+V  A  L +++   L+AS T+RYDL+D+TRQA
Sbjct: 525 -SPLVRRPSLQMNTT------VWYNRSDVFEAWRLLLSAAATLAASPTFRYDLLDVTRQA 577

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
           + +     +     AY   + H + +       EL+  +D +LA    FLLG WLE A+ 
Sbjct: 578 VQELVGLYYEEARSAYLSKELHSLLRAGGILAYELLPALDEVLASDSHFLLGSWLEQARA 637

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +A +E +   YE ++R Q+T+W      E ++L DY NK  +GL+  YY PR  ++ + +
Sbjct: 638 VAVSEAEADFYEQSSRYQLTLW----GPEGNIL-DYANKQLAGLVASYYTPRWRLFLEVL 692

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
             S+  G  F    + +   +L   +   +  YP +  GD +  ++ ++ KY
Sbjct: 693 AASVAQGIPFPQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 744


>gi|320162905|gb|EFW39804.1| lysosomal alpha-N-acetyl glucosaminidase [Capsaspora owczarzaki
           ATCC 30864]
          Length = 786

 Score =  342 bits (877), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 189/534 (35%), Positives = 297/534 (55%), Gaps = 38/534 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+F+G+VP+AL   FP+A ITQ  +W +     ++CC   LDA+DPLF +IG  F+  
Sbjct: 275 VLPSFAGHVPSALAQHFPNANITQSSDWNNFPD--QYCCVGFLDASDPLFTQIGAEFLRL 332

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG T+H+YNCD F+E TP      Y+   G A+Y  M + D  AVW+MQGWLF  +
Sbjct: 333 QNETYG-TNHLYNCDQFNEMTPASTDLGYLKQAGMAVYQSMTAYDPAAVWVMQGWLFFNE 391

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             +W   +++ALL+ VP   +++LDLF++V P+W+  + +YG P+IW MLH+F GNI +Y
Sbjct: 392 AAWWSNDRVQALLSGVPDDHMIILDLFSDVTPVWNRLESYYGKPFIWNMLHDFGGNIGLY 451

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           GIL SI  GP  A  +   TMVG+G++ EGI QN ++Y+ M E  ++   V++  W++ +
Sbjct: 452 GILPSINEGPFAALATPGNTMVGIGLTPEGINQNYILYEFMMENMWRSAPVNLPTWVDAF 511

Query: 247 SVRRYGRSVPAIQD-AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
             RRYG S PA+   A+  L  +VYNCT+G     + ++   P V+              
Sbjct: 512 VGRRYGPSTPAVAKLAYQQLLQSVYNCTNGQYSVTKSLLEIRPAVN-------------- 557

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIA---SGNELSASNTYRYDLIDLT 362
              +S+   + +        +L+Y    VI A++  +A   S  +L++   +RYD++D T
Sbjct: 558 ---MSRNGFMPT--------NLYYDPGHVILAVDHILAAAKSAPQLASVVPFRYDVVDFT 606

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L+  A +   N+  A     AH V    +  + L+ D+D LL     FLLGPWL +A
Sbjct: 607 RQMLSNLAIDFHSNLTLALTSKQAHLVHLYGQGIVGLIADLDELLVSDAHFLLGPWLAAA 666

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           +  ++N   +   E+NAR QIT+W  N +     + DY +K W+GL+  YY PR  ++  
Sbjct: 667 RSWSENTAAQDLLEFNARNQITLWGPNGE-----ITDYASKQWAGLMSSYYRPRWELFVS 721

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +   + ES   F    +    +++   WQ+  + + V   GD++  +  L  KY
Sbjct: 722 FASAAAESDLPFNDAAFNAAVLEVEKAWQHSHHNFTVTPLGDSIAIATRLRAKY 775


>gi|351699889|gb|EHB02808.1| Alpha-N-acetylglucosaminidase, partial [Heterocephalus glaber]
          Length = 652

 Score =  342 bits (877), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 193/522 (36%), Positives = 294/522 (56%), Gaps = 41/522 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +TQLG+W        + C++LL   DPLF  IG  F+ +
Sbjct: 151 VLPAFAGHVPKAVTRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPGDPLFPLIGSLFLRE 208

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             +E+G T H Y  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 209 LNREFG-TDHFYGADTFNEMQPPSSEPAYLAAATAAVYEAMTAVDPDAVWLLQGWLFQHQ 267

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++
Sbjct: 268 PEFWGPAQVGAVLGAVPQGRLLVLDLFAENQPVYTRTASFGGQPFIWCMLHNFGGNHGLF 327

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+T+VG G++ EGI QN VVY LM+E+ ++ + V D+ AW+ +
Sbjct: 328 GALEAVNRGPAAARLFPNSTVVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLSAWVAR 387

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           ++ +RYG + P    AW +L H+VYNC+  A   +                         
Sbjct: 388 FAEQRYGVAQPDAVLAWRLLLHSVYNCSGEACRGHNH----------------------- 424

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             P+ +   L+  T+      +WY+ S+V  A  L + +   L+AS  +RYDL+D+TRQ 
Sbjct: 425 -SPLVRRPSLQMNTT------VWYNRSDVFEAWRLLLKATPNLTASPAFRYDLLDVTRQG 477

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
           L +  +  +     AY   +  G+ +       +L+  +D +LA    FLLG WLE A+ 
Sbjct: 478 LQELVSLYYEEARAAYMRQELEGLLRAGGVLAYKLLPALDEVLASDHRFLLGSWLEQARA 537

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +A +  +   YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  ++ + +
Sbjct: 538 VAVSSAEADLYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYVPRWRLFVETL 592

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
             SL  G  F+ + +  +   L   +   R  YP +  GD +
Sbjct: 593 ASSLARGVPFQQQQFNSDVFLLEQAFVLSRKRYPSQPQGDTV 634


>gi|405964692|gb|EKC30145.1| Alpha-N-acetylglucosaminidase [Crassostrea gigas]
          Length = 859

 Score =  341 bits (875), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 187/561 (33%), Positives = 302/561 (53%), Gaps = 37/561 (6%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRW--------------------CCT 47
           VLP F+G+VP A    +P A +++L +W        W                    CC 
Sbjct: 254 VLPGFAGHVPEATILRYPQANVSRLTDWAGFNQSFCWHYPTANVSRLRDWGHFNKTYCCN 313

Query: 48  YLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSG 107
           YLLD  DPLF++I   FI++   E+G   H+Y+ DTF+E  P  +S EY++  G  +Y  
Sbjct: 314 YLLDFNDPLFMKIAVRFIKEMENEFG-VDHVYSVDTFNEMRPRSNSTEYLALSGRTVYKS 372

Query: 108 MQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFY 167
           ++  DS A+WLMQGWLF    FW+ PQ+KALL +VP G++++LDL++E+ PI++ ++ +Y
Sbjct: 373 LKEADSKAIWLMQGWLFIDGGFWKQPQIKALLTAVPQGEMIILDLYSEIIPIYTQTESYY 432

Query: 168 GVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLM 227
           G P+IWCMLH+F G +E+YG L  I  GP   R   N++MVG+GM+ EGI QN VVY+  
Sbjct: 433 GQPFIWCMLHDFGGTMELYGALKLINEGPFNGRAFPNSSMVGLGMTPEGIFQNEVVYEFF 492

Query: 228 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
           +E  ++    D+  WI++Y + RYG++   I  AW  L ++VYN +D   D + + I   
Sbjct: 493 TENVWRKAPRDISTWISKYVLNRYGKTNKFIDLAWQYLKNSVYNNSDNLKDHDSNAI--- 549

Query: 288 PDVDPSIISVTE---GKYQN----YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALEL 340
           PD  PS+        G Y N    +   ++         +      +WY+  ++  A ++
Sbjct: 550 PDHRPSLSPALHPDLGIYNNTDYLHDNSINIIVTTLPRMTPLIQQDVWYNPEDLYVAWDI 609

Query: 341 FIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 400
              + +E S S+ + YD++D+TR +L   + + + +++ A+   D H V     + L L+
Sbjct: 610 MTLNLDEFSNSSLFMYDIVDVTRNSLQILSIKYYTDLVYAFGRGDIHAVESHGNQLLGLL 669

Query: 401 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 460
            DMD +L     FLLG W+++A   A + +     ++NAR QIT+W    +     +RDY
Sbjct: 670 SDMDTVLGSDSHFLLGRWIKAATDNAMDMQDNWFLQFNARNQITLWGPRGE-----IRDY 724

Query: 461 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPV 519
             K WSGL++DYY PR  I+  Y ++ +     +   +     + K+   +    + YP 
Sbjct: 725 ACKQWSGLIKDYYLPRWEIFVNYTLDIMAHNKTYNATELDIMIYEKVEFPFSYRLDQYPT 784

Query: 520 ESNGDALITSQWLYNKYLQGT 540
           E  GD++   + L+ KY   T
Sbjct: 785 EPQGDSVAIVKSLHKKYRPDT 805


>gi|325103828|ref|YP_004273482.1| alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
 gi|324972676|gb|ADY51660.1| Alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
          Length = 738

 Score =  340 bits (871), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 194/546 (35%), Positives = 295/546 (54%), Gaps = 55/546 (10%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G+VP +    FP AK+       SV     +   Y+L+  DP+F +IG 
Sbjct: 227 LGMIPVLPAFTGHVPKSFVKKFPEAKVD------SVNWQGNFPNIYMLNPNDPMFSKIGE 280

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F+++Q +EYG T H Y+ D F+E  PP   P+Y+  +   +YS M+  D  +VW+MQ W
Sbjct: 281 QFLKEQTREYG-TDHYYSSDIFNELNPPSSDPKYLYDISEKVYSSMKKVDPKSVWVMQAW 339

Query: 123 LFSY---DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNF 179
           LF       FW P +M+A L  VP  KL++LDL+ E +P W  ++ +YG  ++W MLHNF
Sbjct: 340 LFVSAHGRKFWTPERMQAFLKPVPDDKLIILDLYTENRPRWKNTEGYYGKKWVWNMLHNF 399

Query: 180 AGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD 238
            GNI ++G   +IA  P    +        G+G++MEGIEQNP +Y LM +  + +E ++
Sbjct: 400 GGNIGLFGKAQTIASEPARVLSDPMKGNYSGIGLTMEGIEQNPFIYQLMLDHVWNNEPIE 459

Query: 239 VKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 298
           ++ W N+Y  RRYG        AW +L +TVY       D N+D          SI+S  
Sbjct: 460 LEKWTNKYITRRYGVLDNNAVKAWEILLNTVY------KDNNKD-----QGAPESILS-- 506

Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
                  G+P        ++ S +    L+Y   E +RA +  I S ++L  S+ ++YD+
Sbjct: 507 -------GRPTF------AQNSYWTWTDLYYDNREFVRAWDYLIKSADKLRNSDGFQYDI 553

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           +D+TRQA+A YA  L   +   Y   D +   + SRRFLEL+ D+D LLA    FLLG W
Sbjct: 554 VDITRQAMANYATALQRQLAYTYYAGDVNTYEKESRRFLELLSDLDRLLATRKDFLLGIW 613

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           ++ AK+ A N+ + K YE+NA+  ++MW      +   + DY  + WSGL+ +YY  R  
Sbjct: 614 IDDAKKWATNDAERKLYEFNAKDLVSMW----GHKDITINDYSARQWSGLVENYYKQRWK 669

Query: 479 IYFKYMIESLESGDGFR-------LKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQW 531
           I+F   ++ L++ + +        +KDW  EW     +W N R  YP  + GD +  S+ 
Sbjct: 670 IFFDQSLQKLKNNEIWDQAEFEKYIKDW--EW-----NWVNRRETYPTNTKGDPVNVSKE 722

Query: 532 LYNKYL 537
           +YNKY 
Sbjct: 723 MYNKYF 728


>gi|196001339|ref|XP_002110537.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
 gi|190586488|gb|EDV26541.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
          Length = 757

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 191/530 (36%), Positives = 290/530 (54%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+GN+P AL  ++P AKI +   WF      R+  T LLD  D LFI I + FIE+
Sbjct: 254 ILPAFNGNIPNALTKIYPKAKIVKSSPWFGFSK--RYGETALLDPRDKLFIVISKLFIEE 311

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           ++K YG T H+Y+ D F+E  P     EY++++  + Y  + S D+ AVW+MQGW+F  D
Sbjct: 312 EIKAYG-TDHLYSLDLFNEIDPQSKELEYLTAVSKSAYLALNSADTKAVWIMQGWMFYND 370

Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
            + W   +++A L+ +P G++V+LDLFAEV+P +  S  F+G P+IWCML+NF GN  MY
Sbjct: 371 NYYWENKRIQAFLSPIPKGRIVILDLFAEVEPQYHRSNSFFGHPFIWCMLNNFGGNAGMY 430

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G  ++I  G + A   +N+TM+G GM+ EGI  N ++YDLM+EM ++   VDV+ W+  Y
Sbjct: 431 GTFETITEGAISAYDMKNSTMIGTGMAPEGIGNNYIMYDLMAEMGWRKIAVDVRDWVVVY 490

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RRYG     I  AW  L  TVYNC D      R    A P V PS+    +       
Sbjct: 491 TERRYGGLDENIIKAWLRLSETVYNCNDMRQYHCR----ALPAVRPSLKIAND------- 539

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                               +WYS  ++  A E  + + NE  +  T++YD++D+TRQAL
Sbjct: 540 --------------------VWYSADDIFFAWEHMLRANNEFISEETFQYDIVDVTRQAL 579

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
            + A  ++  + + Y  N+   +       +EL  DMD LL  +  FLLG W+  A Q +
Sbjct: 580 QELAFIMYKKVTQCYHDNNQETLKTAGGELIELFTDMDTLLGTNSHFLLGRWVADALQHS 639

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            N   ++Q  +NA  QIT+W        S+L DY NK W+GL+  +Y  R  ++ K + +
Sbjct: 640 NNISIKQQLRFNALNQITLW----GPSKSILHDYANKMWNGLVDKFYKKRWLMFIKALSD 695

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           S+ +   F  + +     K    W +  N Y   S+G ++  S+ L++KY
Sbjct: 696 SISNNILFDQQKFNLAVQKFEAAWASENNTYATTSSGSSVTVSKQLFSKY 745


>gi|390463730|ref|XP_003733088.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
           [Callithrix jacchus]
          Length = 830

 Score =  336 bits (861), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 196/533 (36%), Positives = 293/533 (54%), Gaps = 50/533 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F G+VP A+  VFP   +TQ+G+W        + C++LL   DP+F  +G  F+ +
Sbjct: 334 VLPMFLGHVPKAITRVFPRVSVTQMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRE 391

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D+DAVWL+QGWLF Y 
Sbjct: 392 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQYQ 450

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q++A+L S P G L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 451 PQFWGPAQVRAVLGSAPHGCLLVLDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLF 510

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+++  + V D+ AW   
Sbjct: 511 GALEAMNRGPEAARLFPNSTMVGTGMAPEGISQNXVVYSLMAELSWXKDPVPDLVAWX-- 568

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
                YG S P    AW +L  +VYNC+  A   +                         
Sbjct: 569 -----YGVSHPDTGAAWRLLLRSVYNCSGEACRGHNH----------------------- 600

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             P+ +   L+  T+      +WY+ S+V  A  L  ++   L+AS T+RYDL+D+TRQ 
Sbjct: 601 -SPLVRRPSLQMNTT------IWYNQSDVFEAWRLLFSAAATLAASPTFRYDLLDVTRQV 653

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL--ELVEDMDGLLACHDGFLLGPWLESAK 423
           + +  +  +     AY L+   G    +   L  EL+  +D +LA    FLLG WLE A+
Sbjct: 654 VQELVSLYYEEARSAY-LSKELGSLLRAGGILAYELLPALDEVLASDSHFLLGSWLEQAR 712

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
            +A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+  YY PR  ++ + 
Sbjct: 713 AVAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVAHYYAPRRRLFLEA 767

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +  S+  G  F+   + +   +L   +   +  YP +  GD +  ++ ++ KY
Sbjct: 768 LAASVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 820


>gi|301106961|ref|XP_002902563.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
 gi|262098437|gb|EEY56489.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
          Length = 684

 Score =  334 bits (857), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 185/538 (34%), Positives = 294/538 (54%), Gaps = 39/538 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            LPAF+G+VP  ++ +FP+AK T+  NW     +  +CC Y+LD +DPL+ +IG+ F+E+
Sbjct: 160 ALPAFAGHVPEEMKALFPNAKFTRSPNWGDFSDE--FCCVYMLDFSDPLYYDIGKTFLEE 217

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   Y  TS +Y CDT++E  P    P  + +   A+   M + D++AVWL+QGWLF   
Sbjct: 218 QRALYDYTSSLYQCDTYNEMDPDFTDPAKLQAASRAVIDSMTAADANAVWLIQGWLFENS 277

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P +W   ++KA L+ V   K+++LDL++EV+P+WS    ++G  +++C+LHNF GN  M 
Sbjct: 278 PDYWTKNRVKAYLDGVSNEKMIILDLYSEVRPVWSKMDNYFGKSWVYCVLHNFGGNTGMR 337

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L ++   PV+A    N TM+GVG++MEGI QN VVYDL  +MA+    +D+  W+  +
Sbjct: 338 GDLATLGTAPVQASRDSNGTMIGVGLTMEGIYQNYVVYDLTLQMAWVDTPLDMDEWVPSF 397

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + +RY       + AW  L  +VYN T G     + +I   P                + 
Sbjct: 398 AAQRYHSQDVHTERAWGFLLQSVYNRTLGFGGVTKSLICLIP----------------HW 441

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           K V    +  S T         Y   ++ RA +  + +G+EL A +TYR+DL+D+TRQ L
Sbjct: 442 KLVRDGFMPTSIT---------YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFL 492

Query: 367 AKYANELFLNIIEAYQ--LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
           + +    +L++ E Y+     AH +   + R L  +E MD +LA ++  LLG W+  A+ 
Sbjct: 493 SDHFMAQYLHLKEMYEGKTQPAHQLCAWTERMLLTIERMDEILATNEDSLLGNWIADARA 552

Query: 425 LAQNEEQ------EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           LA+  E       +  YE+ AR Q+T W DN  E    + DY  K W+GL++ YY PR  
Sbjct: 553 LAEESESIESSNLQDYYEYEARNQVTRWGDNNSET---IHDYAGKEWAGLVKGYYLPRWR 609

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++   + ++   G     +  ++  I     WQ     YP  + GDAL+ SQ +Y+++
Sbjct: 610 MWLGEVCQAYTQGRTINKEVVKKARIAFELKWQLSHEHYPTTTVGDALVVSQRIYDEF 667


>gi|194216885|ref|XP_001917396.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
           [Equus caballus]
          Length = 744

 Score =  332 bits (852), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 200/538 (37%), Positives = 301/538 (55%), Gaps = 53/538 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +TQLG+W        + C++LL   DPLF  +G  F+ +
Sbjct: 242 VLPAFAGHVPKAITRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPEDPLFPVVGSLFLRE 299

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 300 LTKEFG-TDHIYGADTFNEMQPPSSEPAYLAAATAAVYQAMTAVDPDAVWLLQGWLFHHQ 358

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW P Q+ A+L +VP G+L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 359 RTFWGPAQVGAVLGAVPRGRLLVLDLFAESQPMYIRTASFQGQPFIWCMLHNFGGNQGLF 418

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D++AW+  
Sbjct: 419 GALEAVNRGPAAARLFPNSTMVGTGMTPEGIGQNEVVYALMAELGWRKDPVADLEAWVTS 478

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           ++ RRYG S    + AW +L  +VYNC+  A   +                         
Sbjct: 479 FAARRYGVSHKDAETAWKLLLRSVYNCSAEAYSGHNQ----------------------- 515

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             P+ K   L+  T+      +WY+ S+V  A  L + +   L++S  + YDL+D+TRQA
Sbjct: 516 -SPLVKRPSLQMGTT------VWYNRSDVFEAWWLLLTAAPALASSPAFLYDLVDVTRQA 568

Query: 366 LAKY-------ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
             +        A   +LN  E   L  A G+        EL+  +D +LA    FLLG W
Sbjct: 569 AQELISLYYEEARTAYLNK-ELVPLLRAGGILAY-----ELLPALDKVLASDSRFLLGSW 622

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           L+ A+++A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  
Sbjct: 623 LKQAREMAVSEAEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWQ 677

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++ + +++SL  G  F+ + + +   +L   +      YP +  GD +  ++  + KY
Sbjct: 678 LFVEMLVQSLAQGVPFQQQQFDKNAFELEEAFVLSTRRYPSQPQGDTVDLAKKFFLKY 735


>gi|321472423|gb|EFX83393.1| hypothetical protein DAPPUDRAFT_301977 [Daphnia pulex]
          Length = 799

 Score =  332 bits (850), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 179/494 (36%), Positives = 288/494 (58%), Gaps = 43/494 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A++ V+P+A  T L +W + +   ++CC   L  T+PLF EIG  FI++
Sbjct: 249 VLPAFAGHVPRAMERVYPNASYTHLTSWLNFQD--QYCCPLFLQPTEPLFTEIGSRFIKE 306

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
              E+G + H+YNCD F+E  P    P ++SS+G A+++ M + D DA+WLMQGWLF  D
Sbjct: 307 MALEFG-SDHVYNCDVFNEVRPTQADPVFVSSVGTAVFNAMTTADPDAIWLMQGWLFKSD 365

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             +W     KALL SVP G++++LDL AE+ P +     FYG P+++C+LHNF G + + 
Sbjct: 366 ADYWTADLSKALLTSVPQGRMLILDLQAELDPQYIRLNSFYGQPFVFCLLHNFGGTLGLN 425

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +  I+   ++AR   N+TMVG G++MEGI+QN VVYD M EM ++ +  ++  W ++Y
Sbjct: 426 GAIQIISQRVIDARNFPNSTMVGTGLTMEGIDQNYVVYDKMLEMGWRDKVPNLNQWFDEY 485

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +VRRYG +  A+  AW  L ++VYN +   + + + V+V  P +                
Sbjct: 486 TVRRYGVNNTAVMSAWRFLQNSVYNDSSRRSFRGQYVLVTRPAL---------------- 529

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIA---SGNELSASNTYRYDLIDLTR 363
                          +  P +WY+  +VI A +  I+   +   LS ++ +R+D++DLTR
Sbjct: 530 ---------------WQLPFVWYNPHDVILAWDHLISGLMTEPLLSNASNFRHDMVDLTR 574

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           Q++ +  + L+  ++E Y   ++  +  ++ + ++L++D+D LL     FLLG W+  AK
Sbjct: 575 QSMQEIFHLLYSQLLEVYLEKNSTAIEGIAYKMIDLLQDLDELLQTGKKFLLGKWIADAK 634

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
                E ++ QYEWNAR QIT+W    +     +RDY  K W+G++ DYY PR  ++ + 
Sbjct: 635 SWGTTEGEKLQYEWNARNQITLWGPRGE-----IRDYAAKQWAGVVADYYKPRWEVFIRE 689

Query: 484 MIESLESGDGFRLK 497
           M  SL+    F  K
Sbjct: 690 MQMSLDENRAFNKK 703


>gi|374385255|ref|ZP_09642763.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
           12061]
 gi|373226460|gb|EHP48786.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
           12061]
          Length = 736

 Score =  332 bits (850), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 187/540 (34%), Positives = 291/540 (53%), Gaps = 57/540 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VP   +  FP A++ Q+ NW     + R+  TYLL+A DPLF  IG  F+E+
Sbjct: 235 ILPAFTGHVPPTFKEHFPEARLRQV-NW-----EGRFDDTYLLEADDPLFQTIGNRFMEE 288

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q++ +G T H+Y  DTF+E  PP +   Y+  +  A+Y  M + D +AVW+MQGWLF   
Sbjct: 289 QIRTFG-TDHLYGADTFNEMFPPSEDSTYLDGISKAVYQSMAAVDPEAVWVMQGWLFHDK 347

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+P QMKA L +VP   L+VLDL+ E  PIW  ++ FYG P+IWCMLHNF G   ++
Sbjct: 348 RDFWKPAQMKAYLGAVPDEHLIVLDLWGEEFPIWDRTEAFYGKPWIWCMLHNFGGRNMLF 407

Query: 187 GILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           G    +A  P           ++G+G   EGIEQNPV+Y L+    +++  V++  W   
Sbjct: 408 GNALKLAEEPSRVLADPAKGQLLGLGAVPEGIEQNPVIYSLLFSHIWRNTAVELDEWFET 467

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y   RYG    A++ AW++L  TVY                            EG   NY
Sbjct: 468 YLESRYGCRDEAVEKAWDILRKTVY--------------------------ANEG---NY 498

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
              ++     +   +++ +  + Y+  EVI+A +  + + + L  +  YRYDLI + +Q 
Sbjct: 499 ESAITARPTFEKH-NNWAYTDIPYNPVEVIKAWKYLLQAADRLGENPCYRYDLILVGKQV 557

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           LA YA  +     E Y+  D     + SR F+EL++DMD L+  H+ FLLG WLE A+  
Sbjct: 558 LANYATIIQQKFGEDYRTKDLPAFTRNSREFMELIDDMDELMGTHEAFLLGKWLEDARSW 617

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
            +   +++ YE NAR QIT+W      + ++L DY +K WSGL + +Y  R  ++   + 
Sbjct: 618 GKTASEKQLYEKNARDQITLW----GGKDAVLHDYASKQWSGLFKGFYKGRWQLFIDEVY 673

Query: 486 ESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           + +++G  +       R++ W  EW+       NG+  YP    GD ++ S+ ++ KY++
Sbjct: 674 DCIKTGRKYDHTASDDRVRSWEWEWV-------NGQEKYPAVPQGDPVVVSERMFGKYIK 726


>gi|440800773|gb|ELR21808.1| AlphaN-acetylglucosaminidase (NAGLU) [Acanthamoeba castellanii str.
           Neff]
          Length = 800

 Score =  332 bits (850), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 195/545 (35%), Positives = 291/545 (53%), Gaps = 69/545 (12%)

Query: 18  AALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSH 77
           A L+ V+P A IT    W       R    +LLD  DP+F +IG AFI+ Q + YG T H
Sbjct: 243 AGLKRVYPHANITLSPTWAHFTDPYR---VWLLDPFDPIFQKIGTAFIDAQTRVYG-TDH 298

Query: 78  IYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKA 137
           IYN DTF+E  PP   P Y+++   A+Y GM + D  A+WLMQGWLF    +W   ++KA
Sbjct: 299 IYNADTFNELDPPSADPTYLAAASNAVYQGMAAADPKALWLMQGWLFR-SVWWSNDRIKA 357

Query: 138 LLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 197
            L+ V    +++LDL+AEV PIWS ++ ++G P++WCMLH+F GN ++YG L  IA  PV
Sbjct: 358 YLSGVKNDNMLILDLYAEVDPIWSKTESYFGKPFVWCMLHDFGGNRDLYGNLTHIATAPV 417

Query: 198 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 257
           +ART+  +TMVG G++ME IEQNPV+Y+LMSEM ++   VDV  W++ Y   RYG   P+
Sbjct: 418 DARTAPGSTMVGTGLTMEAIEQNPVIYELMSEMGWRSAHVDVDDWLDHYVSFRYGADSPS 477

Query: 258 IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKS 317
            + AW +L+ + Y                 P +  SI +    ++               
Sbjct: 478 AKKAWRLLHQSAYQN---------------PVIMRSIYTFVPNRH--------------- 507

Query: 318 ETSSYDHPHLWYSTSEVIRALELFIASGNEL----SASNTYRYDLIDLTRQALAKYANEL 373
             S   H    YS   ++ A  L + S  EL      +  + YDL+D+TRQ L    + L
Sbjct: 508 -VSRNHH----YSPDVLVEAWGLLLQSRLELPNPAQPNGPWEYDLVDVTRQVL----DNL 558

Query: 374 FLNIIEAYQLND-AHGVFQLSRR------------FLELVEDMDGLLACHDGFLLGPWLE 420
           F    +AY L D A+  +  +RR             ++++ D+D +LA +  +LLG W E
Sbjct: 559 FH---DAYGLLDGAYDAYVATRRDPFNQVKTIGAALIQILSDIDTVLATNQNYLLGVWTE 615

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            A+  A NEE+++ YE+NAR QIT+W  N +     + DY +K W+GL+  YY PR  I+
Sbjct: 616 RARSWATNEEEKRLYEFNARNQITLWGPNGE-----INDYASKEWAGLVGTYYRPRWQIF 670

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGT 540
             Y+ +S+  G       +  + +     W N  N +P ++ G+    SQ LY +Y+   
Sbjct: 671 VAYLFDSIAKGTVIDPNKYAADLLLWEQRWNNQTNAFPSQATGNVAEVSQALYARYVSAA 730

Query: 541 GVFDH 545
            +  H
Sbjct: 731 ELKQH 735


>gi|255533286|ref|YP_003093658.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
 gi|255346270|gb|ACU05596.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
          Length = 734

 Score =  331 bits (849), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 181/538 (33%), Positives = 288/538 (53%), Gaps = 55/538 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LP+F+G+VP + ++ FP  K+      + +   P     Y+L+   P+F EIGR F+  
Sbjct: 227 ILPSFTGHVPPSFKDKFPDIKVNTQQ--WGINVSP----AYVLNPETPMFKEIGRKFLTA 280

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +  +G T H+Y+ DTF+E TP  +   Y++ +   IY  M + D+ AVW+MQGW+F   
Sbjct: 281 LINTFG-TDHLYSADTFNEMTPVSNDSTYLNGMAKKIYESMAAVDTQAVWIMQGWMFLDR 339

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW+P QMKAL ++VP  KL+VLDL +E+ P+WS +  FYG  +IWCMLHNF G + M+
Sbjct: 340 PNFWQPTQMKALFSAVPQDKLIVLDLNSELNPVWSRTDAFYGEKWIWCMLHNFGGRLSMF 399

Query: 187 GILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           G +  I   P  A +  +   M G+G++MEGIEQNP +Y LM E  +  + +D+  W+  
Sbjct: 400 GDMSRIGNDPAAALKNDQRGKMSGIGLTMEGIEQNPAIYSLMLEHIWNDKPIDLDNWLKG 459

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y+ RRYG+     + AW VL +TVY+           +I   P  D + +          
Sbjct: 460 YAQRRYGKRNSNAEKAWEVLKNTVYSHQPWWG--TNTIITGRPTFDAATV---------- 507

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
                           + +  + YS+ E+++A    + + +EL +S+ ++YDL+D+TRQ 
Sbjct: 508 ----------------WTYTAIPYSSKELMKAWSYLLTASDELKSSDGFQYDLVDVTRQV 551

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           LA YAN L  +   +Y+  D     + S +FLEL++D+D LL     FLLG W+ +AK L
Sbjct: 552 LANYANVLQQDFASSYKQKDMATFNKKSAQFLELIDDIDQLLGTRSDFLLGKWINNAKAL 611

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
             N  ++K +E NAR  IT+W D    +   + +Y  K W+G+++ +Y PR   +F  + 
Sbjct: 612 GDNPAEKKLFERNARDLITLWLD----KDCNIHEYACKEWAGMMKGFYKPRWQQFFDEVR 667

Query: 486 ESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
               +G           +KDW  +W+       N    Y  +  G+ +  ++ LY KY
Sbjct: 668 LQASAGKEIDQIKFENTIKDWEWKWV-------NANEAYTDKPTGNPVTVAKALYAKY 718


>gi|242011515|ref|XP_002426494.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
 gi|212510620|gb|EEB13756.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
          Length = 1345

 Score =  328 bits (841), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 177/484 (36%), Positives = 282/484 (58%), Gaps = 46/484 (9%)

Query: 8    VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            VLP+F G VP + ++ +P AK+ ++  W     D  +CC YLLD+ DPLF  + R F+++
Sbjct: 873  VLPSFCGIVPRSFKDSYPFAKLLEMPKWNKFSRD--YCCPYLLDSNDPLFSVVSRVFLKE 930

Query: 68   QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             + E+G T+HIYNCD F+EN P  +S +Y+S++ + IY  M S D  A WL+QGW+F  D
Sbjct: 931  YINEFG-TNHIYNCDVFNENKPASESLDYLSTISSTIYKAMSSVDPRATWLVQGWMF-ID 988

Query: 128  PFWRP-PQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
            PFW    ++KA +N+VP G++++LDL +++ P +   + ++G P+IWC LHNF G + MY
Sbjct: 989  PFWASLKRVKAFINAVPKGRMLILDLQSDLTPQYKRLQSYFGQPFIWCTLHNFGGQLGMY 1048

Query: 187  GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
            G L+ +  G  + R  +N+TMVG+G++ EGI+QN ++YD   ++A + + VD+  WI +Y
Sbjct: 1049 GHLNRVNLGVFKGRKFKNSTMVGIGIAPEGIDQNYIMYDFTLDLALRTKPVDLDDWITKY 1108

Query: 247  SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
            ++RRYG     I DAW +L +T+YN                PD +  + S     Y    
Sbjct: 1109 ALRRYGLIEKNILDAWLILKNTLYNYN--------------PDSNFRLTSSNVKMYTLVK 1154

Query: 307  KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF-IASGNELSASNT-YRYDLIDLTRQ 364
                 + +L    S   +   WY+ S ++   E F IAS N + ++++ +++DLID+TRQ
Sbjct: 1155 GEHIAKNILTKFPSLRMNEFTWYNRSIILDIFEKFQIASSNSILSTSSLFQHDLIDVTRQ 1214

Query: 365  ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
             +                        + S  FLEL+ ++D +L     FLLG WLESAK 
Sbjct: 1215 TI--------------------QIAIENSNMFLELLNELDMILNTGKKFLLGNWLESAKN 1254

Query: 425  LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
            +A N+ ++  YE+NAR QIT+W  N +     +RDY  K W+G++ D+Y PR  ++F+ +
Sbjct: 1255 MATNKLEKDNYEFNARNQITLWGSNGE-----IRDYAAKQWAGMIHDFYKPRWKLFFQAL 1309

Query: 485  IESL 488
             ES+
Sbjct: 1310 NESI 1313


>gi|383856382|ref|XP_003703688.1| PREDICTED: alpha-N-acetylglucosaminidase [Megachile rotundata]
          Length = 744

 Score =  327 bits (839), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 181/489 (37%), Positives = 281/489 (57%), Gaps = 44/489 (8%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLP+F+G+VP A   +FP+A +T+L  W +      +CC YLL  TDPLF +IG+
Sbjct: 226 LGIIPVLPSFAGHVPRAFPRLFPNANVTKLAPWNNFPD--VYCCLYLLAPTDPLFQQIGQ 283

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F++  ++E+G T HIYNCDTF+EN P     +++ ++G + +  M + D DA+WLMQGW
Sbjct: 284 LFLKTYIEEFG-TDHIYNCDTFNENEPHTSELKFLRNVGHSTFQAMNAVDPDAIWLMQGW 342

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF++D  FW  P+++A L SVP G+++VLDL +E  P +   K ++G P+IWCMLHNF G
Sbjct: 343 LFTHDKLFWTEPRVEAFLTSVPRGRMIVLDLQSEQFPQYGRLKSYFGQPFIWCMLHNFGG 402

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
            + M+G    I     E R  +N+TMVG G++ EGI QN V+Y+LM+EMA++ E V++  
Sbjct: 403 TLGMFGSAQIINQRVFEGRNMKNSTMVGTGLTPEGINQNYVIYELMNEMAYRKEPVNLNK 462

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W   Y+ RRYG        AW  L  TVYN +     + + VI   P ++ S  +     
Sbjct: 463 WFENYASRRYGVWNEYAVSAWQSLGRTVYNFSGTRKIRGKYVISRRPSLNLSTWT----- 517

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
                                     WY    +     +F+ + +    S  YR+D++DL
Sbjct: 518 --------------------------WYDRDTLYNTWSVFLQARHGRRNSTLYRHDVVDL 551

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           TRQ L   A E++  +I+++   +       S + L+L +D++ +LA    FLLG WL++
Sbjct: 552 TRQVLQAKAEEIYPVLIDSFNKKNLTAFKYHSDKLLDLFDDLELILASGKDFLLGKWLDA 611

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           AK+LA N+E+ + Y+ NA+ QI++W    +     +RDY NK W+G++ DY+ PR +I+ 
Sbjct: 612 AKKLASNDEELRLYQVNAKYQISLWGPRGE-----IRDYANKQWAGVVADYFKPRWSIF- 665

Query: 482 KYMIESLES 490
              +ESLE+
Sbjct: 666 ---LESLEN 671


>gi|157134500|ref|XP_001656341.1| alpha-n-acetylglucosaminidase [Aedes aegypti]
 gi|108881379|gb|EAT45604.1| AAEL003150-PA [Aedes aegypti]
          Length = 763

 Score =  327 bits (838), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 185/529 (34%), Positives = 293/529 (55%), Gaps = 47/529 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            LPAF+G++P     ++P AK+T + NW    +  ++     LD  DPLF EIG+ F+ +
Sbjct: 248 ALPAFAGHLPVQFAQLYPEAKLTPVENWNGFPA--QYASPLFLDPIDPLFQEIGKRFLTK 305

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            ++ YG ++HIY CD F+E  P   S +Y+SS  A IY  M   D  AVWL+QGW+F  +
Sbjct: 306 VIERYG-SNHIYFCDPFNEIQPRSFSAKYLSSASAGIYKAMNDVDPFAVWLLQGWMFVKN 364

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           P+W    ++A L +VPLG+++VLDL +E  P +  ++ ++G P+IWCML NF G + M G
Sbjct: 365 PYWSDVAIRAFLQAVPLGRMLVLDLQSEQFPQYDRTESYHGQPFIWCMLSNFGGTLGMLG 424

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            +D +     + RT+++ TM+G G++ EGI QN  +Y+   EM +     +V+ W   Y+
Sbjct: 425 SVDLVFQRIRDVRTNDSMTMIGTGITPEGINQNYGLYEFALEMGWNPNIDNVEEWFRTYA 484

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             RYG     ++DAW++  +TVY+  +                      +  GKY    +
Sbjct: 485 SVRYGTQDKRLKDAWSMFRYTVYSFKEQ--------------------EMMRGKYTFNRR 524

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
           P  K            HP LWY+ +     ++L + S    S +  +R D++DLTRQ L 
Sbjct: 525 PSLKL-----------HPWLWYNETLFNAGVQLLLESN---STNTLFRNDVVDLTRQFLQ 570

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
             A+ L+LNI+EAY   + + V  LS  F +L+EDMD LL     FLLG WLESAK +A+
Sbjct: 571 NTADRLYLNIMEAYNTKNPNSVKYLSILFQKLLEDMDRLLRTDQHFLLGRWLESAKAVAE 630

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
              + ++YE+NAR QIT+W    Q     + DY NK W+G+++D++ PR  ++   M + 
Sbjct: 631 TSLERQKYEYNARNQITLWGPQGQ-----IVDYANKQWAGMVQDFFLPRWKLFLTEMTKD 685

Query: 488 LESGDGF---RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 533
           +E        +++D   + ++L     N R  YP+  +GDAL+ ++ L+
Sbjct: 686 VEQNRTLNEGKVRDKIFKMVELPFCTSNKR--YPIRPDGDALLVARELF 732


>gi|255533285|ref|YP_003093657.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
 gi|255346269|gb|ACU05595.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
          Length = 749

 Score =  327 bits (837), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 191/541 (35%), Positives = 283/541 (52%), Gaps = 60/541 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAFSG+VP   +  FP+A++ +L NW     + R+  TY+L   DPLF +I   F+ +
Sbjct: 243 ILPAFSGHVPPTFKARFPNARVDRL-NW-----EGRFADTYVLHPDDPLFQQIADKFMAE 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q K +G T H+Y  DTF+E   P     Y+  +G A+Y GM   D +A+W+MQGW+F   
Sbjct: 297 QDKAFGNTDHLYGADTFNEMYLPYTDTAYVRKIGTAVYKGMAKADPEAIWVMQGWMFWDK 356

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+P  +K  L+ VP   L++LDLFA+ +PIW+ ++ F+G  +IWCMLHNF G   +Y
Sbjct: 357 RDFWKPEVVKNYLSGVPDDNLIMLDLFADEQPIWTKTEAFWGKKWIWCMLHNFGGRNPLY 416

Query: 187 GILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           G L+ I   P E     N   + G+G+  EGIEQNPVVY LM E  +  + +DVK+W+  
Sbjct: 417 GDLNYIGREPAEMVHDPNRGRLSGIGLVPEGIEQNPVVYSLMLEHVWNDQVIDVKSWLVN 476

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y+ RRYG+  P  + AW +L+ TVY                            EG Y+  
Sbjct: 477 YAQRRYGQRDPQTEKAWQILHQTVY--------------------------AKEGSYETI 510

Query: 306 --GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
              +P  ++    ++ +  D P   Y   +++ A    + + N    ++ Y++DL+ + R
Sbjct: 511 ISARPTHEK---HADWTGTDLP---YDGDKLVPAWTYLLNASNRFKNNDCYQFDLVTVGR 564

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           Q LA YA  L       ++  +       +  FL L+ DMD L+     FLLG WL  AK
Sbjct: 565 QVLANYATVLQRLFARDFRNKNLTAYRAHTAEFLTLIADMDKLMGTRKDFLLGKWLNDAK 624

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           + A NE + + YE NAR  IT+W     ++AS L +Y NK W+GL   +YG R   +   
Sbjct: 625 KWATNESESRLYEKNARDLITLW---GGKDAS-LHEYANKQWAGLFNGFYGKRWQTFIAE 680

Query: 484 MIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
              +LE G  F       R+KDW  EW     +W NGR  Y  +  G+ +  S  L+ KY
Sbjct: 681 TSTALEQGKSFDQEAFETRMKDW--EW-----NWVNGREQYTDKPQGNPVTVSIQLHKKY 733

Query: 537 L 537
           +
Sbjct: 734 I 734


>gi|194759443|ref|XP_001961958.1| GF14678 [Drosophila ananassae]
 gi|190615655|gb|EDV31179.1| GF14678 [Drosophila ananassae]
          Length = 783

 Score =  325 bits (832), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 184/541 (34%), Positives = 295/541 (54%), Gaps = 54/541 (9%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
             LPAF+G+VP AL  + P+   T +  W       R+CC   ++ T+PLF +I   F++
Sbjct: 264 VALPAFAGHVPRALSRLHPNTSFTDVQRWNQFPD--RYCCGLFVEPTEPLFHQIATTFLQ 321

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
             +  YG ++HI+ CD F+E  PPV  PEY+ S  AAI++ M + D +A+WL+QGW+F  
Sbjct: 322 SVVTIYG-SNHIFFCDPFNELEPPVAKPEYMRSTAAAIHNSMTAVDPEAIWLLQGWMFVK 380

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           +PFW P   +A L +VP G+++VLDL +E  P +  +  ++G P+IWCMLHNF G + M+
Sbjct: 381 NPFWTPDMAEAFLTAVPRGRILVLDLQSEQFPQYELTHSYFGQPFIWCMLHNFGGTLGMF 440

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    I  G   AR+  N+++VG G++ EGI QN VVY L  E  +    +D+ +W   +
Sbjct: 441 GSAKLINSGIEAARSMPNSSIVGTGITPEGIGQNYVVYSLTLERGWSRNSIDLDSWFRHF 500

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +V RYG    ++  AW +L ++VY+                       +    G+Y    
Sbjct: 501 TVTRYGVKDESLAKAWLLLKNSVYSFHG--------------------LQKMRGQYVVTR 540

Query: 307 KPVSKEAVLKSETSSYDH-PHLWYSTSEVIRALELFIASGN----ELSASNTYRYDLIDL 361
           +P            S++H P  WY+ S+V+ A  L +++      E    + Y +DL+D+
Sbjct: 541 RP------------SFNHDPFTWYNASDVLEAWHLLLSARVIIPLEDDRYDVYEHDLVDI 588

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           TRQ L   A++L++N+  +++         LS R L+L +D++ +L+    FLLG WLE 
Sbjct: 589 TRQFLQITADQLYVNLKSSFRKRQLPRFEFLSTRLLQLFDDLELILSSGRNFLLGNWLEQ 648

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           AKQ+A + E  K +E+NAR QIT W  N Q     + DY  K WSGL++DYY PR +++F
Sbjct: 649 AKQVAPHPEDRKSFEFNARNQITAWGPNGQ-----ILDYACKQWSGLVKDYYKPRWSLFF 703

Query: 482 KYMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
             +  +L S     G  F+ K  +R  I+L   + N  ++YP +   +    S  ++ ++
Sbjct: 704 DDVNVALHSQRPFNGSAFKQKVSQR--IELP--FSNKTDIYPTDPVENVWFISHTIFERW 759

Query: 537 L 537
           +
Sbjct: 760 M 760


>gi|390334740|ref|XP_003724005.1| PREDICTED: uncharacterized protein LOC100893810 [Strongylocentrotus
            purpuratus]
          Length = 1043

 Score =  324 bits (830), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 176/471 (37%), Positives = 257/471 (54%), Gaps = 41/471 (8%)

Query: 70   KEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF 129
            +E+  T HIYN DTF+EN P  +   Y+S+    +Y G+  GD   VWLMQGWLF    F
Sbjct: 573  EEFNGTDHIYNADTFNENQPRSNDSAYLSAASRGVYQGIVEGDPQGVWLMQGWLFQKTDF 632

Query: 130  WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL 189
            W P Q+KALL+ VP+G+++VLDLFAE +PI++ ++ FYG P+IWCMLHNF GN  +YG L
Sbjct: 633  WGPSQIKALLHGVPIGRMIVLDLFAEARPIYNATQSFYGQPFIWCMLHNFGGNTGLYGKL 692

Query: 190  DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 249
            D++   P EAR   ++TM+G+G++ EGI QN V+Y+ +++M ++ E ++V  WI +YS R
Sbjct: 693  DAVNKFPFEARQFNSSTMIGMGLTPEGILQNYVMYNFLTDMTWRSESMNVSKWIEEYSGR 752

Query: 250  RY----GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
            RY    G S  A + AW +L  TVYN T    D       A P V PS          N 
Sbjct: 753  RYSPESGHSEEAAK-AWAILQATVYNNTGIDKDHQH----AVPVVRPS----------NK 797

Query: 306  GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
             K V                 +WY  +EV +A    + +   L  S+ +RYDL+D+TR  
Sbjct: 798  TKSV-----------------IWYDYTEVAKAWGFLLQASETLGTSSLFRYDLVDVTRNV 840

Query: 366  LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
            L   A + +  I+ ++   +   +         L+ DMD + + H  +LLG WLE AK L
Sbjct: 841  LQDLAFDFYEQIMVSFHAKNITAIRGNGTLLCNLILDMDNITSSHQDWLLGTWLEDAKSL 900

Query: 426  AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
            A N ++E  YE+NAR QIT+W    +       DY NK W GLLR YY  R  ++ +++ 
Sbjct: 901  ATNHKEESLYEYNARNQITVWGPRGEH-----LDYANKQWGGLLRSYYYNRWQLFVQFLD 955

Query: 486  ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
              +E    +    +      +  +W N    +P +  GD +  S+ LY+KY
Sbjct: 956  GCIELHVPYDQSKFDMRSFIMETEWTNSTEKFPTKPVGDTVSISRALYSKY 1006


>gi|449675146|ref|XP_002156234.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Hydra
           magnipapillata]
          Length = 646

 Score =  323 bits (827), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 175/530 (33%), Positives = 280/530 (52%), Gaps = 36/530 (6%)

Query: 8   VLPAFSGNVPAAL-QNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           VLP F G++P AL   +FP++K  +L  W   K   ++  T+LLD  DPLF ++G AF+E
Sbjct: 126 VLPGFGGHIPKALVSRLFPTSKYYKLKPW--NKFTGKYGGTFLLDPQDPLFKKVGAAFVE 183

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
            Q + Y  T H+YN D F+E  PP  +  +I++    +Y+ M + DSDAVWLMQGW+F  
Sbjct: 184 MQKQLYNGTDHVYNADIFNEMDPPQLTSAFITNTSIGVYNAMLASDSDAVWLMQGWMF-L 242

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W+P  ++A L ++P GKL++LDL +++ P++  +  FYG P+IWCM+ NF G   +Y
Sbjct: 243 SSVWKPELVEAWLQAIPYGKLIILDLASDIYPLYDQTNAFYGHPFIWCMIENFGGTTRLY 302

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L  +  G + AR +  + M+G GM+ EGI QN + ++LM+EM +++E+ ++  W   Y
Sbjct: 303 GQLTGVMKGVISARKTYKSFMIGTGMTPEGINQNDINFELMNEMGWRNEEFNISDWTLSY 362

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
             RRYG     + DAW +L  T+YNC DG  +   D  +  P + P +         N  
Sbjct: 363 IKRRYGDYPKMVSDAWLILIDTIYNCNDGRENGGYDGRI--PVMRPQL---------NAK 411

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
            PV                H+WYS  ++  A +L +   + +   +T+R DL+ L  Q L
Sbjct: 412 LPV----------------HMWYSIKDLYNAWKLMVKGSDYMPLIDTFRNDLVRLGTQVL 455

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
              +   +  ++  Y       V +   +   L+ DMD LLA     LLG W++SA+ + 
Sbjct: 456 EDLSIVFYTQMVSGYFNKSTLNVEKYGSKITVLLTDMDRLLATDQYSLLGRWIQSARSMG 515

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
               + K  E+NA+ QIT+W  N +     +RDY NK W+GL+  +Y  R  ++  ++ +
Sbjct: 516 DTLNETKLLEYNAKNQITLWGPNGE-----IRDYANKNWAGLVGSFYFERWNMFINFLSD 570

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           SL+ G  +    +  + ++    W N    +  +  GDA   S  L   Y
Sbjct: 571 SLKRGVPYDDSAFVSKLLQFEKKWNNEIKEFSADPTGDAFGISHQLLRAY 620


>gi|328867411|gb|EGG15793.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
          Length = 1501

 Score =  321 bits (823), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 188/551 (34%), Positives = 299/551 (54%), Gaps = 66/551 (11%)

Query: 8    VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            VLP F+G+VP A  ++FP+A ITQLG+W +         TY LD +DPLF  + + F++ 
Sbjct: 987  VLPGFAGHVPRAFMSLFPTANITQLGDWRAFNG------TYYLDPSDPLFANVSQTFVKV 1040

Query: 68   QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            Q   YG T H Y+ D F+E TPP     Y+ +  +++Y+ +   D  AVW++Q W F  D
Sbjct: 1041 QTAIYG-TDHYYSFDPFNEITPPSSDAGYLQNSSSSMYNALAYADPQAVWVLQAWFFISD 1099

Query: 128  P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              FW+PPQ+KA L  VP+G L+VLD +AE  P W+ + QF G  +IWCMLHNF G   MY
Sbjct: 1100 AWFWQPPQVKAFLGGVPIGHLLVLDTWAEESPAWTVTDQFNGHDWIWCMLHNFGGRTGMY 1159

Query: 187  GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
            G +  I  GP++AR  ++  M G G++ E IEQN ++YDLMSEM+++    ++  WINQY
Sbjct: 1160 GKIPRITAGPIDAR-KQSPGMKGTGLTPEAIEQNYIMYDLMSEMSWRTTAPNMTEWINQY 1218

Query: 247  SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
            + RRYG  VP +  AWN L  TVYN  D + DKN     +F  + P +            
Sbjct: 1219 TQRRYGVFVPELAQAWNSLASTVYNAPD-SIDKNPS---SFVGIRPELNMTN-------- 1266

Query: 307  KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE-LSASNTYRYDLIDLTRQA 365
                               +++Y +S + +A +L+++  +E + +++TY +D+ ++T QA
Sbjct: 1267 -------------------NIYYDSSIIQKAWQLYLSVTDEYVLSTSTYSFDIAEITIQA 1307

Query: 366  LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ- 424
            L+    E  + + +AY+        + +   L ++ DMD + +     L+G W  +A+Q 
Sbjct: 1308 LSNLFIETEIAMYDAYKTGKGTEFDEHAMNCLNIITDMDMIASTQQLLLVGTWTANARQW 1367

Query: 425  ----LAQNEEQEK----------QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
                L++N+++++          QYE+NAR QIT+W        S L DY    WSGLL 
Sbjct: 1368 ANYNLSRNKDEDRNTDKEQMTIEQYEFNARNQITLW----GPSNSTLHDYAYHLWSGLLN 1423

Query: 471  DYYGPRAAIYFKYM------IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
            D+Y  R +++ KY+        + ++G GF+ +++  +   L   W      YP    G+
Sbjct: 1424 DFYLARWSLFIKYLDSSLSSSSTNDAGTGFKNQEYINDIESLEESWNLQTYQYPTRPTGN 1483

Query: 525  ALITSQWLYNK 535
            A   S+++ N+
Sbjct: 1484 AYQLSKFINNQ 1494


>gi|198476648|ref|XP_001357424.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
 gi|198137793|gb|EAL34493.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
          Length = 767

 Score =  320 bits (820), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 182/533 (34%), Positives = 291/533 (54%), Gaps = 44/533 (8%)

Query: 9   LPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 68
           LPAF+G++P A++ ++P+   T++  W S   DP +CC   +D  DP+F  +   F+ + 
Sbjct: 250 LPAFAGHLPTAMRRIYPNGNYTEVERWNSFP-DP-YCCGLFVDPLDPIFDLVAALFLRRV 307

Query: 69  LKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP 128
           ++ YG ++HI+ CD F+E  PPV  P+Y+ S  AAI++ M+S D +AVWL+QGW+F  + 
Sbjct: 308 VQRYG-SNHIFFCDPFNELQPPVAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNI 366

Query: 129 FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI 188
           FW    M+A L +VP+G+L+VLDL +E  P +  +  +YG P++WCMLHNF G + M+G 
Sbjct: 367 FWTDAMMEAFLTAVPIGRLIVLDLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGS 426

Query: 189 LDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSV 248
            D +  G   AR   N+++VGVG++ EGI QN V+Y L+ E  +    +D+ +W   ++ 
Sbjct: 427 ADLVNNGIEAARRMPNSSIVGVGITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFAR 486

Query: 249 RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP 308
            RYG     +Q AW +L  +VY+       K R                        G  
Sbjct: 487 TRYGVDDEGLQQAWQLLRRSVYSFR--GLQKMRG-----------------------GYT 521

Query: 309 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLTRQ 364
           V++   L  +      P  WY+ S+V+ A +L ++S   +   +     Y +DL+D+TRQ
Sbjct: 522 VTRRPALNLD------PFTWYNASDVLEAWKLLLSSRAIIPLEDDNYAIYEHDLVDITRQ 575

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L   A++L++N+  AY+         L  + L+L  D++ +LA    FLLG WL  A++
Sbjct: 576 YLQISADQLYVNLKSAYRKRQVARFEYLGSKLLQLFGDLERILASGSNFLLGTWLADAQR 635

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
            A N   +  +E+NAR QIT W  + Q     + DY  K WSGL+ DYY PR A++   +
Sbjct: 636 AAPNAADKPNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVLDYYRPRWALFLDDV 690

Query: 485 IESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
             +L S   F    ++ R   ++   + N  +VYPVE  G+    SQ +Y ++
Sbjct: 691 TLALHSNRTFNSTAFKLRVSQEVELPFSNKSDVYPVEPMGNTWFISQNIYERW 743


>gi|348681836|gb|EGZ21652.1| hypothetical protein PHYSODRAFT_247428 [Phytophthora sojae]
          Length = 991

 Score =  319 bits (818), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 181/541 (33%), Positives = 291/541 (53%), Gaps = 43/541 (7%)

Query: 9   LPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 68
           LPAF+G+VP  L+   P+A  T+  NW +     ++CC Y+++ TDPL+ EIG+AF+E+Q
Sbjct: 462 LPAFAGHVPEDLKLTLPNANFTRSPNWGNFTD--QYCCVYMIEPTDPLYREIGKAFLEEQ 519

Query: 69  LKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP 128
              Y  TS +Y CDT+ E  P       +     A+  GM + D +AVWLMQGW F  DP
Sbjct: 520 RALYNYTSSLYQCDTYMEMAPEFTDLSELKGAARAVIDGMTAADPNAVWLMQGWPFVDDP 579

Query: 129 -FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
            +W  P++KA L  VP  KL++LD ++E  PIW+    ++G  +I+ +LHNF GN  M G
Sbjct: 580 HYWTRPRVKAYLEGVPTDKLIILDFYSEAVPIWNKMDNYFGKNWIYSVLHNFGGNTGMRG 639

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            L ++A  PV+A+   N TMVGVG++MEGI QN VVYDL  +MA++   +DV  W+++Y+
Sbjct: 640 DLPTLATAPVQAQRDGNGTMVGVGLTMEGIFQNYVVYDLTLQMAWEDSPLDVDEWVSKYA 699

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRY      ++ AW+ L  +VYN T           +A+  V  S++ +       Y +
Sbjct: 700 SRRYHTQNEHVERAWSYLSRSVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR 748

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
                         +    + Y   +++ A +  + +G+EL   +TYR+DL+D+T+Q L+
Sbjct: 749 --------------FQPTLIKYDPKDIVLAWKELLLAGDELRNVDTYRHDLVDVTKQFLS 794

Query: 368 KYANELF--LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
               E +  L +I + +   A+ V +L++  L  +  ++ +LA ++ FLLG W+  A  L
Sbjct: 795 NKLLEQYQHLKVIYSAKSAPANEVCELTKTMLTTINRLEEILATNEDFLLGNWVADALNL 854

Query: 426 AQN----------EEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
           A +           + ++ YE+ AR Q+T W DN  E    + DY  K W+GL++ YY P
Sbjct: 855 AGDLNIGGDSVTRTKLQEYYEYEARNQVTRWGDNNNEA---IHDYAGKEWAGLVKSYYLP 911

Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
           R  ++   +  +         K  ++  I     WQ     YP  + GD+   S+ +Y++
Sbjct: 912 RWTMWLTEVCSAYTDRREMDEKGLKKRRIAFELKWQLSHEKYPTTTVGDSFSISKRIYSE 971

Query: 536 Y 536
           Y
Sbjct: 972 Y 972


>gi|195155652|ref|XP_002018715.1| GL25802 [Drosophila persimilis]
 gi|194114868|gb|EDW36911.1| GL25802 [Drosophila persimilis]
          Length = 767

 Score =  319 bits (817), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 181/533 (33%), Positives = 291/533 (54%), Gaps = 44/533 (8%)

Query: 9   LPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 68
           LPAF+G++P A++ ++P+   T++  W S   DP +CC   +D  DP+F  +   F+ + 
Sbjct: 250 LPAFAGHLPTAMRRIYPNGNYTEVERWNSF-PDP-YCCGLFVDPLDPIFDLVAALFLRRV 307

Query: 69  LKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP 128
           ++ YG ++HI+ CD F+E  PPV  P+Y+ S  AAI++ M+S D +AVWL+QGW+F  + 
Sbjct: 308 VQRYG-SNHIFFCDPFNELQPPVAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNI 366

Query: 129 FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI 188
           +W    M+A L +VP+G+L+VLDL +E  P +  +  +YG P++WCMLHNF G + M+G 
Sbjct: 367 YWTDAMMEAFLTAVPIGRLIVLDLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGS 426

Query: 189 LDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSV 248
            D +  G   AR   N+++VGVG++ EGI QN V+Y L+ E  +    +D+ +W   ++ 
Sbjct: 427 ADLVNNGIEAARRMPNSSIVGVGITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFAR 486

Query: 249 RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP 308
            RYG     +Q AW +L  +VY+       K R                        G  
Sbjct: 487 TRYGVDDEGLQQAWQLLRRSVYSFR--GLQKMRG-----------------------GYT 521

Query: 309 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLTRQ 364
           V++   L  +      P  WY+ S+V+ A +L ++S   +   +     Y +DL+D+TRQ
Sbjct: 522 VTRRPALNLD------PFTWYNASDVLEAWKLLLSSRAIIPLEDDKYAIYEHDLVDITRQ 575

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L   A++L++N+  AY+         L  + L+L  D++ +LA    FLLG WL  A++
Sbjct: 576 YLQISADQLYVNLKSAYRKRQVARFEYLGSKLLQLFGDLEHILASGSNFLLGTWLADAQR 635

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
            A N   +  +E+NAR QIT W  + Q     + DY  K WSGL+ DYY PR A++   +
Sbjct: 636 AAPNAADKPNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVLDYYRPRWALFLDDV 690

Query: 485 IESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
             +L S   F    ++ R   ++   + N  +VYPVE  G+    SQ +Y ++
Sbjct: 691 TLALHSNRTFNSTAFKLRVSQEVELPFSNKSDVYPVEPMGNTWFISQNIYERW 743


>gi|348681870|gb|EGZ21686.1| hypothetical protein PHYSODRAFT_495971 [Phytophthora sojae]
          Length = 692

 Score =  317 bits (812), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 173/538 (32%), Positives = 285/538 (52%), Gaps = 39/538 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            LPAF+G+VP  L+ ++P+AK T+  NW     +  +CC Y+LD  DPL+ EIG+ F+E+
Sbjct: 160 ALPAFAGHVPEELKTLYPNAKFTRSPNWGGFSDE--FCCVYMLDPQDPLYYEIGKTFLEE 217

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   Y  TS +Y CDT++E  P    P  + +   A+   M + D +AVWL+QGWLF   
Sbjct: 218 QRALYDYTSSLYQCDTYNEMDPDFTDPAKLQAASRAVIDSMTAADPNAVWLIQGWLFVNS 277

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P +W   +++  L+ VP  K+++LDL++EV+P+W+    ++G  +I+C+LHNF GN  M 
Sbjct: 278 PNYWTKERVQTYLDGVPNDKMIILDLYSEVRPVWNKMDNYFGKSWIYCVLHNFGGNTGMR 337

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L ++   PV A  + + TM+G+G++MEGI QN VVYDL  +MA+    +D+  W+  +
Sbjct: 338 GDLPTLGTAPVLANRASSGTMIGMGLTMEGIFQNYVVYDLTLQMAWVDAPLDMDEWVPSF 397

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + +RY       + AW  L  +VYN T G     + ++   P                  
Sbjct: 398 AAQRYHSQDAHTERAWGFLLQSVYNRTLGYGGVTKSLVCLIPHWK--------------- 442

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
             + ++  + +  +        Y   ++ RA +  + +G+EL A +TYR+DL+D+TRQ L
Sbjct: 443 --LVRDGFMPTLIT--------YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFL 492

Query: 367 AKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
           + +    +L++ + Y   +  A  +   + R L  +E +D +LA +D FLLG W+  A+ 
Sbjct: 493 SDHFMAQYLHLEDMYAGKETPADQLCAWTDRMLVTIEWLDEILATNDDFLLGNWVADARA 552

Query: 425 LAQN------EEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           LA           +  YE+ AR Q+T W DN  E    + DY  K W+GL+  YY PR  
Sbjct: 553 LADEVGAAEVTSLQDYYEYEARNQVTRWGDNNSES---IHDYAGKEWAGLVSGYYLPRWR 609

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++   + +S            ++  +    +WQ     YP  + GD L  S+ +Y ++
Sbjct: 610 MWLTEVCQSYTQKRDVNEAALKKARVDFELNWQLSHERYPTTTTGDTLAVSKRIYEEF 667


>gi|428176410|gb|EKX45295.1| hypothetical protein GUITHDRAFT_51145, partial [Guillardia theta
           CCMP2712]
          Length = 680

 Score =  317 bits (812), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 179/552 (32%), Positives = 304/552 (55%), Gaps = 41/552 (7%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLPAF+G VP  ++++FP AK T+ GNW         CC  ++D TDPLF++IG+
Sbjct: 153 LGMLPVLPAFAGGVPEGMKSLFPEAKFTRHGNWGGFAEQH--CCVMMVDPTDPLFLKIGK 210

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP----EYISSLGAAIYSGMQSGDSDAVWL 118
            F+E+    YG ++HIY+CDTF+EN P  +      +++S    A++  M++ D DAVWL
Sbjct: 211 MFVEEVRAVYG-SNHIYSCDTFNENRPRSEHGSVGLDFLSHSSRAVFESMRAADPDAVWL 269

Query: 119 MQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP-----YI 172
           MQGWLF  D  FW+  ++ A L+ VP  ++++LDLF +V P+W         P     ++
Sbjct: 270 MQGWLFMNDARFWQKRELDAYLSGVPEDRMIILDLFTDVFPVWKRRDLQRPTPIEKRRWV 329

Query: 173 WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           W MLH+F GN  MYG L  I+  PV A+  E+ TMVGVG++ EGIEQNPVVY++M+EM +
Sbjct: 330 WNMLHSFGGNSGMYGRLQVISKDPVVAK-KESQTMVGVGITTEGIEQNPVVYEMMAEMRW 388

Query: 233 QHEKVDVKAWINQYSVRRYG----RSVPAI-QDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
           + ++VDV +W+ +++ RR G    R   A+ ++AW  L  TVY+C      + + ++ + 
Sbjct: 389 REQEVDVMSWVEKWADRRLGPEASRERKALGEEAWRELASTVYSCPGTQMGQVKSMVESR 448

Query: 288 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 347
           P +D +   +    +    +   +EA++++          W      ++ L       + 
Sbjct: 449 PRLDLASGWIPNSDFMPIKRHYPEEALVRA----------W------LKLLRATRGGADG 492

Query: 348 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLL 407
            + S++  +D+ D+TRQ L+     LF  +    Q   A       +  L ++ DMD ++
Sbjct: 493 YTCSSSASFDIADVTRQVLSDLFARLFQPLSSFCQTRLAGSAAVRMQTLLGIISDMDKMV 552

Query: 408 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
                 LLG W+E A+   +++E+E+  E+NAR  +T+W    +     + DY +K W G
Sbjct: 553 GTQPRMLLGKWIEDARAWGKSKEEEEVLEFNARNLVTLWGPRGE-----IADYASKQWQG 607

Query: 468 LLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN-GRNVYPVESNGDAL 526
           LL DYY  R  ++F+++ +++     F  + +++E +     WQ    + +P    G+A+
Sbjct: 608 LLSDYYMSRWKLFFEHLQQAIRGTRIFSQQRFQQELLVFEQQWQTRTSSSFPSSPEGNAV 667

Query: 527 ITSQWLYNKYLQ 538
             +  L++KY++
Sbjct: 668 ELAWQLHDKYIK 679


>gi|170060634|ref|XP_001865888.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
 gi|167879069|gb|EDS42452.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
          Length = 761

 Score =  316 bits (809), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 189/541 (34%), Positives = 290/541 (53%), Gaps = 60/541 (11%)

Query: 1   MLLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEI 60
           M+LA    LPAF+G++P   + +FP AK+  +  W    +  ++     LD  DPLF +I
Sbjct: 244 MILA----LPAFAGHLPVQFKTLFPQAKLNPVEVWNGFPA--QYASPLFLDPVDPLFQKI 297

Query: 61  GRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 120
           G  F+ + +  YG T HIY  D F+E  P  +S  Y++S  A IY  M   D  AVWL+Q
Sbjct: 298 GSKFVAKAIARYG-TDHIYFSDPFNEIQPRSESARYLASAAAGIYQAMVDVDPLAVWLLQ 356

Query: 121 GWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFA 180
           GW+   +PFW    +KA   +VP G+++VLDL +E  P +  ++ +YG P+IWCML NF 
Sbjct: 357 GWMLVKNPFWSDRAIKAFFTAVPNGRMLVLDLQSEQFPQYVRTQSYYGQPFIWCMLSNFG 416

Query: 181 GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
           G + M G +D +     E R++E+ TM+G G++ EGI QN  +Y+   EM +  +  DV 
Sbjct: 417 GTLGMLGSVDLVFERIRETRSNESMTMIGTGITPEGINQNYGLYEFALEMGWNPDISDVD 476

Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
            W  +Y++ RYG     +QDAW++   TVY+                       + +  G
Sbjct: 477 NWFTRYAMVRYGNDDKRLQDAWSIFRSTVYSFKG--------------------MEMMRG 516

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIA--SGNELSASNTYRYDL 358
           KY    +P  K             P +WY+ +     +EL +A    NEL     +++D+
Sbjct: 517 KYTFNRRPSLKL-----------QPWVWYNETRFDEGVELILAVNGSNEL-----FKHDV 560

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           +DLTRQ L   A++L+L I++ Y L +A      S  F EL++++D LLA +  FLLG W
Sbjct: 561 VDLTRQFLQNTADKLYLTIMDTYTLKNAAAFKHYSNLFKELLQNIDRLLATNTHFLLGRW 620

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           LESAK LA    + ++YE+NAR QIT+W    Q     + DY NK WSG+++D++ PR +
Sbjct: 621 LESAKSLATTSLERQKYEYNARNQITLWGPQGQ-----IVDYANKQWSGVVQDFFLPRWS 675

Query: 479 IYFKYMIESLESGDGF-----RLKDWRREWIKLTNDWQNGRNVYPVESNG-DALITSQWL 532
           ++ + M  +L +         R K +R+  +    D    R  YP E++G DAL  ++ L
Sbjct: 676 LFLQEMELALATNGTINETKVRDKIFRKVELPFNTD----RKKYPAEASGEDALELAREL 731

Query: 533 Y 533
           Y
Sbjct: 732 Y 732


>gi|195577611|ref|XP_002078662.1| GD22403 [Drosophila simulans]
 gi|194190671|gb|EDX04247.1| GD22403 [Drosophila simulans]
          Length = 778

 Score =  315 bits (806), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 183/533 (34%), Positives = 283/533 (53%), Gaps = 52/533 (9%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
             LPAF+G+VP AL+ + P +   ++  W       R+CC   ++ TD LF EI   F++
Sbjct: 259 VALPAFAGHVPRALKRLHPESTFMEVQRWNQFPD--RYCCGLFVEPTDNLFKEIASRFLQ 316

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
             + +YG ++HI+ CD F+E  PPV  PEY+ S  AAIY  M+  D  A+WL+QGW+F  
Sbjct: 317 NIITKYG-SNHIFFCDPFNELEPPVAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVK 375

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           +PFW     +A L + P G+++VLDL +E  P +  ++ ++G P+IWCMLHNF G + M+
Sbjct: 376 NPFWTTDMAEAFLTAAPRGRILVLDLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMF 435

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    I  G  EAR   N+++VG G++ EGI QN V+Y    E  + +  +D+ +W   +
Sbjct: 436 GSAKLINSGIDEARRLPNSSLVGTGITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNF 495

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S  RYG     ++ AW +L ++VY+                       +    G+Y    
Sbjct: 496 SHTRYGVKDERLEQAWLLLKNSVYSFRG--------------------LQKMRGQY---- 531

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLT 362
                  V+    S    P  WY+ S V+ A  L + S   +   +     Y +DL+D+T
Sbjct: 532 -------VVTRRPSFNQEPFTWYNASAVLDAWHLLLTSRAIIPLEDDRYEIYEHDLVDIT 584

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L   A++L++N+  AY+         LS + L+L +DM+ +LA    FLLG WL+ A
Sbjct: 585 RQFLQISADQLYVNLRSAYRKRQVARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQA 644

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           KQ A N  +++ +E+NAR QIT W  + Q     + DY  K WSGL+ DYY PR  ++ +
Sbjct: 645 KQAAPNTGEQRNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLE 699

Query: 483 YMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
            +  +L +     G  F+LK    + I+L   + N  +VYPV   G+  + SQ
Sbjct: 700 DVTVALHAGRPYNGTAFKLK--VSQEIELP--FSNKADVYPVTPVGNTWLISQ 748


>gi|194863164|ref|XP_001970307.1| GG23441 [Drosophila erecta]
 gi|190662174|gb|EDV59366.1| GG23441 [Drosophila erecta]
          Length = 778

 Score =  313 bits (803), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 184/533 (34%), Positives = 283/533 (53%), Gaps = 52/533 (9%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
             LPAF+G+VP AL+ + P +   ++  W       R+CC   L+ TD LF EI   F++
Sbjct: 259 VALPAFAGHVPRALKRLHPGSTFMEVQRWNQFPD--RYCCGLFLEPTDNLFNEIALIFLQ 316

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
           + +  YG ++HI+ CD F+E  PPV  PEY+ S  AAIY  ++  D  A+WL+QGW+F  
Sbjct: 317 KIITAYG-SNHIFFCDPFNELEPPVAKPEYMRSTAAAIYESIRRLDPQAIWLLQGWMFVK 375

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           +PFW     +A L + P G+++VLDL +E  P +  ++ ++G P+IWCMLHNF G + M+
Sbjct: 376 NPFWTTDMAEAFLTAAPRGRILVLDLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMF 435

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    I  G  EAR   N+++VG G++ EGI QN V+Y    E  + +  +D+ +W   +
Sbjct: 436 GSAKLINSGIEEARRLPNSSLVGTGITPEGIGQNYVMYSFTLERGWSNRPLDLDSWFTSF 495

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S  RYG     ++ AW  L ++VY+                       +    G+Y    
Sbjct: 496 SHARYGVKDERLEQAWLQLKNSVYSFHG--------------------LQKMRGQYVVTR 535

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLT 362
           +P  K+            P  WY+ S V+ A  L ++S   +   +     Y +DL+D+T
Sbjct: 536 RPSFKQ-----------EPFTWYNASAVLDAWHLLLSSRAIIPLEDDRYEMYEHDLVDIT 584

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L   A++L++N+  AY+         LS + L+L +DM+ +LA    FLLG WL+ A
Sbjct: 585 RQFLQISADQLYVNLRSAYKKRQVSRFEFLSSKLLKLFDDMELILASSRNFLLGNWLQQA 644

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           KQ A +  +++ YE+NAR QIT W  + Q     + DY  K WSGL+ DYY PR  ++ +
Sbjct: 645 KQAAPHPGEQRNYEFNARNQITAWGPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLE 699

Query: 483 YMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
            +  +L S     G  F+LK    + I+L   + N  +VYPV   G+    SQ
Sbjct: 700 DVTVALHSLRPFNGTAFKLK--VSQEIELP--FSNKVDVYPVTPVGNTWFISQ 748


>gi|21356587|ref|NP_652045.1| CG13397, isoform A [Drosophila melanogaster]
 gi|442626853|ref|NP_001260251.1| CG13397, isoform B [Drosophila melanogaster]
 gi|16185856|gb|AAL13967.1| LP03571p [Drosophila melanogaster]
 gi|22945953|gb|AAF52672.2| CG13397, isoform A [Drosophila melanogaster]
 gi|440213562|gb|AGB92787.1| CG13397, isoform B [Drosophila melanogaster]
          Length = 778

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 179/532 (33%), Positives = 282/532 (53%), Gaps = 50/532 (9%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
             LPAF+G+VP AL+ + P +   ++  W       R+CC   ++ T+ LF EI   F+ 
Sbjct: 259 VALPAFAGHVPRALKRLNPESTFMEVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLH 316

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
             + +YG ++HI+ CD F+E  PPV  PEY+ S  AAIY  M+  D  A+WL+QGW+F  
Sbjct: 317 NIITKYG-SNHIFFCDPFNELEPPVAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVK 375

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           +PFW     +A L + P G+++VLDL +E  P +  ++ ++G P+IWCMLHNF G + M+
Sbjct: 376 NPFWTTDMAEAFLTAAPRGRILVLDLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMF 435

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    I  G  EAR   N+++VG G++ EGI QN V+Y    E  + +  +D+ +W   +
Sbjct: 436 GSAKLINSGIEEARRLPNSSLVGTGITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNF 495

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S  RYG     ++ AW +L ++VY+                       +    G+Y    
Sbjct: 496 SHSRYGVKDERLEQAWLLLKNSVYSFRG--------------------LQKMRGQY---- 531

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT----YRYDLIDLT 362
                  V+    S    P  WY+ S V+ A  L +     +   +     Y +DL+D+T
Sbjct: 532 -------VVTRRPSFNQEPFTWYNASAVLDAWHLLLTFRAIIPLEDNRYEIYEHDLVDIT 584

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L   A++L++N+  AY+         LS + L+L +DM+ +LA    FLLG WL+ A
Sbjct: 585 RQFLQISADQLYINLRSAYRKRQVSRFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQA 644

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           KQ A N  Q++ +E+NAR QIT W  + Q     + DY  K WSGL+ DYY PR  ++ +
Sbjct: 645 KQAAPNTGQQRNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLE 699

Query: 483 YMIESLESGDGFRLKDWRREWIKLTND----WQNGRNVYPVESNGDALITSQ 530
            +  +L +G  F    ++   +K++++    + N  +VYPV   G+  + SQ
Sbjct: 700 DVTVALHAGRPFNGTAFK---LKVSHEIELPFSNKDDVYPVTPVGNTWLISQ 748


>gi|158300970|ref|XP_320760.4| AGAP011750-PA [Anopheles gambiae str. PEST]
 gi|157013415|gb|EAA00039.4| AGAP011750-PA [Anopheles gambiae str. PEST]
          Length = 770

 Score =  312 bits (799), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 178/535 (33%), Positives = 290/535 (54%), Gaps = 49/535 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            LPAF+G++P   + ++P+     +  W +    P++     LD T+PLF  IG  F++ 
Sbjct: 250 ALPAFAGHLPVQFRTLYPNVSFANVSVWNNFP--PQYASPLFLDPTEPLFAAIGSRFLQL 307

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +K YG T H+Y  D F+E  P + S +Y+SS+  AIYS M   D DA+WL+QGW+F  +
Sbjct: 308 AIKTYG-TDHVYFSDPFNEIDPTLPSGKYLSSVSEAIYSTMVQVDPDAIWLLQGWMFVKN 366

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           PFW    +++ L++VPLG+++VLDL +E  P +  +  + G P+IWCML NF G + M G
Sbjct: 367 PFWSDRAIRSFLSAVPLGRMLVLDLQSEQYPQYGRTASYAGQPFIWCMLSNFGGTLGMLG 426

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            + ++  G  E R +   T++G G++ EGI QN  +Y+   EM +  E    + W ++Y+
Sbjct: 427 SVGNVFRGIRETRDNSTYTLLGTGITPEGINQNYALYEFALEMGWNAELDSAEQWFSEYA 486

Query: 248 VRRYGR-SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           V RYG  S    Q AWN+   TVY               AF  ++     +  GKY    
Sbjct: 487 VARYGNDSDERAQQAWNIFLRTVY---------------AFEGLE-----LMRGKYTFNR 526

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           +P SK             P  WY      + LEL ++   E S +   +YDL+D TRQ L
Sbjct: 527 RPSSK-----------IRPWTWYDVHTFNQGLELLLSFAEEASCNQLCQYDLVDATRQCL 575

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
              A+ L+L ++++++  D       S  FL+L+ D+D LL  ++ FLLGPWLESAK  A
Sbjct: 576 QHTADALYLTLMDSFKKRDLTSFRLHSSLFLQLLSDLDVLLRTNEHFLLGPWLESAKAHA 635

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
           +   +  +YE+NAR QIT+W    Q     + DY NK W+G+++D++ PR  ++   + +
Sbjct: 636 ETTLERHKYEYNARIQITLWGPQGQ-----IVDYANKQWAGMVQDFFLPRWRVFLGELDQ 690

Query: 487 SLESG---DGFRLKD--WRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +L +    +  +++D  +R   +   +D ++    Y  + +GD + T++ LY ++
Sbjct: 691 ALATNGTINDLKIRDKIFRTVELPFVSDSKH----YATQPSGDTVRTARTLYERW 741


>gi|195339231|ref|XP_002036223.1| GM12949 [Drosophila sechellia]
 gi|194130103|gb|EDW52146.1| GM12949 [Drosophila sechellia]
          Length = 778

 Score =  311 bits (798), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 181/533 (33%), Positives = 283/533 (53%), Gaps = 52/533 (9%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
             LPAF+G+VP AL+ + P +   ++  W       R+CC   ++ T+ LF EI   F++
Sbjct: 259 VALPAFAGHVPRALKRLHPESTFMEVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLQ 316

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
             + +YG ++HI+ CD F+E  PPV  PEY+ S  AAIY  M+  D +A+WL+QGW+F  
Sbjct: 317 NIITKYG-SNHIFFCDPFNELEPPVAKPEYMRSTAAAIYESMRGIDPEAIWLLQGWMFVK 375

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           +PFW     +A L + P G+++VLDL +E  P +  ++ ++G P+IWCMLHNF G + M+
Sbjct: 376 NPFWTTDMAEAFLTAAPRGRILVLDLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMF 435

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    I  G  EAR   N+++VG G++ EGI QN V+Y    E  + +  +D+  W   +
Sbjct: 436 GSAKLINSGIEEARRLPNSSLVGTGITPEGIGQNYVMYSFTLERGWSNTSLDLDGWFTNF 495

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S  RYG     ++ AW +L ++VY+                       +    G+Y    
Sbjct: 496 SHTRYGVKDERLEQAWLLLKNSVYSFRG--------------------LQKMRGQY---- 531

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLT 362
                  V+    S    P  WY+ S V+ A  L + S   +   +     Y +DL+D+T
Sbjct: 532 -------VVTRRPSFNQEPFTWYNASAVLDAWHLLLTSRAIIPLEDDRYEMYEHDLVDIT 584

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L   A++L++N+  AY+         LS + L+L +DM+ +LA    FLLG WL+ A
Sbjct: 585 RQFLQISADQLYVNLRSAYRKRQVARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQA 644

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           KQ A N  +++ +E+NAR QIT W  + Q     + DY  K WSGL+ +YY PR  ++ +
Sbjct: 645 KQAAPNTGEQRNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVSNYYRPRWRLFLE 699

Query: 483 YMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
            +  +L +     G  F+LK    + I+L   + N  +VYPV   G+  + SQ
Sbjct: 700 DVTVALHAGRPYNGTAFKLK--VSQEIELP--FSNKIDVYPVTPVGNTWLISQ 748


>gi|195115262|ref|XP_002002183.1| GI17241 [Drosophila mojavensis]
 gi|193912758|gb|EDW11625.1| GI17241 [Drosophila mojavensis]
          Length = 773

 Score =  307 bits (787), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 178/538 (33%), Positives = 290/538 (53%), Gaps = 50/538 (9%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
             LPAFSG VP A++ VFP+A  TQ   W     DP +CC   ++  DPLF ++G  F+ 
Sbjct: 248 VALPAFSGYVPTAMRRVFPNASFTQSDRWNHF-PDP-YCCVLFVEPQDPLFQQVGAMFLR 305

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
           + ++ YG ++HIY  D F+E  P V  P Y+     AIY+ MQ  D+DAVWL+QGW+F  
Sbjct: 306 RVIQVYG-SNHIYFSDPFNEMMPRVREPNYVRYTAKAIYNSMQVVDADAVWLIQGWMFLK 364

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             +W    ++A L +VP G+++ LDL +E  P +  +  +YG P++WCML+NF GN+ ++
Sbjct: 365 SVYWTNDLIEAYLTAVPRGRILALDLQSEQFPQYERTHSYYGQPFVWCMLNNFGGNLGLF 424

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    I  G + AR+  N +MVGVG++ EGI QN  ++ L  E A+  +++ ++ W   +
Sbjct: 425 GSAQLIPSGIIAARSMPNGSMVGVGITPEGIGQNYALFALTLEQAWSPDELQLEDWFEYF 484

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           ++ RYG +   +   W +L  +VY+       + R+ +               GKY    
Sbjct: 485 ALTRYGVNDTRLSQVWQLLRESVYSF------QGRERM--------------RGKY---- 520

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT----YRYDLIDLT 362
                   L    S + +P +WY+ + V  A  L + +   +  ++     Y +DL+D+T
Sbjct: 521 -------TLNKRPSLHHYPWVWYNVTMVYEAWRLMLEAKETVPLNDNRRAIYEHDLVDIT 573

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L    +  ++N+  A +    + V  L+ + LEL  DM+ +LA  + +LLG WLE+A
Sbjct: 574 RQCLQLSFDRFYVNLKSACRHKQLNRVEYLAGKLLELFADMERILASGEHYLLGNWLEAA 633

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           K+LA +EEQ   YE+NAR Q+T W  N Q     + DY  K WSGL+ DY+ PR  ++ +
Sbjct: 634 KRLAPSEEQRPIYEFNARNQLTSWGPNYQ-----IPDYATKQWSGLMSDYFQPRWNMFLE 688

Query: 483 YMIESLESGDGFRLKDWRREWIKLTND----WQNGRNVYPVESNGDALITSQWLYNKY 536
            +I++L++   F   ++++   ++ N+    + N    YP    G     S  +Y K+
Sbjct: 689 AVIQALKTQTPFNYSEFKQ---RVENEIELPFSNHTKAYPTSPVGSTWNISHDIYEKW 743


>gi|301107007|ref|XP_002902586.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
 gi|262098460|gb|EEY56512.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
           infestans T30-4]
          Length = 736

 Score =  305 bits (781), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 180/533 (33%), Positives = 279/533 (52%), Gaps = 52/533 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            LPAF+G+VP  L+   P+A  TQ  NW +   +   CC ++++ TD L+ EIG+ F+++
Sbjct: 235 ALPAFAGHVPEELKLRLPNAHFTQSPNWGNFSEEH--CCVFMIEPTDALYREIGKNFLKE 292

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + Y  TS +Y CDT+ E  P       +     A+  GM + D +AVWLMQGW F  D
Sbjct: 293 QRELYNYTSSLYQCDTYMEMAPEFTDLTELEGAARAVIDGMTAADPNAVWLMQGWPFVDD 352

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW  P++KA L+ VP  KL++LD ++E  PIWS    ++G  +I+ +LHNF GN  M 
Sbjct: 353 PHFWTKPRVKAYLDGVPTDKLIILDFYSESVPIWSKMDNYFGKSWIYSVLHNFGGNTGMR 412

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L ++A  PV A  + N TMVGVG++MEGI QN +VYDL  +MA+    +DV  WI QY
Sbjct: 413 GDLLTLATAPVLANWAGNGTMVGVGLTMEGIFQNYIVYDLTLQMAWVDNPLDVNTWIPQY 472

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + +RY      ++ AW+ L  +VYN T           +A+  V  S++ +       Y 
Sbjct: 473 AAQRYHTHNEHVEQAWSYLLRSVYNRT-----------LAYGGVTKSLVCLIPHWRLLYD 521

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           +   +  ++K            Y  ++V+ A +  + + NEL   +TYR+DL+D+T+Q L
Sbjct: 522 R--FQPTLIK------------YDPNDVVLAWKELLLAENELRDVDTYRHDLVDVTKQFL 567

Query: 367 AKYANELFLNIIEAYQLNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
           +    E ++++   Y    A  + V  L++  L  +E ++ +LA ++ FLLG W+     
Sbjct: 568 SNKLLEQYIHLKGIYNAKKASPNEVCGLTKTMLTTMERLEEILATNEDFLLGNWI----- 622

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
                         AR Q+T W DN  E    + DY  K W+GL++ YY PR  ++   +
Sbjct: 623 --------------ARNQVTRWGDNNNEA---IHDYAGKEWAGLVKGYYIPRWTMWLSEV 665

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
             +         K  + + I     WQ G   YP  + GDA   S+  YN+Y+
Sbjct: 666 CNAYTDKREMNEKALKEKRIAFELKWQLGHESYPTTTVGDAFTISKRFYNEYI 718


>gi|195398029|ref|XP_002057627.1| GJ18000 [Drosophila virilis]
 gi|194141281|gb|EDW57700.1| GJ18000 [Drosophila virilis]
          Length = 766

 Score =  305 bits (780), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 173/538 (32%), Positives = 291/538 (54%), Gaps = 50/538 (9%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
             LPAF+G+VP A++ VFP+A  T    W +     ++CC   ++  DPLF ++G  F+ 
Sbjct: 250 VALPAFAGHVPTAMRRVFPNANYTPAERWNNFPD--QYCCDLFVEPHDPLFQQLGAMFLR 307

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
           + ++ YG ++HIY  D F+E  PP+  P Y+ S   AIY+ M+  D +AVWL+QGW+F  
Sbjct: 308 RVIQVYG-SNHIYFSDPFNEMQPPLAEPGYMRSTAKAIYNSMREVDGNAVWLLQGWMFLK 366

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           D FW    ++A L +VP G+++VLDL +E  P +  +  +YG P++WCML+NF G + ++
Sbjct: 367 DIFWTDELIEAFLTAVPRGRILVLDLQSEQFPQYQRTHSYYGQPFVWCMLNNFGGTLGLF 426

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    I  G   AR   N+++VGVG++ EGI QN  ++ L  E  +   ++ +  W + +
Sbjct: 427 GSAQFIGSGIASARIMPNSSLVGVGITPEGIGQNYAIFALTLEQGWSASELQLGDWFDHF 486

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           ++ RYG +   +  AW +L   VY+                       +    GKY    
Sbjct: 487 ALTRYGVNDTRLAQAWQLLRGGVYSFHG--------------------LQRMRGKY---- 522

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLT 362
                   L        +P  WY+ S V  A +L +AS   +  ++     Y +DL+D+T
Sbjct: 523 -------ALNRRPGLNLNPWTWYNGSSVTDAWQLLLASREMVPLTDDRYAIYEHDLVDIT 575

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L +  +++++N+  AY+    + +  L+ + LEL++DM+ +LA    +LLG WLE+A
Sbjct: 576 RQFLQQSFDQIYVNLRSAYRKEQLNRLEYLAGKLLELLDDMERILASGVHYLLGTWLEAA 635

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           K+LA +++    YE+NAR Q+T W  N Q     + DY  K WSGL+ DYY PR A++  
Sbjct: 636 KKLAPSDKLRPLYEFNARNQLTSWGPNGQ-----ILDYATKQWSGLMCDYYQPRWAMFLD 690

Query: 483 YMIESLESGDGFRLKDWRREWIKLTND----WQNGRNVYPVESNGDALITSQWLYNKY 536
            +  ++++   F   D+++   ++ N+    + N   +YP +  G+  + S  +Y K+
Sbjct: 691 AVTRAMQTHRPFNATDFKQ---RVANEIELPFSNLTKMYPTKPMGNTWLISNDIYIKW 745


>gi|195050088|ref|XP_001992825.1| GH13491 [Drosophila grimshawi]
 gi|193899884|gb|EDV98750.1| GH13491 [Drosophila grimshawi]
          Length = 771

 Score =  303 bits (777), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 179/536 (33%), Positives = 295/536 (55%), Gaps = 50/536 (9%)

Query: 9   LPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 68
           LPAF+G+VP  L  +FP+A  T +  W     DP +CC   ++ +DPLF  +G  F+ + 
Sbjct: 251 LPAFAGHVPTGLPRIFPTANFTSVERWNQFP-DP-YCCALFIEPSDPLFQLVGAQFLRRV 308

Query: 69  LKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP 128
           ++ YG ++HIY  D F+E  P +  P YISS   AIY+ M+  D D VWL+QGW+F  + 
Sbjct: 309 IQIYG-SNHIYFSDPFNEMQPRIAEPGYISSTARAIYNSMRMVDKDPVWLLQGWMFLDNA 367

Query: 129 FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI 188
           +W    ++A L +VP G+++VLDL +E  P +  +  +YG P++WCML+NF G + M+G 
Sbjct: 368 YWSDELIEAFLTAVPRGRMLVLDLQSEQFPQYQRTFSYYGQPFVWCMLNNFGGTLGMFGS 427

Query: 189 LDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSV 248
              I  G + AR+  N++MVGVG++ EGI QN  ++ L  E  +   K+++  W +Q+++
Sbjct: 428 AHLINAGIMAARSMPNSSMVGVGITPEGIGQNYALFALTLEQGWSGSKLELSDWFDQFTL 487

Query: 249 RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP 308
            RYG +   +  AW +L  +VY+                       +    GKY      
Sbjct: 488 TRYGVNDTDLILAWQLLRGSVYHFHG--------------------LQRMRGKY------ 521

Query: 309 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLTRQ 364
                 L    S    P +WY+ S V+ A +L +A+   +   +     Y++DL+D+TRQ
Sbjct: 522 -----ALNKRPSFNLKPWIWYNASSVVEAWQLLLAANQTIPVEDDRYALYKHDLVDITRQ 576

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L +  +++++N+  AY+ +       L+ + LEL+ DM+ +LA  + +LLG WLE+AK+
Sbjct: 577 FLQQSFDQVYVNLKSAYRKSQLARFEYLAAKLLELLADMERILASGEHYLLGNWLEAAKE 636

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           LA + +Q   YE+NAR Q+T W  + Q     + DY  K WSGL++DYY PR +++   +
Sbjct: 637 LAPSADQRHIYEFNARNQLTAWGPSNQ-----ILDYATKQWSGLMQDYYTPRWSMFLDAV 691

Query: 485 IESLESGDGFRLKDWRREWIKLTND----WQNGRNVYPVESNGDALITSQWLYNKY 536
             ++ S   F    +R+   ++ N+    + N   VYP E  G   + SQ +++K+
Sbjct: 692 TLAMHSKRPFNATAFRQ---RVANEIELPFSNLTKVYPTEPVGSTWLISQEIHDKW 744


>gi|195473052|ref|XP_002088810.1| GE10991 [Drosophila yakuba]
 gi|194174911|gb|EDW88522.1| GE10991 [Drosophila yakuba]
          Length = 778

 Score =  299 bits (766), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 177/533 (33%), Positives = 281/533 (52%), Gaps = 52/533 (9%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
             LPAF+G+VP AL+ + P +   ++  W       ++CC   ++  + LF EI   F++
Sbjct: 259 VALPAFAGHVPRALKRLNPDSTFMEVQRWNQFPD--QYCCGLFVEPKENLFNEIALNFLQ 316

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
           + +  YG ++HI+ CD F+E  PPV  PEY+ S  AAIY  M+  D  A+WL+QGW+F  
Sbjct: 317 KIITIYG-SNHIFFCDPFNELEPPVAKPEYMRSTSAAIYESMRRIDPQAIWLLQGWMFVK 375

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           +PFW     +A L + P G+++VLDL +E  P +  ++ ++G P+IWCMLHNF G + M+
Sbjct: 376 NPFWTTDMAEAFLTAAPRGRILVLDLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMF 435

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    I  G  EAR   N+++VG G++ EGI QN V+Y    E  + ++ +D+ +W   +
Sbjct: 436 GSAKLINSGIEEARRLPNSSLVGTGITPEGIGQNYVMYSFTLERGWSNKPLDLDSWFTNF 495

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S  RYG     ++ AW  L ++VY+                       +    G+Y    
Sbjct: 496 SHTRYGVKDERLEQAWLQLKNSVYSFRG--------------------LQKMRGQY---- 531

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLT 362
                  V+    S    P  WY  S V+ A  L ++S   +   +     Y +DL+D+T
Sbjct: 532 -------VVTRRPSFNQEPFTWYDASAVLDAWHLLLSSRAIIPLEDDRYEMYEHDLVDIT 584

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L   A++L++N+  A++         LS + L+L +DM+ +LA    FLLG WL+ A
Sbjct: 585 RQFLQISADQLYVNLRSAFRKRQVTRFEYLSTKLLKLFDDMELILASSRNFLLGNWLQQA 644

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           K+ A +  ++  +E+NAR QIT W  + Q     + DY  K WSGL+ DYY PR  ++ +
Sbjct: 645 KRAAPSPGEQTNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLE 699

Query: 483 YMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
            +  +L S     G  F+LK    + I+L   + +  +VYPV   G+  + SQ
Sbjct: 700 DVTVALHSRRPFNGTAFKLK--VSQEIELP--FSHKVDVYPVTPVGNTWLISQ 748


>gi|330791218|ref|XP_003283691.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
 gi|325086434|gb|EGC39824.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
          Length = 712

 Score =  298 bits (764), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 176/538 (32%), Positives = 285/538 (52%), Gaps = 52/538 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F+G++P A+Q +FP+A ++ L  W           T+ LD +DPLF +I + FI +
Sbjct: 207 VLPGFAGHIPGAIQTLFPTANVSILSTWCEFNG------TFYLDPSDPLFGKITQLFITE 260

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSY 126
            +  +G T H YN D F+E  PP     ++      +Y+ M + D  AVW++QGW +  Y
Sbjct: 261 LIGVFG-TDHYYNFDPFNELAPPSSDLGFLKQTSQQMYNNMLAADPKAVWVLQGWFIVDY 319

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW+  Q +A  + VP+G  +VLDL+++V P W+ ++ FYG  ++WCMLHNF G   MY
Sbjct: 320 PEFWQANQTQAWFSGVPIGGFIVLDLWSDVAPAWNITEYFYGHYWLWCMLHNFGGRSGMY 379

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +  IA  P+ AR S +  M+G G++ E IEQN VVYDLMSEMA++    D++ WI QY
Sbjct: 380 GRIPFIATNPIIAR-SLSDNMMGTGLTPEAIEQNVVVYDLMSEMAWRSTAPDLEEWITQY 438

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RRYG+ +P + + W  +  TV+N T     +N     +F  + PSI         N+G
Sbjct: 439 TNRRYGKIMPEVVEVWMSMVDTVFNATAYWARRNMGAPESFIALRPSI---------NFG 489

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF-IASGNELSASNTYRYDLIDLTRQA 365
                              +++Y  S +  A  +F + + + + ++ T+++D+ ++T QA
Sbjct: 490 D------------------NVFYDPSVMFNAWHVFSLVNDSYVISTETFQFDISEITMQA 531

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           L+ +  + + N+I++Y ++D     + S   +E +  MD + +      LG W   A+  
Sbjct: 532 LSNFFMDTYFNLIKSYNVSDIESFQRESITMMETISFMDLIASTQPELQLGVWTYRARLW 591

Query: 426 A--QNEEQEKQ---------YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
           A   NE    Q         YE+NAR Q+T+W        S+L DY  K W GL+ D+YG
Sbjct: 592 AYPDNETPSLQNSSNSATLPYEFNARNQLTLW----GPSDSVLHDYAFKLWGGLISDFYG 647

Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWL 532
           PR  ++ K +++SLE+   F   ++      L   W     +YP+   G    TS+++
Sbjct: 648 PRWNLFLKTLLQSLENRIPFDANNFISNVQALEQQWVLESTIYPILPFGQGYNTSRYI 705


>gi|440799253|gb|ELR20308.1| alpha-N-acetylglucosaminidase family protein [Acanthamoeba
           castellanii str. Neff]
          Length = 854

 Score =  295 bits (754), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 175/549 (31%), Positives = 279/549 (50%), Gaps = 67/549 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LP F+G VP  ++ ++P+A +T+  +W       ++   Y L   D L+  IGR  I +
Sbjct: 277 ILPGFAGFVPDGIKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGRMVIRR 334

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
              E+G T HIYN DTF+E +PP   P Y+++   A+Y GM + D  A+W+MQGW F +D
Sbjct: 335 VTAEFG-TDHIYNADTFNEMSPPSADPTYLAAASRAVYEGMAAEDPQALWVMQGWSFVFD 393

Query: 128 PFWR-PPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
            FW    ++++ L+ V    +++LDL ++  P WS +  ++G  ++WCMLHN  G   +Y
Sbjct: 394 KFWEDKSRVRSYLSGVSDKDMLILDLASDNNPEWSKTDSYFGKEFVWCMLHNGGGVRGLY 453

Query: 187 GILDSIAFGPVEARTSENTTM--------VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD 238
           G L   +  P+ A  +   TM        VGVGM+ME IEQNPVVY+LMSEM ++ E  D
Sbjct: 454 GNLTQYSSDPLLALATPGNTMLICGTCEQVGVGMTMEAIEQNPVVYELMSEMGWRSEAFD 513

Query: 239 VKAWINQYSVRRYGRS--VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
           +  W+ +Y+ RRYG +  + ++ +AW +L    Y         N+ VI      D     
Sbjct: 514 IVEWVQRYAERRYGLAAGLSSVGEAWELLREATY---------NQSVI------DYGWFG 558

Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG--NELSASNTY 354
            T G    YG                       + ++ + AL LF+ S      + +  +
Sbjct: 559 FTPGLGMGYGGVA--------------------NAAKEVEALRLFLQSALTKGYAPNGPW 598

Query: 355 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF------QLSRRFLELVEDMDGLLA 408
           +YD +DLTRQ LA    +++     AY    AH  +       L    L L+ D+D +LA
Sbjct: 599 QYDCVDLTRQVLANTFRDIYAQFDAAYSAYAAHKTYTVDQLKSLGSALLTLIGDIDEILA 658

Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
            +  +LLG W++SA   A   +Q   Y++NAR QIT+W  + Q     + DY  K+W+ L
Sbjct: 659 TNPNYLLGTWIQSALSWADTPDQALHYQFNARNQITLWGPDGQ-----ITDYATKHWADL 713

Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
           +R YY PR  ++   +++++ +G     +++R E ++L   W      Y     G+ L  
Sbjct: 714 VRSYYQPRWTLFITSVLQAVYAG-----REYRGELLQLEQKWNRENTTYATTPTGNTLQV 768

Query: 529 SQWLYNKYL 537
           +  L  KYL
Sbjct: 769 AYKLAAKYL 777


>gi|195454475|ref|XP_002074254.1| GK18384 [Drosophila willistoni]
 gi|194170339|gb|EDW85240.1| GK18384 [Drosophila willistoni]
          Length = 743

 Score =  291 bits (744), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 168/503 (33%), Positives = 268/503 (53%), Gaps = 43/503 (8%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
             LPAF+G+VP AL+ +FP A  T+   W    +   +CC   ++  +PLF ++   F+ 
Sbjct: 267 VALPAFAGHVPRALRRIFPQANFTETERWNRFPNA--YCCDLFVEPQEPLFRQLATTFLR 324

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
           +  + YG ++HI+ CD F+E  PPV   +++ S  AAIY+ M+  D  A+WL+QGW+F  
Sbjct: 325 RVTQRYG-SNHIFFCDPFNELEPPVSQADFMRSTAAAIYASMREVDPKAIWLLQGWMFVK 383

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           + FW    ++A L +VP G L+VLDL +E  P +  +K +YG P++WCMLHNF G + M 
Sbjct: 384 NIFWTDELIEAFLTAVPQGNLLVLDLQSEQFPQYQRTKSYYGQPFVWCMLHNFGGTLGML 443

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G ++ +  G   AR   N++MVG G++ EGI QN V+Y    E  +   K+D   W   +
Sbjct: 444 GSVELVNSGMDLARQMPNSSMVGAGITPEGIGQNYVMYSFALERGWSDRKLDSAGWFTHF 503

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           ++ RYG     +  AW +L  +VY  T     K R                  GKY    
Sbjct: 504 ALTRYGVQDERLNQAWQLLRTSVY--TFHGLQKMR------------------GKYTITR 543

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT----YRYDLIDLT 362
           +P    +           P  WY+ + V+ A +L +++ + +   +     Y++DL+D+T
Sbjct: 544 RPAINLS-----------PFTWYNVTHVLEAWQLMLSARSIIPLDDNRYDIYQHDLVDIT 592

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L   A++L++N+  +Y+         L  + LEL++D++ +L     FLLG WLE+A
Sbjct: 593 RQYLQITADQLYVNLNSSYRKRQLARFVYLGNKLLELLDDLERILGSGSNFLLGTWLEAA 652

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           K LA   E +  +E+NAR QIT W  N +     + DY  K WSG++ DYY PR A +  
Sbjct: 653 KLLAPTVEDQSNFEFNARNQITTWGPNGE-----ILDYACKQWSGMISDYYRPRWARFLD 707

Query: 483 YMIESLESGDGFRLKDWRREWIK 505
            +  +L+S   F    +++  +K
Sbjct: 708 DVTLALQSNQPFNASAYKQHVLK 730


>gi|66801665|ref|XP_629757.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
 gi|60463162|gb|EAL61355.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
          Length = 798

 Score =  289 bits (740), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 173/537 (32%), Positives = 282/537 (52%), Gaps = 50/537 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F+G++P A+Q +FP A I+ L  W +         T+ L++TDPLF +I   FI +
Sbjct: 292 VLPGFAGHIPGAIQQLFPQANISVLSTWCNFNG------TFYLESTDPLFAKITTMFIGE 345

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +  +G T H YN D F+E  PP +  +Y+     ++Y  +   D  AVW++QGW     
Sbjct: 346 LIDVFG-TDHFYNFDPFNELEPPSNDTDYLRQTSQSMYENVLLADPKAVWVLQGWFIVDA 404

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW+  Q +A  + VP+G ++VLDL+++V P W+T+  +YG  ++WCMLHNF G   MY
Sbjct: 405 PEFWQAKQTEAWFSGVPIGGVLVLDLWSDVIPGWTTTNYYYGHYWVWCMLHNFGGRSGMY 464

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L  I+  P+ AR   +  MVG+G++ E IEQN VVYD+MSEM+++  + ++  W+ QY
Sbjct: 465 GRLPWISSNPITAR-GLSPNMVGIGLTPEAIEQNVVVYDMMSEMSWRSVQPNLTEWVTQY 523

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RRYG+ VP I D W  L +TV+N T      N     +F  + P +          +G
Sbjct: 524 THRRYGKLVPEIVDVWISLVNTVFNATAATARANMGAPESFIALRPQL---------TFG 574

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                        +S+ +P++ Y+   V   ++        + ++ T+ +D+ + T Q+L
Sbjct: 575 N------------NSFYNPNILYNAWNVFSMVD-----DEYVISTETFEFDISEFTMQSL 617

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
           + Y  + +  +IEA+  +D   +  +S   L+++  MD + +      LG W   A+  A
Sbjct: 618 SNYFMDQYFLLIEAFNASDVQTLSTISIELLDIINYMDEIASTQSSLQLGLWTYRARLWA 677

Query: 427 ---------QNEEQEKQ--YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
                    QN        YE+NAR  +T+W        S+L DY  K WSGL+ D+Y P
Sbjct: 678 YPTNDIPTLQNSSNSNTAPYEFNARNVLTLW----GPSNSVLHDYAFKLWSGLVSDFYSP 733

Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWL 532
           R  ++ K +++S+E+   F  + + R    L   W   + +YP    G A  TS+++
Sbjct: 734 RWQLFLKSLVQSVENRKPFNKESFNRMVENLEEQWVVQQTIYPTVPVGQAYNTSKYI 790


>gi|198433857|ref|XP_002122480.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 880

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 167/423 (39%), Positives = 236/423 (55%), Gaps = 37/423 (8%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    VLP F+G++P+A+ N++P A + QL +W        + CTYLL   DPLF  IG 
Sbjct: 243 LGMIPVLPGFAGHIPSAILNLYPKADVIQLSHWSHFNCT--YSCTYLLQPHDPLFNTIGS 300

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            FI++Q+ EY  T+HIYN DTF+E TPP   P Y+S+   A+Y  M   D DAVWLMQGW
Sbjct: 301 MFIKEQMLEYNGTNHIYNADTFNEMTPPSSDPGYLSNASRAVYDAMAVADPDAVWLMQGW 360

Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           LF ++P FW+  Q KALL  VP GK++VLDLF+E  P +     ++G P++WCMLH+F G
Sbjct: 361 LFHHEPTFWKTAQKKALLTGVPKGKMLVLDLFSESYPQY-LPDWYFGQPFLWCMLHDFGG 419

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           N+  YG ++++   P  A TS N+TMVG G++ EGI QN ++YD M E  F    V+V  
Sbjct: 420 NMGFYGKINTVNTQPGIALTSVNSTMVGTGVTPEGINQNYMIYDFMLETGFTVHSVNVTN 479

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           W+ +Y++RRY  S P     WN+L +T+YN          D    FP    S+I      
Sbjct: 480 WLKEYTMRRYNTSSPEAIKTWNILGNTIYN----------DTKPGFP--SKSLIR----- 522

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHL--WYSTSEVIRALELFIASGNELSASNTYRYDLI 359
               G PV +  +        D+P L  WY  S +  A + F  S N L    T RYD +
Sbjct: 523 ----GSPVKRPTL--------DNPGLPYWYQYSSLALAWDNFSQSLNTLKDLETVRYDAV 570

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           D+TRQ L      L+  ++E +      G  +L  + L+L++D D +L     F +G W+
Sbjct: 571 DITRQMLQAVHRLLYYAMVEEFLWKRDPG--KLGEQLLDLLDDFDKMLCSDAHFSMGKWI 628

Query: 420 ESA 422
           + A
Sbjct: 629 QDA 631



 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 58/197 (29%), Positives = 103/197 (52%), Gaps = 8/197 (4%)

Query: 341 FIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 400
           F  S N L    T RYD +D+TRQ L      L+  ++E +      G  +L  + L+L+
Sbjct: 684 FSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDPG--KLGEQLLDLL 741

Query: 401 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 460
           +D D +L     F +G W++ AK L    E++  YE+NAR Q+T+W  N +     + DY
Sbjct: 742 DDFDKMLCSDAHFSMGKWIQDAKILGTTAEEKDLYEYNARIQVTLWGPNGE-----ILDY 796

Query: 461 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKLTNDWQNGRNVYPV 519
            +K+W  L++ YY PR A++  Y+  +  +   F  K +  + +  +   +   R+V+P 
Sbjct: 797 ASKHWCSLVKHYYRPRWALFVSYLNHAYATKSKFDHKAFASDVFTNVEEPFTKDRSVFPS 856

Query: 520 ESNGDALITSQWLYNKY 536
            + G+A+  ++ +Y K+
Sbjct: 857 TATGNAIELAKDMYIKW 873


>gi|281200618|gb|EFA74836.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
          Length = 469

 Score =  278 bits (712), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 169/484 (34%), Positives = 259/484 (53%), Gaps = 57/484 (11%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           AVLP F+G+VP AL++ +P+A ITQL +W            Y+  + +         F+ 
Sbjct: 37  AVLPGFAGHVPEALKSHYPNANITQLSSWN--------MTVYIHQSPN--------TFMS 80

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
            Q   YG T H YN D F+E  PP   P Y+ +   ++++ + + D   +W++QGWLF Y
Sbjct: 81  IQQDLYG-TDHFYNFDPFNELEPPSSDPAYLKNCSQSMFNNLIAVDPQGIWVLQGWLFVY 139

Query: 127 DP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           D  FW+PPQ++A L+ VP+GK++VLDL+A+V   W  +  FYG  +IWCMLHNF G   M
Sbjct: 140 DTEFWQPPQIEAFLSGVPIGKMIVLDLWADVDAGWKITNYFYGHNWIWCMLHNFGGRSGM 199

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG +  I+  P+EAR S +  MVG G++ E IEQN +VYDLMSEMA++    D+K W++Q
Sbjct: 200 YGKIPFISTNPIEAR-SLSPNMVGTGLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQ 258

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y  RRYG+ +  + D W  L  TV+NC+                       VT+G     
Sbjct: 259 YVTRRYGKYIEVLADTWYELVGTVFNCS----------------------IVTKGPVTIL 296

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN-ELSASNTYRYDLIDLTRQ 364
              VS    L   TS Y  P +      + +A   F++  +  +  ++T+ +DL ++T Q
Sbjct: 297 ---VSVRPQLNFTTSLYYDPIV------ISKAWSAFLSIDDLHVVNTSTFSFDLTEITTQ 347

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           AL+       L +  A+ LND++  F  LS   L +++D++ +++  +  L+G W   A+
Sbjct: 348 ALSNLFMTTELQMNAAF-LNDSYEEFSLLSDALLSIIQDINTIVSTQEMLLVGNWTARAR 406

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
            L    E  + YE NAR QIT+W        S   DY  K W GL  D+Y  R  ++ + 
Sbjct: 407 ALTPANETTELYEMNARNQITLW----GPPDSFDHDYAYKLWGGLTEDFYLARWTLFSQS 462

Query: 484 MIES 487
           + ++
Sbjct: 463 IFKT 466


>gi|288927792|ref|ZP_06421639.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella sp. oral taxon 317 str. F0108]
 gi|288330626|gb|EFC69210.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella sp. oral taxon 317 str. F0108]
          Length = 734

 Score =  278 bits (711), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 174/541 (32%), Positives = 275/541 (50%), Gaps = 61/541 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP  L+  +P AKIT L  W   +   R  C + LD  D LF +I + +I++
Sbjct: 235 VLPAFAGHVPEILKEKYPKAKITPLSIWGDFEDQYR--C-HFLDPFDSLFTDIQKTYIDE 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K YG T HIY  D F+E  PP   PEY+++  A IY  +++ DS AVWL   W+FSY 
Sbjct: 292 QTKLYG-TDHIYGVDPFNELAPPSWEPEYLANASAKIYDVLKNADSKAVWLQMTWMFSYQ 350

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   ++K+ + +VP  K ++LD +AE   +W  S+ +Y  P+IWC L NF GN  + 
Sbjct: 351 RKDWTDERIKSYITAVPDKKQILLDYYAERTEVWKFSESYYKQPFIWCYLGNFGGNTMIA 410

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +  +     EA  +   +MVGVG ++EG + NP++YD + E  +  + + +  W  Q+
Sbjct: 411 GNIAEVDRRLNEAFANAE-SMVGVGSTLEGFDVNPIMYDFVFEKVWHKDGISLHDWTVQW 469

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYN----CTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
           + RR G +    + AW +L   +Y     CT+G     R          PS+        
Sbjct: 470 AQRRVGTTDENAEKAWKLLIDKIYVQYSLCTEGTLTNAR----------PSLTGHGNWTT 519

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
           +N+ K                     Y+  +++ A  L + S  +      Y+YD++++ 
Sbjct: 520 KNWTK---------------------YNNRDLLEAWGLLLRS--KAITKIAYKYDIVNIG 556

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L  Y   L     +AY+  D   +       L L+ D++ LL     FLLGPWL +A
Sbjct: 557 RQVLGNYFTVLRDEFTQAYERKDISALTIKGNEMLSLLNDLEALLYTSPSFLLGPWLTNA 616

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + + +N E+ + YE NAR  IT W  +TQ  A  L DYGN+ W+GLL+ YY PR  ++ +
Sbjct: 617 QNMGRNMEESRYYEKNARNIITNW--STQGVA--LNDYGNRTWAGLLQGYYTPRWKMFIE 672

Query: 483 YMIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
            +I +++    F       ++ D   +WI  T +       YP+++ GD+ + +   Y+K
Sbjct: 673 EVISAVKQNKEFNNETFFKKVTDEEWQWISKTEN-------YPIQATGDSYLLANKFYHK 725

Query: 536 Y 536
           Y
Sbjct: 726 Y 726


>gi|340617022|ref|YP_004735475.1| alpha-N-acetylglucosaminidase [Zobellia galactanivorans]
 gi|339731819|emb|CAZ95084.1| Alpha-N-acetylglucosaminidase, family GH89 [Zobellia
           galactanivorans]
          Length = 747

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 167/514 (32%), Positives = 271/514 (52%), Gaps = 27/514 (5%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL AF+G+VP  L+ ++P A I Q+ +W  V+       TY LD TD LF +IG AFI++
Sbjct: 234 VLQAFTGHVPQVLKKLYPEANIFQIEDWAGVEG------TYFLDPTDELFQKIGTAFIKK 287

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG T H+Y+ D F E  PP   P ++  +  ++Y  M+  DS A W++QGW F + 
Sbjct: 288 QTELYG-TDHLYDADCFIEVDPPSKDPAFLKQVSESVYKSMELADSKATWVLQGWFFFFK 346

Query: 128 PFWRPPQM-KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             +   +  +A L+ +P  + +VLDL+ E  P W  +  FYG P+IW ++ N    + M 
Sbjct: 347 KDFWTKERGRAFLDGIPKNRAIVLDLYGEKNPTWDKTDAFYGQPWIWNVICNEDQKVNMS 406

Query: 187 GILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           G L+ +     EA TSE    + G+G+  EG+  NP+V D + E A+  +KV+V+ WI  
Sbjct: 407 GDLEEMQRQFQEAYTSEIGNNLKGIGVIPEGLGYNPIVQDFIFEKAWDPQKVNVQEWIED 466

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y+  RYG   P+++ AW +L  +VY    G T      ++      P ++   EG  ++ 
Sbjct: 467 YATIRYGTKSPSVKKAWQLLGESVY----GRTRTMWSPLI----TTPRLMIFEEGSKEDI 518

Query: 306 GKPVSKEAVLKSETSSYDHPHLW-YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
            + V K+  + +ET     P  W +   ++ +A  L +   NEL    TY +DL ++ R+
Sbjct: 519 -RHVRKDFKI-TETD----PFAWDFDVYKLAKAAGLLLGEANELQDVETYNFDLTNVYRE 572

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L    ++   ++  AYQ  D   + + ++   +L++D++ +   ++ FLLG WLE AK 
Sbjct: 573 LLFSLTHKSINDVSVAYQEKDRQALDRSAKSLFKLMDDLEAITGANENFLLGKWLEDAKS 632

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
                E+++ YEWNART +T+W    +     LRDY  K W+GL   YY PR  ++  ++
Sbjct: 633 WGSTPEEKEYYEWNARTIVTIWQPYPE---GGLRDYAGKQWNGLFSGYYKPRWQLFVDHL 689

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYP 518
             SL  G  F  K +  E  ++   W     +YP
Sbjct: 690 RRSLTEGVDFDPKAYDAEVREMDYKWTRSHQIYP 723


>gi|440792549|gb|ELR13759.1| peptidase, S8/S53 subfamily protein [Acanthamoeba castellanii str.
           Neff]
          Length = 981

 Score =  275 bits (704), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 168/531 (31%), Positives = 263/531 (49%), Gaps = 71/531 (13%)

Query: 20  LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIY 79
           ++ ++P+A +T+  +W       ++   Y L   D L+  IG   I    +E+G T HIY
Sbjct: 434 IKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG-TDHIY 490

Query: 80  NCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALL 139
           N DTF+E +PP   P Y+++   A+Y GM + D  A+W+MQGW F +DPFW   ++KA L
Sbjct: 491 NADTFNEMSPPSADPTYLAAASRAVYEGMATQDPQALWVMQGWSFVFDPFWTKDRIKAYL 550

Query: 140 NSVPLGKLVVLDLFAEVKPIWSTSKQF----YGVPYIWCMLHNFAGNIEMYGILDSIAFG 195
           + V    +++LDL ++  P W+ + QF    +G  ++WCMLHN  G   +YG L   +  
Sbjct: 551 SGVDNSDMLILDLASDNSPEWNKTGQFRDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSD 610

Query: 196 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 255
           P+ A  +   TMVGVGM+ME IEQNPVVY+LMSEM ++ E  D+  W+ +Y+ RRYG + 
Sbjct: 611 PLIALATPGNTMVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAT 670

Query: 256 PA--IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 313
            +  + +AW +L    YN +                +D  +          +G   +   
Sbjct: 671 GSSPVGEAWELLREATYNQS---------------GLDAGLFGFAPALGMGHGGTSN--- 712

Query: 314 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNE--LSASNTYRYDLIDLTRQALAKYAN 371
                           +T EV  AL LF+ S      + +  ++YD +DLTRQ LA   N
Sbjct: 713 ----------------ATKEV-EALRLFLQSAQTEGYAPNGPWQYDCVDLTRQVLANTFN 755

Query: 372 ELFLNIIEAY-----QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
           +++  +  AY       +D      L+   L ++ D+D LLA +  +LLG W++ A   A
Sbjct: 756 DVYSQLDAAYTSYATNKSDTLPFLPLAAELLGIISDLDRLLATNPNYLLGTWIKDAVSWA 815

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
              EQ   Y++NAR QIT+W  + Q     + DY  K+W+GLL               ++
Sbjct: 816 SIPEQALHYQFNARNQITLWGPDGQ-----ISDYATKHWAGLL---------------MK 855

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           ++ +G  F    +  E ++L   W      YP    GD L  +  +  KYL
Sbjct: 856 AVGAGVMFNSTAYGTELLQLEQKWNQENTTYPTTPTGDTLQVALRISQKYL 906


>gi|297273081|ref|XP_001095618.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Macaca mulatta]
          Length = 691

 Score =  275 bits (702), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 157/402 (39%), Positives = 226/402 (56%), Gaps = 50/402 (12%)

Query: 89  PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 147
           PP  +P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L
Sbjct: 286 PPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRL 345

Query: 148 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 207
           +VLDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM
Sbjct: 346 LVLDLFAESQPVYTLTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTM 405

Query: 208 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLY 266
           VG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ +RYG S P    AW +L 
Sbjct: 406 VGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLL 465

Query: 267 HTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 325
            +VYNC+ +     NR  +V  P                          L+  TS     
Sbjct: 466 RSVYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS----- 495

Query: 326 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY---- 381
            +WY+ S V  A  L + S   L+AS  +RYDL+DLTRQA+ +  +  +     AY    
Sbjct: 496 -VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKE 554

Query: 382 --QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 439
              L  A GV        EL+  +D LLA    FLLG WLE A+  A +E +   YE N+
Sbjct: 555 LTSLLRAGGVLAY-----ELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNS 609

Query: 440 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           R Q+T+W      E ++L DY NK  +GL+ +YY PR  ++ 
Sbjct: 610 RYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFL 646


>gi|156121099|ref|NP_001095696.1| alpha-N-acetylglucosaminidase precursor [Bos taurus]
 gi|151554244|gb|AAI48148.1| NAGLU protein [Bos taurus]
 gi|296476361|tpg|DAA18476.1| TPA: alpha-N-acetylglucosaminidase [Bos taurus]
          Length = 667

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 145/346 (41%), Positives = 205/346 (59%), Gaps = 35/346 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP AL  VFP   +TQ+GNW        + C++LL   DPLF  +G  F+ +
Sbjct: 241 VLPAFAGHVPKALTRVFPQVNVTQMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRE 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY  DTF+E  PP   P Y+++  AA+Y  M + D DAVWL+QGWLF + 
Sbjct: 299 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQ 357

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW P Q+ A+L +VP G+L+VLDLFAE +P++  +  F G P+IWCMLHNF GN  ++
Sbjct: 358 PEFWGPAQVAAVLGAVPRGRLLVLDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLF 417

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
           G L+S+  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+ +Q + V D+ AW+  
Sbjct: 418 GALESVNQGPTTARHFPNSTMVGTGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTS 477

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           ++ RRYG S    + AW +L  +VYNC                       S  E +  N+
Sbjct: 478 FAARRYGVSHGDAEAAWRLLLRSVYNC-----------------------SGEECRGHNH 514

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
             P+ +   L+  T+      +WY+ S+V  A  L + + + L++S
Sbjct: 515 -SPLVRRPSLQMVTT------VWYNRSDVFEAWRLLLTATSTLASS 553



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 60/115 (52%), Gaps = 5/115 (4%)

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           A   A +E +   YE N+R Q+T+W      E ++L DY NK  +GL+ DYY PR  ++ 
Sbjct: 551 ASSPAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFT 605

Query: 482 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           + ++ESL  G  F+   + R   +L   +  G   YP +  GD +   + L+ KY
Sbjct: 606 ETLVESLVQGVPFQQHQFDRNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 660


>gi|392588150|gb|EIW77482.1| glycoside hydrolase family 89 protein [Coniophora puteana
           RWD-64-598 SS2]
          Length = 761

 Score =  272 bits (696), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 162/499 (32%), Positives = 262/499 (52%), Gaps = 45/499 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A+ N++P+A I     W      P++     L+  DPLF ++ ++FI +
Sbjct: 244 VLPAFTGFVPPAMHNLYPNASIVNGSAWNDFA--PQFTNDSFLEPFDPLFAQVQQSFISK 301

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q   +G  SHIY  D ++EN P    P Y++++ AA +S +++ D DA WLMQGWLF S 
Sbjct: 302 QQAAFGNVSHIYTLDQYNENDPYSGDPSYLTNISAATFSSLRAADPDATWLMQGWLFFSS 361

Query: 127 DPFWRPPQMKALLNSVPL----GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
             FW P +++A L  VP       +++LDL++E +P W     ++G  +IWC LH++ GN
Sbjct: 362 ADFWTPERVEAYLAGVPGDDDGSGMLILDLYSEAQPQWQRLSSYFGKRWIWCELHDYGGN 421

Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
           +   G   ++   P+ A  S N +MVGVG++ EG+E N ++YD++ + A+    ++   +
Sbjct: 422 MGFEGNFANVTEAPLAALASPNVSMVGVGLTPEGMEGNEIIYDVLLDQAWSSSPINKTEY 481

Query: 243 INQYSVRRY-GRSVPAIQ-DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
              ++ RRY    +P    +AW  L  TVY+ TD  +      I+        +++VT  
Sbjct: 482 AQAWATRRYPADELPECAIEAWQTLAATVYSNTDPGSQATVKSILELEPALSGLVNVTG- 540

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHP-HLWYST-SEVIRALELFIASGN---ELSASNTYR 355
                                  HP H++Y T + ++ AL+  + +G+    L A   YR
Sbjct: 541 ----------------------HHPTHVFYDTNTTIVPALQQLVQAGHSTPSLLAIPEYR 578

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGF 413
           YDL+DLTRQ L     +L+ +++  Y      +  V    +  LELV D+D +L  ++ F
Sbjct: 579 YDLVDLTRQLLVNRFIDLYADLLAVYNTTSASSASVSAAGQPMLELVADLDKVLMTNENF 638

Query: 414 LLGPWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
            L  W ++A+  A        Y E+NAR QIT+W    +     + DY +K W GL+ DY
Sbjct: 639 QLSRWTDAARSWANGNASYAAYLEYNARNQITLWGPKGE-----INDYASKQWGGLVGDY 693

Query: 473 YGPRAAIYFKYMIESLESG 491
           YG R A++ +Y+  S  +G
Sbjct: 694 YGKRWAMFIQYLEGSKSNG 712


>gi|160883168|ref|ZP_02064171.1| hypothetical protein BACOVA_01137 [Bacteroides ovatus ATCC 8483]
 gi|156111393|gb|EDO13138.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
          Length = 737

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 160/530 (30%), Positives = 254/530 (47%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VP+ L+ ++P AKI+++ +W   +   R   ++ LD  DPLF  I + F+E+
Sbjct: 233 ILPAFAGHVPSELKRIYPEAKISRMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PE++++    IY  M   D DA WL   WLF  D
Sbjct: 290 QTKLFG-TDHIYGADPFNEVAPPSWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYID 348

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +++A L +VP  KL++LD + E   +W  + +++G PY+WC L NF GN  + 
Sbjct: 349 RHLWTNERVEAFLKAVPQNKLLLLDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLA 408

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    +        T+      G+G ++EG + NP +Y+ +   A+     D   WI Q 
Sbjct: 409 GNTKEVGKRIENVYTNGGENFSGLGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQL 467

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RR G     ++ AW +LY ++Y                     P+ +          G
Sbjct: 468 ADRRIGLRNQQMRRAWKLLYDSIYTA-------------------PAALG--------QG 500

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
             ++    LK   +    P + YS   +    E+ + +G    +  TY YD++++ RQ L
Sbjct: 501 TLMNARPCLKGNGNWTTTPTVAYSNETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVL 558

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y  +L     E Y       + Q      +L+ D++ LL+    FLLG W+E A+ L 
Sbjct: 559 GNYFGKLRDEFAETYSRKQLPLLKQKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLG 618

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +E  +  YE NART ++ W D  Q     L DY N+ W GL+  YY PR  ++   +I 
Sbjct: 619 IDEASKNYYEENARTIVSTWGDKDQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIR 674

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           S+ +   F    + +   +   DW      YP E  G+A+  +  L NKY
Sbjct: 675 SVSNKQPFNADAFHQRVTQFEIDWVKSHERYPSEPVGNAVEIATLLMNKY 724


>gi|423292430|ref|ZP_17271008.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
           CL02T12C04]
 gi|423294620|ref|ZP_17272747.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
           CL03T12C18]
 gi|392661665|gb|EIY55241.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
           CL02T12C04]
 gi|392675811|gb|EIY69252.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
           CL03T12C18]
          Length = 727

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 160/530 (30%), Positives = 254/530 (47%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VP+ L+ ++P AKI+++ +W   +   R   ++ LD  DPLF  I + F+E+
Sbjct: 223 ILPAFAGHVPSELKRIYPEAKISRMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEE 279

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PE++++    IY  M   D DA WL   WLF  D
Sbjct: 280 QTKLFG-TDHIYGADPFNEVAPPSWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYID 338

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +++A L +VP  KL++LD + E   +W  + +++G PY+WC L NF GN  + 
Sbjct: 339 RHLWTNERVEAFLKAVPQNKLLLLDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLA 398

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    +        T+      G+G ++EG + NP +Y+ +   A+     D   WI Q 
Sbjct: 399 GNTKEVGKRIENVYTNGGENFSGLGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQL 457

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RR G     ++ AW +LY ++Y                     P+ +          G
Sbjct: 458 ADRRIGLRNQQMRRAWKLLYDSIYTA-------------------PAALG--------QG 490

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
             ++    LK   +    P + YS   +    E+ + +G    +  TY YD++++ RQ L
Sbjct: 491 TLMNARPCLKGNGNWTTTPTVAYSNETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVL 548

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y  +L     E Y       + Q      +L+ D++ LL+    FLLG W+E A+ L 
Sbjct: 549 GNYFGKLRDEFAETYSRKQLPLLKQKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLG 608

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +E  +  YE NART ++ W D  Q     L DY N+ W GL+  YY PR  ++   +I 
Sbjct: 609 IDEASKNYYEENARTIVSTWGDKDQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIR 664

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           S+ +   F    + +   +   DW      YP E  G+A+  +  L NKY
Sbjct: 665 SVSNKQPFNADAFHQRVTQFEIDWVKSHERYPSEPVGNAVEIATLLMNKY 714


>gi|295086519|emb|CBK68042.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 727

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 160/530 (30%), Positives = 254/530 (47%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VP+ L+ ++P AKI+++ +W   +   R   ++ LD  DPLF  I + F+E+
Sbjct: 223 ILPAFAGHVPSELKRIYPEAKISRMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEE 279

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PE++++    IY  M   D DA WL   WLF  D
Sbjct: 280 QTKLFG-TDHIYGADPFNEVAPPSWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYID 338

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +++A L +VP  KL++LD + E   +W  + +++G PY+WC L NF GN  + 
Sbjct: 339 RHLWTNERVEAFLKAVPQDKLLLLDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLA 398

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    +        T+      G+G ++EG + NP +Y+ +   A+     D   WI Q 
Sbjct: 399 GNTKEVGKRIENVYTNGGENFSGLGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQL 457

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RR G     ++ AW +LY ++Y                     P+ +          G
Sbjct: 458 ADRRIGLRNQQMRRAWKLLYDSIYTV-------------------PAALG--------QG 490

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
             ++    LK   +    P + YS   +    E+ + +G    ++  Y YD++++ RQ L
Sbjct: 491 ALMNARPCLKGNGNWTTTPTVAYSNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVL 548

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y  +L     EAY       + Q      +L+ D+D LL+    FLLG W+E A+ L 
Sbjct: 549 GNYFGKLRDEFAEAYSRKQLPLLKQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLG 608

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +E  +  YE NART ++ W D  Q     L DY N+ W GL+  YY PR  ++   +I 
Sbjct: 609 TDEVSKNYYEENARTIVSTWGDKDQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIR 664

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           S+ +   F    + +   +   DW      YP E  G+ +  +  L NKY
Sbjct: 665 SVSNKQPFNADAFHQRVTQFEIDWVKSHERYPSEPVGNVVEIATLLMNKY 714


>gi|62088640|dbj|BAD92767.1| huntingtin interacting protein-1-related [Homo sapiens]
          Length = 449

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 155/418 (37%), Positives = 232/418 (55%), Gaps = 50/418 (11%)

Query: 112 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 170
           D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+VLDLFAE +P+++ +  F G P
Sbjct: 18  DTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQP 77

Query: 171 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 230
           +IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+
Sbjct: 78  FIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAEL 137

Query: 231 AFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 288
            ++ + V D+ AW+  ++ RRYG S P    AW +L  +VYNC+ +     NR  +V  P
Sbjct: 138 GWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRP 197

Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
                                     L+  TS      +WY+ S+V  A  L + S   L
Sbjct: 198 S-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAPSL 226

Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 402
           + S  +RYDL+DLTRQA+ +  +  +     AY       L  A GV        EL+  
Sbjct: 227 ATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELASLLRAGGVLAY-----ELLPA 281

Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
           +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W      E ++L DY N
Sbjct: 282 LDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYAN 336

Query: 463 KYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 520
           K  +GL+ +YY PR  ++ + +++S+  G  F+   + +   +L   +   +  YP +
Sbjct: 337 KQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQ 394


>gi|298385999|ref|ZP_06995556.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           1_1_14]
 gi|298261227|gb|EFI04094.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           1_1_14]
          Length = 715

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 165/533 (30%), Positives = 271/533 (50%), Gaps = 54/533 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V   F G VP+ L+  +P A++ + G W S++  P      +LD  DPLF  + + +  +
Sbjct: 231 VFQGFFGMVPSLLKEKYPEARLVEQGLWNSLQRPP------VLDPADPLFERMAKVWYAE 284

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
             K YG+ + ++  D F E   T  +D    ++     + + M+  + DA W++Q WL +
Sbjct: 285 YEKLYGK-ADLFGGDLFHEGGKTGGID----VTDAARRVQTAMKRYNPDATWVIQAWLGN 339

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P+ K LL  +     +++DL AE    W   K F G P++W  + N+ GNI +
Sbjct: 340 -------PK-KELLAGLDRKNTLIVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGGNIGL 391

Query: 186 YGILDSIAFGPVEAR--TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
           +G LD+IA GPV+ +  ++ + +M G   + EGIE NPVV+DL++EM ++ E +D+  W+
Sbjct: 392 HGRLDAIATGPVDGQKDSAASPSMKGTSSTPEGIEVNPVVFDLLNEMRWRSEHLDLDVWL 451

Query: 244 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
            +YSVRRYG     +++AW + + T Y    G    +  V  A P +    I+       
Sbjct: 452 KEYSVRRYGVEDENLKEAWTIFHRTAYGTYTGHRRPSESVFCAPPSLKRDKITA------ 505

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
                           S++    ++Y      + + LF+ S + L  ++TY+YD +D  R
Sbjct: 506 ----------------SAWSQCRIFYDPELFAQGVGLFLQSADRLKQTSTYQYDAVDFVR 549

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           Q LA    E + N+++AY+  D       S RFL+L++D + LL+ H+ F +G WL+ A+
Sbjct: 550 QYLADLGRETYYNLVDAYRAKDTKQFDYWSERFLQLIKDQNELLSTHERFFVGRWLDMAR 609

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             ++  E +  YE NAR  I  W     E  S +RDY +K W GLL+DYY PR   Y  Y
Sbjct: 610 LKSKQPELQDLYEHNARMLIGTW----TETLSPVRDYAHKEWGGLLKDYYLPRWTNYIAY 665

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +  +LE G    + D      +    W N  N Y +E+  D + T++ +Y+KY
Sbjct: 666 LKGTLE-GRSLTVPD----SFQAEKAWVNAHNKYVLEAGVDPVQTAKRMYSKY 713


>gi|383114162|ref|ZP_09934927.1| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
 gi|382948607|gb|EFS30964.2| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
          Length = 727

 Score =  267 bits (682), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 161/530 (30%), Positives = 256/530 (48%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VP+ L+ ++P AKI+++ +W   +   R   ++ LD  DPLF  I + F+E+
Sbjct: 223 ILPAFAGHVPSELKRIYPEAKISRMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEE 279

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PE++++    IY  M   D DA WL   WLF  D
Sbjct: 280 QTKLFG-TDHIYGADPFNEVAPPSWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYID 338

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +++A L +VP  KL++LD + E   +W  + +++G PY+WC L NF GN  + 
Sbjct: 339 RHLWTNERVEAFLKAVPQDKLLLLDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLA 398

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    +        T+      G+G ++EG + NP +Y+ +   A+     D   WI Q 
Sbjct: 399 GNTKEVGKRIENVYTNGGENFSGLGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQL 457

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RR G     ++ AW +LY ++Y                         ++ +G   N  
Sbjct: 458 ADRRIGLRNQQMRRAWKLLYDSIYTAP---------------------AALGQGTLMN-A 495

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           +P  K     + TS+     + YS   +    E+ + +G    ++  Y YD++++ RQ L
Sbjct: 496 RPCLKGNGNWTTTST-----VAYSNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVL 548

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y  +L     EAY       + Q      +L+ D+D LL+    FLLG W+E A+ L 
Sbjct: 549 GNYFGKLRDEFAEAYSRKQLPLLKQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLG 608

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +   +  YE NART ++ W D  Q     L DY N+ W GL+  YY PR  ++   +I 
Sbjct: 609 TDGASKNYYEENARTIVSTWGDKDQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIR 664

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           S+ +   F    + +   +   DW      YP E  G+A+  +  L NKY
Sbjct: 665 SVSNKQPFNADAFHQRVTQFEIDWVKSHERYPSEPVGNAVEIATLLMNKY 714


>gi|237719130|ref|ZP_04549611.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229451509|gb|EEO57300.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 737

 Score =  267 bits (682), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 161/530 (30%), Positives = 256/530 (48%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VP+ L+ ++P AKI+++ +W   +   R   ++ LD  DPLF  I + F+E+
Sbjct: 233 ILPAFAGHVPSELKRIYPEAKISRMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PE++++    IY  M   D DA WL   WLF  D
Sbjct: 290 QTKLFG-TDHIYGADPFNEVAPPSWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYID 348

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +++A L +VP  KL++LD + E   +W  + +++G PY+WC L NF GN  + 
Sbjct: 349 RHLWTNERVEAFLKAVPQDKLLLLDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLA 408

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    +        T+      G+G ++EG + NP +Y+ +   A+     D   WI Q 
Sbjct: 409 GNTKEVGKRIENVYTNGGENFSGLGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQL 467

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RR G     ++ AW +LY ++Y                         ++ +G   N  
Sbjct: 468 ADRRIGLRNQQMRRAWKLLYDSIYTAP---------------------AALGQGTLMN-A 505

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           +P  K     + TS+     + YS   +    E+ + +G    ++  Y YD++++ RQ L
Sbjct: 506 RPCLKGNGNWTTTST-----VAYSNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVL 558

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y  +L     EAY       + Q      +L+ D+D LL+    FLLG W+E A+ L 
Sbjct: 559 GNYFGKLRDEFAEAYSRKQLPLLKQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLG 618

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +E  +  YE NART ++ W D  Q     L DY N+ W GL+  YY PR  ++   +I 
Sbjct: 619 IDEASKNYYEENARTIVSTWGDKDQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIR 674

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           S+ +   F    + +   +   DW      YP E   +A+  +  L NKY
Sbjct: 675 SVSNKQPFNADAFHQRVTQFEIDWVKSHERYPSEPVSNAVEIATLLMNKY 724


>gi|281210062|gb|EFA84230.1| hypothetical protein PPL_03307 [Polysphondylium pallidum PN500]
          Length = 744

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 167/529 (31%), Positives = 271/529 (51%), Gaps = 53/529 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P F+G+VP A +  +PSA I +L +W    +      T  L  TDP++  +   F + 
Sbjct: 251 VFPGFAGHVPVAFKQFYPSANIVELPSWHGFNA------TNHLLTTDPMYDIVADRFYQV 304

Query: 68  QLKEYGRTSHI--YNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + YG  + I  ++ D F+E  PP +S ++++   + I++ +   + D+ W++Q W  +
Sbjct: 305 QNEIYGAYAKIDYFSIDPFNELIPPSNSSQFLNECSSRIFNAINRFNPDSTWVLQNWFLN 364

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
              FW   Q+ + L  VP+G+L+VLDL++E+KP+W+ +  + G  +IW MLHNF G   +
Sbjct: 365 -SAFWGDGQVASFLGGVPIGRLIVLDLWSELKPLWNRTANYQGHKWIWNMLHNFGGRPTI 423

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
            G +  IA  P+EA+ S + TMVG+G++ E IEQN +VYDLMSEM ++    D+  W++ 
Sbjct: 424 SGRMPIIANEPLEAKAS-SPTMVGIGLTPEAIEQNVIVYDLMSEMGWRSRSFDLNLWVDA 482

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y  RRYG ++P ++  W +L +TVY                 P+  P+          NY
Sbjct: 483 YVTRRYGVNLPNLKPVWKMLAYTVYFS---------------PNRSPA----------NY 517

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA-LELFIASGNELSASNTYRYDLIDLTRQ 364
              ++K+  L  +   Y +P +      ++ A  EL       + +S TYRYDL ++T Q
Sbjct: 518 ---IAKKPSLDFQLGLYYNPVV------IVDAWRELLAVDSTIVRSSETYRYDLAEITLQ 568

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLEL-VEDMDGLLACHDGFLLGPWLESAK 423
           AL+ Y N     + ++Y  +D    FQ +R+     +  MD +        LG W   A+
Sbjct: 569 ALSNYFNGNLKQLYQSYYASDFQ-TFQSARQNCSFALRAMDAVADTVQLLKLGKWTADAR 627

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           + A +  + + YE+NAR QIT+W            DY NK+WSGL+ DYY PR  I+F++
Sbjct: 628 KWATDNNERELYEYNARNQITLWGWKDMGNP----DYANKWWSGLIADYYFPRWQIFFEH 683

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYP--VESNGDALITSQ 530
           +  ++     F         +     W    N+YP  V SN D    S+
Sbjct: 684 LEHAIFDKSKFNEHSLAVNTMLHEERWNKQTNIYPSDVNSNDDVHTVSK 732


>gi|380692804|ref|ZP_09857663.1| putative alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
          Length = 709

 Score =  265 bits (677), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 163/533 (30%), Positives = 267/533 (50%), Gaps = 54/533 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V   F G VP++L+  +P A + + G W S++  P      +LD  DPLF ++ + +  +
Sbjct: 225 VFQGFFGMVPSSLKEKYPEAHLVEQGLWNSLQRPP------VLDPADPLFEQMAKVWYTE 278

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
             K YG+ + ++  D F E   T  +D    ++     + + M+  + DA W++Q WL +
Sbjct: 279 YEKLYGK-ADLFGGDLFHEGGKTGGID----VTDAARRVQTAMKQYNPDATWVIQAWLGN 333

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P+ K LL  +     +++DL AE    W   K F G P++W  + N+  NI +
Sbjct: 334 -------PK-KELLAGLDRKHTLIVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGANIGL 385

Query: 186 YGILDSIAFGPVEART--SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
           +G LD+IA GP++ R     + +M G   + EGIE NPVV+DL++EM ++ E +D+  W+
Sbjct: 386 HGRLDAIATGPIDGRKDPEASPSMKGTSSTPEGIEVNPVVFDLLNEMRWRSEYLDIDTWL 445

Query: 244 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
            +YSVRRYG     ++ AW + + T Y    G    +  V  A P +    I+       
Sbjct: 446 KEYSVRRYGAEDENLKKAWIIFHRTAYGTYSGHRRPSESVFCAPPSLKRDKITA------ 499

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
                           S++    ++Y      + + LF+ S + L  ++TY+YD +D  R
Sbjct: 500 ----------------SAWSQCRIFYDPDLFAQGVGLFLQSADHLKQTSTYQYDAVDFVR 543

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           Q LA    E + N+++AY+  D       S RFL+L++D + LL+ H  F +G WL+ A+
Sbjct: 544 QYLADLGREAYYNLVDAYRAKDTKQFDYWSERFLQLIKDQNELLSTHKCFFVGRWLDMAR 603

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
             ++  E +  YE NAR  I  W     E  S +RDY +K W GLL+DYY PR   Y  Y
Sbjct: 604 SKSKQPELQDLYEHNARMLIGTW----TETLSPVRDYAHKEWGGLLKDYYLPRWTNYIAY 659

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +  +LE G    + +      ++   W N  N Y +E+  D + T++ +Y KY
Sbjct: 660 LKGTLE-GRSLTVPN----SFQVEKAWVNAHNKYVLETGVDPVETAKRMYRKY 707


>gi|313203962|ref|YP_004042619.1| alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
 gi|312443278|gb|ADQ79634.1| Alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
          Length = 738

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 177/536 (33%), Positives = 265/536 (49%), Gaps = 40/536 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQL-GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           +L AF+G+VP  L+ + PSAKIT++   W  + ++     TY LD TD LF EI + F+ 
Sbjct: 233 ILSAFAGHVPEQLKTLRPSAKITRIEPGWGGMAAE---YTTYFLDPTDNLFGEIQKRFLT 289

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
            Q K YG T H+Y+ D F+E TPP   P+Y++++G  IY  M   D +A+W    W F  
Sbjct: 290 VQQKLYG-TDHLYSADPFNEITPPSWEPDYLANVGKTIYETMSQVDKEAIWYQMSWTFYN 348

Query: 127 DP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           DP  W  P++ A++++VP GKL  LD   E +  +  S  FYG P+IWC L NF  N  +
Sbjct: 349 DPTHWTRPRLSAMIHAVPQGKLFFLDYNCEEEEFFRKSDNFYGAPFIWCYLGNFGANTHL 408

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ-HEKVDVKAWIN 244
              L+ +       + +  +  VGVG ++EGI  NP +Y+ + EM ++  E V     I 
Sbjct: 409 VAPLNKVV--NRLGKLTYGSACVGVGSTLEGINVNPEIYETVLEMPWRADETVTADTLIR 466

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y+ RR G    A+ +AW +L   V    D A       +V    V P +  +T   +  
Sbjct: 467 HYAERRAGARDKAVIEAWQLLRQHV--LVDTAVGIWNHCVVF--QVSP-VTDLTRAFWAT 521

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRAL-ELFIASGNELSASNTYRYDLIDLTR 363
                              +P + Y   ++  AL  +F AS N    ++ YR+D+++LTR
Sbjct: 522 -------------------NPKIPYRNVDLAIALNRMFQASANS-KKTDAYRFDVVNLTR 561

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           QAL  Y   L+  ++EAY   +     + S  FL+L +++DGLLA    FLLG WL  A+
Sbjct: 562 QALGNYGTVLYHKMMEAYSRKNLIDFRKYSGEFLQLGQEIDGLLATRHEFLLGKWLADAR 621

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
                  ++  YE NAR  IT W     +    L DY N+ W+GLLR YY PR   +   
Sbjct: 622 SWGTTPAEKAYYERNAREIITTW----HKAGGGLTDYSNRQWNGLLRSYYLPRWKEFINR 677

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNG-RNVYPVESNGDALITSQWLYNKYLQ 538
           +  SL +G  +  K +          W +   + Y     GDA+  +  L+ KY Q
Sbjct: 678 LDTSLSTGKDYDDKAFAAWCSAFEQHWVDSPSSAYSDTETGDAVKMAFELFGKYKQ 733


>gi|449541595|gb|EMD32578.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
           B]
          Length = 752

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 156/514 (30%), Positives = 272/514 (52%), Gaps = 38/514 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A+  V  +A I     W +    P       L+  DPLF  + ++FI +
Sbjct: 234 VLPAFTGFVPRAMSTVHSNASIVNGSQW-APGFPPSLTNVSFLEPFDPLFATLQKSFIAK 292

Query: 68  QLKEYG-RTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-S 125
           Q + YG   SHIY  D ++EN P   +  Y+SS+    ++ +++ D DAVW++QGWLF S
Sbjct: 293 QQEAYGANISHIYTLDQYNENNPFSGNLSYLSSISEGTFTSLRAADPDAVWMLQGWLFFS 352

Query: 126 YDPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
            + FW   +++A L  VP    ++VLDL++E +P W+ +  ++G  ++WC LH++ G I 
Sbjct: 353 SEAFWTNERIEAYLGGVPTNDSMIVLDLYSEAQPQWNRTSSYFGKQWVWCELHDYGGTIG 412

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           + G LD+I  GP+ A  S  ++M G+G++MEG E N +VYDL+ + A+    +++ +++ 
Sbjct: 413 LEGNLDAITTGPIAALNSPGSSMKGMGLTMEGQEGNEIVYDLLLDQAWSSSPINIASYVK 472

Query: 245 QYSVRRY-GRSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
            +  RRY    +P A Q+AW +L  TVYN  D     ++  I    +++P +  +     
Sbjct: 473 GWVSRRYLVEPLPSAAQEAWRILSTTVYNNQD---PNSQSTIKNIYELEPVLTGL----- 524

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLI 359
                 V++  +L +    YD      + S ++ AL+L +   A    LS    + +D++
Sbjct: 525 ------VNRTGILPT-VIPYD------TNSTIVPALQLLVKAKAQNAALSTVPEFVHDVV 571

Query: 360 DLTRQALAKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
           D++RQ L+    + +  +I+ Y   +  +  V +  +  + ++  +D LLA ++ FLL  
Sbjct: 572 DVSRQLLSNRFIDAYTALIDTYNNTNVTSDAVIRAGQPLMTILSQLDALLATNENFLLSS 631

Query: 418 WLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
           W+  A+ L+  +E    Y E+NAR QIT+W  + +     + DY +K W+GL+  YY  R
Sbjct: 632 WIAQARNLSHGDESYAAYLEYNARNQITLWGPDGE-----INDYASKAWAGLISTYYAAR 686

Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 510
              +  Y+  +      F    +  + I L  +W
Sbjct: 687 WQTFIDYLASTKRLARPFDTSAFSNQMILLGQEW 720


>gi|119480815|ref|XP_001260436.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
           181]
 gi|119408590|gb|EAW18539.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
           181]
          Length = 748

 Score =  262 bits (669), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 153/489 (31%), Positives = 261/489 (53%), Gaps = 45/489 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A+  V P+A +     W     D R+     L+  DP F  + R+FI++
Sbjct: 233 VLPAFTGFVPRAISRVLPNATVVNGSRWGGF--DERYTNDTFLEPFDPSFTRLQRSFIQK 290

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q + YG  +H+Y  D ++EN P     +Y+ ++    +  ++S D +AVWLMQGWLF S 
Sbjct: 291 QQQAYGNITHVYTLDQYNENDPYSGDLDYLRNVTRNTWLSLKSADPNAVWLMQGWLFYSN 350

Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   ++KA L+ V + + ++VLDLF+E +P W  ++ +YG P+IWC LH++ GN+ +
Sbjct: 351 SDFWTDERVKAYLSGVEVDQDMLVLDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGL 410

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG + ++     +A  + + ++VG G++MEG E N ++YDL+ + A+  + +D   + + 
Sbjct: 411 YGQVMNVTVNATQALAASD-SLVGFGLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHN 469

Query: 246 YSVRRY-----GRSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
           +   RY     G +VP  +  AW++L  TVYN T+  +      I    ++ PSI     
Sbjct: 470 WVKTRYSSGVRGSAVPEELHQAWDILRTTVYNNTNLTSTAVSKSIF---ELQPSI----S 522

Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNE---LSASNTYR 355
           G     G                DHP  + Y  + +++A +L  ++ ++   L +   + 
Sbjct: 523 GLLNRTG----------------DHPTTVNYDPAALVQAWQLMDSAASKDRSLWSQPAFL 566

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
           YD++D+TRQ +A     +++N++  YQ      V       ++L+ D+D +L+ +D F L
Sbjct: 567 YDMVDITRQVMANAFIPMYINLVSTYQA--GASVSTDGSNLIQLLRDVDSVLSTNDNFRL 624

Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
             W+ SA+  A+N+ +   YE+NAR QI +W          + DY +K W GL+  YY P
Sbjct: 625 STWIRSARSWARNDTEADFYEYNARNQIALW-----GPMGEINDYASKQWGGLVSAYYIP 679

Query: 476 RAAIYFKYM 484
           R   +  Y+
Sbjct: 680 RWQTFLHYL 688


>gi|291515668|emb|CBK64878.1| Alpha-N-acetylglucosaminidase (NAGLU) [Alistipes shahii WAL 8301]
          Length = 713

 Score =  261 bits (668), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 168/530 (31%), Positives = 260/530 (49%), Gaps = 41/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP  L+ + P A+IT++  W     D R+ C++L D  DPLF  I R F+ +
Sbjct: 216 VLPAFAGHVPQELKRLHPDARITRVSYWGGF--DDRYRCSFL-DPMDPLFAVIQREFLTE 272

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G T HIY  D F+E   P   PE ++ +   IY  M   D +AVWL  GWLF  D
Sbjct: 273 QTRLFG-TGHIYGADPFNEIDAPTWDPETLAGMSRHIYESMAEVDPEAVWLQMGWLFYAD 331

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P  W    ++A L +VP  +L++LD F E   IW  +++F+G PY+WC L NF GN  + 
Sbjct: 332 PTHWTAENIRAFLGAVPQDRLLMLDYFCEFTEIWKQTEKFHGQPYLWCYLGNFGGNTMLS 391

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G   +++    +A       + GVG ++EG   N  +Y+ + + A+     D   WI + 
Sbjct: 392 GNFHTVSARMEDAFAHGGDNLRGVGSTLEGFGVNQFMYEFVLDKAWNTGIAD-DEWIARL 450

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RR G   PA +  W  L  +VY  T  A      +  A P +        EG +    
Sbjct: 451 ADRRTGFRDPAARTGWRTLCDSVY--TLPAQTGQSPLTNAHPAL--------EGNWHWTT 500

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           KP          T+ Y  P LW       R  E  +A  +E    +TYR+D++++ RQ L
Sbjct: 501 KP----------TTGYRFPTLW-------RVWEELLAVDSE---RDTYRFDVVNIGRQVL 540

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y          AY  +D   +   +RR   L+ D++ L ACH  F L  W+ +A+   
Sbjct: 541 GDYFLIERDRFAAAYAQHDRKAMDAAARRMTGLLADINLLTACHPEFSLERWIAAARGFG 600

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +   +  YE NAR  I++W D+       L DY ++ WSG++  YY PR  ++ + ++E
Sbjct: 601 SDNASKDYYETNARMLISVWGDSYH-----LTDYASRTWSGMISTYYAPRWRLFIERVME 655

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +  +G  F  + + RE       W +  +       GDA+ T++ L +KY
Sbjct: 656 AARTGRMFDHEAFDREIRDFECRWADASHPLTFPEAGDAVRTARELASKY 705


>gi|423287380|ref|ZP_17266231.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
           CL02T12C04]
 gi|392672495|gb|EIY65962.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
           CL02T12C04]
          Length = 726

 Score =  261 bits (667), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 160/536 (29%), Positives = 263/536 (49%), Gaps = 39/536 (7%)

Query: 2   LLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
           LL    VLPAFSG+VPA L+ ++P A ITQ+  W       R   ++ +D  DPLF +I 
Sbjct: 223 LLGMTPVLPAFSGHVPAELKRLYPDAAITQMSQWGGYDKKYR---SHFIDPMDPLFGKIQ 279

Query: 62  RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++E+Q K YG T HIY  D F+E   P    +++ ++   I+  ++  DS A W+   
Sbjct: 280 KRYLEKQTKLYG-TDHIYGIDPFNEVDSPNWDEDFLRTVSDKIFHSIEQVDSLAHWIQMT 338

Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFA 180
           W+F +    W  P++KA LNSVP  KL++LD + +   IW  ++Q+YG PYIWC L NF 
Sbjct: 339 WMFYHSKDKWSQPRIKAFLNSVPDDKLILLDYYCDSVEIWRETQQYYGKPYIWCYLGNFG 398

Query: 181 GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
           GN  + G +D ++             + GVG ++EG++ NP +Y+ + E A+ H   +  
Sbjct: 399 GNSMLAGHVDDVSAKLNRLFVEGGKNISGVGATLEGLDVNPFMYEFVLEKAWSHTITNAD 458

Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
            W+  +++ R G     I DAW  LY  +Y                       I   T G
Sbjct: 459 -WMKNWALCRGGSKSSHIIDAWQQLYKKIY-----------------------IHHATAG 494

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
           +       ++   +L+   S   HP ++Y   E+      F+ + N    S+ Y++D+I+
Sbjct: 495 Q----AVLMNARPMLEGTDSWNTHPDIYYDNKELWHIWGKFLEAKN--VDSSGYKFDVIN 548

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           + RQ L    ++   +    Y+  +  G+ + + +   L  D+D LL+C   F +G W++
Sbjct: 549 IGRQVLGNLFSDFRDSFTACYRQKNIEGMKEWAEKMNTLFTDVDRLLSCESSFSIGKWIK 608

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            A+   +N ++++ YE NAR  +T W     ++A+ L DY N+ W GL   YY  R  ++
Sbjct: 609 DARDWGKNLKEKEYYEQNARCILTTW----GQKATQLNDYANRGWGGLTDSYYRKRWELF 664

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
            +Y I+ +  G     K +     +    W    NVY   S  D +  +  LY KY
Sbjct: 665 TQYAIDEMSHGKEIDEKSFYNLITEFEYQWTLQTNVYSESSGEDPIRIANLLYIKY 720


>gi|357622373|gb|EHJ73879.1| putative alpha-N-acetyl glucosaminidase [Danaus plexippus]
          Length = 780

 Score =  261 bits (667), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 161/502 (32%), Positives = 255/502 (50%), Gaps = 46/502 (9%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    V PAF+G+VP A + +FP+     +  W   K D  +CC   +D  +P F  I +
Sbjct: 238 LGMIPVFPAFNGHVPKAFEKIFPNTTFHPVETW--NKFDEDYCCNLFVDPREPDFKMISK 295

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F+ +     G +SHIY  D F+E      S   +     AI+S +   D DAVWL+Q W
Sbjct: 296 MFMREITAGLG-SSHIYTADPFNEIKIQPWSTSLVVETAKAIFSSISEYDKDAVWLVQNW 354

Query: 123 LFSYDPF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           +F ++P  W   ++ + L SVP G+++VLDL +E  P +   + +YG P+IW MLHNF G
Sbjct: 355 MFVHNPLLWPLKRVNSFLTSVPNGRMLVLDLQSEQWPQYDLYQMYYGQPFIWSMLHNFGG 414

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVK 240
            + M+G   +I     E R  EN+TMVG+G++ EGI QN V+YDLM E A++   V D++
Sbjct: 415 TLGMFGNTKTINKDVYEVRKRENSTMVGIGLTPEGINQNYVIYDLMLESAWRKGPVPDLE 474

Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
            W++ Y+ RRYG +  +I   W  L  +VYN T                     ++   G
Sbjct: 475 EWVSDYAERRYGCNATSI--GWKYLLRSVYNFTG--------------------LNRIRG 512

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
           KY           V+    S    P  WY   ++  AL+ F+   N   +++ + +DL+D
Sbjct: 513 KY-----------VMTRRPSFNIRPWAWYKGHDLFEALKNFVYVQNPACSTSGFLHDLVD 561

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLS-RRFLELVEDMDGLLACHDGFLLGPWL 419
           +TRQAL     ++++N+      N  + VF  +   F++ + DM  +LA    F +  WL
Sbjct: 562 VTRQALQYKIEQIYMNLQNDRYSN--YMVFNYTISSFIDAMTDMQNILATSSDFKITSWL 619

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
            SA+ ++    +   Y++NAR QIT+W  N +     + DY  K W+ L + YY PR +I
Sbjct: 620 SSARAISNLPLESSLYDFNARNQITLWGPNGE-----ISDYACKQWAELFKYYYIPRWSI 674

Query: 480 YFKYMIESLESGDGFRLKDWRR 501
           +    +++    + F  K  +R
Sbjct: 675 FLSMALDAKTRNEPFDEKGAQR 696


>gi|423213214|ref|ZP_17199743.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693674|gb|EIY86904.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 726

 Score =  261 bits (667), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 160/536 (29%), Positives = 263/536 (49%), Gaps = 39/536 (7%)

Query: 2   LLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
           LL    VLPAFSG+VPA L+ ++P A ITQ+  W       R   ++ +D  DPLF +I 
Sbjct: 223 LLGMTPVLPAFSGHVPAELKRLYPDAAITQMSQWGGYDEKYR---SHFIDPMDPLFGKIQ 279

Query: 62  RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++E+Q K YG T HIY  D F+E   P    +++ ++   I+  ++  DS A W+   
Sbjct: 280 KRYLEKQTKLYG-TDHIYGIDPFNEVDSPNWDEDFLRTVSDKIFHSIEQVDSLAHWIQMT 338

Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFA 180
           W+F +    W  P++KA LNSVP  KL++LD + +   IW  ++Q+YG PYIWC L NF 
Sbjct: 339 WMFYHSKDKWSQPRIKAFLNSVPDDKLILLDYYCDSVEIWRETQQYYGKPYIWCYLGNFG 398

Query: 181 GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
           GN  + G +D ++             + GVG ++EG++ NP +Y+ + E A+ H   +  
Sbjct: 399 GNSMLAGHVDDVSAKLNRLFVEGGKNISGVGATLEGLDVNPFMYEFVLEKAWSHTITNAD 458

Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
            W+  +++ R G     I DAW  LY  +Y                       I   T G
Sbjct: 459 -WMKNWALCRGGSKSSHIIDAWQQLYKKIY-----------------------IHHATAG 494

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
           +       ++   +L+   S   HP ++Y   E+      F+ + N    S+ Y++D+I+
Sbjct: 495 Q----AVLMNARPMLEGTDSWNTHPDIYYDNKELWHIWGKFLEAKN--VDSSGYKFDVIN 548

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           + RQ L    ++   +    Y+  +  G+ + + +   L  D+D LL+C   F +G W++
Sbjct: 549 IGRQVLGNLFSDFRDSFTACYRQKNIEGMKEWAEKMNTLFTDVDRLLSCESSFSIGKWIK 608

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            A+   +N ++++ YE NAR  +T W     ++A+ L DY N+ W GL   YY  R  ++
Sbjct: 609 DARDWGKNLKEKEYYEQNARCILTTW----GQKATQLNDYANRGWGGLTDSYYRKRWELF 664

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
            +Y I+ +  G     K +     +    W    NVY   S  D +  +  LY KY
Sbjct: 665 TQYAIDEMSHGKEIDEKSFYNLITEFEYQWTLQTNVYSESSGEDPIRIANLLYIKY 720


>gi|295087651|emb|CBK69174.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 703

 Score =  261 bits (667), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 160/536 (29%), Positives = 263/536 (49%), Gaps = 39/536 (7%)

Query: 2   LLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
           LL    VLPAFSG+VPA L+ ++P A ITQ+  W       R   ++ +D  DPLF +I 
Sbjct: 200 LLGMTPVLPAFSGHVPAELKRLYPDAAITQMSQWGGYDEKYR---SHFIDPMDPLFGKIQ 256

Query: 62  RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++E+Q K YG T HIY  D F+E   P    +++ ++   I+  ++  DS A W+   
Sbjct: 257 KRYLEKQTKLYG-TDHIYGIDPFNEVDSPNWDEDFLRTVSDKIFHSIEQVDSLAHWIQMT 315

Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFA 180
           W+F +    W  P++KA LNSVP  KL++LD + +   IW  ++Q+YG PYIWC L NF 
Sbjct: 316 WMFYHSKDKWSQPRIKAFLNSVPDDKLILLDYYCDSVEIWRETQQYYGKPYIWCYLGNFG 375

Query: 181 GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
           GN  + G +D ++             + GVG ++EG++ NP +Y+ + E A+ H   +  
Sbjct: 376 GNSMLAGHVDDVSAKLNRLFVEGGKNISGVGATLEGLDVNPFMYEFVLEKAWSHTITNAD 435

Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
            W+  +++ R G     I DAW  LY  +Y                       I   T G
Sbjct: 436 -WMKNWALCRGGSKSSHIIDAWQQLYKKIY-----------------------IHHATAG 471

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
           +       ++   +L+   S   HP ++Y   E+      F+ + N    S+ Y++D+I+
Sbjct: 472 Q----AVLMNARPMLEGTDSWNTHPDIYYDNKELWHIWGKFLEAKN--VDSSGYKFDVIN 525

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           + RQ L    ++   +    Y+  +  G+ + + +   L  D+D LL+C   F +G W++
Sbjct: 526 IGRQVLGNLFSDFRDSFTACYRQKNIEGMKEWAEKMNTLFTDVDRLLSCESSFSIGKWIK 585

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            A+   +N ++++ YE NAR  +T W     ++A+ L DY N+ W GL   YY  R  ++
Sbjct: 586 DARDWGKNLKEKEYYEQNARCILTTW----GQKATQLNDYANRGWGGLTDSYYRKRWELF 641

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
            +Y I+ +  G     K +     +    W    NVY   S  D +  +  LY KY
Sbjct: 642 TQYAIDEMSHGKEIDEKSFYNLITEFEYQWTLQTNVYSESSGEDPIRIANLLYIKY 697


>gi|449541596|gb|EMD32579.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
           B]
          Length = 754

 Score =  261 bits (666), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 149/516 (28%), Positives = 273/516 (52%), Gaps = 43/516 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            LPAF+G VP A+  ++P+A I     W    +         L+  DPLF  + ++FI +
Sbjct: 237 ALPAFTGFVPRAMSTLYPNASIVNGSAWSGFPAS--LTNVSFLEPFDPLFSTLQKSFITK 294

Query: 68  QLKEYG-RTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-S 125
           Q + YG   +HIY  D ++EN P   +  Y+SS+ A  ++ +++ D DA+W++QGWLF S
Sbjct: 295 QQQAYGTNVTHIYTLDQYNENNPFSGNISYLSSVSAGTFASLRAADPDAIWMLQGWLFFS 354

Query: 126 YDPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
            + FW   +++A L  VP    ++VLDL++E +P W+ +  ++G  ++WC LH + GN+ 
Sbjct: 355 SETFWTDERIQAYLGGVPTNDSMIVLDLYSEAQPQWNRTSSYFGKQWVWCELHGYGGNMG 414

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           + G L++I  GP+ A +S+ ++M G+G++MEG E N +VYD++ + A+    +D+ +++ 
Sbjct: 415 LEGNLNAITAGPIAALSSQGSSMKGMGLTMEGQEGNEIVYDVLLDQAWSSAPIDIASYVK 474

Query: 245 QYSVRRYG-RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
            +  RRY    +P A Q+AW +L  TVYN  D     ++  I +  +++P++  +     
Sbjct: 475 SWVARRYTVEPLPSAAQEAWQILSTTVYNNQD---PNSQATIKSIYELEPTLTGLVN--- 528

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHL--WYSTSEVIRALELFIASGNE---LSASNTYRYD 357
                             +  HP L  + + + V+ AL+L + +  +   L+A   + YD
Sbjct: 529 -----------------RTGHHPTLIPYDTNTTVVPALQLLVKAKEQNAALAAIPEFVYD 571

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLL 415
            +D++RQ L+    + +  +++ Y   +A    V +  +  + ++  +D LLA ++ FLL
Sbjct: 572 AVDVSRQLLSNRFIDAYTGLVDTYNNANATSDAVVRAGQPLMVILSQLDALLATNENFLL 631

Query: 416 GPWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
             W+  A+  +  +E    Y E+NAR Q+T+W  + +     + DY +K W+GL+  YY 
Sbjct: 632 SSWIAQARNWSHGDESYAAYLEYNARNQVTLWGPDGE-----INDYASKAWAGLISTYYS 686

Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 510
            R   +  Y+  +      F    +  + I L   W
Sbjct: 687 SRWQTFVDYLASTKRLSRPFDSSAFSSQMILLGQQW 722


>gi|71001188|ref|XP_755275.1| alpha-N-acetylglucosaminidase [Aspergillus fumigatus Af293]
 gi|66852913|gb|EAL93237.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
           Af293]
 gi|159129357|gb|EDP54471.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
           A1163]
          Length = 756

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 149/485 (30%), Positives = 259/485 (53%), Gaps = 37/485 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A+  V P+A +     W   + D R+     L+  DP F+ + R+FI++
Sbjct: 241 VLPAFTGFVPRAVSRVLPNATVVNGSRW--EEFDERYTSDTFLEPFDPSFMRLQRSFIKK 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q + YG  +HIY  D ++EN P     +Y+ ++    +  ++S D +AVWLMQGWLF S 
Sbjct: 299 QQQAYGNITHIYTLDQYNENAPYSGDLDYLHNVTHNTWLSLKSADPNAVWLMQGWLFYSS 358

Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   ++KA L+ V + + ++VLDLF+E +P W  ++ +YG P+IWC LH++ GN+ +
Sbjct: 359 SGFWTDERVKAYLSGVEVDQDMLVLDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGL 418

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG + ++     +A  + + ++VG G++MEG E N ++YDL+ + A+  + +D   + + 
Sbjct: 419 YGQVMNVTVNATQALAASD-SLVGFGLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHN 477

Query: 246 YSVRRY-----GRSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
           ++  RY     G +VP  +  AW++L  T YN T+  +      I    ++ PSI  +  
Sbjct: 478 WAKTRYSSGVRGSAVPEELYQAWDILRITAYNNTNLTSTAVSKSIF---ELQPSISGLLN 534

Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
               +              T SYD   L     +  R ++   +    L +   + YD++
Sbjct: 535 RTSHH------------PTTVSYDPAAL----VQAWRLMDSAASKAPSLWSQPAFLYDMV 578

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           D+TRQ ++     ++ N++  YQ   +  V       ++L+ D+D +L+ +D F L  W+
Sbjct: 579 DITRQVMSNAFIPVYTNLVSTYQAGGS--VSTDGSNLIQLLRDLDSVLSTNDNFRLSTWI 636

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           +SA+   +N+ +   YE+NAR Q+T+W    +     + DY +K W GL+  YY PR   
Sbjct: 637 QSARSWVRNDTEADFYEYNARNQVTLWGPKGE-----INDYASKQWGGLVSSYYIPRWQK 691

Query: 480 YFKYM 484
           +  Y+
Sbjct: 692 FLNYL 696


>gi|326437768|gb|EGD83338.1| lysosomal alpha-N-acetyl glucosaminidase [Salpingoeca sp. ATCC
           50818]
          Length = 820

 Score =  258 bits (660), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 165/549 (30%), Positives = 259/549 (47%), Gaps = 60/549 (10%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L   + LP F+G+VP A++ +FP A +TQ   W +  S   +    LL  TDPLF+++G 
Sbjct: 247 LGMVSALPGFAGHVPTAIKRIFPHANLTQTAGWANFNS--TYSDVSLLQPTDPLFLQLGT 304

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F +  +K +G T H++  DT++E  P   +   ++     +Y  M + D +AV+LMQGW
Sbjct: 305 KFYKMLIKAFG-TDHVFQMDTYNEMQPSFTNMTLLAESNRVVYQAMANADPEAVYLMQGW 363

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
           LF ++ +W P  +K  L+ VP  K+++LDL  E  P++S +  ++G  +IW ML N+ G 
Sbjct: 364 LF-HESYWTPEHVKVYLSGVPDDKMIILDLNTEANPVFSLTSDYFGKLWIWNMLLNYGGR 422

Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
             +YG    I+  P+        TM G+G++ E IE NPV+++LM EM +     D+  W
Sbjct: 423 RGLYGNATDISTRPLLDLHRAQGTMDGIGITPEAIENNPVMFELMLEMGWHATPPDMHDW 482

Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
           I  Y+  RYG+     Q AW +L   VY+      D +R  +   PD     +S +E + 
Sbjct: 483 IAAYASSRYGKRESLTQSAWQLLLEHVYD----QPDIDRFHMEMVPD-----LSSSESRN 533

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI--ASGNELSASNTYRYDLID 360
            N                          T+ +++A  L +  A    L  +  + YDL+D
Sbjct: 534 SN--------------------------TTALVQAWRLLVTAAVNGSLPITGPFSYDLVD 567

Query: 361 LTRQA-----------LAKYANELFLNIIEAYQLNDAH--GVFQLSRRFLELVEDMDGLL 407
           + RQA           L  +  E   NI  +     +H   +  L    L++  D+D LL
Sbjct: 568 VGRQALLNLWSDVRGMLVAHVKEYNANIDSSPSTAASHVPAIKSLFTLLLDITSDLDRLL 627

Query: 408 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
                +LLG WLESAK  A N ++    E+NAR QIT+W  + +     + DY  K W G
Sbjct: 628 GTDVNYLLGVWLESAKATAANADERATREFNARNQITLWGPDGE-----ITDYAAKQWQG 682

Query: 468 LLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 527
           L+ DYY  R  +     + +L S          ++ +K    W N    YP     D + 
Sbjct: 683 LVSDYYVKRWEMMHDATLSALNSSTKIDTSA-PKDTLKFEQAWGNENKTYPTAPQADVVK 741

Query: 528 TSQWLYNKY 536
            S  +  KY
Sbjct: 742 VSAAMLQKY 750


>gi|242809019|ref|XP_002485282.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
           ATCC 10500]
 gi|218715907|gb|EED15329.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
           ATCC 10500]
          Length = 755

 Score =  255 bits (652), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 153/492 (31%), Positives = 263/492 (53%), Gaps = 43/492 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP A+  V P A +     W +  +   +     ++ TDP F EI ++FI +
Sbjct: 239 ILPAFPGFVPRAITRVLPDADVINGSAWEAFPT--MYTNDTFMEPTDPHFTEIQKSFIAK 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q++ YG  +  Y  D F+EN P      Y+ ++    +  +++ DS+AVW+MQGWLF S 
Sbjct: 297 QIEAYGNVTTFYTLDQFNENNPSSGDLSYLRNVSQGTWKTLKAADSNAVWVMQGWLFTSN 356

Query: 127 DPFWRPPQMKALLNSVPL-GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   +++A L  V +   L++LDL +E  P W  +  +YG P+IWC +H++ GN+  
Sbjct: 357 SAFWTNDRIEAYLGGVAVDSDLLILDLASESSPQWQRTNSYYGKPWIWCEIHDYGGNMGF 416

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG + +I   P+ A    ++++VG G+SMEG E N +VYDL+ + A+    +D +++ + 
Sbjct: 417 YGQVMNITNNPIAA-LHNSSSLVGFGLSMEGQEGNEIVYDLLLDQAWNAAPIDTESYFHD 475

Query: 246 YSVRRYG--RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
           +   RY   RS+P ++  AW++L  TVYN T+ A +       A P     +I  T G  
Sbjct: 476 WVTARYAGSRSIPSSVYSAWDILRTTVYNNTNLAAN-------AVPKAIFELIPSTTGLL 528

Query: 303 QNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDL 358
              G                 HP  L Y+T+++++A  LF  S  +   L  +  + +DL
Sbjct: 529 NRTGH----------------HPTKLNYNTADMVQAWNLFYTSAFKEPSLWLNPAFEFDL 572

Query: 359 IDLTRQALAKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
           +D++RQ LA     ++ N+I  Y  ++  +  +  +    + +++ +D +LA +  F L 
Sbjct: 573 VDMSRQVLANAFIPVYENLISTYNTSNPSSTKLQTIGAELIGILQALDTVLATNKNFKLS 632

Query: 417 PWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
            WL +A+  A ++   + + E+NAR QIT+W    Q     + DY +K W+GL+  YY P
Sbjct: 633 TWLSAARASAGSQHNIEDFLEYNARNQITLWGPTGQ-----ISDYASKSWAGLVSSYYIP 687

Query: 476 RAAIYFKYMIES 487
           R  ++ +Y+I++
Sbjct: 688 RWKMFVEYLIDT 699


>gi|121698957|ref|XP_001267859.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
           1]
 gi|119396001|gb|EAW06433.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
           1]
          Length = 671

 Score =  255 bits (651), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 150/450 (33%), Positives = 245/450 (54%), Gaps = 38/450 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A+  V P A +     W     D ++     L+  DP F  + R+FI +
Sbjct: 233 VLPAFTGFVPRAITRVLPDATVVNGSRWSGF--DEKYTNDTFLEPFDPNFARLQRSFIHK 290

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q + YG  +HIY  D ++EN P    PEY+ ++    +  ++S D DA+W+MQGWLF S 
Sbjct: 291 QQQAYGNITHIYTLDQYNENDPYSGDPEYLRNVTHNTWQSLKSADPDAIWMMQGWLFYSN 350

Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   ++ A L+ V   + ++VLDLF+E +P W  ++ +YG P+IWC LH++ GN+ +
Sbjct: 351 SDFWTDERVHAYLSGVETDEDMLVLDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGL 410

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG + +I     +A    ++ +VG G++MEG E N +VYDL+ + A+    +D  ++ + 
Sbjct: 411 YGQVMNITVNATDALAVSDS-LVGYGLTMEGQEGNEIVYDLLLDQAWSSRPIDTDSYFHD 469

Query: 246 YSVRRYG----RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
           +   RY      +VP  +  AW++L  T YN T+ AT            V  SI  +   
Sbjct: 470 WVKARYSTARRHNVPHELYQAWDILRTTAYNNTNLATATA---------VSKSIFEL--- 517

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLW-YSTSEVIRALELFIASGNELSA---SNTYRY 356
                 +P  K   L ++T    HP +  Y  S ++R+ +L +++ +E +A      +RY
Sbjct: 518 ------QP--KLTGLVNQTGH--HPTVVNYEASSLVRSWKLMVSAASESTALWSHPAFRY 567

Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
           D++D+TRQ +A     ++LN+   YQ      + Q     + L+ D+D +L+ +D F L 
Sbjct: 568 DMVDVTRQVMANAFIPMYLNVTSTYQ--KGGPISQQGDSLIRLLRDLDAVLSTNDNFRLA 625

Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMW 446
            W+ESA+  A+N+ +   YE+NAR QIT+W
Sbjct: 626 TWIESARTWARNDTEADFYEYNARNQITLW 655


>gi|299149196|ref|ZP_07042257.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
 gi|298512863|gb|EFI36751.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
          Length = 738

 Score =  255 bits (651), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 158/532 (29%), Positives = 263/532 (49%), Gaps = 40/532 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPAAL+ V+P+ K T++  W       R  CT+L +  D L+  I + ++ +
Sbjct: 242 VLPAFAGHVPAALKRVYPNIKTTRVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTE 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG T+HIY  D F+E  PP    + +  +   IY  + + D +AVWL   WLF  D
Sbjct: 299 QTRLYG-TNHIYGIDPFNEIDPPSWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYAD 357

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++K+ L SVP  +L++LD F E   IW  +  ++G PY+WC L NF GN  + 
Sbjct: 358 IKHWTTPRIKSYLRSVPQDRLILLDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLS 417

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G ++ ++    +A  +  + + GVG ++EGI+ N  +Y+ + + A+   + D K W  + 
Sbjct: 418 GPVNLVSERLADALKNGGSNLKGVGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKL 476

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RR G+  P  + AW +L + VY                   V P+   V +G   N  
Sbjct: 477 ADRRIGKISPEARKAWEILANKVY-------------------VQPA--QVGQGTLTN-A 514

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           +P      LK        P + Y   +++ A  L ++  +     ++Y +DL+++ RQ L
Sbjct: 515 RP-----CLKGNGHWTTKPTIEYQPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVL 567

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y N +      AY+  D   +     +  E++ D+D L++CH  F L  W+  A+ + 
Sbjct: 568 GNYFNVVRDEFTLAYEAGDIPMMKNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMG 627

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +   +  YE NAR+ IT+W D+       L DY N+ W+GL   YY  R   +   +IE
Sbjct: 628 HDAASKNYYEMNARSLITIWGDSYH-----LTDYANRSWAGLTNQYYSVRWDHFINEVIE 682

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           + E    F  +++  +     N+W N  N       GD +  ++ +Y KY +
Sbjct: 683 AAEKKKNFDEEEFFNQSRMYENEWVNPSNRISYNEGGDGIKLARQIYKKYAK 734


>gi|237717696|ref|ZP_04548177.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229453015|gb|EEO58806.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 729

 Score =  255 bits (651), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 158/532 (29%), Positives = 263/532 (49%), Gaps = 40/532 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPAAL+ V+P+ K T++  W       R  CT+L +  D L+  I + ++ +
Sbjct: 233 VLPAFAGHVPAALKRVYPNIKTTRVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG T+HIY  D F+E  PP    + +  +   IY  + + D +AVWL   WLF  D
Sbjct: 290 QTRLYG-TNHIYGIDPFNEIDPPSWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYAD 348

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++K+ L SVP  +L++LD F E   IW  +  ++G PY+WC L NF GN  + 
Sbjct: 349 IKHWTTPRIKSYLRSVPQDRLILLDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLS 408

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G ++ ++    +A  +  + + GVG ++EGI+ N  +Y+ + + A+   + D K W  + 
Sbjct: 409 GPVNLVSERLADALKNGGSNLKGVGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKL 467

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RR G+  P  + AW +L + VY                   V P+   V +G   N  
Sbjct: 468 ADRRIGKISPEARKAWEILANKVY-------------------VQPA--QVGQGTLTN-A 505

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           +P      LK        P + Y   +++ A  L ++  +     ++Y +DL+++ RQ L
Sbjct: 506 RP-----CLKGNGHWTTKPTIEYQPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVL 558

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y N +      AY+  D   +     +  E++ D+D L++CH  F L  W+  A+ + 
Sbjct: 559 GNYFNVVRDEFTLAYEAGDIPMMKNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMG 618

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +   +  YE NAR+ IT+W D+       L DY N+ W+GL   YY  R   +   +IE
Sbjct: 619 HDAASKNYYEMNARSLITIWGDSYH-----LTDYANRSWAGLTNQYYSVRWDHFINEVIE 673

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           + E    F  +++  +     N+W N  N       GD +  ++ +Y KY +
Sbjct: 674 AAEKKKNFDEEEFFNQSRMYENEWVNPSNRISYNEGGDGIKLARQIYKKYAK 725


>gi|242077446|ref|XP_002448659.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
 gi|241939842|gb|EES12987.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
          Length = 252

 Score =  254 bits (650), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 122/185 (65%), Positives = 145/185 (78%)

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
           YDL+DLTRQ LAKYAN++FL IIE+Y+ N  + V  L + FL LV D+D LL+ H+GFLL
Sbjct: 51  YDLVDLTRQVLAKYANDVFLKIIESYKSNKMNQVTILCKHFLNLVNDLDTLLSSHEGFLL 110

Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
           GPWLESAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGP
Sbjct: 111 GPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLRDYYGP 170

Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
           RAAIYFK+++ S+E    F L++WRREWI LTN+WQ+ R V+     GD+L  S  LY K
Sbjct: 171 RAAIYFKHLLLSMEKNAPFALEEWRREWISLTNNWQSDRKVFSTTPTGDSLNISWSLYIK 230

Query: 536 YLQGT 540
           YL  T
Sbjct: 231 YLSNT 235


>gi|346324333|gb|EGX93930.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
          Length = 751

 Score =  253 bits (647), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 151/511 (29%), Positives = 266/511 (52%), Gaps = 48/511 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP  +  V+P+  + +   W       R+     +   DP F E+ +AF+ +
Sbjct: 239 ILPAFPGFVPENITRVWPNVSLAESPIWSGFSG--RFTADKYITPYDPHFAELQKAFLTK 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG  +  +  D F+EN P     +Y+ ++    +  +++ D  AVW+MQGWLF+ D
Sbjct: 297 QNEAYGNVTSFWTLDQFNENKPASGELDYLKNVSHNTWQTLKAADPSAVWVMQGWLFASD 356

Query: 128 P-FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             +W   ++K+ L+ VP+ + +++LDLFAE  P W  ++ FYG P+IWC LH++ GN+ +
Sbjct: 357 KTYWIDDRVKSFLDGVPVNEDMLLLDLFAESTPQWQRTESFYGKPWIWCQLHDYGGNMGL 416

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG ++++    VEA    + ++VG G+SMEG E N ++YDL+ + A++ E ++   + + 
Sbjct: 417 YGQIENVTKNAVEA-VQTSKSIVGFGLSMEGQEGNEIMYDLLLDQAWRKEAIETDKYFSD 475

Query: 246 YSVRRYG---RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           +   RYG   + +P  +  AW+ +  TVYN TD +      V  +  ++ PSI  +    
Sbjct: 476 WVTVRYGADHKEIPENLYTAWDKVRSTVYNNTDSSVTA---VTKSIFELAPSISGLV--- 529

Query: 302 YQNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYD 357
                              +  HP  + Y T  +I A     ++G++   L  +  YRYD
Sbjct: 530 -----------------NRTGHHPTKITYDTKTLISAWNDMFSAGDQARWLFDNEAYRYD 572

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
           L D TRQ LA      +  ++E Y+ N+  GV     R   +++ MD +L  +  F L  
Sbjct: 573 LTDWTRQVLANAFEATYNKLVEKYKSNNTKGVKCAGDRLQAILQTMDQVLDTNPSFKLST 632

Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
           W+++A++     E    +E+NAR Q+T+W  N +     + DY +K W+GL+ +YY  R 
Sbjct: 633 WIQAARK--SGGEAADFFEYNARNQVTLWGPNGE-----IEDYASKQWAGLVGNYYAHRW 685

Query: 478 AIYFKYMI-----ESLESGDGFRLKDWRREW 503
            ++  Y++     E  ++    +L++W  +W
Sbjct: 686 QMFVDYLVATDPKEYDQNVFKKKLREWETQW 716


>gi|83775903|dbj|BAE66022.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 633

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 154/494 (31%), Positives = 256/494 (51%), Gaps = 47/494 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF G VP A++ V P A +     W   +   ++     L+  D  F ++ ++ I +
Sbjct: 112 VLPAFPGFVPPAIKRVRPHATVVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISR 169

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G  +H+Y  D F+E  P      Y+ +L    +  +++ +  AVW+MQGWLF YD
Sbjct: 170 QTRAFGNVTHVYALDQFNEINPASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YD 228

Query: 128 P--FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
              FW P ++ A L+ V     +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ 
Sbjct: 229 KKDFWDPNRISAYLSGVERNDDMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMG 288

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           MYG + +I   P+EA  +++ ++VG G++MEG E N +VYDL+ + A+  + +D +A+  
Sbjct: 289 MYGQIMNITSDPIEA-LNKSDSLVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQ 347

Query: 245 QYSVRRYGR--SVP-AIQDAWNVLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEG 300
            +   RY    SVP  +  AW++L  TVYN T+  T    + +    PD+   +  V  G
Sbjct: 348 SWVRSRYSGNFSVPNELYTAWDLLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--G 405

Query: 301 KYQ-----NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
            Y      NY   V  E       ++   P LW+S +                     Y 
Sbjct: 406 HYPTPTSINYDPMVLNEVWSLFMNATRKEPSLWHSPA---------------------YE 444

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDG 412
           YD++D+TRQ +      ++ ++I +++    N    V   S R L L+  +D +L+C++ 
Sbjct: 445 YDMVDITRQLMGNAFVNVYSDLISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNEN 504

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           F L  W+ SA+      E +  +E+NAR QIT+W    +     + DY +K W+GL+  Y
Sbjct: 505 FSLTTWISSARDWGNTTETKDFFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSY 559

Query: 473 YGPRAAIYFKYMIE 486
           Y PR +I+  Y+ E
Sbjct: 560 YKPRWSIFVDYLGE 573


>gi|395331391|gb|EJF63772.1| alpha-N-acetylglucosaminidase [Dichomitus squalens LYAD-421 SS1]
          Length = 750

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 161/518 (31%), Positives = 276/518 (53%), Gaps = 40/518 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+F+G VP AL +++P+A I     W    +         L+  DPLF  I  +FI +
Sbjct: 233 VLPSFTGFVPRALSSLYPNASIVNGSQWEGFPT--ALTNDSFLEPFDPLFTTIQTSFISK 290

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q + YG  SHIY  D ++EN P    P Y++++ A  ++ +++ D DAVWLMQGWLF S 
Sbjct: 291 QREAYGNVSHIYALDQYNENDPFSGDPAYLANVTAGTFASLRAADPDAVWLMQGWLFFSS 350

Query: 127 DPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   +++A L  VP    +++LDL++E +P W+ +  +YG  ++WC LH + GNI M
Sbjct: 351 AAFWTNERIEAYLGGVPGNDSMIILDLYSEAQPQWNRTSSYYGKQWVWCELHGYGGNIGM 410

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
            G LD++   P+ A  +  ++M GVG++MEG E N +VYD++ + A+    +++ ++++Q
Sbjct: 411 EGDLDALTQNPIAALHAPGSSMKGVGLTMEGQEGNELVYDILLDQAWSSAPLNLSSYVDQ 470

Query: 246 YSVRRYG-RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
           +  RRY  R +P +  DAW  L  TVY+  D  T   +  I +  ++ P++  +T     
Sbjct: 471 WVARRYNVRRLPKSALDAWRTLATTVYSNKDSGT---QAAIKSIYELAPALTGMT----- 522

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDLID 360
                        + T  +     + + S V+ A +  + + +E   L+    + YD++D
Sbjct: 523 -------------NRTGHHPTAIPYDTNSTVLVAAKALLEARSENPLLATIPEFAYDVVD 569

Query: 361 LTRQALAKYANELFLNIIEAYQLNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           +TRQ L+    + +  ++  Y  N      V   +   L L++D+D LLA ++ FLL  W
Sbjct: 570 VTRQLLSNRFIDHYNVLVATYNSNATAPRNVAAAAGPLLALLDDLDELLATNEHFLLSNW 629

Query: 419 LESAKQLAQNEEQ---EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
           +  AK+     ++    +  E+NAR QIT+W  + +     + DY +K W+GL+R YY P
Sbjct: 630 IADAKRWTHGADRAAYARLLEYNARNQITLWGPDGE-----INDYASKAWAGLVRTYYKP 684

Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNG 513
           R   + +Y+ ++ E+G  +       + I +   W NG
Sbjct: 685 RWEAFVEYLAQTKEAGAAYDAHVVSAKMIAIGQQWSNG 722


>gi|317158657|ref|XP_001827155.2| alpha-N-acetylglucosaminidase [Aspergillus oryzae RIB40]
          Length = 849

 Score =  252 bits (643), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 154/494 (31%), Positives = 256/494 (51%), Gaps = 47/494 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF G VP A++ V P A +     W   +   ++     L+  D  F ++ ++ I +
Sbjct: 196 VLPAFPGFVPPAIKRVRPHATVVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISR 253

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G  +H+Y  D F+E  P      Y+ +L    +  +++ +  AVW+MQGWLF YD
Sbjct: 254 QTRAFGNVTHVYALDQFNEINPASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YD 312

Query: 128 P--FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
              FW P ++ A L+ V     +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ 
Sbjct: 313 KKDFWDPNRISAYLSGVERNDDMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMG 372

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           MYG + +I   P+EA  +++ ++VG G++MEG E N +VYDL+ + A+  + +D +A+  
Sbjct: 373 MYGQIMNITSDPIEA-LNKSDSLVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQ 431

Query: 245 QYSVRRYGR--SVP-AIQDAWNVLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEG 300
            +   RY    SVP  +  AW++L  TVYN T+  T    + +    PD+   +  V  G
Sbjct: 432 SWVRSRYSGNFSVPNELYTAWDLLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--G 489

Query: 301 KYQ-----NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
            Y      NY   V  E       ++   P LW+S +                     Y 
Sbjct: 490 HYPTPTSINYDPMVLNEVWSLFMNATRKEPSLWHSPA---------------------YE 528

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDG 412
           YD++D+TRQ +      ++ ++I +++    N    V   S R L L+  +D +L+C++ 
Sbjct: 529 YDMVDITRQLMGNAFVNVYSDLISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNEN 588

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           F L  W+ SA+      E +  +E+NAR QIT+W    +     + DY +K W+GL+  Y
Sbjct: 589 FSLTTWISSARDWGNTTETKDFFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSY 643

Query: 473 YGPRAAIYFKYMIE 486
           Y PR +I+  Y+ E
Sbjct: 644 YKPRWSIFVDYLGE 657


>gi|391873368|gb|EIT82411.1| alpha-N-acetylglucosaminidase [Aspergillus oryzae 3.042]
          Length = 633

 Score =  252 bits (643), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 153/497 (30%), Positives = 258/497 (51%), Gaps = 51/497 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF G VP A++ V P A +     W   +   ++     L   D  F ++ ++ I +
Sbjct: 112 VLPAFPGFVPPAIKRVRPHATVVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISR 169

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q++ +G  +H+Y  D F+E  P      Y+ +L    +  +++ +  AVW+MQGWLF YD
Sbjct: 170 QMRAFGNITHVYALDQFNEINPASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YD 228

Query: 128 P--FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
              FW   ++ A L+ V     +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ 
Sbjct: 229 KKDFWDSNRISAYLSGVERNDDMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMG 288

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           MYG + +I   P+EA    N ++VG G++MEG E N +VYDL+ + A+    +D +A+  
Sbjct: 289 MYGQIMNITSDPIEALNKSN-SLVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQ 347

Query: 245 QYSVRRYGR--SVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
            +   RY R  SVP  +  AW++L  TVYN T+  T     V  +  ++ P I  +  G+
Sbjct: 348 SWVRSRYSRNFSVPNELYTAWDLLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GR 403

Query: 302 YQNYGKPVS--KEAVLKSET------SSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
             +Y  P S   + ++ +E       ++   P LW++ +                     
Sbjct: 404 VGHYPTPTSINYDPMVLNEVWSLFMNATRKEPSLWHNPA--------------------- 442

Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACH 410
           Y YD++D+TRQ +      ++  +I +++    N    V   S R L L+  +D +L+C+
Sbjct: 443 YEYDMVDITRQLMGNAFVNVYSVLITSWKSETENRTTKVTSQSERLLNLLSAIDKVLSCN 502

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
           + F L  W+ SA+      E +  +E+NAR QIT+W    +     + DY +K W+GL+ 
Sbjct: 503 ENFSLATWISSARDWGNTTETKDFFEYNARNQITLWGPTGE-----ISDYASKAWAGLIS 557

Query: 471 DYYGPRAAIYFKYMIES 487
            YY PR +I+  Y+ E+
Sbjct: 558 SYYKPRWSIFVDYLGEN 574


>gi|319900259|ref|YP_004159987.1| alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
 gi|319415290|gb|ADV42401.1| Alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
          Length = 718

 Score =  251 bits (642), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 169/544 (31%), Positives = 266/544 (48%), Gaps = 69/544 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
           VLP +SG +PA  +         +LG    V    +WC       L  +D  F  I R +
Sbjct: 224 VLPGYSGMIPANAKE--------KLG--LDVADPGKWCGYRRPAFLQPSDKNFRRIARLY 273

Query: 65  IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            ++  + YG+ ++ Y+ D F E  NT  VD    + + G +I   M+  +  AVW+ Q W
Sbjct: 274 YKEMTRLYGKANY-YSMDPFHEGGNTKGVD----LDAAGKSIRDAMKEANPQAVWVAQAW 328

Query: 123 -LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWC 174
               YD          ++ ++P G ++VLDL++E +P W         KQ +G   +I+C
Sbjct: 329 GACPYD---------NMIKNLPEGDMIVLDLYSESRPQWGDPASAWYRKQGFGRHGWIYC 379

Query: 175 MLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ 233
           ML NF GN+ +YG ++ +     +AR S    T+ GVG++MEG E NPV+Y+L+ E+ + 
Sbjct: 380 MLLNFGGNVGLYGKMEHVIDEFYKARESAFGGTLQGVGLTMEGSENNPVMYELLCELPWH 439

Query: 234 HEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVD 291
             ++    W+  Y   RYG++ P   +AW  L +T+YN  + +T +  +  V  A P ++
Sbjct: 440 GRRISKDQWLKSYLKARYGKTTPQTVEAWLKLSNTIYNSPNASTQQGTHESVFCARPSLE 499

Query: 292 PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
                     YQ                SS+     +Y+ +++IRA    I +  E   +
Sbjct: 500 A---------YQ---------------VSSWSEMKDYYAPADIIRAAGKMIEAAEEFRGN 535

Query: 352 NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 411
           N + YDLID+ RQA+A+    ++  ++ AY+  D       S RFLEL+E  D LL    
Sbjct: 536 NNFEYDLIDVVRQAVAEKGRLVYPIVVSAYKAADKQLFEAASARFLELIELQDKLLGTRR 595

Query: 412 GFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRD 471
            F LG W   A+ + + + Q+  YEWNAR QIT W + T      L DY +K W+GLLRD
Sbjct: 596 EFRLGTWTNYARNMGETDAQKDLYEWNARVQITTWGNRTAANEGGLHDYAHKEWNGLLRD 655

Query: 472 YYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQW 531
           +Y  R   YF  +  +L +G+  +  D    +  L  +W    N Y  E  GDA   ++ 
Sbjct: 656 FYYMRWKAYFDELRSTL-NGNAPKETD----FYTLEENWAGQHNPYSAEPEGDATDIAKE 710

Query: 532 LYNK 535
           +Y K
Sbjct: 711 VYGK 714


>gi|336417192|ref|ZP_08597519.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
           3_8_47FAA]
 gi|423297818|ref|ZP_17275878.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
           CL03T12C18]
 gi|335936512|gb|EGM98438.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
           3_8_47FAA]
 gi|392664455|gb|EIY57993.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
           CL03T12C18]
          Length = 727

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 156/532 (29%), Positives = 259/532 (48%), Gaps = 40/532 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPAAL+ V+P+ K +++  W       R  CT+L +  D L+  I + ++ +
Sbjct: 231 VLPAFAGHVPAALKRVYPNIKTSRVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTE 287

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG T+HIY  D F+E  PP    + +  +   IY  + + D  A+WL   WLF  D
Sbjct: 288 QTRLYG-TNHIYGIDPFNEIDPPSWDTDSLGMMAKHIYESVAAVDPKAIWLQMTWLFYAD 346

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++K+ L SVP  KL++LD F E   IW  +  ++G PY+WC L NF GN  + 
Sbjct: 347 IKHWTTPRIKSYLRSVPQDKLILLDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLS 406

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +  ++    +A  +  + + GVG ++EGI+ N  +Y+ + + A+   + D K W  + 
Sbjct: 407 GPVKLVSERLADALKNGGSNLKGVGSTLEGIDLNQFMYEFVLDKAWNSGQTD-KEWFLKL 465

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + RR G+  P  + AW +L   VY                   + P+   V +G   N  
Sbjct: 466 ADRRTGKVSPEARKAWEILADKVY-------------------IQPA--QVGQGTLTN-A 503

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           +P      LK        P + Y   +++ A  L +   +     ++Y +DL+++ RQ L
Sbjct: 504 RP-----CLKGNGHWTTKPTIEYQPKDLVEAWRLLLLVKD--CQRDSYEFDLVNIGRQVL 556

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y N +      AY+  D   +     +  E++ D+D L++CH  F L  W+  A+ + 
Sbjct: 557 GNYFNVVRDEFTLAYEAGDIMMMKNRGDKMREILADLDKLVSCHPTFSLNKWITDARDMG 616

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +   +  YE NAR+ IT+W D+       L DY N+ W+GL   YY  R   +   +I+
Sbjct: 617 HDATSKNYYEMNARSLITIWGDSYH-----LTDYANRSWAGLTNQYYSVRWDRFINEVIK 671

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           ++E    F  + +  E     N+W N  N       GD +  ++ +Y KY +
Sbjct: 672 AVEKKKAFDEEVFFNESRMYENEWVNPSNRINYNEGGDGIKLARQIYKKYAK 723


>gi|212537509|ref|XP_002148910.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
 gi|210068652|gb|EEA22743.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
          Length = 768

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 152/516 (29%), Positives = 264/516 (51%), Gaps = 43/516 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP A+  V P A +     W +  +   +     ++ TDP F EI ++FI +
Sbjct: 248 ILPAFPGFVPRAITRVLPDADVINGSAWEAFPA--MFTSDTFMEPTDPHFTEIQKSFISK 305

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q   YG  +  Y  D F+EN P      Y+ S+    +  +++ D  AVW+MQGWLF S 
Sbjct: 306 QTAAYGNVTTFYTLDQFNENNPSSGDLNYLRSVSHGTWQALKAADPSAVWVMQGWLFFSN 365

Query: 127 DPFWRPPQMKALLNSVPL-GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   +++A L  V +   L+VLDL +E +P W  +  ++G P+IWC +H++ GN+  
Sbjct: 366 SAFWTNDRVEAYLGGVTVDSDLLVLDLASESQPQWQRTNSYFGKPWIWCQIHDYGGNMGF 425

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG + +I   P+ A  +   ++VG G+SMEG E N VVYDL+ + A+  + +D   + + 
Sbjct: 426 YGQVMNITVNPIAALNNATASLVGFGLSMEGQEGNEVVYDLLLDQAWSAKPIDTATYFHD 485

Query: 246 YSVRRYG--RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
           +   RY   +S+P  +  AW++L  +VYN T+ A++       A P     +I  T G  
Sbjct: 486 WVTARYAGSKSIPTDVYSAWDMLRTSVYNNTNLASN-------AVPKAIFELIPSTTGLV 538

Query: 303 QNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDL 358
              G                 HP  L Y+ +++++A  LF ++  +   L  +  Y +DL
Sbjct: 539 NRTGH----------------HPTTLNYNPADMVKAWSLFYSAAFKEPSLWLNPAYEFDL 582

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVF--QLSRRFLELVEDMDGLLACHDGFLLG 416
           +D++RQ LA     ++ ++I A+   +   +    +    + +++ +D +L  ++ F L 
Sbjct: 583 VDMSRQVLANAFIPVYHDLIAAWNTTNPSTIRIQIIGAELIGILQAIDTILDTNEHFKLS 642

Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
            W+ +A+  A  +  E   E+NA  QIT+W    Q     + DY +K W+GL+  YY PR
Sbjct: 643 TWISAARTSAGEQSLEDFLEYNALNQITLWGPTGQ-----ISDYASKSWAGLVSSYYIPR 697

Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 512
             ++ +Y++++  +   +    ++ E +K    WQN
Sbjct: 698 WKMFIEYLVDTKPA--QYNQTAFKAELLKWELQWQN 731


>gi|410100551|ref|ZP_11295511.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409215586|gb|EKN08585.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 739

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 174/555 (31%), Positives = 266/555 (47%), Gaps = 79/555 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWC----CTYLLDATDPLFIEIGRA 63
           +L +F+G VP  L+  +P A+I         K   RWC     T  LD  DPLF E+G+A
Sbjct: 231 ILQSFTGFVPIKLKEKYPDARI---------KDKNRWCNAFTATVQLDPLDPLFKEMGQA 281

Query: 64  FIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
           F+E+Q K YG T+HIY  D F E   P +   Y+ ++G  I+      D +AV  MQ W 
Sbjct: 282 FLEEQQKLYG-TNHIYAADPFHEGAAPSNEKSYLEAVGKVIWEVASGFDPEAVIAMQTWS 340

Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSK--QFYGVPYIWCMLHNFAG 181
                       +A+  + P  +L++LDL       W+  K   F+  PY+  +LHN+ G
Sbjct: 341 LR----------EAITRTFPQDRLLLLDLGG-----WNVEKFNSFWNYPYVAGVLHNYGG 385

Query: 182 NIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
            + M G L   A    E + S +   + G+G+  E IE NPVVY+L +E+ +  +  D++
Sbjct: 386 RVYMGGNLALYAKNAHELKQSPKGGNIQGIGLFPEAIEHNPVVYELSTEITWMQDAPDLQ 445

Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
            WI  Y+  RYG+     +  W VL  TVY    G       V+ A P +    ++    
Sbjct: 446 KWITDYARARYGKLPAGAEQGWKVLLETVYGSKAGRLPSTESVMCARPALTIQKVAAN-- 503

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
              +  +P                    YST  +  A++ F+ + N+L  S+TYRYDL+D
Sbjct: 504 --GDLSRP--------------------YSTVRLWDAVDHFLQASNDLKKSDTYRYDLVD 541

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           + RQ L+  +  L   I EAY   D   + Q   +FL L++D D LL     FLLG W++
Sbjct: 542 VMRQCLSDLSLPLQKQITEAYLAEDNEKLQQAGEQFLALIDDFDRLLGTRSTFLLGKWIK 601

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            A+Q    EE++  YEWNART +T+W  N    ++ L +Y N+ W+GL++ YY PR   +
Sbjct: 602 EARQWGTTEEEKALYEWNARTLVTVWGPN--HPSAHLFEYSNRQWAGLMKGYYKPRWEKF 659

Query: 481 F---------------KYMIESLESGDGFRLKDWRREWIKLTN---DWQNGRNVYPVESN 522
                           +Y+ +SL         D+   + +LTN   DW   ++VY     
Sbjct: 660 ISYLKAQPKGEWRYDEQYIRKSLAGRPALDASDF---YTRLTNWEYDWAFNKDVYTDTPQ 716

Query: 523 GDALITSQWLYNKYL 537
           G+ +   + LY K+L
Sbjct: 717 GNEIEIVKELYAKWL 731


>gi|393236266|gb|EJD43816.1| putative alpha-N-acetylglucosaminidase [Auricularia delicata
           TFB-10046 SS5]
          Length = 778

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 155/531 (29%), Positives = 265/531 (49%), Gaps = 56/531 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            LP+F+G VP A+  V P A +     W S   D     T+L +  DP F  + ++FIE+
Sbjct: 249 ALPSFTGFVPRAISRVLPGASVVNGSRW-SGFPDALTRVTFL-EPFDPAFARLQKSFIEK 306

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q+  YG  SH+Y  D ++EN P  +   Y+  +  + +  +++ D DA+WLMQGWLF S 
Sbjct: 307 QIAAYGPVSHVYTLDQYNENDPLKNDVGYLRDVSRSTWQSLKAADPDAIWLMQGWLFYSN 366

Query: 127 DPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   +++A L  V     +++LDLF+E +P W  +  +YG P+IWC LH++ GN+ +
Sbjct: 367 RGFWTNARVEAFLGGVEKNDDMLILDLFSESEPQWQRTNSYYGKPWIWCQLHDYGGNLGL 426

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG + +I    VEA   ++ ++VG G++MEG E N ++YDL+   A+  + +D  ++   
Sbjct: 427 YGQVMNITLNAVEA-LEKSPSLVGFGLTMEGQEGNEIMYDLLLSQAWSRKPIDTASYFRS 485

Query: 246 YSVRRY------GRSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 298
           ++ RRY      G  +P AI +AW++L  TVYN T  A++    V  +  ++ P++  + 
Sbjct: 486 WATRRYNAGGIIGSLLPSAIYNAWDILRTTVYNNTKLASNA---VTKSVFELRPALSGI- 541

Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA----SNTY 354
                             +  + +    + Y T  +++A +LF  +     A    +  Y
Sbjct: 542 ------------------ANRTGHHATTITYDTQALVKAYDLFDKAAIYTPALWFNNPAY 583

Query: 355 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG------------VFQLSRRFLELVED 402
            +D +D  RQ L+   +  + +++  Y      G            +     R + ++  
Sbjct: 584 EFDNVDFARQVLSNAFSTQYDDLVATYNEISKPGGSGATLAEAAKIIHDKGERMMGVLAS 643

Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
           +D +L     F L  WL+ A+  A+   +E  +E+NAR QIT+W    Q     + DYG+
Sbjct: 644 LDKVLRTSKHFTLKKWLQDARAWARGGHEE-LFEYNARNQITLWGPTGQ-----INDYGS 697

Query: 463 KYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNG 513
           K W GL+ +YY  R  I+F Y+   + +G  F L     +++    DWQ  
Sbjct: 698 KAWGGLVSEYYAQRWRIFFTYLESVVAAGQPFNLTAVGNQFLAFQLDWQTA 748


>gi|404406438|ref|ZP_10998022.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
          Length = 726

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 165/540 (30%), Positives = 262/540 (48%), Gaps = 51/540 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V  +F+G+VP AL+ +FP A I +L  W S +   R   +Y L+  +PLF  I +A++++
Sbjct: 227 VFTSFTGHVPKALKTLFPDADIERLNPWTSFE---RPYNSYYLNPAEPLFNRIQQAYMQE 283

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G +S +Y  D F+E  PP   PEY++      Y  +   D DAVWL   W+F + 
Sbjct: 284 QRRLFGESS-VYGVDPFNELDPPNWDPEYLARAARLTYESITQFDKDAVWLQMAWVFYHK 342

Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W P ++KA L +VP GKL++LD + +   +W +++ FYG P+IW  L NF GN  + 
Sbjct: 343 RRDWTPERLKAYLCAVPDGKLLMLDYYCDKVELWRSTESFYGQPFIWSYLGNFGGNTMLA 402

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +  ++     A        VG+G ++EG++ NP +Y+ + + A+  +  D   WI++ 
Sbjct: 403 GDVKDVSRKLDRAYAEAGRNFVGIGCTLEGLDVNPFMYEYVLDRAWT-QLYDDAGWIDRL 461

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  GR     + AW +LY  +Y    G    NR   V                     
Sbjct: 462 ADRHSGRIDVHYRQAWRILYDKIYCAPSG----NRSAAVC-------------------A 498

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           +P  K        S +  PHL Y   +++R  E    +  E +AS+  R+D +++ RQ L
Sbjct: 499 RPNMK------GRSKWSGPHLDYDNRDLLRVWEQLTLARPERTASS--RFDCVNIPRQCL 550

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y   L    I A +  D   V +LS R LEL++D+D L+A    FLLG W+  A+++ 
Sbjct: 551 ENYFGNLNERCIAACRGGDRETVARLSARLLELLDDIDRLVAADAYFLLGKWIADARRMG 610

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
               ++  +E +AR  +T W          L DY N+ WSGL+ DYY  R   ++  +  
Sbjct: 611 ATPAEKDYFERDARNILTTW----GGRGYSLNDYANRTWSGLVSDYYKERWRRFYDRLQS 666

Query: 487 SLESGDGF---RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 543
             E  +      L+D+  EW+      + GR  +     GDA    + LY KY    G F
Sbjct: 667 DGEPDEDALLQELQDFEWEWVG-----RKGR--FAERPRGDAFRLCRSLYTKYAAEIGRF 719


>gi|423346424|ref|ZP_17324112.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
           CL03T12C32]
 gi|409220242|gb|EKN13198.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
           CL03T12C32]
          Length = 718

 Score =  249 bits (636), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 166/541 (30%), Positives = 263/541 (48%), Gaps = 67/541 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P +SG VP   +       ++  G W   +          L  TDP F EI   + ++
Sbjct: 224 VFPGYSGMVPHNAKEKL-GLNVSDPGLWNGYRRPA------FLQPTDPRFEEIASLYYKE 276

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             K YG+  + Y+ D F E    V     + + G AI   M+  +  AVW+ Q W  +  
Sbjct: 277 MNKLYGKADY-YSMDPFHEGGSVVGVD--LDAAGKAIMQAMKKNNPKAVWVAQAWQANPR 333

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFA 180
           P         ++ ++  G L+VLDLFAE +P W         K  +G   +I+CML N+ 
Sbjct: 334 P--------QMIGNLEAGDLIVLDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYG 385

Query: 181 GNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
           GN+ ++G +  +     +A+ S    T+ GVGM+MEG E NPV+++L++E+ ++ ++ D 
Sbjct: 386 GNVGLHGKMKHVIDEFYKAKESPFGKTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDK 445

Query: 240 KAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISV 297
             W+ +Y+V RYG+S P +QDAW +L +++YNC D  T +  +  V  A P         
Sbjct: 446 DQWLREYTVARYGKSNPTVQDAWILLSNSIYNCPDANTQQGTHESVFCARP--------- 496

Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
           TE  YQ                SS+     +Y  + VIRA  + ++  +E   +N + YD
Sbjct: 497 TEHPYQ---------------VSSWSEMKDYYDPNNVIRAAAMMVSVADEFKGNNNFEYD 541

Query: 358 LIDLTRQALAKYANELFLNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
           L+D+ RQA+A+    L   ++EA +   D       S RFL L+   D LLA    F +G
Sbjct: 542 LVDIVRQAIAE-KGRLTEKVVEAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVG 600

Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
            W+  A+ L    E+++ YEWNAR QIT W +    +   LRDY ++ W+G+L+D+Y  R
Sbjct: 601 TWIARARSLGSTPEEKELYEWNARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYYMR 660

Query: 477 AAIYFKYMIESLESGDGFRLKDWRR----EWIKLTNDWQNGRNVYPVESNGDALITSQWL 532
              +F Y           RL D R+    ++  +   W    NVY  E  GD + T + +
Sbjct: 661 WKTWFDYQT---------RLLDGRKTAAIDFYAIEERWTKATNVYSSEPEGDCISTVKRI 711

Query: 533 Y 533
           +
Sbjct: 712 F 712


>gi|404487028|ref|ZP_11022215.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335524|gb|EJZ61993.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
           YIT 11860]
          Length = 726

 Score =  249 bits (635), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 161/530 (30%), Positives = 251/530 (47%), Gaps = 45/530 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPAAL+ + P A I  LG W       R    + L+  +PLF EI ++F+E+
Sbjct: 233 VLPAFAGHVPAALKRIHPDANIQYLGKWAGFGDSYR---CHFLNPEEPLFAEIQKSFLEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PEY++ + + +Y  + + D DAVWL   W+F +D
Sbjct: 290 QEKMFG-TDHIYGVDPFNEVDPPSWEPEYLAQVSSDMYKSLAAADPDAVWLQMTWMFYHD 348

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++KALL  VP  KLV+LD   E   +W ++++F+G PYIWC L NF GN  + 
Sbjct: 349 RKLWTAPRVKALLTGVPSDKLVLLDYHCENVELWKSTEKFHGQPYIWCYLGNFGGNTTLT 408

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++ N   Y+ + E A+  + V+ + W+ + 
Sbjct: 409 GNVKESGDRLDNALINGGDNLKGIGSTLEGLDINQFPYEYIFEKAWTID-VNGQDWVERL 467

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G    + ++AW +L+               DV V  P     +         NY 
Sbjct: 468 ADRHVGAVSESAREAWQILFD--------------DVFVQVPRTLGILPGYRPKLGDNYN 513

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           K  S E                Y  + ++R  EL +   +     + +  D+I   RQ L
Sbjct: 514 KRTSNE----------------YDNATLLRVWELLLEVPS--CDRDAFEIDVIMTGRQLL 555

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y  ++       Y+  +  G+ + +    E++ D++ L + H+   L  W+E A+ L 
Sbjct: 556 GNYFLDVKKEFDGFYKKRNVPGLKEKASEMREILSDLELLNSFHNRASLDKWIEDARSLG 615

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
             +E +  YE NAR  IT W          L DY ++ W+GLL DYY  R  IYF  +I 
Sbjct: 616 DTDELKNYYEKNARNLITTW-------GGSLNDYASRTWAGLLNDYYARRWEIYFDAVIG 668

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           + E G      + +        +W        +E NG  L TS+ L  KY
Sbjct: 669 AAEKGIELDKDELKSRLATFEQEWVESTTPVCIERNGTLLDTSRRLLEKY 718


>gi|400599317|gb|EJP67021.1| alpha-N-acetylglucosaminidase, putative [Beauveria bassiana ARSEF
           2860]
          Length = 753

 Score =  248 bits (634), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 151/511 (29%), Positives = 260/511 (50%), Gaps = 48/511 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP  +  V+P+  + +   W       R+     +   DP F E+ +AF+ +
Sbjct: 239 ILPAFPGFVPENITRVWPNVSLAESPTWSGFSG--RFTADKYITPYDPRFAELQKAFLTK 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG  +  +  D F+EN P      Y+ ++    +  ++  D  AVW+MQGWLF+ D
Sbjct: 297 QNEAYGNVTSFWTLDQFNENKPASGELGYLRNVSHNTWQTLKDADPSAVWVMQGWLFASD 356

Query: 128 -PFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             +W   ++K+ L+ VP+ + +++LDLFAE  P W  +  FYG P+IWC LH + GN+ +
Sbjct: 357 KAYWTDDRVKSFLDGVPVNEDMLLLDLFAESTPQWQRTDSFYGKPWIWCQLHGYGGNMGL 416

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG ++++    VEA   ++ ++VG+G+SMEG E N ++Y+L+ + A+  E ++   + + 
Sbjct: 417 YGQIENVTRNAVEA-VQKSPSIVGLGLSMEGQEGNEIMYNLLLDQAWSKEALETDKYFSD 475

Query: 246 YSVRRYG---RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           +   RYG   + +P  +  AW+ +  TVYN TD +       + A       ++  T G 
Sbjct: 476 WVTVRYGADQKEIPKDLYTAWDKVRSTVYNNTDSS-------VTAVAKSIFELVPSTSGL 528

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDL 358
               G   +K               + Y T  +I A      +G++   L  +  Y YDL
Sbjct: 529 VNRTGHHATK---------------ITYDTETLISAWNDMFNAGSQARWLFDNEAYSYDL 573

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
            D TRQ LA      +  ++E Y+ N+  GV     R   ++  MD +L  +  F L  W
Sbjct: 574 TDWTRQVLANAFEATYNKLVEKYKSNNIKGVKCAGSRLQAILRTMDQVLETNVHFRLSTW 633

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           +++A++     +    +E+NAR Q+T+W  N +     + DY +K W+GL+ DYY  R  
Sbjct: 634 IQAARK--SGGDAADFFEYNARNQVTLWGPNGE-----IEDYASKQWAGLIGDYYAHRWQ 686

Query: 479 IYFKYMIESLESGD----GFR--LKDWRREW 503
           ++  Y++ + + GD     F+  L +W +EW
Sbjct: 687 MFVDYLVAT-DPGDYDQMAFQKTLIEWEKEW 716


>gi|238506383|ref|XP_002384393.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
           NRRL3357]
 gi|220689106|gb|EED45457.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
           NRRL3357]
          Length = 669

 Score =  248 bits (634), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 151/496 (30%), Positives = 255/496 (51%), Gaps = 51/496 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF G VP A++ V P A +     W   +   ++     L   D  F ++ ++ I +
Sbjct: 148 VLPAFPGFVPPAIKRVRPHATVVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISR 205

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q++ +G  +H+Y  D F+E  P      Y+ +L    +  +++ +  AVW+MQGWLF YD
Sbjct: 206 QMRAFGNITHVYALDQFNEINPASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YD 264

Query: 128 P--FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
              FW   ++ A L+ V     +++LDL++E KP W  ++ ++G P+IWC LH+F GN+ 
Sbjct: 265 KKDFWDSNRISAYLSGVERNDDMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMG 324

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           MYG + +I   P+EA  +++ ++VG G++MEG E N +VYDL+ + A+    +D +A+  
Sbjct: 325 MYGQIMNITSDPIEA-LNKSDSLVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQ 383

Query: 245 QYSVRRYG--RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
            +   RY    SVP  +  AW++L  TVYN T+  T     V  +  ++ P I  +  G+
Sbjct: 384 SWVRSRYSGNLSVPNELYTAWDLLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GR 439

Query: 302 YQNYGKPVS--------KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
             +Y  P S         E +     ++   P LW++ +                     
Sbjct: 440 VGHYPTPTSINYDPMVLNEVLSLFMNATRKEPSLWHNPA--------------------- 478

Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACH 410
           Y YD++D+TRQ +      ++  +I +++    N    V   S R L L+  +D +L+C+
Sbjct: 479 YEYDMVDITRQLMGNAFVNVYSVLITSWKSETENRTTKVTSHSERLLNLLSAIDKVLSCN 538

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
           + F L  W+ SA+      E +  +E+NAR QIT+W    +     + DY +K W+GL+ 
Sbjct: 539 ENFSLATWISSARDWGNTTETKDFFEYNARNQITLWGPTGE-----ISDYASKAWAGLIS 593

Query: 471 DYYGPRAAIYFKYMIE 486
            YY PR +I+  Y+ E
Sbjct: 594 SYYKPRWSIFVDYLGE 609


>gi|393782608|ref|ZP_10370791.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
           CL02T12C01]
 gi|392672835|gb|EIY66301.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
           CL02T12C01]
          Length = 761

 Score =  248 bits (634), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 156/485 (32%), Positives = 235/485 (48%), Gaps = 51/485 (10%)

Query: 11  AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
            FSG VP  L++ +P+A I Q  +W   K   +      LD TD LF  +GR F+E+Q +
Sbjct: 239 GFSGYVPRELKDKYPTANINQQRSWCGFKGAAQ------LDPTDSLFTRMGRVFLEEQAR 292

Query: 71  EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
            +G    +Y  D F E+ PPVD+PEY+ ++G  I+   +  D  + W MQ W        
Sbjct: 293 LFG-AHGVYAADPFHESVPPVDTPEYLKAVGETIHRLFREFDPQSTWAMQSWSLR----- 346

Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSK-QFYGVPYIWCMLHNFAGNIEMYGIL 189
                +A++ +VP   L++LDL        STSK +F+G P +   LHNF G I M+G L
Sbjct: 347 -----EAIVKAVPKEALLILDLRGS-----STSKAEFWGYPTVVGNLHNFGGRINMHGDL 396

Query: 190 DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 249
             +A          N  + G G+ ME IEQNPV Y+L  EM    + +D++AW+ QY+ R
Sbjct: 397 ALLASNQYSKAKRLNPAVCGSGLFMEAIEQNPVYYELAFEMPCHPDSIDLRAWLKQYATR 456

Query: 250 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 309
           RYG   PA Q AW +L    Y      T+K+  ++ A P +D                 V
Sbjct: 457 RYGAFSPATQKAWMLLLEGPYRQGTNGTEKS-SIVAARPALD-----------------V 498

Query: 310 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 369
            K          YD P L      +IRA  L +   ++LSAS  YR+DL+D+ RQ +   
Sbjct: 499 KKSGPNAGLEIPYD-PAL------IIRAQSLLLEDADKLSASRPYRFDLVDVQRQMMTNL 551

Query: 370 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 429
              +     EA++  D       S RFL ++ DMD LL     +    WL  A+   + E
Sbjct: 552 GQLIHRKAAEAFRSKDREAFTLHSGRFLGMLADMDTLLRTRSEYSFDRWLTEARSWGETE 611

Query: 430 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 489
           E++ Q E +A + +T+W  +       + DY  + W+GL+  YY PR   ++  + + L+
Sbjct: 612 EEKNQMERDATSLVTIWGADGDPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQQHLD 668

Query: 490 SGDGF 494
            G  +
Sbjct: 669 EGTSY 673


>gi|154489986|ref|ZP_02030247.1| hypothetical protein PARMER_00215 [Parabacteroides merdae ATCC
           43184]
 gi|423722990|ref|ZP_17697143.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
           CL09T00C40]
 gi|154089428|gb|EDN88472.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
           43184]
 gi|409241820|gb|EKN34587.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
           CL09T00C40]
          Length = 718

 Score =  248 bits (632), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 168/543 (30%), Positives = 265/543 (48%), Gaps = 71/543 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P +SG VP   +       ++  G W   +          L  TDP F EI   + ++
Sbjct: 224 VFPGYSGMVPHNAKEKL-GLNVSDPGLWNGYRRPA------FLQPTDPRFEEIASLYYKE 276

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
             K YG+  + Y+ D F E  +   VD    + + G AI   M+  +  AVW+ Q W  +
Sbjct: 277 MNKLYGKADY-YSMDPFHEGGSVAGVD----LDAAGKAIMQAMKKNNPKAVWVAQAWQAN 331

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHN 178
             P         ++ ++  G L+VLDLFAE +P W         K  +G   +I+CML N
Sbjct: 332 PRP--------QMIGNLEAGDLIVLDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLN 383

Query: 179 FAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
           + GN+ ++G L  +     +A+ S    T+ GVGM+MEG E NPV+++L++E+ +  ++ 
Sbjct: 384 YGGNVGLHGKLKHVIDEFYKAKESPFGKTLKGVGMTMEGSENNPVMFELLTELPWCPQRF 443

Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSII 295
           D   W+ +Y+V RYG+S P +QDAW +L +++YNC D  T +  +  V  A P       
Sbjct: 444 DKDQWLREYTVARYGKSNPTVQDAWILLSNSIYNCPDANTQQGTHESVFCARP------- 496

Query: 296 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
             TE  YQ                SS+     +Y  ++VIRA  + ++  +E   +N + 
Sbjct: 497 --TEHPYQ---------------VSSWSEMKDYYDPNDVIRAAAMMVSVADEFKGNNNFE 539

Query: 356 YDLIDLTRQALAKYANELFLNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 414
           YDL+D+ RQA+A+    L   ++EA +   D       S RFL L+   D LLA    F 
Sbjct: 540 YDLVDIVRQAIAE-KGRLTEKVVEAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFK 598

Query: 415 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
           +G W+  A+ L    E+++ YEWNAR QIT W +    +   LRDY ++ W+G+L+D+Y 
Sbjct: 599 VGTWIARARSLGGTPEEKELYEWNARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYY 658

Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRR----EWIKLTNDWQNGRNVYPVESNGDALITSQ 530
            R   +F Y           RL D R+    ++  +   W    NVY  E  GD + T +
Sbjct: 659 MRWKTWFDYQT---------RLLDGRKTAAIDFYAIEERWTKATNVYSSEPEGDCISTVK 709

Query: 531 WLY 533
            ++
Sbjct: 710 RIF 712


>gi|403416059|emb|CCM02759.1| predicted protein [Fibroporia radiculosa]
          Length = 705

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 156/542 (28%), Positives = 276/542 (50%), Gaps = 42/542 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A+  ++P+A I     W    S   +  T  L+  DPLF  + ++FI +
Sbjct: 194 VLPAFTGFVPRAMSTLYPNASIVNGSQWEGFPSTLTY--TTFLEPFDPLFTTMQKSFISK 251

Query: 68  QLKEYG-RTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-S 125
           Q   YG   SH+Y  D ++EN P      Y++++ A  ++ +Q+ D +AVW+MQGWLF +
Sbjct: 252 QQAAYGANVSHVYTLDQYNENDPYSGDVGYLANISAGTFASLQAADPEAVWMMQGWLFFA 311

Query: 126 YDPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
            + FW   ++ A L +VP    +++LDL++E  P W  +  +YG  +IWC LH+F GN+ 
Sbjct: 312 SEAFWTTERIAAFLGAVPSNDSMIILDLYSEAAPQWQRTDSYYGKQWIWCELHDFGGNMG 371

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
             G L  +  GP++A  S  ++M G+G++ EG E N +VYD++ + A+    +D+ +++ 
Sbjct: 372 FEGNLPELVTGPIQAL-SNASSMRGMGLTPEGQEGNEIVYDILLDQAWSSTSIDIASYVE 430

Query: 245 QYSVRRYG-RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
            +  RRY  + +P A Q+AW +L  TVY+ +D             P+   +I S+ E   
Sbjct: 431 AWVARRYTVQDLPSAAQEAWTILSTTVYSNSD-------------PNTQATIKSIFE--- 474

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTS-EVIRALELFIASGNE---LSASNTYRYDL 358
                 ++ +    ++ + +    + Y T+  ++ AL+  + +  E   L +   + YD+
Sbjct: 475 ------LAPDLSGLTDRTGHHCTEIPYDTNITIVPALQNLVQAATENPLLLSVPEFMYDV 528

Query: 359 IDLTRQALAKYANELFLNIIEAYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
           +D+TRQ LA    +++  ++  +      A  V    +  L ++ D+D LL  +D FLL 
Sbjct: 529 VDVTRQLLANRFIDVYNELVSTFYSTGVTAASVKNAGQPLLTILSDVDTLLWTNDNFLLS 588

Query: 417 PWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
            W+  A  L+ N      Y E+NAR QIT+W  + +     + DY +K W+G +  YY  
Sbjct: 589 NWILGAINLSDNNGTYADYLEYNARNQITLWGPDGE-----INDYASKQWAGFVGTYYYD 643

Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
           R  ++  Y+ +  ++G  +     +   +    +W           +GD +     L  K
Sbjct: 644 RWNMFITYLEDITQNGTAYNDTAIQTVMLNFGKEWDTQTYSLSATVSGDTMSIVDSLIQK 703

Query: 536 YL 537
           +L
Sbjct: 704 WL 705


>gi|224025137|ref|ZP_03643503.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
           18228]
 gi|224018373|gb|EEF76371.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
           18228]
          Length = 718

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 167/532 (31%), Positives = 265/532 (49%), Gaps = 63/532 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP +SG VP   ++      +   G W      PR      L  TDP F  I   +  +
Sbjct: 224 VLPGYSGMVPHNAKDRL-GLNVADPGRW---NGYPR---PAFLQPTDPQFERIAALYYRE 276

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
             + YG+ S+ Y+ D F E  NT  VD    + + G AI+  M+  +  A W++Q W   
Sbjct: 277 MTRLYGKVSY-YSMDPFHEGGNTSGVD----LEAAGKAIWKAMKQANPRAAWVVQAW--G 329

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHN 178
            +P    PQM   + ++P G +VVLDLF+E +P W         K+ +G   +++CML N
Sbjct: 330 ANP---RPQM---IRNLPAGDMVVLDLFSESRPQWGDPASSWYRKEGFGQHDWLFCMLLN 383

Query: 179 FAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
           + GN+ ++G +  +     +A+ S    T+ GVGM+MEGIE NPV+Y+L+ E+ ++ ++ 
Sbjct: 384 YGGNVGLHGKMAHLIEEFYKAKDSSFGKTLKGVGMTMEGIENNPVMYELLCELPWREQRF 443

Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSII 295
               W+  Y   RYG+S   +  AW +L +T+YNC   +T +  +  ++ A P      +
Sbjct: 444 SKDEWLEGYLKARYGKSDSQVSQAWMLLSNTIYNCPAASTQQGTHESILCARPSWKAYQV 503

Query: 296 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
           S                    SE S Y      Y  ++VIRA  + + +      +N + 
Sbjct: 504 SSW------------------SEMSDY------YDPADVIRAAGMMVDAAERFRGNNNFE 539

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSR-RFLELVEDMDGLLACHDGFL 414
           YDL+D+ RQA+A+    ++  +++AY+  D   +F+LS  RFL L+   D LLA    F 
Sbjct: 540 YDLVDIVRQAVAEKGRLMYRVLVDAYKAGDRE-LFKLSSDRFLRLILMQDRLLATRSEFK 598

Query: 415 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
           +G WLESA+ L   EE++  YEWNAR QIT W +    +   L DY ++ W+GLLRD+Y 
Sbjct: 599 VGRWLESARNLGSTEEEKDWYEWNARVQITTWGNRVAADDGGLHDYAHREWNGLLRDFYY 658

Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
            R   +    ++S E G    +     ++  L   W    N Y  E+ G+ +
Sbjct: 659 LRWKTWLDEQLKSFEGGQPKAI-----DFYALEEPWTLKHNSYASEAEGNPV 705


>gi|355706271|gb|AES02588.1| N-acetylglucosaminidase [Mustela putorius furo]
          Length = 333

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 146/371 (39%), Positives = 210/371 (56%), Gaps = 50/371 (13%)

Query: 112 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 170
           D DAVWL+QGWLF + P FW P Q++A+L +VP G+L++LDLFAE +P++  +  F+G P
Sbjct: 3   DPDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLILDLFAESQPVYLRTASFHGQP 62

Query: 171 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 230
           +IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG GM+ EGI QN VVY LM+E+
Sbjct: 63  FIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVGTGMAPEGIGQNEVVYALMAEL 122

Query: 231 AFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 288
            ++ + V D++AW+  ++ RRYG      + AW +L  +VYNC+ +  T  NR  +V   
Sbjct: 123 GWRKDPVADLEAWVTSFAARRYGVDSKETEVAWRLLLGSVYNCSGEACTGHNRSPLVR-- 180

Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
              PS+  VT                            +WY+ S V  A  L +A+   L
Sbjct: 181 --RPSLQMVTT---------------------------VWYNRSAVFEAWRLLLAAAPTL 211

Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 402
           + S T+RYDL+D+TRQA  +  +  +     AY       L  A G+        EL+  
Sbjct: 212 AKSPTFRYDLLDVTRQAAQELVSLYYTEARTAYLNKELVPLMRAAGIL-----VYELLPA 266

Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
           +DG+LA    FLLG WLE A+ +A +E   + YE N R Q+T+W      E ++L DY N
Sbjct: 267 LDGVLASDSRFLLGTWLEQARAVAVSETDARFYEQNGRYQLTLW----GPEGNIL-DYAN 321

Query: 463 KYWSGLLRDYY 473
           K  +GL+  YY
Sbjct: 322 KQLAGLVAGYY 332


>gi|329963073|ref|ZP_08300853.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
 gi|328529114|gb|EGF56044.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
          Length = 717

 Score =  245 bits (625), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 167/546 (30%), Positives = 270/546 (49%), Gaps = 73/546 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP +SG VP   +         +LG   +V SDP   C Y     L  TDP F EI   
Sbjct: 224 VLPGYSGMVPHNAKE--------RLG--LNV-SDPGLWCGYPRPAFLQPTDPRFGEIADL 272

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  + YG+ +  Y+ D F E  +   VD    +++ G AI+  M+  +  AVW+ Q 
Sbjct: 273 YYKEMTRLYGK-ADFYSMDPFHEGGSIAGVD----LNAAGQAIWGAMKKVNPKAVWVAQA 327

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWC 174
           W  +       P+ K ++ ++P G L+VLDLF+E +P W         K+ +G   +++C
Sbjct: 328 WQAN-------PRQK-MIENIPQGDLIVLDLFSESRPQWGDPASTWYRKEGFGKHDWLYC 379

Query: 175 MLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ 233
           ML N+ GN+ ++G +  +     +A+TS    T+ GVGM+MEG E N V+++L+ E+ ++
Sbjct: 380 MLLNYGGNVGLHGKMRHVIDEFYKAKTSPFGKTLKGVGMTMEGSENNSVMFELLCELPWR 439

Query: 234 HEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVD 291
             + +   W+  Y+  RYG++   +Q AW +L +++YNC D  T +  +  V  A P +D
Sbjct: 440 PAQFEKDEWLKNYTAARYGKADATVQQAWLLLSNSIYNCPDANTQQGTHESVFCARPGMD 499

Query: 292 PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
                     YQ                SS+     +Y   EVIRA  + +++ +    +
Sbjct: 500 ---------VYQ---------------VSSWSEMVKYYEPEEVIRAAGILLSAADRFKGN 535

Query: 352 NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 411
           N + YDL+D+ RQA+A+    ++  +I+A +  +       S+RFL L+   D LLA   
Sbjct: 536 NNFEYDLVDIVRQAVAEKGRLVYPIMIDALKAGEKELFAAASQRFLNLILLQDRLLATRP 595

Query: 412 GFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRD 471
            F +G W+E A+ L   +E++K YEWNAR QI  W + T  +   LRDY +K W+G+LRD
Sbjct: 596 EFKVGTWIEKARNLGTTQEEKKLYEWNARVQIATWGNRTAADEGGLRDYAHKEWNGMLRD 655

Query: 472 YYGPRAAIYFKYMIESLES--GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
           +Y  R  ++       L      GF        W   TND       YP    GD +  +
Sbjct: 656 FYYHRWKLWIDAQTAQLNGAPAQGFDFYAIEEPWTLQTND-------YPSHPEGDVIEVA 708

Query: 530 QWLYNK 535
           +  Y +
Sbjct: 709 RTAYKE 714


>gi|218258436|ref|ZP_03474815.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
           DSM 18315]
 gi|423342591|ref|ZP_17320305.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
           CL02T12C29]
 gi|218225494|gb|EEC98144.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
           DSM 18315]
 gi|409217508|gb|EKN10484.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 718

 Score =  244 bits (624), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 164/539 (30%), Positives = 262/539 (48%), Gaps = 63/539 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P +SG VP   +       ++  G W   +          L  TDP F EI   + ++
Sbjct: 224 VFPGYSGMVPHNAKEKL-GLNVSDPGLWNGYRRPA------FLQPTDPRFEEIASLYYKE 276

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
             K YG+ ++ Y+ D F E  +   VD    + + G AI   M+  +  AVW+ Q W  +
Sbjct: 277 MNKLYGKANY-YSMDPFHEGGSVAGVD----LDAAGKAIMQAMKKNNPKAVWVAQAWQAN 331

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHN 178
             P         ++ ++  G L+ LDLFAE +P W         K  +G   +I+CML N
Sbjct: 332 PRP--------QMIGNLEAGDLIALDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLN 383

Query: 179 FAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
           + GNI ++G +  +     +A+ S   TT+ GVGM+MEG E NPV+++L++E+ ++ ++ 
Sbjct: 384 YGGNIGLHGKMKHVIDEFYKAKESPFGTTLKGVGMTMEGSENNPVMFELLTELPWRPQRF 443

Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSII 295
           D   W+  Y+V RYG+S P +QDAW +L +++YNC D  T +  +  V  A P       
Sbjct: 444 DKDQWLKAYTVARYGKSNPVVQDAWILLSNSIYNCPDANTQQGTHESVFCARP------- 496

Query: 296 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
             TE  YQ                SS+     +Y  ++VIRA  + ++  ++   +N + 
Sbjct: 497 --TEHPYQ---------------VSSWSEMKDYYDPNDVIRAAAMMVSVSDQFKGNNNFE 539

Query: 356 YDLIDLTRQALAKYANELFLNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 414
           YDL+D+ RQA+A+    L   ++EA +   D       S RFL L+   D LLA    F 
Sbjct: 540 YDLVDIVRQAIAE-KGRLTEKVVEAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFK 598

Query: 415 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
           +G W+  A+ L    E++  YEWNAR QIT W +    +   LRDY ++ W+G+L+D+Y 
Sbjct: 599 VGTWIARARSLGNTSEEKDLYEWNARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYY 658

Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 533
            R   +F Y    L+      +     ++  +   W    N Y  E  GD + T Q ++
Sbjct: 659 MRWKTWFDYQTRLLDGKKTAAI-----DFYAIEEPWTKQTNPYSNEPEGDCIPTVQRIF 712


>gi|345511813|ref|ZP_08791352.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|229443748|gb|EEO49539.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
          Length = 720

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 156/530 (29%), Positives = 263/530 (49%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VPA L+ ++P AKI  +  W       R   ++ +D  D L+  I R F+E+
Sbjct: 220 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEE 276

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
           Q K YG T+HIY  D F+E   P  + E++S++   IY  +Q  DS A WL   W+F + 
Sbjct: 277 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHA 335

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++K+ LN+VP  KL++LD + +   IW  ++Q+YG PYIWC L NF GN  + 
Sbjct: 336 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 395

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L+ + F            + G+G+++EG++ NP++Y+ + E A+Q+  + V  WI  +
Sbjct: 396 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANW 454

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  R G     I  AW  LY  +Y  T  A      ++ A P ++      T   Y    
Sbjct: 455 AQCRGGNVDNHIVKAWKQLYEKIY--TSAALCGQAVLMNARPQLEGVEGWNTLPGY---- 508

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                          Y +  LW    E+++A  ++          + Y +D+I++ RQ L
Sbjct: 509 --------------DYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 545

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
                +      + Y+  D  G     +R  +L+ D+D LL C   F +G W++ A+  A
Sbjct: 546 GNLFADYRDKFTDCYRKKDLEGTKVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFA 605

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            NE+++K YE NAR  +T+W     ++ + L DY N+ W GL R +Y  R   + + +I 
Sbjct: 606 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 661

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++     F  + + ++  +   +W      +P+ S  + +  ++ L  KY
Sbjct: 662 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPIISEENPISLAKELILKY 711


>gi|336374066|gb|EGO02404.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 761

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 152/499 (30%), Positives = 264/499 (52%), Gaps = 47/499 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           VLP+F+G VP A+  ++P+A I     W  F+++    +     L+  DPLF  +  +FI
Sbjct: 245 VLPSFTGFVPRAMHTLYPNASIVNGSQWNGFTIQ----YTNDSFLEPFDPLFSTLQTSFI 300

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
            +Q+  YG  SH+Y  D ++EN+P      Y++++ AA ++ +++ D  AVWLMQGWLF 
Sbjct: 301 SKQVAAYGNVSHVYTLDQYNENSPYSGDTSYLANVTAATFASLRAADPQAVWLMQGWLFY 360

Query: 126 YDP-FWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
            D  FW   +++A L  VP    +++LDL++E +P W     ++G  +IWC LH++ GN+
Sbjct: 361 SDSTFWTTERVEAYLGGVPGNDSMIILDLYSEAQPQWQRLNSYFGKQWIWCELHDYGGNM 420

Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
              G  +++   P++A  +   +MVG+G++MEG E N ++YD++ + A+    ++  A+I
Sbjct: 421 GFEGNFENVTTQPIKALATPGNSMVGMGLTMEGQEGNEIIYDVLLDQAWSSTPLNRTAYI 480

Query: 244 NQYSVRRYGRSVP----AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
           + ++ RRY  +VP    A  +AW +L  TVYN  D  T      I+   ++ PSI     
Sbjct: 481 SAWASRRY--NVPDLPTAALEAWEILGATVYNNQDVTTQSTVKSIL---ELSPSITG--- 532

Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA-SNT--YRY 356
                          L + T ++     + + + ++ AL+L + +  E SA SN   ++Y
Sbjct: 533 ---------------LVNRTGTHSTKLFYDTNTTIVPALKLLLQARQEASALSNIPEFQY 577

Query: 357 DLIDLTRQALAKYANELFLNIIEAY--QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 414
           D++D+TRQ LA    +L+ ++I+ +    + +  V       L L++D+D +L     FL
Sbjct: 578 DVVDVTRQLLANRFIDLYTSLIDTFSSTSSSSSAVSAAGAPLLALLQDLDSVLLTDTHFL 637

Query: 415 LGPWLESAKQLAQ--NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           L  W+ +A+      N       E+NAR Q+T+W    +     + DY +K W GL+  Y
Sbjct: 638 LARWISAARNWTHGDNATYAAYLEYNARNQVTLWGPRGE-----VNDYASKQWGGLVGTY 692

Query: 473 YGPRAAIYFKYMIESLESG 491
           Y  R   +  Y+  S E+ 
Sbjct: 693 YVQRWETFVGYLAGSKENA 711


>gi|374385779|ref|ZP_09643282.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
           12061]
 gi|373225481|gb|EHP47815.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
           12061]
          Length = 715

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 159/510 (31%), Positives = 242/510 (47%), Gaps = 58/510 (11%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIE 59
           + ++ + P F G        + P     +LG    VK    WC       L   +P F  
Sbjct: 215 MREYGIEPVFPGYA-----GMLPHNASEKLG--IEVKDPGLWCGYQRPAFLYPENPAFKR 267

Query: 60  IGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVW 117
           I   +  +  K +G+ +  Y  D F E  N   +D    +++   ++   M++ + +AVW
Sbjct: 268 IAGLYYMEMEKRFGK-AKFYGMDPFHEGGNVQGID----LAAAAQSVLQAMKTANPEAVW 322

Query: 118 LMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS-------TSKQFYGVP 170
           +MQ W  +       P+ + ++ ++  G +++LDL +E +P+W          K F G  
Sbjct: 323 VMQAWQAN-------PRHE-MITALQPGNVLILDLSSENRPMWGDKESVWYREKGFEGQD 374

Query: 171 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSE 229
           +++CML NF GN+ MYG +D +  G   A    N  ++ GVG +MEGIE NPV+Y+L+ E
Sbjct: 375 WLYCMLLNFGGNVGMYGRMDRVINGFYAAVQHPNGASLRGVGKTMEGIENNPVMYELLLE 434

Query: 230 MAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 289
           + ++      + W+  Y   RYG+  P +Q AW +L    YNC          V  A P 
Sbjct: 435 LPWRKIPFTKEEWLKGYVKARYGKDDPRLQQAWQILGKAAYNCPVVQEGTTESVFCARP- 493

Query: 290 VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELS 349
                                  A   S  SS+    L+Y+  E  +   LF+    +  
Sbjct: 494 -----------------------AEEISGASSWGTSELYYAPEESKKVAALFLEVSEQYK 530

Query: 350 ASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLAC 409
            +N + YDL D+ RQALA   N L   I EAY+L D      LSR FL+L+   D LLA 
Sbjct: 531 GNNNFEYDLTDIMRQALADKGNVLQKKITEAYRLKDETAFRNLSREFLQLILWQDTLLAT 590

Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
              F LG WLE AK   + EE+++ YEWNAR QIT W +    +   LRDY ++ W+GLL
Sbjct: 591 RPEFRLGTWLERAKAKGETEEEKRLYEWNARVQITTWGNRQAADKGGLRDYSHREWAGLL 650

Query: 470 RDYYGPRAAIYFKYMIESLESGDGFRLKDW 499
           +D+Y PR   YF  ++E   +G+     DW
Sbjct: 651 KDFYYPRWKAYFD-LLEKRLAGEETEDIDW 679


>gi|336386984|gb|EGO28130.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 738

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 152/499 (30%), Positives = 264/499 (52%), Gaps = 47/499 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           VLP+F+G VP A+  ++P+A I     W  F+++    +     L+  DPLF  +  +FI
Sbjct: 222 VLPSFTGFVPRAMHTLYPNASIVNGSQWNGFTIQ----YTNDSFLEPFDPLFSTLQTSFI 277

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
            +Q+  YG  SH+Y  D ++EN+P      Y++++ AA ++ +++ D  AVWLMQGWLF 
Sbjct: 278 SKQVAAYGNVSHVYTLDQYNENSPYSGDTSYLANVTAATFASLRAADPQAVWLMQGWLFY 337

Query: 126 YDP-FWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
            D  FW   +++A L  VP    +++LDL++E +P W     ++G  +IWC LH++ GN+
Sbjct: 338 SDSTFWTTERVEAYLGGVPGNDSMIILDLYSEAQPQWQRLNSYFGKQWIWCELHDYGGNM 397

Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
              G  +++   P++A  +   +MVG+G++MEG E N ++YD++ + A+    ++  A+I
Sbjct: 398 GFEGNFENVTTQPIKALATPGNSMVGMGLTMEGQEGNEIIYDVLLDQAWSSTPLNRTAYI 457

Query: 244 NQYSVRRYGRSVP----AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
           + ++ RRY  +VP    A  +AW +L  TVYN  D  T      I+   ++ PSI     
Sbjct: 458 SAWASRRY--NVPDLPTAALEAWEILGATVYNNQDVTTQSTVKSIL---ELSPSITG--- 509

Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA-SNT--YRY 356
                          L + T ++     + + + ++ AL+L + +  E SA SN   ++Y
Sbjct: 510 ---------------LVNRTGTHSTKLFYDTNTTIVPALKLLLQARQEASALSNIPEFQY 554

Query: 357 DLIDLTRQALAKYANELFLNIIEAY--QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 414
           D++D+TRQ LA    +L+ ++I+ +    + +  V       L L++D+D +L     FL
Sbjct: 555 DVVDVTRQLLANRFIDLYTSLIDTFSSTSSSSSAVSAAGAPLLALLQDLDSVLLTDTHFL 614

Query: 415 LGPWLESAKQLAQ--NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           L  W+ +A+      N       E+NAR Q+T+W    +     + DY +K W GL+  Y
Sbjct: 615 LARWISAARNWTHGDNATYAAYLEYNARNQVTLWGPRGE-----VNDYASKQWGGLVGTY 669

Query: 473 YGPRAAIYFKYMIESLESG 491
           Y  R   +  Y+  S E+ 
Sbjct: 670 YVQRWETFVGYLAGSKENA 688


>gi|294807833|ref|ZP_06766618.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|294444952|gb|EFG13634.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
          Length = 703

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 155/530 (29%), Positives = 262/530 (49%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VPA L+ ++P AKI  +  W       R   ++ +D  D L+  I R F+E+
Sbjct: 203 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEE 259

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
           Q K YG T+HIY  D F+E   P  + E++S++   IY  +Q  DS A WL   W+F + 
Sbjct: 260 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHA 318

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++K+ LN+VP  KL++LD + +   IW  ++Q+YG PYIWC L NF GN  + 
Sbjct: 319 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 378

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L+ + F            + G+G+++EG++ NP++Y+ + E A+Q+  + V  WI  +
Sbjct: 379 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANW 437

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  R G     I  AW  LY  +Y  T  A      ++ A P ++               
Sbjct: 438 AQCRGGNVDNHIVKAWKQLYEKIY--TSAALCGQAVLMNARPQLE--------------- 480

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                E         Y +  LW    E+++A  ++          + Y +D+I++ RQ L
Sbjct: 481 ---GVEGWNTLPGYDYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 528

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
                +      + Y+  D  G     +R  +L+ D+D LL C   F +G W++ A+  A
Sbjct: 529 GNLFADYRDKFTDCYRKKDLEGTKVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFA 588

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            NE+++K YE NAR  +T+W     ++ + L DY N+ W GL R +Y  R   + + +I 
Sbjct: 589 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 644

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++     F  + + ++  +   +W      +P+ S  + +  ++ L  KY
Sbjct: 645 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPIISEENPISLAKELILKY 694


>gi|262407713|ref|ZP_06084261.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|262354521|gb|EEZ03613.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
          Length = 735

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 155/530 (29%), Positives = 262/530 (49%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VPA L+ ++P AKI  +  W       R   ++ +D  D L+  I R F+E+
Sbjct: 235 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEE 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
           Q K YG T+HIY  D F+E   P  + E++S++   IY  +Q  DS A WL   W+F + 
Sbjct: 292 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHA 350

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++K+ LN+VP  KL++LD + +   IW  ++Q+YG PYIWC L NF GN  + 
Sbjct: 351 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 410

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L+ + F            + G+G+++EG++ NP++Y+ + E A+Q+  + V  WI  +
Sbjct: 411 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANW 469

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  R G     I  AW  LY  +Y  T  A      ++ A P ++               
Sbjct: 470 AQCRGGNVDNHIVKAWKQLYEKIY--TSAALCGQAVLMNARPQLE--------------- 512

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                E         Y +  LW    E+++A  ++          + Y +D+I++ RQ L
Sbjct: 513 ---GVEGWNTLPGYDYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 560

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
                +      + Y+  D  G     +R  +L+ D+D LL C   F +G W++ A+  A
Sbjct: 561 GNLFADYRDKFTDCYRKKDLEGTKVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFA 620

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            NE+++K YE NAR  +T+W     ++ + L DY N+ W GL R +Y  R   + + +I 
Sbjct: 621 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 676

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++     F  + + ++  +   +W      +P+ S  + +  ++ L  KY
Sbjct: 677 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPIISEENPISLAKELILKY 726


>gi|294647264|ref|ZP_06724861.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|292637401|gb|EFF55822.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
          Length = 733

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 155/530 (29%), Positives = 262/530 (49%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VPA L+ ++P AKI  +  W       R   ++ +D  D L+  I R F+E+
Sbjct: 233 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
           Q K YG T+HIY  D F+E   P  + E++S++   IY  +Q  DS A WL   W+F + 
Sbjct: 290 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHA 348

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++K+ LN+VP  KL++LD + +   IW  ++Q+YG PYIWC L NF GN  + 
Sbjct: 349 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 408

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L+ + F            + G+G+++EG++ NP++Y+ + E A+Q+  + V  WI  +
Sbjct: 409 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANW 467

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  R G     I  AW  LY  +Y  T  A      ++ A P ++               
Sbjct: 468 AQCRGGNVDNHIVKAWKQLYEKIY--TSAALCGQAVLMNARPQLE--------------- 510

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                E         Y +  LW    E+++A  ++          + Y +D+I++ RQ L
Sbjct: 511 ---GVEGWNTLPGYDYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 558

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
                +      + Y+  D  G     +R  +L+ D+D LL C   F +G W++ A+  A
Sbjct: 559 GNLFADYRDKFTDCYRKKDLEGTKVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFA 618

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            NE+++K YE NAR  +T+W     ++ + L DY N+ W GL R +Y  R   + + +I 
Sbjct: 619 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 674

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++     F  + + ++  +   +W      +P+ S  + +  ++ L  KY
Sbjct: 675 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPIISEENPISLAKELILKY 724


>gi|29348998|ref|NP_812501.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340905|gb|AAO78695.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 732

 Score =  243 bits (620), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 155/530 (29%), Positives = 259/530 (48%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P+AKI Q+  W       R   ++ +D  D L+  I R F+E+
Sbjct: 233 VLPAFAGHVPAELKTIYPNAKIYQMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K YG T HIY  D F+E   P  S ++++++ + IY  +   DS A WL   W+F YD
Sbjct: 290 QTKVYG-TDHIYGIDPFNEVDSPDWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYD 348

Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++++ L +VP  KL++LD + +   IW  ++++YG PYIWC L NF GN  + 
Sbjct: 349 KKKWTQPRIRSFLKAVPDNKLILLDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIA 408

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L+ I F            + G+G ++EG + NP++Y+ + + A+ +  V    WI  +
Sbjct: 409 GNLNDIDFKIKRLFKEGGDNVYGLGATLEGFDVNPLMYEFVFDQAWDY-PVTTDQWITNW 467

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S+ R G     I  AW  L+  +Y  T+ AT     ++ A P        +T  K  N  
Sbjct: 468 SMCRGGNQDANIIKAWRALHQKIY--TEHATCGQSVLMNARP-------RLTGTKSWNTN 518

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
             +            Y +  LW    E+++A  +          ++ +R+D+I++ RQ L
Sbjct: 519 PGI-----------HYANNDLWQIWKELLKARNI---------NNSDFRFDVINIGRQVL 558

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
               +E        Y   D  G+ + S R   L+ D+D LL+C     +G WL+ A+   
Sbjct: 559 GNLFSEYRDQFTACYNRKDTTGMREWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCG 618

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
               ++  YE NAR  +T+W     ++ + L DY N+ W GL R +Y  R   +   +I 
Sbjct: 619 ATVSEKDYYEENARCILTVW----GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIA 674

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++     F    + ++  +   +W   ++ +P+ S  D +  +  L  KY
Sbjct: 675 AVSEDKPFDEDKFHQDITQFEYNWTLQKDSFPIVSEEDPIQIADSLILKY 724


>gi|340520426|gb|EGR50662.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
          Length = 747

 Score =  243 bits (620), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 147/484 (30%), Positives = 259/484 (53%), Gaps = 39/484 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP  +  VFP   ++    W +  +         ++  DP F ++ + FI +
Sbjct: 239 ILPAFPGFVPRNISRVFPDISLSTSPIWSNFGTTL--SADIYINPFDPRFAQLQKLFINK 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG  ++ +  D F+EN P     +Y+ ++    ++ +++ D +AVW+MQ WLFS D
Sbjct: 297 QQELYGNVTNFWTLDQFNENRPLSGDLDYLRNVSHNTWAALKAADPEAVWVMQAWLFSSD 356

Query: 128 P-FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   +++ALL  VP+ + +++LDLFAE  P W  +  FYG P+IWC LHN+ GN+ +
Sbjct: 357 SSFWTNDRVEALLGGVPVNQDMLLLDLFAESAPQWQRTDSFYGKPWIWCELHNYGGNMGL 416

Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           YG ++++    ++A R S+  ++VG G++MEG E N ++YDL+ + A+  + +D   + +
Sbjct: 417 YGQIENVTINSMDAVRNSD--SIVGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTDTYFH 474

Query: 245 QYSVRRYG-RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
            +   RYG ++V  +   W +L  TV+N T+   +  +  I+   ++ PSI  +  G+  
Sbjct: 475 DWVSARYGAKNVKGLYKGWEMLRPTVFNNTNLTVNAVQKSIL---ELTPSISGLL-GRTG 530

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN--TYRYDLIDL 361
            +G  +  +  +  E  S                 ELF A   +L+  N  +Y+YDL+D 
Sbjct: 531 RHGTTIMYDPAVMVEAWS-----------------ELFKAGLQDLTLFNNPSYQYDLVDW 573

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLE 420
           TRQ L     + + ++++AY  + +  V +    + + L++ +D +LA +  F L PW++
Sbjct: 574 TRQVLVNSFEDHYKDLVDAYNKSSSPTVIRTRGAKLVTLLKTLDAVLATNKNFQLTPWID 633

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            A+  A +      +E+NAR QIT+W    Q E     DY +K W+GL+  YY  R   +
Sbjct: 634 RAR--ASSPSSANFFEFNARNQITLWGPQGQIE-----DYASKQWAGLVGTYYAERWQQF 686

Query: 481 FKYM 484
             Y+
Sbjct: 687 VDYL 690


>gi|404487206|ref|ZP_11022393.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335702|gb|EJZ62171.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
           YIT 11860]
          Length = 731

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 165/541 (30%), Positives = 257/541 (47%), Gaps = 63/541 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V   F G VP +L+  FP A I + G W + +          LD  DPLF +I   + E+
Sbjct: 233 VFQGFYGMVPNSLKEKFPEANIKEQGEWQTYQRPA------FLDPNDPLFDKIADIYYEE 286

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q K +G+  + +  D F E   +  +D    + +    I   M+    +AVW++QGW   
Sbjct: 287 QEKLFGKAVY-FAGDPFHEGGQSEGID----VKAAAKKILKAMRRKTPEAVWIIQGWQ-- 339

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ--FYGVP------YIWCMLH 177
                R P M+ LL  +  G+ ++LDL A  +P W   K   FY         +IWC L 
Sbjct: 340 -----RNP-MRDLLEGLEHGEAIILDLMACERPQWGGIKNSLFYKAEGHMHHDWIWCALP 393

Query: 178 NFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 236
           NF G   ++G + S A G V A+       + G+G + EGI   PVVYD++ +MA++ + 
Sbjct: 394 NFGGKTGLHGKMSSYASGVVFAKNHPLGKNLCGIGTAPEGIGTIPVVYDMVYDMAWREDS 453

Query: 237 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
           +D+K W+NQY+  RYG++ P    AW +L  T+Y C +         I A P        
Sbjct: 454 IDIKDWVNQYTQYRYGKADPNCNRAWEILSKTIYECHNEIGGPVESYICARP-------- 505

Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
                           +      SS+    ++Y  +E++ A E      +E + S TY+Y
Sbjct: 506 ----------------SDTIKHASSWGTAEIFYDPAEIVTAWECMYNVRHEFAQSETYQY 549

Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
           DL+DLTRQ L  YA  L    + A+  ND  G    S +FL L+ D D LL+    F +G
Sbjct: 550 DLVDLTRQVLGDYAKYLHKQAVNAFYRNDLKGFQTYSSKFLVLIRDEDKLLSTRKEFNVG 609

Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
            W+  A+  A   ++++++  NA+ QIT W ++     S L DY  K WSGL+RD Y PR
Sbjct: 610 TWINQARNAACTPQEQERFVANAKRQITTWTNHD----SKLHDYALKEWSGLMRDMYLPR 665

Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
              +  Y + +L  G+  +  D    + ++  +W +    Y   S G+A+   + +Y KY
Sbjct: 666 WKAWVDYKL-ALLRGETAQEPD----YFQIEKNWVDSDTRYDSTSTGNAISAVEEIYKKY 720

Query: 537 L 537
            
Sbjct: 721 F 721


>gi|298386708|ref|ZP_06996263.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
 gi|298260382|gb|EFI03251.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
          Length = 732

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 154/530 (29%), Positives = 259/530 (48%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P+AKI Q+  W       R   ++ +D  D L+  I R F+E+
Sbjct: 233 VLPAFAGHVPAELKTIYPNAKIYQMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K YG T HIY  D F+E   P  S ++++++ + IY  +   DS A WL   W+F YD
Sbjct: 290 QTKVYG-TDHIYGIDPFNEVDSPDWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYD 348

Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++++ L +VP  KL++LD + +   IW  ++++YG PYIWC L NF GN  + 
Sbjct: 349 KKKWTQPRIRSFLKAVPDNKLILLDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIA 408

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L+ I F            + G+G ++EG + NP++Y+ + + A+ +  V    WI  +
Sbjct: 409 GNLNDIDFKIKRLFKEGGDNVYGLGATLEGFDVNPLMYEFVFDQAWDYS-VTTDQWITNW 467

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S+ R G     I  AW  L+  +Y  T+ AT     ++ A P        +T  K  N  
Sbjct: 468 SMCRGGNQDANIIKAWRALHQKIY--TEHATCGQSVLMNARP-------RLTGTKSWNTN 518

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
             +            Y +  LW    E+++A  +          ++ +R+D+I++ RQ L
Sbjct: 519 PGI-----------HYANNDLWQIWKELLKARNI---------NNSDFRFDVINIGRQVL 558

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
               ++        Y   D  G+ + S R   L+ D+D LL+C     +G WL+ A+   
Sbjct: 559 GNLFSKYRDQFTACYNRKDTTGMREWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCG 618

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
               ++  YE NAR  +T+W     ++ + L DY N+ W GL R +Y  R   +   +I 
Sbjct: 619 ATVSEKDYYEENARCILTVW----GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIA 674

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++     F    + ++  +   +W   ++ +P+ S  D +  +  L  KY
Sbjct: 675 AVSEDKPFDEDKFHQDITQFEYNWTLQKDSFPIVSEEDPIQIADSLILKY 724


>gi|423259033|ref|ZP_17239956.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
           CL07T00C01]
 gi|423263996|ref|ZP_17242999.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
           CL07T12C05]
 gi|387776613|gb|EIK38713.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
           CL07T00C01]
 gi|392706262|gb|EIY99385.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
           CL07T12C05]
          Length = 718

 Score =  241 bits (615), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 163/540 (30%), Positives = 262/540 (48%), Gaps = 70/540 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V SDP   C Y     L  +DP F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ ++ Y+ D F E  NT  VD    + + G A+   M+  +  AVW+ Q 
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
           W  +  P         ++ ++  G L++LDL +E +P W  S      K  YG   +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380

Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML N+ GN+ ++G +D++   F   +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           + ++   + W+ +Y   RYG   P +Q AW  L +++YN     T +  +  V  A P  
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                   E  YQ                SS+     +Y   EVI A  L ++  +    
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           +N + YDL+D+ RQALA+    +   +  AY+  D       S +FL+L+   D LL   
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+E A+ L    E+++ YEWNAR QIT W +    +   LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           D+Y  R  +YF ++ + +E     ++     ++  +   W    N Y  E+ GD +  ++
Sbjct: 657 DFYYMRWKLYFDFLSQRMEGKPPAKI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|391338146|ref|XP_003743422.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Metaseiulus
           occidentalis]
          Length = 665

 Score =  241 bits (615), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 139/465 (29%), Positives = 233/465 (50%), Gaps = 44/465 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+P F+G VP A + + P+   ++   W +   +  +     L  T+  F+ +   +I  
Sbjct: 225 VVPGFNGFVPRAFERLHPAVSWSRASRWNNFPDE--YAMLTFLAPTESFFLNVSSLYITM 282

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
               YG + H+Y+ D F+E TP  + P  ++ + + +Y  +   D   +W+MQGWLF + 
Sbjct: 283 YRSIYG-SDHLYSVDLFNEETPDTNDPAALAEMSSNVYESIAKADPKGIWVMQGWLFVHG 341

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             +W   ++KA L   PLGK++VLDLF+E  P +     ++G P+IWCMLHN+ G   ++
Sbjct: 342 GDYWNHDRVKAFLGGPPLGKMIVLDLFSEQSPQFPRFSNYFGQPFIWCMLHNYGGVSGLF 401

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L+ I   P+  R S    M+G+G++ EG  QN V+Y+ M+E +++    +V  W+  Y
Sbjct: 402 GNLEWINSEPLNVRRSV-PNMIGIGIAPEGTGQNEVIYEFMAENSYRDSSENVSLWLQNY 460

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
              RYG S P +++AW +L  +VY                         S+T    +N+G
Sbjct: 461 VGARYGLSDPHLENAWELLRKSVY-------------------------SLTSKSIENHG 495

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN---ELSASNTYRYDLIDLTR 363
                  +L         P +WY+ S+VI A    I       EL     +  DL+D+ R
Sbjct: 496 N-----YILTHRPKLNSTPLIWYNGSDVIGAATELIRGATLHRELCHERLFHQDLVDVVR 550

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           QAL    ++ +L ++  ++ N      + SRR L  +  +D +L+    FLLG WL  ++
Sbjct: 551 QALQVRVSDEYLQMMSHFKANSLIDFEEHSRRLLHCIRVLDKVLSTDPNFLLGSWLRDSR 610

Query: 424 QLAQ-NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
           + A  + + + Q+E+NAR QIT W  N +     + DY +K W+G
Sbjct: 611 ESAGLDRDLQDQFEFNARNQITRWGPNGE-----IVDYASKMWNG 650


>gi|260642393|ref|ZP_05415712.2| alpha-N-acetylglucosaminidase [Bacteroides finegoldii DSM 17565]
 gi|260622285|gb|EEX45156.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides finegoldii DSM
           17565]
          Length = 735

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 153/530 (28%), Positives = 261/530 (49%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VPA L+ ++P AKI  +  W       R   ++ +D  D L+  I   F+E+
Sbjct: 235 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQHRFLEE 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
           Q K YG T+HIY  D F+E   P  + E++S++   IY  +QS DS A WL   W+F + 
Sbjct: 292 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQSVDSAAQWLQMTWMFYHA 350

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++K+ LN+VP  KL++LD + +   IW  ++Q+YG PYIWC L NF GN  + 
Sbjct: 351 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 410

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L+ + F            + G+G+++EG++ NP++Y+ + E A+++  + V  WI  +
Sbjct: 411 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWEN-SIPVHQWIANW 469

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  R G     I  AW  LY  +Y  T  A      ++ A P ++               
Sbjct: 470 AQCRGGNVDNHIIKAWKQLYEKIY--TSAALCGQAVLMNARPQLE--------------- 512

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                E         Y +  LW    E+++A  ++          + Y +D+I++ RQ L
Sbjct: 513 ---GVEGWNTLPGYDYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 560

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
                +      + Y+  D  G     +R  +L+ D+D LL C     +G W++ A+  A
Sbjct: 561 GNLFADYRDKFADCYRKKDLEGTKVWGQRMDQLLLDVDRLLCCSPVLSIGKWIKDARDFA 620

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            NE+++K YE NAR  +T+W     ++ + L DY N+ W GL R +Y  R   + + +I 
Sbjct: 621 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 676

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++     F  + + ++  +   +W      +P+ S  + +  ++ L  KY
Sbjct: 677 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPITSGENPISLAKELILKY 726


>gi|383124408|ref|ZP_09945072.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
 gi|251839096|gb|EES67180.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
          Length = 732

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 154/530 (29%), Positives = 258/530 (48%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P+AKI Q+  W       R   ++ +D  D L+  I R F+E+
Sbjct: 233 VLPAFAGHVPAELKTIYPNAKIYQMSQWGGFDEKYR---SHFIDPMDSLYQVIQRRFLEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K YG T HIY  D F+E   P  S ++++++ + IY  +   DS A WL   W+F YD
Sbjct: 290 QTKVYG-TDHIYGIDPFNEVDSPDWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYD 348

Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++++ L +VP  KL++LD + +   IW  ++++YG PYIWC L NF GN  + 
Sbjct: 349 KKKWTQPRIRSFLKAVPDDKLILLDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIA 408

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L+ I F            + G+G ++EG + NP++Y+ + + A+ +  V    WI  +
Sbjct: 409 GNLNDIDFKIKRLFKEGGDNVYGLGATLEGFDVNPLMYEFVFDQAWDY-PVTTDQWITNW 467

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           S+ R G     I  AW  L+  +Y  T+ A      ++ A P        +T  K  N  
Sbjct: 468 SMCRGGDQDANIIKAWRALHQNIY--TEYAICGQSVLMNARP-------RLTGTKSWNTN 518

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
             +            Y +  LW    E+++A  +          ++ +R+D+I++ RQ L
Sbjct: 519 PGIH-----------YANNDLWQIWKELLKARNI---------NNSDFRFDVINIGRQVL 558

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
               +E        Y   D  G+ + S R   L+ D+D LL+C     +G WL+ A+   
Sbjct: 559 GNLFSEYRDQFTACYNRKDTTGMREWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARDCG 618

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
               ++  YE NAR  +T+W     ++ + L DY N+ W GL R +Y  R   +   +I 
Sbjct: 619 TTVSEKDYYEENARCILTVW----GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIG 674

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++     F    + ++  +   +W   ++ +P+ S  D +  +  L  KY
Sbjct: 675 AVSKNKPFDEDKFHQDITQFEYNWTLQKDSFPIVSEEDPIQIADSLILKY 724


>gi|53711968|ref|YP_097960.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis YCH46]
 gi|52214833|dbj|BAD47426.1| alpha-N-acetylglucosaminidase precursor [Bacteroides fragilis
           YCH46]
          Length = 718

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V SDP   C Y     L  +DP F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ ++ Y+ D F E  NT  VD    + + G A+   M+  +  AVW+ Q 
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
           W  +  P         ++ ++  G L++LDL +E +P W  S      K  YG   +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380

Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML N+ GN+ ++G +D++   F   +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           + ++   + W+ +Y   RYG   P +Q AW  L +++YN     T +  +  V  A P  
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                   E  YQ                SS+     +Y   EVI A  L ++  +    
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           +N + YDL+D+ RQALA+    +   +  AY+  D       S +FL+L+   D LL   
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+E A+ L    E+++ YEWNAR QIT W +    +   LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           D+Y  R  +YF ++ + +E      +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 657 DFYYMRWKLYFDFLSQRIEGKTPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|452988463|gb|EME88218.1| glycoside hydrolase family 89 protein [Pseudocercospora fijiensis
           CIRAD86]
          Length = 772

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 144/509 (28%), Positives = 259/509 (50%), Gaps = 51/509 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F+G VP  +  ++P+A       W   +++  +     L+  DPLF  + ++FI +
Sbjct: 238 VLPCFTGFVPTQISRLYPNASFVNGSRWNGFQAE--YTNVTFLEPFDPLFTTLQKSFISK 295

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q++ YG  S IY  D ++EN P      Y+ ++ +     +++ D +A+W +QGWLF S 
Sbjct: 296 QIEAYGNVSSIYTLDQYNENDPFSGELAYLKNVTSNTIKSLKAADPEAIWFIQGWLFYSS 355

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW   +++A L  V    +++LDLF+E +P W  +  ++G P+IWC LH++ GN  ++
Sbjct: 356 ADFWTDERVEAYLGGVANEDMLILDLFSESQPQWQRTNSYFGKPWIWCQLHDYGGNQGLH 415

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G ++++   PV+A  ++ +TMVG+G +MEG E N ++YD++ + A+  E +D  ++ + +
Sbjct: 416 GQVENVTINPVQALANKTSTMVGMGSTMEGQEGNEIIYDILLDQAWSKEPIDSDSYFHDW 475

Query: 247 SVRRY-GRSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
              RY G  +P+ +  AW+V+  TVYN TD   +    V  +  +++P+   +   +  +
Sbjct: 476 VTSRYAGSKLPSGLYTAWDVMRQTVYNSTD--IEAAEAVTKSIFELEPNTTGLLNRRGHH 533

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA--SNTYRYDLIDLT 362
               +    VL S  +                  +L+ AS +++       Y++DL+D T
Sbjct: 534 STLILYDPNVLVSAWN------------------DLYNASNDDIQLWDVKAYQFDLVDTT 575

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVF------QLSRRFLELVEDMDGLLACHDG--FL 414
           RQ LA     L+ + + +       G +      +  +  + L++D+D +L       F 
Sbjct: 576 RQVLANAFYPLYTDFVHSAN-KSVQGTYSPTKAEEKGKEMIMLLKDLDSVLEASGNAHFK 634

Query: 415 LGPWLESAKQLAQNEEQEKQ----------YEWNARTQITMWFDNTQEEASLLRDYGNKY 464
           L  W+ESA+  A  E+              YE+ AR QIT+W  N +     + DY +K 
Sbjct: 635 LSSWIESARLWAPAEDYADDKNTTAKIADFYEYTARNQITLWGPNGE-----ISDYASKQ 689

Query: 465 WSGLLRDYYGPRAAIYFKYMIESLESGDG 493
           W+GL+R YY PR   +  + + S  S +G
Sbjct: 690 WAGLIRSYYVPRWQRFVDFTLNSTTSMNG 718


>gi|423248659|ref|ZP_17229675.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
           CL03T00C08]
 gi|423253608|ref|ZP_17234539.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
           CL03T12C07]
 gi|392655237|gb|EIY48880.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
           CL03T12C07]
 gi|392657600|gb|EIY51231.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
           CL03T00C08]
          Length = 718

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V SDP   C Y     L  +DP F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ ++ Y+ D F E  NT  VD    + + G A+   M+  +  AVW+ Q 
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
           W  +  P         ++ ++  G L++LDL +E +P W  S      K  YG   +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380

Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML N+ GN+ ++G +D++   F   +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           + ++   + W+ +Y   RYG   P +Q AW  L +++YN     T +  +  V  A P  
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                   E  YQ                SS+     +Y   EVI A  L ++  +    
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           +N + YDL+D+ RQALA+    +   +  AY+  D       S +FL+L+   D LL   
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+E A+ L    E+++ YEWNAR QIT W +    +   LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           D+Y  R  +YF ++ + +E      +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 657 DFYYMRWKLYFDFLSQRIEGKTPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|423269418|ref|ZP_17248390.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
           CL05T00C42]
 gi|423273021|ref|ZP_17251968.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
           CL05T12C13]
 gi|392701212|gb|EIY94372.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
           CL05T00C42]
 gi|392708585|gb|EIZ01692.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
           CL05T12C13]
          Length = 718

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V SDP   C Y     L  +DP F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ ++ Y+ D F E  NT  VD    + + G A+   M+  +  AVW+ Q 
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
           W  +  P         ++ ++  G L++LDL +E +P W  S      K  YG   +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380

Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML N+ GN+ ++G +D++   F   +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           + ++   + W+ +Y   RYG   P +Q AW  L +++YN     T +  +  V  A P  
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                   E  YQ                SS+     +Y   EVI A  L ++  +    
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           +N + YDL+D+ RQALA+    +   +  AY+  D       S +FL+L+   D LL   
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+E A+ L    E+++ YEWNAR QIT W +    +   LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           D+Y  R  +YF ++ + +E      +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 657 DFYYMRWKLYFDFLSQRMEGKAPAEI-----DFYAIEEPWTKAANPYSTEAEGDCIEVAK 711


>gi|60680169|ref|YP_210313.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC 9343]
 gi|375357012|ref|YP_005109784.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
 gi|383116930|ref|ZP_09937677.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
 gi|60491603|emb|CAH06355.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC
           9343]
 gi|251947777|gb|EES88059.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
 gi|301161693|emb|CBW21233.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
          Length = 718

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V SDP   C Y     L  +DP F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ ++ Y+ D F E  NT  VD    + + G A+   M+  +  AVW+ Q 
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
           W  +  P         ++ ++  G L++LDL +E +P W  S      K  YG   +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380

Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML N+ GN+ ++G +D++   F   +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           + ++   + W+ +Y   RYG   P +Q AW  L +++YN     T +  +  V  A P  
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                   E  YQ                SS+     +Y   EVI A  L ++  +    
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           +N + YDL+D+ RQALA+    +   +  AY+  D       S +FL+L+   D LL   
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+E A+ L    E+++ YEWNAR QIT W +    +   LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           D+Y  R  +YF ++ + +E      +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 657 DFYYMRWKLYFDFLSQRIEGKTPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|423282107|ref|ZP_17260992.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
           615]
 gi|404582594|gb|EKA87288.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
           615]
          Length = 718

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V SDP   C Y     L  +DP F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ ++ Y+ D F E  NT  VD    + + G A+   M+  +  AVW+ Q 
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
           W  +  P         ++ ++  G L++LDL +E +P W  S      K  YG   +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380

Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML N+ GN+ ++G +D++   F   +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           + ++   + W+ +Y   RYG   P +Q AW  L +++YN     T +  +  V  A P  
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                   E  YQ                SS+     +Y   EVI A  L ++  +    
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           +N + YDL+D+ RQALA+    +   +  AY+  D       S +FL+L+   D LL   
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+E A+ L    E+++ YEWNAR QIT W +    +   LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           D+Y  R  +YF ++ + +E      +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 657 DFYYMRWKLYFDFLSQRMEGKAPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|313145188|ref|ZP_07807381.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
 gi|313133955|gb|EFR51315.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
          Length = 718

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 164/540 (30%), Positives = 260/540 (48%), Gaps = 70/540 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V SDP   C Y     L  +DP F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ +  Y+ D F E  NT  VD    + + G A+   M+  +  AVW+ Q 
Sbjct: 274 YYKELEKLYGK-ADFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
           W  +  P         ++ ++  G L++LDL +E +P W  S      K  YG   +++C
Sbjct: 329 WQANPRP--------KMIENLGAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYC 380

Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML N+ GN+ ++G +D++   F   +A     +T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHAGSTLKGVGMAPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           + E+   + W+ +Y   RYG   P +Q AW  L +++YN     T +  +  V  A P  
Sbjct: 441 RAERFTKEEWLKEYVKARYGADDPVVQAAWTKLANSIYNSPKNLTQQGTHESVFCARP-- 498

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                   E  YQ                SS+     +Y   +VI A  L ++  +    
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQDVIEAARLMVSVADRYKG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           +N + YDL+D+ RQALA+    +   +  AY+  D       S++FL L+   D LL   
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTR 596

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+E A+ L    E++  YEWNAR QIT W +    +   LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGGTSEEKALYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           D+Y  R  +YF  + + LE     ++     ++  +   W    N Y  E+ GD + T++
Sbjct: 657 DFYYMRWKLYFDSLSQKLEGKTPEKI-----DFYAVEEPWAKATNPYSAEAEGDCIETAK 711


>gi|393788286|ref|ZP_10376416.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
           CL02T12C05]
 gi|392655959|gb|EIY49600.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
           CL02T12C05]
          Length = 757

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 166/546 (30%), Positives = 259/546 (47%), Gaps = 70/546 (12%)

Query: 11  AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
            FSG VP  L+  +P+A I Q  +W   K   +      LD TD LF  +GRAF+E+Q +
Sbjct: 236 GFSGYVPRELKEKYPTANINQQRSWCGFKGAAQ------LDPTDSLFTRMGRAFLEEQAR 289

Query: 71  EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
            +G    +Y  D F E+ PP+D+PEY+ ++G  I+   +  D  + W MQ W    D   
Sbjct: 290 LFG-AHGVYAADPFHESAPPIDTPEYLKAVGERIHHLFRDFDPHSTWAMQSWSLRED--- 345

Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ-FYGVPYIWCMLHNFAGNIEMYGIL 189
                  ++ +VP   L++LDL  +     STSK  F+G   +   LHNF G I M+G L
Sbjct: 346 -------IVKAVPKDALLILDLNGK-----STSKALFWGYSTVVGNLHNFGGRINMHGDL 393

Query: 190 DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 249
             +A          N  + G G+ ME +EQNPV Y+L  EM    + ++++AW+ QY+ R
Sbjct: 394 KLLASNQYSKAKRLNPAVCGSGLFMEAVEQNPVYYELAFEMPCHADSINLQAWLKQYATR 453

Query: 250 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 309
           RYG   PA Q+AW +L +  Y      T+K+  ++ A P +D                 V
Sbjct: 454 RYGAFSPAAQEAWLLLLNGPYRRGTNGTEKS-SIVAARPALD-----------------V 495

Query: 310 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 369
            K     +    YD P L      VIRA  L +   ++LS S  YR+D++D+ RQ +   
Sbjct: 496 KKSGPNAALEIPYD-PTL------VIRAQSLLLKDIDKLSVSRPYRFDIVDVQRQLMTNL 548

Query: 370 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 429
              +     EA++  D       S RFLE++ DMD LL     +    WL  A+     +
Sbjct: 549 GQLIHRQAAEAFRKKDQCAFTLHSGRFLEMLADMDKLLRTRSEYSFDRWLTEARSWGDTD 608

Query: 430 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 489
           E++   E +A + +T+W  +       + DY  + WSGL+  YY PR   ++  + + L+
Sbjct: 609 EEKNLMERDATSLVTIWGADGDPR---IFDYSWREWSGLISGYYLPRWQKFYAMLQQHLD 665

Query: 490 SGDGFRLKD----WRREWIKLTNDWQN-------------GRNVYPVESNGDALITSQWL 532
            G  +        + RE  +  ND+ N             G+   P+ + GD +I  + L
Sbjct: 666 VGTSYEEAGLPLIYGREAFR-ANDFYNGLAEWELAYVDTYGKARTPI-TEGDEIIMVKQL 723

Query: 533 YNKYLQ 538
           ++KYL+
Sbjct: 724 FDKYLK 729


>gi|424666301|ref|ZP_18103337.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
           616]
 gi|404573840|gb|EKA78592.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
           616]
          Length = 718

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 164/540 (30%), Positives = 260/540 (48%), Gaps = 70/540 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V SDP   C Y     L  +DP F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ +  Y+ D F E  NT  VD    + + G A+   M+  +  AVW+ Q 
Sbjct: 274 YYKELEKLYGK-ADFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
           W  +  P         ++ ++  G L++LDL +E +P W  S      K  YG   +++C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYC 380

Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML N+ GN+ ++G +D++   F   +A     +T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           + E+   + W+ +Y   RYG   P +Q AW  L +++YN     T +  +  V  A P  
Sbjct: 441 RAERFTKEEWLKEYVKARYGADDPVVQAAWTKLANSIYNSPKNLTQQGTHEAVFCARP-- 498

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                   E  YQ                SS+     +Y   +VI A  L ++  +    
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQDVIEAARLMVSVADRYKG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           +N + YDL+D+ RQALA+    +   +  AY+  D       S++FL L+   D LL   
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTR 596

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+E A+ L    E++  YEWNAR QIT W +    +   LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGGTSEEKALYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           D+Y  R  +YF  + + LE     ++     ++  +   W    N Y  E+ GD + T++
Sbjct: 657 DFYYMRWKLYFDSLSQKLEGKTPEKI-----DFYAVEEPWAKATNPYSAEAEGDCIETAK 711


>gi|423280158|ref|ZP_17259071.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
           610]
 gi|404584494|gb|EKA89159.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
           610]
          Length = 718

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 164/540 (30%), Positives = 260/540 (48%), Gaps = 70/540 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V SDP   C Y     L  +DP F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ +  Y+ D F E  NT  VD    + + G A+   M+  +  AVW+ Q 
Sbjct: 274 YYKELEKLYGK-ADFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
           W  +  P         ++ ++  G L++LDL +E +P W  S      K  YG   +++C
Sbjct: 329 WQANPRP--------KMIENLGAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYC 380

Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML N+ GN+ ++G +D++   F   +A     +T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           + E+   + W+ +Y   RYG   P +Q AW  L +++YN     T +  +  V  A P  
Sbjct: 441 RAERFTKEEWLKEYVKARYGADDPVVQAAWTKLANSIYNSPKNLTQQGTHESVFSARP-- 498

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                   E  YQ                SS+     +Y   +VI A  L ++  +    
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQDVIEAARLMVSVADRYKG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           +N + YDL+D+ RQALA+    +   +  AY+  D       S++FL L+   D LL   
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTR 596

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+E A+ L    E++  YEWNAR QIT W +    +   LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGGTSEEKALYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           D+Y  R  +YF  + + LE     ++     ++  +   W    N Y  E+ GD + T++
Sbjct: 657 DFYYMRWKLYFDSLSQKLEGKTPEKI-----DFYAVEEPWTKATNPYSAEAEGDCIETAK 711


>gi|265765312|ref|ZP_06093587.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
 gi|263254696|gb|EEZ26130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
          Length = 718

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V SDP   C Y     L  +DP F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ ++ Y+ D F E  NT  VD    + + G A+   M+  +  AVW+ Q 
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
           W  +  P         ++ ++  G L++LDL +E +P W  S      K  YG   +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380

Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML N+ GN+ ++G +D++   F   +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           + ++   + W+ +Y   RYG   P +Q AW  L +++YN     T +  +  V  A P  
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                   E  YQ                SS+     +Y   EVI A  L ++  +    
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           +N + YDL+D+ RQALA+    +   +  AY+  D       S +FL+L+   D LL   
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+E A+ L    E+++ YEWNAR QIT W +    +   LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKEFYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           D+Y  R  +YF ++ + +E      +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 657 DFYYMRWKLYFDFLSQRMEGKAPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|379334158|gb|AFD03088.1| putative alpha-N-acetylglucosaminidase [uncultured bacterium 8]
          Length = 726

 Score =  238 bits (608), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 164/524 (31%), Positives = 259/524 (49%), Gaps = 40/524 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           VL  FSG+VP  L      A+ T L  W F V          +LD  DPLF E G   + 
Sbjct: 218 VLQGFSGHVPQELIAER-GARSTTLPWWDFEVG---------MLDPRDPLFEEFGTTLLT 267

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
           +Q + +G T H+Y  D F E TPPV  P  ++ +  A++  M + D  A W++Q W FSY
Sbjct: 268 EQTRLFG-TDHLYAADPFIETTPPVSDPADLAQVARAVHGVMTAVDDRATWVLQAWPFSY 326

Query: 127 -DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
              +W P +  A L+++P   +++LDL+AE +P+W  +  +   P++WCMLH+  G   +
Sbjct: 327 RSRYWTPERTGAFLDAIPDDGMLILDLWAEHRPVWQRTDGYRKKPWVWCMLHSLGGRPGL 386

Query: 186 YGILDSIAFGPVEART-SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           YG LD IA G   A+  +   ++ G+G SME    +PV+Y+L++++A+Q    DV+AW+ 
Sbjct: 387 YGKLDEIATGAARAQADARGGSLSGIGASMEAFGGDPVLYELLADVAWQGSVDDVRAWLE 446

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            ++  RYGR+ P +  AW++L+ +VY  ++G       VIV  P +        EG  ++
Sbjct: 447 TWTRARYGRATPGLLRAWDLLHDSVY-ASEGPGPPG-SVIVGRPTL--------EGDLRH 496

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                 +  V  ++  S D P      +       L   +  E SA    R DL D+T Q
Sbjct: 497 ------ELPVHLADPPSPDVP-----PALAEAWALLADEATQEDSAGPLGR-DLCDVTAQ 544

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L   A E      +A    DA G  + +R  L+ +ED+D LLA      L  WL  A+ 
Sbjct: 545 VLTHVACERQWRAADAALARDADGFQRAARALLDTIEDLDTLLATRPEHRLDGWLADARG 604

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
            A    +   YE +AR  +T+W        S L DY  ++W+GL+  +Y PR   +++++
Sbjct: 605 WATTPAEADLYETDARRLLTLWGHTR----SKLHDYSGRHWAGLVGTFYLPRWRSWYEHI 660

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
             +LE+G  +R +++    +     W   RN       G A  T
Sbjct: 661 ARALETGSPYRAEEFEASLLAQEERWVADRNGPTTPEAGTAGAT 704


>gi|336408181|ref|ZP_08588675.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
 gi|335939481|gb|EGN01355.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
          Length = 718

 Score =  238 bits (608), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 162/540 (30%), Positives = 260/540 (48%), Gaps = 70/540 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           +LP + G VP   +         +LG   +V SDP   C Y     L  +DP F EI   
Sbjct: 225 MLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ ++ Y+ D F E  NT  VD    + + G A+   M+  +  AVW+ Q 
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
           W  +  P         ++  +  G L++LDL +E +P W  S      K  YG   +I+C
Sbjct: 329 WQANPRP--------KMIEDLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380

Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML N+ GN+ ++G +D++   F   +A    + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           + ++   + W+ +Y   RYG   P +Q AW  L +++YN     T +  +  V  A P  
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                   E  YQ                SS+     +Y   EVI A  L ++  +    
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           +N + YDL+D+ RQALA+    +   +  AY+  D       S +FL+L+   D LL   
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+E A+ L    E+++ YEWNAR QIT W +    +   LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           D+Y  R  +YF ++ + +E      +     ++  +   W    N Y  E+ GD +  ++
Sbjct: 657 DFYYMRWKLYFDFLSQRIEGKTPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711


>gi|325299497|ref|YP_004259414.1| alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
 gi|324319050|gb|ADY36941.1| Alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
          Length = 723

 Score =  238 bits (608), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 162/535 (30%), Positives = 263/535 (49%), Gaps = 62/535 (11%)

Query: 8   VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           VLP +SG VP  A Q +  +    +L N F+  +         L  TD  F EI   + E
Sbjct: 227 VLPGYSGMVPHDAHQKLGLNVTEPELWNGFTRPA--------FLMPTDKRFAEIAALYYE 278

Query: 67  QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
           +Q K +G+ ++ Y+ D F E  N   VD      + G A+   M+  +  AVW++QGW  
Sbjct: 279 EQEKLFGKANY-YSMDPFHELENAGEVD----FDAAGKAVMDAMKQVNPKAVWVVQGW-- 331

Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHN 178
           + +P  RP  MK L N    G L++LDLF+E +P      IW   K +    +++CML N
Sbjct: 332 TENP--RPEMMKNLKN----GDLLILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMLEN 385

Query: 179 FAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
           F  N+ ++G +D +       + +     + G+G++MEG E NPV+++LM E+ ++ EK+
Sbjct: 386 FGANVGLHGRMDQLLNNFYLTKNNPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKI 445

Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSII 295
             ++W+ +Y   RYG     I+ AW +L   +YNC  G   +  +  +    P ++   +
Sbjct: 446 TKESWLKEYLAARYGAKDEKIEQAWMILADGIYNCPFGNNQQGPHESIFCGRPSMNNFQV 505

Query: 296 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
           S +  K +NY  P S EA                       A  L + + ++   +N + 
Sbjct: 506 S-SWSKMENYYDPTSTEA-----------------------AARLMLEAADKFRGNNNFE 541

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
           YDL+D+ RQALA     ++   I  ++  D     + S+ FL L+   D LLA    F +
Sbjct: 542 YDLVDIVRQALADRGRIVYNRAIADFKSFDKRSYARHSKEFLNLLLAQDRLLATRSEFRV 601

Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
           G W+  A+ L    E++  YEWNAR QIT W +    +   LRDY +K W+G+L+D+Y  
Sbjct: 602 GRWINQARSLGNTPEEKDLYEWNARVQITTWGNRECADKGGLRDYAHKEWNGILKDFYYK 661

Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
           R A +++ +   L   DG  ++D   +W  +   W    N Y  E+ GD + T++
Sbjct: 662 RWAAWWEMLQGVL---DGGEMQD--IDWYAMEEPWTLQHNPYKAEAEGDCIETAR 711


>gi|282877910|ref|ZP_06286719.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
 gi|281299911|gb|EFA92271.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
          Length = 723

 Score =  238 bits (606), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 157/506 (31%), Positives = 240/506 (47%), Gaps = 45/506 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSG+VPA L+ ++P A I  LG W     + R    + L+  DPLF +I + ++E+
Sbjct: 236 VLPAFSGHVPAKLKELYPQANIKYLGRWAGFSDNYR---CHFLNPEDPLFAKIQKMYLEE 292

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   +G T HIY  D F+E  PP   PEY+  +   IY  + S D  A W+   W+F ++
Sbjct: 293 QKALFG-TDHIYGIDPFNEVDPPSWKPEYLKEISHNIYRTVTSVDPGAEWMQMSWMFYHN 351

Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W P ++KALL  V  GK+ +LD   E   +W T+  FYG PYIWC L NF GN  + 
Sbjct: 352 KKQWTPKRIKALLTGVSRGKMSLLDYHCENVELWKTTNNFYGQPYIWCYLGNFGGNTTIT 411

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +        EA   +N  ++G+G ++EG++     Y+ +   A+     D K WI+  
Sbjct: 412 GNVKESGQRLNEALNKKNKNLIGIGSTLEGLDVIQFPYEYILTQAWTATPAD-KEWIDNL 470

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G S P ++ AW +L++ +Y      T   R + +  P + P +     GKYQ   
Sbjct: 471 ADRHVGFSSPKLRQAWQILFNDIY------TQIPRSLGI-LPALRPIL-----GKYQERR 518

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
             ++       E        +W   S+V                 N Y+ DLI + RQ L
Sbjct: 519 TEITYPTKRLEE--------VWKLMSDVSEC------------DRNEYQLDLIAVGRQVL 558

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
                +L L +   Y   D  G+ +      E++ D+D L A +    +G W++ A+   
Sbjct: 559 GNKFLKLKLELDSCYVNKDLVGLQRTGNTMKEVLVDLDYLTAGNSRCSIGKWIDDARAYG 618

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            N+ ++  YE NAR  IT W          L DY N+ WSGL+R YY  R ++Y   +  
Sbjct: 619 NNDLEKAYYEKNARNLITTW-------GGSLNDYANRTWSGLIRTYYVRRWSMYIDELTA 671

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQN 512
           S+ SG  F  +   +   +   +W N
Sbjct: 672 SVMSGKPFDQQQLDKAIGEFEQNWVN 697


>gi|424665881|ref|ZP_18102917.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
           616]
 gi|404574134|gb|EKA78885.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
           616]
          Length = 732

 Score =  237 bits (605), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 164/540 (30%), Positives = 254/540 (47%), Gaps = 63/540 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  F G VP ++   FP+A I   G W + +          L  +DPLF ++ + F E+
Sbjct: 235 VLQGFYGMVPNSMITKFPNADIRDAGKWITYQRPA------FLVPSDPLFAKVAQIFYEE 288

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q K +G+ S  Y  D F E  N+  ++    I+   + IY  M++ + DA+W++QGW   
Sbjct: 289 QEKLFGK-SRYYGGDPFHEGGNSEGIN----ITEAASDIYKAMKANNPDAIWVLQGW--G 341

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIW-----STSKQFYGV---PYIWCMLH 177
            +P +      ALL  +  G+ ++LDL +  +P W     S S +  G     +IWC L 
Sbjct: 342 ANPSY------ALLKGLKQGEALILDLMSCARPQWGGDPSSQSHREDGYLDHNWIWCALP 395

Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEK 236
           NF G I MYG L S A G + A        V GVG + EGI  NP+ YD++ +MA++ + 
Sbjct: 396 NFGGRIGMYGKLQSYATGVIRAEHHPKGKYVCGVGTTPEGIGTNPIDYDMVYDMAWRTDS 455

Query: 237 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
           +DVK+WI  Y+  RYG      + A   L  +VYNC   A         A P +      
Sbjct: 456 IDVKSWIANYTTYRYGSPNNNAKAAMQQLSTSVYNCPWAADGPQESYFCARPSLKID--- 512

Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
                                 TSS+   HL+Y    V++ALE  + + NEL   +TYRY
Sbjct: 513 ---------------------RTSSWGTAHLYYQPINVLQALEHLLKAENELKEIDTYRY 551

Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
           D++D+TRQ LA Y   +   I +AY   D       + +FL+++ D D LL+    FLLG
Sbjct: 552 DVVDVTRQMLADYGKYIHKCIADAYYGKDTEKFDFYTSKFLQMISDQDLLLSTRKEFLLG 611

Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
            ++  A     N  +++ +  NA+ QIT W        S L +Y +K W+G+L   Y PR
Sbjct: 612 KFIRQADACGSNPMEKRMFINNAKRQITTW----ASVNSSLHEYAHKEWNGILGTLYAPR 667

Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
              YF Y+   LE  +   +     ++  +  DW   +  +        +  ++ +Y+ Y
Sbjct: 668 WKAYFDYLRTKLEGKNPKEI-----DFFTMETDWVESKKEFSAVPIKKEIEIAKTIYHNY 722


>gi|380697007|ref|ZP_09861866.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
          Length = 703

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 152/530 (28%), Positives = 259/530 (48%), Gaps = 39/530 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G+VPA L+ ++P AKI  +  W       R   ++ +D  D L+  I R F+E+
Sbjct: 203 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEE 259

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
           Q K YG T+HIY  D F+E   P  + E++S++   IY  +Q  DS A WL   W+F + 
Sbjct: 260 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQDVDSAAQWLQMTWMFYHA 318

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++K+ LN+VP  KL++LD + +   IW  ++Q+YG PYIWC L NF GN  + 
Sbjct: 319 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 378

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L+ + F            + G+G+++EG++ NP++Y+ + E A+++  +    WI  +
Sbjct: 379 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWEN-SMPAHQWIANW 437

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  R G     I  AW  LY  +Y  T  A      ++ A P ++               
Sbjct: 438 AQCRGGNVDNHIVKAWKQLYEKIY--TSAALCGQAVLMNARPQLE--------------- 480

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                E         Y +  LW    E+++A  ++          + Y +D+I++ RQ L
Sbjct: 481 ---GVEGWNTLPGYDYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 528

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
                +      + Y+           +R  +L+ D+D LL C   F +G W++ AK  A
Sbjct: 529 GNLFADYRDKFTDCYRKKKLEETKVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDAKDFA 588

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            NE+++K YE NAR  +T+W     ++ + L DY N+ W GL R +Y  R   + + +I 
Sbjct: 589 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 644

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++     F  + + ++  +   +W      +P+ S  + +  ++ L  KY
Sbjct: 645 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPITSEENPISLAKELILKY 694


>gi|153808241|ref|ZP_01960909.1| hypothetical protein BACCAC_02529 [Bacteroides caccae ATCC 43185]
 gi|423219048|ref|ZP_17205544.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
           CL03T12C61]
 gi|149129144|gb|EDM20360.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
           43185]
 gi|392625814|gb|EIY19870.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
           CL03T12C61]
          Length = 752

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 162/494 (32%), Positives = 235/494 (47%), Gaps = 54/494 (10%)

Query: 11  AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQ 67
            FSG VP  L++ +P AKI         +  P WC       LD TD LF  +GR F+E+
Sbjct: 233 GFSGYVPRELKDKYPEAKI---------RLQPGWCGFKGAGQLDPTDALFATLGRDFLEE 283

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           + K YG T  IY  D F E+ PPV++PEY+S++G AIY  ++  D  A W MQ W     
Sbjct: 284 EKKLYG-TYGIYAADPFHESAPPVNTPEYLSAVGHAIYKLIKDFDPKAKWAMQAWSL--- 339

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
              R P +KA    VP   L++LDL  E        K F+G P +   LHNF G I M+G
Sbjct: 340 ---REPIVKA----VPQNDLIILDLNGEK---IKGRKGFWGYPAVEGNLHNFGGRINMHG 389

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            L  +A         +   + G G+ ME IEQNPV YDL  EM     +V ++ W+ QY+
Sbjct: 390 DLRLLASNQYMTALKQYPNVCGSGLFMEAIEQNPVYYDLAFEMPLHKGEVAIEEWLKQYA 449

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYG   P+ Q A   L    Y      T+  R  I+A     P++     G     G 
Sbjct: 450 NRRYGAVSPSAQQAMICLLEGPYRPGTNGTE--RSSIIA---ARPALNVKKSGPNAGLGI 504

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
           P                    YS   VI+A  L +   ++L  S  YR+D+ID+ RQ + 
Sbjct: 505 P--------------------YSPLLVIQAEGLLLKDADKLKNSEPYRFDVIDVQRQMMT 544

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
                +     EA+   D       S+RFL+++ED+D LL     F    WL SA+    
Sbjct: 545 NMGQVIHKRAAEAFLNRDKEAFALHSKRFLQMLEDVDELLRTRPEFNFDRWLTSARSWGD 604

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
            EE++   E++A + +T+W      + S+  DY  + W+GL++ YY PR   ++  + E 
Sbjct: 605 TEEEKNLLEYDATSLVTIW--GADGDPSIF-DYSWREWTGLIKGYYLPRWTKFYAMLQEH 661

Query: 488 LESGDGFRLKDWRR 501
           L++G  +  +  R+
Sbjct: 662 LDNGTTYSEEGLRQ 675


>gi|404406328|ref|ZP_10997912.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
          Length = 738

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 146/488 (29%), Positives = 226/488 (46%), Gaps = 39/488 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L  ++P AKI+++  W   +   R   ++ LD  DPLF  I R F+ +
Sbjct: 243 VLPAFAGHVPAELGEIYPEAKISRMSKWGGFEDRYR---SHFLDPLDPLFARIQREFLAE 299

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   +G T HIY  D F+E  PP   PE+++ +   IY  M   D +A WL   WLF  D
Sbjct: 300 QTALFG-TDHIYGADPFNEVDPPSWEPEFLARVSRTIYDTMTEADPEAEWLQMTWLFYLD 358

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +++A + +VP  K+++LD + E   +W  +  ++G PY WC L NF GN  + 
Sbjct: 359 RDKWHDDRIEAFVTAVPQDKMLLLDYYCENTEVWRQTHSYHGQPYFWCYLGNFGGNTMLV 418

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G  D ++             + G+G ++EG++ NP +YD + E A+    VD   W +  
Sbjct: 419 GNFDEVSKRIDGVLAEGGNNLRGLGSTLEGLDSNPFMYDYVFERAWDF-PVDDDRWFDAL 477

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G      + AW+ L   VY                       I S   G      
Sbjct: 478 ADRYLGYEDTGYRRAWDALRKNVY-----------------------ITSSKYGHCPLLN 514

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
              + E +L   T +     + Y   E+       I +G+  S  +TYRY L+++ RQ L
Sbjct: 515 ARPTLEGILTGTTDA----EIKYDNDELFEVWAKMIDAGD--SGRDTYRYWLVNVGRQTL 568

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
                 L      A +  D   + +L    LEL  D++ L A H  F +  W++ ++   
Sbjct: 569 GNLFLPLRDGFTAACRAKDLARMKELRSEMLELAADLETLTAQHGAFSMQKWIDDSRSFG 628

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
              E+   YE N RT +T W D  Q     + DY N+ WSGL+ DYY  R  ++    + 
Sbjct: 629 TTPEERDYYEVNGRTLLTTWGDRAQS----INDYANRTWSGLVADYYAERWRMFLDAAVG 684

Query: 487 SLESGDGF 494
           ++E+G  F
Sbjct: 685 AVEAGRKF 692


>gi|392584963|gb|EIW74305.1| glycoside hydrolase family 89 protein [Coniophora puteana
           RWD-64-598 SS2]
          Length = 772

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 148/504 (29%), Positives = 254/504 (50%), Gaps = 60/504 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF G VP A+  +FP+A I     +  + +  ++     L   DPL+ ++  +F+ +
Sbjct: 251 VLPAFPGFVPPAMHTLFPNASIVNGSEYPGIPA--QYSNDSFLAPFDPLYAQLQSSFLAK 308

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q +  G  +H++  D ++EN+P      Y++++  + ++ +++ D DA+WLMQGWLF  D
Sbjct: 309 QTEALGNVTHVWTIDQYNENSPYSGDLTYLANIANSTFASLRAHDPDAIWLMQGWLFFAD 368

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
            PFW   ++ A L+ +P   +++LDLF++V P W     + G  ++WC +H+F GN+ + 
Sbjct: 369 EPFWTSDRVDAYLDQIPNDGMIILDLFSDVYPQWQRLDSYRGKSWVWCEVHDFGGNMGLE 428

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G    +  GPV+A  S N++M GVG++MEG+E N ++YD++ + A+    +D  A+   +
Sbjct: 429 GNFSVVTNGPVDALNSPNSSMKGVGLAMEGLEGNEIIYDVLLDQAWSAAPLDRDAYAKAW 488

Query: 247 SVRRY-------------GRSVPAIQ-DAWNVLYHTVYNCTD----GATDKNRDVIVAFP 288
           + RR+               S+PA   +AW  L  TVY+ T+    GAT           
Sbjct: 489 ATRRFHLPTANSSTTTATNTSIPASAIEAWQTLASTVYSSTNPNVWGATK---------- 538

Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE- 347
               S+I +       Y  P S      + TS             ++ AL   +A+G   
Sbjct: 539 ----SLIELAPSLGGMYSAPSSTIIFYDTNTS-------------LVPALRGLVAAGTSA 581

Query: 348 --LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN--DAHGVFQLSRRFLELVEDM 403
             L A + +R D ID+ RQ LA    + +     AY  +   +  +   + R +++++D+
Sbjct: 582 PALWALDEFRTDSIDVARQLLANRFADAYTATTGAYNASGPGSAALNATAARMMQIIDDL 641

Query: 404 DGLLACHDGFLLGPWLESAKQLA---QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 460
           D LL  H+ +LL   + SA+  A    +E      E+ AR+Q+T+W        S+L DY
Sbjct: 642 DRLLMTHEPYLLSSRIASARAWAGDGGDEAYADYLEYEARSQVTLW----GPVPSVLNDY 697

Query: 461 GNKYWSGLLRDYYGPRAAIYFKYM 484
            +K W GL+  YY  R   + +YM
Sbjct: 698 ASKVWGGLVGTYYRQRWTAFVEYM 721


>gi|400595379|gb|EJP63180.1| alpha-N-acetylglucosaminidase [Beauveria bassiana ARSEF 2860]
          Length = 761

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 147/490 (30%), Positives = 248/490 (50%), Gaps = 30/490 (6%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F G VPAAL+ + P   I +   W  V  +     T  L+ TD  + E+   FI+ 
Sbjct: 245 VLPGFPGFVPAALKKLRPDVNIAEAPVWVDVPRNN--TATAFLNPTDKTYAELQSLFIKN 302

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q+KE+G  +++Y  D F+E  P     +YI+ + ++ Y G+ + +  A+WLMQGWLF S 
Sbjct: 303 QIKEFGNVTNVYTVDQFNEINPSSGDTKYITDVSSSTYKGITAANPAAIWLMQGWLFYSS 362

Query: 127 DPFWRPPQMKALLNSVP-LGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   ++ A L   P    +++LDLF+E +P W  ++ ++G P+IWC LH+F GN  +
Sbjct: 363 QSFWTQQRVDAYLAGPPGQDDMIILDLFSESQPQWQRTRSYFGRPWIWCELHDFGGNQAL 422

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           +G + ++    V+A   E+ ++VG G++ EG E N VVYD++ + A++   +D   +   
Sbjct: 423 HGKITNVTQNSVQA-LKESGSIVGYGLTPEGYEGNEVVYDILLDQAWEGSPIDTANYFRA 481

Query: 246 YSVRRYGRSVPAIQD---AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
           ++  RY  +    +D   AW  L    Y+  D A      V V+   + PS+  +   + 
Sbjct: 482 WARNRYSAAGIIPEDVFTAWEQLRQHAYDVQDNAIPS---VGVSVYQLFPSLKGLVN-RT 537

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
            +Y  P + +   K   + +   HL+Y+++         I S   L     +  D +D+T
Sbjct: 538 GHYPPPTALQYDPKVMKNIW---HLFYNST---------IDSPGLLQIP-AFHLDFVDVT 584

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           RQ L     +++ +++  +Q      V Q L    L  +ED+D  L  ++ F    WL S
Sbjct: 585 RQVLGNAFIDIYTDLVNQFQATANATVIQDLGNSMLSFIEDLDMALNTNEHFTFKKWLNS 644

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           A+   Q+        +NAR+Q+T+W      E+  L DY  K WSG+++ YYG R  I+ 
Sbjct: 645 AESWGQSIGAPDAVAFNARSQVTVW----STESRALDDYAAKAWSGIVKSYYGERWRIFI 700

Query: 482 KYMIESLESG 491
             ++ + E G
Sbjct: 701 NSLVSAREQG 710


>gi|393783261|ref|ZP_10371436.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
           CL02T12C01]
 gi|392669540|gb|EIY63028.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
           CL02T12C01]
          Length = 724

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 147/530 (27%), Positives = 253/530 (47%), Gaps = 46/530 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP AL+ +FP A I  LG W     + R    + L+  + LF  I + +I++
Sbjct: 235 VLPAFAGHVPGALKRIFPEANIQNLGKWAGFAEEYR---CHFLNPEEALFATIQKQYIKE 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G T HIY  D F+E  PP   PEY+S + A +Y  + + D  A W+   W+F +D
Sbjct: 292 QTRLFG-TDHIYGVDPFNEVDPPSWEPEYLSKVSADMYHTLTAADPKAEWMQMTWMFYFD 350

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++KA+L  VP GK+V+LD   E   +W T++ F+G PYIWC L NF GN  + 
Sbjct: 351 RKDWTAPRVKAMLTGVPQGKMVLLDYHCENVELWKTTEHFHGQPYIWCYLGNFGGNTTLT 410

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +            +  +   G+G ++EG++     Y+ + E A+     D ++W+N  
Sbjct: 411 GNVKESGARLDNTLINGGSNFKGIGSTLEGLDVMQFPYEYIFEKAWTL-NTDDRSWLNAL 469

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G +   +++AW++L++ VY             +   P++ P +            
Sbjct: 470 ADRHTGVTSEPVREAWDILFNQVYVQVP-------RTLAVLPNLRPVM-----------N 511

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           KP ++ ++              Y  + +++A +  + + +     +  R D+I + RQ L
Sbjct: 512 KPNNRTSIN-------------YPNTALLQAWQKLLQAPD--CNRDALRLDIITVGRQLL 556

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y   +  +    Y+  D   +   +    E++ D++ L A H    L  W+  A++  
Sbjct: 557 GNYFLTVKDDFDRMYEAKDLPALKARAAEMREILNDLERLNAFHSRCSLDKWISDARKYG 616

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
              E +  YE NAR  IT W          L DY ++ W+GL++DYY  R  +Y   ++ 
Sbjct: 617 NTPELKNYYEKNARNLITTW-------GGRLNDYASRTWAGLIKDYYSKRWDMYLDAVVA 669

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++E+   F  +    E+    + W +      V   GD LI +++L NKY
Sbjct: 670 AVENNREFDQEKLDGEFRLFEDSWVSSTRPVEVTPEGDLLIYARFLLNKY 719


>gi|392566857|gb|EIW60032.1| alpha-N-acetylglucosaminidase [Trametes versicolor FP-101664 SS1]
          Length = 747

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 160/516 (31%), Positives = 267/516 (51%), Gaps = 39/516 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+P+F+G VP AL  + P+A I     W    +         L+  DPLF  + ++FI +
Sbjct: 231 VMPSFTGFVPRALAALHPNASIVTGSQWSGFPTS--LTNDSFLEPFDPLFATLQQSFIAK 288

Query: 68  QLKEYGR-TSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
           Q   YG   SH+Y  D ++EN P     +Y+ ++ A  ++ +++ D  AVWLMQGWLF  
Sbjct: 289 QQAAYGADISHVYTLDQYNENDPFSGDLDYLRNVSAGTFASLRAADPAAVWLMQGWLFFS 348

Query: 127 DP-FWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
           D  FW   ++ A L  VP    ++VLDL++E +P W+ +  + G  ++WC LH++ GNI 
Sbjct: 349 DAVFWTDDRVAAYLGGVPGNDSMIVLDLYSEAQPQWNRTASYSGKQWVWCELHDYGGNIG 408

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           M G LD +   P+ A +S  ++M GVG++MEG E N +VY ++ + A+    ++  ++++
Sbjct: 409 MEGNLDVLTHAPLTALSSPGSSMKGVGLTMEGQEGNEIVYGVLLDQAWSATSLNTSSYVS 468

Query: 245 QYSVRRYG-RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
            +  RRY  + +P A QDAW +L  TVYN  D  T   +  I    ++ P++  +T    
Sbjct: 469 SWVSRRYPVKPLPKAAQDAWRILSTTVYNNQDPNT---QATIKGIYELAPALTGMTNRIG 525

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLI 359
            +   P S           YD      + + ++ AL+L +   A    LSA   + YD++
Sbjct: 526 HH---PTSIP---------YD------TDATMLSALKLLLEARAQHPTLSAVPEFVYDVV 567

Query: 360 DLTRQALAKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
           D+ RQ L+     L+  +I+ Y      A  V    +  L L+ D+D LL+ ++ FLL  
Sbjct: 568 DVARQLLSNRFIGLYDTLIQTYNSTSSTAQSVSAAGQPLLALLTDLDALLSTNEHFLLSS 627

Query: 418 WLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
           W+  A++ A        Y E+NAR Q+T+W  + +     + DY +K W+GL+  YY PR
Sbjct: 628 WIADARKWADGSASYGAYLEYNARNQVTLWGPDGE-----INDYASKAWAGLVGTYYKPR 682

Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 512
            A +  Y+ E+  +G  +     +   + +  +W N
Sbjct: 683 WAAFVDYLAETKGTGQAYNATAVKSTMLAIGQEWGN 718


>gi|429740222|ref|ZP_19273924.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
 gi|429153947|gb|EKX96708.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
          Length = 730

 Score =  235 bits (599), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 152/530 (28%), Positives = 244/530 (46%), Gaps = 44/530 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ +FP A I  LG W     D ++ C + L+  +PLF +I + F+E+
Sbjct: 235 VLPAFAGHVPAELKRIFPDANIKSLGKWGGF--DEQYLC-HFLNPGEPLFAKIQKLFLEE 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   +G T HIY  D F+E  PP   P Y+  +   +Y  + + D  A W+  GW+F YD
Sbjct: 292 QTALFG-TDHIYGVDPFNEGEPPSWEPAYLKEISKNMYGTLTAVDPKAEWMQMGWMFYYD 350

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W P ++KA L  VP GK+ +LD   E   +W T+  FYG PYIWC L NF GN  + 
Sbjct: 351 KKVWTPKRVKAFLTGVPQGKMSLLDYHCENVELWKTNDGFYGQPYIWCYLGNFGGNTTLT 410

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    M+GVG ++EG++     Y+ + +  + H     + WI++ 
Sbjct: 411 GNVKETGKRLDAALKAARRNMLGVGSTLEGLDVIQFPYEYVFDKVWTHSDKGNQQWIDEL 470

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G + P+++ AW +L+  ++    G              + PS   V    +    
Sbjct: 471 ADRHAGFTSPSVRKAWQILFDEIFVQVPGTY-----------SILPSRSPVLNDNH---- 515

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                     SE +   +P       E + +L L +         N  + DLI + RQ L
Sbjct: 516 ----------SERTEIKYPA---QRLEEVWSLLLDVPQ----CERNELQVDLIAVGRQVL 558

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
                 +      AY   D   + Q +    EL+ D+D L + +    +  W++ A+ L 
Sbjct: 559 GNKFLAVKSEFDAAYAAKDITLLRQKAYEMEELLSDLDCLTSFNTRCTVNKWIDDARALG 618

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
           +N E +  YE NAR  IT+W          L DY ++ W GL+  YYG R  +Y   ++ 
Sbjct: 619 RNAEMKNYYERNARYLITLW-------GGHLSDYASRAWGGLIGSYYGGRWRLYIHDILA 671

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           S ++G  F  K +  +  +    W +      +    D L   + +++KY
Sbjct: 672 SAQTGKPFDQKAFDEKRSQFEQTWVHSTTPITLPQRNDLLTFCKMMFSKY 721


>gi|333031143|ref|ZP_08459204.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
 gi|332741740|gb|EGJ72222.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
          Length = 723

 Score =  234 bits (598), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 157/530 (29%), Positives = 243/530 (45%), Gaps = 46/530 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP+ L+++FP A I  LG W       R  C +L +  DPLF +I R F+E+
Sbjct: 234 VLPAFAGHVPSELKHLFPEADIQHLGKWAGFADKYR--CNFL-NPNDPLFAKIQRLFLEE 290

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
           Q + +G T HIY  D F+E  PP   PEY+  + A +Y  +   D  A WL   WLF + 
Sbjct: 291 QTRLFG-TDHIYGVDPFNEVDPPSWEPEYLKKVAADMYRTLTDVDPKAKWLQMTWLFYHG 349

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P+++ALL  VP  +L +LD   E   +W T+  F+G PYIWC L NF GN  + 
Sbjct: 350 KKKWTAPRIEALLTGVPQDELYLLDYHCENVELWKTTDYFHGQPYIWCYLGNFGGNTTIT 409

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +            +      G+G ++EG++     Y+ + + A+    +D  +W+   
Sbjct: 410 GNVKESGQRLENTLINGGNNFKGIGSTLEGLDVMQFPYEYIFDKAWTFN-MDDNSWVENL 468

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G+   A ++AW +L++ VY     +       +   P+  P +            
Sbjct: 469 ADRHLGKKSEAYREAWKILFNDVYVQVPKS-------LGVLPNFRPEM-----------S 510

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           KP +K  V  ++  + D   +W    EV                 + Y  DLI + RQ L
Sbjct: 511 KP-NKRTV--NDYKNKDLVKVWAKLLEVKEC------------TRDAYIIDLITVGRQVL 555

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y   +     + YQ  D  G+     +  E++ D++ L A H+   L  W+  A+ L 
Sbjct: 556 GNYFLVVKNEFDQMYQFKDLPGLESRGAKLREILNDLENLTAFHNHCTLEKWISDARALG 615

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
              E +  YE NAR  IT W          L DY ++ WSGL++DYY  R  +Y   + E
Sbjct: 616 NTIELKDYYEKNARNLITTW-------GGSLNDYASRTWSGLIKDYYAKRWNLYIDSVTE 668

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           +L+    F   +   +   L   W N          GD L  S++L++KY
Sbjct: 669 ALKENKKFNQSELNEKLNILEEAWVNKVETVTSYEQGDILELSKYLFDKY 718


>gi|393785791|ref|ZP_10373937.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
           CL02T12C05]
 gi|392661410|gb|EIY54996.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
           CL02T12C05]
          Length = 727

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 153/530 (28%), Positives = 250/530 (47%), Gaps = 46/530 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPAAL+ ++P A I  LG W       R    Y L+  +PLF  I + F+++
Sbjct: 235 VLPAFAGHVPAALKRIYPEANIQHLGKWAGFADTYR---CYFLNPEEPLFATIQKHFLQE 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G T HIY  D F+E  PP   PEY+S + + +Y  + + D  A W+   W+F +D
Sbjct: 292 QTRLFG-TDHIYGVDPFNEVDPPSWEPEYLSQVSSDMYRTLTAADPKAEWMQMTWMFYHD 350

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++KALL  VP  K+ +LD   E   +W  ++ F+G PYIWC L NF GN  + 
Sbjct: 351 RKDWTAPRIKALLTGVPQDKMFLLDYHCENVELWKNTEHFHGQPYIWCYLGNFGGNTTLT 410

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +  + + G+G ++EG++     Y+ + E A+    +D +AW+   
Sbjct: 411 GNVKESGDRLDNALINGGSNLRGIGSTLEGLDVMQFPYEYIFEKAWDL-NLDNEAWLQNL 469

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G     +++AW++L++ +Y              V  P         T G   NY 
Sbjct: 470 ADRHAGTVSQPVREAWDILFNQIY--------------VQVPK--------TLGVLPNY- 506

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
           +PV  +   ++           YS + +++A E  + + +     +  R D+I + RQ L
Sbjct: 507 RPVMNKPNRRTVID--------YSNATLLQAWEKLLQATD--CNRDALRLDIITVGRQLL 556

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
             Y   +  +    Y + D  G+   +    E++ D+D L A H    L  WL  A+ L 
Sbjct: 557 GNYFLIVKDDFDRMYTVKDLPGLKARAAEMKEILNDLDRLNAFHSRCALDKWLADARALG 616

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
              E +  YE NAR  IT W          L DY ++ W+GL++DYY  R  +Y   +I 
Sbjct: 617 TTPEVKDYYEKNARNLITTW-------GGSLNDYASRTWAGLIKDYYSKRWDMYMDAVIS 669

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
           ++E    F  K          + W +  +   V   G+ +  +++L  KY
Sbjct: 670 AVEGNREFDQKKLDESIKNFEDAWVDSTDPILVAPQGELMQYARFLLQKY 719


>gi|261199246|ref|XP_002626024.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
 gi|239594232|gb|EEQ76813.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
          Length = 752

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 140/491 (28%), Positives = 255/491 (51%), Gaps = 40/491 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP A+  V P A++     W  +  +P++  T  L   DP  + + ++FI +
Sbjct: 240 ILPAFPGYVPRAVTRVLPDAQVVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISK 297

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
            ++ YG  +H Y  D F+E  P    PE++  +       ++S D +A W+MQGWLF  +
Sbjct: 298 SIEAYGNVTHFYTLDQFNEMIPSSGDPEFLRKVSETTMEAIKSVDPEATWVMQGWLFYIF 357

Query: 127 DPFWRPPQMKALLNS-VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             +W   +++A L++      +++LDLFAE  P+W  +K F+G  ++WC +  F GN  +
Sbjct: 358 ADYWTTERIEAYLSAGKKFRDMLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGL 417

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG + +I  GP +A  +++  MVGVG + EG   N +V+ L+ +  +    +D + + + 
Sbjct: 418 YGHVANITEGPAQA-MAQHPNMVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHD 476

Query: 246 YSVRRY---GRSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           +  RRY   GR+VP  + +AW +L  + YN T+         +V  P +  ++ + +   
Sbjct: 477 WVTRRYSSHGRTVPNELYEAWQLLRLSAYNNTN---------LVDAPLLPHALFAASPSI 527

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
                   +K  +L  E   YD        +++++A  L I     L   ++Y+YD++D+
Sbjct: 528 N-------AKMPMLFIEGLLYD-------PADMLKAWGLMIKGA--LFGDSSYQYDIVDV 571

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           TRQ L+     +  ++   Y+      VF  +  + L +++ +D +L+ ++ F L  W+ 
Sbjct: 572 TRQVLSDAFTLVLQDLKVKYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWIS 631

Query: 421 SAKQLAQNEEQEKQ-YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           +A+  A ++ +    +E NAR QIT+W      E  +L DY  K W+GL+  YY PR  +
Sbjct: 632 AARASAGDDSEAADFFEHNARNQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPRWRM 687

Query: 480 YFKYMIESLES 490
           + +Y+ ++  S
Sbjct: 688 FLEYLKDTPAS 698


>gi|423248233|ref|ZP_17229249.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
           CL03T00C08]
 gi|423253182|ref|ZP_17234113.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
           CL03T12C07]
 gi|392657082|gb|EIY50719.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
           CL03T12C07]
 gi|392660340|gb|EIY53954.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
           CL03T00C08]
          Length = 732

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 155/496 (31%), Positives = 236/496 (47%), Gaps = 58/496 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  F G VP ++   FP+A I   G W + +          L  +DPLF ++   F E+
Sbjct: 235 VLQGFYGMVPNSMITKFPNADIRNAGKWITYQRPA------FLVPSDPLFAKVAEIFYEE 288

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q K +G  S  Y  D F E  N+  ++    I+   + IY  M++ + +A+W++QGW  S
Sbjct: 289 QKKLFGE-SRYYGGDPFHEGGNSKGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--S 341

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLH 177
            +P        ALL  +  G+ +VLDL A  +P W             F    +IWC L 
Sbjct: 342 GNP------SVALLKGLKHGEALVLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALP 395

Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEK 236
           NF G I MYG L S A G ++A        V G+G + EGI  NP+ YD++ +MA++ + 
Sbjct: 396 NFGGRIGMYGKLQSYATGVIKAEHHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDS 455

Query: 237 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
           +D+K+WI  Y+  RYG      + A   L  +VYNC   A         A P +    +S
Sbjct: 456 IDIKSWIANYTTYRYGSENSNAKAAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYVS 515

Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
                                   S+   HL+Y    V++ALE  + +  EL   +TYRY
Sbjct: 516 ------------------------SWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRY 551

Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
           D++D+TRQ LA Y   +   I +AY+  +       + +FL+++ D D LL+    FLLG
Sbjct: 552 DVVDITRQMLADYGKYIHKCISDAYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLG 611

Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
            ++  A     N  +++ +  NA+ QIT W        S L +Y +K W+G+L   Y PR
Sbjct: 612 EYIRQADTCGSNPTEKRMFINNAKRQITSW----TSVNSSLHEYAHKEWNGILSTLYAPR 667

Query: 477 AAIYFKYMIESLESGD 492
             +YF Y+   LE  +
Sbjct: 668 WKVYFDYLHAKLEGKN 683


>gi|423269877|ref|ZP_17248849.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
           CL05T00C42]
 gi|423272668|ref|ZP_17251615.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
           CL05T12C13]
 gi|392700723|gb|EIY93885.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
           CL05T00C42]
 gi|392708745|gb|EIZ01850.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
           CL05T12C13]
          Length = 732

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 155/496 (31%), Positives = 236/496 (47%), Gaps = 58/496 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  F G VP ++   FP+A I   G W + +          L  +DPLF ++   F E+
Sbjct: 235 VLQGFYGMVPNSMITKFPNADIRDAGKWITYQRPA------FLVPSDPLFAKVAEIFYEE 288

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q K +G  S  Y  D F E  N+  ++    I+   + IY  M++ + +A+W++QGW  S
Sbjct: 289 QKKLFGE-SRYYGGDPFHEGGNSKGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--S 341

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLH 177
            +P        ALL  +  G+ +VLDL A  +P W             F    +IWC L 
Sbjct: 342 GNP------SVALLKGLKHGEALVLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALP 395

Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEK 236
           NF G I MYG L S A G ++A        V G+G + EGI  NP+ YD++ +MA++ + 
Sbjct: 396 NFGGRIGMYGKLQSYATGVIKAEHHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDS 455

Query: 237 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
           +D+K+WI  Y+  RYG      + A   L  +VYNC   A         A P +    +S
Sbjct: 456 IDIKSWIANYTTYRYGSENSNAKAAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYVS 515

Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
                                   S+   HL+Y    V++ALE  + +  EL   +TYRY
Sbjct: 516 ------------------------SWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRY 551

Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
           D++D+TRQ LA Y   +   I +AY+  +       + +FL+++ D D LL+    FLLG
Sbjct: 552 DVVDITRQMLADYGKYIHKCISDAYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLG 611

Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
            ++  A     N  +++ +  NA+ QIT W        S L +Y +K W+G+L   Y PR
Sbjct: 612 EYIRQADTCGSNPTEKRMFINNAKRQITSW----TSVNSSLHEYAHKEWNGILSTLYAPR 667

Query: 477 AAIYFKYMIESLESGD 492
             +YF Y+   LE  +
Sbjct: 668 WKVYFDYLHAKLEGKN 683


>gi|383122982|ref|ZP_09943669.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
 gi|251841923|gb|EES70003.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
          Length = 730

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 148/533 (27%), Positives = 262/533 (49%), Gaps = 41/533 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSG+VP  L+ ++P AKI ++  W    S  R   ++ ++  D LF  I + ++E+
Sbjct: 234 VLPAFSGHVPKELKAIYPDAKIHEMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEE 290

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   YG T HIY  D F+E   P  + ++++ +   IY  +   D++A WL   W+F +D
Sbjct: 291 QTAIYG-TDHIYGIDPFNEVDSPNWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHD 349

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++++ L +VP  KL++LD + +   IW  ++ +YG PY+WC L NF GN  M 
Sbjct: 350 QKKWTQPRIRSFLEAVPDDKLILLDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMV 409

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G LD +     +        + G+G ++EG + NP +Y+ + + A+ +  +    WI  +
Sbjct: 410 GNLDDVDVKIEKLFVEGGENVYGLGATLEGFDVNPFMYEFVFDQAWDY-PLTTDQWIQNW 468

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  R G     I  AW+ L+  +Y     AT     ++ A P +                
Sbjct: 469 AKCRGGNQDRHILKAWDSLHKKIYK--KYATAGQAVLMNARPML---------------- 510

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT-YRYDLIDLTRQA 365
             V  ++       +Y++  LW   +E+++A  +          +NT YR+D+I++ RQ 
Sbjct: 511 --VGTDSWNTYPDITYNNRDLWDIWTEMLKASHI----------NNTGYRFDVINVGRQV 558

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           L    +    +  + Y   D  G+ + + +   L+ D D LL+C   F +G W++ A+  
Sbjct: 559 LGNLFSSFRDHFTQCYSEKDIDGMKKWADQMDSLLIDTDRLLSCETNFSIGKWIDDARSF 618

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
            + E +++ YE NAR  +T+W     ++A+ L DY N+ W GL   YY  R   +   +I
Sbjct: 619 GKTEAEKEYYEENARCILTVW----GQKATQLNDYANRGWGGLTYSYYRERWKRFTTEVI 674

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
            +  SG  F  K + +       +W   +  +P+ S  + ++ ++ L  KY+Q
Sbjct: 675 TASLSGQKFDEKQFYQSITDFEYEWTLSKEHHPIISGENPILLAKTLSEKYMQ 727


>gi|239615395|gb|EEQ92382.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ER-3]
          Length = 829

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 140/494 (28%), Positives = 252/494 (51%), Gaps = 46/494 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP A+  V P A++     W  +  +P++  T  L   DP  + + ++FI +
Sbjct: 260 ILPAFPGYVPRAVTRVLPDAQVVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISK 317

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
            ++ YG  +H Y  D F+E  P    P+++  +       ++S D +A W+MQGWLF  +
Sbjct: 318 SIEAYGNVTHFYTLDQFNEMIPSSGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIF 377

Query: 127 DPFWRPPQMKALLNS-VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             +W   +++A L++      +++LDLFAE  P+W  +K F+G  ++WC +  F GN  +
Sbjct: 378 ADYWTTERIEAYLSAGKKFRDMLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGL 437

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG + +I  GP EA  +++  MVGVG + EG   N +V+ L+ +  +    +D + + + 
Sbjct: 438 YGHVANITEGPAEA-MAQHPNMVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHD 496

Query: 246 YSVRRYG---RSVPA-IQDAWNVLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVT 298
           +  RRY    R+VP+ + +AW +L  + YN T+   A      +  A P ++  + +   
Sbjct: 497 WVTRRYSSHERTVPSELYEAWQLLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFI 556

Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
           EG                          L Y  +++++A  L I     L   ++Y+YD+
Sbjct: 557 EG--------------------------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDI 588

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGP 417
           +D+TRQ L+     +  ++   Y+      VF  +  + L +++ +D +L+ ++ F L  
Sbjct: 589 VDVTRQVLSDAFTLVLQDLKVKYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSS 648

Query: 418 WLESAKQLAQNEEQEKQ-YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
           W+ +A+  A +E +    +E NAR QIT+W      E  +L DY  K W+GL+  YY PR
Sbjct: 649 WISAARASAGDESEAADFFEHNARNQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPR 704

Query: 477 AAIYFKYMIESLES 490
             ++ +Y+ ++  S
Sbjct: 705 WRMFLEYLKDTPAS 718


>gi|29345848|ref|NP_809351.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29337741|gb|AAO75545.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 730

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 149/533 (27%), Positives = 262/533 (49%), Gaps = 41/533 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSG+VP  L+ ++P AKI ++  W    S  R   ++ ++  D LF  I + ++E+
Sbjct: 234 VLPAFSGHVPKELKAIYPDAKIHEMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEE 290

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   YG T HIY  D F+E   P  + ++++ +   IY  +   D++A WL   W+F +D
Sbjct: 291 QTAIYG-TDHIYGIDPFNEVDSPNWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHD 349

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++++ L +VP  KL++LD + +   IW  ++ +YG PY+WC L NF GN  M 
Sbjct: 350 QKKWTQPRIRSFLEAVPDDKLILLDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMV 409

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G LD +     +        + G+G ++EG + NP +Y+ + + A+ +  +    WI  +
Sbjct: 410 GNLDDVDVKIEKLFVEGGENVYGLGATLEGFDVNPFMYEFVFDQAWDY-PLTTDQWIQNW 468

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  R G     I  AW+ L+  +Y     AT     ++ A P      + V    +  Y 
Sbjct: 469 AKCRGGNQDRHILKAWDSLHKKIYK--KYATAGQAVLMNARP------MLVGTDSWNTY- 519

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT-YRYDLIDLTRQA 365
                         +Y++  LW   +E+++A  +          +NT YR+D+I++ RQ 
Sbjct: 520 -----------PDITYNNRDLWDIWTEMLKASHI----------NNTGYRFDVINVGRQV 558

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           L    +    +  + Y   D  G+ + + +   L+ D D LL+C   F +G W++ A+  
Sbjct: 559 LGNLFSSFRDHFTQCYSEKDIDGMKKWADQMDALLIDTDRLLSCETNFSIGKWIDDARSF 618

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
            + E +++ YE NAR  +T+W     ++A+ L DY N+ W GL   YY  R   +   +I
Sbjct: 619 GKTEAEKEYYEENARCILTVW----GQKATQLNDYANRGWGGLTYSYYRERWKRFTTEVI 674

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
            +  SG  F  K + +       +W   +  +P+ S  + ++ ++ L  KY+Q
Sbjct: 675 TASLSGQKFDEKQFYQSITDFEYEWTLSKEHHPIISGENPILLAKTLSEKYMQ 727


>gi|281200617|gb|EFA74835.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
          Length = 688

 Score =  231 bits (589), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 113/248 (45%), Positives = 161/248 (64%), Gaps = 9/248 (3%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           AVLP F+G+VP AL+  +P A ITQLG W +         TY L+  DPLF +I +AF+ 
Sbjct: 249 AVLPGFAGHVPEALETHYPKANITQLGGWGTFSG------TYYLNPDDPLFSKIAQAFVI 302

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
            Q + YG T H YN D F+E  PP     Y+ +   ++++ + + D   +W++QGW    
Sbjct: 303 TQNQLYG-TDHFYNFDPFNELEPPSSDLTYLKNCSQSMFNNLIAADPQGIWVLQGWFLVD 361

Query: 127 DP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           DP FW PPQ +A L+ VP+GK++VLDL+++V P W+++  +YG  +IWCMLHNF G   M
Sbjct: 362 DPEFWLPPQTEAFLSGVPIGKMIVLDLWSDVIPAWNSTNYYYGHNWIWCMLHNFGGRSGM 421

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG +  I+  P+EAR S +  MVG G++ E IEQN +VYDLMSEMA++    D+K W++Q
Sbjct: 422 YGKIPFISTNPIEAR-SLSPNMVGTGLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQ 480

Query: 246 YSVRRYGR 253
           Y  RRYG+
Sbjct: 481 YVTRRYGK 488



 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 83/192 (43%), Gaps = 8/192 (4%)

Query: 336 RALELFIASGNELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQLNDAHGVFQLS 393
             L     +   ++ ++T+ +DL ++T QAL      NEL LN   A+  N      + S
Sbjct: 489 HGLPFLSINDTSITNTSTFSFDLTEITTQALINLFMTNELQLN--SAFLNNSLEEFNKYS 546

Query: 394 RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEE 453
              L +++D+  + +  +  L+G W   A+ L    E    YE NAR QIT+W       
Sbjct: 547 EALLSIIQDVYTIASTQEMLLVGHWTARARALTPANESTNLYEMNARNQITLW----GPT 602

Query: 454 ASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNG 513
            S + DY  K W GL  D+Y  R  ++ K +  SL S   F    ++     +   W   
Sbjct: 603 YSDVHDYAYKLWGGLTEDFYLARWTLFVKELQYSLTSSQPFNSTLFQTNCEAVEEVWNLQ 662

Query: 514 RNVYPVESNGDA 525
              YP    G++
Sbjct: 663 TYPYPTIPTGNS 674


>gi|327356744|gb|EGE85601.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ATCC 18188]
          Length = 752

 Score =  231 bits (589), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 139/494 (28%), Positives = 252/494 (51%), Gaps = 46/494 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP A+  V P A++     W  +  +P++  T  L   DP  + + ++FI +
Sbjct: 240 ILPAFPGYVPRAVTRVLPDAQVVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISK 297

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
            ++ YG  +H Y  D F+E  P    P+++  +       ++S D +A W+MQGWLF  +
Sbjct: 298 SIEAYGNVTHFYTLDQFNEMIPSSGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIF 357

Query: 127 DPFWRPPQMKALLNS-VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             +W   +++A L++      +++LDLFAE  P+W  +K F+G  ++WC +  F GN  +
Sbjct: 358 ADYWTTERIEAYLSAGKKFRDMLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGL 417

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG + +I  GP EA  +++  MVGVG + EG   N +V+ L+ +  +    +D + + + 
Sbjct: 418 YGHVANITEGPAEA-MAQHPNMVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHD 476

Query: 246 YSVRRYG---RSVPA-IQDAWNVLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVT 298
           +  RRY    R+VP+ + +AW +L  + YN T+   A      +  A P ++  + +   
Sbjct: 477 WVTRRYSSHERTVPSELYEAWQLLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFI 536

Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
           EG                          L Y  +++++A  L I     L   ++Y+YD+
Sbjct: 537 EG--------------------------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDI 568

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGP 417
           +D+TRQ L+     +  ++   Y+      VF  +  + L +++ +D +L+ ++ F L  
Sbjct: 569 VDVTRQVLSDAFTLVLQDLKVKYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSS 628

Query: 418 WLESAKQLAQNEEQEKQ-YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
           W+ +A+  A ++ +    +E NAR QIT+W      E  +L DY  K W+GL+  YY PR
Sbjct: 629 WISAARASAGDDSEAADFFEHNARNQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPR 684

Query: 477 AAIYFKYMIESLES 490
             ++ +Y+ ++  S
Sbjct: 685 WRMFLEYLKDTPAS 698


>gi|336371253|gb|EGN99592.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336384013|gb|EGO25161.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 761

 Score =  231 bits (589), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 147/503 (29%), Positives = 255/503 (50%), Gaps = 46/503 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           VLP+F+G VP A+  ++P+A I     W  F+++          L+  DPLF  +  +F+
Sbjct: 244 VLPSFTGFVPRAMHTLYPNASIVNGSQWSTFTIQH----TNDSFLEPFDPLFSTLQTSFM 299

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
            +    YG  SHIY  D ++E  P   +  Y+SS+ +A ++ +++ D +AVW+MQGWLF 
Sbjct: 300 TKYAAAYGNVSHIYTLDQYNEMMPYSGNTSYLSSISSATFASLRATDPEAVWMMQGWLFY 359

Query: 126 -YDPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
            Y  FW   +++A L  VP    +++LDLF+E  P W     ++G  +IWC LH+F GN+
Sbjct: 360 IYASFWTDERVEAYLGGVPGNDSMIILDLFSEAYPQWQRLNSYFGKQWIWCELHDFGGNM 419

Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
              G  +++   PV+A  +   TMVG+G++MEG E N ++YD++ + A+    ++  +++
Sbjct: 420 GFEGNFENVTTQPVKALATPGNTMVGMGLTMEGQEGNEIMYDVLFDQAWSPTPINRTSYV 479

Query: 244 NQYSVRRYGRSVP----AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
           + ++ RRY  +VP    A  +AW +L  TVYN  D         I         ++++T 
Sbjct: 480 SAWTSRRY--NVPNLPTAATEAWEILASTVYNNQDPLLQATIKSIFELEPAINGLVNLTV 537

Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYST-SEVIRALELFIASGNELSASNT---YR 355
            +    G P                  L+Y T + ++ AL+  + +  E SA +    ++
Sbjct: 538 LQ----GIPTG----------------LFYDTNTTIVPALQSLLQARQESSALDEVPEFQ 577

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL--SRRFLELVEDMDGLLACHDGF 413
           YD++ + RQ LA    +L+ ++++ Y    +            + L++D+D +L     F
Sbjct: 578 YDVVYIIRQLLANRFIDLYTSLVDTYNSTTSSSSDVSTAGAPLITLLKDVDSVLLTDTHF 637

Query: 414 LLGPWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           LL  W+ +A+  A +      Y E+NAR QIT+W    +     + DY +K W GL+  Y
Sbjct: 638 LLSNWISAARNWAHDNSTYAAYLEYNARNQITLWGPRGE-----VHDYASKQWGGLVGTY 692

Query: 473 YGPRAAIYFKYMIESLESGDGFR 495
           Y  R   +  Y+  S  +G  + 
Sbjct: 693 YVQRWEEFVSYLSGSKANGTAYN 715


>gi|423241433|ref|ZP_17222546.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
           CL03T12C01]
 gi|392641326|gb|EIY35103.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
           CL03T12C01]
          Length = 754

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 162/558 (29%), Positives = 262/558 (46%), Gaps = 83/558 (14%)

Query: 8   VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
           VLP +SG +P+ L       S K  +  +  S +S       +       +L   DP F 
Sbjct: 229 VLPGYSGMIPSKLDLGKRIDSGKEKKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 288

Query: 59  EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
           +I   F E+  K YG TS  Y+ D F E        ++    G AI   M+  +  AVW+
Sbjct: 289 QIANLFYEETEKLYG-TSDYYSIDPFHEAKSLPAGLDF-GKAGRAIMDAMKKANPKAVWV 346

Query: 119 MQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----- 173
           +QGW  + +P  RP  MKAL      G L++LDLF+E +P+W       G+P IW     
Sbjct: 347 VQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKG 391

Query: 174 --------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVY 224
                   C+L NF GN+ ++G +D +       + +     + G+G++MEGIE NPV++
Sbjct: 392 YEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMF 451

Query: 225 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 284
           +LM E+ ++ EK   + WI QY   RYG    +I+ AW +L + +YNC  G         
Sbjct: 452 ELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIRQAWQILANGIYNCPAGNNQ------ 505

Query: 285 VAFPDVDPSIISVTEGKYQNY--GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI 342
                         +G +++   G+P    ++   + SS+     +Y  +    A  L +
Sbjct: 506 --------------QGPHESIFCGRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMV 547

Query: 343 ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVED 402
           +  ++   +N + YDL+D+TRQA+A  A  ++   +  ++  D       +R+FLEL+  
Sbjct: 548 SVADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYATHTRQFLELLMM 607

Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
            D LL     F +G W++ A+ L    E++  YEWNAR QIT W +    +   LRDY +
Sbjct: 608 QDKLLGTRKEFKVGNWIQQARNLGITSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAH 667

Query: 463 KYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTN 508
           K W+GLLRD+Y  R   Y++ + + L+              + D   +     +W  L  
Sbjct: 668 KEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEE 724

Query: 509 DWQNGRNVYPVESNGDAL 526
            W   +N+Y   + GD +
Sbjct: 725 PWTLAKNIYAASAEGDCI 742


>gi|126307952|ref|XP_001365931.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
           domestica]
          Length = 481

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 107/216 (49%), Positives = 146/216 (67%), Gaps = 4/216 (1%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G++P A   VFP A +T+L NW        + C+YLL   DPLF  +G  F+ +
Sbjct: 242 VLPAFAGHIPKAFTRVFPQANVTKLDNWIDFNCT--YSCSYLLAPEDPLFPVVGSLFLRE 299

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             KE+G T HIY+ D F+E  PP  +P Y+++  AA+Y  M + D DAVWL QGWLF   
Sbjct: 300 LAKEFG-TDHIYSADIFNEMDPPSSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNH 358

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW+PPQMKA+L +VP G+ ++LDLFAE +P++S +  FYG P+IWCMLHNF GN  ++
Sbjct: 359 PDFWKPPQMKAVLEAVPRGRFLILDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLF 418

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPV 222
           G+LD++  GP  AR   N+T+VG G+  EGI QN +
Sbjct: 419 GVLDAVNRGPSTARLFPNSTIVGTGIVPEGINQNEI 454


>gi|322702923|gb|EFY94542.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
           ARSEF 23]
          Length = 589

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 154/507 (30%), Positives = 250/507 (49%), Gaps = 54/507 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF G VP A   V P A  T+   W  +  D     T+L    D  +  + +AFI +
Sbjct: 61  VLPAFPGFVPPAFSRVQPDANTTKAPRWTGLP-DTNTRDTFL-SPLDTSYARLQQAFISK 118

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q++ +G  ++IY  D F+E  P  + P Y+S +    Y  + + +  AVWL+QGWLF   
Sbjct: 119 QIEAFGNVTNIYTLDQFNEMPPTSNEPSYLSQVSTYTYKALTAANPAAVWLLQGWLFLNS 178

Query: 128 PFWRPPQMKALLNSVPLG--KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             W   ++ A L   P G   ++VLDL++E +P W  +K ++G P+IWC LH+F GN+ M
Sbjct: 179 GLWTEERVTAYLGG-PEGHNSMLVLDLYSESRPQWQRTKGYFGRPWIWCQLHDFGGNMGM 237

Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           YG +  I    ++A RTS   ++ G GM+ EG E N VVY ++ + A+    +D   +  
Sbjct: 238 YGQISDITVQSMDALRTSP--SLSGFGMTPEGYEGNEVVYQMLFDQAWTTTPIDTSGYFY 295

Query: 245 QYSVRRYG--RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
            Y VRRY       ++  AW++L   +Y+      +K+R V             V  G Y
Sbjct: 296 GYVVRRYAGVSQTNSLFQAWDILRQNIYD------NKDRQVPC-----------VGVGIY 338

Query: 303 QNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNELSA---SNTYRYDL 358
           QN        + L + T ++  P  ++Y  + + +A  L I + NE+       T++ D+
Sbjct: 339 QN----APSLSGLVNRTGNWPPPTKVYYDPATLKKAHSLLIQAANEIPQLWDIPTFQLDV 394

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR---------------FLELVEDM 403
           +D+TRQ ++   N ++ + ++ +    +     +S R                L+ + D+
Sbjct: 395 VDVTRQVMSNAFNTMYTDYVQTFNSQLSRQKSHISNRGGLQRRDDFATKGKQLLDFLTDL 454

Query: 404 DGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNK 463
           D +LA +  F L  WL++A+  A+         +NAR+QIT W      E+  L DY  K
Sbjct: 455 DRVLATNQHFRLDSWLDAAQYWAKQTGANDLIAFNARSQITTWI----WESEALNDYAVK 510

Query: 464 YWSGLLRDYYGPRAAIYFKYMIESLES 490
            WSGL R YY  R +I+   + ++L S
Sbjct: 511 EWSGLTRSYYRGRWSIFVDGLNKALAS 537


>gi|374312699|ref|YP_005059129.1| alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
 gi|358754709|gb|AEU38099.1| Alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
          Length = 754

 Score =  229 bits (584), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 160/514 (31%), Positives = 238/514 (46%), Gaps = 63/514 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWF--SVKSDPRWCCTYLLD-ATDPLFIEIGRAF 64
           V PAF+G VP   + + P  +   L  W     K+ PR   T++L      L+ +IG+ F
Sbjct: 258 VAPAFAGFVPQGFKRLHPEVETFTL-LWLRKEFKTIPRSTRTFILHPGQQELYRQIGKKF 316

Query: 65  IEQQLKEYGRTSHIYNCDTFDENTPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQG 121
           IE+   EYG   + Y  DTF+E   PV      E +   G  ++  +Q+GD    W+MQG
Sbjct: 317 IEEYKAEYGEVEY-YLADTFNELEVPVREDHRYEDLERFGRTVFESIQAGDPKGTWVMQG 375

Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPI---------WSTSKQFYGVPY 171
           WLF YD  FW    ++ALL  +P  +++++D   ++ P          W   K F+G P+
Sbjct: 376 WLFVYDSDFWNKESVEALLRGIPNDRMLIIDYANDLAPSVQGKYLPGQWKLQKAFFGKPW 435

Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEM 230
           I  M H F GN  + G L  +A  P     S E   +VG GM  EGIE N VVY+LM++ 
Sbjct: 436 INGMAHTFGGNNNIKGNLKLMATEPSTVLASPERGNLVGWGMCPEGIENNEVVYELMTDA 495

Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 290
            +Q E +D+  WI  Y   RYG   PA+Q AW +L  + Y+     T   +    A P V
Sbjct: 496 GWQSEAIDLATWIPAYCRSRYGDCPPAMQQAWELLLKSAYSSHIWMT---KQAWQAEPSV 552

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
            P   SV  G       P  +                        RA+ELF++   +L+ 
Sbjct: 553 HPIAASVDAG-------PTFQ------------------------RAVELFLSCAPQLAK 581

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           S  YR DLI+   QA+    +E     ++A            + R +E +  +DGL+   
Sbjct: 582 SELYRNDLIEFVSQAVGGRVDEALALAVQAGDAKQDEDAVAHAARAVEWMRRIDGLMNLR 641

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
               L  W+++ +  A+ +++   Y+ NAR  IT W          L DY ++ WSGL+R
Sbjct: 642 PDRRLETWMQATRAYAKTDDEATFYDENARLLITTW------GWPELSDYASRVWSGLIR 695

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
           DYY  R   +F    ES  +G  F L  W++ W+
Sbjct: 696 DYYAARWEAWF----ESRHTGRSFSLDLWQQTWL 725


>gi|265753065|ref|ZP_06088634.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|263236251|gb|EEZ21746.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
          Length = 750

 Score =  229 bits (584), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 163/558 (29%), Positives = 260/558 (46%), Gaps = 83/558 (14%)

Query: 8   VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
           VLP +SG +P+ L       S K  +  +  S +S       +       +L   DP F 
Sbjct: 225 VLPGYSGMIPSKLDLGKRIDSGKEEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 284

Query: 59  EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
            I   F E+  K YG TS  Y+ D F E    + +       G AI   M+  +  AVW+
Sbjct: 285 RIANLFYEETEKLYG-TSDYYSIDPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWV 342

Query: 119 MQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----- 173
           +QGW  + +P  RP  MKAL      G L++LDLF+E +P+W       G+P IW     
Sbjct: 343 VQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKG 387

Query: 174 --------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVY 224
                   C+L NF GN+ ++G +D +       + +     + G+G++MEGIE NPV++
Sbjct: 388 YEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMF 447

Query: 225 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 284
           +LM E+ ++ EK   + WI QY   RYG    +IQ AW +L + +YNC  G         
Sbjct: 448 ELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIQQAWQILTNGIYNCPAGNNQ------ 501

Query: 285 VAFPDVDPSIISVTEGKYQNY--GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI 342
                         +G +++   G+P    ++   + SS+     +Y  +    A  L +
Sbjct: 502 --------------QGPHESIFCGRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMV 543

Query: 343 ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVED 402
           +  ++   +N + YDL+D+TRQA+A  A  ++   +  ++  D       +R+FLEL+  
Sbjct: 544 SVADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYNTHTRQFLELLMM 603

Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
            D LL     F +G W++ A+ L    E++  YEWNAR QIT W +    +   LRDY +
Sbjct: 604 QDKLLGTRKEFKVGNWIQQARNLGITPEEKDLYEWNARVQITTWGNRYCADIGKLRDYAH 663

Query: 463 KYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTN 508
           K W+GLLRD+Y  R   Y++ + + L+              + D   +     +W  L  
Sbjct: 664 KEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEE 720

Query: 509 DWQNGRNVYPVESNGDAL 526
            W   +N Y   + GD +
Sbjct: 721 PWTLAKNTYAASAEGDCI 738


>gi|449299394|gb|EMC95408.1| glycoside hydrolase family 89 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 801

 Score =  229 bits (584), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 143/517 (27%), Positives = 251/517 (48%), Gaps = 62/517 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY-LLDATDPLFIEIGRAFIE 66
           VLP F G VP  +   +P+A       W      PR       L+  DPL+  + ++FI 
Sbjct: 262 VLPCFPGFVPMQIGRYYPNAMYINGSQW---NGFPRQNTNVSFLEPFDPLYTTLQKSFIS 318

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-S 125
           +Q   YG  S IY  D ++EN P      Y+ ++ A   + +++ D +AVW++QGWLF S
Sbjct: 319 KQTAAYGNVSSIYTLDQYNENNPYSADTTYLRNISAGTIAALKAADPNAVWMLQGWLFFS 378

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
              FW    ++A L  V    +++LDLF+E +P W  +  +YG P+IWC LH++ GN+ +
Sbjct: 379 SATFWTDAAIRAYLGGVNNTDMIILDLFSETQPQWQRTNSYYGKPWIWCELHDYGGNMGL 438

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG ++++   P++A  + ++TMVG+G++MEG E N ++YD++ + A+    ++   + + 
Sbjct: 439 YGQVENVTINPIQALNNASSTMVGMGLTMEGQEGNEIMYDILLDQAWSSTPLNNSLYFHD 498

Query: 246 YSVRRYGRSV---PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
           +   RY  +    P +  AW+ +  TVYN T  +T +   V  +  ++ P++  +     
Sbjct: 499 WVTSRYHGAASLPPGLYTAWDTMRQTVYNNTQISTIQ--SVTKSIWELTPNVTGLLN--- 553

Query: 303 QNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDL 358
                             +  HP  + Y+TS ++ A + F  +  +   L  S  Y +DL
Sbjct: 554 -----------------RTGHHPTTIQYNTSTLVGAWKQFYGAAAQEPTLWDSPGYLFDL 596

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ------LSRRFLELVEDMDGLLACH-- 410
            D+TRQ +A     L+ + + A   + A+  +         ++ + L+  +D +LA    
Sbjct: 597 TDVTRQVMANAFYPLYTSFVSASN-HSANATYSPGNATIYGQQMVSLLSALDSMLAASPI 655

Query: 411 DGFLLGPWLESAK--------------QLAQNEEQEKQYEWNARTQITMWFDNTQEEASL 456
             F L  W+  A+               L  + +    YE+NAR QIT+W    Q     
Sbjct: 656 PYFHLSTWIAEARSWSAPTATLPNNATNLTSSSQTASFYEYNARNQITLWGPTGQ----- 710

Query: 457 LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 493
           + DY +K W+GL+  YY PR  ++  Y +    + +G
Sbjct: 711 ISDYASKQWAGLISSYYVPRWQLFVNYTLNGTTASNG 747


>gi|212695333|ref|ZP_03303461.1| hypothetical protein BACDOR_04880 [Bacteroides dorei DSM 17855]
 gi|212662112|gb|EEB22686.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
          Length = 754

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 164/556 (29%), Positives = 257/556 (46%), Gaps = 79/556 (14%)

Query: 8   VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
           VLP +SG +P+ L       S K  +     S +S       +       +L   DP F 
Sbjct: 229 VLPGYSGMIPSKLDLGKRIDSGKEEKTAGDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 288

Query: 59  EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
            I   F E+  K YG TS  Y+ D F E    + +       G AI   M+  +  AVW+
Sbjct: 289 RIANLFYEETEKLYG-TSDYYSIDPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWV 346

Query: 119 MQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----- 173
           +QGW  + +P  RP  MKAL      G L++LDLF+E +P+W       G+P IW     
Sbjct: 347 VQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKG 391

Query: 174 --------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVY 224
                   C+L NF GN+ ++G +D +       + +     + G+G++MEGIE NPV++
Sbjct: 392 YEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMF 451

Query: 225 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 284
           +LM E+ ++ EK   + WI QY   RYG    +IQ AW +L + +YNC  G   +     
Sbjct: 452 ELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIQQAWQILTNGIYNCPAGNNQQG---- 507

Query: 285 VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS 344
                   SI           G+P    ++   + SS+     +Y  +    A  L ++ 
Sbjct: 508 -----PHESIFC---------GRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSV 549

Query: 345 GNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 404
            ++   +N + YDL+D+TRQA+A  A  ++   +  ++  D       +R+FLEL+   D
Sbjct: 550 ADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYNTHTRQFLELLMMQD 609

Query: 405 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
            LL     F +G W++ A+ L    E++  YEWNAR QIT W +    +   LRDY +K 
Sbjct: 610 KLLGTRKEFKVGNWIQQARNLGITPEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKE 669

Query: 465 WSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTNDW 510
           W+GLLRD+Y  R   Y++ + + L+              + D   +     +W  L   W
Sbjct: 670 WNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPW 726

Query: 511 QNGRNVYPVESNGDAL 526
              +N Y   + GD +
Sbjct: 727 TLAKNTYAASAEGDCI 742


>gi|409042145|gb|EKM51629.1| glycoside hydrolase family 89 protein [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 749

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 143/504 (28%), Positives = 257/504 (50%), Gaps = 43/504 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G VP+ +   +P+A I     W    S         L+  DPL+ ++ ++FI +
Sbjct: 233 ILPAFTGFVPSNMSAHYPNASIIDGSAWSGFPS--TLTNVSFLEPFDPLYPQMQQSFITK 290

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q + YG  +H Y  D ++EN P   +  Y+SS+  +  + +++ D +A W+MQGWLF S 
Sbjct: 291 QQEAYGNITHFYTLDQYNENNPFSGNDSYLSSVSTSTIASLRAADPEATWVMQGWLFFSS 350

Query: 127 DPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           + FW   +++A L        +++LDL++E +P W+ +  ++G  ++WC LH++ GN+ +
Sbjct: 351 ETFWTNDRIEAYLGGAQGNDSMLILDLYSEAQPQWNRTDSYFGKQWVWCELHDYGGNMGL 410

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIE-QNPVVYDLMSEMAFQHEKVDVKAWIN 244
            G L +I  GP+ A  S  ++MVG+G++MEG+E  N +VYD++ + A+    ++V  W+ 
Sbjct: 411 EGNLAAITEGPIAALNSNGSSMVGMGLTMEGMEIGNEIVYDILLDQAWSSTPLNVSDWVA 470

Query: 245 QYSVRRY-GRSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
           +++ RRY  +++P  +Q AW +L  T+YN  D  +      I+        +++VT    
Sbjct: 471 KWAARRYLVKTLPTELQQAWTILSTTIYNNQDPNSQATIKSILELEPATTGLVNVTG--- 527

Query: 303 QNYGKPVSKEAVLKSETSSYDHP-HLWYST-SEVIRALELFI-ASGNELSASNT--YRYD 357
                                HP  + Y T + ++ AL+LF+ AS ++ S      +  D
Sbjct: 528 --------------------HHPTEIPYDTNTTILHALQLFVNASKSQPSLKQVPEFAVD 567

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
           +++L+RQ +     +L+ ++I  +  +   A  V       L L+ D+D LL  ++ +L 
Sbjct: 568 ILELSRQLMVNRFIDLYTDLINTWNSSSSTAQNVTTAGVPLLSLISDLDVLLYTNENYLF 627

Query: 416 GPWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
             W+  AKQ A        Y E+ AR Q T+W          + DY +K  +GL+ +YY 
Sbjct: 628 STWIADAKQWAHGNVSYAAYLEYQARNQQTLWGPQGN-----INDYASKQTAGLVGEYYA 682

Query: 475 PRAAIYFKYMIESLESGDGFRLKD 498
            R   +   + E   SG  +   +
Sbjct: 683 TRWQTFVVMLAEQKTSGQPYNATE 706


>gi|261880010|ref|ZP_06006437.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
 gi|270333326|gb|EFA44112.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
          Length = 719

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 155/533 (29%), Positives = 244/533 (45%), Gaps = 51/533 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSG+VP  ++ ++P + I  LG W +  SD   C  Y +   DPLF +I R ++E+
Sbjct: 232 VLPAFSGHVPGQIKQLYPESNIQHLGRWAAF-SDQYRC--YFMSPQDPLFAKIQRMYLEE 288

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   YG T HIY  D F+E  PP   P+Y+  +   IY  +   D  A WL   WLF + 
Sbjct: 289 QRAIYG-TDHIYGIDPFNEVDPPSWDPDYLFQISKGIYQTLAHVDPKAEWLQMSWLFYHK 347

Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W P ++KAL+  V  GK+V+LD F +   IW  + +FYG PYIWC L NF GN  + 
Sbjct: 348 KKKWTPERVKALITGVETGKMVLLDYFCDRNEIWKMTDKFYGQPYIWCYLGNFGGNTTVA 407

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G + +         T     + GVG+++EG +     Y+ + +  +     +   WI+  
Sbjct: 408 GNVKACGAKLDSTLTLGGKNLQGVGLTLEGFDVCQFPYEYILDKVWSGNSSE-NQWIDAL 466

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +    G + P+ + AW +LYH V+  + G+           P   P + S+         
Sbjct: 467 ADSHVGYASPSFRKAWQLLYHDVFVQSAGSNG-------ILPCYRPELNSL--------- 510

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                         ++ + H+ Y   ++I A +L     +  S     + DLI   RQ L
Sbjct: 511 --------------NWHYTHVDYDRQKLIEAWKLMQHDAD--SKRTAAQLDLIHYGRQVL 554

Query: 367 AKYANELFLN---IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
               NE   +      AY   D  G+   +     ++ D+D L A H    L  W++ A+
Sbjct: 555 ---GNEFLTHKQLFDSAYAHCDLAGMMAQAASMRHIMLDIDTLTAYHPRCTLAGWIDGAR 611

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           Q+A +      YE NAR+ IT W          L DY  K W+GL+ DYY  R   YF +
Sbjct: 612 QMAPDSVCADYYEDNARSLITTW-------GGKLNDYACKGWAGLMSDYYLTRWERYFAH 664

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
            I ++ +   F  + + +E  +    W + R++  VE++    +  + +  KY
Sbjct: 665 AINAVRAHRKFDQQAYDKEIARFELSWASHRDIPRVETHESLALYCKKIIQKY 717


>gi|345513909|ref|ZP_08793424.1| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
 gi|345456132|gb|EEO45798.2| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
          Length = 754

 Score =  228 bits (582), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 163/556 (29%), Positives = 258/556 (46%), Gaps = 79/556 (14%)

Query: 8   VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
           VLP +SG +P+ L       S K  +  +  S +S       +       +L   DP F 
Sbjct: 229 VLPGYSGMIPSKLDLGKRIDSGKEEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 288

Query: 59  EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
            I   F E+  K YG TS  Y+ D F E    + +       G AI   M+  +  AVW+
Sbjct: 289 RIANLFYEETEKLYG-TSDYYSIDPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWV 346

Query: 119 MQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----- 173
           +QGW  + +P  RP  MKAL      G L++LDLF+E +P+W       G+P IW     
Sbjct: 347 VQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKG 391

Query: 174 --------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVY 224
                   C+L NF GN+ ++G +D +       + +     + G+G++MEGIE NPV++
Sbjct: 392 YEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMF 451

Query: 225 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 284
           +LM E+ ++ EK   + WI QY   RYG    +I+ AW +L + +YNC  G   +     
Sbjct: 452 ELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIRQAWQILANGIYNCPAGNNQQG---- 507

Query: 285 VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS 344
                   SI           G+P    ++   + SS+     +Y  +    A  L ++ 
Sbjct: 508 -----PHESIFC---------GRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSV 549

Query: 345 GNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 404
            ++   +N + YDL+D+TRQA+A  A  ++   +  ++  D       +R+FLEL+   D
Sbjct: 550 ADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQD 609

Query: 405 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
            LL     F +G W++ A+ L    E++  YEWNAR QIT W +    +   LRDY +K 
Sbjct: 610 KLLGTRKEFKVGNWIQQARNLGITSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKE 669

Query: 465 WSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTNDW 510
           W+GLLRD+Y  R   Y++ + + L+              + D   +     +W  L   W
Sbjct: 670 WNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPW 726

Query: 511 QNGRNVYPVESNGDAL 526
              +N Y   + GD +
Sbjct: 727 TLAKNTYAASAEGDCI 742


>gi|423299508|ref|ZP_17277533.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
           CL09T03C10]
 gi|408473317|gb|EKJ91839.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
           CL09T03C10]
          Length = 727

 Score =  228 bits (582), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 149/535 (27%), Positives = 261/535 (48%), Gaps = 49/535 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P+AKI  +  W       R   ++ +D  D L+  I + F+E+
Sbjct: 231 VLPAFAGHVPAELKKIYPNAKIYTMSQWGGFDKQYR---SHFIDPMDSLYSVIQKRFLEE 287

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
           Q K YG T HIY  D F+E   P  + E++S++   IY  + S D +A WL   W+F Y 
Sbjct: 288 QTKIYG-TDHIYGIDPFNEVDSPDWNEEFLSNVSRKIYESLHSVDPEAQWLQMTWMFYYA 346

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W P ++K+ L +VP  KL++LD + +   IW  ++ +YG PYIWC L NF GN  + 
Sbjct: 347 KDKWTPSRIKSFLRAVPQDKLILLDYYCDHTEIWKKTEGYYGQPYIWCYLGNFGGNTMLA 406

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G L+       +        + G+G+++E  + NP++Y+ + E A++  +     WI  +
Sbjct: 407 GNLNDTYEKIHQVLAEGGQNIHGLGVTLEAFDVNPMMYEFVFEQAWEGAQ-PTDEWIATW 465

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  R G++ PA+  AW  L+  +Y                   + PS+            
Sbjct: 466 AKCRGGQTCPAVLKAWKELHEKIY-------------------IAPSLCG---------- 496

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEV-IRALELFIASGNELSASNT----YRYDLIDL 361
                +AVL +     +    W +  E      +L++  G+ L   +     + +D++++
Sbjct: 497 -----QAVLMNARPQLEGVQGWNTFPEYKYDNKDLWVIWGSLLQVGSIDKPGHAFDVVNV 551

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
            RQ L    ++        Y+  D  G  + ++R   L+ D+D LLAC   F +G W++ 
Sbjct: 552 GRQVLGNLFSDYRAQFTACYKRKDVKGAQEWAQRMDALLLDVDRLLACSPLFSMGKWIQD 611

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           A+     EE++K YE NAR  +T+W     ++ + L DY N+ W+GL + +Y  R   + 
Sbjct: 612 ARDCGTTEEEKKYYEENARCILTIW----GQKDTQLNDYANRSWAGLTKGFYRERWKRFT 667

Query: 482 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
             ++ ++++   F  K + ++      +W      + V S  DA+  +  L+NKY
Sbjct: 668 DSVLTAMQANRSFDAKKFHKDITDFEYEWTLQHETFSVSSGEDAVKVANELWNKY 722


>gi|358391826|gb|EHK41230.1| glycoside hydrolase family 89 protein [Trichoderma atroviride IMI
           206040]
          Length = 751

 Score =  228 bits (582), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 148/506 (29%), Positives = 260/506 (51%), Gaps = 40/506 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP  +  VFP   ++    W +   D     TY+ +  DP F ++ + FI +
Sbjct: 240 ILPAFPGFVPRNISRVFPGISLSTSPLWENFAEDLS-ADTYV-NPFDPHFTQLQKLFIGK 297

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG  +  +  D F+EN P      Y+ ++    ++ ++S   DA+W+MQ WLFS D
Sbjct: 298 QQELYGNVTKFWTLDQFNENQPLSSDLGYLRNVSQNTWTALKSASPDAIWVMQAWLFSAD 357

Query: 128 P-FWRPPQMKALLNSVPL-GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW    ++A L  +     +++LDLFAE  P W  +  FYG P+IWC LH++ GN+ +
Sbjct: 358 SSFWTNDAIEAFLGGITEDSDMLLLDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGL 417

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG ++++    ++A    ++++VG G++MEG E N ++YDL+ + A+  + +D + + + 
Sbjct: 418 YGQIENVTINAMQA-VRNSSSLVGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHD 476

Query: 246 YSVRRYG-RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           +   RYG  +V ++   W +L  TV+N T+   +     I+   ++ P+I  +  G+   
Sbjct: 477 WVSARYGTENVKSLYTGWELLRPTVFNNTNLTVNAVPKSIL---ELTPNINGLL-GRVGR 532

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA-LELFIASGNELSASNTYRYDLIDLTR 363
           +G  +           +YD   +  + +E+ +A LE     GN       Y+YDL+D TR
Sbjct: 533 HGTTI-----------NYDPAVMVDAWTELFKAGLEDVKLFGNP-----AYQYDLVDWTR 576

Query: 364 QALAKYANELFLNIIEAYQLN-DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           Q L    + L+ +++ AY  + +A  +     +   L++ +D +LA ++ F L  W+ +A
Sbjct: 577 QVLVNSFDGLYKDLVTAYNSSANAAEIRSRGSKLTALLKTLDAVLATNENFQLATWIAAA 636

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           +  A N       E+NAR Q+T+W    Q E     DY +K W+GL+ DYY  R   +  
Sbjct: 637 R--ASNPSNTSFLEYNARNQVTLWGPTGQIE-----DYASKQWAGLVGDYYLGRWQQFID 689

Query: 483 YMIESLESGDG-----FRLKDWRREW 503
           Y+  +  S         +L+ W  +W
Sbjct: 690 YLATTKHSSYNQTAFYHKLQAWEIQW 715


>gi|237711645|ref|ZP_04542126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
 gi|229454340|gb|EEO60061.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
          Length = 732

 Score =  228 bits (582), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 163/560 (29%), Positives = 260/560 (46%), Gaps = 79/560 (14%)

Query: 8   VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
           VLP +SG +P+ L       S K  +  +  S +S       +       +L   DP F 
Sbjct: 207 VLPGYSGMIPSKLDLGKRIDSGKEEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 266

Query: 59  EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
            I   F E+  K YG TS  Y+ D F E    + +       G AI   M+  +  AVW+
Sbjct: 267 RIANLFYEETEKLYG-TSDYYSIDPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWV 324

Query: 119 MQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----- 173
           +QGW  + +P  RP  MKAL      G L++LDLF+E +P+W       G+P IW     
Sbjct: 325 VQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKG 369

Query: 174 --------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVY 224
                   C+L NF GN+ ++G +D +       + +     + G+G++MEGIE NPV++
Sbjct: 370 YEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMF 429

Query: 225 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 284
           +LM E+ ++ EK   + WI QY   RYG    +I+ AW +L + +YNC  G   +     
Sbjct: 430 ELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIRQAWQILANGIYNCPAGNNQQG---- 485

Query: 285 VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS 344
                   SI           G+P    ++   + SS+     +Y  +    A  L ++ 
Sbjct: 486 -----PHESIFC---------GRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSV 527

Query: 345 GNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 404
            ++   +N + YDL+D+TRQA+A  A  ++   +  ++  D       +R+FLEL+   D
Sbjct: 528 ADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQD 587

Query: 405 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
            LL     F +G W++ A+ L    E++  YEWNAR QIT W +    +   LRDY +K 
Sbjct: 588 KLLGTRKEFKVGNWIQQARNLGITSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKE 647

Query: 465 WSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTNDW 510
           W+GLLRD+Y  R   Y++ + + L+              + D   +     +W  L   W
Sbjct: 648 WNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPW 704

Query: 511 QNGRNVYPVESNGDALITSQ 530
              +N Y   + GD +  ++
Sbjct: 705 TLAKNTYAASAEGDCIEVAK 724


>gi|423230938|ref|ZP_17217342.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
           CL02T00C15]
 gi|423244649|ref|ZP_17225724.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
           CL02T12C06]
 gi|392630058|gb|EIY24060.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
           CL02T00C15]
 gi|392641498|gb|EIY35274.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
           CL02T12C06]
          Length = 754

 Score =  228 bits (581), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 165/558 (29%), Positives = 259/558 (46%), Gaps = 83/558 (14%)

Query: 8   VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
           VLP +SG +P+ L       S K  +  +  S +S       +       +L   DP F 
Sbjct: 229 VLPGYSGMIPSKLDLGKRIDSGKEEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 288

Query: 59  EIGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAV 116
            I   F E+  K YG TS  Y+ D F E  N P   +       G AI   M+  +  AV
Sbjct: 289 RIANLFYEETEKLYG-TSDYYSIDPFHEAKNLP---AELDFGKAGRAIMDAMKKANPKAV 344

Query: 117 WLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW--- 173
           W++QGW  + +P  RP  MKAL      G L++LDLF+E +P+W       G+P IW   
Sbjct: 345 WVVQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRD 389

Query: 174 ----------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPV 222
                     C+L NF GN+ ++G +D +       + +     + G+G++MEGIE NPV
Sbjct: 390 KGYEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPV 449

Query: 223 VYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRD 282
           +++LM E+ ++ EK   + WI QY   RYG    +I+ AW +L + +YNC  G   +   
Sbjct: 450 MFELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIRQAWQILANGIYNCPAGNNQQG-- 507

Query: 283 VIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI 342
                     SI           G+P    ++   + SS+     +Y  +    A  L +
Sbjct: 508 -------PHESIFC---------GRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMV 547

Query: 343 ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVED 402
           +  ++   +N + YDL+D+TRQA+A  A  ++   +  ++  D       +R+FLEL+  
Sbjct: 548 SVADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYATHTRQFLELLMM 607

Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
            D LL     F +G W++ A+ L    E++  YEWNAR QIT W +    +   LRDY +
Sbjct: 608 QDKLLGTRKEFKVGNWIQQARNLGITSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAH 667

Query: 463 KYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTN 508
           K W+GLLRD+Y  R   Y++ + + L+              + D   +     +W  L  
Sbjct: 668 KEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEE 724

Query: 509 DWQNGRNVYPVESNGDAL 526
            W   +N Y   + GD +
Sbjct: 725 PWTLAKNTYAASAEGDCI 742


>gi|319640296|ref|ZP_07995021.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
 gi|317388071|gb|EFV68925.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
          Length = 752

 Score =  228 bits (581), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 162/564 (28%), Positives = 257/564 (45%), Gaps = 95/564 (16%)

Query: 8   VLPAFSGNVPA---------------ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA 52
           VLP +SG +P+                L N    +  + L  W             +L  
Sbjct: 227 VLPGYSGMIPSKLDLGKRIDGGKEEKTLSNTSSESAQSTLNKWNGFDRPG------ILLP 280

Query: 53  TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGD 112
            DP F +I   F E+  K YG TS  Y+ D F E    + +       G AI   M+  +
Sbjct: 281 DDPKFTQIASLFYEETEKLYG-TSDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKAN 338

Query: 113 SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYI 172
             AVW++QGW  + +P  RP  MKAL      G L++LDLF+E +P+W       G+P I
Sbjct: 339 PKAVWVVQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSI 383

Query: 173 W-------------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIE 218
           W             C+L NF GN+ ++G +D +       + +     + G+G++MEGIE
Sbjct: 384 WKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIE 443

Query: 219 QNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD 278
            NPV+++LM E+ ++ EK   + WI QY   RYG    +I  AW +L + +YNC  G   
Sbjct: 444 NNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQ 503

Query: 279 K--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIR 336
           +  +  +    P ++ +  + +  K  NY  P +                    T+E  R
Sbjct: 504 QGPHESIFCGRPSLN-NFQASSWSKMCNYYDPTT--------------------TAEAAR 542

Query: 337 ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF 396
              L ++  ++   +N + YDL+D+TRQA+A  A  ++   +  ++  D       +R+F
Sbjct: 543 ---LMVSVAHKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQF 599

Query: 397 LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASL 456
           LEL+   D LL     F +G W++ A+ L    E++  YEWNAR QIT W +    +   
Sbjct: 600 LELLIMQDKLLGTRKEFKVGNWIQQARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGK 659

Query: 457 LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRRE 502
           LRDY +K W+GLLRD+Y  R   Y++ + + L+              + D   +     +
Sbjct: 660 LRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---ID 716

Query: 503 WIKLTNDWQNGRNVYPVESNGDAL 526
           W  L   W   +N Y   + GD +
Sbjct: 717 WYALEEPWTLAKNTYAASAEGDCI 740


>gi|393784337|ref|ZP_10372502.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666113|gb|EIY59630.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
           CL02T12C01]
          Length = 728

 Score =  228 bits (580), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 143/485 (29%), Positives = 228/485 (47%), Gaps = 54/485 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V   F G VP  L+  +P A+I + G W + +          LD TDPLF  +   + E+
Sbjct: 230 VFQGFYGMVPNVLKKKYPDARIKEQGTWQTYQRPA------FLDPTDPLFDRVAAIYYEE 283

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G  +  +  D F E         ++      I   M+  +  AVW++QGW   ++
Sbjct: 284 QKKLFG-DAEFFGGDPFHEGG--TSEGIHVKLAAQKILQAMRKVNPKAVWVLQGW--QHN 338

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS--TSKQFYGVP------YIWCMLHNF 179
           P      +K L++ +  G+ ++LDL A  +P W   T+  F+         +IWC L NF
Sbjct: 339 P------VKDLMDGLNPGETIILDLMACERPQWGGVTTSMFHKPEGHQDHRWIWCALPNF 392

Query: 180 AGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD 238
            G   ++G + S A G V A+       + G+G + EGI   PVVYD++ +MA++ + + 
Sbjct: 393 GGKTGLHGKMSSYASGAVFAKEHPMGRNICGIGTAPEGIGTVPVVYDMVYDMAWRTDSIQ 452

Query: 239 VKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 298
           +  W+  Y+  RYG        AW +L  TVY C +         I A P          
Sbjct: 453 IPQWLTNYTYYRYGMEDTNCDKAWKILSETVYECHNELGGPVESYICARP---------- 502

Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
                         A      S++ +  ++Y   +++ A E    S N  +  +TY YDL
Sbjct: 503 --------------ADTIDHVSTWGNARIFYEPVKMVEAWEFLYQSRNRFNHCDTYEYDL 548

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           +D+TRQ L+ YA  L   ++EA+   + +G  + S  FL++++D D LL+    F+LG W
Sbjct: 549 VDVTRQVLSDYAKYLHKEMVEAFHQKNENGFMKYSTEFLDVIKDEDRLLSTRKEFMLGTW 608

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           L  A+      E+++++  NA+  +T W D   +    L DY NK WSGLL D+Y PR  
Sbjct: 609 LTEAENAGCTPEEKRRFVTNAKRLVTTWTDRDSD----LHDYANKEWSGLLSDFYLPRWE 664

Query: 479 IYFKY 483
            Y  Y
Sbjct: 665 AYVTY 669


>gi|345517325|ref|ZP_08796802.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
 gi|345457718|gb|EET14396.2| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
          Length = 754

 Score =  228 bits (580), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 162/562 (28%), Positives = 255/562 (45%), Gaps = 91/562 (16%)

Query: 8   VLPAFSGNVPA---------------ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA 52
           VLP +SG +P+                L N    +  + L  W             +L  
Sbjct: 229 VLPGYSGMIPSKLDLGKRIDGGKEEKTLSNTSSESAQSTLNKWNGFDRPG------ILLP 282

Query: 53  TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGD 112
            DP F +I   F E+  K YG TS  Y+ D F E    + +       G AI   M+  +
Sbjct: 283 DDPKFTQIASLFYEETEKLYG-TSDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKAN 340

Query: 113 SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYI 172
             AVW++QGW  + +P  RP  MKAL      G L++LDLF+E +P+W       G+P I
Sbjct: 341 PKAVWVVQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSI 385

Query: 173 W-------------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIE 218
           W             C+L NF GN+ ++G +D +       + +     + G+G++MEGIE
Sbjct: 386 WKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIE 445

Query: 219 QNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD 278
            NPV+++LM E+ ++ EK   + WI QY   RYG    +I  AW +L + +YNC  G   
Sbjct: 446 NNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQ 505

Query: 279 KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRAL 338
           +             SI           G+P    ++   + SS+     +Y  +    A 
Sbjct: 506 QG---------PHESIFC---------GRP----SLNNFQASSWSKMCNYYDPTTTAEAA 543

Query: 339 ELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLE 398
            L ++  ++   +N + YDL+D+TRQA+A  A  ++   +  ++  D       +R+FLE
Sbjct: 544 RLMVSVAHKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLE 603

Query: 399 LVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLR 458
           L+   D LL     F +G W++ A+ L    E++  YEWNAR QIT W +    +   LR
Sbjct: 604 LLIMQDKLLGTRKEFKVGNWIQQARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGKLR 663

Query: 459 DYGNKYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWI 504
           DY +K W+GLLRD+Y  R   Y++ + + L+              + D   +     +W 
Sbjct: 664 DYAHKEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWY 720

Query: 505 KLTNDWQNGRNVYPVESNGDAL 526
            L   W   +N Y   + GD +
Sbjct: 721 ALEEPWTLAKNTYAASAEGDCI 742


>gi|320106778|ref|YP_004182368.1| alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
 gi|319925299|gb|ADV82374.1| Alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
          Length = 754

 Score =  228 bits (580), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 163/514 (31%), Positives = 239/514 (46%), Gaps = 63/514 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWF--SVKSDPRWCCTYLLDATDP-LFIEIGRAF 64
           V PAFSG VP   + + P A+   L  W     K+ PR   T++L   +  L+  IG+ F
Sbjct: 257 VAPAFSGFVPQGFKRLHPKAETFTL-LWLPEEFKTIPRSTRTFILHPGEQDLYRLIGKKF 315

Query: 65  IEQQLKEYGRTSHIYNCDTFDENTPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQG 121
           IE+   EYG   + Y  DTF+E   PV      E +   G  +Y G+ +GD +  W+MQG
Sbjct: 316 IEEYKAEYGEVQY-YLADTFNELAVPVREEHRFEDLERFGRTVYEGILAGDPNGTWVMQG 374

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPI---------WSTSKQFYGVPY 171
           WLF YD  FW    + ALL  +P  +++++D   ++ P          W T K F+G  +
Sbjct: 375 WLFVYDVAFWNSESVAALLRGIPNDRMLIIDYANDLAPAVKGKYAPGQWKTQKAFFGKQW 434

Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEM 230
           I  M H F GN  + G L  +A  P    TS E   +VG GM  EGIE N VVY+LM++ 
Sbjct: 435 INGMAHTFGGNNNVKGNLKLMASEPASVLTSPERGNLVGWGMCPEGIETNEVVYELMTDA 494

Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 290
            +Q E +D+K WI  Y   RYG   P + +AW +L  + Y+     T +        P +
Sbjct: 495 GWQREAIDLKQWIPAYCRSRYGACPPVMLEAWTLLMQSAYSAHIWMTHQAWQTE---PSL 551

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
            P+  SV  G       P  +                        RA+ LF++   EL  
Sbjct: 552 APAAASVDAG-------PTFR------------------------RAVALFLSCAPELGQ 580

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
              YR DLI+L  QA     ++ F   ++A Q +      + +   L  +  MD LL   
Sbjct: 581 KELYRNDLIELVVQAAGGSVDQTFSLAVQAGQSHQNEVATEYAAHALGWMGRMDALLNLR 640

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
               L  W+++A+  A+++++   Y+ NAR  IT W          L DY ++ WSGL R
Sbjct: 641 PDRRLETWMQAARSYAKSDDEAAYYDENARRLITTW------GWPELSDYASRAWSGLTR 694

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
           DYY  R   +F     SL +G  F L  W++ W+
Sbjct: 695 DYYASRWEAWFA----SLHAGRPFSLDIWQQTWL 724


>gi|294777713|ref|ZP_06743164.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
 gi|294448781|gb|EFG17330.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
          Length = 752

 Score =  228 bits (580), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 162/568 (28%), Positives = 259/568 (45%), Gaps = 95/568 (16%)

Query: 8   VLPAFSGNVPA---------------ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA 52
           VLP +SG +P+                L N    +  + L  W             +L  
Sbjct: 227 VLPGYSGMIPSKLDLGKRIDGGKEEKTLSNTSSESAQSTLNKWNGFDRPG------ILLP 280

Query: 53  TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGD 112
            DP F +I   F E+  K YG TS  Y+ D F E    + +       G AI   M+  +
Sbjct: 281 DDPKFTQIASLFYEETEKLYG-TSDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKAN 338

Query: 113 SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYI 172
             AVW++QGW  + +P  RP  MKAL      G L++LDLF+E +P+W       G+P I
Sbjct: 339 PKAVWVVQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSI 383

Query: 173 W-------------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIE 218
           W             C+L NF GN+ ++G +D +       + +     + G+G++MEGIE
Sbjct: 384 WKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIE 443

Query: 219 QNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD 278
            NPV+++LM E+ ++ EK   + WI QY   RYG    +I  AW +L + +YNC  G   
Sbjct: 444 NNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQ 503

Query: 279 K--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIR 336
           +  +  +    P ++ +  + +  K  NY  P +                    T+E  R
Sbjct: 504 QGPHESIFCGRPSLN-NFQASSWSKMCNYYDPTT--------------------TAEAAR 542

Query: 337 ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF 396
              L ++  ++   +N + YDL+D+TRQA+A  A  ++   +  ++  D       +R+F
Sbjct: 543 ---LMVSVAHKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQF 599

Query: 397 LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASL 456
           LEL+   D LL     F +G W++ A+ L    E++  YEWNAR QIT W +    +   
Sbjct: 600 LELLIMQDKLLGTRKEFKVGNWIQQARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGK 659

Query: 457 LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRRE 502
           LRDY +K W+GLLRD+Y  R   Y++ + + L+              + D   +     +
Sbjct: 660 LRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---ID 716

Query: 503 WIKLTNDWQNGRNVYPVESNGDALITSQ 530
           W  L   W   +N Y   + GD +  ++
Sbjct: 717 WYALEEPWTLAKNTYAASAEGDCIEVAK 744


>gi|237719043|ref|ZP_04549524.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229451821|gb|EEO57612.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 713

 Score =  228 bits (580), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 160/540 (29%), Positives = 258/540 (47%), Gaps = 60/540 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F+++
Sbjct: 218 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 274

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F +D
Sbjct: 275 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 333

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN  + 
Sbjct: 334 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLT 393

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++     Y+ + E A+ +  VD   WI   
Sbjct: 394 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 452

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G    +++DAW  L++ +Y              V  P         T G    Y 
Sbjct: 453 ADRHVGCVSQSVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 489

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
           +P   +   K  ++ Y +  L     EV R L       NE S+   + +R DLI + RQ
Sbjct: 490 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEASSDRRDAFRLDLITVGRQ 538

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ A++
Sbjct: 539 VLGNYFLDVKMEFDRMVETKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 598

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +  + + +  YE NAR  IT W          L DY ++ W+GL+ DYY  R  +Y    
Sbjct: 599 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYINTF 651

Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
           I++ E G   D  +L+D  +E    W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 652 IKAAEKGVEVDQKQLEDELKEIEEGWVNATDREDTRKDVH---STTDGLLSFSTFLFSKY 708


>gi|212541222|ref|XP_002150766.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
 gi|210068065|gb|EEA22157.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
           18224]
          Length = 787

 Score =  228 bits (580), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 156/489 (31%), Positives = 245/489 (50%), Gaps = 44/489 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP+F+G VP A+  V P+AK+     W    S+  + C   L+  D  F  + ++ I +
Sbjct: 241 VLPSFTGFVPRAITRVLPNAKVVPGSRWNVFSSN--YTCDTFLEPFDDNFALLQKSTISK 298

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   YG  SHIY  D ++EN P   +P+Y+ ++       +++ D DAVWLMQ WLF   
Sbjct: 299 QQAYYGNISHIYALDQYNENNPFSSNPDYLRNISRTTSQSLKAADPDAVWLMQSWLFLDA 358

Query: 128 PFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
            FW    + A L+ V     +++LDLFAE +P+W  +  +YG P+IWC +H++ GN+ +Y
Sbjct: 359 TFWNNVTICAYLSGVENNSDMLILDLFAESQPVWQLTDSYYGKPWIWCQVHDYGGNMGLY 418

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G + +I      A  S   +MVG G +ME  E N +VYDL+ + A+    ++   +   +
Sbjct: 419 GQIMNITENATAALASSG-SMVGFGHTMESQEGNEIVYDLLLDQAWSETPINTSQYFEDW 477

Query: 247 SVRRYG--RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV--TEGK 301
              RY   + VP  + DAW +L  + YN T+ A+      I+   +++PSI  +   EG 
Sbjct: 478 VTVRYAGTQHVPQQLFDAWEILRWSAYNNTNLASSSVPKSIL---ELEPSISGLLNREGH 534

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALEL-FIASGNELSA--SNTYRYDL 358
           +                T +YD P L      V+ A  L + A+  ELS   +  + YDL
Sbjct: 535 HPT--------------TINYD-PEL------VVEAWALTYEAALLELSLWDNPAFNYDL 573

Query: 359 IDLTRQALAKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
           I LTRQ L       +  +I  Y   +     +    R+ ++L++ +D +L  ++ F L 
Sbjct: 574 IFLTRQVLVNAFIPRYELLISFYNNENYSVPAIVSAGRQLIDLLQSLDTVLGTNECFQLA 633

Query: 417 PWLESAKQLAQ-NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
            W+  A   A  N      YE+NAR QIT+W  N +     + DY +K W+GL+  YY P
Sbjct: 634 QWINKAVSRAHGNTTLAAYYEYNARNQITLWGPNGE-----ISDYASKQWAGLISSYYVP 688

Query: 476 RAAIYFKYM 484
           R  I   Y+
Sbjct: 689 RWQILVDYL 697


>gi|393788556|ref|ZP_10376683.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
           CL02T12C05]
 gi|392654236|gb|EIY47884.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
           CL02T12C05]
          Length = 732

 Score =  227 bits (579), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 162/515 (31%), Positives = 244/515 (47%), Gaps = 63/515 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V   F G VP AL+  FP A+I   G W + +          LD TDPLF ++   + E+
Sbjct: 233 VFQGFYGMVPNALKEKFPDARIKDQGIWGTYQRPA------FLDPTDPLFDKLAAIYYEE 286

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAA--IYSGMQSGDSDAVWLMQGWLFS 125
           Q   +G  +  +  D F E      + E I+   AA  I   M+  +  AVW++QGW   
Sbjct: 287 QKNLFGE-AQFFGGDPFHEG----GTSEGINVKLAAQKILQAMRKVNPQAVWVLQGW--Q 339

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSK-QFYGVP-------YIWCMLH 177
           ++P      +K L+  V  G+ ++LDL A  +P W   K   +  P       +IWC L 
Sbjct: 340 HNP------VKELMEGVKPGETIILDLMACERPQWGGVKTSMFHKPEGHWNHQWIWCALP 393

Query: 178 NFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 236
           NF G   ++G + S A GPV A+       + G+G + EGI   PVVYD++ +MA++ + 
Sbjct: 394 NFGGKTGLHGKMSSYASGPVFAKHHPMGKNICGIGTAPEGIGTIPVVYDMVYDMAWRTDS 453

Query: 237 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
           + +  W++ Y+  RYG        AW +L  T+Y C +         I A P       S
Sbjct: 454 IHIPQWLDNYTYYRYGTEDNNCNRAWKLLSETIYECHNELGGPVESYICARP-------S 506

Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
            T      +G  V                 ++Y   +V++A +L   S    + S+TY Y
Sbjct: 507 DTIQHVSTWGNAV-----------------MFYDPMKVVKAWDLLYQSRKRFNHSDTYEY 549

Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
           DL D+TRQ L+ YA  L   ++ A+Q  D     + S +FL +++D D LL+    F+LG
Sbjct: 550 DLTDVTRQVLSDYAKYLHERMVLAFQKKDKERFMEYSGKFLNIIKDEDRLLSTRKEFMLG 609

Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
            WL  A++     E+++++  NA+  IT W D   +    L DY NK WSGLL D+Y PR
Sbjct: 610 TWLAEAEKAGGTPEEKRRFVTNAKRLITTWTDTDSD----LHDYANKEWSGLLIDFYLPR 665

Query: 477 AAIYFKYMIESLESGDGFRLKDWRR---EWIKLTN 508
              Y  Y   SL  G      D+ +   EW+ LTN
Sbjct: 666 WEAYVTYKT-SLLYGKKLPYPDYSKMEQEWV-LTN 698


>gi|410095990|ref|ZP_11290981.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409227396|gb|EKN20294.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 753

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 167/550 (30%), Positives = 246/550 (44%), Gaps = 81/550 (14%)

Query: 11  AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQ 67
            FSG VP  L+N +P AKI         +  P WC       LD TD LF   GR F+E+
Sbjct: 233 GFSGYVPRELKNKYPDAKI---------QLQPSWCGFTGAAQLDPTDSLFSAFGRDFLEE 283

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           + K +G    +Y  D F E+ PP+D+PEY+S++G +IY   Q  D  A+W MQ W     
Sbjct: 284 EKKLFG-AHGVYAADPFHESRPPIDTPEYLSAVGNSIYKLFQDFDPSAIWAMQAWSL--- 339

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
              R P +KA    VP   L++LDL        +T    +G P +   LHNF G I ++G
Sbjct: 340 ---REPIVKA----VPKEHLLILDLNGGRSRQENTC---WGYPVVAGNLHNFGGRINLHG 389

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            L  +A         ++  + G G+ ME IEQNPV YDL  EM    ++VD++ W+  Y+
Sbjct: 390 DLRLLASNQYAVAKQKSPNVCGSGLFMESIEQNPVYYDLAFEMPLHADEVDIEEWLGDYA 449

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYG +      AW  L    Y    G     R  I+A     P++     G     G 
Sbjct: 450 ERRYGAASENAHKAWLHLLEGPYR--PGTNGTERSSIIA---ARPALNVKKSGPNAGLGI 504

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
           P                    YS   VI+A  L +   ++L+AS  YR+D++D+ RQ ++
Sbjct: 505 P--------------------YSPLLVIQAQGLLLKDADKLNASTPYRFDVVDIQRQLMS 544

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
                +     EA+   D       S RFLE++ D+D LL     F    WL  A+    
Sbjct: 545 NLGQAIHKKAAEAFVKKDKAAFTLHSNRFLEMLRDVDVLLRTRPEFNFDKWLTDARSWGT 604

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
             E++   E +A   +T+W     +   L+ DY  + W+GL+  YY  R   ++  + E 
Sbjct: 605 TNEEKDLLEKDATALVTVW---GADGDPLIFDYSWREWTGLIDSYYLKRWEKFYAMLQEH 661

Query: 488 LESG--------------DGFR-------LKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
           L+ G              + FR       L DW  E++  TN  +      P+ + GD +
Sbjct: 662 LDEGNEYSEKGLPMTHGREAFRANDFYSELGDWELEFVSRTNKART-----PI-TQGDEI 715

Query: 527 ITSQWLYNKY 536
            T+  +Y KY
Sbjct: 716 ETALKMYKKY 725


>gi|423223006|ref|ZP_17209475.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392640582|gb|EIY34381.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 755

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 162/549 (29%), Positives = 249/549 (45%), Gaps = 76/549 (13%)

Query: 11  AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
            FSG VP  LQ  +P AKI+    W           T  LD TDPLF E+G AF+E+Q K
Sbjct: 235 GFSGYVPRELQAKYPQAKISMKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDK 288

Query: 71  EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
            +G +  +Y  D F E+ PP+D+PEY++ +G  I+   Q+ D+ A+W+MQ W    D   
Sbjct: 289 LFG-SYGVYAADPFHESAPPIDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED--- 344

Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 190
                  ++ +VP   L++LDL        + +   +G P I   LHNF G I M+G L 
Sbjct: 345 -------IVKAVPKESLLILDLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLA 393

Query: 191 SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRR 250
            +A    +   +    + G G+ ME IEQNPV Y+L  EM    + + ++AW+  Y+ RR
Sbjct: 394 LLASNQYQKAKARYPNVCGSGLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERR 453

Query: 251 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS 310
           YG    A   AW  L    Y    G     R  IVA     P++     G     G P  
Sbjct: 454 YGAKSAAAGKAWMYLLEGPYR--QGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP-- 506

Query: 311 KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYA 370
                             Y    VIRA    +   ++L+ S  YR+D++D+ RQ +    
Sbjct: 507 ------------------YEPMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLG 548

Query: 371 NELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 430
             +     EA+   D       S RFLEL+ DMD LL     +    WL  A+   + +E
Sbjct: 549 QLVHKKAAEAFASKDKAAFVLHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKE 608

Query: 431 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 490
           ++   E +A + +T+W  +       + DY  + W+GL+  YY PR   ++  +   L++
Sbjct: 609 EKDLMERDATSLVTIWGADGDPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQGHLDA 665

Query: 491 GDGF---------------------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
           G  +                     RL +W   ++      Q G+   PV ++GD L+ +
Sbjct: 666 GTDYQEEGLSLAYGREDFRANDFYNRLAEWELAYVD-----QTGKARTPV-THGDELVVT 719

Query: 530 QWLYNKYLQ 538
           + L++KYL+
Sbjct: 720 RRLFDKYLK 728


>gi|336412606|ref|ZP_08592959.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942652|gb|EGN04494.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
           3_8_47FAA]
          Length = 727

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 159/540 (29%), Positives = 258/540 (47%), Gaps = 60/540 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN  + 
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLT 407

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++     Y+ + E A+ +  VD   WI   
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 466

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G    +++DAW  L++ +Y              V  P         T G    Y 
Sbjct: 467 ADRHVGCVSQSVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 503

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
           +P   +   K  ++ Y +  L     EV R L       NE  +   + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  Y  ++ +      +  D   +   + +  E++ D+D L A H    L  W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDHQALKACAEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +  + + +  YE NAR  IT W          L DY ++ W+GL+ DYY  R  +Y    
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYINTF 665

Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
           I++ E G   D  +L+D  +E    W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 666 IKAAEKGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722


>gi|423722278|ref|ZP_17696454.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
           CL09T00C40]
 gi|409242419|gb|EKN35181.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
           CL09T00C40]
          Length = 752

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 162/544 (29%), Positives = 241/544 (44%), Gaps = 69/544 (12%)

Query: 11  AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQ 67
            FSG VP  L+  +P AKI         +  P WC       LD TD LF  IGR F+E+
Sbjct: 232 GFSGYVPRELKEKYPDAKI---------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEE 282

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           + K YG    +Y  D F E+ PPVD+PEY+ ++G AI+      D +++W MQ W     
Sbjct: 283 EKKLYG-AHGVYAADPFHESQPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL--- 338

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
              R P +KA    VP   L++LDL              +G P +   LHNF G I ++G
Sbjct: 339 ---REPIVKA----VPKENLLILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHG 388

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            L  +A         +N  + G G+ ME IEQNPV YDL  EM    ++V+++ W+ +Y+
Sbjct: 389 DLRLLASNQYVNAVKKNPNVCGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYA 448

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYG+       AW  L    Y    G     R  I+A     P++     G     G 
Sbjct: 449 DRRYGKPSENAHQAWLHLLEGPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGI 503

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
           P                    YS   V++A  L +     L  S+ YR+D++D+ RQ ++
Sbjct: 504 P--------------------YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMS 543

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
                +     EA++  D       S RFLE++ D D LL     F    WL  A+    
Sbjct: 544 NLGQAIHKQAAEAFRKKDKEAFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGD 603

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
           N E++  +E +A   +T+W     +   L+ DY  + W+GL+  YY  R   ++  + + 
Sbjct: 604 NSEEKDLFEKDATALVTVW---GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDH 660

Query: 488 LESGDGFRLKDWR----REWIKLTN------DWQ-----NGRNVYPVESNGDALITSQWL 532
           L++G  +  KD      RE  +  +      DW+         V    + GD + T+  L
Sbjct: 661 LDAGTNYSEKDLPQTHGRESFRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRL 720

Query: 533 YNKY 536
           Y KY
Sbjct: 721 YKKY 724


>gi|294648124|ref|ZP_06725667.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|292636508|gb|EFF54983.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
          Length = 499

 Score =  225 bits (573), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 159/540 (29%), Positives = 256/540 (47%), Gaps = 60/540 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F+++
Sbjct: 4   VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 60

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F +D
Sbjct: 61  QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 119

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN  + 
Sbjct: 120 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 179

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++     Y+ + E A+ +  VD   WI   
Sbjct: 180 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 238

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G     ++DAW  L++ +Y              V  P         T G    Y 
Sbjct: 239 ADRHVGCVSQPVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 275

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
           +P   +   K  ++ Y +  L     EV R L       NE  +   + +R DLI + RQ
Sbjct: 276 RPALNKNSEKRTSNVYSNVELL----EVWRKL-------NEAPSDRRDAFRLDLITVGRQ 324

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ A++
Sbjct: 325 VLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 384

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +  + + +  YE NAR  IT W          L DY ++ W+GL+ DYY  R  +Y    
Sbjct: 385 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYVNTF 437

Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
           I++ E G   D  +L+D  +E    W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 438 IKAAEEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 494


>gi|224537227|ref|ZP_03677766.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521150|gb|EEF90255.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 755

 Score =  225 bits (573), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 162/549 (29%), Positives = 249/549 (45%), Gaps = 76/549 (13%)

Query: 11  AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
            FSG VP  LQ  +P AKI+    W           T  LD TDPLF E+G AF+E+Q K
Sbjct: 235 GFSGYVPRELQAKYPQAKISMKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDK 288

Query: 71  EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
            +G +  +Y  D F E+ PP+D+PEY++ +G  I+   Q+ D+ A+W+MQ W    D   
Sbjct: 289 LFG-SYGVYAADPFHESAPPIDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED--- 344

Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 190
                  ++ +VP   L++LDL        + +   +G P I   LHNF G I M+G L 
Sbjct: 345 -------IVKAVPKESLLILDLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLA 393

Query: 191 SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRR 250
            +A    +   +    + G G+ ME IEQNPV Y+L  EM    + + ++AW+  Y+ RR
Sbjct: 394 LLASNQYQKAKARYPNVCGSGLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERR 453

Query: 251 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS 310
           YG    A   AW  L    Y    G     R  IVA     P++     G     G P  
Sbjct: 454 YGAKSAAAGKAWMYLLEGPYR--RGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP-- 506

Query: 311 KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYA 370
                             Y    VIRA    +   ++L+ S  YR+D++D+ RQ +    
Sbjct: 507 ------------------YEPMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLG 548

Query: 371 NELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 430
             +     EA+   D       S RFLEL+ DMD LL     +    WL  A+   + +E
Sbjct: 549 QLVHKKAAEAFASKDKAAFALHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKE 608

Query: 431 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 490
           ++   E +A + +T+W  +       + DY  + W+GL+  YY PR   ++  +   L++
Sbjct: 609 EKDLMERDATSLVTIWGADGDPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQGHLDA 665

Query: 491 GDGF---------------------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
           G  +                     RL +W   ++      Q G+   PV ++GD L+ +
Sbjct: 666 GTDYQEEGLSLAYGREDFRANDFYNRLAEWELAYVD-----QTGKARTPV-THGDELVVT 719

Query: 530 QWLYNKYLQ 538
           + L++KYL+
Sbjct: 720 RRLFDKYLK 728


>gi|237708859|ref|ZP_04539340.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
 gi|345513372|ref|ZP_08792893.1| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
 gi|423228941|ref|ZP_17215347.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
           CL02T00C15]
 gi|423242228|ref|ZP_17223337.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
           CL03T12C01]
 gi|423247755|ref|ZP_17228803.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
           CL02T12C06]
 gi|229457285|gb|EEO63006.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
 gi|345456211|gb|EEO47557.2| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
 gi|392631297|gb|EIY25272.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
           CL02T12C06]
 gi|392635177|gb|EIY29082.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
           CL02T00C15]
 gi|392639514|gb|EIY33330.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
           CL03T12C01]
          Length = 717

 Score =  225 bits (573), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 156/501 (31%), Positives = 248/501 (49%), Gaps = 67/501 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V +DP + C+Y     L   D  F EI   
Sbjct: 224 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 272

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+T   Y  D F E  +T  V+    + + G AI   M+  + DAVW+ Q 
Sbjct: 273 YYKELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 327

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
           W        +     +++  +  G L+VLDL +E +P W         K  YG   +++C
Sbjct: 328 W--------QDNPRTSMIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYC 379

Query: 175 MLHNFAGNIEMYGILDSIAFGPVEARTSENT--TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML NF GNI ++G +D++  G  +A+T  +   T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 380 MLLNFGGNIGLHGKMDALINGFYDAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPW 439

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           +  +     W+  Y   RYG    A+Q AW++L + +YN       +  +  V  A P +
Sbjct: 440 REHRFTRDEWLKGYVYARYGVEDEALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 499

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
           D          YQ                SS+     +Y+  +VI A  L ++  ++   
Sbjct: 500 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 535

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
           +N + +DL+D+ RQALA+    +   +  A++  D   VF+L S+ FL L+   D LL  
Sbjct: 536 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGT 594

Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
              F +G W+E+A+   Q +E++  YEWNAR QIT W +    +   LRDY +K W+G+L
Sbjct: 595 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 654

Query: 470 RDYYGPRAAIYFKYMIESLES 490
           +D+Y  R   YF Y+   L+ 
Sbjct: 655 KDFYFMRWKAYFDYLACVLDG 675


>gi|212693694|ref|ZP_03301822.1| hypothetical protein BACDOR_03214 [Bacteroides dorei DSM 17855]
 gi|265755881|ref|ZP_06090348.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
 gi|212663753|gb|EEB24327.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
 gi|263233959|gb|EEZ19560.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
          Length = 718

 Score =  225 bits (573), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 156/501 (31%), Positives = 248/501 (49%), Gaps = 67/501 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V +DP + C+Y     L   D  F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+T   Y  D F E  +T  V+    + + G AI   M+  + DAVW+ Q 
Sbjct: 274 YYKELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
           W        +     +++  +  G L+VLDL +E +P W         K  YG   +++C
Sbjct: 329 W--------QDNPRTSMIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYC 380

Query: 175 MLHNFAGNIEMYGILDSIAFGPVEARTSENT--TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML NF GNI ++G +D++  G  +A+T  +   T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNFGGNIGLHGKMDALINGFYDAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           +  +     W+  Y   RYG    A+Q AW++L + +YN       +  +  V  A P +
Sbjct: 441 REHRFTRDEWLKGYVYARYGVEDEALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 500

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
           D          YQ                SS+     +Y+  +VI A  L ++  ++   
Sbjct: 501 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
           +N + +DL+D+ RQALA+    +   +  A++  D   VF+L S+ FL L+   D LL  
Sbjct: 537 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGT 595

Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
              F +G W+E+A+   Q +E++  YEWNAR QIT W +    +   LRDY +K W+G+L
Sbjct: 596 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 655

Query: 470 RDYYGPRAAIYFKYMIESLES 490
           +D+Y  R   YF Y+   L+ 
Sbjct: 656 KDFYFMRWKAYFDYLACVLDG 676


>gi|154492110|ref|ZP_02031736.1| hypothetical protein PARMER_01741 [Parabacteroides merdae ATCC
           43184]
 gi|154087335|gb|EDN86380.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
           43184]
          Length = 752

 Score =  224 bits (572), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 158/544 (29%), Positives = 241/544 (44%), Gaps = 69/544 (12%)

Query: 11  AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQ 67
            FSG VP  L+  +P AKI         +  P WC       LD TD LF  IGR F+E+
Sbjct: 232 GFSGYVPRELKEKYPDAKI---------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEE 282

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           + K YG    +Y  D F E+ PPVD+PEY+ ++G AI+      D +++W MQ W     
Sbjct: 283 EKKLYG-AHGVYAADPFHESQPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSLR-- 339

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
                   ++++ +VP   L++LDL              +G P +   LHNF G I ++G
Sbjct: 340 --------ESIVKAVPKENLLILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHG 388

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            L  +A         +N  + G G+ ME IEQNPV YDL  EM    ++V+++ W+ +Y+
Sbjct: 389 DLRLLASNQYVNAVKKNPNVCGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYA 448

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYG+       AW  L    Y    G     R  I+A     P++     G     G 
Sbjct: 449 DRRYGKPSENAHQAWLHLLEGPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGI 503

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
           P                    YS   V++A  L +     L  S+ YR+D++D+ RQ ++
Sbjct: 504 P--------------------YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMS 543

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
                +     EA++  D       S RFLE++ D D LL     F    WL  A+    
Sbjct: 544 NLGQAIHKQAAEAFRKKDKEAFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGD 603

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
           N E++  +E +A   +T+W     +   L+ DY  + W+GL+  YY  R   ++  + + 
Sbjct: 604 NSEEKDLFEKDATALVTVW---GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDH 660

Query: 488 LESGDGFRLKDWR----REWIKLTN------DWQ-----NGRNVYPVESNGDALITSQWL 532
           L++G  +  KD      RE  +  +      DW+         V    + GD + T+  L
Sbjct: 661 LDAGTNYSEKDLPQTHGRESFRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRL 720

Query: 533 YNKY 536
           Y KY
Sbjct: 721 YKKY 724


>gi|29349767|ref|NP_813270.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29341678|gb|AAO79464.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 744

 Score =  224 bits (571), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 146/489 (29%), Positives = 238/489 (48%), Gaps = 59/489 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
           V P ++G VP  +          +LG  + +    +WC       L   D  F      +
Sbjct: 242 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 291

Query: 65  IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            E+  K YG+  + Y+ D F E  NT  VD    ++  G +I S M+  + +AVW+MQ W
Sbjct: 292 YEELEKLYGKAKY-YSMDPFHEGGNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQAW 346

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
                   +    +A+++++  G L+VLDL++E  P       +W   K F    +++CM
Sbjct: 347 --------QANPREAMVSTLDSGDLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCM 398

Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
           L NF GN+ ++G ++ +  G   A    N  T+ GVG + EGIE NPV+++L+ E+ ++ 
Sbjct: 399 LLNFGGNVGLHGRMEQLVNGYYNACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWRE 458

Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
           E+    AW+  Y   RYG  + P + +AW  L HTVYN       +             S
Sbjct: 459 ERFAPDAWLQAYLKARYGNDLSPEVAEAWRALEHTVYNAPKNYQGEG---------TVES 509

Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
           ++    G +Q+              TS++ +  L+YS     +A  L ++  ++   +N 
Sbjct: 510 LLCARPGFHQD-------------RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNN 556

Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
           + YDL+D+ RQ+LA   N L   I ++Y   D     + S++FLEL+   D LL+    F
Sbjct: 557 FEYDLVDVVRQSLADKGNVLLEEISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEF 616

Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
            +  WL +A+ L   EE++K YEWNA   IT+W D+       L DY ++ WSG+L+D Y
Sbjct: 617 SVSSWLNAARSLGTTEEEKKLYEWNASALITVWGDSIAANRGGLHDYSHREWSGILKDLY 676

Query: 474 GPRAAIYFK 482
             R   +F+
Sbjct: 677 YQRWKTFFE 685


>gi|383120707|ref|ZP_09941431.1| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
 gi|382984934|gb|EES68331.2| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
          Length = 736

 Score =  224 bits (571), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 146/489 (29%), Positives = 238/489 (48%), Gaps = 59/489 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
           V P ++G VP  +          +LG  + +    +WC       L   D  F      +
Sbjct: 234 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 283

Query: 65  IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            E+  K YG+  + Y+ D F E  NT  VD    ++  G +I S M+  + +AVW+MQ W
Sbjct: 284 YEELEKLYGKAKY-YSMDPFHEGGNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQAW 338

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
                   +    +A+++++  G L+VLDL++E  P       +W   K F    +++CM
Sbjct: 339 --------QANPREAMVSTLDSGDLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCM 390

Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
           L NF GN+ ++G ++ +  G   A    N  T+ GVG + EGIE NPV+++L+ E+ ++ 
Sbjct: 391 LLNFGGNVGLHGRMEQLVNGYYNACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWRE 450

Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
           E+    AW+  Y   RYG  + P + +AW  L HTVYN       +             S
Sbjct: 451 ERFAPDAWLQAYLKARYGNDLSPEVAEAWRALEHTVYNAPKNYQGEG---------TVES 501

Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
           ++    G +Q+              TS++ +  L+YS     +A  L ++  ++   +N 
Sbjct: 502 LLCARPGFHQD-------------RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNN 548

Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
           + YDL+D+ RQ+LA   N L   I ++Y   D     + S++FLEL+   D LL+    F
Sbjct: 549 FEYDLVDVVRQSLADKGNVLLEEISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEF 608

Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
            +  WL +A+ L   EE++K YEWNA   IT+W D+       L DY ++ WSG+L+D Y
Sbjct: 609 SVSSWLNAARSLGTTEEEKKLYEWNASALITVWGDSIAANRGGLHDYSHREWSGILKDLY 668

Query: 474 GPRAAIYFK 482
             R   +F+
Sbjct: 669 YQRWKTFFE 677


>gi|262406058|ref|ZP_06082608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|294806855|ref|ZP_06765680.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|345510563|ref|ZP_08790130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|262356933|gb|EEZ06023.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|294445884|gb|EFG14526.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|345454460|gb|EEO49066.2| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
          Length = 727

 Score =  224 bits (571), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 159/540 (29%), Positives = 256/540 (47%), Gaps = 60/540 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN  + 
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++     Y+ + E A+ +  VD   WI   
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 466

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G     ++DAW  L++ +Y              V  P         T G    Y 
Sbjct: 467 ADRHVGCVSQPVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 503

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
           +P   +   K  ++ Y +  L     EV R L       NE  +   + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +  + + +  YE NAR  IT W          L DY ++ W+GL+ DYY  R  +Y    
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYVNTF 665

Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
           I++ E G   D  +L+D  +E    W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 666 IKAAEEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722


>gi|358378969|gb|EHK16650.1| glycoside hydrolase family 89 protein [Trichoderma virens Gv29-8]
          Length = 748

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 141/485 (29%), Positives = 255/485 (52%), Gaps = 41/485 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP  +  VFP   ++    W +  ++        ++  DP F ++ + FI +
Sbjct: 239 ILPAFPGFVPRNISRVFPDISLSTSPIWSNFGTEL--SADIYINPFDPRFAQLQKLFISK 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG  ++ +  D F+EN P      Y+ ++    +S +++ D +AVW+MQ WLFS D
Sbjct: 297 QQELYGNVTNFWTLDQFNENQPLSGDLGYLQNVSHNTWSALKAADPEAVWVMQAWLFSSD 356

Query: 128 -PFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   ++++ L  +P+   +++LDLFAE  P W  +  FYG P+IWC LH++ GN+ +
Sbjct: 357 SAFWTNDRIESFLGGIPVNSDMLLLDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGL 416

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG ++++    ++A    + ++VG G++MEG E N ++YDL+ + A+  + +D + + + 
Sbjct: 417 YGQIENVTINSMDA-VRNSGSLVGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHD 475

Query: 246 YSVRRYG-RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           +   RYG ++V ++   W +L  TV+N T+   +  +  I+   ++ PS   +  G+  +
Sbjct: 476 WVSTRYGTKNVKSLYTGWELLRPTVFNNTNLTMNAVQKSIL---ELVPSTTGLL-GRVGH 531

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA-LELFIASGNELS--ASNTYRYDLIDL 361
           +G  ++                  Y+ + ++ A  ELF A   ++    +  Y+YDL+D 
Sbjct: 532 HGTTIT------------------YNPAVMVEAWTELFKAGLQDIKLFTNPAYQYDLVDW 573

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSR--RFLELVEDMDGLLACHDGFLLGPWL 419
           TRQ L      L+ +++ AY    +  V + SR  + + L+  +D +LA ++ F L PW+
Sbjct: 574 TRQVLVNSFEGLYKDLVAAYNSAASSSVIK-SRGAKLIALLRTLDAVLATNEHFQLTPWI 632

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
             A+  A +       E+NAR QIT+W      E     DY +K W+GL+  YY  R   
Sbjct: 633 NEAR--ASSPSTADFLEYNARNQITLWGPQGNIE-----DYASKQWAGLVGTYYVERWQQ 685

Query: 480 YFKYM 484
           +  Y+
Sbjct: 686 FIDYL 690


>gi|346323119|gb|EGX92717.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
          Length = 742

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 147/496 (29%), Positives = 248/496 (50%), Gaps = 38/496 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP A   + P A + +   W  + +D        L   D  + E+ R F+E 
Sbjct: 223 ILPAFPGFVPDAFARLRPGADLVRAPAWGGLPADSPNTRALFLSPLDDAYAELQRLFVEA 282

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF--S 125
           Q++ YG  +++Y  D F+E  P   + +Y+S++    Y+ + + +  AVWLMQGWLF  S
Sbjct: 283 QIEAYGNVTNVYAMDQFNEINPVSGATDYLSAVSRRSYAALAAANPAAVWLMQGWLFYLS 342

Query: 126 YDPFWRPPQMKALLNSVP-LGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
              FW   +++A L        +V+LDLF+E  P W  +  + G P+IWC +H+F GN  
Sbjct: 343 EGNFWTQERIEAYLRGPEDRAGMVILDLFSETAPQWQRTGSYAGRPWIWCQVHDFGGNQN 402

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           ++G + +    P+EA   E+ +MVG+G++ E  E N V+YDL  +  +    +D  ++ +
Sbjct: 403 LFGKITNTTVNPMEA-LRESDSMVGLGIATEAYEGNEVLYDLFFDQGWSATPIDTVSYFH 461

Query: 245 QYSVRRYG--RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
            ++ RRY   R +PA +  AW +L  TVY+    A+D    V V+   ++P++  +    
Sbjct: 462 DWTTRRYSGVRQLPASLYQAWELLRVTVYDYR--ASDLI-GVPVSVYQLEPNLTGL---- 514

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN---ELSASNTYRYDL 358
              Y     K   L  + ++   P +W           LF+A+      L A   +R DL
Sbjct: 515 ---YNTTTGKPTALHYDPAAL--PPIW----------RLFVAAAAAQPRLWAEPGFRLDL 559

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHG-VFQLSRRFLELVEDMDGLLACHDGFLLGP 417
           +D+ RQ L+     L+ +++ A+        + Q  +R   ++ D+D LLA    F L  
Sbjct: 560 VDVMRQVLSNAFGRLYADLVAAFTGGAPPSEIAQRGQRMRAVLGDVDALLATQPHFSLRR 619

Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
           WL +A+   ++  +     + AR+Q+T+W   T     LL DY  K WSGL+  YY  R 
Sbjct: 620 WLNAARAWGESTGENAAIAYEARSQVTIWAPGT-----LLNDYAAKAWSGLIATYYDERW 674

Query: 478 AIYFKYMIESLESGDG 493
            I+   ++++ E+  G
Sbjct: 675 RIFVDRLVDAAENHGG 690


>gi|410096483|ref|ZP_11291470.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409226447|gb|EKN19356.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 718

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 162/545 (29%), Positives = 265/545 (48%), Gaps = 75/545 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           V P +SG VP   +         +LG   +V SDP   C Y     L  TDP F EI   
Sbjct: 224 VFPGYSGMVPHNAKE--------KLG--LNV-SDPGLWCGYHRPAFLQPTDPRFQEIASL 272

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           + ++  K YG+ ++ Y+ D F E  +   VD    + + G AI   M+  +  AVW+ Q 
Sbjct: 273 YYKELNKLYGK-ANFYSMDPFHEGGSVAGVD----LDAAGKAIMQAMKKNNPKAVWVAQA 327

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ-------FYGVPYIWC 174
           W        +      ++ ++  G ++VLDLF+E +P W   +        F    +I+C
Sbjct: 328 W--------QANPRSQMIENLKAGDMIVLDLFSESRPQWGDPESTWHRKDGFGQHDWIYC 379

Query: 175 MLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ 233
           ML N+ GN+ ++G +  +     +A+ S    T+ GVGM+MEG E NPV+++L++E+ ++
Sbjct: 380 MLLNYGGNVGLHGKMAHVIDEYYKAKESSFGKTLCGVGMTMEGSENNPVMFELLTELPWR 439

Query: 234 HEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVD 291
               D   W+  Y+V RYG++ P +Q+AW +L +++YNC    T +  +  +  A P   
Sbjct: 440 PVHFDKNEWLKNYTVARYGKANPTVQEAWILLSNSIYNCPPENTQQGTHESIFCARPSDH 499

Query: 292 PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
           P ++S                    SE S Y      Y+  +VIRA  + ++  ++ + +
Sbjct: 500 PYLVSSW------------------SEMSDY------YNPDDVIRAAAMMVSVADQFTGN 535

Query: 352 NTYRYDLIDLTRQALAKYANELFLNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           N + YDL+D+ RQA+A+    L   ++EA +   D       + RFL+L+   D LL   
Sbjct: 536 NNFEYDLVDIVRQAIAE-KGRLVEKVVEASFASGDKQLYNTAANRFLQLLLLQDELLGTR 594

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             F +G W+   + L    E++  YEWNAR QIT W +    +   LRDY +K W+G+L+
Sbjct: 595 PEFKVGNWIARTRSLGNTPEEKDLYEWNARVQITTWGNRNAADKGGLRDYAHKEWNGILK 654

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRRE--WIKLTNDWQNGRNVYPVESNGDALIT 528
           D+Y  R   +F Y  E L+      +  +  E  W KLT+        Y  E  GD + T
Sbjct: 655 DFYYMRWKTWFDYQNELLDGKKPTAIDFYALEEPWTKLTDS-------YSSEPEGDCIST 707

Query: 529 SQWLY 533
            + ++
Sbjct: 708 VKRIF 712


>gi|298480128|ref|ZP_06998327.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
 gi|336404356|ref|ZP_08585054.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
 gi|298273937|gb|EFI15499.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
 gi|335943684|gb|EGN05523.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
          Length = 727

 Score =  222 bits (566), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 156/537 (29%), Positives = 250/537 (46%), Gaps = 54/537 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN  + 
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLT 407

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++     Y+ + E A+ +  VD   WI   
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 466

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G    +++DAW  L++ +Y              V  P         T G    Y 
Sbjct: 467 ADRHVGCVSQSVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 503

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
           +P   +   K  ++ Y +  L     EV R L       NE  +   + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +  + + +  YE NAR  IT W          L DY ++ W+GL+ DYY  R  +Y    
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYIDTF 665

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 536
           I+++  G     K    E  ++   W N  +   V     S  D L++ S +L++KY
Sbjct: 666 IKAVGEGVEVDQKQLEDELKEIEEGWVNATDRKDVRKDVHSTTDGLLSFSTFLFSKY 722


>gi|423293377|ref|ZP_17271504.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
           CL03T12C18]
 gi|392678320|gb|EIY71728.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
           CL03T12C18]
          Length = 727

 Score =  222 bits (566), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 156/540 (28%), Positives = 256/540 (47%), Gaps = 60/540 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN  + 
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++     Y+ + E A+ +  VD   WI   
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 466

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G     ++DAW  L++ +Y                      + +  T G    Y 
Sbjct: 467 ADRHVGCVSQPVRDAWKRLFNDIY----------------------AQVPRTLGTLPGY- 503

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
           +P   +   K  ++ Y +  L     EV R L       NE  +   + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDYQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +  + + +  YE NAR  IT W          L DY ++ W+GL+ DYY  R  +Y    
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYIDTF 665

Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
           I+++  G   D  +L+D  +E    W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 666 IKAVGEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722


>gi|383115203|ref|ZP_09935961.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
 gi|313695380|gb|EFS32215.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
          Length = 727

 Score =  222 bits (565), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 157/537 (29%), Positives = 249/537 (46%), Gaps = 54/537 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN  + 
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++     Y+ + E A+     D K WI   
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAWNLNADDNK-WIECL 466

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G    +++DAW  L++ +Y              V  P         T G    Y 
Sbjct: 467 ADRHVGCVSQSVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 503

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
           +P   +   K  ++ Y +  L     EV R L       NE  +   + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ A++
Sbjct: 553 VLGNYFFDVKVEFDRMVEAKDYQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +  + + +  YE NAR  IT W          L DY ++ W+GL+ DYY  R  +Y    
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYINTF 665

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 536
           I++ E G     K    E  ++   W N  +   V     S  D L++ S +L++KY
Sbjct: 666 IKAAEKGVEVDQKQLEDELKEIEEGWVNATDRKDVRKDIHSATDGLLSFSTFLFSKY 722


>gi|423345423|ref|ZP_17323112.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
           CL03T12C32]
 gi|409223209|gb|EKN16146.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
           CL03T12C32]
          Length = 752

 Score =  222 bits (565), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 161/544 (29%), Positives = 241/544 (44%), Gaps = 69/544 (12%)

Query: 11  AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQ 67
            FSG VP  L+  +P AKI         +  P WC       LD TD LF  IGR F+E+
Sbjct: 232 GFSGYVPRELKEKYPDAKI---------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEE 282

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           + K YG    +Y  D F E+ PPVD+PEY+ ++G AI+      D +++W MQ W     
Sbjct: 283 EKKLYG-AHGVYAADPFHESQPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL--- 338

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
              R P +KA    VP   L++LDL              +G P +   LHNF G I ++G
Sbjct: 339 ---REPIVKA----VPKENLLILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHG 388

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            L  +A         +N  + G G+ ME IEQNPV YDL  EM    ++V+++ W+ +Y+
Sbjct: 389 DLRLLASNQYVNAVKKNPNVCGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYA 448

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYG+       AW  L    Y    G     R  I+A     P++     G     G 
Sbjct: 449 DRRYGKPSENAHQAWLHLLEGPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGI 503

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
           P                    YS   V++A  L +     L  S+ YR+D++D+ RQ ++
Sbjct: 504 P--------------------YSPLSVVQAEGLLLKDAARLEDSDPYRFDIVDIQRQLMS 543

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
                +     +A++  D       S RFLE++ D D LL     F    WL  A+    
Sbjct: 544 NLGQVIHKQAAKAFRKKDKEAFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGD 603

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
           N E++  +E +A   +T+W     +   L+ DY  + W+GL+  YY  R   ++  + + 
Sbjct: 604 NSEEKDLFEKDATALVTVW---GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDH 660

Query: 488 LESGDGFRLKDWR----REWIKLTN------DWQ-----NGRNVYPVESNGDALITSQWL 532
           L++G  +  KD      RE  +  +      DW+         V    + GD + T+  L
Sbjct: 661 LDAGTNYSEKDLPQTHGRESFRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRL 720

Query: 533 YNKY 536
           Y KY
Sbjct: 721 YKKY 724


>gi|299144715|ref|ZP_07037783.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
 gi|298515206|gb|EFI39087.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
          Length = 727

 Score =  222 bits (565), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 156/540 (28%), Positives = 257/540 (47%), Gaps = 60/540 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRLYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G   HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F +D
Sbjct: 289 QKKLFG-IDHIYGLDPFNEVDPPSFEPEYLRKIVSDMYATLTAADPKAQWMQMTWMFYFD 347

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN  + 
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++     Y+ + E A+ +  VD   WI   
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 466

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G    +++DAW  L++ +Y                      + +  T G    Y 
Sbjct: 467 ADRHVGCVSQSVRDAWKRLFNDIY----------------------AQVPRTLGTLPGY- 503

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
           +P   +   K  ++ Y +  L     EV R L       NE  +   + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDYQALKACGEKMKEILHDLDKLNAFHPYCSLDKWIDDARK 612

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +  + + +  YE NAR  IT W          L DY ++ W+GL+RDYY  R  +Y    
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLIRDYYAKRWEVYINTF 665

Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
           I+++  G   D  +L+D  +E    W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 666 IKAVGEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722


>gi|399028591|ref|ZP_10729778.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
 gi|398073682|gb|EJL64846.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
          Length = 727

 Score =  221 bits (564), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 152/512 (29%), Positives = 243/512 (47%), Gaps = 49/512 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+PAFSG VP A     P +KI++L +W    S   +  TYLLDA DPLF EIG+ FIE 
Sbjct: 246 VVPAFSGYVPKAFAEKHPGSKISELKSW----SGGGFESTYLLDANDPLFKEIGKRFIEI 301

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
             K YG+ +  Y  D F+E TPPV      E +S  G  I+  +     DA W+MQGWLF
Sbjct: 302 YTKLYGQ-ADFYLADAFNEITPPVSKEHKYEELSDYGKTIFETINEASPDATWVMQGWLF 360

Query: 125 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
             +  FW     KA L+ VP  ++++ D   +   +W   + FYG  + +  +HN+ G+ 
Sbjct: 361 GDNKEFWTKEATKAFLSKVPNDRMMIQDYANDRHKVWEKQEAFYGKQWTYGYVHNYGGSN 420

Query: 184 EMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
            +YG L+           + N   +VG G+  EG+  N +VY+ + ++ +   K  V  W
Sbjct: 421 PVYGDLNFYKNELTHLLGNSNKGNVVGYGVMPEGLNNNSIVYEYIYDLPWSQGKESVNDW 480

Query: 243 INQYSVRRYGR--SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
           +N+Y   RYG+  S P  Q AW +L  +VY+     T    D   A+       + +TE 
Sbjct: 481 LNKYLSARYGKNISTPVFQ-AWKLLIESVYSTKYWETRWWDDRAGAYLFFKRPTLKITEF 539

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
           K    G P  K+                    ++ +AL++        + ++ Y YDL+D
Sbjct: 540 K----GNPGDKQ--------------------KLKQALDILKRESKSFNKNSLYFYDLLD 575

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           ++R   +   ++L +  + AY+L D     +L ++  +   D+D +L+      L  WL+
Sbjct: 576 MSRHYYSLCIDDLLIECVTAYELKDIKKADELFKKIEKQALDIDNMLSGQPLNSLNNWLK 635

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
           SA     + E  K Y  NA+T IT+W          L DY ++ W G+ + +Y PR  ++
Sbjct: 636 SASDYGSSPEVSKLYVKNAKTLITLWGGEGH-----LNDYASRSWRGMYKGFYWPRWKMF 690

Query: 481 FKYMIESLESGDGF-------RLKDWRREWIK 505
            +   ES+ +   F        +K W  +W +
Sbjct: 691 LQAQRESVVNNTSFDELKVRESIKQWEIKWCQ 722


>gi|423214204|ref|ZP_17200732.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693149|gb|EIY86384.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 727

 Score =  221 bits (564), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 154/537 (28%), Positives = 249/537 (46%), Gaps = 54/537 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T H+Y  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F +D
Sbjct: 289 QKKLFG-TDHVYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN  + 
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++     Y+ + E A+ +  VD   WI   
Sbjct: 408 GNVKESGERLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDDKWIECL 466

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G     ++DAW  L++ +Y                      + +  T G    Y 
Sbjct: 467 ADRHVGCVSQPVRDAWKRLFNDIY----------------------AQVPRTLGTLPGY- 503

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
           +P   +   K  ++ Y +  L     EV R L       NE  +   + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNIEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDYQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +  + + +  YE NAR  IT W          L DY ++ W+GL+ DYY  R  +Y    
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYIDTF 665

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 536
           I++ E G     K    E  ++   W N  +   V     S  D L++ S +L++KY
Sbjct: 666 IKAAEKGVEVDQKQLEDELKEIEEGWVNATDRKDVRKDIHSATDGLLSFSTFLFSKY 722


>gi|429766730|ref|ZP_19298977.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
           celatum DSM 1785]
 gi|429183354|gb|EKY24416.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
           celatum DSM 1785]
          Length = 2284

 Score =  221 bits (564), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 156/528 (29%), Positives = 255/528 (48%), Gaps = 54/528 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   Q   P A+I   G W      P    TY+ D     F E+   F E+
Sbjct: 415 VLQGYSGMVPLDFQKKNPDAQILSQGGWCGFDR-PNMLKTYVNDGERDYFQEVADVFYEK 473

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + YG  +  Y  D F E  NT  +DS    + +   I   M   D DA+W++Q W  +
Sbjct: 474 QKEVYGDITDYYAVDPFHEGGNTGGMDS----ARIYGTIQDKMIEHDEDAIWVIQHWQGN 529

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
            D      ++  L N     + ++LDL +++ P + T      +P++W MLHNF G + +
Sbjct: 530 PDN----TKLSGLTNK---EQALILDLNSDLNPDY-TRFDNQDIPWVWNMLHNFGGRMGL 581

Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
            G ++++A    EA  T+EN  M G+G++ E +  +P+VY+LM +M +  + ++ + W+N
Sbjct: 582 DGQVETVATSITEALATTEN--MKGIGITPEALANSPIVYELMGDMIWTRDPINYREWVN 639

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTD----GATDKNRDVIVAFPDVDPSIISVTEG 300
            Y  RRYG       +AW +L  T Y  +D    GA +              SII+    
Sbjct: 640 NYIERRYGAVNEDAIEAWEILLETAYKTSDYYYQGAAE--------------SIIN---- 681

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
                    ++ A   +  S++ H  + Y   E+ RA+ELFI+  +EL  S+ + YD +D
Sbjct: 682 ---------ARPATSINSASTWGHSKISYDKKELERAMELFISCYDELKDSDAFVYDFLD 732

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           +T+Q LA  A E    ++ AY   DA    ++S  FL+L+   + +L+    FL+G W+E
Sbjct: 733 VTKQVLANSAQEYHKEMVAAYNSGDAEKFERISEHFLDLIRLQERVLSTSPEFLVGTWIE 792

Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
            ++ +  + +   +  +E+NAR  IT W D        L+DY N+ W+GL  D Y  R  
Sbjct: 793 QSRTMLADADDWTKDLFEFNARALITTWGDYKN---GSLKDYSNRQWAGLTEDLYLKRWE 849

Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
           ++   +   LE+G      DW +   +   +  +  N YP E +G+ L
Sbjct: 850 MWIDGIRTELETGVTAPSIDWHKVEYEWATEKTDESNAYPTEGSGEDL 897


>gi|423312588|ref|ZP_17290525.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
           CL09T03C04]
 gi|392688276|gb|EIY81565.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
           CL09T03C04]
          Length = 717

 Score =  221 bits (564), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 157/495 (31%), Positives = 244/495 (49%), Gaps = 67/495 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V +DP + C+Y     L   D  F EI   
Sbjct: 224 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 272

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           +  +  K YG+T   Y  D F E  +T  V+    + + G AI   M+  + DAVW+ Q 
Sbjct: 273 YYRELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 327

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
           W    +P  R P    ++  +  G L+VLDL +E +P W         K  YG   +++C
Sbjct: 328 W--QDNP--RTP----MIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYC 379

Query: 175 MLHNFAGNIEMYGILDSI--AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML NF GNI ++G +D++   F   +A      T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 380 MLLNFGGNIGLHGKMDALIDGFYDAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPW 439

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           +  +     W+  Y   RYG    A+Q AW++L + +YN       +  +  V  A P +
Sbjct: 440 REHRFTRDEWLKGYVYARYGVEDEALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 499

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
           D          YQ                SS+     +Y+  +VI A  L ++  ++   
Sbjct: 500 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 535

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
           +N + +DL+D+ RQALA+    +   +  A++  D   VF+L S+ FL L+   D LL  
Sbjct: 536 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGT 594

Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
              F +G W+E+A+   Q +E++  YEWNAR QIT W +    +   LRDY +K W+G+L
Sbjct: 595 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 654

Query: 470 RDYYGPRAAIYFKYM 484
           +D+Y  R   YF Y+
Sbjct: 655 KDFYFMRWKAYFDYL 669


>gi|373461342|ref|ZP_09553084.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
 gi|371952896|gb|EHO70729.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
          Length = 731

 Score =  221 bits (563), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 142/485 (29%), Positives = 236/485 (48%), Gaps = 53/485 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP   + + P A+IT++G W       R   TY L  TDP F ++ ++F+++
Sbjct: 231 VLPAFNGSVPLDYKRLHPEARITEVGQWGGFGQAYR---TYFLSPTDPRFGKLQKSFLDE 287

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G T H+Y  D+F+E  PP  SP+ +  L   I++ +   D  +VW+  GWLF  D
Sbjct: 288 QRRMFG-TDHLYCLDSFNEVQPPSWSPDTLCMLARHIHASLDKADPQSVWVQMGWLFYND 346

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W P  ++A L+ +P  + ++LD + +   +W  ++ FYG PYI C+L NF GN  + 
Sbjct: 347 RKHWTPDVIRAYLSGIPKDRALLLDYYIDHTELWRLTESFYGRPYIACVLGNFGGNTMLQ 406

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +  ++   ++A  +++  M GVG +MEG   NP  Y  + + A+     D + W+ + 
Sbjct: 407 GDVGKVS-SRLDAAIAQDGNMAGVGATMEGFGVNPDFYAFVFDKAWDCGTTD-RDWLCRM 464

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G +  A + AW VL+  +        +++  V+ A P          E +Y N  
Sbjct: 465 ADRHVGFASAAGRTAWQVLFDRI---MPSYVNESGTVVCARPSF--------EARYLNTT 513

Query: 307 KPVSKEAVLKS--ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
            P     V K   +  S    HL                            YD++++ RQ
Sbjct: 514 YPAELLGVWKMLLDIDSDKREHL----------------------------YDVVNVGRQ 545

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  +       +  AY    +  V   +RR  ++++D+D LLAC + F L  W+E A+ 
Sbjct: 546 VLGDFFAFERDGLHRAYLSQRSDSVDYYARRMDKMLDDLDRLLACSEEFSLRKWIEDARG 605

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
                 ++  YE NART IT+W D+ Q     L DY N+ W+GL+  YY  R  I+  ++
Sbjct: 606 FGATAAEKDYYERNARTLITVWGDSRQ-----LTDYANRTWAGLVSSYYKQRWHIFTAHV 660

Query: 485 IESLE 489
             ++ 
Sbjct: 661 RRAVR 665


>gi|160884062|ref|ZP_02065065.1| hypothetical protein BACOVA_02038 [Bacteroides ovatus ATCC 8483]
 gi|423291477|ref|ZP_17270325.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
           CL02T12C04]
 gi|156110404|gb|EDO12149.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
 gi|392663477|gb|EIY57027.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
           CL02T12C04]
          Length = 727

 Score =  221 bits (563), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 158/540 (29%), Positives = 255/540 (47%), Gaps = 60/540 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN  + 
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++     Y+ + E A+ +  VD   WI   
Sbjct: 408 GNVKESGERLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDDKWIECL 466

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G     ++DAW  L++ +Y              V  P         T G    Y 
Sbjct: 467 ADRHVGCVSQPVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 503

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
           +P       K  ++ Y +  L     EV R L       NE  +   + +R DLI + RQ
Sbjct: 504 RPALNRNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ A++
Sbjct: 553 VLGNYFLDVKVEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +  + + +  YE NAR  IT W          L DY ++ W+GL+ DYY  R  +Y    
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYINTF 665

Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
           I+++  G   D  +L+D  +E    W+  T+     ++V+   S  D L++ S +L++KY
Sbjct: 666 IKAVGEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722


>gi|189465172|ref|ZP_03013957.1| hypothetical protein BACINT_01517 [Bacteroides intestinalis DSM
           17393]
 gi|189437446|gb|EDV06431.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides intestinalis DSM
           17393]
          Length = 723

 Score =  221 bits (562), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 153/550 (27%), Positives = 261/550 (47%), Gaps = 76/550 (13%)

Query: 8   VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           V P +SG VP  A + +  +   + L N F+  +         L  TD  F EI   +  
Sbjct: 229 VFPGYSGMVPHDADEKLGLNLTKSDLWNGFTRPA--------FLQPTDARFAEIADLYYR 280

Query: 67  QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
           +Q K +G+  + Y+ D F E  N   VD      + G AI + M+  +  A W++QGW  
Sbjct: 281 EQEKLFGKADY-YSMDPFHEAENAASVD----FDAAGKAIMTAMKKVNPKATWVVQGW-- 333

Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----------- 173
           + +P  RP  +K + N    G L++LDLF+E +P+W       G+P IW           
Sbjct: 334 TENP--RPEMIKNMQN----GDLLILDLFSECRPMW-------GIPSIWKRDKGYEQHDW 380

Query: 174 --CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEM 230
             CML NF GN+ ++G +D +       + +   T + G+G++MEG E N ++++LM E+
Sbjct: 381 LFCMLLNFGGNVGLHGRMDQLLNNFYLTKNNPLATHLKGIGLTMEGSENNAMMFELMCEL 440

Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFP 288
            ++ EK   + W+  Y   RYG     I+ AW +L +T+YNC  G   +  +  +    P
Sbjct: 441 PWRPEKFTKEEWLKDYLFARYGVRDEKIEQAWTLLANTIYNCPFGNNQQGPHESIFCGRP 500

Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
            ++ +  + +  K +NY  P   E                        A  L +   ++ 
Sbjct: 501 SLN-NFQASSWSKMKNYYDPTVTE-----------------------EAARLMLEVADKY 536

Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
             +N + YDL+D+ RQ+L+     ++   I  ++  D     + SR+FL+++   D LL 
Sbjct: 537 RGNNNFEYDLVDIVRQSLSDKGRIVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLG 596

Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
               F +G W+E A++L    E++  YEWNAR QIT W +    +   LRDY +K W+G+
Sbjct: 597 TRSEFRVGRWIEQARKLGTTPEEKDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGI 656

Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
           LRD+Y  R A Y++ + + L+     +L     ++  +   W   +N Y   S G+ +  
Sbjct: 657 LRDFYYKRWAAYWQTLQDQLDGKPEVKL-----DYYAMEEPWTLAKNPYGSTSEGNCVDV 711

Query: 529 SQWLYNKYLQ 538
           ++  + K  +
Sbjct: 712 AKEAFEKVFE 721


>gi|423226735|ref|ZP_17213200.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392627008|gb|EIY21049.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 718

 Score =  221 bits (562), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 152/550 (27%), Positives = 261/550 (47%), Gaps = 76/550 (13%)

Query: 8   VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           V P +SG VP  A + +  +   + L N F+  +         L  TD  F EI   + +
Sbjct: 226 VFPGYSGMVPHDADEKLGLNLTKSDLWNGFTRPA--------FLQPTDVRFAEIADLYYQ 277

Query: 67  QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
           +Q K +G+  + Y+ D F E  N   VD      + G AI + M+  +  A W++QGW  
Sbjct: 278 EQEKLFGKVDY-YSMDPFHEAENAASVD----FDAAGKAIMAAMKKVNPKATWVVQGW-- 330

Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----------- 173
           + +P  RP  +K + N    G L++LDLF+E +P+W       G+P IW           
Sbjct: 331 TENP--RPEMIKNMQN----GDLLILDLFSECRPMW-------GIPSIWKRDKGYEQHNW 377

Query: 174 --CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEM 230
             CML NF GN+ ++G +D +       + +     + G+G++MEG E NP++++LM E+
Sbjct: 378 LFCMLLNFGGNVGLHGRMDQLLDNFYLTKNNPLAVHLKGIGLTMEGAENNPMMFELMCEL 437

Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFP 288
            ++ EK   + W+  Y   RYG     I+ AW +L +T+YNC  G   +  +  +    P
Sbjct: 438 PWRPEKFTKEEWLKDYLFARYGVRDEKIEKAWTLLANTIYNCPFGNNQQGPHESIFCGRP 497

Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
            ++ +  + +  K +NY  P   E                        A  L +   ++ 
Sbjct: 498 SLN-NFQASSWSKMKNYYDPTVTE-----------------------EAARLMVEVADKY 533

Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
             +N + YDL+D+ RQ+L+     ++   I  ++  D     + SR+FL+++   D LL 
Sbjct: 534 RGNNNFEYDLVDIVRQSLSDKGRIVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLG 593

Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
               F +G W+E A+ L    E++  YEWNAR QIT W +    +   LRDY +K W+G+
Sbjct: 594 TRSEFRVGRWIEQARNLGTTPEEKDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGI 653

Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
           LRD+Y  R A Y++ + + L+     +L     ++  +   W   +N Y     G  +  
Sbjct: 654 LRDFYYKRWAAYWQTLQDQLDGKPEVKL-----DYYAMEEPWTLAKNPYSSVPEGSCVDV 708

Query: 529 SQWLYNKYLQ 538
           ++ ++ K ++
Sbjct: 709 AKEVFEKAMR 718


>gi|224537466|ref|ZP_03678005.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520904|gb|EEF90009.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 721

 Score =  221 bits (562), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 152/550 (27%), Positives = 261/550 (47%), Gaps = 76/550 (13%)

Query: 8   VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           V P +SG VP  A + +  +   + L N F+  +         L  TD  F EI   + +
Sbjct: 229 VFPGYSGMVPHDADEKLGLNLTKSDLWNGFTRPA--------FLQPTDVRFAEIADLYYQ 280

Query: 67  QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
           +Q K +G+  + Y+ D F E  N   VD      + G AI + M+  +  A W++QGW  
Sbjct: 281 EQEKLFGKVDY-YSMDPFHEAENAASVD----FDAAGKAIMAAMKKVNPKATWVVQGW-- 333

Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----------- 173
           + +P  RP  +K + N    G L++LDLF+E +P+W       G+P IW           
Sbjct: 334 TENP--RPEMIKNMQN----GDLLILDLFSECRPMW-------GIPSIWKRDKGYEQHNW 380

Query: 174 --CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEM 230
             CML NF GN+ ++G +D +       + +     + G+G++MEG E NP++++LM E+
Sbjct: 381 LFCMLLNFGGNVGLHGRMDQLLDNFYLTKNNPLAVHLKGIGLTMEGAENNPMMFELMCEL 440

Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFP 288
            ++ EK   + W+  Y   RYG     I+ AW +L +T+YNC  G   +  +  +    P
Sbjct: 441 PWRPEKFTKEEWLKDYLFARYGVRDEKIEKAWTLLANTIYNCPFGNNQQGPHESIFCGRP 500

Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
            ++ +  + +  K +NY  P   E                        A  L +   ++ 
Sbjct: 501 SLN-NFQASSWSKMKNYYDPTVTE-----------------------EAARLMVEVADKY 536

Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
             +N + YDL+D+ RQ+L+     ++   I  ++  D     + SR+FL+++   D LL 
Sbjct: 537 RGNNNFEYDLVDIVRQSLSDKGRIVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLG 596

Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
               F +G W+E A+ L    E++  YEWNAR QIT W +    +   LRDY +K W+G+
Sbjct: 597 TRSEFRVGRWIEQARNLGTTPEEKDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGI 656

Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
           LRD+Y  R A Y++ + + L+     +L     ++  +   W   +N Y     G  +  
Sbjct: 657 LRDFYYKRWAAYWQTLQDQLDGKPEVKL-----DYYAMEEPWTLAKNPYSSVPEGSCVDV 711

Query: 529 SQWLYNKYLQ 538
           ++ ++ K ++
Sbjct: 712 AKEVFEKAMR 721


>gi|393786624|ref|ZP_10374756.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
           CL02T12C05]
 gi|392657859|gb|EIY51489.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
           CL02T12C05]
          Length = 717

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 159/544 (29%), Positives = 257/544 (47%), Gaps = 72/544 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V SDP   C Y     L   D  F EI   
Sbjct: 224 VLPGYCGMVPHNAK--------AKLG--LNV-SDPGVWCGYRRPAFLQPDDSRFEEISSL 272

Query: 64  FIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
           + ++  K YG+ ++ Y+ D F E    +D    + ++G A+   M+  +  AVW++Q W 
Sbjct: 273 YYKELEKLYGKANY-YSMDPFHEGGS-IDGVN-LDAVGKAVMKAMKKANPKAVWVIQAWQ 329

Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGVP-YIWCML 176
            +  P         L+ ++  G L++LDL +E +P W         K  YG   +++CML
Sbjct: 330 ANPRP--------ELIRNLETGDLLILDLTSECRPQWGDPESEWYRKDGYGKHNWVYCML 381

Query: 177 HNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
            N+  N+ ++G +D++   +   +       T+ GVGM+ EGIE NPV+Y+L+ E+ ++ 
Sbjct: 382 LNYGANVGLHGKMDNVIDNYYLAKENLRARATLKGVGMTPEGIENNPVMYELLMELPWRP 441

Query: 235 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDP 292
           E+   + W+  Y   RYG+  P +Q AW  L +++YN     T +  +  V  A P +D 
Sbjct: 442 ERFTKEDWLKGYVKARYGKDEPVLQLAWGKLANSIYNAPKELTQQGTHESVFCARPGLDV 501

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
                    YQ                SS+     +Y   EVI A  L ++  +    + 
Sbjct: 502 ---------YQ---------------VSSWSEMKDYYDPQEVIEAARLMVSVADRYRGNT 537

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            + YDL+D+ RQA+A+    +   +  AY+  D       S++FL L+   D LL     
Sbjct: 538 NFEYDLVDVVRQAIAEKGRLMQKAVTTAYRAGDKELFAMASQKFLNLILLQDQLLGTRTE 597

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           F LG W+ SA+ L    E++  YEWN R Q+T W +    E   LRDY +K W+GLL+D+
Sbjct: 598 FRLGRWINSARALGVTPEEKALYEWNTRVQVTTWGNRNAAERGGLRDYAHKEWNGLLKDF 657

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDW---RREWIKLTNDWQNGRNVYPVESNGDALITS 529
           Y  R  +YF  +   +E G+     D+      W+K TN +Q        E  GD + T+
Sbjct: 658 YYMRWKLYFDNLACKME-GETIPEIDFYAVEEAWVKRTNPYQ-------AEPEGDCVDTA 709

Query: 530 QWLY 533
           + ++
Sbjct: 710 KLIF 713


>gi|160887167|ref|ZP_02068170.1| hypothetical protein BACOVA_05183 [Bacteroides ovatus ATCC 8483]
 gi|423295093|ref|ZP_17273220.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
           CL03T12C18]
 gi|156107578|gb|EDO09323.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
 gi|392673999|gb|EIY67450.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
           CL03T12C18]
          Length = 711

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 146/489 (29%), Positives = 241/489 (49%), Gaps = 59/489 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
           V P ++G VP  +          +LG  + +    +WC       L   D  F      +
Sbjct: 233 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 282

Query: 65  IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            E+  K YG+ ++ Y+ D F E  NT  VD    ++  GA+I + M+  +  AVW++Q W
Sbjct: 283 YEELEKLYGKANY-YSMDPFHEGGNTEGVD----LAKTGASIMAAMKKANPKAVWIIQAW 337

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
             S       P+ + ++ S+  G L+VLDL++E +P       +W   K F    +++CM
Sbjct: 338 QAS-------PR-EEMIASLNQGDLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCM 389

Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQH 234
           L NF GN+ ++G ++ +  G  +A    N  M+ GVG + EGIE NPV+++L+ E+ ++ 
Sbjct: 390 LLNFGGNVGLHGRMNQLVNGYYDACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWRE 449

Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
           E+     W+  Y   RYGR V P I +AW  L HTVYN      D   +  +       S
Sbjct: 450 ERFSSDEWLQTYLKARYGREVSPEIMEAWRALEHTVYNA---PKDYQGEGTIE------S 500

Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
           ++    G + +              TS++ +  L+Y+     +A  LF +  ++   +N 
Sbjct: 501 LLCARPGFHLD-------------RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNN 547

Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
           + YDL+D+ RQ+ A   N L   I ++Y   D     + +++FL+L+   D LL+    F
Sbjct: 548 FEYDLVDIVRQSNADKGNVLLEEISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEF 607

Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
            +  WL +A+ L   EE+++ YEWNA   IT+W D+       L DY ++ WSGLL+D Y
Sbjct: 608 SVSSWLNAARSLGTTEEEKRLYEWNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLY 667

Query: 474 GPRAAIYFK 482
             R   +F+
Sbjct: 668 YQRWKAFFE 676


>gi|422873453|ref|ZP_16919938.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens F262]
 gi|380305838|gb|EIA18115.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens F262]
          Length = 2104

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 157/528 (29%), Positives = 256/528 (48%), Gaps = 51/528 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +   P A+    G W      P    TY+ +     F ++   F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGNLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + +A   P     SE+  MVG+G++ E I  NP+ Y+L+ +MA+  ++++ + W  
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y  RRYG++   I DAWN++  T Y        K  D    +     SII+   G    
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
           +G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA  A E +  +  AY   DA     +S +FLEL++  + +L+    FL+G W+E A+ 
Sbjct: 737 LLANSAQEYYEVMCNAYNNRDAEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +  
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
            +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900


>gi|345519733|ref|ZP_08799147.1| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
 gi|345457107|gb|EET15964.2| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
          Length = 717

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 156/495 (31%), Positives = 243/495 (49%), Gaps = 67/495 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V +DP + C+Y     L   D  F EI   
Sbjct: 224 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 272

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           +  +  K YG+T   Y  D F E  +T  V+    + + G AI   M+  + DAVW+ Q 
Sbjct: 273 YYRELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 327

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
           W    +P  R P    ++  +  G L+VLDL +E +P W         K  YG   +++C
Sbjct: 328 W--QDNP--RTP----MIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYC 379

Query: 175 MLHNFAGNIEMYGILDSI--AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML NF GNI ++G +D++   F   +A      T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 380 MLLNFGGNIGLHGKMDALIDGFYDAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPW 439

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           +  +     W+  Y   RYG    A+Q  W++L + +YN       +  +  V  A P +
Sbjct: 440 REHRFTRDEWLKGYVYARYGVEDEALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 499

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
           D          YQ                SS+     +Y+  +VI A  L ++  ++   
Sbjct: 500 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 535

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
           +N + +DL+D+ RQALA+    +   +  A++  D   VF+L S+ FL L+   D LL  
Sbjct: 536 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGT 594

Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
              F +G W+E+A+   Q +E++  YEWNAR QIT W +    +   LRDY +K W+G+L
Sbjct: 595 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 654

Query: 470 RDYYGPRAAIYFKYM 484
           +D+Y  R   YF Y+
Sbjct: 655 KDFYFMRWKAYFDYL 669


>gi|319643377|ref|ZP_07998003.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
 gi|317385006|gb|EFV65959.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
          Length = 718

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 156/495 (31%), Positives = 243/495 (49%), Gaps = 67/495 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V +DP + C+Y     L   D  F EI   
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           +  +  K YG+T   Y  D F E  +T  V+    + + G AI   M+  + DAVW+ Q 
Sbjct: 274 YYRELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 328

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
           W    +P  R P    ++  +  G L+VLDL +E +P W         K  YG   +++C
Sbjct: 329 W--QDNP--RTP----MIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYC 380

Query: 175 MLHNFAGNIEMYGILDSI--AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML NF GNI ++G +D++   F   +A      T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNFGGNIGLHGKMDALIDGFYDAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPW 440

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           +  +     W+  Y   RYG    A+Q  W++L + +YN       +  +  V  A P +
Sbjct: 441 REHRFTRDEWLKGYVYARYGVEDEALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 500

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
           D          YQ                SS+     +Y+  +VI A  L ++  ++   
Sbjct: 501 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 536

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
           +N + +DL+D+ RQALA+    +   +  A++  D   VF+L S+ FL L+   D LL  
Sbjct: 537 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGT 595

Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
              F +G W+E+A+   Q +E++  YEWNAR QIT W +    +   LRDY +K W+G+L
Sbjct: 596 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 655

Query: 470 RDYYGPRAAIYFKYM 484
           +D+Y  R   YF Y+
Sbjct: 656 KDFYFMRWKAYFDYL 670


>gi|150004413|ref|YP_001299157.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
 gi|149932837|gb|ABR39535.1| glycoside hydrolase family 89 [Bacteroides vulgatus ATCC 8482]
          Length = 717

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 156/495 (31%), Positives = 243/495 (49%), Gaps = 67/495 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V +DP + C+Y     L   D  F EI   
Sbjct: 224 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 272

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           +  +  K YG+T   Y  D F E  +T  V+    + + G AI   M+  + DAVW+ Q 
Sbjct: 273 YYRELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 327

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
           W    +P  R P    ++  +  G L+VLDL +E +P W         K  YG   +++C
Sbjct: 328 W--QDNP--RTP----MIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYC 379

Query: 175 MLHNFAGNIEMYGILDSI--AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML NF GNI ++G +D++   F   +A      T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 380 MLLNFGGNIGLHGKMDALIDGFYDAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPW 439

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           +  +     W+  Y   RYG    A+Q  W++L + +YN       +  +  V  A P +
Sbjct: 440 REHRFTRDEWLKGYVYARYGVEDEALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 499

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
           D          YQ                SS+     +Y+  +VI A  L ++  ++   
Sbjct: 500 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 535

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
           +N + +DL+D+ RQALA+    +   +  A++  D   VF+L S+ FL L+   D LL  
Sbjct: 536 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGT 594

Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
              F +G W+E+A+   Q +E++  YEWNAR QIT W +    +   LRDY +K W+G+L
Sbjct: 595 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 654

Query: 470 RDYYGPRAAIYFKYM 484
           +D+Y  R   YF Y+
Sbjct: 655 KDFYFMRWKAYFDYL 669


>gi|299148671|ref|ZP_07041733.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           3_1_23]
 gi|383114572|ref|ZP_09935334.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
 gi|298513432|gb|EFI37319.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
           3_1_23]
 gi|313693722|gb|EFS30557.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
          Length = 711

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 144/489 (29%), Positives = 239/489 (48%), Gaps = 59/489 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
           V P ++G VP  +          +LG  + +    +WC       L   D  F      +
Sbjct: 233 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 282

Query: 65  IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            E+  K YG+ ++ Y+ D F E  NT  VD    ++  GA+I + M+  +  AVW++Q W
Sbjct: 283 YEELEKLYGKANY-YSMDPFHEGGNTEGVD----LAKTGASIMAAMKKANPKAVWIIQAW 337

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
                   +    + ++ S+  G L+VLDL++E +P       +W   K F    +++CM
Sbjct: 338 --------QANPREEMIASLNQGDLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCM 389

Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQH 234
           L NF GN+ ++G ++ +  G  +A    N  M+ GVG + EGIE NPV+++L+ E+ ++ 
Sbjct: 390 LLNFGGNVGLHGRMNQLVNGYYDACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWRE 449

Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
           E+     W+  Y   RYGR V P I +AW  L HTVYN      D   +  +       S
Sbjct: 450 ERFSSDEWLQTYLKARYGREVSPEIMEAWRALEHTVYNA---PKDYQGEGTIE------S 500

Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
           ++    G + +              TS++ +  L+Y+     +A  LF +  ++   +N 
Sbjct: 501 LLCARPGFHLD-------------RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNN 547

Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
           + YDL+D+ RQ+ A   N L   I ++Y   D     + +++FL+L+   D LL+    F
Sbjct: 548 FEYDLVDIVRQSNADKGNVLLEEISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEF 607

Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
            +  WL +A+ L   EE+++ YEWNA   IT+W D+       L DY ++ WSGLL+D Y
Sbjct: 608 SVSSWLNAARSLGTTEEEKRLYEWNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLY 667

Query: 474 GPRAAIYFK 482
             R   +F+
Sbjct: 668 YQRWKAFFE 676


>gi|294775488|ref|ZP_06741000.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
 gi|294450633|gb|EFG19121.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
          Length = 712

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 156/495 (31%), Positives = 243/495 (49%), Gaps = 67/495 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP + G VP   +         +LG   +V +DP + C+Y     L   D  F EI   
Sbjct: 219 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 267

Query: 64  FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
           +  +  K YG+T   Y  D F E  +T  V+    + + G AI   M+  + DAVW+ Q 
Sbjct: 268 YYRELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 322

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
           W    +P  R P    ++  +  G L+VLDL +E +P W         K  YG   +++C
Sbjct: 323 W--QDNP--RTP----MIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYC 374

Query: 175 MLHNFAGNIEMYGILDSI--AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
           ML NF GNI ++G +D++   F   +A      T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 375 MLLNFGGNIGLHGKMDALIDGFYDAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPW 434

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
           +  +     W+  Y   RYG    A+Q  W++L + +YN       +  +  V  A P +
Sbjct: 435 REHRFTRDEWLKGYVYARYGVEDEALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 494

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
           D          YQ                SS+     +Y+  +VI A  L ++  ++   
Sbjct: 495 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 530

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
           +N + +DL+D+ RQALA+    +   +  A++  D   VF+L S+ FL L+   D LL  
Sbjct: 531 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGT 589

Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
              F +G W+E+A+   Q +E++  YEWNAR QIT W +    +   LRDY +K W+G+L
Sbjct: 590 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 649

Query: 470 RDYYGPRAAIYFKYM 484
           +D+Y  R   YF Y+
Sbjct: 650 KDFYFMRWKAYFDYL 664


>gi|295085509|emb|CBK67032.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 716

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 155/537 (28%), Positives = 249/537 (46%), Gaps = 54/537 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C +L +  D LF +I + F+++
Sbjct: 221 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 277

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G T HIY  D F+E  PP   PEY+  + + +Y+ + + D  A W+   W+F +D
Sbjct: 278 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 336

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W   +MKALL  VP  K+++LD   E   +W  ++ F+  PYIWC L NF GN  + 
Sbjct: 337 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLT 396

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    + G+G ++EG++     Y+ + E A+ +  VD   WI   
Sbjct: 397 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 455

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R  G    +++DAW  L++ +Y              V  P         T G    Y 
Sbjct: 456 ADRHVGCVSQSVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 492

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
           +P   +   K  ++ Y +  L     EV R L       NE  +   + +R DLI + RQ
Sbjct: 493 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 541

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            L  Y  ++ +      +  D   +     +  E++ D+D L A H    L  W++ A++
Sbjct: 542 VLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 601

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           +  + + +  YE NAR  IT W          L DY ++ W+GL+ DYY  R  +Y    
Sbjct: 602 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYIDTF 654

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 536
           I+++        K    E  ++   W N  +   V     S  D L++ S +L++KY
Sbjct: 655 IKAVGEDVEVDQKQLEDELKEIEEGWVNATDRKDVRKDVHSTTDGLLSFSTFLFSKY 711


>gi|329962235|ref|ZP_08300241.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
 gi|328530343|gb|EGF57220.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
          Length = 726

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 146/480 (30%), Positives = 239/480 (49%), Gaps = 60/480 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
           VLP +SG +P   +       ++  G W          C Y     L  TD  F EI   
Sbjct: 225 VLPGYSGMLPHNAKEKL-GVNVSDPGTW----------CGYNRPAFLQPTDTRFGEIAAL 273

Query: 64  FIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
           + E+  + YG+ +  Y+ D F E          + + G AI+  M+    ++VW++Q W 
Sbjct: 274 YYEEMNRLYGK-ADFYSMDPFHEGGKVAGVN--LDAAGQAIWQAMKKNSRNSVWVVQAWG 330

Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ-------FYGVPYIWCML 176
            +       P+ + ++ +VP G ++VLDL++E +P W   +        F G  +++CML
Sbjct: 331 AN-------PRAQ-MIKNVPRGDMLVLDLYSESRPQWGEPESSWYRENGFDGHQWLYCML 382

Query: 177 HNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE 235
            N+ GN+ ++G +  +     +A R+S   T+ GVGM+MEG E NPV+Y+L+ E+ ++  
Sbjct: 383 LNYGGNVGLHGKMQHVIDAYYKASRSSFGNTLKGVGMTMEGSENNPVMYELLCELPWRPS 442

Query: 236 KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPS 293
                 W+  Y   RYG+  P +++AW +L +++YNC   +T +  +  +  A P +   
Sbjct: 443 TFSKDEWLEGYIAARYGKCTPRLREAWVLLGNSIYNCPPRSTQQGTHESIFCARPSLK-- 500

Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
                           + +A   SE S Y  P       +VIRA  LF+    +   ++ 
Sbjct: 501 ----------------AYQASSWSEMSDYYRPQ------DVIRAAGLFLEEAGQFKGNDN 538

Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
           + YDL+D+TRQA+A+    ++  I  +Y+  D   + Q S RFLEL+   D LLA    F
Sbjct: 539 FEYDLVDITRQAVAEKGRLIYKVIQASYEAGDKPLLRQASDRFLELLLLQDRLLATRPEF 598

Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
            +G W+E A+ L     ++   EWNAR QIT W + T  +   LRDY +K W+GLL+D+Y
Sbjct: 599 KVGRWIEQARNLGHTPAEKDWLEWNARVQITTWGNRTASDRGGLRDYAHKEWNGLLKDFY 658


>gi|374384144|ref|ZP_09641670.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
           12061]
 gi|373228751|gb|EHP51054.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
           12061]
          Length = 835

 Score =  219 bits (558), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 150/518 (28%), Positives = 250/518 (48%), Gaps = 70/518 (13%)

Query: 2   LLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
           LL    + PAF+G VP AL+ ++P  KI +   W    +       ++L   + LF  IG
Sbjct: 223 LLGMKPICPAFAGFVPLALKRLYPDVKIIET-TWAGFHN-------WMLSPEEELFTRIG 274

Query: 62  RAFIEQQLKEYGRTSHIYNCDTFDENT---PPVDSPEYISSL---GAAIYSGMQSGDSDA 115
           + FIE+  KE+G+ +  Y  D+F+E     PP+ + E    L   G  +Y G+++G+ DA
Sbjct: 275 QLFIEEWEKEFGK-NDFYLADSFNEMDVPFPPIGTKERYDMLAFYGEQVYKGIKAGNPDA 333

Query: 116 VWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTS------KQFY 167
           VW+MQGW+F Y    W    ++AL++ VP  K+++LDL A+  K +W         K F+
Sbjct: 334 VWVMQGWMFGYQRDIWDYETLQALVSKVPDDKMMLLDLAADYNKNVWGNGMNWEFYKGFF 393

Query: 168 GVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDL 226
              +++ ++ N  G     GIL   A G +EA  S N   + G GM+ EG E N VVY++
Sbjct: 394 NKLWVYSVIPNMGGKTGATGILSFYANGHLEALNSPNRGRLFGFGMAPEGTENNEVVYEM 453

Query: 227 MSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVA 286
           + +  +   ++DVK W+  YS+ RYG++ P + + W  L  +VY                
Sbjct: 454 ICDAGWSSSEIDVKQWLKDYSLCRYGKTCPEMDEVWEGLCKSVYG--------------T 499

Query: 287 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 346
           F D    +  +  G+        S +  + ++++ Y             RA+E       
Sbjct: 500 FTDHPRFLWQLRPGR--------SGKGTVNTDSNFY-------------RAVEKMAECAP 538

Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
           +++ S  ++ D +++T   L      L   I ++Y   +     ++ ++F EL E +D L
Sbjct: 539 KMTESPLFKADFLEMTAFYLGGKMEALASAIGKSYLYGNTADALKMQQQFEELGEGLDSL 598

Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
           L  H  + L  W++ A++    E+ +  YE NAR  +T+W          + DY  K WS
Sbjct: 599 LESHPVYRLQRWIDFARKHGDTEKLKDYYEMNARRIVTIW-------GPPVSDYACKLWS 651

Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
           GL+RDYY PR   YF+      E+G  + L  W  +W+
Sbjct: 652 GLIRDYYLPRWREYFR----CKETGSKYDLASWESDWV 685


>gi|453081268|gb|EMF09317.1| glycoside hydrolase family 89 protein [Mycosphaerella populorum
           SO2202]
          Length = 784

 Score =  219 bits (557), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 147/512 (28%), Positives = 253/512 (49%), Gaps = 51/512 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F+G VP  +  ++P+A       W   ++  ++     L+  DPLF  + ++FI +
Sbjct: 248 VLPCFTGFVPTQISRLYPNASFVNGSQWNGFQA--QYTNVTFLEPFDPLFTTLQKSFISK 305

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
               YG  S +Y  D ++EN P   +  Y+  + +     +++ D +A+W +QGWLF S 
Sbjct: 306 LDAAYGNVSSVYTLDQYNENDPFSGNVTYLEDVASNTIKSLKAADPEAIWFIQGWLFYSA 365

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             FW   ++KA L  V    +++LDLF+E +P W  +  ++G P+IWC LH++ GN  ++
Sbjct: 366 ADFWDEERIKAYLGGVEDKDMLILDLFSESQPQWQRTNSYFGKPWIWCQLHDYGGNQGLH 425

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G ++++   P+ A  +E +TMVG+G++MEG E N ++YD++ + A+  E ++   + + +
Sbjct: 426 GQVENVTMNPILALANETSTMVGIGLTMEGQEGNEIIYDILLDQAWTPEPIESAGYFDDW 485

Query: 247 SVRRY--GRSVPAI-QD---AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
              RY    +V  + QD   AW+++  T+YN TD   D    V  +  ++ P+   + + 
Sbjct: 486 VTSRYHCDDAVAGLPQDLYIAWDMMRQTIYNNTD--IDTAEAVTKSIFELQPNTTGLLDR 543

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
              +  + +    +L S        H + ++ E  +  EL            +YR+DL+D
Sbjct: 544 TGHHSTRILYDPEILVSAWK-----HFYSASQETPQLWEL-----------ESYRFDLVD 587

Query: 361 LTRQALAKYANEL---FLNIIEAYQLNDAHGVF--QLSRRFLELVEDMDGLLACHDG--F 413
           +TRQ LA     L   F+N+     L  +      Q   R L L+ D+D +L       F
Sbjct: 588 ITRQVLANAFYPLYGEFVNMTANSSLPSSSTASAEQTGARMLSLLLDLDSVLEASGNAHF 647

Query: 414 LLGPWLESAKQLAQNEEQEKQ------------YEWNARTQITMWFDNTQEEASLLRDYG 461
            L  W+ SA+  A  E                 YE+NAR QIT+W    +     + DY 
Sbjct: 648 SLESWIHSARLWAPTETNAADGDNMTAAAIADFYEYNARNQITLWGPGGE-----ISDYA 702

Query: 462 NKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 493
           +K W+GL++ YY PR   +  + + S  S DG
Sbjct: 703 SKQWAGLIKTYYVPRWERFVHFTLNSSTSADG 734


>gi|340347658|ref|ZP_08670763.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
 gi|433652542|ref|YP_007296396.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
           3688]
 gi|339608852|gb|EGQ13735.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
 gi|433303075|gb|AGB28890.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
           3688]
          Length = 781

 Score =  218 bits (556), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 153/503 (30%), Positives = 232/503 (46%), Gaps = 64/503 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G +P   +       +T  G W        +     L ATDP F EI   +  +
Sbjct: 244 VLPGYCGMMPHDAKQKL-GLDVTPGGTWNG------YVRPANLSATDPRFDEIADLYYRE 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYI--SSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + YG+ SH Y+ D F E +  V    YI  +  G  + + M+  +  A W++QGW  +
Sbjct: 297 QTRLYGK-SHYYSMDPFHETSDDV----YIDYAQAGRKLMAAMKRENPKANWVIQGWTEN 351

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNF 179
             P        A+ + +P G L VLDLF+E +P      IW  ++ +    +++CML NF
Sbjct: 352 PRP--------AMTDGLPAGSLTVLDLFSECRPMFGAPSIWKRAEGYGQHDWLFCMLENF 403

Query: 180 AGNIEMYGILDSI------AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ 233
            GN+ ++G +D +      A  P          + G+G +MEG E NP++++LMSE+ ++
Sbjct: 404 GGNVGLHGRMDQLIGNFRLATSPQSPLQQARRHLRGIGFTMEGSENNPIMFELMSELPWR 463

Query: 234 HEKVDVKA--------WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIV 285
            ++V   A        W+  Y   RYG   P  Q AW +L  T+YNC  G   +      
Sbjct: 464 TDEVAQAADARTFRTEWVRGYVKARYGTDDPHAQQAWQLLAETIYNCPAGNNQQG----- 518

Query: 286 AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG 345
                  SI           G+P      +KS +   ++    Y  S  + A  L  A+ 
Sbjct: 519 ----PHESIFD---------GRPSLNNFQVKSWSKMRNY----YEPSATLEAARLMAAAA 561

Query: 346 NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 405
           + L  +N Y YDL+D+ RQA+   A +++L+ I  Y   D     + S RFL L+   D 
Sbjct: 562 DRLKGNNNYEYDLVDIVRQAIDDQARQVYLHAIADYNGFDRRAFSRDSARFLGLLLMQDR 621

Query: 406 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 465
           LL     F LG W E+A+ L     ++  YEWNAR QIT W +    +   LRDY +K W
Sbjct: 622 LLGTRREFRLGRWTEAARSLGTTPAEKDLYEWNARVQITTWGNRACADQGGLRDYAHKEW 681

Query: 466 SGLLRDYYGPRAAIYFKYMIESL 488
            GLL D+Y  R   Y   +   +
Sbjct: 682 QGLLADFYYMRWHTYLDALSRQM 704


>gi|380694112|ref|ZP_09858971.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
          Length = 736

 Score =  218 bits (555), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 146/503 (29%), Positives = 240/503 (47%), Gaps = 73/503 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
           V P ++G VP  +          +LG  + +    +WC       L   D  F      +
Sbjct: 236 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 285

Query: 65  IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            E+  K YG+  + Y+ D F E  NT  VD    ++  G +I   M+  + +AVW+MQ W
Sbjct: 286 YEELEKLYGKAKY-YSMDPFHEGGNTEGVD----LAKAGTSIMGAMKKANPEAVWVMQAW 340

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
                   +    +A++N++  G L+VLDL++E  P       +W   K F    +++CM
Sbjct: 341 --------QANPREAMVNTLDSGDLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCM 392

Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
           L NF GN+ ++G ++ +  G   A    N  T+ GVG + EGIE NP++++L+ E+ ++ 
Sbjct: 393 LLNFGGNVGLHGRMEQLVNGYYNACAHINGKTLRGVGATPEGIENNPMMFELLYELPWRE 452

Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
           E+     W+  Y   RYG  + P + +AW  L HTVYN                      
Sbjct: 453 ERFSPDIWLQGYLKARYGDDLSPEVTEAWRALEHTVYNAP-------------------- 492

Query: 294 IISVTEGKYQNYGKPVSKEAVLKS-------ETSSYDHPHLWYSTSEVIRALELFIASGN 346
                    +NY    + E++L +        TS++ +  L+YS     +A +L ++  +
Sbjct: 493 ---------KNYQGEGTVESLLCARPGFHLDRTSTWGYAKLFYSPDSTAKAAQLLLSVAD 543

Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
               +N + YDL+D+ RQ+LA  AN L   I ++Y   D     + +++FL L+   D L
Sbjct: 544 RYKGNNNFEYDLVDIVRQSLADKANVLLEEISQSYDRKDKDSFRKQTQQFLGLILSQDSL 603

Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
           L+    F +  WL +A+ L   EE++K YEWNA   IT+W D+       L DY ++ WS
Sbjct: 604 LSTRKEFSVSSWLSAARSLGTTEEEKKLYEWNASALITVWGDSIAANQGGLHDYSHREWS 663

Query: 467 GLLRDYYGPRAAIYFKYMIESLE 489
           GLL+D Y  R   +F+   + L+
Sbjct: 664 GLLKDLYYQRWNTFFEQKQQELD 686


>gi|410097657|ref|ZP_11292638.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409223747|gb|EKN16682.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 740

 Score =  218 bits (555), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 153/544 (28%), Positives = 250/544 (45%), Gaps = 66/544 (12%)

Query: 11  AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
            FSG VP  +   FP AKI +  +W+  +      C   LD  DPLF E+G+ F+E++ K
Sbjct: 235 GFSGAVPRKMMEKFPEAKIQKQPDWYGFEG----ICQ--LDPLDPLFTELGKTFLEEEQK 288

Query: 71  EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
            YG T  +Y  D F E+ PPVD+PEY++++G++I+  M++ D DA+W+MQ W F  D   
Sbjct: 289 LYG-TYGLYAADPFHESKPPVDTPEYLNAVGSSIHKLMKTFDPDALWVMQAWSFRKD--- 344

Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 190
                  + + VP   L+VL L   +         F    ++   LHNF G + ++G L 
Sbjct: 345 -------IASVVPKHDLLVLSLNGAL----GGEDHFCNHDFVVGNLHNFGGRVNLHGDLP 393

Query: 191 SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRR 250
            ++         +   +VG G+ ME I QNPV Y+L  EM    + V ++ W+N+Y+ RR
Sbjct: 394 LVSSNQFMKAKQKTPNVVGSGLFMESIGQNPVFYELAFEMPVHQDSVKLEEWLNKYAERR 453

Query: 251 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS 310
           YG    A   AW +L    Y       + +  +I A P VD        G    +  P  
Sbjct: 454 YGAFSDAANKAWELLLAGPYRAGTNGVESS-SIICARPAVDVK----KSGPNAGFNIPYD 508

Query: 311 KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYA 370
            ++++++E                       +    +L  S  YR+D++D+ RQ ++   
Sbjct: 509 PQSLIEAEVC--------------------LLQDAEQLKGSGPYRFDIVDVQRQIMSNLG 548

Query: 371 NELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 430
            E+     EA++  D       S RFLEL++D+D LL     F    WL  A+     +E
Sbjct: 549 QEIHKKAAEAFKKKDKEAFALHSGRFLELLKDVDILLRTRTEFNFDQWLTDARAWGTTDE 608

Query: 431 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 490
           +   +E NA + +T+W            DY  + W+GL+  YY  R   ++  +   L++
Sbjct: 609 ERNLFEKNASSLVTIWGGQVDVRQF---DYSWREWTGLIEGYYLQRWKQFYDMLQGHLDN 665

Query: 491 GDGFRLKDWR----------REWIKLTNDWQ------NGRNVYPVESNGDALITSQWLYN 534
           G  +R +D +           E+     DW+       G+   PV + GD +  ++ + +
Sbjct: 666 GTIYREEDAKMDLGRQAFRANEFYDSLADWELAFVDRPGKARTPV-TEGDEVAVARRMLD 724

Query: 535 KYLQ 538
           KY Q
Sbjct: 725 KYKQ 728


>gi|323344412|ref|ZP_08084637.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
 gi|323094539|gb|EFZ37115.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
          Length = 730

 Score =  218 bits (554), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 150/531 (28%), Positives = 239/531 (45%), Gaps = 46/531 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ +FP A I  LG W   +   ++ C + L   +PLF +I + ++E+
Sbjct: 235 VLPAFAGHVPAELKRIFPDANIKSLGKWGGFEE--KYLC-HFLSPEEPLFSKIQKLYLEE 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   +G T HIY  D F+E  PP   P Y+  +   +Y  + + D  A W+  GW+FSYD
Sbjct: 292 QTALFG-TDHIYGVDPFNEVEPPSWEPAYLRKVSKNMYGTLTAVDPKAEWMQMGWMFSYD 350

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W P +++A L  VP GK+ +LD + E   +W T+  FYG PYIWC L NF GN  + 
Sbjct: 351 NKHWTPDRVQAFLTGVPKGKMSLLDYYCENVELWKTTDGFYGQPYIWCYLGNFGGNTTLM 410

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G +         A  +    M+G G ++EG++     Y+ +    + H   D + WI+  
Sbjct: 411 GNVKESGRRLDNALANGQRNMLGAGSTLEGLDVIQFPYEYLYNKLWSHAVADSR-WIDDL 469

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           + R YG   P+++ AW++L++ +Y                       + +  +G   N+ 
Sbjct: 470 ADRHYGGVSPSVRKAWHILFNDIY---------------------VQVSASMQGVLTNF- 507

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIR-ALELFIASGNELSASNTYRYDLIDLTRQA 365
           +P             Y    L     EV R  L++     NEL      + D+I + RQ 
Sbjct: 508 RPALNNNYPHRTAIEYPAERL----EEVWRLLLDVPRCDRNEL------QLDIIAVGRQV 557

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           L      +      AY   D   +   +    EL+ D+D L + +    +  W++ A++L
Sbjct: 558 LGNRFAVVKTQFDSAYANKDIPRLKAKACEMEELLGDLDRLTSFNSRCSINRWIDDARKL 617

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
              +E +  YE NAR  IT W  N       + DY ++ W GL+  YY  R  +Y   ++
Sbjct: 618 GSTKELKDYYEKNARNLITTWGGN-------INDYASRTWGGLIGSYYAHRWRLYIDDIL 670

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
            + E+   F    +  +  K    W        V    D L   + L  KY
Sbjct: 671 AAAEANKEFDQNAFNEKVSKFEQAWIISTEPITVPKRTDLLTFCRILIQKY 721


>gi|156046298|ref|XP_001589681.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980]
 gi|154693798|gb|EDN93536.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 795

 Score =  218 bits (554), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 156/520 (30%), Positives = 260/520 (50%), Gaps = 72/520 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP+AL+ + P+A I   G+W ++        T+L   TDPLF  +   F+  
Sbjct: 245 VLPAFTGFVPSALRRIAPNANIINGGDWGNIFPVEYSNDTFLY-PTDPLFTTLQHKFLSF 303

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q + YG  +HIY  D ++EN P      Y+ ++    Y  +QS D  AVW++QGWLF S 
Sbjct: 304 QSEYYGNVTHIYTLDQYNENNPASGDLSYLRNVSRGTYESLQSFDPCAVWMLQGWLFYSL 363

Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   +++A +  VP  + +++LDLF+E  P W  +  +YG P+IWC L ++ G + +
Sbjct: 364 SSFWTQDRIEAYIGGVPKNESMLILDLFSESFPQWERTHYYYGKPWIWCQLRDYGGTLGL 423

Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV----K 240
           YG + +I    +EA R SEN  MVGVG +MEG   N ++Y+L+ + A+  + +D     K
Sbjct: 424 YGQIYNITNSLIEAFRESEN--MVGVGNTMEGQGGNGLMYELLLDQAWNIDPIDTEDYFK 481

Query: 241 AWINQ-YSVRRYGRSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 298
           +W+ + Y ++   + +P  I +AW++L  T YN T+     +  V  +  ++ P+I    
Sbjct: 482 SWVRKRYHIKGAKKRLPGEIYEAWDILRRTAYNNTNLTLADS--VPKSLHELQPNIT--- 536

Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI---ASGNELSASNTYR 355
               +N+G+          ++S+ D     Y   ++ RA EL      S  EL     ++
Sbjct: 537 ----ENHGR--------LGQSSTIDL----YDPDDLFRAWELLYNASVSVPELWEDKGWK 580

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
           +D++D+TRQ LA+     ++ +IE Y+      +       + ++E +D +L+    F L
Sbjct: 581 FDMVDITRQVLAERFKLEYVELIEKYK--KGADISCDGDILIGILESLDDVLSASPHFRL 638

Query: 416 GPWLESAKQ----------------------------LAQNEEQEKQ-YEWNARTQITMW 446
             W+ +A                              L  N    +Q + +NA  QIT+W
Sbjct: 639 DTWVNAAVSSAPLPASTNCSSTSINNSSLLFNSSTSILTSNLTPTQQFFAYNAINQITIW 698

Query: 447 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
               Q     + DY +K W GL+R YY PR  ++ +Y+ E
Sbjct: 699 GPTGQ-----IDDYASKSWGGLVRGYYLPRWKMFLEYIDE 733


>gi|427385205|ref|ZP_18881710.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727373|gb|EKU90233.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
           12058]
          Length = 719

 Score =  218 bits (554), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 150/549 (27%), Positives = 261/549 (47%), Gaps = 76/549 (13%)

Query: 8   VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           V P +SG VP  A + +  +   + L N F+  +         L  TD  F EI   +  
Sbjct: 227 VFPGYSGMVPHDADEKLGLNLTKSDLWNGFTRPA--------FLQPTDTRFAEIANLYYR 278

Query: 67  QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
           +Q K +G+  + Y+ D F E  N   VD      + G AI   M+  +  A W++QGW  
Sbjct: 279 EQEKLFGKADY-YSMDPFHEAENAASVD----FDAAGKAIMQAMKKVNPKATWVVQGW-- 331

Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----------- 173
           + +P  RP  ++ + N    G L++LDLF+E +P+W       G+P IW           
Sbjct: 332 TENP--RPEMIENMKN----GDLLILDLFSECRPMW-------GIPSIWKRDKGYEQHDW 378

Query: 174 --CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEM 230
             CML NF GN+ ++G +D +     + + +   T + G+G++MEG E NPV+++LM E+
Sbjct: 379 LFCMLLNFGGNVGLHGRMDQLLDNFYQTKDNPLATHLKGIGLTMEGSENNPVMFELMCEL 438

Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFP 288
            ++ EK   + W+  Y   RYG     I+ AW +L +++YNC  G   +  +  +    P
Sbjct: 439 PWRPEKFTKEEWLKDYLFARYGVKDEKIEKAWTLLANSIYNCPFGNNQQGPHESIFCGRP 498

Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
            ++ +  + +  K +NY  P   E                        A  L +   ++ 
Sbjct: 499 SMN-NFQASSWSKMKNYYDPTVTE-----------------------EAARLMLEVADKY 534

Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
             +N + YDL+D+ RQ+L+     ++   I  ++  D     + S++FL+++   D LL 
Sbjct: 535 RGNNNFEYDLVDIVRQSLSDKGRIVYNQTIADFKSFDKRSFARDSQKFLDILLLQDRLLG 594

Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
               F +G W+E A+ L    E++  YEWNAR QIT W +    +   LRDY +K W+G+
Sbjct: 595 TRSEFRVGRWIEQARNLGTTPEEKDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGI 654

Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
           LRD+Y  R A Y++ + + L+     +L     ++  +   W   +  Y     G+++  
Sbjct: 655 LRDFYYKRWAAYWQTLQDQLDGKPEVKL-----DYYAMEEPWTLAKTPYDSTPEGNSVDV 709

Query: 529 SQWLYNKYL 537
           ++ ++ K +
Sbjct: 710 AKEVFEKAM 718


>gi|262406054|ref|ZP_06082604.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|294648118|ref|ZP_06725661.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|294806859|ref|ZP_06765684.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
 gi|345510559|ref|ZP_08790126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|229443271|gb|EEO49062.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
 gi|262356929|gb|EEZ06019.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
 gi|292636502|gb|EFF54977.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
 gi|294445888|gb|EFG14530.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
           CC 1b]
          Length = 718

 Score =  218 bits (554), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 148/531 (27%), Positives = 255/531 (48%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP A     P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEAFAQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLTEYGETIYKSITAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +W+T +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+            + ++P    
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
             +   + +       +SK  +                + + ++A+ L+ +  +EL +S 
Sbjct: 518 QTVVPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     LA  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYLAAKAENFYKQALKDDSENRVFAAQRNLQQTVDLLMDVDRLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVEFARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PV++
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVKA 708


>gi|168216263|ref|ZP_02641888.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens NCTC 8239]
 gi|182381741|gb|EDT79220.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens NCTC 8239]
          Length = 2104

 Score =  218 bits (554), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 155/528 (29%), Positives = 256/528 (48%), Gaps = 51/528 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +   P A+    G W      P    TY+ +     F ++   F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEEEVDYFQKVADVFYEK 473

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L      G+ +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKGQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + +A   P     SE+  MVG+G++ E I  NP+ Y+L+ +MA+  ++++ + W  
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y  RRYG++   I +AWN++  T Y        K  D    +     SII+   G    
Sbjct: 640 DYIERRYGKTNKEILEAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
           +G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA  A E +  +  AY   +      +S +FLEL++  + +L+    FL+G W+E A+ 
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +  
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
            +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900


>gi|422345314|ref|ZP_16426228.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
           WAL-14572]
 gi|373228039|gb|EHP50349.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
           WAL-14572]
          Length = 1842

 Score =  218 bits (554), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 155/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +   P A+    G W      P    TY+ +     F ++   F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + +A   P     SE+  MVG+G++ E I  NP+ Y+L+ +MA+  ++++ + W  
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y  RRYG++   I DAWN++  T Y        K  D    +     SII+   G    
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
           +G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA  A E +  +  AY   +      +S +FLEL++  + +L+    FL+G W+E A+ 
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +  
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
            +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900


>gi|168207628|ref|ZP_02633633.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens E str. JGS1987]
 gi|170661027|gb|EDT13710.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens E str. JGS1987]
          Length = 2104

 Score =  217 bits (553), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 155/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +   P A+    G W      P    TY+ +     F ++   F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 474 QEEVFGEVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + +A   P     SE+  MVG+G++ E I  NP+ Y+L+ +MA+  ++++ + W  
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y  RRYG++   I DAWN++  T Y        K  D    +     SII+   G    
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
           +G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA  A E +  +  AY   +      +S +FLEL++  + +L+    FL+G W+E A+ 
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +  
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
            +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900


>gi|340514474|gb|EGR44736.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
          Length = 762

 Score =  217 bits (553), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 142/490 (28%), Positives = 250/490 (51%), Gaps = 42/490 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP A   V P A++     W        +     LD  DPLF ++ R+FI +
Sbjct: 245 ILPAFPGFVPRAAPRVLPDARLLHSIQWAGFPE--IFTEDTFLDPVDPLFAQMQRSFITK 302

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG  ++ Y  D F+E  PP     Y+ ++ +  +  ++S D +A+W+ Q WLF+ +
Sbjct: 303 QKQAYGNVTNFYTLDQFNEMIPPSGDVAYLRNVSSNTWKALKSADPNAIWVFQAWLFAQN 362

Query: 128 -PFWRPPQMKALLNSVPL-GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   +++A L  V     +++LD+++E  P W  ++ +YG P+IWC L N+   I +
Sbjct: 363 TTFWTNERIEAYLGGVTADSDMLILDIWSESMPQWQRAQSYYGKPWIWCELQNYGATINL 422

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG + ++   P+ A   E+T++ G G+SMEG + N +VYDL+   A+  E +D +A+ + 
Sbjct: 423 YGQIQNVTNSPILA-LQESTSLSGFGLSMEGQQNNEIVYDLLLAQAWSSEPLDTEAYFHN 481

Query: 246 YSVRRYG--RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
           ++  RY   +    I DAW  +  TVY+ T+         +   P V  SII +      
Sbjct: 482 WASARYSSDQRPGFIHDAWETVRTTVYDNTN---------LTLMPSVPKSIIEL------ 526

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG---NELSASNTYRYDLID 360
                V + + + ++ +      L Y  + ++ A +    +G     L  ++ Y+YDL+D
Sbjct: 527 -----VPRTSNM-ADITGILGTKLPYDPAVMVSAWKQLYHAGLQDTSLFNNSAYQYDLVD 580

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSR------RFLELVEDMDGLLACHDGFL 414
            TRQ LA     ++ NI++ Y  ++     ++ R      +  +L+  +D +L+ +  F 
Sbjct: 581 WTRQVLANAFIPIYKNIVDIYYNSNQTAGSRIQRLKAQGQQVTKLLLSLDLVLSSNRNFR 640

Query: 415 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
           L  WL +A+  A +      +E+ AR QIT+W  + Q     L DY +K WSGL++ Y+ 
Sbjct: 641 LSTWLSAARSSAPSPAYVDSFEYEARNQITLWGPSGQ-----LIDYASKAWSGLMKTYHL 695

Query: 475 PRAAIYFKYM 484
            R  ++ +Y+
Sbjct: 696 KRWQMFVEYL 705


>gi|169346867|ref|ZP_02865815.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens C str. JGS1495]
 gi|169296926|gb|EDS79050.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens C str. JGS1495]
          Length = 2104

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 155/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +   P A+    G W      P    TY+ +     F ++   F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + +A   P     SE+  MVG+G++ E I  NP+ Y+L+ +MA+  ++++ + W  
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y  RRYG++   I DAWN++  T Y        K  D    +     SII+   G    
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
           +G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA  A E +  +  AY   +      +S +FLEL++  + +L+    FL+G W+E A+ 
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +  
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
            +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 857 GLQAELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900


>gi|225875033|ref|YP_002756492.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
           51196]
 gi|225793771|gb|ACO33861.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
           51196]
          Length = 800

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 151/521 (28%), Positives = 238/521 (45%), Gaps = 45/521 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P + G VP       P A +   GNW   +  P W     LD  DPLF  +  +F + 
Sbjct: 278 VFPGYFGMVPEDFARRHPGAHVIPQGNWNGFRR-PAW-----LDPRDPLFAAVAASFYKH 331

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G +S IY+ + F E     D P  +SS   AI   +      A+W+   W    +
Sbjct: 332 QQELFGDSS-IYDIELFQEGGSAADVP--VSSAAKAIQKALLRAHPQAMWMTLAW--QNN 386

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           P       +ALL++V    L+V+D+     P  +  + F G  Y++  L +F G   +  
Sbjct: 387 P------SRALLSAVDRSHLLVVDIDQGRTPHENRERDFMGAAYLFGGLWDFGGRTTLGA 440

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
            L   A           +TM G  +  EG++ NP  +DL +EMA++   VD++ W  +Y+
Sbjct: 441 NLYDYAVRLPRMGLRAGSTMKGTALFSEGLDNNPAAFDLFTEMAWRTSPVDLRTWSREYA 500

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNC-TDGATDKN-RDVIV-AFPDVDPSIISVTEGKYQN 304
            RRYG   P  + AW +L  T Y    DG ++   RD    +  D  PS+ +V+      
Sbjct: 501 RRRYGMDDPHTRRAWRILMETAYGTRADGVSNHGERDAPPESLFDAQPSLDAVS------ 554

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                          SS+    L Y   +   AL   + +   +    TY+YDL+D+ RQ
Sbjct: 555 --------------ASSWSPDRLRYDPKKFEAALTELLQAPPGMREMPTYQYDLVDVARQ 600

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA ++ +    I +AY          L +++L ++   D LLA +  F++GPWL +   
Sbjct: 601 TLANWSRKTLPEIKDAYDHRHEARFETLEKQWLCMMMLQDKLLATNTSFMVGPWLNAVSP 660

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
            A    ++++ +++AR+ +T W + T  EA  LRDYGNK W+GL RDYY  R  IYF  +
Sbjct: 661 WAATATEQRRLDYDARSILTTWGNRTASEAG-LRDYGNKDWAGLTRDYYYRRWQIYFNDL 719

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDA 525
             SL++G      D    W      W   +  Y  ++ GD+
Sbjct: 720 DRSLKTGTPPHPID----WFAFGEKWNRAQTHYATQARGDS 756


>gi|182624959|ref|ZP_02952737.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens D str. JGS1721]
 gi|177909756|gb|EDT72174.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens D str. JGS1721]
          Length = 2104

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 155/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +   P A+    G W      P    TY+ +     F ++   F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + +A   P     SE+  MVG+G++ E I  NP+ Y+L+ +MA+  ++++ + W  
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y  RRYG++   I DAWN++  T Y        K  D    +     SII+   G    
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
           +G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA  A E +  +  AY   +      +S +FLEL++  + +L+    FL+G W+E A+ 
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +  
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
            +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900


>gi|168212494|ref|ZP_02638119.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens CPE str. F4969]
 gi|170716100|gb|EDT28282.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens CPE str. F4969]
          Length = 2104

 Score =  217 bits (552), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 155/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +   P A+    G W      P    TY+ +     F ++   F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + +A   P     SE+  MVG+G++ E I  NP+ Y+L+ +MA+  ++++ + W  
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y  RRYG++   I DAWN++  T Y        K  D    +     SII+   G    
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
           +G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA  A E +  +  AY   +      +S +FLEL++  + +L+    FL+G W+E A+ 
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +  
Sbjct: 797 MLKDADDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
            +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900


>gi|423214208|ref|ZP_17200736.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693153|gb|EIY86388.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 718

 Score =  217 bits (552), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 148/531 (27%), Positives = 255/531 (48%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP A     P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEAFAQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +W+T +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNAMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+            + ++P    
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
             +   + +       +SK  +                + + ++A+ L+ +  +EL +S 
Sbjct: 518 QTVVPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     +A  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708


>gi|224027030|ref|ZP_03645396.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
           18228]
 gi|224020266|gb|EEF78264.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
           18228]
          Length = 837

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 153/515 (29%), Positives = 236/515 (45%), Gaps = 73/515 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKI--TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           + P F G +P A + ++P   I  T  G  F           +++  T+PLF +I  AFI
Sbjct: 230 ICPGFPGFIPEAFKRIYPDLHIVETHWGGAFH---------NWMISPTEPLFAKISEAFI 280

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSP------EYISSLGAAIYSGMQSGDSDAVWLM 119
           ++  KE+G+  + Y  D+F+E   P          E  +S G  +YS ++  + DAVW+M
Sbjct: 281 KEWEKEFGKCDY-YLVDSFNEMDIPFPEKGNPARYEMAASYGEKVYSSIKRANKDAVWVM 339

Query: 120 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDL-------FAEVKPIWSTSKQFYGVPY 171
           QGW+F Y    W    + AL++ VP  K+++LDL       F   +  W   K FY   +
Sbjct: 340 QGWMFGYQRHIWDYETLGALVSRVPDDKMLLLDLAVDYNRHFWHSEVNWEYYKGFYNKQW 399

Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEM 230
           ++ ++ N  G   M G+LD  A G +EA +S N   +V  G++ EGIE N V+Y+L+++ 
Sbjct: 400 VYSVIPNMGGKTGMTGVLDFYANGHLEALSSSNRGNLVAHGLAPEGIENNEVLYELVTDA 459

Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 290
            +   ++DV+ W+ QYS+ RYG++   +  AW+ L  +VY                F D 
Sbjct: 460 GWSDHRMDVRDWLKQYSINRYGKAPAQLMKAWDYLLKSVYG--------------TFTDH 505

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                    G  +N    +S                      +  + LE F+A+  EL  
Sbjct: 506 PRFNWQFRPGLVKNGSINIS---------------------DDYFKGLESFVAASEELKD 544

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           S  Y  DL ++T   L   A  L   I + Y L D      L  RF   +  MD +L+ H
Sbjct: 545 SPYYLTDLCEMTAHYLGSKAEILTRQIDQEYLLGDTLQAHFLQSRFETFMLGMDRILSQH 604

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
               L  W+  A + A+ E Q KQYE NAR  +T+W          + DY  + WSGL+ 
Sbjct: 605 PTLRLDRWVSFASKAARTEAQRKQYEMNARRIVTVW-------GPPVDDYSARMWSGLVG 657

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 505
            YY  R   Y+K      +SG    L  W R+W++
Sbjct: 658 SYYLGRWKEYYK----GRDSGKSADLSSWERKWVE 688


>gi|299140550|ref|ZP_07033688.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella oris C735]
 gi|298577516|gb|EFI49384.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
           (NAG) [Prevotella oris C735]
          Length = 741

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 146/490 (29%), Positives = 232/490 (47%), Gaps = 49/490 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F G +P   +       +T  G W              L  TD  F +I   +  +
Sbjct: 223 VLPGFCGMMPHDAKAKL-GLNVTDGGIWNGYTRPAN------LSPTDAHFDKIADLYYAE 275

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             K YG+ ++ Y+ D F E T   ++ +Y S  G  +   M+  +  A W++QGW  + +
Sbjct: 276 LTKLYGKANY-YSMDPFHE-TNDDETIDY-SKAGCKVMEAMKRVNPKATWVIQGW--TEN 330

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAG 181
           P  RP  +K + N    G L+VLDLF+E +P      IW   K +    +++CML NF  
Sbjct: 331 P--RPQMIKNMKN----GDLLVLDLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGA 384

Query: 182 NIEMYGILDSIAFGPVEARTSENTT--MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
           N+ ++G +D +       + S   T  + G+G +MEG E NPV+++LMSE+ ++ E    
Sbjct: 385 NVGLHGRMDQLLHNFYSTKQSSPNTQHLKGIGFTMEGSENNPVMFELMSELPWRTE-CKK 443

Query: 240 KAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
           + WI  Y   RYG++ P I+ AW +L  T+YNC  G   +             SI     
Sbjct: 444 EDWIKGYVKARYGKTSPEIERAWQLLSETIYNCPAGNNQQG---------PHESIFC--- 491

Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
                 G+P      +KS +   ++    Y     + A +L     ++   +N + YDL+
Sbjct: 492 ------GRPSLNNFQVKSWSKMRNY----YDPQATLEAAQLMTGIADQYKGNNNFEYDLV 541

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           D+ RQALA      +L  I  Y         + + RFLE++   D LL     F LG W 
Sbjct: 542 DICRQALADQGRLQYLKTIADYNGFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWT 601

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           E+A++L   ++++  YEWNAR QIT W +    +   L DY +K W G+L+D+Y  R  I
Sbjct: 602 EAARKLGTTQQEKDLYEWNARVQITTWGNRICADKGGLHDYAHKEWQGILKDFYYKRWKI 661

Query: 480 YFKYMIESLE 489
           +   + + +E
Sbjct: 662 FMDALAKQME 671


>gi|322703040|gb|EFY94656.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
           ARSEF 23]
          Length = 774

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 146/507 (28%), Positives = 254/507 (50%), Gaps = 51/507 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF G VP +++ V P+A +T   NWF+   D ++     LD  D  + E+ + F+ +
Sbjct: 244 VLPAFPGFVPESIKKVRPNANLTVSPNWFAPAPD-KYTRDLFLDPLDDTYAELQKLFVTK 302

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSY 126
           Q+  +G  +++Y  D F+E +P      Y+  +    Y+G+ + +  AVWL+QGWL FS 
Sbjct: 303 QIDAFGNVTNVYTLDQFNELSPASGDTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSS 362

Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW  P++ A L  V   + ++VLDL++EV P W  +  + G P+IWC LH+F GN+ +
Sbjct: 363 RNFWTQPRIDAYLGGVEDHQGMLVLDLYSEVNPQWQRTNSYSGKPWIWCQLHDFGGNMAL 422

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
            G + ++   P++A  +++ ++VG G++ E  E N VVYD++ + A+    +D +A+   
Sbjct: 423 EGRVQTLTSAPIDA-LAQSKSLVGFGLTPEAYEGNEVVYDILLDQAWSATPLDTQAYFAS 481

Query: 246 YSVRRYG--RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
           +  +RY    S+P+ +  AW +L   VY+ T   TD  + V VA   + P++  +     
Sbjct: 482 WVTKRYAGISSIPSELYRAWEILRTDVYSNTR--TDIPQ-VPVATYQLRPALSGIA---- 534

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA-----LELFIASGNELSASNTYRYD 357
                         + T  + HP   +    V++      LE     G+ L     ++ D
Sbjct: 535 --------------NRTGHFPHPTALHYDPLVLQGVWKLMLEALTRQGS-LWKVPAFQLD 579

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHG-------------VFQLSRRFLELVEDMD 404
            +D++RQ L+   + L+ +++ AY+ +   G             V     R L L+  +D
Sbjct: 580 FVDVSRQMLSNQFDVLYADLVNAYKCSTGAGGSRELRSNTPNCDVKAAGARLLFLLSTLD 639

Query: 405 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
             L     F L  W+++A    +    E  + +NAR+Q+T+W    Q  A+ L DY  K 
Sbjct: 640 LTLLTSRHFALQSWVDAASAWGKAAGNEDLFTFNARSQVTVW----QVNATNLNDYAAKA 695

Query: 465 WSGLLRDYYGPRAAIYFKYMIESLESG 491
           W GL+  YY  R +I+   ++ +  SG
Sbjct: 696 WGGLVGSYYKGRWSIFVDALVAASSSG 722


>gi|295085513|emb|CBK67036.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
           XB1A]
          Length = 718

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 148/531 (27%), Positives = 254/531 (47%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP       P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +WST +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWSTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+            + ++P    
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
             +   + +       +SK  +                + + ++A+ L+ +  +EL +S 
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     +A  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708


>gi|423217398|ref|ZP_17203894.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
           CL03T12C61]
 gi|392628557|gb|EIY22583.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
           CL03T12C61]
          Length = 707

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 144/496 (29%), Positives = 237/496 (47%), Gaps = 59/496 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
           V P ++G VP  +          +LG  + +    +WC       L + D  F      +
Sbjct: 233 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMY 282

Query: 65  IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            E+  K YG+  + Y+ D F E  NT  VD    ++  G +I   M+  + +AVW++Q W
Sbjct: 283 YEELEKLYGKAKY-YSMDPFHEGGNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAW 337

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS-------KQFYGVPYIWCM 175
             +  P        A+++ +  G ++VLDL++E +P W  S       K F    +++CM
Sbjct: 338 QANPRP--------AMIDVLNAGDMLVLDLYSEKRPQWGDSDSMWYREKGFGKHDWLYCM 389

Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
           L NF GN+ ++G ++ +  G  +A    N   M GVG + EGIE NPV+++L+ E+ ++ 
Sbjct: 390 LLNFGGNVGLHGRMNQLVNGYYDACAHVNGKRMRGVGATPEGIENNPVMFELLYELPWRA 449

Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
           E+     W+  Y   RYG  + P + +AW  L HTVYN                P   P 
Sbjct: 450 ERFSPDVWLQGYLKARYGGELSPEVMEAWRALEHTVYNA---------------PKNSPG 494

Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
                EG  ++     ++       TS++ +  L+YS     +A +L ++   +    N 
Sbjct: 495 -----EGTLESL--LCARPGFHLDRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGDNN 547

Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
           + YDL+D+ RQ+ A   N L   I ++Y   D     + +++FLEL+   D LL+    F
Sbjct: 548 FEYDLVDIVRQSNADKGNALLDEISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEF 607

Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
            +  WL +A+ L   + ++K YEWNA   IT+W D+       L DY ++ WSGLL+D Y
Sbjct: 608 SVSSWLAAARSLGNTDAEKKLYEWNASALITVWGDSIASNQGGLHDYSHREWSGLLKDLY 667

Query: 474 GPRAAIYFKYMIESLE 489
             R   +F+   + LE
Sbjct: 668 YLRWKTFFEQKQQELE 683


>gi|160884066|ref|ZP_02065069.1| hypothetical protein BACOVA_02042 [Bacteroides ovatus ATCC 8483]
 gi|423291473|ref|ZP_17270321.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
           CL02T12C04]
 gi|156110408|gb|EDO12153.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
           8483]
 gi|392663473|gb|EIY57023.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
           CL02T12C04]
          Length = 718

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 148/531 (27%), Positives = 255/531 (48%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP A     P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEAFAQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +W+T +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+            + ++P    
Sbjct: 469 SSDSIDLDDWMTIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
             +   + +       +SK  +                + + ++A+ L+ +  +EL +S 
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     +A  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWINSTTPFDDPVEA 708


>gi|110801838|ref|YP_698175.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens SM101]
 gi|110682339|gb|ABG85709.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens SM101]
          Length = 2095

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 156/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +   P A+    G W      P    TY+ +     F  +   F E+
Sbjct: 406 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEK 464

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 465 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 520

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 521 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 572

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + +A   P     SE+  MVG+G++ E I  NP+ Y+L+ +MA+  ++++ + W  
Sbjct: 573 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 630

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y  RRYG++   I +AWN++  T Y        K  D    +     SII+   G    
Sbjct: 631 DYIERRYGKTNEEILEAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 676

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
           +G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q
Sbjct: 677 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFSKNYDEFKDSDAFLYDFADILKQ 727

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA  A E +  +  AY   DA     +S +FLEL++  + +L+    FL+G W+E A+ 
Sbjct: 728 LLANSAQEYYEVMCNAYNNRDAEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 787

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +  
Sbjct: 788 MLKDADDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTGDYYYARWEKWIN 847

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
            +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 848 GLQIELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 891


>gi|293371915|ref|ZP_06618319.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
 gi|292633161|gb|EFF51738.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
          Length = 718

 Score =  215 bits (548), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 150/531 (28%), Positives = 253/531 (47%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP       P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +W+T +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+                     
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYS--------------------- 507

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
           S+ S     +Q     +S +  +     S D+          ++A+ L+ +  +EL +S 
Sbjct: 508 SLYSYPRFTWQTV---ISDQRRISKIDLSDDY----------LQAIRLYASCADELKSSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     +A  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708


>gi|168209163|ref|ZP_02634788.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens B str. ATCC 3626]
 gi|170712640|gb|EDT24822.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens B str. ATCC 3626]
          Length = 2104

 Score =  215 bits (548), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 155/528 (29%), Positives = 253/528 (47%), Gaps = 51/528 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +   P A+    G W      P    TY+ +     F  +   F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEK 473

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + +A   P     SE+  MVG+G++ E I  NP+ Y+L+ +MA+  ++++ + W  
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y  RRYG++   I DAWN++  T Y        K  D    +     SII+   G    
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
           +G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA  A E +  +  AY   +      +S +FLEL++  + +L+    FL+G W+E A+ 
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  +T W          L+DY N+ WSGL  DYY  R   +  
Sbjct: 797 MLKDADDWTKDLFEFNARALVTTWGSRNNANGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
            +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900


>gi|336404352|ref|ZP_08585050.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
 gi|335943680|gb|EGN05519.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
          Length = 718

 Score =  215 bits (548), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 150/531 (28%), Positives = 253/531 (47%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP       P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +W+T +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+                     
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYS--------------------- 507

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
           S+ S     +Q     +S +  +     S D+          ++A+ L+ +  +EL +S 
Sbjct: 508 SLYSYPRFTWQTV---ISDQRRISKIDLSDDY----------LQAIRLYASCADELKSSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     +A  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708


>gi|170292392|pdb|2VC9|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
           In Complex With 2-Acetamido-1,2-Dideoxynojirmycin
 gi|170292393|pdb|2VCA|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
           In Complex With Beta-N-Acetyl-D-Glucosamine
 gi|170292394|pdb|2VCB|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
           In Complex With Pugnac
 gi|170292395|pdb|2VCC|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
          Length = 891

 Score =  215 bits (548), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 150/527 (28%), Positives = 253/527 (48%), Gaps = 49/527 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +     A+    G W      P    TY+ +     F ++   F E+
Sbjct: 381 VLQGYSGMVPRDFKEKNQEAQTISQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEK 439

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 440 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 495

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 496 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 547

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
               + +A   +    + +  MVG+G++ E I  NP+ Y+L+ +MA+  ++++ + W   
Sbjct: 548 DAAPEKLA-TEIPKALANSEHMVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTED 606

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y  RRYG++   I +AWN++  T Y        K  D    +     SII+   G    +
Sbjct: 607 YIERRYGKTNKEILEAWNIILDTAYK-------KRNDY---YQGAAESIINARPG----F 652

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
           G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q 
Sbjct: 653 G--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQL 703

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           LA  A E +  +  AY   +      +S +FLEL++  + +L+    FL+G W+E A+ +
Sbjct: 704 LANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTM 763

Query: 426 AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
            ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +   
Sbjct: 764 LKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWING 823

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
           +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 824 LQAELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 866


>gi|18309848|ref|NP_561782.1| alpha-N-acetylglucosaminidase [Clostridium perfringens str. 13]
 gi|18144526|dbj|BAB80572.1| probable alpha-N-acetylglucosaminidase [Clostridium perfringens
           str. 13]
 gi|288872041|dbj|BAI70446.1| alpha-N-acetylglucosaminidase [Clostridium perfringens]
          Length = 2104

 Score =  215 bits (547), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 154/528 (29%), Positives = 254/528 (48%), Gaps = 51/528 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +   P A+    G W      P    TY+ +     F  +   F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEK 473

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + +A   P     SE+  MVG+G++ E I  NP+ ++L+ +MA+  ++++ + W  
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAHELLFDMAWTRDQINFRTWTE 639

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y  RRYG++   I DAWN++  T Y        K  D    +     SII+   G    
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
           +G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA  A E +  +  AY   +      +S +FLEL++  + +L+    FL+G W+E A+ 
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +  
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
            +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 857 GLQAELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900


>gi|237721435|ref|ZP_04551916.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|293370838|ref|ZP_06617383.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
 gi|229449231|gb|EEO55022.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|292634054|gb|EFF52598.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
          Length = 711

 Score =  215 bits (547), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 141/480 (29%), Positives = 236/480 (49%), Gaps = 59/480 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
           V P ++G VP  +          +LG  + +    +WC       L   D  F      +
Sbjct: 233 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 282

Query: 65  IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            E+  K YG+ ++ Y+ D F E  NT  VD    ++  GA+I + M+  + +AVW++Q W
Sbjct: 283 YEELEKLYGKANY-YSMDPFHEGGNTEGVD----LAKTGASIMAAMKKANPEAVWIIQAW 337

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
                   +    + ++ S+  G L+VLDL++E +P       +W   K F    +++CM
Sbjct: 338 --------QANPREEMIASLNQGDLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCM 389

Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQH 234
           L NF GN+ ++G ++ +  G  +A    N  M+ GVG + EGIE NPV+++L+ E+ ++ 
Sbjct: 390 LLNFGGNVGLHGRMNQLVNGYYDACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWRE 449

Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
           E+     W+  Y   RYGR V P I +AW  L +TVYN      D   +  +       S
Sbjct: 450 ERFSSDEWLQTYLKARYGREVSPEIMEAWRALEYTVYNA---PKDYQGEGTIE------S 500

Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
           ++    G + +              TS++ +  L+Y+     +A  LF +  ++   +N 
Sbjct: 501 LLCARPGFHLD-------------RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNN 547

Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
           + YDL+D+ RQ+ A   N L   I ++Y   D     + +++FL+L+   D LL+    F
Sbjct: 548 FEYDLVDIVRQSNADKGNVLLEEISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEF 607

Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
            +  WL +A+ L   EE+++ YEWNA   IT+W D+       L DY ++ WSGLL+D Y
Sbjct: 608 SVSSWLNAARSLGTTEEEKRLYEWNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLY 667


>gi|237719039|ref|ZP_04549520.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
 gi|229451817|gb|EEO57608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
          Length = 718

 Score =  215 bits (547), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 146/531 (27%), Positives = 254/531 (47%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP       P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +W+T +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+            + ++P    
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
             +   + +       +SK  +                + + ++A+ L+++  +EL  S 
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYVSCADELKGSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     +A  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDKLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PV++
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVKA 708


>gi|373460171|ref|ZP_09551927.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
 gi|371956556|gb|EHO74342.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
          Length = 742

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 142/492 (28%), Positives = 230/492 (46%), Gaps = 52/492 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F G +P   +       +T  G W              L  TD  F  I   +  +
Sbjct: 223 VLPGFCGMMPHDAKEKL-GLNVTDGGKWNGYTRPAN------LSPTDSQFNRIADLYYAE 275

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             + YG+ ++ Y+ D F E+    D+ +Y    G+ +   M+  +  A W++QGW  + +
Sbjct: 276 LTRLYGKANY-YSMDPFHESNDD-DALDY-GKAGSVMLEAMKRINPKATWVIQGW--TEN 330

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAG 181
           P  RP  ++ + N    G L++LDLF+E +P      +W   K +    +++CML NF  
Sbjct: 331 P--RPRMIQDMKN----GDLLILDLFSECRPMFGIPSVWKREKGYEQHDWLFCMLENFGA 384

Query: 182 NIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
           N+ ++G +D +   F   + R+     + G+G +MEG E NPV+++LMSE+ ++ E    
Sbjct: 385 NVGLHGRMDQLIHNFYSTKKRSPNTQHLKGIGFTMEGSENNPVMFELMSELPWRPEIFKK 444

Query: 240 KAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISV 297
           + W+  Y   RYGR    I+ AW +L  T+YNC  G   +  +  V    P ++   +  
Sbjct: 445 EDWVRGYVKARYGRKDETIERAWLLLAETIYNCPAGNNQQGPHESVFCGRPGLNNFQVK- 503

Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
           +  K +NY                       Y     + A  L  +  +    +N + YD
Sbjct: 504 SWSKMRNY-----------------------YDPQATLEAARLMASVSSRYKGNNNFEYD 540

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
           LID+ RQALA      +L  I  Y         + ++RFL+++   D LL     F LG 
Sbjct: 541 LIDICRQALADQGRLQYLKTIADYNGFSRAAFAKDAKRFLDMILLQDRLLGTRKEFRLGH 600

Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
           W E+A+ L   + ++  YEWNAR QIT W + T  +   LRDY +K W G+L+D+Y  R 
Sbjct: 601 WTEAARSLGTTQAEKDLYEWNARVQITTWGNRTCADNGGLRDYAHKEWQGILKDFYYKRW 660

Query: 478 AIYFKYMIESLE 489
            IY   + + +E
Sbjct: 661 KIYMDALAKQME 672


>gi|153807690|ref|ZP_01960358.1| hypothetical protein BACCAC_01972 [Bacteroides caccae ATCC 43185]
 gi|149130052|gb|EDM21264.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
           43185]
          Length = 707

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 144/503 (28%), Positives = 240/503 (47%), Gaps = 73/503 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
           V P ++G VP  +          +LG  + +    +WC       L + D  F      +
Sbjct: 233 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMY 282

Query: 65  IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            E+  K YG+  + Y+ D F E  NT  VD    ++  G +I   M+  + +AVW++Q W
Sbjct: 283 YEELEKLYGKAKY-YSMDPFHEGGNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAW 337

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
             +  P        A+++ +  G ++VLDL++E  P       +W   K F    +++CM
Sbjct: 338 QANPRP--------AMVDVLNAGDMLVLDLYSERLPQWGDPDSMWYREKGFGKHDWLYCM 389

Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
           L NF GN+ ++G ++ +  G  +A T  N  T+ GVG + EGIE NPV+++L+ E+ ++ 
Sbjct: 390 LLNFGGNVGLHGRMNQLVNGYYDACTHANGKTLRGVGTTPEGIENNPVMFELLYELPWRA 449

Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
           E+     W+  Y   RYG  + P + +AW  L HTVYN                      
Sbjct: 450 ERFSPDTWLQGYLKARYGGELSPEVMEAWRALEHTVYNAP-------------------- 489

Query: 294 IISVTEGKYQNYGKPVSKEAVLKS-------ETSSYDHPHLWYSTSEVIRALELFIASGN 346
                    +NY    + E++L +        TS++ +  L+YS     +A +L ++   
Sbjct: 490 ---------KNYQGEGTVESLLCARPGFHLDRTSTWGYSKLFYSPDSTSKAADLMLSVAE 540

Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
           +   +N + YDL+D+ RQ+ A   N L   I ++Y   D     + +++FLEL+   D L
Sbjct: 541 QYKGNNNFEYDLVDIVRQSNADKGNALLDEISQSYDRKDKENFRKQTQQFLELILSQDSL 600

Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
           L+    F +  WL +A+ L   + ++K YEWNA   IT+W D+       L DY ++ WS
Sbjct: 601 LSTRKEFSVSSWLTAARSLGNTDAEKKLYEWNASALITVWGDSIASNQGGLHDYSHREWS 660

Query: 467 GLLRDYYGPRAAIYFKYMIESLE 489
           GLL+D Y  R   +F+   + LE
Sbjct: 661 GLLKDLYYLRWKTFFEQKQQELE 683


>gi|299144719|ref|ZP_07037787.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
 gi|298515210|gb|EFI39091.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
          Length = 718

 Score =  214 bits (546), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 147/531 (27%), Positives = 254/531 (47%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP       P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKADKEAKYKLLAEYGETIYKSITAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +W+T +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+            + ++P    
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
             +   + +       +SK  +                + + ++A+ L+ +  +EL +S 
Sbjct: 518 QTVVPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     +A  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708


>gi|298480124|ref|ZP_06998323.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
 gi|298273933|gb|EFI15495.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
          Length = 718

 Score =  214 bits (546), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 147/531 (27%), Positives = 254/531 (47%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP       P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +W+T +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+            + ++P    
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
             +   + +       +SK  +                + + ++A+ L+ +  +EL +S 
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     +A  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708


>gi|198277542|ref|ZP_03210073.1| hypothetical protein BACPLE_03764 [Bacteroides plebeius DSM 17135]
 gi|198270040|gb|EDY94310.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides plebeius DSM
           17135]
          Length = 722

 Score =  214 bits (545), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 152/547 (27%), Positives = 255/547 (46%), Gaps = 76/547 (13%)

Query: 8   VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           V P +SG VP  A + +  +     L N F+  +         L  TD  F EI   + +
Sbjct: 228 VFPGYSGMVPHDANKKLGLNVTEPALWNGFTRPA--------FLLPTDSRFNEIASLYYK 279

Query: 67  QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
           +  K +G+ ++ Y+ D F E  +   VD      + G A+   M++ +  A W++QGW  
Sbjct: 280 ELEKLFGKANY-YSMDPFHELEDAGSVD----FDAAGKAVLKAMKNVNPKATWVIQGW-- 332

Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----------- 173
           + +P  RP  +K L N    G +++LDLF+E +P+W       G+P IW           
Sbjct: 333 TENP--RPEMIKNLNN----GDILILDLFSECRPMW-------GIPSIWKREKGYEQHDW 379

Query: 174 --CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEM 230
             CM+ NF GN+ ++G +D +       + +     + G+G++MEG E NPV+++LM E+
Sbjct: 380 LFCMIENFGGNVGLHGRMDQLLNNFYLTKNNPLAAHLKGIGLTMEGSENNPVMFELMCEL 439

Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFP 288
            ++ EK   + W+  Y   RYG     I  AW++L   +YNC  G   +  +  +    P
Sbjct: 440 PWRPEKFTKEEWLKDYLFARYGVRDEKITQAWSILADGIYNCPFGNNQQGPHESIFCGRP 499

Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
            ++ +  + +  K QNY  P S EA                       A  L +   ++ 
Sbjct: 500 GLN-NFQASSWSKMQNYYDPTSTEA-----------------------AARLMLEVADKY 535

Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
             +N + YDL+D+ RQ+L+     ++   I  ++  D       S+ FL ++   D LL 
Sbjct: 536 KGNNNFEYDLVDIVRQSLSDRGRIVYNQTIADFKSFDKKSFATHSQEFLNILLAQDRLLG 595

Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
               F +G W+E A+ L    E++  YEWNAR QIT W +        LRDY +K W+GL
Sbjct: 596 TRSEFRVGRWIEQARNLGTTPEEKDLYEWNARVQITTWGNRVCANDGGLRDYAHKEWNGL 655

Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
           L+D+Y  R A Y++ + + L+      L     ++  +   W    N Y  +  GD +  
Sbjct: 656 LKDFYYKRWAAYWQTLQDVLDGKPMVEL-----DYYAMEEPWTLAHNPYASQPEGDCVSV 710

Query: 529 SQWLYNK 535
           ++ ++NK
Sbjct: 711 AKEVFNK 717


>gi|322699924|gb|EFY91682.1| alpha-N-acetylglucosaminidase, putative [Metarhizium acridum CQMa
           102]
          Length = 775

 Score =  214 bits (545), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 151/531 (28%), Positives = 260/531 (48%), Gaps = 51/531 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF G VP +++ V P   +T   NWF+   D ++     LD  D  + E+ R F+ +
Sbjct: 246 VLPAFPGFVPESIKKVRPDVNLTVSPNWFAPAPD-KYTRDLFLDPLDDTYAELQRLFVSK 304

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSY 126
           Q+  +G  ++IY  D F+E +P      Y+  +    Y+G+ + +  AVWL+QGWL FS 
Sbjct: 305 QMDAFGNVTNIYTLDQFNELSPASGDTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSS 364

Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW  P++ A L  V   + ++VLDL++E  P W  +  + G P+IWC LH+F GN+ +
Sbjct: 365 RRFWTQPRIDAYLGGVEDDQGMLVLDLYSEANPQWQRTNSYSGKPWIWCQLHDFGGNMAL 424

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
            G + ++   P++A  +++ ++VG G++ E  E N VVYD++ + A+    +D + +   
Sbjct: 425 EGRVQTLTSAPIDA-LAQSESLVGFGLTPEAYEGNEVVYDILLDQAWSATPLDTQTYFAS 483

Query: 246 YSVRRYG--RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
           +  +RY    S+P+ +  AW +L   VY+ T   TD  + V VA   + P++  +     
Sbjct: 484 WVTKRYAGVSSIPSELYRAWEMLRTDVYSNTR--TDIPQ-VPVATYQLRPALSGIA---- 536

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIR-----ALELFIASGNELSASNTYRYD 357
                         + T  + HP   +    V++      LE     G+ L     ++ D
Sbjct: 537 --------------NRTGHFPHPTALHYDPLVLQEAWKLMLEAMTRQGS-LWKVPAFQLD 581

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL------------SRRFLELVEDMDG 405
            +D++RQ L+   + L+ +++ AY+ + A G  +L              R L L+  +D 
Sbjct: 582 FVDVSRQMLSNQFDVLYADLVNAYKCSAAGGSRELRSSAPSCDVEAAGARLLSLLSTLDL 641

Query: 406 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 465
            L     F L  W+++A    +    E  + +NAR+Q+T+W    Q +A+ L DY  K W
Sbjct: 642 TLLTSRHFTLQSWVDAAGSWGKAAGNEDLFTFNARSQVTVW----QVDATNLNDYAAKAW 697

Query: 466 SGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNV 516
            GL+  YY  R +I+   ++ +  SG         RE      +WQ G + 
Sbjct: 698 GGLVGSYYKGRWSIFVDALVAASNSGS-LDEGALTRELEAFEAEWQAGEHA 747


>gi|423293381|ref|ZP_17271508.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
           CL03T12C18]
 gi|392678324|gb|EIY71732.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
           CL03T12C18]
          Length = 718

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 147/531 (27%), Positives = 254/531 (47%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP       P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 ITPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +W+T +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+            + ++P    
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
             +   + +       +SK  +                + + ++A+ L+ +  +EL +S 
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     +A  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWINSTTPFDDPVEA 708


>gi|383115207|ref|ZP_09935965.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
 gi|313695376|gb|EFS32211.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
          Length = 718

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 146/531 (27%), Positives = 253/531 (47%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP       P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +W+T +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+            + ++P    
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
             +   + +       +SK  +                + + ++A+ L+ +  +EL  S 
Sbjct: 518 QTVVPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKGSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     +A  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PV++
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVKA 708


>gi|194695302|gb|ACF81735.1| unknown [Zea mays]
          Length = 173

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 100/152 (65%), Positives = 119/152 (78%)

Query: 392 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
           L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITMWFDNT+
Sbjct: 7   LCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTE 66

Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQ 511
            +ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+   F LK+WRREWI LTN+WQ
Sbjct: 67  TKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWISLTNNWQ 126

Query: 512 NGRNVYPVESNGDALITSQWLYNKYLQGTGVF 543
           + R V+   + GD L  SQ LY KYL    + 
Sbjct: 127 SDRKVFSTTATGDPLNISQSLYTKYLSNADLL 158


>gi|423212382|ref|ZP_17198911.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392694828|gb|EIY88054.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 705

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 145/490 (29%), Positives = 233/490 (47%), Gaps = 61/490 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P ++G VP  +       +I   G W S    PR      L   D  F      + E+
Sbjct: 233 VFPGYAGMVPRNIGEKL-GYQIADPGKWCSF---PR---PAFLSTEDEHFESFAAMYYEE 285

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
             K YG+ ++ Y+ D F E  NT  VD    ++  GA+I + M+  +  AVW++Q W   
Sbjct: 286 LEKLYGKANY-YSMDPFHEGGNTEGVD----LAKTGASIMAAMKKANPKAVWVIQAW--- 337

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHN 178
                +    + +++S+  G ++VLDL++E  P W          K F    +++CML N
Sbjct: 338 -----QANPREEMISSLNQGDMLVLDLYSERLPQWGDPDSKWYREKGFGKHDWLYCMLLN 392

Query: 179 FAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
           F  N+ ++G +D +  G  +A    N  T+ GVG + EGIE NPV+++L+ E+ ++ E+ 
Sbjct: 393 FGANVGLHGRMDLLVNGYYDACAHANGKTLRGVGATPEGIENNPVMFELLYELPWREERF 452

Query: 238 DVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
               W+  Y   RYG+ V P + +AW  L HTVYN         RD              
Sbjct: 453 SPDEWLQGYLKARYGKDVSPEVMEAWRALEHTVYNAP-------RD-------------- 491

Query: 297 VTEGKYQNYGKPVS----KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
                YQ  G   S    +       TS++ +  L+YS     +A  L  +   +   SN
Sbjct: 492 -----YQGEGTVESLLCARPGFHLDRTSTWGYAKLFYSPDSTAKAARLLTSVAKQYEGSN 546

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            + YDL+D+ RQ+ A   N L  +I ++Y   D     + +++FL+L+   D LL+    
Sbjct: 547 NFEYDLVDIVRQSNADKGNVLLEDISQSYDRKDKENFRKQTQQFLDLIVSQDSLLSTRKE 606

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           F +  WL++A+ L   + ++K YEWNA   IT+W D+       L DY ++ WSG+L+D 
Sbjct: 607 FSVSTWLDAARSLGTTDAEKKLYEWNASALITVWGDSIASNQGGLHDYSHREWSGILKDL 666

Query: 473 YGPRAAIYFK 482
           Y  R   +F+
Sbjct: 667 YYQRWKAFFE 676


>gi|383280354|pdb|4A4A|A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In
           Complex With Its Substrate Glcnac-Alpha-1,4-Galactose
          Length = 914

 Score =  213 bits (541), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 148/527 (28%), Positives = 253/527 (48%), Gaps = 49/527 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +     A+    G W      P    TY+ +     F ++   F E+
Sbjct: 404 VLQGYSGMVPRDFKEKNQEAQTISQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEK 462

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F +  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 463 QKEVFGDVTNFYGVDPFHQGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 518

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 519 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 570

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
               + +A   +    + +  MVG+G++ + I  NP+ Y+L+ +MA+  ++++ + W   
Sbjct: 571 DAAPEKLA-TEIPKALANSEHMVGIGITPQAINTNPLAYELLFDMAWTRDQINFRTWTED 629

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y  RRYG++   I +AWN++  T Y        K  D    +     SII+   G    +
Sbjct: 630 YIERRYGKTNKEILEAWNIILDTAYK-------KRNDY---YQGAAESIINARPG----F 675

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
           G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q 
Sbjct: 676 G--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQL 726

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           LA  A E +  +  AY   +      +S +FLEL++  + +L+    FL+G W+E A+ +
Sbjct: 727 LANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTM 786

Query: 426 AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
            ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +   
Sbjct: 787 LKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWING 846

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
           +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 847 LQAELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 889


>gi|260910505|ref|ZP_05917173.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
           F0295]
 gi|260635347|gb|EEX53369.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
           F0295]
          Length = 1566

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 149/522 (28%), Positives = 249/522 (47%), Gaps = 50/522 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+  F G VP   +  FP A+I + G W S +         LL   D +F ++  A+ + 
Sbjct: 236 VVQGFPGIVPTFFKERFPQARIIEQGKWGSFQRP-----AVLLPNNDGVFEKVAEAYYQS 290

Query: 68  QLKEYGRTSHIYNCDTFDEN--TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
             K +G        D F E   T  VD    + S+ A +   M      A W++QGW  +
Sbjct: 291 LTKLFGTDFEFLGGDLFHEGGITTGVD----VGSVAAQVQRQMLRFFPRAKWVLQGW--N 344

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
            +P    PQ+  +L+       ++++L  E+   W +S +F G P++W  +++F G  +M
Sbjct: 345 KNP---SPQLLRVLDK---RHTLLVNLSGEIAASWESSDEFGGTPWLWGSVNHFGGKTDM 398

Query: 186 YGILDSIAFGPVEART-SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
            G L  I   P  A   + ++ M G+G+  EGI  NPVVYDL  + A+     DV + + 
Sbjct: 399 GGQLPVIVTEPHRALALTVDSVMQGIGILPEGIGTNPVVYDLALKTAWHTATPDVDSMLV 458

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           QY   RYG   P +  AW ++  +VY                            EG +++
Sbjct: 459 QYLGYRYGEVHPDLLAAWRIMLKSVYG---------------------EFAIKGEGTFES 497

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                ++ ++  +  S++    + Y  +++ RAL LF+ +  +L  S TY+YDL+DL RQ
Sbjct: 498 VF--CARPSLRVTSVSTWGPKQMQYQPADLYRALGLFLKAAPKLRDSETYQYDLVDLARQ 555

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
           +LA YA   + ++++AY+  +A  + Q ++RF  L+   D LL  +  FLLG WL+ A Q
Sbjct: 556 SLANYARTAYADVVKAYEAKNAEQLQQATQRFERLIVLQDSLLLTNRHFLLGNWLQQATQ 615

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
            A NE   +    NA+T I+ W     E  + + DY NK W+G+L  YY PR   +F+ +
Sbjct: 616 YAPNEADRQLCLHNAQTLISYW--GPDEPTTKVHDYANKEWAGMLSTYYLPRWQAFFRVL 673

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
             S+ +G+   +     ++ ++   W N       +  GDA+
Sbjct: 674 QASINTGNPPAI-----DFFEMEKRWANTPQPINTKPQGDAV 710


>gi|110800516|ref|YP_695309.1| alpha-N-acetylglucosaminidase [Clostridium perfringens ATCC 13124]
 gi|110675163|gb|ABG84150.1| alpha-N-acetylglucosaminidase family protein [Clostridium
           perfringens ATCC 13124]
          Length = 2095

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 153/528 (28%), Positives = 254/528 (48%), Gaps = 51/528 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  +SG VP   +     A+    G W      P    TY+ +     F ++   F E+
Sbjct: 406 VLQGYSGMVPRDFKEKNQEAQTISQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEK 464

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  ++ Y  D F E  NT  +D+      +   I + M   D+DAVW++Q W  +
Sbjct: 465 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 520

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                  P    L       + +VLDLF+EV P W+  ++   +P+IW MLHNF G + M
Sbjct: 521 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 572

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + +A   P     SE+  MVG+G++ E I  NP+ Y+L+ +MA+  ++++ + W  
Sbjct: 573 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 630

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y  RRYG++   I +AWN++  T Y        K  D    +     SII+   G    
Sbjct: 631 DYIERRYGKTNKEILEAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 676

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
           +G        +KS  S++ H  + Y  SE  +A+E+F  + +E   S+ + YD  D+ +Q
Sbjct: 677 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 727

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            LA  A E +  +  AY   +      +S +FLEL++  + +L+    FL+G W+E A+ 
Sbjct: 728 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 787

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  +T W      +   L+DY N+ WSGL  DYY  R   +  
Sbjct: 788 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 847

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
            +   L+ G     K    +W K+  DW N ++    +YP E++ + L
Sbjct: 848 GLQAELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 891


>gi|429740221|ref|ZP_19273923.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
 gi|429153946|gb|EKX96707.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
          Length = 721

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 151/520 (29%), Positives = 243/520 (46%), Gaps = 70/520 (13%)

Query: 2   LLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
           LL    + PAF+G VP     + P  ++      F      +    Y+L    P F++IG
Sbjct: 228 LLGMHPITPAFAGFVPEGFVKLHPEVRVKH----FEWGGFDKSLNAYMLPPDSPYFLQIG 283

Query: 62  RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPE-------YISSLGAAIYSGMQSGDSD 114
           + FIE+  KE+ + ++ Y  D+F+E   PV SP+        +S  G AIY  + +G+ +
Sbjct: 284 KLFIEEWEKEFSKNTY-YLSDSFNEMELPV-SPDDTDGKHRLLSKYGEAIYQSIVAGNPN 341

Query: 115 AVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAE-------VKPIWSTSKQF 166
           AVW+ QGW F Y   FW    ++ALL  VP  KL+++DL  +        +  W T K F
Sbjct: 342 AVWITQGWTFGYQHRFWDKESLQALLERVPNDKLIIVDLANDYPKWVWKTEQTWKTHKGF 401

Query: 167 YGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYD 225
           YG  +I   + NF G   + G L+  A    EA    +   ++G G + EG+E N VVY+
Sbjct: 402 YGKRWILSYVPNFGGKTLLTGDLNLYASCSAEALAHPDKGRLIGFGSAPEGLENNEVVYE 461

Query: 226 LMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIV 285
           L+++M +Q++ +D+  W+ +Y   RYG    A+Q AW  L  +VY+              
Sbjct: 462 LLADMGWQNQPIDLDHWLIEYCRSRYGSCPNAMQKAWKGLCRSVYS-------------- 507

Query: 286 AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG 345
                  S+ S     +Q     V  + + KS+   YD    ++      RA+E F+   
Sbjct: 508 -------SLYSYPRFTWQT----VIPDTLRKSK---YDFNDTYF------RAVEDFLLCA 547

Query: 346 NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 405
            +L  S  YR D +    Q +   A+ L+   ++A  + +     QL  + ++L+   D 
Sbjct: 548 PQLKDSPLYRSDALLFAAQYIGAKADNLYRKALQAKAVGNRARAKQLVDKVIQLLLQADK 607

Query: 406 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 465
           LLA H    L  W+++A+  A   ++  QYE +A+  IT W         + +DY  +YW
Sbjct: 608 LLASHPTDRLSRWVDAARTAAATPQERMQYEMDAKRLITSW-------GGIQQDYAARYW 660

Query: 466 SGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 505
           SGL++ YY PR  +YF              L +W   W+K
Sbjct: 661 SGLIKTYYVPRIKLYF-------AGSKKKELNNWEENWLK 693


>gi|326934230|ref|XP_003213195.1| PREDICTED: hypothetical protein LOC100549752 [Meleagris gallopavo]
          Length = 650

 Score =  211 bits (538), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 97/202 (48%), Positives = 137/202 (67%), Gaps = 4/202 (1%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP  +  VFP    T+LGNW     D    C YLL   +P+F  IG  F+++
Sbjct: 9   VLPAFAGHVPPGVLRVFPRINATRLGNWSHF--DCTLSCAYLLSPEEPMFQVIGTLFLKE 66

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            +KE+G T HIY+ DTF+E +P    P Y++ +  A++  M   D +A WLMQGWLF + 
Sbjct: 67  LIKEFG-TDHIYSADTFNEMSPLSSDPAYLAGITNAVFRAMTGADPEAQWLMQGWLFQHQ 125

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P FW+PPQ++A+L +VPLG+++VLDLFAE KP++  ++ FYG P+IWCMLHNF GN  ++
Sbjct: 126 PAFWQPPQVQAVLRAVPLGRMIVLDLFAESKPVYEWTESFYGQPFIWCMLHNFGGNHGLF 185

Query: 187 GILDSIAFGPVEARTSENTTMV 208
           G +++I  GP  AR   N+TMV
Sbjct: 186 GAVEAINRGPFVARRFPNSTMV 207


>gi|336412611|ref|ZP_08592964.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942657|gb|EGN04499.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
           3_8_47FAA]
          Length = 718

 Score =  211 bits (536), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 146/531 (27%), Positives = 253/531 (47%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP       P  +   +  W     +      Y+L    P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G  ++ Y  D+F+E   P+D  +       ++  G  IY  + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQG 348

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    +KALL++VP  K++++DL  +  K +W+T +       FYG  +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G LD  A   V+A R +    ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y   RYG    A+++AW +   T Y+            + ++P    
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
             +   + +       +SK  +                + + ++A+ L+ +  +EL  S 
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKNSE 554

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            YR DLI+     +A  A   +   ++    N      +  ++ ++L+ D+D LLA H  
Sbjct: 555 LYRNDLIEFVSYYVAAKAEIFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+      +++  YE NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  +YF        + D  ++++W  +WI  T+ W N    +  PV++
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVKA 708


>gi|224026593|ref|ZP_03644959.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
           18228]
 gi|224019829|gb|EEF77827.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
           18228]
          Length = 635

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 141/436 (32%), Positives = 204/436 (46%), Gaps = 45/436 (10%)

Query: 11  AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
            FSG VP  L+  +P AKI    +W   K   +      LD TD LF  IGR F+E++ K
Sbjct: 236 GFSGYVPRELKEKYPEAKIQPQPSWCGFKGAAQ------LDPTDSLFQVIGRDFLEEEKK 289

Query: 71  EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
            +G    +Y  D F E+ PPVD+PEY+S++G +I++  Q  D  ++W MQ W        
Sbjct: 290 LFG-AHGVYAADPFHESRPPVDTPEYLSAVGRSIHTLFQEFDPYSLWAMQAWSL------ 342

Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 190
           R P +KA    VP   L++LDL        +     +G P +   LHNF G I M+G L 
Sbjct: 343 REPIVKA----VPEEHLLILDLNGSK---CTQRNACWGYPVVAGNLHNFGGRINMHGDLP 395

Query: 191 SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRR 250
            +A    EA  S +  + G G+ MEGIEQNP+ Y+L  EM  Q  KV++  W+ +Y++RR
Sbjct: 396 LLAGNQYEAAVSLSPNVCGSGLFMEGIEQNPLYYELAFEMPLQKGKVELDGWLKEYALRR 455

Query: 251 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS 310
           YG        A  +L    Y      T+ +  +I A     P++     G     G P  
Sbjct: 456 YGSKWENTHKALLLLLEGPYRPGTNGTELS-SIIAA----RPALHVKKSGPNAGLGIP-- 508

Query: 311 KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYA 370
                             YS   +I A    +     L  S  YR+D++DL RQ +    
Sbjct: 509 ------------------YSPWLLIEAQAFMLKDAGILKTSEAYRFDIMDLQRQIMTNLG 550

Query: 371 NELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 430
             +     +A++  D  G    SRR+LEL+ D+D LL     F    WL  A+     EE
Sbjct: 551 QAIHKEAAKAFEAGDEKGFELHSRRYLELLTDVDTLLRTRPEFNFDRWLADARSWGDTEE 610

Query: 431 QEKQYEWNARTQITMW 446
           ++ Q+E NA   +T+W
Sbjct: 611 EKNQFERNATALVTIW 626


>gi|126347839|emb|CAJ89559.1| putative alpha-N-acetylglucosaminidase [Streptomyces ambofaciens
           ATCC 23877]
          Length = 740

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 140/519 (26%), Positives = 228/519 (43%), Gaps = 44/519 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G VP       P   +   G+W   +  P W     LD    +F  +  AF   
Sbjct: 250 VLPGYYGTVPPGFTERNPVGPVVPQGDWVGFER-PDW-----LDPRSAVFPRVAAAFYRH 303

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G TS +Y  D   E   P + P  +     A+   +Q+    AVW + GW  +  
Sbjct: 304 QRELFG-TSTMYKMDLLHEGGRPGNVP--VRDAAQAVMKALQTARPGAVWTLIGWQNN-- 358

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
                P  + +++++   +L+++D  ++          ++G PY +  + NF G+  M  
Sbjct: 359 -----PSTQ-IIDAIDKRRLLIVDGLSDRYDGLDREATWHGAPYAFGTIPNFGGHTTMGA 412

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
                A    + RT   + + G+    EG   NPV Y+L +E+A++ E VD + W  +Y+
Sbjct: 413 NTAVWAERFDQWRTKAGSALAGIAYMPEGTGGNPVAYELFTELAWRTEPVDQRKWFAEYA 472

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYG + P    AW +L    Y+   G   +++D +                       
Sbjct: 473 QRRYGGADPHAASAWELLRSGPYSTPSGTWSESQDSLF---------------------- 510

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
             ++  +  +  +S+    + Y    V RAL   +     L A++ YR+DL+D+ RQ LA
Sbjct: 511 -TARPRLTATNAASWSPGAMRYDPGTVRRALTELVRVAPALRATDAYRFDLVDVARQVLA 569

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
             +  L   I  AY   D       +  +   +  +D LLA    FLLGPWLE AK   +
Sbjct: 570 NRSRTLLPQIKAAYDAEDLPRFRARAAEWKNCLSLLDRLLATDARFLLGPWLEDAKSWGR 629

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
            E +    E++AR+ +T W   +  +A  LRDY N+ WSGL+ D+Y  R   Y   +  +
Sbjct: 630 TEAERAAAEFDARSILTTWGHRSGSDAGGLRDYANREWSGLVSDFYAMRWTKYLDSLDTA 689

Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
           L +G      D    W  L +DW   R+ YPV  +GD +
Sbjct: 690 LVTGRPPVAID----WFALEDDWNRQRDGYPVRPSGDPV 724


>gi|261880159|ref|ZP_06006586.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
 gi|270333130|gb|EFA43916.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
          Length = 772

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 140/506 (27%), Positives = 231/506 (45%), Gaps = 65/506 (12%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G +P   +       +T  G W              L ATDP F  I   +  +
Sbjct: 230 VLPGYCGMMPHDAKAKL-GLDVTDGGTWNGYTRPAN------LSATDPKFDHIADLYYRE 282

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             + YG+  + Y+ D F E+  P D+    +  G  + + M+  +  + W++QGW+ +  
Sbjct: 283 LTRLYGKADY-YSMDPFHES--PDDASVDYAEAGRKLLAAMKRANGKSNWVIQGWMENPR 339

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAG 181
           P         ++ ++P G +++LDLF+E +P      IW   + +    +++CML NF  
Sbjct: 340 P--------QMIEALPEGDIIILDLFSECRPMFGAPSIWQRKEGYGRHNWLFCMLENFGA 391

Query: 182 NIEMYGILD------SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF--- 232
           N+ ++G +D       +A  P     +    + G+G +MEG E NP++++LMSE+ +   
Sbjct: 392 NVGLHGRMDQLVHNFKLAASPSTPYQNARKHLKGIGFTMEGSENNPIMFELMSELVWRAN 451

Query: 233 -----QHEKVDVK-AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVI 284
                + ++ D K  W   Y   RYG   P IQ+AW +L  ++YNC  G   +  +  + 
Sbjct: 452 DLVSAERDRRDFKEGWTRNYVKARYGIDNPKIQEAWQLLIGSIYNCPVGNNQQGPHESIF 511

Query: 285 VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS 344
              P +D   +  +  K +NY                       Y  +  +RA +L  + 
Sbjct: 512 NGRPSLDNFQVK-SWSKMRNY-----------------------YDPNVTLRAAQLMTSV 547

Query: 345 GNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 404
            +    +N + YDL+D+ RQA+   A   +L  I  Y+  D       S RFL ++   D
Sbjct: 548 ADRYRGNNNFEYDLVDIVRQAMDDQARLQYLRTIADYKGFDRTAFSADSARFLNMLLLQD 607

Query: 405 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
            LL     F LG  +E A+ L+   E++  YEWNAR QIT W + T      LRDY +K 
Sbjct: 608 KLLGTRQEFRLGTRIEQARSLSTTLEEKNLYEWNARVQITTWGNRTCANEGGLRDYAHKE 667

Query: 465 WSGLLRDYYGPRAAIYFKYMIESLES 490
           W GLLRD+Y  R   Y   + + + +
Sbjct: 668 WQGLLRDFYFMRWHTYLDALSKQMTA 693


>gi|187734575|ref|YP_001876687.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
 gi|187424627|gb|ACD03906.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 848

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 147/518 (28%), Positives = 245/518 (47%), Gaps = 74/518 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLG--NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           + PAFSG VP  +  ++P AK+ +LG   W      P+    + L   +PLF++IGR ++
Sbjct: 239 ICPAFSGFVPRGILRLYPEAKLHRLGWGGW------PQKNHAHFLSPEEPLFLKIGRLYM 292

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSP------EYISSLGAAIYSGMQSGDSDAVWLM 119
           ++  KE+G+ ++ +  D+F+E   P +          +SSLG  IY  + S + DAVW+M
Sbjct: 293 QEWQKEFGKNTY-FLADSFNEMELPENKGGVEARNNMLSSLGEQIYRSISSTNPDAVWVM 351

Query: 120 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPI-------WSTSKQFYGVPY 171
           QGW+F Y    W    +KALL+ VP  K+++LDL A+           W   K F+  P+
Sbjct: 352 QGWMFGYQRNIWNADTLKALLSKVPDDKMLLLDLAADYNKTFWRNGMNWDVFKGFFNKPW 411

Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 230
           ++ ++ N  G   M G++D  A G +EA  +S    + G+GM+ EGIE N V+Y+L+++ 
Sbjct: 412 VYSVVPNMGGKCAMTGVMDFYANGHLEALNSSSRGRLSGMGMAPEGIENNDVIYELITDA 471

Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY-NCTDGATDKNRDVIVAFPD 289
           A+++ + +V+ ++  Y   RYG    ++++AWN+   T Y N  D             P 
Sbjct: 472 AWRNRQENVEQYLENYCRARYGNYPDSMKEAWNLFRRTAYSNLKD------------HPR 519

Query: 290 VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELS 349
            +  +   T G   N                         ++ + ++ L LF+ +   L 
Sbjct: 520 FNWQMKPGTRGCSVN-------------------------TSEDFLKGLSLFVNT-RGLE 553

Query: 350 ASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLAC 409
            S  +R D +++    L    NE      EA    D     +    F +     D LL  
Sbjct: 554 QSPLFRQDAVEMAVHYLGIRMNEAIRAAQEALDEQDQENAEKCMAYFRKYALLADSLLEG 613

Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
           H  + L  W+  A+    + E++ +YE NAR  +T W          + DY  K WSGL+
Sbjct: 614 HPTWRLSRWISFARSHGTSPEEKNKYEQNARRLVTRW-------GPPVDDYAAKIWSGLI 666

Query: 470 RDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 507
           RDYY PR    +++ I+S  S     +  W  +W++ T
Sbjct: 667 RDYYLPR----WEHFIQSRLSEKNPDMGAWEEKWVRST 700


>gi|395804724|ref|ZP_10483959.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
 gi|395433112|gb|EJF99070.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
          Length = 722

 Score =  209 bits (531), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 147/513 (28%), Positives = 244/513 (47%), Gaps = 52/513 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+PAFSG VP A     P AKIT+L +W    S   +  T+LLD+ DPLF +IG+ FIE 
Sbjct: 245 VVPAFSGYVPKAFAEKHPEAKITELKSW----SGGGFASTFLLDSKDPLFKQIGKRFIEI 300

Query: 68  QLKEYGRTSHIYNCDTFDENTPPV---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
             K YG+ S+ Y  D+F+E  PPV   +  E +S+ G+A+Y  +      AVW+MQGWLF
Sbjct: 301 YTKMYGK-SNFYLADSFNEIEPPVSEHNKYEELSNYGSAVYETIDEAAPGAVWVMQGWLF 359

Query: 125 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
             +  FW     KA L+ VP  K++V D   +   +W   + FYG  + +  +HN+ G+ 
Sbjct: 360 GDNKEFWTKEATKAFLSKVPNEKVMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSN 419

Query: 184 EMYGIL----DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
            +YG L    D +A      +      +VG G   EG+  N +VY+ + ++ +   +  +
Sbjct: 420 PVYGDLNFYKDELA---SLLKNPNRGNIVGYGAMPEGLNNNSIVYEYIYDLPWTKAEQPL 476

Query: 240 KAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
             W+ +Y   RYG++  ++  AW +L  +VYN     T    D   A+       + +TE
Sbjct: 477 NDWMAKYLNARYGQTSESVFHAWELLLKSVYNVKYWETRWWNDWAGAYLLFKRPTVKITE 536

Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
            K    G P  K                     ++  AL++      + + +N  +YDLI
Sbjct: 537 FK----GNPGDK--------------------IKLKEALDILKKEAKKYNKNNLIQYDLI 572

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           D++R   +   +E  +  I+AYQ  +     QL ++  + V + D +++      L  W+
Sbjct: 573 DVSRHYNSLSIDEELIECIKAYQEKNIAKGDQLFKQIEKQVLETDKMMSGQPLNNLNQWV 632

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
           +SA     + E    Y  NA+T IT+W          L DY ++ W G+ + +Y PR  +
Sbjct: 633 KSASDYGSSPEVSSLYAKNAKTLITLWGGEGH-----LNDYASRSWKGMYKGFYWPRWKM 687

Query: 480 YFKYMIESLESGDGF-------RLKDWRREWIK 505
           + + + ++  +   F        +K+W   W +
Sbjct: 688 FLEALKKAAVTNTSFDENKERESIKNWEINWTE 720


>gi|404487024|ref|ZP_11022211.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335520|gb|EJZ61989.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
           YIT 11860]
          Length = 722

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 157/533 (29%), Positives = 245/533 (45%), Gaps = 77/533 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           + PAF+G VP A     P  +   L  W  F  K +      Y+L    P F EIG+ FI
Sbjct: 237 IAPAFAGFVPTAFAERHPEIQFKHL-EWGGFDEKYN-----AYVLPPETPYFKEIGKLFI 290

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPV------DSPEYISSLGAAIYSGMQSGDSDAVWLM 119
           E+  KE+G+ ++ Y  D+F+E   PV         + ++  G +IY  + +G+ DAVW+ 
Sbjct: 291 EEWEKEFGKNTY-YLSDSFNEMKLPVAEGDDDGKHKLLAQYGESIYHSIAAGNPDAVWVT 349

Query: 120 QGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPY 171
           QGW F Y   FW    ++ALL+ VP  K++++DL  +  K +W T +       FYG  +
Sbjct: 350 QGWTFGYQHDFWDKASLQALLSRVPDDKMIIIDLGNDYPKWVWGTEQTWKNHDGFYGKKW 409

Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEM 230
           I+  + NF G   M G L   A    EA  S N   +VG G + EG+E N VVY+L+++M
Sbjct: 410 IFSYVPNFGGKTPMTGDLQMYATSSAEALHSANAGNLVGFGSAPEGLENNEVVYELLADM 469

Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 290
            +  + +D+ +W+  Y   RYG    A+  AW     T Y+            +V  PD 
Sbjct: 470 GWTADSIDLDSWLPVYCKARYGGCPAAMDSAWQRFKETAYSSLYSYPRFTWQTVV--PDT 527

Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
                           + +SK  V  S                 ++ +ELF++  + L +
Sbjct: 528 ----------------RRISKLDVSDS----------------FLQGVELFLSCADSLES 555

Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
           S  Y  D I+     LA  A++ +   ++   L +     Q   R +E++ D+D LLA H
Sbjct: 556 SPLYVNDAIEYASYYLAAKADDCYKRALKEDSLGNRVAAMQQLDRSVEILLDVDKLLASH 615

Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
             + L  W++ A+   + + ++  YE NA+  IT W            DY  ++WSGL++
Sbjct: 616 PLYRLEEWVDMARDWGKTDLEKDAYEANAKRLITTW-------GGFQEDYAARFWSGLIK 668

Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           DYY PR  +YF     S +  D   L  W   WIK    W N    +  P++S
Sbjct: 669 DYYIPRMKLYF-----SEQRAD---LDRWEENWIKAP--WHNTSTSFEDPLQS 711


>gi|32564213|ref|NP_496948.2| Protein K09E4.4 [Caenorhabditis elegans]
 gi|25814792|emb|CAB70170.2| Protein K09E4.4 [Caenorhabditis elegans]
          Length = 715

 Score =  208 bits (529), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 136/485 (28%), Positives = 226/485 (46%), Gaps = 48/485 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LP F+G VP  L+ +FP++K  +L  W +  S+    C   +   DPLF +IG  F+  
Sbjct: 247 ILPTFAGFVPDHLETLFPASKFNRLPRWNNFTSET--SCMLSVSPFDPLFQKIGSTFLRH 304

Query: 68  QLKEYG-RTSHIYNCDTFDENTPPVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
           Q K +G   +++Y+ D F+E  P   +    +++     AI +  +  D + VW++Q W 
Sbjct: 305 QKKMFGGDVTNMYSADPFNEILPSESAKFDAKFVKQTAQAIMNSCKKVDKNCVWVLQSWS 364

Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
           F+YD  W    +K+ L+++P+G L++LDL+AEV P W  +  F G  ++WC+LHNF G+ 
Sbjct: 365 FTYDQ-WPAWAIKSFLSAIPVGNLLILDLYAEVVPAWQMTSSFQGHHFVWCLLHNFGGSR 423

Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
           E+ G L  I  G   A     + +VG G+SME I+QN VVY  M +  +  E + +  W+
Sbjct: 424 ELRGNLQKIDKGYQLALMKAGSNLVGAGLSMEAIDQNYVVYQFMIDRMWSPEPLPLNNWL 483

Query: 244 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
             YS  RY       Q  W +L  T YN  +        V +                  
Sbjct: 484 KAYSESRYSADFKVAQKFWTLLAGTFYNQPEKWGTPRFSVFL------------------ 525

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVI-RALELFIASGNELSASNTYRYDLIDLT 362
            Y +P     +             W+   E   R  EL  A  + L     +R DL D+ 
Sbjct: 526 -YHRPGFGRKI-----------EYWFPVEETFSRFRELLPALVHVLGEHPLFREDLNDVM 573

Query: 363 RQALA-KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           R+    +  NE  L++ EA+ + D   V       +E+ + ++     +    +  W+E+
Sbjct: 574 REMTQFEMGNEAALSMSEAFLMEDKQQVGASCEMLMEMFQKLES----YSNRDVRQWIEN 629

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           AK +A   E+ + +   A   +T+W    Q       DY ++ W+GL+  YYG R   + 
Sbjct: 630 AKSIAPTSEERQVFPVTAGDILTVWGPTGQN-----LDYAHREWAGLMSGYYGRRWQYFC 684

Query: 482 KYMIE 486
            +++E
Sbjct: 685 DWILE 689


>gi|296115989|ref|ZP_06834611.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
           23769]
 gi|295977458|gb|EFG84214.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
           23769]
          Length = 758

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 147/544 (27%), Positives = 246/544 (45%), Gaps = 62/544 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F G VP      FP A +   G W   +  P W     LD  DP+F ++   + ++
Sbjct: 259 VLPGFYGMVPDDFGKRFPQAHVIGQGEWNRFRR-PAW-----LDPRDPMFAKVAAIYYDE 312

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G  + +Y+   F E   P D P  ++  G  I   + +    A+W++  W    D
Sbjct: 313 QKKLFG-DAPVYDIQPFQEGGTPGDVP--LADAGQGIQKALDTAHPGAMWMLMAWYEEPD 369

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
                     +L  V   +L ++DL    +   +    F G P+++  L +F G   + G
Sbjct: 370 --------ARMLAGVDRKRLFIVDLEQNTRVRENRDADFQGAPFLYGGLWDFGGRTSLGG 421

Query: 188 ILDSIAFG---PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
              S  +G   P   RT +N  M+G  +  EG++ NP ++DL +E A++ + VD   W  
Sbjct: 422 --SSYDYGVRLPGLWRTQKN--MIGTAVFPEGMDNNPYIFDLFTEAAWRRDGVDTTQWTR 477

Query: 245 QYSVRRYGRSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
            Y+ RRYG+      + AW++L H+ +                         S      Q
Sbjct: 478 DYADRRYGQPGDVHARKAWDLLLHSAF-------------------------SYRATGIQ 512

Query: 304 NYGKP-VSKEAVLKSETSSYDHPHLW-------YSTSEVIRALELFIASGNELSASNTYR 355
           ++G+   + +++  ++ S   H   W       Y    V  A+   + + +   A+  YR
Sbjct: 513 DFGEASAAPDSLFNAQPSLDTHSAAWNGMKVLPYDPHLVEAAMAELLQASDATRATEAYR 572

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
           YDL+D+TRQA+A  A  +   I +A+   D   +  L+ R+LEL++  D LLA +  F +
Sbjct: 573 YDLVDVTRQAVANQARAMLPQIGDAFAARDRAKLHALTTRWLELMDRQDSLLATNTFFRV 632

Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
           G WL   +  + +  Q K  +++AR  +T W   T  +   LRDY NK W+GL +DYY  
Sbjct: 633 GTWLSWPQAWSDDPAQRKLMDYDARVILTNWGGRTASQVGHLRDYANKDWAGLTKDYYRV 692

Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
           R  ++F  +  SL +G   R  D    W K+  +W +   VY     GD+   ++ +++ 
Sbjct: 693 RWQLFFDSLETSLATGRPPREID----WYKVGEEWCHNGRVYSPTPEGDSYTVARDIHDY 748

Query: 536 YLQG 539
             QG
Sbjct: 749 LTQG 752


>gi|393783265|ref|ZP_10371440.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
           CL02T12C01]
 gi|392669544|gb|EIY63032.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
           CL02T12C01]
          Length = 723

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 148/535 (27%), Positives = 246/535 (45%), Gaps = 73/535 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP A     P  K   L  W            Y+L    P F EIG+ F+++
Sbjct: 236 IAPAFAGFVPMAFAEKHPDIKFKHL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKE 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G+ ++ Y  D+F+E   PV   +       ++  G +IY  + +G+ DA+W+ QG
Sbjct: 292 WEKEFGKNTY-YLSDSFNEMELPVAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQG 350

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    ++ALL+ VP  K++++DL  +  K +W T +       FYG  +I+
Sbjct: 351 WTFGYQHDFWDKASLQALLSHVPDDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIF 410

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   + G L   A    EA +   +  ++G G + EG+E N VVY+L+++M +
Sbjct: 411 SYVPNFGGKTPLTGDLQMYATSSAEALKAPSHGNLIGFGSAPEGLENNEVVYELLADMGW 470

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D + W+  Y   RYG    ++++AW +   T Y+            + ++P    
Sbjct: 471 TDQAIDPEQWMPSYCTARYGAYPESMKNAWELFRKTAYSS-----------LYSYPRFTW 519

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
             +   + +       +SK  V                + + +  +ELF+AS + L+ S 
Sbjct: 520 QTVIPDQRR-------ISKIDV----------------SDDFLHGIELFLASADSLNRSK 556

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            Y  D I+     +A  A++L+   +           +Q   + ++L+ ++D LLA H  
Sbjct: 557 LYVNDAIEFASYYIAAQADKLYKQALTEDTAGKPVAAYQHLNQAIDLLLNVDKLLASHPL 616

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+E A+       ++  YE NA+  IT W            DY  ++WSGL++DY
Sbjct: 617 YRLEEWVELARNSGTTPAEKDAYEANAKRLITTW-------GGFQEDYAARFWSGLIKDY 669

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 527
           Y PR  +YF     S + GD   L +W  EWI+    W N     P E   D  I
Sbjct: 670 YIPRLKLYF-----SKQRGD---LDNWEEEWIR--TPWHN--TTTPFEKPLDMAI 712


>gi|210631701|ref|ZP_03296968.1| hypothetical protein COLSTE_00853, partial [Collinsella stercoris
           DSM 13279]
 gi|210159960|gb|EEA90931.1| F5/8 type C domain protein, partial [Collinsella stercoris DSM
           13279]
          Length = 1906

 Score =  207 bits (526), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 150/499 (30%), Positives = 238/499 (47%), Gaps = 42/499 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
           V+  F G VP   Q   P++     G+W S  + P    TYL DA      +  F ++G 
Sbjct: 391 VIQGFGGQVPTDFQQKNPNSVAASSGSW-SGFARPYMIKTYLTDADRAAGKEDYFQKVGT 449

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F E Q + +G+ SH Y  D F E    V     I  +   +   M   D  AVW+MQ W
Sbjct: 450 TFYEAQERIFGKVSHFYAVDPFHEGGT-VPQGFNIVDIYRTVQQKMLDYDPQAVWVMQQW 508

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
            +  D          L       + +VLDL ++++   ++  +   VP++W MLHNF G 
Sbjct: 509 QWGIDE-------NKLSGLAKKEQSLVLDLQSDLRS-QASPMENQQVPWVWNMLHNFGGR 560

Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
           + M G+ + +A    +A  S N  M G+G++ E I+ +P+VY+L+ +M ++ + VD +AW
Sbjct: 561 MGMDGVPEVLAIKIPQAYNS-NRYMRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRAW 619

Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
              Y  RRYG +   IQ+AW++L  T Y   DG           +     SI++      
Sbjct: 620 TRSYIERRYGGTDAKIQEAWDILLDTAYKHVDGEY---------YQGASESIMNA----- 665

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
               +P   +       S++ H  + Y   E  RA +LFI S +    S  +RYD +D+ 
Sbjct: 666 ----RPSDNKI---GSASTWGHSDIDYDKKEFERAAQLFIESYDTYKDSEAFRYDFVDVM 718

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ LA    E      +AY+  DA     L+ + LE+++  D +L+    F+LG W+E+A
Sbjct: 719 RQVLANAFQEYQPLAGDAYKQRDAERFELLANQMLEMLDAQDRMLSTSSDFMLGTWIENA 778

Query: 423 KQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
           + L ++ +      +E NAR+ IT W    ++  SL+ DY N+ WSGL   YY PR   +
Sbjct: 779 RTLLEDADDWTADLFELNARSLITTW--GLEKNGSLI-DYSNRQWSGLTGSYYKPRWESW 835

Query: 481 FKYMIESLESGDGFRLKDW 499
                ++LE G   +  +W
Sbjct: 836 ANARKKALEDGGSAQDLNW 854


>gi|146300873|ref|YP_001195464.1| alpha-N-acetylglucosaminidase [Flavobacterium johnsoniae UW101]
 gi|146155291|gb|ABQ06145.1| Candidate alpha-glycosidase; Glycoside hydrolase family 89
           [Flavobacterium johnsoniae UW101]
          Length = 723

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 148/511 (28%), Positives = 242/511 (47%), Gaps = 47/511 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+PAFSG VP A     P AKIT+L +W    S   +  T+LLD+ DPLF +IG+ FIE 
Sbjct: 245 VVPAFSGYVPKAFAEKHPEAKITELNSW----SGGGFESTFLLDSKDPLFKKIGKRFIEI 300

Query: 68  QLKEYGRTSHIYNCDTFDENTPPV---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
             K YG+ S+ Y  D+F+E  PPV   +  E +++ G+AIY  ++     AVW+MQGWLF
Sbjct: 301 YTKMYGK-SNFYLADSFNEIEPPVTEHNKYEELANYGSAIYETIEEAAPGAVWVMQGWLF 359

Query: 125 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
             +  FW      A L+ VP  +L+V D   +   +W   + FYG  + +  +HN+ G+ 
Sbjct: 360 GDNKNFWTKEATSAFLSKVPNDRLMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSN 419

Query: 184 EMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
            +YG L+      V   +      +VG G   EG+  N +VY+ + ++ +   +  VK W
Sbjct: 420 PVYGDLNFYKNELVSLLKNPHRGNVVGYGAMPEGLNNNAIVYEFIYDLPWSKGEQSVKDW 479

Query: 243 INQYSVRRY-GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           +  Y   RY  ++  ++  AW +L  +VY+     T    D   A+        ++TE K
Sbjct: 480 LTNYLNARYEQKTSDSVFKAWELLLESVYSTKYWETRWWNDRAGAYLLFKRPTATITEFK 539

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
               G P  K+                    ++  AL++  A   +    N  +YDLID 
Sbjct: 540 ----GNPGDKD--------------------KLKEALDILKAEAKKYDKKNFIQYDLIDA 575

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
           +R   +   +E  +  ++AYQ  D     QL ++  + V ++D  ++      L  W++S
Sbjct: 576 SRHYYSLSIDEDLVECVKAYQQKDITKGDQLFKKIEKQVLEIDKSMSGQPLNSLNYWVKS 635

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           A +     E  K Y  NA+T IT+W          L DY ++ W G+ + +Y PR  ++ 
Sbjct: 636 ASEYGSTPEVSKLYVKNAKTLITLWGGEGH-----LNDYASRSWQGMYKGFYWPRWKMFL 690

Query: 482 KYMIESL-------ESGDGFRLKDWRREWIK 505
               ++        E+ +   +K+W  +W K
Sbjct: 691 TAFKKTAVNNTPFDETKEREEIKNWEIKWTK 721


>gi|160914140|ref|ZP_02076362.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
 gi|158433951|gb|EDP12240.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
          Length = 2150

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 136/478 (28%), Positives = 237/478 (49%), Gaps = 50/478 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGR 62
           V+  F G VP +       A +T +  W S  + P    TYL            F ++ +
Sbjct: 424 VIQGFCGQVPMSFVEKNEGAVLTPIDEWPSF-TRPAMIKTYLSQEEIAAGKKDYFKDVAK 482

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 120
            F E+Q   +G  S  Y  D F E  NT  +D    ++++   +   M   ++DA+W+MQ
Sbjct: 483 TFYEKQKNVFGDVSDYYASDPFHEGGNTQGLD----VTNIFKTVQEEMLKSNADAIWVMQ 538

Query: 121 GWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFA 180
            W  + D      ++  L   V   + + LDL +++ P  S+  +  G+ +IWCMLHNF 
Sbjct: 539 QWQGNLDH----AKLSGL---VKPEQALALDLQSDMNP--SSVMENEGISWIWCMLHNFG 589

Query: 181 GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
           G + + G ++ IA  P  A  S N  M G+G++ E +E +P+VY+++ +M +  + +D +
Sbjct: 590 GRMGLDGEVEVIAKEPAIA-ASNNQYMKGIGITPEALENSPIVYEMLFDMTWSKDPIDYQ 648

Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
           AW+++Y+ RR G S  ++Q+AW++L  T Y          +D  + +     ++I+   G
Sbjct: 649 AWVDKYATRRAGGSSDSLQEAWDMLLETAY----------KDKGIYYQGAGETVINARPG 698

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
              N+           S  S++ H ++ Y   E+ + L L I + +  +AS  YRYDL D
Sbjct: 699 --TNF-----------SSASTWGHSNILYDKEELDKVLSLLIENYDAFAASEAYRYDLAD 745

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           +  Q L   A E    +++A    D+    ++S  FLEL++  D +L   + F+LG W+ 
Sbjct: 746 VAEQVLCNAAIEYHALMVQALNNKDSAEFKRISTHFLELIDLSDRILGSSEEFMLGTWIH 805

Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
            A+++  N +   +  +E+NAR  +T W     E +  L+DY N+ W+GL   +Y  R
Sbjct: 806 DAREMLDNADDWTKDLFEFNARAVVTTW---GGERSGSLKDYSNRKWAGLTSSFYKER 860


>gi|169351448|ref|ZP_02868386.1| hypothetical protein CLOSPI_02228 [Clostridium spiroforme DSM 1552]
 gi|169291670|gb|EDS73803.1| LPXTG-motif cell wall anchor domain protein [Clostridium spiroforme
           DSM 1552]
          Length = 1990

 Score =  205 bits (521), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 150/527 (28%), Positives = 251/527 (47%), Gaps = 48/527 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  FSG VP   ++ +   +    G+W   +  P    TY+ +     F ++   F + 
Sbjct: 397 VLSGFSGQVPTNFKDKYQDVQYVAQGSWCGYER-PDMLRTYVDNGGTDYFSQMADVFYKA 455

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q   +G  ++IY  D F E     D     + +   +   M   D DA+WL+Q W  S  
Sbjct: 456 QRDIFGDVTNIYAVDPFHEGGKIGDMN--YTKVYETVQKKMMENDEDAIWLIQEWSGSIA 513

Query: 128 PFWRPPQMKALLNSVPLGK--LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
               P ++      + L K  ++VLDLF+EV P  +++ +    P+IW MLHNF G + +
Sbjct: 514 S--NPSKL------INLDKEHVIVLDLFSEVSP-RNSALEAADTPWIWNMLHNFGGRMGL 564

Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
               + ++   P   + SE+  MVG+GM+ E IE +P+ Y+L+ +M +  + +D + W  
Sbjct: 565 DANPEKVSQNIPNTYQNSEH--MVGIGMTPEAIENSPMAYELLWDMTWTKDPIDFRQWCQ 622

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y+ R YG +   I++ WN+L  T YN  D            +     S+I+        
Sbjct: 623 DYAKRIYGGTNEDIEEVWNILLDTGYNRKDN----------YYQGAPESVIN-------- 664

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
             +P +      +  SS+ H  + Y   E+ RA+ L   + +E   S  + YDL D+TRQ
Sbjct: 665 -ARPTTN----FTSASSWGHSTINYDKEELERAVYLMAKNYDEFKDSPAFIYDLSDITRQ 719

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            ++  A E    ++ AYQ  +      LS +FLE++   D +L+ +  FL+G W+E A+ 
Sbjct: 720 LISNSAQEYHKAMVNAYQAGNLSEFEVLSDKFLEMILLQDQILSTNSDFLVGKWIEQART 779

Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           + ++ +   +  +E+NAR  IT W          LRDY N+ W+GL +DYY PR   +  
Sbjct: 780 MIEDSDDWTKDLFEFNARDLITTWGGLKNANGGGLRDYSNRQWAGLTKDYYYPRWQKWIN 839

Query: 483 YMIESLESGDGFRLKDW---RREWIKLTNDWQNGRNVYPVESNGDAL 526
            + E++++       DW     EW  L +D  N    Y +E++  AL
Sbjct: 840 DVKEAMKNNTAVPSTDWFLMEWEWANLKSDEGNE---YSIEASNLAL 883


>gi|294674521|ref|YP_003575137.1| alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
 gi|294472030|gb|ADE81419.1| putative alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
          Length = 754

 Score =  204 bits (520), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 140/495 (28%), Positives = 232/495 (46%), Gaps = 54/495 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G VP   +       +   G W   +          L  TD  F EI   +  +
Sbjct: 223 VLPGYCGMVPHDAKEKL-GLNVADAGLWNGFQRPAN------LLPTDARFSEIATLYYNE 275

Query: 68  QLKEYGRTSHIYNCDTFDENT--PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
             K +G+  + Y+ D F E+   P +D     +  G A+   M+  +  AVW++QGW  +
Sbjct: 276 LTKLFGKADY-YSMDPFHESNDDPNID----YAKAGQAMMQAMKRVNPKAVWVIQGW--T 328

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNF 179
            +P       +A+++ +  G L+VLDLF+E +P      IW   + +    +++C+L NF
Sbjct: 329 ENP------REAMVDDMKTGDLLVLDLFSECRPMFGIPSIWKREQGYKQHQWLFCLLENF 382

Query: 180 AGNIEMYG----ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE 235
             N+ ++G    +LD+          ++++ + G+G +MEG E NPV+++LMSE+ ++ E
Sbjct: 383 GANVGLHGRMDQLLDNFYMLQSSKFQAQSSKLKGIGFTMEGSENNPVMFELMSELPWRPE 442

Query: 236 KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSII 295
           K   + W+  Y   RYG    AI+ AW  L  ++YNC  G   +             SI 
Sbjct: 443 KFTKEQWVKNYVKARYGVEDEAIEKAWLTLAKSIYNCPAGNNQQG---------PHESIF 493

Query: 296 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
                     G+P     +   + SS+     +Y  +   +A +L  +   +   +N + 
Sbjct: 494 C---------GRPT----LNNFQASSWSKMKNYYDPAMTKKAAKLMNSVAEKYRGNNNFE 540

Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
           YDL+D+TRQALA  A   +   I  Y+        + + RFL+++   D LL     F +
Sbjct: 541 YDLVDITRQALADQARLQYQKTIADYKAFSRKQFDRDAERFLKMLLLQDKLLGTRTEFRV 600

Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
           G W + A       E++K YEWNAR QIT W +    +   LRDY +K W GLL+D+Y  
Sbjct: 601 GHWTQDAVNAGNTAEEKKLYEWNARVQITTWGNRYCADTGGLRDYAHKEWQGLLKDFYYV 660

Query: 476 RAAIYFKYMIESLES 490
           R   YF  +   +++
Sbjct: 661 RWKSYFDALAAQMKA 675


>gi|341892319|gb|EGT48254.1| hypothetical protein CAEBREN_28412 [Caenorhabditis brenneri]
          Length = 713

 Score =  204 bits (519), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 133/486 (27%), Positives = 227/486 (46%), Gaps = 50/486 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LP F+G VP  L+ +FPS+K T+L  W +  S+    C   +   DPLF +IG  F+  
Sbjct: 245 ILPTFAGFVPDQLEKLFPSSKFTRLPCWNNFTSET--SCLLSVSPFDPLFQKIGSLFLRH 302

Query: 68  QLKEYG-RTSHIYNCDTFDENTPPVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
           Q K +G   +++Y+ D F+E  P   +    +++     AI +  +  D + +W++Q W 
Sbjct: 303 QKKMFGGDITNLYSADPFNEILPSDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWS 362

Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
           F+YD  W    +K+ L++VP+G L++LDL++EV P W ++  F+G  YIWCMLH+F G+ 
Sbjct: 363 FTYDE-WPSWAIKSFLSAVPIGNLLILDLYSEVVPAWQSTSSFHGHNYIWCMLHSFGGSR 421

Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
           E+ G L  +  G   A     + ++G G++ME I+QN V+Y  M +  +  E + +  WI
Sbjct: 422 ELRGNLQKVDKGYQLALMKGGSNLIGAGLTMEAIDQNYVIYQFMVDRMWSSEPLPLNTWI 481

Query: 244 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
             YS  RY          W +L  + YN  +   +    V +                  
Sbjct: 482 KSYSESRYSADFKVSHKFWTLLAFSFYNQPEKWGNPRFSVFL------------------ 523

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS-GNELSASNTYRYDLIDLT 362
            Y +P   + +             W+   E    L+  I S  + L     ++ DL D+ 
Sbjct: 524 -YHRPAFGKKI-----------EYWFPVEETFGHLQSLIPSLIHVLGDHPLFKEDLNDVM 571

Query: 363 RQALAKY--ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           R A+ ++   N+  L + EA+ + D   +       +    DM   L  +    +  W+E
Sbjct: 572 R-AITQFEVGNDAALTLTEAFLMEDKQQIGSTCENLM----DMFLKLESYSNRDMKHWIE 626

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            +K +A   E+ + +   A   +T+W    Q       DY ++ W GLL  YYG R   +
Sbjct: 627 DSKSIAATSEERQVFPATAADILTVWGPEGQN-----LDYAHREWEGLLSGYYGRRWQYF 681

Query: 481 FKYMIE 486
             +++E
Sbjct: 682 CDWILE 687


>gi|410634789|ref|ZP_11345419.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
 gi|410145665|dbj|GAC22286.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
          Length = 750

 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 149/548 (27%), Positives = 264/548 (48%), Gaps = 73/548 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           ++ AF+G VP A   +FP A+I +L +W      P     YLL   +PLF++IG+ +IE+
Sbjct: 250 IVHAFAGFVPPATSELFPEAQIRRL-SWGG--GLPESTYGYLLSPENPLFVKIGKMYIEE 306

Query: 68  QLKEYGRTSHIYNCDTFDE-NTPPVDS-PEYISSL---GAAIYSGMQSGDSDAVWLMQGW 122
             KE+G+  + Y  D+F+E + PP D+  E ++ L   G  +Y  +++ + DA W+MQGW
Sbjct: 307 WQKEFGKNEY-YLADSFNEMDVPPADTEAELLTELAGYGDRVYQSIKAANPDATWVMQGW 365

Query: 123 LFSYDP------FWRPPQMKALLNSVPLGKLVVLDLFAE-------VKPIWSTSKQFYGV 169
            F Y        FW P ++ AL++ VP  KL++LDL  E       + P W     F+  
Sbjct: 366 TFPYHKDENRQLFWTPERLHALVSKVPDDKLLILDLANEYNKLWWKIDPSWKMYSGFFNK 425

Query: 170 PYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTMVGVGMSMEGIEQNPVVYDLMS 228
            +I+  + N  G   + G  D  A  P++A    +   ++G G + EGIE N ++Y+L++
Sbjct: 426 KWIYSFIPNMGGKTPLNGRFDIYAELPIDALNYKDKGNLIGFGFAPEGIENNEMIYELLT 485

Query: 229 EMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFP 288
           +MA+Q + +DV  W  +Y+++RYG    +++ A++ L              N+  + +F 
Sbjct: 486 DMAWQRKAIDVDQWQAKYAMQRYGAYPGSLEKAFSYL--------------NKSALGSF- 530

Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
            VD  I       Y+N   P   E          DH  + + + + I+A  LF+ +  +L
Sbjct: 531 -VDHPIHRFQLRPYRN---PEGVE----------DHATV-HESEDFIKATGLFLQASEQL 575

Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
             +  Y++D +++T   L+   + L    +        + V   +   + ++  MD LLA
Sbjct: 576 KDNKLYQHDAMEITTLFLSLVTDNLLTKFLAKDVEQRDYSVLDEA---ISVMHTMDKLLA 632

Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
            H    L  W++ A+       ++  YE NA+  +T W  +       + DY  + WSGL
Sbjct: 633 EHPNHQLVTWVDYARTWGSTTAEKDYYESNAKRLLTTWGGDP------VNDYAGRVWSGL 686

Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
           + +YY PR   Y     +++++   F ++ W   W+     ++N    Y      D +  
Sbjct: 687 IGNYYAPRWQSYH----DAVKNNQTFDVRQWEENWV--MTPYKNTSTAY-----QDPVRV 735

Query: 529 SQWLYNKY 536
           +Q +Y KY
Sbjct: 736 AQAMYFKY 743


>gi|365104185|ref|ZP_09333846.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
           4_7_47CFAA]
 gi|363644798|gb|EHL84079.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
           4_7_47CFAA]
          Length = 1049

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 142/519 (27%), Positives = 240/519 (46%), Gaps = 43/519 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P F+G VP       P A++   G W      P    TY+    D  F ++   + + 
Sbjct: 398 VFPGFAGQVPDTFAAKNPQAQVIDQGEWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQT 455

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
               +G  SH Y  D F E     D    +  +   + + M   D DAVW++Q W     
Sbjct: 456 LKTTFGDISHYYAVDPFHEGGNRADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW----- 508

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
              +     A LN +     ++LDL+A+ KP  +   +F   P+IW MLH F G +   G
Sbjct: 509 ---QENPTDAFLNGLKKDHALILDLYADNKPNHAMRHEFSNTPWIWNMLHAFGGRMGFSG 565

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           + + +A   +    +E+  M GVG++ E +  NP++Y+++ +MA++   +   A+I+ + 
Sbjct: 566 MPEVLA-QEIPQSLAESKKMKGVGVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWL 624

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             RYG   P I+ AW+++  T Y+      D+ R         + SII    G    +G 
Sbjct: 625 TSRYGAQSPEIEQAWDIMVKTAYHRR---KDRQR--------AEDSIIDAKPG----FG- 668

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
            V++     +    YD        +E  + L L+++  +   A+  Y++DL+D+TRQ LA
Sbjct: 669 -VTRACTYYTALIDYDK-------AEFEKILPLYLSVYDHFKANPAYQHDLVDITRQVLA 720

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL-- 425
             + E +    +A+   D     QLS +FL L++  D +L+    F+LG W+ SA+ +  
Sbjct: 721 NASYEYYRAFEDAWIAKDYSAFNQLSGKFLRLIKLQDQVLSTRPEFMLGTWINSARTMLD 780

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
             ++    Q+E+NAR  +T W      +A  LRDY N+ W GL  D+Y  R A + + + 
Sbjct: 781 GMDDWTRDQFEFNARAMVTTWGTEQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALK 839

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQN-GRNVYPVESNG 523
            +  +G   +    +  W  L   W N   N YP + +G
Sbjct: 840 SAAATGQ--KQDAIKVNWFPLEYRWVNQSGNGYPTQPSG 876


>gi|421734750|ref|ZP_16173809.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
 gi|407077324|gb|EKE50171.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
          Length = 1919

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 148/501 (29%), Positives = 237/501 (47%), Gaps = 46/501 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
           V+  F G VPA  Q   P++     G W      P    TYL DA      +  F ++G 
Sbjct: 399 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGD 457

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F + Q   +G+ S+ Y  D F E     D  + I  +   +   M   D  AVW+MQ W
Sbjct: 458 TFYKAQENVFGKVSNYYAVDPFHEGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 516

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
            +  D      ++  L +    G+ +VLDL ++++   +++ +  GVP++W MLHNF G 
Sbjct: 517 QWGIDE----TKLSGLADK---GQTLVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGR 568

Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
           + + G+ + I+    +A  S    M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 569 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 627

Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
             +Y+ RRYG +   I+ AW++L  T Y  TDG      +  +I A P  D +I S    
Sbjct: 628 TQEYAERRYGGTDGTIEKAWDILLDTAYKHTDGEYYQGASESIINARPS-DNTIGSA--- 683

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
                              S++ H  + Y   +  +A  LF  + +    S  +RYD +D
Sbjct: 684 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVD 724

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           + RQ LA    E      +AY+  D      LS R L++++  D LL+  D FL+G W++
Sbjct: 725 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 784

Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
            A+ +    +      +E NAR  +T W  N  +  SL+ DY N+ W+GL  DYY  R  
Sbjct: 785 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 841

Query: 479 IYFKYMIESLESGDGFRLKDW 499
            Y    +  LE G  F   DW
Sbjct: 842 TYVDNRLNKLEHGTDFTDPDW 862


>gi|393785795|ref|ZP_10373941.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
           CL02T12C05]
 gi|392661414|gb|EIY55000.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
           CL02T12C05]
          Length = 724

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 143/512 (27%), Positives = 234/512 (45%), Gaps = 69/512 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP A     P  K   L  W            Y+L    P F EIG+ F+++
Sbjct: 236 IAPAFAGFVPMAFAEKHPDIKFKHL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKE 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+G+ ++ Y  D+F+E   PV   +       ++  G +IY  + +G+ DA+W+ QG
Sbjct: 292 WEKEFGKNTY-YLSDSFNEMELPVAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQG 350

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
           W F Y   FW    ++ALL+ VP  K++++DL  +  K +W T +       FYG  +I+
Sbjct: 351 WTFGYQHSFWDKASLQALLSHVPDDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIF 410

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G L   A    EA  SE +  ++G G + EG+E N VVY+L+++M +
Sbjct: 411 SYVPNFGGKTPMTGDLQMYASSSAEALQSESHGNLIGFGSAPEGLENNEVVYELLADMGW 470

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + +D+  W+  Y + RYG     ++DAW++   T Y+            ++  PD   
Sbjct: 471 TDQAIDLDKWMPSYCMARYGAYPETMKDAWDLFRKTAYSSLYSYPRFTWQTVI--PD--- 525

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
                             K  + K + S            + +  +ELF+ S + L  S 
Sbjct: 526 ------------------KRRISKIDVS-----------DDFLHGVELFLNSADSLKNSK 556

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            Y  D I+     +A  A++L+   +    +  +    Q   + ++++ ++D LLA H  
Sbjct: 557 LYVNDAIEFASYYIAAKADKLYGKALAEDTVGRSAVAQQYLNQTIDMLLNVDKLLASHPL 616

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W+  A+       ++  YE NA+  IT W            DY  ++WSGL++DY
Sbjct: 617 YRLEEWVNFARNSGTTPAEKDAYEINAKRLITTW-------GGFQEDYAARFWSGLIKDY 669

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
           Y PR  IYF     SL+        +W  EWI
Sbjct: 670 YIPRLKIYFSKQRGSLD--------NWEEEWI 693


>gi|311064845|ref|YP_003971571.1| beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
 gi|310867165|gb|ADP36534.1| Beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
          Length = 1923

 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 148/501 (29%), Positives = 237/501 (47%), Gaps = 46/501 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
           V+  F G VPA  Q   P++     G W      P    TYL DA      +  F ++G 
Sbjct: 403 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGD 461

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F + Q   +G+ S+ Y  D F E     D  + I  +   +   M   D  AVW+MQ W
Sbjct: 462 TFYKAQENVFGKVSNYYAVDPFHEGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 520

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
            +  D      ++  L +    G+ +VLDL ++++   +++ +  GVP++W MLHNF G 
Sbjct: 521 QWGIDE----TKLSGLADK---GQALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGR 572

Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
           + + G+ + I+    +A  S    M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 573 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 631

Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
             +Y+ RRYG +   I+ AW++L  T Y  TDG      +  +I A P  D +I S    
Sbjct: 632 TQEYAERRYGGTDGTIEKAWDILLDTAYKHTDGEYYQGASESIINARPS-DNTIGSA--- 687

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
                              S++ H  + Y   +  +A  LF  + +    S  +RYD +D
Sbjct: 688 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVD 728

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           + RQ LA    E      +AY+  D      LS R L++++  D LL+  D FL+G W++
Sbjct: 729 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 788

Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
            A+ +    +      +E NAR  +T W  N  +  SL+ DY N+ W+GL  DYY  R  
Sbjct: 789 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 845

Query: 479 IYFKYMIESLESGDGFRLKDW 499
            Y    +  LE G  F   DW
Sbjct: 846 TYVDNRLNKLEHGTDFTDPDW 866


>gi|313140918|ref|ZP_07803111.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
           bifidum NCIMB 41171]
 gi|313133428|gb|EFR51045.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
           bifidum NCIMB 41171]
          Length = 2005

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 148/501 (29%), Positives = 237/501 (47%), Gaps = 46/501 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
           V+  F G VPA  Q   P++     G W      P    TYL DA      +  F ++G 
Sbjct: 485 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGD 543

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F + Q   +G+ S+ Y  D F E     D  + I  +   +   M   D  AVW+MQ W
Sbjct: 544 TFYKAQESVFGKVSNYYAVDPFHEGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 602

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
            +  D      ++  L +    G+ +VLDL ++++   +++ +  GVP++W MLHNF G 
Sbjct: 603 QWGIDE----TKLSGLADK---GQALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGR 654

Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
           + + G+ + I+    +A  S    M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 655 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 713

Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
             +Y+ RRYG +   I+ AW++L  T Y  TDG      +  +I A P  D +I S    
Sbjct: 714 TQEYAERRYGGTDGTIEKAWDILLDTAYKHTDGEYYQGASESIINARPS-DNTIGSA--- 769

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
                              S++ H  + Y   +  +A  LF  + +    S  +RYD +D
Sbjct: 770 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVD 810

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           + RQ LA    E      +AY+  D      LS R L++++  D LL+  D FL+G W++
Sbjct: 811 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 870

Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
            A+ +    +      +E NAR  +T W  N  +  SL+ DY N+ W+GL  DYY  R  
Sbjct: 871 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 927

Query: 479 IYFKYMIESLESGDGFRLKDW 499
            Y    +  LE G  F   DW
Sbjct: 928 TYVDNRLNKLEHGTDFTDPDW 948


>gi|390937398|ref|YP_006394957.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
 gi|389891011|gb|AFL05078.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
          Length = 1957

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 148/501 (29%), Positives = 236/501 (47%), Gaps = 46/501 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
           V+  F G VPA  Q   P++     G W      P    TYL DA      +  F ++G 
Sbjct: 437 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGD 495

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F + Q   +G+ S+ Y  D F E     D  + I  +   +   M   D  AVW+MQ W
Sbjct: 496 TFYKAQENVFGKVSNYYAVDPFHEGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 554

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
            +  D      ++  L +    G+ +VLDL ++++   ++  +  GVP++W MLHNF G 
Sbjct: 555 QWGIDE----TKLSGLADK---GQALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGR 606

Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
           + + G+ + I+    +A  S    M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 607 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 665

Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
             +Y+ RRYG +   I+ AW++L  T Y  TDG      +  +I A P  D +I S    
Sbjct: 666 TQEYAERRYGGTDGTIEKAWDILLDTAYKHTDGEYYQGASESIINARPS-DNTIGSA--- 721

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
                              S++ H  + Y   +  +A  LF  + +    S  +RYD +D
Sbjct: 722 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVD 762

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           + RQ LA    E      +AY+  D      LS R L++++  D LL+  D FL+G W++
Sbjct: 763 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 822

Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
            A+ +    +      +E NAR  +T W  N  +  SL+ DY N+ W+GL  DYY  R  
Sbjct: 823 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 879

Query: 479 IYFKYMIESLESGDGFRLKDW 499
            Y    +  LE G  F   DW
Sbjct: 880 TYVDNRLNKLEHGTDFTDPDW 900


>gi|291086028|ref|ZP_06354661.2| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
           ATCC 29220]
 gi|291069185|gb|EFE07294.1| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
           ATCC 29220]
          Length = 1014

 Score =  202 bits (515), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 142/519 (27%), Positives = 239/519 (46%), Gaps = 43/519 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P F+G VP       P A++   G+W      P    TY+    D  F ++   + + 
Sbjct: 363 VFPGFAGQVPDTFAAKNPQAQVIDQGDWVGFVRPPM-LRTYVKQGAD-YFSKVADVYYQT 420

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
               +G  SH Y  D F E     D    +  +   + + M   D DAVW++Q W     
Sbjct: 421 LKTTFGDISHYYAVDPFHEGGNRADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW----- 473

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
              +     A LN +     ++LDL+A+ KP  +   +F   P+IW MLH F G +   G
Sbjct: 474 ---QENPTDAFLNGLKKDHALILDLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSG 530

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           + + +A   +    +E+  M GVG++ E +  NP++Y+++ +MA++   +   A+I+ + 
Sbjct: 531 MPEVLA-QEIPQSLAESKYMKGVGVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWL 589

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             RYG   P I+ AW+++  T Y+      D+ R         + SII    G    +G 
Sbjct: 590 TSRYGAQSPEIEQAWDIMVKTAYHRR---KDRQR--------AEDSIIDAKPG----FG- 633

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
            V++     +    YD        +E  + L L+++  +    +  Y++DL+D+TRQ LA
Sbjct: 634 -VTRACTYYTALIDYDK-------AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLA 685

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL-- 425
             + E +    +A+   D     QLS +FL L++  D +L     F+LG WL SA+ +  
Sbjct: 686 NASYEYYRAFEDAWMAKDYSAFNQLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLD 745

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
             ++    Q+E+NAR  +T W      +A  LRDY N+ W GL  D+Y  R A + + + 
Sbjct: 746 GMDDWTRDQFEFNARAMVTTWGIEQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALK 804

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGR-NVYPVESNG 523
            +  +G   +    +  W  L   W N   N YP + +G
Sbjct: 805 NAAATGQ--KQDAIKVNWFPLEYRWVNQTGNGYPTQPSG 841


>gi|268533054|ref|XP_002631655.1| Hypothetical protein CBG20846 [Caenorhabditis briggsae]
          Length = 712

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 128/486 (26%), Positives = 232/486 (47%), Gaps = 50/486 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LP FSG VP  L+ +FP++K  +L  W +  S+    C   +   DPLF +IG +F+  
Sbjct: 244 ILPTFSGFVPDRLEKLFPTSKFNRLPCWNNFTSET--SCLLSVSPFDPLFQKIGSSFLRH 301

Query: 68  QLKEYG-RTSHIYNCDTFDENTPPVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
           Q K  G   +++Y+ D F+E  P   +    +++     AI +  +  D + +W++Q W 
Sbjct: 302 QKKMLGGDITNLYSADPFNEVLPSDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWS 361

Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
           F+YD  W    +K+ L++VP+G++++LDL++EV P W  +  F+G  ++WCMLHNF G+ 
Sbjct: 362 FTYDQ-WPNWAIKSFLSAVPIGQMLILDLYSEVVPAWQMTSSFHGHNFVWCMLHNFGGSR 420

Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
           E+ G +  +  G   A     + +VG G+SME I+QN ++Y  M +  +  E + + +W+
Sbjct: 421 ELRGNVQKVDKGYQLALMKAGSNLVGAGLSMEAIDQNYMMYQFMIDRMWTQEPIPLNSWL 480

Query: 244 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
             YS  RY          W +L  + YN  +   +    V +                  
Sbjct: 481 KSYSESRYSADFKVAHKFWTILAGSFYNQPEKWGNPRFSVFL------------------ 522

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALE-LFIASGNELSASNTYRYDLIDLT 362
            Y +P   + +             W+   E    LE L ++  + L     ++ DL D+ 
Sbjct: 523 -YHRPAFGKKI-----------EYWFPVEETFTHLESLVLSLLHILGDHPLFKEDLNDVM 570

Query: 363 RQALAKY--ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           R A+ ++   NE  L++ EA+ + D   +       + + + ++     +    +  W+E
Sbjct: 571 R-AITQFEIGNEAALSLTEAFLMEDKQQIGTTCENLMGMFQKLEP----YSNRDVRDWIE 625

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
            AK +A   E+ + +  +A   +T+W    Q       DY ++ W+GLL  YYG R   +
Sbjct: 626 DAKSIAPTTEEREVFPISASDILTVWGPTGQN-----LDYAHREWAGLLSGYYGRRWQYF 680

Query: 481 FKYMIE 486
             +++E
Sbjct: 681 CDWILE 686


>gi|440731409|ref|ZP_20911430.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
 gi|440373101|gb|ELQ09870.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
          Length = 732

 Score =  202 bits (513), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 156/557 (28%), Positives = 239/557 (42%), Gaps = 83/557 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSG VP A     P A+I ++  W           TY LD  DPLF +I + FI+ 
Sbjct: 205 VLPAFSGYVPKAFAQAHPQARIYRMRAWEGFHE------TYWLDPADPLFTKIAQRFIQL 258

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDS----------------------PEY--------I 97
             + YG+ ++ Y  D F+E  PP+ +                      PE         +
Sbjct: 259 YDRTYGKGTY-YLADAFNEMLPPIAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRL 317

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           +  G A+Y  +   + DAVW+MQGWLF  D  FW P  + A L  VP  KL+VLD+  + 
Sbjct: 318 ADYGRALYESIHRANPDAVWVMQGWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDR 377

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S  F G  +I+  +HN+ G+  +YG    +AF   + R      +   +VG G
Sbjct: 378 YPGTWKLSDAFDGKQWIYGYVHNYGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFG 434

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
              EG+  N VVY+ M  +A+  ++  ++ W+  Y+  RYG + PA++ AW+ L   V +
Sbjct: 435 AFPEGLHDNSVVYEYMYTLAWGGQQRSLQDWLGDYTRARYGHTSPALRAAWDDLQAAVLS 494

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
                               P       G Y  + +P     + + E +  D P L    
Sbjct: 495 TRYWT---------------PRWWRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL---- 533

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 391
               RAL+  +A   E + +  YRYDL+D  R       +      + AY+  D      
Sbjct: 534 ---RRALDQLLALAPEYADAPLYRYDLVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDA 590

Query: 392 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
              R    V+ +DGL+      +L  WL  A+  A+  +    Y  +A+ QI++W     
Sbjct: 591 AFARVQAAVQQLDGLVGGQQE-ILSSWLGDAEGDAKTPQDAAYYRRDAKAQISVWGGEGN 649

Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREWI 504
                L DY +K W G+  DYY PR A+  + +  +  SG          RL+ W R+W+
Sbjct: 650 -----LGDYASKAWQGMYADYYLPRWALAMQALRAAAVSGGSVDEAALQQRLRVWERDWV 704

Query: 505 KLTNDWQNGRNVYPVES 521
                +       PV +
Sbjct: 705 ACETPYTRRAPADPVAA 721


>gi|210611122|ref|ZP_03288736.1| hypothetical protein CLONEX_00926, partial [Clostridium nexile DSM
            1787]
 gi|210152109|gb|EEA83116.1| hypothetical protein CLONEX_00926 [Clostridium nexile DSM 1787]
          Length = 1662

 Score =  201 bits (512), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 152/541 (28%), Positives = 249/541 (46%), Gaps = 71/541 (13%)

Query: 8    VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            VL  +SG VP  + +  PSA++ + G W S +         +L      F +  + F + 
Sbjct: 722  VLQGYSGMVPVDITDKDPSAQVIKQGTWCSFQRPS------MLKTDSETFDKYAQLFYKV 775

Query: 68   QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
            Q + YG  S  Y  D F E  NT  + SP  I+     + + M   D + +W++Q W   
Sbjct: 776  QKEVYGDVSDYYATDPFHEGGNTGGM-SPTVIAE---KVLANMMEADENGIWIIQSW--- 828

Query: 126  YDPFWRPPQMKALLNSVPLGK--LVVLDLFAEVKPIWSTS-----------KQFYGVPYI 172
                 +     ALL  +   +   +VLDL+AE  P W+ +            +F   P++
Sbjct: 829  -----QGNPSTALLQGLDAARDHALVLDLYAEKTPHWNETDPGSYGGAEGGGEFLNTPWV 883

Query: 173  WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
            +CML+NF G + ++G +++   G  +A    N  M G+G++ E    NPV+YDL  E  +
Sbjct: 884  YCMLNNFGGRLGLHGHIENFVNGVAQAAAQAN-HMAGIGITPEASVNNPVLYDLFFETIW 942

Query: 233  QHE-----KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIV 285
              +      +++  W   Y+ RRYG    +  +A  +L  TVYN       +     V+ 
Sbjct: 943  SDDGENLSAINLDEWFKDYTTRRYGAESQSAYEAMQILNDTVYNPEMNMKGQGAPESVVN 1002

Query: 286  AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG 345
            A P +D        G    +G  V                 + Y  +E+ +A  L +   
Sbjct: 1003 ARPGLDI-------GAASTWGNAV-----------------IDYDKAELEKAAALLLKDY 1038

Query: 346  NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 405
            ++L  S  Y+YDL ++  Q L+  A E    + +A++  DA    ++S  FLE++  ++ 
Sbjct: 1039 DKLKDSAGYQYDLANVLEQVLSNTAQEYQKKMADAFREGDAEKFEKMSNSFLEIITKVEE 1098

Query: 406  LLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNK 463
            +    + F+LG WLESAK LA+N +   ++ YE NAR  IT W    Q  +  L DY N+
Sbjct: 1099 VTGTQEEFMLGTWLESAKALAKNADDFTKELYELNARGLITTWGSIEQANSGGLIDYSNR 1158

Query: 464  YWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESN 522
             WSGL  DYY PR   + K++ E  +   G   K++   +W ++   W    N YP ++N
Sbjct: 1159 QWSGLTSDYYKPR---WEKWIAERKKELAGEESKNYSAADWFEMEWAWARSNNEYPTKAN 1215

Query: 523  G 523
            G
Sbjct: 1216 G 1216


>gi|288927801|ref|ZP_06421648.1| putative alpha-N-acetylglucosaminidase
           (N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
           oral taxon 317 str. F0108]
 gi|288330635|gb|EFC69219.1| putative alpha-N-acetylglucosaminidase
           (N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
           oral taxon 317 str. F0108]
          Length = 723

 Score =  201 bits (511), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 147/513 (28%), Positives = 238/513 (46%), Gaps = 69/513 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + P F+G VP AL+ ++P+A I Q   W     +      Y+L   DPLF +IG  FI++
Sbjct: 235 ICPGFAGFVPEALKELYPTADI-QYTTWEKAFHN------YILSPADPLFHKIGVMFIQE 287

Query: 68  QLKEYGRTSHIYNCDTFDENT---PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+GR    Y  D+F+E     PP D P   E+++  G  +Y  ++  +  A W+MQG
Sbjct: 288 WEKEFGRCD-FYLIDSFNEMDIPFPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQG 346

Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIW 173
           W+F Y P  W    + AL++ VP  K+++LDL A+  K +W T       K F G  +I+
Sbjct: 347 WMFGYQPEIWDYKTLNALVSQVPDNKMIMLDLAADYNKFLWKTPFNWDFYKGFCGKQWIY 406

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
            ++ N  G   + G LD  A G +EA  S+N   ++G G + EGIE N VVY+L+ +  +
Sbjct: 407 SVIPNMGGKSALTGALDFYAKGHLEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGW 466

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + V+++ W+  Y+  RYG     ++  WN +  +VY                      
Sbjct: 467 AKQGVELRPWLRNYTYSRYGCYPIGMEQYWNEMIQSVY---------------------- 504

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
                  G ++++ +   +    K +  S D  + +Y   E++  + L    GN+L    
Sbjct: 505 -------GSFKSHPRFNWQFRPGKEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL---- 552

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            +  D  ++    L      L   I +AY+  D     QL  RF  L+  MD +L  H  
Sbjct: 553 -FEADFKEMAANYLGGKVEILVRQIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPT 611

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
             +  W++ A+    +  +   YE NAR  +T+W          + DY  + W+GL+RDY
Sbjct: 612 KDMQKWIDYARARGVSYNKADCYESNARRIVTVW-------GPPIDDYSARIWAGLIRDY 664

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 505
           Y PR   YF        SG  F    W  ++++
Sbjct: 665 YLPRWKHYF----NQKRSGKPFDFSTWELDFVE 693


>gi|380512475|ref|ZP_09855882.1| N-acetylglucosaminidase [Xanthomonas sacchari NCPPB 4393]
          Length = 785

 Score =  201 bits (511), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 149/540 (27%), Positives = 235/540 (43%), Gaps = 83/540 (15%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF +I + FI+ 
Sbjct: 258 VLPAFAGYVPKAFAQAHPQARIYRMRAWEGFHE------TYWLDPADPLFAQIAQRFIQL 311

Query: 68  QLKEYGRTSHIYNCDTFDE------------------------------NTPPVDSPEYI 97
             + YG+ ++ Y  D F+E                                PPV   + +
Sbjct: 312 YDRTYGKGTY-YLADAFNEMLPPIAADGSDARLASYGDSTANTAKTKPPEVPPVQRDKRL 370

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           ++ G A+Y+ +   + DAVW+MQGWLF  D  FW P  + A L  VP  KL+VLD+  + 
Sbjct: 371 AAYGRALYASIHRANPDAVWVMQGWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDR 430

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S  F G  +I+  +HN+ G+  +YG    +AF   + R      +   +VG G
Sbjct: 431 YPGTWKLSDAFDGKQWIYGYVHNYGGSNPVYG---DLAFYREDLRALLADKDKQQLVGFG 487

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
              EG+    VVY+ M  +A+  ++  ++ W++ Y+  RYG + PA++ AW+ L  +V +
Sbjct: 488 AFPEGLHTTSVVYEYMYALAWGAQQRPLQDWLDDYTRARYGHTSPALRAAWDDLQASVLS 547

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
                               P       G Y  + +P     + + E +  D P L    
Sbjct: 548 TRYWT---------------PRWWRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL---- 586

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 391
               RAL+  +A   E + +  YRYDL+D  R       +      + AY+  D      
Sbjct: 587 ---RRALQQLLALAPEYADAPLYRYDLVDFARHYATGRVDVQLQQAVAAYRRGDVAAGDA 643

Query: 392 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
            + R  E V  +D L+       L  WL++A   A   +    Y  +A+ Q+++W     
Sbjct: 644 ATARVREAVTQLDSLVGGQQD-TLSSWLDAAAGYATTPQDAAYYRRDAKAQVSVWGGEGN 702

Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREWI 504
                L DY +K W G+  DYY PR  +  + + E+  +G          RL+ W R+W+
Sbjct: 703 -----LGDYASKAWQGMYADYYLPRWTLALQMLSEAAVAGGSVDEAQLQQRLRAWERDWV 757


>gi|282881077|ref|ZP_06289764.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
           5C-B1]
 gi|281304881|gb|EFA96954.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
           5C-B1]
          Length = 688

 Score =  201 bits (511), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 140/485 (28%), Positives = 225/485 (46%), Gaps = 46/485 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+  F G VP+ L + +P A +   G W   +         +L     LF  + +A+ + 
Sbjct: 201 VVQGFPGIVPSFLHDKYPKACVVSQGKWNGFQRPS------ILLPQSQLFYCMAKAYYDN 254

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
             + YG     +  D F E  N   VD    +SS  + +   M S   DA W++QGW  +
Sbjct: 255 MKRYYGTDLRYFGGDLFHEGGNAKGVD----LSSTASKVQKCMLSHFPDAKWVLQGWNGN 310

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             P        ALL  +    +++++L  E+   W  S +F   P+IW  +++F G  +M
Sbjct: 311 PSP--------ALLAGLDKKHVLLINLAGEIDASWKQSDEFGQTPWIWGSVNHFGGKTDM 362

Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
            G L  +   P  A   S++  + G+G+  EGI  NPVVYDL  + A+      V   + 
Sbjct: 363 GGQLPVLVEQPHRALAASQHGRLKGLGILPEGIHTNPVVYDLALQTAWSDTVPSVDHLLR 422

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           QY   RYG     +  AW +L  +VY                            EG Y++
Sbjct: 423 QYIWYRYGTWNDDLYRAWQLLASSVYG---------------------EFEVKGEGTYES 461

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                ++ ++  S  S++    + Y   ++++AL LF  +      S TY YDL+DL RQ
Sbjct: 462 VF--CARPSLHVSSVSTWGPKKMQYQPEKLLQALVLFRKAAVHFKGSETYEYDLVDLARQ 519

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            +A  A  ++  ++ AY   D+  + + S  FL L++  D LL+ +  FLLG WL++A+Q
Sbjct: 520 VMANNARNVYNQVVHAYNEKDSLALNRYSSTFLHLIDLQDSLLSTNKFFLLGKWLQAARQ 579

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
             +NE+ ++Q   NART I+ W     +  + L DY NK W+GLL+ YY PR   +F  +
Sbjct: 580 YGENEQDQRQALVNARTLISYW--GPDDATTRLHDYANKEWAGLLKQYYAPRWRAFFAML 637

Query: 485 IESLE 489
              L 
Sbjct: 638 AGQLR 642


>gi|333031147|ref|ZP_08459208.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
 gi|332741744|gb|EGJ72226.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
          Length = 721

 Score =  200 bits (509), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 140/531 (26%), Positives = 243/531 (45%), Gaps = 73/531 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + PAF+G VP A     P      +  W     DP +   Y+L    P F EIG+ FIE+
Sbjct: 236 IAPAFAGFVPMAFAEKHPDINFKHM-RWGGF--DPEYNA-YVLPPDSPFFEEIGKLFIEE 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
              E+G  ++ Y  D+F+E   P+D  +       +   G +IY  + +G+ +A+W+ QG
Sbjct: 292 WENEFGSNTY-YLSDSFNEMELPIDKDDTEGKYRLLRQYGESIYKSISAGNPEAIWVTQG 350

Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIW 173
           W F Y   FW    ++ALL++VP  K++++DL  +        +  W     FYG  +I+
Sbjct: 351 WTFGYQHSFWDTTSLQALLSNVPNEKMIIIDLGNDYPKWVWNTEQTWKVQNGFYGKGWIF 410

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   M G +   A    EA  S N   ++G G + EG+E N V+Y+L+++M +
Sbjct: 411 SYVPNFGGKTTMTGDMQMYATSSAEALASPNKGNLIGFGSAPEGLENNEVIYELLADMGW 470

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             E +++  W+  Y + RYG     +Q AW +   TVY+            + ++P    
Sbjct: 471 TSESINLDEWMQSYCLSRYGGYPENVQKAWELFRKTVYSN-----------LYSYP---- 515

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
                       Y      E  L+    +        ++ E +  +ELF+++ NEL  S 
Sbjct: 516 -----------RYTWQTVVEDTLRINKIN--------TSDEFLIGVELFVSAVNELKDSE 556

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            Y  DLI+ +    A  A++++   +  ++  +      L  + ++++  +D LLA H  
Sbjct: 557 LYVNDLIEFSSFYAAAKADKIYKEALILFERGNKKEARSLLNQSIQILLKVDKLLASHPI 616

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
           + L  W++ A+       ++  +E NA+  IT W         +  DY  ++WSGL++DY
Sbjct: 617 YRLEEWVKYARNSGSTVAEKDAFEANAKRLITTW-------GGIQDDYAARFWSGLIKDY 669

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
           Y PR  + F        S +   L+ W   W+  +  W N    +  P+E+
Sbjct: 670 YIPRMELNF--------SSERNSLRQWEENWV--STPWNNPTQPFDNPIEA 710


>gi|310287970|ref|YP_003939229.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
 gi|309251907|gb|ADO53655.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
          Length = 1923

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 147/501 (29%), Positives = 235/501 (46%), Gaps = 46/501 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
           V+  F G VPA  Q   P++     G W      P    TYL DA      +  F ++G 
Sbjct: 403 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGD 461

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F + Q   +G+ S+ Y  D F E     D  + I  +   +   M   D  AVW+MQ W
Sbjct: 462 TFYKAQESVFGKVSNYYAVDPFHEGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 520

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
            +  D      ++  L +    G+ +VLDL ++++   ++  +  GVP++W MLHNF G 
Sbjct: 521 QWGIDE----TKLSGLADK---GQALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGR 572

Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
           + + G+ + I+    +A  S    M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 573 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 631

Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
             +Y+ RRYG +   I+ AW++L  T Y   DG      +  +I A P  D +I S    
Sbjct: 632 TQEYAERRYGGTDGTIEKAWDILLDTAYKHMDGEYYQGASESIINARPS-DNTIGSA--- 687

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
                              S++ H  + Y   +  +A  LF  + +    S  +RYD +D
Sbjct: 688 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVD 728

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           + RQ LA    E      +AY+  D      LS R L++++  D LL+  D FL+G W++
Sbjct: 729 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 788

Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
            A+ +    +      +E NAR  +T W  N  +  SL+ DY N+ W+GL  DYY  R  
Sbjct: 789 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 845

Query: 479 IYFKYMIESLESGDGFRLKDW 499
            Y    +  LE G  F   DW
Sbjct: 846 TYVDNRLNKLEHGTDFTDPDW 866


>gi|421736727|ref|ZP_16175487.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
           IPLA 20015]
 gi|407295984|gb|EKF15606.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
           IPLA 20015]
          Length = 1044

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 146/501 (29%), Positives = 234/501 (46%), Gaps = 46/501 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
           V+  F G VPA  Q   P++     G W      P    TYL DA      +  F ++  
Sbjct: 375 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVCD 433

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
            F + Q   +G+ S+ Y  D F E     D  + I  +   +   M   D  AVW+MQ W
Sbjct: 434 TFYKAQENVFGKVSNYYAVDPFHEGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 492

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
            +  D      ++  L +    G+ +VLDL ++++   ++  +  GVP++W MLHNF G 
Sbjct: 493 QWGIDE----TKLSGLADK---GQTLVLDLQSDLRSQ-ASPMENQGVPWVWNMLHNFGGR 544

Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
           + + G+ + I+    +A  S    M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 545 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 603

Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
             +Y+ RRYG +   I+  W++L  T Y  TDG      +  +I A P  D +I S    
Sbjct: 604 TQEYAERRYGGTDGTIEKVWDILLDTAYKHTDGEYYQGASESIINARPS-DNTIGSA--- 659

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
                              S++ H  + Y   +  +A  LF  + +    S  +RYD +D
Sbjct: 660 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKNSAGFRYDYVD 700

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           + RQ LA    E      +AY+  D      LS R L++++  D LL+  D FL+G W++
Sbjct: 701 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 760

Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
            A+ +    +      +E NAR  +T W  N  +  SL+ DY N+ W+GL  DYY  R  
Sbjct: 761 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 817

Query: 479 IYFKYMIESLESGDGFRLKDW 499
            Y    +  LE G  F   DW
Sbjct: 818 TYVDNRLNKLEHGTDFTDPDW 838


>gi|281424178|ref|ZP_06255091.1| N-acetylglucosaminidase [Prevotella oris F0302]
 gi|281401447|gb|EFB32278.1| N-acetylglucosaminidase [Prevotella oris F0302]
          Length = 723

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 146/513 (28%), Positives = 237/513 (46%), Gaps = 69/513 (13%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           + P F+G VP AL+ ++P+A I Q   W     +      Y+L   DPLF +IG  FI++
Sbjct: 235 ICPGFAGFVPEALKELYPTADI-QYTTWEKAFHN------YILSPADPLFHKIGVMFIQE 287

Query: 68  QLKEYGRTSHIYNCDTFDENT---PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQG 121
             KE+GR    Y  D+F+E     PP D P   E+++  G  +Y  ++  +  A W+MQG
Sbjct: 288 WEKEFGRCD-FYLIDSFNEMDIPFPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQG 346

Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIW 173
           W+F Y P  W    + AL++ VP  K+++LDL  +  K +W T       K F G  +I+
Sbjct: 347 WMFGYQPEIWDYKTLNALVSQVPDNKMIMLDLAVDYNKFLWKTPFNWDFYKGFCGKQWIY 406

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
            ++ N  G   + G LD  A G +EA  S+N   ++G G + EGIE N VVY+L+ +  +
Sbjct: 407 SVIPNMGGKSALTGALDFYAKGHLEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGW 466

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
             + V+++ W+  Y+  RYG     ++  WN +  +VY                      
Sbjct: 467 AKQGVELRPWLRNYTYSRYGCYPIGMEQYWNEMLQSVY---------------------- 504

Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
                  G ++++ +   +    K +  S D  + +Y   E++  + L    GN+L    
Sbjct: 505 -------GSFKSHPRFNWQFRPGKEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL---- 552

Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
            +  D  ++    L      L   I +AY+  D     QL  RF  L+  MD +L  H  
Sbjct: 553 -FEADFKEMAANYLGGKVEILVRQIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPT 611

Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
             +  W++ A+    +  +   YE NAR  +T+W          + DY  + W+GL+RDY
Sbjct: 612 KDMQKWIDYARARGVSYNKADCYESNARRIVTVW-------GPPIDDYSARIWAGLIRDY 664

Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 505
           Y PR   YF        SG  F    W  ++++
Sbjct: 665 YLPRWKHYF----NQKRSGKPFDFSTWELDFVE 693


>gi|432896403|ref|ZP_20107613.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
 gi|433031274|ref|ZP_20219108.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
 gi|431432398|gb|ELH14169.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
 gi|431538475|gb|ELI14460.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
          Length = 1049

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 140/519 (26%), Positives = 238/519 (45%), Gaps = 43/519 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P F+G VP       P A++   G+W      P    TY+    D  F ++   + + 
Sbjct: 398 VFPGFAGQVPDTFAAKNPQAQVIDQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQT 455

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
               +G  SH Y  D F E     D    +  +   + + M   D +AVW++Q W     
Sbjct: 456 LKTTFGDISHYYAVDPFHEGGNRADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW----- 508

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
              +       LN +     ++LDL+A+ KP  +   +F   P+IW MLH F G +   G
Sbjct: 509 ---QENPTDDFLNDLKKDHALILDLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSG 565

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           + + +A   +    +E+  M GVG++ E +  NP++Y+++ +MA++   +   A+I+ + 
Sbjct: 566 MQEVLA-QEIPQSLAESKYMKGVGVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWL 624

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             RYG   P I+ AW+++  T Y+      D+ R         + SII    G    +G 
Sbjct: 625 TSRYGAQSPEIEQAWDIMVKTAYHRR---KDRQR--------AEDSIIDAKPG----FG- 668

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
            V++     +    YD        +E  + L L+++  +    +  Y++DL+D+TRQ LA
Sbjct: 669 -VTRACTYYTALIDYDK-------AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLA 720

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL-- 425
             + E +    +A+   D     QLS +FL L++  D +L     F+LG WL SA+ +  
Sbjct: 721 NASYEYYRAFEDAWMAKDYSAFNQLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLD 780

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
             ++    Q+E+NAR  +T W      +A  LRDY N+ W GL  D+Y  R A + + + 
Sbjct: 781 GMDDWTRDQFEFNARAMVTTWGTEQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQTLK 839

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGR-NVYPVESNG 523
            +  +G   +    +  W  L   W N   N YP + +G
Sbjct: 840 SAAATGQ--KQDAIKVHWFPLEYRWVNQTGNGYPTQPSG 876


>gi|331660873|ref|ZP_08361805.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
           TA206]
 gi|422369309|ref|ZP_16449711.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
 gi|315298924|gb|EFU58178.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
 gi|331051915|gb|EGI23954.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
           TA206]
          Length = 1052

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 140/519 (26%), Positives = 238/519 (45%), Gaps = 43/519 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P F+G VP       P A++   G+W      P    TY+    D  F ++   + + 
Sbjct: 401 VFPGFAGQVPDTFAAKNPQAQVIDQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQT 458

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
               +G  SH Y  D F E     D    +  +   + + M   D +AVW++Q W     
Sbjct: 459 LKTTFGDISHYYAVDPFHEGGNRADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW----- 511

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
              +       LN +     ++LDL+A+ KP  +   +F   P+IW MLH F G +   G
Sbjct: 512 ---QENPTDDFLNGLKKDHALILDLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSG 568

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           + + +A   +    +E+  M GVG++ E +  NP++Y+++ +MA++   +   A+I+ + 
Sbjct: 569 MQEVLA-QEIPQSLAESKYMKGVGVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWL 627

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             RYG   P I+ AW+++  T Y+      D+ R         + SII    G    +G 
Sbjct: 628 TSRYGAQSPEIEQAWDIMVKTAYHRR---KDRQR--------AEDSIIDAKPG----FG- 671

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
            V++     +    YD        +E  + L L+++  +    +  Y++DL+D+TRQ LA
Sbjct: 672 -VTRACTYYTALIDYDK-------AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLA 723

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL-- 425
             + E +    +A+   D     QLS +FL L++  D +L     F+LG WL SA+ +  
Sbjct: 724 NASYEYYRAFEDAWMAKDYSAFNQLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLD 783

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
             ++    Q+E+NAR  +T W      +A  LRDY N+ W GL  D+Y  R A + + + 
Sbjct: 784 GMDDWTRDQFEFNARAMVTTWGTEQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQTLK 842

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGR-NVYPVESNG 523
            +  +G   +    +  W  L   W N   N YP + +G
Sbjct: 843 SAAATGQ--KQDAIKVHWFPLEYRWVNQTGNGYPTQPSG 879


>gi|161505009|ref|YP_001572121.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160866356|gb|ABX22979.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 1014

 Score =  198 bits (504), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 138/519 (26%), Positives = 240/519 (46%), Gaps = 43/519 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P F+G VP       P A++   G+W      P    TY+    D  F ++   + + 
Sbjct: 363 VFPGFAGQVPDTFAAKNPQAQVIDQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQT 420

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
               +G  SH Y  D F E     D    +  +   + + M   D DAVW++Q W     
Sbjct: 421 LKTTFGDISHYYAVDPFYEGGNRADL--NMVKVAQTVQNKMLEHDKDAVWIIQNW----- 473

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
              +     A LN +     ++LDL+A+ KP  +   +F   P+IW MLH F G +   G
Sbjct: 474 ---QENPTDAFLNGLKKDHALILDLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSG 530

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
           + + +A   +    +E+  M GVG++ E +  NP++Y+++ +MA++   +   A+I+ + 
Sbjct: 531 MPEVLA-QEIPQSLAESKYMKGVGVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHNWL 589

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             RYG   P I+ AW+++  T Y+      D+ R         + SII    G    +G 
Sbjct: 590 TSRYGAQSPEIEQAWDIMVKTAYHRR---KDRQR--------AEDSIIDAKPG----FGV 634

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
                    +   +Y +  + Y  +E  + L L+++  +    +  Y++DL+D+TRQ LA
Sbjct: 635 ---------TRACTYYNALIDYDKAEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLA 685

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL-- 425
             + E +    +A+   D     QLS +FL L++  D +L+    F+LG W+ +++ +  
Sbjct: 686 NASYEYYRAFEDAWMAQDYSAFNQLSGKFLRLIKLQDKVLSTRPEFMLGNWINNSRTMLD 745

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
             ++    Q+E+NAR  +T W      +A  LRDY N+ W GL  D+Y  R A + + + 
Sbjct: 746 GMDDWTRDQFEFNARAMVTTWGTEQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALK 804

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGR-NVYPVESNG 523
            +  +G   +    +  W  L   W N   N YP + +G
Sbjct: 805 TAAATGQ--KQDAIKVSWFPLEYRWVNQTGNGYPTQPSG 841


>gi|293402122|ref|ZP_06646261.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
           bacterium 5_2_54FAA]
 gi|291304514|gb|EFE45764.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
           bacterium 5_2_54FAA]
          Length = 2295

 Score =  198 bits (503), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 138/499 (27%), Positives = 245/499 (49%), Gaps = 57/499 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
           V+  FSG VP       P+A IT++ +W    + P     Y+ ++      + L+ ++ +
Sbjct: 435 VVQGFSGQVPDNFDKKQPTALITEMKDWVGY-TRPSIIQPYITESDAAKGKENLYPQVAK 493

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 120
            F + Q   +G  ++ Y  D F E   P  +D  E        + + M   +  AVW+MQ
Sbjct: 494 DFYDAQKNVFGNVTNYYATDPFHEGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQ 549

Query: 121 GWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLH 177
            W  + D          L   V   + + LDL  ++ P   +   S+     P++WCMLH
Sbjct: 550 QWQGNLDA-------TKLSGLVKPSQALALDLQTDLNPQNGVMENSE----TPWLWCMLH 598

Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
           NF G + M G L ++A  P  A  +E+  M G+G++ E +E +PV Y+L+ +M +  + +
Sbjct: 599 NFGGRMGMDGNLPNVAKNPAIA-MNESKYMKGIGITPEALENSPVAYELLFDMTWTKDPI 657

Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 297
           D  AWI +Y+ RR G +   +Q+AW +L  T Y    GA  +      ++     +II+ 
Sbjct: 658 DEDAWIAKYAQRRAGGTSEKLQEAWKILNETAY----GAKQE------SYQGAAETIINA 707

Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
           T            +++   +  S++ H ++ Y   E  +AL+L I + ++  AS  YRYD
Sbjct: 708 T-----------PRDSFRSA--STWGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYD 754

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
           L D+  Q L   A E    +++A   ++A    + S++FLE+++  D +L   + F++G 
Sbjct: 755 LADVANQVLCNVAIEYHSLMVKAKNESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGN 814

Query: 418 WLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASL--LRDYGNKYWSGLLRDYY 473
           W+  A+ +  + +   +  +E+NAR  +T W   + E +SL  L DY N+ W+GL +D+Y
Sbjct: 815 WINDARNMMSDGDDWTKDLFEFNARAMVTTW---SGERSSLNNLNDYSNRKWNGLTKDFY 871

Query: 474 GPRAAIYFKYMIESLESGD 492
           G R  ++ +     L+  D
Sbjct: 872 GKRWKVWIENRQAELDGKD 890


>gi|62318937|dbj|BAD94027.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
          Length = 182

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 94/181 (51%), Positives = 128/181 (70%), Gaps = 1/181 (0%)

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
           ++DLTRQ L+K AN+++   + A+   D   + QLS +FLEL++DMD LLA  D  LLG 
Sbjct: 1   MVDLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGT 60

Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
           WLESAK+LA+N ++ KQYEWNARTQ+TMW+D+     S L DY NK+WSGLL DYY PRA
Sbjct: 61  WLESAKKLAKNGDERKQYEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRA 120

Query: 478 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKY 536
            +YF  M++SL     F+++ WRREWI +++ WQ   + VYPV++ GDAL  S+ L +KY
Sbjct: 121 RLYFNEMLKSLRDKKIFKVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKY 180

Query: 537 L 537
            
Sbjct: 181 F 181


>gi|373451393|ref|ZP_09543318.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
           3_1_31]
 gi|371968665|gb|EHO86120.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
           3_1_31]
          Length = 2190

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 139/499 (27%), Positives = 246/499 (49%), Gaps = 57/499 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGR 62
           V+  FSG VP       P+A IT++ +W    + P     Y+ +       + L+ ++ +
Sbjct: 427 VVQGFSGQVPDNFDKKQPTALITEMKDWVGY-TRPSIIQPYITENDAAKGKENLYPQVAK 485

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 120
            F + Q   +G  ++ Y  D F E   P  +D  E        + + M   +  AVW+MQ
Sbjct: 486 DFYDAQKNVFGNVTNYYATDPFHEGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQ 541

Query: 121 GWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLH 177
            W  + D      ++  LL      + + LDL  ++ P   +   S+     P++WCMLH
Sbjct: 542 QWQGNLDA----TKLSGLLKP---SQALALDLQTDLNPQNGVMENSE----TPWLWCMLH 590

Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
           NF G + M G L ++A  P  A  +E+  M G+G++ E +E +PV Y+L+ +M +  + +
Sbjct: 591 NFGGRMGMDGNLPNVAKNPAIA-MNESKYMKGIGITPEALENSPVAYELLFDMTWTKDPI 649

Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 297
           D  AWI +Y+ RR G +   +Q+AW +L  T Y    GA  +      ++     +II+ 
Sbjct: 650 DEDAWIAKYAQRRAGGTSEKLQEAWKILNETAY----GAKQE------SYQGAAETIINA 699

Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
           T            +++   +  S++ H ++ Y   E  +AL+L I + ++  AS  YRYD
Sbjct: 700 T-----------PRDSFRSA--STWGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYD 746

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
           L D+  Q L   A E    +++A   ++A    + S++FLE+++  D +L   + F++G 
Sbjct: 747 LADVADQVLCNVAIEYHSLMVKAKNESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGN 806

Query: 418 WLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASL--LRDYGNKYWSGLLRDYY 473
           W+  A+ +  + +   +  +E+NAR  +T W   + E +SL  L DY N+ W+GL +D+Y
Sbjct: 807 WINDARNMMSDGDDWTKDLFEFNARAMVTTW---SGERSSLNNLNDYSNRKWNGLTKDFY 863

Query: 474 GPRAAIYFKYMIESLESGD 492
           G R  I+ +     L+  D
Sbjct: 864 GKRWKIWIENRQAELDGKD 882


>gi|345014586|ref|YP_004816940.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
 gi|344040935|gb|AEM86660.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
          Length = 1044

 Score =  196 bits (498), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 142/526 (26%), Positives = 232/526 (44%), Gaps = 45/526 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P + G VP    +  P A+    G+W  ++  P W     LD     F ++  AF   
Sbjct: 253 VFPGYFGTVPDGFADRNPEARTVPQGDWNGLRR-PDW-----LDPRTESFRKVAAAFYRH 306

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G  + ++  D   E   P D P  +     A+ + +++    A+W++ GW    +
Sbjct: 307 QRELFGE-AGLFKMDLLHEGGDPGDVP--VPDAARAVETALRTARPGAIWVILGW--QEN 361

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           P       + LL++V   +++V+D  +++  +    K +  VPY +  + NF G   +  
Sbjct: 362 P------RRDLLDAVDHDRMLVVDGLSDLDTVTDREKDWGAVPYAFGTIPNFGGRTTIGA 415

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
                       R    + +VG     E  E++P  ++L SE+A++ E VD   W   Y+
Sbjct: 416 KTHMWTKRFTVWRDKPGSKLVGTAYMPEAAERDPAAFELFSELAWREEAVDRAEWFRSYA 475

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             RYG      ++A+  L  T Y  +      +  V  A P       S+T     NY  
Sbjct: 476 EMRYGGRDAKAREAFAALRDTAYEISSKDGRPHDSVFAARP-------SLTARSGTNYA- 527

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
                    + T ++D         +V  A  L + +G  L  S+ YR+DL D+ RQALA
Sbjct: 528 ---------THTPAFDPAGF-----DVAFAALLGVRAG--LRDSDAYRHDLTDIARQALA 571

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
             + +L   + +AY   D      L+R +L+L+   D +   H  FLLGPWLE AK++A 
Sbjct: 572 NRSWQLIPQLQDAYDRKDRTAFRTLARLWLKLMRLSDDMTGAHRRFLLGPWLEDAKRMAS 631

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
            +E+  + E  ART IT W D    +   L +Y N+ WSGL+ D++ P+   Y   + ++
Sbjct: 632 GDEESARLERAARTLITTWADRATADGGKLANYANRDWSGLIADFHLPQWQSYLDELEDA 691

Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 533
           L      R  D    W  +   W   R  YPV    DA  T+Q +Y
Sbjct: 692 LAENRPPRAFD----WFAVEEPWTRERTSYPVRPTTDAHRTAQRVY 733


>gi|308480701|ref|XP_003102557.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
 gi|308261289|gb|EFP05242.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
          Length = 718

 Score =  195 bits (495), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 128/489 (26%), Positives = 227/489 (46%), Gaps = 53/489 (10%)

Query: 8   VLPAFSGNVPAALQN---VFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAF 64
           +LP F+G VP  L+    +FP++K  +L  W +  S+    C   +   DPLF +IG  F
Sbjct: 247 ILPTFAGFVPDQLEKDFRLFPTSKFNRLPCWNNFTSET--SCLLSVSPFDPLFQKIGSTF 304

Query: 65  IEQQLKEYG-RTSHIYNCDTFDENTPPVDSPEYISSL----GAAIYSGMQSGDSDAVWLM 119
           +  Q K  G   +++Y+ D F+E   P DS ++ +S       +I +  +  D + +W++
Sbjct: 305 LRHQKKMLGGDITNLYSADPFNE-ILPSDSSKFDASFMKQTAQSIMNSCRKVDKNCIWVL 363

Query: 120 QGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNF 179
           Q W F+YD  W    +K+ L++VP+G L++LDL++EV P W  +  F+G  ++WC+LHNF
Sbjct: 364 QSWSFTYDQ-WPNWAIKSFLSAVPIGNLLILDLYSEVVPAWQMTSSFHGHNFVWCLLHNF 422

Query: 180 AGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
            G+ E+ G L  +  G   A     + +VG G+SME I+QN VVY  M +  +  E + +
Sbjct: 423 GGSRELRGNLQKVDKGYQLALMKAGSNLVGAGLSMEAIDQNYVVYQFMIDRMWSQEPIPL 482

Query: 240 KAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
             W+  YS  RY          W +L  + Y+  +   +    V +              
Sbjct: 483 NNWLKSYSESRYSADFKVSHKFWTILAGSFYSQPEKWGNPRFSVFL-------------- 528

Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS-GNELSASNTYRYDL 358
                Y +P   + +             W+   E    L+  + S  + L     ++ DL
Sbjct: 529 -----YHRPAFAKKI-----------EYWFPVEETFNHLQSLMPSLMHVLGDHPLFKEDL 572

Query: 359 IDLTRQALA-KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
            D+ R  +  +  NE  L++ EA+ + D   +       +++ + ++     +       
Sbjct: 573 NDVMRAVIQFEIGNEAALSLTEAFLMEDKQQIGASCENLMDMFQKLES----YSNRDFKE 628

Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
           W+E +K +A   E+ + +   A   +T+W    Q       DY ++ W+GLL  YYG R 
Sbjct: 629 WIEDSKSIAPTSEERQVFPVTASDILTVWGPTGQN-----LDYAHREWAGLLSGYYGRRW 683

Query: 478 AIYFKYMIE 486
             +  +++E
Sbjct: 684 QYFCDWILE 692


>gi|326440885|ref|ZP_08215619.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
           27064]
          Length = 1038

 Score =  194 bits (494), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 143/527 (27%), Positives = 229/527 (43%), Gaps = 54/527 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+P + G VP       P A++   G W  +   P W     LD   P+F EI  A+   
Sbjct: 246 VVPGYFGTVPDGFVARNPGARVIPQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRH 299

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G   H +  D   E   P D P  +     A+ + +++    A W++ GW  +  
Sbjct: 300 QEELFGEIDH-FKMDLLHEGGTPGDVP--VPDAARAVETALRAARPAATWVILGWQSNPR 356

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           P        ALL+++   K++++D  +++  +     ++ G PY +  + NF G   +  
Sbjct: 357 P--------ALLDAIDTSKVLIVDGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGA 408

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
             D         R   N+ +VG     E  +++P   +L +E+A++ EK+D  AW   Y+
Sbjct: 409 NTDRWTEKFTAWRDKPNSALVGTAYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYA 468

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             RYG   PA ++A+  L  T Y  T   T   R +   F                    
Sbjct: 469 QFRYGAKDPAAEEAFAALAGTAYQLT---TTDGRPIDSLF-------------------- 505

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
            + + ++  S  +++D        + ++R  E       EL  S+ YRYDL DL RQALA
Sbjct: 506 -LRRPSMSSSVATAFDQAAFDRGFAALLRVNE-------ELRGSDAYRYDLTDLARQALA 557

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
             +  L L +  AY   D      ++  +L L+   D +  CH  FLLGPWLE AK+ A 
Sbjct: 558 LRSRTLQLALRAAYATKDVTAFRGVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFAT 617

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
           + E+  + E  AR  IT W D  +  A  L +Y N+ W GL+ D + P+   YF  +  +
Sbjct: 618 STEEAVELERTARVLITTWGD--RAAAVELSNYANRDWQGLIGDVHVPQWEQYFTEVATA 675

Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
           L  G   +  D    W      W   R  YPV   GD    +Q +++
Sbjct: 676 LAEGRAPKAID----WYPGEETWTKDRRPYPVRPTGDVHKVAQRVHD 718


>gi|294812279|ref|ZP_06770922.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
           27064]
 gi|294324878|gb|EFG06521.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
           27064]
          Length = 1086

 Score =  194 bits (494), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 143/527 (27%), Positives = 229/527 (43%), Gaps = 54/527 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+P + G VP       P A++   G W  +   P W     LD   P+F EI  A+   
Sbjct: 294 VVPGYFGTVPDGFVARNPGARVIPQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRH 347

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G   H +  D   E   P D P  +     A+ + +++    A W++ GW  +  
Sbjct: 348 QEELFGEIDH-FKMDLLHEGGTPGDVP--VPDAARAVETALRAARPAATWVILGWQSNPR 404

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           P        ALL+++   K++++D  +++  +     ++ G PY +  + NF G   +  
Sbjct: 405 P--------ALLDAIDTSKVLIVDGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGA 456

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
             D         R   N+ +VG     E  +++P   +L +E+A++ EK+D  AW   Y+
Sbjct: 457 NTDRWTEKFTAWRDKPNSALVGTAYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYA 516

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             RYG   PA ++A+  L  T Y  T   T   R +   F                    
Sbjct: 517 QFRYGAKDPAAEEAFAALAGTAYQLT---TTDGRPIDSLF-------------------- 553

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
            + + ++  S  +++D        + ++R  E       EL  S+ YRYDL DL RQALA
Sbjct: 554 -LRRPSMSSSVATAFDQAAFDRGFAALLRVNE-------ELRGSDAYRYDLTDLARQALA 605

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
             +  L L +  AY   D      ++  +L L+   D +  CH  FLLGPWLE AK+ A 
Sbjct: 606 LRSRTLQLALRAAYATKDVTAFRGVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFAT 665

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
           + E+  + E  AR  IT W D  +  A  L +Y N+ W GL+ D + P+   YF  +  +
Sbjct: 666 STEEAVELERTARVLITTWGD--RAAAVELSNYANRDWQGLIGDVHVPQWEQYFTEVATA 723

Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
           L  G   +  D    W      W   R  YPV   GD    +Q +++
Sbjct: 724 LAEGRAPKAID----WYPGEETWTKDRRPYPVRPTGDVHKVAQRVHD 766


>gi|408676293|ref|YP_006876120.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
 gi|328880622|emb|CCA53861.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
          Length = 855

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 138/519 (26%), Positives = 220/519 (42%), Gaps = 48/519 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G VP       P A     G W      P W     LD T P+F  +   +   
Sbjct: 364 VLPGYFGTVPPDFTARNPGAHTVPQGRWVGF-GRPDW-----LDPTGPVFARLAAVYYRH 417

Query: 68  QLKEYGRTSHIYNCDTFDENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q + +G  S ++  D   E   P  VD    +S+   A+   +++    A W+M GW  +
Sbjct: 418 QRQRFG-DSDMFKMDLLHEGGAPGTVD----VSAAAGAVQRALEAARPGATWVMLGWQLN 472

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             P        ALL+ V   +L+++D  ++         ++ G PY +  + NF G+  +
Sbjct: 473 PTP--------ALLHGVDRRRLLIVDGLSDRYDELDRETRWGGTPYAFGTIPNFGGHTSI 524

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
                +            ++ + G+    E    NPV + L +E+A+Q   +D + W   
Sbjct: 525 GANTGAWVSRFHAWLAKPDSALRGIAYLPEATGTNPVAFGLFTELAWQPGPIDQQRWFAG 584

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y+ RRYG +      AW  L    Y+   G+  + +D + A     PS+ + T  ++   
Sbjct: 585 YAARRYGGADRHAAAAWEALRLGPYSMRTGSWSEPQDSLFA---ARPSLTASTAARW--- 638

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
               S +A+              Y  + V RAL   +     L  S+ YR+D++D+ RQA
Sbjct: 639 ----SPKAMR-------------YDAATVERALAELLRVAPRLRTSDAYRFDVVDVARQA 681

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           L   A  L   I  AY+  D      L R +    E +  L+     FL+GPWL +A+  
Sbjct: 682 LTNRARVLLPRIRAAYEARDLDAFRALVREWGAAEELLGRLVGSDRRFLVGPWLAAARSW 741

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
             +  +  + E++AR+ +T W D    E+  L DY N+ WSGL+RD Y PR A YF  + 
Sbjct: 742 GADPAERDRLEYDARSILTTWADRVPSESGGLHDYANREWSGLVRDVYAPRWAAYFASLD 801

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
            +L +G      D    W    + W  G   YP   +GD
Sbjct: 802 RALVNGTEPVAID----WFARDDAWARGHRSYPTLPSGD 836


>gi|197302378|ref|ZP_03167435.1| hypothetical protein RUMLAC_01107 [Ruminococcus lactaris ATCC 29176]
 gi|197298557|gb|EDY33100.1| F5/8 type C domain protein [Ruminococcus lactaris ATCC 29176]
          Length = 1655

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 152/543 (27%), Positives = 244/543 (44%), Gaps = 75/543 (13%)

Query: 8    VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            VL  +SG VP  + +  PSA++ + G W S +         +L      F +    F + 
Sbjct: 717  VLQGYSGMVPVDITSKDPSAEVIKQGTWCSFQRPS------MLRTDSESFTKYAALFYKV 770

Query: 68   QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
            Q + YG ++H Y  D F E  NT  +DS    + +   + + M + D  A W++Q W   
Sbjct: 771  QKEVYGDSAHYYATDPFHEGGNTGGMDS----AVISQKVLASMMTADPHATWVIQSW--- 823

Query: 126  YDPFWRPPQMKALLNSVPLGK--LVVLDLFAEVKPIWSTSK-----------QFYGVPYI 172
                 +     ALL  +   +   +VLDL+AE  P W+ +            +F   P++
Sbjct: 824  -----QGNPTTALLQGLGDNRDHALVLDLYAEKTPHWNETNPGYYGGAEGGGEFLNTPWV 878

Query: 173  WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
            +CML+NF G + ++G +D+   G V A + +   M G+G++ E    NPV+YDL  E  +
Sbjct: 879  YCMLNNFGGRLGLHGHIDNYVEGIVNA-SKQAEHMAGIGITPEASVNNPVLYDLFFETIW 937

Query: 233  QH-----EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIV 285
                   +K+++  W   Y  RRYG    +   A  +L+ TVYN       +     V+ 
Sbjct: 938  ADDGNNLQKINLDEWFKNYVTRRYGADSDSAYQAMEILHDTVYNPAYNMKGQGAPESVVN 997

Query: 286  AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG 345
            A P +D        G    +G  V                 + Y   ++ +A EL +A  
Sbjct: 998  ARPGLDI-------GAASTWGNAV-----------------VDYDKKKLEKAAELLLADY 1033

Query: 346  NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 405
            ++L  S  Y+YDL ++  Q L+  A E    +  A++  DA     LS +FL +++ ++ 
Sbjct: 1034 DKLKNSAGYQYDLANVLEQVLSNTAQEYQKKMAAAFRSGDAEEFSTLSDKFLSIIDMVEK 1093

Query: 406  LLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNK 463
            +      FL+G W+  AK+LA+N +   ++ YE NAR+ IT W    Q  +  L DY N+
Sbjct: 1094 VTGTQKEFLVGTWINGAKKLAKNSDDFTKELYELNARSLITTWGSYDQAISGGLIDYSNR 1153

Query: 464  YWSGLLRDYYGPRAAIYFKYMIESL--ESGDGFRLKDW-RREWIKLTNDWQNGRNVYPVE 520
             W+GL  DYY  R   +     + L  ES   +  +DW   EW      W  G N Y   
Sbjct: 1154 QWAGLTNDYYKMRWEKWITERKKELAGESYTNYSAQDWFEMEWA-----WARGTNKYSGT 1208

Query: 521  SNG 523
             NG
Sbjct: 1209 PNG 1211


>gi|433678127|ref|ZP_20510026.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
           translucens DSM 18974]
 gi|430816763|emb|CCP40478.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
           translucens DSM 18974]
          Length = 691

 Score =  192 bits (487), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 154/558 (27%), Positives = 235/558 (42%), Gaps = 85/558 (15%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF G VP A     P A+I ++  W           TY LD  DPLF +I   FI+ 
Sbjct: 164 VLPAFVGYVPKAFAQAHPQARIYRMRAWEGFHE------TYWLDPADPLFAKIALRFIQL 217

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDS----------------------PEY--------I 97
             + YG+ ++ Y  D F+E  PP+ +                      PE         +
Sbjct: 218 YDRTYGKGTY-YLADAFNEMLPPIAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRL 276

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           +  G A+Y  +   + DAVW+MQGWLF  D  FW P  + A L  VP  KL+VLD+  + 
Sbjct: 277 ADYGRALYESIHRANPDAVWVMQGWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDR 336

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S  F G  +I+  +HN+ G+  +YG    +AF   + R      +   +VG G
Sbjct: 337 YPGTWKLSDAFDGKQWIYGYVHNYGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFG 393

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
              EG+  N VVY+ M  +A+  ++  ++ W+  Y   RYG + PA++ AW+ L   V +
Sbjct: 394 AFPEGLHDNSVVYEYMYALAWGGQQRSLQDWLGDYIRARYGHTSPALRAAWDDLQAAVLS 453

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
                               P       G Y  + +P     + + E +  D P L    
Sbjct: 454 TRYWT---------------PRWWRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL---- 492

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 391
               RAL+  +A   E + +  YRYDL+D  R       +      + AY+  D      
Sbjct: 493 ---RRALDQLLALAPEYADAPLYRYDLVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDA 549

Query: 392 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
              R    V+ +DGL+       L  WL  A+  A+  +    Y  +A+ Q+++W     
Sbjct: 550 AFARVQAAVQQLDGLVGGQQE-TLSSWLGDAEGDAKTPQDAAYYRRDAKAQVSVWGGEGN 608

Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF--------RLKDWRREW 503
                L DY +K W G+  DYY PR A+  +  + +   G G         RL+ W  +W
Sbjct: 609 -----LGDYASKAWQGMYADYYLPRWALAMQ-ALRAAAVGSGSVDEAALQQRLRAWELDW 662

Query: 504 IKLTNDWQNGRNVYPVES 521
           +K    +       PV +
Sbjct: 663 VKRETPYTRQAPADPVAA 680


>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 1105

 Score =  191 bits (485), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 146/508 (28%), Positives = 223/508 (43%), Gaps = 76/508 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF +I + FI+ 
Sbjct: 201 VLPAFAGYVPKAFAQAHPQARIYRMRAWEGFHE------TYWLDPADPLFAKIAQRFIQL 254

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDS----------------------PEY--------I 97
             + YG+ ++ Y  D F+E  PP+ +                      PE         +
Sbjct: 255 YDRTYGKGTY-YLADAFNEMLPPIAADGSDARLASYGDSTANTANTAPPEVSPAQRDKRL 313

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           +  G A+Y  +   + DAVW+MQGWLF  D  FW P  + A L  VP  KL+VLD+  + 
Sbjct: 314 ADYGRALYESIHRANPDAVWVMQGWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDR 373

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S  F G  +I+  +HN+ G+  +YG    +AF   + R      +   +VG G
Sbjct: 374 YPGTWKLSDAFDGKQWIYGYVHNYGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFG 430

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
              EG+  N VVY+ M  +A+  ++  ++ W+  Y+  RYG S PA++ AW+ L  +V +
Sbjct: 431 AFPEGLHTNSVVYEYMYALAWGGQQRSLQDWLGDYTRARYGHSSPALRAAWDDLQASVLS 490

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
                               P       G Y  + +P     + + E +  D P L    
Sbjct: 491 TR---------------YWTPRWWRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL---- 529

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 391
               RAL+  +A   E + +  YRYDL+D  R       +      + AY+  D      
Sbjct: 530 ---RRALDQLLALAPEYADAPLYRYDLVDFARHYATGRVDTQLQQALAAYKRGDVAAGDA 586

Query: 392 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
              R    V  +DGL+       L  WL++A+  A+  +    Y  +A+ Q+++W     
Sbjct: 587 AFARVQAAVRQLDGLVGGQQE-TLSSWLDAAEGDAKTPQDAAYYRRDAKAQVSVWGGEGN 645

Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAI 479
                L DY +K W G+  DYY PR A+
Sbjct: 646 -----LGDYASKAWQGMYADYYLPRWAL 668


>gi|423219557|ref|ZP_17206053.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
           CL03T12C61]
 gi|392624762|gb|EIY18840.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
           CL03T12C61]
          Length = 715

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 141/503 (28%), Positives = 219/503 (43%), Gaps = 48/503 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           ++P F G VP+ L+N    A I   G W +           +LD  DP F  +   F ++
Sbjct: 232 LMPGFYGMVPSNLKNK-SKAHIIPQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDE 284

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             + YG     ++ D F E          +   G AI   MQ     ++W++QGW  +  
Sbjct: 285 TRRLYGSDIRFFSGDPFHEGG--ATDGVALGDAGRAIQKTMQKHFPGSIWVLQGWQDNPK 342

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           P         LL  +    ++V +LF E    W T K + G P+IW  + NF     + G
Sbjct: 343 P--------GLLEKLDKRYVLVQELFGENTNNWETRKGYEGTPFIWATVTNFGERPGING 394

Query: 188 ILDSIAFGPVEARTSENTT-MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
            L   A     A  SE    M GVG+  EGI  NPV Y+L+ E+ +  ++VDV  WI  Y
Sbjct: 395 KLQRFADEVYRASNSEYAKYMKGVGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESY 454

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNY 305
              RYGR    I+ AW ++  ++Y+   G  +    +++ A P ++   +S      + Y
Sbjct: 455 VTARYGRITDEIRTAWKMMLKSIYSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKY 514

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
            + + K+A                          LF  +  E +   TYR DLI   RQ 
Sbjct: 515 DRDLYKKAAF------------------------LFAKAMPEFNEVRTYRIDLIHFLRQV 550

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           +A  A+ +F ++I AYQ        Q   +FL +++  + LLA    F L  W + AK  
Sbjct: 551 IANEADSVFYDMITAYQEKKVEKFEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDA 610

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
                ++K    N    IT W ++   E + L DY  K W+G++  YY  R  +YF Y+ 
Sbjct: 611 GNTAAEKKNNFHNLMMLITYWGEHVTSEDN-LHDYAYKEWAGMMNTYYKERWLVYFDYL- 668

Query: 486 ESLESGDGFRLKD---WRREWIK 505
            +L  G+  +  D   W REW++
Sbjct: 669 RALLRGEEAKAPDYFHWEREWVE 691


>gi|153806010|ref|ZP_01958678.1| hypothetical protein BACCAC_00255 [Bacteroides caccae ATCC 43185]
 gi|149130687|gb|EDM21893.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
           43185]
          Length = 715

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 141/503 (28%), Positives = 219/503 (43%), Gaps = 48/503 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           ++P F G VP+ L+N    A I   G W +           +LD  DP F  +   F ++
Sbjct: 232 LMPGFYGMVPSNLKNK-SKAHIIPQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDE 284

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             + YG     ++ D F E          +   G AI   MQ     ++W++QGW  +  
Sbjct: 285 TRRLYGSDIRFFSGDPFHEGG--ATDGVALGDAGRAIQKTMQKHFPGSIWVLQGWQDNPK 342

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
           P         LL  +    ++V +LF E    W T K + G P+IW  + NF     + G
Sbjct: 343 P--------GLLEKLDKRYVLVQELFGENTNNWETRKGYEGTPFIWATVTNFGERPGING 394

Query: 188 ILDSIAFGPVEARTSENTT-MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
            L   A     A  SE    M GVG+  EGI  NPV Y+L+ E+ +  ++VDV  WI  Y
Sbjct: 395 KLQRFADEVYRASNSEYAKYMKGVGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESY 454

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNY 305
              RYGR    I+ AW ++  ++Y+   G  +    +++ A P ++   +S      + Y
Sbjct: 455 VTARYGRITDEIRTAWKMMLKSIYSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKY 514

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
            + + K+A                          LF  +  E +   TYR DLI   RQ 
Sbjct: 515 DRDLYKKAAF------------------------LFAKAMPEFNEVRTYRIDLIHFLRQV 550

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           +A  A+ +F ++I AYQ        Q   +FL +++  + LLA    F L  W + AK  
Sbjct: 551 IANEADSVFYDMITAYQEKKVEKFEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDA 610

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
                ++K    N    IT W ++   E + L DY  K W+G++  YY  R  +YF Y+ 
Sbjct: 611 GNTAAEKKNNFHNLMMLITYWGEHVTSEDN-LHDYAYKEWAGMMNTYYKERWLVYFDYL- 668

Query: 486 ESLESGDGFRLKD---WRREWIK 505
            +L  G+  +  D   W REW++
Sbjct: 669 RALLRGEEAKAPDYFHWEREWVE 691


>gi|374990497|ref|YP_004965992.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
 gi|297161149|gb|ADI10861.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
          Length = 1001

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 139/526 (26%), Positives = 224/526 (42%), Gaps = 45/526 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P + G VP    +  P A+    G W  + + P W     LD     F ++  AF   
Sbjct: 211 VYPGYFGTVPDGFVDRNPGARTVPQGTWNGL-ARPDW-----LDPRTESFGQVAAAFYRH 264

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G    ++  D   E     D P  ++    A+ + +Q+    A W++ GW  +  
Sbjct: 265 QQELFGECD-LFKMDLLHEGGAAGDVP--VADAARAVETALQTARPGATWVILGWQAN-- 319

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
                   + LL++V    ++V+D  +++  I    + +  VPY +  + NF G   +  
Sbjct: 320 ------PRRELLDAVNHDHMLVVDGLSDLDSIGDREQDWGSVPYAFGTIPNFGGRTTIGA 373

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
                A    + R    + +VG     E + ++P  ++L SE+A+++  VD   W   Y+
Sbjct: 374 KTHIWARRFTQWRDKPGSKLVGTAYMAEAVGRDPAAFELFSELAWRNTAVDRDEWFRTYA 433

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             R G      +DA+  L  T Y  T      +  V  A PDV       T     NY  
Sbjct: 434 DVRLGGRDERARDAYAALRDTAYQITSSDGRPHDSVFSARPDV-------TARSGTNYAT 486

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
            +                   +  ++   AL   +     L  S+ YR+DL D+ RQALA
Sbjct: 487 RIPA-----------------FDLADFDPALAALLDVRPSLRDSDAYRHDLTDIARQALA 529

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
             +  L  ++ +AY+  D      L+R +L+L+   D +   H GFLLGPWLE AK+LA 
Sbjct: 530 DRSWTLIPHLHDAYERKDLEAFRTLARLWLKLMRLSDDMTGAHRGFLLGPWLEDAKRLAS 589

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
           +E +    E  ART IT W D    +   L +Y N+ W+GL+ D++ P+   Y   + ++
Sbjct: 590 DEAEAAHLEHLARTLITTWADRVTADTGKLANYANRDWNGLIGDFHLPQWQSYLDELEDA 649

Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 533
           L  G   R  D    W  +   W   R  YPV    DA  T + +Y
Sbjct: 650 LAEGREPRDFD----WFAVEEPWTRERKSYPVRPTTDAHRTGRRVY 691


>gi|404403947|ref|ZP_10995531.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
          Length = 828

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 152/516 (29%), Positives = 241/516 (46%), Gaps = 79/516 (15%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V   F+G VP A++ + P   +T+   W   K+       ++L   DPLF EIG AF+  
Sbjct: 221 VYQGFAGFVPPAMKRIHPETTLTET-KWSGFKN-------WMLSPLDPLFSEIGTAFVRA 272

Query: 68  QLKEYGRTSHIYNCDTFDE-NTP--PVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQG 121
             +E+G+  + Y  D+F+E + P  P  SPE  ++L   G  IY  +   + DAVW+MQG
Sbjct: 273 WEEEFGKGKY-YLIDSFNEMDVPFGPKGSPERAATLRHYGETIYRSLAEANPDAVWVMQG 331

Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQ------FYGVPYIW 173
           W+F Y    W P  ++ALL   P G++++LDL  +    IW + K       F+G  +I+
Sbjct: 332 WMFGYQRNSWDPESVRALLEGAPDGRMMILDLAVDFNNFIWRSEKSWNHLQGFFGREWIY 391

Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
             + NF G   + G L+  A G +EA +S N   + G G S EG+E N +VY++++   +
Sbjct: 392 STVPNFGGRTALIGNLEFYANGHLEALSSPNRGRLTGYGTSPEGVESNEIVYEIIAAAGW 451

Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVD 291
             +++D+K +++ YS  RYG     I   W+ +  + YN CT+ A               
Sbjct: 452 SDDRIDLKKFLHDYSAARYGGCPEGIDRFWSGMLQSSYNECTNNA--------------- 496

Query: 292 PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
                    +Y+   +P S          +Y              A+E F+A   EL  +
Sbjct: 497 ---------RYRWQLRPYSHRMPTMGINENY------------YTAIEQFLACAGELGGN 535

Query: 352 NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG----VFQLSRRFLELVEDMDGLL 407
             YR D I      LA  A+ L    +EA    D +G     +  + R  EL+ D D LL
Sbjct: 536 ELYRTDAIQYAALYLASKADML----LEAANWADLYGAREEAYDCAMRIEELLLDADRLL 591

Query: 408 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
           A H    L  W   A++    EE+++++   +R  I++W   +      L DY  + WSG
Sbjct: 592 ASHPLLRLDRWSGMARKAGCTEEEKERFVGESRRLISVWGGPS------LSDYSARVWSG 645

Query: 468 LLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 503
           ++RDYY PR   Y    +E+   G  F  + W  +W
Sbjct: 646 VIRDYYVPRLNKY----LEAKTDGTVFDFRTWDEQW 677


>gi|291302495|ref|YP_003513773.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
 gi|290571715|gb|ADD44680.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
          Length = 696

 Score =  189 bits (479), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 149/537 (27%), Positives = 241/537 (44%), Gaps = 61/537 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F+G VP  + +   +A I Q G WF     P W  T     T   + E+   F  +
Sbjct: 214 VLPGFAGLVPGDIGD---TAAIPQ-GQWFGFDR-PAWLPT-----TTRAYAEVAEVFYAK 263

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q +  G T      D   E   +  VD  +    + AA    M+    D +W++Q W   
Sbjct: 264 QTERLGAT-RAQAVDLLHEGGTSGGVDLADATRGIAAA----MERAHDDYLWVLQAW--- 315

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
               W  P  + L  +        L L       W  +K ++G P+    L NF G   +
Sbjct: 316 ----WDNPLPEVLAAT----DSDHLLLLDLTGEGWRKTKGWHGKPWARGSLTNFGGRTVL 367

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           +G L  IA  P      + +++VG  +  E  + NPVV+ L ++ ++    +D+ AW+ +
Sbjct: 368 FGGLPEIAELPSLKDDPKASSLVGTALVEEAWQVNPVVWSLFTQTSWADGDIDLNAWVPE 427

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y   RYG++ P    AW+ L  T Y   DG       ++ A P +D              
Sbjct: 428 YVAARYGKAHPRAVRAWHGLLATAYRSMDGRPGGAESLLCAMPSLD-------------- 473

Query: 306 GKPVSKEAVLKSETSSYDHPH-LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                      ++ +S + PH L Y    +  A    +A+   L  ++T+R+DL+D+TRQ
Sbjct: 474 -----------ADRASMNGPHSLPYPAEALEVAWRDLLAAREALGGADTFRFDLVDVTRQ 522

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            ++  A  L   +  AY + +      LS  F++L E +D +LA  + FL+G WL  A+ 
Sbjct: 523 VISNRARPLLPLLRTAYAMKELDRFIALSHSFIDLFELLDPVLATREEFLVGRWLADARA 582

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           LA +E++    E++ART IT W D+ +  A+L+ DY N  W+GL+ DYY PR   Y K +
Sbjct: 583 LAADEDEADALEFDARTIITTWGDSPESSATLI-DYANHEWAGLIADYYRPRWEKYLKSL 641

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK--YLQG 539
              L  G      D+  +       W    + YP E +GDA+ + + +++   Y +G
Sbjct: 642 ETELREGKPAEPIDFYAD----AAAWARSHDTYPTEPSGDAVSSCRAVHHALPYFEG 694


>gi|375146756|ref|YP_005009197.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
           koreensis GR20-10]
 gi|361060802|gb|AEV99793.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
           koreensis GR20-10]
          Length = 1147

 Score =  188 bits (477), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 140/515 (27%), Positives = 219/515 (42%), Gaps = 47/515 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+P F G VP    N    A++   GNW +      +    +LD TD  F  +   F E+
Sbjct: 228 VMPGFYGMVPHNF-NTKSKARVITQGNWGA------FIRPAILDPTDTAFDRVAGIFYEE 280

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
             K YGR    ++ D F E    + +   +   GA I   MQ     A+W++QGW     
Sbjct: 281 TKKLYGRNIRFFSGDPFHEGG--ITNGVNLGKAGANIQKAMQQYFPGAIWVLQGW----- 333

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
              +    K LL       L++ +LF E    W T   + G P+IWC ++NF     + G
Sbjct: 334 ---QDNPKKELLAETDKSALLIQELFGENTNNWETRNGYEGTPFIWCCVNNFGERPGLNG 390

Query: 188 ILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
            L+  A     A T      M GVG+  EGI  NP  YDL+ E+ + ++ V+   WIN Y
Sbjct: 391 KLERYAGEVYRAATGPFREYMKGVGIMPEGINNNPASYDLVLELGWHNQPVETGKWINDY 450

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
              RYG++   I  AW +   T+Y+          +++ A P                  
Sbjct: 451 VKARYGKANDQIATAWTLFLQTIYSNPGYQEGPPENILCARP------------------ 492

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                 A+     SS+      Y T+   + ++ F A+      S TY+ DLI+ TRQ L
Sbjct: 493 ------ALQVKSVSSWGKLKKGYDTALFEKGVQAFAAAAPLFGNSETYKIDLINFTRQVL 546

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
           +  A+ +F +++ AY+  +       +  FL L    + LL  H  + L  + + A +  
Sbjct: 547 SNRADTVFASLVTAYKEENTVAFNAAAEAFLSLHALTNELLNSHSYYRLTSYQQQALRSG 606

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
               + K    NA   IT W +N ++E   L +Y  K W G++  +Y  R  +YF Y+  
Sbjct: 607 NTPIERKNNLHNAMMLITYWGENNRQE-DYLHEYAYKEWGGMMTTFYQQRWKLYFDYLRN 665

Query: 487 SLESGDGFRLKD---WRREWIKLTNDWQNGRNVYP 518
           +L +G      D   W REW+      ++    YP
Sbjct: 666 NL-AGKSVTPPDFFAWEREWVTQNEQVKSEVQPYP 699


>gi|229818803|ref|YP_002880329.1| alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
 gi|229564716|gb|ACQ78567.1| Alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
          Length = 751

 Score =  188 bits (477), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 162/528 (30%), Positives = 252/528 (47%), Gaps = 37/528 (7%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           AVLPAF G+VP  L      A+    G  FS         T LL   D  F  +   F  
Sbjct: 230 AVLPAFGGHVPDGLGA---GARTHWQG--FS---------TALLGPDDDAFAVVAAEFAR 275

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
           QQ + +G T H+Y  D F E+ PP   PE +++  AA Y+GM + D +A W+MQ W F Y
Sbjct: 276 QQRELFG-TDHLYAADPFIESVPPSGEPEDLAAFAAATYAGMSAADPEATWVMQAWPFHY 334

Query: 127 D-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
              FW   ++ A+ ++VP  +L++LDL+AE  P+W   +      ++WC +HNF G   +
Sbjct: 335 HRRFWTAERIAAVTDAVPRDRLLLLDLWAEHAPVWDDGRGIAEHQWLWCAVHNFGGRFSV 394

Query: 186 YGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
           +G L  +A   G V    +      GVGM+ME +E NPV Y+L++++ +  E+ DV AW+
Sbjct: 395 HGDLHGLARDLGGVLDDGARTGGFTGVGMAMEALENNPVFYELLTDLVW--ERPDVDAWV 452

Query: 244 NQYSVRRYG-----RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD--VDPSIIS 296
            ++  +RYG      +  A+  AW +L  T+Y    G T      ++A P   V P    
Sbjct: 453 GRFVDQRYGFADGTAARDAVHGAWAILLRTLYGP--GMTRSIPSPVIARPADVVAPFHTQ 510

Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
              G++ +   PV   A + +E      P +     E+ RA  L   +     A     +
Sbjct: 511 RLAGEFLDPDAPVIVSANIDAEAD----PRVEGDLPEIARAAALLREAAGSSDAGGPLAH 566

Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
           DL DL    +A+        I+ A +  DA  V          + D+D + A     LLG
Sbjct: 567 DLADLLTHVVAQRTRAPIRAIVAAARAGDADAVRANGALLAAAIADLDAVAATQPDRLLG 626

Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
            WL +A++ A ++ + +    +AR Q+T+W     E+ S L DY  ++WSGLL  +Y PR
Sbjct: 627 TWLAAAQRWADDDGERRVLLRDARRQLTVW----GEQTSGLHDYSGRHWSGLLGGFYAPR 682

Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
             ++  ++ E+ ESG     ++ RR  + L   W       P +  GD
Sbjct: 683 WQLWVDWLAEAAESGSEPDPQELRRAVVALEESWVARDETGPTDPAGD 730


>gi|401885538|gb|EJT49648.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
           asahii CBS 2479]
          Length = 781

 Score =  188 bits (477), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 138/539 (25%), Positives = 241/539 (44%), Gaps = 42/539 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F G VP  L N            W S  ++  +     +D     +  +  AF+ +
Sbjct: 274 VLPGFCGFVPPELHNYIGGPDFKTYPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRK 331

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG TS  Y  D F E+ P    P Y+  +  A+   + +   +A W+MQGW+F  D
Sbjct: 332 QKELYGFTSDYYMVDLFTESKPTSTDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVND 391

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P  W     KA L+      L+VLDL AE  P W   K F+G  ++WC L N+  N  +Y
Sbjct: 392 PKSWTETASKAFLDGAG-ESLLVLDLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLY 450

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G LD      ++A+ +    + G+G+  EGI  N  +++L ++  +  + +D+K W   +
Sbjct: 451 GALDKWNHDIMDAK-ANGGRLSGMGIVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNW 509

Query: 247 SVRRY-GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
             RRY G+++   Q AW +L ++VY   + A       ++   D+ P++           
Sbjct: 510 VKRRYRGQNLDLAQKAWELLDNSVYKSNNTALKCTTRSLI---DLRPAV----------- 555

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
                  + L   T +Y    + Y   +V+ AL+  + S +  +    + YDL+D+ RQ 
Sbjct: 556 -------SGLIGTTGNYLATAITYEPRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQV 607

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
               A  ++  +I A+  ++     +  R  + L+ D+D L+A    F L  W+  A+  
Sbjct: 608 FVNAAIPIYQAMINAWNGSNKADTEKYGRELVGLINDIDRLMATSRHFRLESWVGDARNW 667

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
           AQ+   +   E+ AR Q+ +W   T     L R Y  K+W G++ + Y     + ++ ++
Sbjct: 668 AQDAGAKDDMEFQARNQLILWGPATFAPWPLDR-YAAKHWHGIMSEVYAKGWELLYQNLL 726

Query: 486 ESLESGDGFRLKDWRREWI------KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           ++         K W +         K+   W+N ++       GD++   + L  KY Q
Sbjct: 727 KT-------EPKAWNKTAFASELMEKVEKPWENVKSGGVQGPQGDSVAVIRELREKYKQ 778


>gi|406693970|gb|EKC97309.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
           asahii CBS 8904]
          Length = 781

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 138/539 (25%), Positives = 241/539 (44%), Gaps = 42/539 (7%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F G VP  L N            W S  ++  +     +D     +  +  AF+ +
Sbjct: 274 VLPGFCGFVPPELHNYIGGPDFKTYPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRK 331

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG TS  Y  D F E+ P    P Y+  +  A+   + +   +A W+MQGW+F  D
Sbjct: 332 QKELYGFTSDYYMVDLFTESKPTSTDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVND 391

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P  W     KA L+      L+VLDL AE  P W   K F+G  ++WC L N+  N  +Y
Sbjct: 392 PKSWTETASKAFLDGAG-ESLLVLDLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLY 450

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G LD      ++A+ +    + G+G+  EGI  N  +++L ++  +  + +D+K W   +
Sbjct: 451 GALDKWNHDIMDAK-ANGGRLSGMGIVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNW 509

Query: 247 SVRRY-GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
             RRY G+++   Q AW +L ++VY   + A       ++   D+ P++           
Sbjct: 510 VKRRYRGQNLDLAQKAWELLDNSVYKSNNTALKCTTRSLI---DLRPAV----------- 555

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
                  + L   T +Y    + Y   +V+ AL+  + S +  +    + YDL+D+ RQ 
Sbjct: 556 -------SGLIGTTGNYLATAITYEPRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQV 607

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
               A  ++  +I A+  ++     +  R  + L+ D+D L+A    F L  W+  A+  
Sbjct: 608 FVNAAIPIYQAMINAWNGSNKADTEKYGRELVGLINDIDRLMATSRHFRLESWVGDARNW 667

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
           AQ+   +   E+ AR Q+ +W   T     L R Y  K+W G++ + Y     + ++ ++
Sbjct: 668 AQDAGAKDDMEFQARNQLILWGAATFAPWPLDR-YAAKHWHGIMSEVYAKGWELLYQNLL 726

Query: 486 ESLESGDGFRLKDWRREWI------KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
           ++         K W +         K+   W+N ++       GD++   + L  KY Q
Sbjct: 727 KT-------EPKAWNKTAFASELMEKVEKPWENVKSGGVQGPQGDSVAVIRELREKYKQ 778


>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
 gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
          Length = 1552

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 135/488 (27%), Positives = 233/488 (47%), Gaps = 45/488 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+  F G VP+  +  FP+A++   G W      P      LL +   LF ++ +A+ E 
Sbjct: 233 VVQGFPGLVPSFFKERFPAAQLVLQGRWGHFNRPP-----MLLPSDKDLFQQVAKAYYES 287

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
            ++ YGR       D F E  NT  VD     +++   +     S    A W++QGW  +
Sbjct: 288 LIRCYGRDFKFLGGDLFHEGGNTKGVDVAATAAAVQQTMLRYFPS----AKWVLQGWNNN 343

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             P         LL+ +    +++++L  E+   W +S +F G P++W  +++F G  +M
Sbjct: 344 PSP--------TLLSKLDKQHVLLINLSGEIAASWESSNEFGGTPWLWGSVNHFGGKTDM 395

Query: 186 YGILDSIAFGPVEART-SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
            G L  +   P  A + ++N  M G+G+  EGI  NPVVYDL  + A+     D+   + 
Sbjct: 396 GGQLPVLVAEPHRAFSQTKNGVMQGIGILPEGINSNPVVYDLALKTAWYTTTPDLDRLLR 455

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            Y   RYG    ++  AW++L H+VY                            EG +++
Sbjct: 456 DYIAYRYGHVDESLVQAWHILSHSVYG---------------------EFKIKGEGTFES 494

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                ++  +  +  S++    + Y+  ++ +AL LF    ++   S TY+YDL+DL RQ
Sbjct: 495 IF--CARPGLHVTSVSTWGPKQMQYNPKDLEKALGLFRRVADQYKGSATYQYDLVDLARQ 552

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            +A +A +++   ++AY+  DA  + +  + F+ L++  D LL     FLLG WL  A  
Sbjct: 553 VMANHARDVYAAAMQAYRNKDAALLHEKGQEFMHLLQLQDRLLQTDTHFLLGNWLAQAAN 612

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
                  ++Q   NA+  IT W  ++   A+ + DY NK W+GLL+ YY PR   +F  +
Sbjct: 613 YGVTAADKQQALHNAKMLITYWGPDS--AATRVHDYANKEWAGLLKSYYEPRWQKFFYAL 670

Query: 485 IESLESGD 492
            +S+ +G+
Sbjct: 671 YQSVNTGE 678


>gi|317501265|ref|ZP_07959469.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
            8_1_57FAA]
 gi|316897332|gb|EFV19399.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
            8_1_57FAA]
          Length = 1847

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 138/497 (27%), Positives = 230/497 (46%), Gaps = 72/497 (14%)

Query: 8    VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            VL  +SG VP  + +   +A++ + G W S +         +L  T   F +  + F + 
Sbjct: 695  VLQGYSGMVPTNIHDYDKNAEVIEQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQC 748

Query: 68   QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            Q + YG  S+ Y  D F E    +      S +   + + M + D DAVW++Q W     
Sbjct: 749  QKEVYGDVSNYYATDPFHEG--GITGGMNASDISEKVLTEMITADKDAVWIIQSW----- 801

Query: 128  PFWRPPQMKALLNSVP-----LGKLVVLDLFAEVKP----------IWSTSKQFYGVPYI 172
               +     ALLN +          ++LDL+AE  P           +   ++F   P++
Sbjct: 802  ---QGNPTTALLNGLDRVEKGTDHALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWL 858

Query: 173  WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
            +CML+NF G + ++G LD++A   +    +E   + G+G++ E    NPV+YD + E  +
Sbjct: 859  FCMLNNFGGRLGLHGHLDNLA-NNIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIW 917

Query: 233  QH------EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNR 281
            Q       E +D+  W++ Y+ RRYG    +   AW++L  TVY  +      GA +   
Sbjct: 918  QDDASQKMEVIDLDTWLDDYATRRYGAESESANQAWDILKETVYKASLNGLGQGAPES-- 975

Query: 282  DVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF 341
             V+ A P++       T G    +G  V                 + Y   ++  A  L 
Sbjct: 976  -VVNARPNL-------TIGAASTWGNAV-----------------ISYEKGDLEEAAALL 1010

Query: 342  IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVE 401
            +A  ++L  S  Y+YDL ++ +Q L+  A E    +  A+   D       S +F+ ++E
Sbjct: 1011 LADYDKLKDSAGYQYDLANVLQQVLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIE 1070

Query: 402  DMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRD 459
            DM+ +    + FLLG W+E AK LA N +   ++ YE+NA+  +T W    Q E   L+D
Sbjct: 1071 DMEKVTGTSEYFLLGRWVEQAKALANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKD 1130

Query: 460  YGNKYWSGLLRDYYGPR 476
            Y N+ WSGL+ D+Y  R
Sbjct: 1131 YSNRQWSGLIGDFYKAR 1147


>gi|373461651|ref|ZP_09553390.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
 gi|371951955|gb|EHO69797.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
          Length = 713

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 137/472 (29%), Positives = 207/472 (43%), Gaps = 53/472 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           VL  F G V  ++++ +P+A + Q G W F  + D       +L  T+ LF EI   +  
Sbjct: 231 VLQGFYGMVSRSIRDRYPNAVMPQ-GMWGFFERPD-------ILKPTEKLFDEIADTYYR 282

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
           +  K YG   H +  D F E          ++  G A+   MQ     + W++QGW  + 
Sbjct: 283 EIKKHYGTGFHYFGGDLFHEGGQ--TGTLNVADCGLAVQQAMQRNFPGSTWVLQGWSGNP 340

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           +P         LL  +   K++V+DLF E    W+ +K + G P++WC++ NF     MY
Sbjct: 341 NPL--------LLTKLDREKVLVVDLFGENDEAWNRTKAYQGTPFLWCIVSNFGEQCGMY 392

Query: 187 GILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           G L  IA    + R S+    + GVG+  EGI  NPVVYD++        K++V+AW+  
Sbjct: 393 GKLQRIALQIDKVRKSDYKAYLKGVGIMPEGINNNPVVYDMVLHAPLTDRKINVEAWLKS 452

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y   RYG     I  AW +   T+Y                         SV E     Y
Sbjct: 453 YITYRYGSYNADIYAAWLIFLQTIY------------------------ASVPE----KY 484

Query: 306 GKPVS----KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
           G P S    +  V  ++TSS+     +Y        + LF+ +      S TY YD+ DL
Sbjct: 485 GLPESVFCARPGVKVTQTSSWGVRARYYDMDFFKEGVRLFLKAKTSFEDSETYAYDMFDL 544

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
            RQ  +   N ++ ++I A    + +   Q S RFL  +   D LLA   GF L  WL  
Sbjct: 545 LRQVQSDKGNRVYDDMIAAIDAKNPNRFEQTSDRFLHELLRQDTLLAQSKGFTLERWLGQ 604

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
           A +  +          NA+ Q+T W  +     + + DY  K W+G+LR  Y
Sbjct: 605 ASRFGKTVYDRDLALKNAKMQLTFWGPDWNPTTT-VHDYAAKEWAGMLRTLY 655


>gi|153814573|ref|ZP_01967241.1| hypothetical protein RUMTOR_00787 [Ruminococcus torques ATCC 27756]
 gi|331089988|ref|ZP_08338878.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
            3_1_46FAA]
 gi|145848067|gb|EDK24985.1| F5/8 type C domain protein [Ruminococcus torques ATCC 27756]
 gi|330402902|gb|EGG82468.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
            3_1_46FAA]
          Length = 1863

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 138/497 (27%), Positives = 230/497 (46%), Gaps = 72/497 (14%)

Query: 8    VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            VL  +SG VP  + +   +A++ + G W S +         +L  T   F +  + F + 
Sbjct: 711  VLQGYSGMVPTNIHDYDKNAEVIEQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQC 764

Query: 68   QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            Q + YG  S+ Y  D F E    +      S +   + + M + D DAVW++Q W     
Sbjct: 765  QKEVYGDVSNYYATDPFHEG--GITGGMNASDISEKVLTEMITADKDAVWIIQSW----- 817

Query: 128  PFWRPPQMKALLNSVP-----LGKLVVLDLFAEVKP----------IWSTSKQFYGVPYI 172
               +     ALLN +          ++LDL+AE  P           +   ++F   P++
Sbjct: 818  ---QGNPTTALLNGLDRVEKGTDHALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWL 874

Query: 173  WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
            +CML+NF G + ++G LD++A   +    +E   + G+G++ E    NPV+YD + E  +
Sbjct: 875  FCMLNNFGGRLGLHGHLDNLA-NNIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIW 933

Query: 233  QH------EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNR 281
            Q       E +D+  W++ Y+ RRYG    +   AW++L  TVY  +      GA +   
Sbjct: 934  QDDASQKMEVIDLDTWLDDYATRRYGAESESANQAWDILKETVYKASLNGLGQGAPES-- 991

Query: 282  DVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF 341
             V+ A P++       T G    +G  V                 + Y   ++  A  L 
Sbjct: 992  -VVNARPNL-------TIGAASTWGNAV-----------------ISYEKGDLEEAAALL 1026

Query: 342  IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVE 401
            +A  ++L  S  Y+YDL ++ +Q L+  A E    +  A+   D       S +F+ ++E
Sbjct: 1027 LADYDKLKDSAGYQYDLANVLQQVLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIE 1086

Query: 402  DMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRD 459
            DM+ +    + FLLG W+E AK LA N +   ++ YE+NA+  +T W    Q E   L+D
Sbjct: 1087 DMEKVTGTSEYFLLGRWVEQAKALANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKD 1146

Query: 460  YGNKYWSGLLRDYYGPR 476
            Y N+ WSGL+ D+Y  R
Sbjct: 1147 YSNRQWSGLIGDFYKAR 1163


>gi|336439030|ref|ZP_08618649.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
            1_1_57FAA]
 gi|336017072|gb|EGN46842.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
            1_1_57FAA]
          Length = 1863

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 138/497 (27%), Positives = 230/497 (46%), Gaps = 72/497 (14%)

Query: 8    VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            VL  +SG VP  + +   +A++ + G W S +         +L  T   F +  + F + 
Sbjct: 711  VLQGYSGMVPTNIHDYDKNAEVIEQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQC 764

Query: 68   QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
            Q + YG  S+ Y  D F E    +      S +   + + M + D DAVW++Q W     
Sbjct: 765  QKEVYGDVSNYYATDPFHEG--GITGGMNASDISEKVLTEMITADKDAVWIIQSW----- 817

Query: 128  PFWRPPQMKALLNSVP-----LGKLVVLDLFAEVKP----------IWSTSKQFYGVPYI 172
               +     ALLN +          ++LDL+AE  P           +   ++F   P++
Sbjct: 818  ---QGNPTTALLNGLDRVEKGTDHALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWL 874

Query: 173  WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
            +CML+NF G + ++G LD++A   +    +E   + G+G++ E    NPV+YD + E  +
Sbjct: 875  FCMLNNFGGRLGLHGHLDNLA-NNIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIW 933

Query: 233  QH------EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNR 281
            Q       E +D+  W++ Y+ RRYG    +   AW++L  TVY  +      GA +   
Sbjct: 934  QDDASQKMEVIDLDTWLDDYATRRYGAESESANQAWDILKETVYKASLNGLGQGAPES-- 991

Query: 282  DVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF 341
             V+ A P++       T G    +G  V                 + Y   ++  A  L 
Sbjct: 992  -VVNARPNL-------TIGAASTWGNAV-----------------ISYEKGDLEEAAALL 1026

Query: 342  IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVE 401
            +A  ++L  S  Y+YDL ++ +Q L+  A E    +  A+   D       S +F+ ++E
Sbjct: 1027 LADYDKLKDSAGYQYDLANVLQQVLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIE 1086

Query: 402  DMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRD 459
            DM+ +    + FLLG W+E AK LA N +   ++ YE+NA+  +T W    Q E   L+D
Sbjct: 1087 DMEKVTGTSEYFLLGRWVEQAKALANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKD 1146

Query: 460  YGNKYWSGLLRDYYGPR 476
            Y N+ WSGL+ D+Y  R
Sbjct: 1147 YSNRQWSGLIGDFYKAR 1163


>gi|403512485|ref|YP_006644123.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
           [Nocardiopsis alba ATCC BAA-2165]
 gi|402798758|gb|AFR06168.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
           [Nocardiopsis alba ATCC BAA-2165]
          Length = 718

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 133/477 (27%), Positives = 214/477 (44%), Gaps = 48/477 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F+G+VP +L      ++  Q               T++L  TDPL+  +    +E 
Sbjct: 197 VLPGFTGHVPPSLAPGRTGSRTWQG------------LVTHVLVPTDPLYTTLCAEIVET 244

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q KE   T H Y  D F E  P    P +   +  A   G+   D  AVW +Q W FSY 
Sbjct: 245 Q-KELFDTDHQYAIDPFIEMIPVDSDPGFPGLVARATIEGLTRADPRAVWFLQTWPFSYQ 303

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI--- 183
             FW P +++A L+++P   L +LDL+AE  P WS    F G P+ WC L NF G     
Sbjct: 304 SDFWSPERVEAFLDAIPDDHLHLLDLWAEYDPQWSRFHAFGGTPWTWCALLNFGGRTDPM 363

Query: 184 -EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
            ++ G  D I      A   E     G+G+SME    NP  ++L+ + A+       + W
Sbjct: 364 ADLQGAADRIGAAKDSAHPPE-----GIGLSMEATRNNPAFFELVVDQAWTRTGRVEEEW 418

Query: 243 INQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           +  +  +RYG    PA+ + W  L  TV   +          +  FP+    ++++    
Sbjct: 419 LPDFVAQRYGPGHDPALLEGWRGLLRTVLGASG---------VRIFPEQFNGVLTL---- 465

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG--NELSASNTYRYDLI 359
            + + + +   + L++E ++     +WY   +++ A E  +A    + L+      +DL+
Sbjct: 466 -RPHYRHLEDSSALRAEVTAL----VWYPWPDLLAAWERLVAGAETDPLAVEGPLGHDLV 520

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
           D+    L++ A+  +L ++E    +       L  RFLE+ +D+D LL     +    W 
Sbjct: 521 DVAMAVLSRVADHRYLEMVEHLDHHPELPEGDLE-RFLEVFDDLDALLETRPEYRYRTWE 579

Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
             A   A   E  +    NAR  +T+W   T  +   L DY  + WSGL+  YY PR
Sbjct: 580 AKATSWATGTEDHRVLTDNARRILTVW---TTLDDPRLDDYAGRLWSGLVGGYYRPR 633


>gi|315500594|ref|YP_004089396.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
 gi|315418606|gb|ADU15245.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
          Length = 765

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 144/538 (26%), Positives = 221/538 (41%), Gaps = 76/538 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP A     P A+I  +  W           TY LD  DPLF +I   FI  
Sbjct: 239 ILPAFGGYVPKAFAQKHPQARIYPMRPWEGFHE------TYWLDPADPLFAKIAARFIAL 292

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVD-----------------------------SPEYIS 98
             + YG   + Y  D+F+E  PP+                                E ++
Sbjct: 293 YTETYGEGRY-YLADSFNEMLPPISHDGSDVKNAKYGDSTANTKETETVVDPAVKAERLA 351

Query: 99  SLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVK 157
           + G AIY  ++    DAVW MQGWLF  D  FW P  + A L  VP  KL++LD+  +  
Sbjct: 352 AYGKAIYDSIRQARPDAVWTMQGWLFGADKHFWTPDAIGAFLRDVPQDKLMILDIGNDRY 411

Query: 158 P-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL----DSIAFGPVEARTSENTTMVGVGM 212
           P +W +S  F G P+I+  +HN+  +  +YG L    D I        T +   + G G+
Sbjct: 412 PGVWQSSNAFQGKPWIYGYVHNYGASNPVYGDLGFYRDDIRGLLARKDTGD---LKGFGL 468

Query: 213 SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNC 272
             EG+  N +VY+   ++A+      V  W+  Y   RYG+  PA+  AW+      ++ 
Sbjct: 469 FPEGLHNNSIVYEYAYDLAWGQANQTVTEWLTTYLKSRYGQVTPALILAWSTYVEAAFST 528

Query: 273 TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTS 332
              +               P       G Y    +P +     +       HP       
Sbjct: 529 RYWS---------------PRWWRSKAGAYLLCKRPTADMVEFEG------HPG---DRK 564

Query: 333 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 392
           ++ RA++  + S      S  YR+D+ID  R  +++  ++  +  ++AY+  D      L
Sbjct: 565 KLRRAIDALL-SLKGFGGSALYRHDVIDAVRHLVSEEIDDRLIAAMKAYKSGDVKTGDGL 623

Query: 393 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 452
               + LV  +D L+       L  W++ A       E++  Y  NA+ Q+T+W      
Sbjct: 624 REEVIALVTQVDTLMGAQPD-TLASWIDEASAYGDTSEEKAYYVMNAKAQVTVWGGKGN- 681

Query: 453 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 510
               L DY +K W GL +D+Y PR       +  S   G  F  K + RE I     W
Sbjct: 682 ----LNDYASKAWQGLYKDFYLPRWMKLLAALRASASGGAPFDQKTFTRELIDWEQAW 735


>gi|302526099|ref|ZP_07278441.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
 gi|302434994|gb|EFL06810.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
          Length = 860

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 134/524 (25%), Positives = 223/524 (42%), Gaps = 52/524 (9%)

Query: 8   VLPAFSGNVPAALQN------VFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
           VLP + G VP    +          A++   G W      P W     LD     + ++ 
Sbjct: 237 VLPGYFGTVPRGFADKSKKADASSDARVIGQGTWVGFDR-PDW-----LDPRTSSYRKVA 290

Query: 62  RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
            AF + Q   +G TS +Y  D   E     D P  +      + + +Q+    A W++ G
Sbjct: 291 AAFYQAQHDLFGDTS-MYKMDLLHEGGKSGDVP--VGDAARGVMTALQTARPGATWVLLG 347

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           W  +      PP  +A++++V   KL V+D  ++         Q+   PY +  ++NF G
Sbjct: 348 WQNN------PP--RAIVDAVDKSKLFVVDGLSDRYGQRDPDSQWNNTPYAFGTIYNFGG 399

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           +  +            + RT + + + G+    EG   NP  ++L +E+A++   +   A
Sbjct: 400 HTTIGANTGVWTQRFPQWRTKQGSALTGIAYLPEGTGTNPAAFELFTELAWRQTPIHQAA 459

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNC-TDGATDKNRDVIVAFPDVDPSIISVTEG 300
           W   Y+ RRYG        AW++L  T Y+    G ++    +  A P++D         
Sbjct: 460 WFADYASRRYGGPDTRAATAWDLLRQTAYSMPASGWSEAQDSLYAARPNLD--------- 510

Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
                           +  +++    L Y  +   +AL+  +     L  ++ YR+DL+D
Sbjct: 511 ---------------AATAATWSPASLRYQQATFGKALDELLNVDPALRGTDAYRFDLVD 555

Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
           + RQAL   +  L   I  AY   D      L+ R++  +  +D LLA    FLLGPWLE
Sbjct: 556 VARQALTNTSRTLLPQIKTAYTNRDRTQFTTLTSRWMSNMTLLDKLLATDSRFLLGPWLE 615

Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
           +AK  A  + ++ + E++AR+ IT W      +   L DY N+ WSGL+ D+Y  R   Y
Sbjct: 616 AAKSWAGTDTEQARLEYDARSLITTWGPRAGSDDGRLHDYANREWSGLVSDFYAKRWKQY 675

Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
           F  +  ++ +G      D    W    + W   RN YP    GD
Sbjct: 676 FDSLNTAMNTGGQPASID----WFAAEDGWAKQRNPYPTTPAGD 715


>gi|282877909|ref|ZP_06286718.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
 gi|281299910|gb|EFA92270.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
           35310]
          Length = 717

 Score =  186 bits (471), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 140/518 (27%), Positives = 230/518 (44%), Gaps = 71/518 (13%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    + PAF+G VP A     P      L  W            Y+L      F ++G+
Sbjct: 227 LGMHPIAPAFAGFVPKAFAKKHPEINFKHL-RWGGFADS---LNAYVLPPESSYFKQLGK 282

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAV 116
            FIE+  +E+G  ++ Y  D+F+E   PV+  +       ++  G AIY  + +G+  A+
Sbjct: 283 LFIEEWEREFGENTY-YLSDSFNEMKLPVNPNDEEEKCRLLAEYGKAIYQSINAGNPHAI 341

Query: 117 WLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAE-------VKPIWSTSKQFYG 168
           W+ QGW F Y   FW    + ALL+ VP  +++++DL  +        +  W     FYG
Sbjct: 342 WVTQGWTFGYQHDFWNRKSLSALLSQVPNDRMIIIDLGNDYPKWVWHTEQTWKRHNGFYG 401

Query: 169 VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLM 227
             +I+  + NF G   + G L+  A     A ++ N   +VG+G + EG+E N VVY+L+
Sbjct: 402 KQWIFSYVPNFGGKTLLTGDLEMYATDASLALSAANKGNLVGIGSAPEGLENNEVVYELL 461

Query: 228 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
           S+ A+  + +++  WI  Y + RYG+    ++ AWN    +VY+            + ++
Sbjct: 462 SDAAWTDKGINLDEWIANYCMARYGKYPDKMKAAWNGFRKSVYSS-----------LYSY 510

Query: 288 PDVD-PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 346
           P     ++I  T  K                  S +D    ++      +A+E F++  +
Sbjct: 511 PRFTWQTVIPDTRRK------------------SRHDLNETYF------KAVEDFLSCAD 546

Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
           EL  +  Y+ D I    Q L   A+  + N +    LN      +   + +EL+   D +
Sbjct: 547 ELGGAKFYQDDAILFAAQYLGAKADIYYENALRYGSLNKHVEANKQLSKAIELLLFADKI 606

Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
           LA H    L  W+  A+      +++ QYE NA+  IT W  + +       DY  + WS
Sbjct: 607 LASHPTDRLDVWIAKARSQGHTPQEKNQYEANAKRLITTWGGHQE-------DYAARCWS 659

Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
           GL++DYY PR  IYF        S     L  W   WI
Sbjct: 660 GLIKDYYIPRIQIYF--------SNQRKMLDQWEENWI 689


>gi|331092442|ref|ZP_08341267.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
            2_1_46FAA]
 gi|330401285|gb|EGG80874.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
            2_1_46FAA]
          Length = 1598

 Score =  185 bits (469), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 140/533 (26%), Positives = 245/533 (45%), Gaps = 70/533 (13%)

Query: 8    VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            VL  +SG VP  ++    SA++ + G W S +         +L      F +  + F + 
Sbjct: 717  VLQGYSGMVPTNIREKDSSAEVIEQGTWCSFRRPD------MLKTDSASFDKYAKLFYQA 770

Query: 68   QLKEYGRTSHIYNCDTFDENTPPVD-SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
            Q + YG ++H Y  D F E       +P  I+     +   M   D D +W++Q W    
Sbjct: 771  QKEVYGESAHYYATDPFHEGGDTGGLNPTVIA---GKVLDAMLEADKDGIWIIQSW---- 823

Query: 127  DPFWRPPQMKALLNSVPLGK--LVVLDLFAEVKPIWSTSK-------QFYGVPYIWCMLH 177
                +     ALL  +   K   +VLDL+AE  P W+ +        +F   P+++CML+
Sbjct: 824  ----QGNPTTALLKGLEGRKEHALVLDLYAEKTPHWNETNPNEYGGGEFNDTPWVFCMLN 879

Query: 178  NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ---H 234
            NF G + ++G LD++A   + A  +    M G+G++ E    NP++YD + E  +     
Sbjct: 880  NFGGRLGLHGHLDNLAKN-IPAALNSAKHMEGIGITPEASVNNPLLYDFLFETVWTDNAK 938

Query: 235  EK---VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVA 286
            EK   +D+  W+  Y+ RRYG+   +  +A  ++  TVY         GA +    V+ A
Sbjct: 939  EKLPVIDLDKWLKDYAKRRYGKESQSAYEALLIMKDTVYKAELNMKGQGAPES---VVNA 995

Query: 287  FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 346
             P +D    S        +G  V                 + Y  +++ +A EL +   +
Sbjct: 996  RPALDIGAAST-------WGNAV-----------------ISYDKAKLEKAAELLLKDYD 1031

Query: 347  ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
            +L  S+ Y YDL  + +Q L+  A E    +  A++ N+       + +FL +++ M+ +
Sbjct: 1032 KLKDSDGYMYDLATMLQQVLSNSAQEYQRKMANAFKENNKEEFNTYADKFLSIIDSMEKV 1091

Query: 407  LACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
             +    +LLG W+E AK LA+N +   +  YE+NA+  +T W    Q E   L+DY N+ 
Sbjct: 1092 TSTSKYYLLGTWVEQAKALAKNADDFTKDLYEFNAKALVTTWGSINQAEGGGLKDYSNRQ 1151

Query: 465  WSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK--DWRREWIKLTNDWQNGRN 515
            WSGLL+D+Y  R   + +   + L+      +   +W  +W++   ++ N  N
Sbjct: 1152 WSGLLKDFYKVRWQKWIQARNDELDGKQPENINWFEWEWKWVRENTEYTNTPN 1204


>gi|187735714|ref|YP_001877826.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
 gi|187425766|gb|ACD05045.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
           BAA-835]
          Length = 852

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 137/537 (25%), Positives = 236/537 (43%), Gaps = 52/537 (9%)

Query: 8   VLPAFSGNVPAALQ-NV-FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           VL  + G VP+  Q NV     K+   G W + +        +++D T   F ++   + 
Sbjct: 235 VLQGYVGFVPSDFQENVRIDGLKLIPQGEWVNFRR------PWVVDPTCEAFPKLAADWY 288

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           +   K YG    ++  D F E     D    ++     +   MQ     A W++Q W   
Sbjct: 289 KALRKVYGIPGKMFGGDLFHEGGRKGDID--VTQAAQEVQKAMQKASPGAFWVIQAW--G 344

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
            +P       + LL+ +   + +VL L  ++       + F G+P++WC L NF GN  M
Sbjct: 345 GNP------TRELLSGLDPERALVLQLTKDMANGGKNLRTFNGIPWVWCELANFGGNTGM 398

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
           YG +  ++    E    ++  +VG+G   EG+E NP+ Y L S+  +  E + V+ W+ +
Sbjct: 399 YGGVPLLSRLGSELSGYKDKGLVGMGTLSEGLETNPLHYALFSDRLWTREDISVREWLGK 458

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y+ +RYG +  A+  A  VL  ++YN           +I A P  +              
Sbjct: 459 YARQRYGFAPKAVVKALEVLSFSIYNPVRSQEGCTESIICARPSWN-------------- 504

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLT 362
                       + S++     +Y   ++++A   ++ + N+   L    T+RYDL+D+ 
Sbjct: 505 ----------VRKASTWSSGERYYHLGDIVKAARGYLKAANDQPNLVKKETFRYDLVDVV 554

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQALA  A      +  A+   D     +  +RFL L+ DMD LLA    FLLG W + A
Sbjct: 555 RQALADAAFYQLQQVRSAFDSGDLAAYRKQVKRFLSLISDMDALLATDSQFLLGTWQKRA 614

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
                + +++   + +A+  IT W D        L DY N+ W+GL+ D+Y PR   +F+
Sbjct: 615 LDWGDSRQEKALMDKSAKMLITTWIDQVPRS---LNDYSNRQWAGLVSDFYLPRWKNFFE 671

Query: 483 YMIESLESGDGFRLKDWRREWIKLTND---WQNGRNVYPVESNGDALITSQWLYNKY 536
           + ++ L +G   R         K+  D   +     +Y V+  GD L  +  + N +
Sbjct: 672 FQMDVL-TGKKTRDAAHAAFMDKMVRDELAFAGNGKIYSVKPAGDTLAVANRVMNTH 727


>gi|302546018|ref|ZP_07298360.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
           [Streptomyces hygroscopicus ATCC 53653]
 gi|302463636|gb|EFL26729.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
           [Streptomyces himastatinicus ATCC 53653]
          Length = 679

 Score =  182 bits (462), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 138/513 (26%), Positives = 214/513 (41%), Gaps = 49/513 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P + G VP    +  P A     G+W  ++  P W     LD     F E+  AF   
Sbjct: 214 VFPGYFGTVPDGFADRNPGAHTVPQGDWNGLRR-PDW-----LDPRTDAFHEVAAAFYRH 267

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           Q   +G    ++  D   E  N   V  P+       A+   +Q+    A+W++ GW  +
Sbjct: 268 QHDLFG-ACDLFKMDLLHEGGNAGDVSVPDAAR----AVEKALQTSRPGAIWVILGWQSN 322

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                     + LL++V    ++V+D  +++  I    K +  VPY +  + NF G   +
Sbjct: 323 --------PRRDLLDAVDHDHMLVVDGLSDLDTITDREKDWGSVPYAFGTIPNFGGRTTI 374

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
                         R    + +VG     E +E++P  Y+L SE+A++   VD  AW   
Sbjct: 375 GAKTHMWTERFTVWRDKPGSKLVGTAYMPEAVERDPAAYELFSELAWRDTAVDRDAWFRD 434

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y+  RYG      ++A+  L  T Y  +      +  V  A P       S+T     NY
Sbjct: 435 YADVRYGARDAKAREAFAALRDTAYQISSKDGRPHDSVFAARP-------SLTARSGTNY 487

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
                      + T ++D        +    AL   +     L  S+ YRYDL D  RQA
Sbjct: 488 A----------THTPAFD-------PARFDAALAALLGVRAGLRDSDAYRYDLADTARQA 530

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           LA  + +L   + +AY   D      LSR +L+L+   D +   H   LLGPWLE AK++
Sbjct: 531 LANRSWQLIGQLADAYARKDLDTFRALSRLWLKLMRLSDDITGTHRLLLLGPWLEDAKRM 590

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
           A   E+  Q E+ AR  IT W D    +   L +Y N+ W+GL+ D++ P+   Y   + 
Sbjct: 591 ASGAEESAQLEFAARALITTWADRGAADPGKLANYANRDWNGLIGDFHVPQWQTYLDELE 650

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYP 518
           ++L  G   R  D    W  +   W   R  YP
Sbjct: 651 DALAEGRAPRTFD----WYTVEEPWTRERKSYP 679


>gi|386386798|ref|ZP_10071901.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
 gi|385665738|gb|EIF89378.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
          Length = 1033

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 137/522 (26%), Positives = 230/522 (44%), Gaps = 54/522 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G VP    +  P A++   G W  ++  P W     LD    +F ++  AF   
Sbjct: 250 VLPGYFGTVPDGFADRNPGARVVAQGLWNGLRR-PDW-----LDPRTTVFPKVAAAFYRH 303

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q K +G    ++  D   E     D P  +     A+   +++   +AVW++ GW  +  
Sbjct: 304 QTKLFG-ACDLFKMDLLHEGGNAGDVP--VPDAARAVEKALRTARPNAVWVILGWQSN-- 358

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
                   +ALL++V   +++++D  +++        ++ G PY +  + NF G   +  
Sbjct: 359 ------PRRALLDAVDKRRMLIVDGLSDLDTTGDRESEWGGTPYAFGTIPNFGGRTTLGA 412

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
             D         R    + +VG     E  E++P  ++L SE+A++ E++D +AW  +Y+
Sbjct: 413 NTDRWTDRFTVWRDRPGSALVGTAYMPEAAERDPAAFELFSELAWRRERIDREAWFTEYA 472

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             RYG    +   A+  L  T Y                        ++ T+G+   Y  
Sbjct: 473 QIRYGSDDASAAAAFGALAATAYR-----------------------LASTDGR--PYDS 507

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
              +   L S   +   P  + +      A    +A+G EL  S+TYR+DL +L RQALA
Sbjct: 508 HFLRRPSLTSSIGTAFDPAGFDT------AFAALLAAGPELRDSDTYRHDLTELARQALA 561

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
             +  L   +  A    D      +S  +L+L+   D +  CH  FLLGPWLE AK+LA 
Sbjct: 562 NRSRTLQFALRAARASKDVAAFRGVSALWLKLMRLADTMAGCHRSFLLGPWLEDAKRLAT 621

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
           +  +  + E  AR  IT W D  +  A+ L +Y N+ W+GL+ D + P+   +   + ++
Sbjct: 622 SPAEAVELERTARALITTWAD--RPAANALSNYANRDWNGLIADVHVPQWDAFLTEVADA 679

Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
           LE+G   +  DW  +       W   R VYP    GD   T+
Sbjct: 680 LEAGRAPKSFDWYPQ----EEAWTKDRRVYPSAPTGDPYATA 717


>gi|147860882|emb|CAN83148.1| hypothetical protein VITISV_031934 [Vitis vinifera]
          Length = 562

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 82/92 (89%), Positives = 89/92 (96%)

Query: 82  DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 141
           DTFDENTPPVD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+S
Sbjct: 429 DTFDENTPPVDDPEYISSLGAAIFKGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHS 488

Query: 142 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW 173
           VP+G+LVVLDLFAEVKPIW TS+QFYGVPYIW
Sbjct: 489 VPMGRLVVLDLFAEVKPIWITSEQFYGVPYIW 520



 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 54/63 (85%), Positives = 60/63 (95%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGNVPAAL+ +FPSAKIT+LGNWF+V  +PRWCCTYLLDATDPLFIEIGRAFI+Q
Sbjct: 272 VLPAFSGNVPAALKYIFPSAKITRLGNWFTVGGNPRWCCTYLLDATDPLFIEIGRAFIQQ 331

Query: 68  QLK 70
           QLK
Sbjct: 332 QLK 334


>gi|261880009|ref|ZP_06006436.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
 gi|270333325|gb|EFA44111.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
          Length = 722

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 136/518 (26%), Positives = 217/518 (41%), Gaps = 69/518 (13%)

Query: 3   LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
           L    + PAF+G VP       P  ++ +L  W     D R    Y+L    P F +IGR
Sbjct: 226 LGMHPIAPAFAGFVPEQFVEAHPGLQVKKL-TWGGF--DDR-LNAYVLSPESPYFKQIGR 281

Query: 63  AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS------LGAAIYSGMQSGDSDAV 116
            F+E+  KE+G+ +  Y  D+F+E   PV+  + I         G  IY  +   + DAV
Sbjct: 282 LFVEEWEKEFGKNT-FYQSDSFNEMEIPVEPGDSIGKWKLLEQYGDVIYRSIAEANPDAV 340

Query: 117 WLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDL-------FAEVKPIWSTSKQFYG 168
           W+ QGW F Y    W    ++ALL  VP  K++++DL         + +  W     +YG
Sbjct: 341 WVTQGWTFGYQHKMWDSKSLQALLRHVPDDKMLIIDLANDYPKWIWKTQQTWKVQHGYYG 400

Query: 169 VPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLM 227
             +++  + NF G     G +   A    EA   SE   MVG G + EGIE N V+Y+L+
Sbjct: 401 KQWVFSYVPNFGGKTLPTGDMQMYASASAEALHHSERGNMVGFGSAPEGIENNDVIYELL 460

Query: 228 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
           ++M +  + VD+  WI  Y   RYG     +Q AW  +  +VY             + ++
Sbjct: 461 ADMGWTDKAVDLDLWIKDYCEARYGGYPSDMQKAWQCMLRSVYGS-----------LYSY 509

Query: 288 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 347
           P      ++    +       VS  A+  +  S   H                F+    +
Sbjct: 510 PRFTWQTVTPDSRR-------VSTHALNDTFLSGVAH----------------FLRCARQ 546

Query: 348 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLL 407
           L +S  YR D I L    L   A+  +   ++               + ++L+   D LL
Sbjct: 547 LGSSPLYRSDAISLASLYLGTKADRHYTKALDLKASGKQQAASAELHQTIDLLTKADRLL 606

Query: 408 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
           A H    L  W++ A+       ++ +YE +A+  IT+W            DY  ++W+G
Sbjct: 607 ASHPTHRLDRWIQFARNHGITTAEKNRYESDAKRLITIW-------GGFQEDYAARFWNG 659

Query: 468 LLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 505
           L+  YY PR   YF +   +        L  W  +W+K
Sbjct: 660 LIAHYYIPRIRYYFDHGRPA--------LMQWEEQWVK 689


>gi|297194750|ref|ZP_06912148.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
           25486]
 gi|297152431|gb|EFH31740.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
           25486]
          Length = 816

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 140/529 (26%), Positives = 229/529 (43%), Gaps = 55/529 (10%)

Query: 8   VLPAFSGNVPAALQNVFPS--AKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           VLP + G VP       P   A++   G W      P W     LD     F ++  AF 
Sbjct: 28  VLPGYFGTVPDDFPGHNPGSDARVIPQGTWGGGMRRPDW-----LDPRTQAFSDVAAAFY 82

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
             Q + +G  SH +  D   E     D P  +     A+ + +Q+    A W++ GW  +
Sbjct: 83  RHQGELFGDVSH-FKMDLLHEGGTAGDVP--VPDAARAVETSLQTARPGATWVILGWQSN 139

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             P         +L+S+   +++++D  +++  +      + G PY +  + NF G   +
Sbjct: 140 PRPV--------MLDSIDTSRVLIVDGLSDLDTVTDREADWGGAPYAFGTIPNFGGRTTI 191

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
               D         R    + +VG     E  E++P   +L SE+A++ EK+D +AW  +
Sbjct: 192 GANTDRWTEKFTAWRDKPGSALVGTAYMPEAAERDPAALELFSELAWREEKIDREAWFAE 251

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y+  RYG    + ++A+  L  T Y  T                        T+G+   Y
Sbjct: 252 YAQIRYGGVDHSAREAFAALAATAYKLTS-----------------------TDGR--PY 286

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
               S+   L +   +   P      +   RA    +A    L  S+ YR+DL D+ RQA
Sbjct: 287 DSLFSRRPSLTTAIGTAFDP------AGFDRAFAALLAVRAPLRDSDAYRHDLTDVARQA 340

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           LA  +  L L +  AY+  D      +S  +L+++   D +  CH  FLLGPWLE AK+L
Sbjct: 341 LANRSRTLQLALRAAYRNKDVATFRAVSALWLKVMRLSDTMAGCHRQFLLGPWLEDAKRL 400

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
           A + E+  Q E  ART IT W D  +  A+ L +Y N+ W GL+ D + P+   +     
Sbjct: 401 ATSPEEAVQLERTARTLITTWAD--RPTANSLSNYANRDWQGLMADVHVPQWEAFLTEQA 458

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
           +++ +G   +  DW  +       W   R+ YPV   GDA  T+  +++
Sbjct: 459 DAMAAGRAPKSFDWYPQ----EEAWTQERHTYPVRPTGDAYSTALRVFD 503


>gi|289667570|ref|ZP_06488645.1| N-acetylglucosaminidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 798

 Score =  177 bits (450), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 150/541 (27%), Positives = 232/541 (42%), Gaps = 86/541 (15%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF ++ R F+E 
Sbjct: 238 VLPAFAGYVPKAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFMEL 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVD-------SPEYISSL-------------------- 100
             + YG T   Y  D F+E  PPV        + +Y  S+                    
Sbjct: 292 YTQAYG-TGEFYLADAFNEMLPPVADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARL 350

Query: 101 ---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
              G A+Y  +   +  A W+MQGWLF  D  FW+P  + A L  VP  +L+VLD+  + 
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDR 410

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S+ F    +I+  +HN+  +  +YG     AF   + +     SE   + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRQDLQALLADSEKRNLRGFG 467

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
           +  EG+  N VVY+ +  +A++  +     W+ QY   RYGRS  A+  AW+ L   +Y 
Sbjct: 468 IFPEGLHSNSVVYEYLYALAWEGPQQPWSQWLTQYLRARYGRSDAALLSAWSDLEAGIYQ 527

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
               +               P   +   G Y  + +P +   ++K +    D   L    
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTAD--IVKFDDRPGDPQRL---- 566

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
               RA++  +      + +  YRYDLI+  R  L+  A+     +++AY   D A G  
Sbjct: 567 ---RRAIDALLQQAERYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDV 623

Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDN 449
           QL+ R  +LV+ +D L+       L  W   A   A N+    + Y  NAR Q+++W  +
Sbjct: 624 QLA-RITQLVQGLDALVGGQHE-TLADWTGQAAAAAGNDAGLRRAYVGNARAQVSVWGGD 681

Query: 450 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRRE 502
                  L DY +K W G+  D+Y  R   +      + ++G  F       +L  W R 
Sbjct: 682 GN-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFEAAAVDQQLATWERH 736

Query: 503 W 503
           W
Sbjct: 737 W 737


>gi|255079272|ref|XP_002503216.1| GH family 89 protein [Micromonas sp. RCC299]
 gi|226518482|gb|ACO64474.1| GH family 89 protein [Micromonas sp. RCC299]
          Length = 1260

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 157/554 (28%), Positives = 231/554 (41%), Gaps = 140/554 (25%)

Query: 28  KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 87
           K  +LG +   K D      + LD +D LF  +G AF +Q ++++G T H+Y  DTF E 
Sbjct: 396 KDAELGKY--AKKDDSVRSVHFLDPSDALFQSLGAAFTKQLVEDFG-TDHLYLADTFREI 452

Query: 88  TPPVD--SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL 144
             P D  S  ++  +GAA  + M+S D  A W+ Q   F  +P FW   +  ALL SV +
Sbjct: 453 RDPNDDFSETHVVRVGAATLAAMRSADPRATWVFQSDAFRRNPRFWNEGRRGALLRSVDI 512

Query: 145 GKLVVLDLFAEVKPIW-STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---- 199
           G ++VLD  AE  P +      F G P++WC+ HN  GN+ M G L +IA GP  A    
Sbjct: 513 GDMLVLDSAAETDPYYLREPVHFAGQPFVWCVKHNHGGNLGMRGRLSAIATGPAAAMDSL 572

Query: 200 -----------------------------RTSENTT----------MVGVGMSMEGIEQN 220
                                        R S   T          +VG G++ EG+EQN
Sbjct: 573 ASRRDGERGTTHGRGTRVGSSRRMLADNKRVSREATHGSRKVGKSQLVGFGITAEGVEQN 632

Query: 221 PVVYDLMSEMAFQHEKVDVKAWINQ--------YSVRRYGRSV----------------- 255
           PVVY+L +  +   + VDV  +++         YSVR+   +                  
Sbjct: 633 PVVYELAALTSQSEKGVDVDWFLSDYSRRRYGGYSVRQPAPTTLPVGTGQGAFLAGFIVG 692

Query: 256 ------------------PA-------------IQDAWNVLYHTVYNCTDGATDKN--RD 282
                             PA              ++AW +L  TVY       D++  RD
Sbjct: 693 NNPIAGSPGYLGPGEWYDPAKHGEMGKEEAYDRAREAWEILGKTVYGARAKGEDEDHVRD 752

Query: 283 VIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI 342
                P +    +S     Y +  K V                   Y+   +I A     
Sbjct: 753 ACSWQPSLRADELSP---DYFDAAKVVD------------------YAFKPLIDAAPTLR 791

Query: 343 ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVED 402
           A+G    A     YD++D+ RQ LA+ +N L   I ++   N+A        + LEL++D
Sbjct: 792 ANG----AGTRVDYDIVDVGRQLLARQSNVLATQIRDSLNSNNASEAKMYGTQMLELLDD 847

Query: 403 MDGLLACHDGFLLGPWLESAKQLA---QNEEQEKQYEWNARTQITMWFDNTQEEASLL-- 457
           MD LL  H GFLLG ++ESAK  A     E  E   E +AR+ I+ +  +  +  + L  
Sbjct: 848 MDALLRSHKGFLLGNYIESAKSWAGKRNKESDEANLERSARSLISGFGPSGSKLGAPLGH 907

Query: 458 --RDYGNKYWSGLL 469
              DY N+ WSG+L
Sbjct: 908 PMHDYSNRQWSGML 921


>gi|118370728|ref|XP_001018564.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila]
 gi|89300331|gb|EAR98319.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila
           SB210]
          Length = 879

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 150/514 (29%), Positives = 241/514 (46%), Gaps = 60/514 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +L  F G VP +L+  FP +KI    +W   +          LDA D LF  I   F  +
Sbjct: 255 ILQGFYGMVPNSLKAKFPLSKIYGDQSWLGFRRPA------FLDANDELFSNIANIFYSE 308

Query: 68  QLKEYGRTSHIYNCDTFDEN--TPPVDSPEYISSLGAAIYSGMQ----SGDSDAVWLMQG 121
             K YGR +  Y  D F E    P ++    ++S   +IY  MQ      D    W++Q 
Sbjct: 309 SEKLYGR-AKFYGGDPFHEGAIVPGLN----LTSQAQSIYRAMQYTDNPKDEKVKWILQS 363

Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
           W  +      P Q   LL  +   + ++LDL AE +  W T+  F G  ++W  L NF  
Sbjct: 364 WQEN------PSQQ--LLQGLQNDECIILDLMAEARSKWQTND-FSGHDFLWTSLPNFGL 414

Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD--- 238
            I  YG+++     P  A + +N+TM G+G   EGI  N + Y+++ + A+     D   
Sbjct: 415 RIGQYGMIEQYVSQPPLAYSIKNSTMKGIGSIPEGILTNVLDYEILFDKAWIQPNQDTNL 474

Query: 239 -----VKAWINQYSVRRYG-RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
                V  ++  +   RYG ++   +  AW++L +++YN T+     +  V++A P    
Sbjct: 475 TPRQQVLQYLGDFIRYRYGEQNNKNLFSAWSLLTNSIYNSTNPWDGPSESVMLARPASYI 534

Query: 293 SIISVTEGKYQNYGKPVSKEA------VLKSETSSYDHPHLWYSTSEVIRAL-------E 339
             +S     Y  +      EA       +K +       HL     E+ + L       E
Sbjct: 535 DKVSSWGTSYIYWNTTNVLEAWKLFTNYVKEKKQKNRSQHL-QKLEEINKKLGRSDDDME 593

Query: 340 LFIA-SGNELSA--SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF 396
            F+  S NE      +T+ YDL+D+ RQ LA Y+  L+  ++ A+   D       S++F
Sbjct: 594 AFVEISQNEERNIFKDTFLYDLVDVARQNLASYSYLLYNKVMLAFNQTDTIKFALYSQQF 653

Query: 397 LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASL 456
           LEL++D D LL+    F+LG +LES  +L   +++++ +    + QIT+W D      S 
Sbjct: 654 LELIKDQDQLLSSRKEFMLGYYLESVSKLGTTDQEKQNFIEQIKRQITVWSD----FPSD 709

Query: 457 LRDYGNKYWSGLLRDYYGPRAAIYFK----YMIE 486
           L DY NK W+G+L+D+Y PR  +YFK    Y++E
Sbjct: 710 LHDYANKEWNGILKDFYLPRWELYFKSLQSYIVE 743


>gi|289663931|ref|ZP_06485512.1| N-acetylglucosaminidase [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 798

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 150/541 (27%), Positives = 232/541 (42%), Gaps = 86/541 (15%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF ++ R F+E 
Sbjct: 238 VLPAFAGYVPKAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFMEL 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVD-------SPEYISSL-------------------- 100
             + YG T   Y  D F+E  PPV        + +Y  S+                    
Sbjct: 292 YTQAYG-TGEFYLADAFNEMLPPVADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARL 350

Query: 101 ---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
              G A+Y  +   +  A W+MQGWLF  D  FW+P  + A L  VP  +L+VLD+  + 
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDR 410

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S+ F    +I+  +HN+  +  +YG     AF   + +     SE   + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRQDLQALLADSEKRNLRGFG 467

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
           +  EG+  N VVY+ +  +A++  +     W+ QY   RYGRS  A+  AW+ L   +Y 
Sbjct: 468 IFPEGLHSNSVVYEYLYALAWEGPQQPWSQWLMQYLRARYGRSDAALLSAWSDLEAGIYQ 527

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
               +               P   +   G Y  + +P +   ++K +    D   L    
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTAD--IVKFDDRPGDPQRL---- 566

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
               RA++  +      + +  YRYDLI+  R  L+  A+     +++AY   D A G  
Sbjct: 567 ---RRAIDALLQQAERYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDV 623

Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDN 449
           QL+ R  +LV+ +D L+       L  W   A   A N+    + Y  NAR Q+++W  +
Sbjct: 624 QLA-RITQLVQGLDALVGGQHE-TLADWTGQAAAAAGNDAGLRRAYVGNARAQVSVWGGD 681

Query: 450 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRRE 502
                  L DY +K W G+  D+Y  R   +      + ++G  F       +L  W R 
Sbjct: 682 GN-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAAAVDQQLATWERH 736

Query: 503 W 503
           W
Sbjct: 737 W 737


>gi|418473272|ref|ZP_13042874.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
           coelicoflavus ZG0656]
 gi|371546106|gb|EHN74664.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
           coelicoflavus ZG0656]
          Length = 716

 Score =  176 bits (445), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 131/511 (25%), Positives = 216/511 (42%), Gaps = 44/511 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G VP       P   +   G W   +  P W     LD    +F  +  +F   
Sbjct: 250 VLPGYYGTVPPDFTARNPGGTVVPQGQWVGFER-PDW-----LDPRTGVFSRVAASFYRH 303

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G  S +Y  D   E   P + P  +     A+ + +Q+    AVW + GW  +  
Sbjct: 304 QRELFG-DSTMYKMDLLHEGGRPGNVP--VGDAARAVMNALQTARPGAVWTLIGWQNN-- 358

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
                P  + ++++V   +L+++D  ++          ++G PY +  + NF G+  +  
Sbjct: 359 -----PSTQ-IIDAVDKSRLLIVDGLSDRYDGLDRETAWHGAPYAFGTIPNFGGHTTVGA 412

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
                A      RT   + + G+    EG   NPV Y+L +E+A++ E VD   W   Y+
Sbjct: 413 NTAVWAERFDRWRTEPGSALAGIAYLPEGTGGNPVAYELFTELAWRTEPVDHSGWFAAYA 472

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
            RRYGR  P    AW +L    Y+   G   + +D +                       
Sbjct: 473 ERRYGRPDPHAARAWELLRTGPYSMPSGTWSEAQDSLF---------------------- 510

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
             ++  +  +  +S+    + Y    V  AL   +     L  ++ YR+DL+D+ RQALA
Sbjct: 511 -TARPRLTATSAASWSPGAMRYDPDTVRAALAELLKVAPALRTTDAYRFDLVDVARQALA 569

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
             +  L   I  AY   D       +  + + ++ +D LLA    FLLGPWL  A+   +
Sbjct: 570 NRSRSLLPEIKAAYDAGDLSRFRAGAAEWKDDLDLLDRLLATDSRFLLGPWLADARSWGR 629

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
              ++   E++AR+ +T W   +  +A  LRDY N+ WSGL+ D+Y  R   Y   +  +
Sbjct: 630 TAAEKDAAEFDARSLLTTWGHRSGSDAGGLRDYANREWSGLVSDFYAMRWTTYLDSLDTA 689

Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYP 518
           L +G      D    W  L N W    + YP
Sbjct: 690 LVTGRPPAAID----WFSLENAWNQRHDDYP 716


>gi|257067709|ref|YP_003153964.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
           4810]
 gi|256558527|gb|ACU84374.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
           4810]
          Length = 768

 Score =  175 bits (444), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 141/547 (25%), Positives = 232/547 (42%), Gaps = 60/547 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F G +PA L     + ++     W +  + P           DPLF E   +    
Sbjct: 238 VLPGFGGQLPAELVG---TERMIDWQGWHNALAAP----------GDPLFAEAAASLHRH 284

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q +  G T H Y  D + E+ PP  SP+ ++    AI++ M+  D  AVW++QGW F Y 
Sbjct: 285 QRQLLG-TDHHYAVDPYIESLPPTTSPQQLAEHAEAIFTAMRDADPQAVWILQGWPFHYR 343

Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
             +W   ++ +LL+ VP  +L++LDL+ E  P+W  +   YG  ++WC+ H F G   ++
Sbjct: 344 AAYWTEERVHSLLSRVPEDRLILLDLWGEHAPMWHRTAAMYGRRWLWCLAHTFGGRFGLF 403

Query: 187 GIL----DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
           G L    D +      A       + G G++ E ++ N VVY+L +  A        + W
Sbjct: 404 GDLAALDDDLRGLRTAAEAGTRGRLEGFGITSEALDDNAVVYELATR-ALWSPMPPRERW 462

Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
           + ++ +RRYG + P +Q AW V+ HT+Y    G T      ++A P              
Sbjct: 463 LEEHIIRRYGTAAPEVQQAWQVIAHTLYGP--GRTRSTPSPLIARP-------------- 506

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSE--------------VIRALELFIASGNEL 348
              G P + + +        D P      +E               +R+L   + SG   
Sbjct: 507 WTRGLPFASQRLAGEALPDADGPPSANIDAENDAEMLGALAPLAHAVRSLLPVLRSGEH- 565

Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
              +    DL  L     A+ A      I+ A    D   +   +     L+  +D + A
Sbjct: 566 --RDALARDLAQLAIHVGAQSARAPLRAIVAAAAEADGERLRAEASTLEALLRAVDAVAA 623

Query: 409 CHDGFLLGPWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
                L+G W+  A+  A  +E+     E +AR+ I++W   TQ+  S L DY  ++WSG
Sbjct: 624 TRPDMLVGRWIADARAGAGTDERLADALERDARSLISVW--GTQD--SGLHDYSARHWSG 679

Query: 468 LLRDYYGPRAAIYFKYMIESLES-GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD-A 525
            L D +  R   +  ++  + E       L+    +   +  DW++    YP    G+ A
Sbjct: 680 SLTDLHLARWRAWTDWLARTAEEPSTPPDLEQLHAQIRGIEEDWRDSTAPYPTTPRGEPA 739

Query: 526 LITSQWL 532
              SQ L
Sbjct: 740 AAISQLL 746


>gi|429198382|ref|ZP_19190217.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
 gi|428665917|gb|EKX65105.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
          Length = 747

 Score =  175 bits (443), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 130/518 (25%), Positives = 230/518 (44%), Gaps = 44/518 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P + G VP       P A+    G+W    + P W     LD     F  +  AF   
Sbjct: 257 VFPGYFGTVPPGFAERNPGARTVPQGDWMGF-ARPDW-----LDPRTNEFKRVAAAFYRA 310

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G  S +Y  D   E   P D P  ++     +   +++   DA W++ GW  +  
Sbjct: 311 QDELFGGPSTLYKMDLLHEGGDPGDVP--VADAAKGVERALRAAHPDATWVILGWQHN-- 366

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMY 186
               PP  +A++++V   +++V+D  ++  P     +  +G  PY +  + NF G+  + 
Sbjct: 367 ----PP--RAIVDAVDKKRMLVVDGLSDRFPTVIDREADWGDTPYAFGSIWNFGGHTALG 420

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
                 A    + RT + + + G+ +  E  + NP  + L SE+A++ +++D+K W +++
Sbjct: 421 ANTPVWAELYEKWRTKDGSKLRGIALMPEAADNNPAAFALFSELAWRKDELDLKTWFSEW 480

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  RYG   P  + AW++L  T Y    G T  +R                +EG    +G
Sbjct: 481 AHARYGARDPHAEAAWDILRRTAY----GTTRADR---------------WSEGADGLFG 521

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
              S+ A+     + +    L Y  +E   AL   ++    L +S+ YR DL+D+ RQ L
Sbjct: 522 ---SRPALNTVRAARWSPKQLRYDAAEFEPALGELLSVRPGLRSSSAYRRDLLDVARQTL 578

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
           +  +  L   I  AY   D     +L+  +L L++ +D LLA     LLG W+  A+   
Sbjct: 579 SNRSRVLLPRIRGAYDARDTARFDELTGTWLSLMDLLDRLLATDSAHLLGRWVADARAWG 638

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            ++ + ++  ++  + +T+W      +A  LRDY N+ W+GL+   Y  R + YF+ +  
Sbjct: 639 ASDAERERLAYDNLSLLTVWGTRKGADAG-LRDYANREWAGLVGGLYRLRWSTYFEELRA 697

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
           +L  G   +  D    W  L + W         E  GD
Sbjct: 698 ALREGRTPKKID----WFALEDRWTRAPGRLATEPTGD 731


>gi|453051703|gb|EME99203.1| alpha-N-acetylglucosaminidase [Streptomyces mobaraensis NBRC 13819
           = DSM 40847]
          Length = 763

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 130/526 (24%), Positives = 222/526 (42%), Gaps = 48/526 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP ++G VP         A+    G+W      P W     LD     F  + R +   
Sbjct: 276 VLPGYAGTVPPDFTRRNKGARTVPQGDWAGFPR-PDW-----LDPRTAHFARVARTYYRV 329

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG  S +Y  D   E   P   P  + +   A+   +++   DA W + GW  +  
Sbjct: 330 QRELYG-ASSMYKIDLLHEGGTP--GPVPVGAAAKAVEKALRAAHPDATWAILGWQTN-- 384

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
                   + +L++V   K++VLD   +  P +    K + G PY +  + NF G+  M 
Sbjct: 385 ------PRREILDAVDRSKMLVLDGIPDHYPRVTDREKDWGGTPYAFGTIWNFGGHTAMG 438

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
                        RT + + + G+ +  E  + NP    L S++A+   ++D+K W  ++
Sbjct: 439 ANTQDWVSLFHRWRTKKGSALRGIALMPEAADNNPAALALFSDLAWTEGRLDLKDWFARW 498

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
            V+RYG + P  + AW+VL  T Y  T  DG ++    +  A PD+              
Sbjct: 499 PVQRYGAADPNARRAWDVLRRTAYGTTRADGWSEAADGLFGARPDL-------------- 544

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                   AV  +  +++    L Y  +    AL   +A    L  S+ YR DL D+ RQ
Sbjct: 545 --------AV--NRAAAWSPRQLRYDAAAFDEALPALLAVAPALRGSSAYRCDLTDVARQ 594

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
            ++  +  L   I  AY   D      L+R++L+ +  ++  +A  +  LLG W+  A+ 
Sbjct: 595 CVSNRSRLLLPRIKAAYDAGDRTRFRTLTRQWLDWMTLLEETVATSERHLLGRWIAEARA 654

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
                 +  + E +A + +T+W      +   L DY N+ W+GL+   Y  R   YF  +
Sbjct: 655 WGGTAAERDRLEHDAVSLLTVWGPRASADGGKLHDYANREWAGLVGGLYRLRWKTYFTEL 714

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
             +L +    R K    +W  L + W   R  YP + +GD +  ++
Sbjct: 715 EAALTA----RRKPKPIDWYALEDRWTRKRPAYPAKPSGDIVAVAR 756


>gi|290956360|ref|YP_003487542.1| alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
 gi|260645886|emb|CBG68977.1| putative alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
          Length = 732

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 137/519 (26%), Positives = 228/519 (43%), Gaps = 47/519 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P + G VPA      P A+    G W    + P W     LD     F  +  AF   
Sbjct: 243 VFPGYFGTVPAGFAERVPGARTVPQGRWMGF-ARPDW-----LDPRTDEFARVAAAFYRT 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G  S +Y  D   E   P D P  ++     +   +Q     A W+M GW  +  
Sbjct: 297 QDEMFG-PSALYKMDLLHEGGDPGDVP--VADAAKGVERALQRAHPGATWVMLGWQHN-- 351

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
               PP  +A++++V    ++V+D  ++  P +      + G PY +  + NF G+  + 
Sbjct: 352 ----PP--RAIVDAVDKQHMLVVDGLSDRFPTVTDREADWGGTPYAFGSIWNFGGHTALG 405

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
                 A    + RT + +T+ G+ +  E  + NP  + L SE+A++  ++D++ W  ++
Sbjct: 406 ANTPDWAALYEKWRTKDGSTLHGIALMPEAADNNPAAFALFSELAWREGELDLETWFAEW 465

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  RYG   P  + AW++L  T Y  T                      S +EG    +G
Sbjct: 466 AHARYGARDPHAEAAWDILRRTAYGTT-------------------RADSWSEGADGLFG 506

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
              S+ A+       +    L Y+ ++   AL   +    EL AS+ YR DL+D+ RQAL
Sbjct: 507 ---SRPALTAVRAGRWSPKQLRYNAADFEPALGEMLKVRPELRASSAYRRDLLDVARQAL 563

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
           +  +  +   +  AY   DA  + + SR +L L++ +D L+A     LLG W+  A+  A
Sbjct: 564 SNRSRVMLPQLKAAYDAKDAARLAKGSRDWLSLMDLLDELVATDSRHLLGRWVADARSWA 623

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASL-LRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
               +  +  ++A + +T+W   T+E A   LRDY N+ W+GL+   Y  R A YF+ + 
Sbjct: 624 VGSTERTELAYDALSLLTVW--GTREGADAGLRDYANREWAGLVGGLYRLRWATYFEELR 681

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
            +L  G   +  D    W  L + W         E  GD
Sbjct: 682 AALAEGRAPKKID----WFALEDRWARNPGTLATEPAGD 716


>gi|333023613|ref|ZP_08451677.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
 gi|332743465|gb|EGJ73906.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
          Length = 741

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 138/521 (26%), Positives = 231/521 (44%), Gaps = 53/521 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G VPA   +  P AK    G W    + P W     LD    LF E+  AF E 
Sbjct: 252 VLPGYFGTVPAGFADRNPGAKTVPQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEI 305

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YGR + +Y  D   E     + P   ++ G  +   +++   DAVW++ GW  +  
Sbjct: 306 QEELYGRGT-LYKMDLLHEGGSAGNVPVGDATRG--VQRALRAARPDAVWVILGWQKN-- 360

Query: 128 PFWRPPQMKALLNSVPLGKLVVLD----LFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
               PP  K ++ +     ++V+D     F+EV         + G PY +  + NF G+ 
Sbjct: 361 ----PP--KEVVAAADREAMLVVDGLSDRFSEVN---DRESDWQGTPYAFGSIWNFGGHT 411

Query: 184 EMYG-ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
            +     D +   P   R    + + G+ +  E  + NP  ++L +E+ +    VD+  W
Sbjct: 412 ALGANARDWVDLYP-RWRDRSGSRLSGIALMPEAADNNPAAFELFAELPWTEGPVDLTDW 470

Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
             +Y+  RYG S    + AW++L  T Y       D+  +         P++ +V+ GK+
Sbjct: 471 FREYARVRYGGSDAHAEAAWDILRTTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW 528

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
                       L+   +S++             AL+  +A   EL  S TYR DL+D+ 
Sbjct: 529 --------SPKALRYPAASFEP------------ALDELLAVRAELRDSATYRRDLLDVA 568

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQALA  +  L   +  AYQ  +     +L RR++ L++ ++ L+A  +  LLG W+ESA
Sbjct: 569 RQALANRSRTLLPRLAAAYQAKNQAEFARLGRRWIALMDLLEQLVATDENHLLGRWVESA 628

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           +    +  ++ Q +++A + +T W      +A  LRDY N+ WSGL+   Y  R + Y  
Sbjct: 629 RAWGGSAREKSQLQYDALSLLTTWGTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYID 687

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNG 523
            +  +L+ G     K    +W  L + W         +  G
Sbjct: 688 ELSAALKEGR----KPVAVDWFALEDRWTRNPGALATQPRG 724


>gi|295690503|ref|YP_003594196.1| alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
 gi|295432406|gb|ADG11578.1| Alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
          Length = 770

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 140/509 (27%), Positives = 211/509 (41%), Gaps = 73/509 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP A     P A+I ++  W           TY LD  DPLF +I   F+  
Sbjct: 239 ILPAFGGYVPKAFAQKNPKARIYRMRPWEGFHE------TYWLDPADPLFAKIAGRFLAL 292

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVD-------------------------------SPEY 96
             + YG T   Y  D+F+E  PP++                                 + 
Sbjct: 293 YTQTYG-TGTYYLADSFNEMLPPINADGADARDAAYGDGAANTAATKTKVEVDPALKAQR 351

Query: 97  ISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAE 155
           +++ G AIY  ++    DAVW+MQGWLF  D  FW P  + A L+ VP  KL++LD+  +
Sbjct: 352 LAAYGKAIYDSIRQARPDAVWVMQGWLFGADSHFWDPTAISAYLSLVPDDKLMILDIGND 411

Query: 156 VKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTMVGVGMS 213
             P +W  +K F G P+I+  +HN+ G+  +YG LD      P  A   E   + G GM 
Sbjct: 412 RYPAVWKNAKAFGGKPWIYGYVHNYGGSNPVYGDLDYYRRDIPAIAANPEAGKLAGFGMF 471

Query: 214 MEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT 273
            EG+  N +VYD + ++A+   +  + AW++ Y+  RYG++ P +  A   L    Y+  
Sbjct: 472 PEGLHNNSIVYDAVYDLAWGAGRESLSAWLSTYARARYGKTSPELDAALGQLVEAAYSTR 531

Query: 274 DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSE 333
             +               P       G Y  + +P +             HP        
Sbjct: 532 YWS---------------PRWWKSKAGAYLFFKRPTATIGEFPP------HPGDRAKLEA 570

Query: 334 VIRAL-ELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 392
            ++AL  L  A  NE      +  DL D TR       ++L    + AY+  D     Q 
Sbjct: 571 AVKALTALAPAYANE----PLFVLDLTDATRHLATMKIDDLLQAAVAAYRRGDVASGDQA 626

Query: 393 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 452
                 L   +D LL       L  W++ A+           Y  NA+ Q+T+W      
Sbjct: 627 RVEIAALALSIDKLLGVQPE-TLATWIDDARAYGDTPADAAAYVANAKAQVTVWGGEGN- 684

Query: 453 EASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
               L DY +K W GL R +Y PR +++ 
Sbjct: 685 ----LNDYASKAWQGLYRGFYLPRWSMFL 709


>gi|293369246|ref|ZP_06615836.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
 gi|292635671|gb|EFF54173.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
           3f]
          Length = 521

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 93/264 (35%), Positives = 150/264 (56%), Gaps = 6/264 (2%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPAAL+ V+P+ K T++  W       R  CT+L +  D L+  I + ++ +
Sbjct: 233 VLPAFAGHVPAALKRVYPNIKTTRVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTE 289

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG T+HIY  D F+E  PP    + +  +   IY  + + D +AVWL   WLF  D
Sbjct: 290 QTRLYG-TNHIYGIDPFNEIDPPSWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYAD 348

Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
              W  P++K+ L SVP  +L++LD F E   IW  +  ++G PY+WC L NF GN  + 
Sbjct: 349 IKHWTTPRIKSYLRSVPQDRLILLDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLS 408

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
           G ++ ++    +A  +  + + GVG ++EGI+ N  +Y+ + + A+   + D K W  + 
Sbjct: 409 GPVNLVSERLADALKNGGSNLKGVGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKL 467

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVY 270
           + RR G+  P  + AW +L + VY
Sbjct: 468 ADRRIGKISPEARKAWEILANKVY 491


>gi|318057780|ref|ZP_07976503.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actG]
          Length = 741

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 134/518 (25%), Positives = 231/518 (44%), Gaps = 47/518 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G VPA   +  P AK    G W    + P W     LD    LF E+  AF E 
Sbjct: 252 VLPGYFGTVPAGFADRNPGAKTVPQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEI 305

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YGR + +Y  D   E     + P   ++ G  +   +++   DAVW++ GW  +  
Sbjct: 306 QEELYGRGT-LYKMDLLHEGGSAGNVPVGDATRG--VQRALRAARPDAVWVILGWQKN-- 360

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
               PP  K ++ +     ++V+D  ++  P +      + G PY +  + NF G+  + 
Sbjct: 361 ----PP--KEVVAAADREAMLVVDGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALG 414

Query: 187 G-ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
               D +   P   R    + + G+ +  E  + NP  ++L +E+ +    VD+  W  +
Sbjct: 415 ANTRDWVDLYP-RWRDRSGSRLSGIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFRE 473

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y+  RYG S    + AW++L  T Y       D+  +         P++ +V+ GK+   
Sbjct: 474 YARVRYGGSDAHAEAAWDILRTTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--- 528

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
                    L+   +S++             AL+  ++   EL  S TYR DL+D+ RQA
Sbjct: 529 -----SPKALRYPAASFEP------------ALDELLSVRAELRDSATYRRDLLDVARQA 571

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           LA  +  L   +  AY+  +     +L RR++ L++ ++ L+A  +  LLG W+ESA+  
Sbjct: 572 LANRSRTLLPRLAAAYKAKNQAEFARLGRRWIALIDLLEQLVATDENHLLGRWVESARAW 631

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
             +  ++ Q +++A + +T W      +A  LRDY N+ WSGL+   Y  R + Y   + 
Sbjct: 632 GGSAREKNQLQYDALSLLTTWGTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELS 690

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNG 523
            +L+ G     K    +W  L + W         +  G
Sbjct: 691 AALKEGR----KPVAVDWFALEDRWTRNPGTLATQPRG 724


>gi|318078904|ref|ZP_07986236.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actF]
          Length = 719

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 131/494 (26%), Positives = 225/494 (45%), Gaps = 43/494 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G VPA   +  P AK    G W    + P W     LD    LF E+  AF E 
Sbjct: 230 VLPGYFGTVPAGFADRNPGAKTVPQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEI 283

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YGR + +Y  D   E     + P   ++ G  +   +++   DAVW++ GW  +  
Sbjct: 284 QEELYGRGT-LYKMDLLHEGGSAGNVPVGDATRG--VQRALRAARPDAVWVILGWQKN-- 338

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEM- 185
               PP  K ++ +     ++V+D  ++  P +      + G PY +  + NF G+  + 
Sbjct: 339 ----PP--KEVVAAADREAMLVVDGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALG 392

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
               D +   P   R    + + G+ +  E  + NP  ++L +E+ +    VD+  W  +
Sbjct: 393 ANTRDWVDLYP-RWRDRSGSRLSGIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFRE 451

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
           Y+  RYG S    + AW++L  T Y       D+  +         P++ +V+ GK+   
Sbjct: 452 YARVRYGGSDAHAEAAWDILRTTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--- 506

Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
                    L+   +S++             AL+  ++   EL  S TYR DL+D+ RQA
Sbjct: 507 -----SPKALRYPAASFEP------------ALDELLSVRAELRDSATYRRDLLDVARQA 549

Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
           LA  +  L   +  AY+  +     +L RR++ L++ ++ L+A  +  LLG W+ESA+  
Sbjct: 550 LANRSRTLLPRLAAAYKAKNQAEFARLGRRWIALIDLLEQLVATDENHLLGRWVESARAW 609

Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
             +  ++ Q +++A + +T W      +A  LRDY N+ WSGL+   Y  R + Y   + 
Sbjct: 610 GGSAREKNQLQYDALSLLTTWGTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELS 668

Query: 486 ESLESGDGFRLKDW 499
            +L+ G      DW
Sbjct: 669 AALKEGRKPVAVDW 682


>gi|329851961|ref|ZP_08266642.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
           biprosthecum C19]
 gi|328839810|gb|EGF89383.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
           biprosthecum C19]
          Length = 731

 Score =  172 bits (435), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 142/542 (26%), Positives = 226/542 (41%), Gaps = 98/542 (18%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF+G VP A     P A+I ++  W           TY LD  DPLF ++   F++ 
Sbjct: 219 ILPAFAGYVPKAFAESHPQARIYRMRAWEGFHE------TYWLDPADPLFAKLAGRFLDL 272

Query: 68  QLKEYGRTSHIYNCDTFDENTPPV-DSP------------------------EYISSLGA 102
             + YG+    Y  D F+E  PPV D P                        E +++ G 
Sbjct: 273 YDQTYGK-GRFYLADAFNEMLPPVGDGPVEGGYGDSTANKEAVAEVDPAVKAERLAAYGQ 331

Query: 103 AIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IW 160
            ++  ++S   DAVW+MQGWLF  D  FW    + A L +VP   L+VLD+  +  P + 
Sbjct: 332 RLHDSIRSARPDAVWVMQGWLFGADQGFWTGDAIAAFLRNVPDDGLMVLDIGNDRYPKVR 391

Query: 161 STSKQFYGVPYIWCMLHNFAGNIEMYGILD-------SIAFGPVEARTSENTTMVGVGMS 213
            T++ F+G  +I+  +HN+  +  +YG L        +I   P   R      + G G+ 
Sbjct: 392 QTAQAFHGKGWIYGYVHNYGASNPIYGDLGFYRRDMAAITSDPARGR------LQGFGVF 445

Query: 214 MEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT 273
            EG++ N +VY  + ++A+      +  W+  Y+  RYG S P +  AW  +   VY   
Sbjct: 446 PEGLDSNSIVYAYLYDLAWNGGTKSLSDWLAGYTRARYGISSPEVVTAWLDIVKGVYGTR 505

Query: 274 DGATDKNRDVIVAF-----PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 328
                  R    A+     PD+  +      G        +++ A ++     +D P L 
Sbjct: 506 YWTPRWWRSTAGAYLLCKRPDIAMADFEGAPGDRAALRAGLARLAAIR-----HDSPLL- 559

Query: 329 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 388
                                     RYD+I+ TR   + + + L    + AY+  D   
Sbjct: 560 --------------------------RYDVIEFTRHLASLHLDNLIRTALVAYRDGDVAA 593

Query: 389 VFQLS---RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 445
             + +   RR    ++D+ G   CH    L  W+E A+       ++  YE NAR Q+T+
Sbjct: 594 GDRSATEVRRVTIAIDDLMGAQPCH----LAGWIEQARAYGDTATEKPYYERNARAQVTV 649

Query: 446 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL--ESGDGFRLKDWRREW 503
           W          L DY +K W GL RD+Y PR  + F  +       +    RL  W   W
Sbjct: 650 WGGKGN-----LHDYASKAWQGLYRDFYLPRWEMLFAALRTGTYNPAATTERLIAWENAW 704

Query: 504 IK 505
           ++
Sbjct: 705 VE 706


>gi|374985456|ref|YP_004960951.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
 gi|297156108|gb|ADI05820.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
          Length = 1039

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 140/522 (26%), Positives = 219/522 (41%), Gaps = 53/522 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G VP       P A +   G W + K  P W     LD    LF E+  AF   
Sbjct: 251 VLPGYYGTVPDDFLAKNPGASLVAQGTWGAFKR-PDW-----LDPRTDLFAEVAAAFYRH 304

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + YG +S +Y  D   E   P D P  +     A+ + +Q   + AVW + GW    +
Sbjct: 305 QRERYGDSS-MYKMDLLHEGGNPGDVP--VGEAAKAVEAALQKAHAGAVWAILGW--QTN 359

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
           P       + +L +V    ++V+D  ++    +      + G PY +  + NF G+  + 
Sbjct: 360 P------SREILGAVDKSMMLVVDGLSDRYTTVIDRESDWDGTPYAFGSIWNFGGHTPIG 413

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
                      + R    + + G+ M  EG + NP    L +++A+    + +  W   Y
Sbjct: 414 ANAPDWVEQYPKWRDKTGSALTGIAMMPEGADNNPAAMALFTDLAWTPGAIGLDDWFASY 473

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD----VDPSIISVTEGKY 302
           +V RYG   P    AW  +  T YN +        D     PD      PS+ +      
Sbjct: 474 AVSRYGGEDPHAVAAWKAIRDTAYNMS------RADAWSEAPDGLFGARPSLGA------ 521

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
                  +K A    E   YD      + +E+++     +A G  L  S+ Y YDL D+ 
Sbjct: 522 -------NKAAAWGPEADRYDTTAFDAALTELLQ-----VAPG--LRDSSAYAYDLADVA 567

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L+  +  L   I  AY+  D     +L++ +L  ++ MD +LA     LLG WL  A
Sbjct: 568 RQVLSNRSRVLLPQIKTAYEAGDRGRFDRLTKTWLSWMKLMDKVLATSGQHLLGRWLADA 627

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           +       ++ Q E++AR+ IT W      E   L DY N+ WSGLL   Y  R   YF 
Sbjct: 628 RSWGATRAEKDQLEYDARSIITTWGGRASSEEG-LHDYANREWSGLLGGLYHLRWKTYFD 686

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
            +  +L +G     +    +W  L + W    + YPV ++GD
Sbjct: 687 ELSTALAAG----RQPAGIDWFALEDHWARRHDSYPVRTSGD 724


>gi|29828556|ref|NP_823190.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
 gi|29605660|dbj|BAC69725.1| putative alpha-N-acetylglucosaminidase [Streptomyces avermitilis
           MA-4680]
          Length = 728

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 128/518 (24%), Positives = 219/518 (42%), Gaps = 44/518 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P + G VP    +    A     G W    + P W     LD     F  +  AF   
Sbjct: 238 VFPGYFGTVPPGFADRNAGAHTVPQGTWMGF-ARPDW-----LDPRTEHFTRVAAAFYRI 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G  S  Y  D   E   P D P  +      +   +++    AVW++ GW  +  
Sbjct: 292 QDEMFGGASTRYKMDLLHEGGSPGDVP--VGDAAKGVERALRAAHPGAVWVILGWQHN-- 347

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
               PP  +A++++V   +++V+D   +  P +      ++G PY +  + NF G+  + 
Sbjct: 348 ----PP--RAIVDAVDKDRMLVVDGLCDRFPKVTDREADWHGTPYAFGSIWNFGGHTTLG 401

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
                 A      RT   +T+ GV +  E  + NP  + L SE+A++   +D++AW  ++
Sbjct: 402 ANTPDWASLYERWRTRPGSTLRGVALLPEAADNNPAAFALFSELAWREGDLDLRAWFARW 461

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  RYG   P  + AW++L  T Y  T                      S +EG    +G
Sbjct: 462 ARSRYGGRDPHAEAAWDILRRTAYGTT-------------------RADSWSEGADGLFG 502

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
              ++ ++  ++ +S+    L Y   E   AL   +     L  S+ YR DL+D+ RQAL
Sbjct: 503 ---ARPSLAATKAASWSPKRLRYRPEEFEPALGELLKVRPGLRGSSAYRRDLLDVARQAL 559

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
           +  +  L   I  AY+  D     +L+  +L L++ ++ LLA     LLG W+  A+   
Sbjct: 560 SNRSRVLLPQIRTAYEAKDTARFDRLTGVWLALMDLLEALLATDSRHLLGRWVADARAWG 619

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
            +  +  +  ++A + +T+W      +A  LRDY N+ W+GL+   Y  R + YF  +  
Sbjct: 620 ASAAERDRLAYDALSLLTVWGTRAGADAG-LRDYANREWAGLVGGLYRLRWSTYFAELRS 678

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
           +   G   +  D    W  L + W            GD
Sbjct: 679 ASREGRTPKKTD----WFALEDRWTRNPGGLATRPTGD 712


>gi|384417770|ref|YP_005627130.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
 gi|353460684|gb|AEQ94963.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
          Length = 798

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 142/540 (26%), Positives = 231/540 (42%), Gaps = 84/540 (15%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF ++ R F+E 
Sbjct: 238 VLPAFAGYVPKAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLEL 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVD-------SPEY-----------------------I 97
             + YG     Y  D F+E  PPV        + +Y                       +
Sbjct: 292 YTQAYG-AGEFYLADAFNEMLPPVADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARL 350

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           ++ G A+Y  +   +  A W+MQGWLF  D  FW+P  + A L  VP  +L+VLD+  + 
Sbjct: 351 AAYGQALYRSIAQVNPKATWVMQGWLFGADRAFWQPQAIAAFLGKVPDARLMVLDIGNDR 410

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVG 211
            P  W  S+ F    +I+  +HN+  +  +YG    +AF    ++A  ++     + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFG 467

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
           +  EG+  N VVY+ +  +A++  +     W+ +Y   RYGRS  A+  AW  L   +Y 
Sbjct: 468 VFPEGLHSNSVVYEYLYALAWEGPQHPWSQWLARYLRARYGRSDAALLSAWTDLEAGIYQ 527

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
               +               P   +   G Y  + +P +            D P      
Sbjct: 528 TRYWS---------------PRWWNTHAGAYLLFKRPTADIVNFD------DRPG---DP 563

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
             + RA++  +   +  + +  YRYDLI+  R  L+  A+     +++AY   D A G  
Sbjct: 564 QRLRRAIDALLQQADRYADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDA 623

Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
           QL+R   +LV+ +D L+      L     ++A     +    + Y  NAR Q+++W  + 
Sbjct: 624 QLART-TQLVQGLDALVGGQHETLAAWTGQAAAAAGNDARLRRAYVGNARAQVSVWGGDG 682

Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
                 L DY +K W G+  D+Y  R   +      + ++G  F       +L  W R+W
Sbjct: 683 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 737


>gi|390989490|ref|ZP_10259787.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372555759|emb|CCF66762.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 798

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 139/540 (25%), Positives = 222/540 (41%), Gaps = 84/540 (15%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF ++ R F+E 
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKLARRFLEL 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
             + YG     Y  D F+E  PPV                                   +
Sbjct: 292 YAQTYG-AGEFYLADAFNEMLPPVADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARL 350

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           +  G A+Y  +   +  A W+MQGWLF  D  FW+   + A L  VP  +L+VLD+  + 
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDR 410

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S+ F    +I+  +HN+  +  +YG     AF   + +      +   + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFG 467

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
           +  EG+  N V+Y+ +  +A++  +     W+  Y   RYGRS  A+  AW+ L   +Y 
Sbjct: 468 VFPEGLHSNSVIYEYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQ 527

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
               +               P   +   G Y  + +P +            D P      
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTADIVDFD------DRPG---DP 563

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
             + RA++  +   N  + +  YRYDLI+  R  L+  A+     +++AY   D A G  
Sbjct: 564 QRLRRAIDALLRQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDA 623

Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
           QL+R   +LV  +D L+      L     ++A     +    + Y  NAR Q+++W  + 
Sbjct: 624 QLART-TQLVRGLDALVGGQHETLADWTGQAAAATGHDAGLRRAYVGNARAQVSVWGGDG 682

Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
                 L DY +K W G+  D+Y  R   +      + ++G  F       +L  W R+W
Sbjct: 683 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737


>gi|329934959|ref|ZP_08285000.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
 gi|329305781|gb|EGG49637.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
          Length = 1017

 Score =  167 bits (424), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 129/526 (24%), Positives = 215/526 (40%), Gaps = 50/526 (9%)

Query: 8   VLPAFSGNVPAALQNVFPS-AKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           VLP + G+VP    +     A +   G W      P W     LD     F E+  +F  
Sbjct: 228 VLPGYYGHVPKDFADRRGGDAHVVPQGTWHGFDR-PSW-----LDPRTDAFAEVAASFYR 281

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
            Q   +G     +  D   E     D P   ++ G  +   +++    A W++ GW  + 
Sbjct: 282 HQEDVFGPAGD-FKMDLLHEGGTAGDVPVPDAARG--VEKALRAARPGATWVILGWEAN- 337

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
                   +  LL++V   +++++D  ++    +    + + G PY +  + NF G   +
Sbjct: 338 -------PLPELLDAVDKKRMLIVDGVSDRYTSVTDREEDWGGTPYAFGTIPNFGGRTTI 390

Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
            G    I      A R    + + G     E  +++P  ++L SE+A+  E VD   W  
Sbjct: 391 -GARTHIWREKFFAWRDKPGSALAGTAYLPEAADRDPAAFELFSELAWTDEPVDRARWFT 449

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
            Y+  RYG      + AW  L+ T Y    +  +D +  +  A PD+             
Sbjct: 450 GYADFRYGGRDAGARRAWRALHDTAYQQHANERSDPHDSLFCARPDL------------- 496

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
                        +  + Y    L Y  +    AL   +A          YRYDL+D+ R
Sbjct: 497 -----------AATRAARYAPAALTYDPARFDAALSGLLAVAAHRRGGAAYRYDLVDVAR 545

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           QALA  + +    +  A+   DA     L+ ++L L+   + +   H  FLLGPW+E A+
Sbjct: 546 QALAHRSRQYLPQLKAAFDREDAATFKALATQWLTLMRLSEDITGTHPAFLLGPWIEDAR 605

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
           ++A N  +  ++E  A+  +T+W D    +A  L +YGN+ W GLL D+Y PR   +   
Sbjct: 606 RMATNPRERAEFERTAKALVTVWGDRATSDAGNLHEYGNREWHGLLSDFYLPRWQKWLDA 665

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
             ++L +G      D    W      W   R  YP+   GDA  T+
Sbjct: 666 CEDALATGTAPAAVD----WFAFEEPWTRERKDYPLRPVGDAYRTA 707


>gi|291301158|ref|YP_003512436.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
 gi|290570378|gb|ADD43343.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
           44728]
          Length = 734

 Score =  167 bits (424), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 129/518 (24%), Positives = 226/518 (43%), Gaps = 48/518 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G +P         A+    G W+   S P W     LD T   F ++  +F   
Sbjct: 246 VLPGYFGTIPGGFAKRNQQARTVPQGVWYGF-SRPDW-----LDPTGNEFAKVAASFYRH 299

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q +  G  + +Y  D   E   P   P   ++ G A+   +Q     A W+M GW     
Sbjct: 300 QAQLLGE-ADMYKMDLMHEGGDPGGIPIPDAAKGVAL--ALQRARPGATWVMLGW----- 351

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
              R      +L  +   +++++D  ++          + G PY +  + NF G+  +  
Sbjct: 352 ---RKNPRTDILTDIDTSRVLIVDGISDRFDDLDREHTWPGTPYAFGTIPNFGGHTTIGA 408

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
                A    + RT+ ++ + G+    EG  ++P  ++L +E+A++ + +D+  W   Y+
Sbjct: 409 NAKVWAKRFGQWRTAPDSAVSGIAWMPEGAGRDPAAFELFAELAWR-DSIDLGEWFADYA 467

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIV-AFPDVDPSIISVTEGKYQNYG 306
            RRYG +    + AW+ L  + Y    G   +  D +  A P +D     VT   Y    
Sbjct: 468 DRRYGGADDNARTAWDALRRSAYAMPSGRWAEAADGLFGARPGLD-----VTHADY---- 518

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
                          +    L Y  +   +AL   +     L  ++ YR+DL+D+ RQ+L
Sbjct: 519 ---------------FSPEFLRYDAAVFAQALPALLDVDKSLH-NDAYRFDLVDVARQSL 562

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
                EL   +  A+   +     + +R +L+ +  +D LL     FLLGPWLE+A++ A
Sbjct: 563 VNAGRELLPRVKSAFVNQNKKQFDKHTRTWLDWMRLLDRLLETDRRFLLGPWLEAARRSA 622

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
           +  ++ K  E++ART +++W   +  +   L DY N+  +GL+ D Y  R   YF  + E
Sbjct: 623 RTADEAKDLEYDARTIVSVWGHRSGSDEGRLHDYANRELAGLVSDLYAMRWRRYFDSLAE 682

Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
           SL+SG   +  D    W  L ++W +  + +  E  GD
Sbjct: 683 SLDSGQAPQHID----WFALEHEWASKTDDHATEPKGD 716


>gi|21241480|ref|NP_641062.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
 gi|21106823|gb|AAM35598.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
          Length = 798

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 139/540 (25%), Positives = 222/540 (41%), Gaps = 84/540 (15%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF ++ R F+E 
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKLARRFLEL 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
             + YG     Y  D F+E  PPV                                   +
Sbjct: 292 YAQTYG-AGEFYLADAFNEMLPPVADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARL 350

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           +  G A+Y  +   +  A W+MQGWLF  D  FW+   + A L  VP  +L+VLD+  + 
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDR 410

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S+ F    +I+  +HN+  +  +YG     AF   + +      +   + G G
Sbjct: 411 YPGTWKASRAFDNKQWIYGYVHNYGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFG 467

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
           +  EG+  N V+Y+ +  +A++  +     W+  Y   RYGRS  A+  AW+ L   +Y 
Sbjct: 468 VFPEGLHSNSVIYEYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQ 527

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
               +               P   +   G Y  + +P +            D P      
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTADIVDFD------DRPG---DP 563

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
             + RA++  +   N  + +  YRYDLI+  R  L+  A+     +++AY   D A G  
Sbjct: 564 QRLRRAIDALLRQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDA 623

Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
           QL+R   +LV  +D L+      L     ++A     +    + Y  NAR Q+++W  + 
Sbjct: 624 QLART-TQLVRGLDALVGGQHETLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDG 682

Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
                 L DY +K W G+  D+Y  R   +      + ++G  F       +L  W R+W
Sbjct: 683 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737


>gi|372221472|ref|ZP_09499893.1| alpha-N-acetylglucosaminidase [Mesoflavibacter zeaxanthinifaciens
           S86]
          Length = 712

 Score =  165 bits (418), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 144/513 (28%), Positives = 217/513 (42%), Gaps = 57/513 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V+PAFSG +PAAL   FP+AKI++L  W     D     TYLLD  DPLF EIG+ FIE 
Sbjct: 237 VVPAFSGYIPAALAEKFPNAKISELNGWSGGGFD----STYLLDPKDPLFKEIGKRFIEL 292

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLF 124
             +EYG+  + Y  D+F+E TPPV +   +  L   G  IY  +      A W+MQGWLF
Sbjct: 293 YNQEYGKAEY-YLADSFNEVTPPVSTENKLDELAAYGQVIYETLNEAAPGATWVMQGWLF 351

Query: 125 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
            +D  FW    + A L+ VP  KL++ D   +   +W     FYG  + +  +HN+ G+ 
Sbjct: 352 GHDAYFWEKDAVIAFLSKVPNDKLIIQDFGNDRYKVWEKQDAFYGKQWTYGYVHNYGGSN 411

Query: 184 EMYGILDSIAFGPVEA----RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
            +YG  D   F   E        ++T ++G G+  EG+ QN +VY+ + ++ +   K+ V
Sbjct: 412 PIYGDFD---FYKEEINYLLEHDKSTKVLGYGVMPEGLHQNSMVYEYLYDLPWD-SKIPV 467

Query: 240 KAWINQYSVRRYGRS-VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 298
           K W+      RYG+        AW  L   VY+                    P   +  
Sbjct: 468 KDWLKTNIKARYGKDFTKETLTAWIKLDSAVYSTKYWT---------------PRWWNDQ 512

Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
            G Y  + +P  +    K       HP     T+  +      +   N+   +   + D 
Sbjct: 513 AGAYLLFKQPSKEITAFKG------HP-----TNLKLLEEANLLLEKNK-ENNPLIQEDF 560

Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
           I   R  L+   + L      AY  ND      L  +F  L++  + LL       L  W
Sbjct: 561 IAHKRHELSLKIDTLLQQATYAYINNDFEKGDSLQLQFHTLIDSTEQLLENSKLDRLDYW 620

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           ++ A       E +  Y+ NAR  I  W          L +Y ++ W    +  Y  R  
Sbjct: 621 VQEATNYGDTPETKAFYKKNARLLINQWGG-----VGNLNNYASRAWKDQYQLLYKTRWD 675

Query: 479 IYFKYMIESLESGDGF-------RLKDWRREWI 504
           IY   +  + E G           +K+W   W+
Sbjct: 676 IYLGSLRVNSELGGELNQERIEQNIKEWDELWL 708


>gi|381169859|ref|ZP_09879021.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689629|emb|CCG35508.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 798

 Score =  165 bits (418), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 139/540 (25%), Positives = 222/540 (41%), Gaps = 84/540 (15%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF ++ R F+E 
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKLARRFLEL 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
             + YG     Y  D F+E  PPV                                   +
Sbjct: 292 YAQTYG-AGEFYLADAFNEMLPPVADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARL 350

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           +  G A+Y  +   +  A W+MQGWLF  D  FW+   + A L  VP  +L+VLD+  + 
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDR 410

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S+ F    +I+  +HN+  +  +YG     AF   + +      +   + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFG 467

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
           +  EG+  N V+Y+ +  +A++  +     W+  Y   RYGRS  A+  AW+ L   +Y 
Sbjct: 468 VFPEGLHSNSVIYEYLYALAWESPQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQ 527

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
               +               P   +   G Y  + +P +            D P      
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTADIVDFD------DRPG---DP 563

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
             + RA++  +   N  + +  YRYDLI+  R  L+  A+     +++AY   D A G  
Sbjct: 564 QRLRRAIDALLRQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDA 623

Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
           QL+R   +LV  +D L+      L     ++A     +    + Y  NAR Q+++W  + 
Sbjct: 624 QLART-TQLVRGLDALIGGQYETLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDG 682

Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
                 L DY +K W G+  D+Y  R   +      + ++G  F       +L  W R+W
Sbjct: 683 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737


>gi|418520969|ref|ZP_13087015.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410702945|gb|EKQ61442.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
          Length = 798

 Score =  165 bits (417), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 140/540 (25%), Positives = 222/540 (41%), Gaps = 84/540 (15%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF ++ R F+E 
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKLARRFLEL 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
             + YG     Y  D F+E  PPV                                   +
Sbjct: 292 YAQTYG-AGEFYLADAFNEMLPPVADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARL 350

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           +  G A+Y  +   +  A W+MQGWLF  D  FW+   + A L  VP  +L+VLD+  + 
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDR 410

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S+ F    +I+  +HN+  +  +YG     AF   + +      +   + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFG 467

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
           +  EG+  N V+Y+ +  +A++  +     W+  Y   RYGRS  A+  AW+ L   +Y 
Sbjct: 468 VFPEGLHSNSVIYEYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQ 527

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
               +               P   +   G Y  + +P +  A         D P      
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTADIADFD------DRPG---DP 563

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
             + RA++  +   N  + +  YRYDLI+  R  L+  A+     +++AY   D A G  
Sbjct: 564 QRLRRAIDALLQQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDA 623

Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
           QL+R   +LV  +D L+      L     ++A     +    + Y  NAR Q+++W  + 
Sbjct: 624 QLART-TQLVRGLDALVGGQYETLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDG 682

Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
                 L DY +K W G+  D+Y  R   +      +  +G  F       +L  W R+W
Sbjct: 683 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARMAGTPFDAVAMDHQLATWERQW 737


>gi|456388164|gb|EMF53654.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
           25435]
          Length = 732

 Score =  165 bits (417), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 133/519 (25%), Positives = 226/519 (43%), Gaps = 47/519 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           V P + G VPA      P A+    G W    + P W     LD     F  +  AF   
Sbjct: 243 VFPGYFGTVPAGFAERVPGARTVPQGEWMGF-ARPDW-----LDPRTDDFARVAAAFYRV 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G +S +Y  D   E   P D P  ++     +   ++     A W++ GW  +  
Sbjct: 297 QEEMFGPSS-LYKMDLLHEGGDPGDVP--VADAAKGVERALRRSRPGATWVILGWQHN-- 351

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMY 186
               PP  +A++++V    ++V+D  ++  P  +  +  +G  PY +  + NF G+  + 
Sbjct: 352 ----PP--RAIVDAVDKQHMLVVDGLSDRFPTVTDREADWGDTPYAFGSIWNFGGHTALG 405

Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
                 A    + RT + + + G+ +  E  + NP  + L SE+A++  ++D+K W  ++
Sbjct: 406 ANTPDWAALYEKWRTKDGSRLHGIALMPEAADNNPAAFALFSELAWREGELDLKTWFAEW 465

Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
           +  RYG   P  + AW++L  T Y  T                      S +EG    +G
Sbjct: 466 AHARYGGRDPHAEAAWDILRRTAYGTT-------------------RADSWSEGADGLFG 506

Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
              S+ A+       +    L Y  ++   AL   +    EL AS+ YR DL+D+ RQAL
Sbjct: 507 ---SRPALNAVRAGRWSPKQLRYDAADFEPALGEMLRVRPELRASSAYRRDLLDVARQAL 563

Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
           +  +  +   I  AY   DA  +   SR +L L++ +D L+A     LLG W+  A+   
Sbjct: 564 SNRSRVMLPQIKAAYDAKDATRLAAASRDWLSLMDLLDELVATDSRHLLGRWVADARSWG 623

Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASL-LRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
               +  +  ++  + +T+W   T+E A   LRDY N+ W+GL+   Y  R + YF+ + 
Sbjct: 624 AGAAERTELGYDNLSLLTVW--GTREGADAGLRDYANREWAGLVGGLYRLRWSTYFEELR 681

Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
            +L +G   +  D    W  L + W         E  GD
Sbjct: 682 AALAAGRAPKKID----WFALEDRWARNPGPLATEPTGD 716


>gi|418515337|ref|ZP_13081518.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|410708056|gb|EKQ66505.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 782

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 140/540 (25%), Positives = 222/540 (41%), Gaps = 84/540 (15%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF ++ R F+E 
Sbjct: 222 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKLARRFLEL 275

Query: 68  QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
             + YG     Y  D F+E  PPV                                   +
Sbjct: 276 YAQTYG-AGEFYLADAFNEMLPPVADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARL 334

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           +  G A+Y  +   +  A W+MQGWLF  D  FW+   + A L  VP  +L+VLD+  + 
Sbjct: 335 AEYGQALYRSIAQVNPKATWVMQGWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDR 394

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S+ F    +I+  +HN+  +  +YG     AF   + +      +   + G G
Sbjct: 395 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFG 451

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
           +  EG+  N V+Y+ +  +A++  +     W+  Y   RYGRS  A+  AW+ L   +Y 
Sbjct: 452 VFPEGLHSNSVIYEYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQ 511

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
               +               P   +   G Y  + +P +  A         D P      
Sbjct: 512 TRYWS---------------PRWWNKRAGAYLLFKRPTADIADFD------DRPG---DP 547

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
             + RA++  +   N  + +  YRYDLI+  R  L+  A+     +++AY   D A G  
Sbjct: 548 QRLRRAIDALLQQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDA 607

Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
           QL+R   +LV  +D L+      L     ++A     +    + Y  NAR Q+++W  + 
Sbjct: 608 QLART-TQLVRGLDALVGGQYETLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDG 666

Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
                 L DY +K W G+  D+Y  R   +      +  +G  F       +L  W R+W
Sbjct: 667 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARMAGTPFDAVAMDHQLATWERQW 721


>gi|294627661|ref|ZP_06706243.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 11122]
 gi|292598013|gb|EFF42168.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 11122]
          Length = 798

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 143/542 (26%), Positives = 228/542 (42%), Gaps = 88/542 (16%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF ++ R F+E 
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLEL 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
             + YG     Y  D F+E  PPV                                   +
Sbjct: 292 YTQTYG-AGEFYLADAFNEMLPPVADDGSDVAAAKYGDSVANSDAARAKAVPPAQRDARL 350

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           +  G A+Y  +   +  A W+MQGWLF  D  FW+   + A L  VP  +L+VLD+  + 
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDR 410

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVG 211
            P  W  S+ F    +I+  +HN+  +  +YG     AF    ++A  ++     + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRQDLQALLADPGKRNLRGFG 467

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
           +  EG+  N V+Y+ +  +A++  +     W+  Y   RYGRS  A+  AW  L   +Y 
Sbjct: 468 VFPEGLHSNSVIYEYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQ 527

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
               +               P   +   G Y  + +P +   ++  +    D   L    
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTAD--IVDFDDCPGDPQRL---- 566

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
               RA++  +   N  + +  YRYDLI+  R  L+  A+     +++AY   D A G  
Sbjct: 567 ---RRAIDALLQQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDA 623

Query: 391 QLSRRFLELVEDMDGLLAC-HDGFLLGPWL-ESAKQLAQNEEQEKQYEWNARTQITMWFD 448
           QL+R   +LV  +D L+   HD   L  W  ++A     +    + Y  NAR Q+++W  
Sbjct: 624 QLART-TQLVRGLDALVGGQHD--TLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGG 680

Query: 449 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRR 501
           +       L DY +K W G+  D+Y  R   +      + ++G  F       +L  W R
Sbjct: 681 DGN-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAVAVDHQLAAWER 735

Query: 502 EW 503
           +W
Sbjct: 736 QW 737


>gi|398786493|ref|ZP_10549210.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
 gi|396993639|gb|EJJ04702.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
          Length = 1048

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 126/528 (23%), Positives = 217/528 (41%), Gaps = 47/528 (8%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G VP       P A+    G W  + + P W     LD    +F +   AF   
Sbjct: 243 VLPGYFGTVPGGFAARNPGARTVPQGTWSGL-ARPDW-----LDPRTEVFAKTAAAFYRH 296

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q    G   H +  D   E   P D P  +     A+   +++    A W++ GW  +  
Sbjct: 297 QEHLLGPADH-FKMDLLHEGGDPGDVP--VPDAARAVEKALRTARPGATWVILGWQNN-- 351

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
                   + LL++V   +++++D  ++++ +    + + GVPY +  + NF G   +  
Sbjct: 352 ------PRRDLLDAVDHDRMLIVDGLSDLETVTDRERDWGGVPYAFGSIPNFGGRTTIGA 405

Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
                A      R    + + G     E  E++P  ++L SE+A++   VD  AW + Y+
Sbjct: 406 KTHVWAERFPAWRDKPGSRLAGTAYMPEAAERDPAAFELFSELAWRERPVDRAAWFDGYA 465

Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
             RYG      + A+  L  + Y  +      +  V  A PD+                 
Sbjct: 466 DLRYGARDKGARAAFAALGTSAYEISSKDGRPHDSVFAARPDLA---------------- 509

Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
                 V  + T ++D        +    A    +     L  S+ YR DL D  RQALA
Sbjct: 510 -ARSGTVYATHTPAFD-------PAAFDTAFAALLTVRPALRGSDAYRRDLTDTARQALA 561

Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
             + +L   + +AY+  D      LS  +L L+   + +   H  FLLGPWL  A+ +A 
Sbjct: 562 NRSWQLIGQLQDAYRRKDRATFRALSGLWLHLMRLSEDVTGAHRQFLLGPWLTDARAMAS 621

Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
             E+E + E +AR  +T W D    +   L +Y N+ W GL+ + + P+   Y   + ++
Sbjct: 622 GPEEEARLEHSARALLTTWADRPTADGGSLANYANRDWHGLIGEVHLPQWQAYLGELADA 681

Query: 488 LESGDGFRLKDW-RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
           L +    +  DW  RE       W + R   P+    +A  T++ +++
Sbjct: 682 LAADRPPKPFDWYARE-----EPWTHERTTPPLHPTTEAYRTARRVHD 724


>gi|329940646|ref|ZP_08289927.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
 gi|329300707|gb|EGG44604.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
          Length = 798

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 129/502 (25%), Positives = 211/502 (42%), Gaps = 50/502 (9%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP + G VP         A+    G W      P W     LD    LF  +  AF   
Sbjct: 255 VLPGYFGTVPPGFAARNRGARTVPQGTWMGFDR-PDW-----LDPRTDLFARVAAAFYRV 308

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
           Q + +G ++H Y  D   E     D P  +      +   ++    DAVW++ GW  +  
Sbjct: 309 QGELFGASTH-YKMDLLHEGGTAGDVP--VGEAAKGVERALRRARPDAVWVLLGWRHN-- 363

Query: 128 PFWRPPQMKALLNSVPLG---------KLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 177
               PP  +A+L++V  G         +L+V+D  ++  P +      + GVPY +  + 
Sbjct: 364 ----PP--RAILDAVASGGPDGAAGRERLLVVDGLSDRFPTVTDREADWGGVPYAFGSIW 417

Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
           NF G+  +       A      RT E + + G+ +  E  + NP  + L SE+ +   ++
Sbjct: 418 NFGGHTTLGANTPDWARLYEAWRTKEGSALRGIALLPEAADNNPAAFALFSELPWHEGEL 477

Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 297
           D+KAW  +++  RYG      + AW+VL  T Y  T                      S 
Sbjct: 478 DLKAWFARWARSRYGAYDAHAEAAWDVLRRTAYGTT-------------------RADSW 518

Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
           +EG    +G   ++ ++     +S+    L Y   E  RAL+  +     L  S+ YR D
Sbjct: 519 SEGADGLFG---ARPSLTARRAASWSPKELRYDAHEFERALDELLKVRPGLRESSAYRRD 575

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
           L+D+ RQ L+  +  L   I  A    D       S  +L L++ ++ L+      LLG 
Sbjct: 576 LLDVARQCLSNRSRALLPRIARACAARDVKAFDAASGDWLSLMDLLERLVGTDARHLLGR 635

Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
           W   A+    +E +  + +++A + +T+W      EA  LRDY N+ W+GL+   Y  R 
Sbjct: 636 WTAQARAWGADEAERDRLQYDALSLLTVWGTRQGAEAG-LRDYANREWAGLVGGLYRLRW 694

Query: 478 AIYFKYMIESLESGDGFRLKDW 499
           + YF  +  +L  G      DW
Sbjct: 695 STYFTELRAALTEGRAPAAVDW 716


>gi|429201402|ref|ZP_19192867.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
           91-03]
 gi|428663010|gb|EKX62401.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
           91-03]
          Length = 1042

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 128/522 (24%), Positives = 214/522 (40%), Gaps = 50/522 (9%)

Query: 8   VLPAFSGNVPAA-LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           VLP + G+VP   ++     A +   G W   +  P W     LD     F ++  +F  
Sbjct: 249 VLPGYYGHVPKEFVERNGGDAHVVPQGVWHGFER-PDW-----LDPRTDSFAKVAASFYG 302

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
            Q   +G  +H +  D   E     D P  +      +   +Q     A W++ GW    
Sbjct: 303 HQEDVFGEAAH-FKMDLLHEGGTAGDVP--VPGAAQGVERALQKARPGATWVILGW--QE 357

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           +P      +  LL+++   +++++D  ++    +    + + G PY +  + NF G   +
Sbjct: 358 NP------LPELLDAIDKSRMLIVDGVSDRYTSVTDRERDWGGTPYCFGTIPNFGGRTTI 411

Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
            G    I      A R   N+ + G     E  +++P  ++L SE+A+   K+D  AW +
Sbjct: 412 -GARAHIWNEKFFAWRDKANSALAGTAFMPEATDRDPAAFELFSELAWTPTKIDRAAWFS 470

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
            Y+  RYG    + + AW  L+ T Y       +D +  +  A PD+             
Sbjct: 471 AYADYRYGARDDSARRAWRALHDTAYQQRAVERSDPHDSLFCARPDL------------- 517

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
                           + Y    L Y       AL   +     L  S  Y+YD++D+ R
Sbjct: 518 -----------AADRAAEYAPRALTYDPGRFDAALAGLLGVAGGLRGSAAYKYDVVDVAR 566

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           QALA  + +    +  AYQ  D      LS  +L L+   D +   +  FLLGPW+  A+
Sbjct: 567 QALAHRSRQYLPQLRAAYQRKDLATFRALSTLWLRLMRLSDEVTGANSAFLLGPWVNDAR 626

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
            LA N+ +  ++E  A+  IT+W      +A  L +YGN+ W GL+ D+Y PR   +   
Sbjct: 627 LLATNDAERAEFERTAKVLITVWGGRATSDAGDLHEYGNREWHGLMADFYVPRWEKWLDT 686

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDA 525
           + ++L +G      D    W      W   R  Y +   GDA
Sbjct: 687 LEDALATGTAPAAVD----WFAFEEPWTRERKDYALRPVGDA 724


>gi|294667089|ref|ZP_06732314.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 10535]
 gi|292603099|gb|EFF46525.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 10535]
          Length = 798

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 145/542 (26%), Positives = 228/542 (42%), Gaps = 88/542 (16%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD  DPLF ++ R F+E 
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLEL 291

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVD-------SPEYISSL-------------------- 100
             + YG     Y  D F+E  PPV        + +Y  S+                    
Sbjct: 292 YTQTYG-AGEFYLADAFNEMLPPVADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARL 350

Query: 101 ---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
              G A+Y  +   +  A W+MQGWLF  D  FW+   + A L  VP  +L+VLD+  + 
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDR 410

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVG 211
            P  W  S+ F    +I+  +HN+  +  +YG     AF    ++A  ++     + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRQDLQALLADPGKRNLRGFG 467

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
           +  EG+  N V+Y  +  +A++  +     W+  Y   RYGRS  A+  AW  L   +Y 
Sbjct: 468 VFPEGLHSNSVIYAYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQ 527

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
               +               P   +   G Y  + +P +            D P      
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTADIVDFD------DRPG---DP 563

Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
             + RA++  +   N  + +  YRYDLI+  R  L+  A+     +++AY   D A G  
Sbjct: 564 QRLRRAIDALLQQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDA 623

Query: 391 QLSRRFLELVEDMDGLLA-CHDGFLLGPWL-ESAKQLAQNEEQEKQYEWNARTQITMWFD 448
           QL+R   +LV  +D L+   HD   L  W  ++A     +    + Y  NAR Q+++W  
Sbjct: 624 QLART-TQLVRGLDALVGDQHD--TLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGG 680

Query: 449 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRR 501
           +       L DY +K W G+  D+Y  R   +      + ++G  F       +L  W R
Sbjct: 681 DGN-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAVTVDHQLAAWER 735

Query: 502 EW 503
           +W
Sbjct: 736 QW 737


>gi|325922205|ref|ZP_08183992.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
           19865]
 gi|325547324|gb|EGD18391.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
           19865]
          Length = 807

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 138/545 (25%), Positives = 225/545 (41%), Gaps = 90/545 (16%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P+A+I ++  W           TY LD  DPLF ++ R F+E 
Sbjct: 239 VLPAFAGYVPKAFAQAHPNARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLEL 292

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVD-------SPEYISSL-------------------- 100
             + YG     Y  D F+E  PPV        + +Y  S+                    
Sbjct: 293 YTQTYG-AGEFYLADAFNEMLPPVADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARL 351

Query: 101 ---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
              G A+Y  +   +  A W+MQGWLF  D  FW+P  + A L  VP  +L+VLD+  + 
Sbjct: 352 AEYGQALYRSIAQVNPQATWVMQGWLFGADREFWQPQAIAAFLGKVPDARLMVLDIGNDR 411

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
            P  W  S+ F    +I+  +HN+  +  +YG     AF   + +      +   + G G
Sbjct: 412 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRQDLQALLADPDKRNLRGFG 468

Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
           +  EG+  N VVY+ +  +A++  +     W+ QY+  RYG S  A+  AW+        
Sbjct: 469 VFPEGLHSNSVVYEYLYALAWEGPQQSWSQWLTQYTRARYGHSDAALLQAWS-------- 520

Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY----DHPHL 327
                            D+D  I        + + K      + K  T+      D P  
Sbjct: 521 -----------------DLDAGIYQTRYWSLRWWNKRAGAYLLFKRPTADIVGFDDRPG- 562

Query: 328 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 387
                 + RA++  +   +  + +  YRYDLI+  R  L+ +A+     +++AY   D  
Sbjct: 563 --DPQRLRRAIDALLQQADRYADAPLYRYDLIEDARHYLSLHADRQLQAVVQAYGTGDFA 620

Query: 388 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 447
               L  R   LV+ +D L+      L     ++A     +    + Y  NAR Q+++W 
Sbjct: 621 RGDALLARTTRLVQGLDALVGGQHETLADWTDQAAAAAGDDAALRRVYVGNARAQVSVWG 680

Query: 448 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWR 500
            +       L DY +K W G+  ++Y  R   +      + ++G  F       +L  W 
Sbjct: 681 GDGN-----LADYASKAWQGMYAEFYLQRWTRFLSAYRAARKAGTPFDEAAFNKQLAAWE 735

Query: 501 REWIK 505
           R+W +
Sbjct: 736 RQWAE 740


>gi|409097333|ref|ZP_11217357.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Pedobacter agri
           PB92]
          Length = 724

 Score =  162 bits (411), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 134/520 (25%), Positives = 217/520 (41%), Gaps = 50/520 (9%)

Query: 8   VLPAFSGNVPAALQN---VFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAF 64
           +L  F G VP  L         A+I   GNW   +    +    +L  T+  F  +   +
Sbjct: 232 ILQGFYGMVPHDLNKKVAALKDAQIIDQGNWVFTE----FIRPAILAPTNDKFNTVADVY 287

Query: 65  IEQQLKEYGRTSHIYNCDTFDENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
             +  K YG     +  + F E      VD    I+++  ++   MQ    ++ W++QGW
Sbjct: 288 YSELKKLYGSDIKFFGGEPFHEGGKKGGVD----ITAVAKSVQDVMQKNFPNSTWVLQGW 343

Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
                   +     ALL  +     ++++LF E    W   K + G  +IW  + NF   
Sbjct: 344 --------QNNPADALLAGLKKENTLIIELFGENTSNWEQRKGYGGTNFIWSNVSNFGEK 395

Query: 183 IEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
             +YG L          + S     + GVG+  EGI  NPV YDLM ++A+++EK  +  
Sbjct: 396 NGLYGRLQRFLDEVYRIKQSPYKDYLKGVGIIPEGINNNPVAYDLMLDIAWRNEKPPLDK 455

Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
           WI  Y+  RYG     + DAW V   TVY+      +K + V    P         +E  
Sbjct: 456 WITDYTTYRYGSYNKDVADAWKVFTETVYSSP--VNEKGKIVYQEGP---------SESI 504

Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
           Y       ++ ++  +  SS+      Y T    +A+ LFI +  +   S TY+ D  D 
Sbjct: 505 Y------CARPSLKVNPVSSWGTRKRNYDTKLFKQAVALFIKAETQFKNSETYQTDKTDF 558

Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
            RQ +A   ++ +  +I A Q  D + + +    FL ++   D LL  +  F L  WL  
Sbjct: 559 LRQVMADKGDQAYDELINAIQAKDKNAIKEKGNHFLTMILQQDSLLNNNHFFTLNRWLNQ 618

Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
           A  L +     K   +NA+ QIT W  +   + + LRDY +K W GLL   Y  R  ++ 
Sbjct: 619 AVALGKGLPDAKNILFNAKAQITFWGPDNNPKTT-LRDYAHKEWGGLLSSLYYNRWKLFI 677

Query: 482 KYMI-ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 520
              + + + S   F           +   W    N+YP++
Sbjct: 678 DDALNDKITSASTF---------YDMEVKWSKDSNLYPIK 708


>gi|365876979|ref|ZP_09416485.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
 gi|442587289|ref|ZP_21006107.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
           anophelis R26]
 gi|365755253|gb|EHM97186.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
 gi|442562959|gb|ELR80176.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
           anophelis R26]
          Length = 712

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 131/514 (25%), Positives = 220/514 (42%), Gaps = 54/514 (10%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VL  F G VP  L+N    AK+ + G W      P      +LD T  LF +I   +  +
Sbjct: 230 VLQGFYGMVPHDLKNKISEAKVIEQGKWAGEFQRPG-----ILDPTTKLFSKIADTYYTE 284

Query: 68  QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
               YG   H +  + F E   T  +D    + S    I + MQ    ++ W++QGW   
Sbjct: 285 MKNLYGEDIHYFGGEPFHEGGKTNGLDLKNVVES----IQTSMQKSYPNSTWVLQGW--Q 338

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
            +P         LL  +     ++++LF E    W   K + G  +IW  + NF     +
Sbjct: 339 QNP------SDGLLAGLKKENTLIIELFGENTANWEKRKGYGGTSFIWSNVSNFGEKNGL 392

Query: 186 YGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           YG L         A+ S     + G+G+  EGI  NPV YDLM ++A+  EK  +  W+ 
Sbjct: 393 YGKLQRFIDEVFRAKESIYGANLKGIGIIPEGIFNNPVAYDLMLDIAWYSEKPILDQWLT 452

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQ 303
           +Y+  RYG+    +  AW     T+Y+  D   +  +  +  A P ++ + +S    + +
Sbjct: 453 EYTKYRYGKENQDVIQAWKEFAQTIYSSPDVYQEGPSESIYCARPSLNVNPVSSWGTRKR 512

Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
           NY +   KEAV                        ++F+ +  +   S TY+ D  D  R
Sbjct: 513 NYDQSRFKEAV------------------------KVFVKADTDFKDSETYQTDKTDFLR 548

Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           Q  A   + ++  +I+A        + +   +FLE++   + LL  +  F L   L+ A+
Sbjct: 549 QVWANKGDVVYDELIKAIHEKKTTKIQKSGHQFLEMISIQNMLLGNNRYFTLNRLLKEAE 608

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
              +     +   +NA++Q+T W  +   +   LRDY +K W+GLL   Y  R    +K 
Sbjct: 609 HFGEKLPDAQNVMFNAKSQLTYWGPDNNPKTD-LRDYAHKEWNGLLSSLYYNR----WKV 663

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY 517
            IE  +SG    +      +  +  +W  G+N+Y
Sbjct: 664 FIEQAQSG----IITAPEVFYNMEVEWSKGKNMY 693


>gi|16124795|ref|NP_419359.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
 gi|221233511|ref|YP_002515947.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
 gi|13421729|gb|AAK22527.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
 gi|220962683|gb|ACL94039.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
          Length = 770

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 128/503 (25%), Positives = 205/503 (40%), Gaps = 71/503 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAF G VP A     P A+I ++  W           TY LD  DPLF +I   F+  
Sbjct: 239 ILPAFGGYVPKAFAEKNPKARIYRMRPWEGFHE------TYWLDPADPLFAKIAARFLAL 292

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVD-------------------------------SPEY 96
             + +G  ++ Y  D+F+E  PP++                                 + 
Sbjct: 293 YTETFGAGTY-YLADSFNEMLPPINADGADARDAAYGDGTANTAVTKTKVEVDPALKAQR 351

Query: 97  ISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAE 155
           +++ G AIY  ++    DAVW+MQGWLF  D  FW P  + A L+ VP  KL++LD+  +
Sbjct: 352 LAAYGKAIYDSIRQTRPDAVWVMQGWLFGADSHFWDPAAISAYLSLVPDDKLMILDIGND 411

Query: 156 VKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTMVGVGMS 213
             P +W  +K F G P+I+  +HN+ G+  +YG L       P  A   +   + G GM 
Sbjct: 412 RYPNVWKNAKAFGGKPWIYGYVHNYGGSNPVYGDLGFYRQDIPAIAANPDAGKLAGFGMF 471

Query: 214 MEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT 273
            EG+  N +VY+ + ++A+   +     W+ +Y+  RYG++ PA+  A   L    ++  
Sbjct: 472 PEGLHNNSIVYEAVYDLAWSEGQASPATWLTRYARARYGKTSPALDAALGQLVEAAFSTR 531

Query: 274 DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSE 333
             +               P       G Y  + +P +            D P      ++
Sbjct: 532 YWS---------------PRWWKSKAGAYLFFKRPTATVG---------DFPQHPGDRAK 567

Query: 334 VIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLS 393
           +  A++   A          +  DL D TR       ++L    + AY+  D        
Sbjct: 568 LEAAVKALTALAPTYGQEPLFVLDLTDATRHLATMKIDDLLQVAVAAYRRGDTAAGDAAR 627

Query: 394 RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEE 453
                L   +D LL       L  W++ A+           Y  NA+ Q+T+W       
Sbjct: 628 VEIEALALSIDKLLGVQPD-TLATWIDEARAYGDTPADAAAYVANAKAQVTIWGGEGN-- 684

Query: 454 ASLLRDYGNKYWSGLLRDYYGPR 476
              L DY +K W GL + +Y PR
Sbjct: 685 ---LNDYASKAWQGLYKSFYLPR 704


>gi|29832531|ref|NP_827165.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
 gi|29609651|dbj|BAC73700.1| putative alpha-N-acetylglucosaminidase, secreted [Streptomyces
           avermitilis MA-4680]
          Length = 1038

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 125/526 (23%), Positives = 215/526 (40%), Gaps = 50/526 (9%)

Query: 8   VLPAFSGNVPAA-LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           VLP + G+VP   ++     A +   G W   +  P W     LD     F  + ++F  
Sbjct: 249 VLPGYYGHVPKGFVERNGGDAHVVPQGIWHGFER-PDW-----LDPRTASFAAVAKSFYR 302

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
            Q   +G+ +H +  D   E     D P  +      +   +Q+    A W++ GW    
Sbjct: 303 HQKDVFGKAAH-FKMDLLHEGGTAGDVP--VPGAARGVEKALQAAHPGATWVILGW--EA 357

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           +P      + ALL+++   K++++D  ++    +    K + G PY +  + NF G   +
Sbjct: 358 NP------LPALLDAIDKKKMLIVDGVSDRYTSVTDREKDWGGTPYAFGTIPNFGGRTTI 411

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
                         R    + + G     E  +++P  ++L SE+A+   K+D  AW + 
Sbjct: 412 GARAHLWNEKFFAWRDKAGSALAGTAYLPEAADRDPAAFELFSELAWSAGKIDRAAWFSS 471

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           Y+  RYG    + Q AW  L+ T Y       +D +  +  A PD+              
Sbjct: 472 YADFRYGGRDASAQKAWRALHDTAYQQHAVERSDAHDSLFCARPDL-------------- 517

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                       +  + Y    L Y       AL   +     L  S  Y YDL+D+ RQ
Sbjct: 518 ----------AANRAAEYAPRALTYDPGRFDAALSGLLGVAGGLRGSAAYTYDLVDVARQ 567

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVF-QLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
           ALA  + + +L ++ A         F  L+  +L L+   D +   H  FLLGPW+  A+
Sbjct: 568 ALAHRSRQ-YLPLLRAAYARKDAAAFTSLATLWLRLMGLSDEVTGTHPAFLLGPWINDAR 626

Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
            LA +  +  ++E  A+  +T+W      +A  L +Y  + W+GL+ D+Y PR   +   
Sbjct: 627 LLATDAGERAEFERTAKVLLTVWGGRATSDAGDLHEYAGREWNGLMADFYLPRWKKWLDA 686

Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
           + ++L +G      D    W  +   W   R  YP+   GD   T+
Sbjct: 687 LADALATGTPPAAVD----WFAVEEPWTRERKDYPLRPVGDPYRTA 728


>gi|383643231|ref|ZP_09955637.1| N-acetylglucosaminidase [Sphingomonas elodea ATCC 31461]
          Length = 778

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 134/557 (24%), Positives = 215/557 (38%), Gaps = 82/557 (14%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           +LPAFSG VP A     P AKI Q+  W           TY LD +DPLF  +   F++ 
Sbjct: 246 ILPAFSGYVPEAFAKAHPEAKIYQMRQWEGFPG------TYWLDPSDPLFARLAARFLQL 299

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDS-------------------------PEYI----- 97
               YG     Y  D F+E  PP+                           P+ +     
Sbjct: 300 YTATYG-PGEYYLADAFNEMVPPIAEDGSDARAATYGDAIANTAATRAAALPKEVRDARL 358

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
           ++ G  +Y  + +   +A W+MQGWLF  D  FW P  + A L+ VP  ++++LD+  + 
Sbjct: 359 AAYGERLYRSITAAAPNATWVMQGWLFGADKAFWTPDAIAAFLSKVPDERMLILDIGNDR 418

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSM 214
            P IW+ ++ FYG  + +  +HN+ G+  +YG L         A  +  +  M G G+  
Sbjct: 419 YPGIWNATRAFYGKGWAYGYVHNYGGSNPVYGDLAFYRSDITAALANPGHGRMRGFGLFP 478

Query: 215 EGIEQNPVVYDLMSEMAFQHEKVDVK-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 269
           EG+  N + Y    ++A+       K     AWI  Y+  RYG++ PA+  AW+      
Sbjct: 479 EGLHSNGIAYAYAYDLAWGEIDATGKARPLDAWIGDYTRARYGKTSPALVAAWDKAIAGA 538

Query: 270 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 329
           Y                     P       G Y  +  P    A         D+P    
Sbjct: 539 YTTR---------------YWTPRWWHEQAGGYLFFKFPSLDGA---------DYPAAPG 574

Query: 330 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 389
             + +   +E  +A   +      Y YD++DL R   +   ++     + AY+  D    
Sbjct: 575 DPAALRAGIEALLAQAPQHGGEPLYTYDVVDLVRHYASVQLDDRLKTAVAAYKAGDLAAG 634

Query: 390 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 449
            + +     L   +D  LA +    LG WL  A        ++  +   A+  +T+W   
Sbjct: 635 DRATAAAERLARHIDA-LAGNQQETLGSWLADAAAYGDTPAEKAAFVEQAKAVVTVWGGT 693

Query: 450 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRRE 502
                  L DY ++ W GL   YY PR   +      +  +   F        ++ W+  
Sbjct: 694 GH-----LSDYASRAWQGLYAGYYWPRWQRFLAAQRAAAAAHTPFDAKATSDAIRTWQAA 748

Query: 503 WIKLTNDWQNGRNVYPV 519
           W+K    W   R   P+
Sbjct: 749 WLKDGRMWPRQRPAAPL 765


>gi|328867426|gb|EGG15808.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
          Length = 992

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 105/329 (31%), Positives = 160/329 (48%), Gaps = 38/329 (11%)

Query: 207 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 266
           M G G++ E IEQN ++YDLM+EMA++    ++  WINQY+ RRYG  VP +  AWN+L 
Sbjct: 219 MKGTGLTPEAIEQNYMMYDLMNEMAWRTTAPNMTEWINQYTQRRYGVFVPELAQAWNLLI 278

Query: 267 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 326
            TV+N T G                             YG P S   +      + D   
Sbjct: 279 PTVFNATLGY----------------------------YGPPSSFVGMRPQLNMTND--- 307

Query: 327 LWYSTSEVIRALELFIASGNELSASN-TYRYDLIDLTRQALAKYANELFLNIIEAYQLND 385
           L+Y  S V +A +L++   +E   S  T+ +D+ ++T QAL+    +  + + +AY  N 
Sbjct: 308 LYYDPSVVQQAWQLYLGVTDEYVLSTATFSFDVSEITLQALSNLFMDTQMAMYDAYLTNQ 367

Query: 386 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE--QEKQYEWNARTQI 443
           +    + +   L ++ DMD + A     L+G W  +A+Q A N    +   +E+NAR QI
Sbjct: 368 STVFEERATSCLNIITDMDTIAATQQMLLVGTWTANARQWALNTSSGETAPFEFNARNQI 427

Query: 444 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 503
           T+W        S L DY    WSGLL D+Y  R A++ KYM  SL +   F   D+  + 
Sbjct: 428 TLW----GPPNSSLHDYAYHLWSGLLNDFYFARWALFIKYMDTSLSTNTTFNNTDYTNDI 483

Query: 504 IKLTNDWQNGRNVYPVESNGDALITSQWL 532
             L   W N    YP    G+A + S+++
Sbjct: 484 ESLEESWNNQNYQYPTLPTGNAYLLSKFI 512


>gi|154321596|ref|XP_001560113.1| hypothetical protein BC1G_00945 [Botryotinia fuckeliana B05.10]
          Length = 701

 Score =  155 bits (391), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 86/219 (39%), Positives = 126/219 (57%), Gaps = 6/219 (2%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP  L+ V P+A I    +W ++        T+L    DPLF  +   F+  
Sbjct: 223 VLPAFTGFVPRDLRRVAPNANIINGSDWGNLFPFEYSNDTFLY-PIDPLFKTLQHTFLSL 281

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
           Q + YG  SHIY  D F+EN P    P Y+ ++    Y  +QS DS+A W++QGWLF + 
Sbjct: 282 QSEYYGNVSHIYTLDQFNENLPASGDPLYLGNISRGTYDSLQSFDSNATWMLQGWLFYAA 341

Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
             FW   +++A L  VP  + +++LDLF+E  P W  + Q+YG P+IWC LH + G   +
Sbjct: 342 SSFWTQDRVEAYLGGVPKNESMLILDLFSESFPEWENTHQYYGKPWIWCQLHGYGGTPGI 401

Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVV 223
           YG + +I    +EA R SE   MVG+G +MEG + N ++
Sbjct: 402 YGQIYNITNSSIEAFRNSEK--MVGMGNTMEGQDGNGLI 438



 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 60/250 (24%), Positives = 111/250 (44%), Gaps = 46/250 (18%)

Query: 265 LYHTVYNCTDGATD--KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 322
           +Y  +YN T+ + +  +N + +V   +         EG+  N G  + K  +      + 
Sbjct: 401 IYGQIYNITNSSIEAFRNSEKMVGMGNT-------MEGQDGN-GLILPKSLLEMQPNITE 452

Query: 323 DHPHLWYS-TSEVIRALELFIASGNELSAS---------NTYRYDLIDLTRQALAKYANE 372
           +H  L  S T ++    +LF A G   +AS         N +++D++D+TRQ L++    
Sbjct: 453 NHGRLGQSLTIDLFNPADLFRAWGLLYNASISVPELWYDNGWKFDMVDVTRQVLSERFKL 512

Query: 373 LFLNIIEAYQLNDAHGVFQLSRRFLELV-EDMDGLLACHDGFLLGPWLESAKQLAQNE-- 429
            ++++IE Y    A   F+ +   L ++  ++D +L+    F L  W+ +A   + N   
Sbjct: 513 EYVDLIEKYT---AEIDFEATSENLSMILRELDDILSTSPHFRLDTWINAAIASSPNSST 569

Query: 430 ---------------EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
                          + +  + +NA  QIT+W    Q     + DY +K W GL+R YY 
Sbjct: 570 YPIPSSDGSSELNITQTQHLFAYNAINQITIWGPTGQ-----INDYASKSWGGLVRGYYL 624

Query: 475 PRAAIYFKYM 484
            R  I+  Y+
Sbjct: 625 KRWEIFLDYI 634


>gi|84625358|ref|YP_452730.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
 gi|84369298|dbj|BAE70456.1| putative N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
           MAFF 311018]
          Length = 590

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 132/524 (25%), Positives = 218/524 (41%), Gaps = 84/524 (16%)

Query: 24  FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 83
            P A+I ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D 
Sbjct: 46  LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 98

Query: 84  FDENTPPV------------------------------DSPEYISSLGAAIYSGMQSGDS 113
           F+E  PPV                                   +++ G A+Y  +   + 
Sbjct: 99  FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 158

Query: 114 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 171
            A W+MQGWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +
Sbjct: 159 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 218

Query: 172 IWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLM 227
           I+  +HN+  +  +YG    +AF    ++A  ++     + G G+  EG+  N VVY+ +
Sbjct: 219 IYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYL 275

Query: 228 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
             +A++  +     W+ QY   RYGRS  A+  AW  L   +Y     +           
Sbjct: 276 YALAWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTRYWS----------- 324

Query: 288 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 347
               P   +   G Y  + +P +            D P        +  A++  +   + 
Sbjct: 325 ----PRWWNTHAGAYLLFKRPTADIVNFD------DRPG---DPQRLRSAIDALLQQADR 371

Query: 348 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGL 406
            + +  YRYDLI+  R  L+  A+     +++AY   D A G  QL+R   +LV+ +D L
Sbjct: 372 YADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDAL 430

Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
           +      L     ++A  +  +    + Y  NAR Q+++W  +       L DY +K W 
Sbjct: 431 VGGQHETLAAWTGQAAAAVGNDARLLRAYVGNARAQVSVWGGDGN-----LADYASKAWQ 485

Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
           G+  D+Y  R   +      + ++G  F       +L  W R+W
Sbjct: 486 GMYADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 529


>gi|293371910|ref|ZP_06618314.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
 gi|292633156|gb|EFF51733.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
          Length = 411

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 119/449 (26%), Positives = 202/449 (44%), Gaps = 56/449 (12%)

Query: 99  SLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVK 157
            + + +Y+ + + D  A W+   W+F +D   W   +MKALL  VP  K+++LD   E  
Sbjct: 3   KIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMILLDYHCENV 62

Query: 158 PIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGI 217
            +W  ++ F+  PYIWC L NF GN  + G +         A  +    + G+G ++EG+
Sbjct: 63  ELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKGIGSTLEGL 122

Query: 218 EQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGAT 277
           +     Y+ + E A+ +  VD   WI   + R  G    +++DAW  L++ +Y       
Sbjct: 123 DVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDIY------- 174

Query: 278 DKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA 337
                  V  P         T G    Y +P   +   K  ++ Y +  L     EV R 
Sbjct: 175 -------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL----LEVWRK 214

Query: 338 LELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR 395
           L       NE  +   + +R DLI + RQ L  Y  ++ +      +  D   +     +
Sbjct: 215 L-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQALKACGEK 267

Query: 396 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 455
             E++ D+D L A H    L  W++ A+++  + + +  YE NAR  IT W         
Sbjct: 268 MKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW-------GG 320

Query: 456 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE----WIKLTN 508
            L DY ++ W+GL+ DYY  R  +Y    I+ +  G   D  +L+D  +E    W+  T+
Sbjct: 321 SLNDYASRSWAGLISDYYAKRWEVYINTFIKVVGEGVEVDQKQLEDELKEIEEGWVNATD 380

Query: 509 DWQNGRNVYPVESNGDALIT-SQWLYNKY 536
                ++V+   S  D L++ S +L++KY
Sbjct: 381 RKDTRKDVH---STTDGLLSFSTFLFSKY 406


>gi|58583545|ref|YP_202561.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
 gi|58428139|gb|AAW77176.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
          Length = 753

 Score =  152 bits (383), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 131/524 (25%), Positives = 219/524 (41%), Gaps = 84/524 (16%)

Query: 24  FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 83
            P A+I ++  W           TY LD  DPLF ++ R F+E   + YG     Y  D 
Sbjct: 209 LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 261

Query: 84  FDENTPPV------------------------------DSPEYISSLGAAIYSGMQSGDS 113
           F+E  PPV                                   +++ G A+Y  +   + 
Sbjct: 262 FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 321

Query: 114 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 171
            A W+MQGWLF  D  FW+P  + A L  VP  +L+VLD+  +  P  W  S+ F    +
Sbjct: 322 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 381

Query: 172 IWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLM 227
           I+  +HN+  +  +YG    +AF    ++A  ++     + G G+  EG+  N VVY+ +
Sbjct: 382 IYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYL 438

Query: 228 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
             +A++  +     W+ QY   RYGRS  A+  AW  L   +Y                 
Sbjct: 439 YALAWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTR-------------- 484

Query: 288 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 347
               P   +   G Y  + +P +   ++  +    D          +  A++  +   + 
Sbjct: 485 -YWSPRWWNTHAGAYLLFKRPTAD--IVNFDDRPGD-------PQRLRSAIDALLQQADR 534

Query: 348 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGL 406
            + +  YRYDLI+  R  L+  A+     +++AY   D A G  QL+ R  +LV+ +D L
Sbjct: 535 YADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLA-RTTQLVQGLDAL 593

Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
           +      L     ++A  +  +    + Y  NAR Q+++W  +       L DY +K W 
Sbjct: 594 VGGQHETLAAWTGQAAAAVGNDARLLRAYVGNARAQVSVWGGDGN-----LADYASKAWQ 648

Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
           G+  D+Y  R   +      + ++G  F       +L  W R+W
Sbjct: 649 GMYADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 692


>gi|440695019|ref|ZP_20877582.1| Tat pathway signal sequence domain protein [Streptomyces
           turgidiscabies Car8]
 gi|440282912|gb|ELP70302.1| Tat pathway signal sequence domain protein [Streptomyces
           turgidiscabies Car8]
          Length = 1050

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 135/532 (25%), Positives = 226/532 (42%), Gaps = 52/532 (9%)

Query: 8   VLPAFSGNVPAAL--QNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
           VLP + G VP     +N   +A + Q G+W + K  P W     LD     F E+  AF 
Sbjct: 253 VLPGYFGTVPDEFVARNGGDAAVVPQ-GDWGAFKR-PDW-----LDPRTTAFGEVAAAFY 305

Query: 66  EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
           + Q + +G  S +Y  D   E   P D P  +     A+ + ++     AVW + GW  +
Sbjct: 306 QAQSERFG-DSTMYKMDLLHEGGNPGDVP--VGRAAQAVEAALRKAHPGAVWAILGWQNN 362

Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
                  P  + +L++V   ++ V+D  ++    +      + G PY +  + NF G+  
Sbjct: 363 -------PSGE-ILDAVDKSRMFVVDGLSDRYTTVTDRESDWGGTPYAFGSIWNFGGHTP 414

Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
           M            + R  E++ + G+    E  + N     L++++A+    +D+  W  
Sbjct: 415 MGANAPDWVEQYPKWRDKEDSALAGIAAMPEAADNNHAALALLTDLAWTPGTIDLDDWFA 474

Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKY 302
            Y+V RYG   P    AW ++  T Y  +  DG ++    +  A P +            
Sbjct: 475 SYAVSRYGAEDPHALAAWKIIGDTAYGMSRADGWSEAPDGLFGARPSLG----------- 523

Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
                  +K A    E   YD       T+    AL   +     L  ++ YRYDL D+ 
Sbjct: 524 ------ANKAAAWGPEADRYD-------TTAFDLALTELLQVAPALRGNSAYRYDLADVA 570

Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
           RQ L+  +  L   I  AY   D     +L+  +L+ +  MD +LA     LLG WL  A
Sbjct: 571 RQVLSNRSRMLLPQIRAAYDTADRVRFDELTGVWLDWMRLMDKVLATSGQHLLGRWLADA 630

Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
           +       ++ Q E++AR+ IT W      E   L DY N+ WSGL+   Y  R  +YF+
Sbjct: 631 RSWGATRGEKDQLEYDARSIITTWGGRASSEEG-LHDYANREWSGLVGGLYLTRWTLYFR 689

Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
            +  +L      +  D    W  L +DW +  + +P +++GD    ++ ++N
Sbjct: 690 ELSRALRQNRPPKTVD----WFTLEDDWAHRHDSHPTKTSGDVHKLARRVHN 737


>gi|456390168|gb|EMF55563.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
           25435]
          Length = 1042

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 126/525 (24%), Positives = 218/525 (41%), Gaps = 48/525 (9%)

Query: 8   VLPAFSGNVPAA-LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           VLP + G+VP   ++     A +   G W   +  P W     LD     F  +  +F  
Sbjct: 249 VLPGYYGHVPKQFVERNGGDAHVVPQGLWHGFER-PDW-----LDPRTDSFARVAASFYG 302

Query: 67  QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
                +G  +H +  D   E     D P  +      +   +     DA+W++ GW    
Sbjct: 303 HVRDVFGAAAH-FKMDLLHEGGTAGDVP--VPDAARGVERALHKAHPDAIWVILGW--QE 357

Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
           +P      +  LL+++   +++++D  ++    +    + + G PY +  + NF G   +
Sbjct: 358 NP------LPELLDAIDRSRMLIVDGVSDRYASVTDRERDWGGTPYCFGTIPNFGGRTTI 411

Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
                         R   ++ +VG     E  +++P  ++L SE+A+   K+D  AW + 
Sbjct: 412 GARAHLWTDKFFAWRDKPDSALVGTAYMPEATDRDPAAFELFSELAWTPGKIDRAAWFSA 471

Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
           Y+  RYG    A + AW  L+ T Y       +D +  +  A PD+              
Sbjct: 472 YADFRYGGRDDAARAAWRALHETAYQQRAVERSDPHDSLFCARPDL-------------- 517

Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
                 + A     T +YD         +   A  L +A G   + +  YRYD++DL RQ
Sbjct: 518 ---AADRAAEYAPRTLTYDPGRF-----DAAFAGLLDVAGGRRRNPA--YRYDVVDLARQ 567

Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
           ALA  + +    +  A++  D      LS  +L L+   D +      FLLGPW+  A+ 
Sbjct: 568 ALAHRSRQYLPQLRAAHRRKDLTTFRALSTLWLRLMRLSDEVTGTDGAFLLGPWVNDARL 627

Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
           LA ++ +  ++E  A+  IT+W      +   L +YGN+ W+GL+ D+Y PR   +   +
Sbjct: 628 LATDDAERAEFERTAKVLITVWGGRATSDTGDLHEYGNREWNGLMADFYVPRWQKWLDAL 687

Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
            ++L +G      D    W      W   R  YP+   GDA  T+
Sbjct: 688 EDALATGTAPAAVD----WFAFEEPWTRERKDYPLRPVGDAYRTA 728


>gi|169351438|ref|ZP_02868376.1| hypothetical protein CLOSPI_02218 [Clostridium spiroforme DSM 1552]
 gi|169291660|gb|EDS73793.1| F5/8 type C domain protein [Clostridium spiroforme DSM 1552]
          Length = 1762

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 136/529 (25%), Positives = 226/529 (42%), Gaps = 80/529 (15%)

Query: 7    AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
             VL  ++G VP    + +   +I + GNW  V   PR     ++     L+ +  + F E
Sbjct: 852  TVLQGYAGMVPNNFTD-YQDVEILEQGNWCGV---PR---PDMIRTDGELYDQYAKLFYE 904

Query: 67   QQLKEYGRTSHIYNCDTFDENTP-PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
             Q   +G+TS+ Y  D F E    P D  + +  +   + + +   D +AVW++Q W   
Sbjct: 905  AQEWAFGKTSNYYAVDPFHEGGKRPSDLTDDV--ISREVLNSLLEYDQEAVWMVQAW--- 959

Query: 126  YDPFWRPPQMKAL--LNSVPLGKLVVLDLFA------------EVKPIWSTSKQFYGVPY 171
                W  P    L  +       +++LDL              E       S +F    +
Sbjct: 960  ----WSNPTNDLLKGMGDDREDHVIILDLNGLNDAYDSYWDKTEYNGTVLESDEFNSTSW 1015

Query: 172  IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMA 231
            +WCML N+ GN  M G    I    +   +++   M G+G   E    NP++Y+L+ +MA
Sbjct: 1016 VWCMLENYGGNPSMDGRPKEI-INRINKASTQAEHMKGIGFISEATYDNPMIYELLLDMA 1074

Query: 232  FQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD 291
            +Q + +D+  W+++Y +RRYG    +  +AW++L  TVY+ +   TD     ++A    D
Sbjct: 1075 WQQDTIDLDDWLDEYVLRRYGDYSESAGEAWDILLKTVYSRSGKTTD-----VIARS--D 1127

Query: 292  PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
            PS++         YG P                    Y+ SE+  ALEL     ++LSAS
Sbjct: 1128 PSLVQ--------YGLP--------------------YTASELEEALELLYKDYDKLSAS 1159

Query: 352  NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 411
              YRYDL ++ RQ +  YA     ++  AY   +      L  ++L  ++ ++ +     
Sbjct: 1160 EAYRYDLTEIMRQVVNNYAVVRLGDLKTAYDAKEIDNFKSLKEQYLNAIDLLNEVCGTQQ 1219

Query: 412  GFLLGPWLESAKQLAQNEEQEK-QYE---WNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
              L+G W+  A   A++   +   Y+    NA+T IT+W  +T      L  Y  + + G
Sbjct: 1220 DLLIGEWVGRAVDWAKDTNSDDFAYDSMIINAKTLITVWAPSTT-----LGTYAYRNYEG 1274

Query: 468  LLRDYYGPRAAIYFKYMIESLESGDG----FRLKDWRREWIKLTNDWQN 512
            ++ D Y      Y     E LE G          D   +WI    D QN
Sbjct: 1275 MINDIYKVIWQAYLDQSEEILEFGSAKTNLVNYHDLCMDWIYADWDLQN 1323


>gi|212722968|ref|NP_001131519.1| uncharacterized protein LOC100192858 [Zea mays]
 gi|194691748|gb|ACF79958.1| unknown [Zea mays]
          Length = 114

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 61/99 (61%), Positives = 75/99 (75%)

Query: 445 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
           MWFDNT+ +ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+   F LK+WRREWI
Sbjct: 1   MWFDNTETKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWI 60

Query: 505 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 543
            LTN+WQ+ R V+   + GD L  SQ LY KYL    + 
Sbjct: 61  SLTNNWQSDRKVFSTTATGDPLNISQSLYTKYLSNADLL 99


>gi|302522684|ref|ZP_07275026.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
 gi|302431579|gb|EFL03395.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
          Length = 355

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 89/333 (26%), Positives = 155/333 (46%), Gaps = 25/333 (7%)

Query: 168 GVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDL 226
           G PY +  + NF G+  +     D +   P   R    + + G+ +  E  + NP  ++L
Sbjct: 10  GTPYAFGSIWNFGGHTALGANTRDWVDLYP-RWRDRSGSRLSGIALMPEAADNNPAAFEL 68

Query: 227 MSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVA 286
            +E+ +    VD+  W  +Y+  RYG S    + AW++L  TVY       D+  +    
Sbjct: 69  FAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILRTTVYGTRR--DDRWSEPADG 126

Query: 287 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 346
                P++ +V+ GK+            L+   +S++             AL+  ++   
Sbjct: 127 LFGARPALDAVSAGKWS--------PKALRYPAASFEP------------ALDELLSVRA 166

Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
           EL  S TYR DL+D+ RQALA  +  L   +  AY+  +     +L RR++ L++ ++ L
Sbjct: 167 ELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQAEFARLGRRWIALIDLLEQL 226

Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
           +A  +  LLG W+ESA+    +  ++ Q +++A + +T W      +A  LRDY N+ WS
Sbjct: 227 VATDENHLLGRWVESARAWGGSAREKSQLQYDALSLLTTWGTRQGADAG-LRDYANREWS 285

Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 499
           GL+   Y  R   Y   +  +L+ G      DW
Sbjct: 286 GLVGGLYRLRWGTYIDELSAALKEGRKPVAVDW 318


>gi|339238239|ref|XP_003380674.1| GDP-L-fucose synthetase [Trichinella spiralis]
 gi|316976398|gb|EFV59699.1| GDP-L-fucose synthetase [Trichinella spiralis]
          Length = 1203

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 75/220 (34%), Positives = 113/220 (51%), Gaps = 7/220 (3%)

Query: 59   EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
             +G   + + L+ Y    H Y+ D F+E  P      ++ ++  AIY+ M S D  +VW+
Sbjct: 801  HVGNEVVWKSLENYFGLFHAYSADPFNEMVPNTFDVMFLRNVSFAIYNVMLSVDPKSVWV 860

Query: 119  MQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 177
            +Q W+F     W   +  K  L +VP G ++V+DL+AE  P++     FY  P+IWCMLH
Sbjct: 861  LQSWMFLSSERWLENENAKHFLTAVPTGSILVVDLYAEEYPLYEKFSGFYNQPFIWCMLH 920

Query: 178  NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF--QHE 235
            NF G   +YG L  I     +  T  N  MVG G+SMEGI+QN VVY +  +  +   ++
Sbjct: 921  NFGGVQGLYGNLARINQKLADVSTVSNINMVGTGLSMEGIDQNYVVYQMALDRFWSPNNQ 980

Query: 236  KVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTD 274
            KVD+ AW   Y     G  +  +I  AW     +   C +
Sbjct: 981  KVDLAAW---YIYIHLGVGITKSIYTAWGAFLQSSRTCQE 1017



 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 52/209 (24%), Positives = 94/209 (44%), Gaps = 10/209 (4%)

Query: 331  TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 390
            T  +  A   F+ S      +  Y  DL++LT+ AL     +L+  +  +Y         
Sbjct: 998  TKSIYTAWGAFLQSSRTCQENEIYINDLVELTKHALMLTGAKLYEKLQASYIRKCGQEFL 1057

Query: 391  QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
            + +    +++ D++ +   H   +L  W+E A+   +   Q  Q E N R Q+T+W    
Sbjct: 1058 ENAAAVEQVLSDLEWISKTHSRSMLSKWIEIARANGKTAAQSDQLEENLRMQVTIW--GP 1115

Query: 451  QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM---IESLESGDGFRLKDWRREWIKLT 507
            Q E   + DY  K W+ L  +YY PR  ++F ++   I  LE+ +   L       I+L 
Sbjct: 1116 QGE---IVDYARKQWAALFSEYYLPRWRLFFAHLYADILQLETFNQTLLNSRLFHEIELP 1172

Query: 508  NDWQNGRNVYPVESNGDALITSQWLYNKY 536
               Q   N+   +  G+ ++ S+ LY++Y
Sbjct: 1173 FALQKIPNI--DQPTGNTVVVSKILYSRY 1199


>gi|293402299|ref|ZP_06646437.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
            5_2_54FAA]
 gi|291304406|gb|EFE45657.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
            5_2_54FAA]
          Length = 2330

 Score =  128 bits (322), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 117/493 (23%), Positives = 209/493 (42%), Gaps = 60/493 (12%)

Query: 8    VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
            VL  ++G VP       P+  +T   +W  + + P    T       P + E  + F E 
Sbjct: 707  VLQGYAGMVPTNFNEFQPNVPLTAQKSWGGL-ARPSMIPT-----DSPYYDEYAKLFYEA 760

Query: 68   QLKEYGRTSHIYNCDTFDENTPPVDSPEYIS--SLGAAIYSGMQSGDSDAVWLMQGWLFS 125
            Q   YG TS  Y  D + E       PE +S  ++   + + +   D DAVW++Q W   
Sbjct: 761  QEYIYGATSDYYAVDPYHEGGT---RPEGLSDETVAREVLNSLLDYDKDAVWVVQAW--- 814

Query: 126  YDPFWRPPQMKALLNSV---PLGKLVVLDLFAEVKPIWS--TSKQFYGVPYIWCMLHNFA 180
                 +      LLN +       ++++DL       W+     +F G  + W +L  F 
Sbjct: 815  -----QSNPTDGLLNGMGEYRENHVLIVDLIKYPIKSWTKYNKSEFKGTSWAWGLLGGFG 869

Query: 181  GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
            GN  M G + ++    ++    E T M G+G+  E    NPV+YDL+ ++A+  +   + 
Sbjct: 870  GNPTMNGEMQTM-VNDIQTAKKERTHMAGLGIISEAQYDNPVLYDLIFDLAWVDDDFSLD 928

Query: 241  AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
             W+N+Y  RRYG +    ++AW ++ +  YN             V F      +  +   
Sbjct: 929  QWLNKYIERRYGGTSDNAKEAWKIMKNANYNHG-----------VRFT---AQVYGMKGK 974

Query: 301  KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
              Q+YGK                  ++ Y   ++  A  L I   ++   S  YRYDL +
Sbjct: 975  SPQDYGK-----------------QNISYGADKLETAFRLLIEDYDKFKDSECYRYDLTE 1017

Query: 361  LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
            + RQ ++ Y+   + N+I+A +  +     +   +FL+  + ++ +       L G W+ 
Sbjct: 1018 IMRQMVSNYSTLTYNNVIDAREDKNIEKFKEEKAKFLKSFDVLNDIQETQVDQLAGEWIG 1077

Query: 421  SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
             A+  A + +   +  +E NA+  IT W   ++  A  L+DY  + + G+  D Y     
Sbjct: 1078 KAQDRAADYDDFAKDAFEMNAKALITSWA--SRSSAGGLKDYAWRNYQGMFIDLYKQNWI 1135

Query: 479  IYFKYMIESLESG 491
             Y   +  +LE+G
Sbjct: 1136 DYLDQVEANLENG 1148


>gi|281423204|ref|ZP_06254117.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
 gi|281402540|gb|EFB33371.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
          Length = 291

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 70/240 (29%), Positives = 112/240 (46%), Gaps = 22/240 (9%)

Query: 250 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 309
           RYG++ P I+ AW +L  T+YNC  G   +             SI           G+P 
Sbjct: 4   RYGKTSPEIERAWQLLSETIYNCPAGNNQQG---------PHESIFC---------GRP- 44

Query: 310 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 369
              ++   +  S+     +Y     + A +L     ++   +N + YDL+D+ RQALA  
Sbjct: 45  ---SLNNFQVKSWSKMRNYYDLQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALADQ 101

Query: 370 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 429
               +L  I  Y         + + RFLE++   D LL     F LG W E+A++L   +
Sbjct: 102 GRLQYLKTIADYNGFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGTTQ 161

Query: 430 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 489
           +++  YEWNAR QIT W +    +   L DY +K W G+L+D+Y  R  I+   + + +E
Sbjct: 162 QEKDLYEWNARVQITTWGNRMCADKGGLHDYAHKEWQGILKDFYYKRWKIFMDALAKQME 221


>gi|260821254|ref|XP_002605948.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
 gi|229291285|gb|EEN61958.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
          Length = 673

 Score =  112 bits (280), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 93/201 (46%), Gaps = 51/201 (25%)

Query: 91  VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 150
           +  P Y+S  GAA+Y+GM +GD  A+WLMQGWLF    FW+P Q KALL SVP G     
Sbjct: 442 LSEPNYLSKAGAAVYAGMLAGDPQAIWLMQGWLFQARDFWQPAQTKALLQSVPEG----- 496

Query: 151 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 210
                                                        P  AR    +TMVG 
Sbjct: 497 ---------------------------------------------PFLARKYLGSTMVGT 511

Query: 211 GMSMEGIEQNPVVYDLMSEMAFQHEKVDV-KAWINQYSVRRYGRSVPAIQDAWNVLYHTV 269
           G++ EGI+QN ++Y+LM+E+A+  +   +   W + Y+  RYG         W +L  +V
Sbjct: 512 GLTPEGIDQNYIMYELMNEVAWMPQPFQILDNWASDYAWSRYGVKNSNASLGWQILLKSV 571

Query: 270 YNCTDGATDKNRDVIVAFPDV 290
           Y+C +G  D    V+V  PD+
Sbjct: 572 YDCENGFKDHCDSVVVHRPDL 592


>gi|358381741|gb|EHK19415.1| hypothetical protein TRIVIDRAFT_224650 [Trichoderma virens Gv29-8]
          Length = 217

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 58/214 (27%), Positives = 110/214 (51%), Gaps = 11/214 (5%)

Query: 88  TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL-G 145
           TPP     Y+ +  +  +  ++S D +A+W+ Q WLF+ +  FW   +++     + +  
Sbjct: 2   TPPSGELNYLRNASSNTWKALKSADPEAIWVFQAWLFAQNTTFWTNDRIEVYPGGITIDS 61

Query: 146 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 205
            +++LD++ E    W  ++ +Y  P+IWC L N+   I MYG + ++   P+ A   E+ 
Sbjct: 62  DMLILDIWLESMSQWQCAQSYYSKPWIWCELQNYGATINMYGQIQNLTKSPILA-LQESQ 120

Query: 206 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY--GRSVPAIQDAWN 263
           ++VG+G+SME  + N +V+DL+   A+    +D   +   ++  RY   +   +I  AW 
Sbjct: 121 SLVGLGLSMEAQQSNEIVFDLLLSQAWNCTPIDTNIYFKSWAAARYLSSKRPASIYTAWE 180

Query: 264 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 297
            +  TVY+ T      N +++ + P    S I V
Sbjct: 181 AVRATVYDNT------NLNMMSSVPKSRSSEIKV 208


>gi|47212645|emb|CAF95026.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 121

 Score =  106 bits (264), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 47/103 (45%), Positives = 72/103 (69%), Gaps = 3/103 (2%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAFSGNVP  +  ++P A++T+LG W   K +  + C+Y+LD  DPLF+ IG  ++ Q
Sbjct: 22  VLPAFSGNVPKGILRLYPEARVTRLGPW--SKFNCSFSCSYILDPRDPLFLRIGSLYLAQ 79

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQS 110
            +K++G T+HIYN DTF+E TPP   P Y+S++  A+++ M +
Sbjct: 80  VVKQFG-TNHIYNTDTFNEMTPPSSEPNYLSAVSRAVFAAMTA 121


>gi|347541919|ref|YP_004856555.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-rat-Yit]
 gi|346984954|dbj|BAK80629.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-rat-Yit]
          Length = 912

 Score = 99.8 bits (247), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 98/407 (24%), Positives = 184/407 (45%), Gaps = 56/407 (13%)

Query: 108 MQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFY 167
           + +   +++W++Q W  S       P  + + N +     ++LDL +++   W    +F 
Sbjct: 490 IDNNGENSIWIIQSWSHS-------PSSETIEN-LNRNNTLILDLHSQLNTRWKGISKFN 541

Query: 168 GVPY----------IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGI 217
            + +          I+ +L+NF G   +YG    +     +A+ + N  + GV  + EGI
Sbjct: 542 NMSWKDREFDRSNWIFGVLNNFGGRSGLYGHTRHLLNQFYDAKYNSNY-LKGVAHTSEGI 600

Query: 218 EQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT---- 273
             N  + +L++E+ F  +K+D+  ++++Y   RYG+S   +  A+N+L  TVYN      
Sbjct: 601 GFNNFIDELVTEIIFS-DKLDIDEFVSRYLRNRYGKSDNDLLKAFNILLDTVYNPVINIY 659

Query: 274 -DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTS 332
            +GA++    VI A P +D                       +KS  S +   H  Y++ 
Sbjct: 660 HEGASES---VINARPSLD-----------------------VKS-ASKWGSIHKNYNSE 692

Query: 333 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 392
           ++  AL ++ +  NE   S  Y  DLID+  + +   +NE + N+ + Y  N     F+L
Sbjct: 693 KLEEALRIYFSKYNEFKDSKGYMTDLIDIASEVIINLSNEYYKNLQDYYN-NGEIEFFKL 751

Query: 393 -SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
            S+RFL ++     +L  ++   L   ++    L  ++  E     N +T +T W+D   
Sbjct: 752 NSQRFLNMILLQANILYYNERKSLQKLIDKLDDLNYDDYFEDTLIINKKTILTTWYDKQV 811

Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG--DGFRL 496
            E   LRDY N  +  ++   Y  R   +F  + E+  +G  D +R 
Sbjct: 812 SEDDGLRDYANTDFYDIVGTLYYNRWKRFFDNIQENAVNGFYDDYRF 858


>gi|293369245|ref|ZP_06615835.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
 gi|292635670|gb|EFF54172.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
          Length = 221

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 57/214 (26%), Positives = 102/214 (47%), Gaps = 7/214 (3%)

Query: 325 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 384
           P + Y   +++ A  L ++  +     ++Y +DL+++ RQ L  Y N +      AY+  
Sbjct: 11  PTIEYQPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAG 68

Query: 385 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 444
           D   +     +  E++ D+D L++CH  F L  W+  A+ +  +   +  YE NAR+ IT
Sbjct: 69  DIPMMKNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHDAASKNYYEMNARSLIT 128

Query: 445 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
           +W D+       L DY N+ W+GL   YY  R   +   +IE+ E    F  +++  +  
Sbjct: 129 IWGDSYH-----LTDYANRSWAGLTNQYYSVRWDHFINEVIEAAEKKKNFDEEEFFNQSR 183

Query: 505 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
              N+W N  N       GD +  ++ +Y KY +
Sbjct: 184 MYENEWVNPSNRISYNEGGDGIKLARQIYKKYAK 217


>gi|380804373|gb|AFE74062.1| alpha-N-acetylglucosaminidase precursor, partial [Macaca mulatta]
          Length = 265

 Score = 95.5 bits (236), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 68/108 (62%), Gaps = 3/108 (2%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VP A+  VFP   +T++G+W        + C++LL   DP+F  IG  F+ +
Sbjct: 161 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRE 218

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDA 115
            +KE+G T HIY  DTF+E  PP  +P Y+++   A+Y  M + D++A
Sbjct: 219 LVKEFG-TDHIYGADTFNEMQPPSSAPSYLAAATTAVYEAMIAVDTEA 265


>gi|342731751|ref|YP_004770590.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-Japan]
 gi|342329206|dbj|BAK55848.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-mouse-Japan]
          Length = 898

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 100/440 (22%), Positives = 196/440 (44%), Gaps = 67/440 (15%)

Query: 60  IGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGD-SDAV 116
           I   + E+Q +  G++ + +  D F E  N    D+ E    L   + S +++    D+V
Sbjct: 432 ISSVYYEKQRELLGKSKY-FAGDLFHEGANLYGYDAGE----LSNRVLSLLKNNTGEDSV 486

Query: 117 WLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS----------TSKQF 166
           W++Q W          P  +++ N +    +++LDL +++   W            +K+F
Sbjct: 487 WIIQSWA-------HNPSSESIEN-LNKDNILILDLHSQLNTRWKGISKFNYMSWDNKEF 538

Query: 167 YGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDL 226
               +I+ +L+NF G   +YG  + +     +A+ + +  + G+  + EG+  N  + +L
Sbjct: 539 DNSNWIFGILNNFGGRNGLYGHSNHLLRQFYDAKYNSDY-LSGIANTSEGVGFNNFIDEL 597

Query: 227 MSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN-CTD----GATDKNR 281
            +E+ F  E V++  ++ +Y   RYG+S   +  A+N+L  TVYN  TD    GA++   
Sbjct: 598 STELIFSDE-VNMDEFVKRYLKNRYGKSDRDLLVAFNILLDTVYNPVTDIYHEGASES-- 654

Query: 282 DVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF 341
            VI A P ++ +                          S +   H  Y + ++ R +E++
Sbjct: 655 -VINARPSLEIN------------------------SASKWGTIHKNYDSRKLERVIEIY 689

Query: 342 IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVE 401
           I+  +E   +  Y  DLID+  + +   A+E +  I E Y   +   +  +S++FL L+ 
Sbjct: 690 ISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQIIQEYYNNGNIKYLQLISKKFLNLIL 749

Query: 402 DMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYG 461
               +L+ +D   L   +     L  ++  +   ++N +  +T W+D    E   LRDY 
Sbjct: 750 LQANILSYNDKKSLQKIINKLDALDYDDYFKDTLKYNKKMILTTWYDKLVSEDGGLRDYA 809

Query: 462 NKYWSGLLRDYYGPRAAIYF 481
           N        D+Y     +Y+
Sbjct: 810 NT-------DFYDIVGTLYY 822


>gi|402824586|ref|ZP_10873940.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
 gi|402261896|gb|EJU11905.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
          Length = 486

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 62/215 (28%), Positives = 100/215 (46%), Gaps = 39/215 (18%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G VP A     P A+I ++  W           TY LD +DPLF ++   F+  
Sbjct: 260 VLPAFAGYVPEAFAKAHPKARIYKMRAWEGFPP------TYWLDPSDPLFTQLAARFVTL 313

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVD-------SPEY-----------------------I 97
             + YG   + Y  D F+E  PP+        + EY                       +
Sbjct: 314 YNRTYGEGEY-YLADAFNEMIPPIAEDGSDAAAAEYGDSIANTAATRAAALPPAVRDARL 372

Query: 98  SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEV 156
           ++ G  +Y  + +    A W+MQGWLF  D  +R P+ + A L+ VP  ++++LD+  + 
Sbjct: 373 AAYGERLYGSITAAAPKATWVMQGWLFGADKAFRTPEAIAAFLSRVPDDRMLILDIGNDR 432

Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 190
            P IW  +  F G  + +  +HN+ G+  +YG L+
Sbjct: 433 YPGIWQKTDAFDGKAWTYGYVHNYGGSNPVYGDLE 467


>gi|417965571|ref|ZP_12607078.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-4]
 gi|380336329|gb|EIA26351.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-4]
          Length = 685

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 102/463 (22%), Positives = 203/463 (43%), Gaps = 65/463 (14%)

Query: 33  GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPP 90
           G W  +K   R      LD  +     I   + E+Q +  G++ + +  D F E  N   
Sbjct: 205 GYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANLYG 257

Query: 91  VDSPEYISSLGAAIYSGMQSGDS-DAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 149
            D+ E    L   + S +++    D+VW++Q W          P  +++ N +    +++
Sbjct: 258 YDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNILI 305

Query: 150 LDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA 199
           LDL +++   W            +K+F    +I+ +L+NF G   +YG  + +     +A
Sbjct: 306 LDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFYDA 365

Query: 200 RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 259
           + + +  + G+  + EG+  N  + +L +E+ F  E V++  ++ +Y   RYG+S   + 
Sbjct: 366 KYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRDLL 423

Query: 260 DAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 318
            A+N+L  TVYN  TD          +       S+I+             ++ ++  + 
Sbjct: 424 VAFNILLDTVYNPVTD----------IYHEGASESVIN-------------ARPSLGINS 460

Query: 319 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 378
            S +   H  Y + ++ R +E++I+  +E   +  Y  DLID+  + +   A+E +  I 
Sbjct: 461 ASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQIIQ 520

Query: 379 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 438
           E Y   +   +  +S++FL L+     +L+ +D   L   +     L  ++  +   ++N
Sbjct: 521 EYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFKDTLKYN 580

Query: 439 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
            +  +T W+D    E   LRDY N        D+Y     +Y+
Sbjct: 581 KKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 616


>gi|417967717|ref|ZP_12608785.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-co]
 gi|380340884|gb|EIA29424.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
           SFB-co]
          Length = 741

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 96/436 (22%), Positives = 195/436 (44%), Gaps = 59/436 (13%)

Query: 60  IGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDS-DAV 116
           I   + E+Q +  G++ + +  D F E  N    D+ E    L   + S +++    D+V
Sbjct: 275 ISSVYYEKQRELLGKSKY-FAGDLFHEGANLYGYDAGE----LSNRVLSLLKNNTGEDSV 329

Query: 117 WLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS----------TSKQF 166
           W++Q W          P  +++ N +    +++LDL +++   W            +K+F
Sbjct: 330 WIIQSWA-------HNPSSESIEN-LNKDNILILDLHSQLNTRWKGISKFNYMSWDNKEF 381

Query: 167 YGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDL 226
               +I+ +L+NF G   +YG  + +     +A+ + +  + G+  + EG+  N  + +L
Sbjct: 382 DNSNWIFGILNNFGGRNGLYGHSNHLLRQFYDAKYNSDY-LSGIANTSEGVGFNNFIDEL 440

Query: 227 MSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN-CTDGATDKNRDVIV 285
            +E+ F  E V++  ++ +Y   RYG+S   +  A+N+L  TVYN  TD          +
Sbjct: 441 STELIFSDE-VNMDEFVKRYLKNRYGKSDRDLLVAFNILLDTVYNPVTD----------I 489

Query: 286 AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG 345
                  S+I+             ++ ++  +  S +   H  Y + ++ R +E++I+  
Sbjct: 490 YHEGASESVIN-------------ARPSLGINSASKWGTIHKNYDSRKLERVIEIYISKY 536

Query: 346 NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 405
           +E   +  Y  DLID+  + +   A+E +  I E Y   +   +  +S++FL L+     
Sbjct: 537 DEFKDNEGYIIDLIDIASEVIINLASEYYQIIQEYYNNGNIKYLQLISKKFLNLILLQAN 596

Query: 406 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 465
           +L+ +D   L   +     L  ++  +   ++N +  +T W+D    E   LRDY N   
Sbjct: 597 ILSYNDKKSLQKIINKLDALDYDDYFKDTLKYNKKMILTTWYDKLVSEDGGLRDYANT-- 654

Query: 466 SGLLRDYYGPRAAIYF 481
                D+Y     +Y+
Sbjct: 655 -----DFYDIVGTLYY 665


>gi|384455191|ref|YP_005667784.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-mouse-Yit]
 gi|418016862|ref|ZP_12656425.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-NYU]
 gi|418371995|ref|ZP_12964091.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-SU]
 gi|345505596|gb|EGX27892.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-NYU]
 gi|346983532|dbj|BAK79208.1| alpha-N-acetylglucosaminidase family protein [Candidatus
           Arthromitus sp. SFB-mouse-Yit]
 gi|380342872|gb|EIA31299.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
           SFB-mouse-SU]
          Length = 898

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 100/440 (22%), Positives = 195/440 (44%), Gaps = 67/440 (15%)

Query: 60  IGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGD-SDAV 116
           I   + E+Q +  G++ + +  D F E  N    D+ E    L   + S +++    D+V
Sbjct: 432 ISSVYYEKQRELLGKSKY-FAGDLFHEGANLYGYDAGE----LSNRVLSLLKNNTGEDSV 486

Query: 117 WLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS----------TSKQF 166
           W++Q W          P  +++ N +    +++LDL +++   W            +K+F
Sbjct: 487 WIIQSWA-------HNPSSESIEN-LNKDNILILDLHSQLNTRWKGISKFNYMSWDNKEF 538

Query: 167 YGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDL 226
               +I+ +L+NF G   +YG  + +     +A+ + +  + G+  + EG+  N  + +L
Sbjct: 539 DNSNWIFGILNNFGGRNGLYGHSNHLLRQFYDAKYNSDY-LSGIANTSEGVGFNNFIDEL 597

Query: 227 MSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN-CTD----GATDKNR 281
            +E+ F  E V++  ++ +Y   RYG+S   +  A+N+L  TVYN  TD    GA++   
Sbjct: 598 STELIFSDE-VNMDEFVKRYLKNRYGKSDRDLLVAFNILLDTVYNPVTDIYHEGASES-- 654

Query: 282 DVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF 341
            VI A P +  +                          S +   H  Y + ++ R +E++
Sbjct: 655 -VINARPSLGIN------------------------SASKWGTIHKNYDSRKLERVIEIY 689

Query: 342 IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVE 401
           I+  +E   +  Y  DLID+  + +   A+E +  I E Y   +   +  +S++FL L+ 
Sbjct: 690 ISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQIIQEYYNNGNIKYLQLISKKFLNLIL 749

Query: 402 DMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYG 461
               +L+ +D   L   +     L  ++  +   ++N +  +T W+D    E   LRDY 
Sbjct: 750 LQANILSYNDKKSLQKIINKLDALDYDDYFKDTLKYNKKMILTTWYDKLVSEDGGLRDYA 809

Query: 462 NKYWSGLLRDYYGPRAAIYF 481
           N        D+Y     +Y+
Sbjct: 810 NT-------DFYDIVGTLYY 822


>gi|321458423|gb|EFX69492.1| hypothetical protein DAPPUDRAFT_35389 [Daphnia pulex]
          Length = 132

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 45/137 (32%), Positives = 78/137 (56%), Gaps = 5/137 (3%)

Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
           ++DLTRQ++ +  + L+  ++E Y   ++  +  ++ + + L++D+D L+     FLLG 
Sbjct: 1   MVDLTRQSMQEIFHLLYSKLLEVYLEKNSTAIEGIAYKMINLLQDLDELIQTGKTFLLGK 60

Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
           W+  AK     E ++ QYEWNAR QIT+W    +     +RDY  K W+G++ DYY P  
Sbjct: 61  WIADAKSWGTTEGEKLQYEWNARNQITLWGPRGE-----IRDYAAKKWAGVVADYYKPHW 115

Query: 478 AIYFKYMIESLESGDGF 494
            ++ + M  SL+    F
Sbjct: 116 EVFIREMQMSLDENRAF 132


>gi|281423203|ref|ZP_06254116.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
 gi|281402539|gb|EFB33370.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
          Length = 450

 Score = 89.7 bits (221), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 68/230 (29%), Positives = 107/230 (46%), Gaps = 26/230 (11%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLP F G +P   +       +T  G W              L  TD    +I   +  +
Sbjct: 223 VLPGFCGMMPHDAKAKL-GLNVTDGGIWNGYTRPAN------LSPTDAHSDKIADLYYAE 275

Query: 68  QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
               YG+ ++ Y+ D F E     D     S  G  +   M+  + +A W++QGW  + +
Sbjct: 276 LTNLYGKANY-YSMDPFHETND--DEAIDYSKAGRKVMEAMKRVNPNATWVIQGW--TEN 330

Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAG 181
           P  RP  +K + N    G L+VLDLF+E +P      IW   K +    +++CML NF  
Sbjct: 331 P--RPQMIKNMKN----GDLLVLDLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGA 384

Query: 182 NIEMYGILDSIAFGPVEARTSENTT--MVGVGMSMEGIEQNPVVYDLMSE 229
           N+ ++G +D +       + S   T  + G+G +MEG E NPV+++LMSE
Sbjct: 385 NVGLHGRMDLLLHNFYSTKQSSPNTQHLKGIGFTMEGSENNPVMFELMSE 434


>gi|322792283|gb|EFZ16267.1| hypothetical protein SINV_02225 [Solenopsis invicta]
          Length = 87

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 42/92 (45%), Positives = 61/92 (66%), Gaps = 5/92 (5%)

Query: 396 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 455
            LEL +D++ +LA    FLLG WL  AK++A NEE+ + YE+NAR QIT+W  N +    
Sbjct: 1   LLELFDDLESILASGSNFLLGTWLTQAKEMADNEEERRSYEYNARNQITLWGPNGE---- 56

Query: 456 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
            +RDY NK WSG++ DY+ PR  ++ K + +S
Sbjct: 57  -IRDYANKQWSGVVADYFKPRWELFLKALEKS 87


>gi|326435733|gb|EGD81303.1| alpha-N-acetylglucosaminidase [Salpingoeca sp. ATCC 50818]
          Length = 696

 Score = 88.6 bits (218), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 45/126 (35%), Positives = 71/126 (56%), Gaps = 2/126 (1%)

Query: 104 IYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST 162
           +Y+ M   D  A+W+ QGW+    D       M    ++VP G+LV+LD+ AE   IW+ 
Sbjct: 500 VYTTMTKRDPHAIWVYQGWIWLDLDNAQGFSFMSGFTSAVPRGRLVILDMEAEFDEIWAW 559

Query: 163 SKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPV 222
           S+ F+   +IW  + NF GN  MYG +  + F       +++  +VGVG++MEGI+QNP 
Sbjct: 560 SQSFFNTTFIWAAMDNFGGNNGMYGDIQ-LVFDRTRRVFAQSDAVVGVGITMEGIDQNPA 618

Query: 223 VYDLMS 228
            Y  ++
Sbjct: 619 YYQAIA 624



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 43/85 (50%), Gaps = 18/85 (21%)

Query: 7   AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
           ++LPAF GNVPAAL  ++P A I+    W              LD  DPLF  I    ++
Sbjct: 277 SLLPAFQGNVPAALAQLYPHANISN--GW--------------LDGLDPLFATIADLTMQ 320

Query: 67  QQLKEYGRTSHIYNCDT-FDENTPP 90
           + + ++G T H Y  D  FD +T P
Sbjct: 321 ELIADFGAT-HFYQADGFFDHSTGP 344


>gi|315131339|emb|CBM69278.1| venom protein Ci-120 [Chelonus inanitus]
          Length = 165

 Score = 88.6 bits (218), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 79/131 (60%), Gaps = 7/131 (5%)

Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPW 418
           D+TRQ+L   A  ++L + +++   D   VF+     L +L  D++ +L+ +  FL+G W
Sbjct: 1   DVTRQSLQLIAEHVYLKLQQSFHQKDL-AVFKAHANLLMQLFSDLESILSTNKHFLVGKW 59

Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
           +++A+ L  N +++K YE NAR QIT+W  N +     +RDY NK W+G++  Y+G R +
Sbjct: 60  IKNARSLGTNVQEQKLYELNARNQITLWGPNGE-----IRDYANKQWAGVMSQYFGARWS 114

Query: 479 IYFKYMIESLE 489
           +Y   +  +LE
Sbjct: 115 LYLSVLEFALE 125


>gi|224135741|ref|XP_002322149.1| predicted protein [Populus trichocarpa]
 gi|222869145|gb|EEF06276.1| predicted protein [Populus trichocarpa]
          Length = 173

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/34 (76%), Positives = 29/34 (85%)

Query: 208 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
           VGVGM M+GI+QNPVV DLMS+MAF H KVDVK 
Sbjct: 30  VGVGMPMDGIKQNPVVSDLMSKMAFHHNKVDVKG 63



 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 33/60 (55%), Gaps = 2/60 (3%)

Query: 480 YFKYMIESLESGD--GFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
           ++ + I++ E G    F +K   +   K +      R + PVESNG+AL  S+WL+ KYL
Sbjct: 97  HYTFQIQNTEFGKWPRFPVKGLEKRMDKASKQLAESRKIIPVESNGNALNISRWLFYKYL 156


>gi|147798252|emb|CAN69797.1| hypothetical protein VITISV_036335 [Vitis vinifera]
          Length = 273

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/47 (55%), Positives = 31/47 (65%), Gaps = 3/47 (6%)

Query: 207 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD---VKAWINQYSVRR 250
           MVGVG+ MEGIEQNPVVY+ M EMAF  E V    + +  N  + RR
Sbjct: 112 MVGVGVCMEGIEQNPVVYESMFEMAFHSENVQLVVISSTCNTMARRR 158


>gi|297723521|ref|NP_001174124.1| Os04g0650900 [Oryza sativa Japonica Group]
 gi|255675839|dbj|BAH92852.1| Os04g0650900, partial [Oryza sativa Japonica Group]
          Length = 128

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/29 (75%), Positives = 26/29 (89%)

Query: 6   FAVLPAFSGNVPAALQNVFPSAKITQLGN 34
           F VLPAFSGN+PAAL++ FPSAK+T LGN
Sbjct: 99  FPVLPAFSGNIPAALRSKFPSAKVTHLGN 127


>gi|293371911|ref|ZP_06618315.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
 gi|292633157|gb|EFF51734.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
          Length = 289

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 3/61 (4%)

Query: 8   VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
           VLPAF+G+VPA L+ ++P A I  LG W       R  C + L+  D LF +I + F+++
Sbjct: 221 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDE 277

Query: 68  Q 68
           Q
Sbjct: 278 Q 278


>gi|440799252|gb|ELR20307.1| AlphaN-acetylglucosaminidase, putative, partial [Acanthamoeba
           castellanii str. Neff]
          Length = 389

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 47/101 (46%), Gaps = 19/101 (18%)

Query: 2   LLADFAVLPAFSGNVPAA----------------LQNVFPSAKITQLGNWFSVKSDPRWC 45
           +     VLPAF+G VP A                ++ ++P+A +T+  +W       ++ 
Sbjct: 292 MFGMLPVLPAFAGFVPEASVKFTLGRGGGCGEQGIKRIYPTANLTKSADWAGFPH--QYT 349

Query: 46  CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 86
             Y L   D L+  IG   I    +E+G T HIYN DTF+E
Sbjct: 350 NVYFLSPLDSLYKTIGSKVIRLVEEEFG-TDHIYNADTFNE 389


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.135    0.425 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,162,292,582
Number of Sequences: 23463169
Number of extensions: 394314948
Number of successful extensions: 856777
Number of sequences better than 100.0: 512
Number of HSP's better than 100.0 without gapping: 496
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 853567
Number of HSP's gapped (non-prelim): 810
length of query: 545
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 397
effective length of database: 8,886,646,355
effective search space: 3527998602935
effective search space used: 3527998602935
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)