BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 012032
(472 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255540793|ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
gi|223550576|gb|EEF52063.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
Length = 809
Score = 819 bits (2115), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/464 (82%), Positives = 423/464 (91%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGG LPQSW QQL+LQKKIL R+YELGMNPVLPAFSGNVPAAL+N+FPSAKI
Sbjct: 234 MGNLHRWGGSLPQSWFFQQLILQKKILARMYELGMNPVLPAFSGNVPAALRNIFPSAKIA 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWFSVKSD RWCCTYLLDATDPLFIEIGRAFIEQQL+EYG TSHIYNCDTFDENTPP
Sbjct: 294 RLGNWFSVKSDLRWCCTYLLDATDPLFIEIGRAFIEQQLEEYGSTSHIYNCDTFDENTPP 353
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD P+YIS+LGAA++ GMQSGD+DAVWLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 354 VDDPKYISALGAAVFKGMQSGDNDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGRLVVL 413
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DLFAEVKPIW++S QFYGVPYIWCMLHNFAGN+EMYGILDSIA GPVEARTSEN+TMVGV
Sbjct: 414 DLFAEVKPIWTSSYQFYGVPYIWCMLHNFAGNVEMYGILDSIASGPVEARTSENSTMVGV 473
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNPVVYDLMSEMAFQH+KVDVKAWIN YS RRYGRSVP+IQDAW++LYHTVY
Sbjct: 474 GMSMEGIEQNPVVYDLMSEMAFQHKKVDVKAWINLYSTRRYGRSVPSIQDAWDILYHTVY 533
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
NCTDGA DKNRDVIVAFPDV+P SV++ ++ GKPVS+ AVLK + SYDHPHLWYS
Sbjct: 534 NCTDGAYDKNRDVIVAFPDVNPFYFSVSQKRHHLNGKPVSRRAVLKENSDSYDHPHLWYS 593
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
TSEV+ ALELFI SG ELS S+TY YDL+DLTRQALAKY NELFL IIE+YQ ND +GV
Sbjct: 594 TSEVLHALELFITSGEELSGSSTYSYDLVDLTRQALAKYGNELFLKIIESYQANDGNGVA 653
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
S++FL+LVEDMD LL CH+GFLLGPWLESAKQLAQ++EQEKQ
Sbjct: 654 SRSQKFLDLVEDMDTLLGCHEGFLLGPWLESAKQLAQDQEQEKQ 697
>gi|225457148|ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
Length = 813
Score = 810 bits (2092), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/468 (81%), Positives = 418/468 (89%), Gaps = 4/468 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWF+V +PRWCCTYLLDATDPLFIEIG+AFI+QQLKEYGRT HIYNCDTFDENTPP
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPP 354
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 355 VDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVL 414
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYGILD++A GPVEARTSEN+TMVGV
Sbjct: 415 DLFAEVKPIWITSEQFYGVPYIWCMLHNFAGNIEMYGILDAVASGPVEARTSENSTMVGV 474
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNPVVYDLMSEMAFQH KVDVK WI YS RRYG+SVP IQDAWN+LYHTVY
Sbjct: 475 GMSMEGIEQNPVVYDLMSEMAFQHSKVDVKVWIALYSTRRYGKSVPEIQDAWNILYHTVY 534
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTE----GKYQNYGKPVSKEAVLKSETSSYDHPH 356
NCTDG+ DKNRDVIVAFPD+DPS I + G Y YGK VS+ VLK T+S++ PH
Sbjct: 535 NCTDGSYDKNRDVIVAFPDIDPSFIPTPKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPH 594
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
LWYSTSEV AL LFIASG +L SNTYRYDL+DLTRQALAKYAN+LFL +IEAYQLND
Sbjct: 595 LWYSTSEVKDALGLFIASGGQLLGSNTYRYDLVDLTRQALAKYANQLFLEVIEAYQLNDV 654
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
G S++FLELVEDMD LLACHDGFLLGPWLESAKQLAQ+E+QE Q
Sbjct: 655 RGAACHSQKFLELVEDMDTLLACHDGFLLGPWLESAKQLAQDEQQEIQ 702
>gi|224121634|ref|XP_002318632.1| predicted protein [Populus trichocarpa]
gi|222859305|gb|EEE96852.1| predicted protein [Populus trichocarpa]
Length = 812
Score = 808 bits (2086), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/464 (81%), Positives = 420/464 (90%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NLH WGGPLPQSW DQQLVLQKKIL R+YELGM PVLPAFSGNVPAAL+N+FPSAKIT
Sbjct: 238 MANLHRWGGPLPQSWFDQQLVLQKKILARMYELGMTPVLPAFSGNVPAALRNIFPSAKIT 297
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWFSV+SD RWCCTYLLDATDPLFIEIGRAFIEQQL EYG TSHIYNCDTFDENTPP
Sbjct: 298 RLGNWFSVRSDVRWCCTYLLDATDPLFIEIGRAFIEQQLTEYGSTSHIYNCDTFDENTPP 357
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLG +I+ GMQSGDS+AVWLMQGWLFSYDPFWRPPQ KALL+SVP+G+LVVL
Sbjct: 358 VDDPEYISSLGGSIFEGMQSGDSNAVWLMQGWLFSYDPFWRPPQTKALLHSVPIGRLVVL 417
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DLFAEVKPIW+TS+QFYGVPYIWCMLHNFAGN+EMYG LDS+A GPVEARTSEN+TMVGV
Sbjct: 418 DLFAEVKPIWNTSEQFYGVPYIWCMLHNFAGNLEMYGYLDSVASGPVEARTSENSTMVGV 477
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNPVVYDLMSEMAFQ KVDVK WI+ YS RRYGRSVP IQ+AWN+LYHTVY
Sbjct: 478 GMSMEGIEQNPVVYDLMSEMAFQKNKVDVKEWIDLYSARRYGRSVPTIQNAWNILYHTVY 537
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
NCTDGA DKNRDVIVAFPDV+P+++S+ +G++ K VS+ A L T SY+HPHLWYS
Sbjct: 538 NCTDGAYDKNRDVIVAFPDVNPNLVSMLQGRHHTDVKLVSRRAALIKNTDSYEHPHLWYS 597
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
T+EV+RALELFIA G+ELS S+TY YDL+DLTRQ LAKYANELFL +IEAY+L D+HGV
Sbjct: 598 TTEVVRALELFIAGGDELSGSSTYSYDLVDLTRQVLAKYANELFLKVIEAYRLKDSHGVA 657
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
S+ FL+LVED+D LLACH+GFLLGPWLESAKQLAQ+EEQ+ Q
Sbjct: 658 HQSQMFLDLVEDIDTLLACHEGFLLGPWLESAKQLAQDEEQQIQ 701
>gi|356519003|ref|XP_003528164.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
Length = 812
Score = 797 bits (2058), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/464 (80%), Positives = 417/464 (89%), Gaps = 1/464 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSW DQQL+LQKKIL R++ELGM PVLPAFSGNVPAAL+++FPSAKIT
Sbjct: 233 MGNLHGWGGPLPQSWFDQQLILQKKILARMFELGMTPVLPAFSGNVPAALKHIFPSAKIT 292
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWFSVK+D +WCCTYLLDATD LF+EIG+AFIE+QL+EYGRTSHIYNCDTFDENTPP
Sbjct: 293 RLGNWFSVKNDLKWCCTYLLDATDSLFVEIGKAFIEKQLQEYGRTSHIYNCDTFDENTPP 352
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLGAA + GMQSGD DAVWLMQGWLFSYDPFWRPPQMKALL+SVP+GKLVVL
Sbjct: 353 VDDPEYISSLGAATFKGMQSGDDDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGKLVVL 412
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYGILD+IA GP++ARTS N+TMVGV
Sbjct: 413 DLFAEVKPIWVTSEQFYGVPYIWCMLHNFAGNIEMYGILDAIASGPIDARTSNNSTMVGV 472
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAFQH+KVDVKAW++ YS RRYG+++P IQ+ WNVLYHT+Y
Sbjct: 473 GMSMEGIEQNPIVYDLMSEMAFQHKKVDVKAWVDMYSTRRYGQTLPLIQEGWNVLYHTIY 532
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
NCTDGA DKNRDVIVAFPDVDPS+ISV + + KP S ++K T S+D PHLWY
Sbjct: 533 NCTDGAYDKNRDVIVAFPDVDPSLISVQHEQSHHNDKPYSG-TIIKEITDSFDRPHLWYP 591
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
TSEVI ALELFI SG+ELS NTYRYDL+DLTRQ LAKYANELF +IEAYQ +D HG+
Sbjct: 592 TSEVIYALELFITSGDELSRCNTYRYDLVDLTRQVLAKYANELFFKVIEAYQSHDIHGMT 651
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
LS+RFL+LVED+D LLACHDGFLLGPWLESAKQLA NEEQE+Q
Sbjct: 652 LLSQRFLDLVEDLDTLLACHDGFLLGPWLESAKQLALNEEQERQ 695
>gi|297733843|emb|CBI15090.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 772 bits (1994), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/501 (74%), Positives = 411/501 (82%), Gaps = 37/501 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWF+V +PRWCCTYLLDATDPLFIEIG+AFI+QQLKEYGRT HIYNCDTFDENTPP
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPP 354
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 355 VDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVL 414
Query: 181 DLFAEVKPIWSTSKQFYGVPYIW--------------------------------CMLHN 208
DLFAEVKPIW TS+QFYGVPYIW CMLHN
Sbjct: 415 DLFAEVKPIWITSEQFYGVPYIWKVTKSGRQQSLKFTNEKCCSFFRSHSPDSEVLCMLHN 474
Query: 209 FAGNIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 267
FAGNIEMYGILD++A GP+ R + +VGVGMSMEGIEQNPVVYDLMSEMAFQH KV
Sbjct: 475 FAGNIEMYGILDAVASGPILLRAKYAESAVVGVGMSMEGIEQNPVVYDLMSEMAFQHSKV 534
Query: 268 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 327
DVK WI YS RRYG+SVP IQDAWN+LYHTVYNCTDG+ DKNRDVIVAFPD+DPS I
Sbjct: 535 DVKVWIALYSTRRYGKSVPEIQDAWNILYHTVYNCTDGSYDKNRDVIVAFPDIDPSFIPT 594
Query: 328 TE----GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 383
+ G Y YGK VS+ VLK T+S++ PHLWYSTSEV AL LFIASG +L SNT
Sbjct: 595 PKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPHLWYSTSEVKDALGLFIASGGQLLGSNT 654
Query: 384 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 443
YRYDL+DLTRQALAKYAN+LFL +IEAYQLND G S++FLELVEDMD LLACHDGF
Sbjct: 655 YRYDLVDLTRQALAKYANQLFLEVIEAYQLNDVRGAACHSQKFLELVEDMDTLLACHDGF 714
Query: 444 LLGPWLESAKQLAQNEEQEKQ 464
LLGPWLESAKQLAQ+E+QE Q
Sbjct: 715 LLGPWLESAKQLAQDEQQEIQ 735
>gi|449441031|ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
Length = 808
Score = 759 bits (1960), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/464 (75%), Positives = 409/464 (88%), Gaps = 2/464 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLPQSW DQQL+LQKK++ R++ELGM PVLPAFSGN+PAA + ++P+AKIT
Sbjct: 236 MGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKIT 295
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWF+V SDPRWCCTYLLDA DPLF+EIG+AFIEQQ KEYGRTSH+YNCDTFDENTPP
Sbjct: 296 RLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPP 355
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD EYISSLG+AI+ GMQ+GDS+AVWLMQGW+FSYDPFWRP QMKALL+SVPLG+LVVL
Sbjct: 356 VDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVL 415
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKPIW +S+QFYG+PYIWCMLHNFAGN+EMYGILDSIA GP+EAR+S +TMVGV
Sbjct: 416 DLYAEVKPIWISSEQFYGIPYIWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGV 475
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNPVVYDLMSEMAFQH KVDVK W+ QYSVRRYG VP+IQDAW+VLYHTVY
Sbjct: 476 GMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVY 535
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
NCTDGA DKNRDVIVAFPDVDPS I V +G S +V + + +++D PHLWY
Sbjct: 536 NCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHLWYP 593
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF I++AYQL+D +
Sbjct: 594 TSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMA 653
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQ
Sbjct: 654 SLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQ 697
>gi|326515664|dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 829
Score = 704 bits (1816), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/469 (68%), Positives = 393/469 (83%), Gaps = 6/469 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y GM+PVLPAFSGN+PAAL+ FPSAK+T
Sbjct: 239 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVT 298
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 299 HLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 358
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + MQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 359 LSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 418
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKP+W S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TMVGV
Sbjct: 419 DLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 478
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEM F H +VD+K W+ Y RRYG+SV +QDAW +L+ T+Y
Sbjct: 479 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLY 538
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAVLK-SETSSYDHP 355
NCTDG DKNRDVIVAFPDV+PS+I T G Y +NY +S+ V+K + +Y+ P
Sbjct: 539 NCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSKNYSTMLSENYVMKDAPNDAYEQP 597
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
H+WY T VI ALELF+ SG+E+S S+T+RYDL+DLTRQALAKYAN++FL II+ Y+ N+
Sbjct: 598 HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ V L RFL LV+D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 658 VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQ 706
>gi|326519955|dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 829
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/469 (68%), Positives = 392/469 (83%), Gaps = 6/469 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y GM+PVLPAFSGN+PAAL+ FPSAK+T
Sbjct: 239 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVT 298
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 299 HLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 358
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + MQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 359 LSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 418
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKP W S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TMVGV
Sbjct: 419 DLYAEVKPAWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 478
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEM F H +VD+K W+ Y RRYG+SV +QDAW +L+ T+Y
Sbjct: 479 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLY 538
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAVLK-SETSSYDHP 355
NCTDG DKNRDVIVAFPDV+PS+I T G Y +NY +S+ V+K + +Y+ P
Sbjct: 539 NCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSKNYSTMLSENYVMKDAPNDAYEQP 597
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
H+WY T VI ALELF+ SG+E+S S+T+RYDL+DLTRQALAKYAN++FL II+ Y+ N+
Sbjct: 598 HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ V L RFL LV+D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 658 VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQ 706
>gi|414585093|tpg|DAA35664.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 721
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/470 (68%), Positives = 392/470 (83%), Gaps = 6/470 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYNCDTFDENTPP
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYNCDTFDENTPP 359
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+GK++VL
Sbjct: 360 LSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGKMIVL 419
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKP+W S Q YGVPYIWCMLHNFA + EMYG+LD++A GP++AR S+N+TMVGV
Sbjct: 420 DLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGV 479
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+ V +QDAW +LY T+Y
Sbjct: 480 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLY 539
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHP 355
NCTDG DKNRDVIVAFPDV+P +I+ T G + N Y SK + K +S +Y+HP
Sbjct: 540 NCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHP 598
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T+ VI ALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+
Sbjct: 599 HLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNN 658
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
+ V L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE QV
Sbjct: 659 MNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQV 708
>gi|222629680|gb|EEE61812.1| hypothetical protein OsJ_16433 [Oryza sativa Japonica Group]
Length = 1129
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/469 (67%), Positives = 388/469 (82%), Gaps = 6/469 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 537 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 596
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 597 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 656
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 657 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 716
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKPIW S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TMVGV
Sbjct: 717 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMVGV 776
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+S+ +QDAW +LY T+Y
Sbjct: 777 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSMVGLQDAWKILYQTLY 836
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 355
NCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y+HP
Sbjct: 837 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 895
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 896 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 955
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 956 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQ 1004
>gi|414585092|tpg|DAA35663.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 831
Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/469 (68%), Positives = 391/469 (83%), Gaps = 6/469 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYNCDTFDENTPP
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYNCDTFDENTPP 359
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+GK++VL
Sbjct: 360 LSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGKMIVL 419
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKP+W S Q YGVPYIWCMLHNFA + EMYG+LD++A GP++AR S+N+TMVGV
Sbjct: 420 DLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGV 479
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+ V +QDAW +LY T+Y
Sbjct: 480 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLY 539
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHP 355
NCTDG DKNRDVIVAFPDV+P +I+ T G + N Y SK + K +S +Y+HP
Sbjct: 540 NCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHP 598
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T+ VI ALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+
Sbjct: 599 HLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNN 658
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ V L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE Q
Sbjct: 659 MNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQ 707
>gi|218195716|gb|EEC78143.1| hypothetical protein OsI_17702 [Oryza sativa Indica Group]
Length = 829
Score = 697 bits (1800), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/469 (67%), Positives = 388/469 (82%), Gaps = 6/469 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 237 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 296
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 297 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 356
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 357 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 416
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKPIW S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TM+GV
Sbjct: 417 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMIGV 476
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+S+ +QDAW +LY T+Y
Sbjct: 477 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSIVGLQDAWKILYQTLY 536
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 355
NCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y+HP
Sbjct: 537 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 595
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 596 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 655
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 656 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQ 704
>gi|38345908|emb|CAE04506.2| OSJNBb0059K02.16 [Oryza sativa Japonica Group]
Length = 829
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/469 (67%), Positives = 388/469 (82%), Gaps = 6/469 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 237 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 296
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 297 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 356
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 357 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 416
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKPIW S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TMVGV
Sbjct: 417 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMVGV 476
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+S+ +QDAW +LY T+Y
Sbjct: 477 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSMVGLQDAWKILYQTLY 536
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 355
NCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y+HP
Sbjct: 537 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 595
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 596 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 655
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 656 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQ 704
>gi|357166414|ref|XP_003580702.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
distachyon]
Length = 829
Score = 695 bits (1793), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/469 (67%), Positives = 394/469 (84%), Gaps = 6/469 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y GM+PVLPAFSG++PAAL++ FPSAK+T
Sbjct: 238 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGSIPAALKSKFPSAKVT 297
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 298 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 357
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 358 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 417
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKP+W S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TMVGV
Sbjct: 418 DLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 477
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEM F H +VD++ W+ Y RRYG+S+ +QDAW +L+ T+Y
Sbjct: 478 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLQVWVETYPTRRYGKSIVELQDAWRILHQTLY 537
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVLKSETS-SYDHP 355
NCTDG DKNRDVIVAFPDV+P +I T G + + K S K ++K E++ +Y+ P
Sbjct: 538 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLHTSASKMFSTMSAKSYLVKDESNDAYEQP 596
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T+ VIRAL+LF+ G+E+S S+T+RYDL+DLTRQALAKYAN++F II++Y+ N+
Sbjct: 597 HLWYDTNVVIRALQLFLQYGDEVSDSSTFRYDLVDLTRQALAKYANQIFAKIIQSYKSNN 656
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ V LS FL+LV D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 657 MNQVTTLSECFLDLVNDLDMLLASHEGFLLGPWLESAKGLARDQEQEIQ 705
>gi|218192858|gb|EEC75285.1| hypothetical protein OsI_11626 [Oryza sativa Indica Group]
Length = 812
Score = 639 bits (1649), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 298/466 (63%), Positives = 365/466 (78%), Gaps = 2/466 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVP+ + +FPSA IT
Sbjct: 243 MGNLHGWGGPLSQNWLDQQLTLQKKILSRMIELGMVPVLPSFSGNVPSVFKKLFPSANIT 302
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 303 KLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQAFIRQQMKEYGDITNIYNCDTFNENTPP 362
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLG+AIY M G+ DAVWLMQGWLF D FW+ PQMKALL+SVP GK++V
Sbjct: 363 TNEPAYISSLGSAIYEAMSRGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPTGKMIV 422
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYGILDSIA GP++ARTS N+TMVG
Sbjct: 423 LDLFADVKPIWQMSSQFYGVPYIWCMLHNFGGNIEMYGILDSIASGPIDARTSHNSTMVG 482
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF+ +KV+V+ W+ YS RRYG+S ++ AW +LYHT+
Sbjct: 483 VGMCMEGIEHNPVVYELMSEMAFRSQKVEVEDWLKIYSYRRYGQSNVEVEKAWGILYHTI 542
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 358
YNCTDG D N+D IV FPD+ P+ S K + + + SE S S HPHLW
Sbjct: 543 YNCTDGIADHNKDYIVQFPDISPNSFSSDVSKRKAISEVKKHRRFVLSEVSASLPHPHLW 602
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
YST E I+ALELF+ +GN+LS S TYRYDL+DLTRQ+L+K ANE++L+ + AY+ D++G
Sbjct: 603 YSTKEAIKALELFLNAGNDLSKSLTYRYDLVDLTRQSLSKLANEVYLDAMNAYRKKDSNG 662
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ +++FLEL+ D+D LLA D FLLGPWLE AK LA+ E + KQ
Sbjct: 663 LNFYTKKFLELIVDIDTLLASDDNFLLGPWLEDAKSLARTENERKQ 708
>gi|222624949|gb|EEE59081.1| hypothetical protein OsJ_10898 [Oryza sativa Japonica Group]
Length = 812
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 298/466 (63%), Positives = 364/466 (78%), Gaps = 2/466 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVP+ + +FPSA IT
Sbjct: 243 MGNLHGWGGPLSQNWLDQQLTLQKKILSRMIELGMVPVLPSFSGNVPSVFKKLFPSANIT 302
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 303 KLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQAFIRQQMKEYGDITNIYNCDTFNENTPP 362
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLG+AIY M G+ DAVWLMQGWLF D FW+ PQMKALL+SVP GK++V
Sbjct: 363 TNEPAYISSLGSAIYEAMSRGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPTGKMIV 422
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYGILDSIA GP++ARTS N+TMVG
Sbjct: 423 LDLFADVKPIWQMSSQFYGVPYIWCMLHNFGGNIEMYGILDSIASGPIDARTSHNSTMVG 482
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF+ +KV+V+ W+ YS RRYG+S ++ AW +LYHT+
Sbjct: 483 VGMCMEGIEHNPVVYELMSEMAFRSQKVEVEDWLKIYSYRRYGQSNVEVEKAWGILYHTI 542
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 358
YNCTDG D N D IV FPD+ P+ S K + + + SE S S HPHLW
Sbjct: 543 YNCTDGIADHNNDYIVEFPDISPNSFSSDVSKRKAISEVKKHRRFVLSEVSASLPHPHLW 602
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
YST E I+ALELF+ +GN+LS S TYRYDL+DLTRQ+L+K ANE++L+ + AY+ D++G
Sbjct: 603 YSTKEAIKALELFLNAGNDLSKSLTYRYDLVDLTRQSLSKLANEVYLDAMNAYRKKDSNG 662
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ +++FLEL+ D+D LLA D FLLGPWLE AK LA+ E + KQ
Sbjct: 663 LNFYTKKFLELIVDIDTLLASDDNFLLGPWLEDAKSLARTENERKQ 708
>gi|413955691|gb|AFW88340.1| hypothetical protein ZEAMMB73_315381 [Zea mays]
Length = 814
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 292/466 (62%), Positives = 360/466 (77%), Gaps = 2/466 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVPA +FPSA IT
Sbjct: 244 MGNLHGWGGPLSQNWLDQQLALQKKILSRMIELGMVPVLPSFSGNVPAIFAKLFPSANIT 303
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V ++P+WCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 304 RLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQAFIRQQIKEYGDVTNIYNCDTFNENTPP 363
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
D P YISSLG+AIY M G+ +AVWLMQGWLF D FW+ PQMKALL+SVP+GK++V
Sbjct: 364 TDEPAYISSLGSAIYEAMSRGNKNAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 423
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYGILDSI+ GP++ARTS N+TM+G
Sbjct: 424 LDLFADVKPIWKVSSQFYGVPYIWCMLHNFGGNIEMYGILDSISSGPIDARTSYNSTMIG 483
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF ++KV+V+ W+ YS RRYG++ I+ AW LYHT+
Sbjct: 484 VGMCMEGIEHNPVVYELMSEMAFHNKKVEVEDWLKTYSCRRYGQANADIEKAWRYLYHTI 543
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS-YDHPHLW 358
YNCTDG D N+D IV FPD+ PS ++ K + + SE S PHLW
Sbjct: 544 YNCTDGIADHNKDYIVEFPDISPSSVTYQVSKRRGMSITRNHRRFFLSEVSGILPQPHLW 603
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
YST E ++ALELF+ +G+ S S TYRYDL+DLTRQ L+K ANE++L+ I YQ D+HG
Sbjct: 604 YSTKEAVKALELFLDAGSTFSESLTYRYDLVDLTRQCLSKLANEVYLDAISLYQKKDSHG 663
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ +R+FLE++ D+D LLA D FLLGPWLESAK LA E++ +Q
Sbjct: 664 LNAHARKFLEIIVDIDTLLAADDNFLLGPWLESAKSLAITEKERQQ 709
>gi|297736304|emb|CBI24942.3| unnamed protein product [Vitis vinifera]
Length = 868
Score = 624 bits (1610), Expect = e-176, Method: Compositional matrix adjust.
Identities = 296/472 (62%), Positives = 363/472 (76%), Gaps = 9/472 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLD+QLVLQK+IL R+ ELGM PVLP+FSGNVP AL+ +FPSA IT
Sbjct: 294 MGNLHGWGGPLSQNWLDEQLVLQKQILCRMLELGMTPVLPSFSGNVPEALKKIFPSANIT 353
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI QQ+KEYG + IYNCDTF+EN+PP
Sbjct: 354 RLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQQIKEYGDVTDIYNCDTFNENSPP 413
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLGAAIY M GD D+VWLMQGWLF D FW+PPQMKALL+SVP GK+VV
Sbjct: 414 TNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSDSGFWKPPQMKALLHSVPFGKMVV 473
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMYGILD+++ GPV+AR S+N+TMVG
Sbjct: 474 LDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMYGILDAVSSGPVDARISKNSTMVG 533
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPV Y+LMSEMAF+ EKV + W+ YS RRYG++V ++ AW +LY T+
Sbjct: 534 VGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTYSYRRYGKAVHHVEAAWEILYRTI 593
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
YNCTDG D N D +V FPD DPS+ IS + Q + +L ETSS D
Sbjct: 594 YNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHIIQKILTQTGRRKILFQETSS-DL 652
Query: 355 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
P HLWYST EV+ AL LF+ +GNELS S+TYRYDL+DLTRQ L+K N+++L+ + A++
Sbjct: 653 PQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLVDLTRQVLSKLGNQVYLDAVIAFR 712
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
DA S++F++LV+D+D LLA D FLLG WLESAK+LA N + +Q
Sbjct: 713 QKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWLESAKKLAVNPREMEQ 764
>gi|225450036|ref|XP_002273084.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
Length = 803
Score = 624 bits (1609), Expect = e-176, Method: Compositional matrix adjust.
Identities = 296/472 (62%), Positives = 363/472 (76%), Gaps = 9/472 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLD+QLVLQK+IL R+ ELGM PVLP+FSGNVP AL+ +FPSA IT
Sbjct: 229 MGNLHGWGGPLSQNWLDEQLVLQKQILCRMLELGMTPVLPSFSGNVPEALKKIFPSANIT 288
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI QQ+KEYG + IYNCDTF+EN+PP
Sbjct: 289 RLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQQIKEYGDVTDIYNCDTFNENSPP 348
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLGAAIY M GD D+VWLMQGWLF D FW+PPQMKALL+SVP GK+VV
Sbjct: 349 TNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSDSGFWKPPQMKALLHSVPFGKMVV 408
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMYGILD+++ GPV+AR S+N+TMVG
Sbjct: 409 LDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMYGILDAVSSGPVDARISKNSTMVG 468
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPV Y+LMSEMAF+ EKV + W+ YS RRYG++V ++ AW +LY T+
Sbjct: 469 VGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTYSYRRYGKAVHHVEAAWEILYRTI 528
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
YNCTDG D N D +V FPD DPS+ IS + Q + +L ETSS D
Sbjct: 529 YNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHIIQKILTQTGRRKILFQETSS-DL 587
Query: 355 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
P HLWYST EV+ AL LF+ +GNELS S+TYRYDL+DLTRQ L+K N+++L+ + A++
Sbjct: 588 PQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLVDLTRQVLSKLGNQVYLDAVIAFR 647
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
DA S++F++LV+D+D LLA D FLLG WLESAK+LA N + +Q
Sbjct: 648 QKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWLESAKKLAVNPREMEQ 699
>gi|357112065|ref|XP_003557830.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
distachyon]
Length = 809
Score = 620 bits (1598), Expect = e-175, Method: Compositional matrix adjust.
Identities = 292/468 (62%), Positives = 368/468 (78%), Gaps = 6/468 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL Q+WLD QL LQKKIL R+ ELGM PVLP+FSGNVP A + +FPSA IT
Sbjct: 240 MGNLHAWGGPLSQNWLDGQLALQKKILSRMTELGMVPVLPSFSGNVPVAFKKLFPSANIT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V DPRWCCTY+LD +D LFI++G AFI QQ+KEYG + IYNCDTF+ENTPP
Sbjct: 300 RLGEWNTVDGDPRWCCTYILDPSDALFIDVGHAFIRQQIKEYGDITSIYNCDTFNENTPP 359
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLG+AIY M SG+ DAVWLMQGWLF D FW+ PQMKALL+SVP+GK++V
Sbjct: 360 TNEPAYISSLGSAIYEAMSSGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 419
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKP+W S QFYGVPYIWCMLHNF GNIEMYGILDSI+ GP++ARTS +TMVG
Sbjct: 420 LDLFADVKPVWKMSSQFYGVPYIWCMLHNFGGNIEMYGILDSISSGPIDARTSYGSTMVG 479
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM+MEGIE NPVV++LMSEM+F+ +KV+V+ W+ YS RRYG+S I+ AW VLYHT+
Sbjct: 480 VGMTMEGIEHNPVVFELMSEMSFRSQKVEVEDWLKSYSYRRYGQSNVKIEKAWGVLYHTI 539
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA---VLKSETSSYDHPH 356
YNCTDG D NRD IV FPD+ PS S K + G P+ ++ L +++ HPH
Sbjct: 540 YNCTDGIADHNRDYIVEFPDMSPSSFSSHFSKQR--GMPIVRKHPRFFLSEVSANLPHPH 597
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
LWYST+E ++ALELF+ +GN+LS S T+RYDL+DLTRQ+L+K AN+++L+ +++Y+ ++
Sbjct: 598 LWYSTNEAVKALELFLNAGNDLSKSLTFRYDLVDLTRQSLSKLANKVYLDAMDSYKNKNS 657
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
G+ +++FLEL+ D+D LLA D FLLGPWLESAK LA +EE+ KQ
Sbjct: 658 SGLNFHTKKFLELIVDIDILLASDDNFLLGPWLESAKSLAMSEEERKQ 705
>gi|356534602|ref|XP_003535842.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
Length = 807
Score = 619 bits (1597), Expect = e-175, Method: Compositional matrix adjust.
Identities = 297/470 (63%), Positives = 358/470 (76%), Gaps = 8/470 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 230 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 289
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCTYLLD +DPLF+EIG AFI +Q+KEYG + IYNCDTF+EN+PP
Sbjct: 290 RLGDWNTVDGDPRWCCTYLLDPSDPLFVEIGEAFIRKQIKEYGDVTDIYNCDTFNENSPP 349
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ PEYIS+LGAA+Y G+ GD DAVWLMQGWLF D FW+PPQMKALL+SVP GK++V
Sbjct: 350 TNDPEYISNLGAAVYKGISKGDKDAVWLMQGWLFYSDSSFWKPPQMKALLHSVPFGKMIV 409
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S N+TMVG
Sbjct: 410 LDLFADVKPIWKNSFQFYGTPYIWCMLHNFGGNIEMYGTLDSISSGPVDARVSANSTMVG 469
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNP+VY+LMSEMAF+ +KV V WI Y RRYG+ + ++ AW +LYHT+
Sbjct: 470 VGMCMEGIEQNPIVYELMSEMAFRDKKVKVSEWIKSYCHRRYGKVIHQVESAWEILYHTI 529
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN---YGKPVSKEAVLKSETSSYDHP- 355
YNCTDG D N D IV FPD +PS SVT G N Y P L ET S D P
Sbjct: 530 YNCTDGIADHNHDFIVMFPDWNPSTNSVT-GTSNNQKIYLLPPGNRRYLFQETLS-DMPQ 587
Query: 356 -HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
HLWY + +VI+AL+LF+A G L+ S TYRYDL+DLTRQ L+K AN+++ + +YQ
Sbjct: 588 AHLWYPSDDVIKALQLFLAGGKNLAGSLTYRYDLVDLTRQVLSKLANQVYHKAVTSYQKK 647
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ + S +FL+L++D+D LLA D FLLG WLESAK+LA N + KQ
Sbjct: 648 NIEALQFHSNKFLQLIKDIDVLLASDDNFLLGTWLESAKKLAVNPSEIKQ 697
>gi|297807393|ref|XP_002871580.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
lyrata]
gi|297317417|gb|EFH47839.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
lyrata]
Length = 806
Score = 609 bits (1571), Expect = e-172, Method: Compositional matrix adjust.
Identities = 291/472 (61%), Positives = 363/472 (76%), Gaps = 9/472 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL ++WL+ QL+LQK+IL ++ +LGM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHTWGGPLSKNWLNDQLILQKQILSQMLKLGMTPVLPSFSGNVPSALRKIYPGANIT 290
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L NW +V D RWCCTYLL+ +DPLFI+IG AFI+QQ +EYG ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIDIGEAFIKQQPEEYGEITNIYNCDTFNENTPP 350
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEYISSLGAA+Y M G+ +AVWLMQGWLFS D FW+PPQMK LL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQMKVLLHSVPFGKMIV 410
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+AEVKPIW+TS QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TMVG
Sbjct: 411 LDLYAEVKPIWNTSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPVVY+L+SEMAF+ EKVDV+ W+ Y+ RRY + I+ AW +LYHTV
Sbjct: 471 VGMCMEGIEQNPVVYELISEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN-----YGKPVSKEAVLKSETSSYDH 354
YNCTDG D N D IV PD DPS E K+ + G +K VL + SS D
Sbjct: 531 YNCTDGIADHNTDFIVKLPDWDPSSSVQDESKHTDSYMISTGPYETKRRVLFQDKSS-DL 589
Query: 355 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
P HLWYST EVI+AL+LF+ +G+ELS S TYRYD++DLTRQ L+K AN++++ + A+
Sbjct: 590 PKAHLWYSTKEVIQALKLFLEAGDELSRSLTYRYDMVDLTRQVLSKLANQVYIEAVTAFV 649
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
D + QLS +FLEL++D+D LLA D FLLG WLESAK+LA+N ++ KQ
Sbjct: 650 KKDIGSLGQLSEKFLELIKDIDVLLASDDNFLLGTWLESAKKLARNGDERKQ 701
>gi|224106113|ref|XP_002314048.1| predicted protein [Populus trichocarpa]
gi|222850456|gb|EEE88003.1| predicted protein [Populus trichocarpa]
Length = 806
Score = 606 bits (1563), Expect = e-171, Method: Compositional matrix adjust.
Identities = 282/462 (61%), Positives = 354/462 (76%), Gaps = 6/462 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPAAL+ +FPSA IT
Sbjct: 233 MGNLHGWGGPLSQNWLDQQLCLQKQILSRMLELGMTPVLPSFSGNVPAALKKIFPSANIT 292
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V +PRWCCTYLL+ +DPLF+EIG AFI QQ+KEYG + IYNCDTF+EN+PP
Sbjct: 293 RLGDWNTVDKNPRWCCTYLLNPSDPLFVEIGEAFIRQQVKEYGDVTDIYNCDTFNENSPP 352
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
P YISSLGAA+Y M GD DAVWLMQGWLF D FW+PPQM+ALL+SVP GK++V
Sbjct: 353 TSDPAYISSLGAAVYKAMSRGDKDAVWLMQGWLFYSDSAFWKPPQMQALLHSVPFGKMIV 412
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE KPIW S QFYG PY+WC+LHNF GNIEMYGILD+I+ GPV+AR EN+TMVG
Sbjct: 413 LDLFAEAKPIWKNSSQFYGTPYVWCLLHNFGGNIEMYGILDAISSGPVDARIIENSTMVG 472
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF+ K V W+ YS RRYG++V + AW++LYHT+
Sbjct: 473 VGMCMEGIEHNPVVYELMSEMAFRSGKPQVLEWLKTYSRRRYGKAVRQVVAAWDILYHTI 532
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV-----SKEAVLKSETSSYDH 354
YNCTDG D N D IV FPD DPS+ S + Q+ + + ++ + + +S +
Sbjct: 533 YNCTDGIADHNTDFIVKFPDWDPSLHSGSNISEQDNMRILLTSSGTRRFLFQETSSDFPE 592
Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
HLWYST EVI+AL LF+ +GN+L+ S TYRYDL+DLTRQ L+K AN+++ + + A++
Sbjct: 593 AHLWYSTQEVIQALWLFLDAGNDLAGSPTYRYDLVDLTRQVLSKLANQVYRDAMIAFRRK 652
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
DA + ++FL++++D+D LLA D FLLG WLESAK+LA
Sbjct: 653 DARALNLHGQKFLQIIKDIDVLLASDDNFLLGTWLESAKKLA 694
>gi|15240689|ref|NP_196873.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|9758035|dbj|BAB08696.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|19423948|gb|AAL87291.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|21436231|gb|AAM51254.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|332004545|gb|AED91928.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
Length = 806
Score = 605 bits (1560), Expect = e-170, Method: Compositional matrix adjust.
Identities = 286/472 (60%), Positives = 357/472 (75%), Gaps = 9/472 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL ++WLD QL+LQK+IL R+ + GM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMTPVLPSFSGNVPSALRKIYPEANIT 290
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L NW +V D RWCCTYLL+ +DPLFIEIG AFI+QQ +EYG ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIKQQTEEYGEITNIYNCDTFNENTPP 350
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEYISSLGAA+Y M G+ +AVWLMQGWLFS D FW+PPQ+KALL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQLKALLHSVPFGKMIV 410
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+AEVKPIW+ S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TMVG
Sbjct: 411 LDLYAEVKPIWNKSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPVVY+L SEMAF+ EKVDV+ W+ Y+ RRY + I+ AW +LYHTV
Sbjct: 471 VGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS-------KEAVLKSETSSY 352
YNCTDG D N D IV PD DPS SV + Q +S + + + +T+
Sbjct: 531 YNCTDGIADHNTDFIVKLPDWDPS-SSVQDDLKQKDSYMISTGPYETKRRVLFQDKTADL 589
Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
HLWYST EVI+AL+LF+ +G++LS S TYRYD++DLTRQ L+K AN+++ + A+
Sbjct: 590 PKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYTEAVTAFV 649
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
D + QLS +FLEL++DMD LLA D LLG WLESAK+LA+N ++ KQ
Sbjct: 650 KKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQ 701
>gi|357458267|ref|XP_003599414.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488462|gb|AES69665.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 832
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 287/496 (57%), Positives = 354/496 (71%), Gaps = 33/496 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353
Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
EYG + IYNCDTF+EN+PP P YIS+LGAA+Y G+ GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413
Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473
Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 273
EMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE NP+VY+LMSEMAF+ EKV + W+
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKINEWL 533
Query: 274 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 333
YS RRYG+++ + AW +LYHT+YN TDG D N D IV PD DPS +V G
Sbjct: 534 KSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHNHDYIVMLPDWDPS-AAVKSGMSN 592
Query: 334 NYGK-----PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 388
+ K P ++ + + + HLWY +VI+AL+LF+A G L S TYRYDL
Sbjct: 593 HQKKIYFLPPGNRRYLFQQTPAGMPQAHLWYPPEDVIKALQLFLAGGKNLKGSLTYRYDL 652
Query: 389 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 448
+DLTRQ L+K+AN++++ I ++Q + + S FLEL++D+D LLA D FLLG W
Sbjct: 653 VDLTRQVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELIKDIDLLLASDDNFLLGTW 712
Query: 449 LESAKQLAQNEEQEKQ 464
L+SAK+LA N + KQ
Sbjct: 713 LQSAKKLAVNPSELKQ 728
>gi|357458271|ref|XP_003599416.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488464|gb|AES69667.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 807
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 285/491 (58%), Positives = 346/491 (70%), Gaps = 48/491 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353
Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
EYG + IYNCDTF+EN+PP P YIS+LGAA+Y G+ GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413
Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473
Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 273
EMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE NP+VY+LMSEMAF+ EKV + W+
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKINEWL 533
Query: 274 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 333
YS RRYG+++ + AW +LYHT+YN TDG D N D IV PD DPS + G Q
Sbjct: 534 KSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHNHDYIVMLPDWDPSAAVKSAGMPQ 593
Query: 334 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 393
HLWY +VI+AL+LF+A G L S TYRYDL+DLTR
Sbjct: 594 ---------------------AHLWYPPEDVIKALQLFLAGGKNLKGSLTYRYDLVDLTR 632
Query: 394 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
Q L+K+AN++++ I ++Q + + S FLEL++D+D LLA D FLLG WL+SAK
Sbjct: 633 QVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELIKDIDLLLASDDNFLLGTWLQSAK 692
Query: 454 QLAQNEEQEKQ 464
+LA N + KQ
Sbjct: 693 KLAVNPSELKQ 703
>gi|449436325|ref|XP_004135943.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
Length = 774
Score = 577 bits (1486), Expect = e-162, Method: Compositional matrix adjust.
Identities = 278/465 (59%), Positives = 339/465 (72%), Gaps = 24/465 (5%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL ++WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPA L +FPSA IT
Sbjct: 229 MGNLHGWGGPLSKNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAGLVEIFPSANIT 288
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNW S+ +DP CCTYLL+ +DPLF++IG AFI QQ+KEYG ++IY+CDTF+ENTPP
Sbjct: 289 KLGNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTNIYSCDTFNENTPP 348
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ YISSLGA++Y M D DAVWLMQGWLF D FW+P QMKALL+SVP GK++V
Sbjct: 349 TNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSDFWKPDQMKALLHSVPFGKMIV 408
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW +S QFYG PY+WCMLHNF GNIEMYGILD+I+ GPV+A SEN+TMVG
Sbjct: 409 LDLFADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVG 468
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF+ +KV V+ W+ YS RYG++ + AWN+LYHT+
Sbjct: 469 VGMCMEGIEHNPVVYELMSEMAFRSKKVQVQEWLKTYSRCRYGKADHYVDAAWNILYHTI 528
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCTDG + N D IV PD DPS S + K KP PHLWY
Sbjct: 529 YNCTDGIANHNTDFIVKLPDWDPS--STFDLK-----KP----------------PHLWY 565
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
ST EVI AL+L + + L S TYRYDL+DLTRQ L K ANE +L + A++ +
Sbjct: 566 STQEVINALQLLVNVDDNLVHSATYRYDLVDLTRQVLGKLANEEYLKAVTAFRRQNVKAQ 625
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
S+RF++L+ D+D LLA + FLLG WLESAK+LA N + KQ
Sbjct: 626 NLHSKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLATNPAEMKQ 670
>gi|449489156|ref|XP_004158231.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Cucumis sativus]
Length = 567
Score = 567 bits (1461), Expect = e-159, Method: Compositional matrix adjust.
Identities = 274/462 (59%), Positives = 335/462 (72%), Gaps = 24/462 (5%)
Query: 4 LHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLG 63
L WGGPL ++WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPA L +FPSA IT+LG
Sbjct: 25 LKEWGGPLSKNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAGLVEIFPSANITKLG 84
Query: 64 NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDS 123
NW S+ +DP CCTYLL+ +DPLF++IG AFI QQ+KEYG ++IY+CDTF+ENTPP +
Sbjct: 85 NWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTNIYSCDTFNENTPPTND 144
Query: 124 PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDL 182
YISSLGA++Y M D DAVWLMQGWLF D FW+P QMKALL+SVP GK++VLDL
Sbjct: 145 TSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSDFWKPDQMKALLHSVPFGKMIVLDL 204
Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGM 242
FA+VKPIW +S QFYG PY+WCMLHNF GNIEMYGILD+I+ GPV+A SEN+TMVGVGM
Sbjct: 205 FADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGM 264
Query: 243 SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNC 302
MEGIE NPVVY+LMSEMAF+ +KV V+ W+ YS RYG++ + AWN+LYHT+YNC
Sbjct: 265 CMEGIEHNPVVYELMSEMAFRXQKVQVQEWLKTYSRCRYGKADHYVDAAWNILYHTIYNC 324
Query: 303 TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTS 362
TDG + N D IV PD DPS S + K KP PHLWYST
Sbjct: 325 TDGIANHNTDFIVKLPDWDPS--STFDLK-----KP----------------PHLWYSTQ 361
Query: 363 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 422
EVI AL+L + + L S TYRYDL+DLTRQ L K ANE +L + A++ +
Sbjct: 362 EVINALQLLVNVDDNLVHSATYRYDLVDLTRQVLGKLANEEYLKAVTAFRRQNVKAQNLH 421
Query: 423 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
S+RF++L+ D+D LLA + FLLG WLESAK+LA N + KQ
Sbjct: 422 SKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLATNPAEMKQ 463
>gi|242035709|ref|XP_002465249.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
gi|241919103|gb|EER92247.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
Length = 777
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 273/466 (58%), Positives = 337/466 (72%), Gaps = 39/466 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKK+L R+ ELGM PVLP+FSGNVPA +FPSA IT
Sbjct: 244 MGNLHGWGGPLSQNWLDQQLALQKKVLSRMIELGMVPVLPSFSGNVPAVFAKLFPSANIT 303
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG+W +V ++P+WCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 304 LLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQAFIRQQIKEYGDVTNIYNCDTFNENTPP 363
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
D P YISSLG+AIY M G+ +AVWLMQGWLF D FW+ PQMKALL+SVP+GK++V
Sbjct: 364 TDEPAYISSLGSAIYEAMSRGNKNAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 423
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYG+LDSI+ GP++ARTS N+TM+G
Sbjct: 424 LDLFADVKPIWKMSSQFYGVPYIWCMLHNFGGNIEMYGVLDSISSGPIDARTSYNSTMIG 483
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF ++KV+V+
Sbjct: 484 VGMCMEGIEHNPVVYELMSEMAFHNKKVEVE----------------------------- 514
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 358
D N+D IV FPD+ PS IS K + + SE S S HPHLW
Sbjct: 515 --------DHNKDYIVEFPDISPSSISSQLSKRRGMSIMRNHRRFFLSEVSGSLPHPHLW 566
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
YST E I+ALELF+ +G+ S S TYRYDL+DLTRQ L+K ANE++L+ + +YQ D++G
Sbjct: 567 YSTKEAIKALELFLDAGSTFSKSLTYRYDLVDLTRQCLSKLANEVYLDAMSSYQKKDSNG 626
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ +R+FLE++ D+D LLA D FLLGPWLESAK LA E++ +Q
Sbjct: 627 LNSHTRKFLEIIMDIDTLLAADDNFLLGPWLESAKSLAITEKERQQ 672
>gi|4160292|emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum]
Length = 811
Score = 557 bits (1435), Expect = e-156, Method: Compositional matrix adjust.
Identities = 271/477 (56%), Positives = 347/477 (72%), Gaps = 13/477 (2%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL Q+WL+ QL LQK+IL R+ ELGM PVLP+FSGNVPAAL+ +FPSA IT
Sbjct: 231 MGNLHAWGGPLSQNWLNIQLALQKQILSRMRELGMTPVLPSFSGNVPAALKKIFPSANIT 290
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCT+LL +DPLFIEIG AFI +Q++EYG + IYNCDTF+ENTPP
Sbjct: 291 RLGDWNTVNGDPRWCCTFLLAPSDPLFIEIGEAFIRKQIEEYGDITDIYNCDTFNENTPP 350
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWL-MQGWLFSYD-PFWRPPQMKALLNSVPLGKLV 178
D P YI Q + WL + WLF D +W+ PQM+ALL+SVP GK++
Sbjct: 351 TDDPTYIHLSALLCTKQCQKQITMRCWLNARVWLFYSDSKYWKSPQMEALLHSVPRGKMI 410
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VLDLFA+VKPIW +S QFYG PYIWCMLHNF GNIEMYG+LD++A GP++ARTSEN+TMV
Sbjct: 411 VLDLFADVKPIWKSSSQFYGTPYIWCMLHNFGGNIEMYGVLDAVASGPIDARTSENSTMV 470
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GVGM MEGIE NPVVY+LMSEMAF+ + ++ W+ YS RRYG+ IQ AW++LYHT
Sbjct: 471 GVGMCMEGIEHNPVVYELMSEMAFREDNFQLQGWLKSYSHRRYGKVNDQIQAAWDILYHT 530
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPS-----IISVTEGKYQNYGKPVS-----KEAVLKSE 348
+YNCTDG D N+D IV FPD DPS IS T+ QN + ++ + + +
Sbjct: 531 IYNCTDGIADHNKDYIVEFPDWDPSGKTGTDISGTDSSSQNRMQKLAGFQWNRRFLFFEK 590
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
+SS P LWYST +V +AL+LFI + +LS S TYRYDL+DL+RQ+L+K AN+++L+ I
Sbjct: 591 SSSLPKPRLWYSTEDVFQALQLFIDALKKLSGSLTYRYDLVDLSRQSLSKLANQVYLDAI 650
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ-LAQNEEQEKQ 464
A++ DA + Q S +FL L++D+D LLA D FLLG WLE+ Q LA N +++KQ
Sbjct: 651 SAFRREDAKPLNQHSPKFLPLLQDIDRLLAADDNFLLGTWLENCPQNLAMNSDEKKQ 707
>gi|168060822|ref|XP_001782392.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666123|gb|EDQ52786.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 801
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 259/464 (55%), Positives = 331/464 (71%), Gaps = 17/464 (3%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLPQ WLDQQL LQ KIL R+ ELGM PVLPAF+GNVPAA+ +PSA++T
Sbjct: 236 MGNLKRWGGPLPQKWLDQQLQLQIKILARMRELGMTPVLPAFAGNVPAAITKKYPSARVT 295
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V D R+CCT+LLD DPLF++IG+AFI QQ+KEYG T HIYNCDTF+EN PP
Sbjct: 296 RLGEWNTVNGDTRYCCTFLLDPKDPLFVDIGKAFILQQIKEYGGTQHIYNCDTFNENQPP 355
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P YIS+LG+ +Y M + D DA+WLMQ +YD FW+PPQMKALL+SVP+G++VVL
Sbjct: 356 TDDPSYISALGSIVYEAMSAADQDAIWLMQ----AYDKFWKPPQMKALLHSVPVGRMVVL 411
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DLFA+VKP+WS S FYGVPYIWCMLHNF GN+EMYG LD +A P++A TS N+TMVGV
Sbjct: 412 DLFADVKPMWSRSDHFYGVPYIWCMLHNFGGNVEMYGRLDVVATAPIQAVTSSNSTMVGV 471
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GM MEGIEQNPVVYDLM+EMAF + V V+ WI +Y+ RRYG + AW +L+ ++Y
Sbjct: 472 GMCMEGIEQNPVVYDLMAEMAFHNATVVVEDWIEEYARRRYGELTAGARIAWKMLHESIY 531
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-HLWY 359
NC+DG D N DVIV FPD+DP Q+ G+ + HP H+WY
Sbjct: 532 NCSDGIADHNGDVIVEFPDIDPKRSLFQIRPRQSLGQQI------------LGHPQHIWY 579
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + AL+ ++S + L S YRYD++DLTRQ L+K AN+L +++ +++ + +
Sbjct: 580 SPQDAAVALQYLLSSADALGLSKPYRYDVVDLTRQVLSKLANQLHSQVLDQFRMFNVEKM 639
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+S R LEL+ DMD LL + FLLG WLESAK LA ++E+ K
Sbjct: 640 DNISSRLLELLSDMDDLLGASEEFLLGTWLESAKDLATSDEERK 683
>gi|302786446|ref|XP_002974994.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
gi|300157153|gb|EFJ23779.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
Length = 761
Score = 550 bits (1418), Expect = e-154, Method: Compositional matrix adjust.
Identities = 259/463 (55%), Positives = 335/463 (72%), Gaps = 4/463 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP+ WL+ QL+LQKKIL + LGM VLPAFSGNVP AL+ ++PSA IT
Sbjct: 192 MGNLHGWGGPLPEKWLELQLILQKKILHHMRSLGMIAVLPAFSGNVPRALKILYPSANIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W +V +P+WCCTYLL DPLFI+IG+AFIEQQ+KEYG T H+YNCDTF+EN PP
Sbjct: 252 RLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGKAFIEQQVKEYGSTQHVYNCDTFNENLPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
D P YIS+L A++Y M D A+WLMQGWLFS D FW+PPQMKALL++VP GK++V
Sbjct: 312 TDDPSYISALAASVYGAMIVADKQAIWLMQGWLFSSDAQFWKPPQMKALLHAVPFGKMIV 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAEV+PIWS S FYGVPYIWCMLHNF GN EMYG LD ++ GPV+A+TS N+TM+G
Sbjct: 372 LDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGGNHEMYGRLDVVSSGPVDAKTSANSTMIG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPVVY+LM+EMAF+ + +K W+N YS RRYG++VP +AW +L HT+
Sbjct: 432 VGMCMEGIEQNPVVYELMAEMAFRSTRNALKDWVNDYSTRRYGKAVPEALEAWQILSHTL 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNC+DG D N DVIV FPD++ S ++ T +Y +L +S+ HLWY
Sbjct: 492 YNCSDGLQDHNTDVIVKFPDLNASSLT-TLSRYLAEEAGTQTRRLLTEGLTSFG--HLWY 548
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+E AL + + + LS TYRYDL+DLTRQ L K AN++ L + ++ D +
Sbjct: 549 RPTEAKVALSYMLNASSSLSNVATYRYDLVDLTRQVLMKLANQIHLQALVSFVKGDLEEL 608
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ + +++D + LL ++GFLLGPWLESAK+L N +++
Sbjct: 609 TKNCDILIGIIKDSELLLRSNNGFLLGPWLESAKKLGTNSDEK 651
>gi|302791289|ref|XP_002977411.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
gi|300154781|gb|EFJ21415.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
Length = 761
Score = 549 bits (1414), Expect = e-153, Method: Compositional matrix adjust.
Identities = 257/462 (55%), Positives = 333/462 (72%), Gaps = 4/462 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP+ WL+ QL+LQKKIL + LGM VLPAFSGNVP AL+ ++PSA IT
Sbjct: 192 MGNLHGWGGPLPEKWLELQLILQKKILHHMRSLGMIAVLPAFSGNVPRALKILYPSANIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W +V +P+WCCTYLL DPLFI+IG+AFIEQQ+KEYG T H+YNCDTF+EN PP
Sbjct: 252 RLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGKAFIEQQVKEYGSTQHVYNCDTFNENLPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
D P YIS+L A++Y M D A+WLMQGWLFS D FW+PPQMKALL++VP GK++V
Sbjct: 312 TDDPSYISALAASVYGAMIVADKQAIWLMQGWLFSSDAQFWKPPQMKALLHAVPFGKMIV 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAEV+PIWS S FYGVPYIWCMLHNF GN EMYG LD ++ GPV+A+TS N+TM+G
Sbjct: 372 LDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGGNHEMYGRLDVVSSGPVDAKTSANSTMIG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPVVY+LM+EMAF+ + +K W++ YS RRYG++VP +AW +L HT+
Sbjct: 432 VGMCMEGIEQNPVVYELMAEMAFRSTRNALKDWVDDYSTRRYGKAVPEALEAWQILSHTL 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNC+DG D N DVIV FPD++ S ++ G ++ + + TS HLWY
Sbjct: 492 YNCSDGLQDHNTDVIVKFPDLNASSLTTLSRYLAEEGGTQTRRLLTEGLTS---FGHLWY 548
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+E AL + + + LS TYRYDL+DLTRQ L K AN++ L + ++ D +
Sbjct: 549 RPTEAKVALSYMLNASSSLSNVATYRYDLVDLTRQVLMKLANQIHLQALVSFVKGDLEEL 608
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ + +++D + LL ++GFLLGPWLESAK+L N ++
Sbjct: 609 TKNCDILIGIIKDSELLLRSNNGFLLGPWLESAKKLGTNSDE 650
>gi|357458269|ref|XP_003599415.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488463|gb|AES69666.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 539
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 199/296 (67%), Positives = 231/296 (78%), Gaps = 27/296 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353
Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
EYG + IYNCDTF+EN+PP P YIS+LGAA+Y G+ GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413
Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473
Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 269
EMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE NP+VY+LMSEMAF+ EKV +
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKI 529
>gi|156399499|ref|XP_001638539.1| predicted protein [Nematostella vectensis]
gi|156225660|gb|EDO46476.1| predicted protein [Nematostella vectensis]
Length = 675
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 203/459 (44%), Positives = 281/459 (61%), Gaps = 37/459 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+HGWGGPLP +W +L LQ KIL + GM PVLP F+G+VPA L ++P A ++
Sbjct: 142 MGNMHGWGGPLPSTWYGMKLNLQHKILAAMRNFGMTPVLPGFAGHVPAGLLRLYPKANVS 201
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + S +CCTYLL+ +DPLF +IG AFI++Q EYG T+HIYN DTF+E P
Sbjct: 202 KLGDWGNFNST--YCCTYLLEPSDPLFQKIGTAFIKEQTAEYG-TNHIYNADTFNEMRPR 258
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P Y+ + +A+Y GM GD DAVWLMQGWLF + FW+P Q+KALL+ VP G ++VL
Sbjct: 259 SSDPTYLGAASSAVYRGMAGGDPDAVWLMQGWLFVDEGFWKPDQIKALLHGVPQGFMIVL 318
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AE PIWS ++ FYG P+IWCML NF GNI ++G + S++ GP +A S N+TM+G
Sbjct: 319 DLWAENSPIWSRTQSFYGTPFIWCMLLNFGGNIGLFGNIKSVSTGPPKAFQSFNSTMIGT 378
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHE---KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G++MEGIEQN ++++LM+EM ++ E VD+ WI Y++RRYG + PAI AW +L
Sbjct: 379 GLTMEGIEQNDMMFELMNEMGYRLEPLNPVDLDNWIKDYALRRYGGTNPAIIQAWRLLIR 438
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VY C D + V P +D + P+L
Sbjct: 439 SVYQCNGYCADHIHSIFVWKPSLD-------------------------------NKPNL 467
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY +V A + ++ E T+RYDL+D+TRQAL ++ ++I AY+ A
Sbjct: 468 WYDPEDVFNAWDELRSTAAEFMHVETFRYDLVDVTRQALHLRVIPIYNDLISAYKNRSAL 527
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
V R LE+ +D+D LL + FLLG WL SAK L
Sbjct: 528 NVIHFGSRLLEMFDDLDSLLQTNRNFLLGRWLNSAKALG 566
>gi|384247107|gb|EIE20595.1| hypothetical protein COCSUDRAFT_37819 [Coccomyxa subellipsoidea
C-169]
Length = 762
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 210/467 (44%), Positives = 296/467 (63%), Gaps = 30/467 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGPLPQS++D Q LQ+KI+ R+ ELGM+PV PAF+G VP AL P+A+I+
Sbjct: 199 MGNLRGYGGPLPQSYIDDQAELQRKIVRRMRELGMSPVFPAFAGFVPGALARERPAARIS 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTS-HIYNCDTFDENTP 119
+ NW S + R+CC +LLD +PLF EIG AF++ +EYG Y+ DTF+E TP
Sbjct: 259 RSDNWCSFPA--RYCCVHLLDPLEPLFQEIGSAFVKVLREEYGSDEVGFYSADTFNEMTP 316
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD--PFWRPPQMKALLNSVPLGKL 177
P P Y++S+ +AIY+ M + D A WLMQ WLF YD FW+PPQ++AL++ VP L
Sbjct: 317 PSSDPAYLTSVTSAIYNAMAAADPSARWLMQAWLF-YDNQKFWQPPQIQALVSGVPRDAL 375
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL+AEV P+W ++K F+G P+I+CMLHNF GNIEMYG L+++A GP E + +
Sbjct: 376 IMLDLYAEVFPLWKSTKSFFGAPFIYCMLHNFGGNIEMYGALEAVARGPAEGQIDGVAGL 435
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ-DAWNVLY 296
+G+GM EGIEQNPVVY+LMSE AF+ + V+V+ WI Y+ RRYG S P AW++L
Sbjct: 436 IGIGMCPEGIEQNPVVYELMSEWAFRRQPVEVEGWIEAYARRRYGNSTPPTALVAWDLLL 495
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
+VYN TDG TD +RD+ + P + P+ + + K PH
Sbjct: 496 RSVYNATDGHTDHSRDIPTSRPGLSPAEVGLWGLK-----------------------PH 532
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
LWY+ +V+ A L + S EL YRYDL+D+ RQ ++K A +++ + EAY +
Sbjct: 533 LWYNEQQVVDAWGLLLRSAGELQQVEGYRYDLVDVGRQVISKRATDIWKAVAEAYVDGRS 592
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
V + R L+L++D++ LLA + GFLLGP LE A E + +
Sbjct: 593 IVVRREGARLLQLLDDLEELLATNRGFLLGPKLEEASSAGHTEAEAR 639
>gi|326521470|dbj|BAK00311.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 193/317 (60%), Positives = 237/317 (74%), Gaps = 1/317 (0%)
Query: 149 MQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 207
+QGWLF D FW+ QMKALL+SVP+GK++VLDLFA+VKPIW TS QFYGVPYIWCMLH
Sbjct: 8 VQGWLFYSDAVFWKESQMKALLHSVPIGKMMVLDLFADVKPIWQTSSQFYGVPYIWCMLH 67
Query: 208 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 267
NF GNIEMYG+LDSI+ GPV+ARTS N+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV
Sbjct: 68 NFGGNIEMYGVLDSISSGPVDARTSYNSTMVGVGMCMEGIEHNPVVYELMSEMAFRSQKV 127
Query: 268 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 327
V+ W+ YS RRYG+S IQ AW +LYHT+YNCTDG D N+D IV FPD+ PS S
Sbjct: 128 KVEDWLKTYSHRRYGQSNVEIQKAWGILYHTIYNCTDGIADHNKDYIVEFPDMSPSSFSS 187
Query: 328 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 387
K L ++S PHLWYST E I++LELF+ +GN+LS S TYRYD
Sbjct: 188 QYSKRSISLARKHPRFFLSEVSASLPQPHLWYSTEEAIKSLELFLNAGNDLSKSLTYRYD 247
Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
L+DLTRQ+L+K AN+++ + I +YQ D+ G+ ++ FLEL+ D+D LLA D FLLGP
Sbjct: 248 LVDLTRQSLSKLANKVYHDAISSYQKRDSSGLNFHTKEFLELIVDIDTLLASDDNFLLGP 307
Query: 448 WLESAKQLAQNEEQEKQ 464
WLESAK LA E++ KQ
Sbjct: 308 WLESAKSLAMTEDERKQ 324
>gi|14861378|gb|AAK73654.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
Length = 753
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 199/471 (42%), Positives = 289/471 (61%), Gaps = 46/471 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGW GPLP++W +QL +Q ++L R+ LGM VLPAF+G+VP + FP T
Sbjct: 212 MGNLHGWAGPLPRAWHLKQLYVQYRVLERMRSLGMITVLPAFAGHVPQGVLRAFPRVNAT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W D + CTYLLD DP+F IG F+++ +KE+G T HIY+ DTF+E P
Sbjct: 272 RLGGWSHF--DCTYSCTYLLDPEDPMFQVIGTLFLKELIKEFG-TDHIYSADTFNEMNPL 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S + +A++ M D AVWLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 329 SSDPAYLSRVSSAVFRSMTGADPKAVWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ ++ FYG P+IWCMLHNF GN ++G +++I GP AR N+TMVG
Sbjct: 389 LDLFAESRPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 448
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G+ EGIEQN +VY+LM+E+ ++ E +D+ +W+ +Y+ RRYG A AW +L +V
Sbjct: 449 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVARYAERRYGAPNAAAASAWQLLLRSV 508
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT + NR +V P L+ +T +WY
Sbjct: 509 YNCTGVCVNHNRSPLVRRPS-------------------------LRMDT------EVWY 537
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 413
+ S+V A L +++G EL +S T+ YDL D+TRQA + +E +L+I +A+Q L
Sbjct: 538 NKSDVYEAWRLLLSAGAELGSSPTFGYDLADVTRQAAQQLVSEYYLSIRQAFQSRSLPEL 597
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
A GV +L+ ++DGLL+ H FLLG WLESA+ +A ++ + +Q
Sbjct: 598 LTAGGVL-----VYDLLPELDGLLSSHRLFLLGRWLESARAVATSDREAEQ 643
>gi|390348210|ref|XP_785272.3| PREDICTED: alpha-N-acetylglucosaminidase [Strongylocentrotus
purpuratus]
Length = 793
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 188/465 (40%), Positives = 271/465 (58%), Gaps = 36/465 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLPQSW QL LQ +IL R+ +LGM PVLPAF+G+VP + VFP+A I+
Sbjct: 220 MGNIDGWGGPLPQSWHTNQLALQHQILKRMRDLGMIPVLPAFAGHVPXSFSKVFPNASIS 279
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG+W + P +CCT LLD DP+F ++G+AFI+ +E+ T HIY+ DTF+EN P
Sbjct: 280 NLGDW--GRFGPEYCCTSLLDPQDPMFKQVGKAFIDAMSEEFNGTDHIYSADTFNENKPK 337
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
Y+S+ +Y G+ GD VWLM GWLF FW P Q+KALL VP+G+++VL
Sbjct: 338 SRDSAYLSAASKGVYQGIIEGDPKGVWLMMGWLFQDTGFWGPTQIKALLQGVPIGRMIVL 397
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AE +P + T+ FYG P+IWCMLHNF GN +YG LD++ GP EAR +N+TM+G+
Sbjct: 398 DLYAEARPFYKTTYSFYGQPFIWCMLHNFGGNTGLYGKLDAVNQGPFEARNYDNSTMIGM 457
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPAIQDAWNVLYH 297
G + EGI QN V+Y+ +++M ++ +V WI QY+ RRY +AW +L
Sbjct: 458 GTTPEGIFQNYVMYNFLTDMTWRSGSTNVSKWIEQYAGRRYSNDPNKSEEATEAWVILKE 517
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
TVYN T D Y PV + + + + +
Sbjct: 518 TVYNNTGTLQD------------------------HQYAVPVRRPSNIMTSP-------V 546
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY ++V +A E + + +L S +RYDL+D+TR L A + ++ ++++ +A
Sbjct: 547 WYDYTKVAKAWEFLLEASTKLGTSPVFRYDLVDVTRNVLQDLAFDFQQKLMVSFRIRNAG 606
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
V L+ DMD + + H+ +LLG WLE AK LA N ++E
Sbjct: 607 AVGGNGTLLCNLILDMDNITSSHEDWLLGTWLEDAKSLATNNDEE 651
>gi|14861380|gb|AAK73655.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
Length = 753
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 199/471 (42%), Positives = 289/471 (61%), Gaps = 46/471 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGW GPLP++W +QL +Q ++L R+ LGM VLPAF+G+VP + FP T
Sbjct: 212 MGNLHGWAGPLPRAWHLKQLYVQYRVLERMRSLGMITVLPAFAGHVPQGVLRAFPRVNAT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W D + CTYLLD DP+F IG F+++ +KE+G T HIY+ DTF+E P
Sbjct: 272 RLGGWSHF--DCTYSCTYLLDPEDPMFQVIGTLFLKELIKEFG-TDHIYSADTFNEMNPL 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S + +A++ M D AVWLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 329 SSDPAYLSRVSSAVFRSMTGADPKAVWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ ++ FYG P+IWCMLHNF GN ++G +++I GP AR N+TMVG
Sbjct: 389 LDLFAESRPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 448
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G+ EGIEQN +VY+LM+E+ ++ E +D+ +W+ +Y+ RRYG A AW +L +V
Sbjct: 449 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVARYAERRYGAPNAAAASAWXLLLRSV 508
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT + NR +V P L+ +T +WY
Sbjct: 509 YNCTGVCVNHNRSPLVRRPS-------------------------LRMDT------EVWY 537
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 413
+ S+V A L +++G EL +S T+ YDL D+TRQA + +E +L+I +A+Q L
Sbjct: 538 NKSDVYEAWRLLLSAGAELGSSPTFGYDLADVTRQAAQQLVSEYYLSIRQAFQSRSLPEL 597
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
A GV +L+ ++DGLL+ H FLLG WLESA+ +A ++ + +Q
Sbjct: 598 LTAGGVL-----VYDLLPELDGLLSSHRLFLLGRWLESARAVATSDREAEQ 643
>gi|443691318|gb|ELT93213.1| hypothetical protein CAPTEDRAFT_144379, partial [Capitella teleta]
Length = 718
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 192/462 (41%), Positives = 277/462 (59%), Gaps = 36/462 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL +W QQ++LQ +IL R+ +LGM P LPAF+G+VPA + +FP K++
Sbjct: 188 MGNMRGWGGPLSTNWHHQQILLQHRILKRMRDLGMTPALPAFAGHVPANITRLFPRVKVS 247
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W S +CCT LLD DPLF EIG+AFI++ +E+G T H+YN DTF+E TP
Sbjct: 248 KLGDWGRFNST--YCCTTLLDVEDPLFKEIGKAFIDEYTREFG-TDHVYNTDTFNEMTPA 304
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P Y++ G A+YSGM S DS A+WLMQGWLF D FW+PPQ KALL SVP GK++VL
Sbjct: 305 SSDPSYLTKAGQAVYSGMVSSDSKAIWLMQGWLFLSD-FWKPPQAKALLTSVPQGKMLVL 363
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL++EV P + + +YG P+IWCMLHNF G + MYG ++S+ GP E R+ N+TMVG+
Sbjct: 364 DLYSEVNPQYPRLQSYYGQPFIWCMLHNFGGTLPMYGAIESVNQGPFEGRSFVNSTMVGI 423
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y+ M E +F+ + V++ W ++Y+ RRY + AW + TVY
Sbjct: 424 GLTPEGINQNEVMYEFMMENSFRSQPVELTEWFDKYATRRYASRNANARAAWQIFKRTVY 483
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
NC+DG N+++ V P I +WY
Sbjct: 484 NCSDGVKHHNKNIPVCRPSRKNKI-------------------------------DVWYD 512
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+ + +L IA+ E+ S +RYDL+D++RQAL + + I+ +Y+ + +
Sbjct: 513 VEDFFKGWDLMIAASKEVD-SPLFRYDLVDVSRQALQVISITYYNQILTSYKQKNLTSLA 571
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
L L++DMD +LA FLLG W+ A + E++
Sbjct: 572 SSGNDLLHLLDDMDTVLATDSHFLLGAWIAGAHRNGVTPEEK 613
>gi|326679829|ref|XP_688608.3| PREDICTED: alpha-N-acetylglucosaminidase-like [Danio rerio]
Length = 757
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 196/464 (42%), Positives = 279/464 (60%), Gaps = 37/464 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLPQSW +QL LQ KIL R+ GM PVLPAFSG VP + +FP A +T
Sbjct: 214 MGNLFQWGGPLPQSWHVKQLYLQFKILDRMRSFGMIPVLPAFSGIVPEGITRLFPKANVT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W + C Y+LD DPLF IG F+ Q ++E+G T HIYN DTF+E P
Sbjct: 274 KLSPWSHFNCT--YSCAYVLDPRDPLFHRIGALFLTQVIEEFG-TDHIYNTDTFNEMPPA 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y++S+ AI++ M S D A+WLMQGWLF DP FW+ Q+KALL+ VPLG+++V
Sbjct: 331 SSDPTYLASISRAIFNTMTSVDPQAIWLMQGWLFISDPSFWKADQVKALLHGVPLGRMIV 390
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++S++ FYG P+IWCMLHNF GN ++G +DSI GP A N+T+VG
Sbjct: 391 LDLFAESMPVYSSTNSFYGQPFIWCMLHNFGGNSGLFGTVDSINSGPFNAVRFPNSTLVG 450
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+GM+ EGIEQNPV+Y+LMSE+A++ + V++ W++ Y++RRYG + AW +L+ +V
Sbjct: 451 LGMTPEGIEQNPVIYELMSELAWRKDPVNLYKWVSLYALRRYGSMDENLALAWQLLFRSV 510
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK-PVSKEAVLKSETSSYDHPHLW 358
YNCT P KY+N+ + P+ L +T +W
Sbjct: 511 YNCT-------------LP------------KYKNHNRSPLVHRPSLHMQTD------IW 539
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y ++ RA +L + L T+RYDL+D+TRQAL E + +I A+Q
Sbjct: 540 YDPADFYRAWKLLFEAAPGLVTLETFRYDLVDVTRQALQLLTTEFYKDIKSAFQTQKLSD 599
Query: 419 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ + +L+ ++D +L+ ++ FLLG WL+ A+ +E +
Sbjct: 600 LLTAGGVLVYDLLPELDRILSSNEHFLLGAWLQQAQSQGVDEHE 643
>gi|348533253|ref|XP_003454120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oreochromis
niloticus]
Length = 845
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 190/471 (40%), Positives = 290/471 (61%), Gaps = 47/471 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL + GPLPQSW QL LQ KIL R+ GM PVLPAFSGN+P + ++P A++T
Sbjct: 308 MANLFKFAGPLPQSWHVNQLYLQFKILERMRSFGMIPVLPAFSGNIPKGILRLYPEARVT 367
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W + C+ +LD DPLF IG ++ Q LK++G T HIY+ DTF+E TPP
Sbjct: 368 RLGPWSHFNCS--YSCSLVLDPQDPLFHHIGSLYLSQVLKQFG-TDHIYSTDTFNEMTPP 424
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ ++++ M + D AVWLMQGWLF D FW+P Q++ALL+ VPLG+++V
Sbjct: 425 SSDPAYLSAVSRSVFASMTAVDPQAVWLMQGWLFFSDAAFWKPAQIQALLHGVPLGRMIV 484
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +PI+S ++ FYG P+IWCML NF GN ++G ++SI GP +A N+T+VG
Sbjct: 485 LDLFAETEPIFSYTESFYGQPFIWCMLQNFGGNSGLFGTVESINSGPFKALHFPNSTLVG 544
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+GM+ EGIEQNPV Y+LMSE+A++ E V++ W++ Y++RRYG + ++ AW +L+ ++
Sbjct: 545 IGMTPEGIEQNPVTYELMSELAWRKEPVNLAKWVSLYAIRRYGNTQESLTTAWRLLFASI 604
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG-KPVSKEAVLKSETSSYDHPHLW 358
YNCTD Y+N+ P+ + + T LW
Sbjct: 605 YNCTDP-------------------------HYRNHNHSPLVRRPSFQMNTG------LW 633
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
Y +++ +A +L + + L + T+RYDL+D+TR+ L + +I +A+ +
Sbjct: 634 YDPADLYKAWKLIMDAAPSLMSKETFRYDLVDVTREVLQVLTTSFYRDIADAFKKQNLSE 693
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
L A GV +L+ +++ LL+ + FLLG WLE A+ LA ++++ +
Sbjct: 694 LLTAGGVL-----VYDLLPELNRLLSSNRNFLLGAWLERARSLAVDDKEAQ 739
>gi|375144105|ref|YP_005006546.1| alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
gi|361058151|gb|AEV97142.1| Alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
Length = 735
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 192/455 (42%), Positives = 278/455 (61%), Gaps = 36/455 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLP SW+ LQ+KIL R ELGM PVLPAF+G+VP A + +P+AK+
Sbjct: 201 MGNLDAWGGPLPLSWMKSHKALQEKILQRERELGMKPVLPAFTGHVPPAFKKKYPNAKL- 259
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + +D TY+LD+ DPLF E+G+ F+++Q +G T H+Y+ DTF+EN PP
Sbjct: 260 KATNWTNGFAD-----TYILDSQDPLFAEMGKRFLQKQTSLFG-TDHLYSADTFNENEPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
D P ++S+L A IY GM+ D+ A W+MQGWLF D FW+ PQ++ALL +VP K+++
Sbjct: 314 SDDPAFLSALSARIYEGMKQADTAATWVMQGWLFYSDRKFWKAPQIEALLKAVPDNKMIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMV 238
LDL AE++P+W + FYG P+IW MLHNF GN+ ++G +D +A P E + + +
Sbjct: 374 LDLAAEIEPVWKRTDAFYGKPWIWNMLHNFGGNVNLFGRMDGVATQPAETLNDKASGKLW 433
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ME IEQNPV+Y+LM+ +Q VD+ AWI QY + RY + + DAW +L T
Sbjct: 434 GIGLTMEAIEQNPVMYELMTRHTWQTTPVDLDAWIPQYVLNRYRTNNTNLVDAWQILRKT 493
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYN GA + D SII+ G+P + + T PH
Sbjct: 494 VYN---GAVIR---------DGAESIIT---------GRPTFDSTTVWTRTKLNYAPH-- 530
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
E++ A +LF+ + + S+ ++YDL+D+TRQ LA YA L + A+ D+
Sbjct: 531 ----ELLPAWDLFVQAAGKGVNSDGFQYDLVDVTRQVLANYAAPLQKKWVTAFNAKDSAA 586
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
+ S+ FL+L+ DMD LLA F+LGPWL +A+
Sbjct: 587 FNKYSKAFLQLISDMDLLLASRKDFMLGPWLSAAR 621
>gi|373953359|ref|ZP_09613319.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
gi|373889959|gb|EHQ25856.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
Length = 733
Score = 367 bits (942), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 188/458 (41%), Positives = 272/458 (59%), Gaps = 37/458 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGPLP W+ LQKKI+ R LGM PVLPAF+G+VPAA +N +P+AK+
Sbjct: 195 MGNMDSWGGPLPLRWMQTHFDLQKKIIARERALGMKPVLPAFTGHVPAAFKNKYPTAKL- 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + +D TY+LD+ DP+F IG+ F+++Q G T H+Y+ DTF+EN PP
Sbjct: 254 KTTNWKNGFAD-----TYILDSADPMFARIGQLFLQKQTALLG-TDHLYSADTFNENEPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
D PEY+ L +Y GM D+ AVW+MQGWLF D FW+P Q +ALL +VP K+++
Sbjct: 308 SDEPEYLGKLSERVYQGMHQADTAAVWVMQGWLFYSDRKFWKPEQTRALLKAVPDDKMII 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
LDL E++P+W ++ FYG P+IW ML+NF N ++G +DS A GP EA ++ M
Sbjct: 368 LDLATEIEPVWKRTEAFYGKPWIWNMLNNFGANTNLFGRMDSAAKGPAEAYHDPKSGQMK 427
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++MEGIEQNPV+YDL+++ ++++ ++V W+ +Y + RYG+ Q AWN+L T
Sbjct: 428 GIGLTMEGIEQNPVLYDLLTDNTWRNQPINVDEWLPKYVLNRYGKPNAQAQKAWNILRKT 487
Query: 299 VYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
VY+ D +I A P D S S +
Sbjct: 488 VYSVLADRYIRDGAESIIQARPTTDSS--------------------------SRWARTT 521
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
L Y ++ A + I + +LS S+ +R+DL+DL+RQ LA YA L + A+Q DA
Sbjct: 522 LNYEPKALLPAWQAMIKASEDLSTSDGFRFDLVDLSRQVLANYAFTLQRRFVLAHQQKDA 581
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
+ S F+EL++DMD LLA FLLGPW+ A++
Sbjct: 582 AAFKKHSAEFIELIQDMDQLLATRKDFLLGPWVADARR 619
>gi|73965663|ref|XP_548088.2| PREDICTED: alpha-N-acetylglucosaminidase [Canis lupus familiaris]
Length = 747
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 195/465 (41%), Positives = 280/465 (60%), Gaps = 38/465 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAFSG+VP AL VFP IT
Sbjct: 205 MGNLHTWGGPLPHSWHLKQLYLQHRILDRMRSFGMIPVLPAFSGHVPKALTRVFPQINIT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + ++E+G T+HIY DTF+E PP
Sbjct: 265 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELIQEFG-TNHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y++S A++Y M + DSDAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 322 SSEPSYLASATASVYQAMITVDSDAVWLLQGWLFQHQPQFWGPAQVKAVLEAVPRGRLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM+G
Sbjct: 382 LDLFAESQPVYIQTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMLG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D++AW++ ++ RRYG + + AW +L +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVSSFAARRYGVAHRDTEAAWRLLLRS 501
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR +V PS+ VT +
Sbjct: 502 VYNCSGEACSGHNRSPLVR----RPSLQMVTT---------------------------V 530
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L++S T+RYDL+D+TRQA + + ++ AY +
Sbjct: 531 WYNRSDVFEAWRLLLTAAPTLASSPTFRYDLLDVTRQAAQELVSLYYVEARSAYLRKELV 590
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ + + + EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 591 PLLRAAGVLVYELLPALDKVLASDSRFLLGRWLEQARAAAVSEAE 635
>gi|410930376|ref|XP_003978574.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Takifugu rubripes]
Length = 751
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 192/465 (41%), Positives = 289/465 (62%), Gaps = 35/465 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLPQSW QL LQ KIL ++ GM PVLPAFSGN+P + +FP A++T
Sbjct: 214 MGNMFKFGGPLPQSWHVNQLYLQFKILAQMRSFGMIPVLPAFSGNIPKGILRLFPEARVT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W K + + C+Y+LD DPLF IG ++ Q +K++G T+HIYN DTF+E TPP
Sbjct: 274 RLEPW--SKFNCSFSCSYILDPRDPLFSRIGSLYLSQVVKQFG-TNHIYNTDTFNEMTPP 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ A+++ M + D AVWLMQGWLF D FW+P Q++ALLN VP+G+++V
Sbjct: 331 SSEPTYLSAVSRAVFASMTAVDPQAVWLMQGWLFLSDALFWKPAQIQALLNGVPVGRMIV 390
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S ++ FYG P+IWCMLHNF GN +G ++SI GP +A N+++VG
Sbjct: 391 LDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNGGFFGTVESINTGPFKALHFPNSSLVG 450
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+GM+ EGIEQNPVVY+LMSE+A++ E V++ W++ Y RRYG ++ AW +L+ +V
Sbjct: 451 IGMTPEGIEQNPVVYELMSELAWRKEPVNLLKWVSLYVTRRYGSMHESVSAAWKILFASV 510
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT P Y+N+ + L S + + LWY
Sbjct: 511 YNCT-------------LP------------HYRNH-----NHSPLVRRPSFHMNSELWY 540
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+++ RA +L + + + T++YDL+D+TRQ + + +I++A+Q + +
Sbjct: 541 DPADLYRAWKLILEAAPSFMSKETFQYDLVDVTRQVMQVLTTSYYQDIVDAFQKHKMQEL 600
Query: 420 FQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
L +L+ +++ LL+ + FLLG WLE A+ LA +E + K
Sbjct: 601 LTAGGVLLYDLLPELNRLLSSNHNFLLGTWLEQARSLALDEREAK 645
>gi|270005801|gb|EFA02249.1| hypothetical protein TcasGA2_TC007912 [Tribolium castaneum]
Length = 747
Score = 365 bits (936), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 182/465 (39%), Positives = 287/465 (61%), Gaps = 35/465 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W + LVLQK+IL R+ G+ PVLPAF+G++P A + ++P A ++
Sbjct: 201 MGNMRGFGGPLSPAWHSRSLVLQKQILQRMRAFGIIPVLPAFAGHLPRAFKTLYPDANMS 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + +CC Y LD T+PLF EIG+AF+ +Q+ E+G T H+YNCD+F+EN P
Sbjct: 261 KMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSEQISEFG-TDHMYNCDSFNENVPT 317
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVV 179
Y++++G +IY M D DAVWL+QGW+F D FW+ + ++++L SVPLGK++V
Sbjct: 318 SGDLTYLANVGKSIYKAMTDTDPDAVWLLQGWMFYNDNFWQDTERVRSILTSVPLGKMIV 377
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + Q++G PYIWCMLH+F G + M+G I P++AR EN+TM+G
Sbjct: 378 LDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMFGSSTVINEVPIKARHLENSTMIG 437
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+E A++ V++ W +YS RRYG ++AW +L TV
Sbjct: 438 TGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKYSTRRYGFPDSDAENAWRILQRTV 497
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y D ++ GKY ++K LK + + WY
Sbjct: 498 Y--------------------DYQGLNRMRGKY-----AITKSPSLKIKIWT------WY 526
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
ST++++ A + + + L A++ Y +DL+D+TRQ L Y + + +++ YQ +D+
Sbjct: 527 STNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVLQVYGDLYYKEMVKNYQSHDSANF 586
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
S++FLE+++D+D +L+ + FLLGPWLE+AK+ A + +E Q
Sbjct: 587 QANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAANDSAEEAQ 631
>gi|91080563|ref|XP_973259.1| PREDICTED: similar to alpha-N-acetyl glucosaminidase [Tribolium
castaneum]
Length = 747
Score = 365 bits (936), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 183/465 (39%), Positives = 286/465 (61%), Gaps = 35/465 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W + LVLQK+IL R+ G+ PVLPAF+G++P A + ++P A ++
Sbjct: 201 MGNMRGFGGPLSPAWHSRSLVLQKQILQRMRAFGIIPVLPAFAGHLPRAFKTLYPDANMS 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + +CC Y LD T+PLF EIG+AF+ +Q+ E+G T H+YNCD+F+EN P
Sbjct: 261 KMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSEQISEFG-TDHMYNCDSFNENVPT 317
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
Y++++G +IY M D DAVW+MQGWLF++D F W + KA+L +VP GK++V
Sbjct: 318 SGDLTYLANVGKSIYKAMTDTDPDAVWVMQGWLFAHDFFYWTRNRAKAILTAVPKGKMIV 377
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + Q++G PYIWCMLH+F G + M+G I P++AR EN+TM+G
Sbjct: 378 LDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMFGSSTVINEVPIKARHLENSTMIG 437
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+E A++ V++ W +YS RRYG ++AW +L TV
Sbjct: 438 TGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKYSTRRYGFPDSDAENAWRILQRTV 497
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y D ++ GKY ++K LK + + WY
Sbjct: 498 Y--------------------DYQGLNRMRGKY-----AITKSPSLKIKIWT------WY 526
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
ST++++ A + + + L A++ Y +DL+D+TRQ L Y + + +++ YQ +D+
Sbjct: 527 STNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVLQVYGDLYYKEMVKNYQSHDSANF 586
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
S++FLE+++D+D +L+ + FLLGPWLE+AK+ A + +E Q
Sbjct: 587 QANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAANDSAEEAQ 631
>gi|348562747|ref|XP_003467170.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Cavia porcellus]
Length = 750
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 189/466 (40%), Positives = 281/466 (60%), Gaps = 36/466 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP++W +QL LQ +IL R+ LGM PVLPAF+G+VP A+ VFP IT
Sbjct: 211 MGNLHGWGGPLPRTWHLKQLSLQHQILDRMRALGMTPVLPAFAGHVPKAIGRVFPQVNIT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + ++E+G T+HIY DTF+E PP
Sbjct: 271 QLGSWGHFNCS--YSCSFLLAPEDPLFPLIGGIFLRELIREFG-TNHIYGADTFNEMQPP 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A++ M + DSDAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 328 SSDPAYLAAATEAVFKAMVAVDSDAVWLLQGWLFQHQPEFWGPAQVGAVLGAVPQGRLLV 387
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 388 LDLFAESQPVYTRTASFRGQPFIWCMLHNFGGNHGLFGALEAVNRGPTAARLFPNSTMVG 447
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW+++++ RRYG + P + AW +L +
Sbjct: 448 TGITPEGIGQNEVVYALMAELGWRKDPVPDLLAWVSRFAERRYGVAQPDAEAAWRLLLRS 507
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ + L+ T+ +W
Sbjct: 508 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTA------VW 537
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y+ S+V A L + + +L+ S T+RYDL+D+TRQAL + + + + AY + G
Sbjct: 538 YNRSDVFEAWRLLLKASPKLTTSPTFRYDLLDVTRQALQELVSLYYEEVRAAYLHQELAG 597
Query: 419 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + +L+ +D +LA FLLG WL A+ A +E + +
Sbjct: 598 LLRAGGVLAYQLLPALDEVLASDHHFLLGSWLAQARAAAASETEAR 643
>gi|410981277|ref|XP_003996997.1| PREDICTED: alpha-N-acetylglucosaminidase [Felis catus]
Length = 857
Score = 362 bits (928), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 193/465 (41%), Positives = 273/465 (58%), Gaps = 38/465 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 318 MGNLHTWGGPLPPSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVT 377
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 378 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 434
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y++S A++Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 435 SSEPSYLASATASVYQAMVTVDPDAVWLLQGWLFQHQPQFWGPAQVSAVLGAVPRGRLLV 494
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 495 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMVG 554
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D++AW+ ++ RRYG S + AW +L +
Sbjct: 555 TGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVTGFAARRYGVSHGNTEAAWRLLLRS 614
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR +V P LK T+ +
Sbjct: 615 VYNCSGEACSGHNRSPLVRRPS-------------------------LKMTTT------V 643
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+ S T+RYDL+D+TRQA + + + AY +
Sbjct: 644 WYNRSDVFEAWRLLLTTTPSLATSPTFRYDLLDVTRQAAQELVSLYYGEARTAYLNKELV 703
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ + + + EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 704 PLLRAAGILVYELLPSLDKVLASDSRFLLGSWLEQARAAAVSEAE 748
>gi|432926094|ref|XP_004080826.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oryzias latipes]
Length = 882
Score = 360 bits (925), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 191/470 (40%), Positives = 285/470 (60%), Gaps = 45/470 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N++ +GGPLPQSW QL LQ +IL R+ GM PVLPAFSGNVP + + P A +T
Sbjct: 345 MANMYKFGGPLPQSWHVNQLRLQFRILERMRAFGMIPVLPAFSGNVPKGILKLHPEANVT 404
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W + C+Y+LD DPLF++IG ++ Q +K++G T HIYN DTF+E TPP
Sbjct: 405 RLGPWAHFNCS--FSCSYVLDPRDPLFLQIGSLYLSQVVKQFG-TDHIYNTDTFNEMTPP 461
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ ++++ M + D A+WLMQGWLF D FW+PPQ++ALL+ VPLG+++V
Sbjct: 462 SSDPAYLSAISRSVFASMTAVDPKAIWLMQGWLFFSDAAFWKPPQIRALLHGVPLGRMIV 521
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S ++ FYG P+IWCMLHNF GN +G ++SI GP +A +N+TMVG
Sbjct: 522 LDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNNGFFGTVESINSGPFKALNFKNSTMVG 581
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+GM+ EGI QNPV+Y+LMSE+A++ E V++ W + Y+ RRYG ++ AW +L+ +V
Sbjct: 582 IGMTPEGIHQNPVIYELMSELAWRKESVNLTKWASLYAARRYGSMHESLSAAWKLLFSSV 641
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT V + N+ P+ + T LWY
Sbjct: 642 YNCT-----------------------VPHYRNHNH-SPLVRRPSFNMNTG------LWY 671
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QL 413
++++ +LF+ + L + T+RYDL+D+TRQ L + +I +A+ +L
Sbjct: 672 DPADLLETWKLFMEAAPSLMSKETFRYDLVDVTRQVLQDLTTYFYQDIKDAFHSKKMPEL 731
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ GV +L +++ LL FLLG WLE A+ A +E + +
Sbjct: 732 LTSGGVL-----IYDLFPELNRLLNSDRNFLLGTWLEQAQSFALDEPEAR 776
>gi|149054264|gb|EDM06081.1| rCG33377, isoform CRA_d [Rattus norvegicus]
Length = 580
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 192/467 (41%), Positives = 277/467 (59%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 43 MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 102
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLGNW + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 103 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 159
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 160 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 219
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TMVG
Sbjct: 220 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 279
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW++ ++ RRYG S P AW +L +
Sbjct: 280 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLRS 339
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 340 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 368
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+AS +RYDL+D+TRQA+ + + + A+ D
Sbjct: 369 WYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQAVQELVSSCYEEARTAFLNQDLD 428
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + +L+ +D LLA + FLLG WL+ A+++A +E + +
Sbjct: 429 LLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAREVAVSESEAQ 475
>gi|255533666|ref|YP_003094038.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346650|gb|ACU05976.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 735
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 190/466 (40%), Positives = 278/466 (59%), Gaps = 38/466 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLP+S + LQKKIL R GM P+LPAF+G+VP A ++ FP AK+
Sbjct: 199 MGNIDGWGGPLPKSQMLAHEALQKKILERERSFGMTPILPAFTGHVPPAFKDKFPKAKLK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + S Y+LD D LF IG+ FIE+++K +G T H+Y DTF+ENTPP
Sbjct: 259 KT-NWTTFPS------VYILDPEDELFTTIGKRFIEEEVKTFG-TDHLYTADTFNENTPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
Y+S++ +Y M D +A W+MQGWLF + + FW+P Q+KALLN++P K++V
Sbjct: 311 TSDSLYLSNVSKKVYQSMALADPEATWIMQGWLFYHGEKFWKPTQIKALLNAIPNDKMIV 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMV 238
LDL++E P+W + +YG P+IW MLHNF GNI +YG +D +A G ++A+ + N+ MV
Sbjct: 371 LDLWSENHPVWQRTAAYYGKPWIWNMLHNFGGNISLYGRMDEVASGAIKAKQAANSGNMV 430
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ E IEQNPV+Y LM + + E ++V AW+ YS +RYG + AW +LY T
Sbjct: 431 GIGLTPEAIEQNPVMYQLMLDNIWTDEPINVTAWLKNYSRQRYGAQNALAEQAWQILYKT 490
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VY T G P SI++ G+P +E++ P
Sbjct: 491 VY--TGG----------ILPGGPESILT---------GRPTM------AESTRSTRPKKN 523
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y +E+I A E + + +LS ++ ++YDL+D+TRQ L YA+ L +AYQ D
Sbjct: 524 YKPAELIPAWEALLKASQQLS-TDGFKYDLVDVTRQVLVNYADTLQRQFAQAYQGKDGKK 582
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+LS FL +++D+D LLA FLLG WL AK++ E++K+
Sbjct: 583 FDRLSGDFLAVMDDVDYLLATRKDFLLGKWLNEAKRMGTTAEEKKR 628
>gi|281344539|gb|EFB20123.1| hypothetical protein PANDA_011160 [Ailuropoda melanoleuca]
Length = 619
Score = 359 bits (922), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 188/467 (40%), Positives = 275/467 (58%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 81 MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 140
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 141 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 197
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A++Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 198 SSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQPEFWGPAQVTAVLGAVPRGRLLV 257
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F+G P+IWCMLHNF GN ++G L+++ GP AR N+TM G
Sbjct: 258 LDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMAG 317
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN +VY LM+E+ ++ + V D++AW++ + RRYG + + AW +L +
Sbjct: 318 TGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSSSAARRYGVTHKDTEAAWRLLLRS 377
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR +V P + + +
Sbjct: 378 VYNCSGEACSGHNRSPLVRRPSLQMATA-------------------------------V 406
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+AS ++RYDL+D+TRQA + + + AY +
Sbjct: 407 WYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQAAQELVSLYYEEARAAYLNKELV 466
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + + R + EL+ +D +LA FLLG WLE A+ A +E + +
Sbjct: 467 PLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQARAAAVSEAEAR 513
>gi|109491871|ref|XP_001081442.1| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
gi|392351622|ref|XP_002727861.2| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
gi|149054262|gb|EDM06079.1| rCG33377, isoform CRA_b [Rattus norvegicus]
Length = 739
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 192/467 (41%), Positives = 277/467 (59%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 202 MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLGNW + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 262 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 319 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TMVG
Sbjct: 379 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 438
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW++ ++ RRYG S P AW +L +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLRS 498
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 499 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 527
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+AS +RYDL+D+TRQA+ + + + A+ D
Sbjct: 528 WYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQAVQELVSSCYEEARTAFLNQDLD 587
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + +L+ +D LLA + FLLG WL+ A+++A +E + +
Sbjct: 588 LLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAREVAVSESEAQ 634
>gi|301773566|ref|XP_002922216.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Ailuropoda
melanoleuca]
Length = 634
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 188/467 (40%), Positives = 275/467 (58%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 95 MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 154
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 155 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 211
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A++Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 212 SSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQPEFWGPAQVTAVLGAVPRGRLLV 271
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F+G P+IWCMLHNF GN ++G L+++ GP AR N+TM G
Sbjct: 272 LDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMAG 331
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN +VY LM+E+ ++ + V D++AW++ + RRYG + + AW +L +
Sbjct: 332 TGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSSSAARRYGVTHKDTEAAWRLLLRS 391
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR +V P + + +
Sbjct: 392 VYNCSGEACSGHNRSPLVRRPSLQMATA-------------------------------V 420
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+AS ++RYDL+D+TRQA + + + AY +
Sbjct: 421 WYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQAAQELVSLYYEEARAAYLNKELV 480
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + + R + EL+ +D +LA FLLG WLE A+ A +E + +
Sbjct: 481 PLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQARAAAVSEAEAR 527
>gi|295132875|ref|YP_003583551.1| hypothetical protein ZPR_1010 [Zunongwangia profunda SM-A87]
gi|294980890|gb|ADF51355.1| predicted protein [Zunongwangia profunda SM-A87]
Length = 750
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 188/464 (40%), Positives = 275/464 (59%), Gaps = 36/464 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGPLPQSW + LQKKIL R ELGM PVLPAF+G+VPA+ + FP A +
Sbjct: 206 MGNLDGWGGPLPQSWKESHRDLQKKILKRSRELGMKPVLPAFTGHVPASFKKFFPDADLK 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + D TY+LDA DPLF EIG+ F+E+Q + +G T H Y DTF+EN PP
Sbjct: 266 KT-NWGNDFGD-----TYILDAEDPLFAEIGKRFLEKQEEVFG-TDHFYTADTFNENEPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVV 179
D P+Y+ L I+ GM++ D +A W+MQGWLF S+ FW+ PQ+K LL++VP ++++
Sbjct: 319 SDDPKYLGELSEKIFEGMKAADPEATWVMQGWLFYSHKDFWKTPQIKGLLSTVPDDRMII 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
LDL E++P+W ++ FYG +IW MLHNF GNI M+G ++++A P A S + +
Sbjct: 379 LDLATEIEPVWKQTEAFYGKQWIWNMLHNFGGNISMFGRIETVAEQPALALNDSTSGNLK 438
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ME IEQNPV+Y+LM++ ++ +++K+W+ Y+ RYG +I +AW++L T
Sbjct: 439 GIGLTMEAIEQNPVLYELMTDNTWRDTPIELKSWLKNYTRNRYGAVNDSILEAWDILVAT 498
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
YN T D +I A P EG Y+ + +
Sbjct: 499 AYNGT-TIRDGAESIIAARP--------TFEG-YRRWARTKMN----------------- 531
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y +++ A +LFI + + S+ + YDL+DL+RQ LA YA + + AY+ ND
Sbjct: 532 YDPLDLLPAWDLFIGARDRFKDSDGFAYDLVDLSRQVLANYALPVQQQMRIAYENNDKEA 591
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ S L L+ D+D LLA FLLGPW+ A+ E++
Sbjct: 592 FKKHSEELLTLISDLDRLLATRKDFLLGPWIADARSWGTTPEEK 635
>gi|449491231|ref|XP_004174728.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase,
partial [Taeniopygia guttata]
Length = 752
Score = 358 bits (919), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 194/471 (41%), Positives = 286/471 (60%), Gaps = 46/471 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL W GPLP +W +QL LQ +I+ R+ LGM VLPAF+G+VP + VFP T
Sbjct: 211 MGNLRRWAGPLPPAWHFKQLYLQYRIVERMRSLGMTTVLPAFAGHVPQGILRVFPRVNAT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W D + C YLLD DP+F IG F+++ +KE+G T H+Y+ DTF+E TP
Sbjct: 271 RLGHWSHF--DCTYSCIYLLDPEDPMFQVIGTLFLKELIKEFG-TDHVYSADTFNEMTPL 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S + A++ M D A+WLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 328 SSDPAYLSRVSNAVFRSMTGADPKALWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 387
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE KP++ ++ FYG P+IWCMLHNF GN ++G +++I GP AR N+TMVG
Sbjct: 388 LDLFAESKPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 447
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G+ EGIEQN +VY+LM+E+ ++ E +D+ +W+ +Y+ RRYG A AW +L +V
Sbjct: 448 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVTRYAERRYGAPNAAAASAWRLLLRSV 507
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT + NR P+ + L+ +T LWY
Sbjct: 508 YNCTGVCVNHNRS-------------------------PLVRRPSLRMDT------ELWY 536
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 413
+ S+V A L +++G EL +S + YDL+D+TRQA + + +L+I +A+Q L
Sbjct: 537 NASDVFEAWRLLLSAGAELGSSPAFLYDLVDVTRQAAQQLVSHYYLSIRQAFQSHALPEL 596
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
A GV +L+ ++D LL+ H FLLG WL+SA+ +A ++++ +Q
Sbjct: 597 LTAGGVL-----VYDLLPELDSLLSSHSLFLLGRWLQSARAVATSDQEAEQ 642
>gi|340717403|ref|XP_003397173.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus terrestris]
Length = 770
Score = 358 bits (919), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 189/464 (40%), Positives = 267/464 (57%), Gaps = 35/464 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL SW ++ L LQ KIL R+ ELG+ PVLPAF+G+VP A +FP A +T
Sbjct: 218 MGNIRGFGGPLTSSWHERSLQLQHKILQRMRELGIIPVLPAFTGHVPRAFPRLFPEANVT 277
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W S ++CC YLL+ TDPLF +IG F+ +KE+G T HIYNCDTF+EN PP
Sbjct: 278 KSATWNSFSD--KYCCPYLLEPTDPLFHKIGDQFLRTYIKEFG-TDHIYNCDTFNENEPP 334
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M S D A+WLMQGWLF +D FW P++K L SVPLG+L+V
Sbjct: 335 TSELKFLRNVGHSIFQTMLSVDPQAIWLMQGWLFVHDALFWTEPRIKTFLTSVPLGRLIV 394
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P++ K +YG P+IWCMLHNF G + M+G I E R E +TM+G
Sbjct: 395 LDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINRRVFEGRNMEGSTMIG 454
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++ E V++ W Y+ RRYG AW L TV
Sbjct: 455 TGLTPEGINQNYVIYELMNEMAYRQEPVNLDNWFEDYASRRYGAWNEYAVAAWKNLGSTV 514
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN IS GKY +P A L WY
Sbjct: 515 YNFRG--------------------ISKIRGKYVITRRPSLNLARLT-----------WY 543
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +F+ + + S YR+D++D+TRQAL A++++ ++E++ D
Sbjct: 544 DPEKFYSTWYIFLQARHGRQNSTLYRHDVVDITRQALQLKADKIYSALVESFNQKDVTTF 603
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ R LEL +D++ +LA + FLLG WLE AK LA ++ + K
Sbjct: 604 KLQADRLLELFDDLEAILASSEDFLLGTWLEMAKNLATDDAESK 647
>gi|440903235|gb|ELR53922.1| Alpha-N-acetylglucosaminidase, partial [Bos grunniens mutus]
Length = 614
Score = 358 bits (918), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 195/469 (41%), Positives = 274/469 (58%), Gaps = 46/469 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 77 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 136
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+GNW + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 137 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 193
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 194 SSEPSYLAAATTAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 253
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TMVG
Sbjct: 254 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 313
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 314 TGMAPEGIGQNEVVYALMAELGWKKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 373
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC S E + N+ P+ + L+ T+ +W
Sbjct: 374 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 403
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
Y+ S+V A L +A+ + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 404 YNRSDVFEAWRLLLAATSTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 463
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A G+ EL+ +D +LA FLLG WLE A+Q A +E +
Sbjct: 464 LTRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARQAAVSETE 507
>gi|350407422|ref|XP_003488083.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus impatiens]
Length = 770
Score = 358 bits (918), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 189/464 (40%), Positives = 268/464 (57%), Gaps = 35/464 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL SW ++ L LQ +IL R+ ELG+ PVLPAF+G+VP A +FP A +T
Sbjct: 218 MGNIRGFGGPLTSSWHERSLQLQHRILQRMRELGIIPVLPAFTGHVPRAFPRLFPEANVT 277
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W S ++CC YLL+ TDPLF +IG F+ +KE+G T HIYNCDTF+EN PP
Sbjct: 278 KSATWNSFSD--KYCCPYLLEPTDPLFHKIGDQFLRTYIKEFG-TDHIYNCDTFNENEPP 334
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M S D A+WLMQGWLF +D FW P++KA L SVPLG+L+V
Sbjct: 335 TSELKFLRNVGHSIFQTMLSVDPQAIWLMQGWLFVHDAVFWTEPRIKAFLTSVPLGRLIV 394
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P++ K +YG P+IWCMLHNF G + M+G I E R E +TM+G
Sbjct: 395 LDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINRRVFEGRNMEGSTMIG 454
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++ E V++ W Y+ RRYG AW L TV
Sbjct: 455 TGLTPEGINQNYVIYELMNEMAYRQEPVNLDNWFEDYASRRYGAWNEYAVAAWKNLGSTV 514
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN IS GKY +P A L WY
Sbjct: 515 YNFRG--------------------ISKIRGKYVITRRPSLNLARLT-----------WY 543
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +F+ + + S YR+D++D+TRQAL A++++ ++E++ D
Sbjct: 544 DPEKFYSTWYIFLQARHGRKNSTLYRHDVVDITRQALQLKADKIYSVLVESFNQKDVTTF 603
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ R LEL +D++ +LA + FLLG WLE AK LA ++ + K
Sbjct: 604 KLQAGRLLELFDDLEAILASSEDFLLGTWLEMAKNLATDDAESK 647
>gi|355568706|gb|EHH24987.1| Alpha-N-acetylglucosaminidase, partial [Macaca mulatta]
Length = 711
Score = 358 bits (918), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 193/470 (41%), Positives = 273/470 (58%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 172 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 231
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 232 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 288
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 289 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 348
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 349 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 408
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ +RYG S P AW +L +
Sbjct: 409 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 468
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 469 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 497
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY
Sbjct: 498 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 557
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D LLA FLLG WLE A+ A +E +
Sbjct: 558 SLLRAGGVLA-----YELLPALDELLASDSRFLLGSWLEQARAAAVSEAE 602
>gi|402900329|ref|XP_003913130.1| PREDICTED: alpha-N-acetylglucosaminidase [Papio anubis]
Length = 743
Score = 358 bits (918), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 193/470 (41%), Positives = 272/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPEAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ +RYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY
Sbjct: 530 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 589
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D LLA FLLG WLE A+ A +E +
Sbjct: 590 SLLRAGGVLA-----YELLPALDELLASDSRFLLGSWLEQARAAAVSEAE 634
>gi|126307960|ref|XP_001366343.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
domestica]
Length = 741
Score = 357 bits (917), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 183/465 (39%), Positives = 269/465 (57%), Gaps = 38/465 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +Q LQ +IL R+ GM PVLPAF+G++P A VFP A +T
Sbjct: 202 MGNLHTWGGPLPSSWDLKQSYLQYRILERMRSFGMKPVLPAFAGHIPKAFTRVFPQANVT 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + + C+YLL DPLF +G F+ + KE+G T HIY+ D F+E PP
Sbjct: 262 NLGMWGHFSCN--YSCSYLLAPEDPLFPVVGSLFLRELTKEFG-TDHIYSADIFNEMDPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ AA+Y M + D DAVWL QGWLF P FW+PPQMKA+L +VP G+ ++
Sbjct: 319 SSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNHPDFWKPPQMKAVLEAVPRGRFLI 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S + FYG P+IWCMLHNF GN ++G+LD++ GP AR N+T+VG
Sbjct: 379 LDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLFGVLDAVNRGPSTARLFPNSTIVG 438
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+ EGI QN VVY LM+E+ ++ + D+ AW+ ++ +RYG P + AW +L +
Sbjct: 439 TGIVPEGINQNEVVYALMAELGWRKDPFPDLGAWVAGFAAQRYGTPHPQAEAAWRLLLRS 498
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + T N +V P + +
Sbjct: 499 VYNCSWENCTGHNHSPLVKRPSLHLDF-------------------------------SV 527
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + +L+ S+ +RYDL+D+TRQ + + + + A++
Sbjct: 528 WYNRSDVFEAWRLLLEAAPQLATSSAFRYDLLDVTRQVAQELVSLYYGELKTAFEAGSMP 587
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ + +L+ +D LL + FLLG WLE A+++A +E +
Sbjct: 588 ALLSAGGLLVFDLLPSLDELLGTDERFLLGGWLEQAREMAVSEAE 632
>gi|114667172|ref|XP_523654.2| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Pan
troglodytes]
gi|410216584|gb|JAA05511.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410258938|gb|JAA17435.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410304442|gb|JAA30821.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410337929|gb|JAA37911.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
Length = 743
Score = 357 bits (915), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 590 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 634
>gi|66346698|ref|NP_000254.2| alpha-N-acetylglucosaminidase precursor [Homo sapiens]
gi|317373322|sp|P54802.2|ANAG_HUMAN RecName: Full=Alpha-N-acetylglucosaminidase; AltName:
Full=N-acetyl-alpha-glucosaminidase; Short=NAG;
Contains: RecName: Full=Alpha-N-acetylglucosaminidase 82
kDa form; Contains: RecName:
Full=Alpha-N-acetylglucosaminidase 77 kDa form; Flags:
Precursor
Length = 743
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 590 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 634
>gi|1171229|gb|AAC50512.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1171231|gb|AAC50513.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1197840|gb|AAB06188.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1479981|gb|AAB36604.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|32450702|gb|AAH53991.1| N-acetylglucosaminidase, alpha- [Homo sapiens]
gi|119581237|gb|EAW60833.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
CRA_b [Homo sapiens]
Length = 743
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 590 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 634
>gi|355754184|gb|EHH58149.1| Alpha-N-acetylglucosaminidase, partial [Macaca fascicularis]
Length = 650
Score = 356 bits (914), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 193/470 (41%), Positives = 272/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 111 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 170
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 171 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 227
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 228 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 287
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 288 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 347
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ +RYG S P AW +L +
Sbjct: 348 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 407
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 408 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 436
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY
Sbjct: 437 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 496
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D LLA FLLG WLE A+ A +E +
Sbjct: 497 SLLRAGGVLA-----YELLPALDELLASDSRFLLGSWLEQARAAAVSEAE 541
>gi|1479983|gb|AAB36605.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|119581236|gb|EAW60832.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
CRA_a [Homo sapiens]
Length = 639
Score = 356 bits (914), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 100 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 159
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 160 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 216
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 217 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 276
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 277 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 336
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 337 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 396
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 397 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 425
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY
Sbjct: 426 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 485
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 486 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 530
>gi|297701096|ref|XP_002827555.1| PREDICTED: alpha-N-acetylglucosaminidase [Pongo abelii]
Length = 836
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 191/470 (40%), Positives = 271/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 297 MGNLHSWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 356
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HI+ DTF+E PP
Sbjct: 357 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIFGADTFNEMQPP 413
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 414 SSEPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 473
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 474 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 533
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 534 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 593
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 594 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 622
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY
Sbjct: 623 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 682
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 683 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 727
>gi|395827009|ref|XP_003786703.1| PREDICTED: alpha-N-acetylglucosaminidase [Otolemur garnettii]
Length = 756
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 191/464 (41%), Positives = 271/464 (58%), Gaps = 36/464 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWGGPLPFSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QL +W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 264 QLSSWGHFNCS--YSCSFLLAPGDPIFSLIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D DAVWL+QGWLF + P FW P Q+KA+L +VPLG+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMIAVDPDAVWLLQGWLFQHQPQFWGPTQIKAVLRAVPLGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYSRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNQGPKAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 441 TGMAPEGINQNEVVYALMAELGWRKDPVPDLVAWVTSFADRRYGISHGDAEAAWRLLLRS 500
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ K L+ T+ +W
Sbjct: 501 VYNCSGEACSGHNH------------------------SPLVKRPSLQMNTT------VW 530
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y+ S+V A L + S L+AS +RYDL+D+TRQA+ + + + AY +
Sbjct: 531 YNRSDVFEAWRLLLTSAPTLAASPIFRYDLLDITRQAIQELVSLYYEKARTAYLNKELVP 590
Query: 419 VFQLSRRF-LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ + EL+ +D +LA + FLLG WL A+ +A +E +
Sbjct: 591 LLRAGGLLAYELLPALDEVLASDNHFLLGSWLAQARAVAISEAE 634
>gi|358419179|ref|XP_003584151.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bos taurus]
Length = 741
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 196/469 (41%), Positives = 274/469 (58%), Gaps = 46/469 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+GNW + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ +Q + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 441 TGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 500
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC S E + N+ P+ + L+ T+ +W
Sbjct: 501 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 530
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
Y+ S+V A L + + + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 531 YNRSDVFEAWRLLLTATSTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 590
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A G+ EL+ +D +LA FLLG WLE A+Q A +E +
Sbjct: 591 LTRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARQAAVSETE 634
>gi|426348060|ref|XP_004041658.1| PREDICTED: alpha-N-acetylglucosaminidase [Gorilla gorilla gorilla]
Length = 743
Score = 356 bits (913), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIEAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 590 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 634
>gi|397485721|ref|XP_003813989.1| PREDICTED: alpha-N-acetylglucosaminidase [Pan paniscus]
Length = 682
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 143 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 202
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 203 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 259
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 260 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 319
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 320 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNEGPEAARLFPNSTMVG 379
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 380 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 439
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 440 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 468
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY
Sbjct: 469 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 528
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 529 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 573
>gi|301626955|ref|XP_002942650.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Xenopus (Silurana)
tropicalis]
Length = 759
Score = 353 bits (907), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 181/463 (39%), Positives = 273/463 (58%), Gaps = 46/463 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+H WGGPL SW++++L LQ +I R+ LGM VLPAF+G++P + VFP ++
Sbjct: 212 MGNIHTWGGPLSISWMEKRLSLQLQITERMRSLGMITVLPAFAGHIPEGILRVFPKVTVS 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W + + C+YLLD DPLF IG F+ Q ++ +G T HIY+ DTF+E +P
Sbjct: 272 RLGGWSNFNCT--YSCSYLLDPEDPLFQWIGELFLSQMVQSFG-TDHIYSADTFNEMSPT 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ AI+ M D DA+WLMQGWLF +P FWRP Q KALL+ P+G+++V
Sbjct: 329 SSDPGYLSAVSGAIFKSMAKVDPDAIWLMQGWLFINNPSFWRPAQTKALLHGAPIGRIIV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++ T++ FYG P+IWCML+NF GN ++G ++ + GP +A N+TMVG
Sbjct: 389 LDLFAETVPVYLTTESFYGQPFIWCMLNNFGGNHGLFGNIEGVNRGPFDAAKFPNSTMVG 448
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGIEQN ++Y+ M+E+ + + +++ WI+ YS RRYG+S + AW +L +V
Sbjct: 449 TGLTPEGIEQNDMIYEFMNEIGWSSQPINLTKWISNYSDRRYGQSNTDARMAWQILLRSV 508
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT + N +V P ++ + + Y
Sbjct: 509 YNCTQILHNHNHSPLVRRPSLNMNT-------------------------------DICY 537
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QL 413
+ +++ A + L S T+ YDL+D+TR+A+ + +E +L I EAY QL
Sbjct: 538 NKADIYEAWRFMHNASFALGKSATFLYDLVDITREAVQQLVSEYYLEIKEAYGKKSLQQL 597
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
A GV +L+ ++D LL+ GFLLG WL++AK +A
Sbjct: 598 MTAGGVL-----VYDLLPELDSLLSSQPGFLLGSWLKAAKSMA 635
>gi|148671928|gb|EDL03875.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
CRA_a [Mus musculus]
Length = 538
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 185/467 (39%), Positives = 273/467 (58%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW Q+ LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 1 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 60
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 61 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 117
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 118 FSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 177
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++ + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TMVG
Sbjct: 178 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 237
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW++ +++RRYG S P AW +L +
Sbjct: 238 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 297
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 298 VYNCSGEACSGHNRS-------------------------PLVKRPSLQMSTA------V 326
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+ S +RYDL+D+TRQA+ + + + AY +
Sbjct: 327 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 386
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + + +L+ +D LLA FLLG WL+ A++ A +E + +
Sbjct: 387 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQ 433
>gi|254910995|ref|NP_038820.2| alpha-N-acetylglucosaminidase precursor [Mus musculus]
gi|20385160|gb|AAM21194.1|AF363242_1 N-acetyl-glucosaminidase [Mus musculus]
gi|3329361|gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus]
gi|33585908|gb|AAH55733.1| Alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB) [Mus
musculus]
gi|74211094|dbj|BAE37639.1| unnamed protein product [Mus musculus]
gi|74218052|dbj|BAE42009.1| unnamed protein product [Mus musculus]
gi|148671929|gb|EDL03876.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
CRA_b [Mus musculus]
Length = 739
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 185/467 (39%), Positives = 273/467 (58%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW Q+ LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 202 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 262 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 319 FSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++ + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TMVG
Sbjct: 379 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 438
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW++ +++RRYG S P AW +L +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 498
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 499 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 527
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+ S +RYDL+D+TRQA+ + + + AY +
Sbjct: 528 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 587
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + + +L+ +D LLA FLLG WL+ A++ A +E + +
Sbjct: 588 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQ 634
>gi|2660688|gb|AAB88084.1| Naglu [Mus musculus]
Length = 739
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 185/467 (39%), Positives = 273/467 (58%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW Q+ LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 202 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 262 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 319 FSEPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++ + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TMVG
Sbjct: 379 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 438
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW++ +++RRYG S P AW +L +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 498
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 499 VYNCSGEACSGHNRS-------------------------PLVKRPSLQMSTA------V 527
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+ S +RYDL+D+TRQA+ + + + AY +
Sbjct: 528 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 587
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + + +L+ +D LLA FLLG WL+ A++ A +E + +
Sbjct: 588 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQ 634
>gi|311267179|ref|XP_003131436.1| PREDICTED: alpha-N-acetylglucosaminidase [Sus scrofa]
Length = 744
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 191/470 (40%), Positives = 270/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 205 MGNLHTWSGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQISVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPSYLAAATAAVYQAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TM G
Sbjct: 382 LDLFAESQPVYVRTASFLGQPFIWCMLHNFGGNHGLFGALESVNQGPAAARLFPNSTMAG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ W+ ++ RRYG S + AW +L +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGTWVTSFAARRYGVSQGDAEAAWRLLLRS 501
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ +G T NR +V P + + +
Sbjct: 502 VYNCSGEGCTGHNRSPLVRRPSLQMATT-------------------------------V 530
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + + L++S +RYDL+D+TRQA+ + + + AY
Sbjct: 531 WYNQSDVFEAWRLLLKATPTLASSPAFRYDLVDITRQAVQELVSLYYEEARTAYLNKELV 590
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A G+ EL+ +D +LA FLLG WLE A+ +A +E +
Sbjct: 591 SLMRAGGILA-----YELLPALDKVLASDSHFLLGSWLEQARGVAVSEAE 635
>gi|444714090|gb|ELW54978.1| Alpha-N-acetylglucosaminidase [Tupaia chinensis]
Length = 724
Score = 351 bits (900), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 193/467 (41%), Positives = 277/467 (59%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ ++L R+ GM PVLPAF G+VP A+ VFP +T
Sbjct: 174 MGNLHTWGGPLPHSWHLKQLYLQHRVLDRMRSFGMIPVLPAFPGHVPKAITRVFPQVNVT 233
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 234 QLGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRELTKEFG-TDHIYGADTFNELQPP 290
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AAIY+ M + D AVWL+QGW+F + P FW P Q+KA+L +VP G+L+V
Sbjct: 291 SSEPSYLAAATAAIYAAMTAVDPGAVWLLQGWIFQHQPDFWGPAQVKAVLEAVPRGRLLV 350
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN +YG L+++ +GP AR N++MVG
Sbjct: 351 LDLFAETRPVYLYTASFLGQPFIWCMLHNFGGNHGLYGTLEAVNWGPKAARLFPNSSMVG 410
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ Y+ RRYG S+ + AW +L +
Sbjct: 411 TGMAPEGINQNEVVYALMAELGWRKDPVPDLAAWVTSYADRRYGVSLGDAEAAWRLLLRS 470
Query: 299 VYNCTDG-ATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR P+ K L+ T+ +
Sbjct: 471 VYNCSGQMCSGHNR-------------------------SPLVKRPSLQMNTT------V 499
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+AS T+RYDL+D+TRQA+ + + + AY +
Sbjct: 500 WYNRSDVFEAWRLLLTAAPTLAASPTFRYDLLDVTRQAVQELVSLYYEEARTAYLNKELV 559
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + + EL+ D+D LLA F+LG WLE A+ +A +E + +
Sbjct: 560 SLLRAGGILVYELLPDLDNLLATDGRFMLGSWLEQARAVAVSETEAQ 606
>gi|405964692|gb|EKC30145.1| Alpha-N-acetylglucosaminidase [Crassostrea gigas]
Length = 859
Score = 350 bits (898), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 177/485 (36%), Positives = 276/485 (56%), Gaps = 31/485 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+HGWGGP+ Q+W+D QL+LQ KIL R+ GM PVLP F+G+VP A +P A ++
Sbjct: 217 MGNMHGWGGPITQNWIDDQLILQHKILERMRSFGMIPVLPGFAGHVPEATILRYPQANVS 276
Query: 61 QLGNWFSVKSDPRW--------------------CCTYLLDATDPLFIEIGRAFIEQQLK 100
+L +W W CC YLLD DPLF++I FI++
Sbjct: 277 RLTDWAGFNQSFCWHYPTANVSRLRDWGHFNKTYCCNYLLDFNDPLFMKIAVRFIKEMEN 336
Query: 101 EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 160
E+G H+Y+ DTF+E P +S EY++ G +Y ++ DS A+WLMQGWLF FW
Sbjct: 337 EFG-VDHVYSVDTFNEMRPRSNSTEYLALSGRTVYKSLKEADSKAIWLMQGWLFIDGGFW 395
Query: 161 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
+ PQ+KALL +VP G++++LDL++E+ PI++ ++ +YG P+IWCMLH+F G +E+YG L
Sbjct: 396 KQPQIKALLTAVPQGEMIILDLYSEIIPIYTQTESYYGQPFIWCMLHDFGGTMELYGALK 455
Query: 221 SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRR 280
I GP R N++MVG+GM+ EGI QN VVY+ +E ++ D+ WI++Y + R
Sbjct: 456 LINEGPFNGRAFPNSSMVGLGMTPEGIFQNEVVYEFFTENVWRKAPRDISTWISKYVLNR 515
Query: 281 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE---GKYQN--- 334
YG++ I AW L ++VYN +D D + + I PD PS+ G Y N
Sbjct: 516 YGKTNKFIDLAWQYLKNSVYNNSDNLKDHDSNAI---PDHRPSLSPALHPDLGIYNNTDY 572
Query: 335 -YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 393
+ ++ + +WY+ ++ A ++ + +E S S+ + YD++D+TR
Sbjct: 573 LHDNSINIIVTTLPRMTPLIQQDVWYNPEDLYVAWDIMTLNLDEFSNSSLFMYDIVDVTR 632
Query: 394 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
+L + + + +++ A+ D H V + L L+ DMD +L FLLG W+++A
Sbjct: 633 NSLQILSIKYYTDLVYAFGRGDIHAVESHGNQLLGLLSDMDTVLGSDSHFLLGRWIKAAT 692
Query: 454 QLAQN 458
A +
Sbjct: 693 DNAMD 697
>gi|354485058|ref|XP_003504701.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cricetulus griseus]
gi|344251941|gb|EGW08045.1| Alpha-N-acetylglucosaminidase [Cricetulus griseus]
Length = 740
Score = 350 bits (897), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 187/472 (39%), Positives = 276/472 (58%), Gaps = 48/472 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 203 MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRAFGMIPVLPAFAGHVPKAITRVFPQVNVF 262
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E P
Sbjct: 263 QLGSWGHFNCS--YSCSFLLAPGDPVFPLIGSLFLRELIKEFG-TDHIYGADTFNEMQPI 319
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P ++++ AA+Y M S D DA+WL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 320 SSDPSFLTAATAAVYEAMISVDPDAIWLLQGWLFQHQPQFWGPAQVKAVLQAVPRGRLLV 379
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++ + FYG P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 380 LDLFAESHPVYMQTASFYGQPFIWCMLHNFGGNHGLFGALEAVNQGPRAARIFPNSTMVG 439
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN +VY LM+E+ ++ + V D++ W+++++ RYG S P + AW +L +
Sbjct: 440 TGIAPEGIGQNEMVYALMAELGWRKDPVPDLEVWVSRFASHRYGMSHPDAEAAWRLLLRS 499
Query: 299 VYNC-TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC + NR +V P + + I +
Sbjct: 500 VYNCPGETYNGHNRSPLVKRPSLQINTI-------------------------------V 528
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ----- 412
WY+ S+V A L + + L+ S +RYDL+D+TRQ+L + + + A+
Sbjct: 529 WYNRSDVFEAWRLLLTAAPNLTTSKAFRYDLLDVTRQSLQELVSLFYEEARIAFMKEELD 588
Query: 413 -LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
L A G+ ++R+ L+ +D LLA FLLG WL A+ +A +E++ +
Sbjct: 589 LLLRAGGI--ITRK---LLPALDELLASDSRFLLGTWLNQARAMAVSEDEAQ 635
>gi|344285558|ref|XP_003414528.1| PREDICTED: alpha-N-acetylglucosaminidase [Loxodonta africana]
Length = 744
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 193/470 (41%), Positives = 271/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 205 MGNLHSWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAVTRVFPQVNVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DP+F IG F+ + E+G T HIY DTF+E PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRELTTEFG-TDHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G L+V
Sbjct: 322 SSEPSYLAAATAAVYEAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGHLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 382 LDLFAETQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGTLETVNQGPAAARLFPNSTMVG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG + AW +L +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVPDLGAWVASFAARRYGGIHQDAETAWRLLLRS 501
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 502 VYNCSGESCSGHNRS-------------------------PLVKRPSLQMNTT------V 530
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L +A+ L+AS +RYDL+D+TRQA + + + + AY
Sbjct: 531 WYNRSDVFEAWRLLLATTPALAASPAFRYDLLDVTRQAAQELVSFYYGEVRTAYLNKELV 590
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 591 HLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARVAAVSEAE 635
>gi|414585094|tpg|DAA35665.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 1202
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 167/269 (62%), Positives = 209/269 (77%), Gaps = 6/269 (2%)
Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 260
Y WCMLHNFA + EMYG+LD++A GP++AR S+N+TMVGVGMSMEGIEQNP+VYDLMSEM
Sbjct: 348 YNWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGVGMSMEGIEQNPIVYDLMSEM 407
Query: 261 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 320
AF H +VD++ W+ Y RRYG+ V +QDAW +LY T+YNCTDG DKNRDVIVAFPDV
Sbjct: 408 AFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLYNCTDGKNDKNRDVIVAFPDV 467
Query: 321 DPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASG 375
+P +I+ T G + N Y SK + K +S +Y+HPHLWY T+ VI ALELF+ G
Sbjct: 468 EPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHPHLWYDTNAVIHALELFLQHG 526
Query: 376 NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 435
+E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+ + V L + FL LV D+D
Sbjct: 527 DEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNNMNQVTILCQHFLSLVNDLDT 586
Query: 436 LLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
LL+ H+GFLLGPWLESAK LA+N EQE Q
Sbjct: 587 LLSSHEGFLLGPWLESAKGLARNSEQEIQ 615
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 85/110 (77%), Positives = 101/110 (91%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYN 110
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYN
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYN 349
>gi|395532374|ref|XP_003768245.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Sarcophilus
harrisii]
Length = 726
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 189/470 (40%), Positives = 272/470 (57%), Gaps = 38/470 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL SW +Q LQ +IL R+ GM PVLPAF+G+VP A VFP A +T
Sbjct: 187 MGNLHSWGGPLSSSWHRKQSSLQYQILERMRSFGMKPVLPAFAGHVPKAFTRVFPQAYVT 246
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + C+YLL DPLF +G F+ + +E+G T HIY+ DTF+E PP
Sbjct: 247 HLGMWGHFNCT--YSCSYLLAPEDPLFPVVGSLFLRELTQEFG-TDHIYSADTFNEMEPP 303
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW+PPQ+KA+L +VPLG+L+V
Sbjct: 304 SSEPAYLAAATAAVYEAMIAVDVDAVWLLQGWLFQHQPDFWKPPQVKAVLKAVPLGRLLV 363
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+AE KP++S + FYG P+IWCMLHNF GN ++G LD++ GP +A N+T VG
Sbjct: 364 LDLYAESKPVYSRTDSFYGQPFIWCMLHNFGGNHGLFGALDAVNRGPSDAWLFPNSTFVG 423
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+ EGI QN VVY LM+E+ +Q + D+ AW+ ++ +RYG + AW +L +
Sbjct: 424 TGIVPEGINQNEVVYALMAELGWQKGPLPDLGAWVAGFAAQRYGTPHSHAEAAWKLLLQS 483
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ D T NR +V P + I +
Sbjct: 484 VYNCSGDLCTGHNRSPLVKRPSLHLDI-------------------------------SV 512
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L++S +RYDL+D+TRQ + + + + A++
Sbjct: 513 WYNRSDVFEAWRLLLEAAPVLASSPAFRYDLLDVTRQVAQELVSLYYEELRTAFEAGAMP 572
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVR 466
+ + +L+ +D LLA + FLLG WLE A+++A +E + Q +
Sbjct: 573 ALLTAGGLLVFDLLPSLDELLASDERFLLGAWLEQAREMAVSEAEAWQYK 622
>gi|291406137|ref|XP_002719212.1| PREDICTED: alpha-N-acetylglucosaminidase [Oryctolagus cuniculus]
Length = 743
Score = 348 bits (892), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 189/458 (41%), Positives = 268/458 (58%), Gaps = 48/458 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWAGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAVTRVFPHINVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DP+F IG F+ + +E+G T H+Y DTF+E PP
Sbjct: 264 QLGSWGHFNCS--YSCSFLLAPEDPMFPLIGSLFLRELTREFG-TDHVYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA++ M + D DAVWL+QGWLF + P FW P Q+KA+LN+VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVFEAMIAVDPDAVWLLQGWLFQHQPQFWGPSQVKAVLNAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAENQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ E V D++AW+ ++ RRYG + P AW +L +
Sbjct: 441 TGIAPEGISQNEVVYALMAELGWRKEPVPDLEAWVTSFAGRRYGVAHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ D NR +V P L+ T+ +
Sbjct: 501 VYNCSGDACRGHNRSPLVRRPS-------------------------LQLNTT------V 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + + L++S +RYDL+D+TRQA+ + + + AY
Sbjct: 530 WYNRSDVFEAWRLLLKATPTLASSPAFRYDLLDVTRQAVQELVSLYYEEARTAYLHKELA 589
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
L A GV EL+ +D +LA FLLG WL
Sbjct: 590 TLLRAGGVLA-----YELLPALDRVLATDSRFLLGSWL 622
>gi|196001339|ref|XP_002110537.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
gi|190586488|gb|EDV26541.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
Length = 757
Score = 347 bits (890), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 181/473 (38%), Positives = 269/473 (56%), Gaps = 35/473 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ W GPLP W+++Q+ LQ KIL R+ + GM P+LPAF+GN+P AL ++P AKI
Sbjct: 217 MGNIQRWAGPLPHDWINKQITLQVKILDRMRKYGMLPILPAFNGNIPNALTKIYPKAKIV 276
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ WF R+ T LLD D LFI I + FIE+++K YG T H+Y+ D F+E P
Sbjct: 277 KSSPWFGFSK--RYGETALLDPRDKLFIVISKLFIEEEIKAYG-TDHLYSLDLFNEIDPQ 333
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
EY++++ + Y + S D+ AVW+MQGW+F D + W +++A L+ +P G++V+
Sbjct: 334 SKELEYLTAVSKSAYLALNSADTKAVWIMQGWMFYNDNYYWENKRIQAFLSPIPKGRIVI 393
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAEV+P + S F+G P+IWCML+NF GN MYG ++I G + A +N+TM+G
Sbjct: 394 LDLFAEVEPQYHRSNSFFGHPFIWCMLNNFGGNAGMYGTFETITEGAISAYDMKNSTMIG 453
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
GM+ EGI N ++YDLM+EM ++ VDV+ W+ Y+ RRYG I AW L TV
Sbjct: 454 TGMAPEGIGNNYIMYDLMAEMGWRKIAVDVRDWVVVYTERRYGGLDENIIKAWLRLSETV 513
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNC D R A P V PS+ + +WY
Sbjct: 514 YNCNDMRQYHCR----ALPAVRPSLKIAND---------------------------VWY 542
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S ++ A E + + NE + T++YD++D+TRQAL + A ++ + + Y N+ +
Sbjct: 543 SADDIFFAWEHMLRANNEFISEETFQYDIVDVTRQALQELAFIMYKKVTQCYHDNNQETL 602
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVSQ 472
+EL DMD LL + FLLG W+ A Q + N ++Q+R ++Q
Sbjct: 603 KTAGGELIELFTDMDTLLGTNSHFLLGRWVADALQHSNNISIKQQLRFNALNQ 655
>gi|351699889|gb|EHB02808.1| Alpha-N-acetylglucosaminidase, partial [Heterocephalus glaber]
Length = 652
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 185/459 (40%), Positives = 271/459 (59%), Gaps = 36/459 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP +W +QL LQ ++L R+ LGM PVLPAF+G+VP A+ VFP +T
Sbjct: 114 MGNLHGWGGPLPHAWHLKQLYLQHRVLDRMRALGMTPVLPAFAGHVPKAVTRVFPQVNVT 173
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + +E+G T H Y DTF+E PP
Sbjct: 174 QLGSWGHFNCS--YSCSFLLAPGDPLFPLIGSLFLRELNREFG-TDHFYGADTFNEMQPP 230
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 231 SSEPAYLAAATAAVYEAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVGAVLGAVPQGRLLV 290
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+T+VG
Sbjct: 291 LDLFAENQPVYTRTASFGGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTVVG 350
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW+ +++ +RYG + P AW +L H+
Sbjct: 351 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLSAWVARFAEQRYGVAQPDAVLAWRLLLHS 410
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ + L+ T+ +W
Sbjct: 411 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------VW 440
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y+ S+V A L + + L+AS +RYDL+D+TRQ L + + + AY + G
Sbjct: 441 YNRSDVFEAWRLLLKATPNLTASPAFRYDLLDVTRQGLQELVSLYYEEARAAYMRQELEG 500
Query: 419 VFQLSRRF-LELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ + +L+ +D +LA FLLG WLE A+ +A
Sbjct: 501 LLRAGGVLAYKLLPALDEVLASDHRFLLGSWLEQARAVA 539
>gi|156545487|ref|XP_001606979.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Nasonia vitripennis]
Length = 755
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 181/457 (39%), Positives = 264/457 (57%), Gaps = 35/457 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N GWGGPL Q+W + + LQ I+ R+ ELG+ PVLPAF+G+VP VFP A +T
Sbjct: 221 MGNFRGWGGPLSQAWHNHTIQLQHSIVRRMRELGITPVLPAFAGHVPRDFIRVFPEANVT 280
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + ++CC Y LD TDPLF +GR F++ E+G T+HIYNCD+F+EN P
Sbjct: 281 KVVSWNGFED--QYCCPYSLDPTDPLFKTVGREFLKAYTDEFG-TNHIYNCDSFNENDPH 337
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+Y+S+ G AIYSGM D DA+WLMQGWLF + FW P++KA + SVP+GK+++
Sbjct: 338 TGDLDYLSNTGKAIYSGMTGADPDAIWLMQGWLFVHSEYFWTFPRVKAFVTSVPIGKMII 397
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + ++G P+IWCMLHNF G + M+G I G EART+ +TM+G
Sbjct: 398 LDLQSEQFPQYKRFHSYFGQPFIWCMLHNFGGTLGMFGSAGVINKGVFEARTTNGSTMIG 457
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+ M+EM+++ + V + W Y+VRRYG++ +I+ +W L +
Sbjct: 458 TGLTPEGINQNYVIYEFMNEMSYRKKPVVLDNWFENYAVRRYGQADESIRTSWQELGREL 517
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN DG T G Y ++K L E P WY
Sbjct: 518 YNY-DGKTK-------------------IRGHY-----VITKRPSLNIE------PWYWY 546
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ F+ +GN + +++DL+D+TRQAL A+ ++ +I AY + +
Sbjct: 547 DLKTFLAVWNSFVHAGNGTMKNELFKHDLVDITRQALQITADFIYADIKAAYTQKNLTQL 606
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
S L+L +D++ LA FLLG WLE AK +A
Sbjct: 607 QIASSHLLDLFDDLEKNLASSKDFLLGSWLEDAKAIA 643
>gi|307192254|gb|EFN75548.1| Alpha-N-acetylglucosaminidase [Harpegnathos saltator]
Length = 741
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 175/464 (37%), Positives = 267/464 (57%), Gaps = 35/464 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W ++ + LQ +IL R+ +LG+ PVLPAF+G+VP A +FP+A +T
Sbjct: 194 MGNIRGFGGPLSINWHERTVRLQHRILRRMRDLGIVPVLPAFAGHVPRAFARLFPNANMT 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W K + ++CC YLL+ TDPLF IG F+ + E+G T HIYNCDTF+EN P
Sbjct: 254 KIEPW--NKFEDKYCCPYLLEPTDPLFQTIGEKFLRMYINEFG-TDHIYNCDTFNENEPG 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
Y+S++G +++ M + D A+WLMQGWLF +D FW P++++ L SVP G+++V
Sbjct: 311 NSELAYLSNVGRSVFQAMSTVDPQAIWLMQGWLFVHDFIFWTEPRVRSFLTSVPTGRMLV 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + K +YG P+IWCMLHNF G + M+G I E R +TMVG
Sbjct: 371 LDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRHMNGSTMVG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++HE VD+ AW Y+ RRYG AW L T+
Sbjct: 431 TGLTPEGINQNYVIYELMNEMAYRHEPVDLDAWFESYATRRYGAWNEYAVAAWKHLGRTI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN + VI P ++ S P +WY
Sbjct: 491 YNFVGIERIRGHYVITRRPSLNIS-------------------------------PWVWY 519
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ + +F+ + + YR+D++D+TRQAL A+ +++N+++ Y+ + G
Sbjct: 520 NREDFYHTWNVFLKARYGRGNNTLYRHDVVDITRQALQLMADNIYMNVVDCYKRKNITGF 579
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ L+L +D++ +LA FLLG WL AK +A +E++ +
Sbjct: 580 QSHAAALLDLFDDIEAILASGSNFLLGTWLAQAKDMAVDEKERQ 623
>gi|403304646|ref|XP_003942904.1| PREDICTED: alpha-N-acetylglucosaminidase [Saimiri boliviensis
boliviensis]
Length = 754
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 188/464 (40%), Positives = 276/464 (59%), Gaps = 36/464 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP++W +QL LQ +IL R+ GM PVLPAFSG+VP A+ VFP +T
Sbjct: 214 MGNLHTWDGPLPRAWHIKQLYLQHRILDRMRSFGMIPVLPAFSGHVPRAINRVFPRVNVT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DP+F +G F+ + KE+G T HIY DTF+E PP
Sbjct: 274 QMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRELTKEFG-TDHIYGADTFNEMQPP 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D+DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 331 SSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLV 390
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 391 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPEAARLFPNSTMVG 450
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E++++ + V D+ AW+ ++ +RYG S P AW +L +
Sbjct: 451 TGMAPEGINQNEVVYSLMAELSWRKDPVPDLAAWVTSFATQRYGVSHPDAGAAWRLLLRS 510
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ + L+ T+ +W
Sbjct: 511 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------VW 540
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y+ S+V A L +++ L+AS T+RYDL+D+TRQA+ + + AY + H
Sbjct: 541 YNRSDVFEAWRLLLSAAATLAASPTFRYDLLDVTRQAVQELVGLYYEEARSAYLSKELHS 600
Query: 419 VFQLSRRF-LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ + EL+ +D +LA FLLG WLE A+ +A +E +
Sbjct: 601 LLRAGGILAYELLPALDEVLASDSHFLLGSWLEQARAVAVSEAE 644
>gi|431890602|gb|ELK01481.1| Alpha-N-acetylglucosaminidase [Pteropus alecto]
Length = 740
Score = 344 bits (883), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 186/465 (40%), Positives = 270/465 (58%), Gaps = 38/465 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 201 MGNLHTWGGPLPFSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ +W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 261 QMDSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 317
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 318 SSEPSYLAAATAAVYQAMTTVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 377
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 378 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVG 437
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI+QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 438 TGMAPEGIDQNEVVYALMAELGWRKDPVTDLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 497
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + N +V PS+ VT +
Sbjct: 498 VYNCSGEDCRGHNHSPLVR----RPSLQMVTT---------------------------V 526
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A + + + L+ S + Y+L+D+TRQA+ + + + + AY D
Sbjct: 527 WYNQSDVFEAWRMLLTATPTLATSPLFSYELVDITRQAIQELVSLYYEEVRTAYLNKDLV 586
Query: 418 GVFQLSRRF-LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+F+ + EL+ +D +LA FLLG WLE A+ A ++ +
Sbjct: 587 TLFRAAGILAYELLPSLDNILATDSHFLLGSWLEQARAAAVSKAE 631
>gi|328778968|ref|XP_623833.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Apis mellifera]
Length = 752
Score = 343 bits (881), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 176/465 (37%), Positives = 272/465 (58%), Gaps = 37/465 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W D+ + LQ +IL R+ LG+ PVLPAF+G+VP AL +FP A +T
Sbjct: 198 MGNMRGFGGPLNSNWHDKSIRLQHRILERMRALGIIPVLPAFAGHVPRALLKLFPKANVT 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + ++CC YLL+ TDPLF +IG+ F++ ++E+G T H+YNCDTF+EN P
Sbjct: 258 KSAVWNNFSD--KYCCPYLLEPTDPLFKQIGQQFLKTYIEEFG-TDHVYNCDTFNENEPY 314
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M S DS A+WLMQGWLF +D FW P+ + L SVPLG+++V
Sbjct: 315 TSELKFLRNIGHSIFEAMNSVDSKAIWLMQGWLFYHDSVFWTEPRTRTFLTSVPLGRMIV 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + +YG P+IWCMLHNF G + M+G + I EAR +TMVG
Sbjct: 375 LDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGGTLGMFGSAEIINHRVFEARNMNGSTMVG 434
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPAIQDAWNVLYH 297
G++ EGI QN V+Y+LM+EMA++ + V++ W ++ RRYG + AW +
Sbjct: 435 TGLTPEGINQNYVIYELMNEMAYRKKPVNLDKWFENFANRRYGDIKGNEHTVTAWKGFKN 494
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
TVYN +D + + VI P+++ P
Sbjct: 495 TVYNFSDTRRIRGKYVITIRPNLNF-------------------------------FPWR 523
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ I + + + + S YR+D++D+TRQAL A+E++ ++IE++ +
Sbjct: 524 WYNKDAFIYYWYVLLQARDLKRNSTLYRHDVVDVTRQALQLIADEIYTDLIESFNKKNID 583
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
Q ++ L L +D++ +LA + FLLG WL+ AK LA ++E+E
Sbjct: 584 LFKQNAKLLLALFDDLEEILASSEDFLLGKWLKMAKDLATDDEEE 628
>gi|426238065|ref|XP_004012978.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 1 [Ovis aries]
Length = 748
Score = 343 bits (880), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 192/469 (40%), Positives = 269/469 (57%), Gaps = 46/469 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 211 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 271 QMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D AVWL+QGWLF P FW P Q+ A+L +VP G+L+V
Sbjct: 328 SSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQPEFWGPAQVAAVLGAVPRGRLLV 387
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+T+VG
Sbjct: 388 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPATARRFPNSTLVG 447
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 448 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 507
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC S E + N+ P+ K L T+ +W
Sbjct: 508 VYNC-----------------------SGEECRGHNH-SPLVKRPSLHMVTT------VW 537
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
Y+ S+V A L + + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 538 YNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 597
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A G+ EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 598 LMRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARLAAVSETE 641
>gi|426238067|ref|XP_004012979.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Ovis aries]
Length = 739
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 192/469 (40%), Positives = 269/469 (57%), Gaps = 46/469 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 202 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 262 QMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D AVWL+QGWLF P FW P Q+ A+L +VP G+L+V
Sbjct: 319 SSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQPEFWGPAQVAAVLGAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+T+VG
Sbjct: 379 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPATARRFPNSTLVG 438
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 439 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 498
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC S E + N+ P+ K L T+ +W
Sbjct: 499 VYNC-----------------------SGEECRGHNH-SPLVKRPSLHMVTT------VW 528
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
Y+ S+V A L + + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 529 YNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 588
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A G+ EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 589 LMRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARLAAVSETE 632
>gi|380030624|ref|XP_003698943.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Apis florea]
Length = 769
Score = 342 bits (876), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 173/465 (37%), Positives = 269/465 (57%), Gaps = 37/465 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W ++ + LQ +IL R+ LG+ PVLPAF+G+VP A +FP A +T
Sbjct: 216 MGNMRGFGGPLSSNWHEKSIRLQHRILERMRALGIIPVLPAFAGHVPRAFLRLFPKANVT 275
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + ++CC YLL+ DPLF +IG+ F++ ++E+G T H+YNCDTF+EN P
Sbjct: 276 KSAVWNNFSD--KYCCPYLLEPMDPLFKQIGQQFLKTYIEEFG-TDHVYNCDTFNENEPY 332
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M + DS A+WLMQGWLF +D FW P+ + L S+PLG+++V
Sbjct: 333 TSELKFLRNIGHSIFEAMSNVDSKAIWLMQGWLFYHDSVFWTEPRTRTFLTSIPLGRMIV 392
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + +YG P+IWCMLHNF G + M+G + I EAR +TMVG
Sbjct: 393 LDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGGTLGMFGSAEIINHRIFEARNMNGSTMVG 452
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPAIQDAWNVLYH 297
G++ EGI QN V+Y+LM+EMA++ V++ W Y+ RRYG + AW +
Sbjct: 453 TGLTPEGINQNYVIYELMNEMAYRKRPVNLDKWFENYANRRYGDTKGNEHTVTAWKGFKN 512
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
TVYN +D + + I P+++ S P
Sbjct: 513 TVYNFSDTRRIRGKYAITIRPNLNFS-------------------------------PWR 541
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ I + + + + S YR+D++D+TRQAL A+E++ ++IE++ +
Sbjct: 542 WYNKDAFIHYWYMLLQARDLKRNSTLYRHDVVDVTRQALQLIADEIYTDLIESFNKKNID 601
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
Q ++ L L +D++ +LA + FLLG WL+ AK LA N+E+E
Sbjct: 602 LFKQNAKLLLALFDDLEEILASSEDFLLGKWLKMAKDLATNDEEE 646
>gi|307168312|gb|EFN61518.1| Alpha-N-acetylglucosaminidase [Camponotus floridanus]
Length = 737
Score = 341 bits (875), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 174/464 (37%), Positives = 262/464 (56%), Gaps = 35/464 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W ++ + LQ +IL R+ LG+ PVLPAF+G+VP A +FP+A +T
Sbjct: 221 MGNIRGFGGPLSTNWHNRTIHLQHQILRRMRNLGIVPVLPAFAGHVPRAFARLFPNANMT 280
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + + ++CC YLL+ TDPLF IG F+ + E+G T HIYNCDTF+EN P
Sbjct: 281 KINPWNNFED--KYCCPYLLEPTDPLFQIIGEKFLRMYINEFG-TDHIYNCDTFNENEPG 337
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
Y+ ++ A+++ + + DS A+WLMQ WLF +D FW P++K+ L SVP+G++++
Sbjct: 338 STELIYLRNVSHAVFAAINAVDSKAIWLMQAWLFVHDFMFWTEPRVKSFLTSVPMGRMLI 397
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + K +YG P+IWCMLHNF G + M+G I E R +TMVG
Sbjct: 398 LDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRNMNGSTMVG 457
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++HE VD+ AW Y+ RRYG W L TV
Sbjct: 458 TGLTPEGINQNYVIYELMNEMAYRHEPVDLDAWFQNYATRRYGAWNEYAVTTWQYLGRTV 517
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN + V+ P ++ S+ +WY
Sbjct: 518 YNFIGSQRIRGHYVVTRRPSLNISL-------------------------------WIWY 546
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ F+ + + S YR+D++D+TRQAL ++L+ I+++Y+ +
Sbjct: 547 NRKNFYSMWNTFLKARHGRRNSTLYRHDVVDITRQALQLMGDDLYTIILDSYKKRNITAF 606
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ LEL +D++ +LA FLLG WL AK +A NEE+ K
Sbjct: 607 RSSANALLELFDDLESILASGSNFLLGTWLSQAKDVATNEEERK 650
>gi|256422141|ref|YP_003122794.1| alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
gi|256037049|gb|ACU60593.1| Alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
Length = 728
Score = 341 bits (874), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 177/457 (38%), Positives = 266/457 (58%), Gaps = 40/457 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGPLPQ W D VLQ++IL +GM P+LPAF+G+VP A ++ +P+ +I
Sbjct: 195 MGNIDAWGGPLPQHWKDSHKVLQQQILAAERSMGMLPILPAFTGHVPPAFKDKYPN-EIV 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW D + Y+LD P+F +IG+ F+E Q K +G T H Y+ DTF+EN PP
Sbjct: 254 KPTNW-----DAGFPDVYILDPNSPMFDKIGKKFLEAQTKAFG-TDHFYSADTFNENVPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
++ ++ +Y+ M + D AVW+MQGW+F Y+ +W PQ++ALLN+VP ++V
Sbjct: 308 SSDSSFLDAMSRKVYASMAAADPKAVWVMQGWMFHYNASYWHQPQIRALLNAVPDDHMIV 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
LDL++E P W ++ +YG P+IW MLHNF GN M+G +D+ A P A + M
Sbjct: 368 LDLYSESHPEWRNTQAYYGKPWIWNMLHNFGGNTGMWGGMDAAAHDPATALHDPASGKMS 427
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ EGIEQNP +Y LM + ++ + ++V W+ Y+ +RYG A+ AW +LYHT
Sbjct: 428 GIGLTPEGIEQNPALYQLMIDNVWRDQPINVDTWLQSYAKQRYGAENEAVNKAWQILYHT 487
Query: 299 VY--NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
VY T+GA + +IVA P +D ++ E V
Sbjct: 488 VYIGGPTEGAPES---IIVARPTLD-----------------IAAERV---------KTK 518
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
L Y ++V+ A +LFI + +L + ++YDL+DLTRQ L YA+ L + AY+ D
Sbjct: 519 LEYDPAKVVPAWDLFINAAAQLKPTEGFKYDLVDLTRQVLGNYASPLQQRVATAYRNKDL 578
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
Q S +F+ L++DMD LL +GFLLG W+ A+
Sbjct: 579 AAFKQYSTQFIGLLDDMDMLLGTQEGFLLGKWVSDAR 615
>gi|198433857|ref|XP_002122480.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 880
Score = 340 bits (871), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 189/455 (41%), Positives = 263/455 (57%), Gaps = 37/455 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP W+ QL+LQ +IL+R+ LGM PVLP F+G++P+A+ N++P A +
Sbjct: 211 MGNLHGWGGPLPSFWIKSQLILQHQILIRMRSLGMIPVLPGFAGHIPSAILNLYPKADVI 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QL +W + CTYLL DPLF IG FI++Q+ EY T+HIYN DTF+E TPP
Sbjct: 271 QLSHWSHFNCT--YSCTYLLQPHDPLFNTIGSMFIKEQMLEYNGTNHIYNADTFNEMTPP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S+ A+Y M D DAVWLMQGWLF ++P FW+ Q KALL VP GK++V
Sbjct: 329 SSDPGYLSNASRAVYDAMAVADPDAVWLMQGWLFHHEPTFWKTAQKKALLTGVPKGKMLV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E P + ++G P++WCMLH+F GN+ YG ++++ P A TS N+TMVG
Sbjct: 389 LDLFSESYPQY-LPDWYFGQPFLWCMLHDFGGNMGFYGKINTVNTQPGIALTSVNSTMVG 447
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN ++YD M E F V+V W+ +Y++RRY S P WN+L +T+
Sbjct: 448 TGVTPEGINQNYMIYDFMLETGFTVHSVNVTNWLKEYTMRRYNTSSPEAIKTWNILGNTI 507
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL-- 357
YN D FP S+I G PV + + D+P L
Sbjct: 508 YN----------DTKPGFP--SKSLIR---------GSPVKRPTL--------DNPGLPY 538
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY S + A + F S N L T RYD +D+TRQ L L+ ++E +
Sbjct: 539 WYQYSSLALAWDNFSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDP 598
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 452
G +L + L+L++D D +L F +G W++ A
Sbjct: 599 G--KLGEQLLDLLDDFDKMLCSDAHFSMGKWIQDA 631
Score = 38.9 bits (89), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 2/96 (2%)
Query: 367 ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF 426
A + F S N L T RYD +D+TRQ L L+ ++E + G +L +
Sbjct: 680 AWDNFSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDPG--KLGEQL 737
Query: 427 LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
L+L++D D +L F +G W++ AK L E++
Sbjct: 738 LDLLDDFDKMLCSDAHFSMGKWIQDAKILGTTAEEK 773
>gi|325103828|ref|YP_004273482.1| alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
gi|324972676|gb|ADY51660.1| Alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
Length = 738
Score = 340 bits (871), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 182/467 (38%), Positives = 264/467 (56%), Gaps = 37/467 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ WGGPLPQSW+D LQKKIL R ELGM PVLPAF+G+VP + FP AK+
Sbjct: 195 MNNMDAWGGPLPQSWIDSHKDLQKKILARQRELGMIPVLPAFTGHVPKSFVKKFPEAKVD 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
SV + Y+L+ DP+F +IG F+++Q +EYG T H Y+ D F+E PP
Sbjct: 255 ------SVNWQGNFPNIYMLNPNDPMFSKIGEQFLKEQTREYG-TDHYYSSDIFNELNPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY---DPFWRPPQMKALLNSVPLGKL 177
P+Y+ + +YS M+ D +VW+MQ WLF FW P +M+A L VP KL
Sbjct: 308 SSDPKYLYDISEKVYSSMKKVDPKSVWVMQAWLFVSAHGRKFWTPERMQAFLKPVPDDKL 367
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTT 236
++LDL+ E +P W ++ +YG ++W MLHNF GNI ++G +IA P +
Sbjct: 368 IILDLYTENRPRWKNTEGYYGKKWVWNMLHNFGGNIGLFGKAQTIASEPARVLSDPMKGN 427
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
G+G++MEGIEQNP +Y LM + + +E ++++ W N+Y RRYG AW +L
Sbjct: 428 YSGIGLTMEGIEQNPFIYQLMLDHVWNNEPIELEKWTNKYITRRYGVLDNNAVKAWEILL 487
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
+TVY D N+D SI+S G+P ++ S +
Sbjct: 488 NTVY------KDNNKD-----QGAPESILS---------GRPTF------AQNSYWTWTD 521
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
L+Y E +RA + I S ++L S+ ++YD++D+TRQA+A YA L + Y D
Sbjct: 522 LYYDNREFVRAWDYLIKSADKLRNSDGFQYDIVDITRQAMANYATALQRQLAYTYYAGDV 581
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + SRRFLEL+ D+D LLA FLLG W++ AK+ A N+ + K
Sbjct: 582 NTYEKESRRFLELLSDLDRLLATRKDFLLGIWIDDAKKWATNDAERK 628
>gi|332018247|gb|EGI58852.1| Alpha-N-acetylglucosaminidase [Acromyrmex echinatior]
Length = 686
Score = 340 bits (871), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 175/464 (37%), Positives = 265/464 (57%), Gaps = 35/464 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W + + LQ +IL R+ +LG+ PVLPAF+G+VP A +FP+A +T
Sbjct: 152 MGNIRGFGGPLSSNWHNYTIRLQHQILQRMRDLGIVPVLPAFAGHVPRAFARLFPNANMT 211
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W K + ++CC YLL+ TDPLF IG F++ + E+G T HIYNCDTF+EN P
Sbjct: 212 KINPW--NKFEDKYCCPYLLEPTDPLFRTIGEKFLQMYIDEFG-TDHIYNCDTFNENEPG 268
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
Y+ ++G +I+S M + DS A+WLMQ WLF +D FW +++A L SVP+G+++V
Sbjct: 269 NTELIYLRNVGHSIFSAMNAVDSKAIWLMQAWLFVHDIMFWTKSRVRAFLTSVPIGRMLV 328
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + K +YG P+IWCMLHNF G + M+G I E R ++TMVG
Sbjct: 329 LDLQSEQFPQYDRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRNMNDSTMVG 388
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++H V++ W Y+ RRYG AW L TV
Sbjct: 389 TGLTPEGINQNYVIYELMNEMAYRHVPVNLDNWFESYATRRYGAWNEYAVAAWQHLGRTV 448
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN + VI P ++ S+ + WY
Sbjct: 449 YNFIGTQKIRGHYVITRRPSLNISLWT-------------------------------WY 477
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +F+ + + YR+D++D+TRQAL A+++++ I++ Y+ +
Sbjct: 478 DRKDFYAMWNMFLKARYGRGNNTLYRHDVVDITRQALQLIADDIYMTILDCYKKKNITAF 537
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ LEL +D++ +LA + FLLG WL AK +A NEE+ +
Sbjct: 538 QSSANALLELFDDLESILASGNNFLLGTWLAQAKDIAVNEEERR 581
>gi|449518399|ref|XP_004166229.1| PREDICTED: alpha-N-acetylglucosaminidase-like, partial [Cucumis
sativus]
Length = 336
Score = 334 bits (857), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 163/227 (71%), Positives = 190/227 (83%), Gaps = 2/227 (0%)
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VGVGMSMEGIEQNPVVYDLMSEMAFQH KVDVK W+ QYSVRRYG VP+IQDAW+VLYH
Sbjct: 1 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 60
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
TVYNCTDGA DKNRDVIVAFPDVDPS I V +G S +V + + +++D PHL
Sbjct: 61 TVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHL 118
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF I++AYQL+D
Sbjct: 119 WYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQ 178
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQ
Sbjct: 179 TMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQ 225
>gi|255533286|ref|YP_003093658.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346270|gb|ACU05596.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 734
Score = 333 bits (855), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 172/465 (36%), Positives = 264/465 (56%), Gaps = 37/465 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGP+ ++++ +Q LQKKIL R LGM P+LP+F+G+VP + ++ FP K+
Sbjct: 190 MGNLDAWGGPMSKNFMAKQEALQKKILARERALGMTPILPSFTGHVPPSFKDKFPDIKVN 249
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ + P Y+L+ P+F EIGR F+ + +G T H+Y+ DTF+E TP
Sbjct: 250 TQQ--WGINVSP----AYVLNPETPMFKEIGRKFLTALINTFG-TDHLYSADTFNEMTPV 302
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ Y++ + IY M + D+ AVW+MQGW+F P FW+P QMKAL ++VP KL+V
Sbjct: 303 SNDSTYLNGMAKKIYESMAAVDTQAVWIMQGWMFLDRPNFWQPTQMKALFSAVPQDKLIV 362
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
LDL +E+ P+WS + FYG +IWCMLHNF G + M+G + I P A + + M
Sbjct: 363 LDLNSELNPVWSRTDAFYGEKWIWCMLHNFGGRLSMFGDMSRIGNDPAAALKNDQRGKMS 422
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++MEGIEQNP +Y LM E + + +D+ W+ Y+ RRYG+ + AW VL +T
Sbjct: 423 GIGLTMEGIEQNPAIYSLMLEHIWNDKPIDLDNWLKGYAQRRYGKRNSNAEKAWEVLKNT 482
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VY+ +I P D + + + + +
Sbjct: 483 VYSHQPWWG--TNTIITGRPTFDAATV--------------------------WTYTAIP 514
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
YS+ E+++A + + +EL +S+ ++YDL+D+TRQ LA YAN L + +Y+ D
Sbjct: 515 YSSKELMKAWSYLLTASDELKSSDGFQYDLVDVTRQVLANYANVLQQDFASSYKQKDMAT 574
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ S +FLEL++D+D LL FLLG W+ +AK L N ++K
Sbjct: 575 FNKKSAQFLELIDDIDQLLGTRSDFLLGKWINNAKALGDNPAEKK 619
>gi|320162905|gb|EFW39804.1| lysosomal alpha-N-acetyl glucosaminidase [Capsaspora owczarzaki
ATCC 30864]
Length = 786
Score = 328 bits (841), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 176/467 (37%), Positives = 268/467 (57%), Gaps = 33/467 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+ W+ +Q LQ IL R+ GM PVLP+F+G+VP+AL FP+A IT
Sbjct: 238 MGNIKGWGGPISLEWIYKQRNLQVLILQRMRTFGMTPVLPSFAGHVPSALAQHFPNANIT 297
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q +W + ++CC LDA+DPLF +IG F+ Q + YG T+H+YNCD F+E TP
Sbjct: 298 QSSDWNNFPD--QYCCVGFLDASDPLFTQIGAEFLRLQNETYG-TNHLYNCDQFNEMTPA 354
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
Y+ G A+Y M + D AVW+MQGWLF + +W +++ALL+ VP +++
Sbjct: 355 STDLGYLKQAGMAVYQSMTAYDPAAVWVMQGWLFFNEAAWWSNDRVQALLSGVPDDHMII 414
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF++V P+W+ + +YG P+IW MLH+F GNI +YGIL SI GP A + TMVG
Sbjct: 415 LDLFSDVTPVWNRLESYYGKPFIWNMLHDFGGNIGLYGILPSINEGPFAALATPGNTMVG 474
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD-AWNVLYHT 298
+G++ EGI QN ++Y+ M E ++ V++ W++ + RRYG S PA+ A+ L +
Sbjct: 475 IGLTPEGINQNYILYEFMMENMWRSAPVNLPTWVDAFVGRRYGPSTPAVAKLAYQQLLQS 534
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNCT+G + ++ P V+ +S+ + + +L+
Sbjct: 535 VYNCTNGQYSVTKSLLEIRPAVN-----------------MSRNGFMPT--------NLY 569
Query: 359 YSTSEVIRALELFIA---SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
Y VI A++ +A S +L++ +RYD++D TRQ L+ A + N+ A
Sbjct: 570 YDPGHVILAVDHILAAAKSAPQLASVVPFRYDVVDFTRQMLSNLAIDFHSNLTLALTSKQ 629
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
AH V + + L+ D+D LL FLLGPWL +A+ ++N +
Sbjct: 630 AHLVHLYGQGIVGLIADLDELLVSDAHFLLGPWLAAARSWSENTAAQ 676
>gi|390463730|ref|XP_003733088.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Callithrix jacchus]
Length = 830
Score = 327 bits (839), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 186/465 (40%), Positives = 262/465 (56%), Gaps = 45/465 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W PLP +W +QL LQ IL R+ GM PVLP F G+VP A+ VFP +T
Sbjct: 297 MGNLHTWDAPLPHAWHIKQLYLQHWILDRMRSFGMVPVLPMFLGHVPKAITRVFPRVSVT 356
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DP+F +G F+ + KE+G T HIY DTF+E PP
Sbjct: 357 QMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRELTKEFG-TDHIYGADTFNEMQPP 413
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D+DAVWL+QGWLF Y P FW P Q++A+L S P G L+V
Sbjct: 414 SSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQYQPQFWGPAQVRAVLGSAPHGCLLV 473
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 474 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAMNRGPEAARLFPNSTMVG 533
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+++ + V D+ AW YG S P AW +L +
Sbjct: 534 TGMAPEGISQNXVVYSLMAELSWXKDPVPDLVAWX-------YGVSHPDTGAAWRLLLRS 586
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ + L+ T+ +W
Sbjct: 587 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------IW 616
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y+ S+V A L ++ L+AS T+RYDL+D+TRQ + + + + AY L+ G
Sbjct: 617 YNQSDVFEAWRLLFSAAATLAASPTFRYDLLDVTRQVVQELVSLYYEEARSAY-LSKELG 675
Query: 419 VFQLSRRFL--ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ L EL+ +D +LA FLLG WLE A+ +A +E +
Sbjct: 676 SLLRAGGILAYELLPALDEVLASDSHFLLGSWLEQARAVAVSEAE 720
>gi|332260899|ref|XP_003279518.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Nomascus leucogenys]
Length = 736
Score = 325 bits (833), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 174/446 (39%), Positives = 257/446 (57%), Gaps = 38/446 (8%)
Query: 20 LVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYL 79
L LQ ++L R+ +PVLPAF+G+VP A+ VFP +T++G+W + C++L
Sbjct: 216 LFLQHRVLDRMRSSAXDPVLPAFAGHVPEAVTRVFPRVNVTKMGSWGHFNCS--YSCSFL 273
Query: 80 LDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQ 139
L DP+F IG F+ + +KE+G T HIY DTF+E PP P Y+++ A+Y M
Sbjct: 274 LAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMI 332
Query: 140 SGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYG 198
+ D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ + F G
Sbjct: 333 AVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQG 392
Query: 199 VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMS 258
P+IWCMLHNF GN ++G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+
Sbjct: 393 QPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMA 452
Query: 259 EMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVA 316
E+ ++ + V D+ AW+ ++ +RYG S P AW +L +VYNC+ + NR +V
Sbjct: 453 ELGWRKDPVPDLAAWVTSFAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVR 512
Query: 317 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 376
P L+ TS +WY+ S+V A L + S
Sbjct: 513 RPS-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAP 541
Query: 377 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF-LELVEDMDG 435
L+AS +RYDL+DLTRQA+ + + + AY + + + EL+ +D
Sbjct: 542 SLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLRKELASLLRAGGVLAYELLPALDE 601
Query: 436 LLACHDGFLLGPWLESAKQLAQNEEQ 461
+LA FLLG WLE A+ A +E +
Sbjct: 602 VLASDSRFLLGSWLELARAAAVSEAE 627
>gi|383856382|ref|XP_003703688.1| PREDICTED: alpha-N-acetylglucosaminidase [Megachile rotundata]
Length = 744
Score = 325 bits (833), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 179/472 (37%), Positives = 271/472 (57%), Gaps = 37/472 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPL SW +Q + LQ KIL R+ LG+ PVLP+F+G+VP A +FP+A +T
Sbjct: 194 MGNIRAFGGPLYPSWHEQSINLQHKILERMRSLGIIPVLPSFAGHVPRAFPRLFPNANVT 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W + +CC YLL TDPLF +IG+ F++ ++E+G T HIYNCDTF+EN P
Sbjct: 254 KLAPWNNFPD--VYCCLYLLAPTDPLFQQIGQLFLKTYIEEFG-TDHIYNCDTFNENEPH 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G + + M + D DA+WLMQGWLF++D FW P+++A L SVP G+++V
Sbjct: 311 TSELKFLRNVGHSTFQAMNAVDPDAIWLMQGWLFTHDKLFWTEPRVEAFLTSVPRGRMIV 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + K ++G P+IWCMLHNF G + M+G I E R +N+TMVG
Sbjct: 371 LDLQSEQFPQYGRLKSYFGQPFIWCMLHNFGGTLGMFGSAQIINQRVFEGRNMKNSTMVG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++ E V++ W Y+ RRYG AW L TV
Sbjct: 431 TGLTPEGINQNYVIYELMNEMAYRKEPVNLNKWFENYASRRYGVWNEYAVSAWQSLGRTV 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN + + + VI P ++ S + WY
Sbjct: 491 YNFSGTRKIRGKYVISRRPSLNLSTWT-------------------------------WY 519
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +F+ + + S YR+D++DLTRQ L A E++ +I+++ +
Sbjct: 520 DRDTLYNTWSVFLQARHGRRNSTLYRHDVVDLTRQVLQAKAEEIYPVLIDSFNKKNLTAF 579
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK--QVRCPY 469
S + L+L +D++ +LA FLLG WL++AK+LA N+E+ + QV Y
Sbjct: 580 KYHSDKLLDLFDDLELILASGKDFLLGKWLDAAKKLASNDEELRLYQVNAKY 631
>gi|194216885|ref|XP_001917396.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Equus caballus]
Length = 744
Score = 324 bits (831), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 189/470 (40%), Positives = 270/470 (57%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPL +SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 205 MGNLHTWDGPLTRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 265 QLGSWGHFNCS--YSCSFLLAPEDPLFPVVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPAYLAAATAAVYQAMTAVDPDAVWLLQGWLFHHQRTFWGPAQVGAVLGAVPRGRLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 382 LDLFAESQPMYIRTASFQGQPFIWCMLHNFGGNQGLFGALEAVNRGPAAARLFPNSTMVG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D++AW+ ++ RRYG S + AW +L +
Sbjct: 442 TGMTPEGIGQNEVVYALMAELGWRKDPVADLEAWVTSFAARRYGVSHKDAETAWKLLLRS 501
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ K L+ T+ +W
Sbjct: 502 VYNCSAEAYSGHNQ------------------------SPLVKRPSLQMGTT------VW 531
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY-------ANELFLNIIEAY 411
Y+ S+V A L + + L++S + YDL+D+TRQA + A +LN E
Sbjct: 532 YNRSDVFEAWWLLLTAAPALASSPAFLYDLVDVTRQAAQELISLYYEEARTAYLN-KELV 590
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A G+ EL+ +D +LA FLLG WL+ A+++A +E +
Sbjct: 591 PLLRAGGILA-----YELLPALDKVLASDSRFLLGSWLKQAREMAVSEAE 635
>gi|374385255|ref|ZP_09642763.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
12061]
gi|373226460|gb|EHP48786.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
12061]
Length = 736
Score = 322 bits (825), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 178/466 (38%), Positives = 258/466 (55%), Gaps = 40/466 (8%)
Query: 2 SNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ GW GPLP+SW++ LQKKIL R ELGM P+LPAF+G+VP + FP A++ Q
Sbjct: 199 GNIDGWCGPLPKSWMESHEELQKKILARERELGMTPILPAFTGHVPPTFKEHFPEARLRQ 258
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
+ NW + R+ TYLL+A DPLF IG F+E+Q++ +G T H+Y DTF+E PP
Sbjct: 259 V-NW-----EGRFDDTYLLEADDPLFQTIGNRFMEEQIRTFG-TDHLYGADTFNEMFPPS 311
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
+ Y+ + A+Y M + D +AVW+MQGWLF FW+P QMKA L +VP L+VL
Sbjct: 312 EDSTYLDGISKAVYQSMAAVDPEAVWVMQGWLFHDKRDFWKPAQMKAYLGAVPDEHLIVL 371
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTMVG 239
DL+ E PIW ++ FYG P+IWCMLHNF G ++G +A P ++G
Sbjct: 372 DLWGEEFPIWDRTEAFYGKPWIWCMLHNFGGRNMLFGNALKLAEEPSRVLADPAKGQLLG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G EGIEQNPV+Y L+ +++ V++ W Y RYG A++ AW++L TV
Sbjct: 432 LGAVPEGIEQNPVIYSLLFSHIWRNTAVELDEWFETYLESRYGCRDEAVEKAWDILRKTV 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y EG NY ++ + +++ + + Y
Sbjct: 492 Y--------------------------ANEG---NYESAITARPTFEKH-NNWAYTDIPY 521
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ EVI+A + + + + L + YRYDLI + +Q LA YA + E Y+ D
Sbjct: 522 NPVEVIKAWKYLLQAADRLGENPCYRYDLILVGKQVLANYATIIQQKFGEDYRTKDLPAF 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
+ SR F+EL++DMD L+ H+ FLLG WLE A+ + EKQ+
Sbjct: 582 TRNSREFMELIDDMDELMGTHEAFLLGKWLEDARSWGKT-ASEKQL 626
>gi|321472423|gb|EFX83393.1| hypothetical protein DAPPUDRAFT_301977 [Daphnia pulex]
Length = 799
Score = 322 bits (824), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 171/468 (36%), Positives = 273/468 (58%), Gaps = 38/468 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N WGGPL +W L+LQ KIL R+ GM PVLPAF+G+VP A++ V+P+A T
Sbjct: 212 MGNFRAWGGPLSDNWQQATLILQHKILERMRSFGMTPVLPAFAGHVPRAMERVYPNASYT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L +W + + ++CC L T+PLF EIG FI++ E+G + H+YNCD F+E P
Sbjct: 272 HLTSWLNFQD--QYCCPLFLQPTEPLFTEIGSRFIKEMALEFG-SDHVYNCDVFNEVRPT 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P ++SS+G A+++ M + D DA+WLMQGWLF D +W KALL SVP G++++
Sbjct: 329 QADPVFVSSVGTAVFNAMTTADPDAIWLMQGWLFKSDADYWTADLSKALLTSVPQGRMLI 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL AE+ P + FYG P+++C+LHNF G + + G + I+ ++AR N+TMVG
Sbjct: 389 LDLQAELDPQYIRLNSFYGQPFVFCLLHNFGGTLGLNGAIQIISQRVIDARNFPNSTMVG 448
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++MEGI+QN VVYD M EM ++ + ++ W ++Y+VRRYG + A+ AW L ++V
Sbjct: 449 TGLTMEGIDQNYVVYDKMLEMGWRDKVPNLNQWFDEYTVRRYGVNNTAVMSAWRFLQNSV 508
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN + + + + V+V P + + P +WY
Sbjct: 509 YNDSSRRSFRGQYVLVTRPAL-------------------------------WQLPFVWY 537
Query: 360 STSEVIRALELFIA---SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
+ +VI A + I+ + LS ++ +R+D++DLTRQ++ + + L+ ++E Y ++
Sbjct: 538 NPHDVILAWDHLISGLMTEPLLSNASNFRHDMVDLTRQSMQEIFHLLYSQLLEVYLEKNS 597
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ ++ + ++L++D+D LL FLLG W+ AK E ++ Q
Sbjct: 598 TAIEGIAYKMIDLLQDLDELLQTGKKFLLGKWIADAKSWGTTEGEKLQ 645
>gi|301106961|ref|XP_002902563.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
gi|262098437|gb|EEY56489.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
Length = 684
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 173/461 (37%), Positives = 264/461 (57%), Gaps = 32/461 (6%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ KIL R+ E GM P LPAF+G+VP ++ +FP+AK
Sbjct: 121 MGNLRGSWVKGPLPQAFIDSQYALQLKILNRMREFGMIPALPAFAGHVPEEMKALFPNAK 180
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T+ NW + +CC Y+LD +DPL+ +IG+ F+E+Q Y TS +Y CDT++E
Sbjct: 181 FTRSPNWGDFSDE--FCCVYMLDFSDPLYYDIGKTFLEEQRALYDYTSSLYQCDTYNEMD 238
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P P + + A+ M + D++AVWL+QGWLF P +W ++KA L+ V K+
Sbjct: 239 PDFTDPAKLQAASRAVIDSMTAADANAVWLIQGWLFENSPDYWTKNRVKAYLDGVSNEKM 298
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL++EV+P+WS ++G +++C+LHNF GN M G L ++ PV+A N TM
Sbjct: 299 IILDLYSEVRPVWSKMDNYFGKSWVYCVLHNFGGNTGMRGDLATLGTAPVQASRDSNGTM 358
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
+GVG++MEGI QN VVYDL +MA+ +D+ W+ ++ +RY + AW L
Sbjct: 359 IGVGLTMEGIYQNYVVYDLTLQMAWVDTPLDMDEWVPSFAAQRYHSQDVHTERAWGFLLQ 418
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VYN T G + +I P + K V + S T
Sbjct: 419 SVYNRTLGFGGVTKSLICLIP----------------HWKLVRDGFMPTSIT-------- 454
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ--LND 415
Y ++ RA + + +G+EL A +TYR+DL+D+TRQ L+ + +L++ E Y+
Sbjct: 455 -YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFLSDHFMAQYLHLKEMYEGKTQP 513
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
AH + + R L +E MD +LA ++ LLG W+ A+ LA
Sbjct: 514 AHQLCAWTERMLLTIERMDEILATNEDSLLGNWIADARALA 554
>gi|449675146|ref|XP_002156234.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Hydra
magnipapillata]
Length = 646
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 166/464 (35%), Positives = 254/464 (54%), Gaps = 31/464 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL-QNVFPSAKI 59
M NL GWGGPL SW +QL LQ+ I+ R+ GM PVLP F G++P AL +FP++K
Sbjct: 89 MGNLEGWGGPLSSSWYSKQLQLQQNIISRMRSFGMIPVLPGFGGHIPKALVSRLFPTSKY 148
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
+L W K ++ T+LLD DPLF ++G AF+E Q + Y T H+YN D F+E P
Sbjct: 149 YKLKPW--NKFTGKYGGTFLLDPQDPLFKKVGAAFVEMQKQLYNGTDHVYNADIFNEMDP 206
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
P + +I++ +Y+ M + DSDAVWLMQGW+F W+P ++A L ++P GKL++
Sbjct: 207 PQLTSAFITNTSIGVYNAMLASDSDAVWLMQGWMF-LSSVWKPELVEAWLQAIPYGKLII 265
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +++ P++ + FYG P+IWCM+ NF G +YG L + G + AR + + M+G
Sbjct: 266 LDLASDIYPLYDQTNAFYGHPFIWCMIENFGGTTRLYGQLTGVMKGVISARKTYKSFMIG 325
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
GM+ EGI QN + ++LM+EM +++E+ ++ W Y RRYG + DAW +L T+
Sbjct: 326 TGMTPEGINQNDINFELMNEMGWRNEEFNISDWTLSYIKRRYGDYPKMVSDAWLILIDTI 385
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNC DG + D + P + P + N PV H+WY
Sbjct: 386 YNCNDGRENGGYDGRI--PVMRPQL---------NAKLPV----------------HMWY 418
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S ++ A +L + + + +T+R DL+ L Q L + + ++ Y V
Sbjct: 419 SIKDLYNAWKLMVKGSDYMPLIDTFRNDLVRLGTQVLEDLSIVFYTQMVSGYFNKSTLNV 478
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + L+ DMD LLA LLG W++SA+ + + K
Sbjct: 479 EKYGSKITVLLTDMDRLLATDQYSLLGRWIQSARSMGDTLNETK 522
>gi|255533285|ref|YP_003093657.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346269|gb|ACU05595.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 749
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 172/464 (37%), Positives = 256/464 (55%), Gaps = 38/464 (8%)
Query: 2 SNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G GPLP+SW++ LQKKIL R ELGM P+LPAFSG+VP + FP+A++ +
Sbjct: 207 GNIDGLNGPLPKSWMESHEQLQKKILARERELGMKPILPAFSGHVPPTFKARFPNARVDR 266
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
L NW + R+ TY+L DPLF +I F+ +Q K +G T H+Y DTF+E P
Sbjct: 267 L-NW-----EGRFADTYVLHPDDPLFQQIADKFMAEQDKAFGNTDHLYGADTFNEMYLPY 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
Y+ +G A+Y GM D +A+W+MQGW+F FW+P +K L+ VP L++L
Sbjct: 321 TDTAYVRKIGTAVYKGMAKADPEAIWVMQGWMFWDKRDFWKPEVVKNYLSGVPDDNLIML 380
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVG 239
DLFA+ +PIW+ ++ F+G +IWCMLHNF G +YG L+ I P E N + G
Sbjct: 381 DLFADEQPIWTKTEAFWGKKWIWCMLHNFGGRNPLYGDLNYIGREPAEMVHDPNRGRLSG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+ EGIEQNPVVY LM E + + +DVK+W+ Y+ RRYG+ P + AW +L+ TV
Sbjct: 441 IGLVPEGIEQNPVVYSLMLEHVWNDQVIDVKSWLVNYAQRRYGQRDPQTEKAWQILHQTV 500
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y +G+ + +IIS +P ++ ++ + D P Y
Sbjct: 501 Y-AKEGSYE--------------TIISA---------RPTHEK---HADWTGTDLP---Y 530
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+++ A + + N ++ Y++DL+ + RQ LA YA L ++ +
Sbjct: 531 DGDKLVPAWTYLLNASNRFKNNDCYQFDLVTVGRQVLANYATVLQRLFARDFRNKNLTAY 590
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ FL L+ DMD L+ FLLG WL AK+ A NE + +
Sbjct: 591 RAHTAEFLTLIADMDKLMGTRKDFLLGKWLNDAKKWATNESESR 634
>gi|428176410|gb|EKX45295.1| hypothetical protein GUITHDRAFT_51145, partial [Guillardia theta
CCMP2712]
Length = 680
Score = 315 bits (806), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 173/478 (36%), Positives = 272/478 (56%), Gaps = 35/478 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGG L QSW+DQQ LQ KIL R ELGM PVLPAF+G VP ++++FP AK T
Sbjct: 121 MINIKAWGGGLTQSWIDQQRDLQLKILARERELGMLPVLPAFAGGVPEGMKSLFPEAKFT 180
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ GNW CC ++D TDPLF++IG+ F+E+ YG ++HIY+CDTF+EN P
Sbjct: 181 RHGNWGGFAEQH--CCVMMVDPTDPLFLKIGKMFVEEVRAVYG-SNHIYSCDTFNENRPR 237
Query: 121 VDSP----EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG 175
+ +++S A++ M++ D DAVWLMQGWLF D FW+ ++ A L+ VP
Sbjct: 238 SEHGSVGLDFLSHSSRAVFESMRAADPDAVWLMQGWLFMNDARFWQKRELDAYLSGVPED 297
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVP-----YIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
++++LDLF +V P+W P ++W MLH+F GN MYG L I+ PV A+
Sbjct: 298 RMIILDLFTDVFPVWKRRDLQRPTPIEKRRWVWNMLHSFGGNSGMYGRLQVISKDPVVAK 357
Query: 231 TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG----RSVP 286
E+ TMVGVG++ EGIEQNPVVY++M+EM ++ ++VDV +W+ +++ RR G R
Sbjct: 358 -KESQTMVGVGITTEGIEQNPVVYEMMAEMRWREQEVDVMSWVEKWADRRLGPEASRERK 416
Query: 287 AI-QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+ ++AW L TVY+C + + ++ + P +D + + + + +EA++
Sbjct: 417 ALGEEAWRELASTVYSCPGTQMGQVKSMVESRPRLDLASGWIPNSDFMPIKRHYPEEALV 476
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
++ W ++ L + + S++ +D+ D+TRQ L+ LF
Sbjct: 477 RA----------W------LKLLRATRGGADGYTCSSSASFDIADVTRQVLSDLFARLFQ 520
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ Q A + L ++ DMD ++ LLG W+E A+ +++E+E+
Sbjct: 521 PLSSFCQTRLAGSAAVRMQTLLGIISDMDKMVGTQPRMLLGKWIEDARAWGKSKEEEE 578
>gi|156121099|ref|NP_001095696.1| alpha-N-acetylglucosaminidase precursor [Bos taurus]
gi|151554244|gb|AAI48148.1| NAGLU protein [Bos taurus]
gi|296476361|tpg|DAA18476.1| TPA: alpha-N-acetylglucosaminidase [Bos taurus]
Length = 667
Score = 314 bits (804), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 166/383 (43%), Positives = 229/383 (59%), Gaps = 35/383 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+GNW + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ +Q + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 441 TGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 500
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC S E + N+ P+ + L+ T+ +W
Sbjct: 501 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 530
Query: 359 YSTSEVIRALELFIASGNELSAS 381
Y+ S+V A L + + + L++S
Sbjct: 531 YNRSDVFEAWRLLLTATSTLASS 553
>gi|328867411|gb|EGG15793.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
Length = 1501
Score = 311 bits (797), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 170/472 (36%), Positives = 269/472 (56%), Gaps = 46/472 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++GWGGPL ++ Q LQ++IL R+ + GM PVLP F+G+VP A ++FP+A IT
Sbjct: 950 MGNVNGWGGPLDYDFIAGQHDLQQQILERMRQYGMKPVLPGFAGHVPRAFMSLFPTANIT 1009
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + TY LD +DPLF + + F++ Q YG T H Y+ D F+E TPP
Sbjct: 1010 QLGDWRAFNG------TYYLDPSDPLFANVSQTFVKVQTAIYG-TDHYYSFDPFNEITPP 1062
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
Y+ + +++Y+ + D AVW++Q W F D FW+PPQ+KA L VP+G L+V
Sbjct: 1063 SSDAGYLQNSSSSMYNALAYADPQAVWVLQAWFFISDAWFWQPPQVKAFLGGVPIGHLLV 1122
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD +AE P W+ + QF G +IWCMLHNF G MYG + I GP++AR ++ M G
Sbjct: 1123 LDTWAEESPAWTVTDQFNGHDWIWCMLHNFGGRTGMYGKIPRITAGPIDAR-KQSPGMKG 1181
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ E IEQN ++YDLMSEM+++ ++ WINQY+ RRYG VP + AWN L TV
Sbjct: 1182 TGLTPEAIEQNYIMYDLMSEMSWRTTAPNMTEWINQYTQRRYGVFVPELAQAWNSLASTV 1241
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN D + DKN +F + P + +++Y
Sbjct: 1242 YNAPD-SIDKNPS---SFVGIRPELNMTN---------------------------NIYY 1270
Query: 360 STSEVIRALELFIASGNE-LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+S + +A +L+++ +E + +++TY +D+ ++T QAL+ E + + +AY+
Sbjct: 1271 DSSIIQKAWQLYLSVTDEYVLSTSTYSFDIAEITIQALSNLFIETEIAMYDAYKTGKGTE 1330
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ-----LAQNEEQEKQV 465
+ + L ++ DMD + + L+G W +A+Q L++N+++++
Sbjct: 1331 FDEHAMNCLNIITDMDMIASTQQLLLVGTWTANARQWANYNLSRNKDEDRNT 1382
>gi|348681836|gb|EGZ21652.1| hypothetical protein PHYSODRAFT_247428 [Phytophthora sojae]
Length = 991
Score = 308 bits (790), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 171/461 (37%), Positives = 264/461 (57%), Gaps = 32/461 (6%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ +IL R+ E GM P LPAF+G+VP L+ P+A
Sbjct: 422 MGNLRGSWVKGPLPQAFIDNQHELQLRILQRMREFGMIPALPAFAGHVPEDLKLTLPNAN 481
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T+ NW + ++CC Y+++ TDPL+ EIG+AF+E+Q Y TS +Y CDT+ E
Sbjct: 482 FTRSPNWGNFTD--QYCCVYMIEPTDPLYREIGKAFLEEQRALYNYTSSLYQCDTYMEMA 539
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P + A+ GM + D +AVWLMQGW F DP +W P++KA L VP KL
Sbjct: 540 PEFTDLSELKGAARAVIDGMTAADPNAVWLMQGWPFVDDPHYWTRPRVKAYLEGVPTDKL 599
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LD ++E PIW+ ++G +I+ +LHNF GN M G L ++A PV+A+ N TM
Sbjct: 600 IILDFYSEAVPIWNKMDNYFGKNWIYSVLHNFGGNTGMRGDLPTLATAPVQAQRDGNGTM 659
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VGVG++MEGI QN VVYDL +MA++ +DV W+++Y+ RRY ++ AW+ L
Sbjct: 660 VGVGLTMEGIFQNYVVYDLTLQMAWEDSPLDVDEWVSKYASRRYHTQNEHVERAWSYLSR 719
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VYN T +A+ V S++ + Y + + +
Sbjct: 720 SVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR--------------FQPTLI 754
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF--LNIIEAYQLND 415
Y +++ A + + +G+EL +TYR+DL+D+T+Q L+ E + L +I + +
Sbjct: 755 KYDPKDIVLAWKELLLAGDELRNVDTYRHDLVDVTKQFLSNKLLEQYQHLKVIYSAKSAP 814
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
A+ V +L++ L + ++ +LA ++ FLLG W+ A LA
Sbjct: 815 ANEVCELTKTMLTTINRLEEILATNEDFLLGNWVADALNLA 855
>gi|194759443|ref|XP_001961958.1| GF14678 [Drosophila ananassae]
gi|190615655|gb|EDV31179.1| GF14678 [Drosophila ananassae]
Length = 783
Score = 307 bits (787), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 166/468 (35%), Positives = 259/468 (55%), Gaps = 40/468 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++IL LGM+ LPAF+G+VP AL + P+ T
Sbjct: 228 MGNIRGWAGPLKPEWRQFQLLLQQEILSAQRNLGMSVALPAFAGHVPRALSRLHPNTSFT 287
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R+CC ++ T+PLF +I F++ + YG ++HI+ CD F+E PP
Sbjct: 288 DVQRWNQFPD--RYCCGLFVEPTEPLFHQIATTFLQSVVTIYG-SNHIFFCDPFNELEPP 344
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAI++ M + D +A+WL+QGW+F +PFW P +A L +VP G+++VL
Sbjct: 345 VAKPEYMRSTAAAIHNSMTAVDPEAIWLLQGWMFVKNPFWTPDMAEAFLTAVPRGRILVL 404
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + ++G P+IWCMLHNF G + M+G I G AR+ N+++VG
Sbjct: 405 DLQSEQFPQYELTHSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEAARSMPNSSIVGT 464
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN VVY L E + +D+ +W ++V RYG ++ AW +L ++VY
Sbjct: 465 GITPEGIGQNYVVYSLTLERGWSRNSIDLDSWFRHFTVTRYGVKDESLAKAWLLLKNSVY 524
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH-PHLWY 359
+ + G+Y +P S++H P WY
Sbjct: 525 SFHG--------------------LQKMRGQYVVTRRP------------SFNHDPFTWY 552
Query: 360 STSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
+ S+V+ A L +++ E + Y +DL+D+TRQ L A++L++N+ +++
Sbjct: 553 NASDVLEAWHLLLSARVIIPLEDDRYDVYEHDLVDITRQFLQITADQLYVNLKSSFRKRQ 612
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
LS R L+L +D++ +L+ FLLG WLE AKQ+A + E K
Sbjct: 613 LPRFEFLSTRLLQLFDDLELILSSGRNFLLGNWLEQAKQVAPHPEDRK 660
>gi|348681870|gb|EGZ21686.1| hypothetical protein PHYSODRAFT_495971 [Phytophthora sojae]
Length = 692
Score = 304 bits (779), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 164/461 (35%), Positives = 261/461 (56%), Gaps = 32/461 (6%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ KIL R+ GM P LPAF+G+VP L+ ++P+AK
Sbjct: 121 MGNLRGSWVEGPLPQAFIDGQYELQLKILERMRGFGMVPALPAFAGHVPEELKTLYPNAK 180
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T+ NW + +CC Y+LD DPL+ EIG+ F+E+Q Y TS +Y CDT++E
Sbjct: 181 FTRSPNWGGFSDE--FCCVYMLDPQDPLYYEIGKTFLEEQRALYDYTSSLYQCDTYNEMD 238
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P P + + A+ M + D +AVWL+QGWLF P +W +++ L+ VP K+
Sbjct: 239 PDFTDPAKLQAASRAVIDSMTAADPNAVWLIQGWLFVNSPNYWTKERVQTYLDGVPNDKM 298
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL++EV+P+W+ ++G +I+C+LHNF GN M G L ++ PV A + + TM
Sbjct: 299 IILDLYSEVRPVWNKMDNYFGKSWIYCVLHNFGGNTGMRGDLPTLGTAPVLANRASSGTM 358
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
+G+G++MEGI QN VVYDL +MA+ +D+ W+ ++ +RY + AW L
Sbjct: 359 IGMGLTMEGIFQNYVVYDLTLQMAWVDAPLDMDEWVPSFAAQRYHSQDAHTERAWGFLLQ 418
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VYN T G + ++ P + ++ + + +
Sbjct: 419 SVYNRTLGYGGVTKSLVCLIPHW-----------------KLVRDGFMPTLIT------- 454
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-- 415
Y ++ RA + + +G+EL A +TYR+DL+D+TRQ L+ + +L++ + Y +
Sbjct: 455 -YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFLSDHFMAQYLHLEDMYAGKETP 513
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
A + + R L +E +D +LA +D FLLG W+ A+ LA
Sbjct: 514 ADQLCAWTDRMLVTIEWLDEILATNDDFLLGNWVADARALA 554
>gi|157134500|ref|XP_001656341.1| alpha-n-acetylglucosaminidase [Aedes aegypti]
gi|108881379|gb|EAT45604.1| AAEL003150-PA [Aedes aegypti]
Length = 763
Score = 303 bits (776), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 166/464 (35%), Positives = 258/464 (55%), Gaps = 37/464 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL ++++ LQ +++ + LGM LPAF+G++P ++P AK+T
Sbjct: 211 MGNIRGWGGPLTTNFINFSKKLQNQVIDEMRRLGMVLALPAFAGHLPVQFAQLYPEAKLT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + ++ LD DPLF EIG+ F+ + ++ YG ++HIY CD F+E P
Sbjct: 271 PVENWNGFPA--QYASPLFLDPIDPLFQEIGKRFLTKVIERYG-SNHIYFCDPFNEIQPR 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
S +Y+SS A IY M D AVWL+QGW+F +P+W ++A L +VPLG+++VL
Sbjct: 328 SFSAKYLSSASAGIYKAMNDVDPFAVWLLQGWMFVKNPYWSDVAIRAFLQAVPLGRMLVL 387
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCML NF G + M G +D + + RT+++ TM+G
Sbjct: 388 DLQSEQFPQYDRTESYHGQPFIWCMLSNFGGTLGMLGSVDLVFQRIRDVRTNDSMTMIGT 447
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN +Y+ EM + +V+ W Y+ RYG ++DAW++ +TVY
Sbjct: 448 GITPEGINQNYGLYEFALEMGWNPNIDNVEEWFRTYASVRYGTQDKRLKDAWSMFRYTVY 507
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + + GKY +P K HP LWY+
Sbjct: 508 SFKEQ--------------------EMMRGKYTFNRRPSLKL-----------HPWLWYN 536
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+ ++L + S S + +R D++DLTRQ L A+ L+LNI+EAY + + V
Sbjct: 537 ETLFNAGVQLLLESN---STNTLFRNDVVDLTRQFLQNTADRLYLNIMEAYNTKNPNSVK 593
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
LS F +L+EDMD LL FLLG WLESAK +A+ + ++
Sbjct: 594 YLSILFQKLLEDMDRLLRTDQHFLLGRWLESAKAVAETSLERQK 637
>gi|242011515|ref|XP_002426494.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
gi|212510620|gb|EEB13756.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
Length = 1345
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 172/465 (36%), Positives = 270/465 (58%), Gaps = 41/465 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ + L +WL QQL+LQ KIL R+ ELG+ PVLP+F G VP + ++ +P AK+
Sbjct: 836 MGNVRNFSYGLTNNWLQQQLLLQHKILNRLRELGITPVLPSFCGIVPRSFKDSYPFAKLL 895
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W D +CC YLLD+ DPLF + R F+++ + E+G T+HIYNCD F+EN P
Sbjct: 896 EMPKWNKFSRD--YCCPYLLDSNDPLFSVVSRVFLKEYINEFG-TNHIYNCDVFNENKPA 952
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRP-PQMKALLNSVPLGKLVV 179
+S +Y+S++ + IY M S D A WL+QGW+F DPFW ++KA +N+VP G++++
Sbjct: 953 SESLDYLSTISSTIYKAMSSVDPRATWLVQGWMF-IDPFWASLKRVKAFINAVPKGRMLI 1011
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +++ P + + ++G P+IWC LHNF G + MYG L+ + G + R +N+TMVG
Sbjct: 1012 LDLQSDLTPQYKRLQSYFGQPFIWCTLHNFGGQLGMYGHLNRVNLGVFKGRKFKNSTMVG 1071
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G++ EGI+QN ++YD ++A + + VD+ WI +Y++RRYG I DAW +L +T+
Sbjct: 1072 IGIAPEGIDQNYIMYDFTLDLALRTKPVDLDDWITKYALRRYGLIEKNILDAWLILKNTL 1131
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN PD + + S Y + +L S + WY
Sbjct: 1132 YNYN--------------PDSNFRLTSSNVKMYTLVKGEHIAKNILTKFPSLRMNEFTWY 1177
Query: 360 STSEVIRALELF-IASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
+ S ++ E F IAS N + ++++ +++DLID+TRQ +
Sbjct: 1178 NRSIILDIFEKFQIASSNSILSTSSLFQHDLIDVTRQTI--------------------Q 1217
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ S FLEL+ ++D +L FLLG WLESAK +A N+ ++
Sbjct: 1218 IAIENSNMFLELLNELDMILNTGKKFLLGNWLESAKNMATNKLEK 1262
>gi|195577611|ref|XP_002078662.1| GD22403 [Drosophila simulans]
gi|194190671|gb|EDX04247.1| GD22403 [Drosophila simulans]
Length = 778
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 164/467 (35%), Positives = 251/467 (53%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ + LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPGWRRYQLLLQQEIITAQHNLGMSVALPAFAGHVPRALKRLHPESTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC ++ TD LF EI F++ + +YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTDNLFKEIASRFLQNIITKYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++VG
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIDEARRLPNSSLVGT 459
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y E + + +D+ +W +S RYG ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNFSHTRYGVKDERLEQAWLLLKNSVY 519
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + G+Y V+ S P WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548
Query: 361 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A L + S E Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 549 ASAVLDAWHLLLTSRAIIPLEDDRYEIYEHDLVDITRQFLQISADQLYVNLRSAYRKRQV 608
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
LS + L+L +DM+ +LA FLLG WL+ AKQ A N +++
Sbjct: 609 ARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGEQR 655
>gi|195339231|ref|XP_002036223.1| GM12949 [Drosophila sechellia]
gi|194130103|gb|EDW52146.1| GM12949 [Drosophila sechellia]
Length = 778
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 163/467 (34%), Positives = 250/467 (53%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTAGWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLHPESTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC ++ T+ LF EI F++ + +YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLQNIITKYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D +A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPEAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++VG
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y E + + +D+ W +S RYG ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDGWFTNFSHTRYGVKDERLEQAWLLLKNSVY 519
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + G+Y V+ S P WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548
Query: 361 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A L + S E Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 549 ASAVLDAWHLLLTSRAIIPLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAYRKRQV 608
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
LS + L+L +DM+ +LA FLLG WL+ AKQ A N +++
Sbjct: 609 ARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGEQR 655
>gi|21356587|ref|NP_652045.1| CG13397, isoform A [Drosophila melanogaster]
gi|442626853|ref|NP_001260251.1| CG13397, isoform B [Drosophila melanogaster]
gi|16185856|gb|AAL13967.1| LP03571p [Drosophila melanogaster]
gi|22945953|gb|AAF52672.2| CG13397, isoform A [Drosophila melanogaster]
gi|440213562|gb|AGB92787.1| CG13397, isoform B [Drosophila melanogaster]
Length = 778
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 163/467 (34%), Positives = 250/467 (53%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL +W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPAWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLNPESTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC ++ T+ LF EI F+ + +YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLHNIITKYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++VG
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y E + + +D+ +W +S RYG ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNFSHSRYGVKDERLEQAWLLLKNSVY 519
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + G+Y V+ S P WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548
Query: 361 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A L + E + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 549 ASAVLDAWHLLLTFRAIIPLEDNRYEIYEHDLVDITRQFLQISADQLYINLRSAYRKRQV 608
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
LS + L+L +DM+ +LA FLLG WL+ AKQ A N Q++
Sbjct: 609 SRFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGQQR 655
>gi|194863164|ref|XP_001970307.1| GG23441 [Drosophila erecta]
gi|190662174|gb|EDV59366.1| GG23441 [Drosophila erecta]
Length = 778
Score = 298 bits (762), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 163/467 (34%), Positives = 251/467 (53%), Gaps = 38/467 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPEWRRYQLLLQQEIIAAQRNLGMSVALPAFAGHVPRALKRLHPGSTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC L+ TD LF EI F+++ + YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFLEPTDNLFNEIALIFLQKIITAYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY ++ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESIRRLDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++VG
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y E + + +D+ +W +S RYG ++ AW L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNRPLDLDSWFTSFSHARYGVKDERLEQAWLQLKNSVY 519
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + G+Y +P K+ P WY+
Sbjct: 520 SFHG--------------------LQKMRGQYVVTRRPSFKQ-----------EPFTWYN 548
Query: 361 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A L ++S E Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 549 ASAVLDAWHLLLSSRAIIPLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAYKKRQV 608
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
LS + L+L +DM+ +LA FLLG WL+ AKQ A + +++
Sbjct: 609 SRFEFLSSKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPHPGEQR 655
>gi|90399367|emb|CAJ86183.1| H0212B02.15 [Oryza sativa Indica Group]
gi|116311963|emb|CAJ86322.1| OSIGBa0113E10.5 [Oryza sativa Indica Group]
Length = 692
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 133/189 (70%), Positives = 157/189 (83%), Gaps = 3/189 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 261 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 320
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 321 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 380
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG---KL 177
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMK + G
Sbjct: 381 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKIGVGMSMEGIEQNP 440
Query: 178 VVLDLFAEV 186
+V DL +E+
Sbjct: 441 IVYDLMSEM 449
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 139/232 (59%), Positives = 178/232 (76%), Gaps = 6/232 (2%)
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
+GVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+S+ +QDAW +LY
Sbjct: 427 IGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSIVGLQDAWKILYQ 486
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSY 352
T+YNCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y
Sbjct: 487 TLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEY 545
Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+HPHLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+
Sbjct: 546 EHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYK 605
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
N+ + V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE Q
Sbjct: 606 ANNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQ 657
>gi|198476648|ref|XP_001357424.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
gi|198137793|gb|EAL34493.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
Length = 767
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 161/462 (34%), Positives = 257/462 (55%), Gaps = 38/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL + Q +LQ+ IL +LG++ LPAF+G++P A++ ++P+ T
Sbjct: 212 MGNIRGWGGPLKPEYQRLQELLQQHILRAQRDLGISVALPAFAGHLPTAMRRIYPNGNYT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S DP +CC +D DP+F + F+ + ++ YG ++HI+ CD F+E PP
Sbjct: 272 EVERWNSF-PDP-YCCGLFVDPLDPIFDLVAALFLRRVVQRYG-SNHIFFCDPFNELQPP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V P+Y+ S AAI++ M+S D +AVWL+QGW+F + FW M+A L +VP+G+L+VL
Sbjct: 329 VAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNIFWTDAMMEAFLTAVPIGRLIVL 388
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + +YG P++WCMLHNF G + M+G D + G AR N+++VGV
Sbjct: 389 DLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGSADLVNNGIEAARRMPNSSIVGV 448
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y L+ E + +D+ +W ++ RYG +Q AW +L +VY
Sbjct: 449 GITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFARTRYGVDDEGLQQAWQLLRRSVY 508
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ K R G V++ L + P WY+
Sbjct: 509 SFR--GLQKMRG-----------------------GYTVTRRPALNLD------PFTWYN 537
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S+V+ A +L ++S + + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 538 ASDVLEAWKLLLSSRAIIPLEDDNYAIYEHDLVDITRQYLQISADQLYVNLKSAYRKRQV 597
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
L + L+L D++ +LA FLLG WL A++ A N
Sbjct: 598 ARFEYLGSKLLQLFGDLERILASGSNFLLGTWLADAQRAAPN 639
>gi|301107007|ref|XP_002902586.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
gi|262098460|gb|EEY56512.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
Length = 736
Score = 295 bits (755), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 169/454 (37%), Positives = 258/454 (56%), Gaps = 32/454 (7%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ +IL R+ E GM P LPAF+G+VP L+ P+A
Sbjct: 196 MGNLRGSWVKGPLPQAFIDNQHELQLRILERMREFGMIPALPAFAGHVPEELKLRLPNAH 255
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
TQ NW + + CC ++++ TD L+ EIG+ F+++Q + Y TS +Y CDT+ E
Sbjct: 256 FTQSPNWGNFSEEH--CCVFMIEPTDALYREIGKNFLKEQRELYNYTSSLYQCDTYMEMA 313
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P + A+ GM + D +AVWLMQGW F DP FW P++KA L+ VP KL
Sbjct: 314 PEFTDLTELEGAARAVIDGMTAADPNAVWLMQGWPFVDDPHFWTKPRVKAYLDGVPTDKL 373
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LD ++E PIWS ++G +I+ +LHNF GN M G L ++A PV A + N TM
Sbjct: 374 IILDFYSESVPIWSKMDNYFGKSWIYSVLHNFGGNTGMRGDLLTLATAPVLANWAGNGTM 433
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VGVG++MEGI QN +VYDL +MA+ +DV WI QY+ +RY ++ AW+ L
Sbjct: 434 VGVGLTMEGIFQNYIVYDLTLQMAWVDNPLDVNTWIPQYAAQRYHTHNEHVEQAWSYLLR 493
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VYN T +A+ V S++ + Y + + ++K
Sbjct: 494 SVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR--FQPTLIK----------- 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA- 416
Y ++V+ A + + + NEL +TYR+DL+D+T+Q L+ E ++++ Y A
Sbjct: 530 -YDPNDVVLAWKELLLAENELRDVDTYRHDLVDVTKQFLSNKLLEQYIHLKGIYNAKKAS 588
Query: 417 -HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
+ V L++ L +E ++ +LA ++ FLLG W+
Sbjct: 589 PNEVCGLTKTMLTTMERLEEILATNEDFLLGNWI 622
>gi|158300970|ref|XP_320760.4| AGAP011750-PA [Anopheles gambiae str. PEST]
gi|157013415|gb|EAA00039.4| AGAP011750-PA [Anopheles gambiae str. PEST]
Length = 770
Score = 295 bits (754), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 163/459 (35%), Positives = 248/459 (54%), Gaps = 35/459 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL S+ LQ +++ + LGM LPAF+G++P + ++P+
Sbjct: 213 MGNIRGWGGPLTPSFTQFAHTLQVRVVGEMRRLGMAVALPAFAGHLPVQFRTLYPNVSFA 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + P++ LD T+PLF IG F++ +K YG T H+Y D F+E P
Sbjct: 273 NVSVWNNFP--PQYASPLFLDPTEPLFAAIGSRFLQLAIKTYG-TDHVYFSDPFNEIDPT 329
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ S +Y+SS+ AIYS M D DA+WL+QGW+F +PFW +++ L++VPLG+++VL
Sbjct: 330 LPSGKYLSSVSEAIYSTMVQVDPDAIWLLQGWMFVKNPFWSDRAIRSFLSAVPLGRMLVL 389
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + + G P+IWCML NF G + M G + ++ G E R + T++G
Sbjct: 390 DLQSEQYPQYGRTASYAGQPFIWCMLSNFGGTLGMLGSVGNVFRGIRETRDNSTYTLLGT 449
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR-SVPAIQDAWNVLYHTV 299
G++ EGI QN +Y+ EM + E + W ++Y+V RYG S Q AWN+ TV
Sbjct: 450 GITPEGINQNYALYEFALEMGWNAELDSAEQWFSEYAVARYGNDSDERAQQAWNIFLRTV 509
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y AF ++ + GKY +P SK P WY
Sbjct: 510 Y---------------AFEGLE-----LMRGKYTFNRRPSSK-----------IRPWTWY 538
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ LEL ++ E S + +YDL+D TRQ L A+ L+L ++++++ D
Sbjct: 539 DVHTFNQGLELLLSFAEEASCNQLCQYDLVDATRQCLQHTADALYLTLMDSFKKRDLTSF 598
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
S FL+L+ D+D LL ++ FLLGPWLESAK A+
Sbjct: 599 RLHSSLFLQLLSDLDVLLRTNEHFLLGPWLESAKAHAET 637
>gi|195155652|ref|XP_002018715.1| GL25802 [Drosophila persimilis]
gi|194114868|gb|EDW36911.1| GL25802 [Drosophila persimilis]
Length = 767
Score = 294 bits (753), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 160/462 (34%), Positives = 257/462 (55%), Gaps = 38/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL + Q +LQ+ IL +LG++ LPAF+G++P A++ ++P+ T
Sbjct: 212 MGNIRGWGGPLKPEYQRLQELLQQHILRAQRDLGISVALPAFAGHLPTAMRRIYPNGNYT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S DP +CC +D DP+F + F+ + ++ YG ++HI+ CD F+E PP
Sbjct: 272 EVERWNSF-PDP-YCCGLFVDPLDPIFDLVAALFLRRVVQRYG-SNHIFFCDPFNELQPP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V P+Y+ S AAI++ M+S D +AVWL+QGW+F + +W M+A L +VP+G+L+VL
Sbjct: 329 VAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNIYWTDAMMEAFLTAVPIGRLIVL 388
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + +YG P++WCMLHNF G + M+G D + G AR N+++VGV
Sbjct: 389 DLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGSADLVNNGIEAARRMPNSSIVGV 448
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y L+ E + +D+ +W ++ RYG +Q AW +L +VY
Sbjct: 449 GITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFARTRYGVDDEGLQQAWQLLRRSVY 508
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ K R G V++ L + P WY+
Sbjct: 509 SFR--GLQKMRG-----------------------GYTVTRRPALNLD------PFTWYN 537
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S+V+ A +L ++S + + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 538 ASDVLEAWKLLLSSRAIIPLEDDKYAIYEHDLVDITRQYLQISADQLYVNLKSAYRKRQV 597
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
L + L+L D++ +LA FLLG WL A++ A N
Sbjct: 598 ARFEYLGSKLLQLFGDLEHILASGSNFLLGTWLADAQRAAPN 639
>gi|170060634|ref|XP_001865888.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
gi|167879069|gb|EDS42452.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
Length = 761
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 164/466 (35%), Positives = 249/466 (53%), Gaps = 41/466 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL +S+ LQ K++ + GM LPAF+G++P + +FP AK+
Sbjct: 210 MGNIRGWGGPLKESFKTFASDLQAKVVQEMRRFGMILALPAFAGHLPVQFKTLFPQAKLN 269
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + ++ LD DPLF +IG F+ + + YG T HIY D F+E P
Sbjct: 270 PVEVWNGFPA--QYASPLFLDPVDPLFQKIGSKFVAKAIARYG-TDHIYFSDPFNEIQPR 326
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+S Y++S A IY M D AVWL+QGW+ +PFW +KA +VP G+++VL
Sbjct: 327 SESARYLASAAAGIYQAMVDVDPLAVWLLQGWMLVKNPFWSDRAIKAFFTAVPNGRMLVL 386
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ +YG P+IWCML NF G + M G +D + E R++E+ TM+G
Sbjct: 387 DLQSEQFPQYVRTQSYYGQPFIWCMLSNFGGTLGMLGSVDLVFERIRETRSNESMTMIGT 446
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN +Y+ EM + + DV W +Y++ RYG +QDAW++ TVY
Sbjct: 447 GITPEGINQNYGLYEFALEMGWNPDISDVDNWFTRYAMVRYGNDDKRLQDAWSIFRSTVY 506
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + + GKY +P K P +WY+
Sbjct: 507 SFKG--------------------MEMMRGKYTFNRRPSLKL-----------QPWVWYN 535
Query: 361 TSEVIRALELFIA--SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+ +EL +A NEL +++D++DLTRQ L A++L+L I++ Y L +A
Sbjct: 536 ETRFDEGVELILAVNGSNEL-----FKHDVVDLTRQFLQNTADKLYLTIMDTYTLKNAAA 590
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
S F EL++++D LLA + FLLG WLESAK LA + ++
Sbjct: 591 FKHYSNLFKELLQNIDRLLATNTHFLLGRWLESAKSLATTSLERQK 636
>gi|440800773|gb|ELR21808.1| AlphaN-acetylglucosaminidase (NAGLU) [Acanthamoeba castellanii str.
Neff]
Length = 800
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 175/480 (36%), Positives = 257/480 (53%), Gaps = 67/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGPL +SW + Q LQKKI+ ++ + A L+ V+P A IT
Sbjct: 199 MGNVQSWGGPLTKSWREGQAELQKKIVQGVWN---EERAVSVRWARAAGLKRVYPHANIT 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W R +LLD DP+F +IG AFI+ Q + YG T HIYN DTF+E PP
Sbjct: 256 LSPTWAHFTDPYR---VWLLDPFDPIFQKIGTAFIDAQTRVYG-TDHIYNADTFNELDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P Y+++ A+Y GM + D A+WLMQGWLF +W ++KA L+ V +++L
Sbjct: 312 SADPTYLAAASNAVYQGMAAADPKALWLMQGWLFR-SVWWSNDRIKAYLSGVKNDNMLIL 370
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEV PIWS ++ ++G P++WCMLH+F GN ++YG L IA PV+ART+ +TMVG
Sbjct: 371 DLYAEVDPIWSKTESYFGKPFVWCMLHDFGGNRDLYGNLTHIATAPVDARTAPGSTMVGT 430
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ME IEQNPV+Y+LMSEM ++ VDV W++ Y RYG P+ + AW +L+ + Y
Sbjct: 431 GLTMEAIEQNPVIYELMSEMGWRSAHVDVDDWLDHYVSFRYGADSPSAKKAWRLLHQSAY 490
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
P + SI + ++ S H YS
Sbjct: 491 QN---------------PVIMRSIYTFVPNRH----------------VSRNHH----YS 515
Query: 361 TSEVIRALELFIASGNEL----SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND- 415
++ A L + S EL + + YDL+D+TRQ L + LF +AY L D
Sbjct: 516 PDVLVEAWGLLLQSRLELPNPAQPNGPWEYDLVDVTRQVL----DNLFH---DAYGLLDG 568
Query: 416 AHGVFQLSRR------------FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
A+ + +RR ++++ D+D +LA + +LLG W E A+ A NEE+++
Sbjct: 569 AYDAYVATRRDPFNQVKTIGAALIQILSDIDTVLATNQNYLLGVWTERARSWATNEEEKR 628
>gi|330791218|ref|XP_003283691.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
gi|325086434|gb|EGC39824.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
Length = 712
Score = 288 bits (737), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 158/458 (34%), Positives = 253/458 (55%), Gaps = 37/458 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ WGGP+ WL++Q LQ +IL R+ GM PVLP F+G++P A+Q +FP+A ++
Sbjct: 170 MGNVNNWGGPITMDWLEKQRDLQIQILTRMRAYGMKPVLPGFAGHIPGAIQTLFPTANVS 229
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W T+ LD +DPLF +I + FI + + +G T H YN D F+E PP
Sbjct: 230 ILSTWCEFNG------TFYLDPSDPLFGKITQLFITELIGVFG-TDHYYNFDPFNELAPP 282
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSYDPFWRPPQMKALLNSVPLGKLVV 179
++ +Y+ M + D AVW++QGW + Y FW+ Q +A + VP+G +V
Sbjct: 283 SSDLGFLKQTSQQMYNNMLAADPKAVWVLQGWFIVDYPEFWQANQTQAWFSGVPIGGFIV 342
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+++V P W+ ++ FYG ++WCMLHNF G MYG + IA P+ AR S + M+G
Sbjct: 343 LDLWSDVAPAWNITEYFYGHYWLWCMLHNFGGRSGMYGRIPFIATNPIIAR-SLSDNMMG 401
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ E IEQN VVYDLMSEMA++ D++ WI QY+ RRYG+ +P + + W + TV
Sbjct: 402 TGLTPEAIEQNVVVYDLMSEMAWRSTAPDLEEWITQYTNRRYGKIMPEVVEVWMSMVDTV 461
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
+N T +N +F + PSI N+G V +Y
Sbjct: 462 FNATAYWARRNMGAPESFIALRPSI---------NFGDNV------------------FY 494
Query: 360 STSEVIRALELF-IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
S + A +F + + + + ++ T+++D+ ++T QAL+ + + + N+I++Y ++D
Sbjct: 495 DPSVMFNAWHVFSLVNDSYVISTETFQFDISEITMQALSNFFMDTYFNLIKSYNVSDIES 554
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ S +E + MD + + LG W A+ A
Sbjct: 555 FQRESITMMETISFMDLIASTQPELQLGVWTYRARLWA 592
>gi|195473052|ref|XP_002088810.1| GE10991 [Drosophila yakuba]
gi|194174911|gb|EDW88522.1| GE10991 [Drosophila yakuba]
Length = 778
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 158/460 (34%), Positives = 245/460 (53%), Gaps = 38/460 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPQWRRYQLLLQQEIIAAQRNLGMSVALPAFAGHVPRALKRLNPDSTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W ++CC ++ + LF EI F+++ + YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--QYCCGLFVEPKENLFNEIALNFLQKIITIYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTSAAIYESMRRIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++VG
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y E + ++ +D+ +W +S RYG ++ AW L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNKPLDLDSWFTNFSHTRYGVKDERLEQAWLQLKNSVY 519
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + G+Y V+ S P WY
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYD 548
Query: 361 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A L ++S E Y +DL+D+TRQ L A++L++N+ A++
Sbjct: 549 ASAVLDAWHLLLSSRAIIPLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAFRKRQV 608
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
LS + L+L +DM+ +LA FLLG WL+ AK+ A
Sbjct: 609 TRFEYLSTKLLKLFDDMELILASSRNFLLGNWLQQAKRAA 648
>gi|297273081|ref|XP_001095618.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Macaca mulatta]
Length = 691
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 174/472 (36%), Positives = 245/472 (51%), Gaps = 85/472 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ + P A
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRTSCMPVAS 263
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ S+ P GR I + +
Sbjct: 264 LPA-----SLPPSPG-----------------GRKLIH----------------SINLMQ 285
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
PP +P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L
Sbjct: 286 PPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRL 345
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 346 LVLDLFAESQPVYTLTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTM 405
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
VG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ +RYG S P AW +L
Sbjct: 406 VGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLL 465
Query: 297 HTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
+VYNC+ + NR +V P L+ TS
Sbjct: 466 RSVYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS----- 495
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY---- 411
+WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY
Sbjct: 496 -VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKE 554
Query: 412 --QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
L A GV EL+ +D LLA FLLG WLE A+ A +E +
Sbjct: 555 LTSLLRAGGVLA-----YELLPALDELLASDSRFLLGSWLEQARAAAVSEAE 601
>gi|195115262|ref|XP_002002183.1| GI17241 [Drosophila mojavensis]
gi|193912758|gb|EDW11625.1| GI17241 [Drosophila mojavensis]
Length = 773
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 164/466 (35%), Positives = 258/466 (55%), Gaps = 38/466 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLP + Q +LQ++IL ELGM+ LPAFSG VP A++ VFP+A T
Sbjct: 212 MGNLRSWGGPLPPAHRQLQQLLQQRILAAQRELGMSVALPAFSGYVPTAMRRVFPNASFT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q W DP +CC ++ DPLF ++G F+ + ++ YG ++HIY D F+E P
Sbjct: 272 QSDRWNHF-PDP-YCCVLFVEPQDPLFQQVGAMFLRRVIQVYG-SNHIYFSDPFNEMMPR 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V P Y+ AIY+ MQ D+DAVWL+QGW+F +W ++A L +VP G+++ L
Sbjct: 329 VREPNYVRYTAKAIYNSMQVVDADAVWLIQGWMFLKSVYWTNDLIEAYLTAVPRGRILAL 388
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + +YG P++WCML+NF GN+ ++G I G + AR+ N +MVGV
Sbjct: 389 DLQSEQFPQYERTHSYYGQPFVWCMLNNFGGNLGLFGSAQLIPSGIIAARSMPNGSMVGV 448
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN ++ L E A+ +++ ++ W +++ RYG + + W +L +VY
Sbjct: 449 GITPEGIGQNYALFALTLEQAWSPDELQLEDWFEYFALTRYGVNDTRLSQVWQLLRESVY 508
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + R+ + GKY L S + +P +WY+
Sbjct: 509 SF------QGRERM--------------RGKY-----------TLNKRPSLHHYPWVWYN 537
Query: 361 TSEVIRALELFIASGNELSASNT----YRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
+ V A L + + + ++ Y +DL+D+TRQ L + ++N+ A +
Sbjct: 538 VTMVYEAWRLMLEAKETVPLNDNRRAIYEHDLVDITRQCLQLSFDRFYVNLKSACRHKQL 597
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ V L+ + LEL DM+ +LA + +LLG WLE+AK+LA +EEQ
Sbjct: 598 NRVEYLAGKLLELFADMERILASGEHYLLGNWLEAAKRLAPSEEQR 643
>gi|66801665|ref|XP_629757.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
gi|60463162|gb|EAL61355.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
Length = 798
Score = 282 bits (721), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 157/457 (34%), Positives = 253/457 (55%), Gaps = 35/457 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++GWGGP+ WL++Q LQ KIL R+ + GM PVLP F+G++P A+Q +FP A I+
Sbjct: 255 MGNVNGWGGPITLDWLEKQRDLQIKILERMRQYGMKPVLPGFAGHIPGAIQQLFPQANIS 314
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W + T+ L++TDPLF +I FI + + +G T H YN D F+E PP
Sbjct: 315 VLSTWCNFNG------TFYLESTDPLFAKITTMFIGELIDVFG-TDHFYNFDPFNELEPP 367
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ +Y+ ++Y + D AVW++QGW P FW+ Q +A + VP+G ++V
Sbjct: 368 SNDTDYLRQTSQSMYENVLLADPKAVWVLQGWFIVDAPEFWQAKQTEAWFSGVPIGGVLV 427
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+++V P W+T+ +YG ++WCMLHNF G MYG L I+ P+ AR + MVG
Sbjct: 428 LDLWSDVIPGWTTTNYYYGHYWVWCMLHNFGGRSGMYGRLPWISSNPITAR-GLSPNMVG 486
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G++ E IEQN VVYD+MSEM+++ + ++ W+ QY+ RRYG+ VP I D W L +TV
Sbjct: 487 IGLTPEAIEQNVVVYDMMSEMSWRSVQPNLTEWVTQYTHRRYGKLVPEIVDVWISLVNTV 546
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
+N T N +F + P + +G +S+ +P++ Y
Sbjct: 547 FNATAATARANMGAPESFIALRPQL---------TFGN------------NSFYNPNILY 585
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ V ++ + ++ T+ +D+ + T Q+L+ Y + + +IEA+ +D +
Sbjct: 586 NAWNVFSMVD-----DEYVISTETFEFDISEFTMQSLSNYFMDQYFLLIEAFNASDVQTL 640
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+S L+++ MD + + LG W A+ A
Sbjct: 641 STISIELLDIINYMDEIASTQSSLQLGLWTYRARLWA 677
>gi|440799253|gb|ELR20308.1| alpha-N-acetylglucosaminidase family protein [Acanthamoeba
castellanii str. Neff]
Length = 854
Score = 281 bits (718), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 163/480 (33%), Positives = 247/480 (51%), Gaps = 57/480 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL +W Q LQKKI+ R GM P+LP F+G VP ++ ++P+A +T
Sbjct: 240 MGNIQGWGGPLDPAWRKAQAELQKKIVERQRMFGMLPILPGFAGFVPDGIKRIYPTANLT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ +W ++ Y L D L+ IGR I + E+G T HIYN DTF+E +PP
Sbjct: 300 KSADWAGFPH--QYTNVYFLSPLDSLYKTIGRMVIRRVTAEFG-TDHIYNADTFNEMSPP 356
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWR-PPQMKALLNSVPLGKLVV 179
P Y+++ A+Y GM + D A+W+MQGW F +D FW ++++ L+ V +++
Sbjct: 357 SADPTYLAAASRAVYEGMAAEDPQALWVMQGWSFVFDKFWEDKSRVRSYLSGVSDKDMLI 416
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDL ++ P WS + ++G ++WCMLHN G +YG L + P+ A + TM
Sbjct: 417 LDLASDNNPEWSKTDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSDPLLALATPGNTMLI 476
Query: 238 ------VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS--VPAIQ 289
VGVGM+ME IEQNPVVY+LMSEM ++ E D+ W+ +Y+ RRYG + + ++
Sbjct: 477 CGTCEQVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAAGLSSVG 536
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
+AW +L Y N+ VI D T G YG
Sbjct: 537 EAWELLREATY---------NQSVI------DYGWFGFTPGLGMGYGGVA---------- 571
Query: 350 SSYDHPHLWYSTSEVIRALELFIASG--NELSASNTYRYDLIDLTRQALAKYANELFLNI 407
+ ++ + AL LF+ S + + ++YD +DLTRQ LA +++
Sbjct: 572 ----------NAAKEVEALRLFLQSALTKGYAPNGPWQYDCVDLTRQVLANTFRDIYAQF 621
Query: 408 IEAYQLNDAHGVF------QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
AY AH + L L L+ D+D +LA + +LLG W++SA A +Q
Sbjct: 622 DAAYSAYAAHKTYTVDQLKSLGSALLTLIGDIDEILATNPNYLLGTWIQSALSWADTPDQ 681
>gi|255553488|ref|XP_002517785.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
gi|223543057|gb|EEF44592.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
Length = 360
Score = 279 bits (713), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 135/255 (52%), Positives = 182/255 (71%), Gaps = 5/255 (1%)
Query: 215 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 274
MYGILDSI+ GP+EAR SEN+TMVGVGM MEGIE NPVVY+LMSEMAF+ EKV V W+
Sbjct: 1 MYGILDSISTGPIEARVSENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSEKVQVLEWLK 60
Query: 275 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
YS RRYG++V ++ AW +LYHT+YNCTDG D N D IV FPD DPS+ S ++ Q+
Sbjct: 61 TYSRRRYGKAVHQVEAAWEILYHTIYNCTDGIADHNTDFIVKFPDWDPSVQSGSDTSQQD 120
Query: 335 -----YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 389
+ S+ + + S+ H+WYS +VI AL+LFI G+ L+ S TYRYDL+
Sbjct: 121 NKHIFLHRSGSRRFLFEGPNSTLPQAHIWYSIQKVINALQLFIDGGSHLTGSLTYRYDLV 180
Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
DLTRQ L+K AN+++++ I A++ NDA + S++F++L++D+D LLA D FL+G WL
Sbjct: 181 DLTRQVLSKLANQVYVDAIIAFRSNDARALNLHSQKFIQLIKDIDVLLASDDNFLIGTWL 240
Query: 450 ESAKQLAQNEEQEKQ 464
ESAK+LA N + +Q
Sbjct: 241 ESAKELALNPSEMRQ 255
>gi|121698957|ref|XP_001267859.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
1]
gi|119396001|gb|EAW06433.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
1]
Length = 671
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 162/471 (34%), Positives = 259/471 (54%), Gaps = 39/471 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G W G LP SW+D Q LQKKI+ R+ ELGM PVLPAF+G VP A+ V P A +
Sbjct: 197 NIQGSWHGELPYSWIDAQFELQKKIVRRMVELGMTPVLPAFTGFVPRAITRVLPDATVVN 256
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W D ++ L+ DP F + R+FI +Q + YG +HIY D ++EN P
Sbjct: 257 GSRWSGF--DEKYTNDTFLEPFDPNFARLQRSFIHKQQQAYGNITHIYTLDQYNENDPYS 314
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
PEY+ ++ + ++S D DA+W+MQGWLF S FW ++ A L+ V + ++V
Sbjct: 315 GDPEYLRNVTHNTWQSLKSADPDAIWMMQGWLFYSNSDFWTDERVHAYLSGVETDEDMLV 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E +P W ++ +YG P+IWC LH++ GN+ +YG + +I +A ++ +VG
Sbjct: 375 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNITVNATDALAVSDS-LVG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG----RSVP-AIQDAWNV 294
G++MEG E N +VYDL+ + A+ +D ++ + + RY +VP + AW++
Sbjct: 434 YGLTMEGQEGNEIVYDLLLDQAWSSRPIDTDSYFHDWVKARYSTARRHNVPHELYQAWDI 493
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L T YN T+ AT V SI + +P K L ++T H
Sbjct: 494 LRTTAYNNTNLATATA---------VSKSIFEL---------QP--KLTGLVNQTGH--H 531
Query: 355 PHLW-YSTSEVIRALELFIASGNELSA---SNTYRYDLIDLTRQALAKYANELFLNIIEA 410
P + Y S ++R+ +L +++ +E +A +RYD++D+TRQ +A ++LN+
Sbjct: 532 PTVVNYEASSLVRSWKLMVSAASESTALWSHPAFRYDMVDVTRQVMANAFIPMYLNVTST 591
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
YQ + Q + L+ D+D +L+ +D F L W+ESA+ A+N+ +
Sbjct: 592 YQ--KGGPISQQGDSLIRLLRDLDAVLSTNDNFRLATWIESARTWARNDTE 640
>gi|195398029|ref|XP_002057627.1| GJ18000 [Drosophila virilis]
gi|194141281|gb|EDW57700.1| GJ18000 [Drosophila virilis]
Length = 766
Score = 277 bits (709), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 156/464 (33%), Positives = 256/464 (55%), Gaps = 38/464 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPLP + Q +LQ+ I+ ELGM+ LPAF+G+VP A++ VFP+A T
Sbjct: 214 MGNIRGWAGPLPPAHRRLQQLLQQLIVRAQRELGMSVALPAFAGHVPTAMRRVFPNANYT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W + ++CC ++ DPLF ++G F+ + ++ YG ++HIY D F+E PP
Sbjct: 274 PAERWNNFPD--QYCCDLFVEPHDPLFQQLGAMFLRRVIQVYG-SNHIYFSDPFNEMQPP 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P Y+ S AIY+ M+ D +AVWL+QGW+F D FW ++A L +VP G+++VL
Sbjct: 331 LAEPGYMRSTAKAIYNSMREVDGNAVWLLQGWMFLKDIFWTDELIEAFLTAVPRGRILVL 390
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + +YG P++WCML+NF G + ++G I G AR N+++VGV
Sbjct: 391 DLQSEQFPQYQRTHSYYGQPFVWCMLNNFGGTLGLFGSAQFIGSGIASARIMPNSSLVGV 450
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN ++ L E + ++ + W + +++ RYG + + AW +L VY
Sbjct: 451 GITPEGIGQNYAIFALTLEQGWSASELQLGDWFDHFALTRYGVNDTRLAQAWQLLRGGVY 510
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + GKY L +P WY+
Sbjct: 511 SFHG--------------------LQRMRGKY-----------ALNRRPGLNLNPWTWYN 539
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V A +L +AS + ++ Y +DL+D+TRQ L + +++++N+ AY+
Sbjct: 540 GSSVTDAWQLLLASREMVPLTDDRYAIYEHDLVDITRQFLQQSFDQIYVNLRSAYRKEQL 599
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+ + L+ + LEL++DM+ +LA +LLG WLE+AK+LA +++
Sbjct: 600 NRLEYLAGKLLELLDDMERILASGVHYLLGTWLEAAKKLAPSDK 643
>gi|195050088|ref|XP_001992825.1| GH13491 [Drosophila grimshawi]
gi|193899884|gb|EDV98750.1| GH13491 [Drosophila grimshawi]
Length = 771
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 161/466 (34%), Positives = 262/466 (56%), Gaps = 38/466 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLP + Q +LQ++I+ +LGM+ LPAF+G+VP L +FP+A T
Sbjct: 213 MGNIRGWGGPLPPAHRRLQQLLQQRIVQAQRDLGMSVALPAFAGHVPTGLPRIFPTANFT 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W DP +CC ++ +DPLF +G F+ + ++ YG ++HIY D F+E P
Sbjct: 273 SVERWNQFP-DP-YCCALFIEPSDPLFQLVGAQFLRRVIQIYG-SNHIYFSDPFNEMQPR 329
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISS AIY+ M+ D D VWL+QGW+F + +W ++A L +VP G+++VL
Sbjct: 330 IAEPGYISSTARAIYNSMRMVDKDPVWLLQGWMFLDNAYWSDELIEAFLTAVPRGRMLVL 389
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + +YG P++WCML+NF G + M+G I G + AR+ N++MVGV
Sbjct: 390 DLQSEQFPQYQRTFSYYGQPFVWCMLNNFGGTLGMFGSAHLINAGIMAARSMPNSSMVGV 449
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN ++ L E + K+++ W +Q+++ RYG + + AW +L +VY
Sbjct: 450 GITPEGIGQNYALFALTLEQGWSGSKLELSDWFDQFTLTRYGVNDTDLILAWQLLRGSVY 509
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + GKY L S P +WY+
Sbjct: 510 HFHG--------------------LQRMRGKY-----------ALNKRPSFNLKPWIWYN 538
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A +L +A+ + + Y++DL+D+TRQ L + +++++N+ AY+ +
Sbjct: 539 ASSVVEAWQLLLAANQTIPVEDDRYALYKHDLVDITRQFLQQSFDQVYVNLKSAYRKSQL 598
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
L+ + LEL+ DM+ +LA + +LLG WLE+AK+LA + +Q
Sbjct: 599 ARFEYLAAKLLELLADMERILASGEHYLLGNWLEAAKELAPSADQR 644
>gi|357622373|gb|EHJ73879.1| putative alpha-N-acetyl glucosaminidase [Danaus plexippus]
Length = 780
Score = 274 bits (700), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 158/459 (34%), Positives = 245/459 (53%), Gaps = 41/459 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+HGWGGPLPQSW D+Q +Q+ + +++LGM PV PAF+G+VP A + +FP+
Sbjct: 206 MGNVHGWGGPLPQSWHDRQKQIQEVVTDLMFKLGMIPVFPAFNGHVPKAFEKIFPNTTFH 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W K D +CC +D +P F I + F+ + G +SHIY D F+E
Sbjct: 266 PVETW--NKFDEDYCCNLFVDPREPDFKMISKMFMREITAGLG-SSHIYTADPFNEIKIQ 322
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
S + AI+S + D DAVWL+Q W+F ++P W ++ + L SVP G+++V
Sbjct: 323 PWSTSLVVETAKAIFSSISEYDKDAVWLVQNWMFVHNPLLWPLKRVNSFLTSVPNGRMLV 382
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + + +YG P+IW MLHNF G + M+G +I E R EN+TMVG
Sbjct: 383 LDLQSEQWPQYDLYQMYYGQPFIWSMLHNFGGTLGMFGNTKTINKDVYEVRKRENSTMVG 442
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
+G++ EGI QN V+YDLM E A++ V D++ W++ Y+ RRYG + +I W L +
Sbjct: 443 IGLTPEGINQNYVIYDLMLESAWRKGPVPDLEEWVSDYAERRYGCNATSI--GWKYLLRS 500
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYN T ++ GKY V+ S P W
Sbjct: 501 VYNFTG--------------------LNRIRGKY-----------VMTRRPSFNIRPWAW 529
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y ++ AL+ F+ N +++ + +DL+D+TRQAL ++++N+ N +
Sbjct: 530 YKGHDLFEALKNFVYVQNPACSTSGFLHDLVDVTRQALQYKIEQIYMNLQNDRYSN--YM 587
Query: 419 VFQLS-RRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
VF + F++ + DM +LA F + WL SA+ ++
Sbjct: 588 VFNYTISSFIDAMTDMQNILATSSDFKITSWLSSARAIS 626
>gi|119480815|ref|XP_001260436.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
181]
gi|119408590|gb|EAW18539.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
181]
Length = 748
Score = 274 bits (700), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 156/472 (33%), Positives = 262/472 (55%), Gaps = 41/472 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP SW+D Q LQKKI+ R+ ELGM PVLPAF+G VP A+ V P+A +
Sbjct: 197 NIQGSWGGDLPYSWIDSQFELQKKIVRRMVELGMTPVLPAFTGFVPRAISRVLPNATVVN 256
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W D R+ L+ DP F + R+FI++Q + YG +H+Y D ++EN P
Sbjct: 257 GSRWGGF--DERYTNDTFLEPFDPSFTRLQRSFIQKQQQAYGNITHVYTLDQYNENDPYS 314
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
+Y+ ++ + ++S D +AVWLMQGWLF S FW ++KA L+ V + + ++V
Sbjct: 315 GDLDYLRNVTRNTWLSLKSADPNAVWLMQGWLFYSNSDFWTDERVKAYLSGVEVDQDMLV 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E +P W ++ +YG P+IWC LH++ GN+ +YG + ++ +A + + ++VG
Sbjct: 375 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNVTVNATQALAASD-SLVG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-----GRSVP-AIQDAWN 293
G++MEG E N ++YDL+ + A+ + +D + + + RY G +VP + AW+
Sbjct: 434 FGLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHNWVKTRYSSGVRGSAVPEELHQAWD 493
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
+L TVYN T+ + V + ++ PSI G G D
Sbjct: 494 ILRTTVYNNTNLTSTA---VSKSIFELQPSI----SGLLNRTG----------------D 530
Query: 354 HP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
HP + Y + +++A +L ++ ++ L + + YD++D+TRQ +A +++N++
Sbjct: 531 HPTTVNYDPAALVQAWQLMDSAASKDRSLWSQPAFLYDMVDITRQVMANAFIPMYINLVS 590
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
YQ V ++L+ D+D +L+ +D F L W+ SA+ A+N+ +
Sbjct: 591 TYQA--GASVSTDGSNLIQLLRDVDSVLSTNDNFRLSTWIRSARSWARNDTE 640
>gi|390334740|ref|XP_003724005.1| PREDICTED: uncharacterized protein LOC100893810 [Strongylocentrotus
purpuratus]
Length = 1043
Score = 274 bits (700), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 144/367 (39%), Positives = 207/367 (56%), Gaps = 36/367 (9%)
Query: 100 KEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF 159
+E+ T HIYN DTF+EN P + Y+S+ +Y G+ GD VWLMQGWLF F
Sbjct: 573 EEFNGTDHIYNADTFNENQPRSNDSAYLSAASRGVYQGIVEGDPQGVWLMQGWLFQKTDF 632
Query: 160 WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL 219
W P Q+KALL+ VP+G+++VLDLFAE +PI++ ++ FYG P+IWCMLHNF GN +YG L
Sbjct: 633 WGPSQIKALLHGVPIGRMIVLDLFAEARPIYNATQSFYGQPFIWCMLHNFGGNTGLYGKL 692
Query: 220 DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 279
D++ P EAR ++TM+G+G++ EGI QN V+Y+ +++M ++ E ++V WI +YS R
Sbjct: 693 DAVNKFPFEARQFNSSTMIGMGLTPEGILQNYVMYNFLTDMTWRSESMNVSKWIEEYSGR 752
Query: 280 RY----GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 335
RY G S A + AW +L TVYN T D A P V PS N
Sbjct: 753 RYSPESGHSEEAAK-AWAILQATVYNNTGIDKDHQH----AVPVVRPS----------NK 797
Query: 336 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 395
K V +WY +EV +A + + L S+ +RYDL+D+TR
Sbjct: 798 TKSV-----------------IWYDYTEVAKAWGFLLQASETLGTSSLFRYDLVDVTRNV 840
Query: 396 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
L A + + I+ ++ + + L+ DMD + + H +LLG WLE AK L
Sbjct: 841 LQDLAFDFYEQIMVSFHAKNITAIRGNGTLLCNLILDMDNITSSHQDWLLGTWLEDAKSL 900
Query: 456 AQNEEQE 462
A N ++E
Sbjct: 901 ATNHKEE 907
>gi|392588150|gb|EIW77482.1| glycoside hydrolase family 89 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 761
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 157/471 (33%), Positives = 252/471 (53%), Gaps = 40/471 (8%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP W+D Q VL K+I+ R+ +LGM PVLPAF+G VP A+ N++P+A I
Sbjct: 206 FGNIQGSWGGDLPTQWIDDQFVLGKQIVQRMVDLGMTPVLPAFTGFVPPAMHNLYPNASI 265
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W P++ L+ DPLF ++ ++FI +Q +G SHIY D ++EN P
Sbjct: 266 VNGSAWNDFA--PQFTNDSFLEPFDPLFAQVQQSFISKQQAAFGNVSHIYTLDQYNENDP 323
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPL---- 174
P Y++++ AA +S +++ D DA WLMQGWLF S FW P +++A L VP
Sbjct: 324 YSGDPSYLTNISAATFSSLRAADPDATWLMQGWLFFSSADFWTPERVEAYLAGVPGDDDG 383
Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
+++LDL++E +P W ++G +IWC LH++ GN+ G ++ P+ A S N
Sbjct: 384 SGMLILDLYSEAQPQWQRLSSYFGKRWIWCELHDYGGNMGFEGNFANVTEAPLAALASPN 443
Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQ-DAW 292
+MVGVG++ EG+E N ++YD++ + A+ ++ + ++ RRY +P +AW
Sbjct: 444 VSMVGVGLTPEGMEGNEIIYDVLLDQAWSSSPINKTEYAQAWATRRYPADELPECAIEAW 503
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
L TVY+ TD + I+ +++VT
Sbjct: 504 QTLAATVYSNTDPGSQATVKSILELEPALSGLVNVTG----------------------- 540
Query: 353 DHP-HLWYST-SEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNI 407
HP H++Y T + ++ AL+ + +G+ L A YRYDL+DLTRQ L +L+ ++
Sbjct: 541 HHPTHVFYDTNTTIVPALQQLVQAGHSTPSLLAIPEYRYDLVDLTRQLLVNRFIDLYADL 600
Query: 408 IEAYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ Y + V + LELV D+D +L ++ F L W ++A+ A
Sbjct: 601 LAVYNTTSASSASVSAAGQPMLELVADLDKVLMTNENFQLSRWTDAARSWA 651
>gi|126307952|ref|XP_001365931.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
domestica]
Length = 481
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 128/253 (50%), Positives = 170/253 (67%), Gaps = 4/253 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +Q LQ +IL R+ GM PVLPAF+G++P A VFP A +T
Sbjct: 205 MGNLHTWGGPLPSSWDLKQSYLQYQILERMRSFGMKPVLPAFAGHIPKAFTRVFPQANVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L NW + C+YLL DPLF +G F+ + KE+G T HIY+ D F+E PP
Sbjct: 265 KLDNWIDFNCT--YSCSYLLAPEDPLFPVVGSLFLRELAKEFG-TDHIYSADIFNEMDPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ AA+Y M + D DAVWL QGWLF P FW+PPQMKA+L +VP G+ ++
Sbjct: 322 SSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNHPDFWKPPQMKAVLEAVPRGRFLI 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S + FYG P+IWCMLHNF GN ++G+LD++ GP AR N+T+VG
Sbjct: 382 LDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLFGVLDAVNRGPSTARLFPNSTIVG 441
Query: 240 VGMSMEGIEQNPV 252
G+ EGI QN +
Sbjct: 442 TGIVPEGINQNEI 454
>gi|281200618|gb|EFA74836.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
Length = 469
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 166/466 (35%), Positives = 252/466 (54%), Gaps = 54/466 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ W G L W+ Q LQ +IL R+ + GM VLP F+G+VP AL++ +P+A IT
Sbjct: 1 MGNVNEWAGNLTLGWMVDQRDLQIQILTRMRQFGMQAVLPGFAGHVPEALKSHYPNANIT 60
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QL +W T + + F+ I Q YG T H YN D F+E PP
Sbjct: 61 QLSSW---------NMTVYIHQSPNTFMSI-------QQDLYG-TDHFYNFDPFNELEPP 103
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+ + ++++ + + D +W++QGWLF YD FW+PPQ++A L+ VP+GK++V
Sbjct: 104 SSDPAYLKNCSQSMFNNLIAVDPQGIWVLQGWLFVYDTEFWQPPQIEAFLSGVPIGKMIV 163
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+A+V W + FYG +IWCMLHNF G MYG + I+ P+EAR S + MVG
Sbjct: 164 LDLWADVDAGWKITNYFYGHNWIWCMLHNFGGRSGMYGKIPFISTNPIEAR-SLSPNMVG 222
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ E IEQN +VYDLMSEMA++ D+K W++QY RRYG+ + + D W L TV
Sbjct: 223 TGLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQYVTRRYGKYIEVLADTWYELVGTV 282
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
+NC+ VT+G VS L TS Y P +
Sbjct: 283 FNCS----------------------IVTKGPVTIL---VSVRPQLNFTTSLYYDPIV-- 315
Query: 360 STSEVIRALELFIASGN-ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+ +A F++ + + ++T+ +DL ++T QAL+ L + A+ LND++
Sbjct: 316 ----ISKAWSAFLSIDDLHVVNTSTFSFDLTEITTQALSNLFMTTELQMNAAF-LNDSYE 370
Query: 419 VFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA-QNEEQE 462
F LS L +++D++ +++ + L+G W A+ L NE E
Sbjct: 371 EFSLLSDALLSIIQDINTIVSTQEMLLVGNWTARARALTPANETTE 416
>gi|288927792|ref|ZP_06421639.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella sp. oral taxon 317 str. F0108]
gi|288330626|gb|EFC69210.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella sp. oral taxon 317 str. F0108]
Length = 734
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 159/468 (33%), Positives = 241/468 (51%), Gaps = 43/468 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W GPLP SW+ Q LQKKIL R +LGM PVLPAF+G+VP L+ +P AKIT
Sbjct: 198 MSNIDHWMGPLPMSWIKNQEKLQKKILRRTRDLGMKPVLPAFAGHVPEILKEKYPKAKIT 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W + R C + LD D LF +I + +I++Q K YG T HIY D F+E PP
Sbjct: 258 PLSIWGDFEDQYR-C--HFLDPFDSLFTDIQKTYIDEQTKLYG-TDHIYGVDPFNELAPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY+++ A IY +++ DS AVWL W+FSY W ++K+ + +VP K ++
Sbjct: 314 SWEPEYLANASAKIYDVLKNADSKAVWLQMTWMFSYQRKDWTDERIKSYITAVPDKKQIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD +AE +W S+ +Y P+IWC L NF GN + G + + EA + +MVG
Sbjct: 374 LDYYAERTEVWKFSESYYKQPFIWCYLGNFGGNTMIAGNIAEVDRRLNEAFANAE-SMVG 432
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG + NP++YD + E + + + + W Q++ RR G + + AW +L +
Sbjct: 433 VGSTLEGFDVNPIMYDFVFEKVWHKDGISLHDWTVQWAQRRVGTTDENAEKAWKLLIDKI 492
Query: 300 YN----CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
Y CT+G R PS+ +N+ K
Sbjct: 493 YVQYSLCTEGTLTNAR----------PSLTGHGNWTTKNWTK------------------ 524
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
Y+ +++ A L + S + Y+YD++++ RQ L Y L +AY+ D
Sbjct: 525 ---YNNRDLLEAWGLLLRS--KAITKIAYKYDIVNIGRQVLGNYFTVLRDEFTQAYERKD 579
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ L L+ D++ LL FLLGPWL +A+ + +N E+ +
Sbjct: 580 ISALTIKGNEMLSLLNDLEALLYTSPSFLLGPWLTNAQNMGRNMEESR 627
>gi|449541596|gb|EMD32579.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
B]
Length = 754
Score = 271 bits (692), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 147/473 (31%), Positives = 262/473 (55%), Gaps = 38/473 (8%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP W++ Q LQK+IL R+ ELGM P LPAF+G VP A+ ++P+A I
Sbjct: 199 FGNIQGSWGGALPMQWVNDQFALQKQILTRMTELGMTPALPAFTGFVPRAMSTLYPNASI 258
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENT 118
W + L+ DPLF + ++FI +Q + YG +HIY D ++EN
Sbjct: 259 VNGSAWSGFPASL--TNVSFLEPFDPLFSTLQKSFITKQQQAYGTNVTHIYTLDQYNENN 316
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-K 176
P + Y+SS+ A ++ +++ D DA+W++QGWLF S + FW +++A L VP
Sbjct: 317 PFSGNISYLSSVSAGTFASLRAADPDAIWMLQGWLFFSSETFWTDERIQAYLGGVPTNDS 376
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
++VLDL++E +P W+ + ++G ++WC LH + GN+ + G L++I GP+ A +S+ ++
Sbjct: 377 MIVLDLYSEAQPQWNRTSSYFGKQWVWCELHGYGGNMGLEGNLNAITAGPIAALSSQGSS 436
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNV 294
M G+G++MEG E N +VYD++ + A+ +D+ +++ + RRY +P A Q+AW +
Sbjct: 437 MKGMGLTMEGQEGNEIVYDVLLDQAWSSAPIDIASYVKSWVARRYTVEPLPSAAQEAWQI 496
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L TVYN D ++ I + +++P++ + + H
Sbjct: 497 LSTTVYNNQD---PNSQATIKSIYELEPTLTGLVN--------------------RTGHH 533
Query: 355 PHL--WYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
P L + + + V+ AL+L + + + L+A + YD +D++RQ L+ + + +++
Sbjct: 534 PTLIPYDTNTTVVPALQLLVKAKEQNAALAAIPEFVYDAVDVSRQLLSNRFIDAYTGLVD 593
Query: 410 AYQLNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
Y +A V + + + ++ +D LLA ++ FLL W+ A+ + +E
Sbjct: 594 TYNNANATSDAVVRAGQPLMVILSQLDALLATNENFLLSSWIAQARNWSHGDE 646
>gi|340617022|ref|YP_004735475.1| alpha-N-acetylglucosaminidase [Zobellia galactanivorans]
gi|339731819|emb|CAZ95084.1| Alpha-N-acetylglucosaminidase, family GH89 [Zobellia
galactanivorans]
Length = 747
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 158/466 (33%), Positives = 255/466 (54%), Gaps = 24/466 (5%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G GGPLPQ+W+ Q+ LQ KIL R+ LGM PVL AF+G+VP L+ ++P A I
Sbjct: 197 MGNIDGMGGPLPQNWITQRKELQVKILNRMRSLGMKPVLQAFTGHVPQVLKKLYPEANIF 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ +W V+ TY LD TD LF +IG AFI++Q + YG T H+Y+ D F E PP
Sbjct: 257 QIEDWAGVEG------TYFLDPTDELFQKIGTAFIKKQTELYG-TDHLYDADCFIEVDPP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM-KALLNSVPLGKLVV 179
P ++ + ++Y M+ DS A W++QGW F + + + +A L+ +P + +V
Sbjct: 310 SKDPAFLKQVSESVYKSMELADSKATWVLQGWFFFFKKDFWTKERGRAFLDGIPKNRAIV 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMV 238
LDL+ E P W + FYG P+IW ++ N + M G L+ + EA TSE +
Sbjct: 370 LDLYGEKNPTWDKTDAFYGQPWIWNVICNEDQKVNMSGDLEEMQRQFQEAYTSEIGNNLK 429
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G+ EG+ NP+V D + E A+ +KV+V+ WI Y+ RYG P+++ AW +L +
Sbjct: 430 GIGVIPEGLGYNPIVQDFIFEKAWDPQKVNVQEWIEDYATIRYGTKSPSVKKAWQLLGES 489
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VY G T ++ P ++ EG ++ + V K+ + +ET P W
Sbjct: 490 VY----GRTRTMWSPLI----TTPRLMIFEEGSKEDI-RHVRKDFKI-TETD----PFAW 535
Query: 359 -YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
+ ++ +A L + NEL TY +DL ++ R+ L ++ ++ AYQ D
Sbjct: 536 DFDVYKLAKAAGLLLGEANELQDVETYNFDLTNVYRELLFSLTHKSINDVSVAYQEKDRQ 595
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + ++ +L++D++ + ++ FLLG WLE AK E+++
Sbjct: 596 ALDRSAKSLFKLMDDLEAITGANENFLLGKWLEDAKSWGSTPEEKE 641
>gi|71001188|ref|XP_755275.1| alpha-N-acetylglucosaminidase [Aspergillus fumigatus Af293]
gi|66852913|gb|EAL93237.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
Af293]
gi|159129357|gb|EDP54471.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
A1163]
Length = 756
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 152/468 (32%), Positives = 257/468 (54%), Gaps = 33/468 (7%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP SW+D Q LQKKI+ R+ ELGM PVLPAF+G VP A+ V P+A +
Sbjct: 205 NIQGSWGGELPYSWIDSQFELQKKIVRRMVELGMTPVLPAFTGFVPRAVSRVLPNATVVN 264
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + D R+ L+ DP F+ + R+FI++Q + YG +HIY D ++EN P
Sbjct: 265 GSRW--EEFDERYTSDTFLEPFDPSFMRLQRSFIKKQQQAYGNITHIYTLDQYNENAPYS 322
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
+Y+ ++ + ++S D +AVWLMQGWLF S FW ++KA L+ V + + ++V
Sbjct: 323 GDLDYLHNVTHNTWLSLKSADPNAVWLMQGWLFYSSSGFWTDERVKAYLSGVEVDQDMLV 382
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E +P W ++ +YG P+IWC LH++ GN+ +YG + ++ +A + + ++VG
Sbjct: 383 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNVTVNATQALAASD-SLVG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-----GRSVP-AIQDAWN 293
G++MEG E N ++YDL+ + A+ + +D + + ++ RY G +VP + AW+
Sbjct: 442 FGLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHNWAKTRYSSGVRGSAVPEELYQAWD 501
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
+L T YN T+ + I ++ PSI + + T SYD
Sbjct: 502 ILRITAYNNTNLTSTAVSKSIF---ELQPSISGLLNRTSHH------------PTTVSYD 546
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
L + R ++ + L + + YD++D+TRQ ++ ++ N++ YQ
Sbjct: 547 PAAL----VQAWRLMDSAASKAPSLWSQPAFLYDMVDITRQVMSNAFIPVYTNLVSTYQA 602
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ V ++L+ D+D +L+ +D F L W++SA+ +N+ +
Sbjct: 603 GGS--VSTDGSNLIQLLRDLDSVLSTNDNFRLSTWIQSARSWVRNDTE 648
>gi|449541595|gb|EMD32578.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
B]
Length = 752
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 152/471 (32%), Positives = 263/471 (55%), Gaps = 33/471 (7%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP W++ Q LQK+IL R+ ELGM PVLPAF+G VP A+ V +A I
Sbjct: 196 FGNIQGSWGGELPMQWVNDQFALQKQILARMTELGMTPVLPAFTGFVPRAMSTVHSNASI 255
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENT 118
W + P L+ DPLF + ++FI +Q + YG SHIY D ++EN
Sbjct: 256 VNGSQW-APGFPPSLTNVSFLEPFDPLFATLQKSFIAKQQEAYGANISHIYTLDQYNENN 314
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-K 176
P + Y+SS+ ++ +++ D DAVW++QGWLF S + FW +++A L VP
Sbjct: 315 PFSGNLSYLSSISEGTFTSLRAADPDAVWMLQGWLFFSSEAFWTNERIEAYLGGVPTNDS 374
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
++VLDL++E +P W+ + ++G ++WC LH++ G I + G LD+I GP+ A S ++
Sbjct: 375 MIVLDLYSEAQPQWNRTSSYFGKQWVWCELHDYGGTIGLEGNLDAITTGPIAALNSPGSS 434
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVP-AIQDAWNV 294
M G+G++MEG E N +VYDL+ + A+ +++ +++ + RRY +P A Q+AW +
Sbjct: 435 MKGMGLTMEGQEGNEIVYDLLLDQAWSSSPINIASYVKGWVSRRYLVEPLPSAAQEAWRI 494
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L TVYN D ++ I +++P + + V++ +L + YD
Sbjct: 495 LSTTVYNNQD---PNSQSTIKNIYELEPVLTGL-----------VNRTGILPT-VIPYD- 538
Query: 355 PHLWYSTSEVIRALELFIASGNELSASNT---YRYDLIDLTRQALAKYANELFLNIIEAY 411
+ S ++ AL+L + + + +A +T + +D++D++RQ L+ + + +I+ Y
Sbjct: 539 -----TNSTIVPALQLLVKAKAQNAALSTVPEFVHDVVDVSRQLLSNRFIDAYTALIDTY 593
Query: 412 QLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+ + V + + + ++ +D LLA ++ FLL W+ A+ L+ +E
Sbjct: 594 NNTNVTSDAVIRAGQPLMTILSQLDALLATNENFLLSSWIAQARNLSHGDE 644
>gi|393782608|ref|ZP_10370791.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
CL02T12C01]
gi|392672835|gb|EIY66301.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
CL02T12C01]
Length = 761
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 161/466 (34%), Positives = 232/466 (49%), Gaps = 48/466 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+S +D+ L KKI+ R ELGM P+ FSG VP L++ +P+A I
Sbjct: 199 MQNLQSYGGPLPKSVIDRHAALGKKIIARQLELGMQPIQQGFSGYVPRELKDKYPTANIN 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q +W K + LD TD LF +GR F+E+Q + +G +Y D F E+ PP
Sbjct: 259 QQRSWCGFKGAAQ------LDPTDSLFTRMGRVFLEEQARLFG-AHGVYAADPFHESVPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD+PEY+ ++G I+ + D + W MQ W +A++ +VP L++L
Sbjct: 312 VDTPEYLKAVGETIHRLFREFDPQSTWAMQSWSLR----------EAIVKAVPKEALLIL 361
Query: 181 DLFAEVKPIWSTSK-QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
DL STSK +F+G P + LHNF G I M+G L +A N + G
Sbjct: 362 DLRGS-----STSKAEFWGYPTVVGNLHNFGGRINMHGDLALLASNQYSKAKRLNPAVCG 416
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G+ ME IEQNPV Y+L EM + +D++AW+ QY+ RRYG PA Q AW +L
Sbjct: 417 SGLFMEAIEQNPVYYELAFEMPCHPDSIDLRAWLKQYATRRYGAFSPATQKAWMLLLEGP 476
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T+K+ ++ A P +D V K YD P L
Sbjct: 477 YRQGTNGTEKS-SIVAARPALD-----------------VKKSGPNAGLEIPYD-PAL-- 515
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+IRA L + ++LSAS YR+DL+D+ RQ + + EA++ D
Sbjct: 516 ----IIRAQSLLLEDADKLSASRPYRFDLVDVQRQMMTNLGQLIHRKAAEAFRSKDREAF 571
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
S RFL ++ DMD LL + WL A+ + EE++ Q+
Sbjct: 572 TLHSGRFLGMLADMDTLLRTRSEYSFDRWLTEARSWGETEEEKNQM 617
>gi|298385999|ref|ZP_06995556.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
1_1_14]
gi|298261227|gb|EFI04094.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
1_1_14]
Length = 715
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 149/464 (32%), Positives = 243/464 (52%), Gaps = 45/464 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GGPLP W +Q+VLQKKIL R+ E GM PV F G VP+ L+ +P A++
Sbjct: 194 MGNLENIGGPLPDEWFKEQIVLQKKILARMREYGMKPVFQGFFGMVPSLLKEKYPEARLV 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S++ P +LD DPLF + + + + K YG+ + ++ D F E T
Sbjct: 254 EQGLWNSLQRPP------VLDPADPLFERMAKVWYAEYEKLYGK-ADLFGGDLFHEGGKT 306
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D ++ + + M+ + DA W++Q WL + P+ K LL + +
Sbjct: 307 GGID----VTDAARRVQTAMKRYNPDATWVIQAWLGN-------PK-KELLAGLDRKNTL 354
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR--TSENTT 236
++DL AE W K F G P++W + N+ GNI ++G LD+IA GPV+ + ++ + +
Sbjct: 355 IVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGGNIGLHGRLDAIATGPVDGQKDSAASPS 414
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
M G + EGIE NPVV+DL++EM ++ E +D+ W+ +YSVRRYG +++AW + +
Sbjct: 415 MKGTSSTPEGIEVNPVVFDLLNEMRWRSEHLDLDVWLKEYSVRRYGVEDENLKEAWTIFH 474
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
T Y G + V A P + I+ S++
Sbjct: 475 RTAYGTYTGHRRPSESVFCAPPSLKRDKITA----------------------SAWSQCR 512
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
++Y + + LF+ S + L ++TY+YD +D RQ LA E + N+++AY+ D
Sbjct: 513 IFYDPELFAQGVGLFLQSADRLKQTSTYQYDAVDFVRQYLADLGRETYYNLVDAYRAKDT 572
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
S RFL+L++D + LL+ H+ F +G WL+ A+ ++ E
Sbjct: 573 KQFDYWSERFLQLIKDQNELLSTHERFFVGRWLDMARLKSKQPE 616
>gi|346324333|gb|EGX93930.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
Length = 751
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 149/465 (32%), Positives = 250/465 (53%), Gaps = 37/465 (7%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LPQ+W++ Q LQKKI+ R+ ELGM P+LPAF G VP + V+P+ +
Sbjct: 201 FGNIQGSWGGDLPQAWIEDQFELQKKIVKRMIELGMTPILPAFPGFVPENITRVWPNVSL 260
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
+ W R+ + DP F E+ +AF+ +Q + YG + + D F+EN P
Sbjct: 261 AESPIWSGFSG--RFTADKYITPYDPHFAELQKAFLTKQNEAYGNVTSFWTLDQFNENKP 318
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK-L 177
+Y+ ++ + +++ D AVW+MQGWLF+ D +W ++K+ L+ VP+ + +
Sbjct: 319 ASGELDYLKNVSHNTWQTLKAADPSAVWVMQGWLFASDKTYWIDDRVKSFLDGVPVNEDM 378
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDLFAE P W ++ FYG P+IWC LH++ GN+ +YG ++++ VEA + ++
Sbjct: 379 LLLDLFAESTPQWQRTESFYGKPWIWCQLHDYGGNMGLYGQIENVTKNAVEA-VQTSKSI 437
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAWN 293
VG G+SMEG E N ++YDL+ + A++ E ++ + + + RYG + +P + AW+
Sbjct: 438 VGFGLSMEGQEGNEIMYDLLLDQAWRKEAIETDKYFSDWVTVRYGADHKEIPENLYTAWD 497
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
+ TVYN TD + V + ++ PSI + +
Sbjct: 498 KVRSTVYNNTDSSVTA---VTKSIFELAPSISGLV--------------------NRTGH 534
Query: 354 HP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
HP + Y T +I A ++G++ L + YRYDL D TRQ LA + ++E
Sbjct: 535 HPTKITYDTKTLISAWNDMFSAGDQARWLFDNEAYRYDLTDWTRQVLANAFEATYNKLVE 594
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
Y+ N+ GV R +++ MD +L + F L W+++A++
Sbjct: 595 KYKSNNTKGVKCAGDRLQAILQTMDQVLDTNPSFKLSTWIQAARK 639
>gi|404406438|ref|ZP_10998022.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 726
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 153/457 (33%), Positives = 240/457 (52%), Gaps = 37/457 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLPQSW+D QL LQ++I+ R ELG+ PV +F+G+VP AL+ +FP A I
Sbjct: 190 MANIDAWHGPLPQSWIDGQLELQRRIIARERELGIQPVFTSFTGHVPKALKTLFPDADIE 249
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W S + R +Y L+ +PLF I +A++++Q + +G +S +Y D F+E PP
Sbjct: 250 RLNPWTSFE---RPYNSYYLNPAEPLFNRIQQAYMQEQRRLFGESS-VYGVDPFNELDPP 305
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
PEY++ Y + D DAVWL W+F + W P ++KA L +VP GKL++
Sbjct: 306 NWDPEYLARAARLTYESITQFDKDAVWLQMAWVFYHKRRDWTPERLKAYLCAVPDGKLLM 365
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + +W +++ FYG P+IW L NF GN + G + ++ A VG
Sbjct: 366 LDYYCDKVELWRSTESFYGQPFIWSYLGNFGGNTMLAGDVKDVSRKLDRAYAEAGRNFVG 425
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ NP +Y+ + + A+ + D WI++ + R GR + AW +LY +
Sbjct: 426 IGCTLEGLDVNPFMYEYVLDRAWT-QLYDDAGWIDRLADRHSGRIDVHYRQAWRILYDKI 484
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y G NR V +P K S + PHL Y
Sbjct: 485 YCAPSG----NRSAAVC-------------------ARPNMK------GRSKWSGPHLDY 515
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+++R E + E +AS+ R+D +++ RQ L Y L I A + D V
Sbjct: 516 DNRDLLRVWEQLTLARPERTASS--RFDCVNIPRQCLENYFGNLNERCIAACRGGDRETV 573
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+LS R LEL++D+D L+A FLLG W+ A+++
Sbjct: 574 ARLSARLLELLDDIDRLVAADAYFLLGKWIADARRMG 610
>gi|291515668|emb|CBK64878.1| Alpha-N-acetylglucosaminidase (NAGLU) [Alistipes shahii WAL 8301]
Length = 713
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 159/459 (34%), Positives = 233/459 (50%), Gaps = 36/459 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W GPLP+ W+D QL LQ++IL R ELGM PVLPAF+G+VP L+ + P A+IT
Sbjct: 179 MSNIDRWQGPLPEEWIDGQLALQQRILARERELGMKPVLPAFAGHVPQELKRLHPDARIT 238
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W D R+ C++ LD DPLF I R F+ +Q + +G T HIY D F+E P
Sbjct: 239 RVSYWGGF--DDRYRCSF-LDPMDPLFAVIQREFLTEQTRLFG-TGHIYGADPFNEIDAP 294
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE ++ + IY M D +AVWL GWLF DP W ++A L +VP +L++
Sbjct: 295 TWDPETLAGMSRHIYESMAEVDPEAVWLQMGWLFYADPTHWTAENIRAFLGAVPQDRLLM 354
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F E IW +++F+G PY+WC L NF GN + G +++ +A + G
Sbjct: 355 LDYFCEFTEIWKQTEKFHGQPYLWCYLGNFGGNTMLSGNFHTVSARMEDAFAHGGDNLRG 414
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG N +Y+ + + A+ D WI + + RR G PA + W L +V
Sbjct: 415 VGSTLEGFGVNQFMYEFVLDKAWNTGIAD-DEWIARLADRRTGFRDPAARTGWRTLCDSV 473
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A + A P + EG + KP T+ Y P LW
Sbjct: 474 Y--TLPAQTGQSPLTNAHPAL--------EGNWHWTTKP----------TTGYRFPTLW- 512
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
R E +A +E +TYR+D++++ RQ L Y AY +D +
Sbjct: 513 ------RVWEELLAVDSE---RDTYRFDVVNIGRQVLGDYFLIERDRFAAAYAQHDRKAM 563
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+RR L+ D++ L ACH F L W+ +A+ +
Sbjct: 564 DAAARRMTGLLADINLLTACHPEFSLERWIAAARGFGSD 602
>gi|281210062|gb|EFA84230.1| hypothetical protein PPL_03307 [Polysphondylium pallidum PN500]
Length = 744
Score = 262 bits (669), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 159/467 (34%), Positives = 251/467 (53%), Gaps = 48/467 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGG LPQSW+ Q LQ KIL R+ E GM+PV P F+G+VP A + +PSA I
Sbjct: 214 MGNLDGWGGVLPQSWIKGQHELQIKILKRMSEYGMSPVFPGFAGHVPVAFKQFYPSANIV 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHI--YNCDTFDENT 118
+L +W + T L TDP++ + F + Q + YG + I ++ D F+E
Sbjct: 274 ELPSWHGFNA------TNHLLTTDPMYDIVADRFYQVQNEIYGAYAKIDYFSIDPFNELI 327
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
PP +S ++++ + I++ + + D+ W++Q W + FW Q+ + L VP+G+L+
Sbjct: 328 PPSNSSQFLNECSSRIFNAINRFNPDSTWVLQNWFLN-SAFWGDGQVASFLGGVPIGRLI 386
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VLDL++E+KP+W+ + + G +IW MLHNF G + G + IA P+EA+ S + TMV
Sbjct: 387 VLDLWSELKPLWNRTANYQGHKWIWNMLHNFGGRPTISGRMPIIANEPLEAKAS-SPTMV 445
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ E IEQN +VYDLMSEM ++ D+ W++ Y RRYG ++P ++ W +L +T
Sbjct: 446 GIGLTPEAIEQNVIVYDLMSEMGWRSRSFDLNLWVDAYVTRRYGVNLPNLKPVWKMLAYT 505
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VY P+ P+ NY ++K+ L + Y +P +
Sbjct: 506 VYFS---------------PNRSPA----------NY---IAKKPSLDFQLGLYYNPVV- 536
Query: 359 YSTSEVIRAL-ELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
++ A EL + +S TYRYDL ++T QAL+ Y N + ++Y +D
Sbjct: 537 -----IVDAWRELLAVDSTIVRSSETYRYDLAEITLQALSNYFNGNLKQLYQSYYASDFQ 591
Query: 418 GVFQLSRRFLEL-VEDMDGLLACHDGFLLGPWLESAKQLA-QNEEQE 462
FQ +R+ + MD + LG W A++ A N E+E
Sbjct: 592 -TFQSARQNCSFALRAMDAVADTVQLLKLGKWTADARKWATDNNERE 637
>gi|380692804|ref|ZP_09857663.1| putative alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 709
Score = 261 bits (668), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 148/464 (31%), Positives = 238/464 (51%), Gaps = 45/464 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GGPLP W +Q VLQKKIL R+ E GM PV F G VP++L+ +P A +
Sbjct: 188 MGNLENIGGPLPDEWFKEQTVLQKKILARMREYGMKPVFQGFFGMVPSSLKEKYPEAHLV 247
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S++ P +LD DPLF ++ + + + K YG+ + ++ D F E T
Sbjct: 248 EQGLWNSLQRPP------VLDPADPLFEQMAKVWYTEYEKLYGK-ADLFGGDLFHEGGKT 300
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D ++ + + M+ + DA W++Q WL + P+ K LL + +
Sbjct: 301 GGID----VTDAARRVQTAMKQYNPDATWVIQAWLGN-------PK-KELLAGLDRKHTL 348
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART--SENTT 236
++DL AE W K F G P++W + N+ NI ++G LD+IA GP++ R + +
Sbjct: 349 IVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGANIGLHGRLDAIATGPIDGRKDPEASPS 408
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
M G + EGIE NPVV+DL++EM ++ E +D+ W+ +YSVRRYG ++ AW + +
Sbjct: 409 MKGTSSTPEGIEVNPVVFDLLNEMRWRSEYLDIDTWLKEYSVRRYGAEDENLKKAWIIFH 468
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
T Y G + V A P + I+ S++
Sbjct: 469 RTAYGTYSGHRRPSESVFCAPPSLKRDKITA----------------------SAWSQCR 506
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
++Y + + LF+ S + L ++TY+YD +D RQ LA E + N+++AY+ D
Sbjct: 507 IFYDPDLFAQGVGLFLQSADHLKQTSTYQYDAVDFVRQYLADLGREAYYNLVDAYRAKDT 566
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
S RFL+L++D + LL+ H F +G WL+ A+ ++ E
Sbjct: 567 KQFDYWSERFLQLIKDQNELLSTHKCFFVGRWLDMARSKSKQPE 610
>gi|313203962|ref|YP_004042619.1| alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
gi|312443278|gb|ADQ79634.1| Alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
Length = 738
Score = 261 bits (666), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 162/457 (35%), Positives = 241/457 (52%), Gaps = 35/457 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ WGGPLP S+++ Q LQ+ IL R LGM P+L AF+G+VP L+ + PSAKIT
Sbjct: 196 MANMDKWGGPLPISYIEGQKKLQQHILQRSRALGMKPILSAFAGHVPEQLKTLRPSAKIT 255
Query: 61 QLG-NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
++ W + ++ TY LD TD LF EI + F+ Q K YG T H+Y+ D F+E TP
Sbjct: 256 RIEPGWGGMAAE---YTTYFLDPTDNLFGEIQKRFLTVQQKLYG-TDHLYSADPFNEITP 311
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLV 178
P P+Y++++G IY M D +A+W W F DP W P++ A++++VP GKL
Sbjct: 312 PSWEPDYLANVGKTIYETMSQVDKEAIWYQMSWTFYNDPTHWTRPRLSAMIHAVPQGKLF 371
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
LD E + + S FYG P+IWC L NF N + L+ + + + + V
Sbjct: 372 FLDYNCEEEEFFRKSDNFYGAPFIWCYLGNFGANTHLVAPLNKVV--NRLGKLTYGSACV 429
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQ-HEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
GVG ++EGI NP +Y+ + EM ++ E V I Y+ RR G A+ +AW +L
Sbjct: 430 GVGSTLEGINVNPEIYETVLEMPWRADETVTADTLIRHYAERRAGARDKAVIEAWQLLRQ 489
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
V D A +V V P + +T + +P +
Sbjct: 490 HV--LVDTAVGIWNHCVVF--QVSP-VTDLTRAFWAT-------------------NPKI 525
Query: 358 WYSTSEVIRAL-ELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
Y ++ AL +F AS N ++ YR+D+++LTRQAL Y L+ ++EAY +
Sbjct: 526 PYRNVDLAIALNRMFQASANS-KKTDAYRFDVVNLTRQALGNYGTVLYHKMMEAYSRKNL 584
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
+ S FL+L +++DGLLA FLLG WL A+
Sbjct: 585 IDFRKYSGEFLQLGQEIDGLLATRHEFLLGKWLADAR 621
>gi|295086519|emb|CBK68042.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 727
Score = 261 bits (666), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 147/460 (31%), Positives = 229/460 (49%), Gaps = 35/460 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 361
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + +++G PY+WC L NF GN + G + T+ G
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + A+ D WI Q + RR G ++ AW +LY ++
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 480
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y P+ + G ++ LK + P + Y
Sbjct: 481 YTV-------------------PAALGQ--------GALMNARPCLKGNGNWTTTPTVAY 513
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + E+ + +G ++ Y YD++++ RQ L Y +L EAY +
Sbjct: 514 SNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLL 571
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
Q +L+ D+D LL+ FLLG W+E A+ L +E
Sbjct: 572 KQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGTDE 611
>gi|336374066|gb|EGO02404.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 761
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 150/476 (31%), Positives = 259/476 (54%), Gaps = 40/476 (8%)
Query: 7 WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW- 65
WGG LP+ W++ Q LQK+I+ R+ ELGM PVLP+F+G VP A+ ++P+A I W
Sbjct: 214 WGGDLPEQWINDQFALQKQIISRMVELGMTPVLPSFTGFVPRAMHTLYPNASIVNGSQWN 273
Query: 66 -FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
F+++ + L+ DPLF + +FI +Q+ YG SH+Y D ++EN+P
Sbjct: 274 GFTIQ----YTNDSFLEPFDPLFSTLQTSFISKQVAAYGNVSHVYTLDQYNENSPYSGDT 329
Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-KLVVLDL 182
Y++++ AA ++ +++ D AVWLMQGWLF D FW +++A L VP +++LDL
Sbjct: 330 SYLANVTAATFASLRAADPQAVWLMQGWLFYSDSTFWTTERVEAYLGGVPGNDSMIILDL 389
Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGM 242
++E +P W ++G +IWC LH++ GN+ G +++ P++A + +MVG+G+
Sbjct: 390 YSEAQPQWQRLNSYFGKQWIWCELHDYGGNMGFEGNFENVTTQPIKALATPGNSMVGMGL 449
Query: 243 SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP----AIQDAWNVLYHT 298
+MEG E N ++YD++ + A+ ++ A+I+ ++ RRY +VP A +AW +L T
Sbjct: 450 TMEGQEGNEIIYDVLLDQAWSSTPLNRTAYISAWASRRY--NVPDLPTAALEAWEILGAT 507
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYN D T I+ ++ PSI L + T ++ +
Sbjct: 508 VYNNQDVTTQSTVKSIL---ELSPSITG------------------LVNRTGTHSTKLFY 546
Query: 359 YSTSEVIRALELFIASGNELSA-SNT--YRYDLIDLTRQALAKYANELFLNIIEAY--QL 413
+ + ++ AL+L + + E SA SN ++YD++D+TRQ LA +L+ ++I+ +
Sbjct: 547 DTNTTIVPALKLLLQARQEASALSNIPEFQYDVVDVTRQLLANRFIDLYTSLIDTFSSTS 606
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPY 469
+ + V L L++D+D +L FLL W+ +A+ + Y
Sbjct: 607 SSSSAVSAAGAPLLALLQDLDSVLLTDTHFLLARWISAARNWTHGDNATYAAYLEY 662
>gi|336386984|gb|EGO28130.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 738
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 150/476 (31%), Positives = 259/476 (54%), Gaps = 40/476 (8%)
Query: 7 WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW- 65
WGG LP+ W++ Q LQK+I+ R+ ELGM PVLP+F+G VP A+ ++P+A I W
Sbjct: 191 WGGDLPEQWINDQFALQKQIISRMVELGMTPVLPSFTGFVPRAMHTLYPNASIVNGSQWN 250
Query: 66 -FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
F+++ + L+ DPLF + +FI +Q+ YG SH+Y D ++EN+P
Sbjct: 251 GFTIQ----YTNDSFLEPFDPLFSTLQTSFISKQVAAYGNVSHVYTLDQYNENSPYSGDT 306
Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-KLVVLDL 182
Y++++ AA ++ +++ D AVWLMQGWLF D FW +++A L VP +++LDL
Sbjct: 307 SYLANVTAATFASLRAADPQAVWLMQGWLFYSDSTFWTTERVEAYLGGVPGNDSMIILDL 366
Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGM 242
++E +P W ++G +IWC LH++ GN+ G +++ P++A + +MVG+G+
Sbjct: 367 YSEAQPQWQRLNSYFGKQWIWCELHDYGGNMGFEGNFENVTTQPIKALATPGNSMVGMGL 426
Query: 243 SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP----AIQDAWNVLYHT 298
+MEG E N ++YD++ + A+ ++ A+I+ ++ RRY +VP A +AW +L T
Sbjct: 427 TMEGQEGNEIIYDVLLDQAWSSTPLNRTAYISAWASRRY--NVPDLPTAALEAWEILGAT 484
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYN D T I+ ++ PSI L + T ++ +
Sbjct: 485 VYNNQDVTTQSTVKSIL---ELSPSITG------------------LVNRTGTHSTKLFY 523
Query: 359 YSTSEVIRALELFIASGNELSA-SNT--YRYDLIDLTRQALAKYANELFLNIIEAY--QL 413
+ + ++ AL+L + + E SA SN ++YD++D+TRQ LA +L+ ++I+ +
Sbjct: 524 DTNTTIVPALKLLLQARQEASALSNIPEFQYDVVDVTRQLLANRFIDLYTSLIDTFSSTS 583
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPY 469
+ + V L L++D+D +L FLL W+ +A+ + Y
Sbjct: 584 SSSSAVSAAGAPLLALLQDLDSVLLTDTHFLLARWISAARNWTHGDNATYAAYLEY 639
>gi|400599317|gb|EJP67021.1| alpha-N-acetylglucosaminidase, putative [Beauveria bassiana ARSEF
2860]
Length = 753
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 146/464 (31%), Positives = 242/464 (52%), Gaps = 35/464 (7%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LPQ+W+D Q LQ+KI+ R+ ELGM P+LPAF G VP + V+P+ +
Sbjct: 201 FGNIQGSWGGDLPQAWIDDQFALQRKIIKRMVELGMTPILPAFPGFVPENITRVWPNVSL 260
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
+ W R+ + DP F E+ +AF+ +Q + YG + + D F+EN P
Sbjct: 261 AESPTWSGFSG--RFTADKYITPYDPRFAELQKAFLTKQNEAYGNVTSFWTLDQFNENKP 318
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-L 177
Y+ ++ + ++ D AVW+MQGWLF+ D +W ++K+ L+ VP+ + +
Sbjct: 319 ASGELGYLRNVSHNTWQTLKDADPSAVWVMQGWLFASDKAYWTDDRVKSFLDGVPVNEDM 378
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDLFAE P W + FYG P+IWC LH + GN+ +YG ++++ VEA ++ ++
Sbjct: 379 LLLDLFAESTPQWQRTDSFYGKPWIWCQLHGYGGNMGLYGQIENVTRNAVEA-VQKSPSI 437
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAWN 293
VG+G+SMEG E N ++Y+L+ + A+ E ++ + + + RYG + +P + AW+
Sbjct: 438 VGLGLSMEGQEGNEIMYNLLLDQAWSKEALETDKYFSDWVTVRYGADQKEIPKDLYTAWD 497
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
+ TVYN TD + + A ++ T G G +K
Sbjct: 498 KVRSTVYNNTDSS-------VTAVAKSIFELVPSTSGLVNRTGHHATK------------ 538
Query: 354 HPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
+ Y T +I A +G++ L + Y YDL D TRQ LA + ++E
Sbjct: 539 ---ITYDTETLISAWNDMFNAGSQARWLFDNEAYSYDLTDWTRQVLANAFEATYNKLVEK 595
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
Y+ N+ GV R ++ MD +L + F L W+++A++
Sbjct: 596 YKSNNIKGVKCAGSRLQAILRTMDQVLETNVHFRLSTWIQAARK 639
>gi|242809019|ref|XP_002485282.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
ATCC 10500]
gi|218715907|gb|EED15329.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
ATCC 10500]
Length = 755
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 152/473 (32%), Positives = 252/473 (53%), Gaps = 38/473 (8%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G W G LP W+D Q LQKKI+ R+ ELGM P+LPAF G VP A+ V P A +
Sbjct: 201 FGNIQGSWSGSLPYDWVDSQFDLQKKIVKRMTELGMTPILPAFPGFVPRAITRVLPDADV 260
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W + + + ++ TDP F EI ++FI +Q++ YG + Y D F+EN P
Sbjct: 261 INGSAWEAFPT--MYTNDTFMEPTDPHFTEIQKSFIAKQIEAYGNVTTFYTLDQFNENNP 318
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPL-GKL 177
Y+ ++ + +++ DS+AVW+MQGWLF S FW +++A L V + L
Sbjct: 319 SSGDLSYLRNVSQGTWKTLKAADSNAVWVMQGWLFTSNSAFWTNDRIEAYLGGVAVDSDL 378
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL +E P W + +YG P+IWC +H++ GN+ YG + +I P+ A ++++
Sbjct: 379 LILDLASESSPQWQRTNSYYGKPWIWCEIHDYGGNMGFYGQVMNITNNPIAA-LHNSSSL 437
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVP-AIQDAWNV 294
VG G+SMEG E N +VYDL+ + A+ +D +++ + + RY RS+P ++ AW++
Sbjct: 438 VGFGLSMEGQEGNEIVYDLLLDQAWNAAPIDTESYFHDWVTARYAGSRSIPSSVYSAWDI 497
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L TVYN T+ A + A P +I T G G H
Sbjct: 498 LRTTVYNNTNLAAN-------AVPKAIFELIPSTTGLLNRTGH----------------H 534
Query: 355 P-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
P L Y+T+++++A LF S + L + + +DL+D++RQ LA ++ N+I
Sbjct: 535 PTKLNYNTADMVQAWNLFYTSAFKEPSLWLNPAFEFDLVDMSRQVLANAFIPVYENLIST 594
Query: 411 YQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
Y ++ + + + + +++ +D +LA + F L WL +A+ A ++
Sbjct: 595 YNTSNPSSTKLQTIGAELIGILQALDTVLATNKNFKLSTWLSAARASAGSQHN 647
>gi|423292430|ref|ZP_17271008.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
CL02T12C04]
gi|423294620|ref|ZP_17272747.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
CL03T12C18]
gi|392661665|gb|EIY55241.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
CL02T12C04]
gi|392675811|gb|EIY69252.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 146/460 (31%), Positives = 228/460 (49%), Gaps = 35/460 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQNKLLL 361
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + +++G PY+WC L NF GN + G + T+ G
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + A+ D WI Q + RR G ++ AW +LY ++
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 480
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y P+ + G ++ LK + P + Y
Sbjct: 481 YTA-------------------PAALGQ--------GTLMNARPCLKGNGNWTTTPTVAY 513
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + E+ + +G + TY YD++++ RQ L Y +L E Y +
Sbjct: 514 SNETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVLGNYFGKLRDEFAETYSRKQLPLL 571
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
Q +L+ D++ LL+ FLLG W+E A+ L +E
Sbjct: 572 KQKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLGIDE 611
>gi|160883168|ref|ZP_02064171.1| hypothetical protein BACOVA_01137 [Bacteroides ovatus ATCC 8483]
gi|156111393|gb|EDO13138.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
Length = 737
Score = 258 bits (659), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 146/460 (31%), Positives = 228/460 (49%), Gaps = 35/460 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 196 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 256 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 312 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQNKLLL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + +++G PY+WC L NF GN + G + T+ G
Sbjct: 372 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + A+ D WI Q + RR G ++ AW +LY ++
Sbjct: 432 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y P+ + G ++ LK + P + Y
Sbjct: 491 YTA-------------------PAALGQ--------GTLMNARPCLKGNGNWTTTPTVAY 523
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + E+ + +G + TY YD++++ RQ L Y +L E Y +
Sbjct: 524 SNETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVLGNYFGKLRDEFAETYSRKQLPLL 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
Q +L+ D++ LL+ FLLG W+E A+ L +E
Sbjct: 582 KQKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLGIDE 621
>gi|237719130|ref|ZP_04549611.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451509|gb|EEO57300.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 737
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 149/460 (32%), Positives = 232/460 (50%), Gaps = 35/460 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 196 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 256 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 312 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + +++G PY+WC L NF GN + G + T+ G
Sbjct: 372 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + A+ D WI Q + RR G ++ AW +LY ++
Sbjct: 432 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y P+ + +G N +P K + TS+ + Y
Sbjct: 491 YTA-------------------PAALG--QGTLMN-ARPCLKGNGNWTTTST-----VAY 523
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + E+ + +G ++ Y YD++++ RQ L Y +L EAY +
Sbjct: 524 SNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLL 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
Q +L+ D+D LL+ FLLG W+E A+ L +E
Sbjct: 582 KQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGIDE 621
>gi|340520426|gb|EGR50662.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
Length = 747
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 147/461 (31%), Positives = 256/461 (55%), Gaps = 33/461 (7%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP+SW+D+Q LQ KIL R+ ELG+ P+LPAF G VP + VFP +
Sbjct: 201 FGNIQGSWGGTLPRSWVDEQFSLQLKILKRMEELGITPILPAFPGFVPRNISRVFPDISL 260
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
+ W + + ++ DP F ++ + FI +Q + YG ++ + D F+EN P
Sbjct: 261 STSPIWSNFGTTL--SADIYINPFDPRFAQLQKLFINKQQELYGNVTNFWTLDQFNENRP 318
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-L 177
+Y+ ++ ++ +++ D +AVW+MQ WLFS D FW +++ALL VP+ + +
Sbjct: 319 LSGDLDYLRNVSHNTWAALKAADPEAVWVMQAWLFSSDSSFWTNDRVEALLGGVPVNQDM 378
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT 236
++LDLFAE P W + FYG P+IWC LHN+ GN+ +YG ++++ ++A R S+ +
Sbjct: 379 LLLDLFAESAPQWQRTDSFYGKPWIWCELHNYGGNMGLYGQIENVTINSMDAVRNSD--S 436
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVPAIQDAWNVL 295
+VG G++MEG E N ++YDL+ + A+ + +D + + + RYG ++V + W +L
Sbjct: 437 IVGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTDTYFHDWVSARYGAKNVKGLYKGWEML 496
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
TV+N T+ + + I+ ++ PSI + G+ +G + + + E S
Sbjct: 497 RPTVFNNTNLTVNAVQKSIL---ELTPSISGLL-GRTGRHGTTIMYDPAVMVEAWS---- 548
Query: 356 HLWYSTSEVIRALELFIASGNELSASN--TYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
ELF A +L+ N +Y+YDL+D TRQ L + + ++++AY
Sbjct: 549 -------------ELFKAGLQDLTLFNNPSYQYDLVDWTRQVLVNSFEDHYKDLVDAYNK 595
Query: 414 NDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
+ + V + + + L++ +D +LA + F L PW++ A+
Sbjct: 596 SSSPTVIRTRGAKLVTLLKTLDAVLATNKNFQLTPWIDRAR 636
>gi|295087651|emb|CBK69174.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 703
Score = 257 bits (657), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 151/472 (31%), Positives = 239/472 (50%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP SWL Q LQK+I+ R LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 169 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 228
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D DPLF +I + ++E+Q K YG T HIY D F+E P
Sbjct: 229 QMSQWGGYDEKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 284
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++ I+ ++ DS A W+ W+F + W P++KA LNSVP KL++
Sbjct: 285 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 344
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G +D ++ + G
Sbjct: 345 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 404
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG++ NP +Y+ + E A+ H + W+ +++ R G I DAW LY +
Sbjct: 405 VGATLEGLDVNPFMYEFVLEKAWSHTITNAD-WMKNWALCRGGSKSSHIIDAWQQLYKKI 463
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y I T G+ ++ +L+ S HP ++Y
Sbjct: 464 Y-----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYY 496
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+ F+ + N S+ Y++D+I++ RQ L ++ + Y+ + G+
Sbjct: 497 DNKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGM 554
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESA----KQLAQNEEQEKQVRC 467
+ + + L D+D LL+C F +G W++ A K L + E E+ RC
Sbjct: 555 KEWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARC 606
>gi|423287380|ref|ZP_17266231.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
CL02T12C04]
gi|392672495|gb|EIY65962.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
CL02T12C04]
Length = 726
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 151/472 (31%), Positives = 239/472 (50%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP SWL Q LQK+I+ R LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 192 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D DPLF +I + ++E+Q K YG T HIY D F+E P
Sbjct: 252 QMSQWGGYDKKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++ I+ ++ DS A W+ W+F + W P++KA LNSVP KL++
Sbjct: 308 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G +D ++ + G
Sbjct: 368 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 427
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG++ NP +Y+ + E A+ H + W+ +++ R G I DAW LY +
Sbjct: 428 VGATLEGLDVNPFMYEFVLEKAWSHTITNAD-WMKNWALCRGGSKSSHIIDAWQQLYKKI 486
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y I T G+ ++ +L+ S HP ++Y
Sbjct: 487 Y-----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYY 519
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+ F+ + N S+ Y++D+I++ RQ L ++ + Y+ + G+
Sbjct: 520 DNKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGM 577
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESA----KQLAQNEEQEKQVRC 467
+ + + L D+D LL+C F +G W++ A K L + E E+ RC
Sbjct: 578 KEWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARC 629
>gi|423213214|ref|ZP_17199743.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693674|gb|EIY86904.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
CL03T12C04]
Length = 726
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 151/472 (31%), Positives = 239/472 (50%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP SWL Q LQK+I+ R LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 192 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D DPLF +I + ++E+Q K YG T HIY D F+E P
Sbjct: 252 QMSQWGGYDEKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++ I+ ++ DS A W+ W+F + W P++KA LNSVP KL++
Sbjct: 308 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G +D ++ + G
Sbjct: 368 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 427
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG++ NP +Y+ + E A+ H + W+ +++ R G I DAW LY +
Sbjct: 428 VGATLEGLDVNPFMYEFVLEKAWSHTITNAD-WMKNWALCRGGSKSSHIIDAWQQLYKKI 486
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y I T G+ ++ +L+ S HP ++Y
Sbjct: 487 Y-----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYY 519
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+ F+ + N S+ Y++D+I++ RQ L ++ + Y+ + G+
Sbjct: 520 DNKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGM 577
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESA----KQLAQNEEQEKQVRC 467
+ + + L D+D LL+C F +G W++ A K L + E E+ RC
Sbjct: 578 KEWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARC 629
>gi|153808241|ref|ZP_01960909.1| hypothetical protein BACCAC_02529 [Bacteroides caccae ATCC 43185]
gi|423219048|ref|ZP_17205544.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
CL03T12C61]
gi|149129144|gb|EDM20360.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
gi|392625814|gb|EIY19870.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
CL03T12C61]
Length = 752
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 165/465 (35%), Positives = 227/465 (48%), Gaps = 51/465 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ ++L KKI+ R ELGM P+ FSG VP L++ +P AKI
Sbjct: 193 MQNLQSYGGPLPKSWIDKHIILAKKIIDRERELGMTPIQQGFSGYVPRELKDKYPEAKI- 251
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF +GR F+E++ K YG T IY D F E+
Sbjct: 252 --------RLQPGWCGFKGAGQLDPTDALFATLGRDFLEEEKKLYG-TYGIYAADPFHES 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPV++PEY+S++G AIY ++ D A W MQ W R P +KA VP L
Sbjct: 303 APPVNTPEYLSAVGHAIYKLIKDFDPKAKWAMQAWSL------REPIVKA----VPQNDL 352
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL E K F+G P + LHNF G I M+G L +A + +
Sbjct: 353 IILDLNGEKI---KGRKGFWGYPAVEGNLHNFGGRINMHGDLRLLASNQYMTALKQYPNV 409
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G G+ ME IEQNPV YDL EM +V ++ W+ QY+ RRYG P+ Q A L
Sbjct: 410 CGSGLFMEAIEQNPVYYDLAFEMPLHKGEVAIEEWLKQYANRRYGAVSPSAQQAMICLLE 469
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y T+ R I+A P++ G G P
Sbjct: 470 GPYRPGTNGTE--RSSIIA---ARPALNVKKSGPNAGLGIP------------------- 505
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YS VI+A L + ++L S YR+D+ID+ RQ + + EA+ D
Sbjct: 506 -YSPLLVIQAEGLLLKDADKLKNSEPYRFDVIDVQRQMMTNMGQVIHKRAAEAFLNRDKE 564
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
S+RFL+++ED+D LL F WL SA+ EE++
Sbjct: 565 AFALHSKRFLQMLEDVDELLRTRPEFNFDRWLTSARSWGDTEEEK 609
>gi|281200617|gb|EFA74835.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
Length = 688
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 126/284 (44%), Positives = 180/284 (63%), Gaps = 9/284 (3%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ W G L W+ Q LQ +IL R+ + GM VLP F+G+VP AL+ +P A IT
Sbjct: 213 MGNVNEWAGNLTLGWMADQRDLQIQILTRMRQFGMQAVLPGFAGHVPEALETHYPKANIT 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG W + TY L+ DPLF +I +AF+ Q + YG T H YN D F+E PP
Sbjct: 273 QLGGWGTFSG------TYYLNPDDPLFSKIAQAFVITQNQLYG-TDHFYNFDPFNELEPP 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
Y+ + ++++ + + D +W++QGW DP FW PPQ +A L+ VP+GK++V
Sbjct: 326 SSDLTYLKNCSQSMFNNLIAADPQGIWVLQGWFLVDDPEFWLPPQTEAFLSGVPIGKMIV 385
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+++V P W+++ +YG +IWCMLHNF G MYG + I+ P+EAR S + MVG
Sbjct: 386 LDLWSDVIPAWNSTNYYYGHNWIWCMLHNFGGRSGMYGKIPFISTNPIEAR-SLSPNMVG 444
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
G++ E IEQN +VYDLMSEMA++ D+K W++QY RRYG+
Sbjct: 445 TGLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQYVTRRYGK 488
>gi|383114162|ref|ZP_09934927.1| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
gi|382948607|gb|EFS30964.2| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
Length = 727
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 148/459 (32%), Positives = 231/459 (50%), Gaps = 35/459 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 361
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + +++G PY+WC L NF GN + G + T+ G
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + A+ D WI Q + RR G ++ AW +LY ++
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 480
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y P+ + +G N +P K + TS+ + Y
Sbjct: 481 YTA-------------------PAALG--QGTLMN-ARPCLKGNGNWTTTST-----VAY 513
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + E+ + +G ++ Y YD++++ RQ L Y +L EAY +
Sbjct: 514 SNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLL 571
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
Q +L+ D+D LL+ FLLG W+E A+ L +
Sbjct: 572 KQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGTD 610
>gi|391338146|ref|XP_003743422.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Metaseiulus
occidentalis]
Length = 665
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 146/460 (31%), Positives = 235/460 (51%), Gaps = 38/460 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGPLP SW QQ +LQK IL R+ + GM PV+P F+G VP A + + P+ +
Sbjct: 188 MGNLRGFGGPLPSSWQLQQQLLQKMILRRMRDFGMTPVVPGFNGFVPRAFERLHPAVSWS 247
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + + + L T+ F+ + +I YG + H+Y+ D F+E TP
Sbjct: 248 RASRWNNFPDE--YAMLTFLAPTESFFLNVSSLYITMYRSIYG-SDHLYSVDLFNEETPD 304
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ P ++ + + +Y + D +W+MQGWLF + +W ++KA L PLGK++V
Sbjct: 305 TNDPAALAEMSSNVYESIAKADPKGIWVMQGWLFVHGGDYWNHDRVKAFLGGPPLGKMIV 364
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E P + ++G P+IWCMLHN+ G ++G L+ I P+ R S M+G
Sbjct: 365 LDLFSEQSPQFPRFSNYFGQPFIWCMLHNYGGVSGLFGNLEWINSEPLNVRRSV-PNMIG 423
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G++ EG QN V+Y+ M+E +++ +V W+ Y RYG S P +++AW +L +V
Sbjct: 424 IGIAPEGTGQNEVIYEFMAENSYRDSSENVSLWLQNYVGARYGLSDPHLENAWELLRKSV 483
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y S+T +N+G +L P +WY
Sbjct: 484 Y-------------------------SLTSKSIENHGN-----YILTHRPKLNSTPLIWY 513
Query: 360 STSEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
+ S+VI A I EL + DL+D+ RQAL ++ +L ++ ++ N
Sbjct: 514 NGSDVIGAATELIRGATLHRELCHERLFHQDLVDVVRQALQVRVSDEYLQMMSHFKANSL 573
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ SRR L + +D +L+ FLLG WL +++ A
Sbjct: 574 IDFEEHSRRLLHCIRVLDKVLSTDPNFLLGSWLRDSRESA 613
>gi|212537509|ref|XP_002148910.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
gi|210068652|gb|EEA22743.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
Length = 768
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 149/474 (31%), Positives = 248/474 (52%), Gaps = 37/474 (7%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WG PLP +W+D Q LQKKI+ R+ ELGM P+LPAF G VP A+ V P A +
Sbjct: 210 FGNIQGSWGSPLPYAWVDSQFDLQKKIVKRMVELGMTPILPAFPGFVPRAITRVLPDADV 269
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W + + + ++ TDP F EI ++FI +Q YG + Y D F+EN P
Sbjct: 270 INGSAWEAFPA--MFTSDTFMEPTDPHFTEIQKSFISKQTAAYGNVTTFYTLDQFNENNP 327
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPL-GKL 177
Y+ S+ + +++ D AVW+MQGWLF S FW +++A L V + L
Sbjct: 328 SSGDLNYLRSVSHGTWQALKAADPSAVWVMQGWLFFSNSAFWTNDRVEAYLGGVTVDSDL 387
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
+VLDL +E +P W + ++G P+IWC +H++ GN+ YG + +I P+ A + ++
Sbjct: 388 LVLDLASESQPQWQRTNSYFGKPWIWCQIHDYGGNMGFYGQVMNITVNPIAALNNATASL 447
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNV 294
VG G+SMEG E N VVYDL+ + A+ + +D + + + RY +S+P + AW++
Sbjct: 448 VGFGLSMEGQEGNEVVYDLLLDQAWSAKPIDTATYFHDWVTARYAGSKSIPTDVYSAWDM 507
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L +VYN T+ A++ A P +I T G G H
Sbjct: 508 LRTSVYNNTNLASN-------AVPKAIFELIPSTTGLVNRTGH----------------H 544
Query: 355 P-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
P L Y+ +++++A LF ++ + L + Y +DL+D++RQ LA ++ ++I A
Sbjct: 545 PTTLNYNPADMVKAWSLFYSAAFKEPSLWLNPAYEFDLVDMSRQVLANAFIPVYHDLIAA 604
Query: 411 YQLNDAHGVF--QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ + + + + +++ +D +L ++ F L W+ +A+ A + E
Sbjct: 605 WNTTNPSTIRIQIIGAELIGILQAIDTILDTNEHFKLSTWISAARTSAGEQSLE 658
>gi|195454475|ref|XP_002074254.1| GK18384 [Drosophila willistoni]
gi|194170339|gb|EDW85240.1| GK18384 [Drosophila willistoni]
Length = 743
Score = 254 bits (650), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 142/435 (32%), Positives = 233/435 (53%), Gaps = 38/435 (8%)
Query: 32 ELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 91
ELG++ LPAF+G+VP AL+ +FP A T+ W + +CC ++ +PLF ++
Sbjct: 262 ELGISVALPAFAGHVPRALRRIFPQANFTETERWNRFPNA--YCCDLFVEPQEPLFRQLA 319
Query: 92 RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 151
F+ + + YG ++HI+ CD F+E PPV +++ S AAIY+ M+ D A+WL+QG
Sbjct: 320 TTFLRRVTQRYG-SNHIFFCDPFNELEPPVSQADFMRSTAAAIYASMREVDPKAIWLLQG 378
Query: 152 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 211
W+F + FW ++A L +VP G L+VLDL +E P + +K +YG P++WCMLHNF G
Sbjct: 379 WMFVKNIFWTDELIEAFLTAVPQGNLLVLDLQSEQFPQYQRTKSYYGQPFVWCMLHNFGG 438
Query: 212 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
+ M G ++ + G AR N++MVG G++ EGI QN V+Y E + K+D
Sbjct: 439 TLGMLGSVELVNSGMDLARQMPNSSMVGAGITPEGIGQNYVMYSFALERGWSDRKLDSAG 498
Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 331
W +++ RYG + AW +L +VY + + I P ++ S
Sbjct: 499 WFTHFALTRYGVQDERLNQAWQLLRTSVYTFHGLQKMRGKYTITRRPAINLS-------- 550
Query: 332 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL----SASNTYRYD 387
P WY+ + V+ A +L +++ + + + + Y++D
Sbjct: 551 -----------------------PFTWYNVTHVLEAWQLMLSARSIIPLDDNRYDIYQHD 587
Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
L+D+TRQ L A++L++N+ +Y+ L + LEL++D++ +L FLLG
Sbjct: 588 LVDITRQYLQITADQLYVNLNSSYRKRQLARFVYLGNKLLELLDDLERILGSGSNFLLGT 647
Query: 448 WLESAKQLAQNEEQE 462
WLE+AK LA E +
Sbjct: 648 WLEAAKLLAPTVEDQ 662
>gi|452988463|gb|EME88218.1| glycoside hydrolase family 89 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 772
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/474 (30%), Positives = 251/474 (52%), Gaps = 37/474 (7%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LPQSW+D Q L KKI+ R+ ELGM PVLP F+G VP + ++P+A
Sbjct: 200 FGNIQGSWGGDLPQSWIDHQFELNKKIVARMVELGMTPVLPCFTGFVPTQISRLYPNASF 259
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W +++ + L+ DPLF + ++FI +Q++ YG S IY D ++EN P
Sbjct: 260 VNGSRWNGFQAE--YTNVTFLEPFDPLFTTLQKSFISKQIEAYGNVSSIYTLDQYNENDP 317
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLV 178
Y+ ++ + +++ D +A+W +QGWLF S FW +++A L V ++
Sbjct: 318 FSGELAYLKNVTSNTIKSLKAADPEAIWFIQGWLFYSSADFWTDERVEAYLGGVANEDML 377
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
+LDLF+E +P W + ++G P+IWC LH++ GN ++G ++++ PV+A ++ +TMV
Sbjct: 378 ILDLFSESQPQWQRTNSYFGKPWIWCQLHDYGGNQGLHGQVENVTINPVQALANKTSTMV 437
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPA-IQDAWNVLY 296
G+G +MEG E N ++YD++ + A+ E +D ++ + + RY G +P+ + AW+V+
Sbjct: 438 GMGSTMEGQEGNEIIYDILLDQAWSKEPIDSDSYFHDWVTSRYAGSKLPSGLYTAWDVMR 497
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
TVYN TD + V + +++P+ + + + + VL S +
Sbjct: 498 QTVYNSTD--IEAAEAVTKSIFELEPNTTGLLNRRGHHSTLILYDPNVLVSAWN------ 549
Query: 357 LWYSTSEVIRALELFIASGNELSA--SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
+L+ AS +++ Y++DL+D TRQ LA L+ + + +
Sbjct: 550 ------------DLYNASNDDIQLWDVKAYQFDLVDTTRQVLANAFYPLYTDFVHSAN-K 596
Query: 415 DAHGVF------QLSRRFLELVEDMDGLLACHDG--FLLGPWLESAKQLAQNEE 460
G + + + + L++D+D +L F L W+ESA+ A E+
Sbjct: 597 SVQGTYSPTKAEEKGKEMIMLLKDLDSVLEASGNAHFKLSSWIESARLWAPAED 650
>gi|29348998|ref|NP_812501.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340905|gb|AAO78695.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 732
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 153/472 (32%), Positives = 240/472 (50%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 196 MSNVDFWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D D L+ I R F+E+Q K YG T HIY D F+E P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
S ++++++ + IY + DS A WL W+F YD W P++++ L +VP KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDNKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++++YG PYIWC L NF GN + G L+ I F + G
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP++Y+ + + A+ + V WI +S+ R G I AW L+ +
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDY-PVTTDQWITNWSMCRGGNQDANIIKAWRALHQKI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T+ AT ++ A P +T K N + Y + LW
Sbjct: 491 Y--TEHATCGQSVLMNARP-------RLTGTKSWNTNPGI-----------HYANNDLWQ 530
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A + ++ +R+D+I++ RQ L +E Y D G+
Sbjct: 531 IWKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSEYRDQFTACYNRKDTTGM 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ----LAQNEEQEKQVRC 467
+ S R L+ D+D LL+C +G WL+ A+ +++ + E+ RC
Sbjct: 582 REWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCGATVSEKDYYEENARC 633
>gi|395331391|gb|EJF63772.1| alpha-N-acetylglucosaminidase [Dichomitus squalens LYAD-421 SS1]
Length = 750
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 157/476 (32%), Positives = 259/476 (54%), Gaps = 33/476 (6%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP +W+D Q LQK+IL R+ ELGM PVLP+F+G VP AL +++P+A I
Sbjct: 195 FGNIQGSWGGDLPVTWVDDQFQLQKQILQRMVELGMTPVLPSFTGFVPRALSSLYPNASI 254
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W + L+ DPLF I +FI +Q + YG SHIY D ++EN P
Sbjct: 255 VNGSQWEGFPT--ALTNDSFLEPFDPLFTTIQTSFISKQREAYGNVSHIYALDQYNENDP 312
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KL 177
P Y++++ A ++ +++ D DAVWLMQGWLF S FW +++A L VP +
Sbjct: 313 FSGDPAYLANVTAGTFASLRAADPDAVWLMQGWLFFSSAAFWTNERIEAYLGGVPGNDSM 372
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL++E +P W+ + +YG ++WC LH + GNI M G LD++ P+ A + ++M
Sbjct: 373 IILDLYSEAQPQWNRTSSYYGKQWVWCELHGYGGNIGMEGDLDALTQNPIAALHAPGSSM 432
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNVL 295
GVG++MEG E N +VYD++ + A+ +++ ++++Q+ RRY R +P + DAW L
Sbjct: 433 KGVGLTMEGQEGNELVYDILLDQAWSSAPLNLSSYVDQWVARRYNVRRLPKSALDAWRTL 492
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
TVY+ D T + I + ++ P++ +T + T +
Sbjct: 493 ATTVYSNKDSGT---QAAIKSIYELAPALTGMT------------------NRTGHHPTA 531
Query: 356 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+ + S V+ A + + + +E L+ + YD++D+TRQ L+ + + ++ Y
Sbjct: 532 IPYDTNSTVLVAAKALLEARSENPLLATIPEFAYDVVDVTRQLLSNRFIDHYNVLVATYN 591
Query: 413 LNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVR 466
N V + L L++D+D LLA ++ FLL W+ AK+ ++ R
Sbjct: 592 SNATAPRNVAAAAGPLLALLDDLDELLATNEHFLLSNWIADAKRWTHGADRAAYAR 647
>gi|393788286|ref|ZP_10376416.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
CL02T12C05]
gi|392655959|gb|EIY49600.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
CL02T12C05]
Length = 757
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 155/463 (33%), Positives = 230/463 (49%), Gaps = 48/463 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP++ +D+ L KKI+ R ELGM P+ FSG VP L+ +P+A I
Sbjct: 196 MQNLQSYGGPLPKTVIDKHAALGKKIISRQLELGMQPIQQGFSGYVPRELKEKYPTANIN 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q +W K + LD TD LF +GRAF+E+Q + +G +Y D F E+ PP
Sbjct: 256 QQRSWCGFKGAAQ------LDPTDSLFTRMGRAFLEEQARLFG-AHGVYAADPFHESAPP 308
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+D+PEY+ ++G I+ + D + W MQ W D ++ +VP L++L
Sbjct: 309 IDTPEYLKAVGERIHHLFRDFDPHSTWAMQSWSLRED----------IVKAVPKDALLIL 358
Query: 181 DLFAEVKPIWSTSKQ-FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
DL + STSK F+G + LHNF G I M+G L +A N + G
Sbjct: 359 DLNGK-----STSKALFWGYSTVVGNLHNFGGRINMHGDLKLLASNQYSKAKRLNPAVCG 413
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G+ ME +EQNPV Y+L EM + ++++AW+ QY+ RRYG PA Q+AW +L +
Sbjct: 414 SGLFMEAVEQNPVYYELAFEMPCHADSINLQAWLKQYATRRYGAFSPAAQEAWLLLLNGP 473
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T+K+ ++ A P +D V K + YD P L
Sbjct: 474 YRRGTNGTEKS-SIVAARPALD-----------------VKKSGPNAALEIPYD-PTL-- 512
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
VIRA L + ++LS S YR+D++D+ RQ + + EA++ D
Sbjct: 513 ----VIRAQSLLLKDIDKLSVSRPYRFDIVDVQRQLMTNLGQLIHRQAAEAFRKKDQCAF 568
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
S RFLE++ DMD LL + WL A+ +E++
Sbjct: 569 TLHSGRFLEMLADMDKLLRTRSEYSFDRWLTEARSWGDTDEEK 611
>gi|298386708|ref|ZP_06996263.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
gi|298260382|gb|EFI03251.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
Length = 732
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 152/472 (32%), Positives = 240/472 (50%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 196 MSNVDYWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D D L+ I R F+E+Q K YG T HIY D F+E P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
S ++++++ + IY + DS A WL W+F YD W P++++ L +VP KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDNKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++++YG PYIWC L NF GN + G L+ I F + G
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP++Y+ + + A+ + V WI +S+ R G I AW L+ +
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDYS-VTTDQWITNWSMCRGGNQDANIIKAWRALHQKI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T+ AT ++ A P +T K N + Y + LW
Sbjct: 491 Y--TEHATCGQSVLMNARP-------RLTGTKSWNTNPGI-----------HYANNDLWQ 530
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A + ++ +R+D+I++ RQ L ++ Y D G+
Sbjct: 531 IWKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSKYRDQFTACYNRKDTTGM 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ----LAQNEEQEKQVRC 467
+ S R L+ D+D LL+C +G WL+ A+ +++ + E+ RC
Sbjct: 582 REWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCGATVSEKDYYEENARC 633
>gi|392584963|gb|EIW74305.1| glycoside hydrolase family 89 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 772
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 149/479 (31%), Positives = 249/479 (51%), Gaps = 53/479 (11%)
Query: 8 GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
GG LPQ W+D QL LQK+I+ RI ELGM PVLPAF G VP A+ +FP+A I +
Sbjct: 221 GGKLPQEWMDAQLALQKQIVPRIVELGMTPVLPAFPGFVPPAMHTLFPNASIVNGSEYPG 280
Query: 68 VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
+ + ++ L DPL+ ++ +F+ +Q + G +H++ D ++EN+P Y+
Sbjct: 281 IPA--QYSNDSFLAPFDPLYAQLQSSFLAKQTEALGNVTHVWTIDQYNENSPYSGDLTYL 338
Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 186
+++ + ++ +++ D DA+WLMQGWLF D PFW ++ A L+ +P +++LDLF++V
Sbjct: 339 ANIANSTFASLRAHDPDAIWLMQGWLFFADEPFWTSDRVDAYLDQIPNDGMIILDLFSDV 398
Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEG 246
P W + G ++WC +H+F GN+ + G + GPV+A S N++M GVG++MEG
Sbjct: 399 YPQWQRLDSYRGKSWVWCEVHDFGGNMGLEGNFSVVTNGPVDALNSPNSSMKGVGLAMEG 458
Query: 247 IEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-------------GRSVPAIQ-DAW 292
+E N ++YD++ + A+ +D A+ ++ RR+ S+PA +AW
Sbjct: 459 LEGNEIIYDVLLDQAWSAAPLDRDAYAKAWATRRFHLPTANSSTTTATNTSIPASAIEAW 518
Query: 293 NVLYHTVYNCTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
L TVY+ T+ GAT S+I + Y P S +
Sbjct: 519 QTLASTVYSSTNPNVWGATK--------------SLIELAPSLGGMYSAPSSTIIFYDTN 564
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFL 405
TS ++ AL +A+G L A + +R D ID+ RQ LA + +
Sbjct: 565 TS-------------LVPALRGLVAAGTSAPALWALDEFRTDSIDVARQLLANRFADAYT 611
Query: 406 NIIEAYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
AY + + + + R +++++D+D LL H+ +LL + SA+ A + E
Sbjct: 612 ATTGAYNASGPGSAALNATAARMMQIIDDLDRLLMTHEPYLLSSRIASARAWAGDGGDE 670
>gi|403416059|emb|CCM02759.1| predicted protein [Fibroporia radiculosa]
Length = 705
Score = 251 bits (641), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 148/470 (31%), Positives = 254/470 (54%), Gaps = 37/470 (7%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G W G LP W++ Q LQ++I+ R+ ELGM PVLPAF+G VP A+ ++P+A I
Sbjct: 156 FGNIQGSWSGALPTQWINDQWALQQQIVQRMVELGMTPVLPAFTGFVPRAMSTLYPNASI 215
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENT 118
W S + T L+ DPLF + ++FI +Q YG SH+Y D ++EN
Sbjct: 216 VNGSQWEGFPSTLTY--TTFLEPFDPLFTTMQKSFISKQQAAYGANVSHVYTLDQYNEND 273
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-K 176
P Y++++ A ++ +Q+ D +AVW+MQGWL F+ + FW ++ A L +VP
Sbjct: 274 PYSGDVGYLANISAGTFASLQAADPEAVWMMQGWLFFASEAFWTTERIAAFLGAVPSNDS 333
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDL++E P W + +YG +IWC LH+F GN+ G L + GP++A S ++
Sbjct: 334 MIILDLYSEAAPQWQRTDSYYGKQWIWCELHDFGGNMGFEGNLPELVTGPIQAL-SNASS 392
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNV 294
M G+G++ EG E N +VYD++ + A+ +D+ +++ + RRY + +P A Q+AW +
Sbjct: 393 MRGMGLTPEGQEGNEIVYDILLDQAWSSTSIDIASYVEAWVARRYTVQDLPSAAQEAWTI 452
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L TVY+ +D P+ +I S+ E ++ + ++ + +
Sbjct: 453 LSTTVYSNSD-------------PNTQATIKSIFE---------LAPDLSGLTDRTGHHC 490
Query: 355 PHLWYSTS-EVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
+ Y T+ ++ AL+ + + E L + + YD++D+TRQ LA +++ ++
Sbjct: 491 TEIPYDTNITIVPALQNLVQAATENPLLLSVPEFMYDVVDVTRQLLANRFIDVYNELVST 550
Query: 411 YQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+ A V + L ++ D+D LL +D FLL W+ A L+ N
Sbjct: 551 FYSTGVTAASVKNAGQPLLTILSDVDTLLWTNDNFLLSNWILGAINLSDN 600
>gi|383124408|ref|ZP_09945072.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
gi|251839096|gb|EES67180.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
Length = 732
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 152/472 (32%), Positives = 239/472 (50%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 196 MSNVDYWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D D L+ I R F+E+Q K YG T HIY D F+E P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYQVIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
S ++++++ + IY + DS A WL W+F YD W P++++ L +VP KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDDKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++++YG PYIWC L NF GN + G L+ I F + G
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP++Y+ + + A+ + V WI +S+ R G I AW L+ +
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDY-PVTTDQWITNWSMCRGGDQDANIIKAWRALHQNI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T+ A ++ A P +T K N + Y + LW
Sbjct: 491 Y--TEYAICGQSVLMNARP-------RLTGTKSWNTNPGIH-----------YANNDLWQ 530
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A + ++ +R+D+I++ RQ L +E Y D G+
Sbjct: 531 IWKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSEYRDQFTACYNRKDTTGM 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ----LAQNEEQEKQVRC 467
+ S R L+ D+D LL+C +G WL+ A+ +++ + E+ RC
Sbjct: 582 REWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARDCGTTVSEKDYYEENARC 633
>gi|410100551|ref|ZP_11295511.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
CL02T12C30]
gi|409215586|gb|EKN08585.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
CL02T12C30]
Length = 739
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 158/469 (33%), Positives = 234/469 (49%), Gaps = 56/469 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW+D L ++IL R ELGM P+L +F+G VP L+ +P A+I
Sbjct: 194 MTNIETYGGPLPQSWIDSHQALGQQILERQRELGMTPILQSFTGFVPIKLKEKYPDARI- 252
Query: 61 QLGNWFSVKSDPRWC----CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
K RWC T LD DPLF E+G+AF+E+Q K YG T+HIY D F E
Sbjct: 253 --------KDKNRWCNAFTATVQLDPLDPLFKEMGQAFLEEQQKLYG-TNHIYAADPFHE 303
Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
P + Y+ ++G I+ D +AV MQ W +A+ + P +
Sbjct: 304 GAAPSNEKSYLEAVGKVIWEVASGFDPEAVIAMQTWSLR----------EAITRTFPQDR 353
Query: 177 LVVLDLFAEVKPIWSTSK--QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS-E 233
L++LDL W+ K F+ PY+ +LHN+ G + M G L A E + S +
Sbjct: 354 LLLLDLGG-----WNVEKFNSFWNYPYVAGVLHNYGGRVYMGGNLALYAKNAHELKQSPK 408
Query: 234 NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWN 293
+ G+G+ E IE NPVVY+L +E+ + + D++ WI Y+ RYG+ + W
Sbjct: 409 GGNIQGIGLFPEAIEHNPVVYELSTEITWMQDAPDLQKWITDYARARYGKLPAGAEQGWK 468
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
VL TVY G V+ A P + ++ + +P
Sbjct: 469 VLLETVYGSKAGRLPSTESVMCARPALTIQKVAAN----GDLSRP--------------- 509
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
YST + A++ F+ + N+L S+TYRYDL+D+ RQ L+ + L I EAY
Sbjct: 510 -----YSTVRLWDAVDHFLQASNDLKKSDTYRYDLVDVMRQCLSDLSLPLQKQITEAYLA 564
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
D + Q +FL L++D D LL FLLG W++ A+Q EE++
Sbjct: 565 EDNEKLQQAGEQFLALIDDFDRLLGTRSTFLLGKWIKEARQWGTTEEEK 613
>gi|336371253|gb|EGN99592.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.3]
gi|336384013|gb|EGO25161.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 761
Score = 248 bits (632), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 147/473 (31%), Positives = 252/473 (53%), Gaps = 41/473 (8%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP+ W++ Q VLQK+IL R+ ELGM PVLP+F+G VP A+ ++P+A I
Sbjct: 206 FGNIQGSWGGDLPEQWINDQFVLQKQILARMVELGMTPVLPSFTGFVPRAMHTLYPNASI 265
Query: 60 TQLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
W F+++ L+ DPLF + +F+ + YG SHIY D ++E
Sbjct: 266 VNGSQWSTFTIQH----TNDSFLEPFDPLFSTLQTSFMTKYAAAYGNVSHIYTLDQYNEM 321
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS-YDPFWRPPQMKALLNSVPLG- 175
P + Y+SS+ +A ++ +++ D +AVW+MQGWLF Y FW +++A L VP
Sbjct: 322 MPYSGNTSYLSSISSATFASLRATDPEAVWMMQGWLFYIYASFWTDERVEAYLGGVPGND 381
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDLF+E P W ++G +IWC LH+F GN+ G +++ PV+A +
Sbjct: 382 SMIILDLFSEAYPQWQRLNSYFGKQWIWCELHDFGGNMGFEGNFENVTTQPVKALATPGN 441
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP----AIQDA 291
TMVG+G++MEG E N ++YD++ + A+ ++ ++++ ++ RRY +VP A +A
Sbjct: 442 TMVGMGLTMEGQEGNEIMYDVLFDQAWSPTPINRTSYVSAWTSRRY--NVPNLPTAATEA 499
Query: 292 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 351
W +L TVYN D I ++++T + G P
Sbjct: 500 WEILASTVYNNQDPLLQATIKSIFELEPAINGLVNLTVLQ----GIPTG----------- 544
Query: 352 YDHPHLWYST-SEVIRALELFIASGNELSASNT---YRYDLIDLTRQALAKYANELFLNI 407
L+Y T + ++ AL+ + + E SA + ++YD++ + RQ LA +L+ ++
Sbjct: 545 -----LFYDTNTTIVPALQSLLQARQESSALDEVPEFQYDVVYIIRQLLANRFIDLYTSL 599
Query: 408 IEAYQLNDAHGVFQLSRR--FLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
++ Y + + + L++D+D +L FLL W+ +A+ A +
Sbjct: 600 VDTYNSTTSSSSDVSTAGAPLITLLKDVDSVLLTDTHFLLSNWISAARNWAHD 652
>gi|326437768|gb|EGD83338.1| lysosomal alpha-N-acetyl glucosaminidase [Salpingoeca sp. ATCC
50818]
Length = 820
Score = 248 bits (632), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 153/477 (32%), Positives = 233/477 (48%), Gaps = 54/477 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL W PL + W Q LQ KIL R ELGM LP F+G+VP A++ +FP A +T
Sbjct: 215 MGNLKYWAAPLDKDWRTSQYNLQLKILSRARELGMVSALPGFAGHVPTAIKRIFPHANLT 274
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q W + S + LL TDPLF+++G F + +K +G T H++ DT++E P
Sbjct: 275 QTAGWANFNS--TYSDVSLLQPTDPLFLQLGTKFYKMLIKAFG-TDHVFQMDTYNEMQPS 331
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ ++ +Y M + D +AV+LMQGWLF ++ +W P +K L+ VP K+++L
Sbjct: 332 FTNMTLLAESNRVVYQAMANADPEAVYLMQGWLF-HESYWTPEHVKVYLSGVPDDKMIIL 390
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL E P++S + ++G +IW ML N+ G +YG I+ P+ TM G+
Sbjct: 391 DLNTEANPVFSLTSDYFGKLWIWNMLLNYGGRRGLYGNATDISTRPLLDLHRAQGTMDGI 450
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E IE NPV+++LM EM + D+ WI Y+ RYG+ Q AW +L VY
Sbjct: 451 GITPEAIENNPVMFELMLEMGWHATPPDMHDWIAAYASSRYGKRESLTQSAWQLLLEHVY 510
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D +R + PD +S +E + N
Sbjct: 511 D----QPDIDRFHMEMVPD-----LSSSESRNSN-------------------------- 535
Query: 361 TSEVIRALELFI--ASGNELSASNTYRYDLIDLTRQA-----------LAKYANELFLNI 407
T+ +++A L + A L + + YDL+D+ RQA L + E NI
Sbjct: 536 TTALVQAWRLLVTAAVNGSLPITGPFSYDLVDVGRQALLNLWSDVRGMLVAHVKEYNANI 595
Query: 408 IEAYQLNDAH--GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ +H + L L++ D+D LL +LLG WLESAK A N ++
Sbjct: 596 DSSPSTAASHVPAIKSLFTLLLDITSDLDRLLGTDVNYLLGVWLESAKATAANADER 652
>gi|409042145|gb|EKM51629.1| glycoside hydrolase family 89 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 749
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 143/464 (30%), Positives = 249/464 (53%), Gaps = 37/464 (7%)
Query: 6 GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW 65
WGG LP W+ Q LQK+IL R+ ELGM P+LPAF+G VP+ + +P+A I W
Sbjct: 201 AWGGLLPMQWISDQQALQKQILPRMLELGMTPILPAFTGFVPSNMSAHYPNASIIDGSAW 260
Query: 66 FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPE 125
S L+ DPL+ ++ ++FI +Q + YG +H Y D ++EN P +
Sbjct: 261 SGFPS--TLTNVSFLEPFDPLYPQMQQSFITKQQEAYGNITHFYTLDQYNENNPFSGNDS 318
Query: 126 YISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KLVVLDLF 183
Y+SS+ + + +++ D +A W+MQGWLF S + FW +++A L +++LDL+
Sbjct: 319 YLSSVSTSTIASLRAADPEATWVMQGWLFFSSETFWTNDRIEAYLGGAQGNDSMLILDLY 378
Query: 184 AEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMS 243
+E +P W+ + ++G ++WC LH++ GN+ + G L +I GP+ A S ++MVG+G++
Sbjct: 379 SEAQPQWNRTDSYFGKQWVWCELHDYGGNMGLEGNLAAITEGPIAALNSNGSSMVGMGLT 438
Query: 244 MEGIE-QNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPA-IQDAWNVLYHTVY 300
MEG+E N +VYD++ + A+ ++V W+ +++ RRY +++P +Q AW +L T+Y
Sbjct: 439 MEGMEIGNEIVYDILLDQAWSSTPLNVSDWVAKWAARRYLVKTLPTELQQAWTILSTTIY 498
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-HLWY 359
N D + I+ +++VT HP + Y
Sbjct: 499 NNQDPNSQATIKSILELEPATTGLVNVTG-----------------------HHPTEIPY 535
Query: 360 ST-SEVIRALELFI-ASGNELSASNT--YRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
T + ++ AL+LF+ AS ++ S + D+++L+RQ + +L+ ++I + +
Sbjct: 536 DTNTTILHALQLFVNASKSQPSLKQVPEFAVDILELSRQLMVNRFIDLYTDLINTWNSSS 595
Query: 416 --AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
A V L L+ D+D LL ++ +L W+ AKQ A
Sbjct: 596 STAQNVTTAGVPLLSLISDLDVLLYTNENYLFSTWIADAKQWAH 639
>gi|404487028|ref|ZP_11022215.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
YIT 11860]
gi|404335524|gb|EJZ61993.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
YIT 11860]
Length = 726
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 148/461 (32%), Positives = 230/461 (49%), Gaps = 38/461 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ GW GPLP WL+ Q LQKKIL R EL M PVLPAF+G+VPAAL+ + P A I
Sbjct: 196 MANIDGWNGPLPMQWLESQAELQKKILARERELNMTPVLPAFAGHVPAALKRIHPDANIQ 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R + L+ +PLF EI ++F+E+Q K +G T HIY D F+E PP
Sbjct: 256 YLGKWAGFGDSYR---CHFLNPEEPLFAEIQKSFLEEQEKMFG-TDHIYGVDPFNEVDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY++ + + +Y + + D DAVWL W+F +D W P++KALL VP KLV+
Sbjct: 312 SWEPEYLAQVSSDMYKSLAAADPDAVWLQMTWMFYHDRKLWTAPRVKALLTGVPSDKLVL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++++F+G PYIWC L NF GN + G + A + + G
Sbjct: 372 LDYHCENVELWKSTEKFHGQPYIWCYLGNFGGNTTLTGNVKESGDRLDNALINGGDNLKG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ N Y+ + E A+ + V+ + W+ + + R G + ++AW +L+
Sbjct: 432 IGSTLEGLDINQFPYEYIFEKAWTID-VNGQDWVERLADRHVGAVSESAREAWQILFD-- 488
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
DV V P + NY K S E Y
Sbjct: 489 ------------DVFVQVPRTLGILPGYRPKLGDNYNKRTSNE----------------Y 520
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ ++R EL + + + + D+I RQ L Y ++ Y+ + G+
Sbjct: 521 DNATLLRVWELLLEVPS--CDRDAFEIDVIMTGRQLLGNYFLDVKKEFDGFYKKRNVPGL 578
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+ + E++ D++ L + H+ L W+E A+ L +E
Sbjct: 579 KEKASEMREILSDLELLNSFHNRASLDKWIEDARSLGDTDE 619
>gi|317158657|ref|XP_001827155.2| alpha-N-acetylglucosaminidase [Aspergillus oryzae RIB40]
Length = 849
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 152/479 (31%), Positives = 250/479 (52%), Gaps = 44/479 (9%)
Query: 1 MSNLHG-WGG-PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 157 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 216
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L+ D F ++ ++ I +Q + +G +H+Y D F+E
Sbjct: 217 VVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISRQTRAFGNVTHVYALDQFNEIN 274
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW P ++ A L+ V
Sbjct: 275 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDPNRISAYLSGVERND 333
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA +++
Sbjct: 334 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEA-LNKSD 392
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVP-AIQDAW 292
++VG G++MEG E N +VYDL+ + A+ + +D +A+ + RY SVP + AW
Sbjct: 393 SLVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQSWVRSRYSGNFSVPNELYTAW 452
Query: 293 NVLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEGKYQ-----NYGKPVSKEAVLK 346
++L TVYN T+ T + + PD+ + V G Y NY V E
Sbjct: 453 DLLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--GHYPTPTSINYDPMVLNEVWSL 510
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
++ P LW+S + Y YD++D+TRQ + ++ +
Sbjct: 511 FMNATRKEPSLWHSPA---------------------YEYDMVDITRQLMGNAFVNVYSD 549
Query: 407 IIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E +
Sbjct: 550 LISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNENFSLTTWISSARDWGNTTETK 608
>gi|237717696|ref|ZP_04548177.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229453015|gb|EEO58806.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 729
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 144/459 (31%), Positives = 234/459 (50%), Gaps = 35/459 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 196 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+L + D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 256 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D +AVWL WLF D W P++K+ L SVP +L++
Sbjct: 312 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F E IW + ++G PY+WC L NF GN + G ++ ++ +A + + + G
Sbjct: 372 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EGI+ N +Y+ + + A+ + D K W + + RR G+ P + AW +L + V
Sbjct: 432 VGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKLADRRIGKISPEARKAWEILANKV 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P+ V +G N +P LK P + Y
Sbjct: 491 Y-------------------VQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEY 523
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+++ A L ++ + ++Y +DL+++ RQ L Y N + AY+ D +
Sbjct: 524 QPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIPMM 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+ E++ D+D L++CH F L W+ A+ + +
Sbjct: 582 KNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHD 620
>gi|83775903|dbj|BAE66022.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 633
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 152/479 (31%), Positives = 250/479 (52%), Gaps = 44/479 (9%)
Query: 1 MSNLHG-WGG-PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 73 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 132
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L+ D F ++ ++ I +Q + +G +H+Y D F+E
Sbjct: 133 VVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISRQTRAFGNVTHVYALDQFNEIN 190
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW P ++ A L+ V
Sbjct: 191 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDPNRISAYLSGVERND 249
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA +++
Sbjct: 250 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEA-LNKSD 308
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVP-AIQDAW 292
++VG G++MEG E N +VYDL+ + A+ + +D +A+ + RY SVP + AW
Sbjct: 309 SLVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQSWVRSRYSGNFSVPNELYTAW 368
Query: 293 NVLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEGKYQ-----NYGKPVSKEAVLK 346
++L TVYN T+ T + + PD+ + V G Y NY V E
Sbjct: 369 DLLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--GHYPTPTSINYDPMVLNEVWSL 426
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
++ P LW+S + Y YD++D+TRQ + ++ +
Sbjct: 427 FMNATRKEPSLWHSPA---------------------YEYDMVDITRQLMGNAFVNVYSD 465
Query: 407 IIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E +
Sbjct: 466 LISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNENFSLTTWISSARDWGNTTETK 524
>gi|299149196|ref|ZP_07042257.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298512863|gb|EFI36751.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 738
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 144/459 (31%), Positives = 234/459 (50%), Gaps = 35/459 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 205 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+L + D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 265 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D +AVWL WLF D W P++K+ L SVP +L++
Sbjct: 321 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F E IW + ++G PY+WC L NF GN + G ++ ++ +A + + + G
Sbjct: 381 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EGI+ N +Y+ + + A+ + D K W + + RR G+ P + AW +L + V
Sbjct: 441 VGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKLADRRIGKISPEARKAWEILANKV 499
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P+ V +G N +P LK P + Y
Sbjct: 500 Y-------------------VQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEY 532
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+++ A L ++ + ++Y +DL+++ RQ L Y N + AY+ D +
Sbjct: 533 QPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIPMM 590
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+ E++ D+D L++CH F L W+ A+ + +
Sbjct: 591 KNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHD 629
>gi|294647264|ref|ZP_06724861.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|292637401|gb|EFF55822.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
Length = 733
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 153/472 (32%), Positives = 237/472 (50%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 196 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 256 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 312 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 372 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+Q+ + V WI ++ R G I AW LY +
Sbjct: 432 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ T Y Y + LW
Sbjct: 491 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 530
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 531 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
+R +L+ D+D LL C F +G W++ A+ A NE+++K RC
Sbjct: 582 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARC 633
>gi|345511813|ref|ZP_08791352.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|229443748|gb|EEO49539.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
Length = 720
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 153/472 (32%), Positives = 237/472 (50%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 183 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 242
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 243 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 298
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 299 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 358
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 359 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 418
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+Q+ + V WI ++ R G I AW LY +
Sbjct: 419 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 477
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ T Y Y + LW
Sbjct: 478 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 517
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 518 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 568
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
+R +L+ D+D LL C F +G W++ A+ A NE+++K RC
Sbjct: 569 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARC 620
>gi|262407713|ref|ZP_06084261.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|262354521|gb|EEZ03613.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
Length = 735
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 153/472 (32%), Positives = 237/472 (50%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 198 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 258 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 314 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 374 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+Q+ + V WI ++ R G I AW LY +
Sbjct: 434 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 492
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ T Y Y + LW
Sbjct: 493 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 532
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 533 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 583
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
+R +L+ D+D LL C F +G W++ A+ A NE+++K RC
Sbjct: 584 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARC 635
>gi|294807833|ref|ZP_06766618.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|294444952|gb|EFG13634.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
Length = 703
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 153/472 (32%), Positives = 237/472 (50%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 166 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 225
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 226 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 281
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 282 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 341
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 342 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 401
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+Q+ + V WI ++ R G I AW LY +
Sbjct: 402 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 460
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ T Y Y + LW
Sbjct: 461 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 500
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 501 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 551
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
+R +L+ D+D LL C F +G W++ A+ A NE+++K RC
Sbjct: 552 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARC 603
>gi|391873368|gb|EIT82411.1| alpha-N-acetylglucosaminidase [Aspergillus oryzae 3.042]
Length = 633
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 151/481 (31%), Positives = 247/481 (51%), Gaps = 48/481 (9%)
Query: 1 MSNLHG-WGG-PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 73 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 132
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L D F ++ ++ I +Q++ +G +H+Y D F+E
Sbjct: 133 VVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISRQMRAFGNITHVYALDQFNEIN 190
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW ++ A L+ V
Sbjct: 191 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDSNRISAYLSGVERND 249
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA N
Sbjct: 250 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEALNKSN- 308
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVP-AIQDAW 292
++VG G++MEG E N +VYDL+ + A+ +D +A+ + RY R SVP + AW
Sbjct: 309 SLVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQSWVRSRYSRNFSVPNELYTAW 368
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS--------KEAV 344
++L TVYN T+ T V + ++ P I + G+ +Y P S E
Sbjct: 369 DLLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GRVGHYPTPTSINYDPMVLNEVW 424
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
++ P LW++ + Y YD++D+TRQ + ++
Sbjct: 425 SLFMNATRKEPSLWHNPA---------------------YEYDMVDITRQLMGNAFVNVY 463
Query: 405 LNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E
Sbjct: 464 SVLITSWKSETENRTTKVTSQSERLLNLLSAIDKVLSCNENFSLATWISSARDWGNTTET 523
Query: 462 E 462
+
Sbjct: 524 K 524
>gi|423722278|ref|ZP_17696454.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
CL09T00C40]
gi|409242419|gb|EKN35181.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
CL09T00C40]
Length = 752
Score = 245 bits (626), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 154/465 (33%), Positives = 220/465 (47%), Gaps = 51/465 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL K+I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF IGR F+E++ K YG +Y D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPVD+PEY+ ++G AI+ D +++W MQ W R P +KA VP L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL------REPIVKA----VPKENL 351
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL +G P + LHNF G I ++G L +A +N +
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G G+ ME IEQNPV YDL EM ++V+++ W+ +Y+ RRYG+ AW L
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y T+ R I+A P++ G G P
Sbjct: 469 GPYRPGTNGTE--RSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YS V++A L + L S+ YR+D++D+ RQ ++ + EA++ D
Sbjct: 505 -YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMSNLGQAIHKQAAEAFRKKDKE 563
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
S RFLE++ D D LL F WL A+ N E++
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEK 608
>gi|154492110|ref|ZP_02031736.1| hypothetical protein PARMER_01741 [Parabacteroides merdae ATCC
43184]
gi|154087335|gb|EDN86380.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
43184]
Length = 752
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 150/465 (32%), Positives = 220/465 (47%), Gaps = 51/465 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL K+I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF IGR F+E++ K YG +Y D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPVD+PEY+ ++G AI+ D +++W MQ W ++++ +VP L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSLR----------ESIVKAVPKENL 351
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL +G P + LHNF G I ++G L +A +N +
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G G+ ME IEQNPV YDL EM ++V+++ W+ +Y+ RRYG+ AW L
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y T+ R I+A P++ G G P
Sbjct: 469 GPYRPGTNGTE--RSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YS V++A L + L S+ YR+D++D+ RQ ++ + EA++ D
Sbjct: 505 -YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMSNLGQAIHKQAAEAFRKKDKE 563
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
S RFLE++ D D LL F WL A+ N E++
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEK 608
>gi|424665881|ref|ZP_18102917.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
616]
gi|404574134|gb|EKA78885.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
616]
Length = 732
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 157/474 (33%), Positives = 236/474 (49%), Gaps = 54/474 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGP+ Q ++D+Q LQKK++ R+ E GM PVL F G VP ++ FP+A I
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTQLQKKMIDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + + L +DPLF ++ + F E+Q K +G+ S Y D F E N+
Sbjct: 258 DAGKWITYQRPA------FLVPSDPLFAKVAQIFYEEQEKLFGK-SRYYGGDPFHEGGNS 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
++ I+ + IY M++ + DA+W++QGW +P + ALL + G+ +
Sbjct: 311 EGIN----ITEAASDIYKAMKANNPDAIWVLQGW--GANPSY------ALLKGLKQGEAL 358
Query: 179 VLDLFAEVKPIW-----STSKQFYGV---PYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+LDL + +P W S S + G +IWC L NF G I MYG L S A G + A
Sbjct: 359 ILDLMSCARPQWGGDPSSQSHREDGYLDHNWIWCALPNFGGRIGMYGKLQSYATGVIRAE 418
Query: 231 TSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
V GVG + EGI NP+ YD++ +MA++ + +DVK+WI Y+ RYG +
Sbjct: 419 HHPKGKYVCGVGTTPEGIGTNPIDYDMVYDMAWRTDSIDVKSWIANYTTYRYGSPNNNAK 478
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
A L +VYNC A A P + T
Sbjct: 479 AAMQQLSTSVYNCPWAADGPQESYFCARPSLKID------------------------RT 514
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
SS+ HL+Y V++ALE + + NEL +TYRYD++D+TRQ LA Y + I +
Sbjct: 515 SSWGTAHLYYQPINVLQALEHLLKAENELKEIDTYRYDVVDVTRQMLADYGKYIHKCIAD 574
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
AY D + +FL+++ D D LL+ FLLG ++ A N +++
Sbjct: 575 AYYGKDTEKFDFYTSKFLQMISDQDLLLSTRKEFLLGKFIRQADACGSNPMEKR 628
>gi|224025137|ref|ZP_03643503.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
18228]
gi|224018373|gb|EEF76371.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
18228]
Length = 718
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 158/475 (33%), Positives = 247/475 (52%), Gaps = 58/475 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW ++Q LQK+IL R+ E G+ PVLP +SG VP ++ +
Sbjct: 187 MNNLEGWGGPNPDSWYERQEELQKRILKRMREYGIEPVLPGYSGMVPHNAKDRL-GLNVA 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W PR L TDP F I + + + YG+ S+ Y+ D F E NT
Sbjct: 246 DPGRW---NGYPR---PAFLQPTDPQFERIAALYYREMTRLYGKVSY-YSMDPFHEGGNT 298
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + G AI+ M+ + A W++Q W +P PQM + ++P G +V
Sbjct: 299 SGVD----LEAAGKAIWKAMKQANPRAAWVVQAW--GANP---RPQM---IRNLPAGDMV 346
Query: 179 VLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
VLDLF+E +P W K+ +G +++CML N+ GN+ ++G + + +A+
Sbjct: 347 VLDLFSESRPQWGDPASSWYRKEGFGQHDWLFCMLLNYGGNVGLHGKMAHLIEEFYKAKD 406
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
S T+ GVGM+MEGIE NPV+Y+L+ E+ ++ ++ W+ Y RYG+S +
Sbjct: 407 SSFGKTLKGVGMTMEGIENNPVMYELLCELPWREQRFSKDEWLEGYLKARYGKSDSQVSQ 466
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +T+YNC +T + + ++ A P +S SE
Sbjct: 467 AWMLLSNTIYNCPAASTQQGTHESILCARPSWKAYQVSSW------------------SE 508
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
S Y Y ++VIRA + + + +N + YDL+D+ RQA+A+ ++ ++
Sbjct: 509 MSDY------YDPADVIRAAGMMVDAAERFRGNNNFEYDLVDIVRQAVAEKGRLMYRVLV 562
Query: 409 EAYQLNDAHGVFQLSR-RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+AY+ D +F+LS RFL L+ D LLA F +G WLESA+ L EE++
Sbjct: 563 DAYKAGDRE-LFKLSSDRFLRLILMQDRLLATRSEFKVGRWLESARNLGSTEEEK 616
>gi|410095990|ref|ZP_11290981.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
CL02T12C30]
gi|409227396|gb|EKN20294.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
CL02T12C30]
Length = 753
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 156/465 (33%), Positives = 221/465 (47%), Gaps = 51/465 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D L KK++ R ELGM P+ FSG VP L+N +P AKI
Sbjct: 193 MQNLQSYGGPLPKSWIDSHAELGKKVINRQLELGMQPIQQGFSGYVPRELKNKYPDAKI- 251
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF GR F+E++ K +G +Y D F E+
Sbjct: 252 --------QLQPSWCGFTGAAQLDPTDSLFSAFGRDFLEEEKKLFG-AHGVYAADPFHES 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PP+D+PEY+S++G +IY Q D A+W MQ W R P +KA VP L
Sbjct: 303 RPPIDTPEYLSAVGNSIYKLFQDFDPSAIWAMQAWSL------REPIVKA----VPKEHL 352
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL +T +G P + LHNF G I ++G L +A ++ +
Sbjct: 353 LILDLNGGRSRQENTC---WGYPVVAGNLHNFGGRINLHGDLRLLASNQYAVAKQKSPNV 409
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G G+ ME IEQNPV YDL EM ++VD++ W+ Y+ RRYG + AW L
Sbjct: 410 CGSGLFMESIEQNPVYYDLAFEMPLHADEVDIEEWLGDYAERRYGAASENAHKAWLHLLE 469
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y T+ R I+A P++ G G P
Sbjct: 470 GPYRPGTNGTE--RSSIIA---ARPALNVKKSGPNAGLGIP------------------- 505
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YS VI+A L + ++L+AS YR+D++D+ RQ ++ + EA+ D
Sbjct: 506 -YSPLLVIQAQGLLLKDADKLNASTPYRFDVVDIQRQLMSNLGQAIHKKAAEAFVKKDKA 564
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
S RFLE++ D+D LL F WL A+ E++
Sbjct: 565 AFTLHSNRFLEMLRDVDVLLRTRPEFNFDKWLTDARSWGTTNEEK 609
>gi|404406328|ref|ZP_10997912.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 738
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 146/463 (31%), Positives = 219/463 (47%), Gaps = 35/463 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W PLPQSWLD Q+ LQK+I+ R EL M PVLPAF+G+VPA L ++P AKI+
Sbjct: 206 MSNLDYWQSPLPQSWLDAQVELQKRIVARERELNMKPVLPAFAGHVPAELGEIYPEAKIS 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + R ++ LD DPLF I R F+ +Q +G T HIY D F+E PP
Sbjct: 266 RMSKWGGFEDRYR---SHFLDPLDPLFARIQREFLAEQTALFG-TDHIYGADPFNEVDPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PE+++ + IY M D +A WL WLF D W +++A + +VP K+++
Sbjct: 322 SWEPEFLARVSRTIYDTMTEADPEAEWLQMTWLFYLDRDKWHDDRIEAFVTAVPQDKMLL 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + ++G PY WC L NF GN + G D ++ + G
Sbjct: 382 LDYYCENTEVWRQTHSYHGQPYFWCYLGNFGGNTMLVGNFDEVSKRIDGVLAEGGNNLRG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ NP +YD + E A+ VD W + + R G + AW+ L V
Sbjct: 442 LGSTLEGLDSNPFMYDYVFERAWDF-PVDDDRWFDALADRYLGYEDTGYRRAWDALRKNV 500
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y I S G + E +L T + + Y
Sbjct: 501 Y-----------------------ITSSKYGHCPLLNARPTLEGILTGTTDA----EIKY 533
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+ I +G+ S +TYRY L+++ RQ L L A + D +
Sbjct: 534 DNDELFEVWAKMIDAGD--SGRDTYRYWLVNVGRQTLGNLFLPLRDGFTAACRAKDLARM 591
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+L LEL D++ L A H F + W++ ++ E+
Sbjct: 592 KELRSEMLELAADLETLTAQHGAFSMQKWIDDSRSFGTTPEER 634
>gi|224026593|ref|ZP_03644959.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
18228]
gi|224019829|gb|EEF77827.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
18228]
Length = 635
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 156/464 (33%), Positives = 225/464 (48%), Gaps = 45/464 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL ++I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 196 MQNLQSYGGPLPKSWIDKHVVLGQQIIRRELELGMKPIQQGFSGYVPRELKEKYPEAKIQ 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+W K + LD TD LF IGR F+E++ K +G +Y D F E+ PP
Sbjct: 256 PQPSWCGFKGAAQ------LDPTDSLFQVIGRDFLEEEKKLFG-AHGVYAADPFHESRPP 308
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD+PEY+S++G +I++ Q D ++W MQ W R P +KA VP L++L
Sbjct: 309 VDTPEYLSAVGRSIHTLFQEFDPYSLWAMQAWSL------REPIVKA----VPEEHLLIL 358
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL + +G P + LHNF G I M+G L +A EA S + + G
Sbjct: 359 DLNGSKC---TQRNACWGYPVVAGNLHNFGGRINMHGDLPLLAGNQYEAAVSLSPNVCGS 415
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G+ MEGIEQNP+ Y+L EM Q KV++ W+ +Y++RRYG A +L Y
Sbjct: 416 GLFMEGIEQNPLYYELAFEMPLQKGKVELDGWLKEYALRRYGSKWENTHKALLLLLEGPY 475
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
T+ + +I A P++ G G P YS
Sbjct: 476 RPGTNGTELS-SIIAA----RPALHVKKSGPNAGLGIP--------------------YS 510
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+I A + L S YR+D++DL RQ + + +A++ D G
Sbjct: 511 PWLLIEAQAFMLKDAGILKTSEAYRFDIMDLQRQIMTNLGQAIHKEAAKAFEAGDEKGFE 570
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
SRR+LEL+ D+D LL F WL A+ EE++ Q
Sbjct: 571 LHSRRYLELLTDVDTLLRTRPEFNFDRWLADARSWGDTEEEKNQ 614
>gi|423345423|ref|ZP_17323112.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
CL03T12C32]
gi|409223209|gb|EKN16146.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
CL03T12C32]
Length = 752
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 153/465 (32%), Positives = 220/465 (47%), Gaps = 51/465 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL K+I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF IGR F+E++ K YG +Y D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPVD+PEY+ ++G AI+ D +++W MQ W R P +KA VP L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL------REPIVKA----VPKENL 351
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL +G P + LHNF G I ++G L +A +N +
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G G+ ME IEQNPV YDL EM ++V+++ W+ +Y+ RRYG+ AW L
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y T+ R I+A P++ G G P
Sbjct: 469 GPYRPGTNGTE--RSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YS V++A L + L S+ YR+D++D+ RQ ++ + +A++ D
Sbjct: 505 -YSPLSVVQAEGLLLKDAARLEDSDPYRFDIVDIQRQLMSNLGQVIHKQAAKAFRKKDKE 563
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
S RFLE++ D D LL F WL A+ N E++
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEK 608
>gi|336417192|ref|ZP_08597519.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
3_8_47FAA]
gi|423297818|ref|ZP_17275878.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
CL03T12C18]
gi|335936512|gb|EGM98438.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
3_8_47FAA]
gi|392664455|gb|EIY57993.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 142/459 (30%), Positives = 230/459 (50%), Gaps = 35/459 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K +
Sbjct: 194 MCNLDGWQSPLPKEWLSSQAELQEQIVAREREFNMQPVLPAFAGHVPAALKRVYPNIKTS 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+L + D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 254 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D A+WL WLF D W P++K+ L SVP KL++
Sbjct: 310 SWDTDSLGMMAKHIYESVAAVDPKAIWLQMTWLFYADIKHWTTPRIKSYLRSVPQDKLIL 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F E IW + ++G PY+WC L NF GN + G + ++ +A + + + G
Sbjct: 370 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVKLVSERLADALKNGGSNLKG 429
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EGI+ N +Y+ + + A+ + D K W + + RR G+ P + AW +L V
Sbjct: 430 VGSTLEGIDLNQFMYEFVLDKAWNSGQTD-KEWFLKLADRRTGKVSPEARKAWEILADKV 488
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + P+ V +G N +P LK P + Y
Sbjct: 489 Y-------------------IQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEY 521
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+++ A L + + ++Y +DL+++ RQ L Y N + AY+ D +
Sbjct: 522 QPKDLVEAWRLLLLVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIMMM 579
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+ E++ D+D L++CH F L W+ A+ + +
Sbjct: 580 KNRGDKMREILADLDKLVSCHPTFSLNKWITDARDMGHD 618
>gi|282877910|ref|ZP_06286719.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
gi|281299911|gb|EFA92271.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
Length = 723
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 150/460 (32%), Positives = 227/460 (49%), Gaps = 38/460 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP SWL+QQ LQ++IL+R L M PVLPAFSG+VPA L+ ++P A I
Sbjct: 199 MANIDKWNGPLPMSWLEQQKELQQRILLRERSLNMKPVLPAFSGHVPAKLKELYPQANIK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + R + L+ DPLF +I + ++E+Q +G T HIY D F+E PP
Sbjct: 259 YLGRWAGFSDNYR---CHFLNPEDPLFAKIQKMYLEEQKALFG-TDHIYGIDPFNEVDPP 314
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
PEY+ + IY + S D A W+ W+F ++ W P ++KALL V GK+ +
Sbjct: 315 SWKPEYLKEISHNIYRTVTSVDPGAEWMQMSWMFYHNKKQWTPKRIKALLTGVSRGKMSL 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W T+ FYG PYIWC L NF GN + G + EA +N ++G
Sbjct: 375 LDYHCENVELWKTTNNFYGQPYIWCYLGNFGGNTTITGNVKESGQRLNEALNKKNKNLIG 434
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + A+ D K WI+ + R G S P ++ AW +L++ +
Sbjct: 435 IGSTLEGLDVIQFPYEYILTQAWTATPAD-KEWIDNLADRHVGFSSPKLRQAWQILFNDI 493
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T R + + P + P + GKYQ ++ Y
Sbjct: 494 Y------TQIPRSLGI-LPALRPIL-----GKYQERRTEIT------------------Y 523
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
T + +L S N Y+ DLI + RQ L +L L + Y D G+
Sbjct: 524 PTKRLEEVWKLM--SDVSECDRNEYQLDLIAVGRQVLGNKFLKLKLELDSCYVNKDLVGL 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
+ E++ D+D L A + +G W++ A+ N+
Sbjct: 582 QRTGNTMKEVLVDLDYLTAGNSRCSIGKWIDDARAYGNND 621
>gi|379334158|gb|AFD03088.1| putative alpha-N-acetylglucosaminidase [uncultured bacterium 8]
Length = 726
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 157/461 (34%), Positives = 238/461 (51%), Gaps = 36/461 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+++L GW GPLPQSW+D+ L ++IL R LGM PVL FSG+VP L A+ T
Sbjct: 181 LASLDGWSGPLPQSWIDRHADLGRRILARERALGMRPVLQGFSGHVPQELIAER-GARST 239
Query: 61 QLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
L W F V +LD DPLF E G + +Q + +G T H+Y D F E TP
Sbjct: 240 TLPWWDFEVG---------MLDPRDPLFEEFGTTLLTEQTRLFG-TDHLYAADPFIETTP 289
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLV 178
PV P ++ + A++ M + D A W++Q W FSY +W P + A L+++P ++
Sbjct: 290 PVSDPADLAQVARAVHGVMTAVDDRATWVLQAWPFSYRSRYWTPERTGAFLDAIPDDGML 349
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTM 237
+LDL+AE +P+W + + P++WCMLH+ G +YG LD IA G A+ + ++
Sbjct: 350 ILDLWAEHRPVWQRTDGYRKKPWVWCMLHSLGGRPGLYGKLDEIATGAARAQADARGGSL 409
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G SME +PV+Y+L++++A+Q DV+AW+ ++ RYGR+ P + AW++L+
Sbjct: 410 SGIGASMEAFGGDPVLYELLADVAWQGSVDDVRAWLETWTRARYGRATPGLLRAWDLLHD 469
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VY ++G VIV P + EG ++ + V ++ S D P
Sbjct: 470 SVY-ASEGPGPPG-SVIVGRPTL--------EGDLRH------ELPVHLADPPSPDVP-- 511
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
+ L + E SA R DL D+T Q L A E +A DA
Sbjct: 512 ---PALAEAWALLADEATQEDSAGPLGR-DLCDVTAQVLTHVACERQWRAADAALARDAD 567
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
G + +R L+ +ED+D LLA L WL A+ A
Sbjct: 568 GFQRAARALLDTIEDLDTLLATRPEHRLDGWLADARGWATT 608
>gi|238506383|ref|XP_002384393.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
NRRL3357]
gi|220689106|gb|EED45457.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
NRRL3357]
Length = 669
Score = 241 bits (616), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 149/481 (30%), Positives = 249/481 (51%), Gaps = 48/481 (9%)
Query: 1 MSNLHG-WGG-PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 109 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMRPVLPAFPGFVPPAIKRVRPHAT 168
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L D F ++ ++ I +Q++ +G +H+Y D F+E
Sbjct: 169 VVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISRQMRAFGNITHVYALDQFNEIN 226
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW ++ A L+ V
Sbjct: 227 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDSNRISAYLSGVERND 285
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA +++
Sbjct: 286 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEA-LNKSD 344
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVP-AIQDAW 292
++VG G++MEG E N +VYDL+ + A+ +D +A+ + RY SVP + AW
Sbjct: 345 SLVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQSWVRSRYSGNLSVPNELYTAW 404
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS--------KEAV 344
++L TVYN T+ T V + ++ P I + G+ +Y P S E +
Sbjct: 405 DLLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GRVGHYPTPTSINYDPMVLNEVL 460
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
++ P LW++ + Y YD++D+TRQ + ++
Sbjct: 461 SLFMNATRKEPSLWHNPA---------------------YEYDMVDITRQLMGNAFVNVY 499
Query: 405 LNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E
Sbjct: 500 SVLITSWKSETENRTTKVTSHSERLLNLLSAIDKVLSCNENFSLATWISSARDWGNTTET 559
Query: 462 E 462
+
Sbjct: 560 K 560
>gi|260642393|ref|ZP_05415712.2| alpha-N-acetylglucosaminidase [Bacteroides finegoldii DSM 17565]
gi|260622285|gb|EEX45156.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides finegoldii DSM
17565]
Length = 735
Score = 241 bits (616), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 151/472 (31%), Positives = 236/472 (50%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 198 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I F+E+Q K YG T+HIY D F+E P
Sbjct: 258 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQHRFLEEQTKVYG-TNHIYGIDPFNEVDSP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +QS DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 314 NWNEEFLSNVSDKIYKSIQSVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 374 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+++ + V WI ++ R G I AW LY +
Sbjct: 434 LGVTLEGLDVNPLMYEFVFERAWEN-SIPVHQWIANWAQCRGGNVDNHIIKAWKQLYEKI 492
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ T Y Y + LW
Sbjct: 493 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 532
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 533 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFADCYRKKDLEGT 583
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
+R +L+ D+D LL C +G W++ A+ A NE+++K RC
Sbjct: 584 KVWGQRMDQLLLDVDRLLCCSPVLSIGKWIKDARDFAVNEQEQKYYEENARC 635
>gi|393785791|ref|ZP_10373937.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
CL02T12C05]
gi|392661410|gb|EIY54996.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
CL02T12C05]
Length = 727
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 147/461 (31%), Positives = 229/461 (49%), Gaps = 39/461 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ GW GPLP WLD Q+ LQKKIL R EL M PVLPAF+G+VPAAL+ ++P A I
Sbjct: 198 MANIDGWNGPLPMEWLDNQVELQKKILARERELNMKPVLPAFAGHVPAALKRIYPEANIQ 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R Y L+ +PLF I + F+++Q + +G T HIY D F+E PP
Sbjct: 258 HLGKWAGFADTYR---CYFLNPEEPLFATIQKHFLQEQTRLFG-TDHIYGVDPFNEVDPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY+S + + +Y + + D A W+ W+F +D W P++KALL VP K+ +
Sbjct: 314 SWEPEYLSQVSSDMYRTLTAADPKAEWMQMTWMFYHDRKDWTAPRIKALLTGVPQDKMFL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+G PYIWC L NF GN + G + A + + + G
Sbjct: 374 LDYHCENVELWKNTEHFHGQPYIWCYLGNFGGNTTLTGNVKESGDRLDNALINGGSNLRG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ +D +AW+ + R G +++AW++L++ +
Sbjct: 434 IGSTLEGLDVMQFPYEYIFEKAWDL-NLDNEAWLQNLADRHAGTVSQPVREAWDILFNQI 492
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G NY +PV + ++ Y
Sbjct: 493 Y--------------VQVPK--------TLGVLPNY-RPVMNKPNRRTVID--------Y 521
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + +++A E + + + + R D+I + RQ L Y + + Y + D G+
Sbjct: 522 SNATLLQAWEKLLQATD--CNRDALRLDIITVGRQLLGNYFLIVKDDFDRMYTVKDLPGL 579
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+ E++ D+D L A H L WL A+ L E
Sbjct: 580 KARAAEMKEILNDLDRLNAFHSRCALDKWLADARALGTTPE 620
>gi|423346424|ref|ZP_17324112.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
CL03T12C32]
gi|409220242|gb|EKN13198.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
CL03T12C32]
Length = 718
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 156/474 (32%), Positives = 241/474 (50%), Gaps = 54/474 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQK+I+ R+ E G+ PV P +SG VP + ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQKRIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + L TDP F EI + ++ K YG+ + Y+ D F E
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKADY-YSMDPFHEGGSV 298
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V + + G AI M+ + AVW+ Q W +P PQM + ++ G L+VL
Sbjct: 299 VGVD--LDAAGKAIMQAMKKNNPKAVWVAQAW--QANP---RPQM---IGNLEAGDLIVL 348
Query: 181 DLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE 233
DLFAE +P W K +G +I+CML N+ GN+ ++G + + +A+ S
Sbjct: 349 DLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNVGLHGKMKHVIDEFYKAKESP 408
Query: 234 -NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
T+ GVGM+MEG E NPV+++L++E+ ++ ++ D W+ +Y+V RYG+S P +QDAW
Sbjct: 409 FGKTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDKDQWLREYTVARYGKSNPTVQDAW 468
Query: 293 NVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
+L +++YNC D T + + V A P TE YQ S
Sbjct: 469 ILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ---------------VS 504
Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE- 409
S+ +Y + VIRA + ++ +E +N + YDL+D+ RQA+A+ L ++E
Sbjct: 505 SWSEMKDYYDPNNVIRAAAMMVSVADEFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVVEA 563
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
A+ D S RFL L+ D LLA F +G W+ A+ L E+++
Sbjct: 564 AFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGSTPEEKE 617
>gi|393236266|gb|EJD43816.1| putative alpha-N-acetylglucosaminidase [Auricularia delicata
TFB-10046 SS5]
Length = 778
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 144/480 (30%), Positives = 243/480 (50%), Gaps = 50/480 (10%)
Query: 8 GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
G LP W+D Q LQKKI+ R+ ELGM P LP+F+G VP A+ V P A + W S
Sbjct: 219 GSSLPMEWIDDQFELQKKIVRRMVELGMTPALPSFTGFVPRAISRVLPGASVVNGSRW-S 277
Query: 68 VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
D T+L + DP F + ++FIE+Q+ YG SH+Y D ++EN P + Y+
Sbjct: 278 GFPDALTRVTFL-EPFDPAFARLQKSFIEKQIAAYGPVSHVYTLDQYNENDPLKNDVGYL 336
Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KLVVLDLFAE 185
+ + + +++ D DA+WLMQGWLF S FW +++A L V +++LDLF+E
Sbjct: 337 RDVSRSTWQSLKAADPDAIWLMQGWLFYSNRGFWTNARVEAFLGGVEKNDDMLILDLFSE 396
Query: 186 VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSME 245
+P W + +YG P+IWC LH++ GN+ +YG + +I VEA ++ ++VG G++ME
Sbjct: 397 SEPQWQRTNSYYGKPWIWCQLHDYGGNLGLYGQVMNITLNAVEA-LEKSPSLVGFGLTME 455
Query: 246 GIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY------GRSVP-AIQDAWNVLYHT 298
G E N ++YDL+ A+ + +D ++ ++ RRY G +P AI +AW++L T
Sbjct: 456 GQEGNEIMYDLLLSQAWSRKPIDTASYFRSWATRRYNAGGIIGSLLPSAIYNAWDILRTT 515
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYN T A++ V + ++ P++ + + + + +
Sbjct: 516 VYNNTKLASNA---VTKSVFELRPALSGI-------------------ANRTGHHATTIT 553
Query: 359 YSTSEVIRALELFIASGNELSA----SNTYRYDLIDLTRQALAKYANELFLNIIEAYQ-- 412
Y T +++A +LF + A + Y +D +D RQ L+ + + +++ Y
Sbjct: 554 YDTQALVKAYDLFDKAAIYTPALWFNNPAYEFDNVDFARQVLSNAFSTQYDDLVATYNEI 613
Query: 413 ----------LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
A + R + ++ +D +L F L WL+ A+ A+ +E
Sbjct: 614 SKPGGSGATLAEAAKIIHDKGERMMGVLASLDKVLRTSKHFTLKKWLQDARAWARGGHEE 673
>gi|404487206|ref|ZP_11022393.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
YIT 11860]
gi|404335702|gb|EJZ62171.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
YIT 11860]
Length = 731
Score = 240 bits (613), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 156/477 (32%), Positives = 234/477 (49%), Gaps = 55/477 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ Q ++D+Q LQ+K+L R+ EL M PV F G VP +L+ FP A I
Sbjct: 196 MGNLEGFGGPVSQKFIDRQTDLQQKMLRRMRELDMAPVFQGFYGMVPNSLKEKFPEANIK 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W + + LD DPLF +I + E+Q K +G+ + + D F E +
Sbjct: 256 EQGEWQTYQRPA------FLDPNDPLFDKIADIYYEEQEKLFGKAVY-FAGDPFHEGGQS 308
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D + + I M+ +AVW++QGW R P M+ LL + G+ +
Sbjct: 309 EGID----VKAAAKKILKAMRRKTPEAVWIIQGWQ-------RNP-MRDLLEGLEHGEAI 356
Query: 179 VLDLFAEVKPIWSTSKQ--FYGVP------YIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+LDL A +P W K FY +IWC L NF G ++G + S A G V A+
Sbjct: 357 ILDLMACERPQWGGIKNSLFYKAEGHMHHDWIWCALPNFGGKTGLHGKMSSYASGVVFAK 416
Query: 231 TSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
+ G+G + EGI PVVYD++ +MA++ + +D+K W+NQY+ RYG++ P
Sbjct: 417 NHPLGKNLCGIGTAPEGIGTIPVVYDMVYDMAWREDSIDIKDWVNQYTQYRYGKADPNCN 476
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
AW +L T+Y C + I A P +
Sbjct: 477 RAWEILSKTIYECHNEIGGPVESYICARP------------------------SDTIKHA 512
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
SS+ ++Y +E++ A E +E + S TY+YDL+DLTRQ L YA L +
Sbjct: 513 SSWGTAEIFYDPAEIVTAWECMYNVRHEFAQSETYQYDLVDLTRQVLGDYAKYLHKQAVN 572
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN-EEQEKQV 465
A+ ND G S +FL L+ D D LL+ F +G W+ A+ A +EQE+ V
Sbjct: 573 AFYRNDLKGFQTYSSKFLVLIRDEDKLLSTRKEFNVGTWINQARNAACTPQEQERFV 629
>gi|156046298|ref|XP_001589681.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980]
gi|154693798|gb|EDN93536.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 795
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 153/465 (32%), Positives = 254/465 (54%), Gaps = 39/465 (8%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP SW+++Q +LQKKI+ R+ ELG+ PVLPAF+G VP+AL+ + P+A I
Sbjct: 207 FGNIQGSWGGTLPLSWIEEQHLLQKKIVKRMVELGITPVLPAFTGFVPSALRRIAPNANI 266
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G+W ++ T+L TDPLF + F+ Q + YG +HIY D ++EN P
Sbjct: 267 INGGDWGNIFPVEYSNDTFLY-PTDPLFTTLQHKFLSFQSEYYGNVTHIYTLDQYNENNP 325
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-L 177
Y+ ++ Y +QS D AVW++QGWLF S FW +++A + VP + +
Sbjct: 326 ASGDLSYLRNVSRGTYESLQSFDPCAVWMLQGWLFYSLSSFWTQDRIEAYIGGVPKNESM 385
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT 236
++LDLF+E P W + +YG P+IWC L ++ G + +YG + +I +EA R SEN
Sbjct: 386 LILDLFSESFPQWERTHYYYGKPWIWCQLRDYGGTLGLYGQIYNITNSLIEAFRESEN-- 443
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV----KAWINQ-YSVRRYGRSVPA-IQD 290
MVGVG +MEG N ++Y+L+ + A+ + +D K+W+ + Y ++ + +P I +
Sbjct: 444 MVGVGNTMEGQGGNGLMYELLLDQAWNIDPIDTEDYFKSWVRKRYHIKGAKKRLPGEIYE 503
Query: 291 AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
AW++L T YN T+ + V + ++ P+I +N+G+ ++S
Sbjct: 504 AWDILRRTAYNNTNLTLADS--VPKSLHELQPNIT-------ENHGR--------LGQSS 546
Query: 351 SYDHPHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNI 407
+ D Y ++ RA EL S EL +++D++D+TRQ LA+ ++ +
Sbjct: 547 TIDL----YDPDDLFRAWELLYNASVSVPELWEDKGWKFDMVDITRQVLAERFKLEYVEL 602
Query: 408 IEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 452
IE Y+ + + ++E +D +L+ F L W+ +A
Sbjct: 603 IEKYK--KGADISCDGDILIGILESLDDVLSASPHFRLDTWVNAA 645
>gi|154489986|ref|ZP_02030247.1| hypothetical protein PARMER_00215 [Parabacteroides merdae ATCC
43184]
gi|423722990|ref|ZP_17697143.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
CL09T00C40]
gi|154089428|gb|EDN88472.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
43184]
gi|409241820|gb|EKN34587.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
CL09T00C40]
Length = 718
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 158/476 (33%), Positives = 243/476 (51%), Gaps = 58/476 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQK+I+ R+ E G+ PV P +SG VP + ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQKRIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + L TDP F EI + ++ K YG+ + Y+ D F E +
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKADY-YSMDPFHEGGSV 298
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + G AI M+ + AVW+ Q W +P PQM + ++ G L+
Sbjct: 299 AGVD----LDAAGKAIMQAMKKNNPKAVWVAQAW--QANP---RPQM---IGNLEAGDLI 346
Query: 179 VLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
VLDLFAE +P W K +G +I+CML N+ GN+ ++G L + +A+
Sbjct: 347 VLDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNVGLHGKLKHVIDEFYKAKE 406
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
S T+ GVGM+MEG E NPV+++L++E+ + ++ D W+ +Y+V RYG+S P +QD
Sbjct: 407 SPFGKTLKGVGMTMEGSENNPVMFELLTELPWCPQRFDKDQWLREYTVARYGKSNPTVQD 466
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +++YNC D T + + V A P TE YQ
Sbjct: 467 AWILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ--------------- 502
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
SS+ +Y ++VIRA + ++ +E +N + YDL+D+ RQA+A+ L ++
Sbjct: 503 VSSWSEMKDYYDPNDVIRAAAMMVSVADEFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVV 561
Query: 409 E-AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
E A+ D S RFL L+ D LLA F +G W+ A+ L E+++
Sbjct: 562 EAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGGTPEEKE 617
>gi|358378969|gb|EHK16650.1| glycoside hydrolase family 89 protein [Trichoderma virens Gv29-8]
Length = 748
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 142/462 (30%), Positives = 253/462 (54%), Gaps = 35/462 (7%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG +P+SW+D Q LQ KIL R+ ELG+ P+LPAF G VP + VFP +
Sbjct: 201 FGNIQGSWGGSMPRSWVDSQFDLQLKILDRMEELGITPILPAFPGFVPRNISRVFPDISL 260
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
+ W + ++ ++ DP F ++ + FI +Q + YG ++ + D F+EN P
Sbjct: 261 STSPIWSNFGTEL--SADIYINPFDPRFAQLQKLFISKQQELYGNVTNFWTLDQFNENQP 318
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-L 177
Y+ ++ +S +++ D +AVW+MQ WLFS D FW ++++ L +P+ +
Sbjct: 319 LSGDLGYLQNVSHNTWSALKAADPEAVWVMQAWLFSSDSAFWTNDRIESFLGGIPVNSDM 378
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDLFAE P W + FYG P+IWC LH++ GN+ +YG ++++ ++A + ++
Sbjct: 379 LLLDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGLYGQIENVTINSMDA-VRNSGSL 437
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVPAIQDAWNVLY 296
VG G++MEG E N ++YDL+ + A+ + +D + + + + RYG ++V ++ W +L
Sbjct: 438 VGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHDWVSTRYGTKNVKSLYTGWELLR 497
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
TV+N T+ + + I+ ++ PS + G+ ++G ++
Sbjct: 498 PTVFNNTNLTMNAVQKSIL---ELVPSTTGLL-GRVGHHGTTIT---------------- 537
Query: 357 LWYSTSEVIRA-LELFIASGNE--LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
Y+ + ++ A ELF A + L + Y+YDL+D TRQ L L+ +++ AY
Sbjct: 538 --YNPAVMVEAWTELFKAGLQDIKLFTNPAYQYDLVDWTRQVLVNSFEGLYKDLVAAYNS 595
Query: 414 NDAHGVFQLSR--RFLELVEDMDGLLACHDGFLLGPWLESAK 453
+ V + SR + + L+ +D +LA ++ F L PW+ A+
Sbjct: 596 AASSSVIK-SRGAKLIALLRTLDAVLATNEHFQLTPWINEAR 636
>gi|423248233|ref|ZP_17229249.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
CL03T00C08]
gi|423253182|ref|ZP_17234113.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
CL03T12C07]
gi|392657082|gb|EIY50719.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
CL03T12C07]
gi|392660340|gb|EIY53954.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
CL03T00C08]
Length = 732
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 153/474 (32%), Positives = 232/474 (48%), Gaps = 54/474 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGP+ Q ++D+Q LQKK+L R+ E GM PVL F G VP ++ FP+A I
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTKLQKKMLDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + + L +DPLF ++ F E+Q K +G S Y D F E N+
Sbjct: 258 NAGKWITYQRPA------FLVPSDPLFAKVAEIFYEEQKKLFGE-SRYYGGDPFHEGGNS 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
++ I+ + IY M++ + +A+W++QGW S +P ALL + G+ +
Sbjct: 311 KGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--SGNP------SVALLKGLKHGEAL 358
Query: 179 VLDLFAEVKPIWS--------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
VLDL A +P W F +IWC L NF G I MYG L S A G ++A
Sbjct: 359 VLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALPNFGGRIGMYGKLQSYATGVIKAE 418
Query: 231 TSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
V G+G + EGI NP+ YD++ +MA++ + +D+K+WI Y+ RYG +
Sbjct: 419 HHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDSIDIKSWIANYTTYRYGSENSNAK 478
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
A L +VYNC A A P + +S
Sbjct: 479 AAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYVS----------------------- 515
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
S+ HL+Y V++ALE + + EL +TYRYD++D+TRQ LA Y + I +
Sbjct: 516 -SWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRYDVVDITRQMLADYGKYIHKCISD 574
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
AY+ + + +FL+++ D D LL+ FLLG ++ A N +++
Sbjct: 575 AYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLGEYIRQADTCGSNPTEKR 628
>gi|218258436|ref|ZP_03474815.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
DSM 18315]
gi|423342591|ref|ZP_17320305.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
CL02T12C29]
gi|218225494|gb|EEC98144.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
DSM 18315]
gi|409217508|gb|EKN10484.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
CL02T12C29]
Length = 718
Score = 238 bits (607), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 156/475 (32%), Positives = 243/475 (51%), Gaps = 58/475 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQ++I+ R+ E G+ PV P +SG VP + ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQQQIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + L TDP F EI + ++ K YG+ ++ Y+ D F E +
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKANY-YSMDPFHEGGSV 298
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + G AI M+ + AVW+ Q W +P PQM + ++ G L+
Sbjct: 299 AGVD----LDAAGKAIMQAMKKNNPKAVWVAQAW--QANP---RPQM---IGNLEAGDLI 346
Query: 179 VLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
LDLFAE +P W K +G +I+CML N+ GNI ++G + + +A+
Sbjct: 347 ALDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNIGLHGKMKHVIDEFYKAKE 406
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
S TT+ GVGM+MEG E NPV+++L++E+ ++ ++ D W+ Y+V RYG+S P +QD
Sbjct: 407 SPFGTTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDKDQWLKAYTVARYGKSNPVVQD 466
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +++YNC D T + + V A P TE YQ
Sbjct: 467 AWILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ--------------- 502
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
SS+ +Y ++VIRA + ++ ++ +N + YDL+D+ RQA+A+ L ++
Sbjct: 503 VSSWSEMKDYYDPNDVIRAAAMMVSVSDQFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVV 561
Query: 409 E-AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
E A+ D S RFL L+ D LLA F +G W+ A+ L E++
Sbjct: 562 EAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGNTSEEK 616
>gi|423269877|ref|ZP_17248849.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
CL05T00C42]
gi|423272668|ref|ZP_17251615.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
CL05T12C13]
gi|392700723|gb|EIY93885.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
CL05T00C42]
gi|392708745|gb|EIZ01850.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
CL05T12C13]
Length = 732
Score = 238 bits (607), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 153/474 (32%), Positives = 232/474 (48%), Gaps = 54/474 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGP+ Q ++D+Q LQKK+L R+ E GM PVL F G VP ++ FP+A I
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTKLQKKMLDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + + L +DPLF ++ F E+Q K +G S Y D F E N+
Sbjct: 258 DAGKWITYQRPA------FLVPSDPLFAKVAEIFYEEQKKLFGE-SRYYGGDPFHEGGNS 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
++ I+ + IY M++ + +A+W++QGW S +P ALL + G+ +
Sbjct: 311 KGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--SGNP------SVALLKGLKHGEAL 358
Query: 179 VLDLFAEVKPIWS--------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
VLDL A +P W F +IWC L NF G I MYG L S A G ++A
Sbjct: 359 VLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALPNFGGRIGMYGKLQSYATGVIKAE 418
Query: 231 TSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
V G+G + EGI NP+ YD++ +MA++ + +D+K+WI Y+ RYG +
Sbjct: 419 HHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDSIDIKSWIANYTTYRYGSENSNAK 478
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
A L +VYNC A A P + +S
Sbjct: 479 AAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYVS----------------------- 515
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
S+ HL+Y V++ALE + + EL +TYRYD++D+TRQ LA Y + I +
Sbjct: 516 -SWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRYDVVDITRQMLADYGKYIHKCISD 574
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
AY+ + + +FL+++ D D LL+ FLLG ++ A N +++
Sbjct: 575 AYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLGEYIRQADTCGSNPTEKR 628
>gi|380697007|ref|ZP_09861866.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 703
Score = 238 bits (607), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 150/472 (31%), Positives = 234/472 (49%), Gaps = 39/472 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 166 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 225
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 226 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 281
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 282 NWNEEFLSNVSDKIYKSIQDVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 341
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 342 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 401
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+++ + WI ++ R G I AW LY +
Sbjct: 402 LGVTLEGLDVNPLMYEFVFERAWEN-SMPAHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 460
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ T Y Y + LW
Sbjct: 461 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 500
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+
Sbjct: 501 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKKLEET 551
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK----QVRC 467
+R +L+ D+D LL C F +G W++ AK A NE+++K RC
Sbjct: 552 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDAKDFAVNEQEQKYYEENARC 603
>gi|392566857|gb|EIW60032.1| alpha-N-acetylglucosaminidase [Trametes versicolor FP-101664 SS1]
Length = 747
Score = 238 bits (606), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 156/467 (33%), Positives = 251/467 (53%), Gaps = 34/467 (7%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP +W+D Q LQK++L R+ ELGM PV+P+F+G VP AL + P+A I
Sbjct: 193 FGNIQGSWGGELPTAWVDDQFALQKRLLPRMVELGMTPVMPSFTGFVPRALAALHPNASI 252
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGR-TSHIYNCDTFDENT 118
W + L+ DPLF + ++FI +Q YG SH+Y D ++EN
Sbjct: 253 VTGSQWSGFPTSL--TNDSFLEPFDPLFATLQQSFIAKQQAAYGADISHVYTLDQYNEND 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-K 176
P +Y+ ++ A ++ +++ D AVWLMQGWLF D FW ++ A L VP
Sbjct: 311 PFSGDLDYLRNVSAGTFASLRAADPAAVWLMQGWLFFSDAVFWTDDRVAAYLGGVPGNDS 370
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
++VLDL++E +P W+ + + G ++WC LH++ GNI M G LD + P+ A +S ++
Sbjct: 371 MIVLDLYSEAQPQWNRTASYSGKQWVWCELHDYGGNIGMEGNLDVLTHAPLTALSSPGSS 430
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNV 294
M GVG++MEG E N +VY ++ + A+ ++ ++++ + RRY + +P A QDAW +
Sbjct: 431 MKGVGLTMEGQEGNEIVYGVLLDQAWSATSLNTSSYVSSWVSRRYPVKPLPKAAQDAWRI 490
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L TVYN D T + I ++ P++ +T + P S YD
Sbjct: 491 LSTTVYNNQDPNT---QATIKGIYELAPALTGMTNRIGHH---PTSIP---------YD- 534
Query: 355 PHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 411
+ + ++ AL+L + A LSA + YD++D+ RQ L+ L+ +I+ Y
Sbjct: 535 -----TDATMLSALKLLLEARAQHPTLSAVPEFVYDVVDVARQLLSNRFIGLYDTLIQTY 589
Query: 412 QLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
A V + L L+ D+D LL+ ++ FLL W+ A++ A
Sbjct: 590 NSTSSTAQSVSAAGQPLLALLTDLDALLSTNEHFLLSSWIADARKWA 636
>gi|423223006|ref|ZP_17209475.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640582|gb|EIY34381.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 755
Score = 237 bits (605), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 153/462 (33%), Positives = 221/462 (47%), Gaps = 46/462 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP+S +D+ ++L KKIL R ELGM P+ FSG VP LQ +P AKI+
Sbjct: 195 MQNIQSYGGPLPKSVIDKHVILGKKILARQLELGMQPIQQGFSGYVPRELQAKYPQAKIS 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W T LD TDPLF E+G AF+E+Q K +G + +Y D F E+ PP
Sbjct: 255 MKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDKLFG-SYGVYAADPFHESAPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+D+PEY++ +G I+ Q+ D+ A+W+MQ W D ++ +VP L++L
Sbjct: 308 IDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED----------IVKAVPKESLLIL 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL + + +G P I LHNF G I M+G L +A + + + G
Sbjct: 358 DLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLALLASNQYQKAKARYPNVCGS 413
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G+ ME IEQNPV Y+L EM + + ++AW+ Y+ RRYG A AW L Y
Sbjct: 414 GLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERRYGAKSAAAGKAWMYLLEGPY 473
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
G R IVA P++ G G P Y
Sbjct: 474 R--QGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP--------------------YE 508
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
VIRA + ++L+ S YR+D++D+ RQ + + EA+ D
Sbjct: 509 PMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLGQLVHKKAAEAFASKDKAAFV 568
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
S RFLEL+ DMD LL + WL A+ + +E++
Sbjct: 569 LHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKEEK 610
>gi|329963073|ref|ZP_08300853.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
gi|328529114|gb|EGF56044.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
Length = 717
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 154/479 (32%), Positives = 250/479 (52%), Gaps = 64/479 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW Q+ LQK+IL R+ E G+ PVLP +SG VP +
Sbjct: 187 MNNLEGWGGPNPDSWYTQREALQKQILKRMREYGIQPVLPGYSGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L TDP F EI + ++ + YG+ + Y+ D F E
Sbjct: 239 RLG--LNV-SDPGLWCGYPRPAFLQPTDPRFGEIADLYYKEMTRLYGK-ADFYSMDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+ VD +++ G AI+ M+ + AVW+ Q W + P+ K ++ ++P
Sbjct: 295 GGSIAGVD----LNAAGQAIWGAMKKVNPKAVWVAQAWQAN-------PRQK-MIENIPQ 342
Query: 175 GKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDLF+E +P W K+ +G +++CML N+ GN+ ++G + +
Sbjct: 343 GDLIVLDLFSESRPQWGDPASTWYRKEGFGKHDWLYCMLLNYGGNVGLHGKMRHVIDEFY 402
Query: 228 EARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP 286
+A+TS T+ GVGM+MEG E N V+++L+ E+ ++ + + W+ Y+ RYG++
Sbjct: 403 KAKTSPFGKTLKGVGMTMEGSENNSVMFELLCELPWRPAQFEKDEWLKNYTAARYGKADA 462
Query: 287 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+Q AW +L +++YNC D T + + V A P +D YQ
Sbjct: 463 TVQQAWLLLSNSIYNCPDANTQQGTHESVFCARPGMDV---------YQ----------- 502
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
SS+ +Y EVIRA + +++ + +N + YDL+D+ RQA+A+ ++
Sbjct: 503 ----VSSWSEMVKYYEPEEVIRAAGILLSAADRFKGNNNFEYDLVDIVRQAVAEKGRLVY 558
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+I+A + + S+RFL L+ D LLA F +G W+E A+ L +E++K
Sbjct: 559 PIMIDALKAGEKELFAAASQRFLNLILLQDRLLATRPEFKVGTWIEKARNLGTTQEEKK 617
>gi|393783261|ref|ZP_10371436.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
CL02T12C01]
gi|392669540|gb|EIY63028.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
CL02T12C01]
Length = 724
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 138/461 (29%), Positives = 228/461 (49%), Gaps = 39/461 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ GW GPLP WLD Q+ LQKKIL R EL M PVLPAF+G+VP AL+ +FP A I
Sbjct: 198 MANIDGWNGPLPMHWLDSQVELQKKILTRERELNMKPVLPAFAGHVPGALKRIFPEANIQ 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + R + L+ + LF I + +I++Q + +G T HIY D F+E PP
Sbjct: 258 NLGKWAGFAEEYR---CHFLNPEEALFATIQKQYIKEQTRLFG-TDHIYGVDPFNEVDPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY+S + A +Y + + D A W+ W+F +D W P++KA+L VP GK+V+
Sbjct: 314 SWEPEYLSKVSADMYHTLTAADPKAEWMQMTWMFYFDRKDWTAPRVKAMLTGVPQGKMVL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W T++ F+G PYIWC L NF GN + G + + + G
Sbjct: 374 LDYHCENVELWKTTEHFHGQPYIWCYLGNFGGNTTLTGNVKESGARLDNTLINGGSNFKG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ D ++W+N + R G + +++AW++L++ V
Sbjct: 434 IGSTLEGLDVMQFPYEYIFEKAWTL-NTDDRSWLNALADRHTGVTSEPVREAWDILFNQV 492
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + P++ P + KP ++ ++ Y
Sbjct: 493 YVQVP-------RTLAVLPNLRPVM-----------NKPNNRTSIN-------------Y 521
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +++A + + + + + R D+I + RQ L Y + + Y+ D +
Sbjct: 522 PNTALLQAWQKLLQAPD--CNRDALRLDIITVGRQLLGNYFLTVKDDFDRMYEAKDLPAL 579
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+ E++ D++ L A H L W+ A++ E
Sbjct: 580 KARAAEMREILNDLERLNAFHSRCSLDKWISDARKYGNTPE 620
>gi|224537227|ref|ZP_03677766.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521150|gb|EEF90255.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
DSM 14838]
Length = 755
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 153/462 (33%), Positives = 221/462 (47%), Gaps = 46/462 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP+S +D+ ++L KKIL R ELGM P+ FSG VP LQ +P AKI+
Sbjct: 195 MQNIQSYGGPLPKSVIDKHVILGKKILARQLELGMQPIQQGFSGYVPRELQAKYPQAKIS 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W T LD TDPLF E+G AF+E+Q K +G + +Y D F E+ PP
Sbjct: 255 MKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDKLFG-SYGVYAADPFHESAPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+D+PEY++ +G I+ Q+ D+ A+W+MQ W D ++ +VP L++L
Sbjct: 308 IDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED----------IVKAVPKESLLIL 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL + + +G P I LHNF G I M+G L +A + + + G
Sbjct: 358 DLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLALLASNQYQKAKARYPNVCGS 413
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G+ ME IEQNPV Y+L EM + + ++AW+ Y+ RRYG A AW L Y
Sbjct: 414 GLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERRYGAKSAAAGKAWMYLLEGPY 473
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
G R IVA P++ G G P Y
Sbjct: 474 R--RGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP--------------------YE 508
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
VIRA + ++L+ S YR+D++D+ RQ + + EA+ D
Sbjct: 509 PMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLGQLVHKKAAEAFASKDKAAFA 568
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
S RFLEL+ DMD LL + WL A+ + +E++
Sbjct: 569 LHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKEEK 610
>gi|319900259|ref|YP_004159987.1| alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
gi|319415290|gb|ADV42401.1| Alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
Length = 718
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 151/478 (31%), Positives = 238/478 (49%), Gaps = 64/478 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P W Q LQK+IL R+ E G+ PVLP +SG +PA +
Sbjct: 187 MNNLEGWGGPNPDQWYSHQEQLQKRILKRMREYGIEPVLPGYSGMIPANAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY---LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG V +WC L +D F I R + ++ + YG+ ++ Y+ D F E
Sbjct: 239 KLG--LDVADPGKWCGYRRPAFLQPSDKNFRRIARLYYKEMTRLYGKANY-YSMDPFHEG 295
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G +I M+ + AVW+ Q W YD ++ ++P
Sbjct: 296 GNTKGVD----LDAAGKSIRDAMKEANPQAVWVAQAWGACPYD---------NMIKNLPE 342
Query: 175 GKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G ++VLDL++E +P W KQ +G +I+CML NF GN+ +YG ++ +
Sbjct: 343 GDMIVLDLYSESRPQWGDPASAWYRKQGFGRHGWIYCMLLNFGGNVGLYGKMEHVIDEFY 402
Query: 228 EARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP 286
+AR S T+ GVG++MEG E NPV+Y+L+ E+ + ++ W+ Y RYG++ P
Sbjct: 403 KARESAFGGTLQGVGLTMEGSENNPVMYELLCELPWHGRRISKDQWLKSYLKARYGKTTP 462
Query: 287 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+AW L +T+YN + +T + + V A P ++ YQ
Sbjct: 463 QTVEAWLKLSNTIYNSPNASTQQGTHESVFCARPSLEA---------YQ----------- 502
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
SS+ +Y+ +++IRA I + E +N + YDLID+ RQA+A+ ++
Sbjct: 503 ----VSSWSEMKDYYAPADIIRAAGKMIEAAEEFRGNNNFEYDLIDVVRQAVAEKGRLVY 558
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
++ AY+ D S RFLEL+E D LL F LG W A+ + + + Q+
Sbjct: 559 PIVVSAYKAADKQLFEAASARFLELIELQDKLLGTRREFRLGTWTNYARNMGETDAQK 616
>gi|400595379|gb|EJP63180.1| alpha-N-acetylglucosaminidase [Beauveria bassiana ARSEF 2860]
Length = 761
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 146/467 (31%), Positives = 241/467 (51%), Gaps = 29/467 (6%)
Query: 1 MSNLHG-WGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSA 57
N+HG WGG L W++QQ LQKKI+ R+ ELG+ PVLP F G VPAAL+ + P
Sbjct: 205 FGNIHGTWGGEGRLSAEWINQQFALQKKIVARMVELGITPVLPGFPGFVPAALKKLRPDV 264
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
I + W V + T L+ TD + E+ FI+ Q+KE+G +++Y D F+E
Sbjct: 265 NIAEAPVWVDVPRNN--TATAFLNPTDKTYAELQSLFIKNQIKEFGNVTNVYTVDQFNEI 322
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVP-LG 175
P +YI+ + ++ Y G+ + + A+WLMQGWLF S FW ++ A L P
Sbjct: 323 NPSSGDTKYITDVSSSTYKGITAANPAAIWLMQGWLFYSSQSFWTQQRVDAYLAGPPGQD 382
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDLF+E +P W ++ ++G P+IWC LH+F GN ++G + ++ V+A E+
Sbjct: 383 DMIILDLFSESQPQWQRTRSYFGRPWIWCELHDFGGNQALHGKITNVTQNSVQA-LKESG 441
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD---AW 292
++VG G++ EG E N VVYD++ + A++ +D + ++ RY + +D AW
Sbjct: 442 SIVGYGLTPEGYEGNEVVYDILLDQAWEGSPIDTANYFRAWARNRYSAAGIIPEDVFTAW 501
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
L Y+ D A V V+ + PS+ + + +Y P + + K + +
Sbjct: 502 EQLRQHAYDVQDNAIPS---VGVSVYQLFPSLKGLVN-RTGHYPPPTALQYDPKVMKNIW 557
Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
HL+Y+++ I S L + D +D+TRQ L +++ +++ +Q
Sbjct: 558 ---HLFYNST---------IDSPGLLQIP-AFHLDFVDVTRQVLGNAFIDIYTDLVNQFQ 604
Query: 413 LNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
V Q L L +ED+D L ++ F WL SA+ Q+
Sbjct: 605 ATANATVIQDLGNSMLSFIEDLDMALNTNEHFTFKKWLNSAESWGQS 651
>gi|383122982|ref|ZP_09943669.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
gi|251841923|gb|EES70003.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
Length = 730
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 142/473 (30%), Positives = 239/473 (50%), Gaps = 41/473 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP+SWL+QQ VLQK+IL R + M PVLPAFSG+VP L+ ++P AKI
Sbjct: 197 MSNVDYWQSPLPKSWLEQQEVLQKQILKRERDFNMTPVLPAFSGHVPKELKAIYPDAKIH 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S R ++ ++ D LF I + ++E+Q YG T HIY D F+E P
Sbjct: 257 EMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEEQTAIYG-TDHIYGIDPFNEVDSP 312
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ ++++ + IY + D++A WL W+F +D W P++++ L +VP KL++
Sbjct: 313 NWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHDQKKWTQPRIRSFLEAVPDDKLIL 372
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++ +YG PY+WC L NF GN M G LD + + + G
Sbjct: 373 LDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMVGNLDDVDVKIEKLFVEGGENVYG 432
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + + A+ + + WI ++ R G I AW+ L+ +
Sbjct: 433 LGATLEGFDVNPFMYEFVFDQAWDY-PLTTDQWIQNWAKCRGGNQDRHILKAWDSLHKKI 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y AT ++ A P + V ++ +Y++ LW
Sbjct: 492 YK--KYATAGQAVLMNARPML------------------VGTDSWNTYPDITYNNRDLWD 531
Query: 360 STSEVIRALELFIASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+E+++A + +NT YR+D+I++ RQ L + + + Y D G
Sbjct: 532 IWTEMLKASHI----------NNTGYRFDVINVGRQVLGNLFSSFRDHFTQCYSEKDIDG 581
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ----EKQVRC 467
+ + + + L+ D D LL+C F +G W++ A+ + E + E+ RC
Sbjct: 582 MKKWADQMDSLLIDTDRLLSCETNFSIGKWIDDARSFGKTEAEKEYYEENARC 634
>gi|383120707|ref|ZP_09941431.1| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
gi|382984934|gb|EES68331.2| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
Length = 736
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 149/477 (31%), Positives = 237/477 (49%), Gaps = 59/477 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 197 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 248
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ + Y+ D F E
Sbjct: 249 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 305
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I S M+ + +AVW+MQ W + +A+++++ G
Sbjct: 306 GNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQAW--------QANPREAMVSTLDSG 353
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E P +W K F +++CML NF GN+ ++G ++ + G
Sbjct: 354 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 413
Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N T+ GVG + EGIE NPV+++L+ E+ ++ E+ AW+ Y RYG + P
Sbjct: 414 ACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWREERFAPDAWLQAYLKARYGNDLSP 473
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ +AW L HTVYN + S++ G +Q+
Sbjct: 474 EVAEAWRALEHTVYNAPKNYQGEG---------TVESLLCARPGFHQD------------ 512
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+YS +A L ++ ++ +N + YDL+D+ RQ+LA N L
Sbjct: 513 -RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNNFEYDLVDVVRQSLADKGNVLLEE 571
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
I ++Y D + S++FLEL+ D LL+ F + WL +A+ L EE++K
Sbjct: 572 ISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEFSVSSWLNAARSLGTTEEEKK 628
>gi|29349767|ref|NP_813270.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341678|gb|AAO79464.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 744
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 149/477 (31%), Positives = 237/477 (49%), Gaps = 59/477 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 205 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 256
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ + Y+ D F E
Sbjct: 257 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 313
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I S M+ + +AVW+MQ W + +A+++++ G
Sbjct: 314 GNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQAW--------QANPREAMVSTLDSG 361
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E P +W K F +++CML NF GN+ ++G ++ + G
Sbjct: 362 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 421
Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N T+ GVG + EGIE NPV+++L+ E+ ++ E+ AW+ Y RYG + P
Sbjct: 422 ACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWREERFAPDAWLQAYLKARYGNDLSP 481
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ +AW L HTVYN + S++ G +Q+
Sbjct: 482 EVAEAWRALEHTVYNAPKNYQGEG---------TVESLLCARPGFHQD------------ 520
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+YS +A L ++ ++ +N + YDL+D+ RQ+LA N L
Sbjct: 521 -RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNNFEYDLVDVVRQSLADKGNVLLEE 579
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
I ++Y D + S++FLEL+ D LL+ F + WL +A+ L EE++K
Sbjct: 580 ISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEFSVSSWLNAARSLGTTEEEKK 636
>gi|29345848|ref|NP_809351.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337741|gb|AAO75545.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 730
Score = 235 bits (600), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 142/473 (30%), Positives = 239/473 (50%), Gaps = 41/473 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP+SWL+QQ VLQK+IL R + M PVLPAFSG+VP L+ ++P AKI
Sbjct: 197 MSNVDYWQSPLPKSWLEQQEVLQKQILKRERDFNMTPVLPAFSGHVPKELKAIYPDAKIH 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S R ++ ++ D LF I + ++E+Q YG T HIY D F+E P
Sbjct: 257 EMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEEQTAIYG-TDHIYGIDPFNEVDSP 312
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ ++++ + IY + D++A WL W+F +D W P++++ L +VP KL++
Sbjct: 313 NWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHDQKKWTQPRIRSFLEAVPDDKLIL 372
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++ +YG PY+WC L NF GN M G LD + + + G
Sbjct: 373 LDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMVGNLDDVDVKIEKLFVEGGENVYG 432
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + + A+ + + WI ++ R G I AW+ L+ +
Sbjct: 433 LGATLEGFDVNPFMYEFVFDQAWDY-PLTTDQWIQNWAKCRGGNQDRHILKAWDSLHKKI 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y AT ++ A P + V ++ +Y++ LW
Sbjct: 492 YK--KYATAGQAVLMNARPML------------------VGTDSWNTYPDITYNNRDLWD 531
Query: 360 STSEVIRALELFIASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+E+++A + +NT YR+D+I++ RQ L + + + Y D G
Sbjct: 532 IWTEMLKASHI----------NNTGYRFDVINVGRQVLGNLFSSFRDHFTQCYSEKDIDG 581
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ----EKQVRC 467
+ + + + L+ D D LL+C F +G W++ A+ + E + E+ RC
Sbjct: 582 MKKWADQMDALLIDTDRLLSCETNFSIGKWIDDARSFGKTEAEKEYYEENARC 634
>gi|374385779|ref|ZP_09643282.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
12061]
gi|373225481|gb|EHP47815.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
12061]
Length = 715
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 153/477 (32%), Positives = 231/477 (48%), Gaps = 62/477 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW ++Q+ LQ +IL R+ E G+ PV P ++G + P
Sbjct: 186 MNNLEGWGGPNPESWYERQMQLQHRILNRMREYGIEPVFPGYAG--------MLPHNASE 237
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG VK DP C Y L +P F I + + K +G+ + Y D F E
Sbjct: 238 KLG--IEVK-DPGLWCGYQRPAFLYPENPAFKRIAGLYYMEMEKRFGK-AKFYGMDPFHE 293
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
N +D +++ ++ M++ + +AVW+MQ W + P+ + ++ ++
Sbjct: 294 GGNVQGID----LAAAAQSVLQAMKTANPEAVWVMQAWQAN-------PRHE-MITALQP 341
Query: 175 GKLVVLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G +++LDL +E +P+W K F G +++CML NF GN+ MYG +D + G
Sbjct: 342 GNVLILDLSSENRPMWGDKESVWYREKGFEGQDWLYCMLLNFGGNVGMYGRMDRVINGFY 401
Query: 228 EARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP 286
A N ++ GVG +MEGIE NPV+Y+L+ E+ ++ + W+ Y RYG+ P
Sbjct: 402 AAVQHPNGASLRGVGKTMEGIENNPVMYELLLELPWRKIPFTKEEWLKGYVKARYGKDDP 461
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+Q AW +L YNC V A P A
Sbjct: 462 RLQQAWQILGKAAYNCPVVQEGTTESVFCARP------------------------AEEI 497
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
S SS+ L+Y+ E + LF+ + +N + YDL D+ RQALA N L
Sbjct: 498 SGASSWGTSELYYAPEESKKVAALFLEVSEQYKGNNNFEYDLTDIMRQALADKGNVLQKK 557
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
I EAY+L D LSR FL+L+ D LLA F LG WLE AK + EE+++
Sbjct: 558 ITEAYRLKDETAFRNLSREFLQLILWQDTLLATRPEFRLGTWLERAKAKGETEEEKR 614
>gi|449299394|gb|EMC95408.1| glycoside hydrolase family 89 protein [Baudoinia compniacensis UAMH
10762]
Length = 801
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 139/469 (29%), Positives = 239/469 (50%), Gaps = 44/469 (9%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP SW+ Q L K+I+ R+ ELGM PVLP F G VP + +P+A
Sbjct: 226 NIQGSWGGDLPMSWISSQFTLGKQIVARMVELGMTPVLPCFPGFVPMQIGRYYPNAMYIN 285
Query: 62 LGNWFSVKSDPRWCCTY-LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W PR L+ DPL+ + ++FI +Q YG S IY D ++EN P
Sbjct: 286 GSQW---NGFPRQNTNVSFLEPFDPLYTTLQKSFISKQTAAYGNVSSIYTLDQYNENNPY 342
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVV 179
Y+ ++ A + +++ D +AVW++QGWLF S FW ++A L V +++
Sbjct: 343 SADTTYLRNISAGTIAALKAADPNAVWMLQGWLFFSSATFWTDAAIRAYLGGVNNTDMII 402
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E +P W + +YG P+IWC LH++ GN+ +YG ++++ P++A + ++TMVG
Sbjct: 403 LDLFSETQPQWQRTNSYYGKPWIWCELHDYGGNMGLYGQVENVTINPIQALNNASSTMVG 462
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV---PAIQDAWNVLY 296
+G++MEG E N ++YD++ + A+ ++ + + + RY + P + AW+ +
Sbjct: 463 MGLTMEGQEGNEIMYDILLDQAWSSTPLNNSLYFHDWVTSRYHGAASLPPGLYTAWDTMR 522
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP- 355
TVYN T +T + V + ++ P++ + + HP
Sbjct: 523 QTVYNNTQISTIQ--SVTKSIWELTPNVTGLLN--------------------RTGHHPT 560
Query: 356 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+ Y+TS ++ A + F + + L S Y +DL D+TRQ +A L+ + + A
Sbjct: 561 TIQYNTSTLVGAWKQFYGAAAQEPTLWDSPGYLFDLTDVTRQVMANAFYPLYTSFVSASN 620
Query: 413 LNDAHGVFQ------LSRRFLELVEDMDGLLACH--DGFLLGPWLESAK 453
+ A+ + ++ + L+ +D +LA F L W+ A+
Sbjct: 621 -HSANATYSPGNATIYGQQMVSLLSALDSMLAASPIPYFHLSTWIAEAR 668
>gi|440792549|gb|ELR13759.1| peptidase, S8/S53 subfamily protein [Acanthamoeba castellanii str.
Neff]
Length = 981
Score = 234 bits (597), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 139/425 (32%), Positives = 217/425 (51%), Gaps = 51/425 (12%)
Query: 50 LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIY 109
++ ++P+A +T+ +W ++ Y L D L+ IG I +E+G T HIY
Sbjct: 434 IKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG-TDHIY 490
Query: 110 NCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALL 169
N DTF+E +PP P Y+++ A+Y GM + D A+W+MQGW F +DPFW ++KA L
Sbjct: 491 NADTFNEMSPPSADPTYLAAASRAVYEGMATQDPQALWVMQGWSFVFDPFWTKDRIKAYL 550
Query: 170 NSVPLGKLVVLDLFAEVKPIWSTSKQF----YGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
+ V +++LDL ++ P W+ + QF +G ++WCMLHN G +YG L +
Sbjct: 551 SGVDNSDMLILDLASDNSPEWNKTGQFRDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSD 610
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
P+ A + TMVGVGM+ME IEQNPVVY+LMSEM ++ E D+ W+ +Y+ RRYG +
Sbjct: 611 PLIALATPGNTMVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAT 670
Query: 286 PA--IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
+ + +AW +L YN + +D + +G +
Sbjct: 671 GSSPVGEAWELLREATYNQS---------------GLDAGLFGFAPALGMGHGGTSN--- 712
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNE--LSASNTYRYDLIDLTRQALAKYAN 401
+T EV AL LF+ S + + ++YD +DLTRQ LA N
Sbjct: 713 ----------------ATKEV-EALRLFLQSAQTEGYAPNGPWQYDCVDLTRQVLANTFN 755
Query: 402 ELFLNIIEAY-----QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+++ + AY +D L+ L ++ D+D LLA + +LLG W++ A A
Sbjct: 756 DVYSQLDAAYTSYATNKSDTLPFLPLAAELLGIISDLDRLLATNPNYLLGTWIKDAVSWA 815
Query: 457 QNEEQ 461
EQ
Sbjct: 816 SIPEQ 820
>gi|212541222|ref|XP_002150766.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
gi|210068065|gb|EEA22157.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
Length = 787
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 155/470 (32%), Positives = 242/470 (51%), Gaps = 41/470 (8%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
+ N+ G WG PLP W++ Q LQKKIL R+ ELG+ PVLP+F+G VP A+ V P+AK+
Sbjct: 203 LGNIQGFWGDPLPNEWIESQFELQKKILARMVELGITPVLPSFTGFVPRAITRVLPNAKV 262
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W S+ + C L+ D F + ++ I +Q YG SHIY D ++EN P
Sbjct: 263 VPGSRWNVFSSN--YTCDTFLEPFDDNFALLQKSTISKQQAYYGNISHIYALDQYNENNP 320
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-LV 178
+P+Y+ ++ +++ D DAVWLMQ WLF FW + A L+ V ++
Sbjct: 321 FSSNPDYLRNISRTTSQSLKAADPDAVWLMQSWLFLDATFWNNVTICAYLSGVENNSDML 380
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
+LDLFAE +P+W + +YG P+IWC +H++ GN+ +YG + +I A S +MV
Sbjct: 381 ILDLFAESQPVWQLTDSYYGKPWIWCQVHDYGGNMGLYGQIMNITENATAALASSG-SMV 439
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVP-AIQDAWNVL 295
G G +ME E N +VYDL+ + A+ ++ + + RY + VP + DAW +L
Sbjct: 440 GFGHTMESQEGNEIVYDLLLDQAWSETPINTSQYFEDWVTVRYAGTQHVPQQLFDAWEIL 499
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISV--TEGKYQNYGKPVSKEAVLKSETSSYD 353
+ YN T+ A+ I+ +++PSI + EG + T +YD
Sbjct: 500 RWSAYNNTNLASSSVPKSIL---ELEPSISGLLNREGHHPT--------------TINYD 542
Query: 354 HPHLWYSTSEVIRALEL-FIASGNELSASNT--YRYDLIDLTRQALAKYANELFLNIIEA 410
P L V+ A L + A+ ELS + + YDLI LTRQ L + +I
Sbjct: 543 -PEL------VVEAWALTYEAALLELSLWDNPAFNYDLIFLTRQVLVNAFIPRYELLISF 595
Query: 411 YQLNDAHGV---FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
Y N+ + V R+ ++L++ +D +L ++ F L W+ A A
Sbjct: 596 YN-NENYSVPAIVSAGRQLIDLLQSLDTVLGTNECFQLAQWINKAVSRAH 644
>gi|373461342|ref|ZP_09553084.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
gi|371952896|gb|EHO70729.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
Length = 731
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 145/473 (30%), Positives = 235/473 (49%), Gaps = 52/473 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++GW GPLPQSW+D Q LQ++IL R E GM PVLPAF+G+VP + + P A+IT
Sbjct: 194 MLNINGWQGPLPQSWIDGQADLQRRILQREREFGMRPVLPAFNGSVPLDYKRLHPEARIT 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G W R TY L TDP F ++ ++F+++Q + +G T H+Y D+F+E PP
Sbjct: 254 EVGQWGGFGQAYR---TYFLSPTDPRFGKLQKSFLDEQRRMFG-TDHLYCLDSFNEVQPP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
SP+ + L I++ + D +VW+ GWLF D W P ++A L+ +P + ++
Sbjct: 310 SWSPDTLCMLARHIHASLDKADPQSVWVQMGWLFYNDRKHWTPDVIRAYLSGIPKDRALL 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + +W ++ FYG PYI C+L NF GN + G + ++ ++A +++ M G
Sbjct: 370 LDYYIDHTELWRLTESFYGRPYIACVLGNFGGNTMLQGDVGKVS-SRLDAAIAQDGNMAG 428
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG +MEG NP Y + + A+ D + W+ + + R G + A + AW VL+ +
Sbjct: 429 VGATMEGFGVNPDFYAFVFDKAWDCGTTD-RDWLCRMADRHVGFASAAGRTAWQVLFDRI 487
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKS--ETSSYDHPHL 357
+++ V+ A P E +Y N P V K + S HL
Sbjct: 488 ---MPSYVNESGTVVCARPSF--------EARYLNTTYPAELLGVWKMLLDIDSDKREHL 536
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YD++++ RQ L + + AY +
Sbjct: 537 ----------------------------YDVVNVGRQVLGDFFAFERDGLHRAYLSQRSD 568
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL----AQNEEQEKQVR 466
V +RR ++++D+D LLAC + F L W+E A+ A+ + E+ R
Sbjct: 569 SVDYYARRMDKMLDDLDRLLACSEEFSLRKWIEDARGFGATAAEKDYYERNAR 621
>gi|410097657|ref|ZP_11292638.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
CL02T12C30]
gi|409223747|gb|EKN16682.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
CL02T12C30]
Length = 740
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 143/462 (30%), Positives = 225/462 (48%), Gaps = 46/462 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP+SW+D + L K+++ R ELGM P+ FSG VP + FP AKI
Sbjct: 195 MPNIESFGGPLPKSWIDSHIALGKQVVNRQLELGMTPIQQGFSGAVPRKMMEKFPEAKIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ +W+ + C LD DPLF E+G+ F+E++ K YG T +Y D F E+ PP
Sbjct: 255 KQPDWYGFEG----ICQ--LDPLDPLFTELGKTFLEEEQKLYG-TYGLYAADPFHESKPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD+PEY++++G++I+ M++ D DA+W+MQ W F D + + VP L+VL
Sbjct: 308 VDTPEYLNAVGSSIHKLMKTFDPDALWVMQAWSFRKD----------IASVVPKHDLLVL 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
L + F ++ LHNF G + ++G L ++ + +VG
Sbjct: 358 SLNGAL----GGEDHFCNHDFVVGNLHNFGGRVNLHGDLPLVSSNQFMKAKQKTPNVVGS 413
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G+ ME I QNPV Y+L EM + V ++ W+N+Y+ RRYG A AW +L Y
Sbjct: 414 GLFMESIGQNPVFYELAFEMPVHQDSVKLEEWLNKYAERRYGAFSDAANKAWELLLAGPY 473
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + +I A P VD G + P ++++++E
Sbjct: 474 RAGTNGVESS-SIICARPAVDVK----KSGPNAGFNIPYDPQSLIEAEVC---------- 518
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+ +L S YR+D++D+ RQ ++ E+ EA++ D
Sbjct: 519 ----------LLQDAEQLKGSGPYRFDIVDVQRQIMSNLGQEIHKKAAEAFKKKDKEAFA 568
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
S RFLEL++D+D LL F WL A+ +E+
Sbjct: 569 LHSGRFLELLKDVDILLRTRTEFNFDQWLTDARAWGTTDEER 610
>gi|261199246|ref|XP_002626024.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
gi|239594232|gb|EEQ76813.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
Length = 752
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 140/473 (29%), Positives = 242/473 (51%), Gaps = 43/473 (9%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
NL G WGG P W D Q LQKKIL R+ ELGM P+LPAF G VP A+ V P A+
Sbjct: 201 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 260
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + +P++ T L DP + + ++FI + ++ YG +H Y D F+E
Sbjct: 261 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 318
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS-YDPFWRPPQMKALLNS-VPLGK 176
P PE++ + ++S D +A W+MQGWLF + +W +++A L++
Sbjct: 319 PSSGDPEFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 378
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDLFAE P+W +K F+G ++WC + F GN +YG + +I GP +A +++
Sbjct: 379 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAQA-MAQHPN 437
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY---GRSVP-AIQDAW 292
MVGVG + EG N +V+ L+ + + +D + + + + RRY GR+VP + +AW
Sbjct: 438 MVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHGRTVPNELYEAW 497
Query: 293 NVLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSET 349
+L + YN T+ A + A P ++ + + EG
Sbjct: 498 QLLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG------------------- 538
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
L Y +++++A L I L ++Y+YD++D+TRQ L+ + ++
Sbjct: 539 -------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKV 589
Query: 410 AYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
Y+ VF + + L +++ +D +L+ ++ F L W+ +A+ A ++ +
Sbjct: 590 KYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDDSE 642
>gi|239615395|gb|EEQ92382.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ER-3]
Length = 829
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 140/473 (29%), Positives = 242/473 (51%), Gaps = 43/473 (9%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
NL G WGG P W D Q LQKKIL R+ ELGM P+LPAF G VP A+ V P A+
Sbjct: 221 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 280
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + +P++ T L DP + + ++FI + ++ YG +H Y D F+E
Sbjct: 281 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 338
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS-YDPFWRPPQMKALLNS-VPLGK 176
P P+++ + ++S D +A W+MQGWLF + +W +++A L++
Sbjct: 339 PSSGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 398
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDLFAE P+W +K F+G ++WC + F GN +YG + +I GP EA +++
Sbjct: 399 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAEA-MAQHPN 457
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAW 292
MVGVG + EG N +V+ L+ + + +D + + + + RRY R+VP+ + +AW
Sbjct: 458 MVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHERTVPSELYEAW 517
Query: 293 NVLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSET 349
+L + YN T+ A + A P ++ + + EG
Sbjct: 518 QLLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG------------------- 558
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
L Y +++++A L I L ++Y+YD++D+TRQ L+ + ++
Sbjct: 559 -------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKV 609
Query: 410 AYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
Y+ VF + + L +++ +D +L+ ++ F L W+ +A+ A +E +
Sbjct: 610 KYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDESE 662
>gi|429740222|ref|ZP_19273924.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
gi|429153947|gb|EKX96708.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
Length = 730
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 142/463 (30%), Positives = 222/463 (47%), Gaps = 37/463 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP+ WL++Q LQK+IL R M PVLPAF+G+VPA L+ +FP A I
Sbjct: 198 MANIDRWNGPLPKEWLEEQRDLQKQILARERAFNMKPVLPAFAGHVPAELKRIFPDANIK 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W D ++ C + L+ +PLF +I + F+E+Q +G T HIY D F+E PP
Sbjct: 258 SLGKWGGF--DEQYLC-HFLNPGEPLFAKIQKLFLEEQTALFG-TDHIYGVDPFNEGEPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
P Y+ + +Y + + D A W+ GW+F YD W P ++KA L VP GK+ +
Sbjct: 314 SWEPAYLKEISKNMYGTLTAVDPKAEWMQMGWMFYYDKKVWTPKRVKAFLTGVPQGKMSL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W T+ FYG PYIWC L NF GN + G + A + M+G
Sbjct: 374 LDYHCENVELWKTNDGFYGQPYIWCYLGNFGGNTTLTGNVKETGKRLDAALKAARRNMLG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG++ Y+ + + + H + WI++ + R G + P+++ AW +L+ +
Sbjct: 434 VGSTLEGLDVIQFPYEYVFDKVWTHSDKGNQQWIDELADRHAGFTSPSVRKAWQILFDEI 493
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
+ G + PS V + SE + +P
Sbjct: 494 FVQVPGTY-----------SILPSRSPVLNDNH--------------SERTEIKYPA--- 525
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E + +L L + N + DLI + RQ L + AY D +
Sbjct: 526 QRLEEVWSLLLDVPQ----CERNELQVDLIAVGRQVLGNKFLAVKSEFDAAYAAKDITLL 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
Q + EL+ D+D L + + + W++ A+ L +N E +
Sbjct: 582 RQKAYEMEELLSDLDCLTSFNTRCTVNKWIDDARALGRNAEMK 624
>gi|453081268|gb|EMF09317.1| glycoside hydrolase family 89 protein [Mycosphaerella populorum
SO2202]
Length = 784
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 124/398 (31%), Positives = 215/398 (54%), Gaps = 27/398 (6%)
Query: 8 GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
GG LPQSW+DQQ L + I+ R+ ELGM PVLP F+G VP + ++P+A W
Sbjct: 218 GGDLPQSWIDQQFELNQLIIARMIELGMTPVLPCFTGFVPTQISRLYPNASFVNGSQWNG 277
Query: 68 VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
++ ++ L+ DPLF + ++FI + YG S +Y D ++EN P + Y+
Sbjct: 278 FQA--QYTNVTFLEPFDPLFTTLQKSFISKLDAAYGNVSSVYTLDQYNENDPFSGNVTYL 335
Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEV 186
+ + +++ D +A+W +QGWLF S FW ++KA L V +++LDLF+E
Sbjct: 336 EDVASNTIKSLKAADPEAIWFIQGWLFYSAADFWDEERIKAYLGGVEDKDMLILDLFSES 395
Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEG 246
+P W + ++G P+IWC LH++ GN ++G ++++ P+ A +E +TMVG+G++MEG
Sbjct: 396 QPQWQRTNSYFGKPWIWCQLHDYGGNQGLHGQVENVTMNPILALANETSTMVGIGLTMEG 455
Query: 247 IEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY--GRSVPAI-QD---AWNVLYHTVY 300
E N ++YD++ + A+ E ++ + + + RY +V + QD AW+++ T+Y
Sbjct: 456 QEGNEIIYDILLDQAWTPEPIESAGYFDDWVTSRYHCDDAVAGLPQDLYIAWDMMRQTIY 515
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
N TD D V + ++ P+ + + + + + +L S H + +
Sbjct: 516 NNTD--IDTAEAVTKSIFELQPNTTGLLDRTGHHSTRILYDPEILVSA-----WKHFYSA 568
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
+ E + EL +YR+DL+D+TRQ LA
Sbjct: 569 SQETPQLWEL-----------ESYRFDLVDITRQVLAN 595
>gi|399028591|ref|ZP_10729778.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
gi|398073682|gb|EJL64846.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
Length = 727
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 149/470 (31%), Positives = 234/470 (49%), Gaps = 37/470 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLPQ W++++ +QKKIL R+ LGM+PV+PAFSG VP A P +KI+
Sbjct: 209 MGNINSLEGPLPQEWINKKENVQKKILQRMRALGMHPVVPAFSGYVPKAFAEKHPGSKIS 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W S + TYLLDA DPLF EIG+ FIE K YG+ + Y D F+E TPP
Sbjct: 269 ELKSW----SGGGFESTYLLDANDPLFKEIGKRFIEIYTKLYGQ-ADFYLADAFNEITPP 323
Query: 121 VDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V E +S G I+ + DA W+MQGWLF + FW KA L+ VP +
Sbjct: 324 VSKEHKYEELSDYGKTIFETINEASPDATWVMQGWLFGDNKEFWTKEATKAFLSKVPNDR 383
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT- 235
+++ D + +W + FYG + + +HN+ G+ +YG L+ + N
Sbjct: 384 MMIQDYANDRHKVWEKQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKNELTHLLGNSNKG 443
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVPAIQDAWN 293
+VG G+ EG+ N +VY+ + ++ + K V W+N+Y RYG+ S P Q AW
Sbjct: 444 NVVGYGVMPEGLNNNSIVYEYIYDLPWSQGKESVNDWLNKYLSARYGKNISTPVFQ-AWK 502
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
+L +VY+ T D A+ + +TE K G P K+
Sbjct: 503 LLIESVYSTKYWETRWWDDRAGAYLFFKRPTLKITEFK----GNPGDKQ----------- 547
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
++ +AL++ + ++ Y YDL+D++R + ++L + + AY+L
Sbjct: 548 ---------KLKQALDILKRESKSFNKNSLYFYDLLDMSRHYYSLCIDDLLIECVTAYEL 598
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
D +L ++ + D+D +L+ L WL+SA + E K
Sbjct: 599 KDIKKADELFKKIEKQALDIDNMLSGQPLNSLNNWLKSASDYGSSPEVSK 648
>gi|358391826|gb|EHK41230.1| glycoside hydrolase family 89 protein [Trichoderma atroviride IMI
206040]
Length = 751
Score = 231 bits (590), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 139/459 (30%), Positives = 246/459 (53%), Gaps = 29/459 (6%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G W G +P +W+D Q LQ +IL R+ ELG+ P+LPAF G VP + VFP +
Sbjct: 202 FGNIQGSWNGNMPGNWVDDQFALQLQILDRMKELGITPILPAFPGFVPRNISRVFPGISL 261
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
+ W + D TY ++ DP F ++ + FI +Q + YG + + D F+EN P
Sbjct: 262 STSPLWENFAEDLS-ADTY-VNPFDPHFTQLQKLFIGKQQELYGNVTKFWTLDQFNENQP 319
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPL-GKL 177
Y+ ++ ++ ++S DA+W+MQ WLFS D FW ++A L + +
Sbjct: 320 LSSDLGYLRNVSQNTWTALKSASPDAIWVMQAWLFSADSSFWTNDAIEAFLGGITEDSDM 379
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDLFAE P W + FYG P+IWC LH++ GN+ +YG ++++ ++A ++++
Sbjct: 380 LLLDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGLYGQIENVTINAMQA-VRNSSSL 438
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVPAIQDAWNVLY 296
VG G++MEG E N ++YDL+ + A+ + +D + + + + RYG +V ++ W +L
Sbjct: 439 VGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHDWVSARYGTENVKSLYTGWELLR 498
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
TV+N T+ + I+ ++ P+I + G+ +G + +YD
Sbjct: 499 PTVFNNTNLTVNAVPKSIL---ELTPNINGLL-GRVGRHGTTI-----------NYDPAV 543
Query: 357 LWYSTSEVIRA-LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN- 414
+ + +E+ +A LE GN Y+YDL+D TRQ L + L+ +++ AY +
Sbjct: 544 MVDAWTELFKAGLEDVKLFGNP-----AYQYDLVDWTRQVLVNSFDGLYKDLVTAYNSSA 598
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
+A + + L++ +D +LA ++ F L W+ +A+
Sbjct: 599 NAAEIRSRGSKLTALLKTLDAVLATNENFQLATWIAAAR 637
>gi|423259033|ref|ZP_17239956.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
CL07T00C01]
gi|423263996|ref|ZP_17242999.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
CL07T12C05]
gi|387776613|gb|EIK38713.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
CL07T00C01]
gi|392706262|gb|EIY99385.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
CL07T12C05]
Length = 718
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 154/480 (32%), Positives = 239/480 (49%), Gaps = 65/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
>gi|62088640|dbj|BAD92767.1| huntingtin interacting protein-1-related [Homo sapiens]
Length = 449
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 130/329 (39%), Positives = 185/329 (56%), Gaps = 45/329 (13%)
Query: 142 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 200
D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P
Sbjct: 18 DTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQP 77
Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 260
+IWCMLHNF GN ++G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+
Sbjct: 78 FIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAEL 137
Query: 261 AFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 318
++ + V D+ AW+ ++ RRYG S P AW +L +VYNC+ + NR +V P
Sbjct: 138 GWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRP 197
Query: 319 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 378
L+ TS +WY+ S+V A L + S L
Sbjct: 198 S-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAPSL 226
Query: 379 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 432
+ S +RYDL+DLTRQA+ + + + AY L A GV EL+
Sbjct: 227 ATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELASLLRAGGVLA-----YELLPA 281
Query: 433 MDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+D +LA FLLG WLE A+ A +E +
Sbjct: 282 LDEVLASDSRFLLGSWLEQARAAAVSEAE 310
>gi|423248659|ref|ZP_17229675.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
CL03T00C08]
gi|423253608|ref|ZP_17234539.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
CL03T12C07]
gi|392655237|gb|EIY48880.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
CL03T12C07]
gi|392657600|gb|EIY51231.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
CL03T00C08]
Length = 718
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 154/480 (32%), Positives = 239/480 (49%), Gaps = 65/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
>gi|53711968|ref|YP_097960.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis YCH46]
gi|52214833|dbj|BAD47426.1| alpha-N-acetylglucosaminidase precursor [Bacteroides fragilis
YCH46]
Length = 718
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 154/480 (32%), Positives = 239/480 (49%), Gaps = 65/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
>gi|374312699|ref|YP_005059129.1| alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
gi|358754709|gb|AEU38099.1| Alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
Length = 754
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 149/478 (31%), Positives = 231/478 (48%), Gaps = 53/478 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ + GPLPQ +++++ +LQ+++L R+ ELGM PV PAF+G VP + + P +
Sbjct: 221 MGNINHFAGPLPQHFMEEKRILQRQVLNRMRELGMKPVAPAFAGFVPQGFKRLHPEVETF 280
Query: 61 QLGNWF--SVKSDPRWCCTYLLD-ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
L W K+ PR T++L L+ +IG+ FIE+ EYG + Y DTF+E
Sbjct: 281 TL-LWLRKEFKTIPRSTRTFILHPGQQELYRQIGKKFIEEYKAEYGEVEY-YLADTFNEL 338
Query: 118 TPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
PV E + G ++ +Q+GD W+MQGWLF YD FW ++ALL +P
Sbjct: 339 EVPVREDHRYEDLERFGRTVFESIQAGDPKGTWVMQGWLFVYDSDFWNKESVEALLRGIP 398
Query: 174 LGKLVVLDLFAEVKPI---------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
+++++D ++ P W K F+G P+I M H F GN + G L +A
Sbjct: 399 NDRMLIIDYANDLAPSVQGKYLPGQWKLQKAFFGKPWINGMAHTFGGNNNIKGNLKLMAT 458
Query: 225 GPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
P S E +VG GM EGIE N VVY+LM++ +Q E +D+ WI Y RYG
Sbjct: 459 EPSTVLASPERGNLVGWGMCPEGIENNEVVYELMTDAGWQSEAIDLATWIPAYCRSRYGD 518
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
PA+Q AW +L + Y+ T + A P V P SV G P +
Sbjct: 519 CPPAMQQAWELLLKSAYSSHIWMT---KQAWQAEPSVHPIAASVDAG-------PTFQ-- 566
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
RA+ELF++ +L+ S YR DLI+ QA+ +E
Sbjct: 567 ----------------------RAVELFLSCAPQLAKSELYRNDLIEFVSQAVGGRVDEA 604
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
++A + R +E + +DGL+ L W+++ + A+ +++
Sbjct: 605 LALAVQAGDAKQDEDAVAHAARAVEWMRRIDGLMNLRPDRRLETWMQATRAYAKTDDE 662
>gi|333031143|ref|ZP_08459204.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
gi|332741740|gb|EGJ72222.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
Length = 723
Score = 231 bits (589), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 145/461 (31%), Positives = 220/461 (47%), Gaps = 39/461 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP+ WLD Q LQK+IL R EL M PVLPAF+G+VP+ L+++FP A I
Sbjct: 197 MANIDSWNGPLPKEWLDHQSDLQKQILKRERELNMKPVLPAFAGHVPSELKHLFPEADIQ 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + DPLF +I R F+E+Q + +G T HIY D F+E PP
Sbjct: 257 HLGKWAGFADKYR--CNFL-NPNDPLFAKIQRLFLEEQTRLFG-TDHIYGVDPFNEVDPP 312
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
PEY+ + A +Y + D A WL WLF + W P+++ALL VP +L +
Sbjct: 313 SWEPEYLKKVAADMYRTLTDVDPKAKWLQMTWLFYHGKKKWTAPRIEALLTGVPQDELYL 372
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W T+ F+G PYIWC L NF GN + G + + G
Sbjct: 373 LDYHCENVELWKTTDYFHGQPYIWCYLGNFGGNTTITGNVKESGQRLENTLINGGNNFKG 432
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + + A+ +D +W+ + R G+ A ++AW +L++ V
Sbjct: 433 IGSTLEGLDVMQFPYEYIFDKAWTFN-MDDNSWVENLADRHLGKKSEAYREAWKILFNDV 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + + P+ P + KP +K V ++ + D +W
Sbjct: 492 YVQVPKS-------LGVLPNFRPEM-----------SKP-NKRTV--NDYKNKDLVKVWA 530
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
EV + Y DLI + RQ L Y + + YQ D G+
Sbjct: 531 KLLEVKEC------------TRDAYIIDLITVGRQVLGNYFLVVKNEFDQMYQFKDLPGL 578
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+ E++ D++ L A H+ L W+ A+ L E
Sbjct: 579 ESRGAKLREILNDLENLTAFHNHCTLEKWISDARALGNTIE 619
>gi|160887167|ref|ZP_02068170.1| hypothetical protein BACOVA_05183 [Bacteroides ovatus ATCC 8483]
gi|423295093|ref|ZP_17273220.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
CL03T12C18]
gi|156107578|gb|EDO09323.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392673999|gb|EIY67450.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
CL03T12C18]
Length = 711
Score = 231 bits (588), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 148/477 (31%), Positives = 240/477 (50%), Gaps = 59/477 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ ++ Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ GA+I + M+ + AVW++Q W S P+ + ++ S+ G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPKAVWIIQAWQAS-------PR-EEMIASLNQG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E +P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N M+ GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
I +AW L HTVYN D + + S++ G + +
Sbjct: 473 EIMEAWRALEHTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+Y+ +A LF + ++ +N + YDL+D+ RQ+ A N L
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
I ++Y D + +++FL+L+ D LL+ F + WL +A+ L EE+++
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKR 627
>gi|327356744|gb|EGE85601.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ATCC 18188]
Length = 752
Score = 231 bits (588), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 139/473 (29%), Positives = 242/473 (51%), Gaps = 43/473 (9%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
NL G WGG P W D Q LQKKIL R+ ELGM P+LPAF G VP A+ V P A+
Sbjct: 201 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 260
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + +P++ T L DP + + ++FI + ++ YG +H Y D F+E
Sbjct: 261 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 318
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS-YDPFWRPPQMKALLNS-VPLGK 176
P P+++ + ++S D +A W+MQGWLF + +W +++A L++
Sbjct: 319 PSSGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 378
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDLFAE P+W +K F+G ++WC + F GN +YG + +I GP EA +++
Sbjct: 379 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAEA-MAQHPN 437
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAW 292
MVGVG + EG N +V+ L+ + + +D + + + + RRY R+VP+ + +AW
Sbjct: 438 MVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHERTVPSELYEAW 497
Query: 293 NVLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSET 349
+L + YN T+ A + A P ++ + + EG
Sbjct: 498 QLLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG------------------- 538
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
L Y +++++A L I L ++Y+YD++D+TRQ L+ + ++
Sbjct: 539 -------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKV 589
Query: 410 AYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
Y+ VF + + L +++ +D +L+ ++ F L W+ +A+ A ++ +
Sbjct: 590 KYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDDSE 642
>gi|313145188|ref|ZP_07807381.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
gi|313133955|gb|EFR51315.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
Length = 718
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 154/479 (32%), Positives = 236/479 (49%), Gaps = 65/479 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ + Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLGA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +++CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A +T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+ + W+ +Y RYG
Sbjct: 404 LAKADPHAGSTLKGVGMAPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y +VI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ AY+ D S++FL L+ D LL F +G W+E A+ L E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEK 618
>gi|424666301|ref|ZP_18103337.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
616]
gi|404573840|gb|EKA78592.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
616]
Length = 718
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 154/479 (32%), Positives = 236/479 (49%), Gaps = 65/479 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ + Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +++CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A +T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+ + W+ +Y RYG
Sbjct: 404 LAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHEAVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y +VI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ AY+ D S++FL L+ D LL F +G W+E A+ L E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEK 618
>gi|237719043|ref|ZP_04549524.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451821|gb|EEO57612.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 713
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 145/459 (31%), Positives = 224/459 (48%), Gaps = 42/459 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 181 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 240
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 241 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 296
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 297 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 356
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 357 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 416
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G +++DAW L++ +
Sbjct: 417 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 475
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 476 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 510
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE S+ + +R DLI + RQ L Y ++ + + D
Sbjct: 511 --LEVWRKL-------NEASSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVETKDHQ 561
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ + E++ D+D L A H L W++ A+++
Sbjct: 562 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 600
>gi|423280158|ref|ZP_17259071.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
610]
gi|404584494|gb|EKA89159.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
610]
Length = 718
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 154/479 (32%), Positives = 236/479 (49%), Gaps = 65/479 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ + Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLGA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +++CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A +T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+ + W+ +Y RYG
Sbjct: 404 LAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHESVFSARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y +VI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ AY+ D S++FL L+ D LL F +G W+E A+ L E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEK 618
>gi|423299508|ref|ZP_17277533.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
CL09T03C10]
gi|408473317|gb|EKJ91839.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
CL09T03C10]
Length = 727
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 139/459 (30%), Positives = 228/459 (49%), Gaps = 45/459 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP+ WL QQ LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 194 MSNVDYWQSPLPKDWLVQQEELQKRILAREREFNMTPVLPAFAGHVPAELKKIYPNAKIY 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I + F+E+Q K YG T HIY D F+E P
Sbjct: 254 TMSQWGGFDKQYR---SHFIDPMDSLYSVIQKRFLEEQTKIYG-TDHIYGIDPFNEVDSP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY + S D +A WL W+F Y W P ++K+ L +VP KL++
Sbjct: 310 DWNEEFLSNVSRKIYESLHSVDPEAQWLQMTWMFYYAKDKWTPSRIKSFLRAVPQDKLIL 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++ +YG PYIWC L NF GN + G L+ + + G
Sbjct: 370 LDYYCDHTEIWKKTEGYYGQPYIWCYLGNFGGNTMLAGNLNDTYEKIHQVLAEGGQNIHG 429
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++E + NP++Y+ + E A++ + WI ++ R G++ PA+ AW L+ +
Sbjct: 430 LGVTLEAFDVNPMMYEFVFEQAWEGAQ-PTDEWIATWAKCRGGQTCPAVLKAWKELHEKI 488
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + PS+ +AVL + + W
Sbjct: 489 Y-------------------IAPSLCG---------------QAVLMNARPQLEGVQGWN 514
Query: 360 STSEV-IRALELFIASGNELSASNT----YRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
+ E +L++ G+ L + + +D++++ RQ L ++ Y+
Sbjct: 515 TFPEYKYDNKDLWVIWGSLLQVGSIDKPGHAFDVVNVGRQVLGNLFSDYRAQFTACYKRK 574
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
D G + ++R L+ D+D LLAC F +G W++ A+
Sbjct: 575 DVKGAQEWAQRMDALLLDVDRLLACSPLFSMGKWIQDAR 613
>gi|423282107|ref|ZP_17260992.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
615]
gi|404582594|gb|EKA87288.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
615]
Length = 718
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 154/480 (32%), Positives = 238/480 (49%), Gaps = 65/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
>gi|299148671|ref|ZP_07041733.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
3_1_23]
gi|383114572|ref|ZP_09935334.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
gi|298513432|gb|EFI37319.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
3_1_23]
gi|313693722|gb|EFS30557.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
Length = 711
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 146/477 (30%), Positives = 238/477 (49%), Gaps = 59/477 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ ++ Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ GA+I + M+ + AVW++Q W+ + ++ S+ G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPKAVWIIQA--------WQANPREEMIASLNQG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E +P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N M+ GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
I +AW L HTVYN D + + S++ G + +
Sbjct: 473 EIMEAWRALEHTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+Y+ +A LF + ++ +N + YDL+D+ RQ+ A N L
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
I ++Y D + +++FL+L+ D LL+ F + WL +A+ L EE+++
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKR 627
>gi|60680169|ref|YP_210313.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC 9343]
gi|375357012|ref|YP_005109784.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
gi|383116930|ref|ZP_09937677.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
gi|60491603|emb|CAH06355.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC
9343]
gi|251947777|gb|EES88059.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
gi|301161693|emb|CBW21233.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
Length = 718
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 154/480 (32%), Positives = 238/480 (49%), Gaps = 65/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
>gi|423269418|ref|ZP_17248390.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
CL05T00C42]
gi|423273021|ref|ZP_17251968.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
CL05T12C13]
gi|392701212|gb|EIY94372.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
CL05T00C42]
gi|392708585|gb|EIZ01692.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
CL05T12C13]
Length = 718
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 154/480 (32%), Positives = 238/480 (49%), Gaps = 65/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
>gi|265765312|ref|ZP_06093587.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
gi|263254696|gb|EEZ26130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
Length = 718
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 154/480 (32%), Positives = 238/480 (49%), Gaps = 65/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
>gi|423241433|ref|ZP_17222546.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
CL03T12C01]
gi|392641326|gb|EIY35103.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
CL03T12C01]
Length = 754
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 152/478 (31%), Positives = 238/478 (49%), Gaps = 48/478 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F +I F E+ K YG TS Y+
Sbjct: 252 EKKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTQIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E ++ G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHEAKSLPAGLDF-GKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
G L++LDLF+E +P IW K + +++C+L NF GN+ ++G +D +
Sbjct: 363 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 421
Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
+ + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY RYG
Sbjct: 422 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 481
Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+I+ AW +L + +YNC G + SI G+P ++
Sbjct: 482 DESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 519
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
+ SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A A ++
Sbjct: 520 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 579
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ ++ D +R+FLEL+ D LL F +G W++ A+ L E++
Sbjct: 580 NYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITSEEK 637
>gi|265753065|ref|ZP_06088634.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263236251|gb|EEZ21746.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 750
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 153/478 (32%), Positives = 237/478 (49%), Gaps = 48/478 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 188 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 247
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 248 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 306
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 307 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 358
Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
G L++LDLF+E +P IW K + +++C+L NF GN+ ++G +D +
Sbjct: 359 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 417
Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
+ + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY RYG
Sbjct: 418 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 477
Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+IQ AW +L + +YNC G + SI G+P ++
Sbjct: 478 DESIQQAWQILTNGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 515
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
+ SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A A ++
Sbjct: 516 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 575
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ ++ D +R+FLEL+ D LL F +G W++ A+ L E++
Sbjct: 576 NYAVADFKSFDKKNYNTHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITPEEK 633
>gi|212695333|ref|ZP_03303461.1| hypothetical protein BACDOR_04880 [Bacteroides dorei DSM 17855]
gi|212662112|gb|EEB22686.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
Length = 754
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 153/478 (32%), Positives = 236/478 (49%), Gaps = 48/478 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 252 EEKTAGDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
G L++LDLF+E +P IW K + +++C+L NF GN+ ++G +D +
Sbjct: 363 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 421
Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
+ + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY RYG
Sbjct: 422 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 481
Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+IQ AW +L + +YNC G + SI G+P ++
Sbjct: 482 DESIQQAWQILTNGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 519
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
+ SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A A ++
Sbjct: 520 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 579
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ ++ D +R+FLEL+ D LL F +G W++ A+ L E++
Sbjct: 580 NYAVADFKSFDKKNYNTHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITPEEK 637
>gi|325299497|ref|YP_004259414.1| alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
gi|324319050|gb|ADY36941.1| Alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
Length = 723
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 151/474 (31%), Positives = 236/474 (49%), Gaps = 57/474 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PVLP +SG VP A Q + +
Sbjct: 190 MNNLEGWGGPNPDSWYTQQEALQKKILKRMREYGIEPVLPGYSGMVPHDAHQKLGLNVTE 249
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+L N F+ + L TD F EI + E+Q K +G+ ++ Y+ D F E N
Sbjct: 250 PELWNGFTRPA--------FLMPTDKRFAEIAALYYEEQEKLFGKANY-YSMDPFHELEN 300
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G A+ M+ + AVW++QGW + +P RP MK L N G L
Sbjct: 301 AGEVD----FDAAGKAVMDAMKQVNPKAVWVVQGW--TENP--RPEMMKNLKN----GDL 348
Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
++LDLF+E +P IW K + +++CML NF N+ ++G +D + +
Sbjct: 349 LILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLLNNFYLTKN 408
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+ + G+G++MEG E NPV+++LM E+ ++ EK+ ++W+ +Y RYG I+
Sbjct: 409 NPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKITKESWLKEYLAARYGAKDEKIEQ 468
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +YNC G + + + P ++ +S + K +NY P S EA
Sbjct: 469 AWMILADGIYNCPFGNNQQGPHESIFCGRPSMNNFQVS-SWSKMENYYDPTSTEA----- 522
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
A L + + ++ +N + YDL+D+ RQALA ++ I
Sbjct: 523 ------------------AARLMLEAADKFRGNNNFEYDLVDIVRQALADRGRIVYNRAI 564
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
++ D + S+ FL L+ D LLA F +G W+ A+ L E++
Sbjct: 565 ADFKSFDKRSYARHSKEFLNLLLAQDRLLATRSEFRVGRWINQARSLGNTPEEK 618
>gi|393784337|ref|ZP_10372502.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
CL02T12C01]
gi|392666113|gb|EIY59630.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
CL02T12C01]
Length = 728
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 140/473 (29%), Positives = 228/473 (48%), Gaps = 50/473 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ ++ +Q LQ+K+L R+ ELGM PV F G VP L+ +P A+I
Sbjct: 193 MGNLEGFGGPVSPEFIARQTDLQQKMLKRMRELGMKPVFQGFYGMVPNVLKKKYPDARIK 252
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W + + LD TDPLF + + E+Q K +G + + D F E
Sbjct: 253 EQGTWQTYQRPA------FLDPTDPLFDRVAAIYYEEQKKLFG-DAEFFGGDPFHEGG-- 303
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
++ I M+ + AVW++QGW ++P +K L++ + G+ ++L
Sbjct: 304 TSEGIHVKLAAQKILQAMRKVNPKAVWVLQGW--QHNP------VKDLMDGLNPGETIIL 355
Query: 181 DLFAEVKPIWS--TSKQFYGVP------YIWCMLHNFAGNIEMYGILDSIAFGPVEARTS 232
DL A +P W T+ F+ +IWC L NF G ++G + S A G V A+
Sbjct: 356 DLMACERPQWGGVTTSMFHKPEGHQDHRWIWCALPNFGGKTGLHGKMSSYASGAVFAKEH 415
Query: 233 E-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDA 291
+ G+G + EGI PVVYD++ +MA++ + + + W+ Y+ RYG A
Sbjct: 416 PMGRNICGIGTAPEGIGTVPVVYDMVYDMAWRTDSIQIPQWLTNYTYYRYGMEDTNCDKA 475
Query: 292 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 351
W +L TVY C + I A P A S+
Sbjct: 476 WKILSETVYECHNELGGPVESYICARP------------------------ADTIDHVST 511
Query: 352 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 411
+ + ++Y +++ A E S N + +TY YDL+D+TRQ L+ YA L ++EA+
Sbjct: 512 WGNARIFYEPVKMVEAWEFLYQSRNRFNHCDTYEYDLVDVTRQVLSDYAKYLHKEMVEAF 571
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ +G + S FL++++D D LL+ F+LG WL A+ E++++
Sbjct: 572 HQKNENGFMKYSTEFLDVIKDEDRLLSTRKEFMLGTWLTEAENAGCTPEEKRR 624
>gi|345513909|ref|ZP_08793424.1| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
gi|345456132|gb|EEO45798.2| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
Length = 754
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 152/478 (31%), Positives = 237/478 (49%), Gaps = 48/478 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 252 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
G L++LDLF+E +P IW K + +++C+L NF GN+ ++G +D +
Sbjct: 363 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 421
Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
+ + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY RYG
Sbjct: 422 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 481
Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+I+ AW +L + +YNC G + SI G+P ++
Sbjct: 482 DESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 519
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
+ SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A A ++
Sbjct: 520 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 579
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ ++ D +R+FLEL+ D LL F +G W++ A+ L E++
Sbjct: 580 NYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITSEEK 637
>gi|423230938|ref|ZP_17217342.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
CL02T00C15]
gi|423244649|ref|ZP_17225724.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
CL02T12C06]
gi|392630058|gb|EIY24060.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
CL02T00C15]
gi|392641498|gb|EIY35274.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
CL02T12C06]
Length = 754
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 152/478 (31%), Positives = 237/478 (49%), Gaps = 48/478 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 252 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
G L++LDLF+E +P IW K + +++C+L NF GN+ ++G +D +
Sbjct: 363 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 421
Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
+ + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY RYG
Sbjct: 422 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 481
Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+I+ AW +L + +YNC G + SI G+P ++
Sbjct: 482 DESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 519
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
+ SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A A ++
Sbjct: 520 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 579
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ ++ D +R+FLEL+ D LL F +G W++ A+ L E++
Sbjct: 580 NYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITSEEK 637
>gi|237711645|ref|ZP_04542126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
gi|229454340|gb|EEO60061.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
Length = 732
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 152/478 (31%), Positives = 237/478 (49%), Gaps = 48/478 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 170 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 229
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 230 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 288
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 289 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 340
Query: 172 VPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
G L++LDLF+E +P IW K + +++C+L NF GN+ ++G +D +
Sbjct: 341 -NPGDLLILDLFSECRPMWGIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHN 399
Query: 226 PVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
+ + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY RYG
Sbjct: 400 FYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTD 459
Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+I+ AW +L + +YNC G + SI G+P ++
Sbjct: 460 DESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GRP----SL 497
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
+ SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A A ++
Sbjct: 498 NNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIADRARIVY 557
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ ++ D +R+FLEL+ D LL F +G W++ A+ L E++
Sbjct: 558 NYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGITSEEK 615
>gi|336412606|ref|ZP_08592959.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
3_8_47FAA]
gi|335942652|gb|EGN04494.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
3_8_47FAA]
Length = 727
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 144/459 (31%), Positives = 224/459 (48%), Gaps = 42/459 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ + + E++ D+D L A H L W++ A+++
Sbjct: 576 ALKACAEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 614
>gi|322702923|gb|EFY94542.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
ARSEF 23]
Length = 589
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 150/484 (30%), Positives = 242/484 (50%), Gaps = 53/484 (10%)
Query: 1 MSNLHG-WGG--PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSA 57
N G WGG L W+D Q LQKKI+ R+ ELG+ PVLPAF G VP A V P A
Sbjct: 21 FGNTQGSWGGVGNLSSGWIDAQFELQKKIVARMVELGITPVLPAFPGFVPPAFSRVQPDA 80
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
T+ W + D T+ L D + + +AFI +Q++ +G ++IY D F+E
Sbjct: 81 NTTKAPRWTGLP-DTNTRDTF-LSPLDTSYARLQQAFISKQIEAFGNVTNIYTLDQFNEM 138
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG-- 175
P + P Y+S + Y + + + AVWL+QGWLF W ++ A L P G
Sbjct: 139 PPTSNEPSYLSQVSTYTYKALTAANPAAVWLLQGWLFLNSGLWTEERVTAYLGG-PEGHN 197
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSEN 234
++VLDL++E +P W +K ++G P+IWC LH+F GN+ MYG + I ++A RTS
Sbjct: 198 SMLVLDLYSESRPQWQRTKGYFGRPWIWCQLHDFGGNMGMYGQISDITVQSMDALRTSP- 256
Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPAIQDAW 292
++ G GM+ EG E N VVY ++ + A+ +D + Y VRRY ++ AW
Sbjct: 257 -SLSGFGMTPEGYEGNEVVYQMLFDQAWTTTPIDTSGYFYGYVVRRYAGVSQTNSLFQAW 315
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
++L +Y+ +K+R V V G YQN + L + T ++
Sbjct: 316 DILRQNIYD------NKDRQVPC-----------VGVGIYQN----APSLSGLVNRTGNW 354
Query: 353 DHP-HLWYSTSEVIRALELFIASGNELSA---SNTYRYDLIDLTRQALAKYANELFLNII 408
P ++Y + + +A L I + NE+ T++ D++D+TRQ ++ N ++ + +
Sbjct: 355 PPPTKVYYDPATLKKAHSLLIQAANEIPQLWDIPTFQLDVVDVTRQVMSNAFNTMYTDYV 414
Query: 409 EAYQLNDAHGVFQLSRR---------------FLELVEDMDGLLACHDGFLLGPWLESAK 453
+ + + +S R L+ + D+D +LA + F L WL++A+
Sbjct: 415 QTFNSQLSRQKSHISNRGGLQRRDDFATKGKQLLDFLTDLDRVLATNQHFRLDSWLDAAQ 474
Query: 454 QLAQ 457
A+
Sbjct: 475 YWAK 478
>gi|320106778|ref|YP_004182368.1| alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
gi|319925299|gb|ADV82374.1| Alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
Length = 754
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 154/478 (32%), Positives = 231/478 (48%), Gaps = 53/478 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLP+ +++Q+ VLQ+KIL R+ LGM PV PAFSG VP + + P A+
Sbjct: 220 MGNVNNIDGPLPEHFIEQKRVLQRKILDRMRSLGMRPVAPAFSGFVPQGFKRLHPKAETF 279
Query: 61 QLGNWF--SVKSDPRWCCTYLLDATDP-LFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
L W K+ PR T++L + L+ IG+ FIE+ EYG + Y DTF+E
Sbjct: 280 TL-LWLPEEFKTIPRSTRTFILHPGEQDLYRLIGKKFIEEYKAEYGEVQY-YLADTFNEL 337
Query: 118 TPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
PV E + G +Y G+ +GD + W+MQGWLF YD FW + ALL +P
Sbjct: 338 AVPVREEHRFEDLERFGRTVYEGILAGDPNGTWVMQGWLFVYDVAFWNSESVAALLRGIP 397
Query: 174 LGKLVVLDLFAEVKPI---------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
+++++D ++ P W T K F+G +I M H F GN + G L +A
Sbjct: 398 NDRMLIIDYANDLAPAVKGKYAPGQWKTQKAFFGKQWINGMAHTFGGNNNVKGNLKLMAS 457
Query: 225 GPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
P TS E +VG GM EGIE N VVY+LM++ +Q E +D+K WI Y RYG
Sbjct: 458 EPASVLTSPERGNLVGWGMCPEGIETNEVVYELMTDAGWQREAIDLKQWIPAYCRSRYGA 517
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P + +AW +L + Y+ T + P + P+ SV G P +
Sbjct: 518 CPPVMLEAWTLLMQSAYSAHIWMTHQAWQTE---PSLAPAAASVDAG-------PTFR-- 565
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
RA+ LF++ EL YR DLI+L QA ++
Sbjct: 566 ----------------------RAVALFLSCAPELGQKELYRNDLIELVVQAAGGSVDQT 603
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
F ++A Q + + + L + MD LL L W+++A+ A+++++
Sbjct: 604 FSLAVQAGQSHQNEVATEYAAHALGWMGRMDALLNLRPDRRLETWMQAARSYAKSDDE 661
>gi|380694112|ref|ZP_09858971.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 736
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 147/484 (30%), Positives = 236/484 (48%), Gaps = 73/484 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 199 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 250
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ + Y+ D F E
Sbjct: 251 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 307
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I M+ + +AVW+MQ W+ +A++N++ G
Sbjct: 308 GNTEGVD----LAKAGTSIMGAMKKANPEAVWVMQA--------WQANPREAMVNTLDSG 355
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E P +W K F +++CML NF GN+ ++G ++ + G
Sbjct: 356 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 415
Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N T+ GVG + EGIE NP++++L+ E+ ++ E+ W+ Y RYG + P
Sbjct: 416 ACAHINGKTLRGVGATPEGIENNPMMFELLYELPWREERFSPDIWLQGYLKARYGDDLSP 475
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ +AW L HTVYN +NY + E++L
Sbjct: 476 EVTEAWRALEHTVYNAP-----------------------------KNYQGEGTVESLLC 506
Query: 347 S-------ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
+ TS++ + L+YS +A +L ++ + +N + YDL+D+ RQ+LA
Sbjct: 507 ARPGFHLDRTSTWGYAKLFYSPDSTAKAAQLLLSVADRYKGNNNFEYDLVDIVRQSLADK 566
Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
AN L I ++Y D + +++FL L+ D LL+ F + WL +A+ L E
Sbjct: 567 ANVLLEEISQSYDRKDKDSFRKQTQQFLGLILSQDSLLSTRKEFSVSSWLSAARSLGTTE 626
Query: 460 EQEK 463
E++K
Sbjct: 627 EEKK 630
>gi|336408181|ref|ZP_08588675.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
gi|335939481|gb|EGN01355.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
Length = 718
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 153/480 (31%), Positives = 237/480 (49%), Gaps = 65/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ P+LP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPMLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ +
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIEDLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
>gi|295085509|emb|CBK67032.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 716
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 144/459 (31%), Positives = 223/459 (48%), Gaps = 42/459 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 184 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 243
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 244 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 299
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 300 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 359
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 360 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 419
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G +++DAW L++ +
Sbjct: 420 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 478
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 479 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 513
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 514 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 564
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ + E++ D+D L A H L W++ A+++
Sbjct: 565 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 603
>gi|237721435|ref|ZP_04551916.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|293370838|ref|ZP_06617383.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|229449231|gb|EEO55022.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|292634054|gb|EFF52598.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 711
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 145/477 (30%), Positives = 239/477 (50%), Gaps = 59/477 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ ++ Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ GA+I + M+ + +AVW++Q W+ + ++ S+ G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPEAVWIIQA--------WQANPREEMIASLNQG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E +P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N M+ GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
I +AW L +TVYN D + + S++ G + +
Sbjct: 473 EIMEAWRALEYTVYNA---PKDYQGEGTI------ESLLCARPGFHLD------------ 511
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+Y+ +A LF + ++ +N + YDL+D+ RQ+ A N L
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
I ++Y D + +++FL+L+ D LL+ F + WL +A+ L EE+++
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKR 627
>gi|298480128|ref|ZP_06998327.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|336404356|ref|ZP_08585054.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
gi|298273937|gb|EFI15499.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|335943684|gb|EGN05523.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
Length = 727
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 144/459 (31%), Positives = 223/459 (48%), Gaps = 42/459 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ + E++ D+D L A H L W++ A+++
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 614
>gi|340347658|ref|ZP_08670763.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
gi|433652542|ref|YP_007296396.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
3688]
gi|339608852|gb|EGQ13735.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
gi|433303075|gb|AGB28890.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
3688]
Length = 781
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 155/478 (32%), Positives = 227/478 (47%), Gaps = 64/478 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW QQ LQK+IL R ELGM PVLP + G +P + +T
Sbjct: 207 MNNLEGWGGPLPDSWYRQQEALQKRILQRERELGMEPVLPGYCGMMPHDAKQKL-GLDVT 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L ATDP F EI + +Q + YG+ SH Y+ D F E +
Sbjct: 266 PGGTWNGYVRPAN------LSATDPRFDEIADLYYREQTRLYGK-SHYYSMDPFHETSDD 318
Query: 121 VDSPEYI--SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
V YI + G + + M+ + A W++QGW + P A+ + +P G L
Sbjct: 319 V----YIDYAQAGRKLMAAMKRENPKANWVIQGWTENPRP--------AMTDGLPAGSLT 366
Query: 179 VLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDS------IAFGP 226
VLDLF+E +P IW ++ + +++CML NF GN+ ++G +D +A P
Sbjct: 367 VLDLFSECRPMFGAPSIWKRAEGYGQHDWLFCMLENFGGNVGLHGRMDQLIGNFRLATSP 426
Query: 227 VEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA--------WINQYSV 278
+ G+G +MEG E NP++++LMSE+ ++ ++V A W+ Y
Sbjct: 427 QSPLQQARRHLRGIGFTMEGSENNPIMFELMSELPWRTDEVAQAADARTFRTEWVRGYVK 486
Query: 279 RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP 338
RYG P Q AW +L T+YNC G + SI G+P
Sbjct: 487 ARYGTDDPHAQQAWQLLAETIYNCPAGNNQQG---------PHESIFD---------GRP 528
Query: 339 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
+KS + ++ Y S + A L A+ + L +N Y YDL+D+ RQA+
Sbjct: 529 SLNNFQVKSWSKMRNY----YEPSATLEAARLMAAAADRLKGNNNYEYDLVDIVRQAIDD 584
Query: 399 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
A +++L+ I Y D + S RFL L+ D LL F LG W E+A+ L
Sbjct: 585 QARQVYLHAIADYNGFDRRAFSRDSARFLGLLLMQDRLLGTRREFRLGRWTEAARSLG 642
>gi|393788556|ref|ZP_10376683.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
CL02T12C05]
gi|392654236|gb|EIY47884.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
CL02T12C05]
Length = 732
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 148/475 (31%), Positives = 231/475 (48%), Gaps = 54/475 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ ++ +Q LQ+K+L R+ ELGM PV F G VP AL+ FP A+I
Sbjct: 196 MGNLEGFGGPVTPEFIARQTDLQQKMLKRMRELGMKPVFQGFYGMVPNALKEKFPDARIK 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + + LD TDPLF ++ + E+Q +G + + D F E
Sbjct: 256 DQGIWGTYQRPA------FLDPTDPLFDKLAAIYYEEQKNLFGE-AQFFGGDPFHEG--- 305
Query: 121 VDSPEYISSLGAA--IYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+ E I+ AA I M+ + AVW++QGW ++P +K L+ V G+ +
Sbjct: 306 -GTSEGINVKLAAQKILQAMRKVNPQAVWVLQGW--QHNP------VKELMEGVKPGETI 356
Query: 179 VLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+LDL A +P W K + +IWC L NF G ++G + S A GPV A+
Sbjct: 357 ILDLMACERPQWGGVKTSMFHKPEGHWNHQWIWCALPNFGGKTGLHGKMSSYASGPVFAK 416
Query: 231 TSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
+ G+G + EGI PVVYD++ +MA++ + + + W++ Y+ RYG
Sbjct: 417 HHPMGKNICGIGTAPEGIGTIPVVYDMVYDMAWRTDSIHIPQWLDNYTYYRYGTEDNNCN 476
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
AW +L T+Y C + I A P S T +G V
Sbjct: 477 RAWKLLSETIYECHNELGGPVESYICARP-------SDTIQHVSTWGNAV---------- 519
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
++Y +V++A +L S + S+TY YDL D+TRQ L+ YA L ++
Sbjct: 520 -------MFYDPMKVVKAWDLLYQSRKRFNHSDTYEYDLTDVTRQVLSDYAKYLHERMVL 572
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
A+Q D + S +FL +++D D LL+ F+LG WL A++ E++++
Sbjct: 573 AFQKKDKERFMEYSGKFLNIIKDEDRLLSTRKEFMLGTWLAEAEKAGGTPEEKRR 627
>gi|423293377|ref|ZP_17271504.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
CL03T12C18]
gi|392678320|gb|EIY71728.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 142/461 (30%), Positives = 223/461 (48%), Gaps = 42/461 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + + T G Y +P + K ++ Y + L
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+ + E++ D+D L A H L W++ A+++ +
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDS 616
>gi|262406058|ref|ZP_06082608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294806855|ref|ZP_06765680.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345510563|ref|ZP_08790130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|262356933|gb|EEZ06023.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294445884|gb|EFG14526.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345454460|gb|EEO49066.2| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
Length = 727
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 144/459 (31%), Positives = 222/459 (48%), Gaps = 42/459 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ + E++ D+D L A H L W++ A+++
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 614
>gi|294777713|ref|ZP_06743164.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
gi|294448781|gb|EFG17330.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
Length = 752
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 153/493 (31%), Positives = 235/493 (47%), Gaps = 78/493 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
M+NL GWGGP P SW QQ LQKKIL R+ E GM+PVLP +SG +P+
Sbjct: 190 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 249
Query: 49 ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
L N + + L W +L DP F +I F E+ K YG T
Sbjct: 250 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 302
Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
S Y+ D F E + + G AI M+ + AVW++QGW + +P RP M
Sbjct: 303 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 357
Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
KAL G L++LDLF+E +P+W G+P IW C+L NF GN
Sbjct: 358 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 406
Query: 213 IEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
+ ++G +D + + + + G+G++MEGIE NPV+++LM E+ ++ EK +
Sbjct: 407 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 466
Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTE 329
WI QY RYG +I AW +L + +YNC G + + + P ++ + + +
Sbjct: 467 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQGPHESIFCGRPSLN-NFQASSW 525
Query: 330 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 389
K NY P +T+E R L ++ ++ +N + YDL+
Sbjct: 526 SKMCNYYDPT--------------------TTAEAAR---LMVSVAHKYRGNNNFEYDLV 562
Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
D+TRQA+A A ++ + ++ D +R+FLEL+ D LL F +G W+
Sbjct: 563 DITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWI 622
Query: 450 ESAKQLAQNEEQE 462
+ A+ L E++
Sbjct: 623 QQARNLGSTSEEK 635
>gi|345517325|ref|ZP_08796802.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
gi|345457718|gb|EET14396.2| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
Length = 754
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 153/493 (31%), Positives = 235/493 (47%), Gaps = 78/493 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
M+NL GWGGP P SW QQ LQKKIL R+ E GM+PVLP +SG +P+
Sbjct: 192 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 251
Query: 49 ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
L N + + L W +L DP F +I F E+ K YG T
Sbjct: 252 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 304
Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
S Y+ D F E + + G AI M+ + AVW++QGW + +P RP M
Sbjct: 305 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 359
Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
KAL G L++LDLF+E +P+W G+P IW C+L NF GN
Sbjct: 360 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 408
Query: 213 IEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
+ ++G +D + + + + G+G++MEGIE NPV+++LM E+ ++ EK +
Sbjct: 409 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 468
Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTE 329
WI QY RYG +I AW +L + +YNC G + + + P ++ + + +
Sbjct: 469 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQGPHESIFCGRPSLN-NFQASSW 527
Query: 330 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 389
K NY P +T+E R L ++ ++ +N + YDL+
Sbjct: 528 SKMCNYYDPT--------------------TTAEAAR---LMVSVAHKYRGNNNFEYDLV 564
Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
D+TRQA+A A ++ + ++ D +R+FLEL+ D LL F +G W+
Sbjct: 565 DITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWI 624
Query: 450 ESAKQLAQNEEQE 462
+ A+ L E++
Sbjct: 625 QQARNLGSTSEEK 637
>gi|319640296|ref|ZP_07995021.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
gi|317388071|gb|EFV68925.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
Length = 752
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 153/493 (31%), Positives = 235/493 (47%), Gaps = 78/493 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
M+NL GWGGP P SW QQ LQKKIL R+ E GM+PVLP +SG +P+
Sbjct: 190 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 249
Query: 49 ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
L N + + L W +L DP F +I F E+ K YG T
Sbjct: 250 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 302
Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
S Y+ D F E + + G AI M+ + AVW++QGW + +P RP M
Sbjct: 303 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 357
Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
KAL G L++LDLF+E +P+W G+P IW C+L NF GN
Sbjct: 358 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 406
Query: 213 IEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
+ ++G +D + + + + G+G++MEGIE NPV+++LM E+ ++ EK +
Sbjct: 407 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 466
Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTE 329
WI QY RYG +I AW +L + +YNC G + + + P ++ + + +
Sbjct: 467 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQGPHESIFCGRPSLN-NFQASSW 525
Query: 330 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 389
K NY P +T+E R L ++ ++ +N + YDL+
Sbjct: 526 SKMCNYYDPT--------------------TTAEAAR---LMVSVAHKYRGNNNFEYDLV 562
Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
D+TRQA+A A ++ + ++ D +R+FLEL+ D LL F +G W+
Sbjct: 563 DITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWI 622
Query: 450 ESAKQLAQNEEQE 462
+ A+ L E++
Sbjct: 623 QQARNLGSTSEEK 635
>gi|383115203|ref|ZP_09935961.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
gi|313695380|gb|EFS32215.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
Length = 727
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 144/459 (31%), Positives = 222/459 (48%), Gaps = 42/459 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ D K WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAWNLNADDNK-WIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFFDVKVEFDRMVEAKDYQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ + E++ D+D L A H L W++ A+++
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMG 614
>gi|299140550|ref|ZP_07033688.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella oris C735]
gi|298577516|gb|EFI49384.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella oris C735]
Length = 741
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 149/470 (31%), Positives = 229/470 (48%), Gaps = 49/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW +QQ LQKKIL R++E GM PVLP F G +P + +T
Sbjct: 186 MNNLEGWGGPLPDSWYNQQEALQKKILKRMHEYGMQPVLPGFCGMMPHDAKAKL-GLNVT 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L TD F +I + + K YG+ ++ Y+ D F E T
Sbjct: 245 DGGIWNGYTRPAN------LSPTDAHFDKIADLYYAELTKLYGKANY-YSMDPFHE-TND 296
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
++ +Y S G + M+ + A W++QGW + +P RP +K + N G L+VL
Sbjct: 297 DETIDY-SKAGCKVMEAMKRVNPKATWVIQGW--TENP--RPQMIKNMKN----GDLLVL 347
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
DLF+E +P IW K + +++CML NF N+ ++G +D + + S
Sbjct: 348 DLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLLHNFYSTKQSSP 407
Query: 235 TT--MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
T + G+G +MEG E NPV+++LMSE+ ++ E + WI Y RYG++ P I+ AW
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSELPWRTE-CKKEDWIKGYVKARYGKTSPEIERAW 466
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
+L T+YNC G + SI G+P +KS +
Sbjct: 467 QLLSETIYNCPAGNNQQG---------PHESIFC---------GRPSLNNFQVKSWSKMR 508
Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
++ Y + A +L ++ +N + YDL+D+ RQALA +L I Y
Sbjct: 509 NY----YDPQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALADQGRLQYLKTIADYN 564
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ + RFLE++ D LL F LG W E+A++L ++++
Sbjct: 565 GFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGTTQQEK 614
>gi|423214204|ref|ZP_17200732.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693149|gb|EIY86384.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
CL03T12C04]
Length = 727
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 141/461 (30%), Positives = 223/461 (48%), Gaps = 42/461 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T H+Y D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHVYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGERLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDDKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + + T G Y +P + K ++ Y + L
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNIEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+ + E++ D+D L A H L W++ A+++ +
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDS 616
>gi|160884062|ref|ZP_02065065.1| hypothetical protein BACOVA_02038 [Bacteroides ovatus ATCC 8483]
gi|423291477|ref|ZP_17270325.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
CL02T12C04]
gi|156110404|gb|EDO12149.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392663477|gb|EIY57027.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
CL02T12C04]
Length = 727
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 144/461 (31%), Positives = 222/461 (48%), Gaps = 42/461 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGERLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDDKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNRNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKVEFDRMVEAKDHQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+ + E++ D+D L A H L W++ A+++ +
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDS 616
>gi|423212382|ref|ZP_17198911.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694828|gb|EIY88054.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
CL03T12C04]
Length = 705
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 149/478 (31%), Positives = 233/478 (48%), Gaps = 61/478 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ VLQKKI+ R+ ELG+ PV P ++G VP + +I
Sbjct: 196 MNNLEGWGGPNPDSWYRQQEVLQKKIVARMRELGIEPVFPGYAGMVPRNIGEKL-GYQIA 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W S PR L D F + E+ K YG+ ++ Y+ D F E NT
Sbjct: 255 DPGKWCSF---PR---PAFLSTEDEHFESFAAMYYEELEKLYGKANY-YSMDPFHEGGNT 307
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD ++ GA+I + M+ + AVW++Q W+ + +++S+ G ++
Sbjct: 308 EGVD----LAKTGASIMAAMKKANPKAVWVIQA--------WQANPREEMISSLNQGDML 355
Query: 179 VLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
VLDL++E P W K F +++CML NF N+ ++G +D + G +A
Sbjct: 356 VLDLYSERLPQWGDPDSKWYREKGFGKHDWLYCMLLNFGANVGLHGRMDLLVNGYYDACA 415
Query: 232 SEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-PAIQ 289
N T+ GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYG+ V P +
Sbjct: 416 HANGKTLRGVGATPEGIENNPVMFELLYELPWREERFSPDEWLQGYLKARYGKDVSPEVM 475
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVL 345
+AW L HTVYN RD YQ G S +
Sbjct: 476 EAWRALEHTVYNAP-------RD-------------------YQGEGTVESLLCARPGFH 509
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
TS++ + L+YS +A L + + SN + YDL+D+ RQ+ A N L
Sbjct: 510 LDRTSTWGYAKLFYSPDSTAKAARLLTSVAKQYEGSNNFEYDLVDIVRQSNADKGNVLLE 569
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+I ++Y D + +++FL+L+ D LL+ F + WL++A+ L + ++K
Sbjct: 570 DISQSYDRKDKENFRKQTQQFLDLIVSQDSLLSTRKEFSVSTWLDAARSLGTTDAEKK 627
>gi|323344412|ref|ZP_08084637.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
gi|323094539|gb|EFZ37115.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
Length = 730
Score = 224 bits (572), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 141/462 (30%), Positives = 221/462 (47%), Gaps = 39/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP+ WL+ Q LQKKIL R M PVLPAF+G+VPA L+ +FP A I
Sbjct: 198 MANIDRWNGPLPKEWLNGQKELQKKILARERAFNMKPVLPAFAGHVPAELKRIFPDANIK 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + ++ C + L +PLF +I + ++E+Q +G T HIY D F+E PP
Sbjct: 258 SLGKWGGFEE--KYLC-HFLSPEEPLFSKIQKLYLEEQTALFG-TDHIYGVDPFNEVEPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
P Y+ + +Y + + D A W+ GW+FSYD W P +++A L VP GK+ +
Sbjct: 314 SWEPAYLRKVSKNMYGTLTAVDPKAEWMQMGWMFSYDNKHWTPDRVQAFLTGVPKGKMSL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W T+ FYG PYIWC L NF GN + G + A + M+G
Sbjct: 374 LDYYCENVELWKTTDGFYGQPYIWCYLGNFGGNTTLMGNVKESGRRLDNALANGQRNMLG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G ++EG++ Y+ + + H D + WI+ + R YG P+++ AW++L++ +
Sbjct: 434 AGSTLEGLDVIQFPYEYLYNKLWSHAVADSR-WIDDLADRHYGGVSPSVRKAWHILFNDI 492
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + + +G N+ +P Y L
Sbjct: 493 Y---------------------VQVSASMQGVLTNF-RPALNNNYPHRTAIEYPAERL-- 528
Query: 360 STSEVIR-ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
EV R L++ NEL + D+I + RQ L + AY D
Sbjct: 529 --EEVWRLLLDVPRCDRNEL------QLDIIAVGRQVLGNRFAVVKTQFDSAYANKDIPR 580
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+ + EL+ D+D L + + + W++ A++L +E
Sbjct: 581 LKAKACEMEELLGDLDRLTSFNSRCSINRWIDDARKLGSTKE 622
>gi|346323119|gb|EGX92717.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
Length = 742
Score = 224 bits (570), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 142/476 (29%), Positives = 240/476 (50%), Gaps = 39/476 (8%)
Query: 10 PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVK 69
PL SW+DQQ LQK+I+ R+ +LG+ P+LPAF G VP A + P A + + W +
Sbjct: 195 PLSLSWIDQQFALQKRIVARMVQLGITPILPAFPGFVPDAFARLRPGADLVRAPAWGGLP 254
Query: 70 SDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS 129
+D L D + E+ R F+E Q++ YG +++Y D F+E P + +Y+S+
Sbjct: 255 ADSPNTRALFLSPLDDAYAELQRLFVEAQIEAYGNVTNVYAMDQFNEINPVSGATDYLSA 314
Query: 130 LGAAIYSGMQSGDSDAVWLMQGWLF--SYDPFWRPPQMKALLNSVP-LGKLVVLDLFAEV 186
+ Y+ + + + AVWLMQGWLF S FW +++A L +V+LDLF+E
Sbjct: 315 VSRRSYAALAAANPAAVWLMQGWLFYLSEGNFWTQERIEAYLRGPEDRAGMVILDLFSET 374
Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEG 246
P W + + G P+IWC +H+F GN ++G + + P+EA E+ +MVG+G++ E
Sbjct: 375 APQWQRTGSYAGRPWIWCQVHDFGGNQNLFGKITNTTVNPMEA-LRESDSMVGLGIATEA 433
Query: 247 IEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNVLYHTVYNCT 303
E N V+YDL + + +D ++ + ++ RRY R +PA + AW +L TVY+
Sbjct: 434 YEGNEVLYDLFFDQGWSATPIDTVSYFHDWTTRRYSGVRQLPASLYQAWELLRVTVYDY- 492
Query: 304 DGATDKNRDVI---VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+I V+ ++P++ + Y K L + ++ P +W
Sbjct: 493 -----RASDLIGVPVSVYQLEPNLTGL-------YNTTTGKPTALHYDPAAL--PPIW-- 536
Query: 361 TSEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
LF+A+ L A +R DL+D+ RQ L+ L+ +++ A+
Sbjct: 537 --------RLFVAAAAAQPRLWAEPGFRLDLVDVMRQVLSNAFGRLYADLVAAFTGGAPP 588
Query: 418 G-VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVSQ 472
+ Q +R ++ D+D LLA F L WL +A+ ++ + + SQ
Sbjct: 589 SEIAQRGQRMRAVLGDVDALLATQPHFSLRRWLNAARAWGESTGENAAIAYEARSQ 644
>gi|299144715|ref|ZP_07037783.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298515206|gb|EFI39087.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 727
Score = 224 bits (570), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 141/461 (30%), Positives = 223/461 (48%), Gaps = 42/461 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRLYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-IDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIVSDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + + T G Y +P + K ++ Y + L
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+ + E++ D+D L A H L W++ A+++ +
Sbjct: 576 ALKACGEKMKEILHDLDKLNAFHPYCSLDKWIDDARKMGDS 616
>gi|423217398|ref|ZP_17203894.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
CL03T12C61]
gi|392628557|gb|EIY22583.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
CL03T12C61]
Length = 707
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 144/477 (30%), Positives = 233/477 (48%), Gaps = 59/477 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVSRMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L + D F + E+ K YG+ + Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I M+ + +AVW++Q W + P A+++ + G
Sbjct: 305 GNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAWQANPRP--------AMIDVLNAG 352
Query: 176 KLVVLDLFAEVKPIWSTS-------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
++VLDL++E +P W S K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DMLVLDLYSEKRPQWGDSDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N M GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYG + P
Sbjct: 413 ACAHVNGKRMRGVGATPEGIENNPVMFELLYELPWRAERFSPDVWLQGYLKARYGGELSP 472
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ +AW L HTVYN P P EG ++ ++
Sbjct: 473 EVMEAWRALEHTVYNA---------------PKNSPG-----EGTLESL--LCARPGFHL 510
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+YS +A +L ++ + N + YDL+D+ RQ+ A N L
Sbjct: 511 DRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGDNNFEYDLVDIVRQSNADKGNALLDE 570
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
I ++Y D + +++FLEL+ D LL+ F + WL +A+ L + ++K
Sbjct: 571 ISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEFSVSSWLAAARSLGNTDAEKK 627
>gi|340514474|gb|EGR44736.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
Length = 762
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 142/468 (30%), Positives = 241/468 (51%), Gaps = 38/468 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
NL G W LP W+D Q LQKKI+ R+ ELG+ P+LPAF G VP A V P A++
Sbjct: 209 NLQGSWSSSLPFEWVDDQFALQKKIVKRMVELGITPILPAFPGFVPRAAPRVLPDARLLH 268
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + LD DPLF ++ R+FI +Q + YG ++ Y D F+E PP
Sbjct: 269 SIQWAGFPE--IFTEDTFLDPVDPLFAQMQRSFITKQKQAYGNVTNFYTLDQFNEMIPPS 326
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPL-GKLVV 179
Y+ ++ + + ++S D +A+W+ Q WLF+ + FW +++A L V +++
Sbjct: 327 GDVAYLRNVSSNTWKALKSADPNAIWVFQAWLFAQNTTFWTNERIEAYLGGVTADSDMLI 386
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD+++E P W ++ +YG P+IWC L N+ I +YG + ++ P+ A E+T++ G
Sbjct: 387 LDIWSESMPQWQRAQSYYGKPWIWCELQNYGATINLYGQIQNVTNSPILA-LQESTSLSG 445
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPAIQDAWNVLYH 297
G+SMEG + N +VYDL+ A+ E +D +A+ + ++ RY + I DAW +
Sbjct: 446 FGLSMEGQQNNEIVYDLLLAQAWSSEPLDTEAYFHNWASARYSSDQRPGFIHDAWETVRT 505
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
TVY+ T+ + P V SII + V + + + ++ + L
Sbjct: 506 TVYDNTN---------LTLMPSVPKSIIEL-----------VPRTSNM-ADITGILGTKL 544
Query: 358 WYSTSEVIRALELFIASG---NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
Y + ++ A + +G L ++ Y+YDL+D TRQ LA ++ NI++ Y +
Sbjct: 545 PYDPAVMVSAWKQLYHAGLQDTSLFNNSAYQYDLVDWTRQVLANAFIPIYKNIVDIYYNS 604
Query: 415 DAHGVFQLSR------RFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ ++ R + +L+ +D +L+ + F L WL +A+ A
Sbjct: 605 NQTAGSRIQRLKAQGQQVTKLLLSLDLVLSSNRNFRLSTWLSAARSSA 652
>gi|153807690|ref|ZP_01960358.1| hypothetical protein BACCAC_01972 [Bacteroides caccae ATCC 43185]
gi|149130052|gb|EDM21264.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
Length = 707
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 144/484 (29%), Positives = 236/484 (48%), Gaps = 73/484 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVSRMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L + D F + E+ K YG+ + Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I M+ + +AVW++Q W + P A+++ + G
Sbjct: 305 GNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAWQANPRP--------AMVDVLNAG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
++VLDL++E P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DMLVLDLYSERLPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A T N T+ GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYG + P
Sbjct: 413 ACTHANGKTLRGVGTTPEGIENNPVMFELLYELPWRAERFSPDTWLQGYLKARYGGELSP 472
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ +AW L HTVYN +NY + E++L
Sbjct: 473 EVMEAWRALEHTVYNAP-----------------------------KNYQGEGTVESLLC 503
Query: 347 S-------ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
+ TS++ + L+YS +A +L ++ + +N + YDL+D+ RQ+ A
Sbjct: 504 ARPGFHLDRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGNNNFEYDLVDIVRQSNADK 563
Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
N L I ++Y D + +++FLEL+ D LL+ F + WL +A+ L +
Sbjct: 564 GNALLDEISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEFSVSSWLTAARSLGNTD 623
Query: 460 EQEK 463
++K
Sbjct: 624 AEKK 627
>gi|355706271|gb|AES02588.1| N-acetylglucosaminidase [Mustela putorius furo]
Length = 333
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 129/331 (38%), Positives = 187/331 (56%), Gaps = 45/331 (13%)
Query: 142 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 200
D DAVWL+QGWLF + P FW P Q++A+L +VP G+L++LDLFAE +P++ + F+G P
Sbjct: 3 DPDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLILDLFAESQPVYLRTASFHGQP 62
Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 260
+IWCMLHNF GN ++G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+
Sbjct: 63 FIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVGTGMAPEGIGQNEVVYALMAEL 122
Query: 261 AFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 318
++ + V D++AW+ ++ RRYG + AW +L +VYNC+ + T NR +V
Sbjct: 123 GWRKDPVADLEAWVTSFAARRYGVDSKETEVAWRLLLGSVYNCSGEACTGHNRSPLVR-- 180
Query: 319 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 378
PS+ VT +WY+ S V A L +A+ L
Sbjct: 181 --RPSLQMVTT---------------------------VWYNRSAVFEAWRLLLAAAPTL 211
Query: 379 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 432
+ S T+RYDL+D+TRQA + + + AY L A G+ EL+
Sbjct: 212 AKSPTFRYDLLDVTRQAAQELVSLYYTEARTAYLNKELVPLMRAAGIL-----VYELLPA 266
Query: 433 MDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+DG+LA FLLG WLE A+ +A +E +
Sbjct: 267 LDGVLASDSRFLLGTWLEQARAVAVSETDAR 297
>gi|422873453|ref|ZP_16919938.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens F262]
gi|380305838|gb|EIA18115.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens F262]
Length = 2104
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 143/463 (30%), Positives = 232/463 (50%), Gaps = 41/463 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GNLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY DA
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNRDAE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+S +FLEL++ + +L+ FL+G W+E A+ + ++ +
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 802
>gi|429766730|ref|ZP_19298977.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
gi|429183354|gb|EKY24416.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
Length = 2284
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 138/458 (30%), Positives = 230/458 (50%), Gaps = 41/458 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP +W + ++ L +++ R+ LG+ PVL +SG VP Q P A+I
Sbjct: 378 MQNMTSFGGPLPDNWFEDRVELGRQLHERMQTLGIKPVLQGYSGMVPLDFQKKNPDAQIL 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ D F E+ F E+Q + YG + Y D F E NT
Sbjct: 438 SQGGWCGFDR-PNMLKTYVNDGERDYFQEVADVFYEKQKEVYGDITDYYAVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+DS + + I M D DA+W++Q W + D ++ L N + +
Sbjct: 497 GGMDS----ARIYGTIQDKMIEHDEDAIWVIQHWQGNPDN----TKLSGLTNK---EQAL 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTM 237
+LDL +++ P + T +P++W MLHNF G + + G ++++A EA T+EN M
Sbjct: 546 ILDLNSDLNPDY-TRFDNQDIPWVWNMLHNFGGRMGLDGQVETVATSITEALATTEN--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G++ E + +P+VY+LM +M + + ++ + W+N Y RRYG +AW +L
Sbjct: 603 KGIGITPEALANSPIVYELMGDMIWTRDPINYREWVNNYIERRYGAVNEDAIEAWEILLE 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y +D + SII+ ++ A + S++ H +
Sbjct: 663 TAYKTSD----------YYYQGAAESIIN-------------ARPATSINSASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y E+ RA+ELFI+ +EL S+ + YD +D+T+Q LA A E ++ AY DA
Sbjct: 700 SYDKKELERAMELFISCYDELKDSDAFVYDFLDVTKQVLANSAQEYHKEMVAAYNSGDAE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
++S FL+L+ + +L+ FL+G W+E ++ +
Sbjct: 760 KFERISEHFLDLIRLQERVLSTSPEFLVGTWIEQSRTM 797
>gi|261880010|ref|ZP_06006437.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
gi|270333326|gb|EFA44112.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
Length = 719
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 140/460 (30%), Positives = 214/460 (46%), Gaps = 44/460 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP + +Q LQ+KIL R L M PVLPAFSG+VP ++ ++P + I
Sbjct: 195 MANIDKWKGPLPYHTVVEQRDLQQKILARERSLNMTPVLPAFSGHVPGQIKQLYPESNIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + R Y + DPLF +I R ++E+Q YG T HIY D F+E PP
Sbjct: 255 HLGRWAAFSDQYR---CYFMSPQDPLFAKIQRMYLEEQRAIYG-TDHIYGIDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
P+Y+ + IY + D A WL WLF + W P ++KAL+ V GK+V+
Sbjct: 311 SWDPDYLFQISKGIYQTLAHVDPKAEWLQMSWLFYHKKKKWTPERVKALITGVETGKMVL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F + IW + +FYG PYIWC L NF GN + G + + T + G
Sbjct: 371 LDYFCDRNEIWKMTDKFYGQPYIWCYLGNFGGNTTVAGNVKACGAKLDSTLTLGGKNLQG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG+++EG + Y+ + + + + WI+ + G + P+ + AW +LYH V
Sbjct: 431 VGLTLEGFDVCQFPYEYILDKVWSGNSSE-NQWIDALADSHVGYASPSFRKAWQLLYHDV 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
+ + G+ P P + S+ ++ + H+ Y
Sbjct: 490 FVQSAGSNG-------ILPCYRPELNSL-----------------------NWHYTHVDY 519
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN---IIEAYQLNDA 416
++I A +L + S + DLI RQ L NE + AY D
Sbjct: 520 DRQKLIEAWKLMQHDAD--SKRTAAQLDLIHYGRQVL---GNEFLTHKQLFDSAYAHCDL 574
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
G+ + ++ D+D L A H L W++ A+Q+A
Sbjct: 575 AGMMAQAASMRHIMLDIDTLTAYHPRCTLAGWIDGARQMA 614
>gi|212693694|ref|ZP_03301822.1| hypothetical protein BACDOR_03214 [Bacteroides dorei DSM 17855]
gi|265755881|ref|ZP_06090348.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
gi|212663753|gb|EEB24327.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
gi|263233959|gb|EEZ19560.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
Length = 718
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 152/480 (31%), Positives = 241/480 (50%), Gaps = 67/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPESWYIRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + ++ K YG+T Y D F E
Sbjct: 240 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYKELTKLYGKTG-FYAIDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W + +++ +
Sbjct: 296 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--------QDNPRTSMIEHLEA 343
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ G
Sbjct: 344 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALINGFY 403
Query: 228 EARTSENT--TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A+T + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 404 DAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q AW++L + +YN + + V A P +D YQ
Sbjct: 464 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 560 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGTRKEFKVGTWIEAARSAGQTQEEK 618
>gi|373460171|ref|ZP_09551927.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
gi|371956556|gb|EHO74342.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
Length = 742
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 142/472 (30%), Positives = 224/472 (47%), Gaps = 52/472 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW QQ LQKKIL R++E GM PVLP F G +P + +T
Sbjct: 186 MNNLEGWGGPLPDSWYKQQETLQKKILQRMHEYGMEPVLPGFCGMMPHDAKEKL-GLNVT 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L TD F I + + + YG+ ++ Y+ D F E+
Sbjct: 245 DGGKWNGYTRPAN------LSPTDSQFNRIADLYYAELTRLYGKANY-YSMDPFHESNDD 297
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D+ +Y G+ + M+ + A W++QGW + +P RP ++ + N G L++L
Sbjct: 298 -DALDY-GKAGSVMLEAMKRINPKATWVIQGW--TENP--RPRMIQDMKN----GDLLIL 347
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTS 232
DLF+E +P +W K + +++CML NF N+ ++G +D + F + R+
Sbjct: 348 DLFSECRPMFGIPSVWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLIHNFYSTKKRSP 407
Query: 233 ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
+ G+G +MEG E NPV+++LMSE+ ++ E + W+ Y RYGR I+ AW
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSELPWRPEIFKKEDWVRGYVKARYGRKDETIERAW 467
Query: 293 NVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
+L T+YNC G + + V P ++ + + K +NY
Sbjct: 468 LLLAETIYNCPAGNNQQGPHESVFCGRPGLNNFQVK-SWSKMRNY--------------- 511
Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
Y + A L + + +N + YDLID+ RQALA +L I
Sbjct: 512 --------YDPQATLEAARLMASVSSRYKGNNNFEYDLIDICRQALADQGRLQYLKTIAD 563
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
Y + ++RFL+++ D LL F LG W E+A+ L + ++
Sbjct: 564 YNGFSRAAFAKDAKRFLDMILLQDRLLGTRKEFRLGHWTEAARSLGTTQAEK 615
>gi|237708859|ref|ZP_04539340.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
gi|345513372|ref|ZP_08792893.1| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
gi|423228941|ref|ZP_17215347.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
CL02T00C15]
gi|423242228|ref|ZP_17223337.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
CL03T12C01]
gi|423247755|ref|ZP_17228803.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
CL02T12C06]
gi|229457285|gb|EEO63006.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
gi|345456211|gb|EEO47557.2| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
gi|392631297|gb|EIY25272.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
CL02T12C06]
gi|392635177|gb|EIY29082.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
CL02T00C15]
gi|392639514|gb|EIY33330.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
CL03T12C01]
Length = 717
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 152/480 (31%), Positives = 241/480 (50%), Gaps = 67/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYIRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + ++ K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYKELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W + +++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--------QDNPRTSMIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ G
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALINGFY 402
Query: 228 EARTSENT--TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A+T + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 403 DAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q AW++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGTRKEFKVGTWIEAARSAGQTQEEK 617
>gi|329962235|ref|ZP_08300241.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
gi|328530343|gb|EGF57220.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
Length = 726
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 144/471 (30%), Positives = 237/471 (50%), Gaps = 60/471 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP SW + ++ LQK+IL R+ E G++PVLP +SG +P + ++
Sbjct: 188 MNNLEGWGGPNTDSWYEDRIALQKRILKRMREYGIHPVLPGYSGMLPHNAKEKL-GVNVS 246
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
G W C Y L TD F EI + E+ + YG+ + Y+ D F E
Sbjct: 247 DPGTW----------CGYNRPAFLQPTDTRFGEIAALYYEEMNRLYGK-ADFYSMDPFHE 295
Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ + G AI+ M+ ++VW++Q W + P+ + ++ +VP G
Sbjct: 296 GGKVAGVN--LDAAGQAIWQAMKKNSRNSVWVVQAWGAN-------PRAQ-MIKNVPRGD 345
Query: 177 LVVLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA 229
++VLDL++E +P W + F G +++CML N+ GN+ ++G + + +A
Sbjct: 346 MLVLDLYSESRPQWGEPESSWYRENGFDGHQWLYCMLLNYGGNVGLHGKMQHVIDAYYKA 405
Query: 230 -RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAI 288
R+S T+ GVGM+MEG E NPV+Y+L+ E+ ++ W+ Y RYG+ P +
Sbjct: 406 SRSSFGNTLKGVGMTMEGSENNPVMYELLCELPWRPSTFSKDEWLEGYIAARYGKCTPRL 465
Query: 289 QDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
++AW +L +++YNC +T + + + A P + + +A
Sbjct: 466 REAWVLLGNSIYNCPPRSTQQGTHESIFCARPSLK------------------AYQASSW 507
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
SE S Y P +VIRA LF+ + ++ + YDL+D+TRQA+A+ ++
Sbjct: 508 SEMSDYYRPQ------DVIRAAGLFLEEAGQFKGNDNFEYDLVDITRQAVAEKGRLIYKV 561
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
I +Y+ D + Q S RFLEL+ D LLA F +G W+E A+ L
Sbjct: 562 IQASYEAGDKPLLRQASDRFLELLLLQDRLLATRPEFKVGRWIEQARNLGH 612
>gi|395804724|ref|ZP_10483959.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
gi|395433112|gb|EJF99070.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
Length = 722
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 145/465 (31%), Positives = 231/465 (49%), Gaps = 34/465 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLPQ W ++ LQKKIL R+ L M+PV+PAFSG VP A P AKIT
Sbjct: 208 MGNINSLEGPLPQEWFVKKEALQKKILERMKALDMHPVVPAFSGYVPKAFAEKHPEAKIT 267
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W S + T+LLD+ DPLF +IG+ FIE K YG+ S+ Y D+F+E PP
Sbjct: 268 ELKSW----SGGGFASTFLLDSKDPLFKQIGKRFIEIYTKMYGK-SNFYLADSFNEIEPP 322
Query: 121 V---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V + E +S+ G+A+Y + AVW+MQGWLF + FW KA L+ VP K
Sbjct: 323 VSEHNKYEELSNYGSAVYETIDEAAPGAVWVMQGWLFGDNKEFWTKEATKAFLSKVPNEK 382
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT- 235
++V D + +W + FYG + + +HN+ G+ +YG L+ + N
Sbjct: 383 VMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKDELASLLKNPNRG 442
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
+VG G EG+ N +VY+ + ++ + + + W+ +Y RYG++ ++ AW +L
Sbjct: 443 NIVGYGAMPEGLNNNSIVYEYIYDLPWTKAEQPLNDWMAKYLNARYGQTSESVFHAWELL 502
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
+VYN T D A+ + +TE K G P K
Sbjct: 503 LKSVYNVKYWETRWWNDWAGAYLLFKRPTVKITEFK----GNPGDK-------------- 544
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
++ AL++ + + +N +YDLID++R + +E + I+AYQ +
Sbjct: 545 ------IKLKEALDILKKEAKKYNKNNLIQYDLIDVSRHYNSLSIDEELIECIKAYQEKN 598
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
QL ++ + V + D +++ L W++SA + E
Sbjct: 599 IAKGDQLFKQIEKQVLETDKMMSGQPLNNLNQWVKSASDYGSSPE 643
>gi|168216263|ref|ZP_02641888.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens NCTC 8239]
gi|182381741|gb|EDT79220.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens NCTC 8239]
Length = 2104
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 141/461 (30%), Positives = 231/461 (50%), Gaps = 41/461 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEEEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L G+ +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKGQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I +AWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+S +FLEL++ + +L+ FL+G W+E A+ + ++
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKD 800
>gi|110801838|ref|YP_698175.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens SM101]
gi|110682339|gb|ABG85709.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens SM101]
Length = 2095
Score = 218 bits (554), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 142/461 (30%), Positives = 230/461 (49%), Gaps = 41/461 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 369 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 428
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F + F E+Q + +G ++ Y D F E NT
Sbjct: 429 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 487
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 488 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 536
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 537 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 593
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I +AWN++
Sbjct: 594 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNEEILEAWNIILD 653
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 654 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 690
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY DA
Sbjct: 691 VYDKSEFEKAIEIFSKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNRDAE 750
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+S +FLEL++ + +L+ FL+G W+E A+ + ++
Sbjct: 751 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKD 791
>gi|168207628|ref|ZP_02633633.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens E str. JGS1987]
gi|170661027|gb|EDT13710.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens E str. JGS1987]
Length = 2104
Score = 218 bits (554), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 141/463 (30%), Positives = 231/463 (49%), Gaps = 41/463 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQEEVFGEVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+S +FLEL++ + +L+ FL+G W+E A+ + ++ +
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 802
>gi|422345314|ref|ZP_16426228.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
WAL-14572]
gi|373228039|gb|EHP50349.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
WAL-14572]
Length = 1842
Score = 218 bits (554), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 141/463 (30%), Positives = 231/463 (49%), Gaps = 41/463 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+S +FLEL++ + +L+ FL+G W+E A+ + ++ +
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 802
>gi|182624959|ref|ZP_02952737.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens D str. JGS1721]
gi|177909756|gb|EDT72174.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens D str. JGS1721]
Length = 2104
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 141/463 (30%), Positives = 231/463 (49%), Gaps = 41/463 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+S +FLEL++ + +L+ FL+G W+E A+ + ++ +
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 802
>gi|169346867|ref|ZP_02865815.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens C str. JGS1495]
gi|169296926|gb|EDS79050.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens C str. JGS1495]
Length = 2104
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 141/461 (30%), Positives = 230/461 (49%), Gaps = 41/461 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+S +FLEL++ + +L+ FL+G W+E A+ + ++
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKD 800
>gi|262406054|ref|ZP_06082604.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294648118|ref|ZP_06725661.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|294806859|ref|ZP_06765684.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345510559|ref|ZP_08790126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|229443271|gb|EEO49062.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|262356929|gb|EEZ06019.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|292636502|gb|EFF54977.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|294445888|gb|EFG14530.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
Length = 718
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 138/468 (29%), Positives = 230/468 (49%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP A P +
Sbjct: 197 MGNLNKWDGPLSDTWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLTEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ LA A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYLAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVFAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVEFAR 625
>gi|160914140|ref|ZP_02076362.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
gi|158433951|gb|EDP12240.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
Length = 2150
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 135/465 (29%), Positives = 236/465 (50%), Gaps = 45/465 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ +GGPLP W +Q++ L +K+ R+ G++PV+ F G VP + A +T
Sbjct: 387 MQNLYSFGGPLPDDWFEQRVELGRKMHDRMQAFGIDPVIQGFCGQVPMSFVEKNEGAVLT 446
Query: 61 QLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
+ W S + P TYL F ++ + F E+Q +G S Y D F
Sbjct: 447 PIDEWPSF-TRPAMIKTYLSQEEIAAGKKDYFKDVAKTFYEKQKNVFGDVSDYYASDPFH 505
Query: 116 E--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
E NT +D ++++ + M ++DA+W+MQ W + D ++ L V
Sbjct: 506 EGGNTQGLD----VTNIFKTVQEEMLKSNADAIWVMQQWQGNLDH----AKLSGL---VK 554
Query: 174 LGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE 233
+ + LDL +++ P S+ + G+ +IWCMLHNF G + + G ++ IA P A S
Sbjct: 555 PEQALALDLQSDMNP--SSVMENEGISWIWCMLHNFGGRMGLDGEVEVIAKEPAIA-ASN 611
Query: 234 NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWN 293
N M G+G++ E +E +P+VY+++ +M + + +D +AW+++Y+ RR G S ++Q+AW+
Sbjct: 612 NQYMKGIGITPEALENSPIVYEMLFDMTWSKDPIDYQAWVDKYATRRAGGSSDSLQEAWD 671
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
+L T Y +D + + ++I+ G N+ S S++
Sbjct: 672 MLLETAY----------KDKGIYYQGAGETVINARPG--TNF-----------SSASTWG 708
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H ++ Y E+ + L L I + + +AS YRYDL D+ Q L A E +++A
Sbjct: 709 HSNILYDKEELDKVLSLLIENYDAFAASEAYRYDLADVAEQVLCNAAIEYHALMVQALNN 768
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
D+ ++S FLEL++ D +L + F+LG W+ A+++ N
Sbjct: 769 KDSAEFKRISTHFLELIDLSDRILGSSEEFMLGTWIHDAREMLDN 813
>gi|168212494|ref|ZP_02638119.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens CPE str. F4969]
gi|170716100|gb|EDT28282.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens CPE str. F4969]
Length = 2104
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 141/461 (30%), Positives = 230/461 (49%), Gaps = 41/461 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+S +FLEL++ + +L+ FL+G W+E A+ + ++
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKD 800
>gi|326934230|ref|XP_003213195.1| PREDICTED: hypothetical protein LOC100549752 [Meleagris gallopavo]
Length = 650
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 100/207 (48%), Positives = 140/207 (67%), Gaps = 4/207 (1%)
Query: 33 LGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 92
LGM VLPAF+G+VP + VFP T+LGNW D C YLL +P+F IG
Sbjct: 4 LGMTTVLPAFAGHVPPGVLRVFPRINATRLGNWSHF--DCTLSCAYLLSPEEPMFQVIGT 61
Query: 93 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 152
F+++ +KE+G T HIY+ DTF+E +P P Y++ + A++ M D +A WLMQGW
Sbjct: 62 LFLKELIKEFG-TDHIYSADTFNEMSPLSSDPAYLAGITNAVFRAMTGADPEAQWLMQGW 120
Query: 153 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 211
LF + P FW+PPQ++A+L +VPLG+++VLDLFAE KP++ ++ FYG P+IWCMLHNF G
Sbjct: 121 LFQHQPAFWQPPQVQAVLRAVPLGRMIVLDLFAESKPVYEWTESFYGQPFIWCMLHNFGG 180
Query: 212 NIEMYGILDSIAFGPVEARTSENTTMV 238
N ++G +++I GP AR N+TMV
Sbjct: 181 NHGLFGAVEAINRGPFVARRFPNSTMV 207
>gi|168209163|ref|ZP_02634788.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens B str. ATCC 3626]
gi|170712640|gb|EDT24822.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens B str. ATCC 3626]
Length = 2104
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 141/461 (30%), Positives = 229/461 (49%), Gaps = 41/461 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F + F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+S +FLEL++ + +L+ FL+G W+E A+ + ++
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKD 800
>gi|224027030|ref|ZP_03645396.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
18228]
gi|224020266|gb|EEF78264.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
18228]
Length = 837
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 148/481 (30%), Positives = 223/481 (46%), Gaps = 62/481 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI- 59
M N+ G GPL W QL LQ KIL R+ LGM P+ P F G +P A + ++P I
Sbjct: 193 MGNVSGIDGPLNPDWHAGQLALQHKILDRMRALGMKPICPGFPGFIPEAFKRIYPDLHIV 252
Query: 60 -TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T G F +++ T+PLF +I AFI++ KE+G+ + Y D+F+E
Sbjct: 253 ETHWGGAFH---------NWMISPTEPLFAKISEAFIKEWEKEFGKCDY-YLVDSFNEMD 302
Query: 119 PPVDSP------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNS 171
P E +S G +YS ++ + DAVW+MQGW+F Y W + AL++
Sbjct: 303 IPFPEKGNPARYEMAASYGEKVYSSIKRANKDAVWVMQGWMFGYQRHIWDYETLGALVSR 362
Query: 172 VPLGKLVVLDL-------FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
VP K+++LDL F + W K FY +++ ++ N G M G+LD A
Sbjct: 363 VPDDKMLLLDLAVDYNRHFWHSEVNWEYYKGFYNKQWVYSVIPNMGGKTGMTGVLDFYAN 422
Query: 225 GPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
G +EA +S N +V G++ EGIE N V+Y+L+++ + ++DV+ W+ QYS+ RYG+
Sbjct: 423 GHLEALSSSNRGNLVAHGLAPEGIENNEVLYELVTDAGWSDHRMDVRDWLKQYSINRYGK 482
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
+ + AW+ L +VY F D G +N +S
Sbjct: 483 APAQLMKAWDYLLKSVYG--------------TFTDHPRFNWQFRPGLVKNGSINIS--- 525
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
+ + LE F+A+ EL S Y DL ++T L A L
Sbjct: 526 ------------------DDYFKGLESFVAASEELKDSPYYLTDLCEMTAHYLGSKAEIL 567
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
I + Y L D L RF + MD +L+ H L W+ A + A+ E Q K
Sbjct: 568 TRQIDQEYLLGDTLQAHFLQSRFETFMLGMDRILSQHPTLRLDRWVSFASKAARTEAQRK 627
Query: 464 Q 464
Q
Sbjct: 628 Q 628
>gi|427385205|ref|ZP_18881710.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
12058]
gi|425727373|gb|EKU90233.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
12058]
Length = 719
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 142/474 (29%), Positives = 234/474 (49%), Gaps = 57/474 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ VLQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 190 MNNLEGWGGPNPDSWYAQQEVLQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 249
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+ L N F+ + L TD F EI + +Q K +G+ + Y+ D F E N
Sbjct: 250 SDLWNGFTRPA--------FLQPTDTRFAEIANLYYREQEKLFGKADY-YSMDPFHEAEN 300
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI M+ + A W++QGW + +P RP ++ + N G L
Sbjct: 301 AASVD----FDAAGKAIMQAMKKVNPKATWVVQGW--TENP--RPEMIENMKN----GDL 348
Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
++LDLF+E +P IW K + +++CML NF GN+ ++G +D + + +
Sbjct: 349 LILDLFSECRPMWGIPSIWKRDKGYEQHDWLFCMLLNFGGNVGLHGRMDQLLDNFYQTKD 408
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+ T + G+G++MEG E NPV+++LM E+ ++ EK + W+ Y RYG I+
Sbjct: 409 NPLATHLKGIGLTMEGSENNPVMFELMCELPWRPEKFTKEEWLKDYLFARYGVKDEKIEK 468
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +++YNC G + + + P ++ + + + K +NY P E
Sbjct: 469 AWTLLANSIYNCPFGNNQQGPHESIFCGRPSMN-NFQASSWSKMKNYYDPTVTE------ 521
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
A L + ++ +N + YDL+D+ RQ+L+ ++ I
Sbjct: 522 -----------------EAARLMLEVADKYRGNNNFEYDLVDIVRQSLSDKGRIVYNQTI 564
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
++ D + S++FL+++ D LL F +G W+E A+ L E++
Sbjct: 565 ADFKSFDKRSFARDSQKFLDILLLQDRLLGTRSEFRVGRWIEQARNLGTTPEEK 618
>gi|322703040|gb|EFY94656.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
ARSEF 23]
Length = 774
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/426 (30%), Positives = 228/426 (53%), Gaps = 36/426 (8%)
Query: 5 HGWGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 62
WGG LP ++++QQ LQK+I+ R+ ELG+ PVLPAF G VP +++ V P+A +T
Sbjct: 209 RSWGGKGDLPLAFIEQQFELQKQIVTRMVELGITPVLPAFPGFVPESIKKVRPNANLTVS 268
Query: 63 GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 122
NWF+ D ++ LD D + E+ + F+ +Q+ +G +++Y D F+E +P
Sbjct: 269 PNWFAPAPD-KYTRDLFLDPLDDTYAELQKLFVTKQIDAFGNVTNVYTLDQFNELSPASG 327
Query: 123 SPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGK-LVVL 180
Y+ + Y+G+ + + AVWL+QGWL FS FW P++ A L V + ++VL
Sbjct: 328 DTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSSRNFWTQPRIDAYLGGVEDHQGMLVL 387
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL++EV P W + + G P+IWC LH+F GN+ + G + ++ P++A +++ ++VG
Sbjct: 388 DLYSEVNPQWQRTNSYSGKPWIWCQLHDFGGNMALEGRVQTLTSAPIDA-LAQSKSLVGF 446
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNVLYH 297
G++ E E N VVYD++ + A+ +D +A+ + +RY S+P+ + AW +L
Sbjct: 447 GLTPEAYEGNEVVYDILLDQAWSATPLDTQAYFASWVTKRYAGISSIPSELYRAWEILRT 506
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VY+ T TD + V VA + P++ + + T + HP
Sbjct: 507 DVYSNTR--TDIPQ-VPVATYQLRPALSGIA------------------NRTGHFPHPTA 545
Query: 358 WYSTSEVIRA-----LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+ V++ LE G+ L ++ D +D++RQ L+ + L+ +++ AY+
Sbjct: 546 LHYDPLVLQGVWKLMLEALTRQGS-LWKVPAFQLDFVDVSRQMLSNQFDVLYADLVNAYK 604
Query: 413 LNDAHG 418
+ G
Sbjct: 605 CSTGAG 610
>gi|423312588|ref|ZP_17290525.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
CL09T03C04]
gi|392688276|gb|EIY81565.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
CL09T03C04]
Length = 717
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 154/480 (32%), Positives = 239/480 (49%), Gaps = 67/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ F
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 402
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q AW++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617
>gi|146300873|ref|YP_001195464.1| alpha-N-acetylglucosaminidase [Flavobacterium johnsoniae UW101]
gi|146155291|gb|ABQ06145.1| Candidate alpha-glycosidase; Glycoside hydrolase family 89
[Flavobacterium johnsoniae UW101]
Length = 723
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 146/469 (31%), Positives = 230/469 (49%), Gaps = 35/469 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLPQ W ++ LQKKIL R+ L M+PV+PAFSG VP A P AKIT
Sbjct: 208 MGNINSLEGPLPQEWFSKKEELQKKILERMRTLDMHPVVPAFSGYVPKAFAEKHPEAKIT 267
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W S + T+LLD+ DPLF +IG+ FIE K YG+ S+ Y D+F+E PP
Sbjct: 268 ELNSW----SGGGFESTFLLDSKDPLFKKIGKRFIEIYTKMYGK-SNFYLADSFNEIEPP 322
Query: 121 V---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V + E +++ G+AIY ++ AVW+MQGWLF + FW A L+ VP +
Sbjct: 323 VTEHNKYEELANYGSAIYETIEEAAPGAVWVMQGWLFGDNKNFWTKEATSAFLSKVPNDR 382
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE-ARTSENT 235
L+V D + +W + FYG + + +HN+ G+ +YG L+ V +
Sbjct: 383 LMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKNELVSLLKNPHRG 442
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQDAWNV 294
+VG G EG+ N +VY+ + ++ + + VK W+ Y RY ++ ++ AW +
Sbjct: 443 NVVGYGAMPEGLNNNAIVYEFIYDLPWSKGEQSVKDWLTNYLNARYEQKTSDSVFKAWEL 502
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L +VY+ T D A+ ++TE K G P K+
Sbjct: 503 LLESVYSTKYWETRWWNDRAGAYLLFKRPTATITEFK----GNPGDKD------------ 546
Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
++ AL++ A + N +YDLID +R + +E + ++AYQ
Sbjct: 547 --------KLKEALDILKAEAKKYDKKNFIQYDLIDASRHYYSLSIDEDLVECVKAYQQK 598
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
D QL ++ + V ++D ++ L W++SA + E K
Sbjct: 599 DITKGDQLFKKIEKQVLEIDKSMSGQPLNSLNYWVKSASEYGSTPEVSK 647
>gi|410096483|ref|ZP_11291470.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
CL02T12C30]
gi|409226447|gb|EKN19356.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
CL02T12C30]
Length = 718
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 147/479 (30%), Positives = 240/479 (50%), Gaps = 66/479 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQ++I+ R+ E G+ PV P +SG VP +
Sbjct: 187 MNNLEGWGGPNPDSWYKQQITLQQRIVKRMREYGIEPVFPGYSGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L TDP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 239 KLG--LNV-SDPGLWCGYHRPAFLQPTDPRFQEIASLYYKELNKLYGK-ANFYSMDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+ VD + + G AI M+ + AVW+ Q W + ++ ++
Sbjct: 295 GGSVAGVD----LDAAGKAIMQAMKKNNPKAVWVAQAW--------QANPRSQMIENLKA 342
Query: 175 GKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G ++VLDLF+E +P W F +I+CML N+ GN+ ++G + +
Sbjct: 343 GDMIVLDLFSESRPQWGDPESTWHRKDGFGQHDWIYCMLLNYGGNVGLHGKMAHVIDEYY 402
Query: 228 EARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP 286
+A+ S T+ GVGM+MEG E NPV+++L++E+ ++ D W+ Y+V RYG++ P
Sbjct: 403 KAKESSFGKTLCGVGMTMEGSENNPVMFELLTELPWRPVHFDKNEWLKNYTVARYGKANP 462
Query: 287 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+Q+AW +L +++YNC T + + + A P P ++S
Sbjct: 463 TVQEAWILLSNSIYNCPPENTQQGTHESIFCARPSDHPYLVSSW---------------- 506
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
SE S Y Y+ +VIRA + ++ ++ + +N + YDL+D+ RQA+A+ L
Sbjct: 507 --SEMSDY------YNPDDVIRAAAMMVSVADQFTGNNNFEYDLVDIVRQAIAE-KGRLV 557
Query: 405 LNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
++EA + D + RFL+L+ D LL F +G W+ + L E++
Sbjct: 558 EKVVEASFASGDKQLYNTAANRFLQLLLLQDELLGTRPEFKVGNWIARTRSLGNTPEEK 616
>gi|210631701|ref|ZP_03296968.1| hypothetical protein COLSTE_00853, partial [Collinsella stercoris
DSM 13279]
gi|210159960|gb|EEA90931.1| F5/8 type C domain protein, partial [Collinsella stercoris DSM
13279]
Length = 1906
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 142/465 (30%), Positives = 228/465 (49%), Gaps = 37/465 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP SW +Q++ L ++I R+ G++PV+ F G VP Q P++
Sbjct: 354 MQNLYSVGGPLPDSWFEQRVELARRIHDRMQTYGIDPVIQGFGGQVPTDFQQKNPNSVAA 413
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G+W S + P TYL DA + F ++G F E Q + +G+ SH Y D F
Sbjct: 414 SSGSW-SGFARPYMIKTYLTDADRAAGKEDYFQKVGTTFYEAQERIFGKVSHFYAVDPFH 472
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E V I + + M D AVW+MQ W + D L
Sbjct: 473 EG-GTVPQGFNIVDIYRTVQQKMLDYDPQAVWVMQQWQWGIDE-------NKLSGLAKKE 524
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ ++ + VP++W MLHNF G + M G+ + +A +A S N
Sbjct: 525 QSLVLDLQSDLRS-QASPMENQQVPWVWNMLHNFGGRMGMDGVPEVLAIKIPQAYNS-NR 582
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD +AW Y RRYG + IQ+AW++L
Sbjct: 583 YMRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRAWTRSYIERRYGGTDAKIQEAWDIL 642
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
T Y DG + SI++ +P + S++ H
Sbjct: 643 LDTAYKHVDGEY---------YQGASESIMNA---------RPSDNKI---GSASTWGHS 681
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
+ Y E RA +LFI S + S +RYD +D+ RQ LA E +AY+ D
Sbjct: 682 DIDYDKKEFERAAQLFIESYDTYKDSEAFRYDFVDVMRQVLANAFQEYQPLAGDAYKQRD 741
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
A L+ + LE+++ D +L+ F+LG W+E+A+ L ++ +
Sbjct: 742 AERFELLANQMLEMLDAQDRMLSTSSDFMLGTWIENARTLLEDAD 786
>gi|423226735|ref|ZP_17213200.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392627008|gb|EIY21049.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 718
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 142/474 (29%), Positives = 233/474 (49%), Gaps = 57/474 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 189 MNNLEGWGGPNPDSWYVQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 248
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+ L N F+ + L TD F EI + ++Q K +G+ + Y+ D F E N
Sbjct: 249 SDLWNGFTRPA--------FLQPTDVRFAEIADLYYQEQEKLFGKVDY-YSMDPFHEAEN 299
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI + M+ + A W++QGW + +P RP +K + N G L
Sbjct: 300 AASVD----FDAAGKAIMAAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 347
Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
++LDLF+E +P IW K + +++CML NF GN+ ++G +D + +
Sbjct: 348 LILDLFSECRPMWGIPSIWKRDKGYEQHNWLFCMLLNFGGNVGLHGRMDQLLDNFYLTKN 407
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+ + G+G++MEG E NP++++LM E+ ++ EK + W+ Y RYG I+
Sbjct: 408 NPLAVHLKGIGLTMEGAENNPMMFELMCELPWRPEKFTKEEWLKDYLFARYGVRDEKIEK 467
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +T+YNC G + + + P ++ + + + K +NY P E
Sbjct: 468 AWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVTE------ 520
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
A L + ++ +N + YDL+D+ RQ+L+ ++ I
Sbjct: 521 -----------------EAARLMVEVADKYRGNNNFEYDLVDIVRQSLSDKGRIVYNRTI 563
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
++ D + SR+FL+++ D LL F +G W+E A+ L E++
Sbjct: 564 ADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARNLGTTPEEK 617
>gi|224537466|ref|ZP_03678005.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520904|gb|EEF90009.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
DSM 14838]
Length = 721
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 142/474 (29%), Positives = 233/474 (49%), Gaps = 57/474 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 192 MNNLEGWGGPNPDSWYVQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 251
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+ L N F+ + L TD F EI + ++Q K +G+ + Y+ D F E N
Sbjct: 252 SDLWNGFTRPA--------FLQPTDVRFAEIADLYYQEQEKLFGKVDY-YSMDPFHEAEN 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI + M+ + A W++QGW + +P RP +K + N G L
Sbjct: 303 AASVD----FDAAGKAIMAAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 350
Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
++LDLF+E +P IW K + +++CML NF GN+ ++G +D + +
Sbjct: 351 LILDLFSECRPMWGIPSIWKRDKGYEQHNWLFCMLLNFGGNVGLHGRMDQLLDNFYLTKN 410
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+ + G+G++MEG E NP++++LM E+ ++ EK + W+ Y RYG I+
Sbjct: 411 NPLAVHLKGIGLTMEGAENNPMMFELMCELPWRPEKFTKEEWLKDYLFARYGVRDEKIEK 470
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +T+YNC G + + + P ++ + + + K +NY P E
Sbjct: 471 AWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVTE------ 523
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
A L + ++ +N + YDL+D+ RQ+L+ ++ I
Sbjct: 524 -----------------EAARLMVEVADKYRGNNNFEYDLVDIVRQSLSDKGRIVYNRTI 566
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
++ D + SR+FL+++ D LL F +G W+E A+ L E++
Sbjct: 567 ADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARNLGTTPEEK 620
>gi|18309848|ref|NP_561782.1| alpha-N-acetylglucosaminidase [Clostridium perfringens str. 13]
gi|18144526|dbj|BAB80572.1| probable alpha-N-acetylglucosaminidase [Clostridium perfringens
str. 13]
gi|288872041|dbj|BAI70446.1| alpha-N-acetylglucosaminidase [Clostridium perfringens]
Length = 2104
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 140/463 (30%), Positives = 230/463 (49%), Gaps = 41/463 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F + F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ ++L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAHELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+S +FLEL++ + +L+ FL+G W+E A+ + ++ +
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 802
>gi|189465172|ref|ZP_03013957.1| hypothetical protein BACINT_01517 [Bacteroides intestinalis DSM
17393]
gi|189437446|gb|EDV06431.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides intestinalis DSM
17393]
Length = 723
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 142/474 (29%), Positives = 233/474 (49%), Gaps = 57/474 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 192 MNNLEGWGGPNPDSWYAQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 251
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD--EN 117
+ L N F+ + L TD F EI + +Q K +G+ + Y+ D F EN
Sbjct: 252 SDLWNGFTRPA--------FLQPTDARFAEIADLYYREQEKLFGKADY-YSMDPFHEAEN 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI + M+ + A W++QGW + +P RP +K + N G L
Sbjct: 303 AASVD----FDAAGKAIMTAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 350
Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
++LDLF+E +P IW K + +++CML NF GN+ ++G +D + +
Sbjct: 351 LILDLFSECRPMWGIPSIWKRDKGYEQHDWLFCMLLNFGGNVGLHGRMDQLLNNFYLTKN 410
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+ T + G+G++MEG E N ++++LM E+ ++ EK + W+ Y RYG I+
Sbjct: 411 NPLATHLKGIGLTMEGSENNAMMFELMCELPWRPEKFTKEEWLKDYLFARYGVRDEKIEQ 470
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +T+YNC G + + + P ++ + + + K +NY P E
Sbjct: 471 AWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVTE------ 523
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
A L + ++ +N + YDL+D+ RQ+L+ ++ I
Sbjct: 524 -----------------EAARLMLEVADKYRGNNNFEYDLVDIVRQSLSDKGRIVYNRTI 566
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
++ D + SR+FL+++ D LL F +G W+E A++L E++
Sbjct: 567 ADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARKLGTTPEEK 620
>gi|423214208|ref|ZP_17200736.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693153|gb|EIY86388.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
CL03T12C04]
Length = 718
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 137/468 (29%), Positives = 230/468 (49%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP A P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNAMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625
>gi|160884066|ref|ZP_02065069.1| hypothetical protein BACOVA_02042 [Bacteroides ovatus ATCC 8483]
gi|423291473|ref|ZP_17270321.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
CL02T12C04]
gi|156110408|gb|EDO12153.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392663473|gb|EIY57023.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
CL02T12C04]
Length = 718
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 137/468 (29%), Positives = 230/468 (49%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP A P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMTIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625
>gi|237719039|ref|ZP_04549520.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451817|gb|EEO57608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 718
Score = 214 bits (545), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 136/468 (29%), Positives = 229/468 (48%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+++ +EL S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYVSCADELKGSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDKLLASHPLYRLEEWVELAR 625
>gi|150004413|ref|YP_001299157.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149932837|gb|ABR39535.1| glycoside hydrolase family 89 [Bacteroides vulgatus ATCC 8482]
Length = 717
Score = 214 bits (545), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 153/480 (31%), Positives = 238/480 (49%), Gaps = 67/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ F
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 402
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617
>gi|294775488|ref|ZP_06741000.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
gi|294450633|gb|EFG19121.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
Length = 712
Score = 214 bits (545), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 153/480 (31%), Positives = 238/480 (49%), Gaps = 67/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 182 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 233
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 234 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 289
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 290 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 337
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ F
Sbjct: 338 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 397
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 398 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 457
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 458 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 498
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 499 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 553
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 554 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 612
>gi|393786624|ref|ZP_10374756.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
CL02T12C05]
gi|392657859|gb|EIY51489.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
CL02T12C05]
Length = 717
Score = 214 bits (545), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 147/477 (30%), Positives = 232/477 (48%), Gaps = 61/477 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P W QQ+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPDHWYTQQVSLQKKILKRMHEYGIEPVLPGYCGMVPHNAK--------A 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L D F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 239 KLG--LNV-SDPGVWCGYRRPAFLQPDDSRFEEISSLYYKELEKLYGKANY-YSMDPFHE 294
Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+D + ++G A+ M+ + AVW++Q W + P L+ ++ G
Sbjct: 295 GGS-IDGVN-LDAVGKAVMKAMKKANPKAVWVIQAWQANPRP--------ELIRNLETGD 344
Query: 177 LVVLDLFAEVKPIWST------SKQFYGVP-YIWCMLHNFAGNIEMYGILDSIAFGPVEA 229
L++LDL +E +P W K YG +++CML N+ N+ ++G +D++ A
Sbjct: 345 LLILDLTSECRPQWGDPESEWYRKDGYGKHNWVYCMLLNYGANVGLHGKMDNVIDNYYLA 404
Query: 230 RTS--ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
+ + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+ + W+ Y RYG+ P
Sbjct: 405 KENLRARATLKGVGMTPEGIENNPVMYELLMELPWRPERFTKEDWLKGYVKARYGKDEPV 464
Query: 288 IQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
+Q AW L +++YN T + + V A P +D YQ
Sbjct: 465 LQLAWGKLANSIYNAPKELTQQGTHESVFCARPGLDV---------YQ------------ 503
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
SS+ +Y EVI A L ++ + + + YDL+D+ RQA+A+ +
Sbjct: 504 ---VSSWSEMKDYYDPQEVIEAARLMVSVADRYRGNTNFEYDLVDVVRQAIAEKGRLMQK 560
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ AY+ D S++FL L+ D LL F LG W+ SA+ L E++
Sbjct: 561 AVTTAYRAGDKELFAMASQKFLNLILLQDQLLGTRTEFRLGRWINSARALGVTPEEK 617
>gi|345519733|ref|ZP_08799147.1| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
gi|345457107|gb|EET15964.2| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
Length = 717
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 153/480 (31%), Positives = 238/480 (49%), Gaps = 67/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ F
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALIDGFY 402
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617
>gi|295085513|emb|CBK67036.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 718
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 137/468 (29%), Positives = 229/468 (48%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +WST + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWSTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625
>gi|319643377|ref|ZP_07998003.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
gi|317385006|gb|EFV65959.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
Length = 718
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 153/480 (31%), Positives = 238/480 (49%), Gaps = 67/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 240 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 296 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 343
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ F
Sbjct: 344 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALIDGFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 404 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 464 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 560 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 618
>gi|404487024|ref|ZP_11022211.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
YIT 11860]
gi|404335520|gb|EJZ61989.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
YIT 11860]
Length = 722
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 147/476 (30%), Positives = 228/476 (47%), Gaps = 58/476 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+GW GPL W +Q+ LQ KIL R+ ELGM+P+ PAF+G VP A P +
Sbjct: 200 MGNLNGWDGPLTNGWQKEQIKLQHKILNRMRELGMDPIAPAFAGFVPTAFAERHPEIQFK 259
Query: 61 QLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
L W F K + Y+L P F EIG+ FIE+ KE+G+ ++ Y D+F+E
Sbjct: 260 HL-EWGGFDEKYN-----AYVLPPETPYFKEIGKLFIEEWEKEFGKNTY-YLSDSFNEMK 312
Query: 119 PPV------DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNS 171
PV + ++ G +IY + +G+ DAVW+ QGW F Y FW ++ALL+
Sbjct: 313 LPVAEGDDDGKHKLLAQYGESIYHSIAAGNPDAVWVTQGWTFGYQHDFWDKASLQALLSR 372
Query: 172 VPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
VP K++++DL + K +W T + FYG +I+ + NF G M G L A
Sbjct: 373 VPDDKMIIIDLGNDYPKWVWGTEQTWKNHDGFYGKKWIFSYVPNFGGKTPMTGDLQMYAT 432
Query: 225 GPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
EA S N +VG G + EG+E N VVY+L+++M + + +D+ +W+ Y RYG
Sbjct: 433 SSAEALHSANAGNLVGFGSAPEGLENNEVVYELLADMGWTADSIDLDSWLPVYCKARYGG 492
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+ AW T Y+ +V PD + +SK
Sbjct: 493 CPAAMDSAWQRFKETAYSSLYSYPRFTWQTVV--PDT----------------RRISKLD 534
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
V S ++ +ELF++ + L +S Y D I+ LA A++
Sbjct: 535 VSDS----------------FLQGVELFLSCADSLESSPLYVNDAIEYASYYLAAKADDC 578
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
+ ++ L + Q R +E++ D+D LLA H + L W++ A+ + +
Sbjct: 579 YKRALKEDSLGNRVAAMQQLDRSVEILLDVDKLLASHPLYRLEEWVDMARDWGKTD 634
>gi|293371915|ref|ZP_06618319.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|292633161|gb|EFF51738.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 718
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 137/468 (29%), Positives = 228/468 (48%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + +S + +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTV-------------ISDQRRI 527
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
S D+ ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 528 SKIDLSDDY----------LQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625
>gi|429740221|ref|ZP_19273923.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
gi|429153946|gb|EKX96707.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
Length = 721
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 146/480 (30%), Positives = 233/480 (48%), Gaps = 56/480 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W QQ+ LQ KIL R+ LGM+P+ PAF+G VP + P ++
Sbjct: 197 MGNLNTWNGPLSANWHSQQIALQHKILERMRLLGMHPITPAFAGFVPEGFVKLHPEVRVK 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
F + Y+L P F++IG+ FIE+ KE+ + ++ Y D+F+E P
Sbjct: 257 H----FEWGGFDKSLNAYMLPPDSPYFLQIGKLFIEEWEKEFSKNTY-YLSDSFNEMELP 311
Query: 121 VDSPE-------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSV 172
V SP+ +S G AIY + +G+ +AVW+ QGW F Y FW ++ALL V
Sbjct: 312 V-SPDDTDGKHRLLSKYGEAIYQSIVAGNPNAVWITQGWTFGYQHRFWDKESLQALLERV 370
Query: 173 PLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
P KL+++DL + + W T K FYG +I + NF G + G L+ A
Sbjct: 371 PNDKLIIVDLANDYPKWVWKTEQTWKTHKGFYGKRWILSYVPNFGGKTLLTGDLNLYASC 430
Query: 226 PVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
EA + ++G G + EG+E N VVY+L+++M +Q++ +D+ W+ +Y RYG
Sbjct: 431 SAEALAHPDKGRLIGFGSAPEGLENNEVVYELLADMGWQNQPIDLDHWLIEYCRSRYGSC 490
Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
A+Q AW L +VY+ S+ S +Q V + +
Sbjct: 491 PNAMQKAWKGLCRSVYS---------------------SLYSYPRFTWQT----VIPDTL 525
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
KS+ YD ++ RA+E F+ +L S YR D + Q + A+ L+
Sbjct: 526 RKSK---YDFNDTYF------RAVEDFLLCAPQLKDSPLYRSDALLFAAQYIGAKADNLY 576
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
++A + + QL + ++L+ D LLA H L W+++A+ A ++ Q
Sbjct: 577 RKALQAKAVGNRARAKQLVDKVIQLLLQADKLLASHPTDRLSRWVDAARTAAATPQERMQ 636
>gi|336404352|ref|ZP_08585050.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
gi|335943680|gb|EGN05519.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
Length = 718
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 137/468 (29%), Positives = 228/468 (48%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + +S + +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTV-------------ISDQRRI 527
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
S D+ ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 528 SKIDLSDDY----------LQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625
>gi|299144719|ref|ZP_07037787.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298515210|gb|EFI39091.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 718
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 136/468 (29%), Positives = 229/468 (48%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKADKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625
>gi|170292392|pdb|2VC9|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With 2-Acetamido-1,2-Dideoxynojirmycin
gi|170292393|pdb|2VCA|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With Beta-N-Acetyl-D-Glucosamine
gi|170292394|pdb|2VCB|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With Pugnac
gi|170292395|pdb|2VCC|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
Length = 891
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 136/462 (29%), Positives = 229/462 (49%), Gaps = 39/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + A+
Sbjct: 344 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 403
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 404 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 462
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 463 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 511
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A + + + MV
Sbjct: 512 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLA-TEIPKALANSEHMV 569
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I +AWN++ T
Sbjct: 570 GIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDT 629
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y K D + SII+ G +G +KS S++ H +
Sbjct: 630 AYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIV 666
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 667 YDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEK 726
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+S +FLEL++ + +L+ FL+G W+E A+ + ++ +
Sbjct: 727 FKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 768
>gi|261880159|ref|ZP_06006586.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
gi|270333130|gb|EFA43916.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
Length = 772
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 143/485 (29%), Positives = 229/485 (47%), Gaps = 65/485 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP +W QQ LQK+IL R ELGM+PVLP + G +P + +T
Sbjct: 193 MNNLEGWGGPLPDAWYAQQEALQKRILKREKELGMSPVLPGYCGMMPHDAKAKL-GLDVT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L ATDP F I + + + YG+ + Y+ D F E+ P
Sbjct: 252 DGGTWNGYTRPAN------LSATDPKFDHIADLYYRELTRLYGKADY-YSMDPFHES--P 302
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D+ + G + + M+ + + W++QGW+ +P RP ++AL P G +++L
Sbjct: 303 DDASVDYAEAGRKLLAAMKRANGKSNWVIQGWM--ENP--RPQMIEAL----PEGDIIIL 354
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD------SIAFGPVE 228
DLF+E +P IW + + +++CML NF N+ ++G +D +A P
Sbjct: 355 DLFSECRPMFGAPSIWQRKEGYGRHNWLFCMLENFGANVGLHGRMDQLVHNFKLAASPST 414
Query: 229 ARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF--------QHEKVDVK-AWINQYSVR 279
+ + G+G +MEG E NP++++LMSE+ + + ++ D K W Y
Sbjct: 415 PYQNARKHLKGIGFTMEGSENNPIMFELMSELVWRANDLVSAERDRRDFKEGWTRNYVKA 474
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGK 337
RYG P IQ+AW +L ++YNC G + + + P +D + + K +NY
Sbjct: 475 RYGIDNPKIQEAWQLLIGSIYNCPVGNNQQGPHESIFNGRPSLDNFQVK-SWSKMRNY-- 531
Query: 338 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 397
Y + +RA +L + + +N + YDL+D+ RQA+
Sbjct: 532 ---------------------YDPNVTLRAAQLMTSVADRYRGNNNFEYDLVDIVRQAMD 570
Query: 398 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
A +L I Y+ D S RFL ++ D LL F LG +E A+ L+
Sbjct: 571 DQARLQYLRTIADYKGFDRTAFSADSARFLNMLLLQDKLLGTRQEFRLGTRIEQARSLST 630
Query: 458 NEEQE 462
E++
Sbjct: 631 TLEEK 635
>gi|110800516|ref|YP_695309.1| alpha-N-acetylglucosaminidase [Clostridium perfringens ATCC 13124]
gi|110675163|gb|ABG84150.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens ATCC 13124]
Length = 2095
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 139/463 (30%), Positives = 230/463 (49%), Gaps = 41/463 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + A+
Sbjct: 369 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 428
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 429 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 487
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 488 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 536
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 537 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 593
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I +AWN++
Sbjct: 594 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILD 653
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 654 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 690
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 691 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 750
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+S +FLEL++ + +L+ FL+G W+E A+ + ++ +
Sbjct: 751 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 793
>gi|423293381|ref|ZP_17271508.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
CL03T12C18]
gi|392678324|gb|EIY71732.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
CL03T12C18]
Length = 718
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 136/468 (29%), Positives = 229/468 (48%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPITPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625
>gi|298480124|ref|ZP_06998323.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|298273933|gb|EFI15495.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
Length = 718
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 136/468 (29%), Positives = 229/468 (48%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625
>gi|383115207|ref|ZP_09935965.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
gi|313695376|gb|EFS32211.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
Length = 718
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 136/468 (29%), Positives = 228/468 (48%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKGSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625
>gi|322699924|gb|EFY91682.1| alpha-N-acetylglucosaminidase, putative [Metarhizium acridum CQMa
102]
Length = 775
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/430 (30%), Positives = 226/430 (52%), Gaps = 36/430 (8%)
Query: 5 HGWGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 62
WGG LP ++++ Q LQKKI+ R+ ELG+ PVLPAF G VP +++ V P +T
Sbjct: 211 RSWGGKGDLPLAFIELQFELQKKIVARMVELGITPVLPAFPGFVPESIKKVRPDVNLTVS 270
Query: 63 GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 122
NWF+ D ++ LD D + E+ R F+ +Q+ +G ++IY D F+E +P
Sbjct: 271 PNWFAPAPD-KYTRDLFLDPLDDTYAELQRLFVSKQMDAFGNVTNIYTLDQFNELSPASG 329
Query: 123 SPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGK-LVVL 180
Y+ + Y+G+ + + AVWL+QGWL FS FW P++ A L V + ++VL
Sbjct: 330 DTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSSRRFWTQPRIDAYLGGVEDDQGMLVL 389
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL++E P W + + G P+IWC LH+F GN+ + G + ++ P++A +++ ++VG
Sbjct: 390 DLYSEANPQWQRTNSYSGKPWIWCQLHDFGGNMALEGRVQTLTSAPIDA-LAQSESLVGF 448
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNVLYH 297
G++ E E N VVYD++ + A+ +D + + + +RY S+P+ + AW +L
Sbjct: 449 GLTPEAYEGNEVVYDILLDQAWSATPLDTQTYFASWVTKRYAGVSSIPSELYRAWEMLRT 508
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VY+ T TD + V VA + P++ + + T + HP
Sbjct: 509 DVYSNT--RTDIPQ-VPVATYQLRPALSGIA------------------NRTGHFPHPTA 547
Query: 358 WYSTSEVIRA-----LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+ V++ LE G+ L ++ D +D++RQ L+ + L+ +++ AY+
Sbjct: 548 LHYDPLVLQEAWKLMLEAMTRQGS-LWKVPAFQLDFVDVSRQMLSNQFDVLYADLVNAYK 606
Query: 413 LNDAHGVFQL 422
+ A G +L
Sbjct: 607 CSAAGGSREL 616
>gi|32564213|ref|NP_496948.2| Protein K09E4.4 [Caenorhabditis elegans]
gi|25814792|emb|CAB70170.2| Protein K09E4.4 [Caenorhabditis elegans]
Length = 715
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 139/471 (29%), Positives = 224/471 (47%), Gaps = 44/471 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GG L + + L K+I+ R+ ELG+ P+LP F+G VP L+ +FP++K
Sbjct: 210 MGNLKAYGGGLSDAQMLNDHNLAKRIIDRLLELGITPILPTFAGFVPDHLETLFPASKFN 269
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
+L W + S+ C + DPLF +IG F+ Q K +G +++Y+ D F+E P
Sbjct: 270 RLPRWNNFTSET--SCMLSVSPFDPLFQKIGSTFLRHQKKMFGGDVTNMYSADPFNEILP 327
Query: 120 PVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ +++ AI + + D + VW++Q W F+YD W +K+ L+++P+G
Sbjct: 328 SESAKFDAKFVKQTAQAIMNSCKKVDKNCVWVLQSWSFTYDQ-WPAWAIKSFLSAIPVGN 386
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
L++LDL+AEV P W + F G ++WC+LHNF G+ E+ G L I G A +
Sbjct: 387 LLILDLYAEVVPAWQMTSSFQGHHFVWCLLHNFGGSRELRGNLQKIDKGYQLALMKAGSN 446
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
+VG G+SME I+QN VVY M + + E + + W+ YS RY Q W +L
Sbjct: 447 LVGAGLSMEAIDQNYVVYQFMIDRMWSPEPLPLNNWLKAYSESRYSADFKVAQKFWTLLA 506
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
T YN + V + Y +P +
Sbjct: 507 GTFYNQPEKWGTPRFSVFL-------------------YHRPGFGRKI-----------E 536
Query: 357 LWYSTSEVI-RALELFIASGNELSASNTYRYDLIDLTRQALA-KYANELFLNIIEAYQLN 414
W+ E R EL A + L +R DL D+ R+ + NE L++ EA+ +
Sbjct: 537 YWFPVEETFSRFRELLPALVHVLGEHPLFREDLNDVMREMTQFEMGNEAALSMSEAFLME 596
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
D V +E+ + ++ + + W+E+AK +A E E+QV
Sbjct: 597 DKQQVGASCEMLMEMFQKLES----YSNRDVRQWIENAKSIAPTSE-ERQV 642
>gi|293369246|ref|ZP_06615836.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|292635671|gb|EFF54173.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 521
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 111/301 (36%), Positives = 172/301 (57%), Gaps = 6/301 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 196 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+L + D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 256 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D +AVWL WLF D W P++K+ L SVP +L++
Sbjct: 312 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F E IW + ++G PY+WC L NF GN + G ++ ++ +A + + + G
Sbjct: 372 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EGI+ N +Y+ + + A+ + D K W + + RR G+ P + AW +L + V
Sbjct: 432 VGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKLADRRIGKISPEARKAWEILANKV 490
Query: 300 Y 300
Y
Sbjct: 491 Y 491
>gi|341892319|gb|EGT48254.1| hypothetical protein CAEBREN_28412 [Caenorhabditis brenneri]
Length = 713
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 136/472 (28%), Positives = 227/472 (48%), Gaps = 46/472 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GG L + + L L K+I+ R+ ELG+ P+LP F+G VP L+ +FPS+K T
Sbjct: 208 MGNLKAYGGGLSDAQMLNDLNLAKRIINRLLELGITPILPTFAGFVPDQLEKLFPSSKFT 267
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
+L W + S+ C + DPLF +IG F+ Q K +G +++Y+ D F+E P
Sbjct: 268 RLPCWNNFTSET--SCLLSVSPFDPLFQKIGSLFLRHQKKMFGGDITNLYSADPFNEILP 325
Query: 120 PVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ +++ AI + + D + +W++Q W F+YD W +K+ L++VP+G
Sbjct: 326 SDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWSFTYDE-WPSWAIKSFLSAVPIGN 384
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
L++LDL++EV P W ++ F+G YIWCMLH+F G+ E+ G L + G A +
Sbjct: 385 LLILDLYSEVVPAWQSTSSFHGHNYIWCMLHSFGGSRELRGNLQKVDKGYQLALMKGGSN 444
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
++G G++ME I+QN V+Y M + + E + + WI YS RY W +L
Sbjct: 445 LIGAGLTMEAIDQNYVIYQFMVDRMWSSEPLPLNTWIKSYSESRYSADFKVSHKFWTLLA 504
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
+ YN + + V + Y +P + +
Sbjct: 505 FSFYNQPEKWGNPRFSVFL-------------------YHRPAFGKKI-----------E 534
Query: 357 LWYSTSEVIRALELFIASG-NELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQL 413
W+ E L+ I S + L ++ DL D+ R A+ ++ N+ L + EA+ +
Sbjct: 535 YWFPVEETFGHLQSLIPSLIHVLGDHPLFKEDLNDVMR-AITQFEVGNDAALTLTEAFLM 593
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
D + + DM L + + W+E +K +A E E+QV
Sbjct: 594 EDKQQIGSTCENLM----DMFLKLESYSNRDMKHWIEDSKSIAATSE-ERQV 640
>gi|383280354|pdb|4A4A|A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In
Complex With Its Substrate Glcnac-Alpha-1,4-Galactose
Length = 914
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 134/462 (29%), Positives = 229/462 (49%), Gaps = 39/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + A+
Sbjct: 367 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 426
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F + NT
Sbjct: 427 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHQGGNT 485
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 486 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 534
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A + + + MV
Sbjct: 535 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLA-TEIPKALANSEHMV 592
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ + I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I +AWN++ T
Sbjct: 593 GIGITPQAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDT 652
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y K D + SII+ G +G +KS S++ H +
Sbjct: 653 AYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIV 689
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 690 YDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEK 749
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+S +FLEL++ + +L+ FL+G W+E A+ + ++ +
Sbjct: 750 FKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSD 791
>gi|336412611|ref|ZP_08592964.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
3_8_47FAA]
gi|335942657|gb|EGN04499.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
3_8_47FAA]
Length = 718
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 136/468 (29%), Positives = 228/468 (48%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKNSELYRNDLIEFVSYYVAAKAEIFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ N + ++ ++L+ D+D LLA H + L W+E A+
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELAR 625
>gi|373461651|ref|ZP_09553390.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
gi|371951955|gb|EHO69797.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
Length = 713
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 142/464 (30%), Positives = 212/464 (45%), Gaps = 52/464 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ Q ++D Q L ++IL R+ LG+ PVL F G V ++++ +P+A +
Sbjct: 194 MGNLEGWGGPVSQDFIDAQSRLGRRILDRMATLGIQPVLQGFYGMVSRSIRDRYPNAVMP 253
Query: 61 QLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
Q G W F + D +L T+ LF EI + + K YG H + D F E
Sbjct: 254 Q-GMWGFFERPD-------ILKPTEKLFDEIADTYYREIKKHYGTGFHYFGGDLFHEGGQ 305
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
++ G A+ MQ + W++QGW + +P LL + K++V
Sbjct: 306 T--GTLNVADCGLAVQQAMQRNFPGSTWVLQGWSGNPNPL--------LLTKLDREKVLV 355
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMV 238
+DLF E W+ +K + G P++WC++ NF MYG L IA + R S+ +
Sbjct: 356 VDLFGENDEAWNRTKAYQGTPFLWCIVSNFGEQCGMYGKLQRIALQIDKVRKSDYKAYLK 415
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GVG+ EGI NPVVYD++ K++V+AW+ Y RYG I AW + T
Sbjct: 416 GVGIMPEGINNNPVVYDMVLHAPLTDRKINVEAWLKSYITYRYGSYNADIYAAWLIFLQT 475
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVLKSETSSYDH 354
+Y SV E YG P S + V ++TSS+
Sbjct: 476 IY------------------------ASVPE----KYGLPESVFCARPGVKVTQTSSWGV 507
Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
+Y + LF+ + S TY YD+ DL RQ + N ++ ++I A
Sbjct: 508 RARYYDMDFFKEGVRLFLKAKTSFEDSETYAYDMFDLLRQVQSDKGNRVYDDMIAAIDAK 567
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+ + Q S RFL + D LLA GF L WL A + +
Sbjct: 568 NPNRFEQTSDRFLHELLRQDTLLAQSKGFTLERWLGQASRFGKT 611
>gi|268533054|ref|XP_002631655.1| Hypothetical protein CBG20846 [Caenorhabditis briggsae]
Length = 712
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 128/470 (27%), Positives = 228/470 (48%), Gaps = 45/470 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GG L + + L K+I+ R+ ELG+ P+LP FSG VP L+ +FP++K
Sbjct: 207 MGNLKGYGGGLSDAQMLNDFNLAKRIINRLLELGITPILPTFSGFVPDRLEKLFPTSKFN 266
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
+L W + S+ C + DPLF +IG +F+ Q K G +++Y+ D F+E P
Sbjct: 267 RLPCWNNFTSET--SCLLSVSPFDPLFQKIGSSFLRHQKKMLGGDITNLYSADPFNEVLP 324
Query: 120 PVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ +++ AI + + D + +W++Q W F+YD W +K+ L++VP+G+
Sbjct: 325 SDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWSFTYDQ-WPNWAIKSFLSAVPIGQ 383
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDL++EV P W + F+G ++WCMLHNF G+ E+ G + + G A +
Sbjct: 384 MLILDLYSEVVPAWQMTSSFHGHNFVWCMLHNFGGSRELRGNVQKVDKGYQLALMKAGSN 443
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
+VG G+SME I+QN ++Y M + + E + + +W+ YS RY W +L
Sbjct: 444 LVGAGLSMEAIDQNYMMYQFMIDRMWTQEPIPLNSWLKSYSESRYSADFKVAHKFWTILA 503
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
+ YN + + V + Y +P + +
Sbjct: 504 GSFYNQPEKWGNPRFSVFL-------------------YHRPAFGKKI-----------E 533
Query: 357 LWYSTSEVIRALE-LFIASGNELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQL 413
W+ E LE L ++ + L ++ DL D+ R A+ ++ NE L++ EA+ +
Sbjct: 534 YWFPVEETFTHLESLVLSLLHILGDHPLFKEDLNDVMR-AITQFEIGNEAALSLTEAFLM 592
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
D + + + + ++ + + W+E AK +A E+ +
Sbjct: 593 EDKQQIGTTCENLMGMFQKLEP----YSNRDVRDWIEDAKSIAPTTEERE 638
>gi|421734750|ref|ZP_16173809.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
gi|407077324|gb|EKE50171.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
Length = 1919
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 137/462 (29%), Positives = 227/462 (49%), Gaps = 41/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 362 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 421
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 422 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 480
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 481 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 532
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ +++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 533 QTLVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 591
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ AW++L
Sbjct: 592 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 650
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y TDG + +I A P D +I S S++
Sbjct: 651 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 687
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 688 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 747
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
D LS R L++++ D LL+ D FL+G W++ A+ +
Sbjct: 748 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 789
>gi|294674521|ref|YP_003575137.1| alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
gi|294472030|gb|ADE81419.1| putative alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
Length = 754
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 139/475 (29%), Positives = 228/475 (48%), Gaps = 54/475 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW +Q LQK+IL R+ +LGM+PVLP + G VP + +
Sbjct: 186 MNNLEGWGGPLPTSWYARQEKLQKQILARMKQLGMHPVLPGYCGMVPHDAKEKL-GLNVA 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
G W + L TD F EI + + K +G+ + Y+ D F E+
Sbjct: 245 DAGLWNGFQRPAN------LLPTDARFSEIATLYYNELTKLFGKADY-YSMDPFHESNDD 297
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
P +D + G A+ M+ + AVW++QGW + +P +A+++ + G L+
Sbjct: 298 PNID----YAKAGQAMMQAMKRVNPKAVWVIQGW--TENP------REAMVDDMKTGDLL 345
Query: 179 VLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYG----ILDSIAFGPVE 228
VLDLF+E +P IW + + +++C+L NF N+ ++G +LD+
Sbjct: 346 VLDLFSECRPMFGIPSIWKREQGYKQHQWLFCLLENFGANVGLHGRMDQLLDNFYMLQSS 405
Query: 229 ARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAI 288
++++ + G+G +MEG E NPV+++LMSE+ ++ EK + W+ Y RYG AI
Sbjct: 406 KFQAQSSKLKGIGFTMEGSENNPVMFELMSELPWRPEKFTKEQWVKNYVKARYGVEDEAI 465
Query: 289 QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
+ AW L ++YNC G + SI G+P + +
Sbjct: 466 EKAWLTLAKSIYNCPAGNNQQG---------PHESIFC---------GRPT----LNNFQ 503
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
SS+ +Y + +A +L + + +N + YDL+D+TRQALA A + I
Sbjct: 504 ASSWSKMKNYYDPAMTKKAAKLMNSVAEKYRGNNNFEYDLVDITRQALADQARLQYQKTI 563
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
Y+ + + RFL+++ D LL F +G W + A E++K
Sbjct: 564 ADYKAFSRKQFDRDAERFLKMLLLQDKLLGTRTEFRVGHWTQDAVNAGNTAEEKK 618
>gi|313140918|ref|ZP_07803111.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
bifidum NCIMB 41171]
gi|313133428|gb|EFR51045.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
bifidum NCIMB 41171]
Length = 2005
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 137/462 (29%), Positives = 227/462 (49%), Gaps = 41/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 448 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 507
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 508 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQESVFGKVSNYYAVDPFH 566
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 567 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 618
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ +++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 619 QALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 677
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ AW++L
Sbjct: 678 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 736
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y TDG + +I A P D +I S S++
Sbjct: 737 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 773
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 774 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 833
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
D LS R L++++ D LL+ D FL+G W++ A+ +
Sbjct: 834 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 875
>gi|311064845|ref|YP_003971571.1| beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
gi|310867165|gb|ADP36534.1| Beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
Length = 1923
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 137/462 (29%), Positives = 227/462 (49%), Gaps = 41/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 366 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 425
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 426 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 484
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 485 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 536
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ +++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 537 QALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 595
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ AW++L
Sbjct: 596 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 654
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y TDG + +I A P D +I S S++
Sbjct: 655 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 691
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 692 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 751
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
D LS R L++++ D LL+ D FL+G W++ A+ +
Sbjct: 752 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 793
>gi|187734575|ref|YP_001876687.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
gi|187424627|gb|ACD03906.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
Length = 848
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 139/482 (28%), Positives = 231/482 (47%), Gaps = 63/482 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GPLP SW +Q+ LQ +IL R+ LGM P+ PAFSG VP + ++P AK+
Sbjct: 202 MGNIVNHDGPLPASWHKEQIALQHRILHRMKSLGMTPICPAFSGFVPRGILRLYPEAKLH 261
Query: 61 QL--GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+L G W P+ + L +PLF++IGR ++++ KE+G+ ++ + D+F+E
Sbjct: 262 RLGWGGW------PQKNHAHFLSPEEPLFLKIGRLYMQEWQKEFGKNTY-FLADSFNEME 314
Query: 119 PPVDSP------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNS 171
P + +SSLG IY + S + DAVW+MQGW+F Y W +KALL+
Sbjct: 315 LPENKGGVEARNNMLSSLGEQIYRSISSTNPDAVWVMQGWMFGYQRNIWNADTLKALLSK 374
Query: 172 VPLGKLVVLDLFAEVKPI-------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
VP K+++LDL A+ W K F+ P+++ ++ N G M G++D A
Sbjct: 375 VPDDKMLLLDLAADYNKTFWRNGMNWDVFKGFFNKPWVYSVVPNMGGKCAMTGVMDFYAN 434
Query: 225 GPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
G +EA +S + G+GM+ EGIE N V+Y+L+++ A+++ + +V+ ++ Y RYG
Sbjct: 435 GHLEALNSSSRGRLSGMGMAPEGIENNDVIYELITDAAWRNRQENVEQYLENYCRARYGN 494
Query: 284 SVPAIQDAWNVLYHTVY-NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
++++AWN+ T Y N D P + + T G N
Sbjct: 495 YPDSMKEAWNLFRRTAYSNLKD------------HPRFNWQMKPGTRGCSVN-------- 534
Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
++ + ++ L LF+ + L S +R D +++ L NE
Sbjct: 535 -----------------TSEDFLKGLSLFVNT-RGLEQSPLFRQDAVEMAVHYLGIRMNE 576
Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
EA D + F + D LL H + L W+ A+ + E++
Sbjct: 577 AIRAAQEALDEQDQENAEKCMAYFRKYALLADSLLEGHPTWRLSRWISFARSHGTSPEEK 636
Query: 463 KQ 464
+
Sbjct: 637 NK 638
>gi|374384144|ref|ZP_09641670.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
12061]
gi|373228751|gb|EHP51054.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
12061]
Length = 835
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 138/475 (29%), Positives = 232/475 (48%), Gaps = 59/475 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GP+P W Q+ LQ KIL R+ LGM P+ PAF+G VP AL+ ++P KI
Sbjct: 192 MGNISQIDGPMPVEWHSDQVELQHKILKRMKLLGMKPICPAFAGFVPLALKRLYPDVKII 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
+ W + ++L + LF IG+ FIE+ KE+G+ + Y D+F+E
Sbjct: 252 ET-TWAGFHN-------WMLSPEEELFTRIGQLFIEEWEKEFGK-NDFYLADSFNEMDVP 302
Query: 119 -PPVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
PP+ + E L G +Y G+++G+ DAVW+MQGW+F Y W ++AL++ VP
Sbjct: 303 FPPIGTKERYDMLAFYGEQVYKGIKAGNPDAVWVMQGWMFGYQRDIWDYETLQALVSKVP 362
Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K+++LDL A+ K +W K F+ +++ ++ N G GIL A G
Sbjct: 363 DDKMMLLDLAADYNKNVWGNGMNWEFYKGFFNKLWVYSVIPNMGGKTGATGILSFYANGH 422
Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+EA S N + G GM+ EG E N VVY+++ + + ++DVK W+ YS+ RYG++
Sbjct: 423 LEALNSPNRGRLFGFGMAPEGTENNEVVYEMICDAGWSSSEIDVKQWLKDYSLCRYGKTC 482
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
P + + W L +VY F D + + G+ S + +
Sbjct: 483 PEMDEVWEGLCKSVYG--------------TFTDHPRFLWQLRPGR--------SGKGTV 520
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
++++ Y RA+E +++ S ++ D +++T L L
Sbjct: 521 NTDSNFY-------------RAVEKMAECAPKMTESPLFKADFLEMTAFYLGGKMEALAS 567
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
I ++Y + ++ ++F EL E +D LL H + L W++ A++ E+
Sbjct: 568 AIGKSYLYGNTADALKMQQQFEELGEGLDSLLESHPVYRLQRWIDFARKHGDTEK 622
>gi|198277542|ref|ZP_03210073.1| hypothetical protein BACPLE_03764 [Bacteroides plebeius DSM 17135]
gi|198270040|gb|EDY94310.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides plebeius DSM
17135]
Length = 722
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 140/474 (29%), Positives = 229/474 (48%), Gaps = 57/474 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 191 MNNLEGWGGPNPDSWYTQQEALQKKILKRMREYGIEPVFPGYSGMVPHDANKKLGLNVTE 250
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD--EN 117
L N F+ + L TD F EI + ++ K +G+ ++ Y+ D F E+
Sbjct: 251 PALWNGFTRPA--------FLLPTDSRFNEIASLYYKELEKLFGKANY-YSMDPFHELED 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G A+ M++ + A W++QGW + +P RP +K L N G +
Sbjct: 302 AGSVD----FDAAGKAVLKAMKNVNPKATWVIQGW--TENP--RPEMIKNLNN----GDI 349
Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
++LDLF+E +P IW K + +++CM+ NF GN+ ++G +D + +
Sbjct: 350 LILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMIENFGGNVGLHGRMDQLLNNFYLTKN 409
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+ + G+G++MEG E NPV+++LM E+ ++ EK + W+ Y RYG I
Sbjct: 410 NPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKFTKEEWLKDYLFARYGVRDEKITQ 469
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW++L +YNC G + + + P ++ + + + K QNY P S EA
Sbjct: 470 AWSILADGIYNCPFGNNQQGPHESIFCGRPGLN-NFQASSWSKMQNYYDPTSTEA----- 523
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
A L + ++ +N + YDL+D+ RQ+L+ ++ I
Sbjct: 524 ------------------AARLMLEVADKYKGNNNFEYDLVDIVRQSLSDRGRIVYNQTI 565
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
++ D S+ FL ++ D LL F +G W+E A+ L E++
Sbjct: 566 ADFKSFDKKSFATHSQEFLNILLAQDRLLGTRSEFRVGRWIEQARNLGTTPEEK 619
>gi|380512475|ref|ZP_09855882.1| N-acetylglucosaminidase [Xanthomonas sacchari NCPPB 4393]
Length = 785
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 147/502 (29%), Positives = 225/502 (44%), Gaps = 71/502 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W++ + LQ +IL R+ LGM PVLPAF+G VP A P A+I
Sbjct: 221 MGNIEGYDAPLPQQWIEDKHALQLRILQRMRALGMKPVLPAFAGYVPKAFAQAHPQARIY 280
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE---- 116
++ W TY LD DPLF +I + FI+ + YG+ ++ Y D F+E
Sbjct: 281 RMRAWEGFHE------TYWLDPADPLFAQIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 333
Query: 117 ------------------NT--------PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
NT PPV + +++ G A+Y+ + + DAVW+MQ
Sbjct: 334 IAADGSDARLASYGDSTANTAKTKPPEVPPVQRDKRLAAYGRALYASIHRANPDAVWVMQ 393
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 394 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 453
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ G+ +YG +AF + R + +VG G EG+ VVY+ M +A+
Sbjct: 454 YGGSNPVYG---DLAFYREDLRALLADKDKQQLVGFGAFPEGLHTTSVVYEYMYALAWGA 510
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
++ ++ W++ Y+ RYG + PA++ AW+ L +V + P
Sbjct: 511 QQRPLQDWLDDYTRARYGHTSPALRAAWDDLQASVLSTRYWT---------------PRW 555
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
G Y + +P + + E + D P L RAL+ +A E + + Y
Sbjct: 556 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALQQLLALAPEYADAPLY 606
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
RYDL+D R + + AY+ D + R E V +D L+
Sbjct: 607 RYDLVDFARHYATGRVDVQLQQAVAAYRRGDVAAGDAATARVREAVTQLDSLVGGQQD-T 665
Query: 445 LGPWLESAKQLAQNEEQEKQVR 466
L WL++A A + R
Sbjct: 666 LSSWLDAAAGYATTPQDAAYYR 687
>gi|390937398|ref|YP_006394957.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
gi|389891011|gb|AFL05078.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
Length = 1957
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 137/462 (29%), Positives = 226/462 (48%), Gaps = 41/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 400 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 459
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 460 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 518
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 519 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 570
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ ++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 571 QALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 629
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ AW++L
Sbjct: 630 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 688
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y TDG + +I A P D +I S S++
Sbjct: 689 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 725
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 726 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 785
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
D LS R L++++ D LL+ D FL+G W++ A+ +
Sbjct: 786 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 827
>gi|440731409|ref|ZP_20911430.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
gi|440373101|gb|ELQ09870.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
Length = 732
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 147/502 (29%), Positives = 223/502 (44%), Gaps = 71/502 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ + PLPQ W++ + LQ++IL R+ LGM PVLPAFSG VP A P A+I
Sbjct: 168 MGNIEAYDAPLPQQWIEDKYALQQRILQRMRTLGMKPVLPAFSGYVPKAFAQAHPQARIY 227
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I + FI+ + YG+ ++ Y D F+E PP
Sbjct: 228 RMRAWEGFHE------TYWLDPADPLFTKIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 280
Query: 121 VDS----------------------PE--------YISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + PE ++ G A+Y + + DAVW+MQ
Sbjct: 281 IAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 340
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 341 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 400
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ G+ +YG +AF + R + +VG G EG+ N VVY+ M +A+
Sbjct: 401 YGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFGAFPEGLHDNSVVYEYMYTLAWGG 457
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
++ ++ W+ Y+ RYG + PA++ AW+ L V + P
Sbjct: 458 QQRSLQDWLGDYTRARYGHTSPALRAAWDDLQAAVLSTRYWT---------------PRW 502
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
G Y + +P + + E + D P L RAL+ +A E + + Y
Sbjct: 503 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLY 553
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
RYDL+D R + + AY+ D R V+ +DGL+ +
Sbjct: 554 RYDLVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDAAFARVQAAVQQLDGLVGGQQE-I 612
Query: 445 LGPWLESAKQLAQNEEQEKQVR 466
L WL A+ A+ + R
Sbjct: 613 LSSWLGDAEGDAKTPQDAAYYR 634
>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 1105
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 148/502 (29%), Positives = 225/502 (44%), Gaps = 71/502 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W++ + LQ++IL R+ LGM PVLPAF+G VP A P A+I
Sbjct: 164 MGNIEGYDAPLPQQWIEDKHALQQRILQRMRALGMKPVLPAFAGYVPKAFAQAHPQARIY 223
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I + FI+ + YG+ ++ Y D F+E PP
Sbjct: 224 RMRAWEGFHE------TYWLDPADPLFAKIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 276
Query: 121 VDS----------------------PEY--------ISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + PE ++ G A+Y + + DAVW+MQ
Sbjct: 277 IAADGSDARLASYGDSTANTANTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 336
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 337 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 396
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ G+ +YG +AF + R + +VG G EG+ N VVY+ M +A+
Sbjct: 397 YGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFGAFPEGLHTNSVVYEYMYALAWGG 453
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
++ ++ W+ Y+ RYG S PA++ AW+ L +V + P
Sbjct: 454 QQRSLQDWLGDYTRARYGHSSPALRAAWDDLQASVLSTR---------------YWTPRW 498
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
G Y + +P + + E + D P L RAL+ +A E + + Y
Sbjct: 499 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLY 549
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
RYDL+D R + + AY+ D R V +DGL+
Sbjct: 550 RYDLVDFARHYATGRVDTQLQQALAAYKRGDVAAGDAAFARVQAAVRQLDGLVGGQQE-T 608
Query: 445 LGPWLESAKQLAQNEEQEKQVR 466
L WL++A+ A+ + R
Sbjct: 609 LSSWLDAAEGDAKTPQDAAYYR 630
>gi|365104185|ref|ZP_09333846.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
4_7_47CFAA]
gi|363644798|gb|EHL84079.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
4_7_47CFAA]
Length = 1049
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 130/455 (28%), Positives = 218/455 (47%), Gaps = 37/455 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 361 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 420
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 421 DQGEWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 478
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D DAVW++Q W + A LN + ++L
Sbjct: 479 ADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW--------QENPTDAFLNGLKKDHALIL 528
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+A+ KP + +F P+IW MLH F G + G+ + +A + +E+ M GV
Sbjct: 529 DLYADNKPNHAMRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLA-QEIPQSLAESKKMKGV 587
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E + NP++Y+++ +MA++ + A+I+ + RYG P I+ AW+++ T Y
Sbjct: 588 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 647
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+ R + SII G +G V++ + YD
Sbjct: 648 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 684
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+E + L L+++ + A+ Y++DL+D+TRQ LA + E + +A+ D
Sbjct: 685 -AEFEKILPLYLSVYDHFKANPAYQHDLVDITRQVLANASYEYYRAFEDAWIAKDYSAFN 743
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
QLS +FL L++ D +L+ F+LG W+ SA+ +
Sbjct: 744 QLSGKFLRLIKLQDQVLSTRPEFMLGTWINSARTM 778
>gi|282881077|ref|ZP_06289764.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
5C-B1]
gi|281304881|gb|EFA96954.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
5C-B1]
Length = 688
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 138/468 (29%), Positives = 222/468 (47%), Gaps = 44/468 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ Q +D + LQ KIL R+ +LG+ PV+ F G VP+ L + +P A +
Sbjct: 164 MGNLEGWGGPMSQQMIDDRYKLQIKILRRMRQLGIEPVVQGFPGIVPSFLHDKYPKACVV 223
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + +L LF + +A+ + + YG + D F E N
Sbjct: 224 SQGKWNGFQRPS------ILLPQSQLFYCMAKAYYDNMKRYYGTDLRYFGGDLFHEGGNA 277
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD +SS + + M S DA W++QGW + P ALL + ++
Sbjct: 278 KGVD----LSSTASKVQKCMLSHFPDAKWVLQGWNGNPSP--------ALLAGLDKKHVL 325
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTM 237
+++L E+ W S +F P+IW +++F G +M G L + P A S++ +
Sbjct: 326 LINLAGEIDASWKQSDEFGQTPWIWGSVNHFGGKTDMGGQLPVLVEQPHRALAASQHGRL 385
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G+ EGI NPVVYDL + A+ V + QY RYG + AW +L
Sbjct: 386 KGLGILPEGIHTNPVVYDLALQTAWSDTVPSVDHLLRQYIWYRYGTWNDDLYRAWQLLAS 445
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VY EG Y++ ++ ++ S S++ +
Sbjct: 446 SVYG---------------------EFEVKGEGTYESVF--CARPSLHVSSVSTWGPKKM 482
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y ++++AL LF + S TY YDL+DL RQ +A A ++ ++ AY D+
Sbjct: 483 QYQPEKLLQALVLFRKAAVHFKGSETYEYDLVDLARQVMANNARNVYNQVVHAYNEKDSL 542
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
+ + S FL L++ D LL+ + FLLG WL++A+Q +NE+ ++Q
Sbjct: 543 ALNRYSSTFLHLIDLQDSLLSTNKFFLLGKWLQAARQYGENEQDQRQA 590
>gi|291086028|ref|ZP_06354661.2| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
ATCC 29220]
gi|291069185|gb|EFE07294.1| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
ATCC 29220]
Length = 1014
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 130/455 (28%), Positives = 217/455 (47%), Gaps = 37/455 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 326 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 385
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 386 DQGDWVGFVRPPM-LRTYVKQGAD-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 443
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D DAVW++Q W + A LN + ++L
Sbjct: 444 ADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW--------QENPTDAFLNGLKKDHALIL 493
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+A+ KP + +F P+IW MLH F G + G+ + +A + +E+ M GV
Sbjct: 494 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLA-QEIPQSLAESKYMKGV 552
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E + NP++Y+++ +MA++ + A+I+ + RYG P I+ AW+++ T Y
Sbjct: 553 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 612
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+ R + SII G +G V++ + YD
Sbjct: 613 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 649
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D
Sbjct: 650 -AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFN 708
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
QLS +FL L++ D +L F+LG WL SA+ +
Sbjct: 709 QLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTM 743
>gi|410634789|ref|ZP_11345419.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
gi|410145665|dbj|GAC22286.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
Length = 750
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 141/474 (29%), Positives = 239/474 (50%), Gaps = 58/474 (12%)
Query: 1 MSNLHGWGG--PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL W G LP+S+ D+Q+ L KIL R+ LGM P++ AF+G VP A +FP A+
Sbjct: 211 MGNLITWDGGDKLPESYFDEQIALNHKILKRLRSLGMTPIVHAFAGFVPPATSELFPEAQ 270
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE-N 117
I +L +W P YLL +PLF++IG+ +IE+ KE+G+ + Y D+F+E +
Sbjct: 271 IRRL-SWGG--GLPESTYGYLLSPENPLFVKIGKMYIEEWQKEFGKNEY-YLADSFNEMD 326
Query: 118 TPPVDS-PEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP------FWRPPQMKA 167
PP D+ E ++ L G +Y +++ + DA W+MQGW F Y FW P ++ A
Sbjct: 327 VPPADTEAELLTELAGYGDRVYQSIKAANPDATWVMQGWTFPYHKDENRQLFWTPERLHA 386
Query: 168 LLNSVPLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
L++ VP KL++LDL E + P W F+ +I+ + N G + G D
Sbjct: 387 LVSKVPDDKLLILDLANEYNKLWWKIDPSWKMYSGFFNKKWIYSFIPNMGGKTPLNGRFD 446
Query: 221 SIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 279
A P++A + ++G G + EGIE N ++Y+L+++MA+Q + +DV W +Y+++
Sbjct: 447 IYAELPIDALNYKDKGNLIGFGFAPEGIENNEMIYELLTDMAWQRKAIDVDQWQAKYAMQ 506
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 339
RYG +++ A++ L N+ + +F VD I Y+N P
Sbjct: 507 RYGAYPGSLEKAFSYL--------------NKSALGSF--VDHPIHRFQLRPYRN---PE 547
Query: 340 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
E DH + + + + I+A LF+ + +L + Y++D +++T L+
Sbjct: 548 GVE----------DHATV-HESEDFIKATGLFLQASEQLKDNKLYQHDAMEITTLFLSLV 596
Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
+ L + + V + + ++ MD LLA H L W++ A+
Sbjct: 597 TDNLLTKFLAKDVEQRDYSVLDEA---ISVMHTMDKLLAEHPNHQLVTWVDYAR 647
>gi|310287970|ref|YP_003939229.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
gi|309251907|gb|ADO53655.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
Length = 1923
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 136/462 (29%), Positives = 225/462 (48%), Gaps = 41/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 366 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 425
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 426 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGDTFYKAQESVFGKVSNYYAVDPFH 484
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 485 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 536
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ ++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 537 QALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 595
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ AW++L
Sbjct: 596 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 654
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y DG + +I A P D +I S S++
Sbjct: 655 LDTAYKHMDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 691
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 692 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 751
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
D LS R L++++ D LL+ D FL+G W++ A+ +
Sbjct: 752 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 793
>gi|289667570|ref|ZP_06488645.1| N-acetylglucosaminidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 798
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 144/477 (30%), Positives = 219/477 (45%), Gaps = 72/477 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQ++IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQQQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG T Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFMELYTQAYG-TGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + SE + G G+ EG+ N VVY+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADSEKRNLRGFGIFPEGLHSNSVVYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ QY RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQPWSQWLTQYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + ++K + D L RA++ + + + Y
Sbjct: 536 WNKRAGAYLLFKRPTAD--IVKFDDRPGDPQRL-------RRAIDALLQQAERYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACH 440
RYDLI+ R L+ A+ +++AY D A G QL+ R +LV+ +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDVQLA-RITQLVQGLDALVGGQ 642
>gi|333031147|ref|ZP_08459208.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
gi|332741744|gb|EGJ72226.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
Length = 721
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 135/469 (28%), Positives = 226/469 (48%), Gaps = 54/469 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL W Q+ LQ KIL R+ EL M P+ PAF+G VP A P
Sbjct: 199 MGNLNKWDGPLSDEWHTSQIELQHKILDRMRELEMKPIAPAFAGFVPMAFAEKHPDINFK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W DP + Y+L P F EIG+ FIE+ E+G ++ Y D+F+E P
Sbjct: 259 HM-RWGGF--DPEYNA-YVLPPDSPFFEEIGKLFIEEWENEFGSNTY-YLSDSFNEMELP 313
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + + G +IY + +G+ +A+W+ QGW F Y FW ++ALL++VP
Sbjct: 314 IDKDDTEGKYRLLRQYGESIYKSISAGNPEAIWVTQGWTFGYQHSFWDTTSLQALLSNVP 373
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G + A
Sbjct: 374 NEKMIIIDLGNDYPKWVWNTEQTWKVQNGFYGKGWIFSYVPNFGGKTTMTGDMQMYATSS 433
Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
EA S N ++G G + EG+E N V+Y+L+++M + E +++ W+ Y + RYG
Sbjct: 434 AEALASPNKGNLIGFGSAPEGLENNEVIYELLADMGWTSESINLDEWMQSYCLSRYGGYP 493
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
+Q AW + TVY+ + ++P Y E L
Sbjct: 494 ENVQKAWELFRKTVYSN-----------LYSYP---------------RYTWQTVVEDTL 527
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++ E + +ELF+++ NEL S Y DLI+ + A A++++
Sbjct: 528 RINKIN--------TSDEFLIGVELFVSAVNELKDSELYVNDLIEFSSFYAAAKADKIYK 579
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
+ ++ + L + ++++ +D LLA H + L W++ A+
Sbjct: 580 EALILFERGNKKEARSLLNQSIQILLKVDKLLASHPIYRLEEWVKYARN 628
>gi|289663931|ref|ZP_06485512.1| N-acetylglucosaminidase [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 798
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 144/477 (30%), Positives = 219/477 (45%), Gaps = 72/477 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQ++IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQQQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG T Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFMELYTQAYG-TGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + SE + G G+ EG+ N VVY+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADSEKRNLRGFGIFPEGLHSNSVVYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ QY RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQPWSQWLMQYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + ++K + D L RA++ + + + Y
Sbjct: 536 WNKRAGAYLLFKRPTAD--IVKFDDRPGDPQRL-------RRAIDALLQQAERYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACH 440
RYDLI+ R L+ A+ +++AY D A G QL+ R +LV+ +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDVQLA-RITQLVQGLDALVGGQ 642
>gi|372221472|ref|ZP_09499893.1| alpha-N-acetylglucosaminidase [Mesoflavibacter zeaxanthinifaciens
S86]
Length = 712
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 118/311 (37%), Positives = 170/311 (54%), Gaps = 18/311 (5%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++G GPLPQ W+ ++ LQKKIL ++ +LGM PV+PAFSG +PAAL FP+AKI+
Sbjct: 200 MGNINGHAGPLPQEWITKKAKLQKKILSKMRDLGMKPVVPAFSGYIPAALAEKFPNAKIS 259
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W D TYLLD DPLF EIG+ FIE +EYG+ + Y D+F+E TPP
Sbjct: 260 ELNGWSGGGFD----STYLLDPKDPLFKEIGKRFIELYNQEYGKAEY-YLADSFNEVTPP 314
Query: 121 VDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V + + L G IY + A W+MQGWLF +D FW + A L+ VP K
Sbjct: 315 VSTENKLDELAAYGQVIYETLNEAAPGATWVMQGWLFGHDAYFWEKDAVIAFLSKVPNDK 374
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA----RTS 232
L++ D + +W FYG + + +HN+ G+ +YG D F E
Sbjct: 375 LIIQDFGNDRYKVWEKQDAFYGKQWTYGYVHNYGGSNPIYGDFD---FYKEEINYLLEHD 431
Query: 233 ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS-VPAIQDA 291
++T ++G G+ EG+ QN +VY+ + ++ + K+ VK W+ RYG+ A
Sbjct: 432 KSTKVLGYGVMPEGLHQNSMVYEYLYDLPWD-SKIPVKDWLKTNIKARYGKDFTKETLTA 490
Query: 292 WNVLYHTVYNC 302
W L VY+
Sbjct: 491 WIKLDSAVYST 501
>gi|288927801|ref|ZP_06421648.1| putative alpha-N-acetylglucosaminidase
(N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
oral taxon 317 str. F0108]
gi|288330635|gb|EFC69219.1| putative alpha-N-acetylglucosaminidase
(N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
oral taxon 317 str. F0108]
Length = 723
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 140/468 (29%), Positives = 225/468 (48%), Gaps = 58/468 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++G GPL W Q+ LQ KIL R+ +L M+P+ P F+G VP AL+ ++P+A I
Sbjct: 198 MGNIYGIDGPLSNQWHQDQIALQHKILDRMRKLDMHPICPGFAGFVPEALKELYPTADI- 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
Q W + Y+L DPLF +IG FI++ KE+GR Y D+F+E
Sbjct: 257 QYTTWEKAFHN------YILSPADPLFHKIGVMFIQEWEKEFGRCD-FYLIDSFNEMDIP 309
Query: 119 -PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
PP D P E+++ G +Y ++ + A W+MQGW+F Y P W + AL++ VP
Sbjct: 310 FPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQGWMFGYQPEIWDYKTLNALVSQVP 369
Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K+++LDL A+ K +W T K F G +I+ ++ N G + G LD A G
Sbjct: 370 DNKMIMLDLAADYNKFLWKTPFNWDFYKGFCGKQWIYSVIPNMGGKSALTGALDFYAKGH 429
Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+EA S+N ++G G + EGIE N VVY+L+ + + + V+++ W+ Y+ RYG
Sbjct: 430 LEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGWAKQGVELRPWLRNYTYSRYGCYP 489
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
++ WN + +VY G ++++ + +
Sbjct: 490 IGMEQYWNEMIQSVY-----------------------------GSFKSHPRFNWQFRPG 520
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
K + S D + +Y E++ + L GN+L + D ++ L L
Sbjct: 521 KEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL-----FEADFKEMAANYLGGKVEILVR 574
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
I +AY+ D QL RF L+ MD +L H + W++ A+
Sbjct: 575 QIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPTKDMQKWIDYAR 622
>gi|393783265|ref|ZP_10371440.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
CL02T12C01]
gi|392669544|gb|EIY63032.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
CL02T12C01]
Length = 723
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 132/468 (28%), Positives = 224/468 (47%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL W Q+ LQ +I+ R+ ELGM P+ PAF+G VP A P K
Sbjct: 199 MGNLNTWDGPLSDEWQKSQIELQHQIINRMRELGMQPIAPAFAGFVPMAFAEKHPDIKFK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W Y+L P F EIG+ F+++ KE+G+ ++ Y D+F+E P
Sbjct: 259 HL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKEWEKEFGKNTY-YLSDSFNEMELP 313
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
V + ++ G +IY + +G+ DA+W+ QGW F Y FW ++ALL+ VP
Sbjct: 314 VAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQGWTFGYQHDFWDKASLQALLSHVP 373
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W T + FYG +I+ + NF G + G L A
Sbjct: 374 DDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIFSYVPNFGGKTPLTGDLQMYATSS 433
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
EA + + ++G G + EG+E N VVY+L+++M + + +D + W+ Y RYG
Sbjct: 434 AEALKAPSHGNLIGFGSAPEGLENNEVVYELLADMGWTDQAIDPEQWMPSYCTARYGAYP 493
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
++++AW + T Y+ + ++P + + + +SK V
Sbjct: 494 ESMKNAWELFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDV- 534
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + + +ELF+AS + L+ S Y D I+ +A A++L+
Sbjct: 535 ---------------SDDFLHGIELFLASADSLNRSKLYVNDAIEFASYYIAAQADKLYK 579
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
+ +Q + ++L+ ++D LLA H + L W+E A+
Sbjct: 580 QALTEDTAGKPVAAYQHLNQAIDLLLNVDKLLASHPLYRLEEWVELAR 627
>gi|331660873|ref|ZP_08361805.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
TA206]
gi|422369309|ref|ZP_16449711.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
gi|315298924|gb|EFU58178.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
gi|331051915|gb|EGI23954.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
TA206]
Length = 1052
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 128/455 (28%), Positives = 216/455 (47%), Gaps = 37/455 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 364 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 423
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 424 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 481
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D +AVW++Q W + LN + ++L
Sbjct: 482 ADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW--------QENPTDDFLNGLKKDHALIL 531
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+A+ KP + +F P+IW MLH F G + G+ + +A + +E+ M GV
Sbjct: 532 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMQEVLA-QEIPQSLAESKYMKGV 590
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E + NP++Y+++ +MA++ + A+I+ + RYG P I+ AW+++ T Y
Sbjct: 591 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 650
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+ R + SII G +G V++ + YD
Sbjct: 651 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 687
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D
Sbjct: 688 -AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFN 746
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
QLS +FL L++ D +L F+LG WL SA+ +
Sbjct: 747 QLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTM 781
>gi|401885538|gb|EJT49648.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
asahii CBS 2479]
Length = 781
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 135/472 (28%), Positives = 220/472 (46%), Gaps = 33/472 (6%)
Query: 2 SNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
N+HG W G WL+ Q LQK+IL R E GM PVLP F G VP L N
Sbjct: 237 GNIHGNWHGTTTWQWLEGQHNLQKQILARQREFGMTPVLPGFCGFVPPELHNYIGGPDFK 296
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W S ++ + +D + + AF+ +Q + YG TS Y D F E+ P
Sbjct: 297 TYPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRKQKELYGFTSDYYMVDLFTESKPT 354
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+ + A+ + + +A W+MQGW+F DP W KA L+ L+V
Sbjct: 355 STDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVNDPKSWTETASKAFLDGAG-ESLLV 413
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL AE P W K F+G ++WC L N+ N +YG LD ++A+ + + G
Sbjct: 414 LDLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLYGALDKWNHDIMDAK-ANGGRLSG 472
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQDAWNVLYHT 298
+G+ EGI N +++L ++ + + +D+K W + RRY G+++ Q AW +L ++
Sbjct: 473 MGIVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNWVKRRYRGQNLDLAQKAWELLDNS 532
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VY + A ++ D+ P++ + L T +Y +
Sbjct: 533 VYKSNNTALKCTTRSLI---DLRPAV------------------SGLIGTTGNYLATAIT 571
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y +V+ AL+ + S + + + YDL+D+ RQ A ++ +I A+ ++
Sbjct: 572 YEPRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQVFVNAAIPIYQAMINAWNGSNKAD 630
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN----EEQEKQVR 466
+ R + L+ D+D L+A F L W+ A+ AQ+ ++ E Q R
Sbjct: 631 TEKYGRELVGLINDIDRLMATSRHFRLESWVGDARNWAQDAGAKDDMEFQAR 682
>gi|406693970|gb|EKC97309.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
asahii CBS 8904]
Length = 781
Score = 201 bits (512), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 135/472 (28%), Positives = 220/472 (46%), Gaps = 33/472 (6%)
Query: 2 SNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
N+HG W G WL+ Q LQK+IL R E GM PVLP F G VP L N
Sbjct: 237 GNIHGNWHGTTTWQWLEGQHNLQKQILARQREFGMTPVLPGFCGFVPPELHNYIGGPDFK 296
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W S ++ + +D + + AF+ +Q + YG TS Y D F E+ P
Sbjct: 297 TYPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRKQKELYGFTSDYYMVDLFTESKPT 354
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+ + A+ + + +A W+MQGW+F DP W KA L+ L+V
Sbjct: 355 STDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVNDPKSWTETASKAFLDGAG-ESLLV 413
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL AE P W K F+G ++WC L N+ N +YG LD ++A+ + + G
Sbjct: 414 LDLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLYGALDKWNHDIMDAK-ANGGRLSG 472
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQDAWNVLYHT 298
+G+ EGI N +++L ++ + + +D+K W + RRY G+++ Q AW +L ++
Sbjct: 473 MGIVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNWVKRRYRGQNLDLAQKAWELLDNS 532
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VY + A ++ D+ P++ + L T +Y +
Sbjct: 533 VYKSNNTALKCTTRSLI---DLRPAV------------------SGLIGTTGNYLATAIT 571
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y +V+ AL+ + S + + + YDL+D+ RQ A ++ +I A+ ++
Sbjct: 572 YEPRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQVFVNAAIPIYQAMINAWNGSNKAD 630
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN----EEQEKQVR 466
+ R + L+ D+D L+A F L W+ A+ AQ+ ++ E Q R
Sbjct: 631 TEKYGRELVGLINDIDRLMATSRHFRLESWVGDARNWAQDAGAKDDMEFQAR 682
>gi|293402122|ref|ZP_06646261.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
bacterium 5_2_54FAA]
gi|291304514|gb|EFE45764.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
bacterium 5_2_54FAA]
Length = 2295
Score = 201 bits (512), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 132/465 (28%), Positives = 232/465 (49%), Gaps = 50/465 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ +GGPLP +W +Q+ L +K+ R+ G++PV+ FSG VP P+A IT
Sbjct: 398 MQNLYSYGGPLPDNWFEQRTELARKMHDRMQTYGISPVVQGFSGQVPDNFDKKQPTALIT 457
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
++ +W + P Y+ ++ + L+ ++ + F + Q +G ++ Y D F
Sbjct: 458 EMKDWVGY-TRPSIIQPYITESDAAKGKENLYPQVAKDFYDAQKNVFGNVTNYYATDPFH 516
Query: 116 ENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
E P +D E + + M + AVW+MQ W + D ++ L V
Sbjct: 517 EGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQQWQGNLDA----TKLSGL---VK 565
Query: 174 LGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+ + LDL ++ P + S+ P++WCMLHNF G + M G L ++A P A
Sbjct: 566 PSQALALDLQTDLNPQNGVMENSE----TPWLWCMLHNFGGRMGMDGNLPNVAKNPAIA- 620
Query: 231 TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+E+ M G+G++ E +E +PV Y+L+ +M + + +D AWI +Y+ RR G + +Q+
Sbjct: 621 MNESKYMKGIGITPEALENSPVAYELLFDMTWTKDPIDEDAWIAKYAQRRAGGTSEKLQE 680
Query: 291 AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
AW +L T Y GA + ++ +II+ T +S S
Sbjct: 681 AWKILNETAY----GAKQE------SYQGAAETIINAT------------PRDSFRS-AS 717
Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
++ H ++ Y E +AL+L I + ++ AS YRYDL D+ Q L A E +++A
Sbjct: 718 TWGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYDLADVANQVLCNVAIEYHSLMVKA 777
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
++A + S++FLE+++ D +L + F++G W+ A+ +
Sbjct: 778 KNESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGNWINDARNM 822
>gi|393785795|ref|ZP_10373941.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
CL02T12C05]
gi|392661414|gb|EIY55000.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
CL02T12C05]
Length = 724
Score = 201 bits (512), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 134/468 (28%), Positives = 223/468 (47%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL W + Q+ LQ +I+ R+ ELGM+P+ PAF+G VP A P K
Sbjct: 199 MGNLNTWDGPLSDEWQEGQIQLQHQIINRMRELGMSPIAPAFAGFVPMAFAEKHPDIKFK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W Y+L P F EIG+ F+++ KE+G+ ++ Y D+F+E P
Sbjct: 259 HL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKEWEKEFGKNTY-YLSDSFNEMELP 313
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
V + ++ G +IY + +G+ DA+W+ QGW F Y FW ++ALL+ VP
Sbjct: 314 VAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQGWTFGYQHSFWDKASLQALLSHVP 373
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W T + FYG +I+ + NF G M G L A
Sbjct: 374 DDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIFSYVPNFGGKTPMTGDLQMYASSS 433
Query: 227 VEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
EA SE + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y + RYG
Sbjct: 434 AEALQSESHGNLIGFGSAPEGLENNEVVYELLADMGWTDQAIDLDKWMPSYCMARYGAYP 493
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
++DAW++ T Y+ ++ PD K +
Sbjct: 494 ETMKDAWDLFRKTAYSSLYSYPRFTWQTVI--PD---------------------KRRIS 530
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
K + S + + +ELF+ S + L S Y D I+ +A A++L+
Sbjct: 531 KIDVS-----------DDFLHGVELFLNSADSLKNSKLYVNDAIEFASYYIAAKADKLYG 579
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
+ + + Q + ++++ ++D LLA H + L W+ A+
Sbjct: 580 KALAEDTVGRSAVAQQYLNQTIDMLLNVDKLLASHPLYRLEEWVNFAR 627
>gi|432896403|ref|ZP_20107613.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
gi|433031274|ref|ZP_20219108.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
gi|431432398|gb|ELH14169.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
gi|431538475|gb|ELI14460.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
Length = 1049
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 128/455 (28%), Positives = 216/455 (47%), Gaps = 37/455 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 361 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 420
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 421 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 478
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D +AVW++Q W + LN + ++L
Sbjct: 479 ADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW--------QENPTDDFLNDLKKDHALIL 528
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+A+ KP + +F P+IW MLH F G + G+ + +A + +E+ M GV
Sbjct: 529 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMQEVLA-QEIPQSLAESKYMKGV 587
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E + NP++Y+++ +MA++ + A+I+ + RYG P I+ AW+++ T Y
Sbjct: 588 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 647
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+ R + SII G +G V++ + YD
Sbjct: 648 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 684
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D
Sbjct: 685 -AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFN 743
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
QLS +FL L++ D +L F+LG WL SA+ +
Sbjct: 744 QLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTM 778
>gi|161505009|ref|YP_001572121.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160866356|gb|ABX22979.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 1014
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 126/455 (27%), Positives = 218/455 (47%), Gaps = 37/455 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 326 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 385
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 386 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFYEGGNR 443
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D DAVW++Q W+ A LN + ++L
Sbjct: 444 ADLN--MVKVAQTVQNKMLEHDKDAVWIIQN--------WQENPTDAFLNGLKKDHALIL 493
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+A+ KP + +F P+IW MLH F G + G+ + +A + +E+ M GV
Sbjct: 494 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLA-QEIPQSLAESKYMKGV 552
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E + NP++Y+++ +MA++ + A+I+ + RYG P I+ AW+++ T Y
Sbjct: 553 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHNWLTSRYGAQSPEIEQAWDIMVKTAY 612
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+ R + SII G +G + +Y + + Y
Sbjct: 613 HRR---KDRQR--------AEDSIIDAKPG----FG---------VTRACTYYNALIDYD 648
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D
Sbjct: 649 KAEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAQDYSAFN 708
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
QLS +FL L++ D +L+ F+LG W+ +++ +
Sbjct: 709 QLSGKFLRLIKLQDKVLSTRPEFMLGNWINNSRTM 743
>gi|373451393|ref|ZP_09543318.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
3_1_31]
gi|371968665|gb|EHO86120.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
3_1_31]
Length = 2190
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 131/465 (28%), Positives = 233/465 (50%), Gaps = 50/465 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ +GGPLP +W +Q+ L +K+ R+ G++PV+ FSG VP P+A IT
Sbjct: 390 MQNLYSYGGPLPDNWFEQRTELARKMHDRMQTYGISPVVQGFSGQVPDNFDKKQPTALIT 449
Query: 61 QLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
++ +W + P Y+ + + L+ ++ + F + Q +G ++ Y D F
Sbjct: 450 EMKDWVGY-TRPSIIQPYITENDAAKGKENLYPQVAKDFYDAQKNVFGNVTNYYATDPFH 508
Query: 116 ENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
E P +D E + + M + AVW+MQ W + D ++ LL
Sbjct: 509 EGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQQWQGNLDA----TKLSGLLKP-- 558
Query: 174 LGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+ + LDL ++ P + S+ P++WCMLHNF G + M G L ++A P A
Sbjct: 559 -SQALALDLQTDLNPQNGVMENSE----TPWLWCMLHNFGGRMGMDGNLPNVAKNPAIA- 612
Query: 231 TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+E+ M G+G++ E +E +PV Y+L+ +M + + +D AWI +Y+ RR G + +Q+
Sbjct: 613 MNESKYMKGIGITPEALENSPVAYELLFDMTWTKDPIDEDAWIAKYAQRRAGGTSEKLQE 672
Query: 291 AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
AW +L T Y GA + ++ +II+ T +++ + S
Sbjct: 673 AWKILNETAY----GAKQE------SYQGAAETIINAT-----------PRDSFRSA--S 709
Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
++ H ++ Y E +AL+L I + ++ AS YRYDL D+ Q L A E +++A
Sbjct: 710 TWGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYDLADVADQVLCNVAIEYHSLMVKA 769
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
++A + S++FLE+++ D +L + F++G W+ A+ +
Sbjct: 770 KNESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGNWINDARNM 814
>gi|281424178|ref|ZP_06255091.1| N-acetylglucosaminidase [Prevotella oris F0302]
gi|281401447|gb|EFB32278.1| N-acetylglucosaminidase [Prevotella oris F0302]
Length = 723
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 139/468 (29%), Positives = 224/468 (47%), Gaps = 58/468 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++G GPL W Q+ LQ KIL R+ +L M+P+ P F+G VP AL+ ++P+A I
Sbjct: 198 MGNIYGIDGPLSNQWHQDQIALQHKILDRMRKLDMHPICPGFAGFVPEALKELYPTADI- 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
Q W + Y+L DPLF +IG FI++ KE+GR Y D+F+E
Sbjct: 257 QYTTWEKAFHN------YILSPADPLFHKIGVMFIQEWEKEFGRCD-FYLIDSFNEMDIP 309
Query: 119 -PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
PP D P E+++ G +Y ++ + A W+MQGW+F Y P W + AL++ VP
Sbjct: 310 FPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQGWMFGYQPEIWDYKTLNALVSQVP 369
Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K+++LDL + K +W T K F G +I+ ++ N G + G LD A G
Sbjct: 370 DNKMIMLDLAVDYNKFLWKTPFNWDFYKGFCGKQWIYSVIPNMGGKSALTGALDFYAKGH 429
Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+EA S+N ++G G + EGIE N VVY+L+ + + + V+++ W+ Y+ RYG
Sbjct: 430 LEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGWAKQGVELRPWLRNYTYSRYGCYP 489
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
++ WN + +VY G ++++ + +
Sbjct: 490 IGMEQYWNEMLQSVY-----------------------------GSFKSHPRFNWQFRPG 520
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
K + S D + +Y E++ + L GN+L + D ++ L L
Sbjct: 521 KEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL-----FEADFKEMAANYLGGKVEILVR 574
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
I +AY+ D QL RF L+ MD +L H + W++ A+
Sbjct: 575 QIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPTKDMQKWIDYAR 622
>gi|421736727|ref|ZP_16175487.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
IPLA 20015]
gi|407295984|gb|EKF15606.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
IPLA 20015]
Length = 1044
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 135/462 (29%), Positives = 223/462 (48%), Gaps = 41/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q + L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 338 MQNLYSVGGPLPAAWFEQCVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 397
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++ F + Q +G+ S+ Y D F
Sbjct: 398 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVCDTFYKAQENVFGKVSNYYAVDPFH 456
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 457 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 508
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ ++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 509 QTLVLDLQSDLRSQ-ASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 567
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ W++L
Sbjct: 568 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKVWDIL 626
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y TDG + +I A P D +I S S++
Sbjct: 627 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 663
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 664 HSDIDYDKRQFEKAAALFEQAYDSYKNSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 723
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
D LS R L++++ D LL+ D FL+G W++ A+ +
Sbjct: 724 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTM 765
>gi|260910505|ref|ZP_05917173.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
F0295]
gi|260635347|gb|EEX53369.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
F0295]
Length = 1566
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 140/466 (30%), Positives = 231/466 (49%), Gaps = 43/466 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ +S + +L Q+++L R+ +LG+ PV+ F G VP + FP A+I
Sbjct: 199 MGNLEGWGGPMSESLIALRLQQQRQMLQRMRQLGIQPVVQGFPGIVPTFFKERFPQARII 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--T 118
+ G W S + LL D +F ++ A+ + K +G D F E T
Sbjct: 259 EQGKWGSFQRP-----AVLLPNNDGVFEKVAEAYYQSLTKLFGTDFEFLGGDLFHEGGIT 313
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + S+ A + M A W++QGW + +P PQ+ +L+ +
Sbjct: 314 TGVD----VGSVAAQVQRQMLRFFPRAKWVLQGW--NKNP---SPQLLRVLDKR---HTL 361
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTM 237
+++L E+ W +S +F G P++W +++F G +M G L I P A + ++ M
Sbjct: 362 LVNLSGEIAASWESSDEFGGTPWLWGSVNHFGGKTDMGGQLPVIVTEPHRALALTVDSVM 421
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G+ EGI NPVVYDL + A+ DV + + QY RYG P + AW ++
Sbjct: 422 QGIGILPEGIGTNPVVYDLALKTAWHTATPDVDSMLVQYLGYRYGEVHPDLLAAWRIMLK 481
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VY EG +++ ++ ++ + S++ +
Sbjct: 482 SVYG---------------------EFAIKGEGTFESVF--CARPSLRVTSVSTWGPKQM 518
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y +++ RAL LF+ + +L S TY+YDL+DL RQ+LA YA + ++++AY+ +A
Sbjct: 519 QYQPADLYRALGLFLKAAPKLRDSETYQYDLVDLARQSLANYARTAYADVVKAYEAKNAE 578
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ Q ++RF L+ D LL + FLLG WL+ A Q A NE +
Sbjct: 579 QLQQATQRFERLIVLQDSLLLTNRHFLLGNWLQQATQYAPNEADRQ 624
>gi|433678127|ref|ZP_20510026.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430816763|emb|CCP40478.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 691
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 146/502 (29%), Positives = 219/502 (43%), Gaps = 71/502 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PL Q W++ + LQ++IL R+ LGM PVLPAF G VP A P A+I
Sbjct: 127 MGNIEGYDAPLQQQWIEDKHALQQRILQRMRTLGMKPVLPAFVGYVPKAFAQAHPQARIY 186
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I FI+ + YG+ ++ Y D F+E PP
Sbjct: 187 RMRAWEGFHE------TYWLDPADPLFAKIALRFIQLYDRTYGKGTY-YLADAFNEMLPP 239
Query: 121 VDS----------------------PE--------YISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + PE ++ G A+Y + + DAVW+MQ
Sbjct: 240 IAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 299
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 300 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 359
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ G+ +YG +AF + R + +VG G EG+ N VVY+ M +A+
Sbjct: 360 YGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFGAFPEGLHDNSVVYEYMYALAWGG 416
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
++ ++ W+ Y RYG + PA++ AW+ L V + P
Sbjct: 417 QQRSLQDWLGDYIRARYGHTSPALRAAWDDLQAAVLSTRYWT---------------PRW 461
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
G Y + +P + + E + D P L RAL+ +A E + + Y
Sbjct: 462 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLY 512
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
RYDL+D R + + AY+ D R V+ +DGL+
Sbjct: 513 RYDLVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDAAFARVQAAVQQLDGLVGGQQE-T 571
Query: 445 LGPWLESAKQLAQNEEQEKQVR 466
L WL A+ A+ + R
Sbjct: 572 LSSWLGDAEGDAKTPQDAAYYR 593
>gi|308480701|ref|XP_003102557.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
gi|308261289|gb|EFP05242.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
Length = 718
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 130/475 (27%), Positives = 225/475 (47%), Gaps = 49/475 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN---VFPSA 57
M NL +GG L + + L K+I+ R+ ELG+ P+LP F+G VP L+ +FP++
Sbjct: 210 MGNLKAYGGGLSDAQMLNDFNLAKRIINRLLELGIVPILPTFAGFVPDQLEKDFRLFPTS 269
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDE 116
K +L W + S+ C + DPLF +IG F+ Q K G +++Y+ D F+E
Sbjct: 270 KFNRLPCWNNFTSET--SCLLSVSPFDPLFQKIGSTFLRHQKKMLGGDITNLYSADPFNE 327
Query: 117 NTPPVDSPEYISSL----GAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSV 172
P DS ++ +S +I + + D + +W++Q W F+YD W +K+ L++V
Sbjct: 328 -ILPSDSSKFDASFMKQTAQSIMNSCRKVDKNCIWVLQSWSFTYDQ-WPNWAIKSFLSAV 385
Query: 173 PLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS 232
P+G L++LDL++EV P W + F+G ++WC+LHNF G+ E+ G L + G A
Sbjct: 386 PIGNLLILDLYSEVVPAWQMTSSFHGHNFVWCLLHNFGGSRELRGNLQKVDKGYQLALMK 445
Query: 233 ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
+ +VG G+SME I+QN VVY M + + E + + W+ YS RY W
Sbjct: 446 AGSNLVGAGLSMEAIDQNYVVYQFMIDRMWSQEPIPLNNWLKSYSESRYSADFKVSHKFW 505
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
+L + Y+ + + V + Y +P + +
Sbjct: 506 TILAGSFYSQPEKWGNPRFSVFL-------------------YHRPAFAKKI-------- 538
Query: 353 DHPHLWYSTSEVIRALELFIAS-GNELSASNTYRYDLIDLTRQALA-KYANELFLNIIEA 410
W+ E L+ + S + L ++ DL D+ R + + NE L++ EA
Sbjct: 539 ---EYWFPVEETFNHLQSLMPSLMHVLGDHPLFKEDLNDVMRAVIQFEIGNEAALSLTEA 595
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
+ + D + +++ + ++ + W+E +K +A E E+QV
Sbjct: 596 FLMEDKQQIGASCENLMDMFQKLES----YSNRDFKEWIEDSKSIAPTSE-ERQV 645
>gi|423219557|ref|ZP_17206053.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
CL03T12C61]
gi|392624762|gb|EIY18840.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
CL03T12C61]
Length = 715
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 135/465 (29%), Positives = 208/465 (44%), Gaps = 43/465 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+PQS +D + L +K+L R+ LG+ P++P F G VP+ L+N A I
Sbjct: 195 MGNIEGWGGPMPQSQIDSRKKLVQKMLKRMKSLGIEPLMPGFYGMVPSNLKNK-SKAHII 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + +LD DP F + F ++ + YG ++ D F E
Sbjct: 254 PQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDETRRLYGSDIRFFSGDPFHEGGAT 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ G AI MQ ++W++QGW + P LL + ++V
Sbjct: 308 DGVA--LGDAGRAIQKTMQKHFPGSIWVLQGWQDNPKP--------GLLEKLDKRYVLVQ 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT-MVG 239
+LF E W T K + G P+IW + NF + G L A A SE M G
Sbjct: 358 ELFGENTNNWETRKGYEGTPFIWATVTNFGERPGINGKLQRFADEVYRASNSEYAKYMKG 417
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG+ EGI NPV Y+L+ E+ + ++VDV WI Y RYGR I+ AW ++ ++
Sbjct: 418 VGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESYVTARYGRITDEIRTAWKMMLKSI 477
Query: 300 YNCTDGATDK-NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y+ G + +++ A P ++ +S + Y + + K+A
Sbjct: 478 YSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKYDRDLYKKAAF------------- 524
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
LF + E + TYR DLI RQ +A A+ +F ++I AYQ
Sbjct: 525 -----------LFAKAMPEFNEVRTYRIDLIHFLRQVIANEADSVFYDMITAYQEKKVEK 573
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
Q +FL +++ + LLA F L W + AK ++K
Sbjct: 574 FEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDAGNTAAEKK 618
>gi|153806010|ref|ZP_01958678.1| hypothetical protein BACCAC_00255 [Bacteroides caccae ATCC 43185]
gi|149130687|gb|EDM21893.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
Length = 715
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 135/465 (29%), Positives = 208/465 (44%), Gaps = 43/465 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+PQS +D + L +K+L R+ LG+ P++P F G VP+ L+N A I
Sbjct: 195 MGNIEGWGGPMPQSQIDSRKKLVQKMLKRMKSLGIEPLMPGFYGMVPSNLKNK-SKAHII 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + +LD DP F + F ++ + YG ++ D F E
Sbjct: 254 PQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDETRRLYGSDIRFFSGDPFHEGGAT 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ G AI MQ ++W++QGW + P LL + ++V
Sbjct: 308 DGVA--LGDAGRAIQKTMQKHFPGSIWVLQGWQDNPKP--------GLLEKLDKRYVLVQ 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT-MVG 239
+LF E W T K + G P+IW + NF + G L A A SE M G
Sbjct: 358 ELFGENTNNWETRKGYEGTPFIWATVTNFGERPGINGKLQRFADEVYRASNSEYAKYMKG 417
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG+ EGI NPV Y+L+ E+ + ++VDV WI Y RYGR I+ AW ++ ++
Sbjct: 418 VGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESYVTARYGRITDEIRTAWKMMLKSI 477
Query: 300 YNCTDGATDK-NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y+ G + +++ A P ++ +S + Y + + K+A
Sbjct: 478 YSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKYDRDLYKKAAF------------- 524
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
LF + E + TYR DLI RQ +A A+ +F ++I AYQ
Sbjct: 525 -----------LFAKAMPEFNEVRTYRIDLIHFLRQVIANEADSVFYDMITAYQEKKVEK 573
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
Q +FL +++ + LLA F L W + AK ++K
Sbjct: 574 FEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDAGNTAAEKK 618
>gi|169351448|ref|ZP_02868386.1| hypothetical protein CLOSPI_02228 [Clostridium spiroforme DSM 1552]
gi|169291670|gb|EDS73803.1| LPXTG-motif cell wall anchor domain protein [Clostridium spiroforme
DSM 1552]
Length = 1990
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 130/463 (28%), Positives = 226/463 (48%), Gaps = 40/463 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GG LP +W ++++ L +K+ R+ G+ PVL FSG VP ++ + +
Sbjct: 360 MQNMTSYGGKLPNNWFEERVELARKMHDRMQTYGITPVLSGFSGQVPTNFKDKYQDVQYV 419
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W + P TY+ + F ++ F + Q +G ++IY D F E
Sbjct: 420 AQGSWCGYER-PDMLRTYVDNGGTDYFSQMADVFYKAQRDIFGDVTNIYAVDPFHEGGKI 478
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK--LV 178
D + + + M D DA+WL+Q W S P ++ + L K ++
Sbjct: 479 GDMN--YTKVYETVQKKMMENDEDAIWLIQEWSGSIAS--NPSKL------INLDKEHVI 528
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P +++ + P+IW MLHNF G + + + ++ P + SE+ M
Sbjct: 529 VLDLFSEVSP-RNSALEAADTPWIWNMLHNFGGRMGLDANPEKVSQNIPNTYQNSEH--M 585
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+GM+ E IE +P+ Y+L+ +M + + +D + W Y+ R YG + I++ WN+L
Sbjct: 586 VGIGMTPEAIENSPMAYELLWDMTWTKDPIDFRQWCQDYAKRIYGGTNEDIEEVWNILLD 645
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T YN D + S+I+ +P + + SS+ H +
Sbjct: 646 TGYNRKDN----------YYQGAPESVIN---------ARPTTN----FTSASSWGHSTI 682
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y E+ RA+ L + +E S + YDL D+TRQ ++ A E ++ AYQ +
Sbjct: 683 NYDKEELERAVYLMAKNYDEFKDSPAFIYDLSDITRQLISNSAQEYHKAMVNAYQAGNLS 742
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
LS +FLE++ D +L+ + FL+G W+E A+ + ++ +
Sbjct: 743 EFEVLSDKFLEMILLQDQILSTNSDFLVGKWIEQARTMIEDSD 785
>gi|384417770|ref|YP_005627130.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353460684|gb|AEQ94963.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 798
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 140/474 (29%), Positives = 218/474 (45%), Gaps = 72/474 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQQWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
V + +Y +++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAAYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG +AF ++A ++ + G G+ EG+ N VVY+ + +A++
Sbjct: 434 YGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ +Y RYGRS A+ AW L +Y + P
Sbjct: 491 PQHPWSQWLARYLRARYGRSDAALLSAWTDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + D P + RA++ + + + + Y
Sbjct: 536 WNTHAGAYLLFKRPTADIVNFD------DRPG---DPQRLRRAIDALLQQADRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
RYDLI+ R L+ A+ +++AY D A G QL+R +LV+ +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDALV 639
>gi|375146756|ref|YP_005009197.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
koreensis GR20-10]
gi|361060802|gb|AEV99793.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
koreensis GR20-10]
Length = 1147
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 132/453 (29%), Positives = 204/453 (45%), Gaps = 42/453 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+PQS +D + +L +K++ R+ LG+ PV+P F G VP N A++
Sbjct: 191 MGNIEGWGGPMPQSQIDSRKILVQKMIARMQALGIEPVMPGFYGMVPHNF-NTKSKARVI 249
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
GNW + + +LD TD F + F E+ K YGR ++ D F E
Sbjct: 250 TQGNWGA------FIRPAILDPTDTAFDRVAGIFYEETKKLYGRNIRFFSGDPFHEGG-- 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ + + GA I MQ A+W++QGW + K LL L++
Sbjct: 302 ITNGVNLGKAGANIQKAMQQYFPGAIWVLQGW--------QDNPKKELLAETDKSALLIQ 353
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVG 239
+LF E W T + G P+IWC ++NF + G L+ A A T M G
Sbjct: 354 ELFGENTNNWETRNGYEGTPFIWCCVNNFGERPGLNGKLERYAGEVYRAATGPFREYMKG 413
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG+ EGI NP YDL+ E+ + ++ V+ WIN Y RYG++ I AW + T+
Sbjct: 414 VGIMPEGINNNPASYDLVLELGWHNQPVETGKWINDYVKARYGKANDQIATAWTLFLQTI 473
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y+ +++ A P + K VS LK Y
Sbjct: 474 YSNPGYQEGPPENILCARPALQV--------------KSVSSWGKLKKG----------Y 509
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
T+ + ++ F A+ S TY+ DLI+ TRQ L+ A+ +F +++ AY+ +
Sbjct: 510 DTALFEKGVQAFAAAAPLFGNSETYKIDLINFTRQVLSNRADTVFASLVTAYKEENTVAF 569
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 452
+ FL L + LL H + L + + A
Sbjct: 570 NAAAEAFLSLHALTNELLNSHSYYRLTSYQQQA 602
>gi|418515337|ref|ZP_13081518.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|410708056|gb|EKQ66505.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 782
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 138/474 (29%), Positives = 210/474 (44%), Gaps = 72/474 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 185 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 245 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 297
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 298 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 357
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 358 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 417
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N V+Y+ + +A++
Sbjct: 418 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 474
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW+ L +Y + P
Sbjct: 475 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 519
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + A D P + RA++ + N + + Y
Sbjct: 520 WNKRAGAYLLFKRPTADIADFD------DRPG---DPQRLRRAIDALLQQANRYADAPLY 570
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+
Sbjct: 571 RYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALV 623
>gi|403512485|ref|YP_006644123.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
[Nocardiopsis alba ATCC BAA-2165]
gi|402798758|gb|AFR06168.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
[Nocardiopsis alba ATCC BAA-2165]
Length = 718
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 131/471 (27%), Positives = 215/471 (45%), Gaps = 45/471 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL + GP+P+SW++ L +++L R LGM PVLP F+G+VP +L ++
Sbjct: 160 MGNLDHFAGPMPRSWIEGHRELGRRVLERQRALGMTPVLPGFTGHVPPSLAPGRTGSRTW 219
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q T++L TDPL+ + +E Q KE T H Y D F E P
Sbjct: 220 QG------------LVTHVLVPTDPLYTTLCAEIVETQ-KELFDTDHQYAIDPFIEMIPV 266
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P + + A G+ D AVW +Q W FSY FW P +++A L+++P L +
Sbjct: 267 DSDPGFPGLVARATIEGLTRADPRAVWFLQTWPFSYQSDFWSPERVEAFLDAIPDDHLHL 326
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI----EMYGILDSIAFGPVEARTSENT 235
LDL+AE P WS F G P+ WC L NF G ++ G D I A E
Sbjct: 327 LDLWAEYDPQWSRFHAFGGTPWTWCALLNFGGRTDPMADLQGAADRIGAAKDSAHPPE-- 384
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-PAIQDAWNV 294
G+G+SME NP ++L+ + A+ + W+ + +RYG PA+ + W
Sbjct: 385 ---GIGLSMEATRNNPAFFELVVDQAWTRTGRVEEEWLPDFVAQRYGPGHDPALLEGWRG 441
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L TV + + FP+ ++++ + + + + + L++E ++
Sbjct: 442 LLRTVLGASG---------VRIFPEQFNGVLTL-----RPHYRHLEDSSALRAEVTAL-- 485
Query: 355 PHLWYSTSEVIRALELFIASG--NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+WY +++ A E +A + L+ +DL+D+ L++ A+ +L ++E
Sbjct: 486 --VWYPWPDLLAAWERLVAGAETDPLAVEGPLGHDLVDVAMAVLSRVADHRYLEMVEHLD 543
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ L RFLE+ +D+D LL + W A A E +
Sbjct: 544 HHPELPEGDLE-RFLEVFDDLDALLETRPEYRYRTWEAKATSWATGTEDHR 593
>gi|418520969|ref|ZP_13087015.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410702945|gb|EKQ61442.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
Length = 798
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 138/474 (29%), Positives = 210/474 (44%), Gaps = 72/474 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N V+Y+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + A D P + RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTADIADFD------DRPG---DPQRLRRAIDALLQQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALV 639
>gi|21241480|ref|NP_641062.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
gi|21106823|gb|AAM35598.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
Length = 798
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 137/474 (28%), Positives = 209/474 (44%), Gaps = 72/474 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASRAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N V+Y+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + D P + RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALV 639
>gi|381169859|ref|ZP_09879021.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689629|emb|CCG35508.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 798
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 137/474 (28%), Positives = 209/474 (44%), Gaps = 72/474 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N V+Y+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWES 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + D P + RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALI 639
>gi|154321596|ref|XP_001560113.1| hypothetical protein BC1G_00945 [Botryotinia fuckeliana B05.10]
Length = 701
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 104/257 (40%), Positives = 153/257 (59%), Gaps = 7/257 (2%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG +P +W++ Q +LQKKI+ R+ ELG+ PVLPAF+G VP L+ V P+A I
Sbjct: 185 FGNIQGSWGGTIPLAWIEDQHLLQKKIVQRMVELGITPVLPAFTGFVPRDLRRVAPNANI 244
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
+W ++ T+L DPLF + F+ Q + YG SHIY D F+EN P
Sbjct: 245 INGSDWGNLFPFEYSNDTFLY-PIDPLFKTLQHTFLSLQSEYYGNVSHIYTLDQFNENLP 303
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-L 177
P Y+ ++ Y +QS DS+A W++QGWLF + FW +++A L VP + +
Sbjct: 304 ASGDPLYLGNISRGTYDSLQSFDSNATWMLQGWLFYAASSFWTQDRVEAYLGGVPKNESM 363
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT 236
++LDLF+E P W + Q+YG P+IWC LH + G +YG + +I +EA R SE
Sbjct: 364 LILDLFSESFPEWENTHQYYGKPWIWCQLHGYGGTPGIYGQIYNITNSSIEAFRNSEK-- 421
Query: 237 MVGVGMSMEGIEQNPVV 253
MVG+G +MEG + N ++
Sbjct: 422 MVGMGNTMEGQDGNGLI 438
>gi|282877909|ref|ZP_06286718.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
gi|281299910|gb|EFA92270.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
Length = 717
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 133/480 (27%), Positives = 223/480 (46%), Gaps = 56/480 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W QQ+ LQ KI+ R+ ELGM+P+ PAF+G VP A P
Sbjct: 195 MGNLNQWDGPLSDAWHKQQITLQHKIISRMRELGMHPIAPAFAGFVPKAFAKKHPEINFK 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W Y+L F ++G+ FIE+ +E+G ++ Y D+F+E P
Sbjct: 255 HL-RWGGFADS---LNAYVLPPESSYFKQLGKLFIEEWEREFGENTY-YLSDSFNEMKLP 309
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
V+ + ++ G AIY + +G+ A+W+ QGW F Y FW + ALL+ VP
Sbjct: 310 VNPNDEEEKCRLLAEYGKAIYQSINAGNPHAIWVTQGWTFGYQHDFWNRKSLSALLSQVP 369
Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
+++++DL + + W FYG +I+ + NF G + G L+ A
Sbjct: 370 NDRMIIIDLGNDYPKWVWHTEQTWKRHNGFYGKQWIFSYVPNFGGKTLLTGDLEMYATDA 429
Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
A ++ N +VG+G + EG+E N VVY+L+S+ A+ + +++ WI Y + RYG+
Sbjct: 430 SLALSAANKGNLVGIGSAPEGLENNEVVYELLSDAAWTDKGINLDEWIANYCMARYGKYP 489
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD-PSIISVTEGKYQNYGKPVSKEAV 344
++ AWN +VY+ + ++P ++I T K
Sbjct: 490 DKMKAAWNGFRKSVYSS-----------LYSYPRFTWQTVIPDTRRK------------- 525
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
S +D ++ +A+E F++ +EL + Y+ D I Q L A+ +
Sbjct: 526 -----SRHDLNETYF------KAVEDFLSCADELGGAKFYQDDAILFAAQYLGAKADIYY 574
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
N + LN + + +EL+ D +LA H L W+ A+ +++ Q
Sbjct: 575 ENALRYGSLNKHVEANKQLSKAIELLLFADKILASHPTDRLDVWIAKARSQGHTPQEKNQ 634
>gi|390989490|ref|ZP_10259787.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372555759|emb|CCF66762.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 798
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 137/474 (28%), Positives = 209/474 (44%), Gaps = 72/474 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N V+Y+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + D P + RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLL 437
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALV 639
>gi|325922205|ref|ZP_08183992.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
19865]
gi|325547324|gb|EGD18391.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
19865]
Length = 807
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 136/477 (28%), Positives = 210/477 (44%), Gaps = 78/477 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQ +IL R+ ELGM PVLPAF+G VP A P+A+I
Sbjct: 202 MGNIEGYRAPLPQQWIDSKRVLQTQILTRMRELGMQPVLPAFAGYVPKAFAQAHPNARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 262 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 314
Query: 121 VD-------SPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 315 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPQATWVMQ 374
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 375 GWLFGADREFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 434
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N VVY+ + +A++
Sbjct: 435 YGASNPLYG---DFAFYRQDLQALLADPDKRNLRGFGVFPEGLHSNSVVYEYLYALAWEG 491
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ QY+ RYG S A+ AW+ D+D I
Sbjct: 492 PQQSWSQWLTQYTRARYGHSDAALLQAWS-------------------------DLDAGI 526
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSY----DHPHLWYSTSEVIRALELFIASGNELSA 380
+ + K + K T+ D P + RA++ + + +
Sbjct: 527 YQTRYWSLRWWNKRAGAYLLFKRPTADIVGFDDRPG---DPQRLRRAIDALLQQADRYAD 583
Query: 381 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLL 437
+ YRYDLI+ R L+ +A+ +++AY D L R LV+ +D L+
Sbjct: 584 APLYRYDLIEDARHYLSLHADRQLQAVVQAYGTGDFARGDALLARTTRLVQGLDALV 640
>gi|295690503|ref|YP_003594196.1| alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
gi|295432406|gb|ADG11578.1| Alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
Length = 770
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 140/488 (28%), Positives = 211/488 (43%), Gaps = 68/488 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLP +W+D++ LQ +IL R+ LGM P+LPAF G VP A P A+I
Sbjct: 202 MGNIEGYRAPLPTNWIDKKKDLQVQILGRMRSLGMTPILPAFGGYVPKAFAQKNPKARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I F+ + YG T Y D+F+E PP
Sbjct: 262 RMRPWEGFHE------TYWLDPADPLFAKIAGRFLALYTQTYG-TGTYYLADSFNEMLPP 314
Query: 121 VD-------------------------------SPEYISSLGAAIYSGMQSGDSDAVWLM 149
++ + +++ G AIY ++ DAVW+M
Sbjct: 315 INADGADARDAAYGDGAANTAATKTKVEVDPALKAQRLAAYGKAIYDSIRQARPDAVWVM 374
Query: 150 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 207
QGWLF D FW P + A L+ VP KL++LD+ + P +W +K F G P+I+ +H
Sbjct: 375 QGWLFGADSHFWDPTAISAYLSLVPDDKLMILDIGNDRYPAVWKNAKAFGGKPWIYGYVH 434
Query: 208 NFAGNIEMYGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 266
N+ G+ +YG LD P A E + G GM EG+ N +VYD + ++A+ +
Sbjct: 435 NYGGSNPVYGDLDYYRRDIPAIAANPEAGKLAGFGMFPEGLHNNSIVYDAVYDLAWGAGR 494
Query: 267 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 326
+ AW++ Y+ RYG++ P + A L Y+ + P
Sbjct: 495 ESLSAWLSTYARARYGKTSPELDAALGQLVEAAYSTRYWS---------------PRWWK 539
Query: 327 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRAL-ELFIASGNELSASNTYR 385
G Y + +P + HP ++AL L A NE +
Sbjct: 540 SKAGAYLFFKRPTATIGEFPP------HPGDRAKLEAAVKALTALAPAYANE----PLFV 589
Query: 386 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 445
DL D TR ++L + AY+ D Q L +D LL L
Sbjct: 590 LDLTDATRHLATMKIDDLLQAAVAAYRRGDVASGDQARVEIAALALSIDKLLGVQPE-TL 648
Query: 446 GPWLESAK 453
W++ A+
Sbjct: 649 ATWIDDAR 656
>gi|315500594|ref|YP_004089396.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
gi|315418606|gb|ADU15245.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
Length = 765
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 137/497 (27%), Positives = 216/497 (43%), Gaps = 71/497 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ P+PQ+W+ ++ LQ +IL R+ ELGM P+LPAF G VP A P A+I
Sbjct: 202 MGNIEGYLAPVPQAWIQKKHKLQSRILGRMKELGMTPILPAFGGYVPKAFAQKHPQARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W TY LD DPLF +I FI + YG + Y D+F+E PP
Sbjct: 262 PMRPWEGFHE------TYWLDPADPLFAKIAARFIALYTETYGEGRY-YLADSFNEMLPP 314
Query: 121 VD-----------------------------SPEYISSLGAAIYSGMQSGDSDAVWLMQG 151
+ E +++ G AIY ++ DAVW MQG
Sbjct: 315 ISHDGSDVKNAKYGDSTANTKETETVVDPAVKAERLAAYGKAIYDSIRQARPDAVWTMQG 374
Query: 152 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNF 209
WLF D FW P + A L VP KL++LD+ + P +W +S F G P+I+ +HN+
Sbjct: 375 WLFGADKHFWTPDAIGAFLRDVPQDKLMILDIGNDRYPGVWQSSNAFQGKPWIYGYVHNY 434
Query: 210 AGNIEMYGIL----DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE 265
+ +YG L D I + AR + + G G+ EG+ N +VY+ ++A+
Sbjct: 435 GASNPVYGDLGFYRDDIR--GLLAR-KDTGDLKGFGLFPEGLHNNSIVYEYAYDLAWGQA 491
Query: 266 KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSII 325
V W+ Y RYG+ PA+ AW+ ++ + P
Sbjct: 492 NQTVTEWLTTYLKSRYGQVTPALILAWSTYVEAAFSTRYWS---------------PRWW 536
Query: 326 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 385
G Y +P + + HP ++ RA++ + S S YR
Sbjct: 537 RSKAGAYLLCKRPTADMVEFEG------HPG---DRKKLRRAIDALL-SLKGFGGSALYR 586
Query: 386 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 445
+D+ID R +++ ++ + ++AY+ D L + LV +D L+ L
Sbjct: 587 HDVIDAVRHLVSEEIDDRLIAAMKAYKSGDVKTGDGLREEVIALVTQVDTLMGAQPD-TL 645
Query: 446 GPWLESAKQLAQNEEQE 462
W++ A E++
Sbjct: 646 ASWIDEASAYGDTSEEK 662
>gi|147860882|emb|CAN83148.1| hypothetical protein VITISV_031934 [Vitis vinifera]
Length = 562
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 86/100 (86%), Positives = 94/100 (94%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 100
+LGNWF+V +PRWCCTYLLDATDPLFIEIGRAFI+QQLK
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGRAFIQQQLK 334
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 82/92 (89%), Positives = 89/92 (96%)
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
DTFDENTPPVD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+S
Sbjct: 429 DTFDENTPPVDDPEYISSLGAAIFKGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHS 488
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW 203
VP+G+LVVLDLFAEVKPIW TS+QFYGVPYIW
Sbjct: 489 VPMGRLVVLDLFAEVKPIWITSEQFYGVPYIW 520
>gi|261880009|ref|ZP_06006436.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
gi|270333325|gb|EFA44111.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
Length = 722
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 135/468 (28%), Positives = 208/468 (44%), Gaps = 54/468 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ KI+ R+ LGM+P+ PAF+G VP P ++
Sbjct: 194 MGNLNSWNGPLTDAWQQGQITLQHKIIDRMRALGMHPIAPAFAGFVPEQFVEAHPGLQVK 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W D R Y+L P F +IGR F+E+ KE+G+ + Y D+F+E P
Sbjct: 254 KL-TWGGF--DDR-LNAYVLSPESPYFKQIGRLFVEEWEKEFGKNT-FYQSDSFNEMEIP 308
Query: 121 VDSPEYISS------LGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
V+ + I G IY + + DAVW+ QGW F Y W ++ALL VP
Sbjct: 309 VEPGDSIGKWKLLEQYGDVIYRSIAEANPDAVWVTQGWTFGYQHKMWDSKSLQALLRHVP 368
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K IW T + +YG +++ + NF G G + A
Sbjct: 369 DDKMLIIDLANDYPKWIWKTQQTWKVQHGYYGKQWVFSYVPNFGGKTLPTGDMQMYASAS 428
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
EA SE MVG G + EGIE N V+Y+L+++M + + VD+ WI Y RYG
Sbjct: 429 AEALHHSERGNMVGFGSAPEGIENNDVIYELLADMGWTDKAVDLDLWIKDYCEARYGGYP 488
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
+Q AW + +VY + ++P ++ + VS A+
Sbjct: 489 SDMQKAWQCMLRSVYGS-----------LYSYPRFTWQTVTPDS-------RRVSTHALN 530
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ S H F+ +L +S YR D I L L A+ +
Sbjct: 531 DTFLSGVAH----------------FLRCARQLGSSPLYRSDAISLASLYLGTKADRHYT 574
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
++ + ++L+ D LLA H L W++ A+
Sbjct: 575 KALDLKASGKQQAASAELHQTIDLLTKADRLLASHPTHRLDRWIQFAR 622
>gi|404403947|ref|ZP_10995531.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 828
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 148/485 (30%), Positives = 231/485 (47%), Gaps = 71/485 (14%)
Query: 1 MSNLHGW-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
M N+ G G P PQ W + Q+ LQ +I+ R+ LGM PV F+G VP A++ + P +
Sbjct: 184 MGNMSGLDGAPTPQ-WHEAQIALQHRIIDRMEALGMTPVYQGFAGFVPPAMKRIHPETTL 242
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE-NT 118
T+ W K+ ++L DPLF EIG AF+ +E+G+ + Y D+F+E +
Sbjct: 243 TET-KWSGFKN-------WMLSPLDPLFSEIGTAFVRAWEEEFGKGKY-YLIDSFNEMDV 293
Query: 119 P--PVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSV 172
P P SPE ++L G IY + + DAVW+MQGW+F Y W P ++ALL
Sbjct: 294 PFGPKGSPERAATLRHYGETIYRSLAEANPDAVWVMQGWMFGYQRNSWDPESVRALLEGA 353
Query: 173 PLGKLVVLDLFAEVKP-IWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
P G++++LDL + IW + K F+G +I+ + NF G + G L+ A G
Sbjct: 354 PDGRMMILDLAVDFNNFIWRSEKSWNHLQGFFGREWIYSTVPNFGGRTALIGNLEFYANG 413
Query: 226 PVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
+EA +S N + G G S EG+E N +VY++++ + +++D+K +++ YS RYG
Sbjct: 414 HLEALSSPNRGRLTGYGTSPEGVESNEIVYEIIAAAGWSDDRIDLKKFLHDYSAARYGGC 473
Query: 285 VPAIQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
I W+ + + YN CT+ A +Y+ +P S
Sbjct: 474 PEGIDRFWSGMLQSSYNECTNNA------------------------RYRWQLRPYSHRM 509
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
+Y A+E F+A EL + YR D I LA A+ L
Sbjct: 510 PTMGINENY------------YTAIEQFLACAGELGGNELYRTDAIQYAALYLASKADML 557
Query: 404 FLNIIEAYQLNDAHG----VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
+EA D +G + + R EL+ D D LLA H L W A++ E
Sbjct: 558 ----LEAANWADLYGAREEAYDCAMRIEELLLDADRLLASHPLLRLDRWSGMARKAGCTE 613
Query: 460 EQEKQ 464
E++++
Sbjct: 614 EEKER 618
>gi|210611122|ref|ZP_03288736.1| hypothetical protein CLONEX_00926, partial [Clostridium nexile DSM
1787]
gi|210152109|gb|EEA83116.1| hypothetical protein CLONEX_00926 [Clostridium nexile DSM 1787]
Length = 1662
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 134/480 (27%), Positives = 224/480 (46%), Gaps = 65/480 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW ++ L +K + + +LGM PVL +SG VP + + PSA++
Sbjct: 685 MANLSGFGGPVHDSWFTERTELARKNQLIMRKLGMQPVLQGYSGMVPVDITDKDPSAQVI 744
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S + +L F + + F + Q + YG S Y D F E NT
Sbjct: 745 KQGTWCSFQRPS------MLKTDSETFDKYAQLFYKVQKEVYGDVSDYYATDPFHEGGNT 798
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-- 176
+ SP I+ + + M D + +W++Q W+ ALL + +
Sbjct: 799 GGM-SPTVIAE---KVLANMMEADENGIWIIQS--------WQGNPSTALLQGLDAARDH 846
Query: 177 LVVLDLFAEVKPIWSTSK-----------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
+VLDL+AE P W+ + +F P+++CML+NF G + ++G +++ G
Sbjct: 847 ALVLDLYAEKTPHWNETDPGSYGGAEGGGEFLNTPWVYCMLNNFGGRLGLHGHIENFVNG 906
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE-----KVDVKAWINQYSVRR 280
+A N M G+G++ E NPV+YDL E + + +++ W Y+ RR
Sbjct: 907 VAQAAAQAN-HMAGIGITPEASVNNPVLYDLFFETIWSDDGENLSAINLDEWFKDYTTRR 965
Query: 281 YGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKP 338
YG + +A +L TVYN + V+ A P +D G +G
Sbjct: 966 YGAESQSAYEAMQILNDTVYNPEMNMKGQGAPESVVNARPGLDI-------GAASTWGNA 1018
Query: 339 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
V + Y +E+ +A L + ++L S Y+YDL ++ Q L+
Sbjct: 1019 V-----------------IDYDKAELEKAAALLLKDYDKLKDSAGYQYDLANVLEQVLSN 1061
Query: 399 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
A E + +A++ DA ++S FLE++ ++ + + F+LG WLESAK LA+N
Sbjct: 1062 TAQEYQKKMADAFREGDAEKFEKMSNSFLEIITKVEEVTGTQEEFMLGTWLESAKALAKN 1121
>gi|294667089|ref|ZP_06732314.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 10535]
gi|292603099|gb|EFF46525.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 10535]
Length = 798
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 142/479 (29%), Positives = 213/479 (44%), Gaps = 73/479 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWTDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF ++A ++ + G G+ EG+ N V+Y + +A++
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVIYAYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + D P + RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLQQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLA-CHD 441
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+ HD
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGDQHD 644
>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
Length = 1552
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 130/468 (27%), Positives = 225/468 (48%), Gaps = 43/468 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ ++ ++ + LQ+K+L R+ LG+ PV+ F G VP+ + FP+A++
Sbjct: 196 MGNLEGWGGPMSEALIEARYQLQRKMLQRMQALGIQPVVQGFPGLVPSFFKERFPAAQLV 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P LL + LF ++ +A+ E ++ YGR D F E NT
Sbjct: 256 LQGRWGHFNRPP-----MLLPSDKDLFQQVAKAYYESLIRCYGRDFKFLGGDLFHEGGNT 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD +++ + S A W++QGW + P LL+ + ++
Sbjct: 311 KGVDVAATAAAVQQTMLRYFPS----AKWVLQGWNNNPSP--------TLLSKLDKQHVL 358
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTM 237
+++L E+ W +S +F G P++W +++F G +M G L + P A + ++N M
Sbjct: 359 LINLSGEIAASWESSNEFGGTPWLWGSVNHFGGKTDMGGQLPVLVAEPHRAFSQTKNGVM 418
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G+ EGI NPVVYDL + A+ D+ + Y RYG ++ AW++L H
Sbjct: 419 QGIGILPEGINSNPVVYDLALKTAWYTTTPDLDRLLRDYIAYRYGHVDESLVQAWHILSH 478
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VY EG +++ ++ + + S++ +
Sbjct: 479 SVYG---------------------EFKIKGEGTFESIF--CARPGLHVTSVSTWGPKQM 515
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y+ ++ +AL LF ++ S TY+YDL+DL RQ +A +A +++ ++AY+ DA
Sbjct: 516 QYNPKDLEKALGLFRRVADQYKGSATYQYDLVDLARQVMANHARDVYAAAMQAYRNKDAA 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
+ + + F+ L++ D LL FLLG WL A ++Q
Sbjct: 576 LLHEKGQEFMHLLQLQDRLLQTDTHFLLGNWLAQAANYGVTAADKQQA 623
>gi|296115989|ref|ZP_06834611.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
23769]
gi|295977458|gb|EFG84214.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
23769]
Length = 758
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 128/475 (26%), Positives = 220/475 (46%), Gaps = 58/475 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGP+P+ ++++ V ++I+ R+ ELGM PVLP F G VP FP A +
Sbjct: 222 MANMCCYGGPVPRELIEKRAVSAQQIIGRMRELGMRPVLPGFYGMVPDDFGKRFPQAHVI 281
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD DP+F ++ + ++Q K +G + +Y+ F E P
Sbjct: 282 GQGEWNRFRR-PAW-----LDPRDPMFAKVAAIYYDEQKKLFG-DAPVYDIQPFQEGGTP 334
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ G I + + A+W++ W D +L V +L ++
Sbjct: 335 GDVP--LADAGQGIQKALDTAHPGAMWMLMAWYEEPD--------ARMLAGVDRKRLFIV 384
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG---PVEARTSENTTM 237
DL + + F G P+++ L +F G + G S +G P RT +N M
Sbjct: 385 DLEQNTRVRENRDADFQGAPFLYGGLWDFGGRTSLGG--SSYDYGVRLPGLWRTQKN--M 440
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA-IQDAWNVLY 296
+G + EG++ NP ++DL +E A++ + VD W Y+ RRYG+ + AW++L
Sbjct: 441 IGTAVFPEGMDNNPYIFDLFTEAAWRRDGVDTTQWTRDYADRRYGQPGDVHARKAWDLLL 500
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP-VSKEAVLKSETSSYDHP 355
H+ + S Q++G+ + +++ ++ S H
Sbjct: 501 HSAF-------------------------SYRATGIQDFGEASAAPDSLFNAQPSLDTHS 535
Query: 356 HLW-------YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
W Y V A+ + + + A+ YRYDL+D+TRQA+A A + I
Sbjct: 536 AAWNGMKVLPYDPHLVEAAMAELLQASDATRATEAYRYDLVDVTRQAVANQARAMLPQIG 595
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+A+ D + L+ R+LEL++ D LLA + F +G WL + + + Q K
Sbjct: 596 DAFAARDRAKLHALTTRWLELMDRQDSLLATNTFFRVGTWLSWPQAWSDDPAQRK 650
>gi|294627661|ref|ZP_06706243.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 11122]
gi|292598013|gb|EFF42168.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 11122]
Length = 798
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 139/479 (29%), Positives = 213/479 (44%), Gaps = 73/479 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ LPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRASLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSVANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF ++A ++ + G G+ EG+ N V+Y+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + ++ + D L RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTAD--IVDFDDCPGDPQRL-------RRAIDALLQQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLAC-HD 441
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+ HD
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGGQHD 644
>gi|197302378|ref|ZP_03167435.1| hypothetical protein RUMLAC_01107 [Ruminococcus lactaris ATCC 29176]
gi|197298557|gb|EDY33100.1| F5/8 type C domain protein [Ruminococcus lactaris ATCC 29176]
Length = 1655
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 133/482 (27%), Positives = 224/482 (46%), Gaps = 65/482 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ +W ++ L +K + + +LGM PVL +SG VP + + PSA++
Sbjct: 680 MANLSGYGGPVHDTWFTERTELARKNQLIMRKLGMQPVLQGYSGMVPVDITSKDPSAEVI 739
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S + +L F + F + Q + YG ++H Y D F E NT
Sbjct: 740 KQGTWCSFQRPS------MLRTDSESFTKYAALFYKVQKEVYGDSAHYYATDPFHEGGNT 793
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-- 176
+DS + + + + M + D A W++Q W + ALL + +
Sbjct: 794 GGMDS----AVISQKVLASMMTADPHATWVIQSW--------QGNPTTALLQGLGDNRDH 841
Query: 177 LVVLDLFAEVKPIWSTSK-----------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
+VLDL+AE P W+ + +F P+++CML+NF G + ++G +D+ G
Sbjct: 842 ALVLDLYAEKTPHWNETNPGYYGGAEGGGEFLNTPWVYCMLNNFGGRLGLHGHIDNYVEG 901
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH-----EKVDVKAWINQYSVRR 280
V A + + M G+G++ E NPV+YDL E + +K+++ W Y RR
Sbjct: 902 IVNA-SKQAEHMAGIGITPEASVNNPVLYDLFFETIWADDGNNLQKINLDEWFKNYVTRR 960
Query: 281 YGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKP 338
YG + A +L+ TVYN + V+ A P +D G +G
Sbjct: 961 YGADSDSAYQAMEILHDTVYNPAYNMKGQGAPESVVNARPGLDI-------GAASTWGNA 1013
Query: 339 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
V Y ++ +A EL +A ++L S Y+YDL ++ Q L+
Sbjct: 1014 VVD-----------------YDKKKLEKAAELLLADYDKLKNSAGYQYDLANVLEQVLSN 1056
Query: 399 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
A E + A++ DA LS +FL +++ ++ + FL+G W+ AK+LA+N
Sbjct: 1057 TAQEYQKKMAAAFRSGDAEEFSTLSDKFLSIIDMVEKVTGTQKEFLVGTWINGAKKLAKN 1116
Query: 459 EE 460
+
Sbjct: 1117 SD 1118
>gi|225875033|ref|YP_002756492.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
51196]
gi|225793771|gb|ACO33861.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
51196]
Length = 800
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 130/468 (27%), Positives = 215/468 (45%), Gaps = 40/468 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL + P+ +S LD+++ ++I+ R+ ELG+ PV P + G VP P A +
Sbjct: 241 MGNLCCFDEPISRSLLDRRIRSAQQIIRRLRELGITPVFPGYFGMVPEDFARRHPGAHVI 300
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
GNW + P W LD DPLF + +F + Q + +G +S IY+ + F E
Sbjct: 301 PQGNWNGFRR-PAW-----LDPRDPLFAAVAASFYKHQQELFGDSS-IYDIELFQEGGSA 353
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P +SS AI + A+W+ W +P +ALL++V L+V+
Sbjct: 354 ADVP--VSSAAKAIQKALLRAHPQAMWMTLAW--QNNP------SRALLSAVDRSHLLVV 403
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D+ P + + F G Y++ L +F G + L A +TM G
Sbjct: 404 DIDQGRTPHENRERDFMGAAYLFGGLWDFGGRTTLGANLYDYAVRLPRMGLRAGSTMKGT 463
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
+ EG++ NP +DL +EMA++ VD++ W +Y+ RRYG P + AW +L T Y
Sbjct: 464 ALFSEGLDNNPAAFDLFTEMAWRTSPVDLRTWSREYARRRYGMDDPHTRRAWRILMETAY 523
Query: 301 NC-TDGATDKN-RDVIV-AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
DG ++ RD + D PS+ +V+ SS+ L
Sbjct: 524 GTRADGVSNHGERDAPPESLFDAQPSLDAVS--------------------ASSWSPDRL 563
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y + AL + + + TY+YDL+D+ RQ LA ++ + I +AY
Sbjct: 564 RYDPKKFEAALTELLQAPPGMREMPTYQYDLVDVARQTLANWSRKTLPEIKDAYDHRHEA 623
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
L +++L ++ D LLA + F++GPWL + A +++++
Sbjct: 624 RFETLEKQWLCMMMLQDKLLATNTSFMVGPWLNAVSPWAATATEQRRL 671
>gi|294648124|ref|ZP_06725667.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|292636508|gb|EFF54983.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
Length = 499
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 125/425 (29%), Positives = 199/425 (46%), Gaps = 42/425 (9%)
Query: 35 MNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAF 94
M PVLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F
Sbjct: 1 MKPVLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLF 57
Query: 95 IEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
+++Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F
Sbjct: 58 LDEQKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMF 116
Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
+D W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN
Sbjct: 117 YFDKDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNT 176
Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 273
+ G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 177 TLTGNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWI 235
Query: 274 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 333
+ R G ++DAW L++ +Y V P T G
Sbjct: 236 ECLADRHVGCVSQPVRDAWKRLFNDIY--------------VQVP--------RTLGTLP 273
Query: 334 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDL 391
Y +P + K ++ Y + L EV R L NE + + +R DLI +
Sbjct: 274 GY-RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITV 321
Query: 392 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 451
RQ L Y ++ + + D + + E++ D+D L A H L W++
Sbjct: 322 GRQVLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDD 381
Query: 452 AKQLA 456
A+++
Sbjct: 382 ARKMG 386
>gi|126347839|emb|CAJ89559.1| putative alpha-N-acetylglucosaminidase [Streptomyces ambofaciens
ATCC 23877]
Length = 740
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 122/462 (26%), Positives = 204/462 (44%), Gaps = 40/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGP+ + L+ + L ++I R+ +LGM PVLP + G VP P +
Sbjct: 213 MQNMSGFGGPVSERLLEDRADLGRRIADRLRQLGMTPVLPGYYGTVPPGFTERNPVGPVV 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W + P W LD +F + AF Q + +G TS +Y D E P
Sbjct: 273 PQGDWVGFER-PDW-----LDPRSAVFPRVAAAFYRHQRELFG-TSTMYKMDLLHEGGRP 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P + A+ +Q+ AVW + GW + P + +++++ +L+++
Sbjct: 326 GNVP--VRDAAQAVMKALQTARPGAVWTLIGWQNN-------PSTQ-IIDAIDKRRLLIV 375
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D ++ ++G PY + + NF G+ M A + RT + + G+
Sbjct: 376 DGLSDRYDGLDREATWHGAPYAFGTIPNFGGHTTMGANTAVWAERFDQWRTKAGSALAGI 435
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
EG NPV Y+L +E+A++ E VD + W +Y+ RRYG + P AW +L Y
Sbjct: 436 AYMPEGTGGNPVAYELFTELAWRTEPVDQRKWFAEYAQRRYGGADPHAASAWELLRSGPY 495
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ G +++D + ++ + + +S+ + Y
Sbjct: 496 STPSGTWSESQDSLF-----------------------TARPRLTATNAASWSPGAMRYD 532
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
V RAL + L A++ YR+DL+D+ RQ LA + L I AY D
Sbjct: 533 PGTVRRALTELVRVAPALRATDAYRFDLVDVARQVLANRSRTLLPQIKAAYDAEDLPRFR 592
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ + + +D LLA FLLGPWLE AK + E +
Sbjct: 593 ARAAEWKNCLSLLDRLLATDARFLLGPWLEDAKSWGRTEAER 634
>gi|294812279|ref|ZP_06770922.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
gi|294324878|gb|EFG06521.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
Length = 1086
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 125/465 (26%), Positives = 210/465 (45%), Gaps = 48/465 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGP+ + LD+++ L ++I+ R+ LGM PV+P + G VP P A++
Sbjct: 257 LQNMSEYGGPVSPALLDRRIELGQRIVTRMRRLGMRPVVPGYFGTVPDGFVARNPGARVI 316
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD P+F EI A+ Q + +G H + D E P
Sbjct: 317 PQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRHQEELFGEIDH-FKMDLLHEGGTP 369
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +++ A W++ GW + P ALL+++ K++++
Sbjct: 370 GDVP--VPDAARAVETALRAARPAATWVILGWQSNPRP--------ALLDAIDTSKVLIV 419
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D +++ + ++ G PY + + NF G + D R N+ +VG
Sbjct: 420 DGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPNSALVGT 479
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E +++P +L +E+A++ EK+D AW Y+ RYG PA ++A+ L T Y
Sbjct: 480 AYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYAQFRYGAKDPAAEEAFAALAGTAY 539
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
T T R + F + + ++ S +++D
Sbjct: 540 QLT---TTDGRPIDSLF---------------------LRRPSMSSSVATAFDQ------ 569
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+ R + EL S+ YRYDL DL RQALA + L L + AY D
Sbjct: 570 -AAFDRGFAALLRVNEELRGSDAYRYDLTDLARQALALRSRTLQLALRAAYATKDVTAFR 628
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
++ +L L+ D + CH FLLGPWLE AK+ A + E+ ++
Sbjct: 629 GVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFATSTEEAVEL 673
>gi|326440885|ref|ZP_08215619.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
Length = 1038
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 125/465 (26%), Positives = 210/465 (45%), Gaps = 48/465 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGP+ + LD+++ L ++I+ R+ LGM PV+P + G VP P A++
Sbjct: 209 LQNMSEYGGPVSPALLDRRIELGQRIVTRMRRLGMRPVVPGYFGTVPDGFVARNPGARVI 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD P+F EI A+ Q + +G H + D E P
Sbjct: 269 PQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRHQEELFGEIDH-FKMDLLHEGGTP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +++ A W++ GW + P ALL+++ K++++
Sbjct: 322 GDVP--VPDAARAVETALRAARPAATWVILGWQSNPRP--------ALLDAIDTSKVLIV 371
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D +++ + ++ G PY + + NF G + D R N+ +VG
Sbjct: 372 DGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPNSALVGT 431
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E +++P +L +E+A++ EK+D AW Y+ RYG PA ++A+ L T Y
Sbjct: 432 AYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYAQFRYGAKDPAAEEAFAALAGTAY 491
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
T T R + F + + ++ S +++D
Sbjct: 492 QLT---TTDGRPIDSLF---------------------LRRPSMSSSVATAFDQ------ 521
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+ R + EL S+ YRYDL DL RQALA + L L + AY D
Sbjct: 522 -AAFDRGFAALLRVNEELRGSDAYRYDLTDLARQALALRSRTLQLALRAAYATKDVTAFR 580
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
++ +L L+ D + CH FLLGPWLE AK+ A + E+ ++
Sbjct: 581 GVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFATSTEEAVEL 625
>gi|317501265|ref|ZP_07959469.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
8_1_57FAA]
gi|316897332|gb|EFV19399.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
8_1_57FAA]
Length = 1847
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 134/484 (27%), Positives = 226/484 (46%), Gaps = 70/484 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + +LGM PVL +SG VP + + +A++
Sbjct: 658 MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 717
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W S + +L T F + + F + Q + YG S+ Y D F E
Sbjct: 718 EQGEWCSFQRPT------MLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 769
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
+ S + + + M + D DAVW++Q W + ALLN +
Sbjct: 770 ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 821
Query: 176 KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
++LDL+AE P + ++F P+++CML+NF G + ++G LD++A
Sbjct: 822 HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLA-N 880
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH------EKVDVKAWINQYSVR 279
+ +E + G+G++ E NPV+YD + E +Q E +D+ W++ Y+ R
Sbjct: 881 NIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATR 940
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
RYG + AW++L TVY + GA + V+ A P++ T G
Sbjct: 941 RYGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAAST 990
Query: 335 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 394
+G V + Y ++ A L +A ++L S Y+YDL ++ +Q
Sbjct: 991 WGNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQ 1033
Query: 395 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
L+ A E + A+ D S +F+ ++EDM+ + + FLLG W+E AK
Sbjct: 1034 VLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKA 1093
Query: 455 LAQN 458
LA N
Sbjct: 1094 LANN 1097
>gi|118370728|ref|XP_001018564.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila]
gi|89300331|gb|EAR98319.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila
SB210]
Length = 879
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 152/495 (30%), Positives = 236/495 (47%), Gaps = 53/495 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ Q+++D Q LQKKIL R+ LGM P+L F G VP +L+ FP +KI
Sbjct: 218 MGNLEGYGGPVTQAYIDGQYNLQKKILKRMRNLGMQPILQGFYGMVPNSLKAKFPLSKIY 277
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--T 118
+W + LDA D LF I F + K YGR + Y D F E
Sbjct: 278 GDQSWLGFRRPA------FLDANDELFSNIANIFYSESEKLYGR-AKFYGGDPFHEGAIV 330
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDS----DAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
P ++ ++S +IY MQ D+ W++Q W + P Q LL +
Sbjct: 331 PGLN----LTSQAQSIYRAMQYTDNPKDEKVKWILQSWQEN------PSQQ--LLQGLQN 378
Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
+ ++LDL AE + W T+ F G ++W L NF I YG+++ P A + +N
Sbjct: 379 DECIILDLMAEARSKWQTND-FSGHDFLWTSLPNFGLRIGQYGMIEQYVSQPPLAYSIKN 437
Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD--------VKAWINQYSVRRYG-RSV 285
+TM G+G EGI N + Y+++ + A+ D V ++ + RYG ++
Sbjct: 438 STMKGIGSIPEGILTNVLDYEILFDKAWIQPNQDTNLTPRQQVLQYLGDFIRYRYGEQNN 497
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA-- 343
+ AW++L +++YN T+ + V++A P +S Y + EA
Sbjct: 498 KNLFSAWSLLTNSIYNSTNPWDGPSESVMLARPASYIDKVSSWGTSYIYWNTTNVLEAWK 557
Query: 344 ----VLKSETSSYDHPHLWYSTSEVIRAL-------ELFIA-SGNELSA--SNTYRYDLI 389
+K + HL E+ + L E F+ S NE +T+ YDL+
Sbjct: 558 LFTNYVKEKKQKNRSQHL-QKLEEINKKLGRSDDDMEAFVEISQNEERNIFKDTFLYDLV 616
Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
D+ RQ LA Y+ L+ ++ A+ D S++FLEL++D D LL+ F+LG +L
Sbjct: 617 DVARQNLASYSYLLYNKVMLAFNQTDTIKFALYSQQFLELIKDQDQLLSSRKEFMLGYYL 676
Query: 450 ESAKQLAQNEEQEKQ 464
ES +L +QEKQ
Sbjct: 677 ESVSKLG-TTDQEKQ 690
>gi|153814573|ref|ZP_01967241.1| hypothetical protein RUMTOR_00787 [Ruminococcus torques ATCC 27756]
gi|331089988|ref|ZP_08338878.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
3_1_46FAA]
gi|145848067|gb|EDK24985.1| F5/8 type C domain protein [Ruminococcus torques ATCC 27756]
gi|330402902|gb|EGG82468.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 1863
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 134/484 (27%), Positives = 226/484 (46%), Gaps = 70/484 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + +LGM PVL +SG VP + + +A++
Sbjct: 674 MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 733
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W S + +L T F + + F + Q + YG S+ Y D F E
Sbjct: 734 EQGEWCSFQRPT------MLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 785
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
+ S + + + M + D DAVW++Q W + ALLN +
Sbjct: 786 ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 837
Query: 176 KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
++LDL+AE P + ++F P+++CML+NF G + ++G LD++A
Sbjct: 838 HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLA-N 896
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH------EKVDVKAWINQYSVR 279
+ +E + G+G++ E NPV+YD + E +Q E +D+ W++ Y+ R
Sbjct: 897 NIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATR 956
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
RYG + AW++L TVY + GA + V+ A P++ T G
Sbjct: 957 RYGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAAST 1006
Query: 335 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 394
+G V + Y ++ A L +A ++L S Y+YDL ++ +Q
Sbjct: 1007 WGNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQ 1049
Query: 395 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
L+ A E + A+ D S +F+ ++EDM+ + + FLLG W+E AK
Sbjct: 1050 VLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKA 1109
Query: 455 LAQN 458
LA N
Sbjct: 1110 LANN 1113
>gi|336439030|ref|ZP_08618649.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
1_1_57FAA]
gi|336017072|gb|EGN46842.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 1863
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 134/484 (27%), Positives = 226/484 (46%), Gaps = 70/484 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + +LGM PVL +SG VP + + +A++
Sbjct: 674 MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 733
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W S + +L T F + + F + Q + YG S+ Y D F E
Sbjct: 734 EQGEWCSFQRPT------MLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 785
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
+ S + + + M + D DAVW++Q W + ALLN +
Sbjct: 786 ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 837
Query: 176 KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
++LDL+AE P + ++F P+++CML+NF G + ++G LD++A
Sbjct: 838 HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLA-N 896
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH------EKVDVKAWINQYSVR 279
+ +E + G+G++ E NPV+YD + E +Q E +D+ W++ Y+ R
Sbjct: 897 NIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATR 956
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
RYG + AW++L TVY + GA + V+ A P++ T G
Sbjct: 957 RYGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAAST 1006
Query: 335 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 394
+G V + Y ++ A L +A ++L S Y+YDL ++ +Q
Sbjct: 1007 WGNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQ 1049
Query: 395 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
L+ A E + A+ D S +F+ ++EDM+ + + FLLG W+E AK
Sbjct: 1050 VLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKA 1109
Query: 455 LAQN 458
LA N
Sbjct: 1110 LANN 1113
>gi|331092442|ref|ZP_08341267.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330401285|gb|EGG80874.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 1598
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 130/479 (27%), Positives = 224/479 (46%), Gaps = 66/479 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + LGM PVL +SG VP ++ SA++
Sbjct: 680 MANLSGFGGPIHDSWFEERTELARKNQLSMRRLGMQPVLQGYSGMVPTNIREKDSSAEVI 739
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W S + +L F + + F + Q + YG ++H Y D F E
Sbjct: 740 EQGTWCSFRRPD------MLKTDSASFDKYAKLFYQAQKEVYGESAHYYATDPFHEGGDT 793
Query: 121 VD-SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK--L 177
+P I+ + M D D +W++Q W+ ALL + K
Sbjct: 794 GGLNPTVIA---GKVLDAMLEADKDGIWIIQS--------WQGNPTTALLKGLEGRKEHA 842
Query: 178 VVLDLFAEVKPIWSTSK-------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+VLDL+AE P W+ + +F P+++CML+NF G + ++G LD++A + A
Sbjct: 843 LVLDLYAEKTPHWNETNPNEYGGGEFNDTPWVFCMLNNFGGRLGLHGHLDNLAKN-IPAA 901
Query: 231 TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ---HEK---VDVKAWINQYSVRRYGRS 284
+ M G+G++ E NP++YD + E + EK +D+ W+ Y+ RRYG+
Sbjct: 902 LNSAKHMEGIGITPEASVNNPLLYDFLFETVWTDNAKEKLPVIDLDKWLKDYAKRRYGKE 961
Query: 285 VPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 339
+ +A ++ TVY GA + V+ A P +D S +G V
Sbjct: 962 SQSAYEALLIMKDTVYKAELNMKGQGAPES---VVNARPALDIGAAST-------WGNAV 1011
Query: 340 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
+ Y +++ +A EL + ++L S+ Y YDL + +Q L+
Sbjct: 1012 -----------------ISYDKAKLEKAAELLLKDYDKLKDSDGYMYDLATMLQQVLSNS 1054
Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
A E + A++ N+ + +FL +++ M+ + + +LLG W+E AK LA+N
Sbjct: 1055 AQEYQRKMANAFKENNKEEFNTYADKFLSIIDSMEKVTSTSKYYLLGTWVEQAKALAKN 1113
>gi|187735714|ref|YP_001877826.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
gi|187425766|gb|ACD05045.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
Length = 852
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 124/467 (26%), Positives = 212/467 (45%), Gaps = 45/467 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ-NV-FPSAK 58
M NL G GGPL Q +++ + ++I+ R+ +LGM PVL + G VP+ Q NV K
Sbjct: 198 MGNLEGHGGPLSQQQINKMAQMGRRIVSRMEQLGMTPVLQGYVGFVPSDFQENVRIDGLK 257
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ G W + + +++D T F ++ + + K YG ++ D F E
Sbjct: 258 LIPQGEWVNFRR------PWVVDPTCEAFPKLAADWYKALRKVYGIPGKMFGGDLFHEGG 311
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
D ++ + MQ A W++Q W +P + LL+ + + +
Sbjct: 312 RKGDID--VTQAAQEVQKAMQKASPGAFWVIQAW--GGNP------TRELLSGLDPERAL 361
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VL L ++ + F G+P++WC L NF GN MYG + ++ E ++ +V
Sbjct: 362 VLQLTKDMANGGKNLRTFNGIPWVWCELANFGGNTGMYGGVPLLSRLGSELSGYKDKGLV 421
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G EG+E NP+ Y L S+ + E + V+ W+ +Y+ +RYG + A+ A VL +
Sbjct: 422 GMGTLSEGLETNPLHYALFSDRLWTREDISVREWLGKYARQRYGFAPKAVVKALEVLSFS 481
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
+YN +I A P + + S++ +
Sbjct: 482 IYNPVRSQEGCTESIICARPSWNV------------------------RKASTWSSGERY 517
Query: 359 YSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
Y ++++A ++ + N+ L T+RYDL+D+ RQALA A + A+ D
Sbjct: 518 YHLGDIVKAARGYLKAANDQPNLVKKETFRYDLVDVVRQALADAAFYQLQQVRSAFDSGD 577
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ +RFL L+ DMD LLA FLLG W + A + +++
Sbjct: 578 LAAYRKQVKRFLSLISDMDALLATDSQFLLGTWQKRALDWGDSRQEK 624
>gi|16124795|ref|NP_419359.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
gi|221233511|ref|YP_002515947.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
gi|13421729|gb|AAK22527.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
gi|220962683|gb|ACL94039.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
Length = 770
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 130/487 (26%), Positives = 208/487 (42%), Gaps = 66/487 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLP +W+D++ LQ KIL R+ LGM P+LPAF G VP A P A+I
Sbjct: 202 MGNIEGYKAPLPTAWIDKKKDLQVKILGRMRSLGMTPILPAFGGYVPKAFAEKNPKARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I F+ + +G ++ Y D+F+E PP
Sbjct: 262 RMRPWEGFHE------TYWLDPADPLFAKIAARFLALYTETFGAGTY-YLADSFNEMLPP 314
Query: 121 VD-------------------------------SPEYISSLGAAIYSGMQSGDSDAVWLM 149
++ + +++ G AIY ++ DAVW+M
Sbjct: 315 INADGADARDAAYGDGTANTAVTKTKVEVDPALKAQRLAAYGKAIYDSIRQTRPDAVWVM 374
Query: 150 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 207
QGWLF D FW P + A L+ VP KL++LD+ + P +W +K F G P+I+ +H
Sbjct: 375 QGWLFGADSHFWDPAAISAYLSLVPDDKLMILDIGNDRYPNVWKNAKAFGGKPWIYGYVH 434
Query: 208 NFAGNIEMYGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 266
N+ G+ +YG L P A + + G GM EG+ N +VY+ + ++A+ +
Sbjct: 435 NYGGSNPVYGDLGFYRQDIPAIAANPDAGKLAGFGMFPEGLHNNSIVYEAVYDLAWSEGQ 494
Query: 267 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 326
W+ +Y+ RYG++ PA+ A L ++ + P
Sbjct: 495 ASPATWLTRYARARYGKTSPALDAALGQLVEAAFSTRYWS---------------PRWWK 539
Query: 327 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 386
G Y + +P + D P +++ A++ A +
Sbjct: 540 SKAGAYLFFKRPTATVG---------DFPQHPGDRAKLEAAVKALTALAPTYGQEPLFVL 590
Query: 387 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 446
DL D TR ++L + AY+ D L +D LL L
Sbjct: 591 DLTDATRHLATMKIDDLLQVAVAAYRRGDTAAGDAARVEIEALALSIDKLLGVQPD-TLA 649
Query: 447 PWLESAK 453
W++ A+
Sbjct: 650 TWIDEAR 656
>gi|345014586|ref|YP_004816940.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
gi|344040935|gb|AEM86660.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
Length = 1044
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 124/465 (26%), Positives = 212/465 (45%), Gaps = 41/465 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGP+ + LD++ L ++I R+ ELGM PV P + G VP + P A+
Sbjct: 216 LQNMSEYGGPVSTALLDKRTELGRRIADRLRELGMRPVFPGYFGTVPDGFADRNPEARTV 275
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W ++ P W LD F ++ AF Q + +G + ++ D E P
Sbjct: 276 PQGDWNGLRR-PDW-----LDPRTESFRKVAAAFYRHQRELFGE-AGLFKMDLLHEGGDP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +++ A+W++ GW +P + LL++V +++V+
Sbjct: 329 GDVP--VPDAARAVETALRTARPGAIWVILGW--QENP------RRDLLDAVDHDRMLVV 378
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D +++ + K + VPY + + NF G + R + +VG
Sbjct: 379 DGLSDLDTVTDREKDWGAVPYAFGTIPNFGGRTTIGAKTHMWTKRFTVWRDKPGSKLVGT 438
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E E++P ++L SE+A++ E VD W Y+ RYG ++A+ L T Y
Sbjct: 439 AYMPEAAERDPAAFELFSELAWREEAVDRAEWFRSYAEMRYGGRDAKAREAFAALRDTAY 498
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + V A P S+T NY + T ++D
Sbjct: 499 EISSKDGRPHDSVFAARP-------SLTARSGTNYA----------THTPAFDPAGF--- 538
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+V A L + +G L S+ YR+DL D+ RQALA + +L + +AY D
Sbjct: 539 --DVAFAALLGVRAG--LRDSDAYRHDLTDIARQALANRSWQLIPQLQDAYDRKDRTAFR 594
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
L+R +L+L+ D + H FLLGPWLE AK++A +E+ ++
Sbjct: 595 TLARLWLKLMRLSDDMTGAHRRFLLGPWLEDAKRMASGDEESARL 639
>gi|365876979|ref|ZP_09416485.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
gi|442587289|ref|ZP_21006107.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
anophelis R26]
gi|365755253|gb|EHM97186.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
gi|442562959|gb|ELR80176.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
anophelis R26]
Length = 712
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 126/461 (27%), Positives = 202/461 (43%), Gaps = 45/461 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ + QQ LQKKIL R+ ELG+ PVL F G VP L+N AK+
Sbjct: 193 MGNLEGWGGPVSMDMMKQQAELQKKILKRMKELGIEPVLQGFYGMVPHDLKNKISEAKVI 252
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W P +LD T LF +I + + YG H + + F E T
Sbjct: 253 EQGKWAGEFQRPG-----ILDPTTKLFSKIADTYYTEMKNLYGEDIHYFGGEPFHEGGKT 307
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D + S I + MQ ++ W++QG W+ LL + +
Sbjct: 308 NGLDLKNVVES----IQTSMQKSYPNSTWVLQG--------WQQNPSDGLLAGLKKENTL 355
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTM 237
+++LF E W K + G +IW + NF +YG L A+ S +
Sbjct: 356 IIELFGENTANWEKRKGYGGTSFIWSNVSNFGEKNGLYGKLQRFIDEVFRAKESIYGANL 415
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G+ EGI NPV YDLM ++A+ EK + W+ +Y+ RYG+ + AW
Sbjct: 416 KGIGIIPEGIFNNPVAYDLMLDIAWYSEKPILDQWLTEYTKYRYGKENQDVIQAWKEFAQ 475
Query: 298 TVYNCTDGATDK-NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
T+Y+ D + + + A P ++ + +S + +NY + KEAV
Sbjct: 476 TIYSSPDVYQEGPSESIYCARPSLNVNPVSSWGTRKRNYDQSRFKEAV------------ 523
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
++F+ + + S TY+ D D RQ A + ++ +I+A
Sbjct: 524 ------------KVFVKADTDFKDSETYQTDKTDFLRQVWANKGDVVYDELIKAIHEKKT 571
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
+ + +FLE++ + LL + F L L+ A+ +
Sbjct: 572 TKIQKSGHQFLEMISIQNMLLGNNRYFTLNRLLKEAEHFGE 612
>gi|374990497|ref|YP_004965992.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
gi|297161149|gb|ADI10861.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
Length = 1001
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 123/465 (26%), Positives = 204/465 (43%), Gaps = 41/465 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ G+GGP+ L +++ L +KI R+ ELGM PV P + G VP + P A+
Sbjct: 174 LQNMSGYGGPVSPELLAKRIALGQKIAERLRELGMRPVYPGYFGTVPDGFVDRNPGARTV 233
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + + P W LD F ++ AF Q + +G ++ D E
Sbjct: 234 PQGTWNGL-ARPDW-----LDPRTESFGQVAAAFYRHQQELFGECD-LFKMDLLHEGGAA 286
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ A+ + +Q+ A W++ GW + + LL++V ++V+
Sbjct: 287 GDVP--VADAARAVETALQTARPGATWVILGWQAN--------PRRELLDAVNHDHMLVV 336
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D +++ I + + VPY + + NF G + A + R + +VG
Sbjct: 337 DGLSDLDSIGDREQDWGSVPYAFGTIPNFGGRTTIGAKTHIWARRFTQWRDKPGSKLVGT 396
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E + ++P ++L SE+A+++ VD W Y+ R G +DA+ L T Y
Sbjct: 397 AYMAEAVGRDPAAFELFSELAWRNTAVDRDEWFRTYADVRLGGRDERARDAYAALRDTAY 456
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
T + V A PDV T NY + +
Sbjct: 457 QITSSDGRPHDSVFSARPDV-------TARSGTNYATRIPA-----------------FD 492
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
++ AL + L S+ YR+DL D+ RQALA + L ++ +AY+ D
Sbjct: 493 LADFDPALAALLDVRPSLRDSDAYRHDLTDIARQALADRSWTLIPHLHDAYERKDLEAFR 552
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
L+R +L+L+ D + H GFLLGPWLE AK+LA +E + +
Sbjct: 553 TLARLWLKLMRLSDDMTGAHRGFLLGPWLEDAKRLASDEAEAAHL 597
>gi|386386798|ref|ZP_10071901.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
gi|385665738|gb|EIF89378.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
Length = 1033
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 127/471 (26%), Positives = 211/471 (44%), Gaps = 52/471 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGPL ++ LD + L +KI R+ ELGM PVLP + G VP + P A++
Sbjct: 213 LQNMSEYGGPLSKTLLDARAELGRKITARLRELGMRPVLPGYFGTVPDGFADRNPGARVV 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W ++ P W LD +F ++ AF Q K +G ++ D E
Sbjct: 273 AQGLWNGLRR-PDW-----LDPRTTVFPKVAAAFYRHQTKLFG-ACDLFKMDLLHEGGNA 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ +++ +AVW++ GW + +ALL++V +++++
Sbjct: 326 GDVP--VPDAARAVEKALRTARPNAVWVILGWQSN--------PRRALLDAVDKRRMLIV 375
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D +++ ++ G PY + + NF G + D R + +VG
Sbjct: 376 DGLSDLDTTGDRESEWGGTPYAFGTIPNFGGRTTLGANTDRWTDRFTVWRDRPGSALVGT 435
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E E++P ++L SE+A++ E++D +AW +Y+ RYG + A+ L T Y
Sbjct: 436 AYMPEAAERDPAAFELFSELAWRRERIDREAWFTEYAQIRYGSDDASAAAAFGALAATAY 495
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
++ T+G+ Y + L S + P + +
Sbjct: 496 R-----------------------LASTDGR--PYDSHFLRRPSLTSSIGTAFDPAGFDT 530
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
A +A+G EL S+TYR+DL +L RQALA + L + A D
Sbjct: 531 ------AFAALLAAGPELRDSDTYRHDLTELARQALANRSRTLQFALRAARASKDVAAFR 584
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE----EQEKQVRC 467
+S +L+L+ D + CH FLLGPWLE AK+LA + E E+ R
Sbjct: 585 GVSALWLKLMRLADTMAGCHRSFLLGPWLEDAKRLATSPAEAVELERTARA 635
>gi|302546018|ref|ZP_07298360.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
[Streptomyces hygroscopicus ATCC 53653]
gi|302463636|gb|EFL26729.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
[Streptomyces himastatinicus ATCC 53653]
Length = 679
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 129/467 (27%), Positives = 202/467 (43%), Gaps = 45/467 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGP+ + L+++ L ++I R+ ELGM PV P + G VP + P A
Sbjct: 177 MQNMSEWGGPVSTALLEKRTDLGRRIADRLRELGMRPVFPGYFGTVPDGFADRNPGAHTV 236
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G+W ++ P W LD F E+ AF Q +G ++ D E N
Sbjct: 237 PQGDWNGLRR-PDW-----LDPRTDAFHEVAAAFYRHQHDLFG-ACDLFKMDLLHEGGNA 289
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
V P+ A+ +Q+ A+W++ GW +P + LL++V ++
Sbjct: 290 GDVSVPDAAR----AVEKALQTSRPGAIWVILGW--QSNP------RRDLLDAVDHDHML 337
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
V+D +++ I K + VPY + + NF G + R + +V
Sbjct: 338 VVDGLSDLDTITDREKDWGSVPYAFGTIPNFGGRTTIGAKTHMWTERFTVWRDKPGSKLV 397
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G E +E++P Y+L SE+A++ VD AW Y+ RYG ++A+ L T
Sbjct: 398 GTAYMPEAVERDPAAYELFSELAWRDTAVDRDAWFRDYADVRYGARDAKAREAFAALRDT 457
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y + + V A P S+T NY + T ++D
Sbjct: 458 AYQISSKDGRPHDSVFAARP-------SLTARSGTNYA----------THTPAFD----- 495
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+ AL + L S+ YRYDL D RQALA + +L + +AY D
Sbjct: 496 --PARFDAALAALLGVRAGLRDSDAYRYDLADTARQALANRSWQLIGQLADAYARKDLDT 553
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
LSR +L+L+ D + H LLGPWLE AK++A E+ Q+
Sbjct: 554 FRALSRLWLKLMRLSDDITGTHRLLLLGPWLEDAKRMASGAEESAQL 600
>gi|329851961|ref|ZP_08266642.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
biprosthecum C19]
gi|328839810|gb|EGF89383.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
biprosthecum C19]
Length = 731
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 135/504 (26%), Positives = 217/504 (43%), Gaps = 91/504 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLP SW+ ++ LQK+IL + ELGM P+LPAF+G VP A P A+I
Sbjct: 182 MGNIEGYQAPLPLSWIVKKRELQKRILGAMRELGMEPILPAFAGYVPKAFAESHPQARIY 241
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ F++ + YG+ Y D F+E PP
Sbjct: 242 RMRAWEGFHE------TYWLDPADPLFAKLAGRFLDLYDQTYGK-GRFYLADAFNEMLPP 294
Query: 121 V-DSP------------------------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 155
V D P E +++ G ++ ++S DAVW+MQGWLF
Sbjct: 295 VGDGPVEGGYGDSTANKEAVAEVDPAVKAERLAAYGQRLHDSIRSARPDAVWVMQGWLFG 354
Query: 156 YDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW + A L +VP L+VLD+ + P + T++ F+G +I+ +HN+ +
Sbjct: 355 ADQGFWTGDAIAAFLRNVPDDGLMVLDIGNDRYPKVRQTAQAFHGKGWIYGYVHNYGASN 414
Query: 214 EMYGILD-------SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 266
+YG L +I P R + G G+ EG++ N +VY + ++A+
Sbjct: 415 PIYGDLGFYRRDMAAITSDPARGR------LQGFGVFPEGLDSNSIVYAYLYDLAWNGGT 468
Query: 267 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF-----PDVD 321
+ W+ Y+ RYG S P + AW + VY R A+ PD+
Sbjct: 469 KSLSDWLAGYTRARYGISSPEVVTAWLDIVKGVYGTRYWTPRWWRSTAGAYLLCKRPDIA 528
Query: 322 PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 381
+ G +++ A ++ +D P L
Sbjct: 529 MADFEGAPGDRAALRAGLARLAAIR-----HDSPLL------------------------ 559
Query: 382 NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLS---RRFLELVEDMDGLLA 438
RYD+I+ TR + + + L + AY+ D + + RR ++D+ G
Sbjct: 560 ---RYDVIEFTRHLASLHLDNLIRTALVAYRDGDVAAGDRSATEVRRVTIAIDDLMGAQP 616
Query: 439 CHDGFLLGPWLESAKQLAQNEEQE 462
CH L W+E A+ ++
Sbjct: 617 CH----LAGWIEQARAYGDTATEK 636
>gi|229818803|ref|YP_002880329.1| alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
gi|229564716|gb|ACQ78567.1| Alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
Length = 751
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 133/408 (32%), Positives = 202/408 (49%), Gaps = 33/408 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M + +GGPLP SW +++ L ++IL R ELGM VLPAF G+VP L A+
Sbjct: 194 MGSTSSFGGPLPDSWFERRAELGRRILERQRELGMRAVLPAFGGHVPDGLGA---GARTH 250
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G FS T LL D F + F QQ + +G T H+Y D F E+ PP
Sbjct: 251 WQG--FS---------TALLGPDDDAFAVVAAEFARQQRELFG-TDHLYAADPFIESVPP 298
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PE +++ AA Y+GM + D +A W+MQ W F Y FW ++ A+ ++VP +L++
Sbjct: 299 SGEPEDLAAFAAATYAGMSAADPEATWVMQAWPFHYHRRFWTAERIAAVTDAVPRDRLLL 358
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTTM 237
LDL+AE P+W + ++WC +HNF G ++G L +A G V +
Sbjct: 359 LDLWAEHAPVWDDGRGIAEHQWLWCAVHNFGGRFSVHGDLHGLARDLGGVLDDGARTGGF 418
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-----RSVPAIQDAW 292
GVGM+ME +E NPV Y+L++++ + E+ DV AW+ ++ +RYG + A+ AW
Sbjct: 419 TGVGMAMEALENNPVFYELLTDLVW--ERPDVDAWVGRFVDQRYGFADGTAARDAVHGAW 476
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPD--VDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
+L T+Y G T ++A P V P G++ + PV A + +E
Sbjct: 477 AILLRTLYGP--GMTRSIPSPVIARPADVVAPFHTQRLAGEFLDPDAPVIVSANIDAEAD 534
Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
P + E+ RA L + A +DL DL +A+
Sbjct: 535 ----PRVEGDLPEIARAAALLREAAGSSDAGGPLAHDLADLLTHVVAQ 578
>gi|383643231|ref|ZP_09955637.1| N-acetylglucosaminidase [Sphingomonas elodea ATCC 31461]
Length = 778
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 130/490 (26%), Positives = 205/490 (41%), Gaps = 70/490 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PL +W++++ VLQ++IL R+ LGM P+LPAFSG VP A P AKI
Sbjct: 209 MGNIAGYRAPLSANWIEKKRVLQRQILARMRSLGMKPILPAFSGYVPEAFAKAHPEAKIY 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W TY LD +DPLF + F++ YG Y D F+E PP
Sbjct: 269 QMRQWEGFPG------TYWLDPSDPLFARLAARFLQLYTATYG-PGEYYLADAFNEMVPP 321
Query: 121 VDS-------------------------PEYI-----SSLGAAIYSGMQSGDSDAVWLMQ 150
+ P+ + ++ G +Y + + +A W+MQ
Sbjct: 322 IAEDGSDARAATYGDAIANTAATRAAALPKEVRDARLAAYGERLYRSITAAAPNATWVMQ 381
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L+ VP ++++LD+ + P IW+ ++ FYG + + +HN
Sbjct: 382 GWLFGADKAFWTPDAIAAFLSKVPDERMLILDIGNDRYPGIWNATRAFYGKGWAYGYVHN 441
Query: 209 FAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 267
+ G+ +YG L A + + M G G+ EG+ N + Y ++A+
Sbjct: 442 YGGSNPVYGDLAFYRSDITAALANPGHGRMRGFGLFPEGLHSNGIAYAYAYDLAWGEIDA 501
Query: 268 DVK-----AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 322
K AWI Y+ RYG++ PA+ AW+ Y P
Sbjct: 502 TGKARPLDAWIGDYTRARYGKTSPALVAAWDKAIAGAYTTR---------------YWTP 546
Query: 323 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 382
G Y + P A D+P + + +E +A +
Sbjct: 547 RWWHEQAGGYLFFKFPSLDGA---------DYPAAPGDPAALRAGIEALLAQAPQHGGEP 597
Query: 383 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 442
Y YD++DL R + ++ + AY+ D + + L +D LA +
Sbjct: 598 LYTYDVVDLVRHYASVQLDDRLKTAVAAYKAGDLAAGDRATAAAERLARHIDA-LAGNQQ 656
Query: 443 FLLGPWLESA 452
LG WL A
Sbjct: 657 ETLGSWLADA 666
>gi|257067709|ref|YP_003153964.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
4810]
gi|256558527|gb|ACU84374.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
4810]
Length = 768
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 99/323 (30%), Positives = 156/323 (48%), Gaps = 22/323 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M H G L L+ + L ++I R ELGM VLP F G +PA L + ++
Sbjct: 201 MGITHDLGAALTDEALEARAELGRRIAERERELGMTVVLPGFGGQLPAELVG---TERMI 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W + + P DPLF E + Q + G T H Y D + E+ PP
Sbjct: 258 DWQGWHNALAAP----------GDPLFAEAAASLHRHQRQLLG-TDHHYAVDPYIESLPP 306
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
SP+ ++ AI++ M+ D AVW++QGW F Y +W ++ +LL+ VP +L++
Sbjct: 307 TTSPQQLAEHAEAIFTAMRDADPQAVWILQGWPFHYRAAYWTEERVHSLLSRVPEDRLIL 366
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL----DSIAFGPVEARTSENT 235
LDL+ E P+W + YG ++WC+ H F G ++G L D + A
Sbjct: 367 LDLWGEHAPMWHRTAAMYGRRWLWCLAHTFGGRFGLFGDLAALDDDLRGLRTAAEAGTRG 426
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
+ G G++ E ++ N VVY+L + A + W+ ++ +RRYG + P +Q AW V+
Sbjct: 427 RLEGFGITSEALDDNAVVYELATR-ALWSPMPPRERWLEEHIIRRYGTAAPEVQQAWQVI 485
Query: 296 YHTVYNCTDGATDKNRDVIVAFP 318
HT+Y G T ++A P
Sbjct: 486 AHTLYGP--GRTRSTPSPLIARP 506
>gi|429198382|ref|ZP_19190217.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
gi|428665917|gb|EKX65105.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
Length = 747
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 119/466 (25%), Positives = 214/466 (45%), Gaps = 39/466 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ + LD + L ++I+ R+ ELGM PV P + G VP P A+
Sbjct: 220 LQNLSSFPSPVSRQLLDARAALGRRIVGRLRELGMTPVFPGYFGTVPPGFAERNPGARTV 279
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W + P W LD F + AF Q + +G S +Y D E P
Sbjct: 280 PQGDWMGF-ARPDW-----LDPRTNEFKRVAAAFYRAQDELFGGPSTLYKMDLLHEGGDP 333
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ + +++ DA W++ GW + PP +A++++V +++V+
Sbjct: 334 GDVP--VADAAKGVERALRAAHPDATWVILGWQHN------PP--RAIVDAVDKKRMLVV 383
Query: 181 DLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D ++ P + +G PY + + NF G+ + A + RT + + + G
Sbjct: 384 DGLSDRFPTVIDREADWGDTPYAFGSIWNFGGHTALGANTPVWAELYEKWRTKDGSKLRG 443
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ + E + NP + L SE+A++ +++D+K W ++++ RYG P + AW++L T
Sbjct: 444 IALMPEAADNNPAAFALFSELAWRKDELDLKTWFSEWAHARYGARDPHAEAAWDILRRTA 503
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y G T +R +EG +G S+ A+ + + L Y
Sbjct: 504 Y----GTTRADR---------------WSEGADGLFG---SRPALNTVRAARWSPKQLRY 541
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+E AL ++ L +S+ YR DL+D+ RQ L+ + L I AY D
Sbjct: 542 DAAEFEPALGELLSVRPGLRSSSAYRRDLLDVARQTLSNRSRVLLPRIRGAYDARDTARF 601
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
+L+ +L L++ +D LLA LLG W+ A+ ++ + +++
Sbjct: 602 DELTGTWLSLMDLLDRLLATDSAHLLGRWVADARAWGASDAERERL 647
>gi|333023613|ref|ZP_08451677.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
gi|332743465|gb|EGJ73906.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
Length = 741
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 128/476 (26%), Positives = 219/476 (46%), Gaps = 48/476 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ ++Q+ L +I+ R+ ELGM+PVLP + G VPA + P AK
Sbjct: 215 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 274
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD LF E+ AF E Q + YGR + +Y D E
Sbjct: 275 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P ++ G + +++ DAVW++ GW + PP K ++ + ++V+
Sbjct: 328 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 377
Query: 181 D----LFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENT 235
D F+EV + G PY + + NF G+ + D + P R +
Sbjct: 378 DGLSDRFSEVN---DRESDWQGTPYAFGSIWNFGGHTALGANARDWVDLYP-RWRDRSGS 433
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
+ G+ + E + NP ++L +E+ + VD+ W +Y+ RYG S + AW++L
Sbjct: 434 RLSGIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDIL 493
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
T Y D+ + P++ +V+ GK+ L+ +S++
Sbjct: 494 RTTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP- 542
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
AL+ +A EL S TYR DL+D+ RQALA + L + AYQ +
Sbjct: 543 -----------ALDELLAVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYQAKN 591
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVS 471
+L RR++ L++ ++ L+A + LLG W+ESA+ + ++ Q++ +S
Sbjct: 592 QAEFARLGRRWIALMDLLEQLVATDENHLLGRWVESARAWGGSAREKSQLQYDALS 647
>gi|456388164|gb|EMF53654.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
25435]
Length = 732
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 121/466 (25%), Positives = 207/466 (44%), Gaps = 40/466 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ + LD + VL ++I R+ ELGM PV P + G VPA P A+
Sbjct: 206 LQNLSAFPSPVSRQLLDARAVLGRRIADRVRELGMTPVFPGYFGTVPAGFAERVPGARTV 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD F + AF Q + +G +S +Y D E P
Sbjct: 266 PQGEWMGF-ARPDW-----LDPRTDDFARVAAAFYRVQEEMFGPSS-LYKMDLLHEGGDP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ + ++ A W++ GW + PP +A++++V ++V+
Sbjct: 319 GDVP--VADAAKGVERALRRSRPGATWVILGWQHN------PP--RAIVDAVDKQHMLVV 368
Query: 181 DLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D ++ P + + +G PY + + NF G+ + A + RT + + + G
Sbjct: 369 DGLSDRFPTVTDREADWGDTPYAFGSIWNFGGHTALGANTPDWAALYEKWRTKDGSRLHG 428
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ + E + NP + L SE+A++ ++D+K W +++ RYG P + AW++L T
Sbjct: 429 IALMPEAADNNPAAFALFSELAWREGELDLKTWFAEWAHARYGGRDPHAEAAWDILRRTA 488
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T S +EG +G S+ A+ + L Y
Sbjct: 489 YGTT-------------------RADSWSEGADGLFG---SRPALNAVRAGRWSPKQLRY 526
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
++ AL + EL AS+ YR DL+D+ RQAL+ + + I AY DA +
Sbjct: 527 DAADFEPALGEMLRVRPELRASSAYRRDLLDVARQALSNRSRVMLPQIKAAYDAKDATRL 586
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
SR +L L++ +D L+A LLG W+ A+ + ++
Sbjct: 587 AAASRDWLSLMDLLDELVATDSRHLLGRWVADARSWGAGAAERTEL 632
>gi|453051703|gb|EME99203.1| alpha-N-acetylglucosaminidase [Streptomyces mobaraensis NBRC 13819
= DSM 40847]
Length = 763
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 120/474 (25%), Positives = 204/474 (43%), Gaps = 44/474 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGP+ ++ LD++ L ++I R+ ELG+ PVLP ++G VP A+
Sbjct: 239 MQNMSAFGGPVSRALLDRRTALAQRITRRLRELGITPVLPGYAGTVPPDFTRRNKGARTV 298
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P W LD F + R + Q + YG S +Y D E P
Sbjct: 299 PQGDWAGFPR-PDW-----LDPRTAHFARVARTYYRVQRELYG-ASSMYKIDLLHEGGTP 351
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P + + A+ +++ DA W + GW + + +L++V K++VL
Sbjct: 352 --GPVPVGAAAKAVEKALRAAHPDATWAILGWQTN--------PRREILDAVDRSKMLVL 401
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D + P + K + G PY + + NF G+ M RT + + + G
Sbjct: 402 DGIPDHYPRVTDREKDWGGTPYAFGTIWNFGGHTAMGANTQDWVSLFHRWRTKKGSALRG 461
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ + E + NP L S++A+ ++D+K W ++ V+RYG + P + AW+VL T
Sbjct: 462 IALMPEAADNNPAALALFSDLAWTEGRLDLKDWFARWPVQRYGAADPNARRAWDVLRRTA 521
Query: 300 YNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y T DG ++ + A PD+ V++ A YD
Sbjct: 522 YGTTRADGWSEAADGLFGARPDL-----------------AVNRAAAWSPRQLRYD---- 560
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
+ AL +A L S+ YR DL D+ RQ ++ + L I AY D
Sbjct: 561 ---AAAFDEALPALLAVAPALRGSSAYRCDLTDVARQCVSNRSRLLLPRIKAAYDAGDRT 617
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVS 471
L+R++L+ + ++ +A + LLG W+ A+ + ++ VS
Sbjct: 618 RFRTLTRQWLDWMTLLEETVATSERHLLGRWIAEARAWGGTAAERDRLEHDAVS 671
>gi|318057780|ref|ZP_07976503.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actG]
Length = 741
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 124/473 (26%), Positives = 219/473 (46%), Gaps = 42/473 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ ++Q+ L +I+ R+ ELGM+PVLP + G VPA + P AK
Sbjct: 215 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 274
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD LF E+ AF E Q + YGR + +Y D E
Sbjct: 275 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P ++ G + +++ DAVW++ GW + PP K ++ + ++V+
Sbjct: 328 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 377
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENTTMV 238
D ++ P + + G PY + + NF G+ + D + P R + +
Sbjct: 378 DGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALGANTRDWVDLYP-RWRDRSGSRLS 436
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+ + E + NP ++L +E+ + VD+ W +Y+ RYG S + AW++L T
Sbjct: 437 GIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILRTT 496
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y D+ + P++ +V+ GK+ L+ +S++
Sbjct: 497 AYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP---- 542
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
AL+ ++ EL S TYR DL+D+ RQALA + L + AY+ +
Sbjct: 543 --------ALDELLSVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQAE 594
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVS 471
+L RR++ L++ ++ L+A + LLG W+ESA+ + ++ Q++ +S
Sbjct: 595 FARLGRRWIALIDLLEQLVATDENHLLGRWVESARAWGGSAREKNQLQYDALS 647
>gi|318078904|ref|ZP_07986236.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actF]
Length = 719
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 124/473 (26%), Positives = 219/473 (46%), Gaps = 42/473 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ ++Q+ L +I+ R+ ELGM+PVLP + G VPA + P AK
Sbjct: 193 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 252
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD LF E+ AF E Q + YGR + +Y D E
Sbjct: 253 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 305
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P ++ G + +++ DAVW++ GW + PP K ++ + ++V+
Sbjct: 306 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 355
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENTTMV 238
D ++ P + + G PY + + NF G+ + D + P R + +
Sbjct: 356 DGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALGANTRDWVDLYP-RWRDRSGSRLS 414
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+ + E + NP ++L +E+ + VD+ W +Y+ RYG S + AW++L T
Sbjct: 415 GIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILRTT 474
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y D+ + P++ +V+ GK+ L+ +S++
Sbjct: 475 AYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP---- 520
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
AL+ ++ EL S TYR DL+D+ RQALA + L + AY+ +
Sbjct: 521 --------ALDELLSVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQAE 572
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVS 471
+L RR++ L++ ++ L+A + LLG W+ESA+ + ++ Q++ +S
Sbjct: 573 FARLGRRWIALIDLLEQLVATDENHLLGRWVESARAWGGSAREKNQLQYDALS 625
>gi|302526099|ref|ZP_07278441.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
gi|302434994|gb|EFL06810.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
Length = 860
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 117/472 (24%), Positives = 206/472 (43%), Gaps = 48/472 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN------VF 54
+ N+ + GP+ LD ++ + KK++ R+ +LGM PVLP + G VP +
Sbjct: 200 LQNMASFTGPVSPQLLDARVAMAKKVITRLKDLGMTPVLPGYFGTVPRGFADKSKKADAS 259
Query: 55 PSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTF 114
A++ G W P W LD + ++ AF + Q +G TS +Y D
Sbjct: 260 SDARVIGQGTWVGFDR-PDW-----LDPRTSSYRKVAAAFYQAQHDLFGDTS-MYKMDLL 312
Query: 115 DENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
E D P + + + +Q+ A W++ GW + PP +A++++V
Sbjct: 313 HEGGKSGDVP--VGDAARGVMTALQTARPGATWVLLGWQNN------PP--RAIVDAVDK 362
Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
KL V+D ++ Q+ PY + ++NF G+ + + RT +
Sbjct: 363 SKLFVVDGLSDRYGQRDPDSQWNNTPYAFGTIYNFGGHTTIGANTGVWTQRFPQWRTKQG 422
Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNV 294
+ + G+ EG NP ++L +E+A++ + AW Y+ RRYG AW++
Sbjct: 423 SALTGIAYLPEGTGTNPAAFELFTELAWRQTPIHQAAWFADYASRRYGGPDTRAATAWDL 482
Query: 295 LYHTVYNC-TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
L T Y+ G ++ + A P++D + +++
Sbjct: 483 LRQTAYSMPASGWSEAQDSLYAARPNLD------------------------AATAATWS 518
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
L Y + +AL+ + L ++ YR+DL+D+ RQAL + L I AY
Sbjct: 519 PASLRYQQATFGKALDELLNVDPALRGTDAYRFDLVDVARQALTNTSRTLLPQIKTAYTN 578
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
D L+ R++ + +D LLA FLLGPWLE+AK A + ++ ++
Sbjct: 579 RDRTQFTTLTSRWMSNMTLLDKLLATDSRFLLGPWLEAAKSWAGTDTEQARL 630
>gi|29828556|ref|NP_823190.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
gi|29605660|dbj|BAC69725.1| putative alpha-N-acetylglucosaminidase [Streptomyces avermitilis
MA-4680]
Length = 728
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 119/466 (25%), Positives = 204/466 (43%), Gaps = 39/466 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ Q LD + L ++I R+ ELGM PV P + G VP + A
Sbjct: 201 LQNLSAFPDPVSQQLLDARAALGRRIANRLRELGMTPVFPGYFGTVPPGFADRNAGAHTV 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD F + AF Q + +G S Y D E P
Sbjct: 261 PQGTWMGF-ARPDW-----LDPRTEHFTRVAAAFYRIQDEMFGGASTRYKMDLLHEGGSP 314
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + + +++ AVW++ GW + PP +A++++V +++V+
Sbjct: 315 GDVP--VGDAAKGVERALRAAHPGAVWVILGWQHN------PP--RAIVDAVDKDRMLVV 364
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D + P + ++G PY + + NF G+ + A RT +T+ G
Sbjct: 365 DGLCDRFPKVTDREADWHGTPYAFGSIWNFGGHTTLGANTPDWASLYERWRTRPGSTLRG 424
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
V + E + NP + L SE+A++ +D++AW +++ RYG P + AW++L T
Sbjct: 425 VALLPEAADNNPAAFALFSELAWREGDLDLRAWFARWARSRYGGRDPHAEAAWDILRRTA 484
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T S +EG +G ++ ++ ++ +S+ L Y
Sbjct: 485 YGTT-------------------RADSWSEGADGLFG---ARPSLAATKAASWSPKRLRY 522
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E AL + L S+ YR DL+D+ RQAL+ + L I AY+ D
Sbjct: 523 RPEEFEPALGELLKVRPGLRGSSAYRRDLLDVARQALSNRSRVLLPQIRTAYEAKDTARF 582
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
+L+ +L L++ ++ LLA LLG W+ A+ + + ++
Sbjct: 583 DRLTGVWLALMDLLEALLATDSRHLLGRWVADARAWGASAAERDRL 628
>gi|374985456|ref|YP_004960951.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
gi|297156108|gb|ADI05820.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
Length = 1039
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 126/470 (26%), Positives = 197/470 (41%), Gaps = 48/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ G+GGP+ + +DQ+ L KKI+ R+ ELGM PVLP + G VP P A +
Sbjct: 214 LQNMSGFGGPVSKHLIDQRAALAKKIINRVRELGMTPVLPGYYGTVPDDFLAKNPGASLV 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + K P W LD LF E+ AF Q + YG +S +Y D E P
Sbjct: 274 AQGTWGAFKR-PDW-----LDPRTDLFAEVAAAFYRHQRERYGDSS-MYKMDLLHEGGNP 326
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +Q + AVW + G W+ + +L +V ++V+
Sbjct: 327 GDVP--VGEAAKAVEAALQKAHAGAVWAILG--------WQTNPSREILGAVDKSMMLVV 376
Query: 181 DLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D ++ + + G PY + + NF G+ + + R + + G
Sbjct: 377 DGLSDRYTTVIDRESDWDGTPYAFGSIWNFGGHTPIGANAPDWVEQYPKWRDKTGSALTG 436
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ M EG + NP L +++A+ + + W Y+V RYG P AW + T
Sbjct: 437 IAMMPEGADNNPAAMALFTDLAWTPGAIGLDDWFASYAVSRYGGEDPHAVAAWKAIRDTA 496
Query: 300 YNCTDGATDKNRDVIVAFPD----VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
YN + D PD PS+ +K A E YD
Sbjct: 497 YNMS------RADAWSEAPDGLFGARPSL-------------GANKAAAWGPEADRYD-- 535
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
T+ AL + L S+ Y YDL D+ RQ L+ + L I AY+ D
Sbjct: 536 -----TTAFDAALTELLQVAPGLRDSSAYAYDLADVARQVLSNRSRVLLPQIKTAYEAGD 590
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
+L++ +L ++ MD +LA LLG WL A+ ++ Q+
Sbjct: 591 RGRFDRLTKTWLSWMKLMDKVLATSGQHLLGRWLADARSWGATRAEKDQL 640
>gi|290956360|ref|YP_003487542.1| alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
gi|260645886|emb|CBG68977.1| putative alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
Length = 732
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 124/466 (26%), Positives = 209/466 (44%), Gaps = 40/466 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL G+ P+ + LD + VL ++I R ELGM PV P + G VPA P A+
Sbjct: 206 LQNLSGFPSPVSRQLLDARAVLGRRIADRARELGMIPVFPGYFGTVPAGFAERVPGARTV 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD F + AF Q + +G S +Y D E P
Sbjct: 266 PQGRWMGF-ARPDW-----LDPRTDEFARVAAAFYRTQDEMFG-PSALYKMDLLHEGGDP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ + +Q A W+M GW + PP +A++++V ++V+
Sbjct: 319 GDVP--VADAAKGVERALQRAHPGATWVMLGWQHN------PP--RAIVDAVDKQHMLVV 368
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D ++ P + + G PY + + NF G+ + A + RT + +T+ G
Sbjct: 369 DGLSDRFPTVTDREADWGGTPYAFGSIWNFGGHTALGANTPDWAALYEKWRTKDGSTLHG 428
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ + E + NP + L SE+A++ ++D++ W +++ RYG P + AW++L T
Sbjct: 429 IALMPEAADNNPAAFALFSELAWREGELDLETWFAEWAHARYGARDPHAEAAWDILRRTA 488
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T S +EG +G S+ A+ + L Y
Sbjct: 489 YGTT-------------------RADSWSEGADGLFG---SRPALTAVRAGRWSPKQLRY 526
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ ++ AL + EL AS+ YR DL+D+ RQAL+ + + + AY DA +
Sbjct: 527 NAADFEPALGEMLKVRPELRASSAYRRDLLDVARQALSNRSRVMLPQLKAAYDAKDAARL 586
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
+ SR +L L++ +D L+A LLG W+ A+ A + ++
Sbjct: 587 AKGSRDWLSLMDLLDELVATDSRHLLGRWVADARSWAVGSTERTEL 632
>gi|409097333|ref|ZP_11217357.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Pedobacter agri
PB92]
Length = 724
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 127/471 (26%), Positives = 202/471 (42%), Gaps = 39/471 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN---VFPSA 57
M NL GWGG + Q +QKK+L R+ EL ++P+L F G VP L A
Sbjct: 195 MGNLEGWGGTNSLQLMQLQSNIQKKVLSRMKELEIDPILQGFYGMVPHDLNKKVAALKDA 254
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+I GNW + + +L T+ F + + + K YG + + F E
Sbjct: 255 QIIDQGNWVFTE----FIRPAILAPTNDKFNTVADVYYSELKKLYGSDIKFFGGEPFHEG 310
Query: 118 TPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
VD I+++ ++ MQ ++ W++QGW + ALL +
Sbjct: 311 GKKGGVD----ITAVAKSVQDVMQKNFPNSTWVLQGW--------QNNPADALLAGLKKE 358
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-N 234
++++LF E W K + G +IW + NF +YG L + S
Sbjct: 359 NTLIIELFGENTSNWEQRKGYGGTNFIWSNVSNFGEKNGLYGRLQRFLDEVYRIKQSPYK 418
Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNV 294
+ GVG+ EGI NPV YDLM ++A+++EK + WI Y+ RYG + DAW V
Sbjct: 419 DYLKGVGIIPEGINNNPVAYDLMLDIAWRNEKPPLDKWITDYTTYRYGSYNKDVADAWKV 478
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
TVY+ +K + V P +E Y ++ ++ + SS+
Sbjct: 479 FTETVYSSP--VNEKGKIVYQEGP---------SESIY------CARPSLKVNPVSSWGT 521
Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
Y T +A+ LFI + + S TY+ D D RQ +A ++ + +I A Q
Sbjct: 522 RKRNYDTKLFKQAVALFIKAETQFKNSETYQTDKTDFLRQVMADKGDQAYDELINAIQAK 581
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
D + + + FL ++ D LL + F L WL A L + K +
Sbjct: 582 DKNAIKEKGNHFLTMILQQDSLLNNNHFFTLNRWLNQAVALGKGLPDAKNI 632
>gi|255079272|ref|XP_002503216.1| GH family 89 protein [Micromonas sp. RCC299]
gi|226518482|gb|ACO64474.1| GH family 89 protein [Micromonas sp. RCC299]
Length = 1260
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 144/510 (28%), Positives = 210/510 (41%), Gaps = 133/510 (26%)
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
K +LG + K D + LD +D LF +G AF +Q ++++G T H+Y DTF E
Sbjct: 396 KDAELGKY--AKKDDSVRSVHFLDPSDALFQSLGAAFTKQLVEDFG-TDHLYLADTFREI 452
Query: 118 TPPVD--SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL 174
P D S ++ +GAA + M+S D A W+ Q F +P FW + ALL SV +
Sbjct: 453 RDPNDDFSETHVVRVGAATLAAMRSADPRATWVFQSDAFRRNPRFWNEGRRGALLRSVDI 512
Query: 175 GKLVVLDLFAEVKPIW-STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---- 229
G ++VLD AE P + F G P++WC+ HN GN+ M G L +IA GP A
Sbjct: 513 GDMLVLDSAAETDPYYLREPVHFAGQPFVWCVKHNHGGNLGMRGRLSAIATGPAAAMDSL 572
Query: 230 -----------------------------RTSENTT----------MVGVGMSMEGIEQN 250
R S T +VG G++ EG+EQN
Sbjct: 573 ASRRDGERGTTHGRGTRVGSSRRMLADNKRVSREATHGSRKVGKSQLVGFGITAEGVEQN 632
Query: 251 PVVYDLMSEMAFQHEKVDVKAWINQ--------YSVRRYGRSV----------------- 285
PVVY+L + + + VDV +++ YSVR+ +
Sbjct: 633 PVVYELAALTSQSEKGVDVDWFLSDYSRRRYGGYSVRQPAPTTLPVGTGQGAFLAGFIVG 692
Query: 286 ------------------PA-------------IQDAWNVLYHTVYNCTDGATDKN--RD 312
PA ++AW +L TVY D++ RD
Sbjct: 693 NNPIAGSPGYLGPGEWYDPAKHGEMGKEEAYDRAREAWEILGKTVYGARAKGEDEDHVRD 752
Query: 313 VIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI 372
P + +S Y + K V Y+ +I A
Sbjct: 753 ACSWQPSLRADELSP---DYFDAAKVVD------------------YAFKPLIDAAPTLR 791
Query: 373 ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVED 432
A+G A YD++D+ RQ LA+ +N L I ++ N+A + LEL++D
Sbjct: 792 ANG----AGTRVDYDIVDVGRQLLARQSNVLATQIRDSLNSNNASEAKMYGTQMLELLDD 847
Query: 433 MDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
MD LL H GFLLG ++ESAK A +E
Sbjct: 848 MDALLRSHKGFLLGNYIESAKSWAGKRNKE 877
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 38/61 (62%)
Query: 7 WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWF 66
W G P+ WL +Q LQ+ + + + GM PVLP F+G+VP A+ FP AK+ ++ NW
Sbjct: 217 WTGGRPKKWLKRQWDLQRDAVKLMRDFGMTPVLPGFNGHVPPAIARRFPEAKLRRVENWL 276
Query: 67 S 67
+
Sbjct: 277 T 277
>gi|398786493|ref|ZP_10549210.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
gi|396993639|gb|EJJ04702.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
Length = 1048
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/465 (24%), Positives = 195/465 (41%), Gaps = 41/465 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ G+GGP + ++ L ++I R+ ELGM+PVLP + G VP P A+
Sbjct: 206 LQNMSGYGGPTSSELIAKRAELGQRITGRLRELGMHPVLPGYFGTVPGGFAARNPGARTV 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + + P W LD +F + AF Q G H + D E P
Sbjct: 266 PQGTWSGL-ARPDW-----LDPRTEVFAKTAAAFYRHQEHLLGPADH-FKMDLLHEGGDP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ +++ A W++ GW + + LL++V +++++
Sbjct: 319 GDVP--VPDAARAVEKALRTARPGATWVILGWQNN--------PRRDLLDAVDHDRMLIV 368
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D ++++ + + + GVPY + + NF G + A R + + G
Sbjct: 369 DGLSDLETVTDRERDWGGVPYAFGSIPNFGGRTTIGAKTHVWAERFPAWRDKPGSRLAGT 428
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E E++P ++L SE+A++ VD AW + Y+ RYG + A+ L + Y
Sbjct: 429 AYMPEAAERDPAAFELFSELAWRERPVDRAAWFDGYADLRYGARDKGARAAFAALGTSAY 488
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + V A PD+ V + T ++D
Sbjct: 489 EISSKDGRPHDSVFAARPDLA-----------------ARSGTVYATHTPAFD------- 524
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+ A + L S+ YR DL D RQALA + +L + +AY+ D
Sbjct: 525 PAAFDTAFAALLTVRPALRGSDAYRRDLTDTARQALANRSWQLIGQLQDAYRRKDRATFR 584
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
LS +L L+ + + H FLLGPWL A+ +A E+E ++
Sbjct: 585 ALSGLWLHLMRLSEDVTGAHRQFLLGPWLTDARAMASGPEEEARL 629
>gi|418473272|ref|ZP_13042874.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
coelicoflavus ZG0656]
gi|371546106|gb|EHN74664.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
coelicoflavus ZG0656]
Length = 716
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 117/462 (25%), Positives = 199/462 (43%), Gaps = 40/462 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ GP+ + ++Q+ L ++I R+ ELGM PVLP + G VP P +
Sbjct: 213 MQNMSGFAGPVSERLIEQRAALGRRIANRLRELGMTPVLPGYYGTVPPDFTARNPGGTVV 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD +F + +F Q + +G S +Y D E P
Sbjct: 273 PQGQWVGFER-PDW-----LDPRTGVFSRVAASFYRHQRELFG-DSTMYKMDLLHEGGRP 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P + A+ + +Q+ AVW + GW + P + ++++V +L+++
Sbjct: 326 GNVP--VGDAARAVMNALQTARPGAVWTLIGWQNN-------PSTQ-IIDAVDKSRLLIV 375
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D ++ ++G PY + + NF G+ + A RT + + G+
Sbjct: 376 DGLSDRYDGLDRETAWHGAPYAFGTIPNFGGHTTVGANTAVWAERFDRWRTEPGSALAGI 435
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
EG NPV Y+L +E+A++ E VD W Y+ RRYGR P AW +L Y
Sbjct: 436 AYLPEGTGGNPVAYELFTELAWRTEPVDHSGWFAAYAERRYGRPDPHAARAWELLRTGPY 495
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ G + +D + ++ + + +S+ + Y
Sbjct: 496 SMPSGTWSEAQDSLF-----------------------TARPRLTATSAASWSPGAMRYD 532
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
V AL + L ++ YR+DL+D+ RQALA + L I AY D
Sbjct: 533 PDTVRAALAELLKVAPALRTTDAYRFDLVDVARQALANRSRSLLPEIKAAYDAGDLSRFR 592
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ + + ++ +D LLA FLLGPWL A+ + ++
Sbjct: 593 AGAAEWKDDLDLLDRLLATDSRFLLGPWLADARSWGRTAAEK 634
>gi|408676293|ref|YP_006876120.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
gi|328880622|emb|CCA53861.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
Length = 855
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 116/467 (24%), Positives = 196/467 (41%), Gaps = 44/467 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL G+ GP+ + ++ + L +I + LGM PVLP + G VP P A
Sbjct: 327 LQNLSGFAGPVSEQLIEARAALGARIARHLRSLGMTPVLPGYFGTVPPDFTARNPGAHTV 386
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W P W LD T P+F + + Q + +G S ++ D E P
Sbjct: 387 PQGRWVGF-GRPDW-----LDPTGPVFARLAAVYYRHQRQRFG-DSDMFKMDLLHEGGAP 439
Query: 121 --VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD +S+ A+ +++ A W+M GW + P ALL+ V +L+
Sbjct: 440 GTVD----VSAAAGAVQRALEAARPGATWVMLGWQLNPTP--------ALLHGVDRRRLL 487
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
++D ++ ++ G PY + + NF G+ + + ++ +
Sbjct: 488 IVDGLSDRYDELDRETRWGGTPYAFGTIPNFGGHTSIGANTGAWVSRFHAWLAKPDSALR 547
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+ E NPV + L +E+A+Q +D + W Y+ RRYG + AW L
Sbjct: 548 GIAYLPEATGTNPVAFGLFTELAWQPGPIDQQRWFAGYAARRYGGADRHAAAAWEALRLG 607
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y+ G+ + +D + A PS+ + T ++ S +A+
Sbjct: 608 PYSMRTGSWSEPQDSLFA---ARPSLTASTAARW-------SPKAMR------------- 644
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y + V RAL + L S+ YR+D++D+ RQAL A L I AY+ D
Sbjct: 645 YDAATVERALAELLRVAPRLRTSDAYRFDVVDVARQALTNRARVLLPRIRAAYEARDLDA 704
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
L R + E + L+ FL+GPWL +A+ + + ++
Sbjct: 705 FRALVREWGAAEELLGRLVGSDRRFLVGPWLAAARSWGADPAERDRL 751
>gi|297194750|ref|ZP_06912148.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
25486]
gi|297152431|gb|EFH31740.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
25486]
Length = 816
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/452 (26%), Positives = 198/452 (43%), Gaps = 49/452 (10%)
Query: 16 LDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPS--AKITQLGNWFSVKSDPR 73
++++ L ++I R+ ELGM+PVLP + G VP P A++ G W P
Sbjct: 6 IERRTELGRRITDRLRELGMHPVLPGYFGTVPDDFPGHNPGSDARVIPQGTWGGGMRRPD 65
Query: 74 WCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAA 133
W LD F ++ AF Q + +G SH + D E D P + A
Sbjct: 66 W-----LDPRTQAFSDVAAAFYRHQGELFGDVSH-FKMDLLHEGGTAGDVP--VPDAARA 117
Query: 134 IYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS 193
+ + +Q+ A W++ GW + P +L+S+ +++++D +++ +
Sbjct: 118 VETSLQTARPGATWVILGWQSNPRPV--------MLDSIDTSRVLIVDGLSDLDTVTDRE 169
Query: 194 KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVV 253
+ G PY + + NF G + D R + +VG E E++P
Sbjct: 170 ADWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPGSALVGTAYMPEAAERDPAA 229
Query: 254 YDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDV 313
+L SE+A++ EK+D +AW +Y+ RYG + ++A+ L T Y T
Sbjct: 230 LELFSELAWREEKIDREAWFAEYAQIRYGGVDHSAREAFAALAATAYKLTS--------- 280
Query: 314 IVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIA 373
T+G+ Y S+ L + + P + RA +A
Sbjct: 281 --------------TDGR--PYDSLFSRRPSLTTAIGTAFDP------AGFDRAFAALLA 318
Query: 374 SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDM 433
L S+ YR+DL D+ RQALA + L L + AY+ D +S +L+++
Sbjct: 319 VRAPLRDSDAYRHDLTDVARQALANRSRTLQLALRAAYRNKDVATFRAVSALWLKVMRLS 378
Query: 434 DGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
D + CH FLLGPWLE AK+LA + E+ Q+
Sbjct: 379 DTMAGCHRQFLLGPWLEDAKRLATSPEEAVQL 410
>gi|429201402|ref|ZP_19192867.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
91-03]
gi|428663010|gb|EKX62401.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
91-03]
Length = 1042
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 114/467 (24%), Positives = 195/467 (41%), Gaps = 44/467 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
+ NL G+GGPL +D++ L ++I R+ ELGM+PVLP + G+VP ++ A +
Sbjct: 212 LQNLSGYGGPLSPELIDRRAALGRRIADRLRELGMSPVLPGYYGHVPKEFVERNGGDAHV 271
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W + P W LD F ++ +F Q +G +H + D E
Sbjct: 272 VPQGVWHGFER-PDW-----LDPRTDSFAKVAASFYGHQEDVFGEAAH-FKMDLLHEGGT 324
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P + + +Q A W++ GW +P + LL+++ ++++
Sbjct: 325 AGDVP--VPGAAQGVERALQKARPGATWVILGW--QENP------LPELLDAIDKSRMLI 374
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
+D ++ + + + G PY + + NF G + R N+ +
Sbjct: 375 VDGVSDRYTSVTDRERDWGGTPYCFGTIPNFGGRTTIGARAHIWNEKFFAWRDKANSALA 434
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G E +++P ++L SE+A+ K+D AW + Y+ RYG + + AW L+ T
Sbjct: 435 GTAFMPEATDRDPAAFELFSELAWTPTKIDRAAWFSAYADYRYGARDDSARRAWRALHDT 494
Query: 299 VYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y +D + + A PD+ + Y L
Sbjct: 495 AYQQRAVERSDPHDSLFCARPDL------------------------AADRAAEYAPRAL 530
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y AL + L S Y+YD++D+ RQALA + + + AYQ D
Sbjct: 531 TYDPGRFDAALAGLLGVAGGLRGSAAYKYDVVDVARQALAHRSRQYLPQLRAAYQRKDLA 590
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
LS +L L+ D + + FLLGPW+ A+ LA N+ + +
Sbjct: 591 TFRALSTLWLRLMRLSDEVTGANSAFLLGPWVNDARLLATNDAERAE 637
>gi|329940646|ref|ZP_08289927.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
gi|329300707|gb|EGG44604.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
Length = 798
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 119/466 (25%), Positives = 196/466 (42%), Gaps = 49/466 (10%)
Query: 10 PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVK 69
P+ LD + VL +++ R+ ELGM PVLP + G VP A+ G W
Sbjct: 227 PVSTQLLDARAVLGRRLADRLRELGMVPVLPGYFGTVPPGFAARNRGARTVPQGTWMGFD 286
Query: 70 SDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS 129
P W LD LF + AF Q + +G ++H Y D E D P +
Sbjct: 287 R-PDW-----LDPRTDLFARVAAAFYRVQGELFGASTH-YKMDLLHEGGTAGDVP--VGE 337
Query: 130 LGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG---------KLVVL 180
+ ++ DAVW++ GW + PP +A+L++V G +L+V+
Sbjct: 338 AAKGVERALRRARPDAVWVLLGWRHN------PP--RAILDAVASGGPDGAAGRERLLVV 389
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D ++ P + + GVPY + + NF G+ + A RT E + + G
Sbjct: 390 DGLSDRFPTVTDREADWGGVPYAFGSIWNFGGHTTLGANTPDWARLYEAWRTKEGSALRG 449
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ + E + NP + L SE+ + ++D+KAW +++ RYG + AW+VL T
Sbjct: 450 IALLPEAADNNPAAFALFSELPWHEGELDLKAWFARWARSRYGAYDAHAEAAWDVLRRTA 509
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T S +EG +G ++ ++ +S+ L Y
Sbjct: 510 YGTT-------------------RADSWSEGADGLFG---ARPSLTARRAASWSPKELRY 547
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E RAL+ + L S+ YR DL+D+ RQ L+ + L I A D
Sbjct: 548 DAHEFERALDELLKVRPGLRESSAYRRDLLDVARQCLSNRSRALLPRIARACAARDVKAF 607
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
S +L L++ ++ L+ LLG W A+ +E + ++
Sbjct: 608 DAASGDWLSLMDLLERLVGTDARHLLGRWTAQARAWGADEAERDRL 653
>gi|291302495|ref|YP_003513773.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
gi|290571715|gb|ADD44680.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
Length = 696
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 125/464 (26%), Positives = 209/464 (45%), Gaps = 55/464 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M + G+GG + + ++++ L ++I R+ ELG+ PVLP F+G VP ++ +A I
Sbjct: 178 MGCMCGFGG-VSRRLVEERAELGRRITDRMRELGIEPVLPGFAGLVPG---DIGDTAAIP 233
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
Q G WF P W T T + E+ F +Q + G T D E +
Sbjct: 234 Q-GQWFGFDR-PAWLPT-----TTRAYAEVAEVFYAKQTERLGAT-RAQAVDLLHEGGTS 285
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + AA M+ D +W++Q W W P + L +
Sbjct: 286 GGVDLADATRGIAAA----MERAHDDYLWVLQAW-------WDNPLPEVLAAT----DSD 330
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
L L W +K ++G P+ L NF G ++G L IA P + +++V
Sbjct: 331 HLLLLDLTGEGWRKTKGWHGKPWARGSLTNFGGRTVLFGGLPEIAELPSLKDDPKASSLV 390
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G + E + NPVV+ L ++ ++ +D+ AW+ +Y RYG++ P AW+ L T
Sbjct: 391 GTALVEEAWQVNPVVWSLFTQTSWADGDIDLNAWVPEYVAARYGKAHPRAVRAWHGLLAT 450
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH-L 357
Y DG ++ A P +D ++ +S + PH L
Sbjct: 451 AYRSMDGRPGGAESLLCAMPSLD-------------------------ADRASMNGPHSL 485
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y + A +A+ L ++T+R+DL+D+TRQ ++ A L + AY + +
Sbjct: 486 PYPAEALEVAWRDLLAAREALGGADTFRFDLVDVTRQVISNRARPLLPLLRTAYAMKELD 545
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
LS F++L E +D +LA + FL+G WL A+ LA +E++
Sbjct: 546 RFIALSHSFIDLFELLDPVLATREEFLVGRWLADARALAADEDE 589
>gi|291301158|ref|YP_003512436.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
gi|290570378|gb|ADD43343.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
Length = 734
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 116/466 (24%), Positives = 203/466 (43%), Gaps = 44/466 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL + GP+ Q LD + L ++I R+ ELG+ PVLP + G +P A+
Sbjct: 209 MQNLSAFPGPISQHLLDSRAELARRIRTRMAELGIRPVLPGYFGTIPGGFAKRNQQARTV 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W+ S P W LD T F ++ +F Q + G + +Y D E P
Sbjct: 269 PQGVWYGF-SRPDW-----LDPTGNEFAKVAASFYRHQAQLLGE-ADMYKMDLMHEGGDP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P ++ G A+ +Q A W+M GW R +L + +++++
Sbjct: 322 GGIPIPDAAKGVAL--ALQRARPGATWVMLGW--------RKNPRTDILTDIDTSRVLIV 371
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D ++ + G PY + + NF G+ + A + RT+ ++ + G+
Sbjct: 372 DGISDRFDDLDREHTWPGTPYAFGTIPNFGGHTTIGANAKVWAKRFGQWRTAPDSAVSGI 431
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
EG ++P ++L +E+A++ + +D+ W Y+ RRYG + + AW+ L + Y
Sbjct: 432 AWMPEGAGRDPAAFELFAELAWR-DSIDLGEWFADYADRRYGGADDNARTAWDALRRSAY 490
Query: 301 NCTDGATDKNRDVIV-AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
G + D + A P +D VT Y + L Y
Sbjct: 491 AMPSGRWAEAADGLFGARPGLD-----VTHADY-------------------FSPEFLRY 526
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +AL + L ++ YR+DL+D+ RQ+L EL + A+ +
Sbjct: 527 DAAVFAQALPALLDVDKSLH-NDAYRFDLVDVARQSLVNAGRELLPRVKSAFVNQNKKQF 585
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
+ +R +L+ + +D LL FLLGPWLE+A++ A+ ++ K +
Sbjct: 586 DKHTRTWLDWMRLLDRLLETDRRFLLGPWLEAARRSARTADEAKDL 631
>gi|329934959|ref|ZP_08285000.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
gi|329305781|gb|EGG49637.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
Length = 1017
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 113/468 (24%), Positives = 194/468 (41%), Gaps = 46/468 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPS-AKI 59
+ NL+G+GGPL + ++ L ++I R+ LGM PVLP + G+VP + A +
Sbjct: 191 LQNLYGYGGPLSAELIARRAALGRRIADRLRALGMRPVLPGYYGHVPKDFADRRGGDAHV 250
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W P W LD F E+ +F Q +G + + D E
Sbjct: 251 VPQGTWHGFDR-PSW-----LDPRTDAFAEVAASFYRHQEDVFG-PAGDFKMDLLHEGGT 303
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P + + +++ A W++ GW +P + LL++V ++++
Sbjct: 304 AGDVP--VPDAARGVEKALRAARPGATWVILGW--EANP------LPELLDAVDKKRMLI 353
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTM 237
+D ++ + + + G PY + + NF G + G I A R + +
Sbjct: 354 VDGVSDRYTSVTDREEDWGGTPYAFGTIPNFGGRTTI-GARTHIWREKFFAWRDKPGSAL 412
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G E +++P ++L SE+A+ E VD W Y+ RYG + AW L+
Sbjct: 413 AGTAYLPEAADRDPAAFELFSELAWTDEPVDRARWFTGYADFRYGGRDAGARRAWRALHD 472
Query: 298 TVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
T Y + +D + + A PD+ + + Y
Sbjct: 473 TAYQQHANERSDPHDSLFCARPDL------------------------AATRAARYAPAA 508
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
L Y + AL +A YRYDL+D+ RQALA + + + A+ DA
Sbjct: 509 LTYDPARFDAALSGLLAVAAHRRGGAAYRYDLVDVARQALAHRSRQYLPQLKAAFDREDA 568
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
L+ ++L L+ + + H FLLGPW+E A+++A N + +
Sbjct: 569 ATFKALATQWLTLMRLSEDITGTHPAFLLGPWIEDARRMATNPRERAE 616
>gi|169351438|ref|ZP_02868376.1| hypothetical protein CLOSPI_02218 [Clostridium spiroforme DSM 1552]
gi|169291660|gb|EDS73793.1| F5/8 type C domain protein [Clostridium spiroforme DSM 1552]
Length = 1762
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 122/473 (25%), Positives = 209/473 (44%), Gaps = 67/473 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GGP+ W+ +L + ++ LGM VL ++G VP + + +I
Sbjct: 816 MDNMEVIGGPVSDEWVKGRLEMARENQRWKNSLGMQTVLQGYAGMVPNNFTD-YQDVEIL 874
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP- 119
+ GNW V PR ++ L+ + + F E Q +G+TS+ Y D F E
Sbjct: 875 EQGNWCGV---PR---PDMIRTDGELYDQYAKLFYEAQEWAFGKTSNYYAVDPFHEGGKR 928
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKAL--LNSVPLGKL 177
P D + + + + + + D +AVW++Q W W P L + +
Sbjct: 929 PSDLTDDV--ISREVLNSLLEYDQEAVWMVQAW-------WSNPTNDLLKGMGDDREDHV 979
Query: 178 VVLDLFA------------EVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
++LDL E S +F ++WCML N+ GN M G I
Sbjct: 980 IILDLNGLNDAYDSYWDKTEYNGTVLESDEFNSTSWVWCMLENYGGNPSMDGRPKEI-IN 1038
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+ +++ M G+G E NP++Y+L+ +MA+Q + +D+ W+++Y +RRYG
Sbjct: 1039 RINKASTQAEHMKGIGFISEATYDNPMIYELLLDMAWQQDTIDLDDWLDEYVLRRYGDYS 1098
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
+ +AW++L TVY+ + TD ++A DPS++ YG P
Sbjct: 1099 ESAGEAWDILLKTVYSRSGKTTD-----VIARS--DPSLVQ--------YGLP------- 1136
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
Y+ SE+ ALEL ++LSAS YRYDL ++ RQ + YA
Sbjct: 1137 -------------YTASELEEALELLYKDYDKLSASEAYRYDLTEIMRQVVNNYAVVRLG 1183
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
++ AY + L ++L ++ ++ + L+G W+ A A++
Sbjct: 1184 DLKTAYDAKEIDNFKSLKEQYLNAIDLLNEVCGTQQDLLIGEWVGRAVDWAKD 1236
>gi|29832531|ref|NP_827165.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
gi|29609651|dbj|BAC73700.1| putative alpha-N-acetylglucosaminidase, secreted [Streptomyces
avermitilis MA-4680]
Length = 1038
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 114/468 (24%), Positives = 194/468 (41%), Gaps = 46/468 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
+ NL G+GGPL + ++ L ++I R+ LGM PVLP + G+VP ++ A +
Sbjct: 212 LQNLSGYGGPLSPELIAERAGLGRRICDRLRALGMAPVLPGYYGHVPKGFVERNGGDAHV 271
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W + P W LD F + ++F Q +G+ +H + D E
Sbjct: 272 VPQGIWHGFER-PDW-----LDPRTASFAAVAKSFYRHQKDVFGKAAH-FKMDLLHEGGT 324
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P + + +Q+ A W++ GW +P + ALL+++ K+++
Sbjct: 325 AGDVP--VPGAARGVEKALQAAHPGATWVILGW--EANP------LPALLDAIDKKKMLI 374
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
+D ++ + K + G PY + + NF G + R + +
Sbjct: 375 VDGVSDRYTSVTDREKDWGGTPYAFGTIPNFGGRTTIGARAHLWNEKFFAWRDKAGSALA 434
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G E +++P ++L SE+A+ K+D AW + Y+ RYG + Q AW L+ T
Sbjct: 435 GTAYLPEAADRDPAAFELFSELAWSAGKIDRAAWFSSYADFRYGGRDASAQKAWRALHDT 494
Query: 299 VYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y +D + + A PD+ + + Y L
Sbjct: 495 AYQQHAVERSDAHDSLFCARPDL------------------------AANRAAEYAPRAL 530
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y AL + L S Y YDL+D+ RQALA + + +L ++ A
Sbjct: 531 TYDPGRFDAALSGLLGVAGGLRGSAAYTYDLVDVARQALAHRSRQ-YLPLLRAAYARKDA 589
Query: 418 GVF-QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
F L+ +L L+ D + H FLLGPW+ A+ LA + + +
Sbjct: 590 AAFTSLATLWLRLMGLSDEVTGTHPAFLLGPWINDARLLATDAGERAE 637
>gi|281423203|ref|ZP_06254116.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
gi|281402539|gb|EFB33370.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
Length = 450
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 93/267 (34%), Positives = 136/267 (50%), Gaps = 26/267 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW +QQ LQKKIL R++E GM PVLP F G +P + +T
Sbjct: 186 MNNLEGWGGPLPDSWYNQQEALQKKILKRMHEYGMQPVLPGFCGMMPHDAKAKL-GLNVT 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L TD +I + + YG+ ++ Y+ D F E
Sbjct: 245 DGGIWNGYTRPAN------LSPTDAHSDKIADLYYAELTNLYGKANY-YSMDPFHETND- 296
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D S G + M+ + +A W++QGW + +P RP +K + N G L+VL
Sbjct: 297 -DEAIDYSKAGRKVMEAMKRVNPNATWVIQGW--TENP--RPQMIKNMKN----GDLLVL 347
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
DLF+E +P IW K + +++CML NF N+ ++G +D + + S
Sbjct: 348 DLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDLLLHNFYSTKQSSP 407
Query: 235 TT--MVGVGMSMEGIEQNPVVYDLMSE 259
T + G+G +MEG E NPV+++LMSE
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSE 434
>gi|293402299|ref|ZP_06646437.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
5_2_54FAA]
gi|291304406|gb|EFE45657.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
5_2_54FAA]
Length = 2330
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 111/463 (23%), Positives = 197/463 (42%), Gaps = 56/463 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGP+P ++ ++ L + LGMN VL ++G VP P+ +T
Sbjct: 670 MQNIETFGGPIPDQYVVDRVELARTTQRWKNSLGMNTVLQGYAGMVPTNFNEFQPNVPLT 729
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+W + + P T P + E + F E Q YG TS Y D + E
Sbjct: 730 AQKSWGGL-ARPSMIPT-----DSPYYDEYAKLFYEAQEYIYGATSDYYAVDPYHEGGT- 782
Query: 121 VDSPEYIS--SLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSV---PLG 175
PE +S ++ + + + D DAVW++Q W + LLN +
Sbjct: 783 --RPEGLSDETVAREVLNSLLDYDKDAVWVVQAW--------QSNPTDGLLNGMGEYREN 832
Query: 176 KLVVLDLFAEVKPIWS--TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE 233
++++DL W+ +F G + W +L F GN M G + ++ ++ E
Sbjct: 833 HVLIVDLIKYPIKSWTKYNKSEFKGTSWAWGLLGGFGGNPTMNGEMQTM-VNDIQTAKKE 891
Query: 234 NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWN 293
T M G+G+ E NPV+YDL+ ++A+ + + W+N+Y RRYG + ++AW
Sbjct: 892 RTHMAGLGIISEAQYDNPVLYDLIFDLAWVDDDFSLDQWLNKYIERRYGGTSDNAKEAWK 951
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
++ + YN V F + + Q+YGK
Sbjct: 952 IMKNANYNHG-----------VRFT---AQVYGMKGKSPQDYGK---------------- 981
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
++ Y ++ A L I ++ S YRYDL ++ RQ ++ Y+ + N+I+A +
Sbjct: 982 -QNISYGADKLETAFRLLIEDYDKFKDSECYRYDLTEIMRQMVSNYSTLTYNNVIDARED 1040
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+ + +FL+ + ++ + L G W+ A+ A
Sbjct: 1041 KNIEKFKEEKAKFLKSFDVLNDIQETQVDQLAGEWIGKAQDRA 1083
>gi|380804373|gb|AFE74062.1| alpha-N-acetylglucosaminidase precursor, partial [Macaca mulatta]
Length = 265
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 92/145 (63%), Gaps = 3/145 (2%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 124 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 183
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 184 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 240
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDA 145
+P Y+++ A+Y M + D++A
Sbjct: 241 SSAPSYLAAATTAVYEAMIAVDTEA 265
>gi|440695019|ref|ZP_20877582.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
gi|440282912|gb|ELP70302.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
Length = 1050
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 120/470 (25%), Positives = 201/470 (42%), Gaps = 47/470 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL--QNVFPSAK 58
+ N+ G+GGP+ + ++++ L KI R+ ELGM PVLP + G VP +N +A
Sbjct: 216 LQNMSGFGGPVSRRLIEKRADLAAKITERVRELGMTPVLPGYFGTVPDEFVARNGGDAAV 275
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ Q G+W + K P W LD F E+ AF + Q + +G S +Y D E
Sbjct: 276 VPQ-GDWGAFKR-PDW-----LDPRTTAFGEVAAAFYQAQSERFG-DSTMYKMDLLHEGG 327
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
P D P + A+ + ++ AVW + GW + P + +L++V ++
Sbjct: 328 NPGDVP--VGRAAQAVEAALRKAHPGAVWAILGWQNN-------PSGE-ILDAVDKSRMF 377
Query: 179 VLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
V+D ++ + + G PY + + NF G+ M + R E++ +
Sbjct: 378 VVDGLSDRYTTVTDRESDWGGTPYAFGSIWNFGGHTPMGANAPDWVEQYPKWRDKEDSAL 437
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+ E + N L++++A+ +D+ W Y+V RYG P AW ++
Sbjct: 438 AGIAAMPEAADNNHAALALLTDLAWTPGTIDLDDWFASYAVSRYGAEDPHALAAWKIIGD 497
Query: 298 TVYNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
T Y + DG ++ + A P + +K A E YD
Sbjct: 498 TAYGMSRADGWSEAPDGLFGARPSLG-----------------ANKAAAWGPEADRYD-- 538
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
T+ AL + L ++ YRYDL D+ RQ L+ + L I AY D
Sbjct: 539 -----TTAFDLALTELLQVAPALRGNSAYRYDLADVARQVLSNRSRMLLPQIRAAYDTAD 593
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQV 465
+L+ +L+ + MD +LA LLG WL A+ ++ Q+
Sbjct: 594 RVRFDELTGVWLDWMRLMDKVLATSGQHLLGRWLADARSWGATRGEKDQL 643
>gi|456390168|gb|EMF55563.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
25435]
Length = 1042
Score = 134 bits (338), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 113/467 (24%), Positives = 198/467 (42%), Gaps = 44/467 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
+ NL G+GGPL + ++ L ++I R+ ELGM+PVLP + G+VP ++ A +
Sbjct: 212 LQNLSGYGGPLSPQLIARRAGLGRRITDRLRELGMSPVLPGYYGHVPKQFVERNGGDAHV 271
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W + P W LD F + +F +G +H + D E
Sbjct: 272 VPQGLWHGFER-PDW-----LDPRTDSFARVAASFYGHVRDVFGAAAH-FKMDLLHEGGT 324
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P + + + DA+W++ GW +P + LL+++ ++++
Sbjct: 325 AGDVP--VPDAARGVERALHKAHPDAIWVILGW--QENP------LPELLDAIDRSRMLI 374
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
+D ++ + + + G PY + + NF G + R ++ +V
Sbjct: 375 VDGVSDRYASVTDRERDWGGTPYCFGTIPNFGGRTTIGARAHLWTDKFFAWRDKPDSALV 434
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G E +++P ++L SE+A+ K+D AW + Y+ RYG A + AW L+ T
Sbjct: 435 GTAYMPEATDRDPAAFELFSELAWTPGKIDRAAWFSAYADFRYGGRDDAARAAWRALHET 494
Query: 299 VYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y +D + + A PD+ + A T +YD
Sbjct: 495 AYQQRAVERSDPHDSLFCARPDL-----------------AADRAAEYAPRTLTYDPGRF 537
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
+ A L +A G + + YRYD++DL RQALA + + + A++ D
Sbjct: 538 -----DAAFAGLLDVAGGRRRNPA--YRYDVVDLARQALAHRSRQYLPQLRAAHRRKDLT 590
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
LS +L L+ D + FLLGPW+ A+ LA ++ + +
Sbjct: 591 TFRALSTLWLRLMRLSDEVTGTDGAFLLGPWVNDARLLATDDAERAE 637
>gi|84625358|ref|YP_452730.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|84369298|dbj|BAE70456.1| putative N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
MAFF 311018]
Length = 590
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 109/421 (25%), Positives = 177/421 (42%), Gaps = 72/421 (17%)
Query: 54 FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 113
P A+I ++ W TY LD DPLF ++ R F+E + YG Y D
Sbjct: 46 LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 98
Query: 114 FDENTPPV------------------------------DSPEYISSLGAAIYSGMQSGDS 143
F+E PPV +++ G A+Y + +
Sbjct: 99 FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 158
Query: 144 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 201
A W+MQGWLF D FW+P + A L VP +L+VLD+ + P W S+ F +
Sbjct: 159 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 218
Query: 202 IWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLM 257
I+ +HN+ + +YG +AF ++A ++ + G G+ EG+ N VVY+ +
Sbjct: 219 IYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYL 275
Query: 258 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 317
+A++ + W+ QY RYGRS A+ AW L +Y +
Sbjct: 276 YALAWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTRYWS----------- 324
Query: 318 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 377
P + G Y + +P + D P + A++ + +
Sbjct: 325 ----PRWWNTHAGAYLLFKRPTADIVNFD------DRPG---DPQRLRSAIDALLQQADR 371
Query: 378 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGL 436
+ + YRYDLI+ R L+ A+ +++AY D A G QL+R +LV+ +D L
Sbjct: 372 YADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDAL 430
Query: 437 L 437
+
Sbjct: 431 V 431
>gi|260821254|ref|XP_002605948.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
gi|229291285|gb|EEN61958.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
Length = 673
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 93/321 (28%), Positives = 131/321 (40%), Gaps = 97/321 (30%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLPQSW QL LQ KIL R+ N + L +++ +
Sbjct: 368 MGNIRGWGGPLPQSWHQNQLELQHKILARMR-------------NFDSTLMHLY----LD 410
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G ++ C T + +C E
Sbjct: 411 YSGGDLKTRTVAHTCWTLRI-----------------------------HCFLTLEECLL 441
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P Y+S GAA+Y+GM +GD A+WLMQGWLF FW+P Q KALL SVP G
Sbjct: 442 LSEPNYLSKAGAAVYAGMLAGDPQAIWLMQGWLFQARDFWQPAQTKALLQSVPEG----- 496
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
P AR +TMVG
Sbjct: 497 ---------------------------------------------PFLARKYLGSTMVGT 511
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDV-KAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI+QN ++Y+LM+E+A+ + + W + Y+ RYG W +L +V
Sbjct: 512 GLTPEGIDQNYIMYELMNEVAWMPQPFQILDNWASDYAWSRYGVKNSNASLGWQILLKSV 571
Query: 300 YNCTDGATDKNRDVIVAFPDV 320
Y+C +G D V+V PD+
Sbjct: 572 YDCENGFKDHCDSVVVHRPDL 592
>gi|58583545|ref|YP_202561.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
gi|58428139|gb|AAW77176.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
Length = 753
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 108/421 (25%), Positives = 178/421 (42%), Gaps = 72/421 (17%)
Query: 54 FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 113
P A+I ++ W TY LD DPLF ++ R F+E + YG Y D
Sbjct: 209 LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 261
Query: 114 FDENTPPV------------------------------DSPEYISSLGAAIYSGMQSGDS 143
F+E PPV +++ G A+Y + +
Sbjct: 262 FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 321
Query: 144 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 201
A W+MQGWLF D FW+P + A L VP +L+VLD+ + P W S+ F +
Sbjct: 322 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 381
Query: 202 IWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLM 257
I+ +HN+ + +YG +AF ++A ++ + G G+ EG+ N VVY+ +
Sbjct: 382 IYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYL 438
Query: 258 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 317
+A++ + W+ QY RYGRS A+ AW L +Y
Sbjct: 439 YALAWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTR-------------- 484
Query: 318 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 377
P + G Y + +P + ++ + D + A++ + +
Sbjct: 485 -YWSPRWWNTHAGAYLLFKRPTAD--IVNFDDRPGD-------PQRLRSAIDALLQQADR 534
Query: 378 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGL 436
+ + YRYDLI+ R L+ A+ +++AY D A G QL+ R +LV+ +D L
Sbjct: 535 YADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLA-RTTQLVQGLDAL 593
Query: 437 L 437
+
Sbjct: 594 V 594
>gi|339238239|ref|XP_003380674.1| GDP-L-fucose synthetase [Trichinella spiralis]
gi|316976398|gb|EFV59699.1| GDP-L-fucose synthetase [Trichinella spiralis]
Length = 1203
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/220 (34%), Positives = 113/220 (51%), Gaps = 7/220 (3%)
Query: 89 EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 148
+G + + L+ Y H Y+ D F+E P ++ ++ AIY+ M S D +VW+
Sbjct: 801 HVGNEVVWKSLENYFGLFHAYSADPFNEMVPNTFDVMFLRNVSFAIYNVMLSVDPKSVWV 860
Query: 149 MQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 207
+Q W+F W + K L +VP G ++V+DL+AE P++ FY P+IWCMLH
Sbjct: 861 LQSWMFLSSERWLENENAKHFLTAVPTGSILVVDLYAEEYPLYEKFSGFYNQPFIWCMLH 920
Query: 208 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF--QHE 265
NF G +YG L I + T N MVG G+SMEGI+QN VVY + + + ++
Sbjct: 921 NFGGVQGLYGNLARINQKLADVSTVSNINMVGTGLSMEGIDQNYVVYQMALDRFWSPNNQ 980
Query: 266 KVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTD 304
KVD+ AW Y G + +I AW + C +
Sbjct: 981 KVDLAAW---YIYIHLGVGITKSIYTAWGAFLQSSRTCQE 1017
>gi|402824586|ref|ZP_10873940.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
gi|402261896|gb|EJU11905.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
Length = 486
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 123/252 (48%), Gaps = 39/252 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+ PL W++++ LQ +IL R+ LGM PVLPAF+G VP A P A+I
Sbjct: 223 MGNLAGYRAPLSSGWIEKKHQLQLRILARMRALGMKPVLPAFAGYVPEAFAKAHPKARIY 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD +DPLF ++ F+ + YG + Y D F+E PP
Sbjct: 283 KMRAWEGFPP------TYWLDPSDPLFTQLAARFVTLYNRTYGEGEY-YLADAFNEMIPP 335
Query: 121 V-------DSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + EY +++ G +Y + + A W+MQ
Sbjct: 336 IAEDGSDAAAAEYGDSIANTAATRAAALPPAVRDARLAAYGERLYGSITAAAPKATWVMQ 395
Query: 151 GWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D +R P+ + A L+ VP ++++LD+ + P IW + F G + + +HN
Sbjct: 396 GWLFGADKAFRTPEAIAAFLSRVPDDRMLILDIGNDRYPGIWQKTDAFDGKAWTYGYVHN 455
Query: 209 FAGNIEMYGILD 220
+ G+ +YG L+
Sbjct: 456 YGGSNPVYGDLE 467
>gi|47212645|emb|CAF95026.1| unnamed protein product [Tetraodon nigroviridis]
Length = 121
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 56/121 (46%), Positives = 83/121 (68%), Gaps = 3/121 (2%)
Query: 20 LVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYL 79
L LQ KIL ++ GM PVLPAFSGNVP + ++P A++T+LG W K + + C+Y+
Sbjct: 4 LSLQFKILEQMRSFGMTPVLPAFSGNVPKGILRLYPEARVTRLGPW--SKFNCSFSCSYI 61
Query: 80 LDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQ 139
LD DPLF+ IG ++ Q +K++G T+HIYN DTF+E TPP P Y+S++ A+++ M
Sbjct: 62 LDPRDPLFLRIGSLYLAQVVKQFG-TNHIYNTDTFNEMTPPSSEPNYLSAVSRAVFAAMT 120
Query: 140 S 140
+
Sbjct: 121 A 121
>gi|293371910|ref|ZP_06618314.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292633156|gb|EFF51733.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 411
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 144/331 (43%), Gaps = 38/331 (11%)
Query: 129 SLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVK 187
+ + +Y+ + + D A W+ W+F +D W +MKALL VP K+++LD E
Sbjct: 3 KIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMILLDYHCENV 62
Query: 188 PIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGI 247
+W ++ F+ PYIWC L NF GN + G + A + + G+G ++EG+
Sbjct: 63 ELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKGIGSTLEGL 122
Query: 248 EQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGAT 307
+ Y+ + E A+ + VD WI + R G +++DAW L++ +Y
Sbjct: 123 DVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDIY------- 174
Query: 308 DKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA 367
V P T G Y +P + K ++ Y + L EV R
Sbjct: 175 -------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL----LEVWRK 214
Query: 368 LELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR 425
L NE + + +R DLI + RQ L Y ++ + + D + +
Sbjct: 215 L-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQALKACGEK 267
Query: 426 FLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
E++ D+D L A H L W++ A+++
Sbjct: 268 MKEILNDLDKLNAFHPYCSLDKWIDDARKMG 298
>gi|297723521|ref|NP_001174124.1| Os04g0650900 [Oryza sativa Japonica Group]
gi|255675839|dbj|BAH92852.1| Os04g0650900, partial [Oryza sativa Japonica Group]
Length = 128
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/64 (76%), Positives = 56/64 (87%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 64 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 123
Query: 61 QLGN 64
LGN
Sbjct: 124 HLGN 127
>gi|358381741|gb|EHK19415.1| hypothetical protein TRIVIDRAFT_224650 [Trichoderma virens Gv29-8]
Length = 217
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 110/214 (51%), Gaps = 11/214 (5%)
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL-G 175
TPP Y+ + + + ++S D +A+W+ Q WLF+ + FW +++ + +
Sbjct: 2 TPPSGELNYLRNASSNTWKALKSADPEAIWVFQAWLFAQNTTFWTNDRIEVYPGGITIDS 61
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LD++ E W ++ +Y P+IWC L N+ I MYG + ++ P+ A E+
Sbjct: 62 DMLILDIWLESMSQWQCAQSYYSKPWIWCELQNYGATINMYGQIQNLTKSPILA-LQESQ 120
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY--GRSVPAIQDAWN 293
++VG+G+SME + N +V+DL+ A+ +D + ++ RY + +I AW
Sbjct: 121 SLVGLGLSMEAQQSNEIVFDLLLSQAWNCTPIDTNIYFKSWAAARYLSSKRPASIYTAWE 180
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 327
+ TVY+ T N +++ + P S I V
Sbjct: 181 AVRATVYDNT------NLNMMSSVPKSRSSEIKV 208
>gi|242077446|ref|XP_002448659.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
gi|241939842|gb|EES12987.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
Length = 252
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 51/79 (64%), Positives = 62/79 (78%)
Query: 386 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 445
YDL+DLTRQ LAKYAN++FL IIE+Y+ N + V L + FL LV D+D LL+ H+GFLL
Sbjct: 51 YDLVDLTRQVLAKYANDVFLKIIESYKSNKMNQVTILCKHFLNLVNDLDTLLSSHEGFLL 110
Query: 446 GPWLESAKQLAQNEEQEKQ 464
GPWLESAK LA+N EQE Q
Sbjct: 111 GPWLESAKGLARNSEQEIQ 129
>gi|302522684|ref|ZP_07275026.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
gi|302431579|gb|EFL03395.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
Length = 355
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/277 (25%), Positives = 129/277 (46%), Gaps = 24/277 (8%)
Query: 196 FYGVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVY 254
+ G PY + + NF G+ + D + P R + + G+ + E + NP +
Sbjct: 8 WQGTPYAFGSIWNFGGHTALGANTRDWVDLYP-RWRDRSGSRLSGIALMPEAADNNPAAF 66
Query: 255 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 314
+L +E+ + VD+ W +Y+ RYG S + AW++L TVY D+ +
Sbjct: 67 ELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILRTTVYGTRR--DDRWSEPA 124
Query: 315 VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS 374
P++ +V+ GK+ L+ +S++ AL+ ++
Sbjct: 125 DGLFGARPALDAVSAGKWS--------PKALRYPAASFEP------------ALDELLSV 164
Query: 375 GNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 434
EL S TYR DL+D+ RQALA + L + AY+ + +L RR++ L++ ++
Sbjct: 165 RAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQAEFARLGRRWIALIDLLE 224
Query: 435 GLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVS 471
L+A + LLG W+ESA+ + ++ Q++ +S
Sbjct: 225 QLVATDENHLLGRWVESARAWGGSAREKSQLQYDALS 261
>gi|328867426|gb|EGG15808.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
Length = 992
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 69/223 (30%), Positives = 109/223 (48%), Gaps = 32/223 (14%)
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
M G G++ E IEQN ++YDLM+EMA++ ++ WINQY+ RRYG VP + AWN+L
Sbjct: 219 MKGTGLTPEAIEQNYMMYDLMNEMAWRTTAPNMTEWINQYTQRRYGVFVPELAQAWNLLI 278
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
TV+N T G YG P S + + D
Sbjct: 279 PTVFNATLGY----------------------------YGPPSSFVGMRPQLNMTND--- 307
Query: 357 LWYSTSEVIRALELFIASGNELSASN-TYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
L+Y S V +A +L++ +E S T+ +D+ ++T QAL+ + + + +AY N
Sbjct: 308 LYYDPSVVQQAWQLYLGVTDEYVLSTATFSFDVSEITLQALSNLFMDTQMAMYDAYLTNQ 367
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
+ + + L ++ DMD + A L+G W +A+Q A N
Sbjct: 368 STVFEERATSCLNIITDMDTIAATQQMLLVGTWTANARQWALN 410
>gi|347541919|ref|YP_004856555.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-rat-Yit]
gi|346984954|dbj|BAK80629.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-rat-Yit]
Length = 912
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 105/448 (23%), Positives = 193/448 (43%), Gaps = 69/448 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E G+ P+ F G P +
Sbjct: 364 MGNISSIGGELTPKWFEDRAKLSIDIQTRMIEFGIEPIHQMFIGYFPYKEN---SGVNVI 420
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ W +K R LD + I + ++Q + +G + + + D F E N
Sbjct: 421 RGSYWSKIKGPDR------LDFNNNDVEFISSVYYKKQKELFGESKY-FAGDLFHEGNNL 473
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
D E + + + + + +++W++Q W S P + + N + +
Sbjct: 474 YGYDPVELSNKVLKLL---IDNNGENSIWIIQSWSHS-------PSSETIEN-LNRNNTL 522
Query: 179 VLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
+LDL +++ W ++F +I+ +L+NF G +YG + +
Sbjct: 523 ILDLHSQLNTRWKGISKFNNMSWKDREFDRSNWIFGVLNNFGGRSGLYGHTRHLLNQFYD 582
Query: 229 ARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAI 288
A+ + N + GV + EGI N + +L++E+ F +K+D+ ++++Y RYG+S +
Sbjct: 583 AKYNSNY-LKGVAHTSEGIGFNNFIDELVTEIIFS-DKLDIDEFVSRYLRNRYGKSDNDL 640
Query: 289 QDAWNVLYHTVYN-----CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+N+L TVYN +GA++ VI A P +D K SK
Sbjct: 641 LKAFNILLDTVYNPVINIYHEGASES---VINARPSLDV--------------KSASKWG 683
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
+ H Y++ ++ AL ++ + NE S Y DLID+ + + +NE
Sbjct: 684 SI----------HKNYNSEKLEEALRIYFSKYNEFKDSKGYMTDLIDIASEVIINLSNEY 733
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELV 430
+ N+ + Y N F+L S+RFL ++
Sbjct: 734 YKNLQDYYN-NGEIEFFKLNSQRFLNMI 760
>gi|342731751|ref|YP_004770590.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-Japan]
gi|342329206|dbj|BAK55848.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Japan]
Length = 898
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 106/459 (23%), Positives = 200/459 (43%), Gaps = 69/459 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 352 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 408
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 409 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 461
Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 462 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 509
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 510 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 569
Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
+A+ + + + G+ + EG+ N + +L +E+ F E V++ ++ +Y RYG+S
Sbjct: 570 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 627
Query: 288 IQDAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
+ A+N+L TVYN TD GA++ VI A P ++ +
Sbjct: 628 LLVAFNILLDTVYNPVTDIYHEGASES---VINARPSLEIN------------------- 665
Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E
Sbjct: 666 -----SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASE 720
Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 441
+ I E Y + + +S++FL L+ +L+ +D
Sbjct: 721 YYQIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYND 759
>gi|417967717|ref|ZP_12608785.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-co]
gi|380340884|gb|EIA29424.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-co]
Length = 741
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 102/455 (22%), Positives = 199/455 (43%), Gaps = 61/455 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 195 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 252 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 304
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDS-DAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 305 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 352
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 353 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 412
Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
+A+ + + + G+ + EG+ N + +L +E+ F E V++ ++ +Y RYG+S
Sbjct: 413 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 470
Query: 288 IQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ A+N+L TVYN TD + S+I+ ++ ++
Sbjct: 471 LLVAFNILLDTVYNPVTD----------IYHEGASESVIN-------------ARPSLGI 507
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
+ S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E +
Sbjct: 508 NSASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQI 567
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 441
I E Y + + +S++FL L+ +L+ +D
Sbjct: 568 IQEYYNNGNIKYLQLISKKFLNLILLQANILSYND 602
>gi|417965571|ref|ZP_12607078.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-4]
gi|380336329|gb|EIA26351.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-4]
Length = 685
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 106/459 (23%), Positives = 199/459 (43%), Gaps = 69/459 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 146 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 202
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 203 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 255
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDS-DAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 256 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 303
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 304 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 363
Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
+A+ + + + G+ + EG+ N + +L +E+ F E V++ ++ +Y RYG+S
Sbjct: 364 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 421
Query: 288 IQDAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
+ A+N+L TVYN TD GA++ VI A P + +
Sbjct: 422 LLVAFNILLDTVYNPVTDIYHEGASES---VINARPSLGIN------------------- 459
Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E
Sbjct: 460 -----SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASE 514
Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 441
+ I E Y + + +S++FL L+ +L+ +D
Sbjct: 515 YYQIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYND 553
>gi|384455191|ref|YP_005667784.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Yit]
gi|418016862|ref|ZP_12656425.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|418371995|ref|ZP_12964091.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-SU]
gi|345505596|gb|EGX27892.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|346983532|dbj|BAK79208.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Yit]
gi|380342872|gb|EIA31299.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-SU]
Length = 898
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 106/459 (23%), Positives = 199/459 (43%), Gaps = 69/459 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 352 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 408
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 409 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 461
Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 462 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 509
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 510 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 569
Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
+A+ + + + G+ + EG+ N + +L +E+ F E V++ ++ +Y RYG+S
Sbjct: 570 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 627
Query: 288 IQDAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
+ A+N+L TVYN TD GA++ VI A P + +
Sbjct: 628 LLVAFNILLDTVYNPVTDIYHEGASES---VINARPSLGIN------------------- 665
Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E
Sbjct: 666 -----SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASE 720
Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 441
+ I E Y + + +S++FL L+ +L+ +D
Sbjct: 721 YYQIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYND 759
>gi|293371911|ref|ZP_06618315.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
gi|292633157|gb|EFF51734.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
Length = 289
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 3/98 (3%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 184 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 243
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 98
LG W R C + L+ D LF +I + F+++Q
Sbjct: 244 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQ 278
>gi|326435733|gb|EGD81303.1| alpha-N-acetylglucosaminidase [Salpingoeca sp. ATCC 50818]
Length = 696
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 72/126 (57%), Gaps = 2/126 (1%)
Query: 134 IYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST 192
+Y+ M D A+W+ QGW++ D M ++VP G+LV+LD+ AE IW+
Sbjct: 500 VYTTMTKRDPHAIWVYQGWIWLDLDNAQGFSFMSGFTSAVPRGRLVILDMEAEFDEIWAW 559
Query: 193 SKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPV 252
S+ F+ +IW + NF GN MYG + + F +++ +VGVG++MEGI+QNP
Sbjct: 560 SQSFFNTTFIWAAMDNFGGNNGMYGDIQ-LVFDRTRRVFAQSDAVVGVGITMEGIDQNPA 618
Query: 253 VYDLMS 258
Y ++
Sbjct: 619 YYQAIA 624
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 61/116 (52%), Gaps = 18/116 (15%)
Query: 6 GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW 65
G GGPLP W QQ LQ+ I+ R ELG+ +LPAF GNVPAAL ++P A I+ W
Sbjct: 246 GVGGPLPSQWYKQQWELQRAIVQRQTELGIGSLLPAFQGNVPAALAQLYPHANISN--GW 303
Query: 66 FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT-FDENTPP 120
LD DPLF I +++ + ++G T H Y D FD +T P
Sbjct: 304 --------------LDGLDPLFATIADLTMQELIADFGAT-HFYQADGFFDHSTGP 344
>gi|440799252|gb|ELR20307.1| AlphaN-acetylglucosaminidase, putative, partial [Acanthamoeba
castellanii str. Neff]
Length = 389
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 50/132 (37%), Positives = 68/132 (51%), Gaps = 19/132 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA----------- 49
M N+ GWGGPL +W Q LQKKI+ R GM PVLPAF+G VP A
Sbjct: 261 MGNIQGWGGPLDLAWRLAQAELQKKIVERQRMFGMLPVLPAFAGFVPEASVKFTLGRGGG 320
Query: 50 -----LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGR 104
++ ++P+A +T+ +W ++ Y L D L+ IG I +E+G
Sbjct: 321 CGEQGIKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG- 377
Query: 105 TSHIYNCDTFDE 116
T HIYN DTF+E
Sbjct: 378 TDHIYNADTFNE 389
>gi|62318937|dbj|BAD94027.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
Length = 182
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 39/77 (50%), Positives = 55/77 (71%)
Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
++DLTRQ L+K AN+++ + A+ D + QLS +FLEL++DMD LLA D LLG
Sbjct: 1 MVDLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGT 60
Query: 448 WLESAKQLAQNEEQEKQ 464
WLESAK+LA+N ++ KQ
Sbjct: 61 WLESAKKLAKNGDERKQ 77
>gi|281423204|ref|ZP_06254117.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
gi|281402540|gb|EFB33371.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
Length = 291
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 80/185 (43%), Gaps = 26/185 (14%)
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGK 337
RYG++ P I+ AW +L T+YNC G + + + P ++ + + K +NY
Sbjct: 4 RYGKTSPEIERAWQLLSETIYNCPAGNNQQGPHESIFCGRPSLNNFQVK-SWSKMRNY-- 60
Query: 338 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 397
Y + A +L ++ +N + YDL+D+ RQALA
Sbjct: 61 ---------------------YDLQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALA 99
Query: 398 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
+L I Y + + RFLE++ D LL F LG W E+A++L
Sbjct: 100 DQGRLQYLKTIADYNGFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGT 159
Query: 458 NEEQE 462
++++
Sbjct: 160 TQQEK 164
>gi|84625359|ref|YP_452731.1| hypothetical protein XOO_3702 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84369299|dbj|BAE70457.1| truncated N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
MAFF 311018]
Length = 369
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/62 (53%), Positives = 43/62 (69%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQQWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QL 62
++
Sbjct: 261 RM 262
>gi|323456608|gb|EGB12475.1| hypothetical protein AURANDRAFT_20306 [Aureococcus anophagefferens]
Length = 243
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/64 (56%), Positives = 47/64 (73%), Gaps = 1/64 (1%)
Query: 5 HGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT-QLG 63
HG GGPLP+++ D QL L K+IL R+ LG+ PVLP+F GNVP AL+++FP A IT Q
Sbjct: 120 HGVGGPLPRTFADAQLALAKRILARMRGLGIVPVLPSFQGNVPPALKDLFPEANITVQAP 179
Query: 64 NWFS 67
+W S
Sbjct: 180 HWTS 183
>gi|390353486|ref|XP_003728120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Strongylocentrotus
purpuratus]
Length = 385
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 32/50 (64%), Positives = 38/50 (76%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL 50
M N+ GWGGP+PQSW QL LQ KIL R+ ELGM PVLPAF+G+VP +
Sbjct: 218 MGNIDGWGGPIPQSWHTNQLALQHKILKRMRELGMIPVLPAFAGHVPKSF 267
>gi|194695302|gb|ACF81735.1| unknown [Zea mays]
Length = 173
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 34/43 (79%)
Query: 422 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE Q
Sbjct: 7 LCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQ 49
>gi|224135741|ref|XP_002322149.1| predicted protein [Populus trichocarpa]
gi|222869145|gb|EEF06276.1| predicted protein [Populus trichocarpa]
Length = 173
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/34 (76%), Positives = 29/34 (85%)
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
VGVGM M+GI+QNPVV DLMS+MAF H KVDVK
Sbjct: 30 VGVGMPMDGIKQNPVVSDLMSKMAFHHNKVDVKG 63
>gi|147798252|emb|CAN69797.1| hypothetical protein VITISV_036335 [Vitis vinifera]
Length = 273
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 31/47 (65%), Gaps = 3/47 (6%)
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD---VKAWINQYSVRR 280
MVGVG+ MEGIEQNPVVY+ M EMAF E V + + N + RR
Sbjct: 112 MVGVGVCMEGIEQNPVVYESMFEMAFHSENVQLVVISSTCNTMARRR 158
>gi|293369245|ref|ZP_06615835.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
gi|292635670|gb|EFF54172.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
Length = 221
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/104 (24%), Positives = 51/104 (49%), Gaps = 2/104 (1%)
Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
P + Y +++ A L ++ + ++Y +DL+++ RQ L Y N + AY+
Sbjct: 11 PTIEYQPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAG 68
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
D + + E++ D+D L++CH F L W+ A+ + +
Sbjct: 69 DIPMMKNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHD 112
>gi|321458423|gb|EFX69492.1| hypothetical protein DAPPUDRAFT_35389 [Daphnia pulex]
Length = 132
Score = 46.2 bits (108), Expect = 0.037, Method: Composition-based stats.
Identities = 21/77 (27%), Positives = 44/77 (57%)
Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
++DLTRQ++ + + L+ ++E Y ++ + ++ + + L++D+D L+ FLLG
Sbjct: 1 MVDLTRQSMQEIFHLLYSKLLEVYLEKNSTAIEGIAYKMINLLQDLDELIQTGKTFLLGK 60
Query: 448 WLESAKQLAQNEEQEKQ 464
W+ AK E ++ Q
Sbjct: 61 WIADAKSWGTTEGEKLQ 77
>gi|315131339|emb|CBM69278.1| venom protein Ci-120 [Chelonus inanitus]
Length = 165
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 45/75 (60%), Gaps = 2/75 (2%)
Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPW 448
D+TRQ+L A ++L + +++ D VF+ L +L D++ +L+ + FL+G W
Sbjct: 1 DVTRQSLQLIAEHVYLKLQQSFHQKDL-AVFKAHANLLMQLFSDLESILSTNKHFLVGKW 59
Query: 449 LESAKQLAQNEEQEK 463
+++A+ L N +++K
Sbjct: 60 IKNARSLGTNVQEQK 74
>gi|322792283|gb|EFZ16267.1| hypothetical protein SINV_02225 [Solenopsis invicta]
Length = 87
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 26/38 (68%)
Query: 426 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
LEL +D++ +LA FLLG WL AK++A NEE+ +
Sbjct: 1 LLELFDDLESILASGSNFLLGTWLTQAKEMADNEEERR 38
>gi|417963477|ref|ZP_12605420.1| Alpha-N-acetylglucosaminidase family protein, partial [Candidatus
Arthromitus sp. SFB-3]
gi|380332800|gb|EIA23523.1| Alpha-N-acetylglucosaminidase family protein, partial [Candidatus
Arthromitus sp. SFB-3]
Length = 146
Score = 38.5 bits (88), Expect = 7.3, Method: Composition-based stats.
Identities = 23/95 (24%), Positives = 48/95 (50%)
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
+ S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E +
Sbjct: 30 NSASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQI 89
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 441
I E Y + + +S++FL L+ +L+ +D
Sbjct: 90 IQEYYNNGNIKYLQLISKKFLNLILLQANILSYND 124
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.422
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,736,492,678
Number of Sequences: 23463169
Number of extensions: 325306966
Number of successful extensions: 708383
Number of sequences better than 100.0: 504
Number of HSP's better than 100.0 without gapping: 502
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 705589
Number of HSP's gapped (non-prelim): 774
length of query: 472
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 326
effective length of database: 8,933,572,693
effective search space: 2912344697918
effective search space used: 2912344697918
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)