BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 009062
(545 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255540793|ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
gi|223550576|gb|EEF52063.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
Length = 809
Score = 935 bits (2417), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/539 (81%), Positives = 491/539 (91%), Gaps = 1/539 (0%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGNVPAAL+N+FPSAKI +LGNWFSVKSD RWCCTYLLDATDPLFIEIGRAFIEQ
Sbjct: 271 VLPAFSGNVPAALRNIFPSAKIARLGNWFSVKSDLRWCCTYLLDATDPLFIEIGRAFIEQ 330
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
QL+EYG TSHIYNCDTFDENTPPVD P+YIS+LGAA++ GMQSGD+DAVWLMQGWLFSYD
Sbjct: 331 QLEEYGSTSHIYNCDTFDENTPPVDDPKYISALGAAVFKGMQSGDNDAVWLMQGWLFSYD 390
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
PFWRPPQMKALL+SVP+G+LVVLDLFAEVKPIW++S QFYGVPYIWCMLHNFAGN+EMYG
Sbjct: 391 PFWRPPQMKALLHSVPVGRLVVLDLFAEVKPIWTSSYQFYGVPYIWCMLHNFAGNVEMYG 450
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
ILDSIA GPVEARTSEN+TMVGVGMSMEGIEQNPVVYDLMSEMAFQH+KVDVKAWIN YS
Sbjct: 451 ILDSIASGPVEARTSENSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHKKVDVKAWINLYS 510
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYGRSVP+IQDAW++LYHTVYNCTDGA DKNRDVIVAFPDV+P SV++ ++ GK
Sbjct: 511 TRRYGRSVPSIQDAWDILYHTVYNCTDGAYDKNRDVIVAFPDVNPFYFSVSQKRHHLNGK 570
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
PVS+ AVLK + SYDHPHLWYSTSEV+ ALELFI SG ELS S+TY YDL+DLTRQALA
Sbjct: 571 PVSRRAVLKENSDSYDHPHLWYSTSEVLHALELFITSGEELSGSSTYSYDLVDLTRQALA 630
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
KY NELFL IIE+YQ ND +GV S++FL+LVEDMD LL CH+GFLLGPWLESAKQLAQ
Sbjct: 631 KYGNELFLKIIESYQANDGNGVASRSQKFLDLVEDMDTLLGCHEGFLLGPWLESAKQLAQ 690
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
++EQEKQ+EWNARTQITMWFDNT++EASLL DYGNKYWSGLL+DYYGPRAAIYFKY+I+S
Sbjct: 691 DQEQEKQFEWNARTQITMWFDNTEDEASLLHDYGNKYWSGLLQDYYGPRAAIYFKYLIKS 750
Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTG-VFDH 545
LE+G F LKDWRREWIKLTN+WQ RN +PV+SNG+ALI S+WLY+KYL+ +DH
Sbjct: 751 LENGKVFPLKDWRREWIKLTNEWQRSRNKFPVKSNGNALIISKWLYDKYLRNPDTTYDH 809
>gi|225457148|ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
Length = 813
Score = 927 bits (2395), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/541 (80%), Positives = 484/541 (89%), Gaps = 4/541 (0%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGNVPAAL+ +FPSAKIT+LGNWF+V +PRWCCTYLLDATDPLFIEIG+AFI+Q
Sbjct: 272 VLPAFSGNVPAALKYIFPSAKITRLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQ 331
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
QLKEYGRT HIYNCDTFDENTPPVD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYD
Sbjct: 332 QLKEYGRTGHIYNCDTFDENTPPVDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYD 391
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
PFWRPPQMKALL+SVP+G+LVVLDLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYG
Sbjct: 392 PFWRPPQMKALLHSVPMGRLVVLDLFAEVKPIWITSEQFYGVPYIWCMLHNFAGNIEMYG 451
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
ILD++A GPVEARTSEN+TMVGVGMSMEGIEQNPVVYDLMSEMAFQH KVDVK WI YS
Sbjct: 452 ILDAVASGPVEARTSENSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHSKVDVKVWIALYS 511
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE----GKYQ 303
RRYG+SVP IQDAWN+LYHTVYNCTDG+ DKNRDVIVAFPD+DPS I + G Y
Sbjct: 512 TRRYGKSVPEIQDAWNILYHTVYNCTDGSYDKNRDVIVAFPDIDPSFIPTPKLSMPGGYH 571
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
YGK VS+ VLK T+S++ PHLWYSTSEV AL LFIASG +L SNTYRYDL+DLTR
Sbjct: 572 RYGKSVSRRTVLKEITNSFEQPHLWYSTSEVKDALGLFIASGGQLLGSNTYRYDLVDLTR 631
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
QALAKYAN+LFL +IEAYQLND G S++FLELVEDMD LLACHDGFLLGPWLESAK
Sbjct: 632 QALAKYANQLFLEVIEAYQLNDVRGAACHSQKFLELVEDMDTLLACHDGFLLGPWLESAK 691
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
QLAQ+E+QE Q+EWNARTQITMWFDNT++EASLLRDYGNKYWSGLLRDYYGPRAAIYFKY
Sbjct: 692 QLAQDEQQEIQFEWNARTQITMWFDNTEDEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 751
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 543
++ESLE+G+ F LKDWRREWIKLTNDWQN RN YPV S+G+A+ TS+ LYNKYLQ ++
Sbjct: 752 LLESLETGNEFALKDWRREWIKLTNDWQNSRNAYPVRSSGNAIDTSRRLYNKYLQDPEIY 811
Query: 544 D 544
D
Sbjct: 812 D 812
>gi|224121634|ref|XP_002318632.1| predicted protein [Populus trichocarpa]
gi|222859305|gb|EEE96852.1| predicted protein [Populus trichocarpa]
Length = 812
Score = 916 bits (2368), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/538 (78%), Positives = 482/538 (89%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGNVPAAL+N+FPSAKIT+LGNWFSV+SD RWCCTYLLDATDPLFIEIGRAFIEQ
Sbjct: 275 VLPAFSGNVPAALRNIFPSAKITRLGNWFSVRSDVRWCCTYLLDATDPLFIEIGRAFIEQ 334
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
QL EYG TSHIYNCDTFDENTPPVD PEYISSLG +I+ GMQSGDS+AVWLMQGWLFSYD
Sbjct: 335 QLTEYGSTSHIYNCDTFDENTPPVDDPEYISSLGGSIFEGMQSGDSNAVWLMQGWLFSYD 394
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
PFWRPPQ KALL+SVP+G+LVVLDLFAEVKPIW+TS+QFYGVPYIWCMLHNFAGN+EMYG
Sbjct: 395 PFWRPPQTKALLHSVPIGRLVVLDLFAEVKPIWNTSEQFYGVPYIWCMLHNFAGNLEMYG 454
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
LDS+A GPVEARTSEN+TMVGVGMSMEGIEQNPVVYDLMSEMAFQ KVDVK WI+ YS
Sbjct: 455 YLDSVASGPVEARTSENSTMVGVGMSMEGIEQNPVVYDLMSEMAFQKNKVDVKEWIDLYS 514
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYGRSVP IQ+AWN+LYHTVYNCTDGA DKNRDVIVAFPDV+P+++S+ +G++ K
Sbjct: 515 ARRYGRSVPTIQNAWNILYHTVYNCTDGAYDKNRDVIVAFPDVNPNLVSMLQGRHHTDVK 574
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
VS+ A L T SY+HPHLWYST+EV+RALELFIA G+ELS S+TY YDL+DLTRQ LA
Sbjct: 575 LVSRRAALIKNTDSYEHPHLWYSTTEVVRALELFIAGGDELSGSSTYSYDLVDLTRQVLA 634
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
KYANELFL +IEAY+L D+HGV S+ FL+LVED+D LLACH+GFLLGPWLESAKQLAQ
Sbjct: 635 KYANELFLKVIEAYRLKDSHGVAHQSQMFLDLVEDIDTLLACHEGFLLGPWLESAKQLAQ 694
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
+EEQ+ Q+EWNARTQITMW+DNT+ EASLLRDYGNKYWSGLL+DYYGPRAAIYF ++ +S
Sbjct: 695 DEEQQIQFEWNARTQITMWYDNTEVEASLLRDYGNKYWSGLLKDYYGPRAAIYFNFLTQS 754
Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH 545
LE+G GF+LK WRREWIKLTN WQ R ++PVESNG+AL S+WLY+KYL +DH
Sbjct: 755 LENGHGFQLKAWRREWIKLTNKWQKSRKIFPVESNGNALNISRWLYHKYLGNPDTYDH 812
>gi|356519003|ref|XP_003528164.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
Length = 812
Score = 908 bits (2347), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/530 (80%), Positives = 479/530 (90%), Gaps = 1/530 (0%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGNVPAAL+++FPSAKIT+LGNWFSVK+D +WCCTYLLDATD LF+EIG+AFIE+
Sbjct: 270 VLPAFSGNVPAALKHIFPSAKITRLGNWFSVKNDLKWCCTYLLDATDSLFVEIGKAFIEK 329
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
QL+EYGRTSHIYNCDTFDENTPPVD PEYISSLGAA + GMQSGD DAVWLMQGWLFSYD
Sbjct: 330 QLQEYGRTSHIYNCDTFDENTPPVDDPEYISSLGAATFKGMQSGDDDAVWLMQGWLFSYD 389
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
PFWRPPQMKALL+SVP+GKLVVLDLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYG
Sbjct: 390 PFWRPPQMKALLHSVPVGKLVVLDLFAEVKPIWVTSEQFYGVPYIWCMLHNFAGNIEMYG 449
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
ILD+IA GP++ARTS N+TMVGVGMSMEGIEQNP+VYDLMSEMAFQH+KVDVKAW++ YS
Sbjct: 450 ILDAIASGPIDARTSNNSTMVGVGMSMEGIEQNPIVYDLMSEMAFQHKKVDVKAWVDMYS 509
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYG+++P IQ+ WNVLYHT+YNCTDGA DKNRDVIVAFPDVDPS+ISV + + K
Sbjct: 510 TRRYGQTLPLIQEGWNVLYHTIYNCTDGAYDKNRDVIVAFPDVDPSLISVQHEQSHHNDK 569
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
P S ++K T S+D PHLWY TSEVI ALELFI SG+ELS NTYRYDL+DLTRQ LA
Sbjct: 570 PYSG-TIIKEITDSFDRPHLWYPTSEVIYALELFITSGDELSRCNTYRYDLVDLTRQVLA 628
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
KYANELF +IEAYQ +D HG+ LS+RFL+LVED+D LLACHDGFLLGPWLESAKQLA
Sbjct: 629 KYANELFFKVIEAYQSHDIHGMTLLSQRFLDLVEDLDTLLACHDGFLLGPWLESAKQLAL 688
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
NEEQE+Q+EWNARTQITMWFDN+ EEASLLRDYGNKYW+GLL DYYGPRAAIYFKY+ ES
Sbjct: 689 NEEQERQFEWNARTQITMWFDNSDEEASLLRDYGNKYWNGLLHDYYGPRAAIYFKYLRES 748
Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
LESG+ F+L+ WRREWIKLTN+WQ RN++PVES+GDAL TS+WL+NKYL
Sbjct: 749 LESGEDFKLRGWRREWIKLTNEWQKRRNIFPVESSGDALNTSRWLFNKYL 798
>gi|297733843|emb|CBI15090.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 890 bits (2299), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/574 (74%), Positives = 477/574 (83%), Gaps = 37/574 (6%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGNVPAAL+ +FPSAKIT+LGNWF+V +PRWCCTYLLDATDPLFIEIG+AFI+Q
Sbjct: 272 VLPAFSGNVPAALKYIFPSAKITRLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQ 331
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
QLKEYGRT HIYNCDTFDENTPPVD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYD
Sbjct: 332 QLKEYGRTGHIYNCDTFDENTPPVDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYD 391
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------- 173
PFWRPPQMKALL+SVP+G+LVVLDLFAEVKPIW TS+QFYGVPYIW
Sbjct: 392 PFWRPPQMKALLHSVPMGRLVVLDLFAEVKPIWITSEQFYGVPYIWKVTKSGRQQSLKFT 451
Query: 174 ------------------CMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSM 214
CMLHNFAGNIEMYGILD++A GP+ R + +VGVGMSM
Sbjct: 452 NEKCCSFFRSHSPDSEVLCMLHNFAGNIEMYGILDAVASGPILLRAKYAESAVVGVGMSM 511
Query: 215 EGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTD 274
EGIEQNPVVYDLMSEMAFQH KVDVK WI YS RRYG+SVP IQDAWN+LYHTVYNCTD
Sbjct: 512 EGIEQNPVVYDLMSEMAFQHSKVDVKVWIALYSTRRYGKSVPEIQDAWNILYHTVYNCTD 571
Query: 275 GATDKNRDVIVAFPDVDPSIISVTE----GKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 330
G+ DKNRDVIVAFPD+DPS I + G Y YGK VS+ VLK T+S++ PHLWYS
Sbjct: 572 GSYDKNRDVIVAFPDIDPSFIPTPKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPHLWYS 631
Query: 331 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 390
TSEV AL LFIASG +L SNTYRYDL+DLTRQALAKYAN+LFL +IEAYQLND G
Sbjct: 632 TSEVKDALGLFIASGGQLLGSNTYRYDLVDLTRQALAKYANQLFLEVIEAYQLNDVRGAA 691
Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
S++FLELVEDMD LLACHDGFLLGPWLESAKQLAQ+E+QE Q+EWNARTQITMWFDNT
Sbjct: 692 CHSQKFLELVEDMDTLLACHDGFLLGPWLESAKQLAQDEQQEIQFEWNARTQITMWFDNT 751
Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 510
++EASLLRDYGNKYWSGLLRDYYGPRAAIYFKY++ESLE+G+ F LKDWRREWIKLTNDW
Sbjct: 752 EDEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYLLESLETGNEFALKDWRREWIKLTNDW 811
Query: 511 QNGRNVYPVESNGDALITSQWLYNKYLQGTGVFD 544
QN RN YPV S+G+A+ TS+ LYNKYLQ ++D
Sbjct: 812 QNSRNAYPVRSSGNAIDTSRRLYNKYLQDPEIYD 845
>gi|449441031|ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
Length = 808
Score = 873 bits (2255), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/531 (77%), Positives = 468/531 (88%), Gaps = 2/531 (0%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGN+PAA + ++P+AKIT+LGNWF+V SDPRWCCTYLLDA DPLF+EIG+AFIEQ
Sbjct: 273 VLPAFSGNIPAAFKQIYPAAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQ 332
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q KEYGRTSH+YNCDTFDENTPPVD EYISSLG+AI+ GMQ+GDS+AVWLMQGW+FSYD
Sbjct: 333 QQKEYGRTSHVYNCDTFDENTPPVDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYD 392
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
PFWRP QMKALL+SVPLG+LVVLDL+AEVKPIW +S+QFYG+PYIWCMLHNFAGN+EMYG
Sbjct: 393 PFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWISSEQFYGIPYIWCMLHNFAGNVEMYG 452
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
ILDSIA GP+EAR+S +TMVGVGMSMEGIEQNPVVYDLMSEMAFQH KVDVK W+ QYS
Sbjct: 453 ILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYS 512
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
VRRYG VP+IQDAW+VLYHTVYNCTDGA DKNRDVIVAFPDVDPS I V +G
Sbjct: 513 VRRYGHLVPSIQDAWDVLYHTVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGN 572
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
S +V + + +++D PHLWY TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALA
Sbjct: 573 LDS--SVDRLQDATFDRPHLWYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALA 630
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
KY+NELF I++AYQL+D + LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA+
Sbjct: 631 KYSNELFFRIVKAYQLHDVQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAR 690
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
+EE+EKQYEWNARTQITMWFDNT+EEASLLRDYGNKYWSGLL DYY PRAAIY K++ ES
Sbjct: 691 SEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKES 750
Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
E+G F L +WRREWIKLTNDWQ+ R +YPVESNGDAL TS WLYNKYLQ
Sbjct: 751 SENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQ 801
>gi|222629680|gb|EEE61812.1| hypothetical protein OsJ_16433 [Oryza sativa Japonica Group]
Length = 1129
Score = 795 bits (2054), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/538 (67%), Positives = 443/538 (82%), Gaps = 6/538 (1%)
Query: 6 FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 572 FPVLPAFSGNIPAALRSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFI 631
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
E+Q++EYG TSH+Y+CDTFDENTPP+ P YISSLGAA + GMQSGD DA+WLMQGWLFS
Sbjct: 632 EEQIREYGGTSHVYSCDTFDENTPPLSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFS 691
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
YDPFW PPQMKALL+SVP+G+++VLDL+AEVKPIW S QFYGVPYIWCMLHNFA + EM
Sbjct: 692 YDPFWEPPQMKALLHSVPVGRMIVLDLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEM 751
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG+LD +A GP++AR S N+TMVGVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+
Sbjct: 752 YGVLDMVASGPIDARLSANSTMVGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVET 811
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY--- 302
Y RRYG+S+ +QDAW +LY T+YNCTDG DKNRDVIVAFPDV+P +I T G Y
Sbjct: 812 YPTRRYGKSMVGLQDAWKILYQTLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSS 870
Query: 303 -QNYGKPVSKEAV-LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ Y +SK + + + Y+HPHLWY T VIRALELF+ G+E+S SNT+RYDL+D
Sbjct: 871 SKTYSTKLSKNYIAVDASNDEYEHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVD 930
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
LTRQ LAKYAN++F+ IIE+Y+ N+ + V L + F++LV D+D LLA H+GFLLGPWLE
Sbjct: 931 LTRQTLAKYANQVFVKIIESYKSNNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLE 990
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
SAK LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIY
Sbjct: 991 SAKGLARDKEQEMQYEWNARTQITMWFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIY 1050
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
FKY+I S+E + F L++WRREWI LTN+WQ+ V+P + GDAL S+ LY KYL
Sbjct: 1051 FKYLILSMEKKEPFALEEWRREWISLTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 1108
>gi|326515664|dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 829
Score = 794 bits (2050), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/536 (67%), Positives = 447/536 (83%), Gaps = 6/536 (1%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGN+PAAL+ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+
Sbjct: 276 VLPAFSGNIPAALKLKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEE 335
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q++EYGRTSH+YNCDTFDENTPP+ P YISSLGAA + MQSGD+DA+WLMQGWLF+YD
Sbjct: 336 QIREYGRTSHVYNCDTFDENTPPLSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYD 395
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
PFW PPQMKALL+SVP+G+++VLDL+AEVKP+W S QFYGVPYIWCMLHNFA + EMYG
Sbjct: 396 PFWEPPQMKALLHSVPVGRMIVLDLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYG 455
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
+LD++A GP++AR SEN+TMVGVGMSMEGIEQNP+VYDLMSEM F H +VD+K W+ Y
Sbjct: 456 VLDAVASGPIDARLSENSTMVGVGMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYP 515
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----Q 303
RRYG+SV +QDAW +L+ T+YNCTDG DKNRDVIVAFPDV+PS+I T G Y +
Sbjct: 516 TRRYGKSVVGLQDAWRILHQTLYNCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSK 574
Query: 304 NYGKPVSKEAVLK-SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
NY +S+ V+K + +Y+ PH+WY T VI ALELF+ SG+E+S S+T+RYDL+DLT
Sbjct: 575 NYSTMLSENYVMKDAPNDAYEQPHIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLT 634
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQALAKYAN++FL II+ Y+ N+ + V L RFL LV+D+D LLA H+GFLLGPWLESA
Sbjct: 635 RQALAKYANQIFLKIIQGYKSNNVNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESA 694
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
K LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFK
Sbjct: 695 KGLARSQEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFK 754
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
++I SL+ + F L++WRREWI LTN+WQ+ R V+ + GDAL S+ L+ KYL+
Sbjct: 755 HLISSLKKKEPFALEEWRREWISLTNNWQSDRKVFATTATGDALNISRALFTKYLR 810
>gi|326519955|dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 829
Score = 792 bits (2046), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/536 (67%), Positives = 446/536 (83%), Gaps = 6/536 (1%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGN+PAAL+ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+
Sbjct: 276 VLPAFSGNIPAALKLKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEE 335
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q++EYGRTSH+YNCDTFDENTPP+ P YISSLGAA + MQSGD+DA+WLMQGWLF+YD
Sbjct: 336 QIREYGRTSHVYNCDTFDENTPPLSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYD 395
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
PFW PPQMKALL+SVP+G+++VLDL+AEVKP W S QFYGVPYIWCMLHNFA + EMYG
Sbjct: 396 PFWEPPQMKALLHSVPVGRMIVLDLYAEVKPAWINSDQFYGVPYIWCMLHNFAADFEMYG 455
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
+LD++A GP++AR SEN+TMVGVGMSMEGIEQNP+VYDLMSEM F H +VD+K W+ Y
Sbjct: 456 VLDAVASGPIDARLSENSTMVGVGMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYP 515
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----Q 303
RRYG+SV +QDAW +L+ T+YNCTDG DKNRDVIVAFPDV+PS+I T G Y +
Sbjct: 516 TRRYGKSVVGLQDAWRILHQTLYNCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSK 574
Query: 304 NYGKPVSKEAVLK-SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
NY +S+ V+K + +Y+ PH+WY T VI ALELF+ SG+E+S S+T+RYDL+DLT
Sbjct: 575 NYSTMLSENYVMKDAPNDAYEQPHIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLT 634
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQALAKYAN++FL II+ Y+ N+ + V L RFL LV+D+D LLA H+GFLLGPWLESA
Sbjct: 635 RQALAKYANQIFLKIIQGYKSNNVNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESA 694
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
K LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFK
Sbjct: 695 KGLARSQEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFK 754
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
++I SL+ + F L++WRREWI LTN+WQ+ R V+ + GDAL S+ L+ KYL+
Sbjct: 755 HLISSLKKKEPFALEEWRREWISLTNNWQSDRKVFATTATGDALNISRALFTKYLR 810
>gi|414585092|tpg|DAA35663.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 831
Score = 791 bits (2043), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/537 (68%), Positives = 444/537 (82%), Gaps = 6/537 (1%)
Query: 6 FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 275 FPVLPAFSGNIPAALKSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFI 334
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
E+Q++EYGRTSHIYNCDTFDENTPP+ P YISSLGAA + GMQSGD+DA+WLMQGWLF+
Sbjct: 335 EEQIREYGRTSHIYNCDTFDENTPPLSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFT 394
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
YDPFW PPQMKALL+SVP+GK++VLDL+AEVKP+W S Q YGVPYIWCMLHNFA + EM
Sbjct: 395 YDPFWEPPQMKALLHSVPVGKMIVLDLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEM 454
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG+LD++A GP++AR S+N+TMVGVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+
Sbjct: 455 YGVLDALASGPIDARLSDNSTMVGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKT 514
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN- 304
Y RRYG+ V +QDAW +LY T+YNCTDG DKNRDVIVAFPDV+P +I+ T G + N
Sbjct: 515 YPTRRYGKPVKGLQDAWWILYRTLYNCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNT 573
Query: 305 ---YGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
Y SK + K +S +Y+HPHLWY T+ VI ALELF+ G+E+S SNT+RYDL+D
Sbjct: 574 RQMYSTVPSKNYIRKDVSSDAYEHPHLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVD 633
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
LTRQ LAKYAN++FL IIE+Y+ N+ + V L + FL LV D+D LL+ H+GFLLGPWLE
Sbjct: 634 LTRQVLAKYANDVFLKIIESYKSNNMNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLE 693
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
SAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLL+DYYGPRAAIY
Sbjct: 694 SAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLQDYYGPRAAIY 753
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
FK+++ S+E+ F LK+WRREWI LTN+WQ+ R V+ + GD L SQ LY KYL
Sbjct: 754 FKHLLLSMENNAPFALKEWRREWISLTNNWQSDRKVFSTTATGDPLNISQSLYTKYL 810
>gi|218195716|gb|EEC78143.1| hypothetical protein OsI_17702 [Oryza sativa Indica Group]
Length = 829
Score = 791 bits (2043), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/538 (67%), Positives = 443/538 (82%), Gaps = 6/538 (1%)
Query: 6 FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 272 FPVLPAFSGNIPAALRSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFI 331
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
E+Q++EYG TSH+Y+CDTFDENTPP+ P YISSLGAA + GMQSGD DA+WLMQGWLFS
Sbjct: 332 EEQIREYGGTSHVYSCDTFDENTPPLSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFS 391
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
YDPFW PPQMKALL+SVP+G+++VLDL+AEVKPIW S QFYGVPYIWCMLHNFA + EM
Sbjct: 392 YDPFWEPPQMKALLHSVPVGRMIVLDLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEM 451
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG+LD +A GP++AR S N+TM+GVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+
Sbjct: 452 YGVLDMVASGPIDARLSANSTMIGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVET 511
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY--- 302
Y RRYG+S+ +QDAW +LY T+YNCTDG DKNRDVIVAFPDV+P +I T G Y
Sbjct: 512 YPTRRYGKSIVGLQDAWKILYQTLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSS 570
Query: 303 -QNYGKPVSKEAV-LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ Y +SK + + + Y+HPHLWY T VIRALELF+ G+E+S SNT+RYDL+D
Sbjct: 571 SKTYSTKLSKNYIAVDASNDEYEHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVD 630
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
LTRQ LAKYAN++F+ IIE+Y+ N+ + V L + F++LV D+D LLA H+GFLLGPWLE
Sbjct: 631 LTRQTLAKYANQVFVKIIESYKSNNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLE 690
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
SAK LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIY
Sbjct: 691 SAKGLARDKEQEMQYEWNARTQITMWFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIY 750
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
FKY+I S+E + F L++WRREWI LTN+WQ+ V+P + GDAL S+ LY KYL
Sbjct: 751 FKYLILSMEKKEPFALEEWRREWISLTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 808
>gi|38345908|emb|CAE04506.2| OSJNBb0059K02.16 [Oryza sativa Japonica Group]
Length = 829
Score = 790 bits (2041), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/538 (67%), Positives = 443/538 (82%), Gaps = 6/538 (1%)
Query: 6 FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 272 FPVLPAFSGNIPAALRSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFI 331
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
E+Q++EYG TSH+Y+CDTFDENTPP+ P YISSLGAA + GMQSGD DA+WLMQGWLFS
Sbjct: 332 EEQIREYGGTSHVYSCDTFDENTPPLSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFS 391
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
YDPFW PPQMKALL+SVP+G+++VLDL+AEVKPIW S QFYGVPYIWCMLHNFA + EM
Sbjct: 392 YDPFWEPPQMKALLHSVPVGRMIVLDLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEM 451
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG+LD +A GP++AR S N+TMVGVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+
Sbjct: 452 YGVLDMVASGPIDARLSANSTMVGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVET 511
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY--- 302
Y RRYG+S+ +QDAW +LY T+YNCTDG DKNRDVIVAFPDV+P +I T G Y
Sbjct: 512 YPTRRYGKSMVGLQDAWKILYQTLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSS 570
Query: 303 -QNYGKPVSKEAV-LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ Y +SK + + + Y+HPHLWY T VIRALELF+ G+E+S SNT+RYDL+D
Sbjct: 571 SKTYSTKLSKNYIAVDASNDEYEHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVD 630
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
LTRQ LAKYAN++F+ IIE+Y+ N+ + V L + F++LV D+D LLA H+GFLLGPWLE
Sbjct: 631 LTRQTLAKYANQVFVKIIESYKSNNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLE 690
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
SAK LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIY
Sbjct: 691 SAKGLARDKEQEMQYEWNARTQITMWFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIY 750
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
FKY+I S+E + F L++WRREWI LTN+WQ+ V+P + GDAL S+ LY KYL
Sbjct: 751 FKYLILSMEKKEPFALEEWRREWISLTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 808
>gi|357166414|ref|XP_003580702.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
distachyon]
Length = 829
Score = 784 bits (2025), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/536 (67%), Positives = 447/536 (83%), Gaps = 6/536 (1%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSG++PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+
Sbjct: 275 VLPAFSGSIPAALKSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEE 334
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q++EYGRTSH+YNCDTFDENTPP+ P YISSLGAA + GMQSGD DA+WLMQGWLF+YD
Sbjct: 335 QIREYGRTSHVYNCDTFDENTPPLSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFTYD 394
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
PFW PPQMKALL+SVP+G+++VLDL+AEVKP+W S QFYGVPYIWCMLHNFA + EMYG
Sbjct: 395 PFWEPPQMKALLHSVPVGRMIVLDLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYG 454
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
+LD++A GP++AR SEN+TMVGVGMSMEGIEQNP+VYDLMSEM F H +VD++ W+ Y
Sbjct: 455 VLDAVASGPIDARLSENSTMVGVGMSMEGIEQNPIVYDLMSEMVFHHRQVDLQVWVETYP 514
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYG+S+ +QDAW +L+ T+YNCTDG DKNRDVIVAFPDV+P +I T G + + K
Sbjct: 515 TRRYGKSIVELQDAWRILHQTLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLHTSASK 573
Query: 308 PVS----KEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
S K ++K E++ +Y+ PHLWY T+ VIRAL+LF+ G+E+S S+T+RYDL+DLT
Sbjct: 574 MFSTMSAKSYLVKDESNDAYEQPHLWYDTNVVIRALQLFLQYGDEVSDSSTFRYDLVDLT 633
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQALAKYAN++F II++Y+ N+ + V LS FL+LV D+D LLA H+GFLLGPWLESA
Sbjct: 634 RQALAKYANQIFAKIIQSYKSNNMNQVTTLSECFLDLVNDLDMLLASHEGFLLGPWLESA 693
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
K LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLL DYYGPRAAIYFK
Sbjct: 694 KGLARDQEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLGDYYGPRAAIYFK 753
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
Y+I SLE + F L++WRREWI LTN+WQ+ R V+ + GDAL ++ LY KYL+
Sbjct: 754 YLILSLEKKEPFALEEWRREWISLTNNWQSDRKVFATAATGDALNIARSLYMKYLR 809
>gi|218192858|gb|EEC75285.1| hypothetical protein OsI_11626 [Oryza sativa Indica Group]
Length = 812
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/538 (60%), Positives = 412/538 (76%), Gaps = 2/538 (0%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLP+FSGNVP+ + +FPSA IT+LG+W +V DPRWCCTYLLD +D LFI++G+
Sbjct: 275 LGMVPVLPSFSGNVPSVFKKLFPSANITKLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQ 334
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
AFI QQ+KEYG ++IYNCDTF+ENTPP + P YISSLG+AIY M G+ DAVWLMQGW
Sbjct: 335 AFIRQQMKEYGDITNIYNCDTFNENTPPTNEPAYISSLGSAIYEAMSRGNKDAVWLMQGW 394
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF D FW+ PQMKALL+SVP GK++VLDLFA+VKPIW S QFYGVPYIWCMLHNF G
Sbjct: 395 LFYSDAAFWKEPQMKALLHSVPTGKMIVLDLFADVKPIWQMSSQFYGVPYIWCMLHNFGG 454
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
NIEMYGILDSIA GP++ARTS N+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV+V+
Sbjct: 455 NIEMYGILDSIASGPIDARTSHNSTMVGVGMCMEGIEHNPVVYELMSEMAFRSQKVEVED 514
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W+ YS RRYG+S ++ AW +LYHT+YNCTDG D N+D IV FPD+ P+ S K
Sbjct: 515 WLKIYSYRRYGQSNVEVEKAWGILYHTIYNCTDGIADHNKDYIVQFPDISPNSFSSDVSK 574
Query: 302 YQNYGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ + + SE S S HPHLWYST E I+ALELF+ +GN+LS S TYRYDL+D
Sbjct: 575 RKAISEVKKHRRFVLSEVSASLPHPHLWYSTKEAIKALELFLNAGNDLSKSLTYRYDLVD 634
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
LTRQ+L+K ANE++L+ + AY+ D++G+ +++FLEL+ D+D LLA D FLLGPWLE
Sbjct: 635 LTRQSLSKLANEVYLDAMNAYRKKDSNGLNFYTKKFLELIVDIDTLLASDDNFLLGPWLE 694
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
AK LA+ E + KQYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL+ YY PRA+ Y
Sbjct: 695 DAKSLARTENERKQYEWNARTQVTMWYDNTKTEQSKLHDYANKFWSGLLKSYYLPRASKY 754
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
F + + L+ F+L++WR++WI +N+WQ+G+ +Y V++ GDAL S L+ KY +
Sbjct: 755 FSRLTKGLQENQSFQLEEWRKDWIAYSNEWQSGKELYAVKATGDALAISSSLFKKYFR 812
>gi|413955691|gb|AFW88340.1| hypothetical protein ZEAMMB73_315381 [Zea mays]
Length = 814
Score = 698 bits (1801), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/538 (59%), Positives = 410/538 (76%), Gaps = 2/538 (0%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLP+FSGNVPA +FPSA IT+LG+W +V ++P+WCCTYLLD +D LFI++G+
Sbjct: 276 LGMVPVLPSFSGNVPAIFAKLFPSANITRLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQ 335
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
AFI QQ+KEYG ++IYNCDTF+ENTPP D P YISSLG+AIY M G+ +AVWLMQGW
Sbjct: 336 AFIRQQIKEYGDVTNIYNCDTFNENTPPTDEPAYISSLGSAIYEAMSRGNKNAVWLMQGW 395
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF D FW+ PQMKALL+SVP+GK++VLDLFA+VKPIW S QFYGVPYIWCMLHNF G
Sbjct: 396 LFYSDAAFWKEPQMKALLHSVPIGKMIVLDLFADVKPIWKVSSQFYGVPYIWCMLHNFGG 455
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
NIEMYGILDSI+ GP++ARTS N+TM+GVGM MEGIE NPVVY+LMSEMAF ++KV+V+
Sbjct: 456 NIEMYGILDSISSGPIDARTSYNSTMIGVGMCMEGIEHNPVVYELMSEMAFHNKKVEVED 515
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W+ YS RRYG++ I+ AW LYHT+YNCTDG D N+D IV FPD+ PS ++ K
Sbjct: 516 WLKTYSCRRYGQANADIEKAWRYLYHTIYNCTDGIADHNKDYIVEFPDISPSSVTYQVSK 575
Query: 302 YQNYGKPVSKEAVLKSETSS-YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ + SE S PHLWYST E ++ALELF+ +G+ S S TYRYDL+D
Sbjct: 576 RRGMSITRNHRRFFLSEVSGILPQPHLWYSTKEAVKALELFLDAGSTFSESLTYRYDLVD 635
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
LTRQ L+K ANE++L+ I YQ D+HG+ +R+FLE++ D+D LLA D FLLGPWLE
Sbjct: 636 LTRQCLSKLANEVYLDAISLYQKKDSHGLNAHARKFLEIIVDIDTLLAADDNFLLGPWLE 695
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
SAK LA E++ +QYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL+ YY PRA+ Y
Sbjct: 696 SAKSLAITEKERQQYEWNARTQVTMWYDNTETEQSKLHDYANKFWSGLLKSYYLPRASKY 755
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
F Y+ SL+ F+L++WR++WI +N+WQ+G+ VY V++ GDAL ++ LY KYL+
Sbjct: 756 FAYLTRSLQENRSFQLEEWRKDWISYSNEWQSGKEVYAVKATGDALAIARSLYRKYLR 813
>gi|222624949|gb|EEE59081.1| hypothetical protein OsJ_10898 [Oryza sativa Japonica Group]
Length = 812
Score = 696 bits (1796), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/537 (60%), Positives = 409/537 (76%), Gaps = 2/537 (0%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLP+FSGNVP+ + +FPSA IT+LG+W +V DPRWCCTYLLD +D LFI++G+
Sbjct: 275 LGMVPVLPSFSGNVPSVFKKLFPSANITKLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQ 334
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
AFI QQ+KEYG ++IYNCDTF+ENTPP + P YISSLG+AIY M G+ DAVWLMQGW
Sbjct: 335 AFIRQQMKEYGDITNIYNCDTFNENTPPTNEPAYISSLGSAIYEAMSRGNKDAVWLMQGW 394
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF D FW+ PQMKALL+SVP GK++VLDLFA+VKPIW S QFYGVPYIWCMLHNF G
Sbjct: 395 LFYSDAAFWKEPQMKALLHSVPTGKMIVLDLFADVKPIWQMSSQFYGVPYIWCMLHNFGG 454
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
NIEMYGILDSIA GP++ARTS N+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV+V+
Sbjct: 455 NIEMYGILDSIASGPIDARTSHNSTMVGVGMCMEGIEHNPVVYELMSEMAFRSQKVEVED 514
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W+ YS RRYG+S ++ AW +LYHT+YNCTDG D N D IV FPD+ P+ S K
Sbjct: 515 WLKIYSYRRYGQSNVEVEKAWGILYHTIYNCTDGIADHNNDYIVEFPDISPNSFSSDVSK 574
Query: 302 YQNYGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ + + SE S S HPHLWYST E I+ALELF+ +GN+LS S TYRYDL+D
Sbjct: 575 RKAISEVKKHRRFVLSEVSASLPHPHLWYSTKEAIKALELFLNAGNDLSKSLTYRYDLVD 634
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
LTRQ+L+K ANE++L+ + AY+ D++G+ +++FLEL+ D+D LLA D FLLGPWLE
Sbjct: 635 LTRQSLSKLANEVYLDAMNAYRKKDSNGLNFYTKKFLELIVDIDTLLASDDNFLLGPWLE 694
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
AK LA+ E + KQYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL+ YY PRA+ Y
Sbjct: 695 DAKSLARTENERKQYEWNARTQVTMWYDNTKTEQSKLHDYANKFWSGLLKSYYLPRASKY 754
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
F + + L+ F+L++W ++WI +N+WQ+G+ +Y V++ GDAL S L+ KY
Sbjct: 755 FSRLTKGLQENQSFQLEEWTKDWIAYSNEWQSGKELYAVKATGDALAISSSLFKKYF 811
>gi|357112065|ref|XP_003557830.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
distachyon]
Length = 809
Score = 692 bits (1786), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/539 (60%), Positives = 419/539 (77%), Gaps = 6/539 (1%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLP+FSGNVP A + +FPSA IT+LG W +V DPRWCCTY+LD +D LFI++G
Sbjct: 272 LGMVPVLPSFSGNVPVAFKKLFPSANITRLGEWNTVDGDPRWCCTYILDPSDALFIDVGH 331
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
AFI QQ+KEYG + IYNCDTF+ENTPP + P YISSLG+AIY M SG+ DAVWLMQGW
Sbjct: 332 AFIRQQIKEYGDITSIYNCDTFNENTPPTNEPAYISSLGSAIYEAMSSGNKDAVWLMQGW 391
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF D FW+ PQMKALL+SVP+GK++VLDLFA+VKP+W S QFYGVPYIWCMLHNF G
Sbjct: 392 LFYSDAAFWKEPQMKALLHSVPIGKMIVLDLFADVKPVWKMSSQFYGVPYIWCMLHNFGG 451
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
NIEMYGILDSI+ GP++ARTS +TMVGVGM+MEGIE NPVV++LMSEM+F+ +KV+V+
Sbjct: 452 NIEMYGILDSISSGPIDARTSYGSTMVGVGMTMEGIEHNPVVFELMSEMSFRSQKVEVED 511
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W+ YS RRYG+S I+ AW VLYHT+YNCTDG D NRD IV FPD+ PS S K
Sbjct: 512 WLKSYSYRRYGQSNVKIEKAWGVLYHTIYNCTDGIADHNRDYIVEFPDMSPSSFSSHFSK 571
Query: 302 YQNYGKPVSKEA---VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
+ G P+ ++ L +++ HPHLWYST+E ++ALELF+ +GN+LS S T+RYDL
Sbjct: 572 QR--GMPIVRKHPRFFLSEVSANLPHPHLWYSTNEAVKALELFLNAGNDLSKSLTFRYDL 629
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
+DLTRQ+L+K AN+++L+ +++Y+ ++ G+ +++FLEL+ D+D LLA D FLLGPW
Sbjct: 630 VDLTRQSLSKLANKVYLDAMDSYKNKNSSGLNFHTKKFLELIVDIDILLASDDNFLLGPW 689
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
LESAK LA +EE+ KQYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL++YY PRA+
Sbjct: 690 LESAKSLAMSEEERKQYEWNARTQVTMWYDNTKTEQSHLHDYANKFWSGLLKNYYLPRAS 749
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
YF + SL+ F+L++WRR+WI +N+WQ+G +YPV++ GDAL S+ L+ KYL
Sbjct: 750 KYFTGLSRSLQENRSFQLEEWRRDWISYSNEWQSGEELYPVKAKGDALAISKSLFRKYL 808
>gi|356534602|ref|XP_003535842.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
Length = 807
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/536 (60%), Positives = 397/536 (74%), Gaps = 6/536 (1%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVPAAL +FPSAKIT+LG+W +V DPRWCCTYLLD +DPLF+EIG AFI +
Sbjct: 267 VLPSFSGNVPAALTKIFPSAKITRLGDWNTVDGDPRWCCTYLLDPSDPLFVEIGEAFIRK 326
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q+KEYG + IYNCDTF+EN+PP + PEYIS+LGAA+Y G+ GD DAVWLMQGWLF D
Sbjct: 327 QIKEYGDVTDIYNCDTFNENSPPTNDPEYISNLGAAVYKGISKGDKDAVWLMQGWLFYSD 386
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+PPQMKALL+SVP GK++VLDLFA+VKPIW S QFYG PYIWCMLHNF GNIEMY
Sbjct: 387 SSFWKPPQMKALLHSVPFGKMIVLDLFADVKPIWKNSFQFYGTPYIWCMLHNFGGNIEMY 446
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G LDSI+ GPV+AR S N+TMVGVGM MEGIEQNP+VY+LMSEMAF+ +KV V WI Y
Sbjct: 447 GTLDSISSGPVDARVSANSTMVGVGMCMEGIEQNPIVYELMSEMAFRDKKVKVSEWIKSY 506
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
RRYG+ + ++ AW +LYHT+YNCTDG D N D IV FPD +PS SVT G N
Sbjct: 507 CHRRYGKVIHQVESAWEILYHTIYNCTDGIADHNHDFIVMFPDWNPSTNSVT-GTSNNQK 565
Query: 307 ----KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
P ++ + + S HLWY + +VI+AL+LF+A G L+ S TYRYDL+DLT
Sbjct: 566 IYLLPPGNRRYLFQETLSDMPQAHLWYPSDDVIKALQLFLAGGKNLAGSLTYRYDLVDLT 625
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L+K AN+++ + +YQ + + S +FL+L++D+D LLA D FLLG WLESA
Sbjct: 626 RQVLSKLANQVYHKAVTSYQKKNIEALQFHSNKFLQLIKDIDVLLASDDNFLLGTWLESA 685
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
K+LA N + KQYEWNARTQ+TMWFD + S L DY NK+WSGLL YY PRA+ YF
Sbjct: 686 KKLAVNPSEIKQYEWNARTQVTMWFDTNETTQSKLHDYANKFWSGLLESYYLPRASTYFS 745
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
++ ESL D F+L +WR++WI +N WQ G +YPV++ GDAL SQ LY KY Q
Sbjct: 746 HLTESLRQNDKFKLIEWRKQWISQSNKWQEGNELYPVKAKGDALTISQALYEKYFQ 801
>gi|297736304|emb|CBI24942.3| unnamed protein product [Vitis vinifera]
Length = 868
Score = 686 bits (1770), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/538 (60%), Positives = 404/538 (75%), Gaps = 9/538 (1%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVP AL+ +FPSA IT+LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI Q
Sbjct: 331 VLPSFSGNVPEALKKIFPSANITRLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQ 390
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q+KEYG + IYNCDTF+EN+PP + P YISSLGAAIY M GD D+VWLMQGWLF D
Sbjct: 391 QIKEYGDVTDIYNCDTFNENSPPTNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSD 450
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+PPQMKALL+SVP GK+VVLDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMY
Sbjct: 451 SGFWKPPQMKALLHSVPFGKMVVLDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMY 510
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
GILD+++ GPV+AR S+N+TMVGVGM MEGIEQNPV Y+LMSEMAF+ EKV + W+ Y
Sbjct: 511 GILDAVSSGPVDARISKNSTMVGVGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTY 570
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGK 301
S RRYG++V ++ AW +LY T+YNCTDG D N D +V FPD DPS+ IS +
Sbjct: 571 SYRRYGKAVHHVEAAWEILYRTIYNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHI 630
Query: 302 YQNYGKPVSKEAVLKSETSSYDHP--HLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
Q + +L ETSS D P HLWYST EV+ AL LF+ +GNELS S+TYRYDL+
Sbjct: 631 IQKILTQTGRRKILFQETSS-DLPQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLV 689
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
DLTRQ L+K N+++L+ + A++ DA S++F++LV+D+D LLA D FLLG WL
Sbjct: 690 DLTRQVLSKLGNQVYLDAVIAFRQKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWL 749
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
ESAK+LA N + +QYEWNARTQ+TMWF T+ S L DY NK+WSGLL +YY PRA++
Sbjct: 750 ESAKKLAVNPREMEQYEWNARTQLTMWFYVTKTNQSKLHDYANKFWSGLLENYYLPRASM 809
Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
YF Y+ ++L F+L++WRREWI +N WQ G+ +YPV + GD L S+ LY KY
Sbjct: 810 YFSYLAKALTENKNFKLEEWRREWISYSNKWQAGKELYPVRAKGDTLAISRALYEKYF 867
>gi|225450036|ref|XP_002273084.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
Length = 803
Score = 685 bits (1767), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/538 (60%), Positives = 404/538 (75%), Gaps = 9/538 (1%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVP AL+ +FPSA IT+LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI Q
Sbjct: 266 VLPSFSGNVPEALKKIFPSANITRLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQ 325
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q+KEYG + IYNCDTF+EN+PP + P YISSLGAAIY M GD D+VWLMQGWLF D
Sbjct: 326 QIKEYGDVTDIYNCDTFNENSPPTNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSD 385
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+PPQMKALL+SVP GK+VVLDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMY
Sbjct: 386 SGFWKPPQMKALLHSVPFGKMVVLDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMY 445
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
GILD+++ GPV+AR S+N+TMVGVGM MEGIEQNPV Y+LMSEMAF+ EKV + W+ Y
Sbjct: 446 GILDAVSSGPVDARISKNSTMVGVGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTY 505
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGK 301
S RRYG++V ++ AW +LY T+YNCTDG D N D +V FPD DPS+ IS +
Sbjct: 506 SYRRYGKAVHHVEAAWEILYRTIYNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHI 565
Query: 302 YQNYGKPVSKEAVLKSETSSYDHP--HLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
Q + +L ETSS D P HLWYST EV+ AL LF+ +GNELS S+TYRYDL+
Sbjct: 566 IQKILTQTGRRKILFQETSS-DLPQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLV 624
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
DLTRQ L+K N+++L+ + A++ DA S++F++LV+D+D LLA D FLLG WL
Sbjct: 625 DLTRQVLSKLGNQVYLDAVIAFRQKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWL 684
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
ESAK+LA N + +QYEWNARTQ+TMWF T+ S L DY NK+WSGLL +YY PRA++
Sbjct: 685 ESAKKLAVNPREMEQYEWNARTQLTMWFYVTKTNQSKLHDYANKFWSGLLENYYLPRASM 744
Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
YF Y+ ++L F+L++WRREWI +N WQ G+ +YPV + GD L S+ LY KY
Sbjct: 745 YFSYLAKALTENKNFKLEEWRREWISYSNKWQAGKELYPVRAKGDTLAISRALYEKYF 802
>gi|224106113|ref|XP_002314048.1| predicted protein [Populus trichocarpa]
gi|222850456|gb|EEE88003.1| predicted protein [Populus trichocarpa]
Length = 806
Score = 671 bits (1732), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/536 (58%), Positives = 399/536 (74%), Gaps = 6/536 (1%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVPAAL+ +FPSA IT+LG+W +V +PRWCCTYLL+ +DPLF+EIG AFI Q
Sbjct: 270 VLPSFSGNVPAALKKIFPSANITRLGDWNTVDKNPRWCCTYLLNPSDPLFVEIGEAFIRQ 329
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q+KEYG + IYNCDTF+EN+PP P YISSLGAA+Y M GD DAVWLMQGWLF D
Sbjct: 330 QVKEYGDVTDIYNCDTFNENSPPTSDPAYISSLGAAVYKAMSRGDKDAVWLMQGWLFYSD 389
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+PPQM+ALL+SVP GK++VLDLFAE KPIW S QFYG PY+WC+LHNF GNIEMY
Sbjct: 390 SAFWKPPQMQALLHSVPFGKMIVLDLFAEAKPIWKNSSQFYGTPYVWCLLHNFGGNIEMY 449
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
GILD+I+ GPV+AR EN+TMVGVGM MEGIE NPVVY+LMSEMAF+ K V W+ Y
Sbjct: 450 GILDAISSGPVDARIIENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSGKPQVLEWLKTY 509
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S RRYG++V + AW++LYHT+YNCTDG D N D IV FPD DPS+ S + Q+
Sbjct: 510 SRRRYGKAVRQVVAAWDILYHTIYNCTDGIADHNTDFIVKFPDWDPSLHSGSNISEQDNM 569
Query: 307 KPV-----SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
+ + ++ + + +S + HLWYST EVI+AL LF+ +GN+L+ S TYRYDL+DL
Sbjct: 570 RILLTSSGTRRFLFQETSSDFPEAHLWYSTQEVIQALWLFLDAGNDLAGSPTYRYDLVDL 629
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
TRQ L+K AN+++ + + A++ DA + ++FL++++D+D LLA D FLLG WLES
Sbjct: 630 TRQVLSKLANQVYRDAMIAFRRKDARALNLHGQKFLQIIKDIDVLLASDDNFLLGTWLES 689
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
AK+LA + K YEWNARTQ+TMW+D T+ S L DY NK+WSGLL DYY PRA+ YF
Sbjct: 690 AKKLAVDPNDMKLYEWNARTQVTMWYDTTKTNQSQLHDYANKFWSGLLEDYYLPRASTYF 749
Query: 482 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
++++SLE F+L +WR+EWI +N WQ +YPV++ GDAL ++ LY KY
Sbjct: 750 GHLMKSLEENKNFKLTEWRKEWIAFSNKWQADTKIYPVKAKGDALAIAKALYRKYF 805
>gi|297807393|ref|XP_002871580.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
lyrata]
gi|297317417|gb|EFH47839.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
lyrata]
Length = 806
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/539 (59%), Positives = 404/539 (74%), Gaps = 10/539 (1%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVP+AL+ ++P A IT+L NW +V D RWCCTYLL+ +DPLFI+IG AFI+Q
Sbjct: 268 VLPSFSGNVPSALRKIYPGANITRLDNWNTVDGDSRWCCTYLLNPSDPLFIDIGEAFIKQ 327
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q +EYG ++IYNCDTF+ENTPP PEYISSLGAA+Y M G+ +AVWLMQGWLFS D
Sbjct: 328 QPEEYGEITNIYNCDTFNENTPPTSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSD 387
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+PPQMK LL+SVP GK++VLDL+AEVKPIW+TS QFYG PYIWCMLHNF GNIEMY
Sbjct: 388 SKFWKPPQMKVLLHSVPFGKMIVLDLYAEVKPIWNTSAQFYGTPYIWCMLHNFGGNIEMY 447
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G LDSI+ GPV+AR S+N+TMVGVGM MEGIEQNPVVY+L+SEMAF+ EKVDV+ W+ Y
Sbjct: 448 GALDSISSGPVDARVSKNSTMVGVGMCMEGIEQNPVVYELISEMAFRDEKVDVQKWLKSY 507
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN-- 304
+ RRY + I+ AW +LYHTVYNCTDG D N D IV PD DPS E K+ +
Sbjct: 508 ARRRYMKENHQIEAAWEILYHTVYNCTDGIADHNTDFIVKLPDWDPSSSVQDESKHTDSY 567
Query: 305 ---YGKPVSKEAVLKSETSSYDHP--HLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
G +K VL + SS D P HLWYST EVI+AL+LF+ +G+ELS S TYRYD++
Sbjct: 568 MISTGPYETKRRVLFQDKSS-DLPKAHLWYSTKEVIQALKLFLEAGDELSRSLTYRYDMV 626
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
DLTRQ L+K AN++++ + A+ D + QLS +FLEL++D+D LLA D FLLG WL
Sbjct: 627 DLTRQVLSKLANQVYIEAVTAFVKKDIGSLGQLSEKFLELIKDIDVLLASDDNFLLGTWL 686
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
ESAK+LA+N ++ KQYEWNARTQ+TMW+D+ S L DY NK WSGLL DYY PRA +
Sbjct: 687 ESAKKLARNGDERKQYEWNARTQVTMWYDSKDVNQSKLHDYANKLWSGLLEDYYLPRARL 746
Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKYL 537
YF M++SL F+++ W+REWI +++ WQ + VYPV++ GDAL S+ L KY
Sbjct: 747 YFNEMLKSLRDKKKFKVEKWQREWIMMSHKWQQSSSEVYPVKAKGDALAISKHLLLKYF 805
>gi|15240689|ref|NP_196873.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|9758035|dbj|BAB08696.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|19423948|gb|AAL87291.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|21436231|gb|AAM51254.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|332004545|gb|AED91928.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
Length = 806
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/539 (58%), Positives = 401/539 (74%), Gaps = 10/539 (1%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVP+AL+ ++P A IT+L NW +V D RWCCTYLL+ +DPLFIEIG AFI+Q
Sbjct: 268 VLPSFSGNVPSALRKIYPEANITRLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIKQ 327
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q +EYG ++IYNCDTF+ENTPP PEYISSLGAA+Y M G+ +AVWLMQGWLFS D
Sbjct: 328 QTEEYGEITNIYNCDTFNENTPPTSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSD 387
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+PPQ+KALL+SVP GK++VLDL+AEVKPIW+ S QFYG PYIWCMLHNF GNIEMY
Sbjct: 388 SKFWKPPQLKALLHSVPFGKMIVLDLYAEVKPIWNKSAQFYGTPYIWCMLHNFGGNIEMY 447
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G LDSI+ GPV+AR S+N+TMVGVGM MEGIEQNPVVY+L SEMAF+ EKVDV+ W+ Y
Sbjct: 448 GALDSISSGPVDARVSKNSTMVGVGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKWLKSY 507
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RRY + I+ AW +LYHTVYNCTDG D N D IV PD DPS SV + Q
Sbjct: 508 ARRRYMKENHQIEAAWEILYHTVYNCTDGIADHNTDFIVKLPDWDPS-SSVQDDLKQKDS 566
Query: 307 KPVS-------KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
+S + + + +T+ HLWYST EVI+AL+LF+ +G++LS S TYRYD++
Sbjct: 567 YMISTGPYETKRRVLFQDKTADLPKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMV 626
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
DLTRQ L+K AN+++ + A+ D + QLS +FLEL++DMD LLA D LLG WL
Sbjct: 627 DLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWL 686
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
ESAK+LA+N ++ KQYEWNARTQ+TMW+D+ S L DY NK+WSGLL DYY PRA +
Sbjct: 687 ESAKKLAKNGDERKQYEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRARL 746
Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKYL 537
YF M++SL F+++ WRREWI +++ WQ + VYPV++ GDAL S+ L +KY
Sbjct: 747 YFNEMLKSLRDKKIFKVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKYF 805
>gi|357458267|ref|XP_003599414.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488462|gb|AES69665.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 832
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/562 (56%), Positives = 396/562 (70%), Gaps = 33/562 (5%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVPAAL +FPSAKIT+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +
Sbjct: 271 VLPSFSGNVPAALTKIFPSAKITRLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRK 330
Query: 68 QLK--------------------------EYGRTSHIYNCDTFDENTPPVDSPEYISSLG 101
Q+K EYG + IYNCDTF+EN+PP P YIS+LG
Sbjct: 331 QIKATETIHQESEDLGSLIIMDRAVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLG 390
Query: 102 AAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIW 160
AA+Y G+ GD DAVWLMQGWLF D FW+PPQMKALL SVP GK++VLDLFA+VKPIW
Sbjct: 391 AAVYQGISKGDKDAVWLMQGWLFYSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIW 450
Query: 161 STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQN 220
TS QFYG PYIWCMLHNF GNIEMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE N
Sbjct: 451 KTSFQFYGTPYIWCMLHNFGGNIEMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHN 510
Query: 221 PVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKN 280
P+VY+LMSEMAF+ EKV + W+ YS RRYG+++ + AW +LYHT+YN TDG D N
Sbjct: 511 PIVYELMSEMAFRDEKVKINEWLKSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHN 570
Query: 281 RDVIVAFPDVDPSIISVTEGKYQNYGK-----PVSKEAVLKSETSSYDHPHLWYSTSEVI 335
D IV PD DPS +V G + K P ++ + + + HLWY +VI
Sbjct: 571 HDYIVMLPDWDPS-AAVKSGMSNHQKKIYFLPPGNRRYLFQQTPAGMPQAHLWYPPEDVI 629
Query: 336 RALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR 395
+AL+LF+A G L S TYRYDL+DLTRQ L+K+AN++++ I ++Q + + S
Sbjct: 630 KALQLFLAGGKNLKGSLTYRYDLVDLTRQVLSKFANQVYIKAITSFQKKNIDALQLNSHM 689
Query: 396 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 455
FLEL++D+D LLA D FLLG WL+SAK+LA N + KQYEWNARTQ+TMWFD + S
Sbjct: 690 FLELIKDIDLLLASDDNFLLGTWLQSAKKLAVNPSELKQYEWNARTQVTMWFDTNETTQS 749
Query: 456 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRN 515
L DY NK+WSG+L +YY PRA+ YF ++ ESL+ + F L +WR+EWI ++N WQ G
Sbjct: 750 KLHDYANKFWSGILENYYLPRASTYFSHLSESLKQNEKFNLTEWRKEWIPMSNKWQEGSE 809
Query: 516 VYPVESNGDALITSQWLYNKYL 537
+YPV++ GDAL SQ LY KY
Sbjct: 810 LYPVKAKGDALTISQALYKKYF 831
>gi|357458271|ref|XP_003599416.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488464|gb|AES69667.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 807
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/557 (56%), Positives = 388/557 (69%), Gaps = 48/557 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVPAAL +FPSAKIT+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +
Sbjct: 271 VLPSFSGNVPAALTKIFPSAKITRLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRK 330
Query: 68 QLK--------------------------EYGRTSHIYNCDTFDENTPPVDSPEYISSLG 101
Q+K EYG + IYNCDTF+EN+PP P YIS+LG
Sbjct: 331 QIKATETIHQESEDLGSLIIMDRAVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLG 390
Query: 102 AAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIW 160
AA+Y G+ GD DAVWLMQGWLF D FW+PPQMKALL SVP GK++VLDLFA+VKPIW
Sbjct: 391 AAVYQGISKGDKDAVWLMQGWLFYSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIW 450
Query: 161 STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQN 220
TS QFYG PYIWCMLHNF GNIEMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE N
Sbjct: 451 KTSFQFYGTPYIWCMLHNFGGNIEMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHN 510
Query: 221 PVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKN 280
P+VY+LMSEMAF+ EKV + W+ YS RRYG+++ + AW +LYHT+YN TDG D N
Sbjct: 511 PIVYELMSEMAFRDEKVKINEWLKSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHN 570
Query: 281 RDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALEL 340
D IV PD DPS + G Q HLWY +VI+AL+L
Sbjct: 571 HDYIVMLPDWDPSAAVKSAGMPQ---------------------AHLWYPPEDVIKALQL 609
Query: 341 FIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 400
F+A G L S TYRYDL+DLTRQ L+K+AN++++ I ++Q + + S FLEL+
Sbjct: 610 FLAGGKNLKGSLTYRYDLVDLTRQVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELI 669
Query: 401 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 460
+D+D LLA D FLLG WL+SAK+LA N + KQYEWNARTQ+TMWFD + S L DY
Sbjct: 670 KDIDLLLASDDNFLLGTWLQSAKKLAVNPSELKQYEWNARTQVTMWFDTNETTQSKLHDY 729
Query: 461 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 520
NK+WSG+L +YY PRA+ YF ++ ESL+ + F L +WR+EWI ++N WQ G +YPV+
Sbjct: 730 ANKFWSGILENYYLPRASTYFSHLSESLKQNEKFNLTEWRKEWIPMSNKWQEGSELYPVK 789
Query: 521 SNGDALITSQWLYNKYL 537
+ GDAL SQ LY KY
Sbjct: 790 AKGDALTISQALYKKYF 806
>gi|449489156|ref|XP_004158231.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Cucumis sativus]
Length = 567
Score = 645 bits (1663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 309/531 (58%), Positives = 380/531 (71%), Gaps = 24/531 (4%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVPA L +FPSA IT+LGNW S+ +DP CCTYLL+ +DPLF++IG AFI Q
Sbjct: 59 VLPSFSGNVPAGLVEIFPSANITKLGNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQ 118
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q+KEYG ++IY+CDTF+ENTPP + YISSLGA++Y M D DAVWLMQGWLF D
Sbjct: 119 QIKEYGDVTNIYSCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSD 178
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+P QMKALL+SVP GK++VLDLFA+VKPIW +S QFYG PY+WCMLHNF GNIEMY
Sbjct: 179 SDFWKPDQMKALLHSVPFGKMIVLDLFADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMY 238
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
GILD+I+ GPV+A SEN+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV V+ W+ Y
Sbjct: 239 GILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRXQKVQVQEWLKTY 298
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S RYG++ + AWN+LYHT+YNCTDG + N D IV PD DPS S + K
Sbjct: 299 SRCRYGKADHYVDAAWNILYHTIYNCTDGIANHNTDFIVKLPDWDPS--STFDLK----- 351
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
KP PHLWYST EVI AL+L + + L S TYRYDL+DLTRQ L
Sbjct: 352 KP----------------PHLWYSTQEVINALQLLVNVDDNLVHSATYRYDLVDLTRQVL 395
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
K ANE +L + A++ + S+RF++L+ D+D LLA + FLLG WLESAK+LA
Sbjct: 396 GKLANEEYLKAVTAFRRQNVKAQNLHSKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLA 455
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
N + KQYEWNARTQ+TMW+DNT+ S L DY NKYWSGLL YY PRA YF Y+ +
Sbjct: 456 TNPAEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSK 515
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
SL + F L+DWRREWI +N WQ +YPV++ G+A+ S+ LY KY
Sbjct: 516 SLRKNESFHLEDWRREWILFSNKWQAASELYPVKAEGNAVAISKALYEKYF 566
>gi|449436325|ref|XP_004135943.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
Length = 774
Score = 643 bits (1658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 309/531 (58%), Positives = 380/531 (71%), Gaps = 24/531 (4%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVPA L +FPSA IT+LGNW S+ +DP CCTYLL+ +DPLF++IG AFI Q
Sbjct: 266 VLPSFSGNVPAGLVEIFPSANITKLGNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQ 325
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q+KEYG ++IY+CDTF+ENTPP + YISSLGA++Y M D DAVWLMQGWLF D
Sbjct: 326 QIKEYGDVTNIYSCDTFNENTPPTNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSD 385
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+P QMKALL+SVP GK++VLDLFA+VKPIW +S QFYG PY+WCMLHNF GNIEMY
Sbjct: 386 SDFWKPDQMKALLHSVPFGKMIVLDLFADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMY 445
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
GILD+I+ GPV+A SEN+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV V+ W+ Y
Sbjct: 446 GILDAISSGPVDALASENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSKKVQVQEWLKTY 505
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S RYG++ + AWN+LYHT+YNCTDG + N D IV PD DPS S + K
Sbjct: 506 SRCRYGKADHYVDAAWNILYHTIYNCTDGIANHNTDFIVKLPDWDPS--STFDLK----- 558
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
KP PHLWYST EVI AL+L + + L S TYRYDL+DLTRQ L
Sbjct: 559 KP----------------PHLWYSTQEVINALQLLVNVDDNLVHSATYRYDLVDLTRQVL 602
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
K ANE +L + A++ + S+RF++L+ D+D LLA + FLLG WLESAK+LA
Sbjct: 603 GKLANEEYLKAVTAFRRQNVKAQNLHSKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLA 662
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
N + KQYEWNARTQ+TMW+DNT+ S L DY NKYWSGLL YY PRA YF Y+ +
Sbjct: 663 TNPAEMKQYEWNARTQVTMWYDNTKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSK 722
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
SL + F L+DWRREWI +N WQ +YPV++ G+A+ S+ LY KY
Sbjct: 723 SLRKNESFHLEDWRREWILFSNKWQAASELYPVKAEGNAVAISKALYEKYF 773
>gi|414585093|tpg|DAA35664.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 721
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 295/434 (67%), Positives = 360/434 (82%), Gaps = 6/434 (1%)
Query: 6 FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 275 FPVLPAFSGNIPAALKSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFI 334
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
E+Q++EYGRTSHIYNCDTFDENTPP+ P YISSLGAA + GMQSGD+DA+WLMQGWLF+
Sbjct: 335 EEQIREYGRTSHIYNCDTFDENTPPLSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFT 394
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
YDPFW PPQMKALL+SVP+GK++VLDL+AEVKP+W S Q YGVPYIWCMLHNFA + EM
Sbjct: 395 YDPFWEPPQMKALLHSVPVGKMIVLDLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEM 454
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG+LD++A GP++AR S+N+TMVGVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+
Sbjct: 455 YGVLDALASGPIDARLSDNSTMVGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKT 514
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN- 304
Y RRYG+ V +QDAW +LY T+YNCTDG DKNRDVIVAFPDV+P +I+ T G + N
Sbjct: 515 YPTRRYGKPVKGLQDAWWILYRTLYNCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNT 573
Query: 305 ---YGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
Y SK + K +S +Y+HPHLWY T+ VI ALELF+ G+E+S SNT+RYDL+D
Sbjct: 574 RQMYSTVPSKNYIRKDVSSDAYEHPHLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVD 633
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
LTRQ LAKYAN++FL IIE+Y+ N+ + V L + FL LV D+D LL+ H+GFLLGPWLE
Sbjct: 634 LTRQVLAKYANDVFLKIIESYKSNNMNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLE 693
Query: 421 SAKQLAQNEEQEKQ 434
SAK LA+N EQE Q
Sbjct: 694 SAKGLARNSEQEIQ 707
>gi|242035709|ref|XP_002465249.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
gi|241919103|gb|EER92247.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
Length = 777
Score = 627 bits (1618), Expect = e-177, Method: Compositional matrix adjust.
Identities = 304/537 (56%), Positives = 386/537 (71%), Gaps = 39/537 (7%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLP+FSGNVPA +FPSA IT LG+W +V ++P+WCCTYLLD +D LFI++G+
Sbjct: 276 LGMVPVLPSFSGNVPAVFAKLFPSANITLLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQ 335
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
AFI QQ+KEYG ++IYNCDTF+ENTPP D P YISSLG+AIY M G+ +AVWLMQGW
Sbjct: 336 AFIRQQIKEYGDVTNIYNCDTFNENTPPTDEPAYISSLGSAIYEAMSRGNKNAVWLMQGW 395
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF D FW+ PQMKALL+SVP+GK++VLDLFA+VKPIW S QFYGVPYIWCMLHNF G
Sbjct: 396 LFYSDAAFWKEPQMKALLHSVPIGKMIVLDLFADVKPIWKMSSQFYGVPYIWCMLHNFGG 455
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
NIEMYG+LDSI+ GP++ARTS N+TM+GVGM MEGIE NPVVY+LMSEMAF ++KV+V+
Sbjct: 456 NIEMYGVLDSISSGPIDARTSYNSTMIGVGMCMEGIEHNPVVYELMSEMAFHNKKVEVE- 514
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
D N+D IV FPD+ PS IS K
Sbjct: 515 ------------------------------------DHNKDYIVEFPDISPSSISSQLSK 538
Query: 302 YQNYGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ + SE S S HPHLWYST E I+ALELF+ +G+ S S TYRYDL+D
Sbjct: 539 RRGMSIMRNHRRFFLSEVSGSLPHPHLWYSTKEAIKALELFLDAGSTFSKSLTYRYDLVD 598
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
LTRQ L+K ANE++L+ + +YQ D++G+ +R+FLE++ D+D LLA D FLLGPWLE
Sbjct: 599 LTRQCLSKLANEVYLDAMSSYQKKDSNGLNSHTRKFLEIIMDIDTLLAADDNFLLGPWLE 658
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
SAK LA E++ +QYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL+ YY PRA+ Y
Sbjct: 659 SAKSLAITEKERQQYEWNARTQVTMWYDNTETEQSKLHDYANKFWSGLLKSYYLPRASKY 718
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
F Y+ SL+ F+L++WR++WI +N+WQ+G+ VY V++ GDAL ++ LY KYL
Sbjct: 719 FAYLTRSLQENQSFQLEEWRKDWISYSNEWQSGKEVYAVKATGDALAIARSLYRKYL 775
>gi|4160292|emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum]
Length = 811
Score = 626 bits (1615), Expect = e-177, Method: Compositional matrix adjust.
Identities = 303/543 (55%), Positives = 392/543 (72%), Gaps = 13/543 (2%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVPAAL+ +FPSA IT+LG+W +V DPRWCCT+LL +DPLFIEIG AFI +
Sbjct: 268 VLPSFSGNVPAALKKIFPSANITRLGDWNTVNGDPRWCCTFLLAPSDPLFIEIGEAFIRK 327
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL-MQGWLFSY 126
Q++EYG + IYNCDTF+ENTPP D P YI Q + WL + WLF
Sbjct: 328 QIEEYGDITDIYNCDTFNENTPPTDDPTYIHLSALLCTKQCQKQITMRCWLNARVWLFYS 387
Query: 127 D-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
D +W+ PQM+ALL+SVP GK++VLDLFA+VKPIW +S QFYG PYIWCMLHNF GNIEM
Sbjct: 388 DSKYWKSPQMEALLHSVPRGKMIVLDLFADVKPIWKSSSQFYGTPYIWCMLHNFGGNIEM 447
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG+LD++A GP++ARTSEN+TMVGVGM MEGIE NPVVY+LMSEMAF+ + ++ W+
Sbjct: 448 YGVLDAVASGPIDARTSENSTMVGVGMCMEGIEHNPVVYELMSEMAFREDNFQLQGWLKS 507
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS-----IISVTEG 300
YS RRYG+ IQ AW++LYHT+YNCTDG D N+D IV FPD DPS IS T+
Sbjct: 508 YSHRRYGKVNDQIQAAWDILYHTIYNCTDGIADHNKDYIVEFPDWDPSGKTGTDISGTDS 567
Query: 301 KYQNYGKPVS-----KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
QN + ++ + + ++SS P LWYST +V +AL+LFI + +LS S TYR
Sbjct: 568 SSQNRMQKLAGFQWNRRFLFFEKSSSLPKPRLWYSTEDVFQALQLFIDALKKLSGSLTYR 627
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
YDL+DL+RQ+L+K AN+++L+ I A++ DA + Q S +FL L++D+D LLA D FLL
Sbjct: 628 YDLVDLSRQSLSKLANQVYLDAISAFRREDAKPLNQHSPKFLPLLQDIDRLLAADDNFLL 687
Query: 416 GPWLESAKQ-LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
G WLE+ Q LA N +++KQYEWNARTQITMWFDNT+ S L DY NK+WSGLL YY
Sbjct: 688 GTWLENCPQNLAMNSDEKKQYEWNARTQITMWFDNTKYNQSQLHDYANKFWSGLLEAYYL 747
Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
PRA+IYF+ + +SL+ F+L++WR+EWI +N WQ +YPV++ GDAL + L+
Sbjct: 748 PRASIYFELLSKSLKEKVDFKLEEWRKEWIAYSNKWQESTELYPVKAQGDALAIATALFE 807
Query: 535 KYL 537
KY
Sbjct: 808 KYF 810
>gi|168060822|ref|XP_001782392.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666123|gb|EDQ52786.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 801
Score = 626 bits (1614), Expect = e-176, Method: Compositional matrix adjust.
Identities = 294/530 (55%), Positives = 376/530 (70%), Gaps = 17/530 (3%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+GNVPAA+ +PSA++T+LG W +V D R+CCT+LLD DPLF++IG+AFI Q
Sbjct: 273 VLPAFAGNVPAAITKKYPSARVTRLGEWNTVNGDTRYCCTFLLDPKDPLFVDIGKAFILQ 332
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q+KEYG T HIYNCDTF+EN PP D P YIS+LG+ +Y M + D DA+WLMQ +YD
Sbjct: 333 QIKEYGGTQHIYNCDTFNENQPPTDDPSYISALGSIVYEAMSAADQDAIWLMQ----AYD 388
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
FW+PPQMKALL+SVP+G++VVLDLFA+VKP+WS S FYGVPYIWCMLHNF GN+EMYG
Sbjct: 389 KFWKPPQMKALLHSVPVGRMVVLDLFADVKPMWSRSDHFYGVPYIWCMLHNFGGNVEMYG 448
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
LD +A P++A TS N+TMVGVGM MEGIEQNPVVYDLM+EMAF + V V+ WI +Y+
Sbjct: 449 RLDVVATAPIQAVTSSNSTMVGVGMCMEGIEQNPVVYDLMAEMAFHNATVVVEDWIEEYA 508
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYG + AW +L+ ++YNC+DG D N DVIV FPD+DP Q+ G+
Sbjct: 509 RRRYGELTAGARIAWKMLHESIYNCSDGIADHNGDVIVEFPDIDPKRSLFQIRPRQSLGQ 568
Query: 308 PVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+ HP H+WYS + AL+ ++S + L S YRYD++DLTRQ L
Sbjct: 569 QI------------LGHPQHIWYSPQDAAVALQYLLSSADALGLSKPYRYDVVDLTRQVL 616
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+K AN+L +++ +++ + + +S R LEL+ DMD LL + FLLG WLESAK LA
Sbjct: 617 SKLANQLHSQVLDQFRMFNVEKMDNISSRLLELLSDMDDLLGASEEFLLGTWLESAKDLA 676
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
++E+ K YEWNARTQITMWFDNT ++ S L DY NK WSGL RDYY PRA+IY KY+ +
Sbjct: 677 TSDEERKLYEWNARTQITMWFDNTLDKPSPLHDYANKMWSGLTRDYYLPRASIYIKYLKQ 736
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
SL F ++WRREWI LTN+WQ N+YP + GDAL + LY KY
Sbjct: 737 SLHENTSFAFQEWRREWIALTNEWQVASNLYPTVAKGDALEIATTLYEKY 786
>gi|302786446|ref|XP_002974994.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
gi|300157153|gb|EFJ23779.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
Length = 761
Score = 615 bits (1587), Expect = e-173, Method: Compositional matrix adjust.
Identities = 290/535 (54%), Positives = 380/535 (71%), Gaps = 4/535 (0%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L AVLPAFSGNVP AL+ ++PSA IT+L +W +V +P+WCCTYLL DPLFI+IG+
Sbjct: 224 LGMIAVLPAFSGNVPRALKILYPSANITRLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGK 283
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
AFIEQQ+KEYG T H+YNCDTF+EN PP D P YIS+L A++Y M D A+WLMQGW
Sbjct: 284 AFIEQQVKEYGSTQHVYNCDTFNENLPPTDDPSYISALAASVYGAMIVADKQAIWLMQGW 343
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LFS D FW+PPQMKALL++VP GK++VLDLFAEV+PIWS S FYGVPYIWCMLHNF G
Sbjct: 344 LFSSDAQFWKPPQMKALLHAVPFGKMIVLDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGG 403
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
N EMYG LD ++ GPV+A+TS N+TM+GVGM MEGIEQNPVVY+LM+EMAF+ + +K
Sbjct: 404 NHEMYGRLDVVSSGPVDAKTSANSTMIGVGMCMEGIEQNPVVYELMAEMAFRSTRNALKD 463
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W+N YS RRYG++VP +AW +L HT+YNC+DG D N DVIV FPD++ S ++ T +
Sbjct: 464 WVNDYSTRRYGKAVPEALEAWQILSHTLYNCSDGLQDHNTDVIVKFPDLNASSLT-TLSR 522
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
Y +L +S+ HLWY +E AL + + + LS TYRYDL+DL
Sbjct: 523 YLAEEAGTQTRRLLTEGLTSFG--HLWYRPTEAKVALSYMLNASSSLSNVATYRYDLVDL 580
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
TRQ L K AN++ L + ++ D + + + +++D + LL ++GFLLGPWLES
Sbjct: 581 TRQVLMKLANQIHLQALVSFVKGDLEELTKNCDILIGIIKDSELLLRSNNGFLLGPWLES 640
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
AK+L N +++ YEWNARTQ+TMWFDNT+ S L DY NK WSGL DYY PRA++Y
Sbjct: 641 AKKLGTNSDEKHLYEWNARTQVTMWFDNTRSLPSALHDYANKMWSGLFEDYYLPRASLYT 700
Query: 482 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
K ++++L + F WR WI LTN +QNG YP+E+ GD++ ++ L++KY
Sbjct: 701 KLLVKALHDKEPFPYGSWRSSWILLTNTFQNGTKNYPLEAAGDSIEIAKSLFSKY 755
>gi|302791289|ref|XP_002977411.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
gi|300154781|gb|EFJ21415.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
Length = 761
Score = 613 bits (1581), Expect = e-173, Method: Compositional matrix adjust.
Identities = 288/535 (53%), Positives = 378/535 (70%), Gaps = 4/535 (0%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L AVLPAFSGNVP AL+ ++PSA IT+L +W +V +P+WCCTYLL DPLFI+IG+
Sbjct: 224 LGMIAVLPAFSGNVPRALKILYPSANITRLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGK 283
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
AFIEQQ+KEYG T H+YNCDTF+EN PP D P YIS+L A++Y M D A+WLMQGW
Sbjct: 284 AFIEQQVKEYGSTQHVYNCDTFNENLPPTDDPSYISALAASVYGAMIVADKQAIWLMQGW 343
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LFS D FW+PPQMKALL++VP GK++VLDLFAEV+PIWS S FYGVPYIWCMLHNF G
Sbjct: 344 LFSSDAQFWKPPQMKALLHAVPFGKMIVLDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGG 403
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
N EMYG LD ++ GPV+A+TS N+TM+GVGM MEGIEQNPVVY+LM+EMAF+ + +K
Sbjct: 404 NHEMYGRLDVVSSGPVDAKTSANSTMIGVGMCMEGIEQNPVVYELMAEMAFRSTRNALKD 463
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W++ YS RRYG++VP +AW +L HT+YNC+DG D N DVIV FPD++ S ++
Sbjct: 464 WVDDYSTRRYGKAVPEALEAWQILSHTLYNCSDGLQDHNTDVIVKFPDLNASSLTTLSRY 523
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
G ++ + + TS HLWY +E AL + + + LS TYRYDL+DL
Sbjct: 524 LAEEGGTQTRRLLTEGLTS---FGHLWYRPTEAKVALSYMLNASSSLSNVATYRYDLVDL 580
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
TRQ L K AN++ L + ++ D + + + +++D + LL ++GFLLGPWLES
Sbjct: 581 TRQVLMKLANQIHLQALVSFVKGDLEELTKNCDILIGIIKDSELLLRSNNGFLLGPWLES 640
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
AK+L N ++ YEWNARTQ+TMWFDNT+ S L DY NK WSGL DYY PRA++Y
Sbjct: 641 AKKLGTNSDETNLYEWNARTQVTMWFDNTRTLPSALHDYANKMWSGLFEDYYLPRASLYT 700
Query: 482 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
K ++++L + F WR WI LTN +QNG YP+E+ GD++ ++ L++KY
Sbjct: 701 KLLVKALHDKEPFPYDSWRSSWILLTNTFQNGTKNYPLEAAGDSIEIAKSLFSKY 755
>gi|326521470|dbj|BAK00311.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 520 bits (1340), Expect = e-145, Method: Compositional matrix adjust.
Identities = 248/421 (58%), Positives = 316/421 (75%), Gaps = 1/421 (0%)
Query: 119 MQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 177
+QGWLF D FW+ QMKALL+SVP+GK++VLDLFA+VKPIW TS QFYGVPYIWCMLH
Sbjct: 8 VQGWLFYSDAVFWKESQMKALLHSVPIGKMMVLDLFADVKPIWQTSSQFYGVPYIWCMLH 67
Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
NF GNIEMYG+LDSI+ GPV+ARTS N+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV
Sbjct: 68 NFGGNIEMYGVLDSISSGPVDARTSYNSTMVGVGMCMEGIEHNPVVYELMSEMAFRSQKV 127
Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 297
V+ W+ YS RRYG+S IQ AW +LYHT+YNCTDG D N+D IV FPD+ PS S
Sbjct: 128 KVEDWLKTYSHRRYGQSNVEIQKAWGILYHTIYNCTDGIADHNKDYIVEFPDMSPSSFSS 187
Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
K L ++S PHLWYST E I++LELF+ +GN+LS S TYRYD
Sbjct: 188 QYSKRSISLARKHPRFFLSEVSASLPQPHLWYSTEEAIKSLELFLNAGNDLSKSLTYRYD 247
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
L+DLTRQ+L+K AN+++ + I +YQ D+ G+ ++ FLEL+ D+D LLA D FLLGP
Sbjct: 248 LVDLTRQSLSKLANKVYHDAISSYQKRDSSGLNFHTKEFLELIVDIDTLLASDDNFLLGP 307
Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
WLESAK LA E++ KQYEWNARTQ+TMW+D+T+ E S L DY NK+WSGLL+ YY PRA
Sbjct: 308 WLESAKSLAMTEDERKQYEWNARTQVTMWYDDTKTEQSKLHDYANKFWSGLLKSYYLPRA 367
Query: 478 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
+ YF + SL+ F+L++WRR+WI +N+WQ+G+ +YPV++ GD+L S+ L+ KY
Sbjct: 368 SKYFSRLSRSLQENRSFQLEEWRRDWISYSNEWQSGKELYPVKAIGDSLAISRSLFTKYF 427
Query: 538 Q 538
+
Sbjct: 428 R 428
>gi|449518399|ref|XP_004166229.1| PREDICTED: alpha-N-acetylglucosaminidase-like, partial [Cucumis
sativus]
Length = 336
Score = 511 bits (1315), Expect = e-142, Method: Compositional matrix adjust.
Identities = 247/331 (74%), Positives = 281/331 (84%), Gaps = 2/331 (0%)
Query: 208 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 267
VGVGMSMEGIEQNPVVYDLMSEMAFQH KVDVK W+ QYSVRRYG VP+IQDAW+VLYH
Sbjct: 1 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 60
Query: 268 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 327
TVYNCTDGA DKNRDVIVAFPDVDPS I V +G S +V + + +++D PHL
Sbjct: 61 TVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHL 118
Query: 328 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 387
WY TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF I++AYQL+D
Sbjct: 119 WYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQ 178
Query: 388 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 447
+ LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQYEWNARTQITMWF
Sbjct: 179 TMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWF 238
Query: 448 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 507
DNT+EEASLLRDYGNKYWSGLL DYY PRAAIY K++ ES E+G F L +WRREWIKLT
Sbjct: 239 DNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLT 298
Query: 508 NDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
NDWQ+ R +YPVESNGDAL TS WLYNKYLQ
Sbjct: 299 NDWQSSRKIYPVESNGDALDTSHWLYNKYLQ 329
>gi|414585094|tpg|DAA35665.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 1202
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 248/431 (57%), Positives = 312/431 (72%), Gaps = 25/431 (5%)
Query: 112 DSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPY 171
DS+ W L + DP + V +GK+ + + E + + Y
Sbjct: 308 DSNPRWCCTYLLDASDPLF-----------VEIGKMFIEEQIRE----YGRTSHIYN--- 349
Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMA 231
WCMLHNFA + EMYG+LD++A GP++AR S+N+TMVGVGMSMEGIEQNP+VYDLMSEMA
Sbjct: 350 -WCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGVGMSMEGIEQNPIVYDLMSEMA 408
Query: 232 FQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD 291
F H +VD++ W+ Y RRYG+ V +QDAW +LY T+YNCTDG DKNRDVIVAFPDV+
Sbjct: 409 FHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLYNCTDGKNDKNRDVIVAFPDVE 468
Query: 292 PSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGN 346
P +I+ T G + N Y SK + K +S +Y+HPHLWY T+ VI ALELF+ G+
Sbjct: 469 PFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHPHLWYDTNAVIHALELFLQHGD 527
Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+ + V L + FL LV D+D L
Sbjct: 528 EVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNNMNQVTILCQHFLSLVNDLDTL 587
Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
L+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWS
Sbjct: 588 LSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWS 647
Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
GLL+DYYGPRAAIYFK+++ S+E+ F LK+WRREWI LTN+WQ+ R V+ + GD L
Sbjct: 648 GLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWISLTNNWQSDRKVFSTTATGDPL 707
Query: 527 ITSQWLYNKYL 537
SQ LY KYL
Sbjct: 708 NISQSLYTKYL 718
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 58/75 (77%), Positives = 70/75 (93%)
Query: 6 FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 275 FPVLPAFSGNIPAALKSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFI 334
Query: 66 EQQLKEYGRTSHIYN 80
E+Q++EYGRTSHIYN
Sbjct: 335 EEQIREYGRTSHIYN 349
>gi|156399499|ref|XP_001638539.1| predicted protein [Nematostella vectensis]
gi|156225660|gb|EDO46476.1| predicted protein [Nematostella vectensis]
Length = 675
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 217/532 (40%), Positives = 320/532 (60%), Gaps = 42/532 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F+G+VPA L ++P A +++LG+W + S +CCTYLL+ +DPLF +IG AFI++
Sbjct: 179 VLPGFAGHVPAGLLRLYPKANVSKLGDWGNFNST--YCCTYLLEPSDPLFQKIGTAFIKE 236
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q EYG T+HIYN DTF+E P P Y+ + +A+Y GM GD DAVWLMQGWLF +
Sbjct: 237 QTAEYG-TNHIYNADTFNEMRPRSSDPTYLGAASSAVYRGMAGGDPDAVWLMQGWLFVDE 295
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
FW+P Q+KALL+ VP G ++VLDL+AE PIWS ++ FYG P+IWCML NF GNI ++G
Sbjct: 296 GFWKPDQIKALLHGVPQGFMIVLDLWAENSPIWSRTQSFYGTPFIWCMLLNFGGNIGLFG 355
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE---KVDVKAWIN 244
+ S++ GP +A S N+TM+G G++MEGIEQN ++++LM+EM ++ E VD+ WI
Sbjct: 356 NIKSVSTGPPKAFQSFNSTMIGTGLTMEGIEQNDMMFELMNEMGYRLEPLNPVDLDNWIK 415
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y++RRYG + PAI AW +L +VY C D + V P +D
Sbjct: 416 DYALRRYGGTNPAIIQAWRLLIRSVYQCNGYCADHIHSIFVWKPSLD------------- 462
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+ P+LWY +V A + ++ E T+RYDL+D+TRQ
Sbjct: 463 ------------------NKPNLWYDPEDVFNAWDELRSTAAEFMHVETFRYDLVDVTRQ 504
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
AL ++ ++I AY+ A V R LE+ +D+D LL + FLLG WL SAK
Sbjct: 505 ALHLRVIPIYNDLISAYKNRSALNVIHFGSRLLEMFDDLDSLLQTNRNFLLGRWLNSAKA 564
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
L + YE+NAR QIT+W + E DY NK WSGL++ YY PR ++ M
Sbjct: 565 LGTTPAEVALYEFNARNQITLWGPRGEIE-----DYANKMWSGLVKAYYKPRWELFIDEM 619
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ ++ G+ + ++++ ++ W +G+ YP + +GD+L +++L+NK+
Sbjct: 620 VSAIAQGEELDYEAFKKKLLEQETAWTHGKEEYPDQPSGDSLAAAEFLHNKW 671
>gi|384247107|gb|EIE20595.1| hypothetical protein COCSUDRAFT_37819 [Coccomyxa subellipsoidea
C-169]
Length = 762
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 223/533 (41%), Positives = 317/533 (59%), Gaps = 31/533 (5%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V PAF+G VP AL P+A+I++ NW S + R+CC +LLD +PLF EIG AF++
Sbjct: 236 VFPAFAGFVPGALARERPAARISRSDNWCSFPA--RYCCVHLLDPLEPLFQEIGSAFVKV 293
Query: 68 QLKEYGRTS-HIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+EYG Y+ DTF+E TPP P Y++S+ +AIY+ M + D A WLMQ WLF Y
Sbjct: 294 LREEYGSDEVGFYSADTFNEMTPPSSDPAYLTSVTSAIYNAMAAADPSARWLMQAWLF-Y 352
Query: 127 D--PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
D FW+PPQ++AL++ VP L++LDL+AEV P+W ++K F+G P+I+CMLHNF GNIE
Sbjct: 353 DNQKFWQPPQIQALVSGVPRDALIMLDLYAEVFPLWKSTKSFFGAPFIYCMLHNFGGNIE 412
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
MYG L+++A GP E + ++G+GM EGIEQNPVVY+LMSE AF+ + V+V+ WI
Sbjct: 413 MYGALEAVARGPAEGQIDGVAGLIGIGMCPEGIEQNPVVYELMSEWAFRRQPVEVEGWIE 472
Query: 245 QYSVRRYGRSVPAIQ-DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
Y+ RRYG S P AW++L +VYN TDG TD +RD+ + P + P+ + + K
Sbjct: 473 AYARRRYGNSTPPTALVAWDLLLRSVYNATDGHTDHSRDIPTSRPGLSPAEVGLWGLK-- 530
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
PHLWY+ +V+ A L + S EL YRYDL+D+ R
Sbjct: 531 ---------------------PHLWYNEQQVVDAWGLLLRSAGELQQVEGYRYDLVDVGR 569
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
Q ++K A +++ + EAY + V + R L+L++D++ LLA + GFLLGP LE A
Sbjct: 570 QVISKRATDIWKAVAEAYVDGRSIVVRREGARLLQLLDDLEELLATNRGFLLGPKLEEAS 629
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
E + + YEWN R Q+T+W + S + DY N+ W+GL+ YY PR A++
Sbjct: 630 SAGHTEAEARLYEWNLRKQLTVW-GTSDTGGSEIEDYANREWAGLISSYYKPRWALWLLR 688
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ L G + + WR E + T W R+ P+ GD SQ LY Y
Sbjct: 689 LETDLAQGRRYDPEAWRMECLNFTLGWAYLRDQLPLHPQGDTGGVSQRLYEVY 741
>gi|255553488|ref|XP_002517785.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
gi|223543057|gb|EEF44592.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
Length = 360
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 190/359 (52%), Positives = 260/359 (72%), Gaps = 6/359 (1%)
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
MYGILDSI+ GP+EAR SEN+TMVGVGM MEGIE NPVVY+LMSEMAF+ EKV V W+
Sbjct: 1 MYGILDSISTGPIEARVSENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSEKVQVLEWLK 60
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
YS RRYG++V ++ AW +LYHT+YNCTDG D N D IV FPD DPS+ S ++ Q+
Sbjct: 61 TYSRRRYGKAVHQVEAAWEILYHTIYNCTDGIADHNTDFIVKFPDWDPSVQSGSDTSQQD 120
Query: 305 -----YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
+ S+ + + S+ H+WYS +VI AL+LFI G+ L+ S TYRYDL+
Sbjct: 121 NKHIFLHRSGSRRFLFEGPNSTLPQAHIWYSIQKVINALQLFIDGGSHLTGSLTYRYDLV 180
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
DLTRQ L+K AN+++++ I A++ NDA + S++F++L++D+D LLA D FL+G WL
Sbjct: 181 DLTRQVLSKLANQVYVDAIIAFRSNDARALNLHSQKFIQLIKDIDVLLASDDNFLIGTWL 240
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
ESAK+LA N + +QYEWNARTQ+TMW+D T+ S L DY NK+WSGLL DYY PRA+
Sbjct: 241 ESAKELALNPSEMRQYEWNARTQVTMWYDTTKTNQSKLHDYANKFWSGLLEDYYLPRAST 300
Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNG-DALITSQWLYNKYL 537
YF ++++SL+ + F+L++WR +WI +N+WQ G +YP++ +G DAL S+ LY+KY
Sbjct: 301 YFDHLVKSLKQNEKFKLQEWREKWIAFSNEWQAGTKLYPMKGSGDDALAISKALYDKYF 359
>gi|14861378|gb|AAK73654.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
Length = 753
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 215/541 (39%), Positives = 316/541 (58%), Gaps = 51/541 (9%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G+VP + FP T+LG W D + CTYLLD DP+F IG
Sbjct: 244 LGMITVLPAFAGHVPQGVLRAFPRVNATRLGGWSHF--DCTYSCTYLLDPEDPMFQVIGT 301
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F+++ +KE+G T HIY+ DTF+E P P Y+S + +A++ M D AVWLMQGW
Sbjct: 302 LFLKELIKEFG-TDHIYSADTFNEMNPLSSDPAYLSRVSSAVFRSMTGADPKAVWLMQGW 360
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF + P FW+P Q++ALL+ VPLG+++VLDLFAE +P++ ++ FYG P+IWCMLHNF G
Sbjct: 361 LFQHQPDFWQPAQVRALLHGVPLGRMIVLDLFAESRPVYQWTESFYGQPFIWCMLHNFGG 420
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
N ++G +++I GP AR N+TMVG G+ EGIEQN +VY+LM+E+ ++ E +D+ +
Sbjct: 421 NHGLFGTVEAINHGPFAARRFPNSTMVGTGLVPEGIEQNDMVYELMNELGWRQEPLDLPS 480
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W+ +Y+ RRYG A AW +L +VYNCT + NR +V P
Sbjct: 481 WVARYAERRYGAPNAAAASAWQLLLRSVYNCTGVCVNHNRSPLVRRPS------------ 528
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
L+ +T +WY+ S+V A L +++G EL +S T+ YDL D+
Sbjct: 529 -------------LRMDT------EVWYNKSDVYEAWRLLLSAGAELGSSPTFGYDLADV 569
Query: 362 TRQALAKYANELFLNIIEAYQ------LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
TRQA + +E +L+I +A+Q L A GV +L+ ++DGLL+ H FLL
Sbjct: 570 TRQAAQQLVSEYYLSIRQAFQSRSLPELLTAGGVL-----VYDLLPELDGLLSSHRLFLL 624
Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
G WLESA+ +A ++ + +QYE NAR Q+T+W N + DY NK GL+ DYYG
Sbjct: 625 GRWLESARAVATSDREAEQYELNARNQVTLWGPNGN-----ILDYANKQLGGLVLDYYGV 679
Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
R +++ ++ESL SG F + + ++ + + YP GD L S+ ++ K
Sbjct: 680 RWSLFVSALVESLNSGSPFHQDQFNQAVFQVERGFIYNKKRYPTAPVGDTLEISKKIFLK 739
Query: 536 Y 536
Y
Sbjct: 740 Y 740
>gi|14861380|gb|AAK73655.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
Length = 753
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 215/541 (39%), Positives = 316/541 (58%), Gaps = 51/541 (9%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G+VP + FP T+LG W D + CTYLLD DP+F IG
Sbjct: 244 LGMITVLPAFAGHVPQGVLRAFPRVNATRLGGWSHF--DCTYSCTYLLDPEDPMFQVIGT 301
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F+++ +KE+G T HIY+ DTF+E P P Y+S + +A++ M D AVWLMQGW
Sbjct: 302 LFLKELIKEFG-TDHIYSADTFNEMNPLSSDPAYLSRVSSAVFRSMTGADPKAVWLMQGW 360
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF + P FW+P Q++ALL+ VPLG+++VLDLFAE +P++ ++ FYG P+IWCMLHNF G
Sbjct: 361 LFQHQPDFWQPAQVRALLHGVPLGRMIVLDLFAESRPVYQWTESFYGQPFIWCMLHNFGG 420
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
N ++G +++I GP AR N+TMVG G+ EGIEQN +VY+LM+E+ ++ E +D+ +
Sbjct: 421 NHGLFGTVEAINHGPFAARRFPNSTMVGTGLVPEGIEQNDMVYELMNELGWRQEPLDLPS 480
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W+ +Y+ RRYG A AW +L +VYNCT + NR +V P
Sbjct: 481 WVARYAERRYGAPNAAAASAWXLLLRSVYNCTGVCVNHNRSPLVRRPS------------ 528
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
L+ +T +WY+ S+V A L +++G EL +S T+ YDL D+
Sbjct: 529 -------------LRMDT------EVWYNKSDVYEAWRLLLSAGAELGSSPTFGYDLADV 569
Query: 362 TRQALAKYANELFLNIIEAYQ------LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
TRQA + +E +L+I +A+Q L A GV +L+ ++DGLL+ H FLL
Sbjct: 570 TRQAAQQLVSEYYLSIRQAFQSRSLPELLTAGGVL-----VYDLLPELDGLLSSHRLFLL 624
Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
G WLESA+ +A ++ + +QYE NAR Q+T+W N + DY NK GL+ DYYG
Sbjct: 625 GRWLESARAVATSDREAEQYELNARNQVTLWGPNGN-----ILDYANKQLGGLVLDYYGV 679
Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
R +++ ++ESL SG F + + ++ + + YP GD L S+ ++ K
Sbjct: 680 RWSLFVSALVESLNSGSPFHQDQFNQAVFQVERGFIYNKKRYPTAPVGDTLEISKKIFLK 739
Query: 536 Y 536
Y
Sbjct: 740 Y 740
>gi|390348210|ref|XP_785272.3| PREDICTED: alpha-N-acetylglucosaminidase [Strongylocentrotus
purpuratus]
Length = 793
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 202/537 (37%), Positives = 300/537 (55%), Gaps = 41/537 (7%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G+VP + VFP+A I+ LG+W + P +CCT LLD DP+F ++G+
Sbjct: 252 LGMIPVLPAFAGHVPXSFSKVFPNASISNLGDW--GRFGPEYCCTSLLDPQDPMFKQVGK 309
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
AFI+ +E+ T HIY+ DTF+EN P Y+S+ +Y G+ GD VWLM GW
Sbjct: 310 AFIDAMSEEFNGTDHIYSADTFNENKPKSRDSAYLSAASKGVYQGIIEGDPKGVWLMMGW 369
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
LF FW P Q+KALL VP+G+++VLDL+AE +P + T+ FYG P+IWCMLHNF GN
Sbjct: 370 LFQDTGFWGPTQIKALLQGVPIGRMIVLDLYAEARPFYKTTYSFYGQPFIWCMLHNFGGN 429
Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+YG LD++ GP EAR +N+TM+G+G + EGI QN V+Y+ +++M ++ +V W
Sbjct: 430 TGLYGKLDAVNQGPFEARNYDNSTMIGMGTTPEGIFQNYVMYNFLTDMTWRSGSTNVSKW 489
Query: 243 INQYSVRRYG---RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
I QY+ RRY +AW +L TVYN T D
Sbjct: 490 IEQYAGRRYSNDPNKSEEATEAWVILKETVYNNTGTLQD--------------------- 528
Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
Y PV + + + + +WY ++V +A E + + +L S +RYDL+
Sbjct: 529 ---HQYAVPVRRPSNIMTSP-------VWYDYTKVAKAWEFLLEASTKLGTSPVFRYDLV 578
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
D+TR L A + ++ ++++ +A V L+ DMD + + H+ +LLG WL
Sbjct: 579 DVTRNVLQDLAFDFQQKLMVSFRIRNAGAVGGNGTLLCNLILDMDNITSSHEDWLLGTWL 638
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
E AK LA N ++E YE+NA+ QIT+W +EE + DY NK W GLLR YY R +
Sbjct: 639 EDAKSLATNNDEESLYEYNAKNQITIW--GPKEE---ILDYANKQWGGLLRTYYHRRWQL 693
Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
Y +Y+ E ++S + + ++W + + +P E GD + S+ LY KY
Sbjct: 694 YVQYLEECIQSHQPYDQNTFNVRSFVAESEWTHSKEKFPTEPVGDTMAISKALYVKY 750
>gi|348533253|ref|XP_003454120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oreochromis
niloticus]
Length = 845
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 197/533 (36%), Positives = 319/533 (59%), Gaps = 44/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGN+P + ++P A++T+LG W + C+ +LD DPLF IG ++ Q
Sbjct: 345 VLPAFSGNIPKGILRLYPEARVTRLGPWSHFNCS--YSCSLVLDPQDPLFHHIGSLYLSQ 402
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
LK++G T HIY+ DTF+E TPP P Y+S++ ++++ M + D AVWLMQGWLF D
Sbjct: 403 VLKQFG-TDHIYSTDTFNEMTPPSSDPAYLSAVSRSVFASMTAVDPQAVWLMQGWLFFSD 461
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+P Q++ALL+ VPLG+++VLDLFAE +PI+S ++ FYG P+IWCML NF GN ++
Sbjct: 462 AAFWKPAQIQALLHGVPLGRMIVLDLFAETEPIFSYTESFYGQPFIWCMLQNFGGNSGLF 521
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G ++SI GP +A N+T+VG+GM+ EGIEQNPV Y+LMSE+A++ E V++ W++ Y
Sbjct: 522 GTVESINSGPFKALHFPNSTLVGIGMTPEGIEQNPVTYELMSELAWRKEPVNLAKWVSLY 581
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
++RRYG + ++ AW +L+ ++YNCTD Y+N+
Sbjct: 582 AIRRYGNTQESLTTAWRLLFASIYNCTDP-------------------------HYRNHN 616
Query: 307 -KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
P+ + + T LWY +++ +A +L + + L + T+RYDL+D+TR+
Sbjct: 617 HSPLVRRPSFQMNTG------LWYDPADLYKAWKLIMDAAPSLMSKETFRYDLVDVTREV 670
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
L + +I +A++ + + + +L+ +++ LL+ + FLLG WLE A+
Sbjct: 671 LQVLTTSFYRDIADAFKKQNLSELLTAGGVLVYDLLPELNRLLSSNRNFLLGAWLERARS 730
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
LA ++++ + Y+ NAR QIT+W + + + DY +K W GL+ DYY R ++ + +
Sbjct: 731 LAVDDKEAQLYDMNARNQITLWGPSGE-----ILDYASKEWGGLMEDYYAQRWGLFVQTL 785
Query: 485 IESLESGDGFRLKDWRREWIKLTNDW-QNGRNVYPVESNGDALITSQWLYNKY 536
+E L SG F+ + + ++ + NGR YP + GD + ++ KY
Sbjct: 786 VECLNSGQPFKQAAFNQAVFQIEKGFIYNGRK-YPTKPQGDTYEIAYRIFLKY 837
>gi|410930376|ref|XP_003978574.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Takifugu rubripes]
Length = 751
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 202/531 (38%), Positives = 319/531 (60%), Gaps = 40/531 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGN+P + +FP A++T+L W K + + C+Y+LD DPLF IG ++ Q
Sbjct: 251 VLPAFSGNIPKGILRLFPEARVTRLEPW--SKFNCSFSCSYILDPRDPLFSRIGSLYLSQ 308
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+K++G T+HIYN DTF+E TPP P Y+S++ A+++ M + D AVWLMQGWLF D
Sbjct: 309 VVKQFG-TNHIYNTDTFNEMTPPSSEPTYLSAVSRAVFASMTAVDPQAVWLMQGWLFLSD 367
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+P Q++ALLN VP+G+++VLDLFAE +P++S ++ FYG P+IWCMLHNF GN +
Sbjct: 368 ALFWKPAQIQALLNGVPVGRMIVLDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNGGFF 427
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G ++SI GP +A N+++VG+GM+ EGIEQNPVVY+LMSE+A++ E V++ W++ Y
Sbjct: 428 GTVESINTGPFKALHFPNSSLVGIGMTPEGIEQNPVVYELMSELAWRKEPVNLLKWVSLY 487
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
RRYG ++ AW +L+ +VYNCT P Y+N+
Sbjct: 488 VTRRYGSMHESVSAAWKILFASVYNCT-------------LP------------HYRNH- 521
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+ L S + + LWY +++ RA +L + + + T++YDL+D+TRQ +
Sbjct: 522 ----NHSPLVRRPSFHMNSELWYDPADLYRAWKLILEAAPSFMSKETFQYDLVDVTRQVM 577
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQL 425
+ +I++A+Q + + L +L+ +++ LL+ + FLLG WLE A+ L
Sbjct: 578 QVLTTSYYQDIVDAFQKHKMQELLTAGGVLLYDLLPELNRLLSSNHNFLLGTWLEQARSL 637
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
A +E + K Y+ NAR Q+T+W + + + DY NK W GL++DYY R ++ ++
Sbjct: 638 ALDEREAKLYDINARNQLTLWGPSGE-----ILDYANKQWGGLMQDYYAQRWGLFIHTLV 692
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
E L+SG F+ ++ + ++ + R YP + GD + ++ KY
Sbjct: 693 ECLDSGQPFKQDNFNKVVFQVEKGFIYNRRQYPTKPQGDTFEIAHRIFLKY 743
>gi|375144105|ref|YP_005006546.1| alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
gi|361058151|gb|AEV97142.1| Alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
Length = 735
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 209/531 (39%), Positives = 310/531 (58%), Gaps = 40/531 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A + +P+AK+ + NW + +D TY+LD+ DPLF E+G+ F+++
Sbjct: 238 VLPAFTGHVPPAFKKKYPNAKL-KATNWTNGFAD-----TYILDSQDPLFAEMGKRFLQK 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q +G T H+Y+ DTF+EN PP D P ++S+L A IY GM+ D+ A W+MQGWLF D
Sbjct: 292 QTSLFG-TDHLYSADTFNENEPPSDDPAFLSALSARIYEGMKQADTAATWVMQGWLFYSD 350
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+ PQ++ALL +VP K+++LDL AE++P+W + FYG P+IW MLHNF GN+ ++
Sbjct: 351 RKFWKAPQIEALLKAVPDNKMILLDLAAEIEPVWKRTDAFYGKPWIWNMLHNFGGNVNLF 410
Query: 187 GILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G +D +A P E + + + G+G++ME IEQNPV+Y+LM+ +Q VD+ AWI Q
Sbjct: 411 GRMDGVATQPAETLNDKASGKLWGIGLTMEAIEQNPVMYELMTRHTWQTTPVDLDAWIPQ 470
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y + RY + + DAW +L TVYN GA + D SII+
Sbjct: 471 YVLNRYRTNNTNLVDAWQILRKTVYN---GAVIR---------DGAESIIT--------- 509
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
G+P + + T PH E++ A +LF+ + + S+ ++YDL+D+TRQ
Sbjct: 510 GRPTFDSTTVWTRTKLNYAPH------ELLPAWDLFVQAAGKGVNSDGFQYDLVDVTRQV 563
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
LA YA L + A+ D+ + S+ FL+L+ DMD LLA F+LGPWL +A+
Sbjct: 564 LANYAAPLQKKWVTAFNAKDSAAFNKYSKAFLQLISDMDLLLASRKDFMLGPWLSAARSN 623
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
++ YE NAR IT+W D S L +Y N+ WSGLL D+Y PR +F +
Sbjct: 624 GTTPAEKALYEQNARDLITLWGDAN----SPLHEYSNRQWSGLLNDFYKPRWQQFFTLLQ 679
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+SL +G LK + W N + YPV +G+++ +Q LY KY
Sbjct: 680 QSLRTGSTPDLKQFEENIRSWEWKWVNTQKAYPVVPSGNSVQVAQMLYKKY 730
>gi|373953359|ref|ZP_09613319.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
gi|373889959|gb|EHQ25856.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
Length = 733
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 205/533 (38%), Positives = 304/533 (57%), Gaps = 41/533 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPAA +N +P+AK+ + NW + +D TY+LD+ DP+F IG+ F+++
Sbjct: 232 VLPAFTGHVPAAFKNKYPTAKL-KTTNWKNGFAD-----TYILDSADPMFARIGQLFLQK 285
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q G T H+Y+ DTF+EN PP D PEY+ L +Y GM D+ AVW+MQGWLF D
Sbjct: 286 QTALLG-TDHLYSADTFNENEPPSDEPEYLGKLSERVYQGMHQADTAAVWVMQGWLFYSD 344
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+P Q +ALL +VP K+++LDL E++P+W ++ FYG P+IW ML+NF N ++
Sbjct: 345 RKFWKPEQTRALLKAVPDDKMIILDLATEIEPVWKRTEAFYGKPWIWNMLNNFGANTNLF 404
Query: 187 GILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G +DS A GP EA ++ M G+G++MEGIEQNPV+YDL+++ ++++ ++V W+ +
Sbjct: 405 GRMDSAAKGPAEAYHDPKSGQMKGIGLTMEGIEQNPVLYDLLTDNTWRNQPINVDEWLPK 464
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQ 303
Y + RYG+ Q AWN+L TVY+ D +I A P D S
Sbjct: 465 YVLNRYGKPNAQAQKAWNILRKTVYSVLADRYIRDGAESIIQARPTTDSS---------- 514
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
S + L Y ++ A + I + +LS S+ +R+DL+DL+R
Sbjct: 515 ----------------SRWARTTLNYEPKALLPAWQAMIKASEDLSTSDGFRFDLVDLSR 558
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
Q LA YA L + A+Q DA + S F+EL++DMD LLA FLLGPW+ A+
Sbjct: 559 QVLANYAFTLQRRFVLAHQQKDAAAFKKHSAEFIELIQDMDQLLATRKDFLLGPWVADAR 618
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
+ ++ YE NA+ IT+W D + L +Y + WSGLL D+Y PR YF+
Sbjct: 619 RCGATVSEKALYEMNAKDLITLWGD----KDCPLNEYACRQWSGLLNDFYKPRWQQYFEQ 674
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ L F + + R+ W N R YPV+ GD ++ ++ LY KY
Sbjct: 675 INLDLTGKKPFDKEAFERKIKSWEWQWVNARKDYPVKPQGDPVLEARKLYKKY 727
>gi|255533666|ref|YP_003094038.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346650|gb|ACU05976.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 735
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 205/532 (38%), Positives = 311/532 (58%), Gaps = 42/532 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VP A ++ FP AK+ + NW + S Y+LD D LF IG+ FIE+
Sbjct: 236 ILPAFTGHVPPAFKDKFPKAKLKKT-NWTTFPS------VYILDPEDELFTTIGKRFIEE 288
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
++K +G T H+Y DTF+ENTPP Y+S++ +Y M D +A W+MQGWLF +
Sbjct: 289 EVKTFG-TDHLYTADTFNENTPPTSDSLYLSNVSKKVYQSMALADPEATWIMQGWLFYHG 347
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+ FW+P Q+KALLN++P K++VLDL++E P+W + +YG P+IW MLHNF GNI +Y
Sbjct: 348 EKFWKPTQIKALLNAIPNDKMIVLDLWSENHPVWQRTAAYYGKPWIWNMLHNFGGNISLY 407
Query: 187 GILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G +D +A G ++A+ + N+ MVG+G++ E IEQNPV+Y LM + + E ++V AW+
Sbjct: 408 GRMDEVASGAIKAKQAANSGNMVGIGLTPEAIEQNPVMYQLMLDNIWTDEPINVTAWLKN 467
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
YS +RYG + AW +LY TVY T G P SI++
Sbjct: 468 YSRQRYGAQNALAEQAWQILYKTVY--TGG----------ILPGGPESILT--------- 506
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
G+P +E++ P Y +E+I A E + + +LS ++ ++YDL+D+TRQ
Sbjct: 507 GRPTM------AESTRSTRPKKNYKPAELIPAWEALLKASQQLS-TDGFKYDLVDVTRQV 559
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
L YA+ L +AYQ D +LS FL +++D+D LLA FLLG WL AK++
Sbjct: 560 LVNYADTLQRQFAQAYQGKDGKKFDRLSGDFLAVMDDVDYLLATRKDFLLGKWLNEAKRM 619
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
E++K+YE NAR IT+W D + S L +Y + WSGL+ +Y PR +F Y
Sbjct: 620 GTTAEEKKRYERNARNLITLWAD----QNSSLNEYSCRQWSGLISSFYKPRWQQFFSYAK 675
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
+ L+SG K + + + DW N +V+ + +G+ + T++ LY KY+
Sbjct: 676 QQLKSGAKLDQKVFEEKMKRWEWDWVNKNDVFTEQPSGNEIKTAESLYKKYI 727
>gi|326679829|ref|XP_688608.3| PREDICTED: alpha-N-acetylglucosaminidase-like [Danio rerio]
Length = 757
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 201/532 (37%), Positives = 307/532 (57%), Gaps = 42/532 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSG VP + +FP A +T+L W + C Y+LD DPLF IG F+ Q
Sbjct: 251 VLPAFSGIVPEGITRLFPKANVTKLSPWSHFNCT--YSCAYVLDPRDPLFHRIGALFLTQ 308
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
++E+G T HIYN DTF+E P P Y++S+ AI++ M S D A+WLMQGWLF D
Sbjct: 309 VIEEFG-TDHIYNTDTFNEMPPASSDPTYLASISRAIFNTMTSVDPQAIWLMQGWLFISD 367
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW+ Q+KALL+ VPLG+++VLDLFAE P++S++ FYG P+IWCMLHNF GN ++
Sbjct: 368 PSFWKADQVKALLHGVPLGRMIVLDLFAESMPVYSSTNSFYGQPFIWCMLHNFGGNSGLF 427
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G +DSI GP A N+T+VG+GM+ EGIEQNPV+Y+LMSE+A++ + V++ W++ Y
Sbjct: 428 GTVDSINSGPFNAVRFPNSTLVGLGMTPEGIEQNPVIYELMSELAWRKDPVNLYKWVSLY 487
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
++RRYG + AW +L+ +VYNCT P KY+N+
Sbjct: 488 ALRRYGSMDENLALAWQLLFRSVYNCT-------------LP------------KYKNHN 522
Query: 307 K-PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
+ P+ L +T +WY ++ RA +L + L T+RYDL+D+TRQA
Sbjct: 523 RSPLVHRPSLHMQTD------IWYDPADFYRAWKLLFEAAPGLVTLETFRYDLVDVTRQA 576
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
L E + +I A+Q + + +L+ ++D +L+ ++ FLLG WL+ A+
Sbjct: 577 LQLLTTEFYKDIKSAFQTQKLSDLLTAGGVLVYDLLPELDRILSSNEHFLLGAWLQQAQS 636
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+E + Y+ NAR QIT+W + + + DY +K W+GL+ DYY R ++ +
Sbjct: 637 QGVDEHEAHLYDINARNQITLWGPDGE-----ILDYASKEWAGLVEDYYLQRWGLFVNTL 691
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+E L+ G F+ + + ++ + + YP + GD ++ ++ KY
Sbjct: 692 VECLDRGRPFKQDVFNQAVFQVEKGFVFNQRKYPTKPLGDTYDIARRIFLKY 743
>gi|350407422|ref|XP_003488083.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus impatiens]
Length = 770
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 207/544 (38%), Positives = 302/544 (55%), Gaps = 42/544 (7%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G+VP A +FP A +T+ W S ++CC YLL+ TDPLF +IG
Sbjct: 250 LGIIPVLPAFTGHVPRAFPRLFPEANVTKSATWNSFSD--KYCCPYLLEPTDPLFHKIGD 307
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F+ +KE+G T HIYNCDTF+EN PP +++ ++G +I+ M S D A+WLMQGW
Sbjct: 308 QFLRTYIKEFG-TDHIYNCDTFNENEPPTSELKFLRNVGHSIFQTMLSVDPQAIWLMQGW 366
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF +D FW P++KA L SVPLG+L+VLDL +E P++ K +YG P+IWCMLHNF G
Sbjct: 367 LFVHDAVFWTEPRIKAFLTSVPLGRLIVLDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGG 426
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
+ M+G I E R E +TM+G G++ EGI QN V+Y+LM+EMA++ E V++
Sbjct: 427 TLGMFGSAQIINRRVFEGRNMEGSTMIGTGLTPEGINQNYVIYELMNEMAYRQEPVNLDN 486
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W Y+ RRYG AW L TVYN IS GK
Sbjct: 487 WFEDYASRRYGAWNEYAVAAWKNLGSTVYNFRG--------------------ISKIRGK 526
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
Y +P A L WY + +F+ + + S YR+D++D+
Sbjct: 527 YVITRRPSLNLARLT-----------WYDPEKFYSTWYIFLQARHGRKNSTLYRHDVVDI 575
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
TRQAL A++++ ++E++ D + R LEL +D++ +LA + FLLG WLE
Sbjct: 576 TRQALQLKADKIYSVLVESFNQKDVTTFKLQAGRLLELFDDLEAILASSEDFLLGTWLEM 635
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
AK LA ++ + K YE+NAR QIT+W + +RDY NK WSG++ DY+ PR AI+
Sbjct: 636 AKNLATDDAESKLYEYNARNQITLWGPRGE-----IRDYANKQWSGIVSDYFKPRWAIFL 690
Query: 482 KYMIESLESGDGFRLKDW-RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGT 540
+ SL G + R + ++ + R +YP + GD + + + +K+ Q +
Sbjct: 691 DGLTTSLTKGTSLNITRINERIFKEVEKPFTLSRKIYPTNATGDCIDIAMRILSKWYQPS 750
Query: 541 GVFD 544
+FD
Sbjct: 751 -IFD 753
>gi|73965663|ref|XP_548088.2| PREDICTED: alpha-N-acetylglucosaminidase [Canis lupus familiaris]
Length = 747
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 207/533 (38%), Positives = 312/533 (58%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSG+VP AL VFP ITQLG+W + C++LL DPLF IG F+ +
Sbjct: 242 VLPAFSGHVPKALTRVFPQINITQLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRE 299
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
++E+G T+HIY DTF+E PP P Y++S A++Y M + DSDAVWL+QGWLF +
Sbjct: 300 LIQEFG-TNHIYGADTFNEMQPPSSEPSYLASATASVYQAMITVDSDAVWLLQGWLFQHQ 358
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+KA+L +VP G+L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 359 PQFWGPAQVKAVLEAVPRGRLLVLDLFAESQPVYIQTASFQGQPFIWCMLHNFGGNHGLF 418
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TM+G GM+ EGI QN VVY LM+E+ ++ + V D++AW++
Sbjct: 419 GALEAVNRGPAAARLFPNSTMLGTGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVSS 478
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG + + AW +L +VYNC+ + + NR +V PS+ VT
Sbjct: 479 FAARRYGVAHRDTEAAWRLLLRSVYNCSGEACSGHNRSPLVR----RPSLQMVTT----- 529
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+WY+ S+V A L + + L++S T+RYDL+D+TRQ
Sbjct: 530 ----------------------VWYNRSDVFEAWRLLLTAAPTLASSPTFRYDLLDVTRQ 567
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A + + ++ AY + + + + + EL+ +D +LA FLLG WLE A+
Sbjct: 568 AAQELVSLYYVEARSAYLRKELVPLLRAAGVLVYELLPALDKVLASDSRFLLGRWLEQAR 627
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR ++ +
Sbjct: 628 AAAVSEAEAHLYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEM 682
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ESL G F+ + + +L + G YP + +GD + ++ L+ KY
Sbjct: 683 LVESLVQGIPFQQHQFDKNAFQLEQTFIFGTQRYPSQPDGDTVDLAKKLFIKY 735
>gi|443691318|gb|ELT93213.1| hypothetical protein CAPTEDRAFT_144379, partial [Capitella teleta]
Length = 718
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 203/512 (39%), Positives = 302/512 (58%), Gaps = 42/512 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
LPAF+G+VPA + +FP K+++LG+W S +CCT LLD DPLF EIG+AFI++
Sbjct: 225 ALPAFAGHVPANITRLFPRVKVSKLGDWGRFNST--YCCTTLLDVEDPLFKEIGKAFIDE 282
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+E+G T H+YN DTF+E TP P Y++ G A+YSGM S DS A+WLMQGWLF D
Sbjct: 283 YTREFG-TDHVYNTDTFNEMTPASSDPSYLTKAGQAVYSGMVSSDSKAIWLMQGWLFLSD 341
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
FW+PPQ KALL SVP GK++VLDL++EV P + + +YG P+IWCMLHNF G + MYG
Sbjct: 342 -FWKPPQAKALLTSVPQGKMLVLDLYSEVNPQYPRLQSYYGQPFIWCMLHNFGGTLPMYG 400
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
++S+ GP E R+ N+TMVG+G++ EGI QN V+Y+ M E +F+ + V++ W ++Y+
Sbjct: 401 AIESVNQGPFEGRSFVNSTMVGIGLTPEGINQNEVMYEFMMENSFRSQPVELTEWFDKYA 460
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRY + AW + TVYNC+DG N+++ V P
Sbjct: 461 TRRYASRNANARAAWQIFKRTVYNCSDGVKHHNKNIPVCRP------------------- 501
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
S + +WY + + +L IA+ E+ S +RYDL+D++RQAL
Sbjct: 502 ------------SRKNKIDVWYDVEDFFKGWDLMIAASKEVD-SPLFRYDLVDVSRQALQ 548
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ + I+ +Y+ + + L L++DMD +LA FLLG W+ A +
Sbjct: 549 VISITYYNQILTSYKQKNLTSLASSGNDLLHLLDDMDTVLATDSHFLLGAWIAGAHRNGV 608
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
E++ YE+NAR Q+T+W +A++L DY NK W+GL+ DYY R ++ + +S
Sbjct: 609 TPEEKALYEFNARNQVTLW----GPDANIL-DYANKQWAGLVADYYHERWELFIDELKKS 663
Query: 488 LESGDGFRLKDWRRE-WIKLTNDWQNGRNVYP 518
LE+ F K ++++ + K + + NVYP
Sbjct: 664 LENKTSFDEKKFQKDVFEKAESPFTYRTNVYP 695
>gi|340717403|ref|XP_003397173.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus terrestris]
Length = 770
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 204/538 (37%), Positives = 296/538 (55%), Gaps = 41/538 (7%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G+VP A +FP A +T+ W S ++CC YLL+ TDPLF +IG
Sbjct: 250 LGIIPVLPAFTGHVPRAFPRLFPEANVTKSATWNSFSD--KYCCPYLLEPTDPLFHKIGD 307
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F+ +KE+G T HIYNCDTF+EN PP +++ ++G +I+ M S D A+WLMQGW
Sbjct: 308 QFLRTYIKEFG-TDHIYNCDTFNENEPPTSELKFLRNVGHSIFQTMLSVDPQAIWLMQGW 366
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF +D FW P++K L SVPLG+L+VLDL +E P++ K +YG P+IWCMLHNF G
Sbjct: 367 LFVHDALFWTEPRIKTFLTSVPLGRLIVLDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGG 426
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
+ M+G I E R E +TM+G G++ EGI QN V+Y+LM+EMA++ E V++
Sbjct: 427 TLGMFGSAQIINRRVFEGRNMEGSTMIGTGLTPEGINQNYVIYELMNEMAYRQEPVNLDN 486
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W Y+ RRYG AW L TVYN IS GK
Sbjct: 487 WFEDYASRRYGAWNEYAVAAWKNLGSTVYNFRG--------------------ISKIRGK 526
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
Y +P A L WY + +F+ + + S YR+D++D+
Sbjct: 527 YVITRRPSLNLARLT-----------WYDPEKFYSTWYIFLQARHGRQNSTLYRHDVVDI 575
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
TRQAL A++++ ++E++ D + R LEL +D++ +LA + FLLG WLE
Sbjct: 576 TRQALQLKADKIYSALVESFNQKDVTTFKLQADRLLELFDDLEAILASSEDFLLGTWLEM 635
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
AK LA ++ + K YE+NAR QIT+W + +RDY NK WSG++ DY+ PR AI+
Sbjct: 636 AKNLATDDAESKLYEYNARNQITLWGPRGE-----IRDYANKQWSGIVSDYFKPRWAIFL 690
Query: 482 KYMIESLESGDGFRLKDW-RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
+ SL G + R + ++ + R +YP GD + + + +K+ Q
Sbjct: 691 DALTTSLTKGTSLNITRINERIFKEVEKPFTLSRKIYPTNVTGDCIDIAMRILSKWHQ 748
>gi|449491231|ref|XP_004174728.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase,
partial [Taeniopygia guttata]
Length = 752
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 207/536 (38%), Positives = 314/536 (58%), Gaps = 51/536 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP + VFP T+LG+W D + C YLLD DP+F IG F+++
Sbjct: 248 VLPAFAGHVPQGILRVFPRVNATRLGHWSHF--DCTYSCIYLLDPEDPMFQVIGTLFLKE 305
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T H+Y+ DTF+E TP P Y+S + A++ M D A+WLMQGWLF +
Sbjct: 306 LIKEFG-TDHVYSADTFNEMTPLSSDPAYLSRVSNAVFRSMTGADPKALWLMQGWLFQHQ 364
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW+P Q++ALL+ VPLG+++VLDLFAE KP++ ++ FYG P+IWCMLHNF GN ++
Sbjct: 365 PDFWQPAQVRALLHGVPLGRMIVLDLFAESKPVYQWTESFYGQPFIWCMLHNFGGNHGLF 424
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G +++I GP AR N+TMVG G+ EGIEQN +VY+LM+E+ ++ E +D+ +W+ +Y
Sbjct: 425 GTVEAINHGPFAARRFPNSTMVGTGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVTRY 484
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RRYG A AW +L +VYNCT + NR
Sbjct: 485 AERRYGAPNAAAASAWRLLLRSVYNCTGVCVNHNRS------------------------ 520
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
P+ + L+ +T LWY+ S+V A L +++G EL +S + YDL+D+TRQA
Sbjct: 521 -PLVRRPSLRMDT------ELWYNASDVFEAWRLLLSAGAELGSSPAFLYDLVDVTRQAA 573
Query: 367 AKYANELFLNIIEAYQ------LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ + +L+I +A+Q L A GV +L+ ++D LL+ H FLLG WL+
Sbjct: 574 QQLVSHYYLSIRQAFQSHALPELLTAGGVL-----VYDLLPELDSLLSSHSLFLLGRWLQ 628
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
SA+ +A ++++ +QYE NAR Q+T+W + + DY N GL+ DYY R +++
Sbjct: 629 SARAVATSDQEAEQYELNARNQVTLWGPSGN-----ILDYANXQLGGLVLDYYAVRWSLF 683
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ESL SG F + + + ++ + + YP GD + S+ L+ KY
Sbjct: 684 VSVLVESLNSGRPFHQNQFNQVFFQVERGFIYNKKRYPAVPFGDTMEISRKLFLKY 739
>gi|432926094|ref|XP_004080826.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oryzias latipes]
Length = 882
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 202/537 (37%), Positives = 314/537 (58%), Gaps = 52/537 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGNVP + + P A +T+LG W + C+Y+LD DPLF++IG ++ Q
Sbjct: 382 VLPAFSGNVPKGILKLHPEANVTRLGPWAHFNCS--FSCSYVLDPRDPLFLQIGSLYLSQ 439
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+K++G T HIYN DTF+E TPP P Y+S++ ++++ M + D A+WLMQGWLF D
Sbjct: 440 VVKQFG-TDHIYNTDTFNEMTPPSSDPAYLSAISRSVFASMTAVDPKAIWLMQGWLFFSD 498
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+PPQ++ALL+ VPLG+++VLDLFAE +P++S ++ FYG P+IWCMLHNF GN +
Sbjct: 499 AAFWKPPQIRALLHGVPLGRMIVLDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNNGFF 558
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G ++SI GP +A +N+TMVG+GM+ EGI QNPV+Y+LMSE+A++ E V++ W + Y
Sbjct: 559 GTVESINSGPFKALNFKNSTMVGIGMTPEGIHQNPVIYELMSELAWRKESVNLTKWASLY 618
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RRYG ++ AW +L+ +VYNCT V + N+
Sbjct: 619 AARRYGSMHESLSAAWKLLFSSVYNCT-----------------------VPHYRNHNH- 654
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
P+ + T LWY ++++ +LF+ + L + T+RYDL+D+TRQ L
Sbjct: 655 SPLVRRPSFNMNTG------LWYDPADLLETWKLFMEAAPSLMSKETFRYDLVDVTRQVL 708
Query: 367 AKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ +I +A+ +L + GV +L +++ LL FLLG WLE
Sbjct: 709 QDLTTYFYQDIKDAFHSKKMPELLTSGGVL-----IYDLFPELNRLLNSDRNFLLGTWLE 763
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
A+ A +E + + Y+ NAR Q+T+W + + + DY NK W GL+ DYY R +++
Sbjct: 764 QAQSFALDEPEARLYDLNARNQLTLWGPSGE-----ILDYANKEWGGLVEDYYAQRWSLF 818
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDW-QNGRNVYPVESNGDALITSQWLYNKY 536
+ +++ L SG F+ + + ++ + NGR YP + GD + ++ KY
Sbjct: 819 VQTLVDCLNSGLPFKQDAFNQAVFRVEKGFISNGRK-YPTKPQGDTYEIAHRIFLKY 874
>gi|149054264|gb|EDM06081.1| rCG33377, isoform CRA_d [Rattus norvegicus]
Length = 580
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 204/533 (38%), Positives = 306/533 (57%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP + QLGNW + C++LL DPLF IG F+ +
Sbjct: 80 VLPAFAGHVPKAITRVFPQVNVIQLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRE 137
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 138 LTKEFG-TDHIYGADTFNEMQPPFSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQ 196
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+KA+L +VP G+L+VLDLFAE +P++S + F+G P+IWCMLHNF GN ++
Sbjct: 197 PQFWGPSQIKAVLEAVPRGRLLVLDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLF 256
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+ + GP AR N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW++
Sbjct: 257 GALEDVNQGPQAARLFPNSTMVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSS 316
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S P AW +L +VYNC+ + + NR
Sbjct: 317 FASRRYGVSQPDAVAAWRLLLRSVYNCSGEACSGHNRS---------------------- 354
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
P+ K L+ T+ +WY+ S+V A L + + L+AS +RYDL+D+TRQ
Sbjct: 355 ---PLVKRPSLQMSTA------VWYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQ 405
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + A+ D + + +L+ +D LLA + FLLG WL+ A+
Sbjct: 406 AVQELVSSCYEEARTAFLNQDLDLLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAR 465
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
++A +E + + YE N+R QIT+W E ++L DY NK +GL+ DYY PR ++
Sbjct: 466 EVAVSESEAQFYEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGT 520
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ SL G F+ + + L + N + YP++ GD + S+ ++ K+
Sbjct: 521 LAHSLARGIPFQQHQFEKSVFPLEQAFINNKKRYPIQPQGDTVDLSKKIFLKF 573
>gi|440903235|gb|ELR53922.1| Alpha-N-acetylglucosaminidase, partial [Bos grunniens mutus]
Length = 614
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 209/537 (38%), Positives = 305/537 (56%), Gaps = 51/537 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP AL VFP +TQ+GNW + C++LL DPLF +G F+ +
Sbjct: 114 VLPAFAGHVPKALTRVFPQVNVTQMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRE 171
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ A+Y M + D DAVWL+QGWLF +
Sbjct: 172 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYQAMTAVDPDAVWLLQGWLFQHQ 230
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 231 PEFWGPAQVAAVLGAVPRGRLLVLDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLF 290
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+S+ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 291 GALESVNQGPTTARHFPNSTMVGTGMAPEGIGQNEVVYALMAELGWKKDPVADLGAWVTS 350
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
++ RRYG S + AW +L +VYNC S E + N+
Sbjct: 351 FAARRYGVSHGDAEAAWRLLLRSVYNC-----------------------SGEECRGHNH 387
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
P+ + L+ T+ +WY+ S+V A L +A+ + L++S +RYDL+D+TRQA
Sbjct: 388 -SPLVRRPSLQMVTT------VWYNRSDVFEAWRLLLAATSTLASSPAFRYDLVDVTRQA 440
Query: 366 LAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
+ + + + + AY L A G+ EL+ +D +LA FLLG WL
Sbjct: 441 VQELVSLYYEEMRTAYLKKELVPLTRAGGILA-----YELLPALDQVLASDCHFLLGSWL 495
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
E A+Q A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR +
Sbjct: 496 EQARQAAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLMADYYAPRWRL 550
Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ + ++ESL G F+ + R +L + G YP + GD + + L+ KY
Sbjct: 551 FTETLVESLVQGVPFQQHQFDRNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 607
>gi|109491871|ref|XP_001081442.1| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
gi|392351622|ref|XP_002727861.2| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
gi|149054262|gb|EDM06079.1| rCG33377, isoform CRA_b [Rattus norvegicus]
Length = 739
Score = 370 bits (949), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 204/533 (38%), Positives = 306/533 (57%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP + QLGNW + C++LL DPLF IG F+ +
Sbjct: 239 VLPAFAGHVPKAITRVFPQVNVIQLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRE 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 297 LTKEFG-TDHIYGADTFNEMQPPFSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQ 355
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+KA+L +VP G+L+VLDLFAE +P++S + F+G P+IWCMLHNF GN ++
Sbjct: 356 PQFWGPSQIKAVLEAVPRGRLLVLDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLF 415
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+ + GP AR N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW++
Sbjct: 416 GALEDVNQGPQAARLFPNSTMVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSS 475
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S P AW +L +VYNC+ + + NR
Sbjct: 476 FASRRYGVSQPDAVAAWRLLLRSVYNCSGEACSGHNR----------------------- 512
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
P+ K L+ T+ +WY+ S+V A L + + L+AS +RYDL+D+TRQ
Sbjct: 513 --SPLVKRPSLQMSTA------VWYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQ 564
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + A+ D + + +L+ +D LLA + FLLG WL+ A+
Sbjct: 565 AVQELVSSCYEEARTAFLNQDLDLLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAR 624
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
++A +E + + YE N+R QIT+W E ++L DY NK +GL+ DYY PR ++
Sbjct: 625 EVAVSESEAQFYEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGT 679
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ SL G F+ + + L + N + YP++ GD + S+ ++ K+
Sbjct: 680 LAHSLARGIPFQQHQFEKSVFPLEQAFINNKKRYPIQPQGDTVDLSKKIFLKF 732
>gi|307192254|gb|EFN75548.1| Alpha-N-acetylglucosaminidase [Harpegnathos saltator]
Length = 741
Score = 368 bits (945), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 197/537 (36%), Positives = 299/537 (55%), Gaps = 46/537 (8%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G+VP A +FP+A +T++ W K + ++CC YLL+ TDPLF IG
Sbjct: 226 LGIVPVLPAFAGHVPRAFARLFPNANMTKIEPW--NKFEDKYCCPYLLEPTDPLFQTIGE 283
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F+ + E+G T HIYNCDTF+EN P Y+S++G +++ M + D A+WLMQGW
Sbjct: 284 KFLRMYINEFG-TDHIYNCDTFNENEPGNSELAYLSNVGRSVFQAMSTVDPQAIWLMQGW 342
Query: 123 LFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF +D FW P++++ L SVP G+++VLDL +E P + K +YG P+IWCMLHNF G
Sbjct: 343 LFVHDFIFWTEPRVRSFLTSVPTGRMLVLDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGG 402
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
+ M+G I E R +TMVG G++ EGI QN V+Y+LM+EMA++HE VD+ A
Sbjct: 403 TLGMFGSAQIINQRTFEGRHMNGSTMVGTGLTPEGINQNYVIYELMNEMAYRHEPVDLDA 462
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W Y+ RRYG AW L T+YN + VI P ++ S
Sbjct: 463 WFESYATRRYGAWNEYAVAAWKHLGRTIYNFVGIERIRGHYVITRRPSLNIS-------- 514
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
P +WY+ + +F+ + + YR+D++D+
Sbjct: 515 -----------------------PWVWYNREDFYHTWNVFLKARYGRGNNTLYRHDVVDI 551
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
TRQAL A+ +++N+++ Y+ + G + L+L +D++ +LA FLLG WL
Sbjct: 552 TRQALQLMADNIYMNVVDCYKRKNITGFQSHAAALLDLFDDIEAILASGSNFLLGTWLAQ 611
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
AK +A +E++ + YE+NAR QIT+W N + +RDY NK WSG++ DY+ PR A +
Sbjct: 612 AKDMAVDEKERQSYEYNARNQITLWGPNGE-----IRDYANKQWSGVVADYFKPRWAFFL 666
Query: 482 KYMIESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNGDAL-----ITSQWL 532
K + +SL + + R ++++ + +YPV + GD L I S+WL
Sbjct: 667 KALEKSLVERTRLNMTEINDRMFLEVEQAFTFSTKLYPVGTKGDTLDIAVKIISKWL 723
>gi|358419179|ref|XP_003584151.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bos taurus]
Length = 741
Score = 367 bits (942), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 210/537 (39%), Positives = 305/537 (56%), Gaps = 51/537 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP AL VFP +TQ+GNW + C++LL DPLF +G F+ +
Sbjct: 241 VLPAFAGHVPKALTRVFPQVNVTQMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRE 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 299 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQ 357
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 358 PEFWGPAQVAAVLGAVPRGRLLVLDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLF 417
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+S+ GP AR N+TMVG GM+ EGI QN VVY LM+E+ +Q + V D+ AW+
Sbjct: 418 GALESVNQGPTTARHFPNSTMVGTGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTS 477
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
++ RRYG S + AW +L +VYNC S E + N+
Sbjct: 478 FAARRYGVSHGDAEAAWRLLLRSVYNC-----------------------SGEECRGHNH 514
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
P+ + L+ T+ +WY+ S+V A L + + + L++S +RYDL+D+TRQA
Sbjct: 515 -SPLVRRPSLQMVTT------VWYNRSDVFEAWRLLLTATSTLASSPAFRYDLVDVTRQA 567
Query: 366 LAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
+ + + + + AY L A G+ EL+ +D +LA FLLG WL
Sbjct: 568 VQELVSLYYEEMRTAYLKKELVPLTRAGGILA-----YELLPALDQVLASDCHFLLGSWL 622
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
E A+Q A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR +
Sbjct: 623 EQARQAAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRL 677
Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ + ++ESL G F+ + R +L + G YP + GD + + L+ KY
Sbjct: 678 FTETLVESLVQGVPFQQHQFDRNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 734
>gi|270005801|gb|EFA02249.1| hypothetical protein TcasGA2_TC007912 [Tribolium castaneum]
Length = 747
Score = 367 bits (942), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 192/535 (35%), Positives = 314/535 (58%), Gaps = 46/535 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G++P A + ++P A ++++ W + +CC Y LD T+PLF EIG+AF+ +
Sbjct: 238 VLPAFAGHLPRAFKTLYPDANMSKMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSE 295
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q+ E+G T H+YNCD+F+EN P Y++++G +IY M D DAVWL+QGW+F D
Sbjct: 296 QISEFG-TDHMYNCDSFNENVPTSGDLTYLANVGKSIYKAMTDTDPDAVWLLQGWMFYND 354
Query: 128 PFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+ + ++++L SVPLGK++VLDL +E P + Q++G PYIWCMLH+F G + M+
Sbjct: 355 NFWQDTERVRSILTSVPLGKMIVLDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMF 414
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G I P++AR EN+TM+G G++ EGI QN V+Y+LM+E A++ V++ W +Y
Sbjct: 415 GSSTVINEVPIKARHLENSTMIGTGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKY 474
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S RRYG ++AW +L TVY D ++ GKY
Sbjct: 475 STRRYGFPDSDAENAWRILQRTVY--------------------DYQGLNRMRGKY---- 510
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
++K LK + + WYST++++ A + + + L A++ Y +DL+D+TRQ L
Sbjct: 511 -AITKSPSLKIKIWT------WYSTNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVL 563
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y + + +++ YQ +D+ S++FLE+++D+D +L+ + FLLGPWLE+AK+ A
Sbjct: 564 QVYGDLYYKEMVKNYQSHDSANFQANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAA 623
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ +E Q+E+NAR QIT+W + + DY NK W+G++ ++ PR ++ Y+
Sbjct: 624 NDSAEEAQFEYNARNQITLWGPRGE-----IMDYANKQWAGVVSHFFAPRWYLFINYLNS 678
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL-----ITSQWLYNKY 536
+ + D + + ++ + R +PVE GDA+ I +W +Y
Sbjct: 679 TFDGAFNQTYID-AKMFKEVEEPFTFDRTEFPVEPIGDAVEIAWKIHKKWTSEEY 732
>gi|126307960|ref|XP_001366343.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
domestica]
Length = 741
Score = 367 bits (942), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 193/533 (36%), Positives = 296/533 (55%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G++P A VFP A +T LG W + + C+YLL DPLF +G F+ +
Sbjct: 239 VLPAFAGHIPKAFTRVFPQANVTNLGMWGHFSCN--YSCSYLLAPEDPLFPVVGSLFLRE 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY+ D F+E PP +P Y+++ AA+Y M + D DAVWL QGWLF
Sbjct: 297 LTKEFG-TDHIYSADIFNEMDPPSSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNH 355
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW+PPQMKA+L +VP G+ ++LDLFAE +P++S + FYG P+IWCMLHNF GN ++
Sbjct: 356 PDFWKPPQMKAVLEAVPRGRFLILDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLF 415
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G+LD++ GP AR N+T+VG G+ EGI QN VVY LM+E+ ++ + D+ AW+
Sbjct: 416 GVLDAVNRGPSTARLFPNSTIVGTGIVPEGINQNEVVYALMAELGWRKDPFPDLGAWVAG 475
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ +RYG P + AW +L +VYNC+ + T N +V P +
Sbjct: 476 FAAQRYGTPHPQAEAAWRLLLRSVYNCSWENCTGHNHSPLVKRPSLHLDF---------- 525
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+WY+ S+V A L + + +L+ S+ +RYDL+D+TRQ
Sbjct: 526 ---------------------SVWYNRSDVFEAWRLLLEAAPQLATSSAFRYDLLDVTRQ 564
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
+ + + + A++ + + +L+ +D LL + FLLG WLE A+
Sbjct: 565 VAQELVSLYYGELKTAFEAGSMPALLSAGGLLVFDLLPSLDELLGTDERFLLGGWLEQAR 624
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
++A +E + YE NAR Q+T+W + DY NK +GL+ YY PR ++ +
Sbjct: 625 EMAVSEAEAWHYEQNARYQLTLWGPTGN-----ILDYANKQLAGLVAGYYAPRWKLFVEM 679
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+++SL G F + E L + +GR +P + GD + ++ + KY
Sbjct: 680 LVKSLAEGTPFHQNQFENEAFLLGQAFVSGREKFPTQPQGDTVDLARKFFLKY 732
>gi|281344539|gb|EFB20123.1| hypothetical protein PANDA_011160 [Ailuropoda melanoleuca]
Length = 619
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 201/533 (37%), Positives = 306/533 (57%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP AL VFP +TQLG+W + C++LL DPLF IG F+ +
Sbjct: 118 VLPAFAGHVPKALTRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRE 175
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ A++Y M + D DAVWL+QGWLF +
Sbjct: 176 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQ 234
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P++ + F+G P+IWCMLHNF GN ++
Sbjct: 235 PEFWGPAQVTAVLGAVPRGRLLVLDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLF 294
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TM G GM+ EGI QN +VY LM+E+ ++ + V D++AW++
Sbjct: 295 GALEAVNRGPAAARLFPNSTMAGTGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSS 354
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
+ RRYG + + AW +L +VYNC+ + + NR +V P
Sbjct: 355 SAARRYGVTHKDTEAAWRLLLRSVYNCSGEACSGHNRSPLVRRPS--------------- 399
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ T+ +WY+ S+V A L + + L+AS ++RYDL+D+TRQ
Sbjct: 400 ----------LQMATA------VWYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQ 443
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A + + + AY + + + + R + EL+ +D +LA FLLG WLE A+
Sbjct: 444 AAQELVSLYYEEARAAYLNKELVPLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQAR 503
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A +E + + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR ++ +
Sbjct: 504 AAAVSEAEARFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWGLFMEM 558
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ESL G F+ + + +L + YP + GD + ++ L+ KY
Sbjct: 559 LVESLAQGIPFQQHQFDKNAFQLEQAFVFSTQRYPSQPQGDTVDLAKKLFLKY 611
>gi|91080563|ref|XP_973259.1| PREDICTED: similar to alpha-N-acetyl glucosaminidase [Tribolium
castaneum]
Length = 747
Score = 366 bits (940), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 193/535 (36%), Positives = 313/535 (58%), Gaps = 46/535 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G++P A + ++P A ++++ W + +CC Y LD T+PLF EIG+AF+ +
Sbjct: 238 VLPAFAGHLPRAFKTLYPDANMSKMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSE 295
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q+ E+G T H+YNCD+F+EN P Y++++G +IY M D DAVW+MQGWLF++D
Sbjct: 296 QISEFG-TDHMYNCDSFNENVPTSGDLTYLANVGKSIYKAMTDTDPDAVWVMQGWLFAHD 354
Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
F W + KA+L +VP GK++VLDL +E P + Q++G PYIWCMLH+F G + M+
Sbjct: 355 FFYWTRNRAKAILTAVPKGKMIVLDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMF 414
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G I P++AR EN+TM+G G++ EGI QN V+Y+LM+E A++ V++ W +Y
Sbjct: 415 GSSTVINEVPIKARHLENSTMIGTGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKY 474
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S RRYG ++AW +L TVY D ++ GKY
Sbjct: 475 STRRYGFPDSDAENAWRILQRTVY--------------------DYQGLNRMRGKY---- 510
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
++K LK + + WYST++++ A + + + L A++ Y +DL+D+TRQ L
Sbjct: 511 -AITKSPSLKIKIWT------WYSTNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVL 563
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y + + +++ YQ +D+ S++FLE+++D+D +L+ + FLLGPWLE+AK+ A
Sbjct: 564 QVYGDLYYKEMVKNYQSHDSANFQANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAA 623
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ +E Q+E+NAR QIT+W + + DY NK W+G++ ++ PR ++ Y+
Sbjct: 624 NDSAEEAQFEYNARNQITLWGPRGE-----IMDYANKQWAGVVSHFFAPRWYLFINYLNS 678
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL-----ITSQWLYNKY 536
+ + D + + ++ + R +PVE GDA+ I +W +Y
Sbjct: 679 TFDGAFNQTYID-AKMFKEVEEPFTFDRTEFPVEPIGDAVEIAWKIHKKWTSEEY 732
>gi|301773566|ref|XP_002922216.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Ailuropoda
melanoleuca]
Length = 634
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 201/533 (37%), Positives = 306/533 (57%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP AL VFP +TQLG+W + C++LL DPLF IG F+ +
Sbjct: 132 VLPAFAGHVPKALTRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRE 189
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ A++Y M + D DAVWL+QGWLF +
Sbjct: 190 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQ 248
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P++ + F+G P+IWCMLHNF GN ++
Sbjct: 249 PEFWGPAQVTAVLGAVPRGRLLVLDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLF 308
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TM G GM+ EGI QN +VY LM+E+ ++ + V D++AW++
Sbjct: 309 GALEAVNRGPAAARLFPNSTMAGTGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSS 368
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
+ RRYG + + AW +L +VYNC+ + + NR +V P
Sbjct: 369 SAARRYGVTHKDTEAAWRLLLRSVYNCSGEACSGHNRSPLVRRPS--------------- 413
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ T+ +WY+ S+V A L + + L+AS ++RYDL+D+TRQ
Sbjct: 414 ----------LQMATA------VWYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQ 457
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A + + + AY + + + + R + EL+ +D +LA FLLG WLE A+
Sbjct: 458 AAQELVSLYYEEARAAYLNKELVPLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQAR 517
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A +E + + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR ++ +
Sbjct: 518 AAAVSEAEARFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWGLFMEM 572
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ESL G F+ + + +L + YP + GD + ++ L+ KY
Sbjct: 573 LVESLAQGIPFQQHQFDKNAFQLEQAFVFSTQRYPSQPQGDTVDLAKKLFLKY 625
>gi|410981277|ref|XP_003996997.1| PREDICTED: alpha-N-acetylglucosaminidase [Felis catus]
Length = 857
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 203/533 (38%), Positives = 304/533 (57%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +TQLG+W + C++LL DPLF IG F+ +
Sbjct: 355 VLPAFAGHVPKAITRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRE 412
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y++S A++Y M + D DAVWL+QGWLF +
Sbjct: 413 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLASATASVYQAMVTVDPDAVWLLQGWLFQHQ 471
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 472 PQFWGPAQVSAVLGAVPRGRLLVLDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLF 531
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D++AW+
Sbjct: 532 GALEAVNRGPAAARLFPNSTMVGTGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVTG 591
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S + AW +L +VYNC+ + + NR +V P
Sbjct: 592 FAARRYGVSHGNTEAAWRLLLRSVYNCSGEACSGHNRSPLVRRPS--------------- 636
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
LK T+ +WY+ S+V A L + + L+ S T+RYDL+D+TRQ
Sbjct: 637 ----------LKMTTT------VWYNRSDVFEAWRLLLTTTPSLATSPTFRYDLLDVTRQ 680
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A + + + AY + + + + + EL+ +D +LA FLLG WLE A+
Sbjct: 681 AAQELVSLYYGEARTAYLNKELVPLLRAAGILVYELLPSLDKVLASDSRFLLGSWLEQAR 740
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR ++ +
Sbjct: 741 AAAVSEAEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEM 795
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ESL G F+ + + +L + YP + +GD + ++ L+ +Y
Sbjct: 796 LVESLVRGVPFQQHQFDQNAFQLEQTFVLSTQRYPSQPHGDTVDLAKKLFLRY 848
>gi|301626955|ref|XP_002942650.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Xenopus (Silurana)
tropicalis]
Length = 759
Score = 364 bits (935), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 195/541 (36%), Positives = 299/541 (55%), Gaps = 51/541 (9%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G++P + VFP +++LG W + + C+YLLD DPLF IG
Sbjct: 244 LGMITVLPAFAGHIPEGILRVFPKVTVSRLGGWSNFNCT--YSCSYLLDPEDPLFQWIGE 301
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F+ Q ++ +G T HIY+ DTF+E +P P Y+S++ AI+ M D DA+WLMQGW
Sbjct: 302 LFLSQMVQSFG-TDHIYSADTFNEMSPTSSDPGYLSAVSGAIFKSMAKVDPDAIWLMQGW 360
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF +P FWRP Q KALL+ P+G+++VLDLFAE P++ T++ FYG P+IWCML+NF G
Sbjct: 361 LFINNPSFWRPAQTKALLHGAPIGRIIVLDLFAETVPVYLTTESFYGQPFIWCMLNNFGG 420
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
N ++G ++ + GP +A N+TMVG G++ EGIEQN ++Y+ M+E+ + + +++
Sbjct: 421 NHGLFGNIEGVNRGPFDAAKFPNSTMVGTGLTPEGIEQNDMIYEFMNEIGWSSQPINLTK 480
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
WI+ YS RRYG+S + AW +L +VYNCT + N +V P ++ +
Sbjct: 481 WISNYSDRRYGQSNTDARMAWQILLRSVYNCTQILHNHNHSPLVRRPSLNMNT------- 533
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
+ Y+ +++ A + L S T+ YDL+D+
Sbjct: 534 ------------------------DICYNKADIYEAWRFMHNASFALGKSATFLYDLVDI 569
Query: 362 TRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
TR+A+ + +E +L I EAY QL A GV +L+ ++D LL+ GFLL
Sbjct: 570 TREAVQQLVSEYYLEIKEAYGKKSLQQLMTAGGVL-----VYDLLPELDSLLSSQPGFLL 624
Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
G WL++AK +A + Y+ NAR QIT+W + DY NK + GL++DYY
Sbjct: 625 GSWLKAAKSMASTPAEAALYDMNARNQITLWGPTGN-----ILDYANKQYGGLVQDYYTE 679
Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
R ++ ++++SL G+ F + + L D+ Y GD L + +Y K
Sbjct: 680 RWGLFVWFLVQSLNKGEHFNQDKFNKAVFVLEEDFVYNGKEYMASPTGDTLEIANKIYLK 739
Query: 536 Y 536
Y
Sbjct: 740 Y 740
>gi|148671928|gb|EDL03875.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
CRA_a [Mus musculus]
Length = 538
Score = 364 bits (934), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 198/533 (37%), Positives = 302/533 (56%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP + +LG+W + C++LL DP+F IG F+ +
Sbjct: 38 VLPAFAGHVPKAITRVFPQVNVIKLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRE 95
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 96 LTKEFG-TDHIYGADTFNEMQPPFSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQ 154
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L +VP G+L+VLDLFAE P++ + F+G P+IWCMLHNF GN ++
Sbjct: 155 PQFWGPSQIRAVLEAVPRGRLLVLDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLF 214
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+ + GP AR N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW++
Sbjct: 215 GALEDVNRGPQAARLFPNSTMVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSS 274
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
+++RRYG S P AW +L +VYNC+ + + NR
Sbjct: 275 FAIRRYGVSQPDAVAAWKLLLRSVYNCSGEACSGHNRS---------------------- 312
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
P+ K L+ T+ +WY+ S+V A L + + L+ S +RYDL+D+TRQ
Sbjct: 313 ---PLVKRPSLQMSTA------VWYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQ 363
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + AY + + + + +L+ +D LLA FLLG WL+ A+
Sbjct: 364 AVQELVSLCYEEARTAYLKQELDLLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQAR 423
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
+ A +E + + YE N+R QIT+W E ++L DY NK +GL+ DYY PR ++
Sbjct: 424 KAAVSEAEAQFYEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGT 478
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ SL G F+ ++ + L + + YP + GD + S+ ++ KY
Sbjct: 479 LAHSLARGVPFQQHEFEKNVFPLEQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 531
>gi|1171229|gb|AAC50512.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1171231|gb|AAC50513.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1197840|gb|AAB06188.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1479981|gb|AAB36604.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|32450702|gb|AAH53991.1| N-acetylglucosaminidase, alpha- [Homo sapiens]
gi|119581237|gb|EAW60833.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
CRA_b [Homo sapiens]
Length = 743
Score = 363 bits (933), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 198/536 (36%), Positives = 304/536 (56%), Gaps = 43/536 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 241 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY DTF+E PP P Y+++ A+Y M + D++AVWL+QGWLF +
Sbjct: 299 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 357
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 358 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 418 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 477
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S P AW +L +VYNC+ + NR +V P
Sbjct: 478 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 522
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ TS +WY+ S+V A L + S L+ S +RYDL+DLTRQ
Sbjct: 523 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 566
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + AY + + + EL+ +D +LA FLLG WLE A+
Sbjct: 567 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 626
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR ++ +
Sbjct: 627 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 681
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 539
+++S+ G F+ + + +L + + YP + GD + ++ ++ KY G
Sbjct: 682 LVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPG 737
>gi|357458269|ref|XP_003599415.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488463|gb|AES69666.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 539
Score = 363 bits (933), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 171/259 (66%), Positives = 199/259 (76%), Gaps = 27/259 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+FSGNVPAAL +FPSAKIT+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +
Sbjct: 271 VLPSFSGNVPAALTKIFPSAKITRLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRK 330
Query: 68 QLK--------------------------EYGRTSHIYNCDTFDENTPPVDSPEYISSLG 101
Q+K EYG + IYNCDTF+EN+PP P YIS+LG
Sbjct: 331 QIKATETIHQESEDLGSLIIMDRAVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLG 390
Query: 102 AAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIW 160
AA+Y G+ GD DAVWLMQGWLF D FW+PPQMKALL SVP GK++VLDLFA+VKPIW
Sbjct: 391 AAVYQGISKGDKDAVWLMQGWLFYSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIW 450
Query: 161 STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQN 220
TS QFYG PYIWCMLHNF GNIEMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE N
Sbjct: 451 KTSFQFYGTPYIWCMLHNFGGNIEMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHN 510
Query: 221 PVVYDLMSEMAFQHEKVDV 239
P+VY+LMSEMAF+ EKV +
Sbjct: 511 PIVYELMSEMAFRDEKVKI 529
>gi|1479983|gb|AAB36605.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|119581236|gb|EAW60832.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
CRA_a [Homo sapiens]
Length = 639
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 198/536 (36%), Positives = 304/536 (56%), Gaps = 43/536 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 137 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 194
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY DTF+E PP P Y+++ A+Y M + D++AVWL+QGWLF +
Sbjct: 195 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 253
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 254 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 313
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 314 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 373
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S P AW +L +VYNC+ + NR +V P
Sbjct: 374 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 418
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ TS +WY+ S+V A L + S L+ S +RYDL+DLTRQ
Sbjct: 419 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 462
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + AY + + + EL+ +D +LA FLLG WLE A+
Sbjct: 463 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 522
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR ++ +
Sbjct: 523 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 577
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 539
+++S+ G F+ + + +L + + YP + GD + ++ ++ KY G
Sbjct: 578 LVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPG 633
>gi|66346698|ref|NP_000254.2| alpha-N-acetylglucosaminidase precursor [Homo sapiens]
gi|317373322|sp|P54802.2|ANAG_HUMAN RecName: Full=Alpha-N-acetylglucosaminidase; AltName:
Full=N-acetyl-alpha-glucosaminidase; Short=NAG;
Contains: RecName: Full=Alpha-N-acetylglucosaminidase 82
kDa form; Contains: RecName:
Full=Alpha-N-acetylglucosaminidase 77 kDa form; Flags:
Precursor
Length = 743
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 197/533 (36%), Positives = 303/533 (56%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 241 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY DTF+E PP P Y+++ A+Y M + D++AVWL+QGWLF +
Sbjct: 299 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 357
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 358 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 418 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 477
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S P AW +L +VYNC+ + NR +V P
Sbjct: 478 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 522
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ TS +WY+ S+V A L + S L+ S +RYDL+DLTRQ
Sbjct: 523 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 566
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + AY + + + EL+ +D +LA FLLG WLE A+
Sbjct: 567 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 626
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR ++ +
Sbjct: 627 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 681
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+++S+ G F+ + + +L + + YP + GD + ++ ++ KY
Sbjct: 682 LVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
>gi|2660688|gb|AAB88084.1| Naglu [Mus musculus]
Length = 739
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 198/533 (37%), Positives = 302/533 (56%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP + +LG+W + C++LL DP+F IG F+ +
Sbjct: 239 VLPAFAGHVPKAITRVFPQVNVIKLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRE 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 297 LTKEFG-TDHIYGADTFNEMQPPFSEPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQ 355
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L +VP G+L+VLDLFAE P++ + F+G P+IWCMLHNF GN ++
Sbjct: 356 PQFWGPSQIRAVLEAVPRGRLLVLDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLF 415
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+ + GP AR N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW++
Sbjct: 416 GALEDVNRGPQAARLFPNSTMVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSS 475
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
+++RRYG S P AW +L +VYNC+ + + NR
Sbjct: 476 FAIRRYGVSQPDAVAAWKLLLRSVYNCSGEACSGHNRS---------------------- 513
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
P+ K L+ T+ +WY+ S+V A L + + L+ S +RYDL+D+TRQ
Sbjct: 514 ---PLVKRPSLQMSTA------VWYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQ 564
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + AY + + + + +L+ +D LLA FLLG WL+ A+
Sbjct: 565 AVQELVSLCYEEARTAYLKQELDLLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQAR 624
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
+ A +E + + YE N+R QIT+W E ++L DY NK +GL+ DYY PR ++
Sbjct: 625 KAAVSEAEAQFYEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGT 679
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ SL G F+ ++ + L + + YP + GD + S+ ++ KY
Sbjct: 680 LAHSLARGVPFQQHEFEKNVFPLEQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732
>gi|254910995|ref|NP_038820.2| alpha-N-acetylglucosaminidase precursor [Mus musculus]
gi|20385160|gb|AAM21194.1|AF363242_1 N-acetyl-glucosaminidase [Mus musculus]
gi|3329361|gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus]
gi|33585908|gb|AAH55733.1| Alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB) [Mus
musculus]
gi|74211094|dbj|BAE37639.1| unnamed protein product [Mus musculus]
gi|74218052|dbj|BAE42009.1| unnamed protein product [Mus musculus]
gi|148671929|gb|EDL03876.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
CRA_b [Mus musculus]
Length = 739
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 198/533 (37%), Positives = 302/533 (56%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP + +LG+W + C++LL DP+F IG F+ +
Sbjct: 239 VLPAFAGHVPKAITRVFPQVNVIKLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRE 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 297 LTKEFG-TDHIYGADTFNEMQPPFSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQ 355
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L +VP G+L+VLDLFAE P++ + F+G P+IWCMLHNF GN ++
Sbjct: 356 PQFWGPSQIRAVLEAVPRGRLLVLDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLF 415
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+ + GP AR N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW++
Sbjct: 416 GALEDVNRGPQAARLFPNSTMVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSS 475
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
+++RRYG S P AW +L +VYNC+ + + NR
Sbjct: 476 FAIRRYGVSQPDAVAAWKLLLRSVYNCSGEACSGHNRS---------------------- 513
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
P+ K L+ T+ +WY+ S+V A L + + L+ S +RYDL+D+TRQ
Sbjct: 514 ---PLVKRPSLQMSTA------VWYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQ 564
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + AY + + + + +L+ +D LLA FLLG WL+ A+
Sbjct: 565 AVQELVSLCYEEARTAYLKQELDLLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQAR 624
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
+ A +E + + YE N+R QIT+W E ++L DY NK +GL+ DYY PR ++
Sbjct: 625 KAAVSEAEAQFYEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGT 679
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ SL G F+ ++ + L + + YP + GD + S+ ++ KY
Sbjct: 680 LAHSLARGVPFQQHEFEKNVFPLEQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732
>gi|114667172|ref|XP_523654.2| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Pan
troglodytes]
gi|410216584|gb|JAA05511.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410258938|gb|JAA17435.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410304442|gb|JAA30821.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410337929|gb|JAA37911.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
Length = 743
Score = 361 bits (927), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 197/533 (36%), Positives = 302/533 (56%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 241 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY DTF+E PP P Y+++ A+Y M + D++AVWL+QGWLF +
Sbjct: 299 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 357
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 358 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 418 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 477
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S P AW +L +VYNC+ + NR +V P
Sbjct: 478 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 522
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ TS +WY+ S+V A L + S L+ S +RYDL+DLTRQ
Sbjct: 523 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 566
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + AY + + + EL+ +D +LA FLLG WLE A+
Sbjct: 567 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 626
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR ++ +
Sbjct: 627 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 681
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ +S+ G F+ + + +L + + YP + GD + ++ ++ KY
Sbjct: 682 LADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
>gi|311267179|ref|XP_003131436.1| PREDICTED: alpha-N-acetylglucosaminidase [Sus scrofa]
Length = 744
Score = 361 bits (926), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 204/538 (37%), Positives = 301/538 (55%), Gaps = 53/538 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP AL VFP +TQ+G+W + C++LL DPLF +G F+ +
Sbjct: 242 VLPAFAGHVPKALTRVFPQISVTQMGSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRE 299
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 300 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMITVDPDAVWLLQGWLFQHQ 358
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 359 PQFWGPAQVGAVLGAVPRGRLLVLDLFAESQPVYVRTASFLGQPFIWCMLHNFGGNHGLF 418
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+S+ GP AR N+TM G GM+ EGI QN VVY LM+E+ ++ + V D+ W+
Sbjct: 419 GALESVNQGPAAARLFPNSTMAGTGMAPEGIGQNEVVYALMAELGWRKDPVADLGTWVTS 478
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S + AW +L +VYNC+ +G T NR +V P + +
Sbjct: 479 FAARRYGVSQGDAEAAWRLLLRSVYNCSGEGCTGHNRSPLVRRPSLQMATT--------- 529
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+WY+ S+V A L + + L++S +RYDL+D+TRQ
Sbjct: 530 ----------------------VWYNQSDVFEAWRLLLKATPTLASSPAFRYDLVDITRQ 567
Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
A+ + + + AY L A G+ EL+ +D +LA FLLG W
Sbjct: 568 AVQELVSLYYEEARTAYLNKELVSLMRAGGILA-----YELLPALDKVLASDSHFLLGSW 622
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
LE A+ +A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR
Sbjct: 623 LEQARGVAVSEAEALFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWR 677
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ + ++ESL G F+ + + +L + G YP + GD + ++ L+ KY
Sbjct: 678 LFMEMLVESLVQGIPFQQHQFDQNVFQLEQTFVLGTRRYPSQPQGDTVDLAKKLFLKY 735
>gi|380030624|ref|XP_003698943.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Apis florea]
Length = 769
Score = 360 bits (925), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 197/552 (35%), Positives = 304/552 (55%), Gaps = 49/552 (8%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G+VP A +FP A +T+ W + ++CC YLL+ DPLF +IG+
Sbjct: 248 LGIIPVLPAFAGHVPRAFLRLFPKANVTKSAVWNNFSD--KYCCPYLLEPMDPLFKQIGQ 305
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F++ ++E+G T H+YNCDTF+EN P +++ ++G +I+ M + DS A+WLMQGW
Sbjct: 306 QFLKTYIEEFG-TDHVYNCDTFNENEPYTSELKFLRNIGHSIFEAMSNVDSKAIWLMQGW 364
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF +D FW P+ + L S+PLG+++VLDL +E P + +YG P+IWCMLHNF G
Sbjct: 365 LFYHDSVFWTEPRTRTFLTSIPLGRMIVLDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGG 424
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
+ M+G + I EAR +TMVG G++ EGI QN V+Y+LM+EMA++ V++
Sbjct: 425 TLGMFGSAEIINHRIFEARNMNGSTMVGTGLTPEGINQNYVIYELMNEMAYRKRPVNLDK 484
Query: 242 WINQYSVRRYG--RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
W Y+ RRYG + AW +TVYN +D + + I P+++ S
Sbjct: 485 WFENYANRRYGDTKGNEHTVTAWKGFKNTVYNFSDTRRIRGKYAITIRPNLNFS------ 538
Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
P WY+ I + + + + S YR+D++
Sbjct: 539 -------------------------PWRWYNKDAFIHYWYMLLQARDLKRNSTLYRHDVV 573
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
D+TRQAL A+E++ ++IE++ + Q ++ L L +D++ +LA + FLLG WL
Sbjct: 574 DVTRQALQLIADEIYTDLIESFNKKNIDLFKQNAKLLLALFDDLEEILASSEDFLLGKWL 633
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
+ AK LA N+E+E YE+NAR QIT+W +RDY NK WSG++ DY+ PR AI
Sbjct: 634 KMAKDLATNDEEEILYEYNARNQITLW-----GPLGEIRDYANKQWSGIVADYFKPRWAI 688
Query: 480 YFKYMIESLESGDGFRLKDWRREWIK-LTNDWQNGRNVYPVESNGDAL-----ITSQWLY 533
+ + SL +G + + + + R +YP ++ GD++ I S+W Y
Sbjct: 689 FLNELETSLTTGTRVNTTKMNEQIFENVEEAFTFSRKIYPTKATGDSIDIAERILSEW-Y 747
Query: 534 NKYLQGTGVFDH 545
+ YL F H
Sbjct: 748 DPYLPFHKTFRH 759
>gi|426348060|ref|XP_004041658.1| PREDICTED: alpha-N-acetylglucosaminidase [Gorilla gorilla gorilla]
Length = 743
Score = 360 bits (924), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 197/533 (36%), Positives = 302/533 (56%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 241 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY DTF+E PP P Y+++ A+Y M + D++AVWL+QGWLF +
Sbjct: 299 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 357
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 358 PQFWGPAQIEAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 418 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 477
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S P AW +L +VYNC+ + NR +V P
Sbjct: 478 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 522
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ TS +WY+ S+V A L + S L+ S +RYDL+DLTRQ
Sbjct: 523 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 566
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + AY + + + EL+ +D +LA FLLG WLE A+
Sbjct: 567 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 626
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR ++ +
Sbjct: 627 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 681
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ +S+ G F+ + + +L + + YP + GD + ++ ++ KY
Sbjct: 682 LADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPQGDTVDLAKKIFLKY 734
>gi|397485721|ref|XP_003813989.1| PREDICTED: alpha-N-acetylglucosaminidase [Pan paniscus]
Length = 682
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 197/533 (36%), Positives = 302/533 (56%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 180 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 237
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY DTF+E PP P Y+++ A+Y M + D++AVWL+QGWLF +
Sbjct: 238 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQ 296
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 297 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 356
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 357 GALEAVNEGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 416
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S P AW +L +VYNC+ + NR +V P
Sbjct: 417 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 461
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ TS +WY+ S+V A L + S L+ S +RYDL+DLTRQ
Sbjct: 462 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 505
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + AY + + + EL+ +D +LA FLLG WLE A+
Sbjct: 506 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQAR 565
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR ++ +
Sbjct: 566 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 620
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ +S+ G F+ + + +L + + YP + GD + ++ ++ KY
Sbjct: 621 LADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 673
>gi|355568706|gb|EHH24987.1| Alpha-N-acetylglucosaminidase, partial [Macaca mulatta]
Length = 711
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 202/538 (37%), Positives = 304/538 (56%), Gaps = 53/538 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 209 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRE 266
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY DTF+E PP +P Y+++ A+Y M + D++AVWL+QGWLF +
Sbjct: 267 LVKEFG-TDHIYGADTFNEMQPPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQ 325
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 326 PQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 385
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 386 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTN 445
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ +RYG S P AW +L +VYNC+ + NR +V P
Sbjct: 446 FAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 490
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ TS +WY+ S V A L + S L+AS +RYDL+DLTRQ
Sbjct: 491 ----------LQMNTS------VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQ 534
Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
A+ + + + AY L A GV EL+ +D LLA FLLG W
Sbjct: 535 AVQELVSLYYEEARSAYLSKELTSLLRAGGVLAY-----ELLPALDELLASDSRFLLGSW 589
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
LE A+ A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR
Sbjct: 590 LEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWR 644
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ + + +S+ G F+ + + +L + + YP + GD + ++ ++ KY
Sbjct: 645 LFLEALADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 702
>gi|402900329|ref|XP_003913130.1| PREDICTED: alpha-N-acetylglucosaminidase [Papio anubis]
Length = 743
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 202/538 (37%), Positives = 303/538 (56%), Gaps = 53/538 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 241 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRE 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY DTF+E PP +P Y+++ A+Y M + D++AVWL+QGWLF +
Sbjct: 299 LVKEFG-TDHIYGADTFNEMQPPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQ 357
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 358 PQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 418 GALEAVNRGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTN 477
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ +RYG S P AW +L +VYNC+ + NR +V P
Sbjct: 478 FAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 522
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ TS +WY+ S V A L + S L+AS +RYDL+DLTRQ
Sbjct: 523 ----------LQMNTS------VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQ 566
Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
A+ + + + AY L A GV EL+ +D LLA FLLG W
Sbjct: 567 AVQELVSLYYEEARSAYLSKELTSLLRAGGVLAY-----ELLPALDELLASDSRFLLGSW 621
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
LE A+ A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR
Sbjct: 622 LEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWR 676
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ + + +S+ G F+ + + +L + + YP + GD + ++ ++ KY
Sbjct: 677 LFLEALADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
>gi|332260899|ref|XP_003279518.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Nomascus leucogenys]
Length = 736
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 197/533 (36%), Positives = 302/533 (56%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 234 VLPAFAGHVPEAVTRVFPRVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY DTF+E PP P Y+++ A+Y M + D++AVWL+QGWLF +
Sbjct: 292 LIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQ 350
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 351 PQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 410
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 411 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 470
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ +RYG S P AW +L +VYNC+ + NR +V P
Sbjct: 471 FAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 515
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ TS +WY+ S+V A L + S L+AS +RYDL+DLTRQ
Sbjct: 516 ----------LQMNTS------IWYNRSDVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQ 559
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + AY + + + EL+ +D +LA FLLG WLE A+
Sbjct: 560 AVQELVSLYYEEARSAYLRKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLELAR 619
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR ++ +
Sbjct: 620 AAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEA 674
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ +S+ G F+ + + +L + + YP + GD + ++ ++ KY
Sbjct: 675 LADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 727
>gi|328778968|ref|XP_623833.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Apis mellifera]
Length = 752
Score = 358 bits (920), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 196/540 (36%), Positives = 305/540 (56%), Gaps = 52/540 (9%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G+VP AL +FP A +T+ W + ++CC YLL+ TDPLF +IG+
Sbjct: 230 LGIIPVLPAFAGHVPRALLKLFPKANVTKSAVWNNFSD--KYCCPYLLEPTDPLFKQIGQ 287
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F++ ++E+G T H+YNCDTF+EN P +++ ++G +I+ M S DS A+WLMQGW
Sbjct: 288 QFLKTYIEEFG-TDHVYNCDTFNENEPYTSELKFLRNIGHSIFEAMNSVDSKAIWLMQGW 346
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF +D FW P+ + L SVPLG+++VLDL +E P + +YG P+IWCMLHNF G
Sbjct: 347 LFYHDSVFWTEPRTRTFLTSVPLGRMIVLDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGG 406
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
+ M+G + I EAR +TMVG G++ EGI QN V+Y+LM+EMA++ + V++
Sbjct: 407 TLGMFGSAEIINHRVFEARNMNGSTMVGTGLTPEGINQNYVIYELMNEMAYRKKPVNLDK 466
Query: 242 WINQYSVRRYG--RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
W ++ RRYG + AW +TVYN +D + + VI P+++
Sbjct: 467 WFENFANRRYGDIKGNEHTVTAWKGFKNTVYNFSDTRRIRGKYVITIRPNLN-------- 518
Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
P WY+ I + + + + S YR+D++
Sbjct: 519 -----------------------FFPWRWYNKDAFIYYWYVLLQARDLKRNSTLYRHDVV 555
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
D+TRQAL A+E++ ++IE++ + Q ++ L L +D++ +LA + FLLG WL
Sbjct: 556 DVTRQALQLIADEIYTDLIESFNKKNIDLFKQNAKLLLALFDDLEEILASSEDFLLGKWL 615
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
+ AK LA ++E+E YE+NAR QIT+W +RDY NK WSG++ DY+ PR AI
Sbjct: 616 KMAKDLATDDEEEILYEYNARNQITLW-----GPLGEIRDYANKQWSGIVADYFKPRWAI 670
Query: 480 YFKYMIESLESG---DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL-----ITSQW 531
+ + SL +G + R+ +R + + + R +YP ++ GD++ I S+W
Sbjct: 671 FLNELETSLTTGTRVNTTRIN--KRIFENVEKAFTFSRKIYPTKATGDSIDIAERILSEW 728
>gi|355754184|gb|EHH58149.1| Alpha-N-acetylglucosaminidase, partial [Macaca fascicularis]
Length = 650
Score = 358 bits (919), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 202/538 (37%), Positives = 303/538 (56%), Gaps = 53/538 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 148 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRE 205
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY DTF+E PP +P Y+++ A+Y M + D++AVWL+QGWLF +
Sbjct: 206 LVKEFG-TDHIYGADTFNEMQPPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQ 264
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 265 PQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 324
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 325 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTN 384
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ +RYG S P AW +L +VYNC+ + NR +V P
Sbjct: 385 FAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 429
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ TS +WY+ S V A L + S L+AS +RYDL+DLTRQ
Sbjct: 430 ----------LQMNTS------VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQ 473
Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
A+ + + + AY L A GV EL+ +D LLA FLLG W
Sbjct: 474 AVQELVSLYYEEARSAYLSKELTSLLRAGGVLAY-----ELLPALDELLASDSRFLLGSW 528
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
LE A+ A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR
Sbjct: 529 LEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWR 583
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ + + +S+ G F+ + + +L + + YP + GD + ++ ++ KY
Sbjct: 584 LFLEALADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 641
>gi|297701096|ref|XP_002827555.1| PREDICTED: alpha-N-acetylglucosaminidase [Pongo abelii]
Length = 836
Score = 357 bits (917), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 200/538 (37%), Positives = 302/538 (56%), Gaps = 53/538 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 334 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRE 391
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HI+ DTF+E PP P Y+++ A+Y M + D++AVWL+QGWLF +
Sbjct: 392 LIKEFG-TDHIFGADTFNEMQPPSSEPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQ 450
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 451 PQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 510
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 511 GALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTS 570
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S P AW +L +VYNC+ + NR +V P
Sbjct: 571 FAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPS--------------- 615
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ TS +WY+ S+V A L + S L+ S +RYDL+DLTRQ
Sbjct: 616 ----------LQMNTS------VWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ 659
Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
A+ + + + AY L A GV EL+ +D +LA FLLG W
Sbjct: 660 AVQELVSLYYEEARSAYLSKELASLLRAGGVLAY-----ELLPALDEVLASDSRFLLGSW 714
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
LE A+ A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR
Sbjct: 715 LEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWR 769
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ + + +S+ G F+ + + +L + + YP + GD + ++ ++ KY
Sbjct: 770 LFLEALADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPQGDTVDLAKKIFLKY 827
>gi|307168312|gb|EFN61518.1| Alpha-N-acetylglucosaminidase [Camponotus floridanus]
Length = 737
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 190/523 (36%), Positives = 286/523 (54%), Gaps = 41/523 (7%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G+VP A +FP+A +T++ W + + ++CC YLL+ TDPLF IG
Sbjct: 253 LGIVPVLPAFAGHVPRAFARLFPNANMTKINPWNNFED--KYCCPYLLEPTDPLFQIIGE 310
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F+ + E+G T HIYNCDTF+EN P Y+ ++ A+++ + + DS A+WLMQ W
Sbjct: 311 KFLRMYINEFG-TDHIYNCDTFNENEPGSTELIYLRNVSHAVFAAINAVDSKAIWLMQAW 369
Query: 123 LFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF +D FW P++K+ L SVP+G++++LDL +E P + K +YG P+IWCMLHNF G
Sbjct: 370 LFVHDFMFWTEPRVKSFLTSVPMGRMLILDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGG 429
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
+ M+G I E R +TMVG G++ EGI QN V+Y+LM+EMA++HE VD+ A
Sbjct: 430 TLGMFGSAQIINQRTFEGRNMNGSTMVGTGLTPEGINQNYVIYELMNEMAYRHEPVDLDA 489
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W Y+ RRYG W L TVYN + V+ P ++ S+
Sbjct: 490 WFQNYATRRYGAWNEYAVTTWQYLGRTVYNFIGSQRIRGHYVVTRRPSLNISL------- 542
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
+WY+ F+ + + S YR+D++D+
Sbjct: 543 ------------------------WIWYNRKNFYSMWNTFLKARHGRRNSTLYRHDVVDI 578
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
TRQAL ++L+ I+++Y+ + + LEL +D++ +LA FLLG WL
Sbjct: 579 TRQALQLMGDDLYTIILDSYKKRNITAFRSSANALLELFDDLESILASGSNFLLGTWLSQ 638
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
AK +A NEE+ K YE+NA+ QIT+W N + +RDY NK WSG++ DY+ PR ++
Sbjct: 639 AKDVATNEEERKSYEYNAKNQITLWGPNGE-----IRDYANKQWSGVMADYFKPRWELFL 693
Query: 482 KYMIESLESGDGFRLKDWRREWI-KLTNDWQNGRNVYPVESNG 523
K + +SL F + + + K+ + YPVE G
Sbjct: 694 KALEKSLVENTKFNVTEINNKIFDKVERPFTFSTKFYPVEPKG 736
>gi|256422141|ref|YP_003122794.1| alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
gi|256037049|gb|ACU60593.1| Alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
Length = 728
Score = 355 bits (911), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 191/533 (35%), Positives = 301/533 (56%), Gaps = 44/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VP A ++ +P+ +I + NW D + Y+LD P+F +IG+ F+E
Sbjct: 232 ILPAFTGHVPPAFKDKYPN-EIVKPTNW-----DAGFPDVYILDPNSPMFDKIGKKFLEA 285
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T H Y+ DTF+EN PP ++ ++ +Y+ M + D AVW+MQGW+F Y+
Sbjct: 286 QTKAFG-TDHFYSADTFNENVPPSSDSSFLDAMSRKVYASMAAADPKAVWVMQGWMFHYN 344
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+W PQ++ALLN+VP ++VLDL++E P W ++ +YG P+IW MLHNF GN M+
Sbjct: 345 ASYWHQPQIRALLNAVPDDHMIVLDLYSESHPEWRNTQAYYGKPWIWNMLHNFGGNTGMW 404
Query: 187 GILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G +D+ A P A + M G+G++ EGIEQNP +Y LM + ++ + ++V W+
Sbjct: 405 GGMDAAAHDPATALHDPASGKMSGIGLTPEGIEQNPALYQLMIDNVWRDQPINVDTWLQS 464
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVY--NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
Y+ +RYG A+ AW +LYHTVY T+GA + +IVA P +D
Sbjct: 465 YAKQRYGAENEAVNKAWQILYHTVYIGGPTEGAPES---IIVARPTLD------------ 509
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
++ E V L Y ++V+ A +LFI + +L + ++YDL+DLTR
Sbjct: 510 -----IAAERVKTK---------LEYDPAKVVPAWDLFINAAAQLKPTEGFKYDLVDLTR 555
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
Q L YA+ L + AY+ D Q S +F+ L++DMD LL +GFLLG W+ A+
Sbjct: 556 QVLGNYASPLQQRVATAYRNKDLAAFKQYSTQFIGLLDDMDMLLGTQEGFLLGKWVSDAR 615
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
++ YE+NA+ +T+W D + S + +Y N+ W+GL++ +Y PR +F
Sbjct: 616 SNGITPAEQDLYEFNAKDLVTLWGD----KDSPVHEYSNRQWNGLIKGFYKPRWQQFFTL 671
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ SL+ G+ LK + + W NG + Y V+ GDA+ + L+ KY
Sbjct: 672 LESSLKKGETADLKAFEEQVKAFEWKWANGHDKYAVKPQGDAVKAAVQLHKKY 724
>gi|395532374|ref|XP_003768245.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Sarcophilus
harrisii]
Length = 726
Score = 355 bits (911), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 197/533 (36%), Positives = 298/533 (55%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A VFP A +T LG W + C+YLL DPLF +G F+ +
Sbjct: 224 VLPAFAGHVPKAFTRVFPQAYVTHLGMWGHFNCT--YSCSYLLAPEDPLFPVVGSLFLRE 281
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+E+G T HIY+ DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 282 LTQEFG-TDHIYSADTFNEMEPPSSEPAYLAAATAAVYEAMIAVDVDAVWLLQGWLFQHQ 340
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW+PPQ+KA+L +VPLG+L+VLDL+AE KP++S + FYG P+IWCMLHNF GN ++
Sbjct: 341 PDFWKPPQVKAVLKAVPLGRLLVLDLYAESKPVYSRTDSFYGQPFIWCMLHNFGGNHGLF 400
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G LD++ GP +A N+T VG G+ EGI QN VVY LM+E+ +Q + D+ AW+
Sbjct: 401 GALDAVNRGPSDAWLFPNSTFVGTGIVPEGINQNEVVYALMAELGWQKGPLPDLGAWVAG 460
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ +RYG + AW +L +VYNC+ D T NR +V P + I
Sbjct: 461 FAAQRYGTPHSHAEAAWKLLLQSVYNCSGDLCTGHNRSPLVKRPSLHLDI---------- 510
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+WY+ S+V A L + + L++S +RYDL+D+TRQ
Sbjct: 511 ---------------------SVWYNRSDVFEAWRLLLEAAPVLASSPAFRYDLLDVTRQ 549
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
+ + + + A++ + + +L+ +D LLA + FLLG WLE A+
Sbjct: 550 VAQELVSLYYEELRTAFEAGAMPALLTAGGLLVFDLLPSLDELLASDERFLLGAWLEQAR 609
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
++A +E + QY+ NA Q+T+W + DY NK +GL+ YY PR ++ +
Sbjct: 610 EMAVSEAEAWQYKQNALYQLTLWGPTGN-----ILDYANKQLAGLVAGYYAPRWKLFVEM 664
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+++SL G F + E + L ++ GR +P + GD + + + +Y
Sbjct: 665 LVKSLAEGTPFHQNQFESEALLLGQNFVLGREKFPTQPQGDTVDLVKKFFLRY 717
>gi|395827009|ref|XP_003786703.1| PREDICTED: alpha-N-acetylglucosaminidase [Otolemur garnettii]
Length = 756
Score = 355 bits (910), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 196/532 (36%), Positives = 295/532 (55%), Gaps = 41/532 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +TQL +W + C++LL DP+F IG F+ +
Sbjct: 241 VLPAFAGHVPKAITRVFPQVNVTQLSSWGHFNCS--YSCSFLLAPGDPIFSLIGSLFLRE 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ A+Y M + D DAVWL+QGWLF +
Sbjct: 299 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMIAVDPDAVWLLQGWLFQHQ 357
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+KA+L +VPLG+L+VLDLFAE +P++S + F G P+IWCMLHNF GN ++
Sbjct: 358 PQFWGPTQIKAVLRAVPLGRLLVLDLFAESQPVYSRTASFQGQPFIWCMLHNFGGNHGLF 417
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 418 GALEAVNQGPKAARLFPNSTMVGTGMAPEGINQNEVVYALMAELGWRKDPVPDLVAWVTS 477
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
++ RRYG S + AW +L +VYNC+ A +
Sbjct: 478 FADRRYGISHGDAEAAWRLLLRSVYNCSGEACSGHNH----------------------- 514
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
P+ K L+ T+ +WY+ S+V A L + S L+AS +RYDL+D+TRQA
Sbjct: 515 -SPLVKRPSLQMNTT------VWYNRSDVFEAWRLLLTSAPTLAASPIFRYDLLDITRQA 567
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
+ + + + AY + + + EL+ +D +LA + FLLG WL A+
Sbjct: 568 IQELVSLYYEKARTAYLNKELVPLLRAGGLLAYELLPALDEVLASDNHFLLGSWLAQARA 627
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+A +E + YE N+R Q+T+W + DY NK +GL+ DYY PR ++ + +
Sbjct: 628 VAISEAEANFYEQNSRYQLTLW-----GPVGNILDYANKQLAGLVADYYAPRWQLFMQAL 682
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
L G F+ + + + L + YP + G+ + ++ ++ KY
Sbjct: 683 GNCLAQGIPFQQRQFDKNVFPLEQAFVLNSKRYPSQPQGNTMDLAKKIFLKY 734
>gi|426238067|ref|XP_004012979.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Ovis aries]
Length = 739
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 207/544 (38%), Positives = 301/544 (55%), Gaps = 51/544 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP AL VFP +TQ+G+W + C++LL DPLF +G F+ +
Sbjct: 239 VLPAFAGHVPKALTRVFPQVNVTQMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRE 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D AVWL+QGWLF
Sbjct: 297 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQ 355
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 356 PEFWGPAQVAAVLGAVPRGRLLVLDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLF 415
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+S+ GP AR N+T+VG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 416 GALESVNQGPATARRFPNSTLVGTGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTS 475
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
++ RRYG S + AW +L +VYNC S E + N+
Sbjct: 476 FAARRYGVSHGDAEAAWRLLLRSVYNC-----------------------SGEECRGHNH 512
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
P+ K L T+ +WY+ S+V A L + + L++S +RYDL+D+TRQA
Sbjct: 513 -SPLVKRPSLHMVTT------VWYNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQA 565
Query: 366 LAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
+ + + + + AY L A G+ EL+ +D +LA FLLG WL
Sbjct: 566 VQELVSLYYEEMRTAYLKKELVPLMRAGGILA-----YELLPALDQVLASDCHFLLGSWL 620
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
E A+ A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR +
Sbjct: 621 EQARLAAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRL 675
Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 539
+ + + ESL G F+ + + +L + G YP + GD + + L+ KY
Sbjct: 676 FAETLAESLVQGVPFQQHQFDKNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKYYPR 735
Query: 540 TGVF 543
G F
Sbjct: 736 AGSF 739
>gi|426238065|ref|XP_004012978.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 1 [Ovis aries]
Length = 748
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 207/544 (38%), Positives = 301/544 (55%), Gaps = 51/544 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP AL VFP +TQ+G+W + C++LL DPLF +G F+ +
Sbjct: 248 VLPAFAGHVPKALTRVFPQVNVTQMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRE 305
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D AVWL+QGWLF
Sbjct: 306 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQ 364
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 365 PEFWGPAQVAAVLGAVPRGRLLVLDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLF 424
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+S+ GP AR N+T+VG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 425 GALESVNQGPATARRFPNSTLVGTGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTS 484
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
++ RRYG S + AW +L +VYNC S E + N+
Sbjct: 485 FAARRYGVSHGDAEAAWRLLLRSVYNC-----------------------SGEECRGHNH 521
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
P+ K L T+ +WY+ S+V A L + + L++S +RYDL+D+TRQA
Sbjct: 522 -SPLVKRPSLHMVTT------VWYNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQA 574
Query: 366 LAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
+ + + + + AY L A G+ EL+ +D +LA FLLG WL
Sbjct: 575 VQELVSLYYEEMRTAYLKKELVPLMRAGGILA-----YELLPALDQVLASDCHFLLGSWL 629
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
E A+ A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR +
Sbjct: 630 EQARLAAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRL 684
Query: 480 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 539
+ + + ESL G F+ + + +L + G YP + GD + + L+ KY
Sbjct: 685 FAETLAESLVQGVPFQQHQFDKNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKYYPR 744
Query: 540 TGVF 543
G F
Sbjct: 745 AGSF 748
>gi|332018247|gb|EGI58852.1| Alpha-N-acetylglucosaminidase [Acromyrmex echinatior]
Length = 686
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 192/539 (35%), Positives = 299/539 (55%), Gaps = 41/539 (7%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G+VP A +FP+A +T++ W K + ++CC YLL+ TDPLF IG
Sbjct: 184 LGIVPVLPAFAGHVPRAFARLFPNANMTKINPW--NKFEDKYCCPYLLEPTDPLFRTIGE 241
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F++ + E+G T HIYNCDTF+EN P Y+ ++G +I+S M + DS A+WLMQ W
Sbjct: 242 KFLQMYIDEFG-TDHIYNCDTFNENEPGNTELIYLRNVGHSIFSAMNAVDSKAIWLMQAW 300
Query: 123 LFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF +D FW +++A L SVP+G+++VLDL +E P + K +YG P+IWCMLHNF G
Sbjct: 301 LFVHDIMFWTKSRVRAFLTSVPIGRMLVLDLQSEQFPQYDRLKSYYGQPFIWCMLHNFGG 360
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
+ M+G I E R ++TMVG G++ EGI QN V+Y+LM+EMA++H V++
Sbjct: 361 TLGMFGSAQIINQRTFEGRNMNDSTMVGTGLTPEGINQNYVIYELMNEMAYRHVPVNLDN 420
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W Y+ RRYG AW L TVYN + VI P ++ S+ +
Sbjct: 421 WFESYATRRYGAWNEYAVAAWQHLGRTVYNFIGTQKIRGHYVITRRPSLNISLWT----- 475
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
WY + +F+ + + YR+D++D+
Sbjct: 476 --------------------------WYDRKDFYAMWNMFLKARYGRGNNTLYRHDVVDI 509
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
TRQAL A+++++ I++ Y+ + + LEL +D++ +LA + FLLG WL
Sbjct: 510 TRQALQLIADDIYMTILDCYKKKNITAFQSSANALLELFDDLESILASGNNFLLGTWLAQ 569
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
AK +A NEE+ + YE+NAR QIT+W N + +RDY NK WSG++ DY+ R ++
Sbjct: 570 AKDIAVNEEERRSYEYNARNQITLWGPNGE-----IRDYANKQWSGVVADYFKLRWELFL 624
Query: 482 KYMIESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 539
K + +SL + + R + ++ + +YP+E+ GD + + + +K+ +G
Sbjct: 625 KALEKSLIQRIEPNITEINDRIFHEVERSFTFSTKLYPIETKGDTIDIAMKIISKWYKG 683
>gi|156545487|ref|XP_001606979.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Nasonia vitripennis]
Length = 755
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 196/533 (36%), Positives = 297/533 (55%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP VFP A +T++ +W + ++CC Y LD TDPLF +GR F++
Sbjct: 258 VLPAFAGHVPRDFIRVFPEANVTKVVSWNGFED--QYCCPYSLDPTDPLFKTVGREFLKA 315
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
E+G T+HIYNCD+F+EN P +Y+S+ G AIYSGM D DA+WLMQGWLF +
Sbjct: 316 YTDEFG-TNHIYNCDSFNENDPHTGDLDYLSNTGKAIYSGMTGADPDAIWLMQGWLFVHS 374
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW P++KA + SVP+GK+++LDL +E P + ++G P+IWCMLHNF G + M+
Sbjct: 375 EYFWTFPRVKAFVTSVPIGKMIILDLQSEQFPQYKRFHSYFGQPFIWCMLHNFGGTLGMF 434
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G I G EART+ +TM+G G++ EGI QN V+Y+ M+EM+++ + V + W Y
Sbjct: 435 GSAGVINKGVFEARTTNGSTMIGTGLTPEGINQNYVIYEFMNEMSYRKKPVVLDNWFENY 494
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+VRRYG++ +I+ +W L +YN DG T G Y
Sbjct: 495 AVRRYGQADESIRTSWQELGRELYN-YDGKTK-------------------IRGHYV--- 531
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
++K L E P WY + F+ +GN + +++DL+D+TRQAL
Sbjct: 532 --ITKRPSLNIE------PWYWYDLKTFLAVWNSFVHAGNGTMKNELFKHDLVDITRQAL 583
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
A+ ++ +I AY + + S L+L +D++ LA FLLG WLE AK +A
Sbjct: 584 QITADFIYADIKAAYTQKNLTQLQIASSHLLDLFDDLEKNLASSKDFLLGSWLEDAKAIA 643
Query: 427 QN--EEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ YE+NAR QIT+W + + DY NK WSG++ DY+ PR IY K +
Sbjct: 644 PEGATRDRENYEFNARNQITLWGPRGE-----IVDYANKQWSGVVADYFKPRWEIYLKEL 698
Query: 485 IESLESGDGFRLKDWRRE-WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
ES+ +R + ++ + + +YP + GD+++ ++ LY K+
Sbjct: 699 QESIRKQTAVPTAKLKRMIFNQVELPFSYSKKLYPTQPKGDSILIAKELYAKW 751
>gi|354485058|ref|XP_003504701.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cricetulus griseus]
gi|344251941|gb|EGW08045.1| Alpha-N-acetylglucosaminidase [Cricetulus griseus]
Length = 740
Score = 352 bits (902), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 198/543 (36%), Positives = 303/543 (55%), Gaps = 53/543 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP + QLG+W + C++LL DP+F IG F+ +
Sbjct: 240 VLPAFAGHVPKAITRVFPQVNVFQLGSWGHFNCS--YSCSFLLAPGDPVFPLIGSLFLRE 297
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY DTF+E P P ++++ AA+Y M S D DA+WL+QGWLF +
Sbjct: 298 LIKEFG-TDHIYGADTFNEMQPISSDPSFLTAATAAVYEAMISVDPDAIWLLQGWLFQHQ 356
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+KA+L +VP G+L+VLDLFAE P++ + FYG P+IWCMLHNF GN ++
Sbjct: 357 PQFWGPAQVKAVLQAVPRGRLLVLDLFAESHPVYMQTASFYGQPFIWCMLHNFGGNHGLF 416
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG G++ EGI QN +VY LM+E+ ++ + V D++ W+++
Sbjct: 417 GALEAVNQGPRAARIFPNSTMVGTGIAPEGIGQNEMVYALMAELGWRKDPVPDLEVWVSR 476
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RYG S P + AW +L +VYNC + NR +V P + + I
Sbjct: 477 FASHRYGMSHPDAEAAWRLLLRSVYNCPGETYNGHNRSPLVKRPSLQINTI--------- 527
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+WY+ S+V A L + + L+ S +RYDL+D+TRQ
Sbjct: 528 ----------------------VWYNRSDVFEAWRLLLTAAPNLTTSKAFRYDLLDVTRQ 565
Query: 365 ALAKYANELFLNIIEAYQ------LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
+L + + + A+ L A G+ ++R+ L+ +D LLA FLLG W
Sbjct: 566 SLQELVSLFYEEARIAFMKEELDLLLRAGGI--ITRK---LLPALDELLASDSRFLLGTW 620
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
L A+ +A +E++ + YE N+ Q+T+W E +++ DY NK +GL+ DYY PR
Sbjct: 621 LNQARAMAVSEDEAQFYELNSLYQLTLW----GPEGNIM-DYANKQLAGLVADYYQPRWG 675
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
++ + + SL G FR ++ + L + + YP GD + S+ L+ KY
Sbjct: 676 LFMEALAHSLARGVPFRQHEFEKNVFPLELAFIINKKRYPSHPQGDTVDLSKKLFLKYHP 735
Query: 539 GTG 541
G
Sbjct: 736 QAG 738
>gi|444714090|gb|ELW54978.1| Alpha-N-acetylglucosaminidase [Tupaia chinensis]
Length = 724
Score = 351 bits (901), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 201/534 (37%), Positives = 302/534 (56%), Gaps = 43/534 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF G+VP A+ VFP +TQLG+W + C++LL DP+F IG F+ +
Sbjct: 211 VLPAFPGHVPKAITRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRE 268
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AAIY+ M + D AVWL+QGW+F +
Sbjct: 269 LTKEFG-TDHIYGADTFNELQPPSSEPSYLAAATAAIYAAMTAVDPGAVWLLQGWIFQHQ 327
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+KA+L +VP G+L+VLDLFAE +P++ + F G P+IWCMLHNF GN +Y
Sbjct: 328 PDFWGPAQVKAVLEAVPRGRLLVLDLFAETRPVYLYTASFLGQPFIWCMLHNFGGNHGLY 387
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ +GP AR N++MVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 388 GTLEAVNWGPKAARLFPNSSMVGTGMAPEGINQNEVVYALMAELGWRKDPVPDLAAWVTS 447
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDG-ATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y+ RRYG S+ + AW +L +VYNC+ + NR
Sbjct: 448 YADRRYGVSLGDAEAAWRLLLRSVYNCSGQMCSGHNRS---------------------- 485
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
P+ K L+ T+ +WY+ S+V A L + + L+AS T+RYDL+D+TRQ
Sbjct: 486 ---PLVKRPSLQMNTT------VWYNRSDVFEAWRLLLTAAPTLAASPTFRYDLLDVTRQ 536
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + AY + + + + EL+ D+D LLA F+LG WLE A+
Sbjct: 537 AVQELVSLYYEEARTAYLNKELVSLLRAGGILVYELLPDLDNLLATDGRFMLGSWLEQAR 596
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
+A +E + + YE N+R Q+T+W + DY NK +GL+ DYY PR ++ +
Sbjct: 597 AVAVSETEAQFYEQNSRYQLTLWGPTGN-----ILDYANKQLAGLVADYYAPRWQLFMEM 651
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
+ SL G F+ + + +L + YP + GD + ++ ++ KY
Sbjct: 652 LANSLTQGIPFQQHQFDQNAFQLEQAFVLSVERYPSQPQGDTVELAKKIFLKYF 705
>gi|348562747|ref|XP_003467170.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Cavia porcellus]
Length = 750
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 192/532 (36%), Positives = 304/532 (57%), Gaps = 41/532 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP ITQLG+W + C++LL DPLF IG F+ +
Sbjct: 248 VLPAFAGHVPKAIGRVFPQVNITQLGSWGHFNCS--YSCSFLLAPEDPLFPLIGGIFLRE 305
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
++E+G T+HIY DTF+E PP P Y+++ A++ M + DSDAVWL+QGWLF +
Sbjct: 306 LIREFG-TNHIYGADTFNEMQPPSSDPAYLAAATEAVFKAMVAVDSDAVWLLQGWLFQHQ 364
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 365 PEFWGPAQVGAVLGAVPQGRLLVLDLFAESQPVYTRTASFRGQPFIWCMLHNFGGNHGLF 424
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG G++ EGI QN VVY LM+E+ ++ + V D+ AW+++
Sbjct: 425 GALEAVNRGPTAARLFPNSTMVGTGITPEGIGQNEVVYALMAELGWRKDPVPDLLAWVSR 484
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
++ RRYG + P + AW +L +VYNC+ A +
Sbjct: 485 FAERRYGVAQPDAEAAWRLLLRSVYNCSGEACRGHNH----------------------- 521
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
P+ + L+ T+ +WY+ S+V A L + + +L+ S T+RYDL+D+TRQA
Sbjct: 522 -SPLVRRPSLQMNTA------VWYNRSDVFEAWRLLLKASPKLTTSPTFRYDLLDVTRQA 574
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
L + + + + AY + G+ + +L+ +D +LA FLLG WL A+
Sbjct: 575 LQELVSLYYEEVRAAYLHQELAGLLRAGGVLAYQLLPALDEVLASDHHFLLGSWLAQARA 634
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
A +E + + YE N+R Q+T+W E ++L DY NK +GL+ YY PR ++ + +
Sbjct: 635 AAASETEARLYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVAHYYAPRWQLFIESL 689
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+SL F+ + ++ L + Y + GD + ++ ++ ++
Sbjct: 690 ADSLARAAPFQQHQFDKDVFLLEQAFVLSSRRYRSQPQGDTVDLARKVFLRF 741
>gi|431890602|gb|ELK01481.1| Alpha-N-acetylglucosaminidase [Pteropus alecto]
Length = 740
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 196/533 (36%), Positives = 300/533 (56%), Gaps = 43/533 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +TQ+ +W + C++LL DPLF +G F+ +
Sbjct: 238 VLPAFAGHVPKAITRVFPQVNVTQMDSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRE 295
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 296 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMTTVDPDAVWLLQGWLFQHQ 354
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 355 PQFWGPAQVGAVLGAVPRGRLLVLDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLF 414
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI+QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 415 GALEAVNQGPAAARLFPNSTMVGTGMAPEGIDQNEVVYALMAELGWRKDPVTDLGAWVTS 474
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG S + AW +L +VYNC+ + N +V PS+ VT
Sbjct: 475 FAARRYGVSHGDAEAAWRLLLRSVYNCSGEDCRGHNHSPLVR----RPSLQMVTT----- 525
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+WY+ S+V A + + + L+ S + Y+L+D+TRQ
Sbjct: 526 ----------------------VWYNQSDVFEAWRMLLTATPTLATSPLFSYELVDITRQ 563
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAK 423
A+ + + + + AY D +F+ + EL+ +D +LA FLLG WLE A+
Sbjct: 564 AIQELVSLYYEEVRTAYLNKDLVTLFRAAGILAYELLPSLDNILATDSHFLLGSWLEQAR 623
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
A ++ + YE N+R Q+T+W E ++L DY NK +GL+ +YY PR ++ +
Sbjct: 624 AAAVSKAEASFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLIANYYTPRWRLFMEM 678
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ESL G F+ + + +L + YP + GD + ++ L+ KY
Sbjct: 679 LVESLVQGIPFQQHQFDKNAFQLEQTFVFSTQRYPNQPQGDTVDLAKKLFLKY 731
>gi|295132875|ref|YP_003583551.1| hypothetical protein ZPR_1010 [Zunongwangia profunda SM-A87]
gi|294980890|gb|ADF51355.1| predicted protein [Zunongwangia profunda SM-A87]
Length = 750
Score = 348 bits (894), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 196/541 (36%), Positives = 304/541 (56%), Gaps = 63/541 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA+ + FP A + + NW + D TY+LDA DPLF EIG+ F+E+
Sbjct: 243 VLPAFTGHVPASFKKFFPDADLKKT-NWGNDFGD-----TYILDAEDPLFAEIGKRFLEK 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q + +G T H Y DTF+EN PP D P+Y+ L I+ GM++ D +A W+MQGWLF S+
Sbjct: 297 QEEVFG-TDHFYTADTFNENEPPSDDPKYLGELSEKIFEGMKAADPEATWVMQGWLFYSH 355
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+ PQ+K LL++VP ++++LDL E++P+W ++ FYG +IW MLHNF GNI M+
Sbjct: 356 KDFWKTPQIKGLLSTVPDDRMIILDLATEIEPVWKQTEAFYGKQWIWNMLHNFGGNISMF 415
Query: 187 GILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G ++++A P A S + + G+G++ME IEQNPV+Y+LM++ ++ +++K+W+
Sbjct: 416 GRIETVAEQPALALNDSTSGNLKGIGLTMEAIEQNPVLYELMTDNTWRDTPIELKSWLKN 475
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y+ RYG +I +AW++L T YN T D +I A P ++ Y
Sbjct: 476 YTRNRYGAVNDSILEAWDILVATAYNGT-TIRDGAESIIAARP------------TFEGY 522
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
+ + + Y +++ A +LFI + + S+ + YDL+DL+RQ
Sbjct: 523 RR--------------WARTKMNYDPLDLLPAWDLFIGARDRFKDSDGFAYDLVDLSRQV 568
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
LA YA + + AY+ ND + S L L+ D+D LLA FLLGPW+ A+
Sbjct: 569 LANYALPVQQQMRIAYENNDKEAFKKHSEELLTLISDLDRLLATRKDFLLGPWIADARSW 628
Query: 426 AQNEEQEKQYEWNARTQITMWF--DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
E++ YE NAR IT+W DN L +Y + WSG+L D+Y PR ++
Sbjct: 629 GTTPEEKALYERNARDLITLWGGPDNP------LHEYSCRQWSGVLDDFYKPR----WQQ 678
Query: 484 MIESLESGDG--------FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
I +E+ G ++K+W +W+ N YP + +GD+ ++ LY+K
Sbjct: 679 FIADVEANWGDFDQEVFDEKIKEWEWKWV-------NKEEAYPTQPSGDSYKVAKALYDK 731
Query: 536 Y 536
Y
Sbjct: 732 Y 732
>gi|344285558|ref|XP_003414528.1| PREDICTED: alpha-N-acetylglucosaminidase [Loxodonta africana]
Length = 744
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 200/538 (37%), Positives = 295/538 (54%), Gaps = 53/538 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +TQ+G+W + C++LL DP+F IG F+ +
Sbjct: 242 VLPAFAGHVPKAVTRVFPQVNVTQMGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRE 299
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
E+G T HIY DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 300 LTTEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYEAMITVDPDAVWLLQGWLFQHQ 358
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 359 PQFWGPAQVGAVLGAVPRGHLLVLDLFAETQPVYIRTASFQGQPFIWCMLHNFGGNHGLF 418
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+
Sbjct: 419 GTLETVNQGPAAARLFPNSTMVGTGMAPEGIGQNEVVYALMAELGWRKDPVPDLGAWVAS 478
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG + AW +L +VYNC+ + + NR
Sbjct: 479 FAARRYGGIHQDAETAWRLLLRSVYNCSGESCSGHNRS---------------------- 516
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
P+ K L+ T+ +WY+ S+V A L +A+ L+AS +RYDL+D+TRQ
Sbjct: 517 ---PLVKRPSLQMNTT------VWYNRSDVFEAWRLLLATTPALAASPAFRYDLLDVTRQ 567
Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
A + + + + AY L A GV EL+ +D +LA FLLG W
Sbjct: 568 AAQELVSFYYGEVRTAYLNKELVHLLRAGGVLA-----YELLPALDEVLASDSRFLLGSW 622
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
LE A+ A +E + +E N+R Q+T+W + DY NK +GL+ DYY PR
Sbjct: 623 LEQARVAAVSEAEAHFFEQNSRYQLTLW-----GPVGNILDYANKQLAGLVSDYYTPRWQ 677
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ ++ESL F+ + + +L + YP + GD + ++ L+ KY
Sbjct: 678 LFVGALVESLVQDVPFQQRQFDENVFQLEQAFVLNTRRYPTQPKGDTVDLAKRLFLKY 735
>gi|291406137|ref|XP_002719212.1| PREDICTED: alpha-N-acetylglucosaminidase [Oryctolagus cuniculus]
Length = 743
Score = 347 bits (891), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 203/539 (37%), Positives = 305/539 (56%), Gaps = 53/539 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +TQLG+W + C++LL DP+F IG F+ +
Sbjct: 241 VLPAFAGHVPKAVTRVFPHINVTQLGSWGHFNCS--YSCSFLLAPEDPMFPLIGSLFLRE 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+E+G T H+Y DTF+E PP P Y+++ AA++ M + D DAVWL+QGWLF +
Sbjct: 299 LTREFG-TDHVYGADTFNEMQPPSSEPSYLAAATAAVFEAMIAVDPDAVWLLQGWLFQHQ 357
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+KA+LN+VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 358 PQFWGPSQVKAVLNAVPRGRLLVLDLFAENQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 417
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG G++ EGI QN VVY LM+E+ ++ E V D++AW+
Sbjct: 418 GALEAVNRGPAAARLFPNSTMVGTGIAPEGISQNEVVYALMAELGWRKEPVPDLEAWVTS 477
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RRYG + P AW +L +VYNC+ D NR +V P
Sbjct: 478 FAGRRYGVAHPDAGAAWRLLLRSVYNCSGDACRGHNRSPLVRRPS--------------- 522
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
L+ T+ +WY+ S+V A L + + L++S +RYDL+D+TRQ
Sbjct: 523 ----------LQLNTT------VWYNRSDVFEAWRLLLKATPTLASSPAFRYDLLDVTRQ 566
Query: 365 ALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
A+ + + + AY L A GV EL+ +D +LA FLLG W
Sbjct: 567 AVQELVSLYYEEARTAYLHKELATLLRAGGVLA-----YELLPALDRVLATDSRFLLGSW 621
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
LE A+ A +E + + YE N+R Q+T+W E ++L DY NK +GL+ YY PR
Sbjct: 622 LEQARAAAASEAEAQLYEQNSRFQLTLW----GPEGNIL-DYANKQLAGLVAQYYSPRWQ 676
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
++ + + +SL G F+ + + + +L + YP + GD + +Q ++ KY
Sbjct: 677 LFLEALADSLARGVPFQQRLFDKLVFRLEQAFVLSSRRYPTQPQGDTVDLAQKIFLKYF 735
>gi|90399367|emb|CAJ86183.1| H0212B02.15 [Oryza sativa Indica Group]
gi|116311963|emb|CAJ86322.1| OSIGBa0113E10.5 [Oryza sativa Indica Group]
Length = 692
Score = 345 bits (884), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 163/262 (62%), Positives = 205/262 (78%), Gaps = 6/262 (2%)
Query: 208 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 267
+GVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+S+ +QDAW +LY
Sbjct: 427 IGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSIVGLQDAWKILYQ 486
Query: 268 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSY 322
T+YNCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y
Sbjct: 487 TLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEY 545
Query: 323 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 382
+HPHLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+
Sbjct: 546 EHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYK 605
Query: 383 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 442
N+ + V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQ
Sbjct: 606 ANNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQ 665
Query: 443 ITMWFDNTQEEASLLRDYGNKY 464
ITMWFDNT+ +ASLLRDYG +
Sbjct: 666 ITMWFDNTKTKASLLRDYGEAH 687
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 106/154 (68%), Positives = 127/154 (82%), Gaps = 3/154 (1%)
Query: 6 FAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
F VLPAFSGN+PAAL++ FPSAK+T LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FI
Sbjct: 296 FPVLPAFSGNIPAALRSKFPSAKVTHLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFI 355
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
E+Q++EYG TSH+Y+CDTFDENTPP+ P YISSLGAA + GMQSGD DA+WLMQGWLFS
Sbjct: 356 EEQIREYGGTSHVYSCDTFDENTPPLSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFS 415
Query: 126 YDPFWRPPQMKALLNSVPLG---KLVVLDLFAEV 156
YDPFW PPQMK + G +V DL +E+
Sbjct: 416 YDPFWEPPQMKIGVGMSMEGIEQNPIVYDLMSEM 449
>gi|403304646|ref|XP_003942904.1| PREDICTED: alpha-N-acetylglucosaminidase [Saimiri boliviensis
boliviensis]
Length = 754
Score = 344 bits (883), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 196/532 (36%), Positives = 303/532 (56%), Gaps = 41/532 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSG+VP A+ VFP +TQ+G+W + C++LL DP+F +G F+ +
Sbjct: 251 VLPAFSGHVPRAINRVFPRVNVTQMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRE 308
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D+DAVWL+QGWLF +
Sbjct: 309 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQHQ 367
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 368 PQFWGPAQVRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLF 427
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E++++ + V D+ AW+
Sbjct: 428 GALEAVNRGPEAARLFPNSTMVGTGMAPEGINQNEVVYSLMAELSWRKDPVPDLAAWVTS 487
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
++ +RYG S P AW +L +VYNC+ A +
Sbjct: 488 FATQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNH----------------------- 524
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
P+ + L+ T+ +WY+ S+V A L +++ L+AS T+RYDL+D+TRQA
Sbjct: 525 -SPLVRRPSLQMNTT------VWYNRSDVFEAWRLLLSAAATLAASPTFRYDLLDVTRQA 577
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
+ + + AY + H + + EL+ +D +LA FLLG WLE A+
Sbjct: 578 VQELVGLYYEEARSAYLSKELHSLLRAGGILAYELLPALDEVLASDSHFLLGSWLEQARA 637
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+A +E + YE ++R Q+T+W E ++L DY NK +GL+ YY PR ++ + +
Sbjct: 638 VAVSEAEADFYEQSSRYQLTLW----GPEGNIL-DYANKQLAGLVASYYTPRWRLFLEVL 692
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
S+ G F + + +L + + YP + GD + ++ ++ KY
Sbjct: 693 AASVAQGIPFPQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 744
>gi|320162905|gb|EFW39804.1| lysosomal alpha-N-acetyl glucosaminidase [Capsaspora owczarzaki
ATCC 30864]
Length = 786
Score = 342 bits (877), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 189/534 (35%), Positives = 297/534 (55%), Gaps = 38/534 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+F+G+VP+AL FP+A ITQ +W + ++CC LDA+DPLF +IG F+
Sbjct: 275 VLPSFAGHVPSALAQHFPNANITQSSDWNNFPD--QYCCVGFLDASDPLFTQIGAEFLRL 332
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG T+H+YNCD F+E TP Y+ G A+Y M + D AVW+MQGWLF +
Sbjct: 333 QNETYG-TNHLYNCDQFNEMTPASTDLGYLKQAGMAVYQSMTAYDPAAVWVMQGWLFFNE 391
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+W +++ALL+ VP +++LDLF++V P+W+ + +YG P+IW MLH+F GNI +Y
Sbjct: 392 AAWWSNDRVQALLSGVPDDHMIILDLFSDVTPVWNRLESYYGKPFIWNMLHDFGGNIGLY 451
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
GIL SI GP A + TMVG+G++ EGI QN ++Y+ M E ++ V++ W++ +
Sbjct: 452 GILPSINEGPFAALATPGNTMVGIGLTPEGINQNYILYEFMMENMWRSAPVNLPTWVDAF 511
Query: 247 SVRRYGRSVPAIQD-AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
RRYG S PA+ A+ L +VYNCT+G + ++ P V+
Sbjct: 512 VGRRYGPSTPAVAKLAYQQLLQSVYNCTNGQYSVTKSLLEIRPAVN-------------- 557
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIA---SGNELSASNTYRYDLIDLT 362
+S+ + + +L+Y VI A++ +A S +L++ +RYD++D T
Sbjct: 558 ---MSRNGFMPT--------NLYYDPGHVILAVDHILAAAKSAPQLASVVPFRYDVVDFT 606
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L+ A + N+ A AH V + + L+ D+D LL FLLGPWL +A
Sbjct: 607 RQMLSNLAIDFHSNLTLALTSKQAHLVHLYGQGIVGLIADLDELLVSDAHFLLGPWLAAA 666
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++N + E+NAR QIT+W N + + DY +K W+GL+ YY PR ++
Sbjct: 667 RSWSENTAAQDLLEFNARNQITLWGPNGE-----ITDYASKQWAGLMSSYYRPRWELFVS 721
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ + ES F + +++ WQ+ + + V GD++ + L KY
Sbjct: 722 FASAAAESDLPFNDAAFNAAVLEVEKAWQHSHHNFTVTPLGDSIAIATRLRAKY 775
>gi|351699889|gb|EHB02808.1| Alpha-N-acetylglucosaminidase, partial [Heterocephalus glaber]
Length = 652
Score = 342 bits (877), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 193/522 (36%), Positives = 294/522 (56%), Gaps = 41/522 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +TQLG+W + C++LL DPLF IG F+ +
Sbjct: 151 VLPAFAGHVPKAVTRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPGDPLFPLIGSLFLRE 208
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+E+G T H Y DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 209 LNREFG-TDHFYGADTFNEMQPPSSEPAYLAAATAAVYEAMTAVDPDAVWLLQGWLFQHQ 267
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++
Sbjct: 268 PEFWGPAQVGAVLGAVPQGRLLVLDLFAENQPVYTRTASFGGQPFIWCMLHNFGGNHGLF 327
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+T+VG G++ EGI QN VVY LM+E+ ++ + V D+ AW+ +
Sbjct: 328 GALEAVNRGPAAARLFPNSTVVGTGIAPEGIGQNEVVYALMAELGWRKDPVPDLSAWVAR 387
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
++ +RYG + P AW +L H+VYNC+ A +
Sbjct: 388 FAEQRYGVAQPDAVLAWRLLLHSVYNCSGEACRGHNH----------------------- 424
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
P+ + L+ T+ +WY+ S+V A L + + L+AS +RYDL+D+TRQ
Sbjct: 425 -SPLVRRPSLQMNTT------VWYNRSDVFEAWRLLLKATPNLTASPAFRYDLLDVTRQG 477
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQ 424
L + + + AY + G+ + +L+ +D +LA FLLG WLE A+
Sbjct: 478 LQELVSLYYEEARAAYMRQELEGLLRAGGVLAYKLLPALDEVLASDHRFLLGSWLEQARA 537
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+A + + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR ++ + +
Sbjct: 538 VAVSSAEADLYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYVPRWRLFVETL 592
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
SL G F+ + + + L + R YP + GD +
Sbjct: 593 ASSLARGVPFQQQQFNSDVFLLEQAFVLSRKRYPSQPQGDTV 634
>gi|405964692|gb|EKC30145.1| Alpha-N-acetylglucosaminidase [Crassostrea gigas]
Length = 859
Score = 341 bits (875), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 187/561 (33%), Positives = 302/561 (53%), Gaps = 37/561 (6%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRW--------------------CCT 47
VLP F+G+VP A +P A +++L +W W CC
Sbjct: 254 VLPGFAGHVPEATILRYPQANVSRLTDWAGFNQSFCWHYPTANVSRLRDWGHFNKTYCCN 313
Query: 48 YLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSG 107
YLLD DPLF++I FI++ E+G H+Y+ DTF+E P +S EY++ G +Y
Sbjct: 314 YLLDFNDPLFMKIAVRFIKEMENEFG-VDHVYSVDTFNEMRPRSNSTEYLALSGRTVYKS 372
Query: 108 MQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFY 167
++ DS A+WLMQGWLF FW+ PQ+KALL +VP G++++LDL++E+ PI++ ++ +Y
Sbjct: 373 LKEADSKAIWLMQGWLFIDGGFWKQPQIKALLTAVPQGEMIILDLYSEIIPIYTQTESYY 432
Query: 168 GVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLM 227
G P+IWCMLH+F G +E+YG L I GP R N++MVG+GM+ EGI QN VVY+
Sbjct: 433 GQPFIWCMLHDFGGTMELYGALKLINEGPFNGRAFPNSSMVGLGMTPEGIFQNEVVYEFF 492
Query: 228 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
+E ++ D+ WI++Y + RYG++ I AW L ++VYN +D D + + I
Sbjct: 493 TENVWRKAPRDISTWISKYVLNRYGKTNKFIDLAWQYLKNSVYNNSDNLKDHDSNAI--- 549
Query: 288 PDVDPSIISVTE---GKYQN----YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALEL 340
PD PS+ G Y N + ++ + +WY+ ++ A ++
Sbjct: 550 PDHRPSLSPALHPDLGIYNNTDYLHDNSINIIVTTLPRMTPLIQQDVWYNPEDLYVAWDI 609
Query: 341 FIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 400
+ +E S S+ + YD++D+TR +L + + + +++ A+ D H V + L L+
Sbjct: 610 MTLNLDEFSNSSLFMYDIVDVTRNSLQILSIKYYTDLVYAFGRGDIHAVESHGNQLLGLL 669
Query: 401 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 460
DMD +L FLLG W+++A A + + ++NAR QIT+W + +RDY
Sbjct: 670 SDMDTVLGSDSHFLLGRWIKAATDNAMDMQDNWFLQFNARNQITLWGPRGE-----IRDY 724
Query: 461 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPV 519
K WSGL++DYY PR I+ Y ++ + + + + K+ + + YP
Sbjct: 725 ACKQWSGLIKDYYLPRWEIFVNYTLDIMAHNKTYNATELDIMIYEKVEFPFSYRLDQYPT 784
Query: 520 ESNGDALITSQWLYNKYLQGT 540
E GD++ + L+ KY T
Sbjct: 785 EPQGDSVAIVKSLHKKYRPDT 805
>gi|325103828|ref|YP_004273482.1| alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
gi|324972676|gb|ADY51660.1| Alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
Length = 738
Score = 340 bits (871), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 194/546 (35%), Positives = 295/546 (54%), Gaps = 55/546 (10%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G+VP + FP AK+ SV + Y+L+ DP+F +IG
Sbjct: 227 LGMIPVLPAFTGHVPKSFVKKFPEAKVD------SVNWQGNFPNIYMLNPNDPMFSKIGE 280
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F+++Q +EYG T H Y+ D F+E PP P+Y+ + +YS M+ D +VW+MQ W
Sbjct: 281 QFLKEQTREYG-TDHYYSSDIFNELNPPSSDPKYLYDISEKVYSSMKKVDPKSVWVMQAW 339
Query: 123 LFSY---DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNF 179
LF FW P +M+A L VP KL++LDL+ E +P W ++ +YG ++W MLHNF
Sbjct: 340 LFVSAHGRKFWTPERMQAFLKPVPDDKLIILDLYTENRPRWKNTEGYYGKKWVWNMLHNF 399
Query: 180 AGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD 238
GNI ++G +IA P + G+G++MEGIEQNP +Y LM + + +E ++
Sbjct: 400 GGNIGLFGKAQTIASEPARVLSDPMKGNYSGIGLTMEGIEQNPFIYQLMLDHVWNNEPIE 459
Query: 239 VKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 298
++ W N+Y RRYG AW +L +TVY D N+D SI+S
Sbjct: 460 LEKWTNKYITRRYGVLDNNAVKAWEILLNTVY------KDNNKD-----QGAPESILS-- 506
Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
G+P ++ S + L+Y E +RA + I S ++L S+ ++YD+
Sbjct: 507 -------GRPTF------AQNSYWTWTDLYYDNREFVRAWDYLIKSADKLRNSDGFQYDI 553
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
+D+TRQA+A YA L + Y D + + SRRFLEL+ D+D LLA FLLG W
Sbjct: 554 VDITRQAMANYATALQRQLAYTYYAGDVNTYEKESRRFLELLSDLDRLLATRKDFLLGIW 613
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
++ AK+ A N+ + K YE+NA+ ++MW + + DY + WSGL+ +YY R
Sbjct: 614 IDDAKKWATNDAERKLYEFNAKDLVSMW----GHKDITINDYSARQWSGLVENYYKQRWK 669
Query: 479 IYFKYMIESLESGDGFR-------LKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQW 531
I+F ++ L++ + + +KDW EW +W N R YP + GD + S+
Sbjct: 670 IFFDQSLQKLKNNEIWDQAEFEKYIKDW--EW-----NWVNRRETYPTNTKGDPVNVSKE 722
Query: 532 LYNKYL 537
+YNKY
Sbjct: 723 MYNKYF 728
>gi|196001339|ref|XP_002110537.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
gi|190586488|gb|EDV26541.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
Length = 757
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 191/530 (36%), Positives = 290/530 (54%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+GN+P AL ++P AKI + WF R+ T LLD D LFI I + FIE+
Sbjct: 254 ILPAFNGNIPNALTKIYPKAKIVKSSPWFGFSK--RYGETALLDPRDKLFIVISKLFIEE 311
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
++K YG T H+Y+ D F+E P EY++++ + Y + S D+ AVW+MQGW+F D
Sbjct: 312 EIKAYG-TDHLYSLDLFNEIDPQSKELEYLTAVSKSAYLALNSADTKAVWIMQGWMFYND 370
Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+ W +++A L+ +P G++V+LDLFAEV+P + S F+G P+IWCML+NF GN MY
Sbjct: 371 NYYWENKRIQAFLSPIPKGRIVILDLFAEVEPQYHRSNSFFGHPFIWCMLNNFGGNAGMY 430
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G ++I G + A +N+TM+G GM+ EGI N ++YDLM+EM ++ VDV+ W+ Y
Sbjct: 431 GTFETITEGAISAYDMKNSTMIGTGMAPEGIGNNYIMYDLMAEMGWRKIAVDVRDWVVVY 490
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RRYG I AW L TVYNC D R A P V PS+ +
Sbjct: 491 TERRYGGLDENIIKAWLRLSETVYNCNDMRQYHCR----ALPAVRPSLKIAND------- 539
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+WYS ++ A E + + NE + T++YD++D+TRQAL
Sbjct: 540 --------------------VWYSADDIFFAWEHMLRANNEFISEETFQYDIVDVTRQAL 579
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ A ++ + + Y N+ + +EL DMD LL + FLLG W+ A Q +
Sbjct: 580 QELAFIMYKKVTQCYHDNNQETLKTAGGELIELFTDMDTLLGTNSHFLLGRWVADALQHS 639
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
N ++Q +NA QIT+W S+L DY NK W+GL+ +Y R ++ K + +
Sbjct: 640 NNISIKQQLRFNALNQITLW----GPSKSILHDYANKMWNGLVDKFYKKRWLMFIKALSD 695
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
S+ + F + + K W + N Y S+G ++ S+ L++KY
Sbjct: 696 SISNNILFDQQKFNLAVQKFEAAWASENNTYATTSSGSSVTVSKQLFSKY 745
>gi|390463730|ref|XP_003733088.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Callithrix jacchus]
Length = 830
Score = 336 bits (861), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 196/533 (36%), Positives = 293/533 (54%), Gaps = 50/533 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F G+VP A+ VFP +TQ+G+W + C++LL DP+F +G F+ +
Sbjct: 334 VLPMFLGHVPKAITRVFPRVSVTQMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRE 391
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D+DAVWL+QGWLF Y
Sbjct: 392 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQYQ 450
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q++A+L S P G L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 451 PQFWGPAQVRAVLGSAPHGCLLVLDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLF 510
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+++ + V D+ AW
Sbjct: 511 GALEAMNRGPEAARLFPNSTMVGTGMAPEGISQNXVVYSLMAELSWXKDPVPDLVAWX-- 568
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
YG S P AW +L +VYNC+ A +
Sbjct: 569 -----YGVSHPDTGAAWRLLLRSVYNCSGEACRGHNH----------------------- 600
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
P+ + L+ T+ +WY+ S+V A L ++ L+AS T+RYDL+D+TRQ
Sbjct: 601 -SPLVRRPSLQMNTT------IWYNQSDVFEAWRLLFSAAATLAASPTFRYDLLDVTRQV 653
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFL--ELVEDMDGLLACHDGFLLGPWLESAK 423
+ + + + AY L+ G + L EL+ +D +LA FLLG WLE A+
Sbjct: 654 VQELVSLYYEEARSAY-LSKELGSLLRAGGILAYELLPALDEVLASDSHFLLGSWLEQAR 712
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
+A +E + YE N+R Q+T+W E ++L DY NK +GL+ YY PR ++ +
Sbjct: 713 AVAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVAHYYAPRRRLFLEA 767
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ S+ G F+ + + +L + + YP + GD + ++ ++ KY
Sbjct: 768 LAASVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 820
>gi|301106961|ref|XP_002902563.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
gi|262098437|gb|EEY56489.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
Length = 684
Score = 334 bits (857), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 185/538 (34%), Positives = 294/538 (54%), Gaps = 39/538 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
LPAF+G+VP ++ +FP+AK T+ NW + +CC Y+LD +DPL+ +IG+ F+E+
Sbjct: 160 ALPAFAGHVPEEMKALFPNAKFTRSPNWGDFSDE--FCCVYMLDFSDPLYYDIGKTFLEE 217
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q Y TS +Y CDT++E P P + + A+ M + D++AVWL+QGWLF
Sbjct: 218 QRALYDYTSSLYQCDTYNEMDPDFTDPAKLQAASRAVIDSMTAADANAVWLIQGWLFENS 277
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P +W ++KA L+ V K+++LDL++EV+P+WS ++G +++C+LHNF GN M
Sbjct: 278 PDYWTKNRVKAYLDGVSNEKMIILDLYSEVRPVWSKMDNYFGKSWVYCVLHNFGGNTGMR 337
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L ++ PV+A N TM+GVG++MEGI QN VVYDL +MA+ +D+ W+ +
Sbjct: 338 GDLATLGTAPVQASRDSNGTMIGVGLTMEGIYQNYVVYDLTLQMAWVDTPLDMDEWVPSF 397
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ +RY + AW L +VYN T G + +I P +
Sbjct: 398 AAQRYHSQDVHTERAWGFLLQSVYNRTLGFGGVTKSLICLIP----------------HW 441
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
K V + S T Y ++ RA + + +G+EL A +TYR+DL+D+TRQ L
Sbjct: 442 KLVRDGFMPTSIT---------YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFL 492
Query: 367 AKYANELFLNIIEAYQ--LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
+ + +L++ E Y+ AH + + R L +E MD +LA ++ LLG W+ A+
Sbjct: 493 SDHFMAQYLHLKEMYEGKTQPAHQLCAWTERMLLTIERMDEILATNEDSLLGNWIADARA 552
Query: 425 LAQNEEQ------EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
LA+ E + YE+ AR Q+T W DN E + DY K W+GL++ YY PR
Sbjct: 553 LAEESESIESSNLQDYYEYEARNQVTRWGDNNSET---IHDYAGKEWAGLVKGYYLPRWR 609
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ + ++ G + ++ I WQ YP + GDAL+ SQ +Y+++
Sbjct: 610 MWLGEVCQAYTQGRTINKEVVKKARIAFELKWQLSHEHYPTTTVGDALVVSQRIYDEF 667
>gi|194216885|ref|XP_001917396.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Equus caballus]
Length = 744
Score = 332 bits (852), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 200/538 (37%), Positives = 301/538 (55%), Gaps = 53/538 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +TQLG+W + C++LL DPLF +G F+ +
Sbjct: 242 VLPAFAGHVPKAITRVFPQVNVTQLGSWGHFNCS--YSCSFLLAPEDPLFPVVGSLFLRE 299
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 300 LTKEFG-TDHIYGADTFNEMQPPSSEPAYLAAATAAVYQAMTAVDPDAVWLLQGWLFHHQ 358
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW P Q+ A+L +VP G+L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 359 RTFWGPAQVGAVLGAVPRGRLLVLDLFAESQPMYIRTASFQGQPFIWCMLHNFGGNQGLF 418
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+ ++ + V D++AW+
Sbjct: 419 GALEAVNRGPAAARLFPNSTMVGTGMTPEGIGQNEVVYALMAELGWRKDPVADLEAWVTS 478
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
++ RRYG S + AW +L +VYNC+ A +
Sbjct: 479 FAARRYGVSHKDAETAWKLLLRSVYNCSAEAYSGHNQ----------------------- 515
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
P+ K L+ T+ +WY+ S+V A L + + L++S + YDL+D+TRQA
Sbjct: 516 -SPLVKRPSLQMGTT------VWYNRSDVFEAWWLLLTAAPALASSPAFLYDLVDVTRQA 568
Query: 366 LAKY-------ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
+ A +LN E L A G+ EL+ +D +LA FLLG W
Sbjct: 569 AQELISLYYEEARTAYLNK-ELVPLLRAGGILAY-----ELLPALDKVLASDSRFLLGSW 622
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
L+ A+++A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR
Sbjct: 623 LKQAREMAVSEAEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWQ 677
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ + +++SL G F+ + + + +L + YP + GD + ++ + KY
Sbjct: 678 LFVEMLVQSLAQGVPFQQQQFDKNAFELEEAFVLSTRRYPSQPQGDTVDLAKKFFLKY 735
>gi|321472423|gb|EFX83393.1| hypothetical protein DAPPUDRAFT_301977 [Daphnia pulex]
Length = 799
Score = 332 bits (850), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 179/494 (36%), Positives = 288/494 (58%), Gaps = 43/494 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A++ V+P+A T L +W + + ++CC L T+PLF EIG FI++
Sbjct: 249 VLPAFAGHVPRAMERVYPNASYTHLTSWLNFQD--QYCCPLFLQPTEPLFTEIGSRFIKE 306
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
E+G + H+YNCD F+E P P ++SS+G A+++ M + D DA+WLMQGWLF D
Sbjct: 307 MALEFG-SDHVYNCDVFNEVRPTQADPVFVSSVGTAVFNAMTTADPDAIWLMQGWLFKSD 365
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+W KALL SVP G++++LDL AE+ P + FYG P+++C+LHNF G + +
Sbjct: 366 ADYWTADLSKALLTSVPQGRMLILDLQAELDPQYIRLNSFYGQPFVFCLLHNFGGTLGLN 425
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + I+ ++AR N+TMVG G++MEGI+QN VVYD M EM ++ + ++ W ++Y
Sbjct: 426 GAIQIISQRVIDARNFPNSTMVGTGLTMEGIDQNYVVYDKMLEMGWRDKVPNLNQWFDEY 485
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+VRRYG + A+ AW L ++VYN + + + + V+V P +
Sbjct: 486 TVRRYGVNNTAVMSAWRFLQNSVYNDSSRRSFRGQYVLVTRPAL---------------- 529
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIA---SGNELSASNTYRYDLIDLTR 363
+ P +WY+ +VI A + I+ + LS ++ +R+D++DLTR
Sbjct: 530 ---------------WQLPFVWYNPHDVILAWDHLISGLMTEPLLSNASNFRHDMVDLTR 574
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
Q++ + + L+ ++E Y ++ + ++ + ++L++D+D LL FLLG W+ AK
Sbjct: 575 QSMQEIFHLLYSQLLEVYLEKNSTAIEGIAYKMIDLLQDLDELLQTGKKFLLGKWIADAK 634
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
E ++ QYEWNAR QIT+W + +RDY K W+G++ DYY PR ++ +
Sbjct: 635 SWGTTEGEKLQYEWNARNQITLWGPRGE-----IRDYAAKQWAGVVADYYKPRWEVFIRE 689
Query: 484 MIESLESGDGFRLK 497
M SL+ F K
Sbjct: 690 MQMSLDENRAFNKK 703
>gi|374385255|ref|ZP_09642763.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
12061]
gi|373226460|gb|EHP48786.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
12061]
Length = 736
Score = 332 bits (850), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 187/540 (34%), Positives = 291/540 (53%), Gaps = 57/540 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VP + FP A++ Q+ NW + R+ TYLL+A DPLF IG F+E+
Sbjct: 235 ILPAFTGHVPPTFKEHFPEARLRQV-NW-----EGRFDDTYLLEADDPLFQTIGNRFMEE 288
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q++ +G T H+Y DTF+E PP + Y+ + A+Y M + D +AVW+MQGWLF
Sbjct: 289 QIRTFG-TDHLYGADTFNEMFPPSEDSTYLDGISKAVYQSMAAVDPEAVWVMQGWLFHDK 347
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+P QMKA L +VP L+VLDL+ E PIW ++ FYG P+IWCMLHNF G ++
Sbjct: 348 RDFWKPAQMKAYLGAVPDEHLIVLDLWGEEFPIWDRTEAFYGKPWIWCMLHNFGGRNMLF 407
Query: 187 GILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G +A P ++G+G EGIEQNPV+Y L+ +++ V++ W
Sbjct: 408 GNALKLAEEPSRVLADPAKGQLLGLGAVPEGIEQNPVIYSLLFSHIWRNTAVELDEWFET 467
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y RYG A++ AW++L TVY EG NY
Sbjct: 468 YLESRYGCRDEAVEKAWDILRKTVY--------------------------ANEG---NY 498
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
++ + +++ + + Y+ EVI+A + + + + L + YRYDLI + +Q
Sbjct: 499 ESAITARPTFEKH-NNWAYTDIPYNPVEVIKAWKYLLQAADRLGENPCYRYDLILVGKQV 557
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
LA YA + E Y+ D + SR F+EL++DMD L+ H+ FLLG WLE A+
Sbjct: 558 LANYATIIQQKFGEDYRTKDLPAFTRNSREFMELIDDMDELMGTHEAFLLGKWLEDARSW 617
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
+ +++ YE NAR QIT+W + ++L DY +K WSGL + +Y R ++ +
Sbjct: 618 GKTASEKQLYEKNARDQITLW----GGKDAVLHDYASKQWSGLFKGFYKGRWQLFIDEVY 673
Query: 486 ESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
+ +++G + R++ W EW+ NG+ YP GD ++ S+ ++ KY++
Sbjct: 674 DCIKTGRKYDHTASDDRVRSWEWEWV-------NGQEKYPAVPQGDPVVVSERMFGKYIK 726
>gi|440800773|gb|ELR21808.1| AlphaN-acetylglucosaminidase (NAGLU) [Acanthamoeba castellanii str.
Neff]
Length = 800
Score = 332 bits (850), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 195/545 (35%), Positives = 291/545 (53%), Gaps = 69/545 (12%)
Query: 18 AALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSH 77
A L+ V+P A IT W R +LLD DP+F +IG AFI+ Q + YG T H
Sbjct: 243 AGLKRVYPHANITLSPTWAHFTDPYR---VWLLDPFDPIFQKIGTAFIDAQTRVYG-TDH 298
Query: 78 IYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKA 137
IYN DTF+E PP P Y+++ A+Y GM + D A+WLMQGWLF +W ++KA
Sbjct: 299 IYNADTFNELDPPSADPTYLAAASNAVYQGMAAADPKALWLMQGWLFR-SVWWSNDRIKA 357
Query: 138 LLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 197
L+ V +++LDL+AEV PIWS ++ ++G P++WCMLH+F GN ++YG L IA PV
Sbjct: 358 YLSGVKNDNMLILDLYAEVDPIWSKTESYFGKPFVWCMLHDFGGNRDLYGNLTHIATAPV 417
Query: 198 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 257
+ART+ +TMVG G++ME IEQNPV+Y+LMSEM ++ VDV W++ Y RYG P+
Sbjct: 418 DARTAPGSTMVGTGLTMEAIEQNPVIYELMSEMGWRSAHVDVDDWLDHYVSFRYGADSPS 477
Query: 258 IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKS 317
+ AW +L+ + Y P + SI + ++
Sbjct: 478 AKKAWRLLHQSAYQN---------------PVIMRSIYTFVPNRH--------------- 507
Query: 318 ETSSYDHPHLWYSTSEVIRALELFIASGNEL----SASNTYRYDLIDLTRQALAKYANEL 373
S H YS ++ A L + S EL + + YDL+D+TRQ L + L
Sbjct: 508 -VSRNHH----YSPDVLVEAWGLLLQSRLELPNPAQPNGPWEYDLVDVTRQVL----DNL 558
Query: 374 FLNIIEAYQLND-AHGVFQLSRR------------FLELVEDMDGLLACHDGFLLGPWLE 420
F +AY L D A+ + +RR ++++ D+D +LA + +LLG W E
Sbjct: 559 FH---DAYGLLDGAYDAYVATRRDPFNQVKTIGAALIQILSDIDTVLATNQNYLLGVWTE 615
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
A+ A NEE+++ YE+NAR QIT+W N + + DY +K W+GL+ YY PR I+
Sbjct: 616 RARSWATNEEEKRLYEFNARNQITLWGPNGE-----INDYASKEWAGLVGTYYRPRWQIF 670
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGT 540
Y+ +S+ G + + + W N N +P ++ G+ SQ LY +Y+
Sbjct: 671 VAYLFDSIAKGTVIDPNKYAADLLLWEQRWNNQTNAFPSQATGNVAEVSQALYARYVSAA 730
Query: 541 GVFDH 545
+ H
Sbjct: 731 ELKQH 735
>gi|255533286|ref|YP_003093658.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346270|gb|ACU05596.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 734
Score = 331 bits (849), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 181/538 (33%), Positives = 288/538 (53%), Gaps = 55/538 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LP+F+G+VP + ++ FP K+ + + P Y+L+ P+F EIGR F+
Sbjct: 227 ILPSFTGHVPPSFKDKFPDIKVNTQQ--WGINVSP----AYVLNPETPMFKEIGRKFLTA 280
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ +G T H+Y+ DTF+E TP + Y++ + IY M + D+ AVW+MQGW+F
Sbjct: 281 LINTFG-TDHLYSADTFNEMTPVSNDSTYLNGMAKKIYESMAAVDTQAVWIMQGWMFLDR 339
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW+P QMKAL ++VP KL+VLDL +E+ P+WS + FYG +IWCMLHNF G + M+
Sbjct: 340 PNFWQPTQMKALFSAVPQDKLIVLDLNSELNPVWSRTDAFYGEKWIWCMLHNFGGRLSMF 399
Query: 187 GILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G + I P A + + M G+G++MEGIEQNP +Y LM E + + +D+ W+
Sbjct: 400 GDMSRIGNDPAAALKNDQRGKMSGIGLTMEGIEQNPAIYSLMLEHIWNDKPIDLDNWLKG 459
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y+ RRYG+ + AW VL +TVY+ +I P D + +
Sbjct: 460 YAQRRYGKRNSNAEKAWEVLKNTVYSHQPWWG--TNTIITGRPTFDAATV---------- 507
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
+ + + YS+ E+++A + + +EL +S+ ++YDL+D+TRQ
Sbjct: 508 ----------------WTYTAIPYSSKELMKAWSYLLTASDELKSSDGFQYDLVDVTRQV 551
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
LA YAN L + +Y+ D + S +FLEL++D+D LL FLLG W+ +AK L
Sbjct: 552 LANYANVLQQDFASSYKQKDMATFNKKSAQFLELIDDIDQLLGTRSDFLLGKWINNAKAL 611
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
N ++K +E NAR IT+W D + + +Y K W+G+++ +Y PR +F +
Sbjct: 612 GDNPAEKKLFERNARDLITLWLD----KDCNIHEYACKEWAGMMKGFYKPRWQQFFDEVR 667
Query: 486 ESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+G +KDW +W+ N Y + G+ + ++ LY KY
Sbjct: 668 LQASAGKEIDQIKFENTIKDWEWKWV-------NANEAYTDKPTGNPVTVAKALYAKY 718
>gi|242011515|ref|XP_002426494.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
gi|212510620|gb|EEB13756.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
Length = 1345
Score = 328 bits (841), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 177/484 (36%), Positives = 282/484 (58%), Gaps = 46/484 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+F G VP + ++ +P AK+ ++ W D +CC YLLD+ DPLF + R F+++
Sbjct: 873 VLPSFCGIVPRSFKDSYPFAKLLEMPKWNKFSRD--YCCPYLLDSNDPLFSVVSRVFLKE 930
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ E+G T+HIYNCD F+EN P +S +Y+S++ + IY M S D A WL+QGW+F D
Sbjct: 931 YINEFG-TNHIYNCDVFNENKPASESLDYLSTISSTIYKAMSSVDPRATWLVQGWMF-ID 988
Query: 128 PFWRP-PQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
PFW ++KA +N+VP G++++LDL +++ P + + ++G P+IWC LHNF G + MY
Sbjct: 989 PFWASLKRVKAFINAVPKGRMLILDLQSDLTPQYKRLQSYFGQPFIWCTLHNFGGQLGMY 1048
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ + G + R +N+TMVG+G++ EGI+QN ++YD ++A + + VD+ WI +Y
Sbjct: 1049 GHLNRVNLGVFKGRKFKNSTMVGIGIAPEGIDQNYIMYDFTLDLALRTKPVDLDDWITKY 1108
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
++RRYG I DAW +L +T+YN PD + + S Y
Sbjct: 1109 ALRRYGLIEKNILDAWLILKNTLYNYN--------------PDSNFRLTSSNVKMYTLVK 1154
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF-IASGNELSASNT-YRYDLIDLTRQ 364
+ +L S + WY+ S ++ E F IAS N + ++++ +++DLID+TRQ
Sbjct: 1155 GEHIAKNILTKFPSLRMNEFTWYNRSIILDIFEKFQIASSNSILSTSSLFQHDLIDVTRQ 1214
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
+ + S FLEL+ ++D +L FLLG WLESAK
Sbjct: 1215 TI--------------------QIAIENSNMFLELLNELDMILNTGKKFLLGNWLESAKN 1254
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+A N+ ++ YE+NAR QIT+W N + +RDY K W+G++ D+Y PR ++F+ +
Sbjct: 1255 MATNKLEKDNYEFNARNQITLWGSNGE-----IRDYAAKQWAGMIHDFYKPRWKLFFQAL 1309
Query: 485 IESL 488
ES+
Sbjct: 1310 NESI 1313
>gi|383856382|ref|XP_003703688.1| PREDICTED: alpha-N-acetylglucosaminidase [Megachile rotundata]
Length = 744
Score = 327 bits (839), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 181/489 (37%), Positives = 281/489 (57%), Gaps = 44/489 (8%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLP+F+G+VP A +FP+A +T+L W + +CC YLL TDPLF +IG+
Sbjct: 226 LGIIPVLPSFAGHVPRAFPRLFPNANVTKLAPWNNFPD--VYCCLYLLAPTDPLFQQIGQ 283
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F++ ++E+G T HIYNCDTF+EN P +++ ++G + + M + D DA+WLMQGW
Sbjct: 284 LFLKTYIEEFG-TDHIYNCDTFNENEPHTSELKFLRNVGHSTFQAMNAVDPDAIWLMQGW 342
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF++D FW P+++A L SVP G+++VLDL +E P + K ++G P+IWCMLHNF G
Sbjct: 343 LFTHDKLFWTEPRVEAFLTSVPRGRMIVLDLQSEQFPQYGRLKSYFGQPFIWCMLHNFGG 402
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
+ M+G I E R +N+TMVG G++ EGI QN V+Y+LM+EMA++ E V++
Sbjct: 403 TLGMFGSAQIINQRVFEGRNMKNSTMVGTGLTPEGINQNYVIYELMNEMAYRKEPVNLNK 462
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W Y+ RRYG AW L TVYN + + + VI P ++ S +
Sbjct: 463 WFENYASRRYGVWNEYAVSAWQSLGRTVYNFSGTRKIRGKYVISRRPSLNLSTWT----- 517
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
WY + +F+ + + S YR+D++DL
Sbjct: 518 --------------------------WYDRDTLYNTWSVFLQARHGRRNSTLYRHDVVDL 551
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
TRQ L A E++ +I+++ + S + L+L +D++ +LA FLLG WL++
Sbjct: 552 TRQVLQAKAEEIYPVLIDSFNKKNLTAFKYHSDKLLDLFDDLELILASGKDFLLGKWLDA 611
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
AK+LA N+E+ + Y+ NA+ QI++W + +RDY NK W+G++ DY+ PR +I+
Sbjct: 612 AKKLASNDEELRLYQVNAKYQISLWGPRGE-----IRDYANKQWAGVVADYFKPRWSIF- 665
Query: 482 KYMIESLES 490
+ESLE+
Sbjct: 666 ---LESLEN 671
>gi|157134500|ref|XP_001656341.1| alpha-n-acetylglucosaminidase [Aedes aegypti]
gi|108881379|gb|EAT45604.1| AAEL003150-PA [Aedes aegypti]
Length = 763
Score = 327 bits (838), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 185/529 (34%), Positives = 293/529 (55%), Gaps = 47/529 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
LPAF+G++P ++P AK+T + NW + ++ LD DPLF EIG+ F+ +
Sbjct: 248 ALPAFAGHLPVQFAQLYPEAKLTPVENWNGFPA--QYASPLFLDPIDPLFQEIGKRFLTK 305
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
++ YG ++HIY CD F+E P S +Y+SS A IY M D AVWL+QGW+F +
Sbjct: 306 VIERYG-SNHIYFCDPFNEIQPRSFSAKYLSSASAGIYKAMNDVDPFAVWLLQGWMFVKN 364
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
P+W ++A L +VPLG+++VLDL +E P + ++ ++G P+IWCML NF G + M G
Sbjct: 365 PYWSDVAIRAFLQAVPLGRMLVLDLQSEQFPQYDRTESYHGQPFIWCMLSNFGGTLGMLG 424
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
+D + + RT+++ TM+G G++ EGI QN +Y+ EM + +V+ W Y+
Sbjct: 425 SVDLVFQRIRDVRTNDSMTMIGTGITPEGINQNYGLYEFALEMGWNPNIDNVEEWFRTYA 484
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RYG ++DAW++ +TVY+ + + GKY +
Sbjct: 485 SVRYGTQDKRLKDAWSMFRYTVYSFKEQ--------------------EMMRGKYTFNRR 524
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
P K HP LWY+ + ++L + S S + +R D++DLTRQ L
Sbjct: 525 PSLKL-----------HPWLWYNETLFNAGVQLLLESN---STNTLFRNDVVDLTRQFLQ 570
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
A+ L+LNI+EAY + + V LS F +L+EDMD LL FLLG WLESAK +A+
Sbjct: 571 NTADRLYLNIMEAYNTKNPNSVKYLSILFQKLLEDMDRLLRTDQHFLLGRWLESAKAVAE 630
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
+ ++YE+NAR QIT+W Q + DY NK W+G+++D++ PR ++ M +
Sbjct: 631 TSLERQKYEYNARNQITLWGPQGQ-----IVDYANKQWAGMVQDFFLPRWKLFLTEMTKD 685
Query: 488 LESGDGF---RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 533
+E +++D + ++L N R YP+ +GDAL+ ++ L+
Sbjct: 686 VEQNRTLNEGKVRDKIFKMVELPFCTSNKR--YPIRPDGDALLVARELF 732
>gi|255533285|ref|YP_003093657.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346269|gb|ACU05595.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 749
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 191/541 (35%), Positives = 283/541 (52%), Gaps = 60/541 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAFSG+VP + FP+A++ +L NW + R+ TY+L DPLF +I F+ +
Sbjct: 243 ILPAFSGHVPPTFKARFPNARVDRL-NW-----EGRFADTYVLHPDDPLFQQIADKFMAE 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q K +G T H+Y DTF+E P Y+ +G A+Y GM D +A+W+MQGW+F
Sbjct: 297 QDKAFGNTDHLYGADTFNEMYLPYTDTAYVRKIGTAVYKGMAKADPEAIWVMQGWMFWDK 356
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+P +K L+ VP L++LDLFA+ +PIW+ ++ F+G +IWCMLHNF G +Y
Sbjct: 357 RDFWKPEVVKNYLSGVPDDNLIMLDLFADEQPIWTKTEAFWGKKWIWCMLHNFGGRNPLY 416
Query: 187 GILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G L+ I P E N + G+G+ EGIEQNPVVY LM E + + +DVK+W+
Sbjct: 417 GDLNYIGREPAEMVHDPNRGRLSGIGLVPEGIEQNPVVYSLMLEHVWNDQVIDVKSWLVN 476
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y+ RRYG+ P + AW +L+ TVY EG Y+
Sbjct: 477 YAQRRYGQRDPQTEKAWQILHQTVY--------------------------AKEGSYETI 510
Query: 306 --GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
+P ++ ++ + D P Y +++ A + + N ++ Y++DL+ + R
Sbjct: 511 ISARPTHEK---HADWTGTDLP---YDGDKLVPAWTYLLNASNRFKNNDCYQFDLVTVGR 564
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
Q LA YA L ++ + + FL L+ DMD L+ FLLG WL AK
Sbjct: 565 QVLANYATVLQRLFARDFRNKNLTAYRAHTAEFLTLIADMDKLMGTRKDFLLGKWLNDAK 624
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
+ A NE + + YE NAR IT+W ++AS L +Y NK W+GL +YG R +
Sbjct: 625 KWATNESESRLYEKNARDLITLW---GGKDAS-LHEYANKQWAGLFNGFYGKRWQTFIAE 680
Query: 484 MIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+LE G F R+KDW EW +W NGR Y + G+ + S L+ KY
Sbjct: 681 TSTALEQGKSFDQEAFETRMKDW--EW-----NWVNGREQYTDKPQGNPVTVSIQLHKKY 733
Query: 537 L 537
+
Sbjct: 734 I 734
>gi|194759443|ref|XP_001961958.1| GF14678 [Drosophila ananassae]
gi|190615655|gb|EDV31179.1| GF14678 [Drosophila ananassae]
Length = 783
Score = 325 bits (832), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 184/541 (34%), Positives = 295/541 (54%), Gaps = 54/541 (9%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
LPAF+G+VP AL + P+ T + W R+CC ++ T+PLF +I F++
Sbjct: 264 VALPAFAGHVPRALSRLHPNTSFTDVQRWNQFPD--RYCCGLFVEPTEPLFHQIATTFLQ 321
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+ YG ++HI+ CD F+E PPV PEY+ S AAI++ M + D +A+WL+QGW+F
Sbjct: 322 SVVTIYG-SNHIFFCDPFNELEPPVAKPEYMRSTAAAIHNSMTAVDPEAIWLLQGWMFVK 380
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+PFW P +A L +VP G+++VLDL +E P + + ++G P+IWCMLHNF G + M+
Sbjct: 381 NPFWTPDMAEAFLTAVPRGRILVLDLQSEQFPQYELTHSYFGQPFIWCMLHNFGGTLGMF 440
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G I G AR+ N+++VG G++ EGI QN VVY L E + +D+ +W +
Sbjct: 441 GSAKLINSGIEAARSMPNSSIVGTGITPEGIGQNYVVYSLTLERGWSRNSIDLDSWFRHF 500
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+V RYG ++ AW +L ++VY+ + G+Y
Sbjct: 501 TVTRYGVKDESLAKAWLLLKNSVYSFHG--------------------LQKMRGQYVVTR 540
Query: 307 KPVSKEAVLKSETSSYDH-PHLWYSTSEVIRALELFIASGN----ELSASNTYRYDLIDL 361
+P S++H P WY+ S+V+ A L +++ E + Y +DL+D+
Sbjct: 541 RP------------SFNHDPFTWYNASDVLEAWHLLLSARVIIPLEDDRYDVYEHDLVDI 588
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
TRQ L A++L++N+ +++ LS R L+L +D++ +L+ FLLG WLE
Sbjct: 589 TRQFLQITADQLYVNLKSSFRKRQLPRFEFLSTRLLQLFDDLELILSSGRNFLLGNWLEQ 648
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
AKQ+A + E K +E+NAR QIT W N Q + DY K WSGL++DYY PR +++F
Sbjct: 649 AKQVAPHPEDRKSFEFNARNQITAWGPNGQ-----ILDYACKQWSGLVKDYYKPRWSLFF 703
Query: 482 KYMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ +L S G F+ K +R I+L + N ++YP + + S ++ ++
Sbjct: 704 DDVNVALHSQRPFNGSAFKQKVSQR--IELP--FSNKTDIYPTDPVENVWFISHTIFERW 759
Query: 537 L 537
+
Sbjct: 760 M 760
>gi|390334740|ref|XP_003724005.1| PREDICTED: uncharacterized protein LOC100893810 [Strongylocentrotus
purpuratus]
Length = 1043
Score = 324 bits (830), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 176/471 (37%), Positives = 257/471 (54%), Gaps = 41/471 (8%)
Query: 70 KEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF 129
+E+ T HIYN DTF+EN P + Y+S+ +Y G+ GD VWLMQGWLF F
Sbjct: 573 EEFNGTDHIYNADTFNENQPRSNDSAYLSAASRGVYQGIVEGDPQGVWLMQGWLFQKTDF 632
Query: 130 WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL 189
W P Q+KALL+ VP+G+++VLDLFAE +PI++ ++ FYG P+IWCMLHNF GN +YG L
Sbjct: 633 WGPSQIKALLHGVPIGRMIVLDLFAEARPIYNATQSFYGQPFIWCMLHNFGGNTGLYGKL 692
Query: 190 DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 249
D++ P EAR ++TM+G+G++ EGI QN V+Y+ +++M ++ E ++V WI +YS R
Sbjct: 693 DAVNKFPFEARQFNSSTMIGMGLTPEGILQNYVMYNFLTDMTWRSESMNVSKWIEEYSGR 752
Query: 250 RY----GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
RY G S A + AW +L TVYN T D A P V PS N
Sbjct: 753 RYSPESGHSEEAAK-AWAILQATVYNNTGIDKDHQH----AVPVVRPS----------NK 797
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
K V +WY +EV +A + + L S+ +RYDL+D+TR
Sbjct: 798 TKSV-----------------IWYDYTEVAKAWGFLLQASETLGTSSLFRYDLVDVTRNV 840
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
L A + + I+ ++ + + L+ DMD + + H +LLG WLE AK L
Sbjct: 841 LQDLAFDFYEQIMVSFHAKNITAIRGNGTLLCNLILDMDNITSSHQDWLLGTWLEDAKSL 900
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
A N ++E YE+NAR QIT+W + DY NK W GLLR YY R ++ +++
Sbjct: 901 ATNHKEESLYEYNARNQITVWGPRGEH-----LDYANKQWGGLLRSYYYNRWQLFVQFLD 955
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+E + + + +W N +P + GD + S+ LY+KY
Sbjct: 956 GCIELHVPYDQSKFDMRSFIMETEWTNSTEKFPTKPVGDTVSISRALYSKY 1006
>gi|449675146|ref|XP_002156234.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Hydra
magnipapillata]
Length = 646
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 175/530 (33%), Positives = 280/530 (52%), Gaps = 36/530 (6%)
Query: 8 VLPAFSGNVPAAL-QNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
VLP F G++P AL +FP++K +L W K ++ T+LLD DPLF ++G AF+E
Sbjct: 126 VLPGFGGHIPKALVSRLFPTSKYYKLKPW--NKFTGKYGGTFLLDPQDPLFKKVGAAFVE 183
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
Q + Y T H+YN D F+E PP + +I++ +Y+ M + DSDAVWLMQGW+F
Sbjct: 184 MQKQLYNGTDHVYNADIFNEMDPPQLTSAFITNTSIGVYNAMLASDSDAVWLMQGWMF-L 242
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W+P ++A L ++P GKL++LDL +++ P++ + FYG P+IWCM+ NF G +Y
Sbjct: 243 SSVWKPELVEAWLQAIPYGKLIILDLASDIYPLYDQTNAFYGHPFIWCMIENFGGTTRLY 302
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L + G + AR + + M+G GM+ EGI QN + ++LM+EM +++E+ ++ W Y
Sbjct: 303 GQLTGVMKGVISARKTYKSFMIGTGMTPEGINQNDINFELMNEMGWRNEEFNISDWTLSY 362
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
RRYG + DAW +L T+YNC DG + D + P + P + N
Sbjct: 363 IKRRYGDYPKMVSDAWLILIDTIYNCNDGRENGGYDGRI--PVMRPQL---------NAK 411
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
PV H+WYS ++ A +L + + + +T+R DL+ L Q L
Sbjct: 412 LPV----------------HMWYSIKDLYNAWKLMVKGSDYMPLIDTFRNDLVRLGTQVL 455
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ + ++ Y V + + L+ DMD LLA LLG W++SA+ +
Sbjct: 456 EDLSIVFYTQMVSGYFNKSTLNVEKYGSKITVLLTDMDRLLATDQYSLLGRWIQSARSMG 515
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ K E+NA+ QIT+W N + +RDY NK W+GL+ +Y R ++ ++ +
Sbjct: 516 DTLNETKLLEYNAKNQITLWGPNGE-----IRDYANKNWAGLVGSFYFERWNMFINFLSD 570
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
SL+ G + + + ++ W N + + GDA S L Y
Sbjct: 571 SLKRGVPYDDSAFVSKLLQFEKKWNNEIKEFSADPTGDAFGISHQLLRAY 620
>gi|328867411|gb|EGG15793.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
Length = 1501
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 188/551 (34%), Positives = 299/551 (54%), Gaps = 66/551 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F+G+VP A ++FP+A ITQLG+W + TY LD +DPLF + + F++
Sbjct: 987 VLPGFAGHVPRAFMSLFPTANITQLGDWRAFNG------TYYLDPSDPLFANVSQTFVKV 1040
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q YG T H Y+ D F+E TPP Y+ + +++Y+ + D AVW++Q W F D
Sbjct: 1041 QTAIYG-TDHYYSFDPFNEITPPSSDAGYLQNSSSSMYNALAYADPQAVWVLQAWFFISD 1099
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+PPQ+KA L VP+G L+VLD +AE P W+ + QF G +IWCMLHNF G MY
Sbjct: 1100 AWFWQPPQVKAFLGGVPIGHLLVLDTWAEESPAWTVTDQFNGHDWIWCMLHNFGGRTGMY 1159
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + I GP++AR ++ M G G++ E IEQN ++YDLMSEM+++ ++ WINQY
Sbjct: 1160 GKIPRITAGPIDAR-KQSPGMKGTGLTPEAIEQNYIMYDLMSEMSWRTTAPNMTEWINQY 1218
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RRYG VP + AWN L TVYN D + DKN +F + P +
Sbjct: 1219 TQRRYGVFVPELAQAWNSLASTVYNAPD-SIDKNPS---SFVGIRPELNMTN-------- 1266
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE-LSASNTYRYDLIDLTRQA 365
+++Y +S + +A +L+++ +E + +++TY +D+ ++T QA
Sbjct: 1267 -------------------NIYYDSSIIQKAWQLYLSVTDEYVLSTSTYSFDIAEITIQA 1307
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ- 424
L+ E + + +AY+ + + L ++ DMD + + L+G W +A+Q
Sbjct: 1308 LSNLFIETEIAMYDAYKTGKGTEFDEHAMNCLNIITDMDMIASTQQLLLVGTWTANARQW 1367
Query: 425 ----LAQNEEQEK----------QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
L++N+++++ QYE+NAR QIT+W S L DY WSGLL
Sbjct: 1368 ANYNLSRNKDEDRNTDKEQMTIEQYEFNARNQITLW----GPSNSTLHDYAYHLWSGLLN 1423
Query: 471 DYYGPRAAIYFKYM------IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
D+Y R +++ KY+ + ++G GF+ +++ + L W YP G+
Sbjct: 1424 DFYLARWSLFIKYLDSSLSSSSTNDAGTGFKNQEYINDIESLEESWNLQTYQYPTRPTGN 1483
Query: 525 ALITSQWLYNK 535
A S+++ N+
Sbjct: 1484 AYQLSKFINNQ 1494
>gi|198476648|ref|XP_001357424.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
gi|198137793|gb|EAL34493.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
Length = 767
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 182/533 (34%), Positives = 291/533 (54%), Gaps = 44/533 (8%)
Query: 9 LPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 68
LPAF+G++P A++ ++P+ T++ W S DP +CC +D DP+F + F+ +
Sbjct: 250 LPAFAGHLPTAMRRIYPNGNYTEVERWNSFP-DP-YCCGLFVDPLDPIFDLVAALFLRRV 307
Query: 69 LKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP 128
++ YG ++HI+ CD F+E PPV P+Y+ S AAI++ M+S D +AVWL+QGW+F +
Sbjct: 308 VQRYG-SNHIFFCDPFNELQPPVAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNI 366
Query: 129 FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI 188
FW M+A L +VP+G+L+VLDL +E P + + +YG P++WCMLHNF G + M+G
Sbjct: 367 FWTDAMMEAFLTAVPIGRLIVLDLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGS 426
Query: 189 LDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSV 248
D + G AR N+++VGVG++ EGI QN V+Y L+ E + +D+ +W ++
Sbjct: 427 ADLVNNGIEAARRMPNSSIVGVGITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFAR 486
Query: 249 RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP 308
RYG +Q AW +L +VY+ K R G
Sbjct: 487 TRYGVDDEGLQQAWQLLRRSVYSFR--GLQKMRG-----------------------GYT 521
Query: 309 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLTRQ 364
V++ L + P WY+ S+V+ A +L ++S + + Y +DL+D+TRQ
Sbjct: 522 VTRRPALNLD------PFTWYNASDVLEAWKLLLSSRAIIPLEDDNYAIYEHDLVDITRQ 575
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L A++L++N+ AY+ L + L+L D++ +LA FLLG WL A++
Sbjct: 576 YLQISADQLYVNLKSAYRKRQVARFEYLGSKLLQLFGDLERILASGSNFLLGTWLADAQR 635
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
A N + +E+NAR QIT W + Q + DY K WSGL+ DYY PR A++ +
Sbjct: 636 AAPNAADKPNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVLDYYRPRWALFLDDV 690
Query: 485 IESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+L S F ++ R ++ + N +VYPVE G+ SQ +Y ++
Sbjct: 691 TLALHSNRTFNSTAFKLRVSQEVELPFSNKSDVYPVEPMGNTWFISQNIYERW 743
>gi|348681836|gb|EGZ21652.1| hypothetical protein PHYSODRAFT_247428 [Phytophthora sojae]
Length = 991
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 181/541 (33%), Positives = 291/541 (53%), Gaps = 43/541 (7%)
Query: 9 LPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 68
LPAF+G+VP L+ P+A T+ NW + ++CC Y+++ TDPL+ EIG+AF+E+Q
Sbjct: 462 LPAFAGHVPEDLKLTLPNANFTRSPNWGNFTD--QYCCVYMIEPTDPLYREIGKAFLEEQ 519
Query: 69 LKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP 128
Y TS +Y CDT+ E P + A+ GM + D +AVWLMQGW F DP
Sbjct: 520 RALYNYTSSLYQCDTYMEMAPEFTDLSELKGAARAVIDGMTAADPNAVWLMQGWPFVDDP 579
Query: 129 -FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
+W P++KA L VP KL++LD ++E PIW+ ++G +I+ +LHNF GN M G
Sbjct: 580 HYWTRPRVKAYLEGVPTDKLIILDFYSEAVPIWNKMDNYFGKNWIYSVLHNFGGNTGMRG 639
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
L ++A PV+A+ N TMVGVG++MEGI QN VVYDL +MA++ +DV W+++Y+
Sbjct: 640 DLPTLATAPVQAQRDGNGTMVGVGLTMEGIFQNYVVYDLTLQMAWEDSPLDVDEWVSKYA 699
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRY ++ AW+ L +VYN T +A+ V S++ + Y +
Sbjct: 700 SRRYHTQNEHVERAWSYLSRSVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR 748
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
+ + Y +++ A + + +G+EL +TYR+DL+D+T+Q L+
Sbjct: 749 --------------FQPTLIKYDPKDIVLAWKELLLAGDELRNVDTYRHDLVDVTKQFLS 794
Query: 368 KYANELF--LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
E + L +I + + A+ V +L++ L + ++ +LA ++ FLLG W+ A L
Sbjct: 795 NKLLEQYQHLKVIYSAKSAPANEVCELTKTMLTTINRLEEILATNEDFLLGNWVADALNL 854
Query: 426 AQN----------EEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
A + + ++ YE+ AR Q+T W DN E + DY K W+GL++ YY P
Sbjct: 855 AGDLNIGGDSVTRTKLQEYYEYEARNQVTRWGDNNNEA---IHDYAGKEWAGLVKSYYLP 911
Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
R ++ + + K ++ I WQ YP + GD+ S+ +Y++
Sbjct: 912 RWTMWLTEVCSAYTDRREMDEKGLKKRRIAFELKWQLSHEKYPTTTVGDSFSISKRIYSE 971
Query: 536 Y 536
Y
Sbjct: 972 Y 972
>gi|195155652|ref|XP_002018715.1| GL25802 [Drosophila persimilis]
gi|194114868|gb|EDW36911.1| GL25802 [Drosophila persimilis]
Length = 767
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 181/533 (33%), Positives = 291/533 (54%), Gaps = 44/533 (8%)
Query: 9 LPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 68
LPAF+G++P A++ ++P+ T++ W S DP +CC +D DP+F + F+ +
Sbjct: 250 LPAFAGHLPTAMRRIYPNGNYTEVERWNSF-PDP-YCCGLFVDPLDPIFDLVAALFLRRV 307
Query: 69 LKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP 128
++ YG ++HI+ CD F+E PPV P+Y+ S AAI++ M+S D +AVWL+QGW+F +
Sbjct: 308 VQRYG-SNHIFFCDPFNELQPPVAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNI 366
Query: 129 FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI 188
+W M+A L +VP+G+L+VLDL +E P + + +YG P++WCMLHNF G + M+G
Sbjct: 367 YWTDAMMEAFLTAVPIGRLIVLDLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGS 426
Query: 189 LDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSV 248
D + G AR N+++VGVG++ EGI QN V+Y L+ E + +D+ +W ++
Sbjct: 427 ADLVNNGIEAARRMPNSSIVGVGITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFAR 486
Query: 249 RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP 308
RYG +Q AW +L +VY+ K R G
Sbjct: 487 TRYGVDDEGLQQAWQLLRRSVYSFR--GLQKMRG-----------------------GYT 521
Query: 309 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLTRQ 364
V++ L + P WY+ S+V+ A +L ++S + + Y +DL+D+TRQ
Sbjct: 522 VTRRPALNLD------PFTWYNASDVLEAWKLLLSSRAIIPLEDDKYAIYEHDLVDITRQ 575
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L A++L++N+ AY+ L + L+L D++ +LA FLLG WL A++
Sbjct: 576 YLQISADQLYVNLKSAYRKRQVARFEYLGSKLLQLFGDLEHILASGSNFLLGTWLADAQR 635
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
A N + +E+NAR QIT W + Q + DY K WSGL+ DYY PR A++ +
Sbjct: 636 AAPNAADKPNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVLDYYRPRWALFLDDV 690
Query: 485 IESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+L S F ++ R ++ + N +VYPVE G+ SQ +Y ++
Sbjct: 691 TLALHSNRTFNSTAFKLRVSQEVELPFSNKSDVYPVEPMGNTWFISQNIYERW 743
>gi|348681870|gb|EGZ21686.1| hypothetical protein PHYSODRAFT_495971 [Phytophthora sojae]
Length = 692
Score = 317 bits (812), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 173/538 (32%), Positives = 285/538 (52%), Gaps = 39/538 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
LPAF+G+VP L+ ++P+AK T+ NW + +CC Y+LD DPL+ EIG+ F+E+
Sbjct: 160 ALPAFAGHVPEELKTLYPNAKFTRSPNWGGFSDE--FCCVYMLDPQDPLYYEIGKTFLEE 217
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q Y TS +Y CDT++E P P + + A+ M + D +AVWL+QGWLF
Sbjct: 218 QRALYDYTSSLYQCDTYNEMDPDFTDPAKLQAASRAVIDSMTAADPNAVWLIQGWLFVNS 277
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P +W +++ L+ VP K+++LDL++EV+P+W+ ++G +I+C+LHNF GN M
Sbjct: 278 PNYWTKERVQTYLDGVPNDKMIILDLYSEVRPVWNKMDNYFGKSWIYCVLHNFGGNTGMR 337
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L ++ PV A + + TM+G+G++MEGI QN VVYDL +MA+ +D+ W+ +
Sbjct: 338 GDLPTLGTAPVLANRASSGTMIGMGLTMEGIFQNYVVYDLTLQMAWVDAPLDMDEWVPSF 397
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ +RY + AW L +VYN T G + ++ P
Sbjct: 398 AAQRYHSQDAHTERAWGFLLQSVYNRTLGYGGVTKSLVCLIPHWK--------------- 442
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+ ++ + + + Y ++ RA + + +G+EL A +TYR+DL+D+TRQ L
Sbjct: 443 --LVRDGFMPTLIT--------YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFL 492
Query: 367 AKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
+ + +L++ + Y + A + + R L +E +D +LA +D FLLG W+ A+
Sbjct: 493 SDHFMAQYLHLEDMYAGKETPADQLCAWTDRMLVTIEWLDEILATNDDFLLGNWVADARA 552
Query: 425 LAQN------EEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
LA + YE+ AR Q+T W DN E + DY K W+GL+ YY PR
Sbjct: 553 LADEVGAAEVTSLQDYYEYEARNQVTRWGDNNSES---IHDYAGKEWAGLVSGYYLPRWR 609
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ + +S ++ + +WQ YP + GD L S+ +Y ++
Sbjct: 610 MWLTEVCQSYTQKRDVNEAALKKARVDFELNWQLSHERYPTTTTGDTLAVSKRIYEEF 667
>gi|428176410|gb|EKX45295.1| hypothetical protein GUITHDRAFT_51145, partial [Guillardia theta
CCMP2712]
Length = 680
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 179/552 (32%), Positives = 304/552 (55%), Gaps = 41/552 (7%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLPAF+G VP ++++FP AK T+ GNW CC ++D TDPLF++IG+
Sbjct: 153 LGMLPVLPAFAGGVPEGMKSLFPEAKFTRHGNWGGFAEQH--CCVMMVDPTDPLFLKIGK 210
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP----EYISSLGAAIYSGMQSGDSDAVWL 118
F+E+ YG ++HIY+CDTF+EN P + +++S A++ M++ D DAVWL
Sbjct: 211 MFVEEVRAVYG-SNHIYSCDTFNENRPRSEHGSVGLDFLSHSSRAVFESMRAADPDAVWL 269
Query: 119 MQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP-----YI 172
MQGWLF D FW+ ++ A L+ VP ++++LDLF +V P+W P ++
Sbjct: 270 MQGWLFMNDARFWQKRELDAYLSGVPEDRMIILDLFTDVFPVWKRRDLQRPTPIEKRRWV 329
Query: 173 WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
W MLH+F GN MYG L I+ PV A+ E+ TMVGVG++ EGIEQNPVVY++M+EM +
Sbjct: 330 WNMLHSFGGNSGMYGRLQVISKDPVVAK-KESQTMVGVGITTEGIEQNPVVYEMMAEMRW 388
Query: 233 QHEKVDVKAWINQYSVRRYG----RSVPAI-QDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
+ ++VDV +W+ +++ RR G R A+ ++AW L TVY+C + + ++ +
Sbjct: 389 REQEVDVMSWVEKWADRRLGPEASRERKALGEEAWRELASTVYSCPGTQMGQVKSMVESR 448
Query: 288 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 347
P +D + + + + +EA++++ W ++ L +
Sbjct: 449 PRLDLASGWIPNSDFMPIKRHYPEEALVRA----------W------LKLLRATRGGADG 492
Query: 348 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLL 407
+ S++ +D+ D+TRQ L+ LF + Q A + L ++ DMD ++
Sbjct: 493 YTCSSSASFDIADVTRQVLSDLFARLFQPLSSFCQTRLAGSAAVRMQTLLGIISDMDKMV 552
Query: 408 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
LLG W+E A+ +++E+E+ E+NAR +T+W + + DY +K W G
Sbjct: 553 GTQPRMLLGKWIEDARAWGKSKEEEEVLEFNARNLVTLWGPRGE-----IADYASKQWQG 607
Query: 468 LLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN-GRNVYPVESNGDAL 526
LL DYY R ++F+++ +++ F + +++E + WQ + +P G+A+
Sbjct: 608 LLSDYYMSRWKLFFEHLQQAIRGTRIFSQQRFQQELLVFEQQWQTRTSSSFPSSPEGNAV 667
Query: 527 ITSQWLYNKYLQ 538
+ L++KY++
Sbjct: 668 ELAWQLHDKYIK 679
>gi|170060634|ref|XP_001865888.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
gi|167879069|gb|EDS42452.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
Length = 761
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 189/541 (34%), Positives = 290/541 (53%), Gaps = 60/541 (11%)
Query: 1 MLLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEI 60
M+LA LPAF+G++P + +FP AK+ + W + ++ LD DPLF +I
Sbjct: 244 MILA----LPAFAGHLPVQFKTLFPQAKLNPVEVWNGFPA--QYASPLFLDPVDPLFQKI 297
Query: 61 GRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 120
G F+ + + YG T HIY D F+E P +S Y++S A IY M D AVWL+Q
Sbjct: 298 GSKFVAKAIARYG-TDHIYFSDPFNEIQPRSESARYLASAAAGIYQAMVDVDPLAVWLLQ 356
Query: 121 GWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFA 180
GW+ +PFW +KA +VP G+++VLDL +E P + ++ +YG P+IWCML NF
Sbjct: 357 GWMLVKNPFWSDRAIKAFFTAVPNGRMLVLDLQSEQFPQYVRTQSYYGQPFIWCMLSNFG 416
Query: 181 GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
G + M G +D + E R++E+ TM+G G++ EGI QN +Y+ EM + + DV
Sbjct: 417 GTLGMLGSVDLVFERIRETRSNESMTMIGTGITPEGINQNYGLYEFALEMGWNPDISDVD 476
Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
W +Y++ RYG +QDAW++ TVY+ + + G
Sbjct: 477 NWFTRYAMVRYGNDDKRLQDAWSIFRSTVYSFKG--------------------MEMMRG 516
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIA--SGNELSASNTYRYDL 358
KY +P K P +WY+ + +EL +A NEL +++D+
Sbjct: 517 KYTFNRRPSLKL-----------QPWVWYNETRFDEGVELILAVNGSNEL-----FKHDV 560
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
+DLTRQ L A++L+L I++ Y L +A S F EL++++D LLA + FLLG W
Sbjct: 561 VDLTRQFLQNTADKLYLTIMDTYTLKNAAAFKHYSNLFKELLQNIDRLLATNTHFLLGRW 620
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
LESAK LA + ++YE+NAR QIT+W Q + DY NK WSG+++D++ PR +
Sbjct: 621 LESAKSLATTSLERQKYEYNARNQITLWGPQGQ-----IVDYANKQWSGVVQDFFLPRWS 675
Query: 479 IYFKYMIESLESGDGF-----RLKDWRREWIKLTNDWQNGRNVYPVESNG-DALITSQWL 532
++ + M +L + R K +R+ + D R YP E++G DAL ++ L
Sbjct: 676 LFLQEMELALATNGTINETKVRDKIFRKVELPFNTD----RKKYPAEASGEDALELAREL 731
Query: 533 Y 533
Y
Sbjct: 732 Y 732
>gi|195577611|ref|XP_002078662.1| GD22403 [Drosophila simulans]
gi|194190671|gb|EDX04247.1| GD22403 [Drosophila simulans]
Length = 778
Score = 315 bits (806), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 183/533 (34%), Positives = 283/533 (53%), Gaps = 52/533 (9%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
LPAF+G+VP AL+ + P + ++ W R+CC ++ TD LF EI F++
Sbjct: 259 VALPAFAGHVPRALKRLHPESTFMEVQRWNQFPD--RYCCGLFVEPTDNLFKEIASRFLQ 316
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+ +YG ++HI+ CD F+E PPV PEY+ S AAIY M+ D A+WL+QGW+F
Sbjct: 317 NIITKYG-SNHIFFCDPFNELEPPVAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVK 375
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+PFW +A L + P G+++VLDL +E P + ++ ++G P+IWCMLHNF G + M+
Sbjct: 376 NPFWTTDMAEAFLTAAPRGRILVLDLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMF 435
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G I G EAR N+++VG G++ EGI QN V+Y E + + +D+ +W +
Sbjct: 436 GSAKLINSGIDEARRLPNSSLVGTGITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNF 495
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S RYG ++ AW +L ++VY+ + G+Y
Sbjct: 496 SHTRYGVKDERLEQAWLLLKNSVYSFRG--------------------LQKMRGQY---- 531
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLT 362
V+ S P WY+ S V+ A L + S + + Y +DL+D+T
Sbjct: 532 -------VVTRRPSFNQEPFTWYNASAVLDAWHLLLTSRAIIPLEDDRYEIYEHDLVDIT 584
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L A++L++N+ AY+ LS + L+L +DM+ +LA FLLG WL+ A
Sbjct: 585 RQFLQISADQLYVNLRSAYRKRQVARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQA 644
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
KQ A N +++ +E+NAR QIT W + Q + DY K WSGL+ DYY PR ++ +
Sbjct: 645 KQAAPNTGEQRNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLE 699
Query: 483 YMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
+ +L + G F+LK + I+L + N +VYPV G+ + SQ
Sbjct: 700 DVTVALHAGRPYNGTAFKLK--VSQEIELP--FSNKADVYPVTPVGNTWLISQ 748
>gi|194863164|ref|XP_001970307.1| GG23441 [Drosophila erecta]
gi|190662174|gb|EDV59366.1| GG23441 [Drosophila erecta]
Length = 778
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 184/533 (34%), Positives = 283/533 (53%), Gaps = 52/533 (9%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
LPAF+G+VP AL+ + P + ++ W R+CC L+ TD LF EI F++
Sbjct: 259 VALPAFAGHVPRALKRLHPGSTFMEVQRWNQFPD--RYCCGLFLEPTDNLFNEIALIFLQ 316
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+ + YG ++HI+ CD F+E PPV PEY+ S AAIY ++ D A+WL+QGW+F
Sbjct: 317 KIITAYG-SNHIFFCDPFNELEPPVAKPEYMRSTAAAIYESIRRLDPQAIWLLQGWMFVK 375
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+PFW +A L + P G+++VLDL +E P + ++ ++G P+IWCMLHNF G + M+
Sbjct: 376 NPFWTTDMAEAFLTAAPRGRILVLDLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMF 435
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G I G EAR N+++VG G++ EGI QN V+Y E + + +D+ +W +
Sbjct: 436 GSAKLINSGIEEARRLPNSSLVGTGITPEGIGQNYVMYSFTLERGWSNRPLDLDSWFTSF 495
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S RYG ++ AW L ++VY+ + G+Y
Sbjct: 496 SHARYGVKDERLEQAWLQLKNSVYSFHG--------------------LQKMRGQYVVTR 535
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLT 362
+P K+ P WY+ S V+ A L ++S + + Y +DL+D+T
Sbjct: 536 RPSFKQ-----------EPFTWYNASAVLDAWHLLLSSRAIIPLEDDRYEMYEHDLVDIT 584
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L A++L++N+ AY+ LS + L+L +DM+ +LA FLLG WL+ A
Sbjct: 585 RQFLQISADQLYVNLRSAYKKRQVSRFEFLSSKLLKLFDDMELILASSRNFLLGNWLQQA 644
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
KQ A + +++ YE+NAR QIT W + Q + DY K WSGL+ DYY PR ++ +
Sbjct: 645 KQAAPHPGEQRNYEFNARNQITAWGPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLE 699
Query: 483 YMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
+ +L S G F+LK + I+L + N +VYPV G+ SQ
Sbjct: 700 DVTVALHSLRPFNGTAFKLK--VSQEIELP--FSNKVDVYPVTPVGNTWFISQ 748
>gi|21356587|ref|NP_652045.1| CG13397, isoform A [Drosophila melanogaster]
gi|442626853|ref|NP_001260251.1| CG13397, isoform B [Drosophila melanogaster]
gi|16185856|gb|AAL13967.1| LP03571p [Drosophila melanogaster]
gi|22945953|gb|AAF52672.2| CG13397, isoform A [Drosophila melanogaster]
gi|440213562|gb|AGB92787.1| CG13397, isoform B [Drosophila melanogaster]
Length = 778
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 179/532 (33%), Positives = 282/532 (53%), Gaps = 50/532 (9%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
LPAF+G+VP AL+ + P + ++ W R+CC ++ T+ LF EI F+
Sbjct: 259 VALPAFAGHVPRALKRLNPESTFMEVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLH 316
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+ +YG ++HI+ CD F+E PPV PEY+ S AAIY M+ D A+WL+QGW+F
Sbjct: 317 NIITKYG-SNHIFFCDPFNELEPPVAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVK 375
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+PFW +A L + P G+++VLDL +E P + ++ ++G P+IWCMLHNF G + M+
Sbjct: 376 NPFWTTDMAEAFLTAAPRGRILVLDLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMF 435
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G I G EAR N+++VG G++ EGI QN V+Y E + + +D+ +W +
Sbjct: 436 GSAKLINSGIEEARRLPNSSLVGTGITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNF 495
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S RYG ++ AW +L ++VY+ + G+Y
Sbjct: 496 SHSRYGVKDERLEQAWLLLKNSVYSFRG--------------------LQKMRGQY---- 531
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT----YRYDLIDLT 362
V+ S P WY+ S V+ A L + + + Y +DL+D+T
Sbjct: 532 -------VVTRRPSFNQEPFTWYNASAVLDAWHLLLTFRAIIPLEDNRYEIYEHDLVDIT 584
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L A++L++N+ AY+ LS + L+L +DM+ +LA FLLG WL+ A
Sbjct: 585 RQFLQISADQLYINLRSAYRKRQVSRFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQA 644
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
KQ A N Q++ +E+NAR QIT W + Q + DY K WSGL+ DYY PR ++ +
Sbjct: 645 KQAAPNTGQQRNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLE 699
Query: 483 YMIESLESGDGFRLKDWRREWIKLTND----WQNGRNVYPVESNGDALITSQ 530
+ +L +G F ++ +K++++ + N +VYPV G+ + SQ
Sbjct: 700 DVTVALHAGRPFNGTAFK---LKVSHEIELPFSNKDDVYPVTPVGNTWLISQ 748
>gi|158300970|ref|XP_320760.4| AGAP011750-PA [Anopheles gambiae str. PEST]
gi|157013415|gb|EAA00039.4| AGAP011750-PA [Anopheles gambiae str. PEST]
Length = 770
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 178/535 (33%), Positives = 290/535 (54%), Gaps = 49/535 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
LPAF+G++P + ++P+ + W + P++ LD T+PLF IG F++
Sbjct: 250 ALPAFAGHLPVQFRTLYPNVSFANVSVWNNFP--PQYASPLFLDPTEPLFAAIGSRFLQL 307
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+K YG T H+Y D F+E P + S +Y+SS+ AIYS M D DA+WL+QGW+F +
Sbjct: 308 AIKTYG-TDHVYFSDPFNEIDPTLPSGKYLSSVSEAIYSTMVQVDPDAIWLLQGWMFVKN 366
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
PFW +++ L++VPLG+++VLDL +E P + + + G P+IWCML NF G + M G
Sbjct: 367 PFWSDRAIRSFLSAVPLGRMLVLDLQSEQYPQYGRTASYAGQPFIWCMLSNFGGTLGMLG 426
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
+ ++ G E R + T++G G++ EGI QN +Y+ EM + E + W ++Y+
Sbjct: 427 SVGNVFRGIRETRDNSTYTLLGTGITPEGINQNYALYEFALEMGWNAELDSAEQWFSEYA 486
Query: 248 VRRYGR-SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
V RYG S Q AWN+ TVY AF ++ + GKY
Sbjct: 487 VARYGNDSDERAQQAWNIFLRTVY---------------AFEGLE-----LMRGKYTFNR 526
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+P SK P WY + LEL ++ E S + +YDL+D TRQ L
Sbjct: 527 RPSSK-----------IRPWTWYDVHTFNQGLELLLSFAEEASCNQLCQYDLVDATRQCL 575
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
A+ L+L ++++++ D S FL+L+ D+D LL ++ FLLGPWLESAK A
Sbjct: 576 QHTADALYLTLMDSFKKRDLTSFRLHSSLFLQLLSDLDVLLRTNEHFLLGPWLESAKAHA 635
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ + +YE+NAR QIT+W Q + DY NK W+G+++D++ PR ++ + +
Sbjct: 636 ETTLERHKYEYNARIQITLWGPQGQ-----IVDYANKQWAGMVQDFFLPRWRVFLGELDQ 690
Query: 487 SLESG---DGFRLKD--WRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+L + + +++D +R + +D ++ Y + +GD + T++ LY ++
Sbjct: 691 ALATNGTINDLKIRDKIFRTVELPFVSDSKH----YATQPSGDTVRTARTLYERW 741
>gi|195339231|ref|XP_002036223.1| GM12949 [Drosophila sechellia]
gi|194130103|gb|EDW52146.1| GM12949 [Drosophila sechellia]
Length = 778
Score = 311 bits (798), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 181/533 (33%), Positives = 283/533 (53%), Gaps = 52/533 (9%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
LPAF+G+VP AL+ + P + ++ W R+CC ++ T+ LF EI F++
Sbjct: 259 VALPAFAGHVPRALKRLHPESTFMEVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLQ 316
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+ +YG ++HI+ CD F+E PPV PEY+ S AAIY M+ D +A+WL+QGW+F
Sbjct: 317 NIITKYG-SNHIFFCDPFNELEPPVAKPEYMRSTAAAIYESMRGIDPEAIWLLQGWMFVK 375
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+PFW +A L + P G+++VLDL +E P + ++ ++G P+IWCMLHNF G + M+
Sbjct: 376 NPFWTTDMAEAFLTAAPRGRILVLDLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMF 435
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G I G EAR N+++VG G++ EGI QN V+Y E + + +D+ W +
Sbjct: 436 GSAKLINSGIEEARRLPNSSLVGTGITPEGIGQNYVMYSFTLERGWSNTSLDLDGWFTNF 495
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S RYG ++ AW +L ++VY+ + G+Y
Sbjct: 496 SHTRYGVKDERLEQAWLLLKNSVYSFRG--------------------LQKMRGQY---- 531
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLT 362
V+ S P WY+ S V+ A L + S + + Y +DL+D+T
Sbjct: 532 -------VVTRRPSFNQEPFTWYNASAVLDAWHLLLTSRAIIPLEDDRYEMYEHDLVDIT 584
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L A++L++N+ AY+ LS + L+L +DM+ +LA FLLG WL+ A
Sbjct: 585 RQFLQISADQLYVNLRSAYRKRQVARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQA 644
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
KQ A N +++ +E+NAR QIT W + Q + DY K WSGL+ +YY PR ++ +
Sbjct: 645 KQAAPNTGEQRNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVSNYYRPRWRLFLE 699
Query: 483 YMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
+ +L + G F+LK + I+L + N +VYPV G+ + SQ
Sbjct: 700 DVTVALHAGRPYNGTAFKLK--VSQEIELP--FSNKIDVYPVTPVGNTWLISQ 748
>gi|195115262|ref|XP_002002183.1| GI17241 [Drosophila mojavensis]
gi|193912758|gb|EDW11625.1| GI17241 [Drosophila mojavensis]
Length = 773
Score = 307 bits (787), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 178/538 (33%), Positives = 290/538 (53%), Gaps = 50/538 (9%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
LPAFSG VP A++ VFP+A TQ W DP +CC ++ DPLF ++G F+
Sbjct: 248 VALPAFSGYVPTAMRRVFPNASFTQSDRWNHF-PDP-YCCVLFVEPQDPLFQQVGAMFLR 305
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+ ++ YG ++HIY D F+E P V P Y+ AIY+ MQ D+DAVWL+QGW+F
Sbjct: 306 RVIQVYG-SNHIYFSDPFNEMMPRVREPNYVRYTAKAIYNSMQVVDADAVWLIQGWMFLK 364
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+W ++A L +VP G+++ LDL +E P + + +YG P++WCML+NF GN+ ++
Sbjct: 365 SVYWTNDLIEAYLTAVPRGRILALDLQSEQFPQYERTHSYYGQPFVWCMLNNFGGNLGLF 424
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G I G + AR+ N +MVGVG++ EGI QN ++ L E A+ +++ ++ W +
Sbjct: 425 GSAQLIPSGIIAARSMPNGSMVGVGITPEGIGQNYALFALTLEQAWSPDELQLEDWFEYF 484
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
++ RYG + + W +L +VY+ + R+ + GKY
Sbjct: 485 ALTRYGVNDTRLSQVWQLLRESVYSF------QGRERM--------------RGKY---- 520
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT----YRYDLIDLT 362
L S + +P +WY+ + V A L + + + ++ Y +DL+D+T
Sbjct: 521 -------TLNKRPSLHHYPWVWYNVTMVYEAWRLMLEAKETVPLNDNRRAIYEHDLVDIT 573
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L + ++N+ A + + V L+ + LEL DM+ +LA + +LLG WLE+A
Sbjct: 574 RQCLQLSFDRFYVNLKSACRHKQLNRVEYLAGKLLELFADMERILASGEHYLLGNWLEAA 633
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
K+LA +EEQ YE+NAR Q+T W N Q + DY K WSGL+ DY+ PR ++ +
Sbjct: 634 KRLAPSEEQRPIYEFNARNQLTSWGPNYQ-----IPDYATKQWSGLMSDYFQPRWNMFLE 688
Query: 483 YMIESLESGDGFRLKDWRREWIKLTND----WQNGRNVYPVESNGDALITSQWLYNKY 536
+I++L++ F ++++ ++ N+ + N YP G S +Y K+
Sbjct: 689 AVIQALKTQTPFNYSEFKQ---RVENEIELPFSNHTKAYPTSPVGSTWNISHDIYEKW 743
>gi|301107007|ref|XP_002902586.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
gi|262098460|gb|EEY56512.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
Length = 736
Score = 305 bits (781), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 180/533 (33%), Positives = 279/533 (52%), Gaps = 52/533 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
LPAF+G+VP L+ P+A TQ NW + + CC ++++ TD L+ EIG+ F+++
Sbjct: 235 ALPAFAGHVPEELKLRLPNAHFTQSPNWGNFSEEH--CCVFMIEPTDALYREIGKNFLKE 292
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + Y TS +Y CDT+ E P + A+ GM + D +AVWLMQGW F D
Sbjct: 293 QRELYNYTSSLYQCDTYMEMAPEFTDLTELEGAARAVIDGMTAADPNAVWLMQGWPFVDD 352
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P++KA L+ VP KL++LD ++E PIWS ++G +I+ +LHNF GN M
Sbjct: 353 PHFWTKPRVKAYLDGVPTDKLIILDFYSESVPIWSKMDNYFGKSWIYSVLHNFGGNTGMR 412
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L ++A PV A + N TMVGVG++MEGI QN +VYDL +MA+ +DV WI QY
Sbjct: 413 GDLLTLATAPVLANWAGNGTMVGVGLTMEGIFQNYIVYDLTLQMAWVDNPLDVNTWIPQY 472
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ +RY ++ AW+ L +VYN T +A+ V S++ + Y
Sbjct: 473 AAQRYHTHNEHVEQAWSYLLRSVYNRT-----------LAYGGVTKSLVCLIPHWRLLYD 521
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+ + ++K Y ++V+ A + + + NEL +TYR+DL+D+T+Q L
Sbjct: 522 R--FQPTLIK------------YDPNDVVLAWKELLLAENELRDVDTYRHDLVDVTKQFL 567
Query: 367 AKYANELFLNIIEAYQLNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
+ E ++++ Y A + V L++ L +E ++ +LA ++ FLLG W+
Sbjct: 568 SNKLLEQYIHLKGIYNAKKASPNEVCGLTKTMLTTMERLEEILATNEDFLLGNWI----- 622
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
AR Q+T W DN E + DY K W+GL++ YY PR ++ +
Sbjct: 623 --------------ARNQVTRWGDNNNEA---IHDYAGKEWAGLVKGYYIPRWTMWLSEV 665
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
+ K + + I WQ G YP + GDA S+ YN+Y+
Sbjct: 666 CNAYTDKREMNEKALKEKRIAFELKWQLGHESYPTTTVGDAFTISKRFYNEYI 718
>gi|195398029|ref|XP_002057627.1| GJ18000 [Drosophila virilis]
gi|194141281|gb|EDW57700.1| GJ18000 [Drosophila virilis]
Length = 766
Score = 305 bits (780), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 173/538 (32%), Positives = 291/538 (54%), Gaps = 50/538 (9%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
LPAF+G+VP A++ VFP+A T W + ++CC ++ DPLF ++G F+
Sbjct: 250 VALPAFAGHVPTAMRRVFPNANYTPAERWNNFPD--QYCCDLFVEPHDPLFQQLGAMFLR 307
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+ ++ YG ++HIY D F+E PP+ P Y+ S AIY+ M+ D +AVWL+QGW+F
Sbjct: 308 RVIQVYG-SNHIYFSDPFNEMQPPLAEPGYMRSTAKAIYNSMREVDGNAVWLLQGWMFLK 366
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
D FW ++A L +VP G+++VLDL +E P + + +YG P++WCML+NF G + ++
Sbjct: 367 DIFWTDELIEAFLTAVPRGRILVLDLQSEQFPQYQRTHSYYGQPFVWCMLNNFGGTLGLF 426
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G I G AR N+++VGVG++ EGI QN ++ L E + ++ + W + +
Sbjct: 427 GSAQFIGSGIASARIMPNSSLVGVGITPEGIGQNYAIFALTLEQGWSASELQLGDWFDHF 486
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
++ RYG + + AW +L VY+ + GKY
Sbjct: 487 ALTRYGVNDTRLAQAWQLLRGGVYSFHG--------------------LQRMRGKY---- 522
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLT 362
L +P WY+ S V A +L +AS + ++ Y +DL+D+T
Sbjct: 523 -------ALNRRPGLNLNPWTWYNGSSVTDAWQLLLASREMVPLTDDRYAIYEHDLVDIT 575
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L + +++++N+ AY+ + + L+ + LEL++DM+ +LA +LLG WLE+A
Sbjct: 576 RQFLQQSFDQIYVNLRSAYRKEQLNRLEYLAGKLLELLDDMERILASGVHYLLGTWLEAA 635
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
K+LA +++ YE+NAR Q+T W N Q + DY K WSGL+ DYY PR A++
Sbjct: 636 KKLAPSDKLRPLYEFNARNQLTSWGPNGQ-----ILDYATKQWSGLMCDYYQPRWAMFLD 690
Query: 483 YMIESLESGDGFRLKDWRREWIKLTND----WQNGRNVYPVESNGDALITSQWLYNKY 536
+ ++++ F D+++ ++ N+ + N +YP + G+ + S +Y K+
Sbjct: 691 AVTRAMQTHRPFNATDFKQ---RVANEIELPFSNLTKMYPTKPMGNTWLISNDIYIKW 745
>gi|195050088|ref|XP_001992825.1| GH13491 [Drosophila grimshawi]
gi|193899884|gb|EDV98750.1| GH13491 [Drosophila grimshawi]
Length = 771
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 179/536 (33%), Positives = 295/536 (55%), Gaps = 50/536 (9%)
Query: 9 LPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 68
LPAF+G+VP L +FP+A T + W DP +CC ++ +DPLF +G F+ +
Sbjct: 251 LPAFAGHVPTGLPRIFPTANFTSVERWNQFP-DP-YCCALFIEPSDPLFQLVGAQFLRRV 308
Query: 69 LKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP 128
++ YG ++HIY D F+E P + P YISS AIY+ M+ D D VWL+QGW+F +
Sbjct: 309 IQIYG-SNHIYFSDPFNEMQPRIAEPGYISSTARAIYNSMRMVDKDPVWLLQGWMFLDNA 367
Query: 129 FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI 188
+W ++A L +VP G+++VLDL +E P + + +YG P++WCML+NF G + M+G
Sbjct: 368 YWSDELIEAFLTAVPRGRMLVLDLQSEQFPQYQRTFSYYGQPFVWCMLNNFGGTLGMFGS 427
Query: 189 LDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSV 248
I G + AR+ N++MVGVG++ EGI QN ++ L E + K+++ W +Q+++
Sbjct: 428 AHLINAGIMAARSMPNSSMVGVGITPEGIGQNYALFALTLEQGWSGSKLELSDWFDQFTL 487
Query: 249 RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP 308
RYG + + AW +L +VY+ + GKY
Sbjct: 488 TRYGVNDTDLILAWQLLRGSVYHFHG--------------------LQRMRGKY------ 521
Query: 309 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLTRQ 364
L S P +WY+ S V+ A +L +A+ + + Y++DL+D+TRQ
Sbjct: 522 -----ALNKRPSFNLKPWIWYNASSVVEAWQLLLAANQTIPVEDDRYALYKHDLVDITRQ 576
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L + +++++N+ AY+ + L+ + LEL+ DM+ +LA + +LLG WLE+AK+
Sbjct: 577 FLQQSFDQVYVNLKSAYRKSQLARFEYLAAKLLELLADMERILASGEHYLLGNWLEAAKE 636
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
LA + +Q YE+NAR Q+T W + Q + DY K WSGL++DYY PR +++ +
Sbjct: 637 LAPSADQRHIYEFNARNQLTAWGPSNQ-----ILDYATKQWSGLMQDYYTPRWSMFLDAV 691
Query: 485 IESLESGDGFRLKDWRREWIKLTND----WQNGRNVYPVESNGDALITSQWLYNKY 536
++ S F +R+ ++ N+ + N VYP E G + SQ +++K+
Sbjct: 692 TLAMHSKRPFNATAFRQ---RVANEIELPFSNLTKVYPTEPVGSTWLISQEIHDKW 744
>gi|195473052|ref|XP_002088810.1| GE10991 [Drosophila yakuba]
gi|194174911|gb|EDW88522.1| GE10991 [Drosophila yakuba]
Length = 778
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 177/533 (33%), Positives = 281/533 (52%), Gaps = 52/533 (9%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
LPAF+G+VP AL+ + P + ++ W ++CC ++ + LF EI F++
Sbjct: 259 VALPAFAGHVPRALKRLNPDSTFMEVQRWNQFPD--QYCCGLFVEPKENLFNEIALNFLQ 316
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+ + YG ++HI+ CD F+E PPV PEY+ S AAIY M+ D A+WL+QGW+F
Sbjct: 317 KIITIYG-SNHIFFCDPFNELEPPVAKPEYMRSTSAAIYESMRRIDPQAIWLLQGWMFVK 375
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+PFW +A L + P G+++VLDL +E P + ++ ++G P+IWCMLHNF G + M+
Sbjct: 376 NPFWTTDMAEAFLTAAPRGRILVLDLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMF 435
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G I G EAR N+++VG G++ EGI QN V+Y E + ++ +D+ +W +
Sbjct: 436 GSAKLINSGIEEARRLPNSSLVGTGITPEGIGQNYVMYSFTLERGWSNKPLDLDSWFTNF 495
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S RYG ++ AW L ++VY+ + G+Y
Sbjct: 496 SHTRYGVKDERLEQAWLQLKNSVYSFRG--------------------LQKMRGQY---- 531
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN----TYRYDLIDLT 362
V+ S P WY S V+ A L ++S + + Y +DL+D+T
Sbjct: 532 -------VVTRRPSFNQEPFTWYDASAVLDAWHLLLSSRAIIPLEDDRYEMYEHDLVDIT 584
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L A++L++N+ A++ LS + L+L +DM+ +LA FLLG WL+ A
Sbjct: 585 RQFLQISADQLYVNLRSAFRKRQVTRFEYLSTKLLKLFDDMELILASSRNFLLGNWLQQA 644
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
K+ A + ++ +E+NAR QIT W + Q + DY K WSGL+ DYY PR ++ +
Sbjct: 645 KRAAPSPGEQTNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLE 699
Query: 483 YMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
+ +L S G F+LK + I+L + + +VYPV G+ + SQ
Sbjct: 700 DVTVALHSRRPFNGTAFKLK--VSQEIELP--FSHKVDVYPVTPVGNTWLISQ 748
>gi|330791218|ref|XP_003283691.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
gi|325086434|gb|EGC39824.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
Length = 712
Score = 298 bits (764), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 176/538 (32%), Positives = 285/538 (52%), Gaps = 52/538 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F+G++P A+Q +FP+A ++ L W T+ LD +DPLF +I + FI +
Sbjct: 207 VLPGFAGHIPGAIQTLFPTANVSILSTWCEFNG------TFYLDPSDPLFGKITQLFITE 260
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSY 126
+ +G T H YN D F+E PP ++ +Y+ M + D AVW++QGW + Y
Sbjct: 261 LIGVFG-TDHYYNFDPFNELAPPSSDLGFLKQTSQQMYNNMLAADPKAVWVLQGWFIVDY 319
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW+ Q +A + VP+G +VLDL+++V P W+ ++ FYG ++WCMLHNF G MY
Sbjct: 320 PEFWQANQTQAWFSGVPIGGFIVLDLWSDVAPAWNITEYFYGHYWLWCMLHNFGGRSGMY 379
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + IA P+ AR S + M+G G++ E IEQN VVYDLMSEMA++ D++ WI QY
Sbjct: 380 GRIPFIATNPIIAR-SLSDNMMGTGLTPEAIEQNVVVYDLMSEMAWRSTAPDLEEWITQY 438
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RRYG+ +P + + W + TV+N T +N +F + PSI N+G
Sbjct: 439 TNRRYGKIMPEVVEVWMSMVDTVFNATAYWARRNMGAPESFIALRPSI---------NFG 489
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF-IASGNELSASNTYRYDLIDLTRQA 365
+++Y S + A +F + + + + ++ T+++D+ ++T QA
Sbjct: 490 D------------------NVFYDPSVMFNAWHVFSLVNDSYVISTETFQFDISEITMQA 531
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
L+ + + + N+I++Y ++D + S +E + MD + + LG W A+
Sbjct: 532 LSNFFMDTYFNLIKSYNVSDIESFQRESITMMETISFMDLIASTQPELQLGVWTYRARLW 591
Query: 426 A--QNEEQEKQ---------YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
A NE Q YE+NAR Q+T+W S+L DY K W GL+ D+YG
Sbjct: 592 AYPDNETPSLQNSSNSATLPYEFNARNQLTLW----GPSDSVLHDYAFKLWGGLISDFYG 647
Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWL 532
PR ++ K +++SLE+ F ++ L W +YP+ G TS+++
Sbjct: 648 PRWNLFLKTLLQSLENRIPFDANNFISNVQALEQQWVLESTIYPILPFGQGYNTSRYI 705
>gi|440799253|gb|ELR20308.1| alpha-N-acetylglucosaminidase family protein [Acanthamoeba
castellanii str. Neff]
Length = 854
Score = 295 bits (754), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 175/549 (31%), Positives = 279/549 (50%), Gaps = 67/549 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LP F+G VP ++ ++P+A +T+ +W ++ Y L D L+ IGR I +
Sbjct: 277 ILPGFAGFVPDGIKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGRMVIRR 334
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
E+G T HIYN DTF+E +PP P Y+++ A+Y GM + D A+W+MQGW F +D
Sbjct: 335 VTAEFG-TDHIYNADTFNEMSPPSADPTYLAAASRAVYEGMAAEDPQALWVMQGWSFVFD 393
Query: 128 PFWR-PPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW ++++ L+ V +++LDL ++ P WS + ++G ++WCMLHN G +Y
Sbjct: 394 KFWEDKSRVRSYLSGVSDKDMLILDLASDNNPEWSKTDSYFGKEFVWCMLHNGGGVRGLY 453
Query: 187 GILDSIAFGPVEARTSENTTM--------VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD 238
G L + P+ A + TM VGVGM+ME IEQNPVVY+LMSEM ++ E D
Sbjct: 454 GNLTQYSSDPLLALATPGNTMLICGTCEQVGVGMTMEAIEQNPVVYELMSEMGWRSEAFD 513
Query: 239 VKAWINQYSVRRYGRS--VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
+ W+ +Y+ RRYG + + ++ +AW +L Y N+ VI D
Sbjct: 514 IVEWVQRYAERRYGLAAGLSSVGEAWELLREATY---------NQSVI------DYGWFG 558
Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG--NELSASNTY 354
T G YG + ++ + AL LF+ S + + +
Sbjct: 559 FTPGLGMGYGGVA--------------------NAAKEVEALRLFLQSALTKGYAPNGPW 598
Query: 355 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF------QLSRRFLELVEDMDGLLA 408
+YD +DLTRQ LA +++ AY AH + L L L+ D+D +LA
Sbjct: 599 QYDCVDLTRQVLANTFRDIYAQFDAAYSAYAAHKTYTVDQLKSLGSALLTLIGDIDEILA 658
Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
+ +LLG W++SA A +Q Y++NAR QIT+W + Q + DY K+W+ L
Sbjct: 659 TNPNYLLGTWIQSALSWADTPDQALHYQFNARNQITLWGPDGQ-----ITDYATKHWADL 713
Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
+R YY PR ++ +++++ +G +++R E ++L W Y G+ L
Sbjct: 714 VRSYYQPRWTLFITSVLQAVYAG-----REYRGELLQLEQKWNRENTTYATTPTGNTLQV 768
Query: 529 SQWLYNKYL 537
+ L KYL
Sbjct: 769 AYKLAAKYL 777
>gi|195454475|ref|XP_002074254.1| GK18384 [Drosophila willistoni]
gi|194170339|gb|EDW85240.1| GK18384 [Drosophila willistoni]
Length = 743
Score = 291 bits (744), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 168/503 (33%), Positives = 268/503 (53%), Gaps = 43/503 (8%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
LPAF+G+VP AL+ +FP A T+ W + +CC ++ +PLF ++ F+
Sbjct: 267 VALPAFAGHVPRALRRIFPQANFTETERWNRFPNA--YCCDLFVEPQEPLFRQLATTFLR 324
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+ + YG ++HI+ CD F+E PPV +++ S AAIY+ M+ D A+WL+QGW+F
Sbjct: 325 RVTQRYG-SNHIFFCDPFNELEPPVSQADFMRSTAAAIYASMREVDPKAIWLLQGWMFVK 383
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+ FW ++A L +VP G L+VLDL +E P + +K +YG P++WCMLHNF G + M
Sbjct: 384 NIFWTDELIEAFLTAVPQGNLLVLDLQSEQFPQYQRTKSYYGQPFVWCMLHNFGGTLGML 443
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G ++ + G AR N++MVG G++ EGI QN V+Y E + K+D W +
Sbjct: 444 GSVELVNSGMDLARQMPNSSMVGAGITPEGIGQNYVMYSFALERGWSDRKLDSAGWFTHF 503
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
++ RYG + AW +L +VY T K R GKY
Sbjct: 504 ALTRYGVQDERLNQAWQLLRTSVY--TFHGLQKMR------------------GKYTITR 543
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT----YRYDLIDLT 362
+P + P WY+ + V+ A +L +++ + + + Y++DL+D+T
Sbjct: 544 RPAINLS-----------PFTWYNVTHVLEAWQLMLSARSIIPLDDNRYDIYQHDLVDIT 592
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L A++L++N+ +Y+ L + LEL++D++ +L FLLG WLE+A
Sbjct: 593 RQYLQITADQLYVNLNSSYRKRQLARFVYLGNKLLELLDDLERILGSGSNFLLGTWLEAA 652
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
K LA E + +E+NAR QIT W N + + DY K WSG++ DYY PR A +
Sbjct: 653 KLLAPTVEDQSNFEFNARNQITTWGPNGE-----ILDYACKQWSGMISDYYRPRWARFLD 707
Query: 483 YMIESLESGDGFRLKDWRREWIK 505
+ +L+S F +++ +K
Sbjct: 708 DVTLALQSNQPFNASAYKQHVLK 730
>gi|66801665|ref|XP_629757.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
gi|60463162|gb|EAL61355.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
Length = 798
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 173/537 (32%), Positives = 282/537 (52%), Gaps = 50/537 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F+G++P A+Q +FP A I+ L W + T+ L++TDPLF +I FI +
Sbjct: 292 VLPGFAGHIPGAIQQLFPQANISVLSTWCNFNG------TFYLESTDPLFAKITTMFIGE 345
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ +G T H YN D F+E PP + +Y+ ++Y + D AVW++QGW
Sbjct: 346 LIDVFG-TDHFYNFDPFNELEPPSNDTDYLRQTSQSMYENVLLADPKAVWVLQGWFIVDA 404
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW+ Q +A + VP+G ++VLDL+++V P W+T+ +YG ++WCMLHNF G MY
Sbjct: 405 PEFWQAKQTEAWFSGVPIGGVLVLDLWSDVIPGWTTTNYYYGHYWVWCMLHNFGGRSGMY 464
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L I+ P+ AR + MVG+G++ E IEQN VVYD+MSEM+++ + ++ W+ QY
Sbjct: 465 GRLPWISSNPITAR-GLSPNMVGIGLTPEAIEQNVVVYDMMSEMSWRSVQPNLTEWVTQY 523
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RRYG+ VP I D W L +TV+N T N +F + P + +G
Sbjct: 524 THRRYGKLVPEIVDVWISLVNTVFNATAATARANMGAPESFIALRPQL---------TFG 574
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+S+ +P++ Y+ V ++ + ++ T+ +D+ + T Q+L
Sbjct: 575 N------------NSFYNPNILYNAWNVFSMVD-----DEYVISTETFEFDISEFTMQSL 617
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ Y + + +IEA+ +D + +S L+++ MD + + LG W A+ A
Sbjct: 618 SNYFMDQYFLLIEAFNASDVQTLSTISIELLDIINYMDEIASTQSSLQLGLWTYRARLWA 677
Query: 427 ---------QNEEQEKQ--YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
QN YE+NAR +T+W S+L DY K WSGL+ D+Y P
Sbjct: 678 YPTNDIPTLQNSSNSNTAPYEFNARNVLTLW----GPSNSVLHDYAFKLWSGLVSDFYSP 733
Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWL 532
R ++ K +++S+E+ F + + R L W + +YP G A TS+++
Sbjct: 734 RWQLFLKSLVQSVENRKPFNKESFNRMVENLEEQWVVQQTIYPTVPVGQAYNTSKYI 790
>gi|198433857|ref|XP_002122480.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 880
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 167/423 (39%), Positives = 236/423 (55%), Gaps = 37/423 (8%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L VLP F+G++P+A+ N++P A + QL +W + CTYLL DPLF IG
Sbjct: 243 LGMIPVLPGFAGHIPSAILNLYPKADVIQLSHWSHFNCT--YSCTYLLQPHDPLFNTIGS 300
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
FI++Q+ EY T+HIYN DTF+E TPP P Y+S+ A+Y M D DAVWLMQGW
Sbjct: 301 MFIKEQMLEYNGTNHIYNADTFNEMTPPSSDPGYLSNASRAVYDAMAVADPDAVWLMQGW 360
Query: 123 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
LF ++P FW+ Q KALL VP GK++VLDLF+E P + ++G P++WCMLH+F G
Sbjct: 361 LFHHEPTFWKTAQKKALLTGVPKGKMLVLDLFSESYPQY-LPDWYFGQPFLWCMLHDFGG 419
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
N+ YG ++++ P A TS N+TMVG G++ EGI QN ++YD M E F V+V
Sbjct: 420 NMGFYGKINTVNTQPGIALTSVNSTMVGTGVTPEGINQNYMIYDFMLETGFTVHSVNVTN 479
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
W+ +Y++RRY S P WN+L +T+YN D FP S+I
Sbjct: 480 WLKEYTMRRYNTSSPEAIKTWNILGNTIYN----------DTKPGFP--SKSLIR----- 522
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHL--WYSTSEVIRALELFIASGNELSASNTYRYDLI 359
G PV + + D+P L WY S + A + F S N L T RYD +
Sbjct: 523 ----GSPVKRPTL--------DNPGLPYWYQYSSLALAWDNFSQSLNTLKDLETVRYDAV 570
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
D+TRQ L L+ ++E + G +L + L+L++D D +L F +G W+
Sbjct: 571 DITRQMLQAVHRLLYYAMVEEFLWKRDPG--KLGEQLLDLLDDFDKMLCSDAHFSMGKWI 628
Query: 420 ESA 422
+ A
Sbjct: 629 QDA 631
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 103/197 (52%), Gaps = 8/197 (4%)
Query: 341 FIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 400
F S N L T RYD +D+TRQ L L+ ++E + G +L + L+L+
Sbjct: 684 FSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDPG--KLGEQLLDLL 741
Query: 401 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 460
+D D +L F +G W++ AK L E++ YE+NAR Q+T+W N + + DY
Sbjct: 742 DDFDKMLCSDAHFSMGKWIQDAKILGTTAEEKDLYEYNARIQVTLWGPNGE-----ILDY 796
Query: 461 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKLTNDWQNGRNVYPV 519
+K+W L++ YY PR A++ Y+ + + F K + + + + + R+V+P
Sbjct: 797 ASKHWCSLVKHYYRPRWALFVSYLNHAYATKSKFDHKAFASDVFTNVEEPFTKDRSVFPS 856
Query: 520 ESNGDALITSQWLYNKY 536
+ G+A+ ++ +Y K+
Sbjct: 857 TATGNAIELAKDMYIKW 873
>gi|281200618|gb|EFA74836.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
Length = 469
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 169/484 (34%), Positives = 259/484 (53%), Gaps = 57/484 (11%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
AVLP F+G+VP AL++ +P+A ITQL +W Y+ + + F+
Sbjct: 37 AVLPGFAGHVPEALKSHYPNANITQLSSWN--------MTVYIHQSPN--------TFMS 80
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
Q YG T H YN D F+E PP P Y+ + ++++ + + D +W++QGWLF Y
Sbjct: 81 IQQDLYG-TDHFYNFDPFNELEPPSSDPAYLKNCSQSMFNNLIAVDPQGIWVLQGWLFVY 139
Query: 127 DP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
D FW+PPQ++A L+ VP+GK++VLDL+A+V W + FYG +IWCMLHNF G M
Sbjct: 140 DTEFWQPPQIEAFLSGVPIGKMIVLDLWADVDAGWKITNYFYGHNWIWCMLHNFGGRSGM 199
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + I+ P+EAR S + MVG G++ E IEQN +VYDLMSEMA++ D+K W++Q
Sbjct: 200 YGKIPFISTNPIEAR-SLSPNMVGTGLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQ 258
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y RRYG+ + + D W L TV+NC+ VT+G
Sbjct: 259 YVTRRYGKYIEVLADTWYELVGTVFNCS----------------------IVTKGPVTIL 296
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN-ELSASNTYRYDLIDLTRQ 364
VS L TS Y P + + +A F++ + + ++T+ +DL ++T Q
Sbjct: 297 ---VSVRPQLNFTTSLYYDPIV------ISKAWSAFLSIDDLHVVNTSTFSFDLTEITTQ 347
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
AL+ L + A+ LND++ F LS L +++D++ +++ + L+G W A+
Sbjct: 348 ALSNLFMTTELQMNAAF-LNDSYEEFSLLSDALLSIIQDINTIVSTQEMLLVGNWTARAR 406
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
L E + YE NAR QIT+W S DY K W GL D+Y R ++ +
Sbjct: 407 ALTPANETTELYEMNARNQITLW----GPPDSFDHDYAYKLWGGLTEDFYLARWTLFSQS 462
Query: 484 MIES 487
+ ++
Sbjct: 463 IFKT 466
>gi|288927792|ref|ZP_06421639.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella sp. oral taxon 317 str. F0108]
gi|288330626|gb|EFC69210.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella sp. oral taxon 317 str. F0108]
Length = 734
Score = 278 bits (711), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 174/541 (32%), Positives = 275/541 (50%), Gaps = 61/541 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP L+ +P AKIT L W + R C + LD D LF +I + +I++
Sbjct: 235 VLPAFAGHVPEILKEKYPKAKITPLSIWGDFEDQYR--C-HFLDPFDSLFTDIQKTYIDE 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K YG T HIY D F+E PP PEY+++ A IY +++ DS AVWL W+FSY
Sbjct: 292 QTKLYG-TDHIYGVDPFNELAPPSWEPEYLANASAKIYDVLKNADSKAVWLQMTWMFSYQ 350
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W ++K+ + +VP K ++LD +AE +W S+ +Y P+IWC L NF GN +
Sbjct: 351 RKDWTDERIKSYITAVPDKKQILLDYYAERTEVWKFSESYYKQPFIWCYLGNFGGNTMIA 410
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + + EA + +MVGVG ++EG + NP++YD + E + + + + W Q+
Sbjct: 411 GNIAEVDRRLNEAFANAE-SMVGVGSTLEGFDVNPIMYDFVFEKVWHKDGISLHDWTVQW 469
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYN----CTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+ RR G + + AW +L +Y CT+G R PS+
Sbjct: 470 AQRRVGTTDENAEKAWKLLIDKIYVQYSLCTEGTLTNAR----------PSLTGHGNWTT 519
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
+N+ K Y+ +++ A L + S + Y+YD++++
Sbjct: 520 KNWTK---------------------YNNRDLLEAWGLLLRS--KAITKIAYKYDIVNIG 556
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L Y L +AY+ D + L L+ D++ LL FLLGPWL +A
Sbjct: 557 RQVLGNYFTVLRDEFTQAYERKDISALTIKGNEMLSLLNDLEALLYTSPSFLLGPWLTNA 616
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ + +N E+ + YE NAR IT W +TQ A L DYGN+ W+GLL+ YY PR ++ +
Sbjct: 617 QNMGRNMEESRYYEKNARNIITNW--STQGVA--LNDYGNRTWAGLLQGYYTPRWKMFIE 672
Query: 483 YMIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
+I +++ F ++ D +WI T + YP+++ GD+ + + Y+K
Sbjct: 673 EVISAVKQNKEFNNETFFKKVTDEEWQWISKTEN-------YPIQATGDSYLLANKFYHK 725
Query: 536 Y 536
Y
Sbjct: 726 Y 726
>gi|340617022|ref|YP_004735475.1| alpha-N-acetylglucosaminidase [Zobellia galactanivorans]
gi|339731819|emb|CAZ95084.1| Alpha-N-acetylglucosaminidase, family GH89 [Zobellia
galactanivorans]
Length = 747
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 167/514 (32%), Positives = 271/514 (52%), Gaps = 27/514 (5%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL AF+G+VP L+ ++P A I Q+ +W V+ TY LD TD LF +IG AFI++
Sbjct: 234 VLQAFTGHVPQVLKKLYPEANIFQIEDWAGVEG------TYFLDPTDELFQKIGTAFIKK 287
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG T H+Y+ D F E PP P ++ + ++Y M+ DS A W++QGW F +
Sbjct: 288 QTELYG-TDHLYDADCFIEVDPPSKDPAFLKQVSESVYKSMELADSKATWVLQGWFFFFK 346
Query: 128 PFWRPPQM-KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+ + +A L+ +P + +VLDL+ E P W + FYG P+IW ++ N + M
Sbjct: 347 KDFWTKERGRAFLDGIPKNRAIVLDLYGEKNPTWDKTDAFYGQPWIWNVICNEDQKVNMS 406
Query: 187 GILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G L+ + EA TSE + G+G+ EG+ NP+V D + E A+ +KV+V+ WI
Sbjct: 407 GDLEEMQRQFQEAYTSEIGNNLKGIGVIPEGLGYNPIVQDFIFEKAWDPQKVNVQEWIED 466
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y+ RYG P+++ AW +L +VY G T ++ P ++ EG ++
Sbjct: 467 YATIRYGTKSPSVKKAWQLLGESVY----GRTRTMWSPLI----TTPRLMIFEEGSKEDI 518
Query: 306 GKPVSKEAVLKSETSSYDHPHLW-YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+ V K+ + +ET P W + ++ +A L + NEL TY +DL ++ R+
Sbjct: 519 -RHVRKDFKI-TETD----PFAWDFDVYKLAKAAGLLLGEANELQDVETYNFDLTNVYRE 572
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L ++ ++ AYQ D + + ++ +L++D++ + ++ FLLG WLE AK
Sbjct: 573 LLFSLTHKSINDVSVAYQEKDRQALDRSAKSLFKLMDDLEAITGANENFLLGKWLEDAKS 632
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
E+++ YEWNART +T+W + LRDY K W+GL YY PR ++ ++
Sbjct: 633 WGSTPEEKEYYEWNARTIVTIWQPYPE---GGLRDYAGKQWNGLFSGYYKPRWQLFVDHL 689
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYP 518
SL G F K + E ++ W +YP
Sbjct: 690 RRSLTEGVDFDPKAYDAEVREMDYKWTRSHQIYP 723
>gi|440792549|gb|ELR13759.1| peptidase, S8/S53 subfamily protein [Acanthamoeba castellanii str.
Neff]
Length = 981
Score = 275 bits (704), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 168/531 (31%), Positives = 263/531 (49%), Gaps = 71/531 (13%)
Query: 20 LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIY 79
++ ++P+A +T+ +W ++ Y L D L+ IG I +E+G T HIY
Sbjct: 434 IKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG-TDHIY 490
Query: 80 NCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALL 139
N DTF+E +PP P Y+++ A+Y GM + D A+W+MQGW F +DPFW ++KA L
Sbjct: 491 NADTFNEMSPPSADPTYLAAASRAVYEGMATQDPQALWVMQGWSFVFDPFWTKDRIKAYL 550
Query: 140 NSVPLGKLVVLDLFAEVKPIWSTSKQF----YGVPYIWCMLHNFAGNIEMYGILDSIAFG 195
+ V +++LDL ++ P W+ + QF +G ++WCMLHN G +YG L +
Sbjct: 551 SGVDNSDMLILDLASDNSPEWNKTGQFRDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSD 610
Query: 196 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 255
P+ A + TMVGVGM+ME IEQNPVVY+LMSEM ++ E D+ W+ +Y+ RRYG +
Sbjct: 611 PLIALATPGNTMVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAT 670
Query: 256 PA--IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 313
+ + +AW +L YN + +D + +G +
Sbjct: 671 GSSPVGEAWELLREATYNQS---------------GLDAGLFGFAPALGMGHGGTSN--- 712
Query: 314 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNE--LSASNTYRYDLIDLTRQALAKYAN 371
+T EV AL LF+ S + + ++YD +DLTRQ LA N
Sbjct: 713 ----------------ATKEV-EALRLFLQSAQTEGYAPNGPWQYDCVDLTRQVLANTFN 755
Query: 372 ELFLNIIEAY-----QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+++ + AY +D L+ L ++ D+D LLA + +LLG W++ A A
Sbjct: 756 DVYSQLDAAYTSYATNKSDTLPFLPLAAELLGIISDLDRLLATNPNYLLGTWIKDAVSWA 815
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
EQ Y++NAR QIT+W + Q + DY K+W+GLL ++
Sbjct: 816 SIPEQALHYQFNARNQITLWGPDGQ-----ISDYATKHWAGLL---------------MK 855
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
++ +G F + E ++L W YP GD L + + KYL
Sbjct: 856 AVGAGVMFNSTAYGTELLQLEQKWNQENTTYPTTPTGDTLQVALRISQKYL 906
>gi|297273081|ref|XP_001095618.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Macaca mulatta]
Length = 691
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 157/402 (39%), Positives = 226/402 (56%), Gaps = 50/402 (12%)
Query: 89 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 147
PP +P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L
Sbjct: 286 PPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRL 345
Query: 148 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 207
+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 346 LVLDLFAESQPVYTLTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTM 405
Query: 208 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLY 266
VG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ +RYG S P AW +L
Sbjct: 406 VGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLL 465
Query: 267 HTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 325
+VYNC+ + NR +V P L+ TS
Sbjct: 466 RSVYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS----- 495
Query: 326 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY---- 381
+WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY
Sbjct: 496 -VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKE 554
Query: 382 --QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 439
L A GV EL+ +D LLA FLLG WLE A+ A +E + YE N+
Sbjct: 555 LTSLLRAGGVLAY-----ELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNS 609
Query: 440 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
R Q+T+W E ++L DY NK +GL+ +YY PR ++
Sbjct: 610 RYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFL 646
>gi|156121099|ref|NP_001095696.1| alpha-N-acetylglucosaminidase precursor [Bos taurus]
gi|151554244|gb|AAI48148.1| NAGLU protein [Bos taurus]
gi|296476361|tpg|DAA18476.1| TPA: alpha-N-acetylglucosaminidase [Bos taurus]
Length = 667
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 145/346 (41%), Positives = 205/346 (59%), Gaps = 35/346 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP AL VFP +TQ+GNW + C++LL DPLF +G F+ +
Sbjct: 241 VLPAFAGHVPKALTRVFPQVNVTQMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRE 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY DTF+E PP P Y+++ AA+Y M + D DAVWL+QGWLF +
Sbjct: 299 LTKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQ 357
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW P Q+ A+L +VP G+L+VLDLFAE +P++ + F G P+IWCMLHNF GN ++
Sbjct: 358 PEFWGPAQVAAVLGAVPRGRLLVLDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLF 417
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQ 245
G L+S+ GP AR N+TMVG GM+ EGI QN VVY LM+E+ +Q + V D+ AW+
Sbjct: 418 GALESVNQGPTTARHFPNSTMVGTGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTS 477
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
++ RRYG S + AW +L +VYNC S E + N+
Sbjct: 478 FAARRYGVSHGDAEAAWRLLLRSVYNC-----------------------SGEECRGHNH 514
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
P+ + L+ T+ +WY+ S+V A L + + + L++S
Sbjct: 515 -SPLVRRPSLQMVTT------VWYNRSDVFEAWRLLLTATSTLASS 553
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 60/115 (52%), Gaps = 5/115 (4%)
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
A A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR ++
Sbjct: 551 ASSPAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFT 605
Query: 482 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ ++ESL G F+ + R +L + G YP + GD + + L+ KY
Sbjct: 606 ETLVESLVQGVPFQQHQFDRNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 660
>gi|392588150|gb|EIW77482.1| glycoside hydrolase family 89 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 761
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 162/499 (32%), Positives = 262/499 (52%), Gaps = 45/499 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A+ N++P+A I W P++ L+ DPLF ++ ++FI +
Sbjct: 244 VLPAFTGFVPPAMHNLYPNASIVNGSAWNDFA--PQFTNDSFLEPFDPLFAQVQQSFISK 301
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q +G SHIY D ++EN P P Y++++ AA +S +++ D DA WLMQGWLF S
Sbjct: 302 QQAAFGNVSHIYTLDQYNENDPYSGDPSYLTNISAATFSSLRAADPDATWLMQGWLFFSS 361
Query: 127 DPFWRPPQMKALLNSVPL----GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
FW P +++A L VP +++LDL++E +P W ++G +IWC LH++ GN
Sbjct: 362 ADFWTPERVEAYLAGVPGDDDGSGMLILDLYSEAQPQWQRLSSYFGKRWIWCELHDYGGN 421
Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+ G ++ P+ A S N +MVGVG++ EG+E N ++YD++ + A+ ++ +
Sbjct: 422 MGFEGNFANVTEAPLAALASPNVSMVGVGLTPEGMEGNEIIYDVLLDQAWSSSPINKTEY 481
Query: 243 INQYSVRRY-GRSVPAIQ-DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
++ RRY +P +AW L TVY+ TD + I+ +++VT
Sbjct: 482 AQAWATRRYPADELPECAIEAWQTLAATVYSNTDPGSQATVKSILELEPALSGLVNVTG- 540
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHP-HLWYST-SEVIRALELFIASGN---ELSASNTYR 355
HP H++Y T + ++ AL+ + +G+ L A YR
Sbjct: 541 ----------------------HHPTHVFYDTNTTIVPALQQLVQAGHSTPSLLAIPEYR 578
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGF 413
YDL+DLTRQ L +L+ +++ Y + V + LELV D+D +L ++ F
Sbjct: 579 YDLVDLTRQLLVNRFIDLYADLLAVYNTTSASSASVSAAGQPMLELVADLDKVLMTNENF 638
Query: 414 LLGPWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
L W ++A+ A Y E+NAR QIT+W + + DY +K W GL+ DY
Sbjct: 639 QLSRWTDAARSWANGNASYAAYLEYNARNQITLWGPKGE-----INDYASKQWGGLVGDY 693
Query: 473 YGPRAAIYFKYMIESLESG 491
YG R A++ +Y+ S +G
Sbjct: 694 YGKRWAMFIQYLEGSKSNG 712
>gi|160883168|ref|ZP_02064171.1| hypothetical protein BACOVA_01137 [Bacteroides ovatus ATCC 8483]
gi|156111393|gb|EDO13138.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
Length = 737
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 160/530 (30%), Positives = 254/530 (47%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VP+ L+ ++P AKI+++ +W + R ++ LD DPLF I + F+E+
Sbjct: 233 ILPAFAGHVPSELKRIYPEAKISRMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PE++++ IY M D DA WL WLF D
Sbjct: 290 QTKLFG-TDHIYGADPFNEVAPPSWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYID 348
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +++A L +VP KL++LD + E +W + +++G PY+WC L NF GN +
Sbjct: 349 RHLWTNERVEAFLKAVPQNKLLLLDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLA 408
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + T+ G+G ++EG + NP +Y+ + A+ D WI Q
Sbjct: 409 GNTKEVGKRIENVYTNGGENFSGLGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQL 467
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RR G ++ AW +LY ++Y P+ + G
Sbjct: 468 ADRRIGLRNQQMRRAWKLLYDSIYTA-------------------PAALG--------QG 500
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
++ LK + P + YS + E+ + +G + TY YD++++ RQ L
Sbjct: 501 TLMNARPCLKGNGNWTTTPTVAYSNETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVL 558
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y +L E Y + Q +L+ D++ LL+ FLLG W+E A+ L
Sbjct: 559 GNYFGKLRDEFAETYSRKQLPLLKQKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLG 618
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+E + YE NART ++ W D Q L DY N+ W GL+ YY PR ++ +I
Sbjct: 619 IDEASKNYYEENARTIVSTWGDKDQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIR 674
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
S+ + F + + + DW YP E G+A+ + L NKY
Sbjct: 675 SVSNKQPFNADAFHQRVTQFEIDWVKSHERYPSEPVGNAVEIATLLMNKY 724
>gi|423292430|ref|ZP_17271008.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
CL02T12C04]
gi|423294620|ref|ZP_17272747.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
CL03T12C18]
gi|392661665|gb|EIY55241.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
CL02T12C04]
gi|392675811|gb|EIY69252.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 160/530 (30%), Positives = 254/530 (47%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VP+ L+ ++P AKI+++ +W + R ++ LD DPLF I + F+E+
Sbjct: 223 ILPAFAGHVPSELKRIYPEAKISRMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEE 279
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PE++++ IY M D DA WL WLF D
Sbjct: 280 QTKLFG-TDHIYGADPFNEVAPPSWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYID 338
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +++A L +VP KL++LD + E +W + +++G PY+WC L NF GN +
Sbjct: 339 RHLWTNERVEAFLKAVPQNKLLLLDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLA 398
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + T+ G+G ++EG + NP +Y+ + A+ D WI Q
Sbjct: 399 GNTKEVGKRIENVYTNGGENFSGLGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQL 457
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RR G ++ AW +LY ++Y P+ + G
Sbjct: 458 ADRRIGLRNQQMRRAWKLLYDSIYTA-------------------PAALG--------QG 490
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
++ LK + P + YS + E+ + +G + TY YD++++ RQ L
Sbjct: 491 TLMNARPCLKGNGNWTTTPTVAYSNETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVL 548
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y +L E Y + Q +L+ D++ LL+ FLLG W+E A+ L
Sbjct: 549 GNYFGKLRDEFAETYSRKQLPLLKQKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLG 608
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+E + YE NART ++ W D Q L DY N+ W GL+ YY PR ++ +I
Sbjct: 609 IDEASKNYYEENARTIVSTWGDKDQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIR 664
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
S+ + F + + + DW YP E G+A+ + L NKY
Sbjct: 665 SVSNKQPFNADAFHQRVTQFEIDWVKSHERYPSEPVGNAVEIATLLMNKY 714
>gi|295086519|emb|CBK68042.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 727
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 160/530 (30%), Positives = 254/530 (47%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VP+ L+ ++P AKI+++ +W + R ++ LD DPLF I + F+E+
Sbjct: 223 ILPAFAGHVPSELKRIYPEAKISRMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEE 279
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PE++++ IY M D DA WL WLF D
Sbjct: 280 QTKLFG-TDHIYGADPFNEVAPPSWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYID 338
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +++A L +VP KL++LD + E +W + +++G PY+WC L NF GN +
Sbjct: 339 RHLWTNERVEAFLKAVPQDKLLLLDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLA 398
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + T+ G+G ++EG + NP +Y+ + A+ D WI Q
Sbjct: 399 GNTKEVGKRIENVYTNGGENFSGLGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQL 457
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RR G ++ AW +LY ++Y P+ + G
Sbjct: 458 ADRRIGLRNQQMRRAWKLLYDSIYTV-------------------PAALG--------QG 490
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
++ LK + P + YS + E+ + +G ++ Y YD++++ RQ L
Sbjct: 491 ALMNARPCLKGNGNWTTTPTVAYSNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVL 548
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y +L EAY + Q +L+ D+D LL+ FLLG W+E A+ L
Sbjct: 549 GNYFGKLRDEFAEAYSRKQLPLLKQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLG 608
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+E + YE NART ++ W D Q L DY N+ W GL+ YY PR ++ +I
Sbjct: 609 TDEVSKNYYEENARTIVSTWGDKDQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIR 664
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
S+ + F + + + DW YP E G+ + + L NKY
Sbjct: 665 SVSNKQPFNADAFHQRVTQFEIDWVKSHERYPSEPVGNVVEIATLLMNKY 714
>gi|62088640|dbj|BAD92767.1| huntingtin interacting protein-1-related [Homo sapiens]
Length = 449
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 155/418 (37%), Positives = 232/418 (55%), Gaps = 50/418 (11%)
Query: 112 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 170
D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P
Sbjct: 18 DTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQP 77
Query: 171 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 230
+IWCMLHNF GN ++G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+
Sbjct: 78 FIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAEL 137
Query: 231 AFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 288
++ + V D+ AW+ ++ RRYG S P AW +L +VYNC+ + NR +V P
Sbjct: 138 GWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRP 197
Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
L+ TS +WY+ S+V A L + S L
Sbjct: 198 S-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAPSL 226
Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 402
+ S +RYDL+DLTRQA+ + + + AY L A GV EL+
Sbjct: 227 ATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELASLLRAGGVLAY-----ELLPA 281
Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
+D +LA FLLG WLE A+ A +E + YE N+R Q+T+W E ++L DY N
Sbjct: 282 LDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYAN 336
Query: 463 KYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 520
K +GL+ +YY PR ++ + +++S+ G F+ + + +L + + YP +
Sbjct: 337 KQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQ 394
>gi|298385999|ref|ZP_06995556.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
1_1_14]
gi|298261227|gb|EFI04094.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
1_1_14]
Length = 715
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 165/533 (30%), Positives = 271/533 (50%), Gaps = 54/533 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V F G VP+ L+ +P A++ + G W S++ P +LD DPLF + + + +
Sbjct: 231 VFQGFFGMVPSLLKEKYPEARLVEQGLWNSLQRPP------VLDPADPLFERMAKVWYAE 284
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
K YG+ + ++ D F E T +D ++ + + M+ + DA W++Q WL +
Sbjct: 285 YEKLYGK-ADLFGGDLFHEGGKTGGID----VTDAARRVQTAMKRYNPDATWVIQAWLGN 339
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P+ K LL + +++DL AE W K F G P++W + N+ GNI +
Sbjct: 340 -------PK-KELLAGLDRKNTLIVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGGNIGL 391
Query: 186 YGILDSIAFGPVEAR--TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
+G LD+IA GPV+ + ++ + +M G + EGIE NPVV+DL++EM ++ E +D+ W+
Sbjct: 392 HGRLDAIATGPVDGQKDSAASPSMKGTSSTPEGIEVNPVVFDLLNEMRWRSEHLDLDVWL 451
Query: 244 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
+YSVRRYG +++AW + + T Y G + V A P + I+
Sbjct: 452 KEYSVRRYGVEDENLKEAWTIFHRTAYGTYTGHRRPSESVFCAPPSLKRDKITA------ 505
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
S++ ++Y + + LF+ S + L ++TY+YD +D R
Sbjct: 506 ----------------SAWSQCRIFYDPELFAQGVGLFLQSADRLKQTSTYQYDAVDFVR 549
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
Q LA E + N+++AY+ D S RFL+L++D + LL+ H+ F +G WL+ A+
Sbjct: 550 QYLADLGRETYYNLVDAYRAKDTKQFDYWSERFLQLIKDQNELLSTHERFFVGRWLDMAR 609
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
++ E + YE NAR I W E S +RDY +K W GLL+DYY PR Y Y
Sbjct: 610 LKSKQPELQDLYEHNARMLIGTW----TETLSPVRDYAHKEWGGLLKDYYLPRWTNYIAY 665
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ +LE G + D + W N N Y +E+ D + T++ +Y+KY
Sbjct: 666 LKGTLE-GRSLTVPD----SFQAEKAWVNAHNKYVLEAGVDPVQTAKRMYSKY 713
>gi|383114162|ref|ZP_09934927.1| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
gi|382948607|gb|EFS30964.2| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
Length = 727
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 161/530 (30%), Positives = 256/530 (48%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VP+ L+ ++P AKI+++ +W + R ++ LD DPLF I + F+E+
Sbjct: 223 ILPAFAGHVPSELKRIYPEAKISRMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEE 279
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PE++++ IY M D DA WL WLF D
Sbjct: 280 QTKLFG-TDHIYGADPFNEVAPPSWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYID 338
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +++A L +VP KL++LD + E +W + +++G PY+WC L NF GN +
Sbjct: 339 RHLWTNERVEAFLKAVPQDKLLLLDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLA 398
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + T+ G+G ++EG + NP +Y+ + A+ D WI Q
Sbjct: 399 GNTKEVGKRIENVYTNGGENFSGLGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQL 457
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RR G ++ AW +LY ++Y ++ +G N
Sbjct: 458 ADRRIGLRNQQMRRAWKLLYDSIYTAP---------------------AALGQGTLMN-A 495
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+P K + TS+ + YS + E+ + +G ++ Y YD++++ RQ L
Sbjct: 496 RPCLKGNGNWTTTST-----VAYSNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVL 548
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y +L EAY + Q +L+ D+D LL+ FLLG W+E A+ L
Sbjct: 549 GNYFGKLRDEFAEAYSRKQLPLLKQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLG 608
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ + YE NART ++ W D Q L DY N+ W GL+ YY PR ++ +I
Sbjct: 609 TDGASKNYYEENARTIVSTWGDKDQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIR 664
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
S+ + F + + + DW YP E G+A+ + L NKY
Sbjct: 665 SVSNKQPFNADAFHQRVTQFEIDWVKSHERYPSEPVGNAVEIATLLMNKY 714
>gi|237719130|ref|ZP_04549611.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451509|gb|EEO57300.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 737
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 161/530 (30%), Positives = 256/530 (48%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VP+ L+ ++P AKI+++ +W + R ++ LD DPLF I + F+E+
Sbjct: 233 ILPAFAGHVPSELKRIYPEAKISRMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PE++++ IY M D DA WL WLF D
Sbjct: 290 QTKLFG-TDHIYGADPFNEVAPPSWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYID 348
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +++A L +VP KL++LD + E +W + +++G PY+WC L NF GN +
Sbjct: 349 RHLWTNERVEAFLKAVPQDKLLLLDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLA 408
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + T+ G+G ++EG + NP +Y+ + A+ D WI Q
Sbjct: 409 GNTKEVGKRIENVYTNGGENFSGLGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQL 467
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RR G ++ AW +LY ++Y ++ +G N
Sbjct: 468 ADRRIGLRNQQMRRAWKLLYDSIYTAP---------------------AALGQGTLMN-A 505
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+P K + TS+ + YS + E+ + +G ++ Y YD++++ RQ L
Sbjct: 506 RPCLKGNGNWTTTST-----VAYSNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVL 558
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y +L EAY + Q +L+ D+D LL+ FLLG W+E A+ L
Sbjct: 559 GNYFGKLRDEFAEAYSRKQLPLLKQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLG 618
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+E + YE NART ++ W D Q L DY N+ W GL+ YY PR ++ +I
Sbjct: 619 IDEASKNYYEENARTIVSTWGDKDQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIR 674
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
S+ + F + + + DW YP E +A+ + L NKY
Sbjct: 675 SVSNKQPFNADAFHQRVTQFEIDWVKSHERYPSEPVSNAVEIATLLMNKY 724
>gi|281210062|gb|EFA84230.1| hypothetical protein PPL_03307 [Polysphondylium pallidum PN500]
Length = 744
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 167/529 (31%), Positives = 271/529 (51%), Gaps = 53/529 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P F+G+VP A + +PSA I +L +W + T L TDP++ + F +
Sbjct: 251 VFPGFAGHVPVAFKQFYPSANIVELPSWHGFNA------TNHLLTTDPMYDIVADRFYQV 304
Query: 68 QLKEYGRTSHI--YNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + YG + I ++ D F+E PP +S ++++ + I++ + + D+ W++Q W +
Sbjct: 305 QNEIYGAYAKIDYFSIDPFNELIPPSNSSQFLNECSSRIFNAINRFNPDSTWVLQNWFLN 364
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW Q+ + L VP+G+L+VLDL++E+KP+W+ + + G +IW MLHNF G +
Sbjct: 365 -SAFWGDGQVASFLGGVPIGRLIVLDLWSELKPLWNRTANYQGHKWIWNMLHNFGGRPTI 423
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G + IA P+EA+ S + TMVG+G++ E IEQN +VYDLMSEM ++ D+ W++
Sbjct: 424 SGRMPIIANEPLEAKAS-SPTMVGIGLTPEAIEQNVIVYDLMSEMGWRSRSFDLNLWVDA 482
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y RRYG ++P ++ W +L +TVY P+ P+ NY
Sbjct: 483 YVTRRYGVNLPNLKPVWKMLAYTVYFS---------------PNRSPA----------NY 517
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA-LELFIASGNELSASNTYRYDLIDLTRQ 364
++K+ L + Y +P + ++ A EL + +S TYRYDL ++T Q
Sbjct: 518 ---IAKKPSLDFQLGLYYNPVV------IVDAWRELLAVDSTIVRSSETYRYDLAEITLQ 568
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLEL-VEDMDGLLACHDGFLLGPWLESAK 423
AL+ Y N + ++Y +D FQ +R+ + MD + LG W A+
Sbjct: 569 ALSNYFNGNLKQLYQSYYASDFQ-TFQSARQNCSFALRAMDAVADTVQLLKLGKWTADAR 627
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
+ A + + + YE+NAR QIT+W DY NK+WSGL+ DYY PR I+F++
Sbjct: 628 KWATDNNERELYEYNARNQITLWGWKDMGNP----DYANKWWSGLIADYYFPRWQIFFEH 683
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYP--VESNGDALITSQ 530
+ ++ F + W N+YP V SN D S+
Sbjct: 684 LEHAIFDKSKFNEHSLAVNTMLHEERWNKQTNIYPSDVNSNDDVHTVSK 732
>gi|380692804|ref|ZP_09857663.1| putative alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 709
Score = 265 bits (677), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 163/533 (30%), Positives = 267/533 (50%), Gaps = 54/533 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V F G VP++L+ +P A + + G W S++ P +LD DPLF ++ + + +
Sbjct: 225 VFQGFFGMVPSSLKEKYPEAHLVEQGLWNSLQRPP------VLDPADPLFEQMAKVWYTE 278
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
K YG+ + ++ D F E T +D ++ + + M+ + DA W++Q WL +
Sbjct: 279 YEKLYGK-ADLFGGDLFHEGGKTGGID----VTDAARRVQTAMKQYNPDATWVIQAWLGN 333
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P+ K LL + +++DL AE W K F G P++W + N+ NI +
Sbjct: 334 -------PK-KELLAGLDRKHTLIVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGANIGL 385
Query: 186 YGILDSIAFGPVEART--SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
+G LD+IA GP++ R + +M G + EGIE NPVV+DL++EM ++ E +D+ W+
Sbjct: 386 HGRLDAIATGPIDGRKDPEASPSMKGTSSTPEGIEVNPVVFDLLNEMRWRSEYLDIDTWL 445
Query: 244 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
+YSVRRYG ++ AW + + T Y G + V A P + I+
Sbjct: 446 KEYSVRRYGAEDENLKKAWIIFHRTAYGTYSGHRRPSESVFCAPPSLKRDKITA------ 499
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
S++ ++Y + + LF+ S + L ++TY+YD +D R
Sbjct: 500 ----------------SAWSQCRIFYDPDLFAQGVGLFLQSADHLKQTSTYQYDAVDFVR 543
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
Q LA E + N+++AY+ D S RFL+L++D + LL+ H F +G WL+ A+
Sbjct: 544 QYLADLGREAYYNLVDAYRAKDTKQFDYWSERFLQLIKDQNELLSTHKCFFVGRWLDMAR 603
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
++ E + YE NAR I W E S +RDY +K W GLL+DYY PR Y Y
Sbjct: 604 SKSKQPELQDLYEHNARMLIGTW----TETLSPVRDYAHKEWGGLLKDYYLPRWTNYIAY 659
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ +LE G + + ++ W N N Y +E+ D + T++ +Y KY
Sbjct: 660 LKGTLE-GRSLTVPN----SFQVEKAWVNAHNKYVLETGVDPVETAKRMYRKY 707
>gi|313203962|ref|YP_004042619.1| alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
gi|312443278|gb|ADQ79634.1| Alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
Length = 738
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 177/536 (33%), Positives = 265/536 (49%), Gaps = 40/536 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQL-GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
+L AF+G+VP L+ + PSAKIT++ W + ++ TY LD TD LF EI + F+
Sbjct: 233 ILSAFAGHVPEQLKTLRPSAKITRIEPGWGGMAAE---YTTYFLDPTDNLFGEIQKRFLT 289
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
Q K YG T H+Y+ D F+E TPP P+Y++++G IY M D +A+W W F
Sbjct: 290 VQQKLYG-TDHLYSADPFNEITPPSWEPDYLANVGKTIYETMSQVDKEAIWYQMSWTFYN 348
Query: 127 DP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
DP W P++ A++++VP GKL LD E + + S FYG P+IWC L NF N +
Sbjct: 349 DPTHWTRPRLSAMIHAVPQGKLFFLDYNCEEEEFFRKSDNFYGAPFIWCYLGNFGANTHL 408
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ-HEKVDVKAWIN 244
L+ + + + + VGVG ++EGI NP +Y+ + EM ++ E V I
Sbjct: 409 VAPLNKVV--NRLGKLTYGSACVGVGSTLEGINVNPEIYETVLEMPWRADETVTADTLIR 466
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y+ RR G A+ +AW +L V D A +V V P + +T +
Sbjct: 467 HYAERRAGARDKAVIEAWQLLRQHV--LVDTAVGIWNHCVVF--QVSP-VTDLTRAFWAT 521
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRAL-ELFIASGNELSASNTYRYDLIDLTR 363
+P + Y ++ AL +F AS N ++ YR+D+++LTR
Sbjct: 522 -------------------NPKIPYRNVDLAIALNRMFQASANS-KKTDAYRFDVVNLTR 561
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
QAL Y L+ ++EAY + + S FL+L +++DGLLA FLLG WL A+
Sbjct: 562 QALGNYGTVLYHKMMEAYSRKNLIDFRKYSGEFLQLGQEIDGLLATRHEFLLGKWLADAR 621
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
++ YE NAR IT W + L DY N+ W+GLLR YY PR +
Sbjct: 622 SWGTTPAEKAYYERNAREIITTW----HKAGGGLTDYSNRQWNGLLRSYYLPRWKEFINR 677
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNG-RNVYPVESNGDALITSQWLYNKYLQ 538
+ SL +G + K + W + + Y GDA+ + L+ KY Q
Sbjct: 678 LDTSLSTGKDYDDKAFAAWCSAFEQHWVDSPSSAYSDTETGDAVKMAFELFGKYKQ 733
>gi|449541595|gb|EMD32578.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
B]
Length = 752
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 156/514 (30%), Positives = 272/514 (52%), Gaps = 38/514 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A+ V +A I W + P L+ DPLF + ++FI +
Sbjct: 234 VLPAFTGFVPRAMSTVHSNASIVNGSQW-APGFPPSLTNVSFLEPFDPLFATLQKSFIAK 292
Query: 68 QLKEYG-RTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-S 125
Q + YG SHIY D ++EN P + Y+SS+ ++ +++ D DAVW++QGWLF S
Sbjct: 293 QQEAYGANISHIYTLDQYNENNPFSGNLSYLSSISEGTFTSLRAADPDAVWMLQGWLFFS 352
Query: 126 YDPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
+ FW +++A L VP ++VLDL++E +P W+ + ++G ++WC LH++ G I
Sbjct: 353 SEAFWTNERIEAYLGGVPTNDSMIVLDLYSEAQPQWNRTSSYFGKQWVWCELHDYGGTIG 412
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ G LD+I GP+ A S ++M G+G++MEG E N +VYDL+ + A+ +++ +++
Sbjct: 413 LEGNLDAITTGPIAALNSPGSSMKGMGLTMEGQEGNEIVYDLLLDQAWSSSPINIASYVK 472
Query: 245 QYSVRRY-GRSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+ RRY +P A Q+AW +L TVYN D ++ I +++P + +
Sbjct: 473 GWVSRRYLVEPLPSAAQEAWRILSTTVYNNQD---PNSQSTIKNIYELEPVLTGL----- 524
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLI 359
V++ +L + YD + S ++ AL+L + A LS + +D++
Sbjct: 525 ------VNRTGILPT-VIPYD------TNSTIVPALQLLVKAKAQNAALSTVPEFVHDVV 571
Query: 360 DLTRQALAKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
D++RQ L+ + + +I+ Y + + V + + + ++ +D LLA ++ FLL
Sbjct: 572 DVSRQLLSNRFIDAYTALIDTYNNTNVTSDAVIRAGQPLMTILSQLDALLATNENFLLSS 631
Query: 418 WLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
W+ A+ L+ +E Y E+NAR QIT+W + + + DY +K W+GL+ YY R
Sbjct: 632 WIAQARNLSHGDESYAAYLEYNARNQITLWGPDGE-----INDYASKAWAGLISTYYAAR 686
Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 510
+ Y+ + F + + I L +W
Sbjct: 687 WQTFIDYLASTKRLARPFDTSAFSNQMILLGQEW 720
>gi|119480815|ref|XP_001260436.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
181]
gi|119408590|gb|EAW18539.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
181]
Length = 748
Score = 262 bits (669), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 153/489 (31%), Positives = 261/489 (53%), Gaps = 45/489 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A+ V P+A + W D R+ L+ DP F + R+FI++
Sbjct: 233 VLPAFTGFVPRAISRVLPNATVVNGSRWGGF--DERYTNDTFLEPFDPSFTRLQRSFIQK 290
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q + YG +H+Y D ++EN P +Y+ ++ + ++S D +AVWLMQGWLF S
Sbjct: 291 QQQAYGNITHVYTLDQYNENDPYSGDLDYLRNVTRNTWLSLKSADPNAVWLMQGWLFYSN 350
Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW ++KA L+ V + + ++VLDLF+E +P W ++ +YG P+IWC LH++ GN+ +
Sbjct: 351 SDFWTDERVKAYLSGVEVDQDMLVLDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGL 410
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + ++ +A + + ++VG G++MEG E N ++YDL+ + A+ + +D + +
Sbjct: 411 YGQVMNVTVNATQALAASD-SLVGFGLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHN 469
Query: 246 YSVRRY-----GRSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
+ RY G +VP + AW++L TVYN T+ + I ++ PSI
Sbjct: 470 WVKTRYSSGVRGSAVPEELHQAWDILRTTVYNNTNLTSTAVSKSIF---ELQPSI----S 522
Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNE---LSASNTYR 355
G G DHP + Y + +++A +L ++ ++ L + +
Sbjct: 523 GLLNRTG----------------DHPTTVNYDPAALVQAWQLMDSAASKDRSLWSQPAFL 566
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
YD++D+TRQ +A +++N++ YQ V ++L+ D+D +L+ +D F L
Sbjct: 567 YDMVDITRQVMANAFIPMYINLVSTYQA--GASVSTDGSNLIQLLRDVDSVLSTNDNFRL 624
Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
W+ SA+ A+N+ + YE+NAR QI +W + DY +K W GL+ YY P
Sbjct: 625 STWIRSARSWARNDTEADFYEYNARNQIALW-----GPMGEINDYASKQWGGLVSAYYIP 679
Query: 476 RAAIYFKYM 484
R + Y+
Sbjct: 680 RWQTFLHYL 688
>gi|291515668|emb|CBK64878.1| Alpha-N-acetylglucosaminidase (NAGLU) [Alistipes shahii WAL 8301]
Length = 713
Score = 261 bits (668), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 168/530 (31%), Positives = 260/530 (49%), Gaps = 41/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP L+ + P A+IT++ W D R+ C++L D DPLF I R F+ +
Sbjct: 216 VLPAFAGHVPQELKRLHPDARITRVSYWGGF--DDRYRCSFL-DPMDPLFAVIQREFLTE 272
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G T HIY D F+E P PE ++ + IY M D +AVWL GWLF D
Sbjct: 273 QTRLFG-TGHIYGADPFNEIDAPTWDPETLAGMSRHIYESMAEVDPEAVWLQMGWLFYAD 331
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P W ++A L +VP +L++LD F E IW +++F+G PY+WC L NF GN +
Sbjct: 332 PTHWTAENIRAFLGAVPQDRLLMLDYFCEFTEIWKQTEKFHGQPYLWCYLGNFGGNTMLS 391
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G +++ +A + GVG ++EG N +Y+ + + A+ D WI +
Sbjct: 392 GNFHTVSARMEDAFAHGGDNLRGVGSTLEGFGVNQFMYEFVLDKAWNTGIAD-DEWIARL 450
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RR G PA + W L +VY T A + A P + EG +
Sbjct: 451 ADRRTGFRDPAARTGWRTLCDSVY--TLPAQTGQSPLTNAHPAL--------EGNWHWTT 500
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
KP T+ Y P LW R E +A +E +TYR+D++++ RQ L
Sbjct: 501 KP----------TTGYRFPTLW-------RVWEELLAVDSE---RDTYRFDVVNIGRQVL 540
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y AY +D + +RR L+ D++ L ACH F L W+ +A+
Sbjct: 541 GDYFLIERDRFAAAYAQHDRKAMDAAARRMTGLLADINLLTACHPEFSLERWIAAARGFG 600
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ + YE NAR I++W D+ L DY ++ WSG++ YY PR ++ + ++E
Sbjct: 601 SDNASKDYYETNARMLISVWGDSYH-----LTDYASRTWSGMISTYYAPRWRLFIERVME 655
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ +G F + + RE W + + GDA+ T++ L +KY
Sbjct: 656 AARTGRMFDHEAFDREIRDFECRWADASHPLTFPEAGDAVRTARELASKY 705
>gi|423287380|ref|ZP_17266231.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
CL02T12C04]
gi|392672495|gb|EIY65962.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
CL02T12C04]
Length = 726
Score = 261 bits (667), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 160/536 (29%), Positives = 263/536 (49%), Gaps = 39/536 (7%)
Query: 2 LLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
LL VLPAFSG+VPA L+ ++P A ITQ+ W R ++ +D DPLF +I
Sbjct: 223 LLGMTPVLPAFSGHVPAELKRLYPDAAITQMSQWGGYDKKYR---SHFIDPMDPLFGKIQ 279
Query: 62 RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++E+Q K YG T HIY D F+E P +++ ++ I+ ++ DS A W+
Sbjct: 280 KRYLEKQTKLYG-TDHIYGIDPFNEVDSPNWDEDFLRTVSDKIFHSIEQVDSLAHWIQMT 338
Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFA 180
W+F + W P++KA LNSVP KL++LD + + IW ++Q+YG PYIWC L NF
Sbjct: 339 WMFYHSKDKWSQPRIKAFLNSVPDDKLILLDYYCDSVEIWRETQQYYGKPYIWCYLGNFG 398
Query: 181 GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
GN + G +D ++ + GVG ++EG++ NP +Y+ + E A+ H +
Sbjct: 399 GNSMLAGHVDDVSAKLNRLFVEGGKNISGVGATLEGLDVNPFMYEFVLEKAWSHTITNAD 458
Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
W+ +++ R G I DAW LY +Y I T G
Sbjct: 459 -WMKNWALCRGGSKSSHIIDAWQQLYKKIY-----------------------IHHATAG 494
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ ++ +L+ S HP ++Y E+ F+ + N S+ Y++D+I+
Sbjct: 495 Q----AVLMNARPMLEGTDSWNTHPDIYYDNKELWHIWGKFLEAKN--VDSSGYKFDVIN 548
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQ L ++ + Y+ + G+ + + + L D+D LL+C F +G W++
Sbjct: 549 IGRQVLGNLFSDFRDSFTACYRQKNIEGMKEWAEKMNTLFTDVDRLLSCESSFSIGKWIK 608
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
A+ +N ++++ YE NAR +T W ++A+ L DY N+ W GL YY R ++
Sbjct: 609 DARDWGKNLKEKEYYEQNARCILTTW----GQKATQLNDYANRGWGGLTDSYYRKRWELF 664
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+Y I+ + G K + + W NVY S D + + LY KY
Sbjct: 665 TQYAIDEMSHGKEIDEKSFYNLITEFEYQWTLQTNVYSESSGEDPIRIANLLYIKY 720
>gi|357622373|gb|EHJ73879.1| putative alpha-N-acetyl glucosaminidase [Danaus plexippus]
Length = 780
Score = 261 bits (667), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 161/502 (32%), Positives = 255/502 (50%), Gaps = 46/502 (9%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L V PAF+G+VP A + +FP+ + W K D +CC +D +P F I +
Sbjct: 238 LGMIPVFPAFNGHVPKAFEKIFPNTTFHPVETW--NKFDEDYCCNLFVDPREPDFKMISK 295
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F+ + G +SHIY D F+E S + AI+S + D DAVWL+Q W
Sbjct: 296 MFMREITAGLG-SSHIYTADPFNEIKIQPWSTSLVVETAKAIFSSISEYDKDAVWLVQNW 354
Query: 123 LFSYDPF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
+F ++P W ++ + L SVP G+++VLDL +E P + + +YG P+IW MLHNF G
Sbjct: 355 MFVHNPLLWPLKRVNSFLTSVPNGRMLVLDLQSEQWPQYDLYQMYYGQPFIWSMLHNFGG 414
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVK 240
+ M+G +I E R EN+TMVG+G++ EGI QN V+YDLM E A++ V D++
Sbjct: 415 TLGMFGNTKTINKDVYEVRKRENSTMVGIGLTPEGINQNYVIYDLMLESAWRKGPVPDLE 474
Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
W++ Y+ RRYG + +I W L +VYN T ++ G
Sbjct: 475 EWVSDYAERRYGCNATSI--GWKYLLRSVYNFTG--------------------LNRIRG 512
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
KY V+ S P WY ++ AL+ F+ N +++ + +DL+D
Sbjct: 513 KY-----------VMTRRPSFNIRPWAWYKGHDLFEALKNFVYVQNPACSTSGFLHDLVD 561
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLS-RRFLELVEDMDGLLACHDGFLLGPWL 419
+TRQAL ++++N+ N + VF + F++ + DM +LA F + WL
Sbjct: 562 VTRQALQYKIEQIYMNLQNDRYSN--YMVFNYTISSFIDAMTDMQNILATSSDFKITSWL 619
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
SA+ ++ + Y++NAR QIT+W N + + DY K W+ L + YY PR +I
Sbjct: 620 SSARAISNLPLESSLYDFNARNQITLWGPNGE-----ISDYACKQWAELFKYYYIPRWSI 674
Query: 480 YFKYMIESLESGDGFRLKDWRR 501
+ +++ + F K +R
Sbjct: 675 FLSMALDAKTRNEPFDEKGAQR 696
>gi|423213214|ref|ZP_17199743.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693674|gb|EIY86904.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
CL03T12C04]
Length = 726
Score = 261 bits (667), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 160/536 (29%), Positives = 263/536 (49%), Gaps = 39/536 (7%)
Query: 2 LLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
LL VLPAFSG+VPA L+ ++P A ITQ+ W R ++ +D DPLF +I
Sbjct: 223 LLGMTPVLPAFSGHVPAELKRLYPDAAITQMSQWGGYDEKYR---SHFIDPMDPLFGKIQ 279
Query: 62 RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++E+Q K YG T HIY D F+E P +++ ++ I+ ++ DS A W+
Sbjct: 280 KRYLEKQTKLYG-TDHIYGIDPFNEVDSPNWDEDFLRTVSDKIFHSIEQVDSLAHWIQMT 338
Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFA 180
W+F + W P++KA LNSVP KL++LD + + IW ++Q+YG PYIWC L NF
Sbjct: 339 WMFYHSKDKWSQPRIKAFLNSVPDDKLILLDYYCDSVEIWRETQQYYGKPYIWCYLGNFG 398
Query: 181 GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
GN + G +D ++ + GVG ++EG++ NP +Y+ + E A+ H +
Sbjct: 399 GNSMLAGHVDDVSAKLNRLFVEGGKNISGVGATLEGLDVNPFMYEFVLEKAWSHTITNAD 458
Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
W+ +++ R G I DAW LY +Y I T G
Sbjct: 459 -WMKNWALCRGGSKSSHIIDAWQQLYKKIY-----------------------IHHATAG 494
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ ++ +L+ S HP ++Y E+ F+ + N S+ Y++D+I+
Sbjct: 495 Q----AVLMNARPMLEGTDSWNTHPDIYYDNKELWHIWGKFLEAKN--VDSSGYKFDVIN 548
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQ L ++ + Y+ + G+ + + + L D+D LL+C F +G W++
Sbjct: 549 IGRQVLGNLFSDFRDSFTACYRQKNIEGMKEWAEKMNTLFTDVDRLLSCESSFSIGKWIK 608
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
A+ +N ++++ YE NAR +T W ++A+ L DY N+ W GL YY R ++
Sbjct: 609 DARDWGKNLKEKEYYEQNARCILTTW----GQKATQLNDYANRGWGGLTDSYYRKRWELF 664
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+Y I+ + G K + + W NVY S D + + LY KY
Sbjct: 665 TQYAIDEMSHGKEIDEKSFYNLITEFEYQWTLQTNVYSESSGEDPIRIANLLYIKY 720
>gi|295087651|emb|CBK69174.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 703
Score = 261 bits (667), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 160/536 (29%), Positives = 263/536 (49%), Gaps = 39/536 (7%)
Query: 2 LLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
LL VLPAFSG+VPA L+ ++P A ITQ+ W R ++ +D DPLF +I
Sbjct: 200 LLGMTPVLPAFSGHVPAELKRLYPDAAITQMSQWGGYDEKYR---SHFIDPMDPLFGKIQ 256
Query: 62 RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++E+Q K YG T HIY D F+E P +++ ++ I+ ++ DS A W+
Sbjct: 257 KRYLEKQTKLYG-TDHIYGIDPFNEVDSPNWDEDFLRTVSDKIFHSIEQVDSLAHWIQMT 315
Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFA 180
W+F + W P++KA LNSVP KL++LD + + IW ++Q+YG PYIWC L NF
Sbjct: 316 WMFYHSKDKWSQPRIKAFLNSVPDDKLILLDYYCDSVEIWRETQQYYGKPYIWCYLGNFG 375
Query: 181 GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
GN + G +D ++ + GVG ++EG++ NP +Y+ + E A+ H +
Sbjct: 376 GNSMLAGHVDDVSAKLNRLFVEGGKNISGVGATLEGLDVNPFMYEFVLEKAWSHTITNAD 435
Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
W+ +++ R G I DAW LY +Y I T G
Sbjct: 436 -WMKNWALCRGGSKSSHIIDAWQQLYKKIY-----------------------IHHATAG 471
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ ++ +L+ S HP ++Y E+ F+ + N S+ Y++D+I+
Sbjct: 472 Q----AVLMNARPMLEGTDSWNTHPDIYYDNKELWHIWGKFLEAKN--VDSSGYKFDVIN 525
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQ L ++ + Y+ + G+ + + + L D+D LL+C F +G W++
Sbjct: 526 IGRQVLGNLFSDFRDSFTACYRQKNIEGMKEWAEKMNTLFTDVDRLLSCESSFSIGKWIK 585
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
A+ +N ++++ YE NAR +T W ++A+ L DY N+ W GL YY R ++
Sbjct: 586 DARDWGKNLKEKEYYEQNARCILTTW----GQKATQLNDYANRGWGGLTDSYYRKRWELF 641
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+Y I+ + G K + + W NVY S D + + LY KY
Sbjct: 642 TQYAIDEMSHGKEIDEKSFYNLITEFEYQWTLQTNVYSESSGEDPIRIANLLYIKY 697
>gi|449541596|gb|EMD32579.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
B]
Length = 754
Score = 261 bits (666), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 149/516 (28%), Positives = 273/516 (52%), Gaps = 43/516 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
LPAF+G VP A+ ++P+A I W + L+ DPLF + ++FI +
Sbjct: 237 ALPAFTGFVPRAMSTLYPNASIVNGSAWSGFPAS--LTNVSFLEPFDPLFSTLQKSFITK 294
Query: 68 QLKEYG-RTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-S 125
Q + YG +HIY D ++EN P + Y+SS+ A ++ +++ D DA+W++QGWLF S
Sbjct: 295 QQQAYGTNVTHIYTLDQYNENNPFSGNISYLSSVSAGTFASLRAADPDAIWMLQGWLFFS 354
Query: 126 YDPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
+ FW +++A L VP ++VLDL++E +P W+ + ++G ++WC LH + GN+
Sbjct: 355 SETFWTDERIQAYLGGVPTNDSMIVLDLYSEAQPQWNRTSSYFGKQWVWCELHGYGGNMG 414
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ G L++I GP+ A +S+ ++M G+G++MEG E N +VYD++ + A+ +D+ +++
Sbjct: 415 LEGNLNAITAGPIAALSSQGSSMKGMGLTMEGQEGNEIVYDVLLDQAWSSAPIDIASYVK 474
Query: 245 QYSVRRYG-RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+ RRY +P A Q+AW +L TVYN D ++ I + +++P++ +
Sbjct: 475 SWVARRYTVEPLPSAAQEAWQILSTTVYNNQD---PNSQATIKSIYELEPTLTGLVN--- 528
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHL--WYSTSEVIRALELFIASGNE---LSASNTYRYD 357
+ HP L + + + V+ AL+L + + + L+A + YD
Sbjct: 529 -----------------RTGHHPTLIPYDTNTTVVPALQLLVKAKEQNAALAAIPEFVYD 571
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLL 415
+D++RQ L+ + + +++ Y +A V + + + ++ +D LLA ++ FLL
Sbjct: 572 AVDVSRQLLSNRFIDAYTGLVDTYNNANATSDAVVRAGQPLMVILSQLDALLATNENFLL 631
Query: 416 GPWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
W+ A+ + +E Y E+NAR Q+T+W + + + DY +K W+GL+ YY
Sbjct: 632 SSWIAQARNWSHGDESYAAYLEYNARNQVTLWGPDGE-----INDYASKAWAGLISTYYS 686
Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 510
R + Y+ + F + + I L W
Sbjct: 687 SRWQTFVDYLASTKRLSRPFDSSAFSSQMILLGQQW 722
>gi|71001188|ref|XP_755275.1| alpha-N-acetylglucosaminidase [Aspergillus fumigatus Af293]
gi|66852913|gb|EAL93237.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
Af293]
gi|159129357|gb|EDP54471.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
A1163]
Length = 756
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 149/485 (30%), Positives = 259/485 (53%), Gaps = 37/485 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A+ V P+A + W + D R+ L+ DP F+ + R+FI++
Sbjct: 241 VLPAFTGFVPRAVSRVLPNATVVNGSRW--EEFDERYTSDTFLEPFDPSFMRLQRSFIKK 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q + YG +HIY D ++EN P +Y+ ++ + ++S D +AVWLMQGWLF S
Sbjct: 299 QQQAYGNITHIYTLDQYNENAPYSGDLDYLHNVTHNTWLSLKSADPNAVWLMQGWLFYSS 358
Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW ++KA L+ V + + ++VLDLF+E +P W ++ +YG P+IWC LH++ GN+ +
Sbjct: 359 SGFWTDERVKAYLSGVEVDQDMLVLDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGL 418
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + ++ +A + + ++VG G++MEG E N ++YDL+ + A+ + +D + +
Sbjct: 419 YGQVMNVTVNATQALAASD-SLVGFGLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHN 477
Query: 246 YSVRRY-----GRSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
++ RY G +VP + AW++L T YN T+ + I ++ PSI +
Sbjct: 478 WAKTRYSSGVRGSAVPEELYQAWDILRITAYNNTNLTSTAVSKSIF---ELQPSISGLLN 534
Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
+ T SYD L + R ++ + L + + YD++
Sbjct: 535 RTSHH------------PTTVSYDPAAL----VQAWRLMDSAASKAPSLWSQPAFLYDMV 578
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
D+TRQ ++ ++ N++ YQ + V ++L+ D+D +L+ +D F L W+
Sbjct: 579 DITRQVMSNAFIPVYTNLVSTYQAGGS--VSTDGSNLIQLLRDLDSVLSTNDNFRLSTWI 636
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
+SA+ +N+ + YE+NAR Q+T+W + + DY +K W GL+ YY PR
Sbjct: 637 QSARSWVRNDTEADFYEYNARNQVTLWGPKGE-----INDYASKQWGGLVSSYYIPRWQK 691
Query: 480 YFKYM 484
+ Y+
Sbjct: 692 FLNYL 696
>gi|326437768|gb|EGD83338.1| lysosomal alpha-N-acetyl glucosaminidase [Salpingoeca sp. ATCC
50818]
Length = 820
Score = 258 bits (660), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 165/549 (30%), Positives = 259/549 (47%), Gaps = 60/549 (10%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L + LP F+G+VP A++ +FP A +TQ W + S + LL TDPLF+++G
Sbjct: 247 LGMVSALPGFAGHVPTAIKRIFPHANLTQTAGWANFNS--TYSDVSLLQPTDPLFLQLGT 304
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F + +K +G T H++ DT++E P + ++ +Y M + D +AV+LMQGW
Sbjct: 305 KFYKMLIKAFG-TDHVFQMDTYNEMQPSFTNMTLLAESNRVVYQAMANADPEAVYLMQGW 363
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
LF ++ +W P +K L+ VP K+++LDL E P++S + ++G +IW ML N+ G
Sbjct: 364 LF-HESYWTPEHVKVYLSGVPDDKMIILDLNTEANPVFSLTSDYFGKLWIWNMLLNYGGR 422
Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+YG I+ P+ TM G+G++ E IE NPV+++LM EM + D+ W
Sbjct: 423 RGLYGNATDISTRPLLDLHRAQGTMDGIGITPEAIENNPVMFELMLEMGWHATPPDMHDW 482
Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
I Y+ RYG+ Q AW +L VY+ D +R + PD +S +E +
Sbjct: 483 IAAYASSRYGKRESLTQSAWQLLLEHVYD----QPDIDRFHMEMVPD-----LSSSESRN 533
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI--ASGNELSASNTYRYDLID 360
N T+ +++A L + A L + + YDL+D
Sbjct: 534 SN--------------------------TTALVQAWRLLVTAAVNGSLPITGPFSYDLVD 567
Query: 361 LTRQA-----------LAKYANELFLNIIEAYQLNDAH--GVFQLSRRFLELVEDMDGLL 407
+ RQA L + E NI + +H + L L++ D+D LL
Sbjct: 568 VGRQALLNLWSDVRGMLVAHVKEYNANIDSSPSTAASHVPAIKSLFTLLLDITSDLDRLL 627
Query: 408 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
+LLG WLESAK A N ++ E+NAR QIT+W + + + DY K W G
Sbjct: 628 GTDVNYLLGVWLESAKATAANADERATREFNARNQITLWGPDGE-----ITDYAAKQWQG 682
Query: 468 LLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 527
L+ DYY R + + +L S ++ +K W N YP D +
Sbjct: 683 LVSDYYVKRWEMMHDATLSALNSSTKIDTSA-PKDTLKFEQAWGNENKTYPTAPQADVVK 741
Query: 528 TSQWLYNKY 536
S + KY
Sbjct: 742 VSAAMLQKY 750
>gi|242809019|ref|XP_002485282.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
ATCC 10500]
gi|218715907|gb|EED15329.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
ATCC 10500]
Length = 755
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 153/492 (31%), Positives = 263/492 (53%), Gaps = 43/492 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP A+ V P A + W + + + ++ TDP F EI ++FI +
Sbjct: 239 ILPAFPGFVPRAITRVLPDADVINGSAWEAFPT--MYTNDTFMEPTDPHFTEIQKSFIAK 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q++ YG + Y D F+EN P Y+ ++ + +++ DS+AVW+MQGWLF S
Sbjct: 297 QIEAYGNVTTFYTLDQFNENNPSSGDLSYLRNVSQGTWKTLKAADSNAVWVMQGWLFTSN 356
Query: 127 DPFWRPPQMKALLNSVPL-GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW +++A L V + L++LDL +E P W + +YG P+IWC +H++ GN+
Sbjct: 357 SAFWTNDRIEAYLGGVAVDSDLLILDLASESSPQWQRTNSYYGKPWIWCEIHDYGGNMGF 416
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + +I P+ A ++++VG G+SMEG E N +VYDL+ + A+ +D +++ +
Sbjct: 417 YGQVMNITNNPIAA-LHNSSSLVGFGLSMEGQEGNEIVYDLLLDQAWNAAPIDTESYFHD 475
Query: 246 YSVRRYG--RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+ RY RS+P ++ AW++L TVYN T+ A + A P +I T G
Sbjct: 476 WVTARYAGSRSIPSSVYSAWDILRTTVYNNTNLAAN-------AVPKAIFELIPSTTGLL 528
Query: 303 QNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDL 358
G HP L Y+T+++++A LF S + L + + +DL
Sbjct: 529 NRTGH----------------HPTKLNYNTADMVQAWNLFYTSAFKEPSLWLNPAFEFDL 572
Query: 359 IDLTRQALAKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
+D++RQ LA ++ N+I Y ++ + + + + +++ +D +LA + F L
Sbjct: 573 VDMSRQVLANAFIPVYENLISTYNTSNPSSTKLQTIGAELIGILQALDTVLATNKNFKLS 632
Query: 417 PWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
WL +A+ A ++ + + E+NAR QIT+W Q + DY +K W+GL+ YY P
Sbjct: 633 TWLSAARASAGSQHNIEDFLEYNARNQITLWGPTGQ-----ISDYASKSWAGLVSSYYIP 687
Query: 476 RAAIYFKYMIES 487
R ++ +Y+I++
Sbjct: 688 RWKMFVEYLIDT 699
>gi|121698957|ref|XP_001267859.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
1]
gi|119396001|gb|EAW06433.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
1]
Length = 671
Score = 255 bits (651), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 150/450 (33%), Positives = 245/450 (54%), Gaps = 38/450 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A+ V P A + W D ++ L+ DP F + R+FI +
Sbjct: 233 VLPAFTGFVPRAITRVLPDATVVNGSRWSGF--DEKYTNDTFLEPFDPNFARLQRSFIHK 290
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q + YG +HIY D ++EN P PEY+ ++ + ++S D DA+W+MQGWLF S
Sbjct: 291 QQQAYGNITHIYTLDQYNENDPYSGDPEYLRNVTHNTWQSLKSADPDAIWMMQGWLFYSN 350
Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW ++ A L+ V + ++VLDLF+E +P W ++ +YG P+IWC LH++ GN+ +
Sbjct: 351 SDFWTDERVHAYLSGVETDEDMLVLDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGL 410
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + +I +A ++ +VG G++MEG E N +VYDL+ + A+ +D ++ +
Sbjct: 411 YGQVMNITVNATDALAVSDS-LVGYGLTMEGQEGNEIVYDLLLDQAWSSRPIDTDSYFHD 469
Query: 246 YSVRRYG----RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
+ RY +VP + AW++L T YN T+ AT V SI +
Sbjct: 470 WVKARYSTARRHNVPHELYQAWDILRTTAYNNTNLATATA---------VSKSIFEL--- 517
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLW-YSTSEVIRALELFIASGNELSA---SNTYRY 356
+P K L ++T HP + Y S ++R+ +L +++ +E +A +RY
Sbjct: 518 ------QP--KLTGLVNQTGH--HPTVVNYEASSLVRSWKLMVSAASESTALWSHPAFRY 567
Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
D++D+TRQ +A ++LN+ YQ + Q + L+ D+D +L+ +D F L
Sbjct: 568 DMVDVTRQVMANAFIPMYLNVTSTYQ--KGGPISQQGDSLIRLLRDLDAVLSTNDNFRLA 625
Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMW 446
W+ESA+ A+N+ + YE+NAR QIT+W
Sbjct: 626 TWIESARTWARNDTEADFYEYNARNQITLW 655
>gi|299149196|ref|ZP_07042257.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298512863|gb|EFI36751.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 738
Score = 255 bits (651), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 158/532 (29%), Positives = 263/532 (49%), Gaps = 40/532 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPAAL+ V+P+ K T++ W R CT+L + D L+ I + ++ +
Sbjct: 242 VLPAFAGHVPAALKRVYPNIKTTRVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTE 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG T+HIY D F+E PP + + + IY + + D +AVWL WLF D
Sbjct: 299 QTRLYG-TNHIYGIDPFNEIDPPSWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYAD 357
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++K+ L SVP +L++LD F E IW + ++G PY+WC L NF GN +
Sbjct: 358 IKHWTTPRIKSYLRSVPQDRLILLDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLS 417
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G ++ ++ +A + + + GVG ++EGI+ N +Y+ + + A+ + D K W +
Sbjct: 418 GPVNLVSERLADALKNGGSNLKGVGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKL 476
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RR G+ P + AW +L + VY V P+ V +G N
Sbjct: 477 ADRRIGKISPEARKAWEILANKVY-------------------VQPA--QVGQGTLTN-A 514
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+P LK P + Y +++ A L ++ + ++Y +DL+++ RQ L
Sbjct: 515 RP-----CLKGNGHWTTKPTIEYQPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVL 567
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y N + AY+ D + + E++ D+D L++CH F L W+ A+ +
Sbjct: 568 GNYFNVVRDEFTLAYEAGDIPMMKNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMG 627
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ + YE NAR+ IT+W D+ L DY N+ W+GL YY R + +IE
Sbjct: 628 HDAASKNYYEMNARSLITIWGDSYH-----LTDYANRSWAGLTNQYYSVRWDHFINEVIE 682
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
+ E F +++ + N+W N N GD + ++ +Y KY +
Sbjct: 683 AAEKKKNFDEEEFFNQSRMYENEWVNPSNRISYNEGGDGIKLARQIYKKYAK 734
>gi|237717696|ref|ZP_04548177.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229453015|gb|EEO58806.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 729
Score = 255 bits (651), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 158/532 (29%), Positives = 263/532 (49%), Gaps = 40/532 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPAAL+ V+P+ K T++ W R CT+L + D L+ I + ++ +
Sbjct: 233 VLPAFAGHVPAALKRVYPNIKTTRVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG T+HIY D F+E PP + + + IY + + D +AVWL WLF D
Sbjct: 290 QTRLYG-TNHIYGIDPFNEIDPPSWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYAD 348
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++K+ L SVP +L++LD F E IW + ++G PY+WC L NF GN +
Sbjct: 349 IKHWTTPRIKSYLRSVPQDRLILLDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLS 408
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G ++ ++ +A + + + GVG ++EGI+ N +Y+ + + A+ + D K W +
Sbjct: 409 GPVNLVSERLADALKNGGSNLKGVGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKL 467
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RR G+ P + AW +L + VY V P+ V +G N
Sbjct: 468 ADRRIGKISPEARKAWEILANKVY-------------------VQPA--QVGQGTLTN-A 505
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+P LK P + Y +++ A L ++ + ++Y +DL+++ RQ L
Sbjct: 506 RP-----CLKGNGHWTTKPTIEYQPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVL 558
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y N + AY+ D + + E++ D+D L++CH F L W+ A+ +
Sbjct: 559 GNYFNVVRDEFTLAYEAGDIPMMKNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMG 618
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ + YE NAR+ IT+W D+ L DY N+ W+GL YY R + +IE
Sbjct: 619 HDAASKNYYEMNARSLITIWGDSYH-----LTDYANRSWAGLTNQYYSVRWDHFINEVIE 673
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
+ E F +++ + N+W N N GD + ++ +Y KY +
Sbjct: 674 AAEKKKNFDEEEFFNQSRMYENEWVNPSNRISYNEGGDGIKLARQIYKKYAK 725
>gi|242077446|ref|XP_002448659.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
gi|241939842|gb|EES12987.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
Length = 252
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 122/185 (65%), Positives = 145/185 (78%)
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
YDL+DLTRQ LAKYAN++FL IIE+Y+ N + V L + FL LV D+D LL+ H+GFLL
Sbjct: 51 YDLVDLTRQVLAKYANDVFLKIIESYKSNKMNQVTILCKHFLNLVNDLDTLLSSHEGFLL 110
Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
GPWLESAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGP
Sbjct: 111 GPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLRDYYGP 170
Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
RAAIYFK+++ S+E F L++WRREWI LTN+WQ+ R V+ GD+L S LY K
Sbjct: 171 RAAIYFKHLLLSMEKNAPFALEEWRREWISLTNNWQSDRKVFSTTPTGDSLNISWSLYIK 230
Query: 536 YLQGT 540
YL T
Sbjct: 231 YLSNT 235
>gi|346324333|gb|EGX93930.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
Length = 751
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 151/511 (29%), Positives = 266/511 (52%), Gaps = 48/511 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP + V+P+ + + W R+ + DP F E+ +AF+ +
Sbjct: 239 ILPAFPGFVPENITRVWPNVSLAESPIWSGFSG--RFTADKYITPYDPHFAELQKAFLTK 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG + + D F+EN P +Y+ ++ + +++ D AVW+MQGWLF+ D
Sbjct: 297 QNEAYGNVTSFWTLDQFNENKPASGELDYLKNVSHNTWQTLKAADPSAVWVMQGWLFASD 356
Query: 128 P-FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+W ++K+ L+ VP+ + +++LDLFAE P W ++ FYG P+IWC LH++ GN+ +
Sbjct: 357 KTYWIDDRVKSFLDGVPVNEDMLLLDLFAESTPQWQRTESFYGKPWIWCQLHDYGGNMGL 416
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG ++++ VEA + ++VG G+SMEG E N ++YDL+ + A++ E ++ + +
Sbjct: 417 YGQIENVTKNAVEA-VQTSKSIVGFGLSMEGQEGNEIMYDLLLDQAWRKEAIETDKYFSD 475
Query: 246 YSVRRYG---RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
+ RYG + +P + AW+ + TVYN TD + V + ++ PSI +
Sbjct: 476 WVTVRYGADHKEIPENLYTAWDKVRSTVYNNTDSSVTA---VTKSIFELAPSISGLV--- 529
Query: 302 YQNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYD 357
+ HP + Y T +I A ++G++ L + YRYD
Sbjct: 530 -----------------NRTGHHPTKITYDTKTLISAWNDMFSAGDQARWLFDNEAYRYD 572
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
L D TRQ LA + ++E Y+ N+ GV R +++ MD +L + F L
Sbjct: 573 LTDWTRQVLANAFEATYNKLVEKYKSNNTKGVKCAGDRLQAILQTMDQVLDTNPSFKLST 632
Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
W+++A++ E +E+NAR Q+T+W N + + DY +K W+GL+ +YY R
Sbjct: 633 WIQAARK--SGGEAADFFEYNARNQVTLWGPNGE-----IEDYASKQWAGLVGNYYAHRW 685
Query: 478 AIYFKYMI-----ESLESGDGFRLKDWRREW 503
++ Y++ E ++ +L++W +W
Sbjct: 686 QMFVDYLVATDPKEYDQNVFKKKLREWETQW 716
>gi|83775903|dbj|BAE66022.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 633
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 154/494 (31%), Positives = 256/494 (51%), Gaps = 47/494 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF G VP A++ V P A + W + ++ L+ D F ++ ++ I +
Sbjct: 112 VLPAFPGFVPPAIKRVRPHATVVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISR 169
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G +H+Y D F+E P Y+ +L + +++ + AVW+MQGWLF YD
Sbjct: 170 QTRAFGNVTHVYALDQFNEINPASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YD 228
Query: 128 P--FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
FW P ++ A L+ V +++LDL++E KP W ++ ++G P+IWC LH+F GN+
Sbjct: 229 KKDFWDPNRISAYLSGVERNDDMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMG 288
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
MYG + +I P+EA +++ ++VG G++MEG E N +VYDL+ + A+ + +D +A+
Sbjct: 289 MYGQIMNITSDPIEA-LNKSDSLVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQ 347
Query: 245 QYSVRRYGR--SVP-AIQDAWNVLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEG 300
+ RY SVP + AW++L TVYN T+ T + + PD+ + V G
Sbjct: 348 SWVRSRYSGNFSVPNELYTAWDLLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--G 405
Query: 301 KYQ-----NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
Y NY V E ++ P LW+S + Y
Sbjct: 406 HYPTPTSINYDPMVLNEVWSLFMNATRKEPSLWHSPA---------------------YE 444
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YD++D+TRQ + ++ ++I +++ N V S R L L+ +D +L+C++
Sbjct: 445 YDMVDITRQLMGNAFVNVYSDLISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNEN 504
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
F L W+ SA+ E + +E+NAR QIT+W + + DY +K W+GL+ Y
Sbjct: 505 FSLTTWISSARDWGNTTETKDFFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSY 559
Query: 473 YGPRAAIYFKYMIE 486
Y PR +I+ Y+ E
Sbjct: 560 YKPRWSIFVDYLGE 573
>gi|395331391|gb|EJF63772.1| alpha-N-acetylglucosaminidase [Dichomitus squalens LYAD-421 SS1]
Length = 750
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 161/518 (31%), Positives = 276/518 (53%), Gaps = 40/518 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+F+G VP AL +++P+A I W + L+ DPLF I +FI +
Sbjct: 233 VLPSFTGFVPRALSSLYPNASIVNGSQWEGFPT--ALTNDSFLEPFDPLFTTIQTSFISK 290
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q + YG SHIY D ++EN P P Y++++ A ++ +++ D DAVWLMQGWLF S
Sbjct: 291 QREAYGNVSHIYALDQYNENDPFSGDPAYLANVTAGTFASLRAADPDAVWLMQGWLFFSS 350
Query: 127 DPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW +++A L VP +++LDL++E +P W+ + +YG ++WC LH + GNI M
Sbjct: 351 AAFWTNERIEAYLGGVPGNDSMIILDLYSEAQPQWNRTSSYYGKQWVWCELHGYGGNIGM 410
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G LD++ P+ A + ++M GVG++MEG E N +VYD++ + A+ +++ ++++Q
Sbjct: 411 EGDLDALTQNPIAALHAPGSSMKGVGLTMEGQEGNELVYDILLDQAWSSAPLNLSSYVDQ 470
Query: 246 YSVRRYG-RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
+ RRY R +P + DAW L TVY+ D T + I + ++ P++ +T
Sbjct: 471 WVARRYNVRRLPKSALDAWRTLATTVYSNKDSGT---QAAIKSIYELAPALTGMT----- 522
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDLID 360
+ T + + + S V+ A + + + +E L+ + YD++D
Sbjct: 523 -------------NRTGHHPTAIPYDTNSTVLVAAKALLEARSENPLLATIPEFAYDVVD 569
Query: 361 LTRQALAKYANELFLNIIEAYQLNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
+TRQ L+ + + ++ Y N V + L L++D+D LLA ++ FLL W
Sbjct: 570 VTRQLLSNRFIDHYNVLVATYNSNATAPRNVAAAAGPLLALLDDLDELLATNEHFLLSNW 629
Query: 419 LESAKQLAQNEEQ---EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
+ AK+ ++ + E+NAR QIT+W + + + DY +K W+GL+R YY P
Sbjct: 630 IADAKRWTHGADRAAYARLLEYNARNQITLWGPDGE-----INDYASKAWAGLVRTYYKP 684
Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNG 513
R + +Y+ ++ E+G + + I + W NG
Sbjct: 685 RWEAFVEYLAQTKEAGAAYDAHVVSAKMIAIGQQWSNG 722
>gi|317158657|ref|XP_001827155.2| alpha-N-acetylglucosaminidase [Aspergillus oryzae RIB40]
Length = 849
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 154/494 (31%), Positives = 256/494 (51%), Gaps = 47/494 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF G VP A++ V P A + W + ++ L+ D F ++ ++ I +
Sbjct: 196 VLPAFPGFVPPAIKRVRPHATVVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISR 253
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G +H+Y D F+E P Y+ +L + +++ + AVW+MQGWLF YD
Sbjct: 254 QTRAFGNVTHVYALDQFNEINPASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YD 312
Query: 128 P--FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
FW P ++ A L+ V +++LDL++E KP W ++ ++G P+IWC LH+F GN+
Sbjct: 313 KKDFWDPNRISAYLSGVERNDDMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMG 372
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
MYG + +I P+EA +++ ++VG G++MEG E N +VYDL+ + A+ + +D +A+
Sbjct: 373 MYGQIMNITSDPIEA-LNKSDSLVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQ 431
Query: 245 QYSVRRYGR--SVP-AIQDAWNVLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEG 300
+ RY SVP + AW++L TVYN T+ T + + PD+ + V G
Sbjct: 432 SWVRSRYSGNFSVPNELYTAWDLLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--G 489
Query: 301 KYQ-----NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
Y NY V E ++ P LW+S + Y
Sbjct: 490 HYPTPTSINYDPMVLNEVWSLFMNATRKEPSLWHSPA---------------------YE 528
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YD++D+TRQ + ++ ++I +++ N V S R L L+ +D +L+C++
Sbjct: 529 YDMVDITRQLMGNAFVNVYSDLISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNEN 588
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
F L W+ SA+ E + +E+NAR QIT+W + + DY +K W+GL+ Y
Sbjct: 589 FSLTTWISSARDWGNTTETKDFFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSY 643
Query: 473 YGPRAAIYFKYMIE 486
Y PR +I+ Y+ E
Sbjct: 644 YKPRWSIFVDYLGE 657
>gi|391873368|gb|EIT82411.1| alpha-N-acetylglucosaminidase [Aspergillus oryzae 3.042]
Length = 633
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 153/497 (30%), Positives = 258/497 (51%), Gaps = 51/497 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF G VP A++ V P A + W + ++ L D F ++ ++ I +
Sbjct: 112 VLPAFPGFVPPAIKRVRPHATVVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISR 169
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q++ +G +H+Y D F+E P Y+ +L + +++ + AVW+MQGWLF YD
Sbjct: 170 QMRAFGNITHVYALDQFNEINPASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YD 228
Query: 128 P--FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
FW ++ A L+ V +++LDL++E KP W ++ ++G P+IWC LH+F GN+
Sbjct: 229 KKDFWDSNRISAYLSGVERNDDMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMG 288
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
MYG + +I P+EA N ++VG G++MEG E N +VYDL+ + A+ +D +A+
Sbjct: 289 MYGQIMNITSDPIEALNKSN-SLVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQ 347
Query: 245 QYSVRRYGR--SVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
+ RY R SVP + AW++L TVYN T+ T V + ++ P I + G+
Sbjct: 348 SWVRSRYSRNFSVPNELYTAWDLLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GR 403
Query: 302 YQNYGKPVS--KEAVLKSET------SSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
+Y P S + ++ +E ++ P LW++ +
Sbjct: 404 VGHYPTPTSINYDPMVLNEVWSLFMNATRKEPSLWHNPA--------------------- 442
Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACH 410
Y YD++D+TRQ + ++ +I +++ N V S R L L+ +D +L+C+
Sbjct: 443 YEYDMVDITRQLMGNAFVNVYSVLITSWKSETENRTTKVTSQSERLLNLLSAIDKVLSCN 502
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
+ F L W+ SA+ E + +E+NAR QIT+W + + DY +K W+GL+
Sbjct: 503 ENFSLATWISSARDWGNTTETKDFFEYNARNQITLWGPTGE-----ISDYASKAWAGLIS 557
Query: 471 DYYGPRAAIYFKYMIES 487
YY PR +I+ Y+ E+
Sbjct: 558 SYYKPRWSIFVDYLGEN 574
>gi|319900259|ref|YP_004159987.1| alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
gi|319415290|gb|ADV42401.1| Alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
Length = 718
Score = 251 bits (642), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 169/544 (31%), Positives = 266/544 (48%), Gaps = 69/544 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
VLP +SG +PA + +LG V +WC L +D F I R +
Sbjct: 224 VLPGYSGMIPANAKE--------KLG--LDVADPGKWCGYRRPAFLQPSDKNFRRIARLY 273
Query: 65 IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
++ + YG+ ++ Y+ D F E NT VD + + G +I M+ + AVW+ Q W
Sbjct: 274 YKEMTRLYGKANY-YSMDPFHEGGNTKGVD----LDAAGKSIRDAMKEANPQAVWVAQAW 328
Query: 123 -LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWC 174
YD ++ ++P G ++VLDL++E +P W KQ +G +I+C
Sbjct: 329 GACPYD---------NMIKNLPEGDMIVLDLYSESRPQWGDPASAWYRKQGFGRHGWIYC 379
Query: 175 MLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ 233
ML NF GN+ +YG ++ + +AR S T+ GVG++MEG E NPV+Y+L+ E+ +
Sbjct: 380 MLLNFGGNVGLYGKMEHVIDEFYKARESAFGGTLQGVGLTMEGSENNPVMYELLCELPWH 439
Query: 234 HEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVD 291
++ W+ Y RYG++ P +AW L +T+YN + +T + + V A P ++
Sbjct: 440 GRRISKDQWLKSYLKARYGKTTPQTVEAWLKLSNTIYNSPNASTQQGTHESVFCARPSLE 499
Query: 292 PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
YQ SS+ +Y+ +++IRA I + E +
Sbjct: 500 A---------YQ---------------VSSWSEMKDYYAPADIIRAAGKMIEAAEEFRGN 535
Query: 352 NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 411
N + YDLID+ RQA+A+ ++ ++ AY+ D S RFLEL+E D LL
Sbjct: 536 NNFEYDLIDVVRQAVAEKGRLVYPIVVSAYKAADKQLFEAASARFLELIELQDKLLGTRR 595
Query: 412 GFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRD 471
F LG W A+ + + + Q+ YEWNAR QIT W + T L DY +K W+GLLRD
Sbjct: 596 EFRLGTWTNYARNMGETDAQKDLYEWNARVQITTWGNRTAANEGGLHDYAHKEWNGLLRD 655
Query: 472 YYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQW 531
+Y R YF + +L +G+ + D + L +W N Y E GDA ++
Sbjct: 656 FYYMRWKAYFDELRSTL-NGNAPKETD----FYTLEENWAGQHNPYSAEPEGDATDIAKE 710
Query: 532 LYNK 535
+Y K
Sbjct: 711 VYGK 714
>gi|336417192|ref|ZP_08597519.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
3_8_47FAA]
gi|423297818|ref|ZP_17275878.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
CL03T12C18]
gi|335936512|gb|EGM98438.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
3_8_47FAA]
gi|392664455|gb|EIY57993.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 156/532 (29%), Positives = 259/532 (48%), Gaps = 40/532 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPAAL+ V+P+ K +++ W R CT+L + D L+ I + ++ +
Sbjct: 231 VLPAFAGHVPAALKRVYPNIKTSRVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTE 287
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG T+HIY D F+E PP + + + IY + + D A+WL WLF D
Sbjct: 288 QTRLYG-TNHIYGIDPFNEIDPPSWDTDSLGMMAKHIYESVAAVDPKAIWLQMTWLFYAD 346
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++K+ L SVP KL++LD F E IW + ++G PY+WC L NF GN +
Sbjct: 347 IKHWTTPRIKSYLRSVPQDKLILLDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLS 406
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + ++ +A + + + GVG ++EGI+ N +Y+ + + A+ + D K W +
Sbjct: 407 GPVKLVSERLADALKNGGSNLKGVGSTLEGIDLNQFMYEFVLDKAWNSGQTD-KEWFLKL 465
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RR G+ P + AW +L VY + P+ V +G N
Sbjct: 466 ADRRTGKVSPEARKAWEILADKVY-------------------IQPA--QVGQGTLTN-A 503
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+P LK P + Y +++ A L + + ++Y +DL+++ RQ L
Sbjct: 504 RP-----CLKGNGHWTTKPTIEYQPKDLVEAWRLLLLVKD--CQRDSYEFDLVNIGRQVL 556
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y N + AY+ D + + E++ D+D L++CH F L W+ A+ +
Sbjct: 557 GNYFNVVRDEFTLAYEAGDIMMMKNRGDKMREILADLDKLVSCHPTFSLNKWITDARDMG 616
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ + YE NAR+ IT+W D+ L DY N+ W+GL YY R + +I+
Sbjct: 617 HDATSKNYYEMNARSLITIWGDSYH-----LTDYANRSWAGLTNQYYSVRWDRFINEVIK 671
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
++E F + + E N+W N N GD + ++ +Y KY +
Sbjct: 672 AVEKKKAFDEEVFFNESRMYENEWVNPSNRINYNEGGDGIKLARQIYKKYAK 723
>gi|212537509|ref|XP_002148910.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
gi|210068652|gb|EEA22743.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
Length = 768
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 152/516 (29%), Positives = 264/516 (51%), Gaps = 43/516 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP A+ V P A + W + + + ++ TDP F EI ++FI +
Sbjct: 248 ILPAFPGFVPRAITRVLPDADVINGSAWEAFPA--MFTSDTFMEPTDPHFTEIQKSFISK 305
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q YG + Y D F+EN P Y+ S+ + +++ D AVW+MQGWLF S
Sbjct: 306 QTAAYGNVTTFYTLDQFNENNPSSGDLNYLRSVSHGTWQALKAADPSAVWVMQGWLFFSN 365
Query: 127 DPFWRPPQMKALLNSVPL-GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW +++A L V + L+VLDL +E +P W + ++G P+IWC +H++ GN+
Sbjct: 366 SAFWTNDRVEAYLGGVTVDSDLLVLDLASESQPQWQRTNSYFGKPWIWCQIHDYGGNMGF 425
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + +I P+ A + ++VG G+SMEG E N VVYDL+ + A+ + +D + +
Sbjct: 426 YGQVMNITVNPIAALNNATASLVGFGLSMEGQEGNEVVYDLLLDQAWSAKPIDTATYFHD 485
Query: 246 YSVRRYG--RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+ RY +S+P + AW++L +VYN T+ A++ A P +I T G
Sbjct: 486 WVTARYAGSKSIPTDVYSAWDMLRTSVYNNTNLASN-------AVPKAIFELIPSTTGLV 538
Query: 303 QNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDL 358
G HP L Y+ +++++A LF ++ + L + Y +DL
Sbjct: 539 NRTGH----------------HPTTLNYNPADMVKAWSLFYSAAFKEPSLWLNPAYEFDL 582
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVF--QLSRRFLELVEDMDGLLACHDGFLLG 416
+D++RQ LA ++ ++I A+ + + + + +++ +D +L ++ F L
Sbjct: 583 VDMSRQVLANAFIPVYHDLIAAWNTTNPSTIRIQIIGAELIGILQAIDTILDTNEHFKLS 642
Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
W+ +A+ A + E E+NA QIT+W Q + DY +K W+GL+ YY PR
Sbjct: 643 TWISAARTSAGEQSLEDFLEYNALNQITLWGPTGQ-----ISDYASKSWAGLVSSYYIPR 697
Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 512
++ +Y++++ + + ++ E +K WQN
Sbjct: 698 WKMFIEYLVDTKPA--QYNQTAFKAELLKWELQWQN 731
>gi|410100551|ref|ZP_11295511.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
CL02T12C30]
gi|409215586|gb|EKN08585.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
CL02T12C30]
Length = 739
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 174/555 (31%), Positives = 266/555 (47%), Gaps = 79/555 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWC----CTYLLDATDPLFIEIGRA 63
+L +F+G VP L+ +P A+I K RWC T LD DPLF E+G+A
Sbjct: 231 ILQSFTGFVPIKLKEKYPDARI---------KDKNRWCNAFTATVQLDPLDPLFKEMGQA 281
Query: 64 FIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
F+E+Q K YG T+HIY D F E P + Y+ ++G I+ D +AV MQ W
Sbjct: 282 FLEEQQKLYG-TNHIYAADPFHEGAAPSNEKSYLEAVGKVIWEVASGFDPEAVIAMQTWS 340
Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSK--QFYGVPYIWCMLHNFAG 181
+A+ + P +L++LDL W+ K F+ PY+ +LHN+ G
Sbjct: 341 LR----------EAITRTFPQDRLLLLDLGG-----WNVEKFNSFWNYPYVAGVLHNYGG 385
Query: 182 NIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
+ M G L A E + S + + G+G+ E IE NPVVY+L +E+ + + D++
Sbjct: 386 RVYMGGNLALYAKNAHELKQSPKGGNIQGIGLFPEAIEHNPVVYELSTEITWMQDAPDLQ 445
Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
WI Y+ RYG+ + W VL TVY G V+ A P + ++
Sbjct: 446 KWITDYARARYGKLPAGAEQGWKVLLETVYGSKAGRLPSTESVMCARPALTIQKVAAN-- 503
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ +P YST + A++ F+ + N+L S+TYRYDL+D
Sbjct: 504 --GDLSRP--------------------YSTVRLWDAVDHFLQASNDLKKSDTYRYDLVD 541
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQ L+ + L I EAY D + Q +FL L++D D LL FLLG W++
Sbjct: 542 VMRQCLSDLSLPLQKQITEAYLAEDNEKLQQAGEQFLALIDDFDRLLGTRSTFLLGKWIK 601
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
A+Q EE++ YEWNART +T+W N ++ L +Y N+ W+GL++ YY PR +
Sbjct: 602 EARQWGTTEEEKALYEWNARTLVTVWGPN--HPSAHLFEYSNRQWAGLMKGYYKPRWEKF 659
Query: 481 F---------------KYMIESLESGDGFRLKDWRREWIKLTN---DWQNGRNVYPVESN 522
+Y+ +SL D+ + +LTN DW ++VY
Sbjct: 660 ISYLKAQPKGEWRYDEQYIRKSLAGRPALDASDF---YTRLTNWEYDWAFNKDVYTDTPQ 716
Query: 523 GDALITSQWLYNKYL 537
G+ + + LY K+L
Sbjct: 717 GNEIEIVKELYAKWL 731
>gi|393236266|gb|EJD43816.1| putative alpha-N-acetylglucosaminidase [Auricularia delicata
TFB-10046 SS5]
Length = 778
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 155/531 (29%), Positives = 265/531 (49%), Gaps = 56/531 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
LP+F+G VP A+ V P A + W S D T+L + DP F + ++FIE+
Sbjct: 249 ALPSFTGFVPRAISRVLPGASVVNGSRW-SGFPDALTRVTFL-EPFDPAFARLQKSFIEK 306
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q+ YG SH+Y D ++EN P + Y+ + + + +++ D DA+WLMQGWLF S
Sbjct: 307 QIAAYGPVSHVYTLDQYNENDPLKNDVGYLRDVSRSTWQSLKAADPDAIWLMQGWLFYSN 366
Query: 127 DPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW +++A L V +++LDLF+E +P W + +YG P+IWC LH++ GN+ +
Sbjct: 367 RGFWTNARVEAFLGGVEKNDDMLILDLFSESEPQWQRTNSYYGKPWIWCQLHDYGGNLGL 426
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + +I VEA ++ ++VG G++MEG E N ++YDL+ A+ + +D ++
Sbjct: 427 YGQVMNITLNAVEA-LEKSPSLVGFGLTMEGQEGNEIMYDLLLSQAWSRKPIDTASYFRS 485
Query: 246 YSVRRY------GRSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 298
++ RRY G +P AI +AW++L TVYN T A++ V + ++ P++ +
Sbjct: 486 WATRRYNAGGIIGSLLPSAIYNAWDILRTTVYNNTKLASNA---VTKSVFELRPALSGI- 541
Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA----SNTY 354
+ + + + Y T +++A +LF + A + Y
Sbjct: 542 ------------------ANRTGHHATTITYDTQALVKAYDLFDKAAIYTPALWFNNPAY 583
Query: 355 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG------------VFQLSRRFLELVED 402
+D +D RQ L+ + + +++ Y G + R + ++
Sbjct: 584 EFDNVDFARQVLSNAFSTQYDDLVATYNEISKPGGSGATLAEAAKIIHDKGERMMGVLAS 643
Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
+D +L F L WL+ A+ A+ +E +E+NAR QIT+W Q + DYG+
Sbjct: 644 LDKVLRTSKHFTLKKWLQDARAWARGGHEE-LFEYNARNQITLWGPTGQ-----INDYGS 697
Query: 463 KYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNG 513
K W GL+ +YY R I+F Y+ + +G F L +++ DWQ
Sbjct: 698 KAWGGLVSEYYAQRWRIFFTYLESVVAAGQPFNLTAVGNQFLAFQLDWQTA 748
>gi|404406438|ref|ZP_10998022.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 726
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 165/540 (30%), Positives = 262/540 (48%), Gaps = 51/540 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V +F+G+VP AL+ +FP A I +L W S + R +Y L+ +PLF I +A++++
Sbjct: 227 VFTSFTGHVPKALKTLFPDADIERLNPWTSFE---RPYNSYYLNPAEPLFNRIQQAYMQE 283
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G +S +Y D F+E PP PEY++ Y + D DAVWL W+F +
Sbjct: 284 QRRLFGESS-VYGVDPFNELDPPNWDPEYLARAARLTYESITQFDKDAVWLQMAWVFYHK 342
Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P ++KA L +VP GKL++LD + + +W +++ FYG P+IW L NF GN +
Sbjct: 343 RRDWTPERLKAYLCAVPDGKLLMLDYYCDKVELWRSTESFYGQPFIWSYLGNFGGNTMLA 402
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + ++ A VG+G ++EG++ NP +Y+ + + A+ + D WI++
Sbjct: 403 GDVKDVSRKLDRAYAEAGRNFVGIGCTLEGLDVNPFMYEYVLDRAWT-QLYDDAGWIDRL 461
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R GR + AW +LY +Y G NR V
Sbjct: 462 ADRHSGRIDVHYRQAWRILYDKIYCAPSG----NRSAAVC-------------------A 498
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+P K S + PHL Y +++R E + E +AS+ R+D +++ RQ L
Sbjct: 499 RPNMK------GRSKWSGPHLDYDNRDLLRVWEQLTLARPERTASS--RFDCVNIPRQCL 550
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y L I A + D V +LS R LEL++D+D L+A FLLG W+ A+++
Sbjct: 551 ENYFGNLNERCIAACRGGDRETVARLSARLLELLDDIDRLVAADAYFLLGKWIADARRMG 610
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
++ +E +AR +T W L DY N+ WSGL+ DYY R ++ +
Sbjct: 611 ATPAEKDYFERDARNILTTW----GGRGYSLNDYANRTWSGLVSDYYKERWRRFYDRLQS 666
Query: 487 SLESGDGF---RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 543
E + L+D+ EW+ + GR + GDA + LY KY G F
Sbjct: 667 DGEPDEDALLQELQDFEWEWVG-----RKGR--FAERPRGDAFRLCRSLYTKYAAEIGRF 719
>gi|423346424|ref|ZP_17324112.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
CL03T12C32]
gi|409220242|gb|EKN13198.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
CL03T12C32]
Length = 718
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 166/541 (30%), Positives = 263/541 (48%), Gaps = 67/541 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P +SG VP + ++ G W + L TDP F EI + ++
Sbjct: 224 VFPGYSGMVPHNAKEKL-GLNVSDPGLWNGYRRPA------FLQPTDPRFEEIASLYYKE 276
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
K YG+ + Y+ D F E V + + G AI M+ + AVW+ Q W +
Sbjct: 277 MNKLYGKADY-YSMDPFHEGGSVVGVD--LDAAGKAIMQAMKKNNPKAVWVAQAWQANPR 333
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFA 180
P ++ ++ G L+VLDLFAE +P W K +G +I+CML N+
Sbjct: 334 P--------QMIGNLEAGDLIVLDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYG 385
Query: 181 GNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
GN+ ++G + + +A+ S T+ GVGM+MEG E NPV+++L++E+ ++ ++ D
Sbjct: 386 GNVGLHGKMKHVIDEFYKAKESPFGKTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDK 445
Query: 240 KAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISV 297
W+ +Y+V RYG+S P +QDAW +L +++YNC D T + + V A P
Sbjct: 446 DQWLREYTVARYGKSNPTVQDAWILLSNSIYNCPDANTQQGTHESVFCARP--------- 496
Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
TE YQ SS+ +Y + VIRA + ++ +E +N + YD
Sbjct: 497 TEHPYQ---------------VSSWSEMKDYYDPNNVIRAAAMMVSVADEFKGNNNFEYD 541
Query: 358 LIDLTRQALAKYANELFLNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
L+D+ RQA+A+ L ++EA + D S RFL L+ D LLA F +G
Sbjct: 542 LVDIVRQAIAE-KGRLTEKVVEAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVG 600
Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
W+ A+ L E+++ YEWNAR QIT W + + LRDY ++ W+G+L+D+Y R
Sbjct: 601 TWIARARSLGSTPEEKELYEWNARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYYMR 660
Query: 477 AAIYFKYMIESLESGDGFRLKDWRR----EWIKLTNDWQNGRNVYPVESNGDALITSQWL 532
+F Y RL D R+ ++ + W NVY E GD + T + +
Sbjct: 661 WKTWFDYQT---------RLLDGRKTAAIDFYAIEERWTKATNVYSSEPEGDCISTVKRI 711
Query: 533 Y 533
+
Sbjct: 712 F 712
>gi|404487028|ref|ZP_11022215.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
YIT 11860]
gi|404335524|gb|EJZ61993.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
YIT 11860]
Length = 726
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 161/530 (30%), Positives = 251/530 (47%), Gaps = 45/530 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPAAL+ + P A I LG W R + L+ +PLF EI ++F+E+
Sbjct: 233 VLPAFAGHVPAALKRIHPDANIQYLGKWAGFGDSYR---CHFLNPEEPLFAEIQKSFLEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PEY++ + + +Y + + D DAVWL W+F +D
Sbjct: 290 QEKMFG-TDHIYGVDPFNEVDPPSWEPEYLAQVSSDMYKSLAAADPDAVWLQMTWMFYHD 348
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++KALL VP KLV+LD E +W ++++F+G PYIWC L NF GN +
Sbjct: 349 RKLWTAPRVKALLTGVPSDKLVLLDYHCENVELWKSTEKFHGQPYIWCYLGNFGGNTTLT 408
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ N Y+ + E A+ + V+ + W+ +
Sbjct: 409 GNVKESGDRLDNALINGGDNLKGIGSTLEGLDINQFPYEYIFEKAWTID-VNGQDWVERL 467
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G + ++AW +L+ DV V P + NY
Sbjct: 468 ADRHVGAVSESAREAWQILFD--------------DVFVQVPRTLGILPGYRPKLGDNYN 513
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
K S E Y + ++R EL + + + + D+I RQ L
Sbjct: 514 KRTSNE----------------YDNATLLRVWELLLEVPS--CDRDAFEIDVIMTGRQLL 555
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y ++ Y+ + G+ + + E++ D++ L + H+ L W+E A+ L
Sbjct: 556 GNYFLDVKKEFDGFYKKRNVPGLKEKASEMREILSDLELLNSFHNRASLDKWIEDARSLG 615
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+E + YE NAR IT W L DY ++ W+GLL DYY R IYF +I
Sbjct: 616 DTDELKNYYEKNARNLITTW-------GGSLNDYASRTWAGLLNDYYARRWEIYFDAVIG 668
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ E G + + +W +E NG L TS+ L KY
Sbjct: 669 AAEKGIELDKDELKSRLATFEQEWVESTTPVCIERNGTLLDTSRRLLEKY 718
>gi|400599317|gb|EJP67021.1| alpha-N-acetylglucosaminidase, putative [Beauveria bassiana ARSEF
2860]
Length = 753
Score = 248 bits (634), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 151/511 (29%), Positives = 260/511 (50%), Gaps = 48/511 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP + V+P+ + + W R+ + DP F E+ +AF+ +
Sbjct: 239 ILPAFPGFVPENITRVWPNVSLAESPTWSGFSG--RFTADKYITPYDPRFAELQKAFLTK 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG + + D F+EN P Y+ ++ + ++ D AVW+MQGWLF+ D
Sbjct: 297 QNEAYGNVTSFWTLDQFNENKPASGELGYLRNVSHNTWQTLKDADPSAVWVMQGWLFASD 356
Query: 128 -PFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+W ++K+ L+ VP+ + +++LDLFAE P W + FYG P+IWC LH + GN+ +
Sbjct: 357 KAYWTDDRVKSFLDGVPVNEDMLLLDLFAESTPQWQRTDSFYGKPWIWCQLHGYGGNMGL 416
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG ++++ VEA ++ ++VG+G+SMEG E N ++Y+L+ + A+ E ++ + +
Sbjct: 417 YGQIENVTRNAVEA-VQKSPSIVGLGLSMEGQEGNEIMYNLLLDQAWSKEALETDKYFSD 475
Query: 246 YSVRRYG---RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
+ RYG + +P + AW+ + TVYN TD + + A ++ T G
Sbjct: 476 WVTVRYGADQKEIPKDLYTAWDKVRSTVYNNTDSS-------VTAVAKSIFELVPSTSGL 528
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDL 358
G +K + Y T +I A +G++ L + Y YDL
Sbjct: 529 VNRTGHHATK---------------ITYDTETLISAWNDMFNAGSQARWLFDNEAYSYDL 573
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
D TRQ LA + ++E Y+ N+ GV R ++ MD +L + F L W
Sbjct: 574 TDWTRQVLANAFEATYNKLVEKYKSNNIKGVKCAGSRLQAILRTMDQVLETNVHFRLSTW 633
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
+++A++ + +E+NAR Q+T+W N + + DY +K W+GL+ DYY R
Sbjct: 634 IQAARK--SGGDAADFFEYNARNQVTLWGPNGE-----IEDYASKQWAGLIGDYYAHRWQ 686
Query: 479 IYFKYMIESLESGD----GFR--LKDWRREW 503
++ Y++ + + GD F+ L +W +EW
Sbjct: 687 MFVDYLVAT-DPGDYDQMAFQKTLIEWEKEW 716
>gi|238506383|ref|XP_002384393.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
NRRL3357]
gi|220689106|gb|EED45457.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
NRRL3357]
Length = 669
Score = 248 bits (634), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 151/496 (30%), Positives = 255/496 (51%), Gaps = 51/496 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF G VP A++ V P A + W + ++ L D F ++ ++ I +
Sbjct: 148 VLPAFPGFVPPAIKRVRPHATVVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISR 205
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q++ +G +H+Y D F+E P Y+ +L + +++ + AVW+MQGWLF YD
Sbjct: 206 QMRAFGNITHVYALDQFNEINPASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YD 264
Query: 128 P--FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
FW ++ A L+ V +++LDL++E KP W ++ ++G P+IWC LH+F GN+
Sbjct: 265 KKDFWDSNRISAYLSGVERNDDMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMG 324
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
MYG + +I P+EA +++ ++VG G++MEG E N +VYDL+ + A+ +D +A+
Sbjct: 325 MYGQIMNITSDPIEA-LNKSDSLVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQ 383
Query: 245 QYSVRRYG--RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
+ RY SVP + AW++L TVYN T+ T V + ++ P I + G+
Sbjct: 384 SWVRSRYSGNLSVPNELYTAWDLLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GR 439
Query: 302 YQNYGKPVS--------KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
+Y P S E + ++ P LW++ +
Sbjct: 440 VGHYPTPTSINYDPMVLNEVLSLFMNATRKEPSLWHNPA--------------------- 478
Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACH 410
Y YD++D+TRQ + ++ +I +++ N V S R L L+ +D +L+C+
Sbjct: 479 YEYDMVDITRQLMGNAFVNVYSVLITSWKSETENRTTKVTSHSERLLNLLSAIDKVLSCN 538
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
+ F L W+ SA+ E + +E+NAR QIT+W + + DY +K W+GL+
Sbjct: 539 ENFSLATWISSARDWGNTTETKDFFEYNARNQITLWGPTGE-----ISDYASKAWAGLIS 593
Query: 471 DYYGPRAAIYFKYMIE 486
YY PR +I+ Y+ E
Sbjct: 594 SYYKPRWSIFVDYLGE 609
>gi|393782608|ref|ZP_10370791.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
CL02T12C01]
gi|392672835|gb|EIY66301.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
CL02T12C01]
Length = 761
Score = 248 bits (634), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 156/485 (32%), Positives = 235/485 (48%), Gaps = 51/485 (10%)
Query: 11 AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
FSG VP L++ +P+A I Q +W K + LD TD LF +GR F+E+Q +
Sbjct: 239 GFSGYVPRELKDKYPTANINQQRSWCGFKGAAQ------LDPTDSLFTRMGRVFLEEQAR 292
Query: 71 EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
+G +Y D F E+ PPVD+PEY+ ++G I+ + D + W MQ W
Sbjct: 293 LFG-AHGVYAADPFHESVPPVDTPEYLKAVGETIHRLFREFDPQSTWAMQSWSLR----- 346
Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSK-QFYGVPYIWCMLHNFAGNIEMYGIL 189
+A++ +VP L++LDL STSK +F+G P + LHNF G I M+G L
Sbjct: 347 -----EAIVKAVPKEALLILDLRGS-----STSKAEFWGYPTVVGNLHNFGGRINMHGDL 396
Query: 190 DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 249
+A N + G G+ ME IEQNPV Y+L EM + +D++AW+ QY+ R
Sbjct: 397 ALLASNQYSKAKRLNPAVCGSGLFMEAIEQNPVYYELAFEMPCHPDSIDLRAWLKQYATR 456
Query: 250 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 309
RYG PA Q AW +L Y T+K+ ++ A P +D V
Sbjct: 457 RYGAFSPATQKAWMLLLEGPYRQGTNGTEKS-SIVAARPALD-----------------V 498
Query: 310 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 369
K YD P L +IRA L + ++LSAS YR+DL+D+ RQ +
Sbjct: 499 KKSGPNAGLEIPYD-PAL------IIRAQSLLLEDADKLSASRPYRFDLVDVQRQMMTNL 551
Query: 370 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 429
+ EA++ D S RFL ++ DMD LL + WL A+ + E
Sbjct: 552 GQLIHRKAAEAFRSKDREAFTLHSGRFLGMLADMDTLLRTRSEYSFDRWLTEARSWGETE 611
Query: 430 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 489
E++ Q E +A + +T+W + + DY + W+GL+ YY PR ++ + + L+
Sbjct: 612 EEKNQMERDATSLVTIWGADGDPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQQHLD 668
Query: 490 SGDGF 494
G +
Sbjct: 669 EGTSY 673
>gi|154489986|ref|ZP_02030247.1| hypothetical protein PARMER_00215 [Parabacteroides merdae ATCC
43184]
gi|423722990|ref|ZP_17697143.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
CL09T00C40]
gi|154089428|gb|EDN88472.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
43184]
gi|409241820|gb|EKN34587.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
CL09T00C40]
Length = 718
Score = 248 bits (632), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 168/543 (30%), Positives = 265/543 (48%), Gaps = 71/543 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P +SG VP + ++ G W + L TDP F EI + ++
Sbjct: 224 VFPGYSGMVPHNAKEKL-GLNVSDPGLWNGYRRPA------FLQPTDPRFEEIASLYYKE 276
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
K YG+ + Y+ D F E + VD + + G AI M+ + AVW+ Q W +
Sbjct: 277 MNKLYGKADY-YSMDPFHEGGSVAGVD----LDAAGKAIMQAMKKNNPKAVWVAQAWQAN 331
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHN 178
P ++ ++ G L+VLDLFAE +P W K +G +I+CML N
Sbjct: 332 PRP--------QMIGNLEAGDLIVLDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLN 383
Query: 179 FAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
+ GN+ ++G L + +A+ S T+ GVGM+MEG E NPV+++L++E+ + ++
Sbjct: 384 YGGNVGLHGKLKHVIDEFYKAKESPFGKTLKGVGMTMEGSENNPVMFELLTELPWCPQRF 443
Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSII 295
D W+ +Y+V RYG+S P +QDAW +L +++YNC D T + + V A P
Sbjct: 444 DKDQWLREYTVARYGKSNPTVQDAWILLSNSIYNCPDANTQQGTHESVFCARP------- 496
Query: 296 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
TE YQ SS+ +Y ++VIRA + ++ +E +N +
Sbjct: 497 --TEHPYQ---------------VSSWSEMKDYYDPNDVIRAAAMMVSVADEFKGNNNFE 539
Query: 356 YDLIDLTRQALAKYANELFLNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 414
YDL+D+ RQA+A+ L ++EA + D S RFL L+ D LLA F
Sbjct: 540 YDLVDIVRQAIAE-KGRLTEKVVEAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFK 598
Query: 415 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
+G W+ A+ L E+++ YEWNAR QIT W + + LRDY ++ W+G+L+D+Y
Sbjct: 599 VGTWIARARSLGGTPEEKELYEWNARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYY 658
Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRR----EWIKLTNDWQNGRNVYPVESNGDALITSQ 530
R +F Y RL D R+ ++ + W NVY E GD + T +
Sbjct: 659 MRWKTWFDYQT---------RLLDGRKTAAIDFYAIEERWTKATNVYSSEPEGDCISTVK 709
Query: 531 WLY 533
++
Sbjct: 710 RIF 712
>gi|403416059|emb|CCM02759.1| predicted protein [Fibroporia radiculosa]
Length = 705
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 156/542 (28%), Positives = 276/542 (50%), Gaps = 42/542 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A+ ++P+A I W S + T L+ DPLF + ++FI +
Sbjct: 194 VLPAFTGFVPRAMSTLYPNASIVNGSQWEGFPSTLTY--TTFLEPFDPLFTTMQKSFISK 251
Query: 68 QLKEYG-RTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-S 125
Q YG SH+Y D ++EN P Y++++ A ++ +Q+ D +AVW+MQGWLF +
Sbjct: 252 QQAAYGANVSHVYTLDQYNENDPYSGDVGYLANISAGTFASLQAADPEAVWMMQGWLFFA 311
Query: 126 YDPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
+ FW ++ A L +VP +++LDL++E P W + +YG +IWC LH+F GN+
Sbjct: 312 SEAFWTTERIAAFLGAVPSNDSMIILDLYSEAAPQWQRTDSYYGKQWIWCELHDFGGNMG 371
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
G L + GP++A S ++M G+G++ EG E N +VYD++ + A+ +D+ +++
Sbjct: 372 FEGNLPELVTGPIQAL-SNASSMRGMGLTPEGQEGNEIVYDILLDQAWSSTSIDIASYVE 430
Query: 245 QYSVRRYG-RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+ RRY + +P A Q+AW +L TVY+ +D P+ +I S+ E
Sbjct: 431 AWVARRYTVQDLPSAAQEAWTILSTTVYSNSD-------------PNTQATIKSIFE--- 474
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTS-EVIRALELFIASGNE---LSASNTYRYDL 358
++ + ++ + + + Y T+ ++ AL+ + + E L + + YD+
Sbjct: 475 ------LAPDLSGLTDRTGHHCTEIPYDTNITIVPALQNLVQAATENPLLLSVPEFMYDV 528
Query: 359 IDLTRQALAKYANELFLNIIEAYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
+D+TRQ LA +++ ++ + A V + L ++ D+D LL +D FLL
Sbjct: 529 VDVTRQLLANRFIDVYNELVSTFYSTGVTAASVKNAGQPLLTILSDVDTLLWTNDNFLLS 588
Query: 417 PWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
W+ A L+ N Y E+NAR QIT+W + + + DY +K W+G + YY
Sbjct: 589 NWILGAINLSDNNGTYADYLEYNARNQITLWGPDGE-----INDYASKQWAGFVGTYYYD 643
Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
R ++ Y+ + ++G + + + +W +GD + L K
Sbjct: 644 RWNMFITYLEDITQNGTAYNDTAIQTVMLNFGKEWDTQTYSLSATVSGDTMSIVDSLIQK 703
Query: 536 YL 537
+L
Sbjct: 704 WL 705
>gi|224025137|ref|ZP_03643503.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
18228]
gi|224018373|gb|EEF76371.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
18228]
Length = 718
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 167/532 (31%), Positives = 265/532 (49%), Gaps = 63/532 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP +SG VP ++ + G W PR L TDP F I + +
Sbjct: 224 VLPGYSGMVPHNAKDRL-GLNVADPGRW---NGYPR---PAFLQPTDPQFERIAALYYRE 276
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
+ YG+ S+ Y+ D F E NT VD + + G AI+ M+ + A W++Q W
Sbjct: 277 MTRLYGKVSY-YSMDPFHEGGNTSGVD----LEAAGKAIWKAMKQANPRAAWVVQAW--G 329
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHN 178
+P PQM + ++P G +VVLDLF+E +P W K+ +G +++CML N
Sbjct: 330 ANP---RPQM---IRNLPAGDMVVLDLFSESRPQWGDPASSWYRKEGFGQHDWLFCMLLN 383
Query: 179 FAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
+ GN+ ++G + + +A+ S T+ GVGM+MEGIE NPV+Y+L+ E+ ++ ++
Sbjct: 384 YGGNVGLHGKMAHLIEEFYKAKDSSFGKTLKGVGMTMEGIENNPVMYELLCELPWREQRF 443
Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSII 295
W+ Y RYG+S + AW +L +T+YNC +T + + ++ A P +
Sbjct: 444 SKDEWLEGYLKARYGKSDSQVSQAWMLLSNTIYNCPAASTQQGTHESILCARPSWKAYQV 503
Query: 296 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
S SE S Y Y ++VIRA + + + +N +
Sbjct: 504 SSW------------------SEMSDY------YDPADVIRAAGMMVDAAERFRGNNNFE 539
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSR-RFLELVEDMDGLLACHDGFL 414
YDL+D+ RQA+A+ ++ +++AY+ D +F+LS RFL L+ D LLA F
Sbjct: 540 YDLVDIVRQAVAEKGRLMYRVLVDAYKAGDRE-LFKLSSDRFLRLILMQDRLLATRSEFK 598
Query: 415 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
+G WLESA+ L EE++ YEWNAR QIT W + + L DY ++ W+GLLRD+Y
Sbjct: 599 VGRWLESARNLGSTEEEKDWYEWNARVQITTWGNRVAADDGGLHDYAHREWNGLLRDFYY 658
Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
R + ++S E G + ++ L W N Y E+ G+ +
Sbjct: 659 LRWKTWLDEQLKSFEGGQPKAI-----DFYALEEPWTLKHNSYASEAEGNPV 705
>gi|355706271|gb|AES02588.1| N-acetylglucosaminidase [Mustela putorius furo]
Length = 333
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 146/371 (39%), Positives = 210/371 (56%), Gaps = 50/371 (13%)
Query: 112 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 170
D DAVWL+QGWLF + P FW P Q++A+L +VP G+L++LDLFAE +P++ + F+G P
Sbjct: 3 DPDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLILDLFAESQPVYLRTASFHGQP 62
Query: 171 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 230
+IWCMLHNF GN ++G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+
Sbjct: 63 FIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVGTGMAPEGIGQNEVVYALMAEL 122
Query: 231 AFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 288
++ + V D++AW+ ++ RRYG + AW +L +VYNC+ + T NR +V
Sbjct: 123 GWRKDPVADLEAWVTSFAARRYGVDSKETEVAWRLLLGSVYNCSGEACTGHNRSPLVR-- 180
Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
PS+ VT +WY+ S V A L +A+ L
Sbjct: 181 --RPSLQMVTT---------------------------VWYNRSAVFEAWRLLLAAAPTL 211
Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 402
+ S T+RYDL+D+TRQA + + + AY L A G+ EL+
Sbjct: 212 AKSPTFRYDLLDVTRQAAQELVSLYYTEARTAYLNKELVPLMRAAGIL-----VYELLPA 266
Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
+DG+LA FLLG WLE A+ +A +E + YE N R Q+T+W E ++L DY N
Sbjct: 267 LDGVLASDSRFLLGTWLEQARAVAVSETDARFYEQNGRYQLTLW----GPEGNIL-DYAN 321
Query: 463 KYWSGLLRDYY 473
K +GL+ YY
Sbjct: 322 KQLAGLVAGYY 332
>gi|329963073|ref|ZP_08300853.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
gi|328529114|gb|EGF56044.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
Length = 717
Score = 245 bits (625), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 167/546 (30%), Positives = 270/546 (49%), Gaps = 73/546 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP +SG VP + +LG +V SDP C Y L TDP F EI
Sbjct: 224 VLPGYSGMVPHNAKE--------RLG--LNV-SDPGLWCGYPRPAFLQPTDPRFGEIADL 272
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ + YG+ + Y+ D F E + VD +++ G AI+ M+ + AVW+ Q
Sbjct: 273 YYKEMTRLYGK-ADFYSMDPFHEGGSIAGVD----LNAAGQAIWGAMKKVNPKAVWVAQA 327
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWC 174
W + P+ K ++ ++P G L+VLDLF+E +P W K+ +G +++C
Sbjct: 328 WQAN-------PRQK-MIENIPQGDLIVLDLFSESRPQWGDPASTWYRKEGFGKHDWLYC 379
Query: 175 MLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ 233
ML N+ GN+ ++G + + +A+TS T+ GVGM+MEG E N V+++L+ E+ ++
Sbjct: 380 MLLNYGGNVGLHGKMRHVIDEFYKAKTSPFGKTLKGVGMTMEGSENNSVMFELLCELPWR 439
Query: 234 HEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVD 291
+ + W+ Y+ RYG++ +Q AW +L +++YNC D T + + V A P +D
Sbjct: 440 PAQFEKDEWLKNYTAARYGKADATVQQAWLLLSNSIYNCPDANTQQGTHESVFCARPGMD 499
Query: 292 PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
YQ SS+ +Y EVIRA + +++ + +
Sbjct: 500 ---------VYQ---------------VSSWSEMVKYYEPEEVIRAAGILLSAADRFKGN 535
Query: 352 NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 411
N + YDL+D+ RQA+A+ ++ +I+A + + S+RFL L+ D LLA
Sbjct: 536 NNFEYDLVDIVRQAVAEKGRLVYPIMIDALKAGEKELFAAASQRFLNLILLQDRLLATRP 595
Query: 412 GFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRD 471
F +G W+E A+ L +E++K YEWNAR QI W + T + LRDY +K W+G+LRD
Sbjct: 596 EFKVGTWIEKARNLGTTQEEKKLYEWNARVQIATWGNRTAADEGGLRDYAHKEWNGMLRD 655
Query: 472 YYGPRAAIYFKYMIESLES--GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
+Y R ++ L GF W TND YP GD + +
Sbjct: 656 FYYHRWKLWIDAQTAQLNGAPAQGFDFYAIEEPWTLQTND-------YPSHPEGDVIEVA 708
Query: 530 QWLYNK 535
+ Y +
Sbjct: 709 RTAYKE 714
>gi|218258436|ref|ZP_03474815.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
DSM 18315]
gi|423342591|ref|ZP_17320305.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
CL02T12C29]
gi|218225494|gb|EEC98144.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
DSM 18315]
gi|409217508|gb|EKN10484.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
CL02T12C29]
Length = 718
Score = 244 bits (624), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 164/539 (30%), Positives = 262/539 (48%), Gaps = 63/539 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P +SG VP + ++ G W + L TDP F EI + ++
Sbjct: 224 VFPGYSGMVPHNAKEKL-GLNVSDPGLWNGYRRPA------FLQPTDPRFEEIASLYYKE 276
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
K YG+ ++ Y+ D F E + VD + + G AI M+ + AVW+ Q W +
Sbjct: 277 MNKLYGKANY-YSMDPFHEGGSVAGVD----LDAAGKAIMQAMKKNNPKAVWVAQAWQAN 331
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHN 178
P ++ ++ G L+ LDLFAE +P W K +G +I+CML N
Sbjct: 332 PRP--------QMIGNLEAGDLIALDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLN 383
Query: 179 FAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
+ GNI ++G + + +A+ S TT+ GVGM+MEG E NPV+++L++E+ ++ ++
Sbjct: 384 YGGNIGLHGKMKHVIDEFYKAKESPFGTTLKGVGMTMEGSENNPVMFELLTELPWRPQRF 443
Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSII 295
D W+ Y+V RYG+S P +QDAW +L +++YNC D T + + V A P
Sbjct: 444 DKDQWLKAYTVARYGKSNPVVQDAWILLSNSIYNCPDANTQQGTHESVFCARP------- 496
Query: 296 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
TE YQ SS+ +Y ++VIRA + ++ ++ +N +
Sbjct: 497 --TEHPYQ---------------VSSWSEMKDYYDPNDVIRAAAMMVSVSDQFKGNNNFE 539
Query: 356 YDLIDLTRQALAKYANELFLNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 414
YDL+D+ RQA+A+ L ++EA + D S RFL L+ D LLA F
Sbjct: 540 YDLVDIVRQAIAE-KGRLTEKVVEAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFK 598
Query: 415 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
+G W+ A+ L E++ YEWNAR QIT W + + LRDY ++ W+G+L+D+Y
Sbjct: 599 VGTWIARARSLGNTSEEKDLYEWNARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYY 658
Query: 475 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 533
R +F Y L+ + ++ + W N Y E GD + T Q ++
Sbjct: 659 MRWKTWFDYQTRLLDGKKTAAI-----DFYAIEEPWTKQTNPYSNEPEGDCIPTVQRIF 712
>gi|345511813|ref|ZP_08791352.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|229443748|gb|EEO49539.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
Length = 720
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 156/530 (29%), Positives = 263/530 (49%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VPA L+ ++P AKI + W R ++ +D D L+ I R F+E+
Sbjct: 220 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEE 276
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
Q K YG T+HIY D F+E P + E++S++ IY +Q DS A WL W+F +
Sbjct: 277 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHA 335
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++K+ LN+VP KL++LD + + IW ++Q+YG PYIWC L NF GN +
Sbjct: 336 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 395
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ + F + G+G+++EG++ NP++Y+ + E A+Q+ + V WI +
Sbjct: 396 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANW 454
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G I AW LY +Y T A ++ A P ++ T Y
Sbjct: 455 AQCRGGNVDNHIVKAWKQLYEKIY--TSAALCGQAVLMNARPQLEGVEGWNTLPGY---- 508
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
Y + LW E+++A ++ + Y +D+I++ RQ L
Sbjct: 509 --------------DYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 545
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ + Y+ D G +R +L+ D+D LL C F +G W++ A+ A
Sbjct: 546 GNLFADYRDKFTDCYRKKDLEGTKVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFA 605
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
NE+++K YE NAR +T+W ++ + L DY N+ W GL R +Y R + + +I
Sbjct: 606 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 661
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ F + + ++ + +W +P+ S + + ++ L KY
Sbjct: 662 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPIISEENPISLAKELILKY 711
>gi|336374066|gb|EGO02404.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 761
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 152/499 (30%), Positives = 264/499 (52%), Gaps = 47/499 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
VLP+F+G VP A+ ++P+A I W F+++ + L+ DPLF + +FI
Sbjct: 245 VLPSFTGFVPRAMHTLYPNASIVNGSQWNGFTIQ----YTNDSFLEPFDPLFSTLQTSFI 300
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
+Q+ YG SH+Y D ++EN+P Y++++ AA ++ +++ D AVWLMQGWLF
Sbjct: 301 SKQVAAYGNVSHVYTLDQYNENSPYSGDTSYLANVTAATFASLRAADPQAVWLMQGWLFY 360
Query: 126 YDP-FWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
D FW +++A L VP +++LDL++E +P W ++G +IWC LH++ GN+
Sbjct: 361 SDSTFWTTERVEAYLGGVPGNDSMIILDLYSEAQPQWQRLNSYFGKQWIWCELHDYGGNM 420
Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
G +++ P++A + +MVG+G++MEG E N ++YD++ + A+ ++ A+I
Sbjct: 421 GFEGNFENVTTQPIKALATPGNSMVGMGLTMEGQEGNEIIYDVLLDQAWSSTPLNRTAYI 480
Query: 244 NQYSVRRYGRSVP----AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
+ ++ RRY +VP A +AW +L TVYN D T I+ ++ PSI
Sbjct: 481 SAWASRRY--NVPDLPTAALEAWEILGATVYNNQDVTTQSTVKSIL---ELSPSITG--- 532
Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA-SNT--YRY 356
L + T ++ + + + ++ AL+L + + E SA SN ++Y
Sbjct: 533 ---------------LVNRTGTHSTKLFYDTNTTIVPALKLLLQARQEASALSNIPEFQY 577
Query: 357 DLIDLTRQALAKYANELFLNIIEAY--QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 414
D++D+TRQ LA +L+ ++I+ + + + V L L++D+D +L FL
Sbjct: 578 DVVDVTRQLLANRFIDLYTSLIDTFSSTSSSSSAVSAAGAPLLALLQDLDSVLLTDTHFL 637
Query: 415 LGPWLESAKQLAQ--NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
L W+ +A+ N E+NAR Q+T+W + + DY +K W GL+ Y
Sbjct: 638 LARWISAARNWTHGDNATYAAYLEYNARNQVTLWGPRGE-----VNDYASKQWGGLVGTY 692
Query: 473 YGPRAAIYFKYMIESLESG 491
Y R + Y+ S E+
Sbjct: 693 YVQRWETFVGYLAGSKENA 711
>gi|374385779|ref|ZP_09643282.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
12061]
gi|373225481|gb|EHP47815.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
12061]
Length = 715
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 159/510 (31%), Positives = 242/510 (47%), Gaps = 58/510 (11%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIE 59
+ ++ + P F G + P +LG VK WC L +P F
Sbjct: 215 MREYGIEPVFPGYA-----GMLPHNASEKLG--IEVKDPGLWCGYQRPAFLYPENPAFKR 267
Query: 60 IGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVW 117
I + + K +G+ + Y D F E N +D +++ ++ M++ + +AVW
Sbjct: 268 IAGLYYMEMEKRFGK-AKFYGMDPFHEGGNVQGID----LAAAAQSVLQAMKTANPEAVW 322
Query: 118 LMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS-------TSKQFYGVP 170
+MQ W + P+ + ++ ++ G +++LDL +E +P+W K F G
Sbjct: 323 VMQAWQAN-------PRHE-MITALQPGNVLILDLSSENRPMWGDKESVWYREKGFEGQD 374
Query: 171 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSE 229
+++CML NF GN+ MYG +D + G A N ++ GVG +MEGIE NPV+Y+L+ E
Sbjct: 375 WLYCMLLNFGGNVGMYGRMDRVINGFYAAVQHPNGASLRGVGKTMEGIENNPVMYELLLE 434
Query: 230 MAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 289
+ ++ + W+ Y RYG+ P +Q AW +L YNC V A P
Sbjct: 435 LPWRKIPFTKEEWLKGYVKARYGKDDPRLQQAWQILGKAAYNCPVVQEGTTESVFCARP- 493
Query: 290 VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELS 349
A S SS+ L+Y+ E + LF+ +
Sbjct: 494 -----------------------AEEISGASSWGTSELYYAPEESKKVAALFLEVSEQYK 530
Query: 350 ASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLAC 409
+N + YDL D+ RQALA N L I EAY+L D LSR FL+L+ D LLA
Sbjct: 531 GNNNFEYDLTDIMRQALADKGNVLQKKITEAYRLKDETAFRNLSREFLQLILWQDTLLAT 590
Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
F LG WLE AK + EE+++ YEWNAR QIT W + + LRDY ++ W+GLL
Sbjct: 591 RPEFRLGTWLERAKAKGETEEEKRLYEWNARVQITTWGNRQAADKGGLRDYSHREWAGLL 650
Query: 470 RDYYGPRAAIYFKYMIESLESGDGFRLKDW 499
+D+Y PR YF ++E +G+ DW
Sbjct: 651 KDFYYPRWKAYFD-LLEKRLAGEETEDIDW 679
>gi|336386984|gb|EGO28130.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 738
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 152/499 (30%), Positives = 264/499 (52%), Gaps = 47/499 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
VLP+F+G VP A+ ++P+A I W F+++ + L+ DPLF + +FI
Sbjct: 222 VLPSFTGFVPRAMHTLYPNASIVNGSQWNGFTIQ----YTNDSFLEPFDPLFSTLQTSFI 277
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
+Q+ YG SH+Y D ++EN+P Y++++ AA ++ +++ D AVWLMQGWLF
Sbjct: 278 SKQVAAYGNVSHVYTLDQYNENSPYSGDTSYLANVTAATFASLRAADPQAVWLMQGWLFY 337
Query: 126 YDP-FWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
D FW +++A L VP +++LDL++E +P W ++G +IWC LH++ GN+
Sbjct: 338 SDSTFWTTERVEAYLGGVPGNDSMIILDLYSEAQPQWQRLNSYFGKQWIWCELHDYGGNM 397
Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
G +++ P++A + +MVG+G++MEG E N ++YD++ + A+ ++ A+I
Sbjct: 398 GFEGNFENVTTQPIKALATPGNSMVGMGLTMEGQEGNEIIYDVLLDQAWSSTPLNRTAYI 457
Query: 244 NQYSVRRYGRSVP----AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
+ ++ RRY +VP A +AW +L TVYN D T I+ ++ PSI
Sbjct: 458 SAWASRRY--NVPDLPTAALEAWEILGATVYNNQDVTTQSTVKSIL---ELSPSITG--- 509
Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA-SNT--YRY 356
L + T ++ + + + ++ AL+L + + E SA SN ++Y
Sbjct: 510 ---------------LVNRTGTHSTKLFYDTNTTIVPALKLLLQARQEASALSNIPEFQY 554
Query: 357 DLIDLTRQALAKYANELFLNIIEAY--QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 414
D++D+TRQ LA +L+ ++I+ + + + V L L++D+D +L FL
Sbjct: 555 DVVDVTRQLLANRFIDLYTSLIDTFSSTSSSSSAVSAAGAPLLALLQDLDSVLLTDTHFL 614
Query: 415 LGPWLESAKQLAQ--NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
L W+ +A+ N E+NAR Q+T+W + + DY +K W GL+ Y
Sbjct: 615 LARWISAARNWTHGDNATYAAYLEYNARNQVTLWGPRGE-----VNDYASKQWGGLVGTY 669
Query: 473 YGPRAAIYFKYMIESLESG 491
Y R + Y+ S E+
Sbjct: 670 YVQRWETFVGYLAGSKENA 688
>gi|294807833|ref|ZP_06766618.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|294444952|gb|EFG13634.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
Length = 703
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 155/530 (29%), Positives = 262/530 (49%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VPA L+ ++P AKI + W R ++ +D D L+ I R F+E+
Sbjct: 203 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEE 259
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
Q K YG T+HIY D F+E P + E++S++ IY +Q DS A WL W+F +
Sbjct: 260 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHA 318
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++K+ LN+VP KL++LD + + IW ++Q+YG PYIWC L NF GN +
Sbjct: 319 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 378
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ + F + G+G+++EG++ NP++Y+ + E A+Q+ + V WI +
Sbjct: 379 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANW 437
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G I AW LY +Y T A ++ A P ++
Sbjct: 438 AQCRGGNVDNHIVKAWKQLYEKIY--TSAALCGQAVLMNARPQLE--------------- 480
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
E Y + LW E+++A ++ + Y +D+I++ RQ L
Sbjct: 481 ---GVEGWNTLPGYDYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 528
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ + Y+ D G +R +L+ D+D LL C F +G W++ A+ A
Sbjct: 529 GNLFADYRDKFTDCYRKKDLEGTKVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFA 588
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
NE+++K YE NAR +T+W ++ + L DY N+ W GL R +Y R + + +I
Sbjct: 589 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 644
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ F + + ++ + +W +P+ S + + ++ L KY
Sbjct: 645 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPIISEENPISLAKELILKY 694
>gi|262407713|ref|ZP_06084261.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|262354521|gb|EEZ03613.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
Length = 735
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 155/530 (29%), Positives = 262/530 (49%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VPA L+ ++P AKI + W R ++ +D D L+ I R F+E+
Sbjct: 235 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEE 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
Q K YG T+HIY D F+E P + E++S++ IY +Q DS A WL W+F +
Sbjct: 292 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHA 350
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++K+ LN+VP KL++LD + + IW ++Q+YG PYIWC L NF GN +
Sbjct: 351 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 410
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ + F + G+G+++EG++ NP++Y+ + E A+Q+ + V WI +
Sbjct: 411 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANW 469
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G I AW LY +Y T A ++ A P ++
Sbjct: 470 AQCRGGNVDNHIVKAWKQLYEKIY--TSAALCGQAVLMNARPQLE--------------- 512
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
E Y + LW E+++A ++ + Y +D+I++ RQ L
Sbjct: 513 ---GVEGWNTLPGYDYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 560
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ + Y+ D G +R +L+ D+D LL C F +G W++ A+ A
Sbjct: 561 GNLFADYRDKFTDCYRKKDLEGTKVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFA 620
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
NE+++K YE NAR +T+W ++ + L DY N+ W GL R +Y R + + +I
Sbjct: 621 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 676
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ F + + ++ + +W +P+ S + + ++ L KY
Sbjct: 677 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPIISEENPISLAKELILKY 726
>gi|294647264|ref|ZP_06724861.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|292637401|gb|EFF55822.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
Length = 733
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 155/530 (29%), Positives = 262/530 (49%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VPA L+ ++P AKI + W R ++ +D D L+ I R F+E+
Sbjct: 233 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
Q K YG T+HIY D F+E P + E++S++ IY +Q DS A WL W+F +
Sbjct: 290 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHA 348
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++K+ LN+VP KL++LD + + IW ++Q+YG PYIWC L NF GN +
Sbjct: 349 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 408
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ + F + G+G+++EG++ NP++Y+ + E A+Q+ + V WI +
Sbjct: 409 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANW 467
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G I AW LY +Y T A ++ A P ++
Sbjct: 468 AQCRGGNVDNHIVKAWKQLYEKIY--TSAALCGQAVLMNARPQLE--------------- 510
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
E Y + LW E+++A ++ + Y +D+I++ RQ L
Sbjct: 511 ---GVEGWNTLPGYDYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 558
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ + Y+ D G +R +L+ D+D LL C F +G W++ A+ A
Sbjct: 559 GNLFADYRDKFTDCYRKKDLEGTKVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFA 618
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
NE+++K YE NAR +T+W ++ + L DY N+ W GL R +Y R + + +I
Sbjct: 619 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 674
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ F + + ++ + +W +P+ S + + ++ L KY
Sbjct: 675 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPIISEENPISLAKELILKY 724
>gi|29348998|ref|NP_812501.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340905|gb|AAO78695.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 732
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 155/530 (29%), Positives = 259/530 (48%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P+AKI Q+ W R ++ +D D L+ I R F+E+
Sbjct: 233 VLPAFAGHVPAELKTIYPNAKIYQMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K YG T HIY D F+E P S ++++++ + IY + DS A WL W+F YD
Sbjct: 290 QTKVYG-TDHIYGIDPFNEVDSPDWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYD 348
Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++++ L +VP KL++LD + + IW ++++YG PYIWC L NF GN +
Sbjct: 349 KKKWTQPRIRSFLKAVPDNKLILLDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIA 408
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ I F + G+G ++EG + NP++Y+ + + A+ + V WI +
Sbjct: 409 GNLNDIDFKIKRLFKEGGDNVYGLGATLEGFDVNPLMYEFVFDQAWDY-PVTTDQWITNW 467
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S+ R G I AW L+ +Y T+ AT ++ A P +T K N
Sbjct: 468 SMCRGGNQDANIIKAWRALHQKIY--TEHATCGQSVLMNARP-------RLTGTKSWNTN 518
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+ Y + LW E+++A + ++ +R+D+I++ RQ L
Sbjct: 519 PGI-----------HYANNDLWQIWKELLKARNI---------NNSDFRFDVINIGRQVL 558
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+E Y D G+ + S R L+ D+D LL+C +G WL+ A+
Sbjct: 559 GNLFSEYRDQFTACYNRKDTTGMREWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCG 618
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
++ YE NAR +T+W ++ + L DY N+ W GL R +Y R + +I
Sbjct: 619 ATVSEKDYYEENARCILTVW----GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIA 674
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ F + ++ + +W ++ +P+ S D + + L KY
Sbjct: 675 AVSEDKPFDEDKFHQDITQFEYNWTLQKDSFPIVSEEDPIQIADSLILKY 724
>gi|340520426|gb|EGR50662.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
Length = 747
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 147/484 (30%), Positives = 259/484 (53%), Gaps = 39/484 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP + VFP ++ W + + ++ DP F ++ + FI +
Sbjct: 239 ILPAFPGFVPRNISRVFPDISLSTSPIWSNFGTTL--SADIYINPFDPRFAQLQKLFINK 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG ++ + D F+EN P +Y+ ++ ++ +++ D +AVW+MQ WLFS D
Sbjct: 297 QQELYGNVTNFWTLDQFNENRPLSGDLDYLRNVSHNTWAALKAADPEAVWVMQAWLFSSD 356
Query: 128 P-FWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW +++ALL VP+ + +++LDLFAE P W + FYG P+IWC LHN+ GN+ +
Sbjct: 357 SSFWTNDRVEALLGGVPVNQDMLLLDLFAESAPQWQRTDSFYGKPWIWCELHNYGGNMGL 416
Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
YG ++++ ++A R S+ ++VG G++MEG E N ++YDL+ + A+ + +D + +
Sbjct: 417 YGQIENVTINSMDAVRNSD--SIVGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTDTYFH 474
Query: 245 QYSVRRYG-RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
+ RYG ++V + W +L TV+N T+ + + I+ ++ PSI + G+
Sbjct: 475 DWVSARYGAKNVKGLYKGWEMLRPTVFNNTNLTVNAVQKSIL---ELTPSISGLL-GRTG 530
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN--TYRYDLIDL 361
+G + + + E S ELF A +L+ N +Y+YDL+D
Sbjct: 531 RHGTTIMYDPAVMVEAWS-----------------ELFKAGLQDLTLFNNPSYQYDLVDW 573
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLE 420
TRQ L + + ++++AY + + V + + + L++ +D +LA + F L PW++
Sbjct: 574 TRQVLVNSFEDHYKDLVDAYNKSSSPTVIRTRGAKLVTLLKTLDAVLATNKNFQLTPWID 633
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
A+ A + +E+NAR QIT+W Q E DY +K W+GL+ YY R +
Sbjct: 634 RAR--ASSPSSANFFEFNARNQITLWGPQGQIE-----DYASKQWAGLVGTYYAERWQQF 686
Query: 481 FKYM 484
Y+
Sbjct: 687 VDYL 690
>gi|404487206|ref|ZP_11022393.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
YIT 11860]
gi|404335702|gb|EJZ62171.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
YIT 11860]
Length = 731
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 165/541 (30%), Positives = 257/541 (47%), Gaps = 63/541 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V F G VP +L+ FP A I + G W + + LD DPLF +I + E+
Sbjct: 233 VFQGFYGMVPNSLKEKFPEANIKEQGEWQTYQRPA------FLDPNDPLFDKIADIYYEE 286
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q K +G+ + + D F E + +D + + I M+ +AVW++QGW
Sbjct: 287 QEKLFGKAVY-FAGDPFHEGGQSEGID----VKAAAKKILKAMRRKTPEAVWIIQGWQ-- 339
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ--FYGVP------YIWCMLH 177
R P M+ LL + G+ ++LDL A +P W K FY +IWC L
Sbjct: 340 -----RNP-MRDLLEGLEHGEAIILDLMACERPQWGGIKNSLFYKAEGHMHHDWIWCALP 393
Query: 178 NFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 236
NF G ++G + S A G V A+ + G+G + EGI PVVYD++ +MA++ +
Sbjct: 394 NFGGKTGLHGKMSSYASGVVFAKNHPLGKNLCGIGTAPEGIGTIPVVYDMVYDMAWREDS 453
Query: 237 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
+D+K W+NQY+ RYG++ P AW +L T+Y C + I A P
Sbjct: 454 IDIKDWVNQYTQYRYGKADPNCNRAWEILSKTIYECHNEIGGPVESYICARP-------- 505
Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
+ SS+ ++Y +E++ A E +E + S TY+Y
Sbjct: 506 ----------------SDTIKHASSWGTAEIFYDPAEIVTAWECMYNVRHEFAQSETYQY 549
Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
DL+DLTRQ L YA L + A+ ND G S +FL L+ D D LL+ F +G
Sbjct: 550 DLVDLTRQVLGDYAKYLHKQAVNAFYRNDLKGFQTYSSKFLVLIRDEDKLLSTRKEFNVG 609
Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
W+ A+ A ++++++ NA+ QIT W ++ S L DY K WSGL+RD Y PR
Sbjct: 610 TWINQARNAACTPQEQERFVANAKRQITTWTNHD----SKLHDYALKEWSGLMRDMYLPR 665
Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ Y + +L G+ + D + ++ +W + Y S G+A+ + +Y KY
Sbjct: 666 WKAWVDYKL-ALLRGETAQEPD----YFQIEKNWVDSDTRYDSTSTGNAISAVEEIYKKY 720
Query: 537 L 537
Sbjct: 721 F 721
>gi|298386708|ref|ZP_06996263.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
gi|298260382|gb|EFI03251.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
Length = 732
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 154/530 (29%), Positives = 259/530 (48%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P+AKI Q+ W R ++ +D D L+ I R F+E+
Sbjct: 233 VLPAFAGHVPAELKTIYPNAKIYQMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K YG T HIY D F+E P S ++++++ + IY + DS A WL W+F YD
Sbjct: 290 QTKVYG-TDHIYGIDPFNEVDSPDWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYD 348
Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++++ L +VP KL++LD + + IW ++++YG PYIWC L NF GN +
Sbjct: 349 KKKWTQPRIRSFLKAVPDNKLILLDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIA 408
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ I F + G+G ++EG + NP++Y+ + + A+ + V WI +
Sbjct: 409 GNLNDIDFKIKRLFKEGGDNVYGLGATLEGFDVNPLMYEFVFDQAWDYS-VTTDQWITNW 467
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S+ R G I AW L+ +Y T+ AT ++ A P +T K N
Sbjct: 468 SMCRGGNQDANIIKAWRALHQKIY--TEHATCGQSVLMNARP-------RLTGTKSWNTN 518
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+ Y + LW E+++A + ++ +R+D+I++ RQ L
Sbjct: 519 PGI-----------HYANNDLWQIWKELLKARNI---------NNSDFRFDVINIGRQVL 558
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
++ Y D G+ + S R L+ D+D LL+C +G WL+ A+
Sbjct: 559 GNLFSKYRDQFTACYNRKDTTGMREWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCG 618
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
++ YE NAR +T+W ++ + L DY N+ W GL R +Y R + +I
Sbjct: 619 ATVSEKDYYEENARCILTVW----GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIA 674
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ F + ++ + +W ++ +P+ S D + + L KY
Sbjct: 675 AVSEDKPFDEDKFHQDITQFEYNWTLQKDSFPIVSEEDPIQIADSLILKY 724
>gi|423259033|ref|ZP_17239956.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
CL07T00C01]
gi|423263996|ref|ZP_17242999.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
CL07T12C05]
gi|387776613|gb|EIK38713.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
CL07T00C01]
gi|392706262|gb|EIY99385.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
CL07T12C05]
Length = 718
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 163/540 (30%), Positives = 262/540 (48%), Gaps = 70/540 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V SDP C Y L +DP F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ ++ Y+ D F E NT VD + + G A+ M+ + AVW+ Q
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
W + P ++ ++ G L++LDL +E +P W S K YG +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380
Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML N+ GN+ ++G +D++ F +A + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ ++ + W+ +Y RYG P +Q AW L +++YN T + + V A P
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
E YQ SS+ +Y EVI A L ++ +
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
+N + YDL+D+ RQALA+ + + AY+ D S +FL+L+ D LL
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+E A+ L E+++ YEWNAR QIT W + + LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
D+Y R +YF ++ + +E ++ ++ + W N Y E+ GD + ++
Sbjct: 657 DFYYMRWKLYFDFLSQRMEGKPPAKI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|391338146|ref|XP_003743422.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Metaseiulus
occidentalis]
Length = 665
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 139/465 (29%), Positives = 233/465 (50%), Gaps = 44/465 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+P F+G VP A + + P+ ++ W + + + L T+ F+ + +I
Sbjct: 225 VVPGFNGFVPRAFERLHPAVSWSRASRWNNFPDE--YAMLTFLAPTESFFLNVSSLYITM 282
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
YG + H+Y+ D F+E TP + P ++ + + +Y + D +W+MQGWLF +
Sbjct: 283 YRSIYG-SDHLYSVDLFNEETPDTNDPAALAEMSSNVYESIAKADPKGIWVMQGWLFVHG 341
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+W ++KA L PLGK++VLDLF+E P + ++G P+IWCMLHN+ G ++
Sbjct: 342 GDYWNHDRVKAFLGGPPLGKMIVLDLFSEQSPQFPRFSNYFGQPFIWCMLHNYGGVSGLF 401
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ I P+ R S M+G+G++ EG QN V+Y+ M+E +++ +V W+ Y
Sbjct: 402 GNLEWINSEPLNVRRSV-PNMIGIGIAPEGTGQNEVIYEFMAENSYRDSSENVSLWLQNY 460
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
RYG S P +++AW +L +VY S+T +N+G
Sbjct: 461 VGARYGLSDPHLENAWELLRKSVY-------------------------SLTSKSIENHG 495
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN---ELSASNTYRYDLIDLTR 363
+L P +WY+ S+VI A I EL + DL+D+ R
Sbjct: 496 N-----YILTHRPKLNSTPLIWYNGSDVIGAATELIRGATLHRELCHERLFHQDLVDVVR 550
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
QAL ++ +L ++ ++ N + SRR L + +D +L+ FLLG WL ++
Sbjct: 551 QALQVRVSDEYLQMMSHFKANSLIDFEEHSRRLLHCIRVLDKVLSTDPNFLLGSWLRDSR 610
Query: 424 QLAQ-NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
+ A + + + Q+E+NAR QIT W N + + DY +K W+G
Sbjct: 611 ESAGLDRDLQDQFEFNARNQITRWGPNGE-----IVDYASKMWNG 650
>gi|260642393|ref|ZP_05415712.2| alpha-N-acetylglucosaminidase [Bacteroides finegoldii DSM 17565]
gi|260622285|gb|EEX45156.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides finegoldii DSM
17565]
Length = 735
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 153/530 (28%), Positives = 261/530 (49%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VPA L+ ++P AKI + W R ++ +D D L+ I F+E+
Sbjct: 235 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQHRFLEE 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
Q K YG T+HIY D F+E P + E++S++ IY +QS DS A WL W+F +
Sbjct: 292 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQSVDSAAQWLQMTWMFYHA 350
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++K+ LN+VP KL++LD + + IW ++Q+YG PYIWC L NF GN +
Sbjct: 351 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 410
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ + F + G+G+++EG++ NP++Y+ + E A+++ + V WI +
Sbjct: 411 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWEN-SIPVHQWIANW 469
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G I AW LY +Y T A ++ A P ++
Sbjct: 470 AQCRGGNVDNHIIKAWKQLYEKIY--TSAALCGQAVLMNARPQLE--------------- 512
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
E Y + LW E+++A ++ + Y +D+I++ RQ L
Sbjct: 513 ---GVEGWNTLPGYDYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 560
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ + Y+ D G +R +L+ D+D LL C +G W++ A+ A
Sbjct: 561 GNLFADYRDKFADCYRKKDLEGTKVWGQRMDQLLLDVDRLLCCSPVLSIGKWIKDARDFA 620
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
NE+++K YE NAR +T+W ++ + L DY N+ W GL R +Y R + + +I
Sbjct: 621 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 676
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ F + + ++ + +W +P+ S + + ++ L KY
Sbjct: 677 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPITSGENPISLAKELILKY 726
>gi|383124408|ref|ZP_09945072.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
gi|251839096|gb|EES67180.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
Length = 732
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 154/530 (29%), Positives = 258/530 (48%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P+AKI Q+ W R ++ +D D L+ I R F+E+
Sbjct: 233 VLPAFAGHVPAELKTIYPNAKIYQMSQWGGFDEKYR---SHFIDPMDSLYQVIQRRFLEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K YG T HIY D F+E P S ++++++ + IY + DS A WL W+F YD
Sbjct: 290 QTKVYG-TDHIYGIDPFNEVDSPDWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYD 348
Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++++ L +VP KL++LD + + IW ++++YG PYIWC L NF GN +
Sbjct: 349 KKKWTQPRIRSFLKAVPDDKLILLDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIA 408
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ I F + G+G ++EG + NP++Y+ + + A+ + V WI +
Sbjct: 409 GNLNDIDFKIKRLFKEGGDNVYGLGATLEGFDVNPLMYEFVFDQAWDY-PVTTDQWITNW 467
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
S+ R G I AW L+ +Y T+ A ++ A P +T K N
Sbjct: 468 SMCRGGDQDANIIKAWRALHQNIY--TEYAICGQSVLMNARP-------RLTGTKSWNTN 518
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+ Y + LW E+++A + ++ +R+D+I++ RQ L
Sbjct: 519 PGIH-----------YANNDLWQIWKELLKARNI---------NNSDFRFDVINIGRQVL 558
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+E Y D G+ + S R L+ D+D LL+C +G WL+ A+
Sbjct: 559 GNLFSEYRDQFTACYNRKDTTGMREWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARDCG 618
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
++ YE NAR +T+W ++ + L DY N+ W GL R +Y R + +I
Sbjct: 619 TTVSEKDYYEENARCILTVW----GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIG 674
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ F + ++ + +W ++ +P+ S D + + L KY
Sbjct: 675 AVSKNKPFDEDKFHQDITQFEYNWTLQKDSFPIVSEEDPIQIADSLILKY 724
>gi|53711968|ref|YP_097960.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis YCH46]
gi|52214833|dbj|BAD47426.1| alpha-N-acetylglucosaminidase precursor [Bacteroides fragilis
YCH46]
Length = 718
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V SDP C Y L +DP F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ ++ Y+ D F E NT VD + + G A+ M+ + AVW+ Q
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
W + P ++ ++ G L++LDL +E +P W S K YG +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380
Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML N+ GN+ ++G +D++ F +A + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ ++ + W+ +Y RYG P +Q AW L +++YN T + + V A P
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
E YQ SS+ +Y EVI A L ++ +
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
+N + YDL+D+ RQALA+ + + AY+ D S +FL+L+ D LL
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+E A+ L E+++ YEWNAR QIT W + + LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
D+Y R +YF ++ + +E + ++ + W N Y E+ GD + ++
Sbjct: 657 DFYYMRWKLYFDFLSQRIEGKTPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|452988463|gb|EME88218.1| glycoside hydrolase family 89 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 772
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 144/509 (28%), Positives = 259/509 (50%), Gaps = 51/509 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F+G VP + ++P+A W +++ + L+ DPLF + ++FI +
Sbjct: 238 VLPCFTGFVPTQISRLYPNASFVNGSRWNGFQAE--YTNVTFLEPFDPLFTTLQKSFISK 295
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q++ YG S IY D ++EN P Y+ ++ + +++ D +A+W +QGWLF S
Sbjct: 296 QIEAYGNVSSIYTLDQYNENDPFSGELAYLKNVTSNTIKSLKAADPEAIWFIQGWLFYSS 355
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW +++A L V +++LDLF+E +P W + ++G P+IWC LH++ GN ++
Sbjct: 356 ADFWTDERVEAYLGGVANEDMLILDLFSESQPQWQRTNSYFGKPWIWCQLHDYGGNQGLH 415
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G ++++ PV+A ++ +TMVG+G +MEG E N ++YD++ + A+ E +D ++ + +
Sbjct: 416 GQVENVTINPVQALANKTSTMVGMGSTMEGQEGNEIIYDILLDQAWSKEPIDSDSYFHDW 475
Query: 247 SVRRY-GRSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
RY G +P+ + AW+V+ TVYN TD + V + +++P+ + + +
Sbjct: 476 VTSRYAGSKLPSGLYTAWDVMRQTVYNSTD--IEAAEAVTKSIFELEPNTTGLLNRRGHH 533
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA--SNTYRYDLIDLT 362
+ VL S + +L+ AS +++ Y++DL+D T
Sbjct: 534 STLILYDPNVLVSAWN------------------DLYNASNDDIQLWDVKAYQFDLVDTT 575
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVF------QLSRRFLELVEDMDGLLACHDG--FL 414
RQ LA L+ + + + G + + + + L++D+D +L F
Sbjct: 576 RQVLANAFYPLYTDFVHSAN-KSVQGTYSPTKAEEKGKEMIMLLKDLDSVLEASGNAHFK 634
Query: 415 LGPWLESAKQLAQNEEQEKQ----------YEWNARTQITMWFDNTQEEASLLRDYGNKY 464
L W+ESA+ A E+ YE+ AR QIT+W N + + DY +K
Sbjct: 635 LSSWIESARLWAPAEDYADDKNTTAKIADFYEYTARNQITLWGPNGE-----ISDYASKQ 689
Query: 465 WSGLLRDYYGPRAAIYFKYMIESLESGDG 493
W+GL+R YY PR + + + S S +G
Sbjct: 690 WAGLIRSYYVPRWQRFVDFTLNSTTSMNG 718
>gi|423248659|ref|ZP_17229675.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
CL03T00C08]
gi|423253608|ref|ZP_17234539.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
CL03T12C07]
gi|392655237|gb|EIY48880.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
CL03T12C07]
gi|392657600|gb|EIY51231.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
CL03T00C08]
Length = 718
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V SDP C Y L +DP F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ ++ Y+ D F E NT VD + + G A+ M+ + AVW+ Q
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
W + P ++ ++ G L++LDL +E +P W S K YG +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380
Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML N+ GN+ ++G +D++ F +A + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ ++ + W+ +Y RYG P +Q AW L +++YN T + + V A P
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
E YQ SS+ +Y EVI A L ++ +
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
+N + YDL+D+ RQALA+ + + AY+ D S +FL+L+ D LL
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+E A+ L E+++ YEWNAR QIT W + + LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
D+Y R +YF ++ + +E + ++ + W N Y E+ GD + ++
Sbjct: 657 DFYYMRWKLYFDFLSQRIEGKTPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|423269418|ref|ZP_17248390.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
CL05T00C42]
gi|423273021|ref|ZP_17251968.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
CL05T12C13]
gi|392701212|gb|EIY94372.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
CL05T00C42]
gi|392708585|gb|EIZ01692.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
CL05T12C13]
Length = 718
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V SDP C Y L +DP F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ ++ Y+ D F E NT VD + + G A+ M+ + AVW+ Q
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
W + P ++ ++ G L++LDL +E +P W S K YG +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380
Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML N+ GN+ ++G +D++ F +A + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ ++ + W+ +Y RYG P +Q AW L +++YN T + + V A P
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
E YQ SS+ +Y EVI A L ++ +
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
+N + YDL+D+ RQALA+ + + AY+ D S +FL+L+ D LL
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+E A+ L E+++ YEWNAR QIT W + + LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
D+Y R +YF ++ + +E + ++ + W N Y E+ GD + ++
Sbjct: 657 DFYYMRWKLYFDFLSQRMEGKAPAEI-----DFYAIEEPWTKAANPYSTEAEGDCIEVAK 711
>gi|60680169|ref|YP_210313.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC 9343]
gi|375357012|ref|YP_005109784.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
gi|383116930|ref|ZP_09937677.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
gi|60491603|emb|CAH06355.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC
9343]
gi|251947777|gb|EES88059.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
gi|301161693|emb|CBW21233.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
Length = 718
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V SDP C Y L +DP F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ ++ Y+ D F E NT VD + + G A+ M+ + AVW+ Q
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
W + P ++ ++ G L++LDL +E +P W S K YG +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380
Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML N+ GN+ ++G +D++ F +A + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ ++ + W+ +Y RYG P +Q AW L +++YN T + + V A P
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
E YQ SS+ +Y EVI A L ++ +
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
+N + YDL+D+ RQALA+ + + AY+ D S +FL+L+ D LL
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+E A+ L E+++ YEWNAR QIT W + + LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
D+Y R +YF ++ + +E + ++ + W N Y E+ GD + ++
Sbjct: 657 DFYYMRWKLYFDFLSQRIEGKTPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|423282107|ref|ZP_17260992.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
615]
gi|404582594|gb|EKA87288.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
615]
Length = 718
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V SDP C Y L +DP F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ ++ Y+ D F E NT VD + + G A+ M+ + AVW+ Q
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
W + P ++ ++ G L++LDL +E +P W S K YG +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380
Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML N+ GN+ ++G +D++ F +A + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ ++ + W+ +Y RYG P +Q AW L +++YN T + + V A P
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
E YQ SS+ +Y EVI A L ++ +
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
+N + YDL+D+ RQALA+ + + AY+ D S +FL+L+ D LL
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+E A+ L E+++ YEWNAR QIT W + + LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
D+Y R +YF ++ + +E + ++ + W N Y E+ GD + ++
Sbjct: 657 DFYYMRWKLYFDFLSQRMEGKAPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|313145188|ref|ZP_07807381.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
gi|313133955|gb|EFR51315.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
Length = 718
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 164/540 (30%), Positives = 260/540 (48%), Gaps = 70/540 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V SDP C Y L +DP F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ + Y+ D F E NT VD + + G A+ M+ + AVW+ Q
Sbjct: 274 YYKELEKLYGK-ADFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
W + P ++ ++ G L++LDL +E +P W S K YG +++C
Sbjct: 329 WQANPRP--------KMIENLGAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYC 380
Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML N+ GN+ ++G +D++ F +A +T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHAGSTLKGVGMAPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ E+ + W+ +Y RYG P +Q AW L +++YN T + + V A P
Sbjct: 441 RAERFTKEEWLKEYVKARYGADDPVVQAAWTKLANSIYNSPKNLTQQGTHESVFCARP-- 498
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
E YQ SS+ +Y +VI A L ++ +
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQDVIEAARLMVSVADRYKG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
+N + YDL+D+ RQALA+ + + AY+ D S++FL L+ D LL
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTR 596
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+E A+ L E++ YEWNAR QIT W + + LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGGTSEEKALYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
D+Y R +YF + + LE ++ ++ + W N Y E+ GD + T++
Sbjct: 657 DFYYMRWKLYFDSLSQKLEGKTPEKI-----DFYAVEEPWAKATNPYSAEAEGDCIETAK 711
>gi|393788286|ref|ZP_10376416.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
CL02T12C05]
gi|392655959|gb|EIY49600.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
CL02T12C05]
Length = 757
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 166/546 (30%), Positives = 259/546 (47%), Gaps = 70/546 (12%)
Query: 11 AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
FSG VP L+ +P+A I Q +W K + LD TD LF +GRAF+E+Q +
Sbjct: 236 GFSGYVPRELKEKYPTANINQQRSWCGFKGAAQ------LDPTDSLFTRMGRAFLEEQAR 289
Query: 71 EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
+G +Y D F E+ PP+D+PEY+ ++G I+ + D + W MQ W D
Sbjct: 290 LFG-AHGVYAADPFHESAPPIDTPEYLKAVGERIHHLFRDFDPHSTWAMQSWSLRED--- 345
Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ-FYGVPYIWCMLHNFAGNIEMYGIL 189
++ +VP L++LDL + STSK F+G + LHNF G I M+G L
Sbjct: 346 -------IVKAVPKDALLILDLNGK-----STSKALFWGYSTVVGNLHNFGGRINMHGDL 393
Query: 190 DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 249
+A N + G G+ ME +EQNPV Y+L EM + ++++AW+ QY+ R
Sbjct: 394 KLLASNQYSKAKRLNPAVCGSGLFMEAVEQNPVYYELAFEMPCHADSINLQAWLKQYATR 453
Query: 250 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 309
RYG PA Q+AW +L + Y T+K+ ++ A P +D V
Sbjct: 454 RYGAFSPAAQEAWLLLLNGPYRRGTNGTEKS-SIVAARPALD-----------------V 495
Query: 310 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 369
K + YD P L VIRA L + ++LS S YR+D++D+ RQ +
Sbjct: 496 KKSGPNAALEIPYD-PTL------VIRAQSLLLKDIDKLSVSRPYRFDIVDVQRQLMTNL 548
Query: 370 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 429
+ EA++ D S RFLE++ DMD LL + WL A+ +
Sbjct: 549 GQLIHRQAAEAFRKKDQCAFTLHSGRFLEMLADMDKLLRTRSEYSFDRWLTEARSWGDTD 608
Query: 430 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 489
E++ E +A + +T+W + + DY + WSGL+ YY PR ++ + + L+
Sbjct: 609 EEKNLMERDATSLVTIWGADGDPR---IFDYSWREWSGLISGYYLPRWQKFYAMLQQHLD 665
Query: 490 SGDGFRLKD----WRREWIKLTNDWQN-------------GRNVYPVESNGDALITSQWL 532
G + + RE + ND+ N G+ P+ + GD +I + L
Sbjct: 666 VGTSYEEAGLPLIYGREAFR-ANDFYNGLAEWELAYVDTYGKARTPI-TEGDEIIMVKQL 723
Query: 533 YNKYLQ 538
++KYL+
Sbjct: 724 FDKYLK 729
>gi|424666301|ref|ZP_18103337.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
616]
gi|404573840|gb|EKA78592.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
616]
Length = 718
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 164/540 (30%), Positives = 260/540 (48%), Gaps = 70/540 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V SDP C Y L +DP F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ + Y+ D F E NT VD + + G A+ M+ + AVW+ Q
Sbjct: 274 YYKELEKLYGK-ADFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
W + P ++ ++ G L++LDL +E +P W S K YG +++C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYC 380
Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML N+ GN+ ++G +D++ F +A +T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ E+ + W+ +Y RYG P +Q AW L +++YN T + + V A P
Sbjct: 441 RAERFTKEEWLKEYVKARYGADDPVVQAAWTKLANSIYNSPKNLTQQGTHEAVFCARP-- 498
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
E YQ SS+ +Y +VI A L ++ +
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQDVIEAARLMVSVADRYKG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
+N + YDL+D+ RQALA+ + + AY+ D S++FL L+ D LL
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTR 596
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+E A+ L E++ YEWNAR QIT W + + LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGGTSEEKALYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
D+Y R +YF + + LE ++ ++ + W N Y E+ GD + T++
Sbjct: 657 DFYYMRWKLYFDSLSQKLEGKTPEKI-----DFYAVEEPWAKATNPYSAEAEGDCIETAK 711
>gi|423280158|ref|ZP_17259071.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
610]
gi|404584494|gb|EKA89159.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
610]
Length = 718
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 164/540 (30%), Positives = 260/540 (48%), Gaps = 70/540 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V SDP C Y L +DP F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ + Y+ D F E NT VD + + G A+ M+ + AVW+ Q
Sbjct: 274 YYKELEKLYGK-ADFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
W + P ++ ++ G L++LDL +E +P W S K YG +++C
Sbjct: 329 WQANPRP--------KMIENLGAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYC 380
Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML N+ GN+ ++G +D++ F +A +T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ E+ + W+ +Y RYG P +Q AW L +++YN T + + V A P
Sbjct: 441 RAERFTKEEWLKEYVKARYGADDPVVQAAWTKLANSIYNSPKNLTQQGTHESVFSARP-- 498
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
E YQ SS+ +Y +VI A L ++ +
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQDVIEAARLMVSVADRYKG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
+N + YDL+D+ RQALA+ + + AY+ D S++FL L+ D LL
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTR 596
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+E A+ L E++ YEWNAR QIT W + + LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGGTSEEKALYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
D+Y R +YF + + LE ++ ++ + W N Y E+ GD + T++
Sbjct: 657 DFYYMRWKLYFDSLSQKLEGKTPEKI-----DFYAVEEPWTKATNPYSAEAEGDCIETAK 711
>gi|265765312|ref|ZP_06093587.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
gi|263254696|gb|EEZ26130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
Length = 718
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 163/540 (30%), Positives = 261/540 (48%), Gaps = 70/540 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V SDP C Y L +DP F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ ++ Y+ D F E NT VD + + G A+ M+ + AVW+ Q
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
W + P ++ ++ G L++LDL +E +P W S K YG +I+C
Sbjct: 329 WQANPRP--------KMIENLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380
Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML N+ GN+ ++G +D++ F +A + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ ++ + W+ +Y RYG P +Q AW L +++YN T + + V A P
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
E YQ SS+ +Y EVI A L ++ +
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
+N + YDL+D+ RQALA+ + + AY+ D S +FL+L+ D LL
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+E A+ L E+++ YEWNAR QIT W + + LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKEFYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
D+Y R +YF ++ + +E + ++ + W N Y E+ GD + ++
Sbjct: 657 DFYYMRWKLYFDFLSQRMEGKAPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|379334158|gb|AFD03088.1| putative alpha-N-acetylglucosaminidase [uncultured bacterium 8]
Length = 726
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 164/524 (31%), Positives = 259/524 (49%), Gaps = 40/524 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
VL FSG+VP L A+ T L W F V +LD DPLF E G +
Sbjct: 218 VLQGFSGHVPQELIAER-GARSTTLPWWDFEVG---------MLDPRDPLFEEFGTTLLT 267
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+Q + +G T H+Y D F E TPPV P ++ + A++ M + D A W++Q W FSY
Sbjct: 268 EQTRLFG-TDHLYAADPFIETTPPVSDPADLAQVARAVHGVMTAVDDRATWVLQAWPFSY 326
Query: 127 -DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+W P + A L+++P +++LDL+AE +P+W + + P++WCMLH+ G +
Sbjct: 327 RSRYWTPERTGAFLDAIPDDGMLILDLWAEHRPVWQRTDGYRKKPWVWCMLHSLGGRPGL 386
Query: 186 YGILDSIAFGPVEART-SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
YG LD IA G A+ + ++ G+G SME +PV+Y+L++++A+Q DV+AW+
Sbjct: 387 YGKLDEIATGAARAQADARGGSLSGIGASMEAFGGDPVLYELLADVAWQGSVDDVRAWLE 446
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
++ RYGR+ P + AW++L+ +VY ++G VIV P + EG ++
Sbjct: 447 TWTRARYGRATPGLLRAWDLLHDSVY-ASEGPGPPG-SVIVGRPTL--------EGDLRH 496
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+ V ++ S D P + L + E SA R DL D+T Q
Sbjct: 497 ------ELPVHLADPPSPDVP-----PALAEAWALLADEATQEDSAGPLGR-DLCDVTAQ 544
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L A E +A DA G + +R L+ +ED+D LLA L WL A+
Sbjct: 545 VLTHVACERQWRAADAALARDADGFQRAARALLDTIEDLDTLLATRPEHRLDGWLADARG 604
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
A + YE +AR +T+W S L DY ++W+GL+ +Y PR +++++
Sbjct: 605 WATTPAEADLYETDARRLLTLWGHTR----SKLHDYSGRHWAGLVGTFYLPRWRSWYEHI 660
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
+LE+G +R +++ + W RN G A T
Sbjct: 661 ARALETGSPYRAEEFEASLLAQEERWVADRNGPTTPEAGTAGAT 704
>gi|336408181|ref|ZP_08588675.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
gi|335939481|gb|EGN01355.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
Length = 718
Score = 238 bits (608), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 162/540 (30%), Positives = 260/540 (48%), Gaps = 70/540 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
+LP + G VP + +LG +V SDP C Y L +DP F EI
Sbjct: 225 MLPGYCGMVPHNAKE--------KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ ++ Y+ D F E NT VD + + G A+ M+ + AVW+ Q
Sbjct: 274 YYKELEKLYGK-ANFYSMDPFHEGGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWC 174
W + P ++ + G L++LDL +E +P W S K YG +I+C
Sbjct: 329 WQANPRP--------KMIEDLKAGDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYC 380
Query: 175 MLHNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML N+ GN+ ++G +D++ F +A + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ ++ + W+ +Y RYG P +Q AW L +++YN T + + V A P
Sbjct: 441 RPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP-- 498
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
E YQ SS+ +Y EVI A L ++ +
Sbjct: 499 -------AEDVYQ---------------VSSWSEMKDYYRPQEVIEAARLMVSVADRFKG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
+N + YDL+D+ RQALA+ + + AY+ D S +FL+L+ D LL
Sbjct: 537 NNNFEYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTR 596
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+E A+ L E+++ YEWNAR QIT W + + LRDY +K W+GLL+
Sbjct: 597 PEFRVGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLK 656
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
D+Y R +YF ++ + +E + ++ + W N Y E+ GD + ++
Sbjct: 657 DFYYMRWKLYFDFLSQRIEGKTPAEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|325299497|ref|YP_004259414.1| alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
gi|324319050|gb|ADY36941.1| Alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
Length = 723
Score = 238 bits (608), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 162/535 (30%), Positives = 263/535 (49%), Gaps = 62/535 (11%)
Query: 8 VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
VLP +SG VP A Q + + +L N F+ + L TD F EI + E
Sbjct: 227 VLPGYSGMVPHDAHQKLGLNVTEPELWNGFTRPA--------FLMPTDKRFAEIAALYYE 278
Query: 67 QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
+Q K +G+ ++ Y+ D F E N VD + G A+ M+ + AVW++QGW
Sbjct: 279 EQEKLFGKANY-YSMDPFHELENAGEVD----FDAAGKAVMDAMKQVNPKAVWVVQGW-- 331
Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHN 178
+ +P RP MK L N G L++LDLF+E +P IW K + +++CML N
Sbjct: 332 TENP--RPEMMKNLKN----GDLLILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMLEN 385
Query: 179 FAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
F N+ ++G +D + + + + G+G++MEG E NPV+++LM E+ ++ EK+
Sbjct: 386 FGANVGLHGRMDQLLNNFYLTKNNPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKI 445
Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSII 295
++W+ +Y RYG I+ AW +L +YNC G + + + P ++ +
Sbjct: 446 TKESWLKEYLAARYGAKDEKIEQAWMILADGIYNCPFGNNQQGPHESIFCGRPSMNNFQV 505
Query: 296 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
S + K +NY P S EA A L + + ++ +N +
Sbjct: 506 S-SWSKMENYYDPTSTEA-----------------------AARLMLEAADKFRGNNNFE 541
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
YDL+D+ RQALA ++ I ++ D + S+ FL L+ D LLA F +
Sbjct: 542 YDLVDIVRQALADRGRIVYNRAIADFKSFDKRSYARHSKEFLNLLLAQDRLLATRSEFRV 601
Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
G W+ A+ L E++ YEWNAR QIT W + + LRDY +K W+G+L+D+Y
Sbjct: 602 GRWINQARSLGNTPEEKDLYEWNARVQITTWGNRECADKGGLRDYAHKEWNGILKDFYYK 661
Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
R A +++ + L DG ++D +W + W N Y E+ GD + T++
Sbjct: 662 RWAAWWEMLQGVL---DGGEMQD--IDWYAMEEPWTLQHNPYKAEAEGDCIETAR 711
>gi|282877910|ref|ZP_06286719.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
gi|281299911|gb|EFA92271.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
Length = 723
Score = 238 bits (606), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 157/506 (31%), Positives = 240/506 (47%), Gaps = 45/506 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSG+VPA L+ ++P A I LG W + R + L+ DPLF +I + ++E+
Sbjct: 236 VLPAFSGHVPAKLKELYPQANIKYLGRWAGFSDNYR---CHFLNPEDPLFAKIQKMYLEE 292
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q +G T HIY D F+E PP PEY+ + IY + S D A W+ W+F ++
Sbjct: 293 QKALFG-TDHIYGIDPFNEVDPPSWKPEYLKEISHNIYRTVTSVDPGAEWMQMSWMFYHN 351
Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P ++KALL V GK+ +LD E +W T+ FYG PYIWC L NF GN +
Sbjct: 352 KKQWTPKRIKALLTGVSRGKMSLLDYHCENVELWKTTNNFYGQPYIWCYLGNFGGNTTIT 411
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + EA +N ++G+G ++EG++ Y+ + A+ D K WI+
Sbjct: 412 GNVKESGQRLNEALNKKNKNLIGIGSTLEGLDVIQFPYEYILTQAWTATPAD-KEWIDNL 470
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G S P ++ AW +L++ +Y T R + + P + P + GKYQ
Sbjct: 471 ADRHVGFSSPKLRQAWQILFNDIY------TQIPRSLGI-LPALRPIL-----GKYQERR 518
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
++ E +W S+V N Y+ DLI + RQ L
Sbjct: 519 TEITYPTKRLEE--------VWKLMSDVSEC------------DRNEYQLDLIAVGRQVL 558
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+L L + Y D G+ + E++ D+D L A + +G W++ A+
Sbjct: 559 GNKFLKLKLELDSCYVNKDLVGLQRTGNTMKEVLVDLDYLTAGNSRCSIGKWIDDARAYG 618
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
N+ ++ YE NAR IT W L DY N+ WSGL+R YY R ++Y +
Sbjct: 619 NNDLEKAYYEKNARNLITTW-------GGSLNDYANRTWSGLIRTYYVRRWSMYIDELTA 671
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQN 512
S+ SG F + + + +W N
Sbjct: 672 SVMSGKPFDQQQLDKAIGEFEQNWVN 697
>gi|424665881|ref|ZP_18102917.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
616]
gi|404574134|gb|EKA78885.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
616]
Length = 732
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 164/540 (30%), Positives = 254/540 (47%), Gaps = 63/540 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL F G VP ++ FP+A I G W + + L +DPLF ++ + F E+
Sbjct: 235 VLQGFYGMVPNSMITKFPNADIRDAGKWITYQRPA------FLVPSDPLFAKVAQIFYEE 288
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q K +G+ S Y D F E N+ ++ I+ + IY M++ + DA+W++QGW
Sbjct: 289 QEKLFGK-SRYYGGDPFHEGGNSEGIN----ITEAASDIYKAMKANNPDAIWVLQGW--G 341
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIW-----STSKQFYGV---PYIWCMLH 177
+P + ALL + G+ ++LDL + +P W S S + G +IWC L
Sbjct: 342 ANPSY------ALLKGLKQGEALILDLMSCARPQWGGDPSSQSHREDGYLDHNWIWCALP 395
Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEK 236
NF G I MYG L S A G + A V GVG + EGI NP+ YD++ +MA++ +
Sbjct: 396 NFGGRIGMYGKLQSYATGVIRAEHHPKGKYVCGVGTTPEGIGTNPIDYDMVYDMAWRTDS 455
Query: 237 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
+DVK+WI Y+ RYG + A L +VYNC A A P +
Sbjct: 456 IDVKSWIANYTTYRYGSPNNNAKAAMQQLSTSVYNCPWAADGPQESYFCARPSLKID--- 512
Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
TSS+ HL+Y V++ALE + + NEL +TYRY
Sbjct: 513 ---------------------RTSSWGTAHLYYQPINVLQALEHLLKAENELKEIDTYRY 551
Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
D++D+TRQ LA Y + I +AY D + +FL+++ D D LL+ FLLG
Sbjct: 552 DVVDVTRQMLADYGKYIHKCIADAYYGKDTEKFDFYTSKFLQMISDQDLLLSTRKEFLLG 611
Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
++ A N +++ + NA+ QIT W S L +Y +K W+G+L Y PR
Sbjct: 612 KFIRQADACGSNPMEKRMFINNAKRQITTW----ASVNSSLHEYAHKEWNGILGTLYAPR 667
Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
YF Y+ LE + + ++ + DW + + + ++ +Y+ Y
Sbjct: 668 WKAYFDYLRTKLEGKNPKEI-----DFFTMETDWVESKKEFSAVPIKKEIEIAKTIYHNY 722
>gi|380697007|ref|ZP_09861866.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 703
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 152/530 (28%), Positives = 259/530 (48%), Gaps = 39/530 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G+VPA L+ ++P AKI + W R ++ +D D L+ I R F+E+
Sbjct: 203 ILPAFAGHVPAELKELYPEAKIYTMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEE 259
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
Q K YG T+HIY D F+E P + E++S++ IY +Q DS A WL W+F +
Sbjct: 260 QTKVYG-TNHIYGIDPFNEVDSPNWNEEFLSNVSDKIYKSIQDVDSAAQWLQMTWMFYHA 318
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++K+ LN+VP KL++LD + + IW ++Q+YG PYIWC L NF GN +
Sbjct: 319 KEKWTQPRIKSFLNAVPQDKLILLDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLA 378
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ + F + G+G+++EG++ NP++Y+ + E A+++ + WI +
Sbjct: 379 GDLNDVDFKIDRLFKEGGDNVYGLGVTLEGLDVNPLMYEFVFERAWEN-SMPAHQWIANW 437
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G I AW LY +Y T A ++ A P ++
Sbjct: 438 AQCRGGNVDNHIVKAWKQLYEKIY--TSAALCGQAVLMNARPQLE--------------- 480
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
E Y + LW E+++A ++ + Y +D+I++ RQ L
Sbjct: 481 ---GVEGWNTLPGYDYKNIDLWEIWKELLKAEGVY---------HSEYHFDVINVGRQVL 528
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ + Y+ +R +L+ D+D LL C F +G W++ AK A
Sbjct: 529 GNLFADYRDKFTDCYRKKKLEETKVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDAKDFA 588
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
NE+++K YE NAR +T+W ++ + L DY N+ W GL R +Y R + + +I
Sbjct: 589 VNEQEQKYYEENARCILTVW----GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIA 644
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ F + + ++ + +W +P+ S + + ++ L KY
Sbjct: 645 AMTRHKNFDEEKFHQDITQFEYEWTLKNEDFPITSEENPISLAKELILKY 694
>gi|153808241|ref|ZP_01960909.1| hypothetical protein BACCAC_02529 [Bacteroides caccae ATCC 43185]
gi|423219048|ref|ZP_17205544.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
CL03T12C61]
gi|149129144|gb|EDM20360.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
gi|392625814|gb|EIY19870.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
CL03T12C61]
Length = 752
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 162/494 (32%), Positives = 235/494 (47%), Gaps = 54/494 (10%)
Query: 11 AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQ 67
FSG VP L++ +P AKI + P WC LD TD LF +GR F+E+
Sbjct: 233 GFSGYVPRELKDKYPEAKI---------RLQPGWCGFKGAGQLDPTDALFATLGRDFLEE 283
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ K YG T IY D F E+ PPV++PEY+S++G AIY ++ D A W MQ W
Sbjct: 284 EKKLYG-TYGIYAADPFHESAPPVNTPEYLSAVGHAIYKLIKDFDPKAKWAMQAWSL--- 339
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
R P +KA VP L++LDL E K F+G P + LHNF G I M+G
Sbjct: 340 ---REPIVKA----VPQNDLIILDLNGEK---IKGRKGFWGYPAVEGNLHNFGGRINMHG 389
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
L +A + + G G+ ME IEQNPV YDL EM +V ++ W+ QY+
Sbjct: 390 DLRLLASNQYMTALKQYPNVCGSGLFMEAIEQNPVYYDLAFEMPLHKGEVAIEEWLKQYA 449
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYG P+ Q A L Y T+ R I+A P++ G G
Sbjct: 450 NRRYGAVSPSAQQAMICLLEGPYRPGTNGTE--RSSIIA---ARPALNVKKSGPNAGLGI 504
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
P YS VI+A L + ++L S YR+D+ID+ RQ +
Sbjct: 505 P--------------------YSPLLVIQAEGLLLKDADKLKNSEPYRFDVIDVQRQMMT 544
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ EA+ D S+RFL+++ED+D LL F WL SA+
Sbjct: 545 NMGQVIHKRAAEAFLNRDKEAFALHSKRFLQMLEDVDELLRTRPEFNFDRWLTSARSWGD 604
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
EE++ E++A + +T+W + S+ DY + W+GL++ YY PR ++ + E
Sbjct: 605 TEEEKNLLEYDATSLVTIW--GADGDPSIF-DYSWREWTGLIKGYYLPRWTKFYAMLQEH 661
Query: 488 LESGDGFRLKDWRR 501
L++G + + R+
Sbjct: 662 LDNGTTYSEEGLRQ 675
>gi|404406328|ref|ZP_10997912.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 738
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 146/488 (29%), Positives = 226/488 (46%), Gaps = 39/488 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L ++P AKI+++ W + R ++ LD DPLF I R F+ +
Sbjct: 243 VLPAFAGHVPAELGEIYPEAKISRMSKWGGFEDRYR---SHFLDPLDPLFARIQREFLAE 299
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q +G T HIY D F+E PP PE+++ + IY M D +A WL WLF D
Sbjct: 300 QTALFG-TDHIYGADPFNEVDPPSWEPEFLARVSRTIYDTMTEADPEAEWLQMTWLFYLD 358
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +++A + +VP K+++LD + E +W + ++G PY WC L NF GN +
Sbjct: 359 RDKWHDDRIEAFVTAVPQDKMLLLDYYCENTEVWRQTHSYHGQPYFWCYLGNFGGNTMLV 418
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G D ++ + G+G ++EG++ NP +YD + E A+ VD W +
Sbjct: 419 GNFDEVSKRIDGVLAEGGNNLRGLGSTLEGLDSNPFMYDYVFERAWDF-PVDDDRWFDAL 477
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G + AW+ L VY I S G
Sbjct: 478 ADRYLGYEDTGYRRAWDALRKNVY-----------------------ITSSKYGHCPLLN 514
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+ E +L T + + Y E+ I +G+ S +TYRY L+++ RQ L
Sbjct: 515 ARPTLEGILTGTTDA----EIKYDNDELFEVWAKMIDAGD--SGRDTYRYWLVNVGRQTL 568
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
L A + D + +L LEL D++ L A H F + W++ ++
Sbjct: 569 GNLFLPLRDGFTAACRAKDLARMKELRSEMLELAADLETLTAQHGAFSMQKWIDDSRSFG 628
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
E+ YE N RT +T W D Q + DY N+ WSGL+ DYY R ++ +
Sbjct: 629 TTPEERDYYEVNGRTLLTTWGDRAQS----INDYANRTWSGLVADYYAERWRMFLDAAVG 684
Query: 487 SLESGDGF 494
++E+G F
Sbjct: 685 AVEAGRKF 692
>gi|392584963|gb|EIW74305.1| glycoside hydrolase family 89 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 772
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 148/504 (29%), Positives = 254/504 (50%), Gaps = 60/504 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF G VP A+ +FP+A I + + + ++ L DPL+ ++ +F+ +
Sbjct: 251 VLPAFPGFVPPAMHTLFPNASIVNGSEYPGIPA--QYSNDSFLAPFDPLYAQLQSSFLAK 308
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + G +H++ D ++EN+P Y++++ + ++ +++ D DA+WLMQGWLF D
Sbjct: 309 QTEALGNVTHVWTIDQYNENSPYSGDLTYLANIANSTFASLRAHDPDAIWLMQGWLFFAD 368
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
PFW ++ A L+ +P +++LDLF++V P W + G ++WC +H+F GN+ +
Sbjct: 369 EPFWTSDRVDAYLDQIPNDGMIILDLFSDVYPQWQRLDSYRGKSWVWCEVHDFGGNMGLE 428
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + GPV+A S N++M GVG++MEG+E N ++YD++ + A+ +D A+ +
Sbjct: 429 GNFSVVTNGPVDALNSPNSSMKGVGLAMEGLEGNEIIYDVLLDQAWSAAPLDRDAYAKAW 488
Query: 247 SVRRY-------------GRSVPAIQ-DAWNVLYHTVYNCTD----GATDKNRDVIVAFP 288
+ RR+ S+PA +AW L TVY+ T+ GAT
Sbjct: 489 ATRRFHLPTANSSTTTATNTSIPASAIEAWQTLASTVYSSTNPNVWGATK---------- 538
Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE- 347
S+I + Y P S + TS ++ AL +A+G
Sbjct: 539 ----SLIELAPSLGGMYSAPSSTIIFYDTNTS-------------LVPALRGLVAAGTSA 581
Query: 348 --LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN--DAHGVFQLSRRFLELVEDM 403
L A + +R D ID+ RQ LA + + AY + + + + R +++++D+
Sbjct: 582 PALWALDEFRTDSIDVARQLLANRFADAYTATTGAYNASGPGSAALNATAARMMQIIDDL 641
Query: 404 DGLLACHDGFLLGPWLESAKQLA---QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 460
D LL H+ +LL + SA+ A +E E+ AR+Q+T+W S+L DY
Sbjct: 642 DRLLMTHEPYLLSSRIASARAWAGDGGDEAYADYLEYEARSQVTLW----GPVPSVLNDY 697
Query: 461 GNKYWSGLLRDYYGPRAAIYFKYM 484
+K W GL+ YY R + +YM
Sbjct: 698 ASKVWGGLVGTYYRQRWTAFVEYM 721
>gi|400595379|gb|EJP63180.1| alpha-N-acetylglucosaminidase [Beauveria bassiana ARSEF 2860]
Length = 761
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 147/490 (30%), Positives = 248/490 (50%), Gaps = 30/490 (6%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F G VPAAL+ + P I + W V + T L+ TD + E+ FI+
Sbjct: 245 VLPGFPGFVPAALKKLRPDVNIAEAPVWVDVPRNN--TATAFLNPTDKTYAELQSLFIKN 302
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q+KE+G +++Y D F+E P +YI+ + ++ Y G+ + + A+WLMQGWLF S
Sbjct: 303 QIKEFGNVTNVYTVDQFNEINPSSGDTKYITDVSSSTYKGITAANPAAIWLMQGWLFYSS 362
Query: 127 DPFWRPPQMKALLNSVP-LGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW ++ A L P +++LDLF+E +P W ++ ++G P+IWC LH+F GN +
Sbjct: 363 QSFWTQQRVDAYLAGPPGQDDMIILDLFSESQPQWQRTRSYFGRPWIWCELHDFGGNQAL 422
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
+G + ++ V+A E+ ++VG G++ EG E N VVYD++ + A++ +D +
Sbjct: 423 HGKITNVTQNSVQA-LKESGSIVGYGLTPEGYEGNEVVYDILLDQAWEGSPIDTANYFRA 481
Query: 246 YSVRRYGRSVPAIQD---AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
++ RY + +D AW L Y+ D A V V+ + PS+ + +
Sbjct: 482 WARNRYSAAGIIPEDVFTAWEQLRQHAYDVQDNAIPS---VGVSVYQLFPSLKGLVN-RT 537
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
+Y P + + K + + HL+Y+++ I S L + D +D+T
Sbjct: 538 GHYPPPTALQYDPKVMKNIW---HLFYNST---------IDSPGLLQIP-AFHLDFVDVT 584
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLES 421
RQ L +++ +++ +Q V Q L L +ED+D L ++ F WL S
Sbjct: 585 RQVLGNAFIDIYTDLVNQFQATANATVIQDLGNSMLSFIEDLDMALNTNEHFTFKKWLNS 644
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
A+ Q+ +NAR+Q+T+W E+ L DY K WSG+++ YYG R I+
Sbjct: 645 AESWGQSIGAPDAVAFNARSQVTVW----STESRALDDYAAKAWSGIVKSYYGERWRIFI 700
Query: 482 KYMIESLESG 491
++ + E G
Sbjct: 701 NSLVSAREQG 710
>gi|393783261|ref|ZP_10371436.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
CL02T12C01]
gi|392669540|gb|EIY63028.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
CL02T12C01]
Length = 724
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 147/530 (27%), Positives = 253/530 (47%), Gaps = 46/530 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP AL+ +FP A I LG W + R + L+ + LF I + +I++
Sbjct: 235 VLPAFAGHVPGALKRIFPEANIQNLGKWAGFAEEYR---CHFLNPEEALFATIQKQYIKE 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G T HIY D F+E PP PEY+S + A +Y + + D A W+ W+F +D
Sbjct: 292 QTRLFG-TDHIYGVDPFNEVDPPSWEPEYLSKVSADMYHTLTAADPKAEWMQMTWMFYFD 350
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++KA+L VP GK+V+LD E +W T++ F+G PYIWC L NF GN +
Sbjct: 351 RKDWTAPRVKAMLTGVPQGKMVLLDYHCENVELWKTTEHFHGQPYIWCYLGNFGGNTTLT 410
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + + + G+G ++EG++ Y+ + E A+ D ++W+N
Sbjct: 411 GNVKESGARLDNTLINGGSNFKGIGSTLEGLDVMQFPYEYIFEKAWTL-NTDDRSWLNAL 469
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G + +++AW++L++ VY + P++ P +
Sbjct: 470 ADRHTGVTSEPVREAWDILFNQVYVQVP-------RTLAVLPNLRPVM-----------N 511
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
KP ++ ++ Y + +++A + + + + + R D+I + RQ L
Sbjct: 512 KPNNRTSIN-------------YPNTALLQAWQKLLQAPD--CNRDALRLDIITVGRQLL 556
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y + + Y+ D + + E++ D++ L A H L W+ A++
Sbjct: 557 GNYFLTVKDDFDRMYEAKDLPALKARAAEMREILNDLERLNAFHSRCSLDKWISDARKYG 616
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
E + YE NAR IT W L DY ++ W+GL++DYY R +Y ++
Sbjct: 617 NTPELKNYYEKNARNLITTW-------GGRLNDYASRTWAGLIKDYYSKRWDMYLDAVVA 669
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++E+ F + E+ + W + V GD LI +++L NKY
Sbjct: 670 AVENNREFDQEKLDGEFRLFEDSWVSSTRPVEVTPEGDLLIYARFLLNKY 719
>gi|392566857|gb|EIW60032.1| alpha-N-acetylglucosaminidase [Trametes versicolor FP-101664 SS1]
Length = 747
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 160/516 (31%), Positives = 267/516 (51%), Gaps = 39/516 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+P+F+G VP AL + P+A I W + L+ DPLF + ++FI +
Sbjct: 231 VMPSFTGFVPRALAALHPNASIVTGSQWSGFPTS--LTNDSFLEPFDPLFATLQQSFIAK 288
Query: 68 QLKEYGR-TSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
Q YG SH+Y D ++EN P +Y+ ++ A ++ +++ D AVWLMQGWLF
Sbjct: 289 QQAAYGADISHVYTLDQYNENDPFSGDLDYLRNVSAGTFASLRAADPAAVWLMQGWLFFS 348
Query: 127 DP-FWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
D FW ++ A L VP ++VLDL++E +P W+ + + G ++WC LH++ GNI
Sbjct: 349 DAVFWTDDRVAAYLGGVPGNDSMIVLDLYSEAQPQWNRTASYSGKQWVWCELHDYGGNIG 408
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
M G LD + P+ A +S ++M GVG++MEG E N +VY ++ + A+ ++ ++++
Sbjct: 409 MEGNLDVLTHAPLTALSSPGSSMKGVGLTMEGQEGNEIVYGVLLDQAWSATSLNTSSYVS 468
Query: 245 QYSVRRYG-RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+ RRY + +P A QDAW +L TVYN D T + I ++ P++ +T
Sbjct: 469 SWVSRRYPVKPLPKAAQDAWRILSTTVYNNQDPNT---QATIKGIYELAPALTGMTNRIG 525
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLI 359
+ P S YD + + ++ AL+L + A LSA + YD++
Sbjct: 526 HH---PTSIP---------YD------TDATMLSALKLLLEARAQHPTLSAVPEFVYDVV 567
Query: 360 DLTRQALAKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
D+ RQ L+ L+ +I+ Y A V + L L+ D+D LL+ ++ FLL
Sbjct: 568 DVARQLLSNRFIGLYDTLIQTYNSTSSTAQSVSAAGQPLLALLTDLDALLSTNEHFLLSS 627
Query: 418 WLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
W+ A++ A Y E+NAR Q+T+W + + + DY +K W+GL+ YY PR
Sbjct: 628 WIADARKWADGSASYGAYLEYNARNQVTLWGPDGE-----INDYASKAWAGLVGTYYKPR 682
Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 512
A + Y+ E+ +G + + + + +W N
Sbjct: 683 WAAFVDYLAETKGTGQAYNATAVKSTMLAIGQEWGN 718
>gi|429740222|ref|ZP_19273924.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
gi|429153947|gb|EKX96708.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
Length = 730
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 152/530 (28%), Positives = 244/530 (46%), Gaps = 44/530 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ +FP A I LG W D ++ C + L+ +PLF +I + F+E+
Sbjct: 235 VLPAFAGHVPAELKRIFPDANIKSLGKWGGF--DEQYLC-HFLNPGEPLFAKIQKLFLEE 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q +G T HIY D F+E PP P Y+ + +Y + + D A W+ GW+F YD
Sbjct: 292 QTALFG-TDHIYGVDPFNEGEPPSWEPAYLKEISKNMYGTLTAVDPKAEWMQMGWMFYYD 350
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P ++KA L VP GK+ +LD E +W T+ FYG PYIWC L NF GN +
Sbjct: 351 KKVWTPKRVKAFLTGVPQGKMSLLDYHCENVELWKTNDGFYGQPYIWCYLGNFGGNTTLT 410
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + M+GVG ++EG++ Y+ + + + H + WI++
Sbjct: 411 GNVKETGKRLDAALKAARRNMLGVGSTLEGLDVIQFPYEYVFDKVWTHSDKGNQQWIDEL 470
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G + P+++ AW +L+ ++ G + PS V +
Sbjct: 471 ADRHAGFTSPSVRKAWQILFDEIFVQVPGTY-----------SILPSRSPVLNDNH---- 515
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
SE + +P E + +L L + N + DLI + RQ L
Sbjct: 516 ----------SERTEIKYPA---QRLEEVWSLLLDVPQ----CERNELQVDLIAVGRQVL 558
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ AY D + Q + EL+ D+D L + + + W++ A+ L
Sbjct: 559 GNKFLAVKSEFDAAYAAKDITLLRQKAYEMEELLSDLDCLTSFNTRCTVNKWIDDARALG 618
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+N E + YE NAR IT+W L DY ++ W GL+ YYG R +Y ++
Sbjct: 619 RNAEMKNYYERNARYLITLW-------GGHLSDYASRAWGGLIGSYYGGRWRLYIHDILA 671
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
S ++G F K + + + W + + D L + +++KY
Sbjct: 672 SAQTGKPFDQKAFDEKRSQFEQTWVHSTTPITLPQRNDLLTFCKMMFSKY 721
>gi|333031143|ref|ZP_08459204.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
gi|332741740|gb|EGJ72222.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
Length = 723
Score = 234 bits (598), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 157/530 (29%), Positives = 243/530 (45%), Gaps = 46/530 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP+ L+++FP A I LG W R C +L + DPLF +I R F+E+
Sbjct: 234 VLPAFAGHVPSELKHLFPEADIQHLGKWAGFADKYR--CNFL-NPNDPLFAKIQRLFLEE 290
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
Q + +G T HIY D F+E PP PEY+ + A +Y + D A WL WLF +
Sbjct: 291 QTRLFG-TDHIYGVDPFNEVDPPSWEPEYLKKVAADMYRTLTDVDPKAKWLQMTWLFYHG 349
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P+++ALL VP +L +LD E +W T+ F+G PYIWC L NF GN +
Sbjct: 350 KKKWTAPRIEALLTGVPQDELYLLDYHCENVELWKTTDYFHGQPYIWCYLGNFGGNTTIT 409
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + + G+G ++EG++ Y+ + + A+ +D +W+
Sbjct: 410 GNVKESGQRLENTLINGGNNFKGIGSTLEGLDVMQFPYEYIFDKAWTFN-MDDNSWVENL 468
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G+ A ++AW +L++ VY + + P+ P +
Sbjct: 469 ADRHLGKKSEAYREAWKILFNDVYVQVPKS-------LGVLPNFRPEM-----------S 510
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
KP +K V ++ + D +W EV + Y DLI + RQ L
Sbjct: 511 KP-NKRTV--NDYKNKDLVKVWAKLLEVKEC------------TRDAYIIDLITVGRQVL 555
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y + + YQ D G+ + E++ D++ L A H+ L W+ A+ L
Sbjct: 556 GNYFLVVKNEFDQMYQFKDLPGLESRGAKLREILNDLENLTAFHNHCTLEKWISDARALG 615
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
E + YE NAR IT W L DY ++ WSGL++DYY R +Y + E
Sbjct: 616 NTIELKDYYEKNARNLITTW-------GGSLNDYASRTWSGLIKDYYAKRWNLYIDSVTE 668
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+L+ F + + L W N GD L S++L++KY
Sbjct: 669 ALKENKKFNQSELNEKLNILEEAWVNKVETVTSYEQGDILELSKYLFDKY 718
>gi|393785791|ref|ZP_10373937.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
CL02T12C05]
gi|392661410|gb|EIY54996.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
CL02T12C05]
Length = 727
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 153/530 (28%), Positives = 250/530 (47%), Gaps = 46/530 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPAAL+ ++P A I LG W R Y L+ +PLF I + F+++
Sbjct: 235 VLPAFAGHVPAALKRIYPEANIQHLGKWAGFADTYR---CYFLNPEEPLFATIQKHFLQE 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G T HIY D F+E PP PEY+S + + +Y + + D A W+ W+F +D
Sbjct: 292 QTRLFG-TDHIYGVDPFNEVDPPSWEPEYLSQVSSDMYRTLTAADPKAEWMQMTWMFYHD 350
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++KALL VP K+ +LD E +W ++ F+G PYIWC L NF GN +
Sbjct: 351 RKDWTAPRIKALLTGVPQDKMFLLDYHCENVELWKNTEHFHGQPYIWCYLGNFGGNTTLT 410
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + + G+G ++EG++ Y+ + E A+ +D +AW+
Sbjct: 411 GNVKESGDRLDNALINGGSNLRGIGSTLEGLDVMQFPYEYIFEKAWDL-NLDNEAWLQNL 469
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G +++AW++L++ +Y V P T G NY
Sbjct: 470 ADRHAGTVSQPVREAWDILFNQIY--------------VQVPK--------TLGVLPNY- 506
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+PV + ++ YS + +++A E + + + + R D+I + RQ L
Sbjct: 507 RPVMNKPNRRTVID--------YSNATLLQAWEKLLQATD--CNRDALRLDIITVGRQLL 556
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
Y + + Y + D G+ + E++ D+D L A H L WL A+ L
Sbjct: 557 GNYFLIVKDDFDRMYTVKDLPGLKARAAEMKEILNDLDRLNAFHSRCALDKWLADARALG 616
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
E + YE NAR IT W L DY ++ W+GL++DYY R +Y +I
Sbjct: 617 TTPEVKDYYEKNARNLITTW-------GGSLNDYASRTWAGLIKDYYSKRWDMYMDAVIS 669
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++E F K + W + + V G+ + +++L KY
Sbjct: 670 AVEGNREFDQKKLDESIKNFEDAWVDSTDPILVAPQGELMQYARFLLQKY 719
>gi|261199246|ref|XP_002626024.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
gi|239594232|gb|EEQ76813.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
Length = 752
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 140/491 (28%), Positives = 255/491 (51%), Gaps = 40/491 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP A+ V P A++ W + +P++ T L DP + + ++FI +
Sbjct: 240 ILPAFPGYVPRAVTRVLPDAQVVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISK 297
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
++ YG +H Y D F+E P PE++ + ++S D +A W+MQGWLF +
Sbjct: 298 SIEAYGNVTHFYTLDQFNEMIPSSGDPEFLRKVSETTMEAIKSVDPEATWVMQGWLFYIF 357
Query: 127 DPFWRPPQMKALLNS-VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+W +++A L++ +++LDLFAE P+W +K F+G ++WC + F GN +
Sbjct: 358 ADYWTTERIEAYLSAGKKFRDMLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGL 417
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + +I GP +A +++ MVGVG + EG N +V+ L+ + + +D + + +
Sbjct: 418 YGHVANITEGPAQA-MAQHPNMVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHD 476
Query: 246 YSVRRY---GRSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
+ RRY GR+VP + +AW +L + YN T+ +V P + ++ + +
Sbjct: 477 WVTRRYSSHGRTVPNELYEAWQLLRLSAYNNTN---------LVDAPLLPHALFAASPSI 527
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
+K +L E YD +++++A L I L ++Y+YD++D+
Sbjct: 528 N-------AKMPMLFIEGLLYD-------PADMLKAWGLMIKGA--LFGDSSYQYDIVDV 571
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLE 420
TRQ L+ + ++ Y+ VF + + L +++ +D +L+ ++ F L W+
Sbjct: 572 TRQVLSDAFTLVLQDLKVKYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWIS 631
Query: 421 SAKQLAQNEEQEKQ-YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
+A+ A ++ + +E NAR QIT+W E +L DY K W+GL+ YY PR +
Sbjct: 632 AARASAGDDSEAADFFEHNARNQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPRWRM 687
Query: 480 YFKYMIESLES 490
+ +Y+ ++ S
Sbjct: 688 FLEYLKDTPAS 698
>gi|423248233|ref|ZP_17229249.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
CL03T00C08]
gi|423253182|ref|ZP_17234113.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
CL03T12C07]
gi|392657082|gb|EIY50719.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
CL03T12C07]
gi|392660340|gb|EIY53954.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
CL03T00C08]
Length = 732
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 155/496 (31%), Positives = 236/496 (47%), Gaps = 58/496 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL F G VP ++ FP+A I G W + + L +DPLF ++ F E+
Sbjct: 235 VLQGFYGMVPNSMITKFPNADIRNAGKWITYQRPA------FLVPSDPLFAKVAEIFYEE 288
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q K +G S Y D F E N+ ++ I+ + IY M++ + +A+W++QGW S
Sbjct: 289 QKKLFGE-SRYYGGDPFHEGGNSKGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--S 341
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLH 177
+P ALL + G+ +VLDL A +P W F +IWC L
Sbjct: 342 GNP------SVALLKGLKHGEALVLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALP 395
Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEK 236
NF G I MYG L S A G ++A V G+G + EGI NP+ YD++ +MA++ +
Sbjct: 396 NFGGRIGMYGKLQSYATGVIKAEHHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDS 455
Query: 237 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
+D+K+WI Y+ RYG + A L +VYNC A A P + +S
Sbjct: 456 IDIKSWIANYTTYRYGSENSNAKAAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYVS 515
Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
S+ HL+Y V++ALE + + EL +TYRY
Sbjct: 516 ------------------------SWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRY 551
Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
D++D+TRQ LA Y + I +AY+ + + +FL+++ D D LL+ FLLG
Sbjct: 552 DVVDITRQMLADYGKYIHKCISDAYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLG 611
Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
++ A N +++ + NA+ QIT W S L +Y +K W+G+L Y PR
Sbjct: 612 EYIRQADTCGSNPTEKRMFINNAKRQITSW----TSVNSSLHEYAHKEWNGILSTLYAPR 667
Query: 477 AAIYFKYMIESLESGD 492
+YF Y+ LE +
Sbjct: 668 WKVYFDYLHAKLEGKN 683
>gi|423269877|ref|ZP_17248849.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
CL05T00C42]
gi|423272668|ref|ZP_17251615.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
CL05T12C13]
gi|392700723|gb|EIY93885.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
CL05T00C42]
gi|392708745|gb|EIZ01850.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
CL05T12C13]
Length = 732
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 155/496 (31%), Positives = 236/496 (47%), Gaps = 58/496 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL F G VP ++ FP+A I G W + + L +DPLF ++ F E+
Sbjct: 235 VLQGFYGMVPNSMITKFPNADIRDAGKWITYQRPA------FLVPSDPLFAKVAEIFYEE 288
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q K +G S Y D F E N+ ++ I+ + IY M++ + +A+W++QGW S
Sbjct: 289 QKKLFGE-SRYYGGDPFHEGGNSKGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--S 341
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLH 177
+P ALL + G+ +VLDL A +P W F +IWC L
Sbjct: 342 GNP------SVALLKGLKHGEALVLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALP 395
Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEK 236
NF G I MYG L S A G ++A V G+G + EGI NP+ YD++ +MA++ +
Sbjct: 396 NFGGRIGMYGKLQSYATGVIKAEHHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDS 455
Query: 237 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
+D+K+WI Y+ RYG + A L +VYNC A A P + +S
Sbjct: 456 IDIKSWIANYTTYRYGSENSNAKAAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYVS 515
Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
S+ HL+Y V++ALE + + EL +TYRY
Sbjct: 516 ------------------------SWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRY 551
Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
D++D+TRQ LA Y + I +AY+ + + +FL+++ D D LL+ FLLG
Sbjct: 552 DVVDITRQMLADYGKYIHKCISDAYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLG 611
Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
++ A N +++ + NA+ QIT W S L +Y +K W+G+L Y PR
Sbjct: 612 EYIRQADTCGSNPTEKRMFINNAKRQITSW----TSVNSSLHEYAHKEWNGILSTLYAPR 667
Query: 477 AAIYFKYMIESLESGD 492
+YF Y+ LE +
Sbjct: 668 WKVYFDYLHAKLEGKN 683
>gi|383122982|ref|ZP_09943669.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
gi|251841923|gb|EES70003.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
Length = 730
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 148/533 (27%), Positives = 262/533 (49%), Gaps = 41/533 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSG+VP L+ ++P AKI ++ W S R ++ ++ D LF I + ++E+
Sbjct: 234 VLPAFSGHVPKELKAIYPDAKIHEMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEE 290
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q YG T HIY D F+E P + ++++ + IY + D++A WL W+F +D
Sbjct: 291 QTAIYG-TDHIYGIDPFNEVDSPNWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHD 349
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++++ L +VP KL++LD + + IW ++ +YG PY+WC L NF GN M
Sbjct: 350 QKKWTQPRIRSFLEAVPDDKLILLDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMV 409
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G LD + + + G+G ++EG + NP +Y+ + + A+ + + WI +
Sbjct: 410 GNLDDVDVKIEKLFVEGGENVYGLGATLEGFDVNPFMYEFVFDQAWDY-PLTTDQWIQNW 468
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G I AW+ L+ +Y AT ++ A P +
Sbjct: 469 AKCRGGNQDRHILKAWDSLHKKIYK--KYATAGQAVLMNARPML---------------- 510
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT-YRYDLIDLTRQA 365
V ++ +Y++ LW +E+++A + +NT YR+D+I++ RQ
Sbjct: 511 --VGTDSWNTYPDITYNNRDLWDIWTEMLKASHI----------NNTGYRFDVINVGRQV 558
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
L + + + Y D G+ + + + L+ D D LL+C F +G W++ A+
Sbjct: 559 LGNLFSSFRDHFTQCYSEKDIDGMKKWADQMDSLLIDTDRLLSCETNFSIGKWIDDARSF 618
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
+ E +++ YE NAR +T+W ++A+ L DY N+ W GL YY R + +I
Sbjct: 619 GKTEAEKEYYEENARCILTVW----GQKATQLNDYANRGWGGLTYSYYRERWKRFTTEVI 674
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
+ SG F K + + +W + +P+ S + ++ ++ L KY+Q
Sbjct: 675 TASLSGQKFDEKQFYQSITDFEYEWTLSKEHHPIISGENPILLAKTLSEKYMQ 727
>gi|239615395|gb|EEQ92382.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ER-3]
Length = 829
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 140/494 (28%), Positives = 252/494 (51%), Gaps = 46/494 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP A+ V P A++ W + +P++ T L DP + + ++FI +
Sbjct: 260 ILPAFPGYVPRAVTRVLPDAQVVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISK 317
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
++ YG +H Y D F+E P P+++ + ++S D +A W+MQGWLF +
Sbjct: 318 SIEAYGNVTHFYTLDQFNEMIPSSGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIF 377
Query: 127 DPFWRPPQMKALLNS-VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+W +++A L++ +++LDLFAE P+W +K F+G ++WC + F GN +
Sbjct: 378 ADYWTTERIEAYLSAGKKFRDMLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGL 437
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + +I GP EA +++ MVGVG + EG N +V+ L+ + + +D + + +
Sbjct: 438 YGHVANITEGPAEA-MAQHPNMVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHD 496
Query: 246 YSVRRYG---RSVPA-IQDAWNVLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVT 298
+ RRY R+VP+ + +AW +L + YN T+ A + A P ++ + +
Sbjct: 497 WVTRRYSSHERTVPSELYEAWQLLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFI 556
Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
EG L Y +++++A L I L ++Y+YD+
Sbjct: 557 EG--------------------------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDI 588
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGP 417
+D+TRQ L+ + ++ Y+ VF + + L +++ +D +L+ ++ F L
Sbjct: 589 VDVTRQVLSDAFTLVLQDLKVKYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSS 648
Query: 418 WLESAKQLAQNEEQEKQ-YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
W+ +A+ A +E + +E NAR QIT+W E +L DY K W+GL+ YY PR
Sbjct: 649 WISAARASAGDESEAADFFEHNARNQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPR 704
Query: 477 AAIYFKYMIESLES 490
++ +Y+ ++ S
Sbjct: 705 WRMFLEYLKDTPAS 718
>gi|29345848|ref|NP_809351.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337741|gb|AAO75545.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 730
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 149/533 (27%), Positives = 262/533 (49%), Gaps = 41/533 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSG+VP L+ ++P AKI ++ W S R ++ ++ D LF I + ++E+
Sbjct: 234 VLPAFSGHVPKELKAIYPDAKIHEMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEE 290
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q YG T HIY D F+E P + ++++ + IY + D++A WL W+F +D
Sbjct: 291 QTAIYG-TDHIYGIDPFNEVDSPNWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHD 349
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++++ L +VP KL++LD + + IW ++ +YG PY+WC L NF GN M
Sbjct: 350 QKKWTQPRIRSFLEAVPDDKLILLDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMV 409
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G LD + + + G+G ++EG + NP +Y+ + + A+ + + WI +
Sbjct: 410 GNLDDVDVKIEKLFVEGGENVYGLGATLEGFDVNPFMYEFVFDQAWDY-PLTTDQWIQNW 468
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G I AW+ L+ +Y AT ++ A P + V + Y
Sbjct: 469 AKCRGGNQDRHILKAWDSLHKKIYK--KYATAGQAVLMNARP------MLVGTDSWNTY- 519
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT-YRYDLIDLTRQA 365
+Y++ LW +E+++A + +NT YR+D+I++ RQ
Sbjct: 520 -----------PDITYNNRDLWDIWTEMLKASHI----------NNTGYRFDVINVGRQV 558
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
L + + + Y D G+ + + + L+ D D LL+C F +G W++ A+
Sbjct: 559 LGNLFSSFRDHFTQCYSEKDIDGMKKWADQMDALLIDTDRLLSCETNFSIGKWIDDARSF 618
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
+ E +++ YE NAR +T+W ++A+ L DY N+ W GL YY R + +I
Sbjct: 619 GKTEAEKEYYEENARCILTVW----GQKATQLNDYANRGWGGLTYSYYRERWKRFTTEVI 674
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
+ SG F K + + +W + +P+ S + ++ ++ L KY+Q
Sbjct: 675 TASLSGQKFDEKQFYQSITDFEYEWTLSKEHHPIISGENPILLAKTLSEKYMQ 727
>gi|281200617|gb|EFA74835.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
Length = 688
Score = 231 bits (589), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 113/248 (45%), Positives = 161/248 (64%), Gaps = 9/248 (3%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
AVLP F+G+VP AL+ +P A ITQLG W + TY L+ DPLF +I +AF+
Sbjct: 249 AVLPGFAGHVPEALETHYPKANITQLGGWGTFSG------TYYLNPDDPLFSKIAQAFVI 302
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
Q + YG T H YN D F+E PP Y+ + ++++ + + D +W++QGW
Sbjct: 303 TQNQLYG-TDHFYNFDPFNELEPPSSDLTYLKNCSQSMFNNLIAADPQGIWVLQGWFLVD 361
Query: 127 DP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
DP FW PPQ +A L+ VP+GK++VLDL+++V P W+++ +YG +IWCMLHNF G M
Sbjct: 362 DPEFWLPPQTEAFLSGVPIGKMIVLDLWSDVIPAWNSTNYYYGHNWIWCMLHNFGGRSGM 421
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + I+ P+EAR S + MVG G++ E IEQN +VYDLMSEMA++ D+K W++Q
Sbjct: 422 YGKIPFISTNPIEAR-SLSPNMVGTGLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQ 480
Query: 246 YSVRRYGR 253
Y RRYG+
Sbjct: 481 YVTRRYGK 488
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 83/192 (43%), Gaps = 8/192 (4%)
Query: 336 RALELFIASGNELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQLNDAHGVFQLS 393
L + ++ ++T+ +DL ++T QAL NEL LN A+ N + S
Sbjct: 489 HGLPFLSINDTSITNTSTFSFDLTEITTQALINLFMTNELQLN--SAFLNNSLEEFNKYS 546
Query: 394 RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEE 453
L +++D+ + + + L+G W A+ L E YE NAR QIT+W
Sbjct: 547 EALLSIIQDVYTIASTQEMLLVGHWTARARALTPANESTNLYEMNARNQITLW----GPT 602
Query: 454 ASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNG 513
S + DY K W GL D+Y R ++ K + SL S F ++ + W
Sbjct: 603 YSDVHDYAYKLWGGLTEDFYLARWTLFVKELQYSLTSSQPFNSTLFQTNCEAVEEVWNLQ 662
Query: 514 RNVYPVESNGDA 525
YP G++
Sbjct: 663 TYPYPTIPTGNS 674
>gi|327356744|gb|EGE85601.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ATCC 18188]
Length = 752
Score = 231 bits (589), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 139/494 (28%), Positives = 252/494 (51%), Gaps = 46/494 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP A+ V P A++ W + +P++ T L DP + + ++FI +
Sbjct: 240 ILPAFPGYVPRAVTRVLPDAQVVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISK 297
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
++ YG +H Y D F+E P P+++ + ++S D +A W+MQGWLF +
Sbjct: 298 SIEAYGNVTHFYTLDQFNEMIPSSGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIF 357
Query: 127 DPFWRPPQMKALLNS-VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+W +++A L++ +++LDLFAE P+W +K F+G ++WC + F GN +
Sbjct: 358 ADYWTTERIEAYLSAGKKFRDMLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGL 417
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + +I GP EA +++ MVGVG + EG N +V+ L+ + + +D + + +
Sbjct: 418 YGHVANITEGPAEA-MAQHPNMVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHD 476
Query: 246 YSVRRYG---RSVPA-IQDAWNVLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVT 298
+ RRY R+VP+ + +AW +L + YN T+ A + A P ++ + +
Sbjct: 477 WVTRRYSSHERTVPSELYEAWQLLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFI 536
Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
EG L Y +++++A L I L ++Y+YD+
Sbjct: 537 EG--------------------------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDI 568
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGP 417
+D+TRQ L+ + ++ Y+ VF + + L +++ +D +L+ ++ F L
Sbjct: 569 VDVTRQVLSDAFTLVLQDLKVKYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSS 628
Query: 418 WLESAKQLAQNEEQEKQ-YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
W+ +A+ A ++ + +E NAR QIT+W E +L DY K W+GL+ YY PR
Sbjct: 629 WISAARASAGDDSEAADFFEHNARNQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPR 684
Query: 477 AAIYFKYMIESLES 490
++ +Y+ ++ S
Sbjct: 685 WRMFLEYLKDTPAS 698
>gi|336371253|gb|EGN99592.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.3]
gi|336384013|gb|EGO25161.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 761
Score = 231 bits (589), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 147/503 (29%), Positives = 255/503 (50%), Gaps = 46/503 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
VLP+F+G VP A+ ++P+A I W F+++ L+ DPLF + +F+
Sbjct: 244 VLPSFTGFVPRAMHTLYPNASIVNGSQWSTFTIQH----TNDSFLEPFDPLFSTLQTSFM 299
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
+ YG SHIY D ++E P + Y+SS+ +A ++ +++ D +AVW+MQGWLF
Sbjct: 300 TKYAAAYGNVSHIYTLDQYNEMMPYSGNTSYLSSISSATFASLRATDPEAVWMMQGWLFY 359
Query: 126 -YDPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
Y FW +++A L VP +++LDLF+E P W ++G +IWC LH+F GN+
Sbjct: 360 IYASFWTDERVEAYLGGVPGNDSMIILDLFSEAYPQWQRLNSYFGKQWIWCELHDFGGNM 419
Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
G +++ PV+A + TMVG+G++MEG E N ++YD++ + A+ ++ +++
Sbjct: 420 GFEGNFENVTTQPVKALATPGNTMVGMGLTMEGQEGNEIMYDVLFDQAWSPTPINRTSYV 479
Query: 244 NQYSVRRYGRSVP----AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
+ ++ RRY +VP A +AW +L TVYN D I ++++T
Sbjct: 480 SAWTSRRY--NVPNLPTAATEAWEILASTVYNNQDPLLQATIKSIFELEPAINGLVNLTV 537
Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYST-SEVIRALELFIASGNELSASNT---YR 355
+ G P L+Y T + ++ AL+ + + E SA + ++
Sbjct: 538 LQ----GIPTG----------------LFYDTNTTIVPALQSLLQARQESSALDEVPEFQ 577
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL--SRRFLELVEDMDGLLACHDGF 413
YD++ + RQ LA +L+ ++++ Y + + L++D+D +L F
Sbjct: 578 YDVVYIIRQLLANRFIDLYTSLVDTYNSTTSSSSDVSTAGAPLITLLKDVDSVLLTDTHF 637
Query: 414 LLGPWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
LL W+ +A+ A + Y E+NAR QIT+W + + DY +K W GL+ Y
Sbjct: 638 LLSNWISAARNWAHDNSTYAAYLEYNARNQITLWGPRGE-----VHDYASKQWGGLVGTY 692
Query: 473 YGPRAAIYFKYMIESLESGDGFR 495
Y R + Y+ S +G +
Sbjct: 693 YVQRWEEFVSYLSGSKANGTAYN 715
>gi|423241433|ref|ZP_17222546.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
CL03T12C01]
gi|392641326|gb|EIY35103.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
CL03T12C01]
Length = 754
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 162/558 (29%), Positives = 262/558 (46%), Gaps = 83/558 (14%)
Query: 8 VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
VLP +SG +P+ L S K + + S +S + +L DP F
Sbjct: 229 VLPGYSGMIPSKLDLGKRIDSGKEKKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 288
Query: 59 EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
+I F E+ K YG TS Y+ D F E ++ G AI M+ + AVW+
Sbjct: 289 QIANLFYEETEKLYG-TSDYYSIDPFHEAKSLPAGLDF-GKAGRAIMDAMKKANPKAVWV 346
Query: 119 MQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----- 173
+QGW + +P RP MKAL G L++LDLF+E +P+W G+P IW
Sbjct: 347 VQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKG 391
Query: 174 --------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVY 224
C+L NF GN+ ++G +D + + + + G+G++MEGIE NPV++
Sbjct: 392 YEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMF 451
Query: 225 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 284
+LM E+ ++ EK + WI QY RYG +I+ AW +L + +YNC G
Sbjct: 452 ELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIRQAWQILANGIYNCPAGNNQ------ 505
Query: 285 VAFPDVDPSIISVTEGKYQNY--GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI 342
+G +++ G+P ++ + SS+ +Y + A L +
Sbjct: 506 --------------QGPHESIFCGRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMV 547
Query: 343 ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVED 402
+ ++ +N + YDL+D+TRQA+A A ++ + ++ D +R+FLEL+
Sbjct: 548 SVADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYATHTRQFLELLMM 607
Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
D LL F +G W++ A+ L E++ YEWNAR QIT W + + LRDY +
Sbjct: 608 QDKLLGTRKEFKVGNWIQQARNLGITSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAH 667
Query: 463 KYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTN 508
K W+GLLRD+Y R Y++ + + L+ + D + +W L
Sbjct: 668 KEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEE 724
Query: 509 DWQNGRNVYPVESNGDAL 526
W +N+Y + GD +
Sbjct: 725 PWTLAKNIYAASAEGDCI 742
>gi|126307952|ref|XP_001365931.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
domestica]
Length = 481
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 107/216 (49%), Positives = 146/216 (67%), Gaps = 4/216 (1%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G++P A VFP A +T+L NW + C+YLL DPLF +G F+ +
Sbjct: 242 VLPAFAGHIPKAFTRVFPQANVTKLDNWIDFNCT--YSCSYLLAPEDPLFPVVGSLFLRE 299
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
KE+G T HIY+ D F+E PP +P Y+++ AA+Y M + D DAVWL QGWLF
Sbjct: 300 LAKEFG-TDHIYSADIFNEMDPPSSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNH 358
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW+PPQMKA+L +VP G+ ++LDLFAE +P++S + FYG P+IWCMLHNF GN ++
Sbjct: 359 PDFWKPPQMKAVLEAVPRGRFLILDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLF 418
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPV 222
G+LD++ GP AR N+T+VG G+ EGI QN +
Sbjct: 419 GVLDAVNRGPSTARLFPNSTIVGTGIVPEGINQNEI 454
>gi|322702923|gb|EFY94542.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
ARSEF 23]
Length = 589
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 154/507 (30%), Positives = 250/507 (49%), Gaps = 54/507 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF G VP A V P A T+ W + D T+L D + + +AFI +
Sbjct: 61 VLPAFPGFVPPAFSRVQPDANTTKAPRWTGLP-DTNTRDTFL-SPLDTSYARLQQAFISK 118
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q++ +G ++IY D F+E P + P Y+S + Y + + + AVWL+QGWLF
Sbjct: 119 QIEAFGNVTNIYTLDQFNEMPPTSNEPSYLSQVSTYTYKALTAANPAAVWLLQGWLFLNS 178
Query: 128 PFWRPPQMKALLNSVPLG--KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
W ++ A L P G ++VLDL++E +P W +K ++G P+IWC LH+F GN+ M
Sbjct: 179 GLWTEERVTAYLGG-PEGHNSMLVLDLYSESRPQWQRTKGYFGRPWIWCQLHDFGGNMGM 237
Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
YG + I ++A RTS ++ G GM+ EG E N VVY ++ + A+ +D +
Sbjct: 238 YGQISDITVQSMDALRTSP--SLSGFGMTPEGYEGNEVVYQMLFDQAWTTTPIDTSGYFY 295
Query: 245 QYSVRRYG--RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
Y VRRY ++ AW++L +Y+ +K+R V V G Y
Sbjct: 296 GYVVRRYAGVSQTNSLFQAWDILRQNIYD------NKDRQVPC-----------VGVGIY 338
Query: 303 QNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNELSA---SNTYRYDL 358
QN + L + T ++ P ++Y + + +A L I + NE+ T++ D+
Sbjct: 339 QN----APSLSGLVNRTGNWPPPTKVYYDPATLKKAHSLLIQAANEIPQLWDIPTFQLDV 394
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR---------------FLELVEDM 403
+D+TRQ ++ N ++ + ++ + + +S R L+ + D+
Sbjct: 395 VDVTRQVMSNAFNTMYTDYVQTFNSQLSRQKSHISNRGGLQRRDDFATKGKQLLDFLTDL 454
Query: 404 DGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNK 463
D +LA + F L WL++A+ A+ +NAR+QIT W E+ L DY K
Sbjct: 455 DRVLATNQHFRLDSWLDAAQYWAKQTGANDLIAFNARSQITTWI----WESEALNDYAVK 510
Query: 464 YWSGLLRDYYGPRAAIYFKYMIESLES 490
WSGL R YY R +I+ + ++L S
Sbjct: 511 EWSGLTRSYYRGRWSIFVDGLNKALAS 537
>gi|374312699|ref|YP_005059129.1| alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
gi|358754709|gb|AEU38099.1| Alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
Length = 754
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 160/514 (31%), Positives = 238/514 (46%), Gaps = 63/514 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWF--SVKSDPRWCCTYLLD-ATDPLFIEIGRAF 64
V PAF+G VP + + P + L W K+ PR T++L L+ +IG+ F
Sbjct: 258 VAPAFAGFVPQGFKRLHPEVETFTL-LWLRKEFKTIPRSTRTFILHPGQQELYRQIGKKF 316
Query: 65 IEQQLKEYGRTSHIYNCDTFDENTPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQG 121
IE+ EYG + Y DTF+E PV E + G ++ +Q+GD W+MQG
Sbjct: 317 IEEYKAEYGEVEY-YLADTFNELEVPVREDHRYEDLERFGRTVFESIQAGDPKGTWVMQG 375
Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPI---------WSTSKQFYGVPY 171
WLF YD FW ++ALL +P +++++D ++ P W K F+G P+
Sbjct: 376 WLFVYDSDFWNKESVEALLRGIPNDRMLIIDYANDLAPSVQGKYLPGQWKLQKAFFGKPW 435
Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEM 230
I M H F GN + G L +A P S E +VG GM EGIE N VVY+LM++
Sbjct: 436 INGMAHTFGGNNNIKGNLKLMATEPSTVLASPERGNLVGWGMCPEGIENNEVVYELMTDA 495
Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 290
+Q E +D+ WI Y RYG PA+Q AW +L + Y+ T + A P V
Sbjct: 496 GWQSEAIDLATWIPAYCRSRYGDCPPAMQQAWELLLKSAYSSHIWMT---KQAWQAEPSV 552
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
P SV G P + RA+ELF++ +L+
Sbjct: 553 HPIAASVDAG-------PTFQ------------------------RAVELFLSCAPQLAK 581
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
S YR DLI+ QA+ +E ++A + R +E + +DGL+
Sbjct: 582 SELYRNDLIEFVSQAVGGRVDEALALAVQAGDAKQDEDAVAHAARAVEWMRRIDGLMNLR 641
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
L W+++ + A+ +++ Y+ NAR IT W L DY ++ WSGL+R
Sbjct: 642 PDRRLETWMQATRAYAKTDDEATFYDENARLLITTW------GWPELSDYASRVWSGLIR 695
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
DYY R +F ES +G F L W++ W+
Sbjct: 696 DYYAARWEAWF----ESRHTGRSFSLDLWQQTWL 725
>gi|265753065|ref|ZP_06088634.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263236251|gb|EEZ21746.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 750
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 163/558 (29%), Positives = 260/558 (46%), Gaps = 83/558 (14%)
Query: 8 VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
VLP +SG +P+ L S K + + S +S + +L DP F
Sbjct: 225 VLPGYSGMIPSKLDLGKRIDSGKEEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 284
Query: 59 EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
I F E+ K YG TS Y+ D F E + + G AI M+ + AVW+
Sbjct: 285 RIANLFYEETEKLYG-TSDYYSIDPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWV 342
Query: 119 MQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----- 173
+QGW + +P RP MKAL G L++LDLF+E +P+W G+P IW
Sbjct: 343 VQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKG 387
Query: 174 --------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVY 224
C+L NF GN+ ++G +D + + + + G+G++MEGIE NPV++
Sbjct: 388 YEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMF 447
Query: 225 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 284
+LM E+ ++ EK + WI QY RYG +IQ AW +L + +YNC G
Sbjct: 448 ELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIQQAWQILTNGIYNCPAGNNQ------ 501
Query: 285 VAFPDVDPSIISVTEGKYQNY--GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI 342
+G +++ G+P ++ + SS+ +Y + A L +
Sbjct: 502 --------------QGPHESIFCGRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMV 543
Query: 343 ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVED 402
+ ++ +N + YDL+D+TRQA+A A ++ + ++ D +R+FLEL+
Sbjct: 544 SVADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYNTHTRQFLELLMM 603
Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
D LL F +G W++ A+ L E++ YEWNAR QIT W + + LRDY +
Sbjct: 604 QDKLLGTRKEFKVGNWIQQARNLGITPEEKDLYEWNARVQITTWGNRYCADIGKLRDYAH 663
Query: 463 KYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTN 508
K W+GLLRD+Y R Y++ + + L+ + D + +W L
Sbjct: 664 KEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEE 720
Query: 509 DWQNGRNVYPVESNGDAL 526
W +N Y + GD +
Sbjct: 721 PWTLAKNTYAASAEGDCI 738
>gi|449299394|gb|EMC95408.1| glycoside hydrolase family 89 protein [Baudoinia compniacensis UAMH
10762]
Length = 801
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 143/517 (27%), Positives = 251/517 (48%), Gaps = 62/517 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY-LLDATDPLFIEIGRAFIE 66
VLP F G VP + +P+A W PR L+ DPL+ + ++FI
Sbjct: 262 VLPCFPGFVPMQIGRYYPNAMYINGSQW---NGFPRQNTNVSFLEPFDPLYTTLQKSFIS 318
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-S 125
+Q YG S IY D ++EN P Y+ ++ A + +++ D +AVW++QGWLF S
Sbjct: 319 KQTAAYGNVSSIYTLDQYNENNPYSADTTYLRNISAGTIAALKAADPNAVWMLQGWLFFS 378
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW ++A L V +++LDLF+E +P W + +YG P+IWC LH++ GN+ +
Sbjct: 379 SATFWTDAAIRAYLGGVNNTDMIILDLFSETQPQWQRTNSYYGKPWIWCELHDYGGNMGL 438
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG ++++ P++A + ++TMVG+G++MEG E N ++YD++ + A+ ++ + +
Sbjct: 439 YGQVENVTINPIQALNNASSTMVGMGLTMEGQEGNEIMYDILLDQAWSSTPLNNSLYFHD 498
Query: 246 YSVRRYGRSV---PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+ RY + P + AW+ + TVYN T +T + V + ++ P++ +
Sbjct: 499 WVTSRYHGAASLPPGLYTAWDTMRQTVYNNTQISTIQ--SVTKSIWELTPNVTGLLN--- 553
Query: 303 QNYGKPVSKEAVLKSETSSYDHP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDL 358
+ HP + Y+TS ++ A + F + + L S Y +DL
Sbjct: 554 -----------------RTGHHPTTIQYNTSTLVGAWKQFYGAAAQEPTLWDSPGYLFDL 596
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ------LSRRFLELVEDMDGLLACH-- 410
D+TRQ +A L+ + + A + A+ + ++ + L+ +D +LA
Sbjct: 597 TDVTRQVMANAFYPLYTSFVSASN-HSANATYSPGNATIYGQQMVSLLSALDSMLAASPI 655
Query: 411 DGFLLGPWLESAK--------------QLAQNEEQEKQYEWNARTQITMWFDNTQEEASL 456
F L W+ A+ L + + YE+NAR QIT+W Q
Sbjct: 656 PYFHLSTWIAEARSWSAPTATLPNNATNLTSSSQTASFYEYNARNQITLWGPTGQ----- 710
Query: 457 LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 493
+ DY +K W+GL+ YY PR ++ Y + + +G
Sbjct: 711 ISDYASKQWAGLISSYYVPRWQLFVNYTLNGTTASNG 747
>gi|212695333|ref|ZP_03303461.1| hypothetical protein BACDOR_04880 [Bacteroides dorei DSM 17855]
gi|212662112|gb|EEB22686.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
Length = 754
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 164/556 (29%), Positives = 257/556 (46%), Gaps = 79/556 (14%)
Query: 8 VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
VLP +SG +P+ L S K + S +S + +L DP F
Sbjct: 229 VLPGYSGMIPSKLDLGKRIDSGKEEKTAGDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 288
Query: 59 EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
I F E+ K YG TS Y+ D F E + + G AI M+ + AVW+
Sbjct: 289 RIANLFYEETEKLYG-TSDYYSIDPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWV 346
Query: 119 MQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----- 173
+QGW + +P RP MKAL G L++LDLF+E +P+W G+P IW
Sbjct: 347 VQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKG 391
Query: 174 --------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVY 224
C+L NF GN+ ++G +D + + + + G+G++MEGIE NPV++
Sbjct: 392 YEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMF 451
Query: 225 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 284
+LM E+ ++ EK + WI QY RYG +IQ AW +L + +YNC G +
Sbjct: 452 ELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIQQAWQILTNGIYNCPAGNNQQG---- 507
Query: 285 VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS 344
SI G+P ++ + SS+ +Y + A L ++
Sbjct: 508 -----PHESIFC---------GRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSV 549
Query: 345 GNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 404
++ +N + YDL+D+TRQA+A A ++ + ++ D +R+FLEL+ D
Sbjct: 550 ADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYNTHTRQFLELLMMQD 609
Query: 405 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
LL F +G W++ A+ L E++ YEWNAR QIT W + + LRDY +K
Sbjct: 610 KLLGTRKEFKVGNWIQQARNLGITPEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKE 669
Query: 465 WSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTNDW 510
W+GLLRD+Y R Y++ + + L+ + D + +W L W
Sbjct: 670 WNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPW 726
Query: 511 QNGRNVYPVESNGDAL 526
+N Y + GD +
Sbjct: 727 TLAKNTYAASAEGDCI 742
>gi|409042145|gb|EKM51629.1| glycoside hydrolase family 89 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 749
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 143/504 (28%), Positives = 257/504 (50%), Gaps = 43/504 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G VP+ + +P+A I W S L+ DPL+ ++ ++FI +
Sbjct: 233 ILPAFTGFVPSNMSAHYPNASIIDGSAWSGFPS--TLTNVSFLEPFDPLYPQMQQSFITK 290
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q + YG +H Y D ++EN P + Y+SS+ + + +++ D +A W+MQGWLF S
Sbjct: 291 QQEAYGNITHFYTLDQYNENNPFSGNDSYLSSVSTSTIASLRAADPEATWVMQGWLFFSS 350
Query: 127 DPFWRPPQMKALLNSVPLG-KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+ FW +++A L +++LDL++E +P W+ + ++G ++WC LH++ GN+ +
Sbjct: 351 ETFWTNDRIEAYLGGAQGNDSMLILDLYSEAQPQWNRTDSYFGKQWVWCELHDYGGNMGL 410
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIE-QNPVVYDLMSEMAFQHEKVDVKAWIN 244
G L +I GP+ A S ++MVG+G++MEG+E N +VYD++ + A+ ++V W+
Sbjct: 411 EGNLAAITEGPIAALNSNGSSMVGMGLTMEGMEIGNEIVYDILLDQAWSSTPLNVSDWVA 470
Query: 245 QYSVRRY-GRSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+++ RRY +++P +Q AW +L T+YN D + I+ +++VT
Sbjct: 471 KWAARRYLVKTLPTELQQAWTILSTTIYNNQDPNSQATIKSILELEPATTGLVNVTG--- 527
Query: 303 QNYGKPVSKEAVLKSETSSYDHP-HLWYST-SEVIRALELFI-ASGNELSASNT--YRYD 357
HP + Y T + ++ AL+LF+ AS ++ S + D
Sbjct: 528 --------------------HHPTEIPYDTNTTILHALQLFVNASKSQPSLKQVPEFAVD 567
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
+++L+RQ + +L+ ++I + + A V L L+ D+D LL ++ +L
Sbjct: 568 ILELSRQLMVNRFIDLYTDLINTWNSSSSTAQNVTTAGVPLLSLISDLDVLLYTNENYLF 627
Query: 416 GPWLESAKQLAQNEEQEKQY-EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
W+ AKQ A Y E+ AR Q T+W + DY +K +GL+ +YY
Sbjct: 628 STWIADAKQWAHGNVSYAAYLEYQARNQQTLWGPQGN-----INDYASKQTAGLVGEYYA 682
Query: 475 PRAAIYFKYMIESLESGDGFRLKD 498
R + + E SG + +
Sbjct: 683 TRWQTFVVMLAEQKTSGQPYNATE 706
>gi|261880010|ref|ZP_06006437.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
gi|270333326|gb|EFA44112.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
Length = 719
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 155/533 (29%), Positives = 244/533 (45%), Gaps = 51/533 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSG+VP ++ ++P + I LG W + SD C Y + DPLF +I R ++E+
Sbjct: 232 VLPAFSGHVPGQIKQLYPESNIQHLGRWAAF-SDQYRC--YFMSPQDPLFAKIQRMYLEE 288
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q YG T HIY D F+E PP P+Y+ + IY + D A WL WLF +
Sbjct: 289 QRAIYG-TDHIYGIDPFNEVDPPSWDPDYLFQISKGIYQTLAHVDPKAEWLQMSWLFYHK 347
Query: 128 PF-WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P ++KAL+ V GK+V+LD F + IW + +FYG PYIWC L NF GN +
Sbjct: 348 KKKWTPERVKALITGVETGKMVLLDYFCDRNEIWKMTDKFYGQPYIWCYLGNFGGNTTVA 407
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + + T + GVG+++EG + Y+ + + + + WI+
Sbjct: 408 GNVKACGAKLDSTLTLGGKNLQGVGLTLEGFDVCQFPYEYILDKVWSGNSSE-NQWIDAL 466
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ G + P+ + AW +LYH V+ + G+ P P + S+
Sbjct: 467 ADSHVGYASPSFRKAWQLLYHDVFVQSAGSNG-------ILPCYRPELNSL--------- 510
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
++ + H+ Y ++I A +L + S + DLI RQ L
Sbjct: 511 --------------NWHYTHVDYDRQKLIEAWKLMQHDAD--SKRTAAQLDLIHYGRQVL 554
Query: 367 AKYANELFLN---IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
NE + AY D G+ + ++ D+D L A H L W++ A+
Sbjct: 555 ---GNEFLTHKQLFDSAYAHCDLAGMMAQAASMRHIMLDIDTLTAYHPRCTLAGWIDGAR 611
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
Q+A + YE NAR+ IT W L DY K W+GL+ DYY R YF +
Sbjct: 612 QMAPDSVCADYYEDNARSLITTW-------GGKLNDYACKGWAGLMSDYYLTRWERYFAH 664
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
I ++ + F + + +E + W + R++ VE++ + + + KY
Sbjct: 665 AINAVRAHRKFDQQAYDKEIARFELSWASHRDIPRVETHESLALYCKKIIQKY 717
>gi|345513909|ref|ZP_08793424.1| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
gi|345456132|gb|EEO45798.2| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
Length = 754
Score = 228 bits (582), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 163/556 (29%), Positives = 258/556 (46%), Gaps = 79/556 (14%)
Query: 8 VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
VLP +SG +P+ L S K + + S +S + +L DP F
Sbjct: 229 VLPGYSGMIPSKLDLGKRIDSGKEEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 288
Query: 59 EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
I F E+ K YG TS Y+ D F E + + G AI M+ + AVW+
Sbjct: 289 RIANLFYEETEKLYG-TSDYYSIDPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWV 346
Query: 119 MQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----- 173
+QGW + +P RP MKAL G L++LDLF+E +P+W G+P IW
Sbjct: 347 VQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKG 391
Query: 174 --------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVY 224
C+L NF GN+ ++G +D + + + + G+G++MEGIE NPV++
Sbjct: 392 YEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMF 451
Query: 225 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 284
+LM E+ ++ EK + WI QY RYG +I+ AW +L + +YNC G +
Sbjct: 452 ELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIRQAWQILANGIYNCPAGNNQQG---- 507
Query: 285 VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS 344
SI G+P ++ + SS+ +Y + A L ++
Sbjct: 508 -----PHESIFC---------GRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSV 549
Query: 345 GNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 404
++ +N + YDL+D+TRQA+A A ++ + ++ D +R+FLEL+ D
Sbjct: 550 ADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQD 609
Query: 405 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
LL F +G W++ A+ L E++ YEWNAR QIT W + + LRDY +K
Sbjct: 610 KLLGTRKEFKVGNWIQQARNLGITSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKE 669
Query: 465 WSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTNDW 510
W+GLLRD+Y R Y++ + + L+ + D + +W L W
Sbjct: 670 WNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPW 726
Query: 511 QNGRNVYPVESNGDAL 526
+N Y + GD +
Sbjct: 727 TLAKNTYAASAEGDCI 742
>gi|423299508|ref|ZP_17277533.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
CL09T03C10]
gi|408473317|gb|EKJ91839.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
CL09T03C10]
Length = 727
Score = 228 bits (582), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 149/535 (27%), Positives = 261/535 (48%), Gaps = 49/535 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P+AKI + W R ++ +D D L+ I + F+E+
Sbjct: 231 VLPAFAGHVPAELKKIYPNAKIYTMSQWGGFDKQYR---SHFIDPMDSLYSVIQKRFLEE 287
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY- 126
Q K YG T HIY D F+E P + E++S++ IY + S D +A WL W+F Y
Sbjct: 288 QTKIYG-TDHIYGIDPFNEVDSPDWNEEFLSNVSRKIYESLHSVDPEAQWLQMTWMFYYA 346
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P ++K+ L +VP KL++LD + + IW ++ +YG PYIWC L NF GN +
Sbjct: 347 KDKWTPSRIKSFLRAVPQDKLILLDYYCDHTEIWKKTEGYYGQPYIWCYLGNFGGNTMLA 406
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G L+ + + G+G+++E + NP++Y+ + E A++ + WI +
Sbjct: 407 GNLNDTYEKIHQVLAEGGQNIHGLGVTLEAFDVNPMMYEFVFEQAWEGAQ-PTDEWIATW 465
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G++ PA+ AW L+ +Y + PS+
Sbjct: 466 AKCRGGQTCPAVLKAWKELHEKIY-------------------IAPSLCG---------- 496
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEV-IRALELFIASGNELSASNT----YRYDLIDL 361
+AVL + + W + E +L++ G+ L + + +D++++
Sbjct: 497 -----QAVLMNARPQLEGVQGWNTFPEYKYDNKDLWVIWGSLLQVGSIDKPGHAFDVVNV 551
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
RQ L ++ Y+ D G + ++R L+ D+D LLAC F +G W++
Sbjct: 552 GRQVLGNLFSDYRAQFTACYKRKDVKGAQEWAQRMDALLLDVDRLLACSPLFSMGKWIQD 611
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
A+ EE++K YE NAR +T+W ++ + L DY N+ W+GL + +Y R +
Sbjct: 612 ARDCGTTEEEKKYYEENARCILTIW----GQKDTQLNDYANRSWAGLTKGFYRERWKRFT 667
Query: 482 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
++ ++++ F K + ++ +W + V S DA+ + L+NKY
Sbjct: 668 DSVLTAMQANRSFDAKKFHKDITDFEYEWTLQHETFSVSSGEDAVKVANELWNKY 722
>gi|358391826|gb|EHK41230.1| glycoside hydrolase family 89 protein [Trichoderma atroviride IMI
206040]
Length = 751
Score = 228 bits (582), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 148/506 (29%), Positives = 260/506 (51%), Gaps = 40/506 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP + VFP ++ W + D TY+ + DP F ++ + FI +
Sbjct: 240 ILPAFPGFVPRNISRVFPGISLSTSPLWENFAEDLS-ADTYV-NPFDPHFTQLQKLFIGK 297
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG + + D F+EN P Y+ ++ ++ ++S DA+W+MQ WLFS D
Sbjct: 298 QQELYGNVTKFWTLDQFNENQPLSSDLGYLRNVSQNTWTALKSASPDAIWVMQAWLFSAD 357
Query: 128 P-FWRPPQMKALLNSVPL-GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW ++A L + +++LDLFAE P W + FYG P+IWC LH++ GN+ +
Sbjct: 358 SSFWTNDAIEAFLGGITEDSDMLLLDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGL 417
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG ++++ ++A ++++VG G++MEG E N ++YDL+ + A+ + +D + + +
Sbjct: 418 YGQIENVTINAMQA-VRNSSSLVGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHD 476
Query: 246 YSVRRYG-RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
+ RYG +V ++ W +L TV+N T+ + I+ ++ P+I + G+
Sbjct: 477 WVSARYGTENVKSLYTGWELLRPTVFNNTNLTVNAVPKSIL---ELTPNINGLL-GRVGR 532
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA-LELFIASGNELSASNTYRYDLIDLTR 363
+G + +YD + + +E+ +A LE GN Y+YDL+D TR
Sbjct: 533 HGTTI-----------NYDPAVMVDAWTELFKAGLEDVKLFGNP-----AYQYDLVDWTR 576
Query: 364 QALAKYANELFLNIIEAYQLN-DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
Q L + L+ +++ AY + +A + + L++ +D +LA ++ F L W+ +A
Sbjct: 577 QVLVNSFDGLYKDLVTAYNSSANAAEIRSRGSKLTALLKTLDAVLATNENFQLATWIAAA 636
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ A N E+NAR Q+T+W Q E DY +K W+GL+ DYY R +
Sbjct: 637 R--ASNPSNTSFLEYNARNQVTLWGPTGQIE-----DYASKQWAGLVGDYYLGRWQQFID 689
Query: 483 YMIESLESGDG-----FRLKDWRREW 503
Y+ + S +L+ W +W
Sbjct: 690 YLATTKHSSYNQTAFYHKLQAWEIQW 715
>gi|237711645|ref|ZP_04542126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
gi|229454340|gb|EEO60061.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
Length = 732
Score = 228 bits (582), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 163/560 (29%), Positives = 260/560 (46%), Gaps = 79/560 (14%)
Query: 8 VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
VLP +SG +P+ L S K + + S +S + +L DP F
Sbjct: 207 VLPGYSGMIPSKLDLGKRIDSGKEEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 266
Query: 59 EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
I F E+ K YG TS Y+ D F E + + G AI M+ + AVW+
Sbjct: 267 RIANLFYEETEKLYG-TSDYYSIDPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWV 324
Query: 119 MQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----- 173
+QGW + +P RP MKAL G L++LDLF+E +P+W G+P IW
Sbjct: 325 VQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKG 369
Query: 174 --------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVY 224
C+L NF GN+ ++G +D + + + + G+G++MEGIE NPV++
Sbjct: 370 YEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMF 429
Query: 225 DLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI 284
+LM E+ ++ EK + WI QY RYG +I+ AW +L + +YNC G +
Sbjct: 430 ELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIRQAWQILANGIYNCPAGNNQQG---- 485
Query: 285 VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS 344
SI G+P ++ + SS+ +Y + A L ++
Sbjct: 486 -----PHESIFC---------GRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSV 527
Query: 345 GNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 404
++ +N + YDL+D+TRQA+A A ++ + ++ D +R+FLEL+ D
Sbjct: 528 ADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQD 587
Query: 405 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
LL F +G W++ A+ L E++ YEWNAR QIT W + + LRDY +K
Sbjct: 588 KLLGTRKEFKVGNWIQQARNLGITSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKE 647
Query: 465 WSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTNDW 510
W+GLLRD+Y R Y++ + + L+ + D + +W L W
Sbjct: 648 WNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPW 704
Query: 511 QNGRNVYPVESNGDALITSQ 530
+N Y + GD + ++
Sbjct: 705 TLAKNTYAASAEGDCIEVAK 724
>gi|423230938|ref|ZP_17217342.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
CL02T00C15]
gi|423244649|ref|ZP_17225724.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
CL02T12C06]
gi|392630058|gb|EIY24060.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
CL02T00C15]
gi|392641498|gb|EIY35274.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
CL02T12C06]
Length = 754
Score = 228 bits (581), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 165/558 (29%), Positives = 259/558 (46%), Gaps = 83/558 (14%)
Query: 8 VLPAFSGNVPAALQ--NVFPSAKITQLGNWFSVKSDPRWCCTY-------LLDATDPLFI 58
VLP +SG +P+ L S K + + S +S + +L DP F
Sbjct: 229 VLPGYSGMIPSKLDLGKRIDSGKEEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFT 288
Query: 59 EIGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAV 116
I F E+ K YG TS Y+ D F E N P + G AI M+ + AV
Sbjct: 289 RIANLFYEETEKLYG-TSDYYSIDPFHEAKNLP---AELDFGKAGRAIMDAMKKANPKAV 344
Query: 117 WLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW--- 173
W++QGW + +P RP MKAL G L++LDLF+E +P+W G+P IW
Sbjct: 345 WVVQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSIWKRD 389
Query: 174 ----------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPV 222
C+L NF GN+ ++G +D + + + + G+G++MEGIE NPV
Sbjct: 390 KGYEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPV 449
Query: 223 VYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRD 282
+++LM E+ ++ EK + WI QY RYG +I+ AW +L + +YNC G +
Sbjct: 450 MFELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIRQAWQILANGIYNCPAGNNQQG-- 507
Query: 283 VIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI 342
SI G+P ++ + SS+ +Y + A L +
Sbjct: 508 -------PHESIFC---------GRP----SLNNFQASSWSKMCNYYDPTTTTEAARLMV 547
Query: 343 ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVED 402
+ ++ +N + YDL+D+TRQA+A A ++ + ++ D +R+FLEL+
Sbjct: 548 SVADKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKNYATHTRQFLELLMM 607
Query: 403 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 462
D LL F +G W++ A+ L E++ YEWNAR QIT W + + LRDY +
Sbjct: 608 QDKLLGTRKEFKVGNWIQQARNLGITSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAH 667
Query: 463 KYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWIKLTN 508
K W+GLLRD+Y R Y++ + + L+ + D + +W L
Sbjct: 668 KEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEE 724
Query: 509 DWQNGRNVYPVESNGDAL 526
W +N Y + GD +
Sbjct: 725 PWTLAKNTYAASAEGDCI 742
>gi|319640296|ref|ZP_07995021.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
gi|317388071|gb|EFV68925.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
Length = 752
Score = 228 bits (581), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 162/564 (28%), Positives = 257/564 (45%), Gaps = 95/564 (16%)
Query: 8 VLPAFSGNVPA---------------ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA 52
VLP +SG +P+ L N + + L W +L
Sbjct: 227 VLPGYSGMIPSKLDLGKRIDGGKEEKTLSNTSSESAQSTLNKWNGFDRPG------ILLP 280
Query: 53 TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGD 112
DP F +I F E+ K YG TS Y+ D F E + + G AI M+ +
Sbjct: 281 DDPKFTQIASLFYEETEKLYG-TSDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKAN 338
Query: 113 SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYI 172
AVW++QGW + +P RP MKAL G L++LDLF+E +P+W G+P I
Sbjct: 339 PKAVWVVQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSI 383
Query: 173 W-------------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIE 218
W C+L NF GN+ ++G +D + + + + G+G++MEGIE
Sbjct: 384 WKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIE 443
Query: 219 QNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD 278
NPV+++LM E+ ++ EK + WI QY RYG +I AW +L + +YNC G
Sbjct: 444 NNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQ 503
Query: 279 K--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIR 336
+ + + P ++ + + + K NY P + T+E R
Sbjct: 504 QGPHESIFCGRPSLN-NFQASSWSKMCNYYDPTT--------------------TAEAAR 542
Query: 337 ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF 396
L ++ ++ +N + YDL+D+TRQA+A A ++ + ++ D +R+F
Sbjct: 543 ---LMVSVAHKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQF 599
Query: 397 LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASL 456
LEL+ D LL F +G W++ A+ L E++ YEWNAR QIT W + +
Sbjct: 600 LELLIMQDKLLGTRKEFKVGNWIQQARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGK 659
Query: 457 LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRRE 502
LRDY +K W+GLLRD+Y R Y++ + + L+ + D + +
Sbjct: 660 LRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---ID 716
Query: 503 WIKLTNDWQNGRNVYPVESNGDAL 526
W L W +N Y + GD +
Sbjct: 717 WYALEEPWTLAKNTYAASAEGDCI 740
>gi|393784337|ref|ZP_10372502.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
CL02T12C01]
gi|392666113|gb|EIY59630.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
CL02T12C01]
Length = 728
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 143/485 (29%), Positives = 228/485 (47%), Gaps = 54/485 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V F G VP L+ +P A+I + G W + + LD TDPLF + + E+
Sbjct: 230 VFQGFYGMVPNVLKKKYPDARIKEQGTWQTYQRPA------FLDPTDPLFDRVAAIYYEE 283
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G + + D F E ++ I M+ + AVW++QGW ++
Sbjct: 284 QKKLFG-DAEFFGGDPFHEGG--TSEGIHVKLAAQKILQAMRKVNPKAVWVLQGW--QHN 338
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS--TSKQFYGVP------YIWCMLHNF 179
P +K L++ + G+ ++LDL A +P W T+ F+ +IWC L NF
Sbjct: 339 P------VKDLMDGLNPGETIILDLMACERPQWGGVTTSMFHKPEGHQDHRWIWCALPNF 392
Query: 180 AGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD 238
G ++G + S A G V A+ + G+G + EGI PVVYD++ +MA++ + +
Sbjct: 393 GGKTGLHGKMSSYASGAVFAKEHPMGRNICGIGTAPEGIGTVPVVYDMVYDMAWRTDSIQ 452
Query: 239 VKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 298
+ W+ Y+ RYG AW +L TVY C + I A P
Sbjct: 453 IPQWLTNYTYYRYGMEDTNCDKAWKILSETVYECHNELGGPVESYICARP---------- 502
Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
A S++ + ++Y +++ A E S N + +TY YDL
Sbjct: 503 --------------ADTIDHVSTWGNARIFYEPVKMVEAWEFLYQSRNRFNHCDTYEYDL 548
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
+D+TRQ L+ YA L ++EA+ + +G + S FL++++D D LL+ F+LG W
Sbjct: 549 VDVTRQVLSDYAKYLHKEMVEAFHQKNENGFMKYSTEFLDVIKDEDRLLSTRKEFMLGTW 608
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
L A+ E+++++ NA+ +T W D + L DY NK WSGLL D+Y PR
Sbjct: 609 LTEAENAGCTPEEKRRFVTNAKRLVTTWTDRDSD----LHDYANKEWSGLLSDFYLPRWE 664
Query: 479 IYFKY 483
Y Y
Sbjct: 665 AYVTY 669
>gi|345517325|ref|ZP_08796802.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
gi|345457718|gb|EET14396.2| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
Length = 754
Score = 228 bits (580), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 162/562 (28%), Positives = 255/562 (45%), Gaps = 91/562 (16%)
Query: 8 VLPAFSGNVPA---------------ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA 52
VLP +SG +P+ L N + + L W +L
Sbjct: 229 VLPGYSGMIPSKLDLGKRIDGGKEEKTLSNTSSESAQSTLNKWNGFDRPG------ILLP 282
Query: 53 TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGD 112
DP F +I F E+ K YG TS Y+ D F E + + G AI M+ +
Sbjct: 283 DDPKFTQIASLFYEETEKLYG-TSDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKAN 340
Query: 113 SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYI 172
AVW++QGW + +P RP MKAL G L++LDLF+E +P+W G+P I
Sbjct: 341 PKAVWVVQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSI 385
Query: 173 W-------------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIE 218
W C+L NF GN+ ++G +D + + + + G+G++MEGIE
Sbjct: 386 WKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIE 445
Query: 219 QNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD 278
NPV+++LM E+ ++ EK + WI QY RYG +I AW +L + +YNC G
Sbjct: 446 NNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQ 505
Query: 279 KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRAL 338
+ SI G+P ++ + SS+ +Y + A
Sbjct: 506 QG---------PHESIFC---------GRP----SLNNFQASSWSKMCNYYDPTTTAEAA 543
Query: 339 ELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLE 398
L ++ ++ +N + YDL+D+TRQA+A A ++ + ++ D +R+FLE
Sbjct: 544 RLMVSVAHKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLE 603
Query: 399 LVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLR 458
L+ D LL F +G W++ A+ L E++ YEWNAR QIT W + + LR
Sbjct: 604 LLIMQDKLLGTRKEFKVGNWIQQARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGKLR 663
Query: 459 DYGNKYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRREWI 504
DY +K W+GLLRD+Y R Y++ + + L+ + D + +W
Sbjct: 664 DYAHKEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWY 720
Query: 505 KLTNDWQNGRNVYPVESNGDAL 526
L W +N Y + GD +
Sbjct: 721 ALEEPWTLAKNTYAASAEGDCI 742
>gi|320106778|ref|YP_004182368.1| alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
gi|319925299|gb|ADV82374.1| Alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
Length = 754
Score = 228 bits (580), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 163/514 (31%), Positives = 239/514 (46%), Gaps = 63/514 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWF--SVKSDPRWCCTYLLDATDP-LFIEIGRAF 64
V PAFSG VP + + P A+ L W K+ PR T++L + L+ IG+ F
Sbjct: 257 VAPAFSGFVPQGFKRLHPKAETFTL-LWLPEEFKTIPRSTRTFILHPGEQDLYRLIGKKF 315
Query: 65 IEQQLKEYGRTSHIYNCDTFDENTPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQG 121
IE+ EYG + Y DTF+E PV E + G +Y G+ +GD + W+MQG
Sbjct: 316 IEEYKAEYGEVQY-YLADTFNELAVPVREEHRFEDLERFGRTVYEGILAGDPNGTWVMQG 374
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPI---------WSTSKQFYGVPY 171
WLF YD FW + ALL +P +++++D ++ P W T K F+G +
Sbjct: 375 WLFVYDVAFWNSESVAALLRGIPNDRMLIIDYANDLAPAVKGKYAPGQWKTQKAFFGKQW 434
Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEM 230
I M H F GN + G L +A P TS E +VG GM EGIE N VVY+LM++
Sbjct: 435 INGMAHTFGGNNNVKGNLKLMASEPASVLTSPERGNLVGWGMCPEGIETNEVVYELMTDA 494
Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 290
+Q E +D+K WI Y RYG P + +AW +L + Y+ T + P +
Sbjct: 495 GWQREAIDLKQWIPAYCRSRYGACPPVMLEAWTLLMQSAYSAHIWMTHQAWQTE---PSL 551
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
P+ SV G P + RA+ LF++ EL
Sbjct: 552 APAAASVDAG-------PTFR------------------------RAVALFLSCAPELGQ 580
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
YR DLI+L QA ++ F ++A Q + + + L + MD LL
Sbjct: 581 KELYRNDLIELVVQAAGGSVDQTFSLAVQAGQSHQNEVATEYAAHALGWMGRMDALLNLR 640
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
L W+++A+ A+++++ Y+ NAR IT W L DY ++ WSGL R
Sbjct: 641 PDRRLETWMQAARSYAKSDDEAAYYDENARRLITTW------GWPELSDYASRAWSGLTR 694
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
DYY R +F SL +G F L W++ W+
Sbjct: 695 DYYASRWEAWFA----SLHAGRPFSLDIWQQTWL 724
>gi|294777713|ref|ZP_06743164.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
gi|294448781|gb|EFG17330.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
Length = 752
Score = 228 bits (580), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 162/568 (28%), Positives = 259/568 (45%), Gaps = 95/568 (16%)
Query: 8 VLPAFSGNVPA---------------ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA 52
VLP +SG +P+ L N + + L W +L
Sbjct: 227 VLPGYSGMIPSKLDLGKRIDGGKEEKTLSNTSSESAQSTLNKWNGFDRPG------ILLP 280
Query: 53 TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGD 112
DP F +I F E+ K YG TS Y+ D F E + + G AI M+ +
Sbjct: 281 DDPKFTQIASLFYEETEKLYG-TSDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKAN 338
Query: 113 SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYI 172
AVW++QGW + +P RP MKAL G L++LDLF+E +P+W G+P I
Sbjct: 339 PKAVWVVQGW--TENP--RPEMMKAL----NPGDLLILDLFSECRPMW-------GIPSI 383
Query: 173 W-------------CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIE 218
W C+L NF GN+ ++G +D + + + + G+G++MEGIE
Sbjct: 384 WKRDKGYEEHNWLFCLLENFGGNVGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIE 443
Query: 219 QNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD 278
NPV+++LM E+ ++ EK + WI QY RYG +I AW +L + +YNC G
Sbjct: 444 NNPVMFELMCELPWRAEKFTKEEWIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQ 503
Query: 279 K--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIR 336
+ + + P ++ + + + K NY P + T+E R
Sbjct: 504 QGPHESIFCGRPSLN-NFQASSWSKMCNYYDPTT--------------------TAEAAR 542
Query: 337 ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF 396
L ++ ++ +N + YDL+D+TRQA+A A ++ + ++ D +R+F
Sbjct: 543 ---LMVSVAHKYRGNNNFEYDLVDITRQAIADRARIVYNYAVADFKSFDKKSYATHTRQF 599
Query: 397 LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASL 456
LEL+ D LL F +G W++ A+ L E++ YEWNAR QIT W + +
Sbjct: 600 LELLIMQDKLLGTRKEFKVGNWIQQARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGK 659
Query: 457 LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE--------------SGDGFRLKDWRRE 502
LRDY +K W+GLLRD+Y R Y++ + + L+ + D + +
Sbjct: 660 LRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQLDGKLPVLPVGNSSTPTADNPAMT---ID 716
Query: 503 WIKLTNDWQNGRNVYPVESNGDALITSQ 530
W L W +N Y + GD + ++
Sbjct: 717 WYALEEPWTLAKNTYAASAEGDCIEVAK 744
>gi|237719043|ref|ZP_04549524.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451821|gb|EEO57612.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 713
Score = 228 bits (580), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 160/540 (29%), Positives = 258/540 (47%), Gaps = 60/540 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F+++
Sbjct: 218 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 274
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F +D
Sbjct: 275 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 333
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN +
Sbjct: 334 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLT 393
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 394 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 452
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G +++DAW L++ +Y V P T G Y
Sbjct: 453 ADRHVGCVSQSVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 489
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
+P + K ++ Y + L EV R L NE S+ + +R DLI + RQ
Sbjct: 490 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEASSDRRDAFRLDLITVGRQ 538
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L Y ++ + + D + + E++ D+D L A H L W++ A++
Sbjct: 539 VLGNYFLDVKMEFDRMVETKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 598
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 599 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYINTF 651
Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
I++ E G D +L+D +E W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 652 IKAAEKGVEVDQKQLEDELKEIEEGWVNATDREDTRKDVH---STTDGLLSFSTFLFSKY 708
>gi|212541222|ref|XP_002150766.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
gi|210068065|gb|EEA22157.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
Length = 787
Score = 228 bits (580), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 156/489 (31%), Positives = 245/489 (50%), Gaps = 44/489 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP+F+G VP A+ V P+AK+ W S+ + C L+ D F + ++ I +
Sbjct: 241 VLPSFTGFVPRAITRVLPNAKVVPGSRWNVFSSN--YTCDTFLEPFDDNFALLQKSTISK 298
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q YG SHIY D ++EN P +P+Y+ ++ +++ D DAVWLMQ WLF
Sbjct: 299 QQAYYGNISHIYALDQYNENNPFSSNPDYLRNISRTTSQSLKAADPDAVWLMQSWLFLDA 358
Query: 128 PFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW + A L+ V +++LDLFAE +P+W + +YG P+IWC +H++ GN+ +Y
Sbjct: 359 TFWNNVTICAYLSGVENNSDMLILDLFAESQPVWQLTDSYYGKPWIWCQVHDYGGNMGLY 418
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + +I A S +MVG G +ME E N +VYDL+ + A+ ++ + +
Sbjct: 419 GQIMNITENATAALASSG-SMVGFGHTMESQEGNEIVYDLLLDQAWSETPINTSQYFEDW 477
Query: 247 SVRRYG--RSVP-AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV--TEGK 301
RY + VP + DAW +L + YN T+ A+ I+ +++PSI + EG
Sbjct: 478 VTVRYAGTQHVPQQLFDAWEILRWSAYNNTNLASSSVPKSIL---ELEPSISGLLNREGH 534
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALEL-FIASGNELSA--SNTYRYDL 358
+ T +YD P L V+ A L + A+ ELS + + YDL
Sbjct: 535 HPT--------------TINYD-PEL------VVEAWALTYEAALLELSLWDNPAFNYDL 573
Query: 359 IDLTRQALAKYANELFLNIIEAYQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
I LTRQ L + +I Y + + R+ ++L++ +D +L ++ F L
Sbjct: 574 IFLTRQVLVNAFIPRYELLISFYNNENYSVPAIVSAGRQLIDLLQSLDTVLGTNECFQLA 633
Query: 417 PWLESAKQLAQ-NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
W+ A A N YE+NAR QIT+W N + + DY +K W+GL+ YY P
Sbjct: 634 QWINKAVSRAHGNTTLAAYYEYNARNQITLWGPNGE-----ISDYASKQWAGLISSYYVP 688
Query: 476 RAAIYFKYM 484
R I Y+
Sbjct: 689 RWQILVDYL 697
>gi|393788556|ref|ZP_10376683.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
CL02T12C05]
gi|392654236|gb|EIY47884.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
CL02T12C05]
Length = 732
Score = 227 bits (579), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 162/515 (31%), Positives = 244/515 (47%), Gaps = 63/515 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V F G VP AL+ FP A+I G W + + LD TDPLF ++ + E+
Sbjct: 233 VFQGFYGMVPNALKEKFPDARIKDQGIWGTYQRPA------FLDPTDPLFDKLAAIYYEE 286
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAA--IYSGMQSGDSDAVWLMQGWLFS 125
Q +G + + D F E + E I+ AA I M+ + AVW++QGW
Sbjct: 287 QKNLFGE-AQFFGGDPFHEG----GTSEGINVKLAAQKILQAMRKVNPQAVWVLQGW--Q 339
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSK-QFYGVP-------YIWCMLH 177
++P +K L+ V G+ ++LDL A +P W K + P +IWC L
Sbjct: 340 HNP------VKELMEGVKPGETIILDLMACERPQWGGVKTSMFHKPEGHWNHQWIWCALP 393
Query: 178 NFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 236
NF G ++G + S A GPV A+ + G+G + EGI PVVYD++ +MA++ +
Sbjct: 394 NFGGKTGLHGKMSSYASGPVFAKHHPMGKNICGIGTAPEGIGTIPVVYDMVYDMAWRTDS 453
Query: 237 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
+ + W++ Y+ RYG AW +L T+Y C + I A P S
Sbjct: 454 IHIPQWLDNYTYYRYGTEDNNCNRAWKLLSETIYECHNELGGPVESYICARP-------S 506
Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
T +G V ++Y +V++A +L S + S+TY Y
Sbjct: 507 DTIQHVSTWGNAV-----------------MFYDPMKVVKAWDLLYQSRKRFNHSDTYEY 549
Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
DL D+TRQ L+ YA L ++ A+Q D + S +FL +++D D LL+ F+LG
Sbjct: 550 DLTDVTRQVLSDYAKYLHERMVLAFQKKDKERFMEYSGKFLNIIKDEDRLLSTRKEFMLG 609
Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
WL A++ E+++++ NA+ IT W D + L DY NK WSGLL D+Y PR
Sbjct: 610 TWLAEAEKAGGTPEEKRRFVTNAKRLITTWTDTDSD----LHDYANKEWSGLLIDFYLPR 665
Query: 477 AAIYFKYMIESLESGDGFRLKDWRR---EWIKLTN 508
Y Y SL G D+ + EW+ LTN
Sbjct: 666 WEAYVTYKT-SLLYGKKLPYPDYSKMEQEWV-LTN 698
>gi|410095990|ref|ZP_11290981.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
CL02T12C30]
gi|409227396|gb|EKN20294.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
CL02T12C30]
Length = 753
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 167/550 (30%), Positives = 246/550 (44%), Gaps = 81/550 (14%)
Query: 11 AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQ 67
FSG VP L+N +P AKI + P WC LD TD LF GR F+E+
Sbjct: 233 GFSGYVPRELKNKYPDAKI---------QLQPSWCGFTGAAQLDPTDSLFSAFGRDFLEE 283
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ K +G +Y D F E+ PP+D+PEY+S++G +IY Q D A+W MQ W
Sbjct: 284 EKKLFG-AHGVYAADPFHESRPPIDTPEYLSAVGNSIYKLFQDFDPSAIWAMQAWSL--- 339
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
R P +KA VP L++LDL +T +G P + LHNF G I ++G
Sbjct: 340 ---REPIVKA----VPKEHLLILDLNGGRSRQENTC---WGYPVVAGNLHNFGGRINLHG 389
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
L +A ++ + G G+ ME IEQNPV YDL EM ++VD++ W+ Y+
Sbjct: 390 DLRLLASNQYAVAKQKSPNVCGSGLFMESIEQNPVYYDLAFEMPLHADEVDIEEWLGDYA 449
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYG + AW L Y G R I+A P++ G G
Sbjct: 450 ERRYGAASENAHKAWLHLLEGPYR--PGTNGTERSSIIA---ARPALNVKKSGPNAGLGI 504
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
P YS VI+A L + ++L+AS YR+D++D+ RQ ++
Sbjct: 505 P--------------------YSPLLVIQAQGLLLKDADKLNASTPYRFDVVDIQRQLMS 544
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ EA+ D S RFLE++ D+D LL F WL A+
Sbjct: 545 NLGQAIHKKAAEAFVKKDKAAFTLHSNRFLEMLRDVDVLLRTRPEFNFDKWLTDARSWGT 604
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
E++ E +A +T+W + L+ DY + W+GL+ YY R ++ + E
Sbjct: 605 TNEEKDLLEKDATALVTVW---GADGDPLIFDYSWREWTGLIDSYYLKRWEKFYAMLQEH 661
Query: 488 LESG--------------DGFR-------LKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
L+ G + FR L DW E++ TN + P+ + GD +
Sbjct: 662 LDEGNEYSEKGLPMTHGREAFRANDFYSELGDWELEFVSRTNKART-----PI-TQGDEI 715
Query: 527 ITSQWLYNKY 536
T+ +Y KY
Sbjct: 716 ETALKMYKKY 725
>gi|423223006|ref|ZP_17209475.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640582|gb|EIY34381.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 755
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 162/549 (29%), Positives = 249/549 (45%), Gaps = 76/549 (13%)
Query: 11 AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
FSG VP LQ +P AKI+ W T LD TDPLF E+G AF+E+Q K
Sbjct: 235 GFSGYVPRELQAKYPQAKISMKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDK 288
Query: 71 EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
+G + +Y D F E+ PP+D+PEY++ +G I+ Q+ D+ A+W+MQ W D
Sbjct: 289 LFG-SYGVYAADPFHESAPPIDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED--- 344
Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 190
++ +VP L++LDL + + +G P I LHNF G I M+G L
Sbjct: 345 -------IVKAVPKESLLILDLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLA 393
Query: 191 SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRR 250
+A + + + G G+ ME IEQNPV Y+L EM + + ++AW+ Y+ RR
Sbjct: 394 LLASNQYQKAKARYPNVCGSGLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERR 453
Query: 251 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS 310
YG A AW L Y G R IVA P++ G G P
Sbjct: 454 YGAKSAAAGKAWMYLLEGPYR--QGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP-- 506
Query: 311 KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYA 370
Y VIRA + ++L+ S YR+D++D+ RQ +
Sbjct: 507 ------------------YEPMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLG 548
Query: 371 NELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 430
+ EA+ D S RFLEL+ DMD LL + WL A+ + +E
Sbjct: 549 QLVHKKAAEAFASKDKAAFVLHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKE 608
Query: 431 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 490
++ E +A + +T+W + + DY + W+GL+ YY PR ++ + L++
Sbjct: 609 EKDLMERDATSLVTIWGADGDPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQGHLDA 665
Query: 491 GDGF---------------------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
G + RL +W ++ Q G+ PV ++GD L+ +
Sbjct: 666 GTDYQEEGLSLAYGREDFRANDFYNRLAEWELAYVD-----QTGKARTPV-THGDELVVT 719
Query: 530 QWLYNKYLQ 538
+ L++KYL+
Sbjct: 720 RRLFDKYLK 728
>gi|336412606|ref|ZP_08592959.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
3_8_47FAA]
gi|335942652|gb|EGN04494.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
3_8_47FAA]
Length = 727
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 159/540 (29%), Positives = 258/540 (47%), Gaps = 60/540 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN +
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLT 407
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 466
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G +++DAW L++ +Y V P T G Y
Sbjct: 467 ADRHVGCVSQSVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 503
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
+P + K ++ Y + L EV R L NE + + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L Y ++ + + D + + + E++ D+D L A H L W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDHQALKACAEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYINTF 665
Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
I++ E G D +L+D +E W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 666 IKAAEKGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|423722278|ref|ZP_17696454.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
CL09T00C40]
gi|409242419|gb|EKN35181.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
CL09T00C40]
Length = 752
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 162/544 (29%), Positives = 241/544 (44%), Gaps = 69/544 (12%)
Query: 11 AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQ 67
FSG VP L+ +P AKI + P WC LD TD LF IGR F+E+
Sbjct: 232 GFSGYVPRELKEKYPDAKI---------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEE 282
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ K YG +Y D F E+ PPVD+PEY+ ++G AI+ D +++W MQ W
Sbjct: 283 EKKLYG-AHGVYAADPFHESQPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL--- 338
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
R P +KA VP L++LDL +G P + LHNF G I ++G
Sbjct: 339 ---REPIVKA----VPKENLLILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHG 388
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
L +A +N + G G+ ME IEQNPV YDL EM ++V+++ W+ +Y+
Sbjct: 389 DLRLLASNQYVNAVKKNPNVCGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYA 448
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYG+ AW L Y G R I+A P++ G G
Sbjct: 449 DRRYGKPSENAHQAWLHLLEGPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGI 503
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
P YS V++A L + L S+ YR+D++D+ RQ ++
Sbjct: 504 P--------------------YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMS 543
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ EA++ D S RFLE++ D D LL F WL A+
Sbjct: 544 NLGQAIHKQAAEAFRKKDKEAFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGD 603
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
N E++ +E +A +T+W + L+ DY + W+GL+ YY R ++ + +
Sbjct: 604 NSEEKDLFEKDATALVTVW---GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDH 660
Query: 488 LESGDGFRLKDWR----REWIKLTN------DWQ-----NGRNVYPVESNGDALITSQWL 532
L++G + KD RE + + DW+ V + GD + T+ L
Sbjct: 661 LDAGTNYSEKDLPQTHGRESFRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRL 720
Query: 533 YNKY 536
Y KY
Sbjct: 721 YKKY 724
>gi|294648124|ref|ZP_06725667.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|292636508|gb|EFF54983.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
Length = 499
Score = 225 bits (573), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 159/540 (29%), Positives = 256/540 (47%), Gaps = 60/540 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F+++
Sbjct: 4 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 60
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F +D
Sbjct: 61 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 119
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN +
Sbjct: 120 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 179
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 180 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 238
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G ++DAW L++ +Y V P T G Y
Sbjct: 239 ADRHVGCVSQPVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 275
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
+P + K ++ Y + L EV R L NE + + +R DLI + RQ
Sbjct: 276 RPALNKNSEKRTSNVYSNVELL----EVWRKL-------NEAPSDRRDAFRLDLITVGRQ 324
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L Y ++ + + D + + E++ D+D L A H L W++ A++
Sbjct: 325 VLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 384
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 385 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYVNTF 437
Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
I++ E G D +L+D +E W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 438 IKAAEEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 494
>gi|224537227|ref|ZP_03677766.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521150|gb|EEF90255.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
DSM 14838]
Length = 755
Score = 225 bits (573), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 162/549 (29%), Positives = 249/549 (45%), Gaps = 76/549 (13%)
Query: 11 AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
FSG VP LQ +P AKI+ W T LD TDPLF E+G AF+E+Q K
Sbjct: 235 GFSGYVPRELQAKYPQAKISMKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDK 288
Query: 71 EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
+G + +Y D F E+ PP+D+PEY++ +G I+ Q+ D+ A+W+MQ W D
Sbjct: 289 LFG-SYGVYAADPFHESAPPIDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED--- 344
Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 190
++ +VP L++LDL + + +G P I LHNF G I M+G L
Sbjct: 345 -------IVKAVPKESLLILDLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLA 393
Query: 191 SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRR 250
+A + + + G G+ ME IEQNPV Y+L EM + + ++AW+ Y+ RR
Sbjct: 394 LLASNQYQKAKARYPNVCGSGLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERR 453
Query: 251 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS 310
YG A AW L Y G R IVA P++ G G P
Sbjct: 454 YGAKSAAAGKAWMYLLEGPYR--RGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP-- 506
Query: 311 KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYA 370
Y VIRA + ++L+ S YR+D++D+ RQ +
Sbjct: 507 ------------------YEPMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLG 548
Query: 371 NELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 430
+ EA+ D S RFLEL+ DMD LL + WL A+ + +E
Sbjct: 549 QLVHKKAAEAFASKDKAAFALHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKE 608
Query: 431 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 490
++ E +A + +T+W + + DY + W+GL+ YY PR ++ + L++
Sbjct: 609 EKDLMERDATSLVTIWGADGDPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQGHLDA 665
Query: 491 GDGF---------------------RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
G + RL +W ++ Q G+ PV ++GD L+ +
Sbjct: 666 GTDYQEEGLSLAYGREDFRANDFYNRLAEWELAYVD-----QTGKARTPV-THGDELVVT 719
Query: 530 QWLYNKYLQ 538
+ L++KYL+
Sbjct: 720 RRLFDKYLK 728
>gi|237708859|ref|ZP_04539340.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
gi|345513372|ref|ZP_08792893.1| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
gi|423228941|ref|ZP_17215347.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
CL02T00C15]
gi|423242228|ref|ZP_17223337.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
CL03T12C01]
gi|423247755|ref|ZP_17228803.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
CL02T12C06]
gi|229457285|gb|EEO63006.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
gi|345456211|gb|EEO47557.2| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
gi|392631297|gb|EIY25272.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
CL02T12C06]
gi|392635177|gb|EIY29082.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
CL02T00C15]
gi|392639514|gb|EIY33330.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
CL03T12C01]
Length = 717
Score = 225 bits (573), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 156/501 (31%), Positives = 248/501 (49%), Gaps = 67/501 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V +DP + C+Y L D F EI
Sbjct: 224 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 272
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+T Y D F E +T V+ + + G AI M+ + DAVW+ Q
Sbjct: 273 YYKELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 327
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
W + +++ + G L+VLDL +E +P W K YG +++C
Sbjct: 328 W--------QDNPRTSMIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYC 379
Query: 175 MLHNFAGNIEMYGILDSIAFGPVEARTSENT--TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML NF GNI ++G +D++ G +A+T + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 380 MLLNFGGNIGLHGKMDALINGFYDAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPW 439
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ + W+ Y RYG A+Q AW++L + +YN + + V A P +
Sbjct: 440 REHRFTRDEWLKGYVYARYGVEDEALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 499
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
D YQ SS+ +Y+ +VI A L ++ ++
Sbjct: 500 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 535
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
+N + +DL+D+ RQALA+ + + A++ D VF+L S+ FL L+ D LL
Sbjct: 536 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGT 594
Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
F +G W+E+A+ Q +E++ YEWNAR QIT W + + LRDY +K W+G+L
Sbjct: 595 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 654
Query: 470 RDYYGPRAAIYFKYMIESLES 490
+D+Y R YF Y+ L+
Sbjct: 655 KDFYFMRWKAYFDYLACVLDG 675
>gi|212693694|ref|ZP_03301822.1| hypothetical protein BACDOR_03214 [Bacteroides dorei DSM 17855]
gi|265755881|ref|ZP_06090348.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
gi|212663753|gb|EEB24327.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
gi|263233959|gb|EEZ19560.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
Length = 718
Score = 225 bits (573), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 156/501 (31%), Positives = 248/501 (49%), Gaps = 67/501 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V +DP + C+Y L D F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+T Y D F E +T V+ + + G AI M+ + DAVW+ Q
Sbjct: 274 YYKELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
W + +++ + G L+VLDL +E +P W K YG +++C
Sbjct: 329 W--------QDNPRTSMIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYC 380
Query: 175 MLHNFAGNIEMYGILDSIAFGPVEARTSENT--TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML NF GNI ++G +D++ G +A+T + T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNFGGNIGLHGKMDALINGFYDAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ + W+ Y RYG A+Q AW++L + +YN + + V A P +
Sbjct: 441 REHRFTRDEWLKGYVYARYGVEDEALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 500
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
D YQ SS+ +Y+ +VI A L ++ ++
Sbjct: 501 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
+N + +DL+D+ RQALA+ + + A++ D VF+L S+ FL L+ D LL
Sbjct: 537 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGT 595
Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
F +G W+E+A+ Q +E++ YEWNAR QIT W + + LRDY +K W+G+L
Sbjct: 596 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 655
Query: 470 RDYYGPRAAIYFKYMIESLES 490
+D+Y R YF Y+ L+
Sbjct: 656 KDFYFMRWKAYFDYLACVLDG 676
>gi|154492110|ref|ZP_02031736.1| hypothetical protein PARMER_01741 [Parabacteroides merdae ATCC
43184]
gi|154087335|gb|EDN86380.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
43184]
Length = 752
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 158/544 (29%), Positives = 241/544 (44%), Gaps = 69/544 (12%)
Query: 11 AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQ 67
FSG VP L+ +P AKI + P WC LD TD LF IGR F+E+
Sbjct: 232 GFSGYVPRELKEKYPDAKI---------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEE 282
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ K YG +Y D F E+ PPVD+PEY+ ++G AI+ D +++W MQ W
Sbjct: 283 EKKLYG-AHGVYAADPFHESQPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSLR-- 339
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
++++ +VP L++LDL +G P + LHNF G I ++G
Sbjct: 340 --------ESIVKAVPKENLLILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHG 388
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
L +A +N + G G+ ME IEQNPV YDL EM ++V+++ W+ +Y+
Sbjct: 389 DLRLLASNQYVNAVKKNPNVCGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYA 448
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYG+ AW L Y G R I+A P++ G G
Sbjct: 449 DRRYGKPSENAHQAWLHLLEGPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGI 503
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
P YS V++A L + L S+ YR+D++D+ RQ ++
Sbjct: 504 P--------------------YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMS 543
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ EA++ D S RFLE++ D D LL F WL A+
Sbjct: 544 NLGQAIHKQAAEAFRKKDKEAFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGD 603
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
N E++ +E +A +T+W + L+ DY + W+GL+ YY R ++ + +
Sbjct: 604 NSEEKDLFEKDATALVTVW---GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDH 660
Query: 488 LESGDGFRLKDWR----REWIKLTN------DWQ-----NGRNVYPVESNGDALITSQWL 532
L++G + KD RE + + DW+ V + GD + T+ L
Sbjct: 661 LDAGTNYSEKDLPQTHGRESFRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRL 720
Query: 533 YNKY 536
Y KY
Sbjct: 721 YKKY 724
>gi|29349767|ref|NP_813270.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341678|gb|AAO79464.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 744
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 146/489 (29%), Positives = 238/489 (48%), Gaps = 59/489 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
V P ++G VP + +LG + + +WC L D F +
Sbjct: 242 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 291
Query: 65 IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
E+ K YG+ + Y+ D F E NT VD ++ G +I S M+ + +AVW+MQ W
Sbjct: 292 YEELEKLYGKAKY-YSMDPFHEGGNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQAW 346
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
+ +A+++++ G L+VLDL++E P +W K F +++CM
Sbjct: 347 --------QANPREAMVSTLDSGDLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCM 398
Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
L NF GN+ ++G ++ + G A N T+ GVG + EGIE NPV+++L+ E+ ++
Sbjct: 399 LLNFGGNVGLHGRMEQLVNGYYNACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWRE 458
Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
E+ AW+ Y RYG + P + +AW L HTVYN + S
Sbjct: 459 ERFAPDAWLQAYLKARYGNDLSPEVAEAWRALEHTVYNAPKNYQGEG---------TVES 509
Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
++ G +Q+ TS++ + L+YS +A L ++ ++ +N
Sbjct: 510 LLCARPGFHQD-------------RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNN 556
Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
+ YDL+D+ RQ+LA N L I ++Y D + S++FLEL+ D LL+ F
Sbjct: 557 FEYDLVDVVRQSLADKGNVLLEEISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEF 616
Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
+ WL +A+ L EE++K YEWNA IT+W D+ L DY ++ WSG+L+D Y
Sbjct: 617 SVSSWLNAARSLGTTEEEKKLYEWNASALITVWGDSIAANRGGLHDYSHREWSGILKDLY 676
Query: 474 GPRAAIYFK 482
R +F+
Sbjct: 677 YQRWKTFFE 685
>gi|383120707|ref|ZP_09941431.1| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
gi|382984934|gb|EES68331.2| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
Length = 736
Score = 224 bits (571), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 146/489 (29%), Positives = 238/489 (48%), Gaps = 59/489 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
V P ++G VP + +LG + + +WC L D F +
Sbjct: 234 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 283
Query: 65 IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
E+ K YG+ + Y+ D F E NT VD ++ G +I S M+ + +AVW+MQ W
Sbjct: 284 YEELEKLYGKAKY-YSMDPFHEGGNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQAW 338
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
+ +A+++++ G L+VLDL++E P +W K F +++CM
Sbjct: 339 --------QANPREAMVSTLDSGDLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCM 390
Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
L NF GN+ ++G ++ + G A N T+ GVG + EGIE NPV+++L+ E+ ++
Sbjct: 391 LLNFGGNVGLHGRMEQLVNGYYNACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWRE 450
Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
E+ AW+ Y RYG + P + +AW L HTVYN + S
Sbjct: 451 ERFAPDAWLQAYLKARYGNDLSPEVAEAWRALEHTVYNAPKNYQGEG---------TVES 501
Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
++ G +Q+ TS++ + L+YS +A L ++ ++ +N
Sbjct: 502 LLCARPGFHQD-------------RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNN 548
Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
+ YDL+D+ RQ+LA N L I ++Y D + S++FLEL+ D LL+ F
Sbjct: 549 FEYDLVDVVRQSLADKGNVLLEEISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEF 608
Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
+ WL +A+ L EE++K YEWNA IT+W D+ L DY ++ WSG+L+D Y
Sbjct: 609 SVSSWLNAARSLGTTEEEKKLYEWNASALITVWGDSIAANRGGLHDYSHREWSGILKDLY 668
Query: 474 GPRAAIYFK 482
R +F+
Sbjct: 669 YQRWKTFFE 677
>gi|262406058|ref|ZP_06082608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294806855|ref|ZP_06765680.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345510563|ref|ZP_08790130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|262356933|gb|EEZ06023.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294445884|gb|EFG14526.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345454460|gb|EEO49066.2| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
Length = 727
Score = 224 bits (571), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 159/540 (29%), Positives = 256/540 (47%), Gaps = 60/540 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN +
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 466
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G ++DAW L++ +Y V P T G Y
Sbjct: 467 ADRHVGCVSQPVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 503
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
+P + K ++ Y + L EV R L NE + + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L Y ++ + + D + + E++ D+D L A H L W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYVNTF 665
Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
I++ E G D +L+D +E W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 666 IKAAEEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|358378969|gb|EHK16650.1| glycoside hydrolase family 89 protein [Trichoderma virens Gv29-8]
Length = 748
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 141/485 (29%), Positives = 255/485 (52%), Gaps = 41/485 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP + VFP ++ W + ++ ++ DP F ++ + FI +
Sbjct: 239 ILPAFPGFVPRNISRVFPDISLSTSPIWSNFGTEL--SADIYINPFDPRFAQLQKLFISK 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG ++ + D F+EN P Y+ ++ +S +++ D +AVW+MQ WLFS D
Sbjct: 297 QQELYGNVTNFWTLDQFNENQPLSGDLGYLQNVSHNTWSALKAADPEAVWVMQAWLFSSD 356
Query: 128 -PFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW ++++ L +P+ +++LDLFAE P W + FYG P+IWC LH++ GN+ +
Sbjct: 357 SAFWTNDRIESFLGGIPVNSDMLLLDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGL 416
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG ++++ ++A + ++VG G++MEG E N ++YDL+ + A+ + +D + + +
Sbjct: 417 YGQIENVTINSMDA-VRNSGSLVGFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHD 475
Query: 246 YSVRRYG-RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
+ RYG ++V ++ W +L TV+N T+ + + I+ ++ PS + G+ +
Sbjct: 476 WVSTRYGTKNVKSLYTGWELLRPTVFNNTNLTMNAVQKSIL---ELVPSTTGLL-GRVGH 531
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA-LELFIASGNELS--ASNTYRYDLIDL 361
+G ++ Y+ + ++ A ELF A ++ + Y+YDL+D
Sbjct: 532 HGTTIT------------------YNPAVMVEAWTELFKAGLQDIKLFTNPAYQYDLVDW 573
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSR--RFLELVEDMDGLLACHDGFLLGPWL 419
TRQ L L+ +++ AY + V + SR + + L+ +D +LA ++ F L PW+
Sbjct: 574 TRQVLVNSFEGLYKDLVAAYNSAASSSVIK-SRGAKLIALLRTLDAVLATNEHFQLTPWI 632
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
A+ A + E+NAR QIT+W E DY +K W+GL+ YY R
Sbjct: 633 NEAR--ASSPSTADFLEYNARNQITLWGPQGNIE-----DYASKQWAGLVGTYYVERWQQ 685
Query: 480 YFKYM 484
+ Y+
Sbjct: 686 FIDYL 690
>gi|346323119|gb|EGX92717.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
Length = 742
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 147/496 (29%), Positives = 248/496 (50%), Gaps = 38/496 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP A + P A + + W + +D L D + E+ R F+E
Sbjct: 223 ILPAFPGFVPDAFARLRPGADLVRAPAWGGLPADSPNTRALFLSPLDDAYAELQRLFVEA 282
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF--S 125
Q++ YG +++Y D F+E P + +Y+S++ Y+ + + + AVWLMQGWLF S
Sbjct: 283 QIEAYGNVTNVYAMDQFNEINPVSGATDYLSAVSRRSYAALAAANPAAVWLMQGWLFYLS 342
Query: 126 YDPFWRPPQMKALLNSVP-LGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
FW +++A L +V+LDLF+E P W + + G P+IWC +H+F GN
Sbjct: 343 EGNFWTQERIEAYLRGPEDRAGMVILDLFSETAPQWQRTGSYAGRPWIWCQVHDFGGNQN 402
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
++G + + P+EA E+ +MVG+G++ E E N V+YDL + + +D ++ +
Sbjct: 403 LFGKITNTTVNPMEA-LRESDSMVGLGIATEAYEGNEVLYDLFFDQGWSATPIDTVSYFH 461
Query: 245 QYSVRRYG--RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
++ RRY R +PA + AW +L TVY+ A+D V V+ ++P++ +
Sbjct: 462 DWTTRRYSGVRQLPASLYQAWELLRVTVYDYR--ASDLI-GVPVSVYQLEPNLTGL---- 514
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN---ELSASNTYRYDL 358
Y K L + ++ P +W LF+A+ L A +R DL
Sbjct: 515 ---YNTTTGKPTALHYDPAAL--PPIW----------RLFVAAAAAQPRLWAEPGFRLDL 559
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHG-VFQLSRRFLELVEDMDGLLACHDGFLLGP 417
+D+ RQ L+ L+ +++ A+ + Q +R ++ D+D LLA F L
Sbjct: 560 VDVMRQVLSNAFGRLYADLVAAFTGGAPPSEIAQRGQRMRAVLGDVDALLATQPHFSLRR 619
Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
WL +A+ ++ + + AR+Q+T+W T LL DY K WSGL+ YY R
Sbjct: 620 WLNAARAWGESTGENAAIAYEARSQVTIWAPGT-----LLNDYAAKAWSGLIATYYDERW 674
Query: 478 AIYFKYMIESLESGDG 493
I+ ++++ E+ G
Sbjct: 675 RIFVDRLVDAAENHGG 690
>gi|410096483|ref|ZP_11291470.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
CL02T12C30]
gi|409226447|gb|EKN19356.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
CL02T12C30]
Length = 718
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 162/545 (29%), Positives = 265/545 (48%), Gaps = 75/545 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
V P +SG VP + +LG +V SDP C Y L TDP F EI
Sbjct: 224 VFPGYSGMVPHNAKE--------KLG--LNV-SDPGLWCGYHRPAFLQPTDPRFQEIASL 272
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ ++ K YG+ ++ Y+ D F E + VD + + G AI M+ + AVW+ Q
Sbjct: 273 YYKELNKLYGK-ANFYSMDPFHEGGSVAGVD----LDAAGKAIMQAMKKNNPKAVWVAQA 327
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ-------FYGVPYIWC 174
W + ++ ++ G ++VLDLF+E +P W + F +I+C
Sbjct: 328 W--------QANPRSQMIENLKAGDMIVLDLFSESRPQWGDPESTWHRKDGFGQHDWIYC 379
Query: 175 MLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ 233
ML N+ GN+ ++G + + +A+ S T+ GVGM+MEG E NPV+++L++E+ ++
Sbjct: 380 MLLNYGGNVGLHGKMAHVIDEYYKAKESSFGKTLCGVGMTMEGSENNPVMFELLTELPWR 439
Query: 234 HEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVD 291
D W+ Y+V RYG++ P +Q+AW +L +++YNC T + + + A P
Sbjct: 440 PVHFDKNEWLKNYTVARYGKANPTVQEAWILLSNSIYNCPPENTQQGTHESIFCARPSDH 499
Query: 292 PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
P ++S SE S Y Y+ +VIRA + ++ ++ + +
Sbjct: 500 PYLVSSW------------------SEMSDY------YNPDDVIRAAAMMVSVADQFTGN 535
Query: 352 NTYRYDLIDLTRQALAKYANELFLNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
N + YDL+D+ RQA+A+ L ++EA + D + RFL+L+ D LL
Sbjct: 536 NNFEYDLVDIVRQAIAE-KGRLVEKVVEASFASGDKQLYNTAANRFLQLLLLQDELLGTR 594
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
F +G W+ + L E++ YEWNAR QIT W + + LRDY +K W+G+L+
Sbjct: 595 PEFKVGNWIARTRSLGNTPEEKDLYEWNARVQITTWGNRNAADKGGLRDYAHKEWNGILK 654
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRRE--WIKLTNDWQNGRNVYPVESNGDALIT 528
D+Y R +F Y E L+ + + E W KLT+ Y E GD + T
Sbjct: 655 DFYYMRWKTWFDYQNELLDGKKPTAIDFYALEEPWTKLTDS-------YSSEPEGDCIST 707
Query: 529 SQWLY 533
+ ++
Sbjct: 708 VKRIF 712
>gi|298480128|ref|ZP_06998327.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|336404356|ref|ZP_08585054.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
gi|298273937|gb|EFI15499.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|335943684|gb|EGN05523.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
Length = 727
Score = 222 bits (566), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 156/537 (29%), Positives = 250/537 (46%), Gaps = 54/537 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN +
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLT 407
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 466
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G +++DAW L++ +Y V P T G Y
Sbjct: 467 ADRHVGCVSQSVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 503
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
+P + K ++ Y + L EV R L NE + + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L Y ++ + + D + + E++ D+D L A H L W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYIDTF 665
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 536
I+++ G K E ++ W N + V S D L++ S +L++KY
Sbjct: 666 IKAVGEGVEVDQKQLEDELKEIEEGWVNATDRKDVRKDVHSTTDGLLSFSTFLFSKY 722
>gi|423293377|ref|ZP_17271504.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
CL03T12C18]
gi|392678320|gb|EIY71728.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 222 bits (566), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 156/540 (28%), Positives = 256/540 (47%), Gaps = 60/540 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN +
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 466
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G ++DAW L++ +Y + + T G Y
Sbjct: 467 ADRHVGCVSQPVRDAWKRLFNDIY----------------------AQVPRTLGTLPGY- 503
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
+P + K ++ Y + L EV R L NE + + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L Y ++ + + D + + E++ D+D L A H L W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDYQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYIDTF 665
Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
I+++ G D +L+D +E W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 666 IKAVGEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|383115203|ref|ZP_09935961.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
gi|313695380|gb|EFS32215.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
Length = 727
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 157/537 (29%), Positives = 249/537 (46%), Gaps = 54/537 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN +
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ Y+ + E A+ D K WI
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAWNLNADDNK-WIECL 466
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G +++DAW L++ +Y V P T G Y
Sbjct: 467 ADRHVGCVSQSVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 503
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
+P + K ++ Y + L EV R L NE + + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L Y ++ + + D + + E++ D+D L A H L W++ A++
Sbjct: 553 VLGNYFFDVKVEFDRMVEAKDYQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYINTF 665
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 536
I++ E G K E ++ W N + V S D L++ S +L++KY
Sbjct: 666 IKAAEKGVEVDQKQLEDELKEIEEGWVNATDRKDVRKDIHSATDGLLSFSTFLFSKY 722
>gi|423345423|ref|ZP_17323112.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
CL03T12C32]
gi|409223209|gb|EKN16146.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
CL03T12C32]
Length = 752
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 161/544 (29%), Positives = 241/544 (44%), Gaps = 69/544 (12%)
Query: 11 AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQ 67
FSG VP L+ +P AKI + P WC LD TD LF IGR F+E+
Sbjct: 232 GFSGYVPRELKEKYPDAKI---------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEE 282
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ K YG +Y D F E+ PPVD+PEY+ ++G AI+ D +++W MQ W
Sbjct: 283 EKKLYG-AHGVYAADPFHESQPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL--- 338
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
R P +KA VP L++LDL +G P + LHNF G I ++G
Sbjct: 339 ---REPIVKA----VPKENLLILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHG 388
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
L +A +N + G G+ ME IEQNPV YDL EM ++V+++ W+ +Y+
Sbjct: 389 DLRLLASNQYVNAVKKNPNVCGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYA 448
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYG+ AW L Y G R I+A P++ G G
Sbjct: 449 DRRYGKPSENAHQAWLHLLEGPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGI 503
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
P YS V++A L + L S+ YR+D++D+ RQ ++
Sbjct: 504 P--------------------YSPLSVVQAEGLLLKDAARLEDSDPYRFDIVDIQRQLMS 543
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ +A++ D S RFLE++ D D LL F WL A+
Sbjct: 544 NLGQVIHKQAAKAFRKKDKEAFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGD 603
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
N E++ +E +A +T+W + L+ DY + W+GL+ YY R ++ + +
Sbjct: 604 NSEEKDLFEKDATALVTVW---GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDH 660
Query: 488 LESGDGFRLKDWR----REWIKLTN------DWQ-----NGRNVYPVESNGDALITSQWL 532
L++G + KD RE + + DW+ V + GD + T+ L
Sbjct: 661 LDAGTNYSEKDLPQTHGRESFRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRL 720
Query: 533 YNKY 536
Y KY
Sbjct: 721 YKKY 724
>gi|299144715|ref|ZP_07037783.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298515206|gb|EFI39087.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 727
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 156/540 (28%), Positives = 257/540 (47%), Gaps = 60/540 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRLYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F +D
Sbjct: 289 QKKLFG-IDHIYGLDPFNEVDPPSFEPEYLRKIVSDMYATLTAADPKAQWMQMTWMFYFD 347
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN +
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 408 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 466
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G +++DAW L++ +Y + + T G Y
Sbjct: 467 ADRHVGCVSQSVRDAWKRLFNDIY----------------------AQVPRTLGTLPGY- 503
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
+P + K ++ Y + L EV R L NE + + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L Y ++ + + D + + E++ D+D L A H L W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDYQALKACGEKMKEILHDLDKLNAFHPYCSLDKWIDDARK 612
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + + + YE NAR IT W L DY ++ W+GL+RDYY R +Y
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLIRDYYAKRWEVYINTF 665
Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
I+++ G D +L+D +E W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 666 IKAVGEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|399028591|ref|ZP_10729778.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
gi|398073682|gb|EJL64846.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
Length = 727
Score = 221 bits (564), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 152/512 (29%), Positives = 243/512 (47%), Gaps = 49/512 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+PAFSG VP A P +KI++L +W S + TYLLDA DPLF EIG+ FIE
Sbjct: 246 VVPAFSGYVPKAFAEKHPGSKISELKSW----SGGGFESTYLLDANDPLFKEIGKRFIEI 301
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
K YG+ + Y D F+E TPPV E +S G I+ + DA W+MQGWLF
Sbjct: 302 YTKLYGQ-ADFYLADAFNEITPPVSKEHKYEELSDYGKTIFETINEASPDATWVMQGWLF 360
Query: 125 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
+ FW KA L+ VP ++++ D + +W + FYG + + +HN+ G+
Sbjct: 361 GDNKEFWTKEATKAFLSKVPNDRMMIQDYANDRHKVWEKQEAFYGKQWTYGYVHNYGGSN 420
Query: 184 EMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+YG L+ + N +VG G+ EG+ N +VY+ + ++ + K V W
Sbjct: 421 PVYGDLNFYKNELTHLLGNSNKGNVVGYGVMPEGLNNNSIVYEYIYDLPWSQGKESVNDW 480
Query: 243 INQYSVRRYGR--SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
+N+Y RYG+ S P Q AW +L +VY+ T D A+ + +TE
Sbjct: 481 LNKYLSARYGKNISTPVFQ-AWKLLIESVYSTKYWETRWWDDRAGAYLFFKRPTLKITEF 539
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
K G P K+ ++ +AL++ + ++ Y YDL+D
Sbjct: 540 K----GNPGDKQ--------------------KLKQALDILKRESKSFNKNSLYFYDLLD 575
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
++R + ++L + + AY+L D +L ++ + D+D +L+ L WL+
Sbjct: 576 MSRHYYSLCIDDLLIECVTAYELKDIKKADELFKKIEKQALDIDNMLSGQPLNSLNNWLK 635
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
SA + E K Y NA+T IT+W L DY ++ W G+ + +Y PR ++
Sbjct: 636 SASDYGSSPEVSKLYVKNAKTLITLWGGEGH-----LNDYASRSWRGMYKGFYWPRWKMF 690
Query: 481 FKYMIESLESGDGF-------RLKDWRREWIK 505
+ ES+ + F +K W +W +
Sbjct: 691 LQAQRESVVNNTSFDELKVRESIKQWEIKWCQ 722
>gi|423214204|ref|ZP_17200732.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693149|gb|EIY86384.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
CL03T12C04]
Length = 727
Score = 221 bits (564), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 154/537 (28%), Positives = 249/537 (46%), Gaps = 54/537 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T H+Y D F+E PP PEY+ + + +Y+ + + D A W+ W+F +D
Sbjct: 289 QKKLFG-TDHVYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN +
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 408 GNVKESGERLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDDKWIECL 466
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G ++DAW L++ +Y + + T G Y
Sbjct: 467 ADRHVGCVSQPVRDAWKRLFNDIY----------------------AQVPRTLGTLPGY- 503
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
+P + K ++ Y + L EV R L NE + + +R DLI + RQ
Sbjct: 504 RPALNKNSEKRTSNVYSNIEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L Y ++ + + D + + E++ D+D L A H L W++ A++
Sbjct: 553 VLGNYFLDVKMEFDRMVEAKDYQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYIDTF 665
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 536
I++ E G K E ++ W N + V S D L++ S +L++KY
Sbjct: 666 IKAAEKGVEVDQKQLEDELKEIEEGWVNATDRKDVRKDIHSATDGLLSFSTFLFSKY 722
>gi|429766730|ref|ZP_19298977.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
gi|429183354|gb|EKY24416.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
Length = 2284
Score = 221 bits (564), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 156/528 (29%), Positives = 255/528 (48%), Gaps = 54/528 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP Q P A+I G W P TY+ D F E+ F E+
Sbjct: 415 VLQGYSGMVPLDFQKKNPDAQILSQGGWCGFDR-PNMLKTYVNDGERDYFQEVADVFYEK 473
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + YG + Y D F E NT +DS + + I M D DA+W++Q W +
Sbjct: 474 QKEVYGDITDYYAVDPFHEGGNTGGMDS----ARIYGTIQDKMIEHDEDAIWVIQHWQGN 529
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
D ++ L N + ++LDL +++ P + T +P++W MLHNF G + +
Sbjct: 530 PDN----TKLSGLTNK---EQALILDLNSDLNPDY-TRFDNQDIPWVWNMLHNFGGRMGL 581
Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
G ++++A EA T+EN M G+G++ E + +P+VY+LM +M + + ++ + W+N
Sbjct: 582 DGQVETVATSITEALATTEN--MKGIGITPEALANSPIVYELMGDMIWTRDPINYREWVN 639
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTD----GATDKNRDVIVAFPDVDPSIISVTEG 300
Y RRYG +AW +L T Y +D GA + SII+
Sbjct: 640 NYIERRYGAVNEDAIEAWEILLETAYKTSDYYYQGAAE--------------SIIN---- 681
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
++ A + S++ H + Y E+ RA+ELFI+ +EL S+ + YD +D
Sbjct: 682 ---------ARPATSINSASTWGHSKISYDKKELERAMELFISCYDELKDSDAFVYDFLD 732
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+T+Q LA A E ++ AY DA ++S FL+L+ + +L+ FL+G W+E
Sbjct: 733 VTKQVLANSAQEYHKEMVAAYNSGDAEKFERISEHFLDLIRLQERVLSTSPEFLVGTWIE 792
Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
++ + + + + +E+NAR IT W D L+DY N+ W+GL D Y R
Sbjct: 793 QSRTMLADADDWTKDLFEFNARALITTWGDYKN---GSLKDYSNRQWAGLTEDLYLKRWE 849
Query: 479 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
++ + LE+G DW + + + + N YP E +G+ L
Sbjct: 850 MWIDGIRTELETGVTAPSIDWHKVEYEWATEKTDESNAYPTEGSGEDL 897
>gi|423312588|ref|ZP_17290525.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
CL09T03C04]
gi|392688276|gb|EIY81565.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
CL09T03C04]
Length = 717
Score = 221 bits (564), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 157/495 (31%), Positives = 244/495 (49%), Gaps = 67/495 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V +DP + C+Y L D F EI
Sbjct: 224 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 272
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ + K YG+T Y D F E +T V+ + + G AI M+ + DAVW+ Q
Sbjct: 273 YYRELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 327
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
W +P R P ++ + G L+VLDL +E +P W K YG +++C
Sbjct: 328 W--QDNP--RTP----MIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYC 379
Query: 175 MLHNFAGNIEMYGILDSI--AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML NF GNI ++G +D++ F +A T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 380 MLLNFGGNIGLHGKMDALIDGFYDAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPW 439
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ + W+ Y RYG A+Q AW++L + +YN + + V A P +
Sbjct: 440 REHRFTRDEWLKGYVYARYGVEDEALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 499
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
D YQ SS+ +Y+ +VI A L ++ ++
Sbjct: 500 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 535
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
+N + +DL+D+ RQALA+ + + A++ D VF+L S+ FL L+ D LL
Sbjct: 536 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGT 594
Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
F +G W+E+A+ Q +E++ YEWNAR QIT W + + LRDY +K W+G+L
Sbjct: 595 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 654
Query: 470 RDYYGPRAAIYFKYM 484
+D+Y R YF Y+
Sbjct: 655 KDFYFMRWKAYFDYL 669
>gi|373461342|ref|ZP_09553084.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
gi|371952896|gb|EHO70729.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
Length = 731
Score = 221 bits (563), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 142/485 (29%), Positives = 236/485 (48%), Gaps = 53/485 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP + + P A+IT++G W R TY L TDP F ++ ++F+++
Sbjct: 231 VLPAFNGSVPLDYKRLHPEARITEVGQWGGFGQAYR---TYFLSPTDPRFGKLQKSFLDE 287
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G T H+Y D+F+E PP SP+ + L I++ + D +VW+ GWLF D
Sbjct: 288 QRRMFG-TDHLYCLDSFNEVQPPSWSPDTLCMLARHIHASLDKADPQSVWVQMGWLFYND 346
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P ++A L+ +P + ++LD + + +W ++ FYG PYI C+L NF GN +
Sbjct: 347 RKHWTPDVIRAYLSGIPKDRALLLDYYIDHTELWRLTESFYGRPYIACVLGNFGGNTMLQ 406
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + ++ ++A +++ M GVG +MEG NP Y + + A+ D + W+ +
Sbjct: 407 GDVGKVS-SRLDAAIAQDGNMAGVGATMEGFGVNPDFYAFVFDKAWDCGTTD-RDWLCRM 464
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G + A + AW VL+ + +++ V+ A P E +Y N
Sbjct: 465 ADRHVGFASAAGRTAWQVLFDRI---MPSYVNESGTVVCARPSF--------EARYLNTT 513
Query: 307 KPVSKEAVLKS--ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
P V K + S HL YD++++ RQ
Sbjct: 514 YPAELLGVWKMLLDIDSDKREHL----------------------------YDVVNVGRQ 545
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L + + AY + V +RR ++++D+D LLAC + F L W+E A+
Sbjct: 546 VLGDFFAFERDGLHRAYLSQRSDSVDYYARRMDKMLDDLDRLLACSEEFSLRKWIEDARG 605
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
++ YE NART IT+W D+ Q L DY N+ W+GL+ YY R I+ ++
Sbjct: 606 FGATAAEKDYYERNARTLITVWGDSRQ-----LTDYANRTWAGLVSSYYKQRWHIFTAHV 660
Query: 485 IESLE 489
++
Sbjct: 661 RRAVR 665
>gi|160884062|ref|ZP_02065065.1| hypothetical protein BACOVA_02038 [Bacteroides ovatus ATCC 8483]
gi|423291477|ref|ZP_17270325.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
CL02T12C04]
gi|156110404|gb|EDO12149.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392663477|gb|EIY57027.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
CL02T12C04]
Length = 727
Score = 221 bits (563), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 158/540 (29%), Positives = 255/540 (47%), Gaps = 60/540 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F+++
Sbjct: 232 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 288
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F +D
Sbjct: 289 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 347
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN +
Sbjct: 348 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLT 407
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 408 GNVKESGERLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDDKWIECL 466
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G ++DAW L++ +Y V P T G Y
Sbjct: 467 ADRHVGCVSQPVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 503
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
+P K ++ Y + L EV R L NE + + +R DLI + RQ
Sbjct: 504 RPALNRNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 552
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L Y ++ + + D + + E++ D+D L A H L W++ A++
Sbjct: 553 VLGNYFLDVKVEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 612
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 613 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYINTF 665
Query: 485 IESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 536
I+++ G D +L+D +E W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 666 IKAVGEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|189465172|ref|ZP_03013957.1| hypothetical protein BACINT_01517 [Bacteroides intestinalis DSM
17393]
gi|189437446|gb|EDV06431.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides intestinalis DSM
17393]
Length = 723
Score = 221 bits (562), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 153/550 (27%), Positives = 261/550 (47%), Gaps = 76/550 (13%)
Query: 8 VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
V P +SG VP A + + + + L N F+ + L TD F EI +
Sbjct: 229 VFPGYSGMVPHDADEKLGLNLTKSDLWNGFTRPA--------FLQPTDARFAEIADLYYR 280
Query: 67 QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
+Q K +G+ + Y+ D F E N VD + G AI + M+ + A W++QGW
Sbjct: 281 EQEKLFGKADY-YSMDPFHEAENAASVD----FDAAGKAIMTAMKKVNPKATWVVQGW-- 333
Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----------- 173
+ +P RP +K + N G L++LDLF+E +P+W G+P IW
Sbjct: 334 TENP--RPEMIKNMQN----GDLLILDLFSECRPMW-------GIPSIWKRDKGYEQHDW 380
Query: 174 --CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEM 230
CML NF GN+ ++G +D + + + T + G+G++MEG E N ++++LM E+
Sbjct: 381 LFCMLLNFGGNVGLHGRMDQLLNNFYLTKNNPLATHLKGIGLTMEGSENNAMMFELMCEL 440
Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFP 288
++ EK + W+ Y RYG I+ AW +L +T+YNC G + + + P
Sbjct: 441 PWRPEKFTKEEWLKDYLFARYGVRDEKIEQAWTLLANTIYNCPFGNNQQGPHESIFCGRP 500
Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
++ + + + K +NY P E A L + ++
Sbjct: 501 SLN-NFQASSWSKMKNYYDPTVTE-----------------------EAARLMLEVADKY 536
Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
+N + YDL+D+ RQ+L+ ++ I ++ D + SR+FL+++ D LL
Sbjct: 537 RGNNNFEYDLVDIVRQSLSDKGRIVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLG 596
Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
F +G W+E A++L E++ YEWNAR QIT W + + LRDY +K W+G+
Sbjct: 597 TRSEFRVGRWIEQARKLGTTPEEKDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGI 656
Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
LRD+Y R A Y++ + + L+ +L ++ + W +N Y S G+ +
Sbjct: 657 LRDFYYKRWAAYWQTLQDQLDGKPEVKL-----DYYAMEEPWTLAKNPYGSTSEGNCVDV 711
Query: 529 SQWLYNKYLQ 538
++ + K +
Sbjct: 712 AKEAFEKVFE 721
>gi|423226735|ref|ZP_17213200.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392627008|gb|EIY21049.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 718
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 152/550 (27%), Positives = 261/550 (47%), Gaps = 76/550 (13%)
Query: 8 VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
V P +SG VP A + + + + L N F+ + L TD F EI + +
Sbjct: 226 VFPGYSGMVPHDADEKLGLNLTKSDLWNGFTRPA--------FLQPTDVRFAEIADLYYQ 277
Query: 67 QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
+Q K +G+ + Y+ D F E N VD + G AI + M+ + A W++QGW
Sbjct: 278 EQEKLFGKVDY-YSMDPFHEAENAASVD----FDAAGKAIMAAMKKVNPKATWVVQGW-- 330
Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----------- 173
+ +P RP +K + N G L++LDLF+E +P+W G+P IW
Sbjct: 331 TENP--RPEMIKNMQN----GDLLILDLFSECRPMW-------GIPSIWKRDKGYEQHNW 377
Query: 174 --CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEM 230
CML NF GN+ ++G +D + + + + G+G++MEG E NP++++LM E+
Sbjct: 378 LFCMLLNFGGNVGLHGRMDQLLDNFYLTKNNPLAVHLKGIGLTMEGAENNPMMFELMCEL 437
Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFP 288
++ EK + W+ Y RYG I+ AW +L +T+YNC G + + + P
Sbjct: 438 PWRPEKFTKEEWLKDYLFARYGVRDEKIEKAWTLLANTIYNCPFGNNQQGPHESIFCGRP 497
Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
++ + + + K +NY P E A L + ++
Sbjct: 498 SLN-NFQASSWSKMKNYYDPTVTE-----------------------EAARLMVEVADKY 533
Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
+N + YDL+D+ RQ+L+ ++ I ++ D + SR+FL+++ D LL
Sbjct: 534 RGNNNFEYDLVDIVRQSLSDKGRIVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLG 593
Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
F +G W+E A+ L E++ YEWNAR QIT W + + LRDY +K W+G+
Sbjct: 594 TRSEFRVGRWIEQARNLGTTPEEKDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGI 653
Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
LRD+Y R A Y++ + + L+ +L ++ + W +N Y G +
Sbjct: 654 LRDFYYKRWAAYWQTLQDQLDGKPEVKL-----DYYAMEEPWTLAKNPYSSVPEGSCVDV 708
Query: 529 SQWLYNKYLQ 538
++ ++ K ++
Sbjct: 709 AKEVFEKAMR 718
>gi|224537466|ref|ZP_03678005.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520904|gb|EEF90009.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
DSM 14838]
Length = 721
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 152/550 (27%), Positives = 261/550 (47%), Gaps = 76/550 (13%)
Query: 8 VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
V P +SG VP A + + + + L N F+ + L TD F EI + +
Sbjct: 229 VFPGYSGMVPHDADEKLGLNLTKSDLWNGFTRPA--------FLQPTDVRFAEIADLYYQ 280
Query: 67 QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
+Q K +G+ + Y+ D F E N VD + G AI + M+ + A W++QGW
Sbjct: 281 EQEKLFGKVDY-YSMDPFHEAENAASVD----FDAAGKAIMAAMKKVNPKATWVVQGW-- 333
Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----------- 173
+ +P RP +K + N G L++LDLF+E +P+W G+P IW
Sbjct: 334 TENP--RPEMIKNMQN----GDLLILDLFSECRPMW-------GIPSIWKRDKGYEQHNW 380
Query: 174 --CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEM 230
CML NF GN+ ++G +D + + + + G+G++MEG E NP++++LM E+
Sbjct: 381 LFCMLLNFGGNVGLHGRMDQLLDNFYLTKNNPLAVHLKGIGLTMEGAENNPMMFELMCEL 440
Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFP 288
++ EK + W+ Y RYG I+ AW +L +T+YNC G + + + P
Sbjct: 441 PWRPEKFTKEEWLKDYLFARYGVRDEKIEKAWTLLANTIYNCPFGNNQQGPHESIFCGRP 500
Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
++ + + + K +NY P E A L + ++
Sbjct: 501 SLN-NFQASSWSKMKNYYDPTVTE-----------------------EAARLMVEVADKY 536
Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
+N + YDL+D+ RQ+L+ ++ I ++ D + SR+FL+++ D LL
Sbjct: 537 RGNNNFEYDLVDIVRQSLSDKGRIVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLG 596
Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
F +G W+E A+ L E++ YEWNAR QIT W + + LRDY +K W+G+
Sbjct: 597 TRSEFRVGRWIEQARNLGTTPEEKDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGI 656
Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
LRD+Y R A Y++ + + L+ +L ++ + W +N Y G +
Sbjct: 657 LRDFYYKRWAAYWQTLQDQLDGKPEVKL-----DYYAMEEPWTLAKNPYSSVPEGSCVDV 711
Query: 529 SQWLYNKYLQ 538
++ ++ K ++
Sbjct: 712 AKEVFEKAMR 721
>gi|393786624|ref|ZP_10374756.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
CL02T12C05]
gi|392657859|gb|EIY51489.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
CL02T12C05]
Length = 717
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 159/544 (29%), Positives = 257/544 (47%), Gaps = 72/544 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V SDP C Y L D F EI
Sbjct: 224 VLPGYCGMVPHNAK--------AKLG--LNV-SDPGVWCGYRRPAFLQPDDSRFEEISSL 272
Query: 64 FIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
+ ++ K YG+ ++ Y+ D F E +D + ++G A+ M+ + AVW++Q W
Sbjct: 273 YYKELEKLYGKANY-YSMDPFHEGGS-IDGVN-LDAVGKAVMKAMKKANPKAVWVIQAWQ 329
Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST------SKQFYGVP-YIWCML 176
+ P L+ ++ G L++LDL +E +P W K YG +++CML
Sbjct: 330 ANPRP--------ELIRNLETGDLLILDLTSECRPQWGDPESEWYRKDGYGKHNWVYCML 381
Query: 177 HNFAGNIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
N+ N+ ++G +D++ + + T+ GVGM+ EGIE NPV+Y+L+ E+ ++
Sbjct: 382 LNYGANVGLHGKMDNVIDNYYLAKENLRARATLKGVGMTPEGIENNPVMYELLMELPWRP 441
Query: 235 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDP 292
E+ + W+ Y RYG+ P +Q AW L +++YN T + + V A P +D
Sbjct: 442 ERFTKEDWLKGYVKARYGKDEPVLQLAWGKLANSIYNAPKELTQQGTHESVFCARPGLDV 501
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
YQ SS+ +Y EVI A L ++ + +
Sbjct: 502 ---------YQ---------------VSSWSEMKDYYDPQEVIEAARLMVSVADRYRGNT 537
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
+ YDL+D+ RQA+A+ + + AY+ D S++FL L+ D LL
Sbjct: 538 NFEYDLVDVVRQAIAEKGRLMQKAVTTAYRAGDKELFAMASQKFLNLILLQDQLLGTRTE 597
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
F LG W+ SA+ L E++ YEWN R Q+T W + E LRDY +K W+GLL+D+
Sbjct: 598 FRLGRWINSARALGVTPEEKALYEWNTRVQVTTWGNRNAAERGGLRDYAHKEWNGLLKDF 657
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDW---RREWIKLTNDWQNGRNVYPVESNGDALITS 529
Y R +YF + +E G+ D+ W+K TN +Q E GD + T+
Sbjct: 658 YYMRWKLYFDNLACKME-GETIPEIDFYAVEEAWVKRTNPYQ-------AEPEGDCVDTA 709
Query: 530 QWLY 533
+ ++
Sbjct: 710 KLIF 713
>gi|160887167|ref|ZP_02068170.1| hypothetical protein BACOVA_05183 [Bacteroides ovatus ATCC 8483]
gi|423295093|ref|ZP_17273220.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
CL03T12C18]
gi|156107578|gb|EDO09323.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392673999|gb|EIY67450.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
CL03T12C18]
Length = 711
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 146/489 (29%), Positives = 241/489 (49%), Gaps = 59/489 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
V P ++G VP + +LG + + +WC L D F +
Sbjct: 233 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 282
Query: 65 IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
E+ K YG+ ++ Y+ D F E NT VD ++ GA+I + M+ + AVW++Q W
Sbjct: 283 YEELEKLYGKANY-YSMDPFHEGGNTEGVD----LAKTGASIMAAMKKANPKAVWIIQAW 337
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
S P+ + ++ S+ G L+VLDL++E +P +W K F +++CM
Sbjct: 338 QAS-------PR-EEMIASLNQGDLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCM 389
Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQH 234
L NF GN+ ++G ++ + G +A N M+ GVG + EGIE NPV+++L+ E+ ++
Sbjct: 390 LLNFGGNVGLHGRMNQLVNGYYDACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWRE 449
Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
E+ W+ Y RYGR V P I +AW L HTVYN D + + S
Sbjct: 450 ERFSSDEWLQTYLKARYGREVSPEIMEAWRALEHTVYNA---PKDYQGEGTIE------S 500
Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
++ G + + TS++ + L+Y+ +A LF + ++ +N
Sbjct: 501 LLCARPGFHLD-------------RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNN 547
Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
+ YDL+D+ RQ+ A N L I ++Y D + +++FL+L+ D LL+ F
Sbjct: 548 FEYDLVDIVRQSNADKGNVLLEEISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEF 607
Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
+ WL +A+ L EE+++ YEWNA IT+W D+ L DY ++ WSGLL+D Y
Sbjct: 608 SVSSWLNAARSLGTTEEEKRLYEWNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLY 667
Query: 474 GPRAAIYFK 482
R +F+
Sbjct: 668 YQRWKAFFE 676
>gi|422873453|ref|ZP_16919938.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens F262]
gi|380305838|gb|EIA18115.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens F262]
Length = 2104
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 157/528 (29%), Positives = 256/528 (48%), Gaps = 51/528 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + P A+ G W P TY+ + F ++ F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGNLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ +A P SE+ MVG+G++ E I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RRYG++ I DAWN++ T Y K D + SII+ G
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA A E + + AY DA +S +FLEL++ + +L+ FL+G W+E A+
Sbjct: 737 LLANSAQEYYEVMCNAYNNRDAEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900
>gi|345519733|ref|ZP_08799147.1| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
gi|345457107|gb|EET15964.2| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
Length = 717
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 156/495 (31%), Positives = 243/495 (49%), Gaps = 67/495 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V +DP + C+Y L D F EI
Sbjct: 224 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 272
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ + K YG+T Y D F E +T V+ + + G AI M+ + DAVW+ Q
Sbjct: 273 YYRELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 327
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
W +P R P ++ + G L+VLDL +E +P W K YG +++C
Sbjct: 328 W--QDNP--RTP----MIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYC 379
Query: 175 MLHNFAGNIEMYGILDSI--AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML NF GNI ++G +D++ F +A T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 380 MLLNFGGNIGLHGKMDALIDGFYDAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPW 439
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ + W+ Y RYG A+Q W++L + +YN + + V A P +
Sbjct: 440 REHRFTRDEWLKGYVYARYGVEDEALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 499
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
D YQ SS+ +Y+ +VI A L ++ ++
Sbjct: 500 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 535
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
+N + +DL+D+ RQALA+ + + A++ D VF+L S+ FL L+ D LL
Sbjct: 536 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGT 594
Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
F +G W+E+A+ Q +E++ YEWNAR QIT W + + LRDY +K W+G+L
Sbjct: 595 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 654
Query: 470 RDYYGPRAAIYFKYM 484
+D+Y R YF Y+
Sbjct: 655 KDFYFMRWKAYFDYL 669
>gi|319643377|ref|ZP_07998003.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
gi|317385006|gb|EFV65959.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
Length = 718
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 156/495 (31%), Positives = 243/495 (49%), Gaps = 67/495 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V +DP + C+Y L D F EI
Sbjct: 225 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ + K YG+T Y D F E +T V+ + + G AI M+ + DAVW+ Q
Sbjct: 274 YYRELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 328
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
W +P R P ++ + G L+VLDL +E +P W K YG +++C
Sbjct: 329 W--QDNP--RTP----MIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYC 380
Query: 175 MLHNFAGNIEMYGILDSI--AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML NF GNI ++G +D++ F +A T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 381 MLLNFGGNIGLHGKMDALIDGFYDAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPW 440
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ + W+ Y RYG A+Q W++L + +YN + + V A P +
Sbjct: 441 REHRFTRDEWLKGYVYARYGVEDEALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 500
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
D YQ SS+ +Y+ +VI A L ++ ++
Sbjct: 501 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 536
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
+N + +DL+D+ RQALA+ + + A++ D VF+L S+ FL L+ D LL
Sbjct: 537 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGT 595
Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
F +G W+E+A+ Q +E++ YEWNAR QIT W + + LRDY +K W+G+L
Sbjct: 596 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 655
Query: 470 RDYYGPRAAIYFKYM 484
+D+Y R YF Y+
Sbjct: 656 KDFYFMRWKAYFDYL 670
>gi|150004413|ref|YP_001299157.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149932837|gb|ABR39535.1| glycoside hydrolase family 89 [Bacteroides vulgatus ATCC 8482]
Length = 717
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 156/495 (31%), Positives = 243/495 (49%), Gaps = 67/495 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V +DP + C+Y L D F EI
Sbjct: 224 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 272
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ + K YG+T Y D F E +T V+ + + G AI M+ + DAVW+ Q
Sbjct: 273 YYRELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 327
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
W +P R P ++ + G L+VLDL +E +P W K YG +++C
Sbjct: 328 W--QDNP--RTP----MIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYC 379
Query: 175 MLHNFAGNIEMYGILDSI--AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML NF GNI ++G +D++ F +A T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 380 MLLNFGGNIGLHGKMDALIDGFYDAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPW 439
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ + W+ Y RYG A+Q W++L + +YN + + V A P +
Sbjct: 440 REHRFTRDEWLKGYVYARYGVEDEALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 499
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
D YQ SS+ +Y+ +VI A L ++ ++
Sbjct: 500 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 535
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
+N + +DL+D+ RQALA+ + + A++ D VF+L S+ FL L+ D LL
Sbjct: 536 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGT 594
Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
F +G W+E+A+ Q +E++ YEWNAR QIT W + + LRDY +K W+G+L
Sbjct: 595 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 654
Query: 470 RDYYGPRAAIYFKYM 484
+D+Y R YF Y+
Sbjct: 655 KDFYFMRWKAYFDYL 669
>gi|299148671|ref|ZP_07041733.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
3_1_23]
gi|383114572|ref|ZP_09935334.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
gi|298513432|gb|EFI37319.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
3_1_23]
gi|313693722|gb|EFS30557.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
Length = 711
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 144/489 (29%), Positives = 239/489 (48%), Gaps = 59/489 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
V P ++G VP + +LG + + +WC L D F +
Sbjct: 233 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 282
Query: 65 IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
E+ K YG+ ++ Y+ D F E NT VD ++ GA+I + M+ + AVW++Q W
Sbjct: 283 YEELEKLYGKANY-YSMDPFHEGGNTEGVD----LAKTGASIMAAMKKANPKAVWIIQAW 337
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
+ + ++ S+ G L+VLDL++E +P +W K F +++CM
Sbjct: 338 --------QANPREEMIASLNQGDLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCM 389
Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQH 234
L NF GN+ ++G ++ + G +A N M+ GVG + EGIE NPV+++L+ E+ ++
Sbjct: 390 LLNFGGNVGLHGRMNQLVNGYYDACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWRE 449
Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
E+ W+ Y RYGR V P I +AW L HTVYN D + + S
Sbjct: 450 ERFSSDEWLQTYLKARYGREVSPEIMEAWRALEHTVYNA---PKDYQGEGTIE------S 500
Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
++ G + + TS++ + L+Y+ +A LF + ++ +N
Sbjct: 501 LLCARPGFHLD-------------RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNN 547
Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
+ YDL+D+ RQ+ A N L I ++Y D + +++FL+L+ D LL+ F
Sbjct: 548 FEYDLVDIVRQSNADKGNVLLEEISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEF 607
Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
+ WL +A+ L EE+++ YEWNA IT+W D+ L DY ++ WSGLL+D Y
Sbjct: 608 SVSSWLNAARSLGTTEEEKRLYEWNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLY 667
Query: 474 GPRAAIYFK 482
R +F+
Sbjct: 668 YQRWKAFFE 676
>gi|294775488|ref|ZP_06741000.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
gi|294450633|gb|EFG19121.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
Length = 712
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 156/495 (31%), Positives = 243/495 (49%), Gaps = 67/495 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP + G VP + +LG +V +DP + C+Y L D F EI
Sbjct: 219 VLPGYCGMVPHNAKE--------KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISAL 267
Query: 64 FIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
+ + K YG+T Y D F E +T V+ + + G AI M+ + DAVW+ Q
Sbjct: 268 YYRELTKLYGKTG-FYAIDPFHEGGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQA 322
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWC 174
W +P R P ++ + G L+VLDL +E +P W K YG +++C
Sbjct: 323 W--QDNP--RTP----MIEHLEAGDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYC 374
Query: 175 MLHNFAGNIEMYGILDSI--AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
ML NF GNI ++G +D++ F +A T+ GVGM+ EGIE NPV+Y+L+ E+ +
Sbjct: 375 MLLNFGGNIGLHGKMDALIDGFYDAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPW 434
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDV 290
+ + W+ Y RYG A+Q W++L + +YN + + V A P +
Sbjct: 435 REHRFTRDEWLKGYVYARYGVEDEALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGL 494
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
D YQ SS+ +Y+ +VI A L ++ ++
Sbjct: 495 DV---------YQ---------------VSSWSEMKEYYNPQDVIEAARLMVSVADKYQG 530
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLAC 409
+N + +DL+D+ RQALA+ + + A++ D VF+L S+ FL L+ D LL
Sbjct: 531 NNNFEFDLVDVLRQALAEKGRLMQKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGT 589
Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
F +G W+E+A+ Q +E++ YEWNAR QIT W + + LRDY +K W+G+L
Sbjct: 590 RKEFKVGTWIEAARSAGQTQEEKALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGIL 649
Query: 470 RDYYGPRAAIYFKYM 484
+D+Y R YF Y+
Sbjct: 650 KDFYFMRWKAYFDYL 664
>gi|295085509|emb|CBK67032.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 716
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 155/537 (28%), Positives = 249/537 (46%), Gaps = 54/537 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F+++
Sbjct: 221 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDE 277
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F +D
Sbjct: 278 QKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFD 336
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN +
Sbjct: 337 KDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLT 396
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 397 GNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECL 455
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R G +++DAW L++ +Y V P T G Y
Sbjct: 456 ADRHVGCVSQSVRDAWKRLFNDIY--------------VQVPR--------TLGTLPGY- 492
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQ 364
+P + K ++ Y + L EV R L NE + + +R DLI + RQ
Sbjct: 493 RPALNKNSEKRTSNVYSNVEL----LEVWRKL-------NEAPSDRRDAFRLDLITVGRQ 541
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
L Y ++ + + D + + E++ D+D L A H L W++ A++
Sbjct: 542 VLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARK 601
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 602 MGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYIDTF 654
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 536
I+++ K E ++ W N + V S D L++ S +L++KY
Sbjct: 655 IKAVGEDVEVDQKQLEDELKEIEEGWVNATDRKDVRKDVHSTTDGLLSFSTFLFSKY 711
>gi|329962235|ref|ZP_08300241.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
gi|328530343|gb|EGF57220.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
Length = 726
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 146/480 (30%), Positives = 239/480 (49%), Gaps = 60/480 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRA 63
VLP +SG +P + ++ G W C Y L TD F EI
Sbjct: 225 VLPGYSGMLPHNAKEKL-GVNVSDPGTW----------CGYNRPAFLQPTDTRFGEIAAL 273
Query: 64 FIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
+ E+ + YG+ + Y+ D F E + + G AI+ M+ ++VW++Q W
Sbjct: 274 YYEEMNRLYGK-ADFYSMDPFHEGGKVAGVN--LDAAGQAIWQAMKKNSRNSVWVVQAWG 330
Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQ-------FYGVPYIWCML 176
+ P+ + ++ +VP G ++VLDL++E +P W + F G +++CML
Sbjct: 331 AN-------PRAQ-MIKNVPRGDMLVLDLYSESRPQWGEPESSWYRENGFDGHQWLYCML 382
Query: 177 HNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE 235
N+ GN+ ++G + + +A R+S T+ GVGM+MEG E NPV+Y+L+ E+ ++
Sbjct: 383 LNYGGNVGLHGKMQHVIDAYYKASRSSFGNTLKGVGMTMEGSENNPVMYELLCELPWRPS 442
Query: 236 KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPS 293
W+ Y RYG+ P +++AW +L +++YNC +T + + + A P +
Sbjct: 443 TFSKDEWLEGYIAARYGKCTPRLREAWVLLGNSIYNCPPRSTQQGTHESIFCARPSLK-- 500
Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
+ +A SE S Y P +VIRA LF+ + ++
Sbjct: 501 ----------------AYQASSWSEMSDYYRPQ------DVIRAAGLFLEEAGQFKGNDN 538
Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
+ YDL+D+TRQA+A+ ++ I +Y+ D + Q S RFLEL+ D LLA F
Sbjct: 539 FEYDLVDITRQAVAEKGRLIYKVIQASYEAGDKPLLRQASDRFLELLLLQDRLLATRPEF 598
Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
+G W+E A+ L ++ EWNAR QIT W + T + LRDY +K W+GLL+D+Y
Sbjct: 599 KVGRWIEQARNLGHTPAEKDWLEWNARVQITTWGNRTASDRGGLRDYAHKEWNGLLKDFY 658
>gi|374384144|ref|ZP_09641670.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
12061]
gi|373228751|gb|EHP51054.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
12061]
Length = 835
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 150/518 (28%), Positives = 250/518 (48%), Gaps = 70/518 (13%)
Query: 2 LLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
LL + PAF+G VP AL+ ++P KI + W + ++L + LF IG
Sbjct: 223 LLGMKPICPAFAGFVPLALKRLYPDVKIIET-TWAGFHN-------WMLSPEEELFTRIG 274
Query: 62 RAFIEQQLKEYGRTSHIYNCDTFDENT---PPVDSPEYISSL---GAAIYSGMQSGDSDA 115
+ FIE+ KE+G+ + Y D+F+E PP+ + E L G +Y G+++G+ DA
Sbjct: 275 QLFIEEWEKEFGK-NDFYLADSFNEMDVPFPPIGTKERYDMLAFYGEQVYKGIKAGNPDA 333
Query: 116 VWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTS------KQFY 167
VW+MQGW+F Y W ++AL++ VP K+++LDL A+ K +W K F+
Sbjct: 334 VWVMQGWMFGYQRDIWDYETLQALVSKVPDDKMMLLDLAADYNKNVWGNGMNWEFYKGFF 393
Query: 168 GVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDL 226
+++ ++ N G GIL A G +EA S N + G GM+ EG E N VVY++
Sbjct: 394 NKLWVYSVIPNMGGKTGATGILSFYANGHLEALNSPNRGRLFGFGMAPEGTENNEVVYEM 453
Query: 227 MSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVA 286
+ + + ++DVK W+ YS+ RYG++ P + + W L +VY
Sbjct: 454 ICDAGWSSSEIDVKQWLKDYSLCRYGKTCPEMDEVWEGLCKSVYG--------------T 499
Query: 287 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 346
F D + + G+ S + + ++++ Y RA+E
Sbjct: 500 FTDHPRFLWQLRPGR--------SGKGTVNTDSNFY-------------RAVEKMAECAP 538
Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
+++ S ++ D +++T L L I ++Y + ++ ++F EL E +D L
Sbjct: 539 KMTESPLFKADFLEMTAFYLGGKMEALASAIGKSYLYGNTADALKMQQQFEELGEGLDSL 598
Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
L H + L W++ A++ E+ + YE NAR +T+W + DY K WS
Sbjct: 599 LESHPVYRLQRWIDFARKHGDTEKLKDYYEMNARRIVTIW-------GPPVSDYACKLWS 651
Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
GL+RDYY PR YF+ E+G + L W +W+
Sbjct: 652 GLIRDYYLPRWREYFR----CKETGSKYDLASWESDWV 685
>gi|453081268|gb|EMF09317.1| glycoside hydrolase family 89 protein [Mycosphaerella populorum
SO2202]
Length = 784
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 147/512 (28%), Positives = 253/512 (49%), Gaps = 51/512 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F+G VP + ++P+A W ++ ++ L+ DPLF + ++FI +
Sbjct: 248 VLPCFTGFVPTQISRLYPNASFVNGSQWNGFQA--QYTNVTFLEPFDPLFTTLQKSFISK 305
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
YG S +Y D ++EN P + Y+ + + +++ D +A+W +QGWLF S
Sbjct: 306 LDAAYGNVSSVYTLDQYNENDPFSGNVTYLEDVASNTIKSLKAADPEAIWFIQGWLFYSA 365
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
FW ++KA L V +++LDLF+E +P W + ++G P+IWC LH++ GN ++
Sbjct: 366 ADFWDEERIKAYLGGVEDKDMLILDLFSESQPQWQRTNSYFGKPWIWCQLHDYGGNQGLH 425
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G ++++ P+ A +E +TMVG+G++MEG E N ++YD++ + A+ E ++ + + +
Sbjct: 426 GQVENVTMNPILALANETSTMVGIGLTMEGQEGNEIIYDILLDQAWTPEPIESAGYFDDW 485
Query: 247 SVRRY--GRSVPAI-QD---AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
RY +V + QD AW+++ T+YN TD D V + ++ P+ + +
Sbjct: 486 VTSRYHCDDAVAGLPQDLYIAWDMMRQTIYNNTD--IDTAEAVTKSIFELQPNTTGLLDR 543
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ + + +L S H + ++ E + EL +YR+DL+D
Sbjct: 544 TGHHSTRILYDPEILVSAWK-----HFYSASQETPQLWEL-----------ESYRFDLVD 587
Query: 361 LTRQALAKYANEL---FLNIIEAYQLNDAHGVF--QLSRRFLELVEDMDGLLACHDG--F 413
+TRQ LA L F+N+ L + Q R L L+ D+D +L F
Sbjct: 588 ITRQVLANAFYPLYGEFVNMTANSSLPSSSTASAEQTGARMLSLLLDLDSVLEASGNAHF 647
Query: 414 LLGPWLESAKQLAQNEEQEKQ------------YEWNARTQITMWFDNTQEEASLLRDYG 461
L W+ SA+ A E YE+NAR QIT+W + + DY
Sbjct: 648 SLESWIHSARLWAPTETNAADGDNMTAAAIADFYEYNARNQITLWGPGGE-----ISDYA 702
Query: 462 NKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 493
+K W+GL++ YY PR + + + S S DG
Sbjct: 703 SKQWAGLIKTYYVPRWERFVHFTLNSSTSADG 734
>gi|340347658|ref|ZP_08670763.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
gi|433652542|ref|YP_007296396.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
3688]
gi|339608852|gb|EGQ13735.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
gi|433303075|gb|AGB28890.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
3688]
Length = 781
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 153/503 (30%), Positives = 232/503 (46%), Gaps = 64/503 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G +P + +T G W + L ATDP F EI + +
Sbjct: 244 VLPGYCGMMPHDAKQKL-GLDVTPGGTWNG------YVRPANLSATDPRFDEIADLYYRE 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYI--SSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + YG+ SH Y+ D F E + V YI + G + + M+ + A W++QGW +
Sbjct: 297 QTRLYGK-SHYYSMDPFHETSDDV----YIDYAQAGRKLMAAMKRENPKANWVIQGWTEN 351
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNF 179
P A+ + +P G L VLDLF+E +P IW ++ + +++CML NF
Sbjct: 352 PRP--------AMTDGLPAGSLTVLDLFSECRPMFGAPSIWKRAEGYGQHDWLFCMLENF 403
Query: 180 AGNIEMYGILDSI------AFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ 233
GN+ ++G +D + A P + G+G +MEG E NP++++LMSE+ ++
Sbjct: 404 GGNVGLHGRMDQLIGNFRLATSPQSPLQQARRHLRGIGFTMEGSENNPIMFELMSELPWR 463
Query: 234 HEKVDVKA--------WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIV 285
++V A W+ Y RYG P Q AW +L T+YNC G +
Sbjct: 464 TDEVAQAADARTFRTEWVRGYVKARYGTDDPHAQQAWQLLAETIYNCPAGNNQQG----- 518
Query: 286 AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG 345
SI G+P +KS + ++ Y S + A L A+
Sbjct: 519 ----PHESIFD---------GRPSLNNFQVKSWSKMRNY----YEPSATLEAARLMAAAA 561
Query: 346 NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 405
+ L +N Y YDL+D+ RQA+ A +++L+ I Y D + S RFL L+ D
Sbjct: 562 DRLKGNNNYEYDLVDIVRQAIDDQARQVYLHAIADYNGFDRRAFSRDSARFLGLLLMQDR 621
Query: 406 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 465
LL F LG W E+A+ L ++ YEWNAR QIT W + + LRDY +K W
Sbjct: 622 LLGTRREFRLGRWTEAARSLGTTPAEKDLYEWNARVQITTWGNRACADQGGLRDYAHKEW 681
Query: 466 SGLLRDYYGPRAAIYFKYMIESL 488
GLL D+Y R Y + +
Sbjct: 682 QGLLADFYYMRWHTYLDALSRQM 704
>gi|380694112|ref|ZP_09858971.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 736
Score = 218 bits (555), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 146/503 (29%), Positives = 240/503 (47%), Gaps = 73/503 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
V P ++G VP + +LG + + +WC L D F +
Sbjct: 236 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 285
Query: 65 IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
E+ K YG+ + Y+ D F E NT VD ++ G +I M+ + +AVW+MQ W
Sbjct: 286 YEELEKLYGKAKY-YSMDPFHEGGNTEGVD----LAKAGTSIMGAMKKANPEAVWVMQAW 340
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
+ +A++N++ G L+VLDL++E P +W K F +++CM
Sbjct: 341 --------QANPREAMVNTLDSGDLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCM 392
Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
L NF GN+ ++G ++ + G A N T+ GVG + EGIE NP++++L+ E+ ++
Sbjct: 393 LLNFGGNVGLHGRMEQLVNGYYNACAHINGKTLRGVGATPEGIENNPMMFELLYELPWRE 452
Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
E+ W+ Y RYG + P + +AW L HTVYN
Sbjct: 453 ERFSPDIWLQGYLKARYGDDLSPEVTEAWRALEHTVYNAP-------------------- 492
Query: 294 IISVTEGKYQNYGKPVSKEAVLKS-------ETSSYDHPHLWYSTSEVIRALELFIASGN 346
+NY + E++L + TS++ + L+YS +A +L ++ +
Sbjct: 493 ---------KNYQGEGTVESLLCARPGFHLDRTSTWGYAKLFYSPDSTAKAAQLLLSVAD 543
Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
+N + YDL+D+ RQ+LA AN L I ++Y D + +++FL L+ D L
Sbjct: 544 RYKGNNNFEYDLVDIVRQSLADKANVLLEEISQSYDRKDKDSFRKQTQQFLGLILSQDSL 603
Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
L+ F + WL +A+ L EE++K YEWNA IT+W D+ L DY ++ WS
Sbjct: 604 LSTRKEFSVSSWLSAARSLGTTEEEKKLYEWNASALITVWGDSIAANQGGLHDYSHREWS 663
Query: 467 GLLRDYYGPRAAIYFKYMIESLE 489
GLL+D Y R +F+ + L+
Sbjct: 664 GLLKDLYYQRWNTFFEQKQQELD 686
>gi|410097657|ref|ZP_11292638.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
CL02T12C30]
gi|409223747|gb|EKN16682.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
CL02T12C30]
Length = 740
Score = 218 bits (555), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 153/544 (28%), Positives = 250/544 (45%), Gaps = 66/544 (12%)
Query: 11 AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
FSG VP + FP AKI + +W+ + C LD DPLF E+G+ F+E++ K
Sbjct: 235 GFSGAVPRKMMEKFPEAKIQKQPDWYGFEG----ICQ--LDPLDPLFTELGKTFLEEEQK 288
Query: 71 EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
YG T +Y D F E+ PPVD+PEY++++G++I+ M++ D DA+W+MQ W F D
Sbjct: 289 LYG-TYGLYAADPFHESKPPVDTPEYLNAVGSSIHKLMKTFDPDALWVMQAWSFRKD--- 344
Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 190
+ + VP L+VL L + F ++ LHNF G + ++G L
Sbjct: 345 -------IASVVPKHDLLVLSLNGAL----GGEDHFCNHDFVVGNLHNFGGRVNLHGDLP 393
Query: 191 SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRR 250
++ + +VG G+ ME I QNPV Y+L EM + V ++ W+N+Y+ RR
Sbjct: 394 LVSSNQFMKAKQKTPNVVGSGLFMESIGQNPVFYELAFEMPVHQDSVKLEEWLNKYAERR 453
Query: 251 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS 310
YG A AW +L Y + + +I A P VD G + P
Sbjct: 454 YGAFSDAANKAWELLLAGPYRAGTNGVESS-SIICARPAVDVK----KSGPNAGFNIPYD 508
Query: 311 KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYA 370
++++++E + +L S YR+D++D+ RQ ++
Sbjct: 509 PQSLIEAEVC--------------------LLQDAEQLKGSGPYRFDIVDVQRQIMSNLG 548
Query: 371 NELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 430
E+ EA++ D S RFLEL++D+D LL F WL A+ +E
Sbjct: 549 QEIHKKAAEAFKKKDKEAFALHSGRFLELLKDVDILLRTRTEFNFDQWLTDARAWGTTDE 608
Query: 431 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 490
+ +E NA + +T+W DY + W+GL+ YY R ++ + L++
Sbjct: 609 ERNLFEKNASSLVTIWGGQVDVRQF---DYSWREWTGLIEGYYLQRWKQFYDMLQGHLDN 665
Query: 491 GDGFRLKDWR----------REWIKLTNDWQ------NGRNVYPVESNGDALITSQWLYN 534
G +R +D + E+ DW+ G+ PV + GD + ++ + +
Sbjct: 666 GTIYREEDAKMDLGRQAFRANEFYDSLADWELAFVDRPGKARTPV-TEGDEVAVARRMLD 724
Query: 535 KYLQ 538
KY Q
Sbjct: 725 KYKQ 728
>gi|323344412|ref|ZP_08084637.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
gi|323094539|gb|EFZ37115.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
Length = 730
Score = 218 bits (554), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 150/531 (28%), Positives = 239/531 (45%), Gaps = 46/531 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ +FP A I LG W + ++ C + L +PLF +I + ++E+
Sbjct: 235 VLPAFAGHVPAELKRIFPDANIKSLGKWGGFEE--KYLC-HFLSPEEPLFSKIQKLYLEE 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q +G T HIY D F+E PP P Y+ + +Y + + D A W+ GW+FSYD
Sbjct: 292 QTALFG-TDHIYGVDPFNEVEPPSWEPAYLRKVSKNMYGTLTAVDPKAEWMQMGWMFSYD 350
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P +++A L VP GK+ +LD + E +W T+ FYG PYIWC L NF GN +
Sbjct: 351 NKHWTPDRVQAFLTGVPKGKMSLLDYYCENVELWKTTDGFYGQPYIWCYLGNFGGNTTLM 410
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G + A + M+G G ++EG++ Y+ + + H D + WI+
Sbjct: 411 GNVKESGRRLDNALANGQRNMLGAGSTLEGLDVIQFPYEYLYNKLWSHAVADSR-WIDDL 469
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ R YG P+++ AW++L++ +Y + + +G N+
Sbjct: 470 ADRHYGGVSPSVRKAWHILFNDIY---------------------VQVSASMQGVLTNF- 507
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIR-ALELFIASGNELSASNTYRYDLIDLTRQA 365
+P Y L EV R L++ NEL + D+I + RQ
Sbjct: 508 RPALNNNYPHRTAIEYPAERL----EEVWRLLLDVPRCDRNEL------QLDIIAVGRQV 557
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
L + AY D + + EL+ D+D L + + + W++ A++L
Sbjct: 558 LGNRFAVVKTQFDSAYANKDIPRLKAKACEMEELLGDLDRLTSFNSRCSINRWIDDARKL 617
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
+E + YE NAR IT W N + DY ++ W GL+ YY R +Y ++
Sbjct: 618 GSTKELKDYYEKNARNLITTWGGN-------INDYASRTWGGLIGSYYAHRWRLYIDDIL 670
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 536
+ E+ F + + K W V D L + L KY
Sbjct: 671 AAAEANKEFDQNAFNEKVSKFEQAWIISTEPITVPKRTDLLTFCRILIQKY 721
>gi|156046298|ref|XP_001589681.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980]
gi|154693798|gb|EDN93536.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 795
Score = 218 bits (554), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 156/520 (30%), Positives = 260/520 (50%), Gaps = 72/520 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP+AL+ + P+A I G+W ++ T+L TDPLF + F+
Sbjct: 245 VLPAFTGFVPSALRRIAPNANIINGGDWGNIFPVEYSNDTFLY-PTDPLFTTLQHKFLSF 303
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q + YG +HIY D ++EN P Y+ ++ Y +QS D AVW++QGWLF S
Sbjct: 304 QSEYYGNVTHIYTLDQYNENNPASGDLSYLRNVSRGTYESLQSFDPCAVWMLQGWLFYSL 363
Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW +++A + VP + +++LDLF+E P W + +YG P+IWC L ++ G + +
Sbjct: 364 SSFWTQDRIEAYIGGVPKNESMLILDLFSESFPQWERTHYYYGKPWIWCQLRDYGGTLGL 423
Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV----K 240
YG + +I +EA R SEN MVGVG +MEG N ++Y+L+ + A+ + +D K
Sbjct: 424 YGQIYNITNSLIEAFRESEN--MVGVGNTMEGQGGNGLMYELLLDQAWNIDPIDTEDYFK 481
Query: 241 AWINQ-YSVRRYGRSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 298
+W+ + Y ++ + +P I +AW++L T YN T+ + V + ++ P+I
Sbjct: 482 SWVRKRYHIKGAKKRLPGEIYEAWDILRRTAYNNTNLTLADS--VPKSLHELQPNIT--- 536
Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI---ASGNELSASNTYR 355
+N+G+ ++S+ D Y ++ RA EL S EL ++
Sbjct: 537 ----ENHGR--------LGQSSTIDL----YDPDDLFRAWELLYNASVSVPELWEDKGWK 580
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
+D++D+TRQ LA+ ++ +IE Y+ + + ++E +D +L+ F L
Sbjct: 581 FDMVDITRQVLAERFKLEYVELIEKYK--KGADISCDGDILIGILESLDDVLSASPHFRL 638
Query: 416 GPWLESAKQ----------------------------LAQNEEQEKQ-YEWNARTQITMW 446
W+ +A L N +Q + +NA QIT+W
Sbjct: 639 DTWVNAAVSSAPLPASTNCSSTSINNSSLLFNSSTSILTSNLTPTQQFFAYNAINQITIW 698
Query: 447 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
Q + DY +K W GL+R YY PR ++ +Y+ E
Sbjct: 699 GPTGQ-----IDDYASKSWGGLVRGYYLPRWKMFLEYIDE 733
>gi|427385205|ref|ZP_18881710.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
12058]
gi|425727373|gb|EKU90233.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
12058]
Length = 719
Score = 218 bits (554), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 150/549 (27%), Positives = 261/549 (47%), Gaps = 76/549 (13%)
Query: 8 VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
V P +SG VP A + + + + L N F+ + L TD F EI +
Sbjct: 227 VFPGYSGMVPHDADEKLGLNLTKSDLWNGFTRPA--------FLQPTDTRFAEIANLYYR 278
Query: 67 QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
+Q K +G+ + Y+ D F E N VD + G AI M+ + A W++QGW
Sbjct: 279 EQEKLFGKADY-YSMDPFHEAENAASVD----FDAAGKAIMQAMKKVNPKATWVVQGW-- 331
Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----------- 173
+ +P RP ++ + N G L++LDLF+E +P+W G+P IW
Sbjct: 332 TENP--RPEMIENMKN----GDLLILDLFSECRPMW-------GIPSIWKRDKGYEQHDW 378
Query: 174 --CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEM 230
CML NF GN+ ++G +D + + + + T + G+G++MEG E NPV+++LM E+
Sbjct: 379 LFCMLLNFGGNVGLHGRMDQLLDNFYQTKDNPLATHLKGIGLTMEGSENNPVMFELMCEL 438
Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFP 288
++ EK + W+ Y RYG I+ AW +L +++YNC G + + + P
Sbjct: 439 PWRPEKFTKEEWLKDYLFARYGVKDEKIEKAWTLLANSIYNCPFGNNQQGPHESIFCGRP 498
Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
++ + + + K +NY P E A L + ++
Sbjct: 499 SMN-NFQASSWSKMKNYYDPTVTE-----------------------EAARLMLEVADKY 534
Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
+N + YDL+D+ RQ+L+ ++ I ++ D + S++FL+++ D LL
Sbjct: 535 RGNNNFEYDLVDIVRQSLSDKGRIVYNQTIADFKSFDKRSFARDSQKFLDILLLQDRLLG 594
Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
F +G W+E A+ L E++ YEWNAR QIT W + + LRDY +K W+G+
Sbjct: 595 TRSEFRVGRWIEQARNLGTTPEEKDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGI 654
Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
LRD+Y R A Y++ + + L+ +L ++ + W + Y G+++
Sbjct: 655 LRDFYYKRWAAYWQTLQDQLDGKPEVKL-----DYYAMEEPWTLAKTPYDSTPEGNSVDV 709
Query: 529 SQWLYNKYL 537
++ ++ K +
Sbjct: 710 AKEVFEKAM 718
>gi|262406054|ref|ZP_06082604.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294648118|ref|ZP_06725661.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|294806859|ref|ZP_06765684.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345510559|ref|ZP_08790126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|229443271|gb|EEO49062.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|262356929|gb|EEZ06019.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|292636502|gb|EFF54977.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|294445888|gb|EFG14530.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
Length = 718
Score = 218 bits (554), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 148/531 (27%), Positives = 255/531 (48%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP A P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEAFAQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLTEYGETIYKSITAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +W+T + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+ + ++P
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
+ + + +SK + + + ++A+ L+ + +EL +S
Sbjct: 518 QTVVPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ LA A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYLAAKAENFYKQALKDDSENRVFAAQRNLQQTVDLLMDVDRLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVEFARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PV++
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVKA 708
>gi|168216263|ref|ZP_02641888.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens NCTC 8239]
gi|182381741|gb|EDT79220.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens NCTC 8239]
Length = 2104
Score = 218 bits (554), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 155/528 (29%), Positives = 256/528 (48%), Gaps = 51/528 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + P A+ G W P TY+ + F ++ F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEEEVDYFQKVADVFYEK 473
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L G+ +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKGQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ +A P SE+ MVG+G++ E I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RRYG++ I +AWN++ T Y K D + SII+ G
Sbjct: 640 DYIERRYGKTNKEILEAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA A E + + AY + +S +FLEL++ + +L+ FL+G W+E A+
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900
>gi|422345314|ref|ZP_16426228.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
WAL-14572]
gi|373228039|gb|EHP50349.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
WAL-14572]
Length = 1842
Score = 218 bits (554), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 155/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + P A+ G W P TY+ + F ++ F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ +A P SE+ MVG+G++ E I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RRYG++ I DAWN++ T Y K D + SII+ G
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA A E + + AY + +S +FLEL++ + +L+ FL+G W+E A+
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900
>gi|168207628|ref|ZP_02633633.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens E str. JGS1987]
gi|170661027|gb|EDT13710.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens E str. JGS1987]
Length = 2104
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 155/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + P A+ G W P TY+ + F ++ F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 474 QEEVFGEVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ +A P SE+ MVG+G++ E I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RRYG++ I DAWN++ T Y K D + SII+ G
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA A E + + AY + +S +FLEL++ + +L+ FL+G W+E A+
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900
>gi|340514474|gb|EGR44736.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
Length = 762
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 142/490 (28%), Positives = 250/490 (51%), Gaps = 42/490 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP A V P A++ W + LD DPLF ++ R+FI +
Sbjct: 245 ILPAFPGFVPRAAPRVLPDARLLHSIQWAGFPE--IFTEDTFLDPVDPLFAQMQRSFITK 302
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG ++ Y D F+E PP Y+ ++ + + ++S D +A+W+ Q WLF+ +
Sbjct: 303 QKQAYGNVTNFYTLDQFNEMIPPSGDVAYLRNVSSNTWKALKSADPNAIWVFQAWLFAQN 362
Query: 128 -PFWRPPQMKALLNSVPL-GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW +++A L V +++LD+++E P W ++ +YG P+IWC L N+ I +
Sbjct: 363 TTFWTNERIEAYLGGVTADSDMLILDIWSESMPQWQRAQSYYGKPWIWCELQNYGATINL 422
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + ++ P+ A E+T++ G G+SMEG + N +VYDL+ A+ E +D +A+ +
Sbjct: 423 YGQIQNVTNSPILA-LQESTSLSGFGLSMEGQQNNEIVYDLLLAQAWSSEPLDTEAYFHN 481
Query: 246 YSVRRYG--RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
++ RY + I DAW + TVY+ T+ + P V SII +
Sbjct: 482 WASARYSSDQRPGFIHDAWETVRTTVYDNTN---------LTLMPSVPKSIIEL------ 526
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG---NELSASNTYRYDLID 360
V + + + ++ + L Y + ++ A + +G L ++ Y+YDL+D
Sbjct: 527 -----VPRTSNM-ADITGILGTKLPYDPAVMVSAWKQLYHAGLQDTSLFNNSAYQYDLVD 580
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSR------RFLELVEDMDGLLACHDGFL 414
TRQ LA ++ NI++ Y ++ ++ R + +L+ +D +L+ + F
Sbjct: 581 WTRQVLANAFIPIYKNIVDIYYNSNQTAGSRIQRLKAQGQQVTKLLLSLDLVLSSNRNFR 640
Query: 415 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
L WL +A+ A + +E+ AR QIT+W + Q L DY +K WSGL++ Y+
Sbjct: 641 LSTWLSAARSSAPSPAYVDSFEYEARNQITLWGPSGQ-----LIDYASKAWSGLMKTYHL 695
Query: 475 PRAAIYFKYM 484
R ++ +Y+
Sbjct: 696 KRWQMFVEYL 705
>gi|169346867|ref|ZP_02865815.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens C str. JGS1495]
gi|169296926|gb|EDS79050.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens C str. JGS1495]
Length = 2104
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 155/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + P A+ G W P TY+ + F ++ F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ +A P SE+ MVG+G++ E I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RRYG++ I DAWN++ T Y K D + SII+ G
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA A E + + AY + +S +FLEL++ + +L+ FL+G W+E A+
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 857 GLQAELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900
>gi|225875033|ref|YP_002756492.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
51196]
gi|225793771|gb|ACO33861.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
51196]
Length = 800
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 151/521 (28%), Positives = 238/521 (45%), Gaps = 45/521 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P + G VP P A + GNW + P W LD DPLF + +F +
Sbjct: 278 VFPGYFGMVPEDFARRHPGAHVIPQGNWNGFRR-PAW-----LDPRDPLFAAVAASFYKH 331
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G +S IY+ + F E D P +SS AI + A+W+ W +
Sbjct: 332 QQELFGDSS-IYDIELFQEGGSAADVP--VSSAAKAIQKALLRAHPQAMWMTLAW--QNN 386
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
P +ALL++V L+V+D+ P + + F G Y++ L +F G +
Sbjct: 387 P------SRALLSAVDRSHLLVVDIDQGRTPHENRERDFMGAAYLFGGLWDFGGRTTLGA 440
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
L A +TM G + EG++ NP +DL +EMA++ VD++ W +Y+
Sbjct: 441 NLYDYAVRLPRMGLRAGSTMKGTALFSEGLDNNPAAFDLFTEMAWRTSPVDLRTWSREYA 500
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNC-TDGATDKN-RDVIV-AFPDVDPSIISVTEGKYQN 304
RRYG P + AW +L T Y DG ++ RD + D PS+ +V+
Sbjct: 501 RRRYGMDDPHTRRAWRILMETAYGTRADGVSNHGERDAPPESLFDAQPSLDAVS------ 554
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
SS+ L Y + AL + + + TY+YDL+D+ RQ
Sbjct: 555 --------------ASSWSPDRLRYDPKKFEAALTELLQAPPGMREMPTYQYDLVDVARQ 600
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA ++ + I +AY L +++L ++ D LLA + F++GPWL +
Sbjct: 601 TLANWSRKTLPEIKDAYDHRHEARFETLEKQWLCMMMLQDKLLATNTSFMVGPWLNAVSP 660
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
A ++++ +++AR+ +T W + T EA LRDYGNK W+GL RDYY R IYF +
Sbjct: 661 WAATATEQRRLDYDARSILTTWGNRTASEAG-LRDYGNKDWAGLTRDYYYRRWQIYFNDL 719
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDA 525
SL++G D W W + Y ++ GD+
Sbjct: 720 DRSLKTGTPPHPID----WFAFGEKWNRAQTHYATQARGDS 756
>gi|182624959|ref|ZP_02952737.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens D str. JGS1721]
gi|177909756|gb|EDT72174.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens D str. JGS1721]
Length = 2104
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 155/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + P A+ G W P TY+ + F ++ F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ +A P SE+ MVG+G++ E I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RRYG++ I DAWN++ T Y K D + SII+ G
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA A E + + AY + +S +FLEL++ + +L+ FL+G W+E A+
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900
>gi|168212494|ref|ZP_02638119.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens CPE str. F4969]
gi|170716100|gb|EDT28282.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens CPE str. F4969]
Length = 2104
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 155/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + P A+ G W P TY+ + F ++ F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEK 473
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ +A P SE+ MVG+G++ E I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RRYG++ I DAWN++ T Y K D + SII+ G
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA A E + + AY + +S +FLEL++ + +L+ FL+G W+E A+
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 797 MLKDADDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900
>gi|423214208|ref|ZP_17200736.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693153|gb|EIY86388.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
CL03T12C04]
Length = 718
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 148/531 (27%), Positives = 255/531 (48%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP A P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEAFAQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +W+T + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNAMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+ + ++P
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
+ + + +SK + + + ++A+ L+ + +EL +S
Sbjct: 518 QTVVPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ +A A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|224027030|ref|ZP_03645396.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
18228]
gi|224020266|gb|EEF78264.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
18228]
Length = 837
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 153/515 (29%), Positives = 236/515 (45%), Gaps = 73/515 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKI--TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
+ P F G +P A + ++P I T G F +++ T+PLF +I AFI
Sbjct: 230 ICPGFPGFIPEAFKRIYPDLHIVETHWGGAFH---------NWMISPTEPLFAKISEAFI 280
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSP------EYISSLGAAIYSGMQSGDSDAVWLM 119
++ KE+G+ + Y D+F+E P E +S G +YS ++ + DAVW+M
Sbjct: 281 KEWEKEFGKCDY-YLVDSFNEMDIPFPEKGNPARYEMAASYGEKVYSSIKRANKDAVWVM 339
Query: 120 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDL-------FAEVKPIWSTSKQFYGVPY 171
QGW+F Y W + AL++ VP K+++LDL F + W K FY +
Sbjct: 340 QGWMFGYQRHIWDYETLGALVSRVPDDKMLLLDLAVDYNRHFWHSEVNWEYYKGFYNKQW 399
Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEM 230
++ ++ N G M G+LD A G +EA +S N +V G++ EGIE N V+Y+L+++
Sbjct: 400 VYSVIPNMGGKTGMTGVLDFYANGHLEALSSSNRGNLVAHGLAPEGIENNEVLYELVTDA 459
Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 290
+ ++DV+ W+ QYS+ RYG++ + AW+ L +VY F D
Sbjct: 460 GWSDHRMDVRDWLKQYSINRYGKAPAQLMKAWDYLLKSVYG--------------TFTDH 505
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
G +N +S + + LE F+A+ EL
Sbjct: 506 PRFNWQFRPGLVKNGSINIS---------------------DDYFKGLESFVAASEELKD 544
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
S Y DL ++T L A L I + Y L D L RF + MD +L+ H
Sbjct: 545 SPYYLTDLCEMTAHYLGSKAEILTRQIDQEYLLGDTLQAHFLQSRFETFMLGMDRILSQH 604
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
L W+ A + A+ E Q KQYE NAR +T+W + DY + WSGL+
Sbjct: 605 PTLRLDRWVSFASKAARTEAQRKQYEMNARRIVTVW-------GPPVDDYSARMWSGLVG 657
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 505
YY R Y+K +SG L W R+W++
Sbjct: 658 SYYLGRWKEYYK----GRDSGKSADLSSWERKWVE 688
>gi|299140550|ref|ZP_07033688.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella oris C735]
gi|298577516|gb|EFI49384.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella oris C735]
Length = 741
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 146/490 (29%), Positives = 232/490 (47%), Gaps = 49/490 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F G +P + +T G W L TD F +I + +
Sbjct: 223 VLPGFCGMMPHDAKAKL-GLNVTDGGIWNGYTRPAN------LSPTDAHFDKIADLYYAE 275
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
K YG+ ++ Y+ D F E T ++ +Y S G + M+ + A W++QGW + +
Sbjct: 276 LTKLYGKANY-YSMDPFHE-TNDDETIDY-SKAGCKVMEAMKRVNPKATWVIQGW--TEN 330
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAG 181
P RP +K + N G L+VLDLF+E +P IW K + +++CML NF
Sbjct: 331 P--RPQMIKNMKN----GDLLVLDLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGA 384
Query: 182 NIEMYGILDSIAFGPVEARTSENTT--MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
N+ ++G +D + + S T + G+G +MEG E NPV+++LMSE+ ++ E
Sbjct: 385 NVGLHGRMDQLLHNFYSTKQSSPNTQHLKGIGFTMEGSENNPVMFELMSELPWRTE-CKK 443
Query: 240 KAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
+ WI Y RYG++ P I+ AW +L T+YNC G + SI
Sbjct: 444 EDWIKGYVKARYGKTSPEIERAWQLLSETIYNCPAGNNQQG---------PHESIFC--- 491
Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
G+P +KS + ++ Y + A +L ++ +N + YDL+
Sbjct: 492 ------GRPSLNNFQVKSWSKMRNY----YDPQATLEAAQLMTGIADQYKGNNNFEYDLV 541
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
D+ RQALA +L I Y + + RFLE++ D LL F LG W
Sbjct: 542 DICRQALADQGRLQYLKTIADYNGFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWT 601
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
E+A++L ++++ YEWNAR QIT W + + L DY +K W G+L+D+Y R I
Sbjct: 602 EAARKLGTTQQEKDLYEWNARVQITTWGNRICADKGGLHDYAHKEWQGILKDFYYKRWKI 661
Query: 480 YFKYMIESLE 489
+ + + +E
Sbjct: 662 FMDALAKQME 671
>gi|322703040|gb|EFY94656.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
ARSEF 23]
Length = 774
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 146/507 (28%), Positives = 254/507 (50%), Gaps = 51/507 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF G VP +++ V P+A +T NWF+ D ++ LD D + E+ + F+ +
Sbjct: 244 VLPAFPGFVPESIKKVRPNANLTVSPNWFAPAPD-KYTRDLFLDPLDDTYAELQKLFVTK 302
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSY 126
Q+ +G +++Y D F+E +P Y+ + Y+G+ + + AVWL+QGWL FS
Sbjct: 303 QIDAFGNVTNVYTLDQFNELSPASGDTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSS 362
Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW P++ A L V + ++VLDL++EV P W + + G P+IWC LH+F GN+ +
Sbjct: 363 RNFWTQPRIDAYLGGVEDHQGMLVLDLYSEVNPQWQRTNSYSGKPWIWCQLHDFGGNMAL 422
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G + ++ P++A +++ ++VG G++ E E N VVYD++ + A+ +D +A+
Sbjct: 423 EGRVQTLTSAPIDA-LAQSKSLVGFGLTPEAYEGNEVVYDILLDQAWSATPLDTQAYFAS 481
Query: 246 YSVRRYG--RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+ +RY S+P+ + AW +L VY+ T TD + V VA + P++ +
Sbjct: 482 WVTKRYAGISSIPSELYRAWEILRTDVYSNTR--TDIPQ-VPVATYQLRPALSGIA---- 534
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA-----LELFIASGNELSASNTYRYD 357
+ T + HP + V++ LE G+ L ++ D
Sbjct: 535 --------------NRTGHFPHPTALHYDPLVLQGVWKLMLEALTRQGS-LWKVPAFQLD 579
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHG-------------VFQLSRRFLELVEDMD 404
+D++RQ L+ + L+ +++ AY+ + G V R L L+ +D
Sbjct: 580 FVDVSRQMLSNQFDVLYADLVNAYKCSTGAGGSRELRSNTPNCDVKAAGARLLFLLSTLD 639
Query: 405 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
L F L W+++A + E + +NAR+Q+T+W Q A+ L DY K
Sbjct: 640 LTLLTSRHFALQSWVDAASAWGKAAGNEDLFTFNARSQVTVW----QVNATNLNDYAAKA 695
Query: 465 WSGLLRDYYGPRAAIYFKYMIESLESG 491
W GL+ YY R +I+ ++ + SG
Sbjct: 696 WGGLVGSYYKGRWSIFVDALVAASSSG 722
>gi|295085513|emb|CBK67036.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 718
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 148/531 (27%), Positives = 254/531 (47%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +WST + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWSTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+ + ++P
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
+ + + +SK + + + ++A+ L+ + +EL +S
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ +A A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|423217398|ref|ZP_17203894.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
CL03T12C61]
gi|392628557|gb|EIY22583.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
CL03T12C61]
Length = 707
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 144/496 (29%), Positives = 237/496 (47%), Gaps = 59/496 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
V P ++G VP + +LG + + +WC L + D F +
Sbjct: 233 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMY 282
Query: 65 IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
E+ K YG+ + Y+ D F E NT VD ++ G +I M+ + +AVW++Q W
Sbjct: 283 YEELEKLYGKAKY-YSMDPFHEGGNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAW 337
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS-------KQFYGVPYIWCM 175
+ P A+++ + G ++VLDL++E +P W S K F +++CM
Sbjct: 338 QANPRP--------AMIDVLNAGDMLVLDLYSEKRPQWGDSDSMWYREKGFGKHDWLYCM 389
Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
L NF GN+ ++G ++ + G +A N M GVG + EGIE NPV+++L+ E+ ++
Sbjct: 390 LLNFGGNVGLHGRMNQLVNGYYDACAHVNGKRMRGVGATPEGIENNPVMFELLYELPWRA 449
Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
E+ W+ Y RYG + P + +AW L HTVYN P P
Sbjct: 450 ERFSPDVWLQGYLKARYGGELSPEVMEAWRALEHTVYNA---------------PKNSPG 494
Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
EG ++ ++ TS++ + L+YS +A +L ++ + N
Sbjct: 495 -----EGTLESL--LCARPGFHLDRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGDNN 547
Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
+ YDL+D+ RQ+ A N L I ++Y D + +++FLEL+ D LL+ F
Sbjct: 548 FEYDLVDIVRQSNADKGNALLDEISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEF 607
Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
+ WL +A+ L + ++K YEWNA IT+W D+ L DY ++ WSGLL+D Y
Sbjct: 608 SVSSWLAAARSLGNTDAEKKLYEWNASALITVWGDSIASNQGGLHDYSHREWSGLLKDLY 667
Query: 474 GPRAAIYFKYMIESLE 489
R +F+ + LE
Sbjct: 668 YLRWKTFFEQKQQELE 683
>gi|160884066|ref|ZP_02065069.1| hypothetical protein BACOVA_02042 [Bacteroides ovatus ATCC 8483]
gi|423291473|ref|ZP_17270321.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
CL02T12C04]
gi|156110408|gb|EDO12153.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392663473|gb|EIY57023.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
CL02T12C04]
Length = 718
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 148/531 (27%), Positives = 255/531 (48%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP A P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEAFAQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +W+T + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+ + ++P
Sbjct: 469 SSDSIDLDDWMTIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
+ + + +SK + + + ++A+ L+ + +EL +S
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ +A A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWINSTTPFDDPVEA 708
>gi|110801838|ref|YP_698175.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens SM101]
gi|110682339|gb|ABG85709.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens SM101]
Length = 2095
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 156/528 (29%), Positives = 255/528 (48%), Gaps = 51/528 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + P A+ G W P TY+ + F + F E+
Sbjct: 406 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEK 464
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 465 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 520
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 521 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 572
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ +A P SE+ MVG+G++ E I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 573 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 630
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RRYG++ I +AWN++ T Y K D + SII+ G
Sbjct: 631 DYIERRYGKTNEEILEAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 676
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 677 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFSKNYDEFKDSDAFLYDFADILKQ 727
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA A E + + AY DA +S +FLEL++ + +L+ FL+G W+E A+
Sbjct: 728 LLANSAQEYYEVMCNAYNNRDAEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 787
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 788 MLKDADDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTGDYYYARWEKWIN 847
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 848 GLQIELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 891
>gi|293371915|ref|ZP_06618319.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|292633161|gb|EFF51738.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 718
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 150/531 (28%), Positives = 253/531 (47%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +W+T + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYS--------------------- 507
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
S+ S +Q +S + + S D+ ++A+ L+ + +EL +S
Sbjct: 508 SLYSYPRFTWQTV---ISDQRRISKIDLSDDY----------LQAIRLYASCADELKSSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ +A A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|168209163|ref|ZP_02634788.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens B str. ATCC 3626]
gi|170712640|gb|EDT24822.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens B str. ATCC 3626]
Length = 2104
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 155/528 (29%), Positives = 253/528 (47%), Gaps = 51/528 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + P A+ G W P TY+ + F + F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEK 473
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ +A P SE+ MVG+G++ E I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 639
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RRYG++ I DAWN++ T Y K D + SII+ G
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA A E + + AY + +S +FLEL++ + +L+ FL+G W+E A+
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR +T W L+DY N+ WSGL DYY R +
Sbjct: 797 MLKDADDWTKDLFEFNARALVTTWGSRNNANGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 857 GLQTELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900
>gi|336404352|ref|ZP_08585050.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
gi|335943680|gb|EGN05519.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
Length = 718
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 150/531 (28%), Positives = 253/531 (47%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +W+T + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYS--------------------- 507
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
S+ S +Q +S + + S D+ ++A+ L+ + +EL +S
Sbjct: 508 SLYSYPRFTWQTV---ISDQRRISKIDLSDDY----------LQAIRLYASCADELKSSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ +A A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|170292392|pdb|2VC9|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With 2-Acetamido-1,2-Dideoxynojirmycin
gi|170292393|pdb|2VCA|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With Beta-N-Acetyl-D-Glucosamine
gi|170292394|pdb|2VCB|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With Pugnac
gi|170292395|pdb|2VCC|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
Length = 891
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 150/527 (28%), Positives = 253/527 (48%), Gaps = 49/527 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + A+ G W P TY+ + F ++ F E+
Sbjct: 381 VLQGYSGMVPRDFKEKNQEAQTISQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEK 439
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 440 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 495
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 496 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 547
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
+ +A + + + MVG+G++ E I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 548 DAAPEKLA-TEIPKALANSEHMVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTED 606
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y RRYG++ I +AWN++ T Y K D + SII+ G +
Sbjct: 607 YIERRYGKTNKEILEAWNIILDTAYK-------KRNDY---YQGAAESIINARPG----F 652
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 653 G--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQL 703
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
LA A E + + AY + +S +FLEL++ + +L+ FL+G W+E A+ +
Sbjct: 704 LANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTM 763
Query: 426 AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 764 LKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWING 823
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 824 LQAELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 866
>gi|18309848|ref|NP_561782.1| alpha-N-acetylglucosaminidase [Clostridium perfringens str. 13]
gi|18144526|dbj|BAB80572.1| probable alpha-N-acetylglucosaminidase [Clostridium perfringens
str. 13]
gi|288872041|dbj|BAI70446.1| alpha-N-acetylglucosaminidase [Clostridium perfringens]
Length = 2104
Score = 215 bits (547), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 154/528 (29%), Positives = 254/528 (48%), Gaps = 51/528 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + P A+ G W P TY+ + F + F E+
Sbjct: 415 VLQGYSGMVPRDFKEKNPEAQTISQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEK 473
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 474 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 529
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 530 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 581
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ +A P SE+ MVG+G++ E I NP+ ++L+ +MA+ ++++ + W
Sbjct: 582 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAHELLFDMAWTRDQINFRTWTE 639
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RRYG++ I DAWN++ T Y K D + SII+ G
Sbjct: 640 DYIERRYGKTNKEILDAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 685
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 686 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 736
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA A E + + AY + +S +FLEL++ + +L+ FL+G W+E A+
Sbjct: 737 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 796
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 797 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 856
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 857 GLQAELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 900
>gi|237721435|ref|ZP_04551916.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|293370838|ref|ZP_06617383.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|229449231|gb|EEO55022.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|292634054|gb|EFF52598.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 711
Score = 215 bits (547), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 141/480 (29%), Positives = 236/480 (49%), Gaps = 59/480 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
V P ++G VP + +LG + + +WC L D F +
Sbjct: 233 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMY 282
Query: 65 IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
E+ K YG+ ++ Y+ D F E NT VD ++ GA+I + M+ + +AVW++Q W
Sbjct: 283 YEELEKLYGKANY-YSMDPFHEGGNTEGVD----LAKTGASIMAAMKKANPEAVWIIQAW 337
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
+ + ++ S+ G L+VLDL++E +P +W K F +++CM
Sbjct: 338 --------QANPREEMIASLNQGDLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCM 389
Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQH 234
L NF GN+ ++G ++ + G +A N M+ GVG + EGIE NPV+++L+ E+ ++
Sbjct: 390 LLNFGGNVGLHGRMNQLVNGYYDACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWRE 449
Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
E+ W+ Y RYGR V P I +AW L +TVYN D + + S
Sbjct: 450 ERFSSDEWLQTYLKARYGREVSPEIMEAWRALEYTVYNA---PKDYQGEGTIE------S 500
Query: 294 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 353
++ G + + TS++ + L+Y+ +A LF + ++ +N
Sbjct: 501 LLCARPGFHLD-------------RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNN 547
Query: 354 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 413
+ YDL+D+ RQ+ A N L I ++Y D + +++FL+L+ D LL+ F
Sbjct: 548 FEYDLVDIVRQSNADKGNVLLEEISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEF 607
Query: 414 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
+ WL +A+ L EE+++ YEWNA IT+W D+ L DY ++ WSGLL+D Y
Sbjct: 608 SVSSWLNAARSLGTTEEEKRLYEWNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLY 667
>gi|237719039|ref|ZP_04549520.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451817|gb|EEO57608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 718
Score = 215 bits (547), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 146/531 (27%), Positives = 254/531 (47%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +W+T + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+ + ++P
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
+ + + +SK + + + ++A+ L+++ +EL S
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYVSCADELKGSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ +A A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDKLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PV++
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVKA 708
>gi|373460171|ref|ZP_09551927.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
gi|371956556|gb|EHO74342.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
Length = 742
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 142/492 (28%), Positives = 230/492 (46%), Gaps = 52/492 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F G +P + +T G W L TD F I + +
Sbjct: 223 VLPGFCGMMPHDAKEKL-GLNVTDGGKWNGYTRPAN------LSPTDSQFNRIADLYYAE 275
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ YG+ ++ Y+ D F E+ D+ +Y G+ + M+ + A W++QGW + +
Sbjct: 276 LTRLYGKANY-YSMDPFHESNDD-DALDY-GKAGSVMLEAMKRINPKATWVIQGW--TEN 330
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAG 181
P RP ++ + N G L++LDLF+E +P +W K + +++CML NF
Sbjct: 331 P--RPRMIQDMKN----GDLLILDLFSECRPMFGIPSVWKREKGYEQHDWLFCMLENFGA 384
Query: 182 NIEMYGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
N+ ++G +D + F + R+ + G+G +MEG E NPV+++LMSE+ ++ E
Sbjct: 385 NVGLHGRMDQLIHNFYSTKKRSPNTQHLKGIGFTMEGSENNPVMFELMSELPWRPEIFKK 444
Query: 240 KAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISV 297
+ W+ Y RYGR I+ AW +L T+YNC G + + V P ++ +
Sbjct: 445 EDWVRGYVKARYGRKDETIERAWLLLAETIYNCPAGNNQQGPHESVFCGRPGLNNFQVK- 503
Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
+ K +NY Y + A L + + +N + YD
Sbjct: 504 SWSKMRNY-----------------------YDPQATLEAARLMASVSSRYKGNNNFEYD 540
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
LID+ RQALA +L I Y + ++RFL+++ D LL F LG
Sbjct: 541 LIDICRQALADQGRLQYLKTIADYNGFSRAAFAKDAKRFLDMILLQDRLLGTRKEFRLGH 600
Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
W E+A+ L + ++ YEWNAR QIT W + T + LRDY +K W G+L+D+Y R
Sbjct: 601 WTEAARSLGTTQAEKDLYEWNARVQITTWGNRTCADNGGLRDYAHKEWQGILKDFYYKRW 660
Query: 478 AIYFKYMIESLE 489
IY + + +E
Sbjct: 661 KIYMDALAKQME 672
>gi|153807690|ref|ZP_01960358.1| hypothetical protein BACCAC_01972 [Bacteroides caccae ATCC 43185]
gi|149130052|gb|EDM21264.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
Length = 707
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 144/503 (28%), Positives = 240/503 (47%), Gaps = 73/503 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAF 64
V P ++G VP + +LG + + +WC L + D F +
Sbjct: 233 VFPGYAGMVPRNIGE--------KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMY 282
Query: 65 IEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
E+ K YG+ + Y+ D F E NT VD ++ G +I M+ + +AVW++Q W
Sbjct: 283 YEELEKLYGKAKY-YSMDPFHEGGNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAW 337
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCM 175
+ P A+++ + G ++VLDL++E P +W K F +++CM
Sbjct: 338 QANPRP--------AMVDVLNAGDMLVLDLYSERLPQWGDPDSMWYREKGFGKHDWLYCM 389
Query: 176 LHNFAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 234
L NF GN+ ++G ++ + G +A T N T+ GVG + EGIE NPV+++L+ E+ ++
Sbjct: 390 LLNFGGNVGLHGRMNQLVNGYYDACTHANGKTLRGVGTTPEGIENNPVMFELLYELPWRA 449
Query: 235 EKVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 293
E+ W+ Y RYG + P + +AW L HTVYN
Sbjct: 450 ERFSPDTWLQGYLKARYGGELSPEVMEAWRALEHTVYNAP-------------------- 489
Query: 294 IISVTEGKYQNYGKPVSKEAVLKS-------ETSSYDHPHLWYSTSEVIRALELFIASGN 346
+NY + E++L + TS++ + L+YS +A +L ++
Sbjct: 490 ---------KNYQGEGTVESLLCARPGFHLDRTSTWGYSKLFYSPDSTSKAADLMLSVAE 540
Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
+ +N + YDL+D+ RQ+ A N L I ++Y D + +++FLEL+ D L
Sbjct: 541 QYKGNNNFEYDLVDIVRQSNADKGNALLDEISQSYDRKDKENFRKQTQQFLELILSQDSL 600
Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
L+ F + WL +A+ L + ++K YEWNA IT+W D+ L DY ++ WS
Sbjct: 601 LSTRKEFSVSSWLTAARSLGNTDAEKKLYEWNASALITVWGDSIASNQGGLHDYSHREWS 660
Query: 467 GLLRDYYGPRAAIYFKYMIESLE 489
GLL+D Y R +F+ + LE
Sbjct: 661 GLLKDLYYLRWKTFFEQKQQELE 683
>gi|299144719|ref|ZP_07037787.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298515210|gb|EFI39091.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 718
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 147/531 (27%), Positives = 254/531 (47%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKADKEAKYKLLAEYGETIYKSITAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +W+T + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+ + ++P
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
+ + + +SK + + + ++A+ L+ + +EL +S
Sbjct: 518 QTVVPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ +A A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|298480124|ref|ZP_06998323.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|298273933|gb|EFI15495.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
Length = 718
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 147/531 (27%), Positives = 254/531 (47%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +W+T + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+ + ++P
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
+ + + +SK + + + ++A+ L+ + +EL +S
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ +A A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|198277542|ref|ZP_03210073.1| hypothetical protein BACPLE_03764 [Bacteroides plebeius DSM 17135]
gi|198270040|gb|EDY94310.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides plebeius DSM
17135]
Length = 722
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 152/547 (27%), Positives = 255/547 (46%), Gaps = 76/547 (13%)
Query: 8 VLPAFSGNVPA-ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
V P +SG VP A + + + L N F+ + L TD F EI + +
Sbjct: 228 VFPGYSGMVPHDANKKLGLNVTEPALWNGFTRPA--------FLLPTDSRFNEIASLYYK 279
Query: 67 QQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
+ K +G+ ++ Y+ D F E + VD + G A+ M++ + A W++QGW
Sbjct: 280 ELEKLFGKANY-YSMDPFHELEDAGSVD----FDAAGKAVLKAMKNVNPKATWVIQGW-- 332
Query: 125 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW----------- 173
+ +P RP +K L N G +++LDLF+E +P+W G+P IW
Sbjct: 333 TENP--RPEMIKNLNN----GDILILDLFSECRPMW-------GIPSIWKREKGYEQHDW 379
Query: 174 --CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEM 230
CM+ NF GN+ ++G +D + + + + G+G++MEG E NPV+++LM E+
Sbjct: 380 LFCMIENFGGNVGLHGRMDQLLNNFYLTKNNPLAAHLKGIGLTMEGSENNPVMFELMCEL 439
Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFP 288
++ EK + W+ Y RYG I AW++L +YNC G + + + P
Sbjct: 440 PWRPEKFTKEEWLKDYLFARYGVRDEKITQAWSILADGIYNCPFGNNQQGPHESIFCGRP 499
Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
++ + + + K QNY P S EA A L + ++
Sbjct: 500 GLN-NFQASSWSKMQNYYDPTSTEA-----------------------AARLMLEVADKY 535
Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
+N + YDL+D+ RQ+L+ ++ I ++ D S+ FL ++ D LL
Sbjct: 536 KGNNNFEYDLVDIVRQSLSDRGRIVYNQTIADFKSFDKKSFATHSQEFLNILLAQDRLLG 595
Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
F +G W+E A+ L E++ YEWNAR QIT W + LRDY +K W+GL
Sbjct: 596 TRSEFRVGRWIEQARNLGTTPEEKDLYEWNARVQITTWGNRVCANDGGLRDYAHKEWNGL 655
Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
L+D+Y R A Y++ + + L+ L ++ + W N Y + GD +
Sbjct: 656 LKDFYYKRWAAYWQTLQDVLDGKPMVEL-----DYYAMEEPWTLAHNPYASQPEGDCVSV 710
Query: 529 SQWLYNK 535
++ ++NK
Sbjct: 711 AKEVFNK 717
>gi|322699924|gb|EFY91682.1| alpha-N-acetylglucosaminidase, putative [Metarhizium acridum CQMa
102]
Length = 775
Score = 214 bits (545), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 151/531 (28%), Positives = 260/531 (48%), Gaps = 51/531 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF G VP +++ V P +T NWF+ D ++ LD D + E+ R F+ +
Sbjct: 246 VLPAFPGFVPESIKKVRPDVNLTVSPNWFAPAPD-KYTRDLFLDPLDDTYAELQRLFVSK 304
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSY 126
Q+ +G ++IY D F+E +P Y+ + Y+G+ + + AVWL+QGWL FS
Sbjct: 305 QMDAFGNVTNIYTLDQFNELSPASGDTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSS 364
Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW P++ A L V + ++VLDL++E P W + + G P+IWC LH+F GN+ +
Sbjct: 365 RRFWTQPRIDAYLGGVEDDQGMLVLDLYSEANPQWQRTNSYSGKPWIWCQLHDFGGNMAL 424
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G + ++ P++A +++ ++VG G++ E E N VVYD++ + A+ +D + +
Sbjct: 425 EGRVQTLTSAPIDA-LAQSESLVGFGLTPEAYEGNEVVYDILLDQAWSATPLDTQTYFAS 483
Query: 246 YSVRRYG--RSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+ +RY S+P+ + AW +L VY+ T TD + V VA + P++ +
Sbjct: 484 WVTKRYAGVSSIPSELYRAWEMLRTDVYSNTR--TDIPQ-VPVATYQLRPALSGIA---- 536
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIR-----ALELFIASGNELSASNTYRYD 357
+ T + HP + V++ LE G+ L ++ D
Sbjct: 537 --------------NRTGHFPHPTALHYDPLVLQEAWKLMLEAMTRQGS-LWKVPAFQLD 581
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL------------SRRFLELVEDMDG 405
+D++RQ L+ + L+ +++ AY+ + A G +L R L L+ +D
Sbjct: 582 FVDVSRQMLSNQFDVLYADLVNAYKCSAAGGSRELRSSAPSCDVEAAGARLLSLLSTLDL 641
Query: 406 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 465
L F L W+++A + E + +NAR+Q+T+W Q +A+ L DY K W
Sbjct: 642 TLLTSRHFTLQSWVDAAGSWGKAAGNEDLFTFNARSQVTVW----QVDATNLNDYAAKAW 697
Query: 466 SGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNV 516
GL+ YY R +I+ ++ + SG RE +WQ G +
Sbjct: 698 GGLVGSYYKGRWSIFVDALVAASNSGS-LDEGALTRELEAFEAEWQAGEHA 747
>gi|423293381|ref|ZP_17271508.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
CL03T12C18]
gi|392678324|gb|EIY71732.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
CL03T12C18]
Length = 718
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 147/531 (27%), Positives = 254/531 (47%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 ITPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +W+T + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+ + ++P
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
+ + + +SK + + + ++A+ L+ + +EL +S
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKSSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ +A A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PVE+
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWINSTTPFDDPVEA 708
>gi|383115207|ref|ZP_09935965.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
gi|313695376|gb|EFS32211.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
Length = 718
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 146/531 (27%), Positives = 253/531 (47%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +W+T + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+ + ++P
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
+ + + +SK + + + ++A+ L+ + +EL S
Sbjct: 518 QTVVPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKGSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ +A A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PV++
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVKA 708
>gi|194695302|gb|ACF81735.1| unknown [Zea mays]
Length = 173
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 100/152 (65%), Positives = 119/152 (78%)
Query: 392 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITMWFDNT+
Sbjct: 7 LCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTE 66
Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQ 511
+ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+ F LK+WRREWI LTN+WQ
Sbjct: 67 TKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWISLTNNWQ 126
Query: 512 NGRNVYPVESNGDALITSQWLYNKYLQGTGVF 543
+ R V+ + GD L SQ LY KYL +
Sbjct: 127 SDRKVFSTTATGDPLNISQSLYTKYLSNADLL 158
>gi|423212382|ref|ZP_17198911.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694828|gb|EIY88054.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
CL03T12C04]
Length = 705
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 145/490 (29%), Positives = 233/490 (47%), Gaps = 61/490 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P ++G VP + +I G W S PR L D F + E+
Sbjct: 233 VFPGYAGMVPRNIGEKL-GYQIADPGKWCSF---PR---PAFLSTEDEHFESFAAMYYEE 285
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
K YG+ ++ Y+ D F E NT VD ++ GA+I + M+ + AVW++Q W
Sbjct: 286 LEKLYGKANY-YSMDPFHEGGNTEGVD----LAKTGASIMAAMKKANPKAVWVIQAW--- 337
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHN 178
+ + +++S+ G ++VLDL++E P W K F +++CML N
Sbjct: 338 -----QANPREEMISSLNQGDMLVLDLYSERLPQWGDPDSKWYREKGFGKHDWLYCMLLN 392
Query: 179 FAGNIEMYGILDSIAFGPVEARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
F N+ ++G +D + G +A N T+ GVG + EGIE NPV+++L+ E+ ++ E+
Sbjct: 393 FGANVGLHGRMDLLVNGYYDACAHANGKTLRGVGATPEGIENNPVMFELLYELPWREERF 452
Query: 238 DVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 296
W+ Y RYG+ V P + +AW L HTVYN RD
Sbjct: 453 SPDEWLQGYLKARYGKDVSPEVMEAWRALEHTVYNAP-------RD-------------- 491
Query: 297 VTEGKYQNYGKPVS----KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
YQ G S + TS++ + L+YS +A L + + SN
Sbjct: 492 -----YQGEGTVESLLCARPGFHLDRTSTWGYAKLFYSPDSTAKAARLLTSVAKQYEGSN 546
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
+ YDL+D+ RQ+ A N L +I ++Y D + +++FL+L+ D LL+
Sbjct: 547 NFEYDLVDIVRQSNADKGNVLLEDISQSYDRKDKENFRKQTQQFLDLIVSQDSLLSTRKE 606
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
F + WL++A+ L + ++K YEWNA IT+W D+ L DY ++ WSG+L+D
Sbjct: 607 FSVSTWLDAARSLGTTDAEKKLYEWNASALITVWGDSIASNQGGLHDYSHREWSGILKDL 666
Query: 473 YGPRAAIYFK 482
Y R +F+
Sbjct: 667 YYQRWKAFFE 676
>gi|383280354|pdb|4A4A|A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In
Complex With Its Substrate Glcnac-Alpha-1,4-Galactose
Length = 914
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 148/527 (28%), Positives = 253/527 (48%), Gaps = 49/527 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + A+ G W P TY+ + F ++ F E+
Sbjct: 404 VLQGYSGMVPRDFKEKNQEAQTISQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEK 462
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F + NT +D+ + I + M D+DAVW++Q W +
Sbjct: 463 QKEVFGDVTNFYGVDPFHQGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 518
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 519 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 570
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
+ +A + + + MVG+G++ + I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 571 DAAPEKLA-TEIPKALANSEHMVGIGITPQAINTNPLAYELLFDMAWTRDQINFRTWTED 629
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y RRYG++ I +AWN++ T Y K D + SII+ G +
Sbjct: 630 YIERRYGKTNKEILEAWNIILDTAYK-------KRNDY---YQGAAESIINARPG----F 675
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 676 G--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQL 726
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
LA A E + + AY + +S +FLEL++ + +L+ FL+G W+E A+ +
Sbjct: 727 LANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTM 786
Query: 426 AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 787 LKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWING 846
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 847 LQAELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 889
>gi|260910505|ref|ZP_05917173.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
F0295]
gi|260635347|gb|EEX53369.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
F0295]
Length = 1566
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 149/522 (28%), Positives = 249/522 (47%), Gaps = 50/522 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+ F G VP + FP A+I + G W S + LL D +F ++ A+ +
Sbjct: 236 VVQGFPGIVPTFFKERFPQARIIEQGKWGSFQRP-----AVLLPNNDGVFEKVAEAYYQS 290
Query: 68 QLKEYGRTSHIYNCDTFDEN--TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
K +G D F E T VD + S+ A + M A W++QGW +
Sbjct: 291 LTKLFGTDFEFLGGDLFHEGGITTGVD----VGSVAAQVQRQMLRFFPRAKWVLQGW--N 344
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+P PQ+ +L+ ++++L E+ W +S +F G P++W +++F G +M
Sbjct: 345 KNP---SPQLLRVLDK---RHTLLVNLSGEIAASWESSDEFGGTPWLWGSVNHFGGKTDM 398
Query: 186 YGILDSIAFGPVEART-SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
G L I P A + ++ M G+G+ EGI NPVVYDL + A+ DV + +
Sbjct: 399 GGQLPVIVTEPHRALALTVDSVMQGIGILPEGIGTNPVVYDLALKTAWHTATPDVDSMLV 458
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
QY RYG P + AW ++ +VY EG +++
Sbjct: 459 QYLGYRYGEVHPDLLAAWRIMLKSVYG---------------------EFAIKGEGTFES 497
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
++ ++ + S++ + Y +++ RAL LF+ + +L S TY+YDL+DL RQ
Sbjct: 498 VF--CARPSLRVTSVSTWGPKQMQYQPADLYRALGLFLKAAPKLRDSETYQYDLVDLARQ 555
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
+LA YA + ++++AY+ +A + Q ++RF L+ D LL + FLLG WL+ A Q
Sbjct: 556 SLANYARTAYADVVKAYEAKNAEQLQQATQRFERLIVLQDSLLLTNRHFLLGNWLQQATQ 615
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
A NE + NA+T I+ W E + + DY NK W+G+L YY PR +F+ +
Sbjct: 616 YAPNEADRQLCLHNAQTLISYW--GPDEPTTKVHDYANKEWAGMLSTYYLPRWQAFFRVL 673
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
S+ +G+ + ++ ++ W N + GDA+
Sbjct: 674 QASINTGNPPAI-----DFFEMEKRWANTPQPINTKPQGDAV 710
>gi|110800516|ref|YP_695309.1| alpha-N-acetylglucosaminidase [Clostridium perfringens ATCC 13124]
gi|110675163|gb|ABG84150.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens ATCC 13124]
Length = 2095
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 153/528 (28%), Positives = 254/528 (48%), Gaps = 51/528 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + A+ G W P TY+ + F ++ F E+
Sbjct: 406 VLQGYSGMVPRDFKEKNQEAQTISQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEK 464
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G ++ Y D F E NT +D+ + I + M D+DAVW++Q W +
Sbjct: 465 QKEVFGDVTNFYGVDPFHEGGNTGDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN 520
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P L + +VLDLF+EV P W+ ++ +P+IW MLHNF G + M
Sbjct: 521 -------PSNNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGM 572
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ +A P SE+ MVG+G++ E I NP+ Y+L+ +MA+ ++++ + W
Sbjct: 573 DAAPEKLATEIPKALANSEH--MVGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTE 630
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RRYG++ I +AWN++ T Y K D + SII+ G
Sbjct: 631 DYIERRYGKTNKEILEAWNIILDTAYK-------KRNDY---YQGAAESIINARPG---- 676
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+G +KS S++ H + Y SE +A+E+F + +E S+ + YD D+ +Q
Sbjct: 677 FG--------IKS-ASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQ 727
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
LA A E + + AY + +S +FLEL++ + +L+ FL+G W+E A+
Sbjct: 728 LLANSAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDART 787
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR +T W + L+DY N+ WSGL DYY R +
Sbjct: 788 MLKDSDDWTKDLFEFNARALVTTWGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWIN 847
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRN----VYPVESNGDAL 526
+ L+ G K +W K+ DW N ++ +YP E++ + L
Sbjct: 848 GLQAELDGG----AKAPNIDWFKMEYDWVNKKSDTDKLYPTEASNENL 891
>gi|429740221|ref|ZP_19273923.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
gi|429153946|gb|EKX96707.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
Length = 721
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 151/520 (29%), Positives = 243/520 (46%), Gaps = 70/520 (13%)
Query: 2 LLADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
LL + PAF+G VP + P ++ F + Y+L P F++IG
Sbjct: 228 LLGMHPITPAFAGFVPEGFVKLHPEVRVKH----FEWGGFDKSLNAYMLPPDSPYFLQIG 283
Query: 62 RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPE-------YISSLGAAIYSGMQSGDSD 114
+ FIE+ KE+ + ++ Y D+F+E PV SP+ +S G AIY + +G+ +
Sbjct: 284 KLFIEEWEKEFSKNTY-YLSDSFNEMELPV-SPDDTDGKHRLLSKYGEAIYQSIVAGNPN 341
Query: 115 AVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAE-------VKPIWSTSKQF 166
AVW+ QGW F Y FW ++ALL VP KL+++DL + + W T K F
Sbjct: 342 AVWITQGWTFGYQHRFWDKESLQALLERVPNDKLIIVDLANDYPKWVWKTEQTWKTHKGF 401
Query: 167 YGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYD 225
YG +I + NF G + G L+ A EA + ++G G + EG+E N VVY+
Sbjct: 402 YGKRWILSYVPNFGGKTLLTGDLNLYASCSAEALAHPDKGRLIGFGSAPEGLENNEVVYE 461
Query: 226 LMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIV 285
L+++M +Q++ +D+ W+ +Y RYG A+Q AW L +VY+
Sbjct: 462 LLADMGWQNQPIDLDHWLIEYCRSRYGSCPNAMQKAWKGLCRSVYS-------------- 507
Query: 286 AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG 345
S+ S +Q V + + KS+ YD ++ RA+E F+
Sbjct: 508 -------SLYSYPRFTWQT----VIPDTLRKSK---YDFNDTYF------RAVEDFLLCA 547
Query: 346 NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 405
+L S YR D + Q + A+ L+ ++A + + QL + ++L+ D
Sbjct: 548 PQLKDSPLYRSDALLFAAQYIGAKADNLYRKALQAKAVGNRARAKQLVDKVIQLLLQADK 607
Query: 406 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 465
LLA H L W+++A+ A ++ QYE +A+ IT W + +DY +YW
Sbjct: 608 LLASHPTDRLSRWVDAARTAAATPQERMQYEMDAKRLITSW-------GGIQQDYAARYW 660
Query: 466 SGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 505
SGL++ YY PR +YF L +W W+K
Sbjct: 661 SGLIKTYYVPRIKLYF-------AGSKKKELNNWEENWLK 693
>gi|326934230|ref|XP_003213195.1| PREDICTED: hypothetical protein LOC100549752 [Meleagris gallopavo]
Length = 650
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 97/202 (48%), Positives = 137/202 (67%), Gaps = 4/202 (1%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP + VFP T+LGNW D C YLL +P+F IG F+++
Sbjct: 9 VLPAFAGHVPPGVLRVFPRINATRLGNWSHF--DCTLSCAYLLSPEEPMFQVIGTLFLKE 66
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+KE+G T HIY+ DTF+E +P P Y++ + A++ M D +A WLMQGWLF +
Sbjct: 67 LIKEFG-TDHIYSADTFNEMSPLSSDPAYLAGITNAVFRAMTGADPEAQWLMQGWLFQHQ 125
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P FW+PPQ++A+L +VPLG+++VLDLFAE KP++ ++ FYG P+IWCMLHNF GN ++
Sbjct: 126 PAFWQPPQVQAVLRAVPLGRMIVLDLFAESKPVYEWTESFYGQPFIWCMLHNFGGNHGLF 185
Query: 187 GILDSIAFGPVEARTSENTTMV 208
G +++I GP AR N+TMV
Sbjct: 186 GAVEAINRGPFVARRFPNSTMV 207
>gi|336412611|ref|ZP_08592964.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
3_8_47FAA]
gi|335942657|gb|EGN04499.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
3_8_47FAA]
Length = 718
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 146/531 (27%), Positives = 253/531 (47%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP P + + W + Y+L P F EIG+ F+E+
Sbjct: 234 IAPAFAGFVPEGFVQKHPDTQFRHM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G ++ Y D+F+E P+D + ++ G IY + +G+ DAVW+ QG
Sbjct: 290 WEKEFGENTY-YLSDSFNEMELPIDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQG 348
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW +KALL++VP K++++DL + K +W+T + FYG +I+
Sbjct: 349 WTFGYQHSFWDKESLKALLSNVPDDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIF 408
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G LD A V+A R + ++G G + EG+E N VVY+L+++M +
Sbjct: 409 SYVPNFGGKNTMTGDLDMYASSSVKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGW 468
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y RYG A+++AW + T Y+ + ++P
Sbjct: 469 SSDSIDLDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSS-----------LYSYPRFTW 517
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
+ + + +SK + + + ++A+ L+ + +EL S
Sbjct: 518 QTVIPDQRR-------ISKIDL----------------SDDYLQAIRLYASCADELKNSE 554
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
YR DLI+ +A A + ++ N + ++ ++L+ D+D LLA H
Sbjct: 555 LYRNDLIEFVSYYVAAKAEIFYKQALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPL 614
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ +++ YE NA+ IT W + DY ++WSGL++DY
Sbjct: 615 YRLEEWVELARNSGTTLQEKDAYEANAKRLITSW-------GGIQEDYAARFWSGLIKDY 667
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR +YF + D ++++W +WI T+ W N + PV++
Sbjct: 668 YIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSNSTTPFDDPVKA 708
>gi|224026593|ref|ZP_03644959.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
18228]
gi|224019829|gb|EEF77827.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
18228]
Length = 635
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 141/436 (32%), Positives = 204/436 (46%), Gaps = 45/436 (10%)
Query: 11 AFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 70
FSG VP L+ +P AKI +W K + LD TD LF IGR F+E++ K
Sbjct: 236 GFSGYVPRELKEKYPEAKIQPQPSWCGFKGAAQ------LDPTDSLFQVIGRDFLEEEKK 289
Query: 71 EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 130
+G +Y D F E+ PPVD+PEY+S++G +I++ Q D ++W MQ W
Sbjct: 290 LFG-AHGVYAADPFHESRPPVDTPEYLSAVGRSIHTLFQEFDPYSLWAMQAWSL------ 342
Query: 131 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 190
R P +KA VP L++LDL + +G P + LHNF G I M+G L
Sbjct: 343 REPIVKA----VPEEHLLILDLNGSK---CTQRNACWGYPVVAGNLHNFGGRINMHGDLP 395
Query: 191 SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRR 250
+A EA S + + G G+ MEGIEQNP+ Y+L EM Q KV++ W+ +Y++RR
Sbjct: 396 LLAGNQYEAAVSLSPNVCGSGLFMEGIEQNPLYYELAFEMPLQKGKVELDGWLKEYALRR 455
Query: 251 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS 310
YG A +L Y T+ + +I A P++ G G P
Sbjct: 456 YGSKWENTHKALLLLLEGPYRPGTNGTELS-SIIAA----RPALHVKKSGPNAGLGIP-- 508
Query: 311 KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYA 370
YS +I A + L S YR+D++DL RQ +
Sbjct: 509 ------------------YSPWLLIEAQAFMLKDAGILKTSEAYRFDIMDLQRQIMTNLG 550
Query: 371 NELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 430
+ +A++ D G SRR+LEL+ D+D LL F WL A+ EE
Sbjct: 551 QAIHKEAAKAFEAGDEKGFELHSRRYLELLTDVDTLLRTRPEFNFDRWLADARSWGDTEE 610
Query: 431 QEKQYEWNARTQITMW 446
++ Q+E NA +T+W
Sbjct: 611 EKNQFERNATALVTIW 626
>gi|126347839|emb|CAJ89559.1| putative alpha-N-acetylglucosaminidase [Streptomyces ambofaciens
ATCC 23877]
Length = 740
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 140/519 (26%), Positives = 228/519 (43%), Gaps = 44/519 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G VP P + G+W + P W LD +F + AF
Sbjct: 250 VLPGYYGTVPPGFTERNPVGPVVPQGDWVGFER-PDW-----LDPRSAVFPRVAAAFYRH 303
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G TS +Y D E P + P + A+ +Q+ AVW + GW +
Sbjct: 304 QRELFG-TSTMYKMDLLHEGGRPGNVP--VRDAAQAVMKALQTARPGAVWTLIGWQNN-- 358
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
P + +++++ +L+++D ++ ++G PY + + NF G+ M
Sbjct: 359 -----PSTQ-IIDAIDKRRLLIVDGLSDRYDGLDREATWHGAPYAFGTIPNFGGHTTMGA 412
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
A + RT + + G+ EG NPV Y+L +E+A++ E VD + W +Y+
Sbjct: 413 NTAVWAERFDQWRTKAGSALAGIAYMPEGTGGNPVAYELFTELAWRTEPVDQRKWFAEYA 472
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYG + P AW +L Y+ G +++D +
Sbjct: 473 QRRYGGADPHAASAWELLRSGPYSTPSGTWSESQDSLF---------------------- 510
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
++ + + +S+ + Y V RAL + L A++ YR+DL+D+ RQ LA
Sbjct: 511 -TARPRLTATNAASWSPGAMRYDPGTVRRALTELVRVAPALRATDAYRFDLVDVARQVLA 569
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ L I AY D + + + +D LLA FLLGPWLE AK +
Sbjct: 570 NRSRTLLPQIKAAYDAEDLPRFRARAAEWKNCLSLLDRLLATDARFLLGPWLEDAKSWGR 629
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
E + E++AR+ +T W + +A LRDY N+ WSGL+ D+Y R Y + +
Sbjct: 630 TEAERAAAEFDARSILTTWGHRSGSDAGGLRDYANREWSGLVSDFYAMRWTKYLDSLDTA 689
Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 526
L +G D W L +DW R+ YPV +GD +
Sbjct: 690 LVTGRPPVAID----WFALEDDWNRQRDGYPVRPSGDPV 724
>gi|261880159|ref|ZP_06006586.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
gi|270333130|gb|EFA43916.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
Length = 772
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 140/506 (27%), Positives = 231/506 (45%), Gaps = 65/506 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G +P + +T G W L ATDP F I + +
Sbjct: 230 VLPGYCGMMPHDAKAKL-GLDVTDGGTWNGYTRPAN------LSATDPKFDHIADLYYRE 282
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ YG+ + Y+ D F E+ P D+ + G + + M+ + + W++QGW+ +
Sbjct: 283 LTRLYGKADY-YSMDPFHES--PDDASVDYAEAGRKLLAAMKRANGKSNWVIQGWMENPR 339
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAG 181
P ++ ++P G +++LDLF+E +P IW + + +++CML NF
Sbjct: 340 P--------QMIEALPEGDIIILDLFSECRPMFGAPSIWQRKEGYGRHNWLFCMLENFGA 391
Query: 182 NIEMYGILD------SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF--- 232
N+ ++G +D +A P + + G+G +MEG E NP++++LMSE+ +
Sbjct: 392 NVGLHGRMDQLVHNFKLAASPSTPYQNARKHLKGIGFTMEGSENNPIMFELMSELVWRAN 451
Query: 233 -----QHEKVDVK-AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVI 284
+ ++ D K W Y RYG P IQ+AW +L ++YNC G + + +
Sbjct: 452 DLVSAERDRRDFKEGWTRNYVKARYGIDNPKIQEAWQLLIGSIYNCPVGNNQQGPHESIF 511
Query: 285 VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS 344
P +D + + K +NY Y + +RA +L +
Sbjct: 512 NGRPSLDNFQVK-SWSKMRNY-----------------------YDPNVTLRAAQLMTSV 547
Query: 345 GNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 404
+ +N + YDL+D+ RQA+ A +L I Y+ D S RFL ++ D
Sbjct: 548 ADRYRGNNNFEYDLVDIVRQAMDDQARLQYLRTIADYKGFDRTAFSADSARFLNMLLLQD 607
Query: 405 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
LL F LG +E A+ L+ E++ YEWNAR QIT W + T LRDY +K
Sbjct: 608 KLLGTRQEFRLGTRIEQARSLSTTLEEKNLYEWNARVQITTWGNRTCANEGGLRDYAHKE 667
Query: 465 WSGLLRDYYGPRAAIYFKYMIESLES 490
W GLLRD+Y R Y + + + +
Sbjct: 668 WQGLLRDFYFMRWHTYLDALSKQMTA 693
>gi|187734575|ref|YP_001876687.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
gi|187424627|gb|ACD03906.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
Length = 848
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 147/518 (28%), Positives = 245/518 (47%), Gaps = 74/518 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLG--NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
+ PAFSG VP + ++P AK+ +LG W P+ + L +PLF++IGR ++
Sbjct: 239 ICPAFSGFVPRGILRLYPEAKLHRLGWGGW------PQKNHAHFLSPEEPLFLKIGRLYM 292
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSP------EYISSLGAAIYSGMQSGDSDAVWLM 119
++ KE+G+ ++ + D+F+E P + +SSLG IY + S + DAVW+M
Sbjct: 293 QEWQKEFGKNTY-FLADSFNEMELPENKGGVEARNNMLSSLGEQIYRSISSTNPDAVWVM 351
Query: 120 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPI-------WSTSKQFYGVPY 171
QGW+F Y W +KALL+ VP K+++LDL A+ W K F+ P+
Sbjct: 352 QGWMFGYQRNIWNADTLKALLSKVPDDKMLLLDLAADYNKTFWRNGMNWDVFKGFFNKPW 411
Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 230
++ ++ N G M G++D A G +EA +S + G+GM+ EGIE N V+Y+L+++
Sbjct: 412 VYSVVPNMGGKCAMTGVMDFYANGHLEALNSSSRGRLSGMGMAPEGIENNDVIYELITDA 471
Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY-NCTDGATDKNRDVIVAFPD 289
A+++ + +V+ ++ Y RYG ++++AWN+ T Y N D P
Sbjct: 472 AWRNRQENVEQYLENYCRARYGNYPDSMKEAWNLFRRTAYSNLKD------------HPR 519
Query: 290 VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELS 349
+ + T G N ++ + ++ L LF+ + L
Sbjct: 520 FNWQMKPGTRGCSVN-------------------------TSEDFLKGLSLFVNT-RGLE 553
Query: 350 ASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLAC 409
S +R D +++ L NE EA D + F + D LL
Sbjct: 554 QSPLFRQDAVEMAVHYLGIRMNEAIRAAQEALDEQDQENAEKCMAYFRKYALLADSLLEG 613
Query: 410 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 469
H + L W+ A+ + E++ +YE NAR +T W + DY K WSGL+
Sbjct: 614 HPTWRLSRWISFARSHGTSPEEKNKYEQNARRLVTRW-------GPPVDDYAAKIWSGLI 666
Query: 470 RDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 507
RDYY PR +++ I+S S + W +W++ T
Sbjct: 667 RDYYLPR----WEHFIQSRLSEKNPDMGAWEEKWVRST 700
>gi|395804724|ref|ZP_10483959.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
gi|395433112|gb|EJF99070.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
Length = 722
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 147/513 (28%), Positives = 244/513 (47%), Gaps = 52/513 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+PAFSG VP A P AKIT+L +W S + T+LLD+ DPLF +IG+ FIE
Sbjct: 245 VVPAFSGYVPKAFAEKHPEAKITELKSW----SGGGFASTFLLDSKDPLFKQIGKRFIEI 300
Query: 68 QLKEYGRTSHIYNCDTFDENTPPV---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
K YG+ S+ Y D+F+E PPV + E +S+ G+A+Y + AVW+MQGWLF
Sbjct: 301 YTKMYGK-SNFYLADSFNEIEPPVSEHNKYEELSNYGSAVYETIDEAAPGAVWVMQGWLF 359
Query: 125 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
+ FW KA L+ VP K++V D + +W + FYG + + +HN+ G+
Sbjct: 360 GDNKEFWTKEATKAFLSKVPNEKVMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSN 419
Query: 184 EMYGIL----DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
+YG L D +A + +VG G EG+ N +VY+ + ++ + + +
Sbjct: 420 PVYGDLNFYKDELA---SLLKNPNRGNIVGYGAMPEGLNNNSIVYEYIYDLPWTKAEQPL 476
Query: 240 KAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
W+ +Y RYG++ ++ AW +L +VYN T D A+ + +TE
Sbjct: 477 NDWMAKYLNARYGQTSESVFHAWELLLKSVYNVKYWETRWWNDWAGAYLLFKRPTVKITE 536
Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 359
K G P K ++ AL++ + + +N +YDLI
Sbjct: 537 FK----GNPGDK--------------------IKLKEALDILKKEAKKYNKNNLIQYDLI 572
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
D++R + +E + I+AYQ + QL ++ + V + D +++ L W+
Sbjct: 573 DVSRHYNSLSIDEELIECIKAYQEKNIAKGDQLFKQIEKQVLETDKMMSGQPLNNLNQWV 632
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 479
+SA + E Y NA+T IT+W L DY ++ W G+ + +Y PR +
Sbjct: 633 KSASDYGSSPEVSSLYAKNAKTLITLWGGEGH-----LNDYASRSWKGMYKGFYWPRWKM 687
Query: 480 YFKYMIESLESGDGF-------RLKDWRREWIK 505
+ + + ++ + F +K+W W +
Sbjct: 688 FLEALKKAAVTNTSFDENKERESIKNWEINWTE 720
>gi|404487024|ref|ZP_11022211.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
YIT 11860]
gi|404335520|gb|EJZ61989.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
YIT 11860]
Length = 722
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 157/533 (29%), Positives = 245/533 (45%), Gaps = 77/533 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
+ PAF+G VP A P + L W F K + Y+L P F EIG+ FI
Sbjct: 237 IAPAFAGFVPTAFAERHPEIQFKHL-EWGGFDEKYN-----AYVLPPETPYFKEIGKLFI 290
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPV------DSPEYISSLGAAIYSGMQSGDSDAVWLM 119
E+ KE+G+ ++ Y D+F+E PV + ++ G +IY + +G+ DAVW+
Sbjct: 291 EEWEKEFGKNTY-YLSDSFNEMKLPVAEGDDDGKHKLLAQYGESIYHSIAAGNPDAVWVT 349
Query: 120 QGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPY 171
QGW F Y FW ++ALL+ VP K++++DL + K +W T + FYG +
Sbjct: 350 QGWTFGYQHDFWDKASLQALLSRVPDDKMIIIDLGNDYPKWVWGTEQTWKNHDGFYGKKW 409
Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEM 230
I+ + NF G M G L A EA S N +VG G + EG+E N VVY+L+++M
Sbjct: 410 IFSYVPNFGGKTPMTGDLQMYATSSAEALHSANAGNLVGFGSAPEGLENNEVVYELLADM 469
Query: 231 AFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 290
+ + +D+ +W+ Y RYG A+ AW T Y+ +V PD
Sbjct: 470 GWTADSIDLDSWLPVYCKARYGGCPAAMDSAWQRFKETAYSSLYSYPRFTWQTVV--PDT 527
Query: 291 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 350
+ +SK V S ++ +ELF++ + L +
Sbjct: 528 ----------------RRISKLDVSDS----------------FLQGVELFLSCADSLES 555
Query: 351 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 410
S Y D I+ LA A++ + ++ L + Q R +E++ D+D LLA H
Sbjct: 556 SPLYVNDAIEYASYYLAAKADDCYKRALKEDSLGNRVAAMQQLDRSVEILLDVDKLLASH 615
Query: 411 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 470
+ L W++ A+ + + ++ YE NA+ IT W DY ++WSGL++
Sbjct: 616 PLYRLEEWVDMARDWGKTDLEKDAYEANAKRLITTW-------GGFQEDYAARFWSGLIK 668
Query: 471 DYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
DYY PR +YF S + D L W WIK W N + P++S
Sbjct: 669 DYYIPRMKLYF-----SEQRAD---LDRWEENWIKAP--WHNTSTSFEDPLQS 711
>gi|32564213|ref|NP_496948.2| Protein K09E4.4 [Caenorhabditis elegans]
gi|25814792|emb|CAB70170.2| Protein K09E4.4 [Caenorhabditis elegans]
Length = 715
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 136/485 (28%), Positives = 226/485 (46%), Gaps = 48/485 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LP F+G VP L+ +FP++K +L W + S+ C + DPLF +IG F+
Sbjct: 247 ILPTFAGFVPDHLETLFPASKFNRLPRWNNFTSET--SCMLSVSPFDPLFQKIGSTFLRH 304
Query: 68 QLKEYG-RTSHIYNCDTFDENTPPVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
Q K +G +++Y+ D F+E P + +++ AI + + D + VW++Q W
Sbjct: 305 QKKMFGGDVTNMYSADPFNEILPSESAKFDAKFVKQTAQAIMNSCKKVDKNCVWVLQSWS 364
Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
F+YD W +K+ L+++P+G L++LDL+AEV P W + F G ++WC+LHNF G+
Sbjct: 365 FTYDQ-WPAWAIKSFLSAIPVGNLLILDLYAEVVPAWQMTSSFQGHHFVWCLLHNFGGSR 423
Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
E+ G L I G A + +VG G+SME I+QN VVY M + + E + + W+
Sbjct: 424 ELRGNLQKIDKGYQLALMKAGSNLVGAGLSMEAIDQNYVVYQFMIDRMWSPEPLPLNNWL 483
Query: 244 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
YS RY Q W +L T YN + V +
Sbjct: 484 KAYSESRYSADFKVAQKFWTLLAGTFYNQPEKWGTPRFSVFL------------------ 525
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVI-RALELFIASGNELSASNTYRYDLIDLT 362
Y +P + W+ E R EL A + L +R DL D+
Sbjct: 526 -YHRPGFGRKI-----------EYWFPVEETFSRFRELLPALVHVLGEHPLFREDLNDVM 573
Query: 363 RQALA-KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
R+ + NE L++ EA+ + D V +E+ + ++ + + W+E+
Sbjct: 574 REMTQFEMGNEAALSMSEAFLMEDKQQVGASCEMLMEMFQKLES----YSNRDVRQWIEN 629
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
AK +A E+ + + A +T+W Q DY ++ W+GL+ YYG R +
Sbjct: 630 AKSIAPTSEERQVFPVTAGDILTVWGPTGQN-----LDYAHREWAGLMSGYYGRRWQYFC 684
Query: 482 KYMIE 486
+++E
Sbjct: 685 DWILE 689
>gi|296115989|ref|ZP_06834611.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
23769]
gi|295977458|gb|EFG84214.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
23769]
Length = 758
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 147/544 (27%), Positives = 246/544 (45%), Gaps = 62/544 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F G VP FP A + G W + P W LD DP+F ++ + ++
Sbjct: 259 VLPGFYGMVPDDFGKRFPQAHVIGQGEWNRFRR-PAW-----LDPRDPMFAKVAAIYYDE 312
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G + +Y+ F E P D P ++ G I + + A+W++ W D
Sbjct: 313 QKKLFG-DAPVYDIQPFQEGGTPGDVP--LADAGQGIQKALDTAHPGAMWMLMAWYEEPD 369
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
+L V +L ++DL + + F G P+++ L +F G + G
Sbjct: 370 --------ARMLAGVDRKRLFIVDLEQNTRVRENRDADFQGAPFLYGGLWDFGGRTSLGG 421
Query: 188 ILDSIAFG---PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
S +G P RT +N M+G + EG++ NP ++DL +E A++ + VD W
Sbjct: 422 --SSYDYGVRLPGLWRTQKN--MIGTAVFPEGMDNNPYIFDLFTEAAWRRDGVDTTQWTR 477
Query: 245 QYSVRRYGRSVPA-IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
Y+ RRYG+ + AW++L H+ + S Q
Sbjct: 478 DYADRRYGQPGDVHARKAWDLLLHSAF-------------------------SYRATGIQ 512
Query: 304 NYGKP-VSKEAVLKSETSSYDHPHLW-------YSTSEVIRALELFIASGNELSASNTYR 355
++G+ + +++ ++ S H W Y V A+ + + + A+ YR
Sbjct: 513 DFGEASAAPDSLFNAQPSLDTHSAAWNGMKVLPYDPHLVEAAMAELLQASDATRATEAYR 572
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
YDL+D+TRQA+A A + I +A+ D + L+ R+LEL++ D LLA + F +
Sbjct: 573 YDLVDVTRQAVANQARAMLPQIGDAFAARDRAKLHALTTRWLELMDRQDSLLATNTFFRV 632
Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
G WL + + + Q K +++AR +T W T + LRDY NK W+GL +DYY
Sbjct: 633 GTWLSWPQAWSDDPAQRKLMDYDARVILTNWGGRTASQVGHLRDYANKDWAGLTKDYYRV 692
Query: 476 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 535
R ++F + SL +G R D W K+ +W + VY GD+ ++ +++
Sbjct: 693 RWQLFFDSLETSLATGRPPREID----WYKVGEEWCHNGRVYSPTPEGDSYTVARDIHDY 748
Query: 536 YLQG 539
QG
Sbjct: 749 LTQG 752
>gi|393783265|ref|ZP_10371440.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
CL02T12C01]
gi|392669544|gb|EIY63032.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
CL02T12C01]
Length = 723
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 148/535 (27%), Positives = 246/535 (45%), Gaps = 73/535 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP A P K L W Y+L P F EIG+ F+++
Sbjct: 236 IAPAFAGFVPMAFAEKHPDIKFKHL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKE 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G+ ++ Y D+F+E PV + ++ G +IY + +G+ DA+W+ QG
Sbjct: 292 WEKEFGKNTY-YLSDSFNEMELPVAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQG 350
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW ++ALL+ VP K++++DL + K +W T + FYG +I+
Sbjct: 351 WTFGYQHDFWDKASLQALLSHVPDDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIF 410
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G + G L A EA + + ++G G + EG+E N VVY+L+++M +
Sbjct: 411 SYVPNFGGKTPLTGDLQMYATSSAEALKAPSHGNLIGFGSAPEGLENNEVVYELLADMGW 470
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D + W+ Y RYG ++++AW + T Y+ + ++P
Sbjct: 471 TDQAIDPEQWMPSYCTARYGAYPESMKNAWELFRKTAYSS-----------LYSYPRFTW 519
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
+ + + +SK V + + + +ELF+AS + L+ S
Sbjct: 520 QTVIPDQRR-------ISKIDV----------------SDDFLHGIELFLASADSLNRSK 556
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
Y D I+ +A A++L+ + +Q + ++L+ ++D LLA H
Sbjct: 557 LYVNDAIEFASYYIAAQADKLYKQALTEDTAGKPVAAYQHLNQAIDLLLNVDKLLASHPL 616
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+E A+ ++ YE NA+ IT W DY ++WSGL++DY
Sbjct: 617 YRLEEWVELARNSGTTPAEKDAYEANAKRLITTW-------GGFQEDYAARFWSGLIKDY 669
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 527
Y PR +YF S + GD L +W EWI+ W N P E D I
Sbjct: 670 YIPRLKLYF-----SKQRGD---LDNWEEEWIR--TPWHN--TTTPFEKPLDMAI 712
>gi|210631701|ref|ZP_03296968.1| hypothetical protein COLSTE_00853, partial [Collinsella stercoris
DSM 13279]
gi|210159960|gb|EEA90931.1| F5/8 type C domain protein, partial [Collinsella stercoris DSM
13279]
Length = 1906
Score = 207 bits (526), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 150/499 (30%), Positives = 238/499 (47%), Gaps = 42/499 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
V+ F G VP Q P++ G+W S + P TYL DA + F ++G
Sbjct: 391 VIQGFGGQVPTDFQQKNPNSVAASSGSW-SGFARPYMIKTYLTDADRAAGKEDYFQKVGT 449
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F E Q + +G+ SH Y D F E V I + + M D AVW+MQ W
Sbjct: 450 TFYEAQERIFGKVSHFYAVDPFHEGGT-VPQGFNIVDIYRTVQQKMLDYDPQAVWVMQQW 508
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
+ D L + +VLDL ++++ ++ + VP++W MLHNF G
Sbjct: 509 QWGIDE-------NKLSGLAKKEQSLVLDLQSDLRS-QASPMENQQVPWVWNMLHNFGGR 560
Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+ M G+ + +A +A S N M G+G++ E I+ +P+VY+L+ +M ++ + VD +AW
Sbjct: 561 MGMDGVPEVLAIKIPQAYNS-NRYMRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRAW 619
Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
Y RRYG + IQ+AW++L T Y DG + SI++
Sbjct: 620 TRSYIERRYGGTDAKIQEAWDILLDTAYKHVDGEY---------YQGASESIMNA----- 665
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
+P + S++ H + Y E RA +LFI S + S +RYD +D+
Sbjct: 666 ----RPSDNKI---GSASTWGHSDIDYDKKEFERAAQLFIESYDTYKDSEAFRYDFVDVM 718
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ LA E +AY+ DA L+ + LE+++ D +L+ F+LG W+E+A
Sbjct: 719 RQVLANAFQEYQPLAGDAYKQRDAERFELLANQMLEMLDAQDRMLSTSSDFMLGTWIENA 778
Query: 423 KQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
+ L ++ + +E NAR+ IT W ++ SL+ DY N+ WSGL YY PR +
Sbjct: 779 RTLLEDADDWTADLFELNARSLITTW--GLEKNGSLI-DYSNRQWSGLTGSYYKPRWESW 835
Query: 481 FKYMIESLESGDGFRLKDW 499
++LE G + +W
Sbjct: 836 ANARKKALEDGGSAQDLNW 854
>gi|146300873|ref|YP_001195464.1| alpha-N-acetylglucosaminidase [Flavobacterium johnsoniae UW101]
gi|146155291|gb|ABQ06145.1| Candidate alpha-glycosidase; Glycoside hydrolase family 89
[Flavobacterium johnsoniae UW101]
Length = 723
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 148/511 (28%), Positives = 242/511 (47%), Gaps = 47/511 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+PAFSG VP A P AKIT+L +W S + T+LLD+ DPLF +IG+ FIE
Sbjct: 245 VVPAFSGYVPKAFAEKHPEAKITELNSW----SGGGFESTFLLDSKDPLFKKIGKRFIEI 300
Query: 68 QLKEYGRTSHIYNCDTFDENTPPV---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 124
K YG+ S+ Y D+F+E PPV + E +++ G+AIY ++ AVW+MQGWLF
Sbjct: 301 YTKMYGK-SNFYLADSFNEIEPPVTEHNKYEELANYGSAIYETIEEAAPGAVWVMQGWLF 359
Query: 125 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
+ FW A L+ VP +L+V D + +W + FYG + + +HN+ G+
Sbjct: 360 GDNKNFWTKEATSAFLSKVPNDRLMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSN 419
Query: 184 EMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+YG L+ V + +VG G EG+ N +VY+ + ++ + + VK W
Sbjct: 420 PVYGDLNFYKNELVSLLKNPHRGNVVGYGAMPEGLNNNAIVYEFIYDLPWSKGEQSVKDW 479
Query: 243 INQYSVRRY-GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
+ Y RY ++ ++ AW +L +VY+ T D A+ ++TE K
Sbjct: 480 LTNYLNARYEQKTSDSVFKAWELLLESVYSTKYWETRWWNDRAGAYLLFKRPTATITEFK 539
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
G P K+ ++ AL++ A + N +YDLID
Sbjct: 540 ----GNPGDKD--------------------KLKEALDILKAEAKKYDKKNFIQYDLIDA 575
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
+R + +E + ++AYQ D QL ++ + V ++D ++ L W++S
Sbjct: 576 SRHYYSLSIDEDLVECVKAYQQKDITKGDQLFKKIEKQVLEIDKSMSGQPLNSLNYWVKS 635
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
A + E K Y NA+T IT+W L DY ++ W G+ + +Y PR ++
Sbjct: 636 ASEYGSTPEVSKLYVKNAKTLITLWGGEGH-----LNDYASRSWQGMYKGFYWPRWKMFL 690
Query: 482 KYMIESL-------ESGDGFRLKDWRREWIK 505
++ E+ + +K+W +W K
Sbjct: 691 TAFKKTAVNNTPFDETKEREEIKNWEIKWTK 721
>gi|160914140|ref|ZP_02076362.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
gi|158433951|gb|EDP12240.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
Length = 2150
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 136/478 (28%), Positives = 237/478 (49%), Gaps = 50/478 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGR 62
V+ F G VP + A +T + W S + P TYL F ++ +
Sbjct: 424 VIQGFCGQVPMSFVEKNEGAVLTPIDEWPSF-TRPAMIKTYLSQEEIAAGKKDYFKDVAK 482
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 120
F E+Q +G S Y D F E NT +D ++++ + M ++DA+W+MQ
Sbjct: 483 TFYEKQKNVFGDVSDYYASDPFHEGGNTQGLD----VTNIFKTVQEEMLKSNADAIWVMQ 538
Query: 121 GWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFA 180
W + D ++ L V + + LDL +++ P S+ + G+ +IWCMLHNF
Sbjct: 539 QWQGNLDH----AKLSGL---VKPEQALALDLQSDMNP--SSVMENEGISWIWCMLHNFG 589
Query: 181 GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
G + + G ++ IA P A S N M G+G++ E +E +P+VY+++ +M + + +D +
Sbjct: 590 GRMGLDGEVEVIAKEPAIA-ASNNQYMKGIGITPEALENSPIVYEMLFDMTWSKDPIDYQ 648
Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
AW+++Y+ RR G S ++Q+AW++L T Y +D + + ++I+ G
Sbjct: 649 AWVDKYATRRAGGSSDSLQEAWDMLLETAY----------KDKGIYYQGAGETVINARPG 698
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
N+ S S++ H ++ Y E+ + L L I + + +AS YRYDL D
Sbjct: 699 --TNF-----------SSASTWGHSNILYDKEELDKVLSLLIENYDAFAASEAYRYDLAD 745
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ Q L A E +++A D+ ++S FLEL++ D +L + F+LG W+
Sbjct: 746 VAEQVLCNAAIEYHALMVQALNNKDSAEFKRISTHFLELIDLSDRILGSSEEFMLGTWIH 805
Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
A+++ N + + +E+NAR +T W E + L+DY N+ W+GL +Y R
Sbjct: 806 DAREMLDNADDWTKDLFEFNARAVVTTW---GGERSGSLKDYSNRKWAGLTSSFYKER 860
>gi|169351448|ref|ZP_02868386.1| hypothetical protein CLOSPI_02228 [Clostridium spiroforme DSM 1552]
gi|169291670|gb|EDS73803.1| LPXTG-motif cell wall anchor domain protein [Clostridium spiroforme
DSM 1552]
Length = 1990
Score = 205 bits (521), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 150/527 (28%), Positives = 251/527 (47%), Gaps = 48/527 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL FSG VP ++ + + G+W + P TY+ + F ++ F +
Sbjct: 397 VLSGFSGQVPTNFKDKYQDVQYVAQGSWCGYER-PDMLRTYVDNGGTDYFSQMADVFYKA 455
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q +G ++IY D F E D + + + M D DA+WL+Q W S
Sbjct: 456 QRDIFGDVTNIYAVDPFHEGGKIGDMN--YTKVYETVQKKMMENDEDAIWLIQEWSGSIA 513
Query: 128 PFWRPPQMKALLNSVPLGK--LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P ++ + L K ++VLDLF+EV P +++ + P+IW MLHNF G + +
Sbjct: 514 S--NPSKL------INLDKEHVIVLDLFSEVSP-RNSALEAADTPWIWNMLHNFGGRMGL 564
Query: 186 YGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
+ ++ P + SE+ MVG+GM+ E IE +P+ Y+L+ +M + + +D + W
Sbjct: 565 DANPEKVSQNIPNTYQNSEH--MVGIGMTPEAIENSPMAYELLWDMTWTKDPIDFRQWCQ 622
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y+ R YG + I++ WN+L T YN D + S+I+
Sbjct: 623 DYAKRIYGGTNEDIEEVWNILLDTGYNRKDN----------YYQGAPESVIN-------- 664
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+P + + SS+ H + Y E+ RA+ L + +E S + YDL D+TRQ
Sbjct: 665 -ARPTTN----FTSASSWGHSTINYDKEELERAVYLMAKNYDEFKDSPAFIYDLSDITRQ 719
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
++ A E ++ AYQ + LS +FLE++ D +L+ + FL+G W+E A+
Sbjct: 720 LISNSAQEYHKAMVNAYQAGNLSEFEVLSDKFLEMILLQDQILSTNSDFLVGKWIEQART 779
Query: 425 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ + + +E+NAR IT W LRDY N+ W+GL +DYY PR +
Sbjct: 780 MIEDSDDWTKDLFEFNARDLITTWGGLKNANGGGLRDYSNRQWAGLTKDYYYPRWQKWIN 839
Query: 483 YMIESLESGDGFRLKDW---RREWIKLTNDWQNGRNVYPVESNGDAL 526
+ E++++ DW EW L +D N Y +E++ AL
Sbjct: 840 DVKEAMKNNTAVPSTDWFLMEWEWANLKSDEGNE---YSIEASNLAL 883
>gi|294674521|ref|YP_003575137.1| alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
gi|294472030|gb|ADE81419.1| putative alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
Length = 754
Score = 204 bits (520), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 140/495 (28%), Positives = 232/495 (46%), Gaps = 54/495 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G VP + + G W + L TD F EI + +
Sbjct: 223 VLPGYCGMVPHDAKEKL-GLNVADAGLWNGFQRPAN------LLPTDARFSEIATLYYNE 275
Query: 68 QLKEYGRTSHIYNCDTFDENT--PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
K +G+ + Y+ D F E+ P +D + G A+ M+ + AVW++QGW +
Sbjct: 276 LTKLFGKADY-YSMDPFHESNDDPNID----YAKAGQAMMQAMKRVNPKAVWVIQGW--T 328
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNF 179
+P +A+++ + G L+VLDLF+E +P IW + + +++C+L NF
Sbjct: 329 ENP------REAMVDDMKTGDLLVLDLFSECRPMFGIPSIWKREQGYKQHQWLFCLLENF 382
Query: 180 AGNIEMYG----ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE 235
N+ ++G +LD+ ++++ + G+G +MEG E NPV+++LMSE+ ++ E
Sbjct: 383 GANVGLHGRMDQLLDNFYMLQSSKFQAQSSKLKGIGFTMEGSENNPVMFELMSELPWRPE 442
Query: 236 KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSII 295
K + W+ Y RYG AI+ AW L ++YNC G + SI
Sbjct: 443 KFTKEQWVKNYVKARYGVEDEAIEKAWLTLAKSIYNCPAGNNQQG---------PHESIF 493
Query: 296 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 355
G+P + + SS+ +Y + +A +L + + +N +
Sbjct: 494 C---------GRPT----LNNFQASSWSKMKNYYDPAMTKKAAKLMNSVAEKYRGNNNFE 540
Query: 356 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 415
YDL+D+TRQALA A + I Y+ + + RFL+++ D LL F +
Sbjct: 541 YDLVDITRQALADQARLQYQKTIADYKAFSRKQFDRDAERFLKMLLLQDKLLGTRTEFRV 600
Query: 416 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 475
G W + A E++K YEWNAR QIT W + + LRDY +K W GLL+D+Y
Sbjct: 601 GHWTQDAVNAGNTAEEKKLYEWNARVQITTWGNRYCADTGGLRDYAHKEWQGLLKDFYYV 660
Query: 476 RAAIYFKYMIESLES 490
R YF + +++
Sbjct: 661 RWKSYFDALAAQMKA 675
>gi|341892319|gb|EGT48254.1| hypothetical protein CAEBREN_28412 [Caenorhabditis brenneri]
Length = 713
Score = 204 bits (519), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 133/486 (27%), Positives = 227/486 (46%), Gaps = 50/486 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LP F+G VP L+ +FPS+K T+L W + S+ C + DPLF +IG F+
Sbjct: 245 ILPTFAGFVPDQLEKLFPSSKFTRLPCWNNFTSET--SCLLSVSPFDPLFQKIGSLFLRH 302
Query: 68 QLKEYG-RTSHIYNCDTFDENTPPVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
Q K +G +++Y+ D F+E P + +++ AI + + D + +W++Q W
Sbjct: 303 QKKMFGGDITNLYSADPFNEILPSDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWS 362
Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
F+YD W +K+ L++VP+G L++LDL++EV P W ++ F+G YIWCMLH+F G+
Sbjct: 363 FTYDE-WPSWAIKSFLSAVPIGNLLILDLYSEVVPAWQSTSSFHGHNYIWCMLHSFGGSR 421
Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
E+ G L + G A + ++G G++ME I+QN V+Y M + + E + + WI
Sbjct: 422 ELRGNLQKVDKGYQLALMKGGSNLIGAGLTMEAIDQNYVIYQFMVDRMWSSEPLPLNTWI 481
Query: 244 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
YS RY W +L + YN + + V +
Sbjct: 482 KSYSESRYSADFKVSHKFWTLLAFSFYNQPEKWGNPRFSVFL------------------ 523
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS-GNELSASNTYRYDLIDLT 362
Y +P + + W+ E L+ I S + L ++ DL D+
Sbjct: 524 -YHRPAFGKKI-----------EYWFPVEETFGHLQSLIPSLIHVLGDHPLFKEDLNDVM 571
Query: 363 RQALAKY--ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
R A+ ++ N+ L + EA+ + D + + DM L + + W+E
Sbjct: 572 R-AITQFEVGNDAALTLTEAFLMEDKQQIGSTCENLM----DMFLKLESYSNRDMKHWIE 626
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
+K +A E+ + + A +T+W Q DY ++ W GLL YYG R +
Sbjct: 627 DSKSIAATSEERQVFPATAADILTVWGPEGQN-----LDYAHREWEGLLSGYYGRRWQYF 681
Query: 481 FKYMIE 486
+++E
Sbjct: 682 CDWILE 687
>gi|410634789|ref|ZP_11345419.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
gi|410145665|dbj|GAC22286.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
Length = 750
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 149/548 (27%), Positives = 264/548 (48%), Gaps = 73/548 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
++ AF+G VP A +FP A+I +L +W P YLL +PLF++IG+ +IE+
Sbjct: 250 IVHAFAGFVPPATSELFPEAQIRRL-SWGG--GLPESTYGYLLSPENPLFVKIGKMYIEE 306
Query: 68 QLKEYGRTSHIYNCDTFDE-NTPPVDS-PEYISSL---GAAIYSGMQSGDSDAVWLMQGW 122
KE+G+ + Y D+F+E + PP D+ E ++ L G +Y +++ + DA W+MQGW
Sbjct: 307 WQKEFGKNEY-YLADSFNEMDVPPADTEAELLTELAGYGDRVYQSIKAANPDATWVMQGW 365
Query: 123 LFSYDP------FWRPPQMKALLNSVPLGKLVVLDLFAE-------VKPIWSTSKQFYGV 169
F Y FW P ++ AL++ VP KL++LDL E + P W F+
Sbjct: 366 TFPYHKDENRQLFWTPERLHALVSKVPDDKLLILDLANEYNKLWWKIDPSWKMYSGFFNK 425
Query: 170 PYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTMVGVGMSMEGIEQNPVVYDLMS 228
+I+ + N G + G D A P++A + ++G G + EGIE N ++Y+L++
Sbjct: 426 KWIYSFIPNMGGKTPLNGRFDIYAELPIDALNYKDKGNLIGFGFAPEGIENNEMIYELLT 485
Query: 229 EMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFP 288
+MA+Q + +DV W +Y+++RYG +++ A++ L N+ + +F
Sbjct: 486 DMAWQRKAIDVDQWQAKYAMQRYGAYPGSLEKAFSYL--------------NKSALGSF- 530
Query: 289 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 348
VD I Y+N P E DH + + + + I+A LF+ + +L
Sbjct: 531 -VDHPIHRFQLRPYRN---PEGVE----------DHATV-HESEDFIKATGLFLQASEQL 575
Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
+ Y++D +++T L+ + L + + V + + ++ MD LLA
Sbjct: 576 KDNKLYQHDAMEITTLFLSLVTDNLLTKFLAKDVEQRDYSVLDEA---ISVMHTMDKLLA 632
Query: 409 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 468
H L W++ A+ ++ YE NA+ +T W + + DY + WSGL
Sbjct: 633 EHPNHQLVTWVDYARTWGSTTAEKDYYESNAKRLLTTWGGDP------VNDYAGRVWSGL 686
Query: 469 LRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALIT 528
+ +YY PR Y +++++ F ++ W W+ ++N Y D +
Sbjct: 687 IGNYYAPRWQSYH----DAVKNNQTFDVRQWEENWV--MTPYKNTSTAY-----QDPVRV 735
Query: 529 SQWLYNKY 536
+Q +Y KY
Sbjct: 736 AQAMYFKY 743
>gi|365104185|ref|ZP_09333846.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
4_7_47CFAA]
gi|363644798|gb|EHL84079.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
4_7_47CFAA]
Length = 1049
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 142/519 (27%), Positives = 240/519 (46%), Gaps = 43/519 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P F+G VP P A++ G W P TY+ D F ++ + +
Sbjct: 398 VFPGFAGQVPDTFAAKNPQAQVIDQGEWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQT 455
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+G SH Y D F E D + + + + M D DAVW++Q W
Sbjct: 456 LKTTFGDISHYYAVDPFHEGGNRADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW----- 508
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
+ A LN + ++LDL+A+ KP + +F P+IW MLH F G + G
Sbjct: 509 ---QENPTDAFLNGLKKDHALILDLYADNKPNHAMRHEFSNTPWIWNMLHAFGGRMGFSG 565
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
+ + +A + +E+ M GVG++ E + NP++Y+++ +MA++ + A+I+ +
Sbjct: 566 MPEVLA-QEIPQSLAESKKMKGVGVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWL 624
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RYG P I+ AW+++ T Y+ D+ R + SII G +G
Sbjct: 625 TSRYGAQSPEIEQAWDIMVKTAYHRR---KDRQR--------AEDSIIDAKPG----FG- 668
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
V++ + YD +E + L L+++ + A+ Y++DL+D+TRQ LA
Sbjct: 669 -VTRACTYYTALIDYDK-------AEFEKILPLYLSVYDHFKANPAYQHDLVDITRQVLA 720
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL-- 425
+ E + +A+ D QLS +FL L++ D +L+ F+LG W+ SA+ +
Sbjct: 721 NASYEYYRAFEDAWIAKDYSAFNQLSGKFLRLIKLQDQVLSTRPEFMLGTWINSARTMLD 780
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
++ Q+E+NAR +T W +A LRDY N+ W GL D+Y R A + + +
Sbjct: 781 GMDDWTRDQFEFNARAMVTTWGTEQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALK 839
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQN-GRNVYPVESNG 523
+ +G + + W L W N N YP + +G
Sbjct: 840 SAAATGQ--KQDAIKVNWFPLEYRWVNQSGNGYPTQPSG 876
>gi|421734750|ref|ZP_16173809.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
gi|407077324|gb|EKE50171.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
Length = 1919
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 148/501 (29%), Positives = 237/501 (47%), Gaps = 46/501 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
V+ F G VPA Q P++ G W P TYL DA + F ++G
Sbjct: 399 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGD 457
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F + Q +G+ S+ Y D F E D + I + + M D AVW+MQ W
Sbjct: 458 TFYKAQENVFGKVSNYYAVDPFHEGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 516
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
+ D ++ L + G+ +VLDL ++++ +++ + GVP++W MLHNF G
Sbjct: 517 QWGIDE----TKLSGLADK---GQTLVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGR 568
Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+ + G+ + I+ +A S M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 569 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 627
Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
+Y+ RRYG + I+ AW++L T Y TDG + +I A P D +I S
Sbjct: 628 TQEYAERRYGGTDGTIEKAWDILLDTAYKHTDGEYYQGASESIINARPS-DNTIGSA--- 683
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
S++ H + Y + +A LF + + S +RYD +D
Sbjct: 684 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVD 724
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQ LA E +AY+ D LS R L++++ D LL+ D FL+G W++
Sbjct: 725 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 784
Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
A+ + + +E NAR +T W N + SL+ DY N+ W+GL DYY R
Sbjct: 785 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 841
Query: 479 IYFKYMIESLESGDGFRLKDW 499
Y + LE G F DW
Sbjct: 842 TYVDNRLNKLEHGTDFTDPDW 862
>gi|393785795|ref|ZP_10373941.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
CL02T12C05]
gi|392661414|gb|EIY55000.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
CL02T12C05]
Length = 724
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 143/512 (27%), Positives = 234/512 (45%), Gaps = 69/512 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP A P K L W Y+L P F EIG+ F+++
Sbjct: 236 IAPAFAGFVPMAFAEKHPDIKFKHL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKE 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+G+ ++ Y D+F+E PV + ++ G +IY + +G+ DA+W+ QG
Sbjct: 292 WEKEFGKNTY-YLSDSFNEMELPVAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQG 350
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIW 173
W F Y FW ++ALL+ VP K++++DL + K +W T + FYG +I+
Sbjct: 351 WTFGYQHSFWDKASLQALLSHVPDDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIF 410
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G L A EA SE + ++G G + EG+E N VVY+L+++M +
Sbjct: 411 SYVPNFGGKTPMTGDLQMYASSSAEALQSESHGNLIGFGSAPEGLENNEVVYELLADMGW 470
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ +D+ W+ Y + RYG ++DAW++ T Y+ ++ PD
Sbjct: 471 TDQAIDLDKWMPSYCMARYGAYPETMKDAWDLFRKTAYSSLYSYPRFTWQTVI--PD--- 525
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
K + K + S + + +ELF+ S + L S
Sbjct: 526 ------------------KRRISKIDVS-----------DDFLHGVELFLNSADSLKNSK 556
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
Y D I+ +A A++L+ + + + Q + ++++ ++D LLA H
Sbjct: 557 LYVNDAIEFASYYIAAKADKLYGKALAEDTVGRSAVAQQYLNQTIDMLLNVDKLLASHPL 616
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W+ A+ ++ YE NA+ IT W DY ++WSGL++DY
Sbjct: 617 YRLEEWVNFARNSGTTPAEKDAYEINAKRLITTW-------GGFQEDYAARFWSGLIKDY 669
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
Y PR IYF SL+ +W EWI
Sbjct: 670 YIPRLKIYFSKQRGSLD--------NWEEEWI 693
>gi|311064845|ref|YP_003971571.1| beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
gi|310867165|gb|ADP36534.1| Beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
Length = 1923
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 148/501 (29%), Positives = 237/501 (47%), Gaps = 46/501 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
V+ F G VPA Q P++ G W P TYL DA + F ++G
Sbjct: 403 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGD 461
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F + Q +G+ S+ Y D F E D + I + + M D AVW+MQ W
Sbjct: 462 TFYKAQENVFGKVSNYYAVDPFHEGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 520
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
+ D ++ L + G+ +VLDL ++++ +++ + GVP++W MLHNF G
Sbjct: 521 QWGIDE----TKLSGLADK---GQALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGR 572
Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+ + G+ + I+ +A S M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 573 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 631
Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
+Y+ RRYG + I+ AW++L T Y TDG + +I A P D +I S
Sbjct: 632 TQEYAERRYGGTDGTIEKAWDILLDTAYKHTDGEYYQGASESIINARPS-DNTIGSA--- 687
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
S++ H + Y + +A LF + + S +RYD +D
Sbjct: 688 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVD 728
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQ LA E +AY+ D LS R L++++ D LL+ D FL+G W++
Sbjct: 729 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 788
Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
A+ + + +E NAR +T W N + SL+ DY N+ W+GL DYY R
Sbjct: 789 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 845
Query: 479 IYFKYMIESLESGDGFRLKDW 499
Y + LE G F DW
Sbjct: 846 TYVDNRLNKLEHGTDFTDPDW 866
>gi|313140918|ref|ZP_07803111.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
bifidum NCIMB 41171]
gi|313133428|gb|EFR51045.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
bifidum NCIMB 41171]
Length = 2005
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 148/501 (29%), Positives = 237/501 (47%), Gaps = 46/501 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
V+ F G VPA Q P++ G W P TYL DA + F ++G
Sbjct: 485 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGD 543
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F + Q +G+ S+ Y D F E D + I + + M D AVW+MQ W
Sbjct: 544 TFYKAQESVFGKVSNYYAVDPFHEGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 602
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
+ D ++ L + G+ +VLDL ++++ +++ + GVP++W MLHNF G
Sbjct: 603 QWGIDE----TKLSGLADK---GQALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGR 654
Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+ + G+ + I+ +A S M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 655 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 713
Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
+Y+ RRYG + I+ AW++L T Y TDG + +I A P D +I S
Sbjct: 714 TQEYAERRYGGTDGTIEKAWDILLDTAYKHTDGEYYQGASESIINARPS-DNTIGSA--- 769
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
S++ H + Y + +A LF + + S +RYD +D
Sbjct: 770 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVD 810
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQ LA E +AY+ D LS R L++++ D LL+ D FL+G W++
Sbjct: 811 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 870
Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
A+ + + +E NAR +T W N + SL+ DY N+ W+GL DYY R
Sbjct: 871 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 927
Query: 479 IYFKYMIESLESGDGFRLKDW 499
Y + LE G F DW
Sbjct: 928 TYVDNRLNKLEHGTDFTDPDW 948
>gi|390937398|ref|YP_006394957.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
gi|389891011|gb|AFL05078.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
Length = 1957
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 148/501 (29%), Positives = 236/501 (47%), Gaps = 46/501 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
V+ F G VPA Q P++ G W P TYL DA + F ++G
Sbjct: 437 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGD 495
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F + Q +G+ S+ Y D F E D + I + + M D AVW+MQ W
Sbjct: 496 TFYKAQENVFGKVSNYYAVDPFHEGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 554
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
+ D ++ L + G+ +VLDL ++++ ++ + GVP++W MLHNF G
Sbjct: 555 QWGIDE----TKLSGLADK---GQALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGR 606
Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+ + G+ + I+ +A S M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 607 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 665
Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
+Y+ RRYG + I+ AW++L T Y TDG + +I A P D +I S
Sbjct: 666 TQEYAERRYGGTDGTIEKAWDILLDTAYKHTDGEYYQGASESIINARPS-DNTIGSA--- 721
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
S++ H + Y + +A LF + + S +RYD +D
Sbjct: 722 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVD 762
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQ LA E +AY+ D LS R L++++ D LL+ D FL+G W++
Sbjct: 763 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 822
Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
A+ + + +E NAR +T W N + SL+ DY N+ W+GL DYY R
Sbjct: 823 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 879
Query: 479 IYFKYMIESLESGDGFRLKDW 499
Y + LE G F DW
Sbjct: 880 TYVDNRLNKLEHGTDFTDPDW 900
>gi|291086028|ref|ZP_06354661.2| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
ATCC 29220]
gi|291069185|gb|EFE07294.1| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
ATCC 29220]
Length = 1014
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 142/519 (27%), Positives = 239/519 (46%), Gaps = 43/519 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P F+G VP P A++ G+W P TY+ D F ++ + +
Sbjct: 363 VFPGFAGQVPDTFAAKNPQAQVIDQGDWVGFVRPPM-LRTYVKQGAD-YFSKVADVYYQT 420
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+G SH Y D F E D + + + + M D DAVW++Q W
Sbjct: 421 LKTTFGDISHYYAVDPFHEGGNRADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW----- 473
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
+ A LN + ++LDL+A+ KP + +F P+IW MLH F G + G
Sbjct: 474 ---QENPTDAFLNGLKKDHALILDLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSG 530
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
+ + +A + +E+ M GVG++ E + NP++Y+++ +MA++ + A+I+ +
Sbjct: 531 MPEVLA-QEIPQSLAESKYMKGVGVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWL 589
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RYG P I+ AW+++ T Y+ D+ R + SII G +G
Sbjct: 590 TSRYGAQSPEIEQAWDIMVKTAYHRR---KDRQR--------AEDSIIDAKPG----FG- 633
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
V++ + YD +E + L L+++ + + Y++DL+D+TRQ LA
Sbjct: 634 -VTRACTYYTALIDYDK-------AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLA 685
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL-- 425
+ E + +A+ D QLS +FL L++ D +L F+LG WL SA+ +
Sbjct: 686 NASYEYYRAFEDAWMAKDYSAFNQLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLD 745
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
++ Q+E+NAR +T W +A LRDY N+ W GL D+Y R A + + +
Sbjct: 746 GMDDWTRDQFEFNARAMVTTWGIEQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALK 804
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGR-NVYPVESNG 523
+ +G + + W L W N N YP + +G
Sbjct: 805 NAAATGQ--KQDAIKVNWFPLEYRWVNQTGNGYPTQPSG 841
>gi|268533054|ref|XP_002631655.1| Hypothetical protein CBG20846 [Caenorhabditis briggsae]
Length = 712
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 128/486 (26%), Positives = 232/486 (47%), Gaps = 50/486 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LP FSG VP L+ +FP++K +L W + S+ C + DPLF +IG +F+
Sbjct: 244 ILPTFSGFVPDRLEKLFPTSKFNRLPCWNNFTSET--SCLLSVSPFDPLFQKIGSSFLRH 301
Query: 68 QLKEYG-RTSHIYNCDTFDENTPPVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWL 123
Q K G +++Y+ D F+E P + +++ AI + + D + +W++Q W
Sbjct: 302 QKKMLGGDITNLYSADPFNEVLPSDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWS 361
Query: 124 FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
F+YD W +K+ L++VP+G++++LDL++EV P W + F+G ++WCMLHNF G+
Sbjct: 362 FTYDQ-WPNWAIKSFLSAVPIGQMLILDLYSEVVPAWQMTSSFHGHNFVWCMLHNFGGSR 420
Query: 184 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
E+ G + + G A + +VG G+SME I+QN ++Y M + + E + + +W+
Sbjct: 421 ELRGNVQKVDKGYQLALMKAGSNLVGAGLSMEAIDQNYMMYQFMIDRMWTQEPIPLNSWL 480
Query: 244 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
YS RY W +L + YN + + V +
Sbjct: 481 KSYSESRYSADFKVAHKFWTILAGSFYNQPEKWGNPRFSVFL------------------ 522
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALE-LFIASGNELSASNTYRYDLIDLT 362
Y +P + + W+ E LE L ++ + L ++ DL D+
Sbjct: 523 -YHRPAFGKKI-----------EYWFPVEETFTHLESLVLSLLHILGDHPLFKEDLNDVM 570
Query: 363 RQALAKY--ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
R A+ ++ NE L++ EA+ + D + + + + ++ + + W+E
Sbjct: 571 R-AITQFEIGNEAALSLTEAFLMEDKQQIGTTCENLMGMFQKLEP----YSNRDVRDWIE 625
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
AK +A E+ + + +A +T+W Q DY ++ W+GLL YYG R +
Sbjct: 626 DAKSIAPTTEEREVFPISASDILTVWGPTGQN-----LDYAHREWAGLLSGYYGRRWQYF 680
Query: 481 FKYMIE 486
+++E
Sbjct: 681 CDWILE 686
>gi|440731409|ref|ZP_20911430.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
gi|440373101|gb|ELQ09870.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
Length = 732
Score = 202 bits (513), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 156/557 (28%), Positives = 239/557 (42%), Gaps = 83/557 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSG VP A P A+I ++ W TY LD DPLF +I + FI+
Sbjct: 205 VLPAFSGYVPKAFAQAHPQARIYRMRAWEGFHE------TYWLDPADPLFTKIAQRFIQL 258
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDS----------------------PEY--------I 97
+ YG+ ++ Y D F+E PP+ + PE +
Sbjct: 259 YDRTYGKGTY-YLADAFNEMLPPIAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRL 317
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV 156
+ G A+Y + + DAVW+MQGWLF D FW P + A L VP KL+VLD+ +
Sbjct: 318 ADYGRALYESIHRANPDAVWVMQGWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDR 377
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S F G +I+ +HN+ G+ +YG +AF + R + +VG G
Sbjct: 378 YPGTWKLSDAFDGKQWIYGYVHNYGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFG 434
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
EG+ N VVY+ M +A+ ++ ++ W+ Y+ RYG + PA++ AW+ L V +
Sbjct: 435 AFPEGLHDNSVVYEYMYTLAWGGQQRSLQDWLGDYTRARYGHTSPALRAAWDDLQAAVLS 494
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
P G Y + +P + + E + D P L
Sbjct: 495 TRYWT---------------PRWWRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL---- 533
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 391
RAL+ +A E + + YRYDL+D R + + AY+ D
Sbjct: 534 ---RRALDQLLALAPEYADAPLYRYDLVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDA 590
Query: 392 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
R V+ +DGL+ +L WL A+ A+ + Y +A+ QI++W
Sbjct: 591 AFARVQAAVQQLDGLVGGQQE-ILSSWLGDAEGDAKTPQDAAYYRRDAKAQISVWGGEGN 649
Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREWI 504
L DY +K W G+ DYY PR A+ + + + SG RL+ W R+W+
Sbjct: 650 -----LGDYASKAWQGMYADYYLPRWALAMQALRAAAVSGGSVDEAALQQRLRVWERDWV 704
Query: 505 KLTNDWQNGRNVYPVES 521
+ PV +
Sbjct: 705 ACETPYTRRAPADPVAA 721
>gi|210611122|ref|ZP_03288736.1| hypothetical protein CLONEX_00926, partial [Clostridium nexile DSM
1787]
gi|210152109|gb|EEA83116.1| hypothetical protein CLONEX_00926 [Clostridium nexile DSM 1787]
Length = 1662
Score = 201 bits (512), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 152/541 (28%), Positives = 249/541 (46%), Gaps = 71/541 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + + PSA++ + G W S + +L F + + F +
Sbjct: 722 VLQGYSGMVPVDITDKDPSAQVIKQGTWCSFQRPS------MLKTDSETFDKYAQLFYKV 775
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + YG S Y D F E NT + SP I+ + + M D + +W++Q W
Sbjct: 776 QKEVYGDVSDYYATDPFHEGGNTGGM-SPTVIAE---KVLANMMEADENGIWIIQSW--- 828
Query: 126 YDPFWRPPQMKALLNSVPLGK--LVVLDLFAEVKPIWSTS-----------KQFYGVPYI 172
+ ALL + + +VLDL+AE P W+ + +F P++
Sbjct: 829 -----QGNPSTALLQGLDAARDHALVLDLYAEKTPHWNETDPGSYGGAEGGGEFLNTPWV 883
Query: 173 WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+CML+NF G + ++G +++ G +A N M G+G++ E NPV+YDL E +
Sbjct: 884 YCMLNNFGGRLGLHGHIENFVNGVAQAAAQAN-HMAGIGITPEASVNNPVLYDLFFETIW 942
Query: 233 QHE-----KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIV 285
+ +++ W Y+ RRYG + +A +L TVYN + V+
Sbjct: 943 SDDGENLSAINLDEWFKDYTTRRYGAESQSAYEAMQILNDTVYNPEMNMKGQGAPESVVN 1002
Query: 286 AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG 345
A P +D G +G V + Y +E+ +A L +
Sbjct: 1003 ARPGLDI-------GAASTWGNAV-----------------IDYDKAELEKAAALLLKDY 1038
Query: 346 NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 405
++L S Y+YDL ++ Q L+ A E + +A++ DA ++S FLE++ ++
Sbjct: 1039 DKLKDSAGYQYDLANVLEQVLSNTAQEYQKKMADAFREGDAEKFEKMSNSFLEIITKVEE 1098
Query: 406 LLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNK 463
+ + F+LG WLESAK LA+N + ++ YE NAR IT W Q + L DY N+
Sbjct: 1099 VTGTQEEFMLGTWLESAKALAKNADDFTKELYELNARGLITTWGSIEQANSGGLIDYSNR 1158
Query: 464 YWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESN 522
WSGL DYY PR + K++ E + G K++ +W ++ W N YP ++N
Sbjct: 1159 QWSGLTSDYYKPR---WEKWIAERKKELAGEESKNYSAADWFEMEWAWARSNNEYPTKAN 1215
Query: 523 G 523
G
Sbjct: 1216 G 1216
>gi|288927801|ref|ZP_06421648.1| putative alpha-N-acetylglucosaminidase
(N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
oral taxon 317 str. F0108]
gi|288330635|gb|EFC69219.1| putative alpha-N-acetylglucosaminidase
(N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
oral taxon 317 str. F0108]
Length = 723
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 147/513 (28%), Positives = 238/513 (46%), Gaps = 69/513 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ P F+G VP AL+ ++P+A I Q W + Y+L DPLF +IG FI++
Sbjct: 235 ICPGFAGFVPEALKELYPTADI-QYTTWEKAFHN------YILSPADPLFHKIGVMFIQE 287
Query: 68 QLKEYGRTSHIYNCDTFDENT---PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+GR Y D+F+E PP D P E+++ G +Y ++ + A W+MQG
Sbjct: 288 WEKEFGRCD-FYLIDSFNEMDIPFPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQG 346
Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIW 173
W+F Y P W + AL++ VP K+++LDL A+ K +W T K F G +I+
Sbjct: 347 WMFGYQPEIWDYKTLNALVSQVPDNKMIMLDLAADYNKFLWKTPFNWDFYKGFCGKQWIY 406
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
++ N G + G LD A G +EA S+N ++G G + EGIE N VVY+L+ + +
Sbjct: 407 SVIPNMGGKSALTGALDFYAKGHLEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGW 466
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ V+++ W+ Y+ RYG ++ WN + +VY
Sbjct: 467 AKQGVELRPWLRNYTYSRYGCYPIGMEQYWNEMIQSVY---------------------- 504
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
G ++++ + + K + S D + +Y E++ + L GN+L
Sbjct: 505 -------GSFKSHPRFNWQFRPGKEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL---- 552
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
+ D ++ L L I +AY+ D QL RF L+ MD +L H
Sbjct: 553 -FEADFKEMAANYLGGKVEILVRQIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPT 611
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ W++ A+ + + YE NAR +T+W + DY + W+GL+RDY
Sbjct: 612 KDMQKWIDYARARGVSYNKADCYESNARRIVTVW-------GPPIDDYSARIWAGLIRDY 664
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 505
Y PR YF SG F W ++++
Sbjct: 665 YLPRWKHYF----NQKRSGKPFDFSTWELDFVE 693
>gi|380512475|ref|ZP_09855882.1| N-acetylglucosaminidase [Xanthomonas sacchari NCPPB 4393]
Length = 785
Score = 201 bits (511), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 149/540 (27%), Positives = 235/540 (43%), Gaps = 83/540 (15%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF +I + FI+
Sbjct: 258 VLPAFAGYVPKAFAQAHPQARIYRMRAWEGFHE------TYWLDPADPLFAQIAQRFIQL 311
Query: 68 QLKEYGRTSHIYNCDTFDE------------------------------NTPPVDSPEYI 97
+ YG+ ++ Y D F+E PPV + +
Sbjct: 312 YDRTYGKGTY-YLADAFNEMLPPIAADGSDARLASYGDSTANTAKTKPPEVPPVQRDKRL 370
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV 156
++ G A+Y+ + + DAVW+MQGWLF D FW P + A L VP KL+VLD+ +
Sbjct: 371 AAYGRALYASIHRANPDAVWVMQGWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDR 430
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S F G +I+ +HN+ G+ +YG +AF + R + +VG G
Sbjct: 431 YPGTWKLSDAFDGKQWIYGYVHNYGGSNPVYG---DLAFYREDLRALLADKDKQQLVGFG 487
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
EG+ VVY+ M +A+ ++ ++ W++ Y+ RYG + PA++ AW+ L +V +
Sbjct: 488 AFPEGLHTTSVVYEYMYALAWGAQQRPLQDWLDDYTRARYGHTSPALRAAWDDLQASVLS 547
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
P G Y + +P + + E + D P L
Sbjct: 548 TRYWT---------------PRWWRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL---- 586
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 391
RAL+ +A E + + YRYDL+D R + + AY+ D
Sbjct: 587 ---RRALQQLLALAPEYADAPLYRYDLVDFARHYATGRVDVQLQQAVAAYRRGDVAAGDA 643
Query: 392 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
+ R E V +D L+ L WL++A A + Y +A+ Q+++W
Sbjct: 644 ATARVREAVTQLDSLVGGQQD-TLSSWLDAAAGYATTPQDAAYYRRDAKAQVSVWGGEGN 702
Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREWI 504
L DY +K W G+ DYY PR + + + E+ +G RL+ W R+W+
Sbjct: 703 -----LGDYASKAWQGMYADYYLPRWTLALQMLSEAAVAGGSVDEAQLQQRLRAWERDWV 757
>gi|282881077|ref|ZP_06289764.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
5C-B1]
gi|281304881|gb|EFA96954.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
5C-B1]
Length = 688
Score = 201 bits (511), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 140/485 (28%), Positives = 225/485 (46%), Gaps = 46/485 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+ F G VP+ L + +P A + G W + +L LF + +A+ +
Sbjct: 201 VVQGFPGIVPSFLHDKYPKACVVSQGKWNGFQRPS------ILLPQSQLFYCMAKAYYDN 254
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
+ YG + D F E N VD +SS + + M S DA W++QGW +
Sbjct: 255 MKRYYGTDLRYFGGDLFHEGGNAKGVD----LSSTASKVQKCMLSHFPDAKWVLQGWNGN 310
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P ALL + +++++L E+ W S +F P+IW +++F G +M
Sbjct: 311 PSP--------ALLAGLDKKHVLLINLAGEIDASWKQSDEFGQTPWIWGSVNHFGGKTDM 362
Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
G L + P A S++ + G+G+ EGI NPVVYDL + A+ V +
Sbjct: 363 GGQLPVLVEQPHRALAASQHGRLKGLGILPEGIHTNPVVYDLALQTAWSDTVPSVDHLLR 422
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
QY RYG + AW +L +VY EG Y++
Sbjct: 423 QYIWYRYGTWNDDLYRAWQLLASSVYG---------------------EFEVKGEGTYES 461
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
++ ++ S S++ + Y ++++AL LF + S TY YDL+DL RQ
Sbjct: 462 VF--CARPSLHVSSVSTWGPKKMQYQPEKLLQALVLFRKAAVHFKGSETYEYDLVDLARQ 519
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
+A A ++ ++ AY D+ + + S FL L++ D LL+ + FLLG WL++A+Q
Sbjct: 520 VMANNARNVYNQVVHAYNEKDSLALNRYSSTFLHLIDLQDSLLSTNKFFLLGKWLQAARQ 579
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+NE+ ++Q NART I+ W + + L DY NK W+GLL+ YY PR +F +
Sbjct: 580 YGENEQDQRQALVNARTLISYW--GPDDATTRLHDYANKEWAGLLKQYYAPRWRAFFAML 637
Query: 485 IESLE 489
L
Sbjct: 638 AGQLR 642
>gi|333031147|ref|ZP_08459208.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
gi|332741744|gb|EGJ72226.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
Length = 721
Score = 200 bits (509), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 140/531 (26%), Positives = 243/531 (45%), Gaps = 73/531 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ PAF+G VP A P + W DP + Y+L P F EIG+ FIE+
Sbjct: 236 IAPAFAGFVPMAFAEKHPDINFKHM-RWGGF--DPEYNA-YVLPPDSPFFEEIGKLFIEE 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQG 121
E+G ++ Y D+F+E P+D + + G +IY + +G+ +A+W+ QG
Sbjct: 292 WENEFGSNTY-YLSDSFNEMELPIDKDDTEGKYRLLRQYGESIYKSISAGNPEAIWVTQG 350
Query: 122 WLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIW 173
W F Y FW ++ALL++VP K++++DL + + W FYG +I+
Sbjct: 351 WTFGYQHSFWDTTSLQALLSNVPNEKMIIIDLGNDYPKWVWNTEQTWKVQNGFYGKGWIF 410
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G M G + A EA S N ++G G + EG+E N V+Y+L+++M +
Sbjct: 411 SYVPNFGGKTTMTGDMQMYATSSAEALASPNKGNLIGFGSAPEGLENNEVIYELLADMGW 470
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
E +++ W+ Y + RYG +Q AW + TVY+ + ++P
Sbjct: 471 TSESINLDEWMQSYCLSRYGGYPENVQKAWELFRKTVYSN-----------LYSYP---- 515
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
Y E L+ + ++ E + +ELF+++ NEL S
Sbjct: 516 -----------RYTWQTVVEDTLRINKIN--------TSDEFLIGVELFVSAVNELKDSE 556
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
Y DLI+ + A A++++ + ++ + L + ++++ +D LLA H
Sbjct: 557 LYVNDLIEFSSFYAAAKADKIYKEALILFERGNKKEARSLLNQSIQILLKVDKLLASHPI 616
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ L W++ A+ ++ +E NA+ IT W + DY ++WSGL++DY
Sbjct: 617 YRLEEWVKYARNSGSTVAEKDAFEANAKRLITTW-------GGIQDDYAARFWSGLIKDY 669
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY--PVES 521
Y PR + F S + L+ W W+ + W N + P+E+
Sbjct: 670 YIPRMELNF--------SSERNSLRQWEENWV--STPWNNPTQPFDNPIEA 710
>gi|310287970|ref|YP_003939229.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
gi|309251907|gb|ADO53655.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
Length = 1923
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 147/501 (29%), Positives = 235/501 (46%), Gaps = 46/501 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
V+ F G VPA Q P++ G W P TYL DA + F ++G
Sbjct: 403 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGD 461
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F + Q +G+ S+ Y D F E D + I + + M D AVW+MQ W
Sbjct: 462 TFYKAQESVFGKVSNYYAVDPFHEGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 520
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
+ D ++ L + G+ +VLDL ++++ ++ + GVP++W MLHNF G
Sbjct: 521 QWGIDE----TKLSGLADK---GQALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGR 572
Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+ + G+ + I+ +A S M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 573 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 631
Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
+Y+ RRYG + I+ AW++L T Y DG + +I A P D +I S
Sbjct: 632 TQEYAERRYGGTDGTIEKAWDILLDTAYKHMDGEYYQGASESIINARPS-DNTIGSA--- 687
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
S++ H + Y + +A LF + + S +RYD +D
Sbjct: 688 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVD 728
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQ LA E +AY+ D LS R L++++ D LL+ D FL+G W++
Sbjct: 729 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 788
Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
A+ + + +E NAR +T W N + SL+ DY N+ W+GL DYY R
Sbjct: 789 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 845
Query: 479 IYFKYMIESLESGDGFRLKDW 499
Y + LE G F DW
Sbjct: 846 TYVDNRLNKLEHGTDFTDPDW 866
>gi|421736727|ref|ZP_16175487.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
IPLA 20015]
gi|407295984|gb|EKF15606.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
IPLA 20015]
Length = 1044
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 146/501 (29%), Positives = 234/501 (46%), Gaps = 46/501 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
V+ F G VPA Q P++ G W P TYL DA + F ++
Sbjct: 375 VIQGFGGQVPADFQEKNPTSVAASSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVCD 433
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
F + Q +G+ S+ Y D F E D + I + + M D AVW+MQ W
Sbjct: 434 TFYKAQENVFGKVSNYYAVDPFHEGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQW 492
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
+ D ++ L + G+ +VLDL ++++ ++ + GVP++W MLHNF G
Sbjct: 493 QWGIDE----TKLSGLADK---GQTLVLDLQSDLRSQ-ASPMENQGVPWVWNMLHNFGGR 544
Query: 183 IEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+ + G+ + I+ +A S M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W
Sbjct: 545 MGLDGVPEVISQDITKAYNSSGY-MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSW 603
Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEG 300
+Y+ RRYG + I+ W++L T Y TDG + +I A P D +I S
Sbjct: 604 TQEYAERRYGGTDGTIEKVWDILLDTAYKHTDGEYYQGASESIINARPS-DNTIGSA--- 659
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
S++ H + Y + +A LF + + S +RYD +D
Sbjct: 660 -------------------STWGHSDIDYDKRQFEKAAALFEQAYDSYKNSAGFRYDYVD 700
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQ LA E +AY+ D LS R L++++ D LL+ D FL+G W++
Sbjct: 701 VMRQVLANSFQEYQPLAGQAYKSGDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWID 760
Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
A+ + + +E NAR +T W N + SL+ DY N+ W+GL DYY R
Sbjct: 761 DARTMLDGADDWTADLFELNARALVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWK 817
Query: 479 IYFKYMIESLESGDGFRLKDW 499
Y + LE G F DW
Sbjct: 818 TYVDNRLNKLEHGTDFTDPDW 838
>gi|281424178|ref|ZP_06255091.1| N-acetylglucosaminidase [Prevotella oris F0302]
gi|281401447|gb|EFB32278.1| N-acetylglucosaminidase [Prevotella oris F0302]
Length = 723
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 146/513 (28%), Positives = 237/513 (46%), Gaps = 69/513 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+ P F+G VP AL+ ++P+A I Q W + Y+L DPLF +IG FI++
Sbjct: 235 ICPGFAGFVPEALKELYPTADI-QYTTWEKAFHN------YILSPADPLFHKIGVMFIQE 287
Query: 68 QLKEYGRTSHIYNCDTFDENT---PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQG 121
KE+GR Y D+F+E PP D P E+++ G +Y ++ + A W+MQG
Sbjct: 288 WEKEFGRCD-FYLIDSFNEMDIPFPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQG 346
Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIW 173
W+F Y P W + AL++ VP K+++LDL + K +W T K F G +I+
Sbjct: 347 WMFGYQPEIWDYKTLNALVSQVPDNKMIMLDLAVDYNKFLWKTPFNWDFYKGFCGKQWIY 406
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
++ N G + G LD A G +EA S+N ++G G + EGIE N VVY+L+ + +
Sbjct: 407 SVIPNMGGKSALTGALDFYAKGHLEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGW 466
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
+ V+++ W+ Y+ RYG ++ WN + +VY
Sbjct: 467 AKQGVELRPWLRNYTYSRYGCYPIGMEQYWNEMLQSVY---------------------- 504
Query: 293 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 352
G ++++ + + K + S D + +Y E++ + L GN+L
Sbjct: 505 -------GSFKSHPRFNWQFRPGKEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL---- 552
Query: 353 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 412
+ D ++ L L I +AY+ D QL RF L+ MD +L H
Sbjct: 553 -FEADFKEMAANYLGGKVEILVRQIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPT 611
Query: 413 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 472
+ W++ A+ + + YE NAR +T+W + DY + W+GL+RDY
Sbjct: 612 KDMQKWIDYARARGVSYNKADCYESNARRIVTVW-------GPPIDDYSARIWAGLIRDY 664
Query: 473 YGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 505
Y PR YF SG F W ++++
Sbjct: 665 YLPRWKHYF----NQKRSGKPFDFSTWELDFVE 693
>gi|432896403|ref|ZP_20107613.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
gi|433031274|ref|ZP_20219108.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
gi|431432398|gb|ELH14169.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
gi|431538475|gb|ELI14460.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
Length = 1049
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 140/519 (26%), Positives = 238/519 (45%), Gaps = 43/519 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P F+G VP P A++ G+W P TY+ D F ++ + +
Sbjct: 398 VFPGFAGQVPDTFAAKNPQAQVIDQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQT 455
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+G SH Y D F E D + + + + M D +AVW++Q W
Sbjct: 456 LKTTFGDISHYYAVDPFHEGGNRADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW----- 508
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
+ LN + ++LDL+A+ KP + +F P+IW MLH F G + G
Sbjct: 509 ---QENPTDDFLNDLKKDHALILDLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSG 565
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
+ + +A + +E+ M GVG++ E + NP++Y+++ +MA++ + A+I+ +
Sbjct: 566 MQEVLA-QEIPQSLAESKYMKGVGVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWL 624
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RYG P I+ AW+++ T Y+ D+ R + SII G +G
Sbjct: 625 TSRYGAQSPEIEQAWDIMVKTAYHRR---KDRQR--------AEDSIIDAKPG----FG- 668
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
V++ + YD +E + L L+++ + + Y++DL+D+TRQ LA
Sbjct: 669 -VTRACTYYTALIDYDK-------AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLA 720
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL-- 425
+ E + +A+ D QLS +FL L++ D +L F+LG WL SA+ +
Sbjct: 721 NASYEYYRAFEDAWMAKDYSAFNQLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLD 780
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
++ Q+E+NAR +T W +A LRDY N+ W GL D+Y R A + + +
Sbjct: 781 GMDDWTRDQFEFNARAMVTTWGTEQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQTLK 839
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGR-NVYPVESNG 523
+ +G + + W L W N N YP + +G
Sbjct: 840 SAAATGQ--KQDAIKVHWFPLEYRWVNQTGNGYPTQPSG 876
>gi|331660873|ref|ZP_08361805.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
TA206]
gi|422369309|ref|ZP_16449711.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
gi|315298924|gb|EFU58178.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
gi|331051915|gb|EGI23954.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
TA206]
Length = 1052
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 140/519 (26%), Positives = 238/519 (45%), Gaps = 43/519 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P F+G VP P A++ G+W P TY+ D F ++ + +
Sbjct: 401 VFPGFAGQVPDTFAAKNPQAQVIDQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQT 458
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+G SH Y D F E D + + + + M D +AVW++Q W
Sbjct: 459 LKTTFGDISHYYAVDPFHEGGNRADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW----- 511
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
+ LN + ++LDL+A+ KP + +F P+IW MLH F G + G
Sbjct: 512 ---QENPTDDFLNGLKKDHALILDLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSG 568
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
+ + +A + +E+ M GVG++ E + NP++Y+++ +MA++ + A+I+ +
Sbjct: 569 MQEVLA-QEIPQSLAESKYMKGVGVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWL 627
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RYG P I+ AW+++ T Y+ D+ R + SII G +G
Sbjct: 628 TSRYGAQSPEIEQAWDIMVKTAYHRR---KDRQR--------AEDSIIDAKPG----FG- 671
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
V++ + YD +E + L L+++ + + Y++DL+D+TRQ LA
Sbjct: 672 -VTRACTYYTALIDYDK-------AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLA 723
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL-- 425
+ E + +A+ D QLS +FL L++ D +L F+LG WL SA+ +
Sbjct: 724 NASYEYYRAFEDAWMAKDYSAFNQLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLD 783
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
++ Q+E+NAR +T W +A LRDY N+ W GL D+Y R A + + +
Sbjct: 784 GMDDWTRDQFEFNARAMVTTWGTEQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQTLK 842
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGR-NVYPVESNG 523
+ +G + + W L W N N YP + +G
Sbjct: 843 SAAATGQ--KQDAIKVHWFPLEYRWVNQTGNGYPTQPSG 879
>gi|161505009|ref|YP_001572121.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160866356|gb|ABX22979.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 1014
Score = 198 bits (504), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 138/519 (26%), Positives = 240/519 (46%), Gaps = 43/519 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P F+G VP P A++ G+W P TY+ D F ++ + +
Sbjct: 363 VFPGFAGQVPDTFAAKNPQAQVIDQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQT 420
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+G SH Y D F E D + + + + M D DAVW++Q W
Sbjct: 421 LKTTFGDISHYYAVDPFYEGGNRADL--NMVKVAQTVQNKMLEHDKDAVWIIQNW----- 473
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
+ A LN + ++LDL+A+ KP + +F P+IW MLH F G + G
Sbjct: 474 ---QENPTDAFLNGLKKDHALILDLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSG 530
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
+ + +A + +E+ M GVG++ E + NP++Y+++ +MA++ + A+I+ +
Sbjct: 531 MPEVLA-QEIPQSLAESKYMKGVGVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHNWL 589
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RYG P I+ AW+++ T Y+ D+ R + SII G +G
Sbjct: 590 TSRYGAQSPEIEQAWDIMVKTAYHRR---KDRQR--------AEDSIIDAKPG----FGV 634
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
+ +Y + + Y +E + L L+++ + + Y++DL+D+TRQ LA
Sbjct: 635 ---------TRACTYYNALIDYDKAEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLA 685
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL-- 425
+ E + +A+ D QLS +FL L++ D +L+ F+LG W+ +++ +
Sbjct: 686 NASYEYYRAFEDAWMAQDYSAFNQLSGKFLRLIKLQDKVLSTRPEFMLGNWINNSRTMLD 745
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
++ Q+E+NAR +T W +A LRDY N+ W GL D+Y R A + + +
Sbjct: 746 GMDDWTRDQFEFNARAMVTTWGTEQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALK 804
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGR-NVYPVESNG 523
+ +G + + W L W N N YP + +G
Sbjct: 805 TAAATGQ--KQDAIKVSWFPLEYRWVNQTGNGYPTQPSG 841
>gi|293402122|ref|ZP_06646261.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
bacterium 5_2_54FAA]
gi|291304514|gb|EFE45764.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
bacterium 5_2_54FAA]
Length = 2295
Score = 198 bits (503), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 138/499 (27%), Positives = 245/499 (49%), Gaps = 57/499 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGR 62
V+ FSG VP P+A IT++ +W + P Y+ ++ + L+ ++ +
Sbjct: 435 VVQGFSGQVPDNFDKKQPTALITEMKDWVGY-TRPSIIQPYITESDAAKGKENLYPQVAK 493
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 120
F + Q +G ++ Y D F E P +D E + + M + AVW+MQ
Sbjct: 494 DFYDAQKNVFGNVTNYYATDPFHEGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQ 549
Query: 121 GWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLH 177
W + D L V + + LDL ++ P + S+ P++WCMLH
Sbjct: 550 QWQGNLDA-------TKLSGLVKPSQALALDLQTDLNPQNGVMENSE----TPWLWCMLH 598
Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
NF G + M G L ++A P A +E+ M G+G++ E +E +PV Y+L+ +M + + +
Sbjct: 599 NFGGRMGMDGNLPNVAKNPAIA-MNESKYMKGIGITPEALENSPVAYELLFDMTWTKDPI 657
Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 297
D AWI +Y+ RR G + +Q+AW +L T Y GA + ++ +II+
Sbjct: 658 DEDAWIAKYAQRRAGGTSEKLQEAWKILNETAY----GAKQE------SYQGAAETIINA 707
Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
T +++ + S++ H ++ Y E +AL+L I + ++ AS YRYD
Sbjct: 708 T-----------PRDSFRSA--STWGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYD 754
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
L D+ Q L A E +++A ++A + S++FLE+++ D +L + F++G
Sbjct: 755 LADVANQVLCNVAIEYHSLMVKAKNESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGN 814
Query: 418 WLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASL--LRDYGNKYWSGLLRDYY 473
W+ A+ + + + + +E+NAR +T W + E +SL L DY N+ W+GL +D+Y
Sbjct: 815 WINDARNMMSDGDDWTKDLFEFNARAMVTTW---SGERSSLNNLNDYSNRKWNGLTKDFY 871
Query: 474 GPRAAIYFKYMIESLESGD 492
G R ++ + L+ D
Sbjct: 872 GKRWKVWIENRQAELDGKD 890
>gi|62318937|dbj|BAD94027.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
Length = 182
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 94/181 (51%), Positives = 128/181 (70%), Gaps = 1/181 (0%)
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
++DLTRQ L+K AN+++ + A+ D + QLS +FLEL++DMD LLA D LLG
Sbjct: 1 MVDLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGT 60
Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
WLESAK+LA+N ++ KQYEWNARTQ+TMW+D+ S L DY NK+WSGLL DYY PRA
Sbjct: 61 WLESAKKLAKNGDERKQYEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRA 120
Query: 478 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKY 536
+YF M++SL F+++ WRREWI +++ WQ + VYPV++ GDAL S+ L +KY
Sbjct: 121 RLYFNEMLKSLRDKKIFKVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKY 180
Query: 537 L 537
Sbjct: 181 F 181
>gi|373451393|ref|ZP_09543318.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
3_1_31]
gi|371968665|gb|EHO86120.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
3_1_31]
Length = 2190
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 139/499 (27%), Positives = 246/499 (49%), Gaps = 57/499 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGR 62
V+ FSG VP P+A IT++ +W + P Y+ + + L+ ++ +
Sbjct: 427 VVQGFSGQVPDNFDKKQPTALITEMKDWVGY-TRPSIIQPYITENDAAKGKENLYPQVAK 485
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 120
F + Q +G ++ Y D F E P +D E + + M + AVW+MQ
Sbjct: 486 DFYDAQKNVFGNVTNYYATDPFHEGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQ 541
Query: 121 GWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLH 177
W + D ++ LL + + LDL ++ P + S+ P++WCMLH
Sbjct: 542 QWQGNLDA----TKLSGLLKP---SQALALDLQTDLNPQNGVMENSE----TPWLWCMLH 590
Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
NF G + M G L ++A P A +E+ M G+G++ E +E +PV Y+L+ +M + + +
Sbjct: 591 NFGGRMGMDGNLPNVAKNPAIA-MNESKYMKGIGITPEALENSPVAYELLFDMTWTKDPI 649
Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 297
D AWI +Y+ RR G + +Q+AW +L T Y GA + ++ +II+
Sbjct: 650 DEDAWIAKYAQRRAGGTSEKLQEAWKILNETAY----GAKQE------SYQGAAETIINA 699
Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
T +++ + S++ H ++ Y E +AL+L I + ++ AS YRYD
Sbjct: 700 T-----------PRDSFRSA--STWGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYD 746
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
L D+ Q L A E +++A ++A + S++FLE+++ D +L + F++G
Sbjct: 747 LADVADQVLCNVAIEYHSLMVKAKNESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGN 806
Query: 418 WLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASL--LRDYGNKYWSGLLRDYY 473
W+ A+ + + + + +E+NAR +T W + E +SL L DY N+ W+GL +D+Y
Sbjct: 807 WINDARNMMSDGDDWTKDLFEFNARAMVTTW---SGERSSLNNLNDYSNRKWNGLTKDFY 863
Query: 474 GPRAAIYFKYMIESLESGD 492
G R I+ + L+ D
Sbjct: 864 GKRWKIWIENRQAELDGKD 882
>gi|345014586|ref|YP_004816940.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
gi|344040935|gb|AEM86660.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
Length = 1044
Score = 196 bits (498), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 142/526 (26%), Positives = 232/526 (44%), Gaps = 45/526 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P + G VP + P A+ G+W ++ P W LD F ++ AF
Sbjct: 253 VFPGYFGTVPDGFADRNPEARTVPQGDWNGLRR-PDW-----LDPRTESFRKVAAAFYRH 306
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G + ++ D E P D P + A+ + +++ A+W++ GW +
Sbjct: 307 QRELFGE-AGLFKMDLLHEGGDPGDVP--VPDAARAVETALRTARPGAIWVILGW--QEN 361
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
P + LL++V +++V+D +++ + K + VPY + + NF G +
Sbjct: 362 P------RRDLLDAVDHDRMLVVDGLSDLDTVTDREKDWGAVPYAFGTIPNFGGRTTIGA 415
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
R + +VG E E++P ++L SE+A++ E VD W Y+
Sbjct: 416 KTHMWTKRFTVWRDKPGSKLVGTAYMPEAAERDPAAFELFSELAWREEAVDRAEWFRSYA 475
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RYG ++A+ L T Y + + V A P S+T NY
Sbjct: 476 EMRYGGRDAKAREAFAALRDTAYEISSKDGRPHDSVFAARP-------SLTARSGTNYA- 527
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
+ T ++D +V A L + +G L S+ YR+DL D+ RQALA
Sbjct: 528 ---------THTPAFDPAGF-----DVAFAALLGVRAG--LRDSDAYRHDLTDIARQALA 571
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ +L + +AY D L+R +L+L+ D + H FLLGPWLE AK++A
Sbjct: 572 NRSWQLIPQLQDAYDRKDRTAFRTLARLWLKLMRLSDDMTGAHRRFLLGPWLEDAKRMAS 631
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
+E+ + E ART IT W D + L +Y N+ WSGL+ D++ P+ Y + ++
Sbjct: 632 GDEESARLERAARTLITTWADRATADGGKLANYANRDWSGLIADFHLPQWQSYLDELEDA 691
Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 533
L R D W + W R YPV DA T+Q +Y
Sbjct: 692 LAENRPPRAFD----WFAVEEPWTRERTSYPVRPTTDAHRTAQRVY 733
>gi|308480701|ref|XP_003102557.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
gi|308261289|gb|EFP05242.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
Length = 718
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 128/489 (26%), Positives = 227/489 (46%), Gaps = 53/489 (10%)
Query: 8 VLPAFSGNVPAALQN---VFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAF 64
+LP F+G VP L+ +FP++K +L W + S+ C + DPLF +IG F
Sbjct: 247 ILPTFAGFVPDQLEKDFRLFPTSKFNRLPCWNNFTSET--SCLLSVSPFDPLFQKIGSTF 304
Query: 65 IEQQLKEYG-RTSHIYNCDTFDENTPPVDSPEYISSL----GAAIYSGMQSGDSDAVWLM 119
+ Q K G +++Y+ D F+E P DS ++ +S +I + + D + +W++
Sbjct: 305 LRHQKKMLGGDITNLYSADPFNE-ILPSDSSKFDASFMKQTAQSIMNSCRKVDKNCIWVL 363
Query: 120 QGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNF 179
Q W F+YD W +K+ L++VP+G L++LDL++EV P W + F+G ++WC+LHNF
Sbjct: 364 QSWSFTYDQ-WPNWAIKSFLSAVPIGNLLILDLYSEVVPAWQMTSSFHGHNFVWCLLHNF 422
Query: 180 AGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
G+ E+ G L + G A + +VG G+SME I+QN VVY M + + E + +
Sbjct: 423 GGSRELRGNLQKVDKGYQLALMKAGSNLVGAGLSMEAIDQNYVVYQFMIDRMWSQEPIPL 482
Query: 240 KAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 299
W+ YS RY W +L + Y+ + + V +
Sbjct: 483 NNWLKSYSESRYSADFKVSHKFWTILAGSFYSQPEKWGNPRFSVFL-------------- 528
Query: 300 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS-GNELSASNTYRYDL 358
Y +P + + W+ E L+ + S + L ++ DL
Sbjct: 529 -----YHRPAFAKKI-----------EYWFPVEETFNHLQSLMPSLMHVLGDHPLFKEDL 572
Query: 359 IDLTRQALA-KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
D+ R + + NE L++ EA+ + D + +++ + ++ +
Sbjct: 573 NDVMRAVIQFEIGNEAALSLTEAFLMEDKQQIGASCENLMDMFQKLES----YSNRDFKE 628
Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
W+E +K +A E+ + + A +T+W Q DY ++ W+GLL YYG R
Sbjct: 629 WIEDSKSIAPTSEERQVFPVTASDILTVWGPTGQN-----LDYAHREWAGLLSGYYGRRW 683
Query: 478 AIYFKYMIE 486
+ +++E
Sbjct: 684 QYFCDWILE 692
>gi|326440885|ref|ZP_08215619.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
Length = 1038
Score = 194 bits (494), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 143/527 (27%), Positives = 229/527 (43%), Gaps = 54/527 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+P + G VP P A++ G W + P W LD P+F EI A+
Sbjct: 246 VVPGYFGTVPDGFVARNPGARVIPQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRH 299
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G H + D E P D P + A+ + +++ A W++ GW +
Sbjct: 300 QEELFGEIDH-FKMDLLHEGGTPGDVP--VPDAARAVETALRAARPAATWVILGWQSNPR 356
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
P ALL+++ K++++D +++ + ++ G PY + + NF G +
Sbjct: 357 P--------ALLDAIDTSKVLIVDGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGA 408
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
D R N+ +VG E +++P +L +E+A++ EK+D AW Y+
Sbjct: 409 NTDRWTEKFTAWRDKPNSALVGTAYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYA 468
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RYG PA ++A+ L T Y T T R + F
Sbjct: 469 QFRYGAKDPAAEEAFAALAGTAYQLT---TTDGRPIDSLF-------------------- 505
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
+ + ++ S +++D + ++R E EL S+ YRYDL DL RQALA
Sbjct: 506 -LRRPSMSSSVATAFDQAAFDRGFAALLRVNE-------ELRGSDAYRYDLTDLARQALA 557
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ L L + AY D ++ +L L+ D + CH FLLGPWLE AK+ A
Sbjct: 558 LRSRTLQLALRAAYATKDVTAFRGVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFAT 617
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
+ E+ + E AR IT W D + A L +Y N+ W GL+ D + P+ YF + +
Sbjct: 618 STEEAVELERTARVLITTWGD--RAAAVELSNYANRDWQGLIGDVHVPQWEQYFTEVATA 675
Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
L G + D W W R YPV GD +Q +++
Sbjct: 676 LAEGRAPKAID----WYPGEETWTKDRRPYPVRPTGDVHKVAQRVHD 718
>gi|294812279|ref|ZP_06770922.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
gi|294324878|gb|EFG06521.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
Length = 1086
Score = 194 bits (494), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 143/527 (27%), Positives = 229/527 (43%), Gaps = 54/527 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+P + G VP P A++ G W + P W LD P+F EI A+
Sbjct: 294 VVPGYFGTVPDGFVARNPGARVIPQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRH 347
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G H + D E P D P + A+ + +++ A W++ GW +
Sbjct: 348 QEELFGEIDH-FKMDLLHEGGTPGDVP--VPDAARAVETALRAARPAATWVILGWQSNPR 404
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
P ALL+++ K++++D +++ + ++ G PY + + NF G +
Sbjct: 405 P--------ALLDAIDTSKVLIVDGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGA 456
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
D R N+ +VG E +++P +L +E+A++ EK+D AW Y+
Sbjct: 457 NTDRWTEKFTAWRDKPNSALVGTAYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYA 516
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RYG PA ++A+ L T Y T T R + F
Sbjct: 517 QFRYGAKDPAAEEAFAALAGTAYQLT---TTDGRPIDSLF-------------------- 553
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
+ + ++ S +++D + ++R E EL S+ YRYDL DL RQALA
Sbjct: 554 -LRRPSMSSSVATAFDQAAFDRGFAALLRVNE-------ELRGSDAYRYDLTDLARQALA 605
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ L L + AY D ++ +L L+ D + CH FLLGPWLE AK+ A
Sbjct: 606 LRSRTLQLALRAAYATKDVTAFRGVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFAT 665
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
+ E+ + E AR IT W D + A L +Y N+ W GL+ D + P+ YF + +
Sbjct: 666 STEEAVELERTARVLITTWGD--RAAAVELSNYANRDWQGLIGDVHVPQWEQYFTEVATA 723
Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
L G + D W W R YPV GD +Q +++
Sbjct: 724 LAEGRAPKAID----WYPGEETWTKDRRPYPVRPTGDVHKVAQRVHD 766
>gi|408676293|ref|YP_006876120.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
gi|328880622|emb|CCA53861.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
Length = 855
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 138/519 (26%), Positives = 220/519 (42%), Gaps = 48/519 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G VP P A G W P W LD T P+F + +
Sbjct: 364 VLPGYFGTVPPDFTARNPGAHTVPQGRWVGF-GRPDW-----LDPTGPVFARLAAVYYRH 417
Query: 68 QLKEYGRTSHIYNCDTFDENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G S ++ D E P VD +S+ A+ +++ A W+M GW +
Sbjct: 418 QRQRFG-DSDMFKMDLLHEGGAPGTVD----VSAAAGAVQRALEAARPGATWVMLGWQLN 472
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P ALL+ V +L+++D ++ ++ G PY + + NF G+ +
Sbjct: 473 PTP--------ALLHGVDRRRLLIVDGLSDRYDELDRETRWGGTPYAFGTIPNFGGHTSI 524
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
+ ++ + G+ E NPV + L +E+A+Q +D + W
Sbjct: 525 GANTGAWVSRFHAWLAKPDSALRGIAYLPEATGTNPVAFGLFTELAWQPGPIDQQRWFAG 584
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y+ RRYG + AW L Y+ G+ + +D + A PS+ + T ++
Sbjct: 585 YAARRYGGADRHAAAAWEALRLGPYSMRTGSWSEPQDSLFA---ARPSLTASTAARW--- 638
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
S +A+ Y + V RAL + L S+ YR+D++D+ RQA
Sbjct: 639 ----SPKAMR-------------YDAATVERALAELLRVAPRLRTSDAYRFDVVDVARQA 681
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
L A L I AY+ D L R + E + L+ FL+GPWL +A+
Sbjct: 682 LTNRARVLLPRIRAAYEARDLDAFRALVREWGAAEELLGRLVGSDRRFLVGPWLAAARSW 741
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
+ + + E++AR+ +T W D E+ L DY N+ WSGL+RD Y PR A YF +
Sbjct: 742 GADPAERDRLEYDARSILTTWADRVPSESGGLHDYANREWSGLVRDVYAPRWAAYFASLD 801
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
+L +G D W + W G YP +GD
Sbjct: 802 RALVNGTEPVAID----WFARDDAWARGHRSYPTLPSGD 836
>gi|197302378|ref|ZP_03167435.1| hypothetical protein RUMLAC_01107 [Ruminococcus lactaris ATCC 29176]
gi|197298557|gb|EDY33100.1| F5/8 type C domain protein [Ruminococcus lactaris ATCC 29176]
Length = 1655
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 152/543 (27%), Positives = 244/543 (44%), Gaps = 75/543 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + + PSA++ + G W S + +L F + F +
Sbjct: 717 VLQGYSGMVPVDITSKDPSAEVIKQGTWCSFQRPS------MLRTDSESFTKYAALFYKV 770
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + YG ++H Y D F E NT +DS + + + + M + D A W++Q W
Sbjct: 771 QKEVYGDSAHYYATDPFHEGGNTGGMDS----AVISQKVLASMMTADPHATWVIQSW--- 823
Query: 126 YDPFWRPPQMKALLNSVPLGK--LVVLDLFAEVKPIWSTSK-----------QFYGVPYI 172
+ ALL + + +VLDL+AE P W+ + +F P++
Sbjct: 824 -----QGNPTTALLQGLGDNRDHALVLDLYAEKTPHWNETNPGYYGGAEGGGEFLNTPWV 878
Query: 173 WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+CML+NF G + ++G +D+ G V A + + M G+G++ E NPV+YDL E +
Sbjct: 879 YCMLNNFGGRLGLHGHIDNYVEGIVNA-SKQAEHMAGIGITPEASVNNPVLYDLFFETIW 937
Query: 233 QH-----EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIV 285
+K+++ W Y RRYG + A +L+ TVYN + V+
Sbjct: 938 ADDGNNLQKINLDEWFKNYVTRRYGADSDSAYQAMEILHDTVYNPAYNMKGQGAPESVVN 997
Query: 286 AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG 345
A P +D G +G V + Y ++ +A EL +A
Sbjct: 998 ARPGLDI-------GAASTWGNAV-----------------VDYDKKKLEKAAELLLADY 1033
Query: 346 NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 405
++L S Y+YDL ++ Q L+ A E + A++ DA LS +FL +++ ++
Sbjct: 1034 DKLKNSAGYQYDLANVLEQVLSNTAQEYQKKMAAAFRSGDAEEFSTLSDKFLSIIDMVEK 1093
Query: 406 LLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNK 463
+ FL+G W+ AK+LA+N + ++ YE NAR+ IT W Q + L DY N+
Sbjct: 1094 VTGTQKEFLVGTWINGAKKLAKNSDDFTKELYELNARSLITTWGSYDQAISGGLIDYSNR 1153
Query: 464 YWSGLLRDYYGPRAAIYFKYMIESL--ESGDGFRLKDW-RREWIKLTNDWQNGRNVYPVE 520
W+GL DYY R + + L ES + +DW EW W G N Y
Sbjct: 1154 QWAGLTNDYYKMRWEKWITERKKELAGESYTNYSAQDWFEMEWA-----WARGTNKYSGT 1208
Query: 521 SNG 523
NG
Sbjct: 1209 PNG 1211
>gi|433678127|ref|ZP_20510026.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430816763|emb|CCP40478.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 691
Score = 192 bits (487), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 154/558 (27%), Positives = 235/558 (42%), Gaps = 85/558 (15%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF G VP A P A+I ++ W TY LD DPLF +I FI+
Sbjct: 164 VLPAFVGYVPKAFAQAHPQARIYRMRAWEGFHE------TYWLDPADPLFAKIALRFIQL 217
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDS----------------------PEY--------I 97
+ YG+ ++ Y D F+E PP+ + PE +
Sbjct: 218 YDRTYGKGTY-YLADAFNEMLPPIAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRL 276
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV 156
+ G A+Y + + DAVW+MQGWLF D FW P + A L VP KL+VLD+ +
Sbjct: 277 ADYGRALYESIHRANPDAVWVMQGWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDR 336
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S F G +I+ +HN+ G+ +YG +AF + R + +VG G
Sbjct: 337 YPGTWKLSDAFDGKQWIYGYVHNYGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFG 393
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
EG+ N VVY+ M +A+ ++ ++ W+ Y RYG + PA++ AW+ L V +
Sbjct: 394 AFPEGLHDNSVVYEYMYALAWGGQQRSLQDWLGDYIRARYGHTSPALRAAWDDLQAAVLS 453
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
P G Y + +P + + E + D P L
Sbjct: 454 TRYWT---------------PRWWRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL---- 492
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 391
RAL+ +A E + + YRYDL+D R + + AY+ D
Sbjct: 493 ---RRALDQLLALAPEYADAPLYRYDLVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDA 549
Query: 392 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
R V+ +DGL+ L WL A+ A+ + Y +A+ Q+++W
Sbjct: 550 AFARVQAAVQQLDGLVGGQQE-TLSSWLGDAEGDAKTPQDAAYYRRDAKAQVSVWGGEGN 608
Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF--------RLKDWRREW 503
L DY +K W G+ DYY PR A+ + + + G G RL+ W +W
Sbjct: 609 -----LGDYASKAWQGMYADYYLPRWALAMQ-ALRAAAVGSGSVDEAALQQRLRAWELDW 662
Query: 504 IKLTNDWQNGRNVYPVES 521
+K + PV +
Sbjct: 663 VKRETPYTRQAPADPVAA 680
>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 1105
Score = 191 bits (485), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 146/508 (28%), Positives = 223/508 (43%), Gaps = 76/508 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF +I + FI+
Sbjct: 201 VLPAFAGYVPKAFAQAHPQARIYRMRAWEGFHE------TYWLDPADPLFAKIAQRFIQL 254
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDS----------------------PEY--------I 97
+ YG+ ++ Y D F+E PP+ + PE +
Sbjct: 255 YDRTYGKGTY-YLADAFNEMLPPIAADGSDARLASYGDSTANTANTAPPEVSPAQRDKRL 313
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEV 156
+ G A+Y + + DAVW+MQGWLF D FW P + A L VP KL+VLD+ +
Sbjct: 314 ADYGRALYESIHRANPDAVWVMQGWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDR 373
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S F G +I+ +HN+ G+ +YG +AF + R + +VG G
Sbjct: 374 YPGTWKLSDAFDGKQWIYGYVHNYGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFG 430
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
EG+ N VVY+ M +A+ ++ ++ W+ Y+ RYG S PA++ AW+ L +V +
Sbjct: 431 AFPEGLHTNSVVYEYMYALAWGGQQRSLQDWLGDYTRARYGHSSPALRAAWDDLQASVLS 490
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
P G Y + +P + + E + D P L
Sbjct: 491 TR---------------YWTPRWWRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL---- 529
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 391
RAL+ +A E + + YRYDL+D R + + AY+ D
Sbjct: 530 ---RRALDQLLALAPEYADAPLYRYDLVDFARHYATGRVDTQLQQALAAYKRGDVAAGDA 586
Query: 392 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
R V +DGL+ L WL++A+ A+ + Y +A+ Q+++W
Sbjct: 587 AFARVQAAVRQLDGLVGGQQE-TLSSWLDAAEGDAKTPQDAAYYRRDAKAQVSVWGGEGN 645
Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAI 479
L DY +K W G+ DYY PR A+
Sbjct: 646 -----LGDYASKAWQGMYADYYLPRWAL 668
>gi|423219557|ref|ZP_17206053.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
CL03T12C61]
gi|392624762|gb|EIY18840.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
CL03T12C61]
Length = 715
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 141/503 (28%), Positives = 219/503 (43%), Gaps = 48/503 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
++P F G VP+ L+N A I G W + +LD DP F + F ++
Sbjct: 232 LMPGFYGMVPSNLKNK-SKAHIIPQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDE 284
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ YG ++ D F E + G AI MQ ++W++QGW +
Sbjct: 285 TRRLYGSDIRFFSGDPFHEGG--ATDGVALGDAGRAIQKTMQKHFPGSIWVLQGWQDNPK 342
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
P LL + ++V +LF E W T K + G P+IW + NF + G
Sbjct: 343 P--------GLLEKLDKRYVLVQELFGENTNNWETRKGYEGTPFIWATVTNFGERPGING 394
Query: 188 ILDSIAFGPVEARTSENTT-MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
L A A SE M GVG+ EGI NPV Y+L+ E+ + ++VDV WI Y
Sbjct: 395 KLQRFADEVYRASNSEYAKYMKGVGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESY 454
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNY 305
RYGR I+ AW ++ ++Y+ G + +++ A P ++ +S + Y
Sbjct: 455 VTARYGRITDEIRTAWKMMLKSIYSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKY 514
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
+ + K+A LF + E + TYR DLI RQ
Sbjct: 515 DRDLYKKAAF------------------------LFAKAMPEFNEVRTYRIDLIHFLRQV 550
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
+A A+ +F ++I AYQ Q +FL +++ + LLA F L W + AK
Sbjct: 551 IANEADSVFYDMITAYQEKKVEKFEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDA 610
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
++K N IT W ++ E + L DY K W+G++ YY R +YF Y+
Sbjct: 611 GNTAAEKKNNFHNLMMLITYWGEHVTSEDN-LHDYAYKEWAGMMNTYYKERWLVYFDYL- 668
Query: 486 ESLESGDGFRLKD---WRREWIK 505
+L G+ + D W REW++
Sbjct: 669 RALLRGEEAKAPDYFHWEREWVE 691
>gi|153806010|ref|ZP_01958678.1| hypothetical protein BACCAC_00255 [Bacteroides caccae ATCC 43185]
gi|149130687|gb|EDM21893.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
Length = 715
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 141/503 (28%), Positives = 219/503 (43%), Gaps = 48/503 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
++P F G VP+ L+N A I G W + +LD DP F + F ++
Sbjct: 232 LMPGFYGMVPSNLKNK-SKAHIIPQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDE 284
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
+ YG ++ D F E + G AI MQ ++W++QGW +
Sbjct: 285 TRRLYGSDIRFFSGDPFHEGG--ATDGVALGDAGRAIQKTMQKHFPGSIWVLQGWQDNPK 342
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
P LL + ++V +LF E W T K + G P+IW + NF + G
Sbjct: 343 P--------GLLEKLDKRYVLVQELFGENTNNWETRKGYEGTPFIWATVTNFGERPGING 394
Query: 188 ILDSIAFGPVEARTSENTT-MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
L A A SE M GVG+ EGI NPV Y+L+ E+ + ++VDV WI Y
Sbjct: 395 KLQRFADEVYRASNSEYAKYMKGVGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESY 454
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNY 305
RYGR I+ AW ++ ++Y+ G + +++ A P ++ +S + Y
Sbjct: 455 VTARYGRITDEIRTAWKMMLKSIYSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKY 514
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
+ + K+A LF + E + TYR DLI RQ
Sbjct: 515 DRDLYKKAAF------------------------LFAKAMPEFNEVRTYRIDLIHFLRQV 550
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
+A A+ +F ++I AYQ Q +FL +++ + LLA F L W + AK
Sbjct: 551 IANEADSVFYDMITAYQEKKVEKFEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDA 610
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
++K N IT W ++ E + L DY K W+G++ YY R +YF Y+
Sbjct: 611 GNTAAEKKNNFHNLMMLITYWGEHVTSEDN-LHDYAYKEWAGMMNTYYKERWLVYFDYL- 668
Query: 486 ESLESGDGFRLKD---WRREWIK 505
+L G+ + D W REW++
Sbjct: 669 RALLRGEEAKAPDYFHWEREWVE 691
>gi|374990497|ref|YP_004965992.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
gi|297161149|gb|ADI10861.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
Length = 1001
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 139/526 (26%), Positives = 224/526 (42%), Gaps = 45/526 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P + G VP + P A+ G W + + P W LD F ++ AF
Sbjct: 211 VYPGYFGTVPDGFVDRNPGARTVPQGTWNGL-ARPDW-----LDPRTESFGQVAAAFYRH 264
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G ++ D E D P ++ A+ + +Q+ A W++ GW +
Sbjct: 265 QQELFGECD-LFKMDLLHEGGAAGDVP--VADAARAVETALQTARPGATWVILGWQAN-- 319
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
+ LL++V ++V+D +++ I + + VPY + + NF G +
Sbjct: 320 ------PRRELLDAVNHDHMLVVDGLSDLDSIGDREQDWGSVPYAFGTIPNFGGRTTIGA 373
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
A + R + +VG E + ++P ++L SE+A+++ VD W Y+
Sbjct: 374 KTHIWARRFTQWRDKPGSKLVGTAYMAEAVGRDPAAFELFSELAWRNTAVDRDEWFRTYA 433
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
R G +DA+ L T Y T + V A PDV T NY
Sbjct: 434 DVRLGGRDERARDAYAALRDTAYQITSSDGRPHDSVFSARPDV-------TARSGTNYAT 486
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
+ + ++ AL + L S+ YR+DL D+ RQALA
Sbjct: 487 RIPA-----------------FDLADFDPALAALLDVRPSLRDSDAYRHDLTDIARQALA 529
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ L ++ +AY+ D L+R +L+L+ D + H GFLLGPWLE AK+LA
Sbjct: 530 DRSWTLIPHLHDAYERKDLEAFRTLARLWLKLMRLSDDMTGAHRGFLLGPWLEDAKRLAS 589
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
+E + E ART IT W D + L +Y N+ W+GL+ D++ P+ Y + ++
Sbjct: 590 DEAEAAHLEHLARTLITTWADRVTADTGKLANYANRDWNGLIGDFHLPQWQSYLDELEDA 649
Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 533
L G R D W + W R YPV DA T + +Y
Sbjct: 650 LAEGREPRDFD----WFAVEEPWTRERKSYPVRPTTDAHRTGRRVY 691
>gi|404403947|ref|ZP_10995531.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 828
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 152/516 (29%), Positives = 241/516 (46%), Gaps = 79/516 (15%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V F+G VP A++ + P +T+ W K+ ++L DPLF EIG AF+
Sbjct: 221 VYQGFAGFVPPAMKRIHPETTLTET-KWSGFKN-------WMLSPLDPLFSEIGTAFVRA 272
Query: 68 QLKEYGRTSHIYNCDTFDE-NTP--PVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQG 121
+E+G+ + Y D+F+E + P P SPE ++L G IY + + DAVW+MQG
Sbjct: 273 WEEEFGKGKY-YLIDSFNEMDVPFGPKGSPERAATLRHYGETIYRSLAEANPDAVWVMQG 331
Query: 122 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQ------FYGVPYIW 173
W+F Y W P ++ALL P G++++LDL + IW + K F+G +I+
Sbjct: 332 WMFGYQRNSWDPESVRALLEGAPDGRMMILDLAVDFNNFIWRSEKSWNHLQGFFGREWIY 391
Query: 174 CMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+ NF G + G L+ A G +EA +S N + G G S EG+E N +VY++++ +
Sbjct: 392 STVPNFGGRTALIGNLEFYANGHLEALSSPNRGRLTGYGTSPEGVESNEIVYEIIAAAGW 451
Query: 233 QHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVD 291
+++D+K +++ YS RYG I W+ + + YN CT+ A
Sbjct: 452 SDDRIDLKKFLHDYSAARYGGCPEGIDRFWSGMLQSSYNECTNNA--------------- 496
Query: 292 PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
+Y+ +P S +Y A+E F+A EL +
Sbjct: 497 ---------RYRWQLRPYSHRMPTMGINENY------------YTAIEQFLACAGELGGN 535
Query: 352 NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG----VFQLSRRFLELVEDMDGLL 407
YR D I LA A+ L +EA D +G + + R EL+ D D LL
Sbjct: 536 ELYRTDAIQYAALYLASKADML----LEAANWADLYGAREEAYDCAMRIEELLLDADRLL 591
Query: 408 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
A H L W A++ EE+++++ +R I++W + L DY + WSG
Sbjct: 592 ASHPLLRLDRWSGMARKAGCTEEEKERFVGESRRLISVWGGPS------LSDYSARVWSG 645
Query: 468 LLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 503
++RDYY PR Y +E+ G F + W +W
Sbjct: 646 VIRDYYVPRLNKY----LEAKTDGTVFDFRTWDEQW 677
>gi|291302495|ref|YP_003513773.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
gi|290571715|gb|ADD44680.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
Length = 696
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 149/537 (27%), Positives = 241/537 (44%), Gaps = 61/537 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F+G VP + + +A I Q G WF P W T T + E+ F +
Sbjct: 214 VLPGFAGLVPGDIGD---TAAIPQ-GQWFGFDR-PAWLPT-----TTRAYAEVAEVFYAK 263
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + G T D E + VD + + AA M+ D +W++Q W
Sbjct: 264 QTERLGAT-RAQAVDLLHEGGTSGGVDLADATRGIAAA----MERAHDDYLWVLQAW--- 315
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
W P + L + L L W +K ++G P+ L NF G +
Sbjct: 316 ----WDNPLPEVLAAT----DSDHLLLLDLTGEGWRKTKGWHGKPWARGSLTNFGGRTVL 367
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
+G L IA P + +++VG + E + NPVV+ L ++ ++ +D+ AW+ +
Sbjct: 368 FGGLPEIAELPSLKDDPKASSLVGTALVEEAWQVNPVVWSLFTQTSWADGDIDLNAWVPE 427
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y RYG++ P AW+ L T Y DG ++ A P +D
Sbjct: 428 YVAARYGKAHPRAVRAWHGLLATAYRSMDGRPGGAESLLCAMPSLD-------------- 473
Query: 306 GKPVSKEAVLKSETSSYDHPH-LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
++ +S + PH L Y + A +A+ L ++T+R+DL+D+TRQ
Sbjct: 474 -----------ADRASMNGPHSLPYPAEALEVAWRDLLAAREALGGADTFRFDLVDVTRQ 522
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
++ A L + AY + + LS F++L E +D +LA + FL+G WL A+
Sbjct: 523 VISNRARPLLPLLRTAYAMKELDRFIALSHSFIDLFELLDPVLATREEFLVGRWLADARA 582
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
LA +E++ E++ART IT W D+ + A+L+ DY N W+GL+ DYY PR Y K +
Sbjct: 583 LAADEDEADALEFDARTIITTWGDSPESSATLI-DYANHEWAGLIADYYRPRWEKYLKSL 641
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK--YLQG 539
L G D+ + W + YP E +GDA+ + + +++ Y +G
Sbjct: 642 ETELREGKPAEPIDFYAD----AAAWARSHDTYPTEPSGDAVSSCRAVHHALPYFEG 694
>gi|375146756|ref|YP_005009197.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
koreensis GR20-10]
gi|361060802|gb|AEV99793.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
koreensis GR20-10]
Length = 1147
Score = 188 bits (477), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 140/515 (27%), Positives = 219/515 (42%), Gaps = 47/515 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+P F G VP N A++ GNW + + +LD TD F + F E+
Sbjct: 228 VMPGFYGMVPHNF-NTKSKARVITQGNWGA------FIRPAILDPTDTAFDRVAGIFYEE 280
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
K YGR ++ D F E + + + GA I MQ A+W++QGW
Sbjct: 281 TKKLYGRNIRFFSGDPFHEGG--ITNGVNLGKAGANIQKAMQQYFPGAIWVLQGW----- 333
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
+ K LL L++ +LF E W T + G P+IWC ++NF + G
Sbjct: 334 ---QDNPKKELLAETDKSALLIQELFGENTNNWETRNGYEGTPFIWCCVNNFGERPGLNG 390
Query: 188 ILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
L+ A A T M GVG+ EGI NP YDL+ E+ + ++ V+ WIN Y
Sbjct: 391 KLERYAGEVYRAATGPFREYMKGVGIMPEGINNNPASYDLVLELGWHNQPVETGKWINDY 450
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
RYG++ I AW + T+Y+ +++ A P
Sbjct: 451 VKARYGKANDQIATAWTLFLQTIYSNPGYQEGPPENILCARP------------------ 492
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
A+ SS+ Y T+ + ++ F A+ S TY+ DLI+ TRQ L
Sbjct: 493 ------ALQVKSVSSWGKLKKGYDTALFEKGVQAFAAAAPLFGNSETYKIDLINFTRQVL 546
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ A+ +F +++ AY+ + + FL L + LL H + L + + A +
Sbjct: 547 SNRADTVFASLVTAYKEENTVAFNAAAEAFLSLHALTNELLNSHSYYRLTSYQQQALRSG 606
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ K NA IT W +N ++E L +Y K W G++ +Y R +YF Y+
Sbjct: 607 NTPIERKNNLHNAMMLITYWGENNRQE-DYLHEYAYKEWGGMMTTFYQQRWKLYFDYLRN 665
Query: 487 SLESGDGFRLKD---WRREWIKLTNDWQNGRNVYP 518
+L +G D W REW+ ++ YP
Sbjct: 666 NL-AGKSVTPPDFFAWEREWVTQNEQVKSEVQPYP 699
>gi|229818803|ref|YP_002880329.1| alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
gi|229564716|gb|ACQ78567.1| Alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
Length = 751
Score = 188 bits (477), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 162/528 (30%), Positives = 252/528 (47%), Gaps = 37/528 (7%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
AVLPAF G+VP L A+ G FS T LL D F + F
Sbjct: 230 AVLPAFGGHVPDGLGA---GARTHWQG--FS---------TALLGPDDDAFAVVAAEFAR 275
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
QQ + +G T H+Y D F E+ PP PE +++ AA Y+GM + D +A W+MQ W F Y
Sbjct: 276 QQRELFG-TDHLYAADPFIESVPPSGEPEDLAAFAAATYAGMSAADPEATWVMQAWPFHY 334
Query: 127 D-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW ++ A+ ++VP +L++LDL+AE P+W + ++WC +HNF G +
Sbjct: 335 HRRFWTAERIAAVTDAVPRDRLLLLDLWAEHAPVWDDGRGIAEHQWLWCAVHNFGGRFSV 394
Query: 186 YGILDSIA--FGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 243
+G L +A G V + GVGM+ME +E NPV Y+L++++ + E+ DV AW+
Sbjct: 395 HGDLHGLARDLGGVLDDGARTGGFTGVGMAMEALENNPVFYELLTDLVW--ERPDVDAWV 452
Query: 244 NQYSVRRYG-----RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD--VDPSIIS 296
++ +RYG + A+ AW +L T+Y G T ++A P V P
Sbjct: 453 GRFVDQRYGFADGTAARDAVHGAWAILLRTLYGP--GMTRSIPSPVIARPADVVAPFHTQ 510
Query: 297 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 356
G++ + PV A + +E P + E+ RA L + A +
Sbjct: 511 RLAGEFLDPDAPVIVSANIDAEAD----PRVEGDLPEIARAAALLREAAGSSDAGGPLAH 566
Query: 357 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 416
DL DL +A+ I+ A + DA V + D+D + A LLG
Sbjct: 567 DLADLLTHVVAQRTRAPIRAIVAAARAGDADAVRANGALLAAAIADLDAVAATQPDRLLG 626
Query: 417 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
WL +A++ A ++ + + +AR Q+T+W E+ S L DY ++WSGLL +Y PR
Sbjct: 627 TWLAAAQRWADDDGERRVLLRDARRQLTVW----GEQTSGLHDYSGRHWSGLLGGFYAPR 682
Query: 477 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
++ ++ E+ ESG ++ RR + L W P + GD
Sbjct: 683 WQLWVDWLAEAAESGSEPDPQELRRAVVALEESWVARDETGPTDPAGD 730
>gi|401885538|gb|EJT49648.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
asahii CBS 2479]
Length = 781
Score = 188 bits (477), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 138/539 (25%), Positives = 241/539 (44%), Gaps = 42/539 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F G VP L N W S ++ + +D + + AF+ +
Sbjct: 274 VLPGFCGFVPPELHNYIGGPDFKTYPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRK 331
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG TS Y D F E+ P P Y+ + A+ + + +A W+MQGW+F D
Sbjct: 332 QKELYGFTSDYYMVDLFTESKPTSTDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVND 391
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P W KA L+ L+VLDL AE P W K F+G ++WC L N+ N +Y
Sbjct: 392 PKSWTETASKAFLDGAG-ESLLVLDLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLY 450
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G LD ++A+ + + G+G+ EGI N +++L ++ + + +D+K W +
Sbjct: 451 GALDKWNHDIMDAK-ANGGRLSGMGIVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNW 509
Query: 247 SVRRY-GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
RRY G+++ Q AW +L ++VY + A ++ D+ P++
Sbjct: 510 VKRRYRGQNLDLAQKAWELLDNSVYKSNNTALKCTTRSLI---DLRPAV----------- 555
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
+ L T +Y + Y +V+ AL+ + S + + + YDL+D+ RQ
Sbjct: 556 -------SGLIGTTGNYLATAITYEPRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQV 607
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
A ++ +I A+ ++ + R + L+ D+D L+A F L W+ A+
Sbjct: 608 FVNAAIPIYQAMINAWNGSNKADTEKYGRELVGLINDIDRLMATSRHFRLESWVGDARNW 667
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
AQ+ + E+ AR Q+ +W T L R Y K+W G++ + Y + ++ ++
Sbjct: 668 AQDAGAKDDMEFQARNQLILWGPATFAPWPLDR-YAAKHWHGIMSEVYAKGWELLYQNLL 726
Query: 486 ESLESGDGFRLKDWRREWI------KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
++ K W + K+ W+N ++ GD++ + L KY Q
Sbjct: 727 KT-------EPKAWNKTAFASELMEKVEKPWENVKSGGVQGPQGDSVAVIRELREKYKQ 778
>gi|406693970|gb|EKC97309.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
asahii CBS 8904]
Length = 781
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 138/539 (25%), Positives = 241/539 (44%), Gaps = 42/539 (7%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F G VP L N W S ++ + +D + + AF+ +
Sbjct: 274 VLPGFCGFVPPELHNYIGGPDFKTYPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRK 331
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG TS Y D F E+ P P Y+ + A+ + + +A W+MQGW+F D
Sbjct: 332 QKELYGFTSDYYMVDLFTESKPTSTDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVND 391
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P W KA L+ L+VLDL AE P W K F+G ++WC L N+ N +Y
Sbjct: 392 PKSWTETASKAFLDGAG-ESLLVLDLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLY 450
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G LD ++A+ + + G+G+ EGI N +++L ++ + + +D+K W +
Sbjct: 451 GALDKWNHDIMDAK-ANGGRLSGMGIVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNW 509
Query: 247 SVRRY-GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
RRY G+++ Q AW +L ++VY + A ++ D+ P++
Sbjct: 510 VKRRYRGQNLDLAQKAWELLDNSVYKSNNTALKCTTRSLI---DLRPAV----------- 555
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
+ L T +Y + Y +V+ AL+ + S + + + YDL+D+ RQ
Sbjct: 556 -------SGLIGTTGNYLATAITYEPRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQV 607
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
A ++ +I A+ ++ + R + L+ D+D L+A F L W+ A+
Sbjct: 608 FVNAAIPIYQAMINAWNGSNKADTEKYGRELVGLINDIDRLMATSRHFRLESWVGDARNW 667
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
AQ+ + E+ AR Q+ +W T L R Y K+W G++ + Y + ++ ++
Sbjct: 668 AQDAGAKDDMEFQARNQLILWGAATFAPWPLDR-YAAKHWHGIMSEVYAKGWELLYQNLL 726
Query: 486 ESLESGDGFRLKDWRREWI------KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
++ K W + K+ W+N ++ GD++ + L KY Q
Sbjct: 727 KT-------EPKAWNKTAFASELMEKVEKPWENVKSGGVQGPQGDSVAVIRELREKYKQ 778
>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
Length = 1552
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 135/488 (27%), Positives = 233/488 (47%), Gaps = 45/488 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+ F G VP+ + FP+A++ G W P LL + LF ++ +A+ E
Sbjct: 233 VVQGFPGLVPSFFKERFPAAQLVLQGRWGHFNRPP-----MLLPSDKDLFQQVAKAYYES 287
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
++ YGR D F E NT VD +++ + S A W++QGW +
Sbjct: 288 LIRCYGRDFKFLGGDLFHEGGNTKGVDVAATAAAVQQTMLRYFPS----AKWVLQGWNNN 343
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P LL+ + +++++L E+ W +S +F G P++W +++F G +M
Sbjct: 344 PSP--------TLLSKLDKQHVLLINLSGEIAASWESSNEFGGTPWLWGSVNHFGGKTDM 395
Query: 186 YGILDSIAFGPVEART-SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
G L + P A + ++N M G+G+ EGI NPVVYDL + A+ D+ +
Sbjct: 396 GGQLPVLVAEPHRAFSQTKNGVMQGIGILPEGINSNPVVYDLALKTAWYTTTPDLDRLLR 455
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y RYG ++ AW++L H+VY EG +++
Sbjct: 456 DYIAYRYGHVDESLVQAWHILSHSVYG---------------------EFKIKGEGTFES 494
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
++ + + S++ + Y+ ++ +AL LF ++ S TY+YDL+DL RQ
Sbjct: 495 IF--CARPGLHVTSVSTWGPKQMQYNPKDLEKALGLFRRVADQYKGSATYQYDLVDLARQ 552
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
+A +A +++ ++AY+ DA + + + F+ L++ D LL FLLG WL A
Sbjct: 553 VMANHARDVYAAAMQAYRNKDAALLHEKGQEFMHLLQLQDRLLQTDTHFLLGNWLAQAAN 612
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
++Q NA+ IT W ++ A+ + DY NK W+GLL+ YY PR +F +
Sbjct: 613 YGVTAADKQQALHNAKMLITYWGPDS--AATRVHDYANKEWAGLLKSYYEPRWQKFFYAL 670
Query: 485 IESLESGD 492
+S+ +G+
Sbjct: 671 YQSVNTGE 678
>gi|317501265|ref|ZP_07959469.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
8_1_57FAA]
gi|316897332|gb|EFV19399.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
8_1_57FAA]
Length = 1847
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 138/497 (27%), Positives = 230/497 (46%), Gaps = 72/497 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + + +A++ + G W S + +L T F + + F +
Sbjct: 695 VLQGYSGMVPTNIHDYDKNAEVIEQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQC 748
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG S+ Y D F E + S + + + M + D DAVW++Q W
Sbjct: 749 QKEVYGDVSNYYATDPFHEG--GITGGMNASDISEKVLTEMITADKDAVWIIQSW----- 801
Query: 128 PFWRPPQMKALLNSVP-----LGKLVVLDLFAEVKP----------IWSTSKQFYGVPYI 172
+ ALLN + ++LDL+AE P + ++F P++
Sbjct: 802 ---QGNPTTALLNGLDRVEKGTDHALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWL 858
Query: 173 WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+CML+NF G + ++G LD++A + +E + G+G++ E NPV+YD + E +
Sbjct: 859 FCMLNNFGGRLGLHGHLDNLA-NNIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIW 917
Query: 233 QH------EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNR 281
Q E +D+ W++ Y+ RRYG + AW++L TVY + GA +
Sbjct: 918 QDDASQKMEVIDLDTWLDDYATRRYGAESESANQAWDILKETVYKASLNGLGQGAPES-- 975
Query: 282 DVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF 341
V+ A P++ T G +G V + Y ++ A L
Sbjct: 976 -VVNARPNL-------TIGAASTWGNAV-----------------ISYEKGDLEEAAALL 1010
Query: 342 IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVE 401
+A ++L S Y+YDL ++ +Q L+ A E + A+ D S +F+ ++E
Sbjct: 1011 LADYDKLKDSAGYQYDLANVLQQVLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIE 1070
Query: 402 DMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRD 459
DM+ + + FLLG W+E AK LA N + ++ YE+NA+ +T W Q E L+D
Sbjct: 1071 DMEKVTGTSEYFLLGRWVEQAKALANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKD 1130
Query: 460 YGNKYWSGLLRDYYGPR 476
Y N+ WSGL+ D+Y R
Sbjct: 1131 YSNRQWSGLIGDFYKAR 1147
>gi|373461651|ref|ZP_09553390.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
gi|371951955|gb|EHO69797.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
Length = 713
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 137/472 (29%), Positives = 207/472 (43%), Gaps = 53/472 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
VL F G V ++++ +P+A + Q G W F + D +L T+ LF EI +
Sbjct: 231 VLQGFYGMVSRSIRDRYPNAVMPQ-GMWGFFERPD-------ILKPTEKLFDEIADTYYR 282
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+ K YG H + D F E ++ G A+ MQ + W++QGW +
Sbjct: 283 EIKKHYGTGFHYFGGDLFHEGGQ--TGTLNVADCGLAVQQAMQRNFPGSTWVLQGWSGNP 340
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+P LL + K++V+DLF E W+ +K + G P++WC++ NF MY
Sbjct: 341 NPL--------LLTKLDREKVLVVDLFGENDEAWNRTKAYQGTPFLWCIVSNFGEQCGMY 392
Query: 187 GILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
G L IA + R S+ + GVG+ EGI NPVVYD++ K++V+AW+
Sbjct: 393 GKLQRIALQIDKVRKSDYKAYLKGVGIMPEGINNNPVVYDMVLHAPLTDRKINVEAWLKS 452
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y RYG I AW + T+Y SV E Y
Sbjct: 453 YITYRYGSYNADIYAAWLIFLQTIY------------------------ASVPE----KY 484
Query: 306 GKPVS----KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
G P S + V ++TSS+ +Y + LF+ + S TY YD+ DL
Sbjct: 485 GLPESVFCARPGVKVTQTSSWGVRARYYDMDFFKEGVRLFLKAKTSFEDSETYAYDMFDL 544
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
RQ + N ++ ++I A + + Q S RFL + D LLA GF L WL
Sbjct: 545 LRQVQSDKGNRVYDDMIAAIDAKNPNRFEQTSDRFLHELLRQDTLLAQSKGFTLERWLGQ 604
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 473
A + + NA+ Q+T W + + + DY K W+G+LR Y
Sbjct: 605 ASRFGKTVYDRDLALKNAKMQLTFWGPDWNPTTT-VHDYAAKEWAGMLRTLY 655
>gi|153814573|ref|ZP_01967241.1| hypothetical protein RUMTOR_00787 [Ruminococcus torques ATCC 27756]
gi|331089988|ref|ZP_08338878.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
3_1_46FAA]
gi|145848067|gb|EDK24985.1| F5/8 type C domain protein [Ruminococcus torques ATCC 27756]
gi|330402902|gb|EGG82468.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 1863
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 138/497 (27%), Positives = 230/497 (46%), Gaps = 72/497 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + + +A++ + G W S + +L T F + + F +
Sbjct: 711 VLQGYSGMVPTNIHDYDKNAEVIEQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQC 764
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG S+ Y D F E + S + + + M + D DAVW++Q W
Sbjct: 765 QKEVYGDVSNYYATDPFHEG--GITGGMNASDISEKVLTEMITADKDAVWIIQSW----- 817
Query: 128 PFWRPPQMKALLNSVP-----LGKLVVLDLFAEVKP----------IWSTSKQFYGVPYI 172
+ ALLN + ++LDL+AE P + ++F P++
Sbjct: 818 ---QGNPTTALLNGLDRVEKGTDHALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWL 874
Query: 173 WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+CML+NF G + ++G LD++A + +E + G+G++ E NPV+YD + E +
Sbjct: 875 FCMLNNFGGRLGLHGHLDNLA-NNIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIW 933
Query: 233 QH------EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNR 281
Q E +D+ W++ Y+ RRYG + AW++L TVY + GA +
Sbjct: 934 QDDASQKMEVIDLDTWLDDYATRRYGAESESANQAWDILKETVYKASLNGLGQGAPES-- 991
Query: 282 DVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF 341
V+ A P++ T G +G V + Y ++ A L
Sbjct: 992 -VVNARPNL-------TIGAASTWGNAV-----------------ISYEKGDLEEAAALL 1026
Query: 342 IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVE 401
+A ++L S Y+YDL ++ +Q L+ A E + A+ D S +F+ ++E
Sbjct: 1027 LADYDKLKDSAGYQYDLANVLQQVLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIE 1086
Query: 402 DMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRD 459
DM+ + + FLLG W+E AK LA N + ++ YE+NA+ +T W Q E L+D
Sbjct: 1087 DMEKVTGTSEYFLLGRWVEQAKALANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKD 1146
Query: 460 YGNKYWSGLLRDYYGPR 476
Y N+ WSGL+ D+Y R
Sbjct: 1147 YSNRQWSGLIGDFYKAR 1163
>gi|336439030|ref|ZP_08618649.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
1_1_57FAA]
gi|336017072|gb|EGN46842.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 1863
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 138/497 (27%), Positives = 230/497 (46%), Gaps = 72/497 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP + + +A++ + G W S + +L T F + + F +
Sbjct: 711 VLQGYSGMVPTNIHDYDKNAEVIEQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQC 764
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG S+ Y D F E + S + + + M + D DAVW++Q W
Sbjct: 765 QKEVYGDVSNYYATDPFHEG--GITGGMNASDISEKVLTEMITADKDAVWIIQSW----- 817
Query: 128 PFWRPPQMKALLNSVP-----LGKLVVLDLFAEVKP----------IWSTSKQFYGVPYI 172
+ ALLN + ++LDL+AE P + ++F P++
Sbjct: 818 ---QGNPTTALLNGLDRVEKGTDHALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWL 874
Query: 173 WCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF 232
+CML+NF G + ++G LD++A + +E + G+G++ E NPV+YD + E +
Sbjct: 875 FCMLNNFGGRLGLHGHLDNLA-NNIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIW 933
Query: 233 QH------EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNR 281
Q E +D+ W++ Y+ RRYG + AW++L TVY + GA +
Sbjct: 934 QDDASQKMEVIDLDTWLDDYATRRYGAESESANQAWDILKETVYKASLNGLGQGAPES-- 991
Query: 282 DVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF 341
V+ A P++ T G +G V + Y ++ A L
Sbjct: 992 -VVNARPNL-------TIGAASTWGNAV-----------------ISYEKGDLEEAAALL 1026
Query: 342 IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVE 401
+A ++L S Y+YDL ++ +Q L+ A E + A+ D S +F+ ++E
Sbjct: 1027 LADYDKLKDSAGYQYDLANVLQQVLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIE 1086
Query: 402 DMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRD 459
DM+ + + FLLG W+E AK LA N + ++ YE+NA+ +T W Q E L+D
Sbjct: 1087 DMEKVTGTSEYFLLGRWVEQAKALANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKD 1146
Query: 460 YGNKYWSGLLRDYYGPR 476
Y N+ WSGL+ D+Y R
Sbjct: 1147 YSNRQWSGLIGDFYKAR 1163
>gi|403512485|ref|YP_006644123.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
[Nocardiopsis alba ATCC BAA-2165]
gi|402798758|gb|AFR06168.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
[Nocardiopsis alba ATCC BAA-2165]
Length = 718
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 133/477 (27%), Positives = 214/477 (44%), Gaps = 48/477 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F+G+VP +L ++ Q T++L TDPL+ + +E
Sbjct: 197 VLPGFTGHVPPSLAPGRTGSRTWQG------------LVTHVLVPTDPLYTTLCAEIVET 244
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q KE T H Y D F E P P + + A G+ D AVW +Q W FSY
Sbjct: 245 Q-KELFDTDHQYAIDPFIEMIPVDSDPGFPGLVARATIEGLTRADPRAVWFLQTWPFSYQ 303
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI--- 183
FW P +++A L+++P L +LDL+AE P WS F G P+ WC L NF G
Sbjct: 304 SDFWSPERVEAFLDAIPDDHLHLLDLWAEYDPQWSRFHAFGGTPWTWCALLNFGGRTDPM 363
Query: 184 -EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
++ G D I A E G+G+SME NP ++L+ + A+ + W
Sbjct: 364 ADLQGAADRIGAAKDSAHPPE-----GIGLSMEATRNNPAFFELVVDQAWTRTGRVEEEW 418
Query: 243 INQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
+ + +RYG PA+ + W L TV + + FP+ ++++
Sbjct: 419 LPDFVAQRYGPGHDPALLEGWRGLLRTVLGASG---------VRIFPEQFNGVLTL---- 465
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG--NELSASNTYRYDLI 359
+ + + + + L++E ++ +WY +++ A E +A + L+ +DL+
Sbjct: 466 -RPHYRHLEDSSALRAEVTAL----VWYPWPDLLAAWERLVAGAETDPLAVEGPLGHDLV 520
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 419
D+ L++ A+ +L ++E + L RFLE+ +D+D LL + W
Sbjct: 521 DVAMAVLSRVADHRYLEMVEHLDHHPELPEGDLE-RFLEVFDDLDALLETRPEYRYRTWE 579
Query: 420 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 476
A A E + NAR +T+W T + L DY + WSGL+ YY PR
Sbjct: 580 AKATSWATGTEDHRVLTDNARRILTVW---TTLDDPRLDDYAGRLWSGLVGGYYRPR 633
>gi|315500594|ref|YP_004089396.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
gi|315418606|gb|ADU15245.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
Length = 765
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 144/538 (26%), Positives = 221/538 (41%), Gaps = 76/538 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP A P A+I + W TY LD DPLF +I FI
Sbjct: 239 ILPAFGGYVPKAFAQKHPQARIYPMRPWEGFHE------TYWLDPADPLFAKIAARFIAL 292
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVD-----------------------------SPEYIS 98
+ YG + Y D+F+E PP+ E ++
Sbjct: 293 YTETYGEGRY-YLADSFNEMLPPISHDGSDVKNAKYGDSTANTKETETVVDPAVKAERLA 351
Query: 99 SLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVK 157
+ G AIY ++ DAVW MQGWLF D FW P + A L VP KL++LD+ +
Sbjct: 352 AYGKAIYDSIRQARPDAVWTMQGWLFGADKHFWTPDAIGAFLRDVPQDKLMILDIGNDRY 411
Query: 158 P-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL----DSIAFGPVEARTSENTTMVGVGM 212
P +W +S F G P+I+ +HN+ + +YG L D I T + + G G+
Sbjct: 412 PGVWQSSNAFQGKPWIYGYVHNYGASNPVYGDLGFYRDDIRGLLARKDTGD---LKGFGL 468
Query: 213 SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNC 272
EG+ N +VY+ ++A+ V W+ Y RYG+ PA+ AW+ ++
Sbjct: 469 FPEGLHNNSIVYEYAYDLAWGQANQTVTEWLTTYLKSRYGQVTPALILAWSTYVEAAFST 528
Query: 273 TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTS 332
+ P G Y +P + + HP
Sbjct: 529 RYWS---------------PRWWRSKAGAYLLCKRPTADMVEFEG------HPG---DRK 564
Query: 333 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 392
++ RA++ + S S YR+D+ID R +++ ++ + ++AY+ D L
Sbjct: 565 KLRRAIDALL-SLKGFGGSALYRHDVIDAVRHLVSEEIDDRLIAAMKAYKSGDVKTGDGL 623
Query: 393 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 452
+ LV +D L+ L W++ A E++ Y NA+ Q+T+W
Sbjct: 624 REEVIALVTQVDTLMGAQPD-TLASWIDEASAYGDTSEEKAYYVMNAKAQVTVWGGKGN- 681
Query: 453 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 510
L DY +K W GL +D+Y PR + S G F K + RE I W
Sbjct: 682 ----LNDYASKAWQGLYKDFYLPRWMKLLAALRASASGGAPFDQKTFTRELIDWEQAW 735
>gi|302526099|ref|ZP_07278441.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
gi|302434994|gb|EFL06810.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
Length = 860
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 134/524 (25%), Positives = 223/524 (42%), Gaps = 52/524 (9%)
Query: 8 VLPAFSGNVPAALQN------VFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 61
VLP + G VP + A++ G W P W LD + ++
Sbjct: 237 VLPGYFGTVPRGFADKSKKADASSDARVIGQGTWVGFDR-PDW-----LDPRTSSYRKVA 290
Query: 62 RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 121
AF + Q +G TS +Y D E D P + + + +Q+ A W++ G
Sbjct: 291 AAFYQAQHDLFGDTS-MYKMDLLHEGGKSGDVP--VGDAARGVMTALQTARPGATWVLLG 347
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
W + PP +A++++V KL V+D ++ Q+ PY + ++NF G
Sbjct: 348 WQNN------PP--RAIVDAVDKSKLFVVDGLSDRYGQRDPDSQWNNTPYAFGTIYNFGG 399
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
+ + + RT + + + G+ EG NP ++L +E+A++ + A
Sbjct: 400 HTTIGANTGVWTQRFPQWRTKQGSALTGIAYLPEGTGTNPAAFELFTELAWRQTPIHQAA 459
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNC-TDGATDKNRDVIVAFPDVDPSIISVTEG 300
W Y+ RRYG AW++L T Y+ G ++ + A P++D
Sbjct: 460 WFADYASRRYGGPDTRAATAWDLLRQTAYSMPASGWSEAQDSLYAARPNLD--------- 510
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
+ +++ L Y + +AL+ + L ++ YR+DL+D
Sbjct: 511 ---------------AATAATWSPASLRYQQATFGKALDELLNVDPALRGTDAYRFDLVD 555
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQAL + L I AY D L+ R++ + +D LLA FLLGPWLE
Sbjct: 556 VARQALTNTSRTLLPQIKTAYTNRDRTQFTTLTSRWMSNMTLLDKLLATDSRFLLGPWLE 615
Query: 421 SAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 480
+AK A + ++ + E++AR+ IT W + L DY N+ WSGL+ D+Y R Y
Sbjct: 616 AAKSWAGTDTEQARLEYDARSLITTWGPRAGSDDGRLHDYANREWSGLVSDFYAKRWKQY 675
Query: 481 FKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
F + ++ +G D W + W RN YP GD
Sbjct: 676 FDSLNTAMNTGGQPASID----WFAAEDGWAKQRNPYPTTPAGD 715
>gi|282877909|ref|ZP_06286718.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
gi|281299910|gb|EFA92270.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
Length = 717
Score = 186 bits (471), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 140/518 (27%), Positives = 230/518 (44%), Gaps = 71/518 (13%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L + PAF+G VP A P L W Y+L F ++G+
Sbjct: 227 LGMHPIAPAFAGFVPKAFAKKHPEINFKHL-RWGGFADS---LNAYVLPPESSYFKQLGK 282
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPE------YISSLGAAIYSGMQSGDSDAV 116
FIE+ +E+G ++ Y D+F+E PV+ + ++ G AIY + +G+ A+
Sbjct: 283 LFIEEWEREFGENTY-YLSDSFNEMKLPVNPNDEEEKCRLLAEYGKAIYQSINAGNPHAI 341
Query: 117 WLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAE-------VKPIWSTSKQFYG 168
W+ QGW F Y FW + ALL+ VP +++++DL + + W FYG
Sbjct: 342 WVTQGWTFGYQHDFWNRKSLSALLSQVPNDRMIIIDLGNDYPKWVWHTEQTWKRHNGFYG 401
Query: 169 VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLM 227
+I+ + NF G + G L+ A A ++ N +VG+G + EG+E N VVY+L+
Sbjct: 402 KQWIFSYVPNFGGKTLLTGDLEMYATDASLALSAANKGNLVGIGSAPEGLENNEVVYELL 461
Query: 228 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
S+ A+ + +++ WI Y + RYG+ ++ AWN +VY+ + ++
Sbjct: 462 SDAAWTDKGINLDEWIANYCMARYGKYPDKMKAAWNGFRKSVYSS-----------LYSY 510
Query: 288 PDVD-PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 346
P ++I T K S +D ++ +A+E F++ +
Sbjct: 511 PRFTWQTVIPDTRRK------------------SRHDLNETYF------KAVEDFLSCAD 546
Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
EL + Y+ D I Q L A+ + N + LN + + +EL+ D +
Sbjct: 547 ELGGAKFYQDDAILFAAQYLGAKADIYYENALRYGSLNKHVEANKQLSKAIELLLFADKI 606
Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
LA H L W+ A+ +++ QYE NA+ IT W + + DY + WS
Sbjct: 607 LASHPTDRLDVWIAKARSQGHTPQEKNQYEANAKRLITTWGGHQE-------DYAARCWS 659
Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
GL++DYY PR IYF S L W WI
Sbjct: 660 GLIKDYYIPRIQIYF--------SNQRKMLDQWEENWI 689
>gi|331092442|ref|ZP_08341267.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330401285|gb|EGG80874.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 1598
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 140/533 (26%), Positives = 245/533 (45%), Gaps = 70/533 (13%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL +SG VP ++ SA++ + G W S + +L F + + F +
Sbjct: 717 VLQGYSGMVPTNIREKDSSAEVIEQGTWCSFRRPD------MLKTDSASFDKYAKLFYQA 770
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVD-SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
Q + YG ++H Y D F E +P I+ + M D D +W++Q W
Sbjct: 771 QKEVYGESAHYYATDPFHEGGDTGGLNPTVIA---GKVLDAMLEADKDGIWIIQSW---- 823
Query: 127 DPFWRPPQMKALLNSVPLGK--LVVLDLFAEVKPIWSTSK-------QFYGVPYIWCMLH 177
+ ALL + K +VLDL+AE P W+ + +F P+++CML+
Sbjct: 824 ----QGNPTTALLKGLEGRKEHALVLDLYAEKTPHWNETNPNEYGGGEFNDTPWVFCMLN 879
Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ---H 234
NF G + ++G LD++A + A + M G+G++ E NP++YD + E +
Sbjct: 880 NFGGRLGLHGHLDNLAKN-IPAALNSAKHMEGIGITPEASVNNPLLYDFLFETVWTDNAK 938
Query: 235 EK---VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVA 286
EK +D+ W+ Y+ RRYG+ + +A ++ TVY GA + V+ A
Sbjct: 939 EKLPVIDLDKWLKDYAKRRYGKESQSAYEALLIMKDTVYKAELNMKGQGAPES---VVNA 995
Query: 287 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 346
P +D S +G V + Y +++ +A EL + +
Sbjct: 996 RPALDIGAAST-------WGNAV-----------------ISYDKAKLEKAAELLLKDYD 1031
Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
+L S+ Y YDL + +Q L+ A E + A++ N+ + +FL +++ M+ +
Sbjct: 1032 KLKDSDGYMYDLATMLQQVLSNSAQEYQRKMANAFKENNKEEFNTYADKFLSIIDSMEKV 1091
Query: 407 LACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 464
+ +LLG W+E AK LA+N + + YE+NA+ +T W Q E L+DY N+
Sbjct: 1092 TSTSKYYLLGTWVEQAKALAKNADDFTKDLYEFNAKALVTTWGSINQAEGGGLKDYSNRQ 1151
Query: 465 WSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK--DWRREWIKLTNDWQNGRN 515
WSGLL+D+Y R + + + L+ + +W +W++ ++ N N
Sbjct: 1152 WSGLLKDFYKVRWQKWIQARNDELDGKQPENINWFEWEWKWVRENTEYTNTPN 1204
>gi|187735714|ref|YP_001877826.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
gi|187425766|gb|ACD05045.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
Length = 852
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 137/537 (25%), Positives = 236/537 (43%), Gaps = 52/537 (9%)
Query: 8 VLPAFSGNVPAALQ-NV-FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
VL + G VP+ Q NV K+ G W + + +++D T F ++ +
Sbjct: 235 VLQGYVGFVPSDFQENVRIDGLKLIPQGEWVNFRR------PWVVDPTCEAFPKLAADWY 288
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
+ K YG ++ D F E D ++ + MQ A W++Q W
Sbjct: 289 KALRKVYGIPGKMFGGDLFHEGGRKGDID--VTQAAQEVQKAMQKASPGAFWVIQAW--G 344
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+P + LL+ + + +VL L ++ + F G+P++WC L NF GN M
Sbjct: 345 GNP------TRELLSGLDPERALVLQLTKDMANGGKNLRTFNGIPWVWCELANFGGNTGM 398
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
YG + ++ E ++ +VG+G EG+E NP+ Y L S+ + E + V+ W+ +
Sbjct: 399 YGGVPLLSRLGSELSGYKDKGLVGMGTLSEGLETNPLHYALFSDRLWTREDISVREWLGK 458
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y+ +RYG + A+ A VL ++YN +I A P +
Sbjct: 459 YARQRYGFAPKAVVKALEVLSFSIYNPVRSQEGCTESIICARPSWN-------------- 504
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLT 362
+ S++ +Y ++++A ++ + N+ L T+RYDL+D+
Sbjct: 505 ----------VRKASTWSSGERYYHLGDIVKAARGYLKAANDQPNLVKKETFRYDLVDVV 554
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQALA A + A+ D + +RFL L+ DMD LLA FLLG W + A
Sbjct: 555 RQALADAAFYQLQQVRSAFDSGDLAAYRKQVKRFLSLISDMDALLATDSQFLLGTWQKRA 614
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ +++ + +A+ IT W D L DY N+ W+GL+ D+Y PR +F+
Sbjct: 615 LDWGDSRQEKALMDKSAKMLITTWIDQVPRS---LNDYSNRQWAGLVSDFYLPRWKNFFE 671
Query: 483 YMIESLESGDGFRLKDWRREWIKLTND---WQNGRNVYPVESNGDALITSQWLYNKY 536
+ ++ L +G R K+ D + +Y V+ GD L + + N +
Sbjct: 672 FQMDVL-TGKKTRDAAHAAFMDKMVRDELAFAGNGKIYSVKPAGDTLAVANRVMNTH 727
>gi|302546018|ref|ZP_07298360.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
[Streptomyces hygroscopicus ATCC 53653]
gi|302463636|gb|EFL26729.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
[Streptomyces himastatinicus ATCC 53653]
Length = 679
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 138/513 (26%), Positives = 214/513 (41%), Gaps = 49/513 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P + G VP + P A G+W ++ P W LD F E+ AF
Sbjct: 214 VFPGYFGTVPDGFADRNPGAHTVPQGDWNGLRR-PDW-----LDPRTDAFHEVAAAFYRH 267
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q +G ++ D E N V P+ A+ +Q+ A+W++ GW +
Sbjct: 268 QHDLFG-ACDLFKMDLLHEGGNAGDVSVPDAAR----AVEKALQTSRPGAIWVILGWQSN 322
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+ LL++V ++V+D +++ I K + VPY + + NF G +
Sbjct: 323 --------PRRDLLDAVDHDHMLVVDGLSDLDTITDREKDWGSVPYAFGTIPNFGGRTTI 374
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
R + +VG E +E++P Y+L SE+A++ VD AW
Sbjct: 375 GAKTHMWTERFTVWRDKPGSKLVGTAYMPEAVERDPAAYELFSELAWRDTAVDRDAWFRD 434
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y+ RYG ++A+ L T Y + + V A P S+T NY
Sbjct: 435 YADVRYGARDAKAREAFAALRDTAYQISSKDGRPHDSVFAARP-------SLTARSGTNY 487
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
+ T ++D + AL + L S+ YRYDL D RQA
Sbjct: 488 A----------THTPAFD-------PARFDAALAALLGVRAGLRDSDAYRYDLADTARQA 530
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
LA + +L + +AY D LSR +L+L+ D + H LLGPWLE AK++
Sbjct: 531 LANRSWQLIGQLADAYARKDLDTFRALSRLWLKLMRLSDDITGTHRLLLLGPWLEDAKRM 590
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
A E+ Q E+ AR IT W D + L +Y N+ W+GL+ D++ P+ Y +
Sbjct: 591 ASGAEESAQLEFAARALITTWADRGAADPGKLANYANRDWNGLIGDFHVPQWQTYLDELE 650
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYP 518
++L G R D W + W R YP
Sbjct: 651 DALAEGRAPRTFD----WYTVEEPWTRERKSYP 679
>gi|386386798|ref|ZP_10071901.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
gi|385665738|gb|EIF89378.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
Length = 1033
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 137/522 (26%), Positives = 230/522 (44%), Gaps = 54/522 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G VP + P A++ G W ++ P W LD +F ++ AF
Sbjct: 250 VLPGYFGTVPDGFADRNPGARVVAQGLWNGLRR-PDW-----LDPRTTVFPKVAAAFYRH 303
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q K +G ++ D E D P + A+ +++ +AVW++ GW +
Sbjct: 304 QTKLFG-ACDLFKMDLLHEGGNAGDVP--VPDAARAVEKALRTARPNAVWVILGWQSN-- 358
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
+ALL++V +++++D +++ ++ G PY + + NF G +
Sbjct: 359 ------PRRALLDAVDKRRMLIVDGLSDLDTTGDRESEWGGTPYAFGTIPNFGGRTTLGA 412
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
D R + +VG E E++P ++L SE+A++ E++D +AW +Y+
Sbjct: 413 NTDRWTDRFTVWRDRPGSALVGTAYMPEAAERDPAAFELFSELAWRRERIDREAWFTEYA 472
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RYG + A+ L T Y ++ T+G+ Y
Sbjct: 473 QIRYGSDDASAAAAFGALAATAYR-----------------------LASTDGR--PYDS 507
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
+ L S + P + + A +A+G EL S+TYR+DL +L RQALA
Sbjct: 508 HFLRRPSLTSSIGTAFDPAGFDT------AFAALLAAGPELRDSDTYRHDLTELARQALA 561
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ L + A D +S +L+L+ D + CH FLLGPWLE AK+LA
Sbjct: 562 NRSRTLQFALRAARASKDVAAFRGVSALWLKLMRLADTMAGCHRSFLLGPWLEDAKRLAT 621
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
+ + + E AR IT W D + A+ L +Y N+ W+GL+ D + P+ + + ++
Sbjct: 622 SPAEAVELERTARALITTWAD--RPAANALSNYANRDWNGLIADVHVPQWDAFLTEVADA 679
Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
LE+G + DW + W R VYP GD T+
Sbjct: 680 LEAGRAPKSFDWYPQ----EEAWTKDRRVYPSAPTGDPYATA 717
>gi|147860882|emb|CAN83148.1| hypothetical protein VITISV_031934 [Vitis vinifera]
Length = 562
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 82/92 (89%), Positives = 89/92 (96%)
Query: 82 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 141
DTFDENTPPVD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+S
Sbjct: 429 DTFDENTPPVDDPEYISSLGAAIFKGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHS 488
Query: 142 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW 173
VP+G+LVVLDLFAEVKPIW TS+QFYGVPYIW
Sbjct: 489 VPMGRLVVLDLFAEVKPIWITSEQFYGVPYIW 520
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 54/63 (85%), Positives = 60/63 (95%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGNVPAAL+ +FPSAKIT+LGNWF+V +PRWCCTYLLDATDPLFIEIGRAFI+Q
Sbjct: 272 VLPAFSGNVPAALKYIFPSAKITRLGNWFTVGGNPRWCCTYLLDATDPLFIEIGRAFIQQ 331
Query: 68 QLK 70
QLK
Sbjct: 332 QLK 334
>gi|261880009|ref|ZP_06006436.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
gi|270333325|gb|EFA44111.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
Length = 722
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 136/518 (26%), Positives = 217/518 (41%), Gaps = 69/518 (13%)
Query: 3 LADFAVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 62
L + PAF+G VP P ++ +L W D R Y+L P F +IGR
Sbjct: 226 LGMHPIAPAFAGFVPEQFVEAHPGLQVKKL-TWGGF--DDR-LNAYVLSPESPYFKQIGR 281
Query: 63 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS------LGAAIYSGMQSGDSDAV 116
F+E+ KE+G+ + Y D+F+E PV+ + I G IY + + DAV
Sbjct: 282 LFVEEWEKEFGKNT-FYQSDSFNEMEIPVEPGDSIGKWKLLEQYGDVIYRSIAEANPDAV 340
Query: 117 WLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDL-------FAEVKPIWSTSKQFYG 168
W+ QGW F Y W ++ALL VP K++++DL + + W +YG
Sbjct: 341 WVTQGWTFGYQHKMWDSKSLQALLRHVPDDKMLIIDLANDYPKWIWKTQQTWKVQHGYYG 400
Query: 169 VPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLM 227
+++ + NF G G + A EA SE MVG G + EGIE N V+Y+L+
Sbjct: 401 KQWVFSYVPNFGGKTLPTGDMQMYASASAEALHHSERGNMVGFGSAPEGIENNDVIYELL 460
Query: 228 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
++M + + VD+ WI Y RYG +Q AW + +VY + ++
Sbjct: 461 ADMGWTDKAVDLDLWIKDYCEARYGGYPSDMQKAWQCMLRSVYGS-----------LYSY 509
Query: 288 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 347
P ++ + VS A+ + S H F+ +
Sbjct: 510 PRFTWQTVTPDSRR-------VSTHALNDTFLSGVAH----------------FLRCARQ 546
Query: 348 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLL 407
L +S YR D I L L A+ + ++ + ++L+ D LL
Sbjct: 547 LGSSPLYRSDAISLASLYLGTKADRHYTKALDLKASGKQQAASAELHQTIDLLTKADRLL 606
Query: 408 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
A H L W++ A+ ++ +YE +A+ IT+W DY ++W+G
Sbjct: 607 ASHPTHRLDRWIQFARNHGITTAEKNRYESDAKRLITIW-------GGFQEDYAARFWNG 659
Query: 468 LLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 505
L+ YY PR YF + + L W +W+K
Sbjct: 660 LIAHYYIPRIRYYFDHGRPA--------LMQWEEQWVK 689
>gi|297194750|ref|ZP_06912148.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
25486]
gi|297152431|gb|EFH31740.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
25486]
Length = 816
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 140/529 (26%), Positives = 229/529 (43%), Gaps = 55/529 (10%)
Query: 8 VLPAFSGNVPAALQNVFPS--AKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
VLP + G VP P A++ G W P W LD F ++ AF
Sbjct: 28 VLPGYFGTVPDDFPGHNPGSDARVIPQGTWGGGMRRPDW-----LDPRTQAFSDVAAAFY 82
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q + +G SH + D E D P + A+ + +Q+ A W++ GW +
Sbjct: 83 RHQGELFGDVSH-FKMDLLHEGGTAGDVP--VPDAARAVETSLQTARPGATWVILGWQSN 139
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
P +L+S+ +++++D +++ + + G PY + + NF G +
Sbjct: 140 PRPV--------MLDSIDTSRVLIVDGLSDLDTVTDREADWGGAPYAFGTIPNFGGRTTI 191
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
D R + +VG E E++P +L SE+A++ EK+D +AW +
Sbjct: 192 GANTDRWTEKFTAWRDKPGSALVGTAYMPEAAERDPAALELFSELAWREEKIDREAWFAE 251
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y+ RYG + ++A+ L T Y T T+G+ Y
Sbjct: 252 YAQIRYGGVDHSAREAFAALAATAYKLTS-----------------------TDGR--PY 286
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
S+ L + + P + RA +A L S+ YR+DL D+ RQA
Sbjct: 287 DSLFSRRPSLTTAIGTAFDP------AGFDRAFAALLAVRAPLRDSDAYRHDLTDVARQA 340
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
LA + L L + AY+ D +S +L+++ D + CH FLLGPWLE AK+L
Sbjct: 341 LANRSRTLQLALRAAYRNKDVATFRAVSALWLKVMRLSDTMAGCHRQFLLGPWLEDAKRL 400
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
A + E+ Q E ART IT W D + A+ L +Y N+ W GL+ D + P+ +
Sbjct: 401 ATSPEEAVQLERTARTLITTWAD--RPTANSLSNYANRDWQGLMADVHVPQWEAFLTEQA 458
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
+++ +G + DW + W R+ YPV GDA T+ +++
Sbjct: 459 DAMAAGRAPKSFDWYPQ----EEAWTQERHTYPVRPTGDAYSTALRVFD 503
>gi|289667570|ref|ZP_06488645.1| N-acetylglucosaminidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 798
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 150/541 (27%), Positives = 232/541 (42%), Gaps = 86/541 (15%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF ++ R F+E
Sbjct: 238 VLPAFAGYVPKAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFMEL 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVD-------SPEYISSL-------------------- 100
+ YG T Y D F+E PPV + +Y S+
Sbjct: 292 YTQAYG-TGEFYLADAFNEMLPPVADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARL 350
Query: 101 ---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
G A+Y + + A W+MQGWLF D FW+P + A L VP +L+VLD+ +
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDR 410
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S+ F +I+ +HN+ + +YG AF + + SE + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRQDLQALLADSEKRNLRGFG 467
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
+ EG+ N VVY+ + +A++ + W+ QY RYGRS A+ AW+ L +Y
Sbjct: 468 IFPEGLHSNSVVYEYLYALAWEGPQQPWSQWLTQYLRARYGRSDAALLSAWSDLEAGIYQ 527
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
+ P + G Y + +P + ++K + D L
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTAD--IVKFDDRPGDPQRL---- 566
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
RA++ + + + YRYDLI+ R L+ A+ +++AY D A G
Sbjct: 567 ---RRAIDALLQQAERYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDV 623
Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDN 449
QL+ R +LV+ +D L+ L W A A N+ + Y NAR Q+++W +
Sbjct: 624 QLA-RITQLVQGLDALVGGQHE-TLADWTGQAAAAAGNDAGLRRAYVGNARAQVSVWGGD 681
Query: 450 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRRE 502
L DY +K W G+ D+Y R + + ++G F +L W R
Sbjct: 682 GN-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFEAAAVDQQLATWERH 736
Query: 503 W 503
W
Sbjct: 737 W 737
>gi|255079272|ref|XP_002503216.1| GH family 89 protein [Micromonas sp. RCC299]
gi|226518482|gb|ACO64474.1| GH family 89 protein [Micromonas sp. RCC299]
Length = 1260
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 157/554 (28%), Positives = 231/554 (41%), Gaps = 140/554 (25%)
Query: 28 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 87
K +LG + K D + LD +D LF +G AF +Q ++++G T H+Y DTF E
Sbjct: 396 KDAELGKY--AKKDDSVRSVHFLDPSDALFQSLGAAFTKQLVEDFG-TDHLYLADTFREI 452
Query: 88 TPPVD--SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL 144
P D S ++ +GAA + M+S D A W+ Q F +P FW + ALL SV +
Sbjct: 453 RDPNDDFSETHVVRVGAATLAAMRSADPRATWVFQSDAFRRNPRFWNEGRRGALLRSVDI 512
Query: 145 GKLVVLDLFAEVKPIW-STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---- 199
G ++VLD AE P + F G P++WC+ HN GN+ M G L +IA GP A
Sbjct: 513 GDMLVLDSAAETDPYYLREPVHFAGQPFVWCVKHNHGGNLGMRGRLSAIATGPAAAMDSL 572
Query: 200 -----------------------------RTSENTT----------MVGVGMSMEGIEQN 220
R S T +VG G++ EG+EQN
Sbjct: 573 ASRRDGERGTTHGRGTRVGSSRRMLADNKRVSREATHGSRKVGKSQLVGFGITAEGVEQN 632
Query: 221 PVVYDLMSEMAFQHEKVDVKAWINQ--------YSVRRYGRSV----------------- 255
PVVY+L + + + VDV +++ YSVR+ +
Sbjct: 633 PVVYELAALTSQSEKGVDVDWFLSDYSRRRYGGYSVRQPAPTTLPVGTGQGAFLAGFIVG 692
Query: 256 ------------------PA-------------IQDAWNVLYHTVYNCTDGATDKN--RD 282
PA ++AW +L TVY D++ RD
Sbjct: 693 NNPIAGSPGYLGPGEWYDPAKHGEMGKEEAYDRAREAWEILGKTVYGARAKGEDEDHVRD 752
Query: 283 VIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI 342
P + +S Y + K V Y+ +I A
Sbjct: 753 ACSWQPSLRADELSP---DYFDAAKVVD------------------YAFKPLIDAAPTLR 791
Query: 343 ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVED 402
A+G A YD++D+ RQ LA+ +N L I ++ N+A + LEL++D
Sbjct: 792 ANG----AGTRVDYDIVDVGRQLLARQSNVLATQIRDSLNSNNASEAKMYGTQMLELLDD 847
Query: 403 MDGLLACHDGFLLGPWLESAKQLA---QNEEQEKQYEWNARTQITMWFDNTQEEASLL-- 457
MD LL H GFLLG ++ESAK A E E E +AR+ I+ + + + + L
Sbjct: 848 MDALLRSHKGFLLGNYIESAKSWAGKRNKESDEANLERSARSLISGFGPSGSKLGAPLGH 907
Query: 458 --RDYGNKYWSGLL 469
DY N+ WSG+L
Sbjct: 908 PMHDYSNRQWSGML 921
>gi|118370728|ref|XP_001018564.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila]
gi|89300331|gb|EAR98319.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila
SB210]
Length = 879
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 150/514 (29%), Positives = 241/514 (46%), Gaps = 60/514 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+L F G VP +L+ FP +KI +W + LDA D LF I F +
Sbjct: 255 ILQGFYGMVPNSLKAKFPLSKIYGDQSWLGFRRPA------FLDANDELFSNIANIFYSE 308
Query: 68 QLKEYGRTSHIYNCDTFDEN--TPPVDSPEYISSLGAAIYSGMQ----SGDSDAVWLMQG 121
K YGR + Y D F E P ++ ++S +IY MQ D W++Q
Sbjct: 309 SEKLYGR-AKFYGGDPFHEGAIVPGLN----LTSQAQSIYRAMQYTDNPKDEKVKWILQS 363
Query: 122 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 181
W + P Q LL + + ++LDL AE + W T+ F G ++W L NF
Sbjct: 364 WQEN------PSQQ--LLQGLQNDECIILDLMAEARSKWQTND-FSGHDFLWTSLPNFGL 414
Query: 182 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD--- 238
I YG+++ P A + +N+TM G+G EGI N + Y+++ + A+ D
Sbjct: 415 RIGQYGMIEQYVSQPPLAYSIKNSTMKGIGSIPEGILTNVLDYEILFDKAWIQPNQDTNL 474
Query: 239 -----VKAWINQYSVRRYG-RSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 292
V ++ + RYG ++ + AW++L +++YN T+ + V++A P
Sbjct: 475 TPRQQVLQYLGDFIRYRYGEQNNKNLFSAWSLLTNSIYNSTNPWDGPSESVMLARPASYI 534
Query: 293 SIISVTEGKYQNYGKPVSKEA------VLKSETSSYDHPHLWYSTSEVIRAL-------E 339
+S Y + EA +K + HL E+ + L E
Sbjct: 535 DKVSSWGTSYIYWNTTNVLEAWKLFTNYVKEKKQKNRSQHL-QKLEEINKKLGRSDDDME 593
Query: 340 LFIA-SGNELSA--SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF 396
F+ S NE +T+ YDL+D+ RQ LA Y+ L+ ++ A+ D S++F
Sbjct: 594 AFVEISQNEERNIFKDTFLYDLVDVARQNLASYSYLLYNKVMLAFNQTDTIKFALYSQQF 653
Query: 397 LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASL 456
LEL++D D LL+ F+LG +LES +L +++++ + + QIT+W D S
Sbjct: 654 LELIKDQDQLLSSRKEFMLGYYLESVSKLGTTDQEKQNFIEQIKRQITVWSD----FPSD 709
Query: 457 LRDYGNKYWSGLLRDYYGPRAAIYFK----YMIE 486
L DY NK W+G+L+D+Y PR +YFK Y++E
Sbjct: 710 LHDYANKEWNGILKDFYLPRWELYFKSLQSYIVE 743
>gi|289663931|ref|ZP_06485512.1| N-acetylglucosaminidase [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 798
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 150/541 (27%), Positives = 232/541 (42%), Gaps = 86/541 (15%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF ++ R F+E
Sbjct: 238 VLPAFAGYVPKAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFMEL 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVD-------SPEYISSL-------------------- 100
+ YG T Y D F+E PPV + +Y S+
Sbjct: 292 YTQAYG-TGEFYLADAFNEMLPPVADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARL 350
Query: 101 ---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
G A+Y + + A W+MQGWLF D FW+P + A L VP +L+VLD+ +
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDR 410
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S+ F +I+ +HN+ + +YG AF + + SE + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRQDLQALLADSEKRNLRGFG 467
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
+ EG+ N VVY+ + +A++ + W+ QY RYGRS A+ AW+ L +Y
Sbjct: 468 IFPEGLHSNSVVYEYLYALAWEGPQQPWSQWLMQYLRARYGRSDAALLSAWSDLEAGIYQ 527
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
+ P + G Y + +P + ++K + D L
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTAD--IVKFDDRPGDPQRL---- 566
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
RA++ + + + YRYDLI+ R L+ A+ +++AY D A G
Sbjct: 567 ---RRAIDALLQQAERYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDV 623
Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDN 449
QL+ R +LV+ +D L+ L W A A N+ + Y NAR Q+++W +
Sbjct: 624 QLA-RITQLVQGLDALVGGQHE-TLADWTGQAAAAAGNDAGLRRAYVGNARAQVSVWGGD 681
Query: 450 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRRE 502
L DY +K W G+ D+Y R + + ++G F +L W R
Sbjct: 682 GN-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAAAVDQQLATWERH 736
Query: 503 W 503
W
Sbjct: 737 W 737
>gi|418473272|ref|ZP_13042874.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
coelicoflavus ZG0656]
gi|371546106|gb|EHN74664.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
coelicoflavus ZG0656]
Length = 716
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 131/511 (25%), Positives = 216/511 (42%), Gaps = 44/511 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G VP P + G W + P W LD +F + +F
Sbjct: 250 VLPGYYGTVPPDFTARNPGGTVVPQGQWVGFER-PDW-----LDPRTGVFSRVAASFYRH 303
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G S +Y D E P + P + A+ + +Q+ AVW + GW +
Sbjct: 304 QRELFG-DSTMYKMDLLHEGGRPGNVP--VGDAARAVMNALQTARPGAVWTLIGWQNN-- 358
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
P + ++++V +L+++D ++ ++G PY + + NF G+ +
Sbjct: 359 -----PSTQ-IIDAVDKSRLLIVDGLSDRYDGLDRETAWHGAPYAFGTIPNFGGHTTVGA 412
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
A RT + + G+ EG NPV Y+L +E+A++ E VD W Y+
Sbjct: 413 NTAVWAERFDRWRTEPGSALAGIAYLPEGTGGNPVAYELFTELAWRTEPVDHSGWFAAYA 472
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RRYGR P AW +L Y+ G + +D +
Sbjct: 473 ERRYGRPDPHAARAWELLRTGPYSMPSGTWSEAQDSLF---------------------- 510
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
++ + + +S+ + Y V AL + L ++ YR+DL+D+ RQALA
Sbjct: 511 -TARPRLTATSAASWSPGAMRYDPDTVRAALAELLKVAPALRTTDAYRFDLVDVARQALA 569
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ L I AY D + + + ++ +D LLA FLLGPWL A+ +
Sbjct: 570 NRSRSLLPEIKAAYDAGDLSRFRAGAAEWKDDLDLLDRLLATDSRFLLGPWLADARSWGR 629
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
++ E++AR+ +T W + +A LRDY N+ WSGL+ D+Y R Y + +
Sbjct: 630 TAAEKDAAEFDARSLLTTWGHRSGSDAGGLRDYANREWSGLVSDFYAMRWTTYLDSLDTA 689
Query: 488 LESGDGFRLKDWRREWIKLTNDWQNGRNVYP 518
L +G D W L N W + YP
Sbjct: 690 LVTGRPPAAID----WFSLENAWNQRHDDYP 716
>gi|257067709|ref|YP_003153964.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
4810]
gi|256558527|gb|ACU84374.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
4810]
Length = 768
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 141/547 (25%), Positives = 232/547 (42%), Gaps = 60/547 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F G +PA L + ++ W + + P DPLF E +
Sbjct: 238 VLPGFGGQLPAELVG---TERMIDWQGWHNALAAP----------GDPLFAEAAASLHRH 284
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + G T H Y D + E+ PP SP+ ++ AI++ M+ D AVW++QGW F Y
Sbjct: 285 QRQLLG-TDHHYAVDPYIESLPPTTSPQQLAEHAEAIFTAMRDADPQAVWILQGWPFHYR 343
Query: 128 P-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+W ++ +LL+ VP +L++LDL+ E P+W + YG ++WC+ H F G ++
Sbjct: 344 AAYWTEERVHSLLSRVPEDRLILLDLWGEHAPMWHRTAAMYGRRWLWCLAHTFGGRFGLF 403
Query: 187 GIL----DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
G L D + A + G G++ E ++ N VVY+L + A + W
Sbjct: 404 GDLAALDDDLRGLRTAAEAGTRGRLEGFGITSEALDDNAVVYELATR-ALWSPMPPRERW 462
Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+ ++ +RRYG + P +Q AW V+ HT+Y G T ++A P
Sbjct: 463 LEEHIIRRYGTAAPEVQQAWQVIAHTLYGP--GRTRSTPSPLIARP-------------- 506
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSE--------------VIRALELFIASGNEL 348
G P + + + D P +E +R+L + SG
Sbjct: 507 WTRGLPFASQRLAGEALPDADGPPSANIDAENDAEMLGALAPLAHAVRSLLPVLRSGEH- 565
Query: 349 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 408
+ DL L A+ A I+ A D + + L+ +D + A
Sbjct: 566 --RDALARDLAQLAIHVGAQSARAPLRAIVAAAAEADGERLRAEASTLEALLRAVDAVAA 623
Query: 409 CHDGFLLGPWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
L+G W+ A+ A +E+ E +AR+ I++W TQ+ S L DY ++WSG
Sbjct: 624 TRPDMLVGRWIADARAGAGTDERLADALERDARSLISVW--GTQD--SGLHDYSARHWSG 679
Query: 468 LLRDYYGPRAAIYFKYMIESLES-GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD-A 525
L D + R + ++ + E L+ + + DW++ YP G+ A
Sbjct: 680 SLTDLHLARWRAWTDWLARTAEEPSTPPDLEQLHAQIRGIEEDWRDSTAPYPTTPRGEPA 739
Query: 526 LITSQWL 532
SQ L
Sbjct: 740 AAISQLL 746
>gi|429198382|ref|ZP_19190217.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
gi|428665917|gb|EKX65105.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
Length = 747
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 130/518 (25%), Positives = 230/518 (44%), Gaps = 44/518 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P + G VP P A+ G+W + P W LD F + AF
Sbjct: 257 VFPGYFGTVPPGFAERNPGARTVPQGDWMGF-ARPDW-----LDPRTNEFKRVAAAFYRA 310
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G S +Y D E P D P ++ + +++ DA W++ GW +
Sbjct: 311 QDELFGGPSTLYKMDLLHEGGDPGDVP--VADAAKGVERALRAAHPDATWVILGWQHN-- 366
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMY 186
PP +A++++V +++V+D ++ P + +G PY + + NF G+ +
Sbjct: 367 ----PP--RAIVDAVDKKRMLVVDGLSDRFPTVIDREADWGDTPYAFGSIWNFGGHTALG 420
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
A + RT + + + G+ + E + NP + L SE+A++ +++D+K W +++
Sbjct: 421 ANTPVWAELYEKWRTKDGSKLRGIALMPEAADNNPAAFALFSELAWRKDELDLKTWFSEW 480
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RYG P + AW++L T Y G T +R +EG +G
Sbjct: 481 AHARYGARDPHAEAAWDILRRTAY----GTTRADR---------------WSEGADGLFG 521
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
S+ A+ + + L Y +E AL ++ L +S+ YR DL+D+ RQ L
Sbjct: 522 ---SRPALNTVRAARWSPKQLRYDAAEFEPALGELLSVRPGLRSSSAYRRDLLDVARQTL 578
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ + L I AY D +L+ +L L++ +D LLA LLG W+ A+
Sbjct: 579 SNRSRVLLPRIRGAYDARDTARFDELTGTWLSLMDLLDRLLATDSAHLLGRWVADARAWG 638
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
++ + ++ ++ + +T+W +A LRDY N+ W+GL+ Y R + YF+ +
Sbjct: 639 ASDAERERLAYDNLSLLTVWGTRKGADAG-LRDYANREWAGLVGGLYRLRWSTYFEELRA 697
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
+L G + D W L + W E GD
Sbjct: 698 ALREGRTPKKID----WFALEDRWTRAPGRLATEPTGD 731
>gi|453051703|gb|EME99203.1| alpha-N-acetylglucosaminidase [Streptomyces mobaraensis NBRC 13819
= DSM 40847]
Length = 763
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 130/526 (24%), Positives = 222/526 (42%), Gaps = 48/526 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP ++G VP A+ G+W P W LD F + R +
Sbjct: 276 VLPGYAGTVPPDFTRRNKGARTVPQGDWAGFPR-PDW-----LDPRTAHFARVARTYYRV 329
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG S +Y D E P P + + A+ +++ DA W + GW +
Sbjct: 330 QRELYG-ASSMYKIDLLHEGGTP--GPVPVGAAAKAVEKALRAAHPDATWAILGWQTN-- 384
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
+ +L++V K++VLD + P + K + G PY + + NF G+ M
Sbjct: 385 ------PRREILDAVDRSKMLVLDGIPDHYPRVTDREKDWGGTPYAFGTIWNFGGHTAMG 438
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
RT + + + G+ + E + NP L S++A+ ++D+K W ++
Sbjct: 439 ANTQDWVSLFHRWRTKKGSALRGIALMPEAADNNPAALALFSDLAWTEGRLDLKDWFARW 498
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
V+RYG + P + AW+VL T Y T DG ++ + A PD+
Sbjct: 499 PVQRYGAADPNARRAWDVLRRTAYGTTRADGWSEAADGLFGARPDL-------------- 544
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
AV + +++ L Y + AL +A L S+ YR DL D+ RQ
Sbjct: 545 --------AV--NRAAAWSPRQLRYDAAAFDEALPALLAVAPALRGSSAYRCDLTDVARQ 594
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
++ + L I AY D L+R++L+ + ++ +A + LLG W+ A+
Sbjct: 595 CVSNRSRLLLPRIKAAYDAGDRTRFRTLTRQWLDWMTLLEETVATSERHLLGRWIAEARA 654
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
+ + E +A + +T+W + L DY N+ W+GL+ Y R YF +
Sbjct: 655 WGGTAAERDRLEHDAVSLLTVWGPRASADGGKLHDYANREWAGLVGGLYRLRWKTYFTEL 714
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 530
+L + R K +W L + W R YP + +GD + ++
Sbjct: 715 EAALTA----RRKPKPIDWYALEDRWTRKRPAYPAKPSGDIVAVAR 756
>gi|290956360|ref|YP_003487542.1| alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
gi|260645886|emb|CBG68977.1| putative alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
Length = 732
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 137/519 (26%), Positives = 228/519 (43%), Gaps = 47/519 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P + G VPA P A+ G W + P W LD F + AF
Sbjct: 243 VFPGYFGTVPAGFAERVPGARTVPQGRWMGF-ARPDW-----LDPRTDEFARVAAAFYRT 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G S +Y D E P D P ++ + +Q A W+M GW +
Sbjct: 297 QDEMFG-PSALYKMDLLHEGGDPGDVP--VADAAKGVERALQRAHPGATWVMLGWQHN-- 351
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
PP +A++++V ++V+D ++ P + + G PY + + NF G+ +
Sbjct: 352 ----PP--RAIVDAVDKQHMLVVDGLSDRFPTVTDREADWGGTPYAFGSIWNFGGHTALG 405
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
A + RT + +T+ G+ + E + NP + L SE+A++ ++D++ W ++
Sbjct: 406 ANTPDWAALYEKWRTKDGSTLHGIALMPEAADNNPAAFALFSELAWREGELDLETWFAEW 465
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RYG P + AW++L T Y T S +EG +G
Sbjct: 466 AHARYGARDPHAEAAWDILRRTAYGTT-------------------RADSWSEGADGLFG 506
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
S+ A+ + L Y+ ++ AL + EL AS+ YR DL+D+ RQAL
Sbjct: 507 ---SRPALTAVRAGRWSPKQLRYNAADFEPALGEMLKVRPELRASSAYRRDLLDVARQAL 563
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ + + + AY DA + + SR +L L++ +D L+A LLG W+ A+ A
Sbjct: 564 SNRSRVMLPQLKAAYDAKDAARLAKGSRDWLSLMDLLDELVATDSRHLLGRWVADARSWA 623
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASL-LRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
+ + ++A + +T+W T+E A LRDY N+ W+GL+ Y R A YF+ +
Sbjct: 624 VGSTERTELAYDALSLLTVW--GTREGADAGLRDYANREWAGLVGGLYRLRWATYFEELR 681
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
+L G + D W L + W E GD
Sbjct: 682 AALAEGRAPKKID----WFALEDRWARNPGTLATEPAGD 716
>gi|333023613|ref|ZP_08451677.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
gi|332743465|gb|EGJ73906.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
Length = 741
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 138/521 (26%), Positives = 231/521 (44%), Gaps = 53/521 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G VPA + P AK G W + P W LD LF E+ AF E
Sbjct: 252 VLPGYFGTVPAGFADRNPGAKTVPQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEI 305
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YGR + +Y D E + P ++ G + +++ DAVW++ GW +
Sbjct: 306 QEELYGRGT-LYKMDLLHEGGSAGNVPVGDATRG--VQRALRAARPDAVWVILGWQKN-- 360
Query: 128 PFWRPPQMKALLNSVPLGKLVVLD----LFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
PP K ++ + ++V+D F+EV + G PY + + NF G+
Sbjct: 361 ----PP--KEVVAAADREAMLVVDGLSDRFSEVN---DRESDWQGTPYAFGSIWNFGGHT 411
Query: 184 EMYG-ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAW 242
+ D + P R + + G+ + E + NP ++L +E+ + VD+ W
Sbjct: 412 ALGANARDWVDLYP-RWRDRSGSRLSGIALMPEAADNNPAAFELFAELPWTEGPVDLTDW 470
Query: 243 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 302
+Y+ RYG S + AW++L T Y D+ + P++ +V+ GK+
Sbjct: 471 FREYARVRYGGSDAHAEAAWDILRTTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW 528
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
L+ +S++ AL+ +A EL S TYR DL+D+
Sbjct: 529 --------SPKALRYPAASFEP------------ALDELLAVRAELRDSATYRRDLLDVA 568
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQALA + L + AYQ + +L RR++ L++ ++ L+A + LLG W+ESA
Sbjct: 569 RQALANRSRTLLPRLAAAYQAKNQAEFARLGRRWIALMDLLEQLVATDENHLLGRWVESA 628
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ + ++ Q +++A + +T W +A LRDY N+ WSGL+ Y R + Y
Sbjct: 629 RAWGGSAREKSQLQYDALSLLTTWGTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYID 687
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNG 523
+ +L+ G K +W L + W + G
Sbjct: 688 ELSAALKEGR----KPVAVDWFALEDRWTRNPGALATQPRG 724
>gi|295690503|ref|YP_003594196.1| alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
gi|295432406|gb|ADG11578.1| Alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
Length = 770
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 140/509 (27%), Positives = 211/509 (41%), Gaps = 73/509 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP A P A+I ++ W TY LD DPLF +I F+
Sbjct: 239 ILPAFGGYVPKAFAQKNPKARIYRMRPWEGFHE------TYWLDPADPLFAKIAGRFLAL 292
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVD-------------------------------SPEY 96
+ YG T Y D+F+E PP++ +
Sbjct: 293 YTQTYG-TGTYYLADSFNEMLPPINADGADARDAAYGDGAANTAATKTKVEVDPALKAQR 351
Query: 97 ISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAE 155
+++ G AIY ++ DAVW+MQGWLF D FW P + A L+ VP KL++LD+ +
Sbjct: 352 LAAYGKAIYDSIRQARPDAVWVMQGWLFGADSHFWDPTAISAYLSLVPDDKLMILDIGND 411
Query: 156 VKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTMVGVGMS 213
P +W +K F G P+I+ +HN+ G+ +YG LD P A E + G GM
Sbjct: 412 RYPAVWKNAKAFGGKPWIYGYVHNYGGSNPVYGDLDYYRRDIPAIAANPEAGKLAGFGMF 471
Query: 214 MEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT 273
EG+ N +VYD + ++A+ + + AW++ Y+ RYG++ P + A L Y+
Sbjct: 472 PEGLHNNSIVYDAVYDLAWGAGRESLSAWLSTYARARYGKTSPELDAALGQLVEAAYSTR 531
Query: 274 DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSE 333
+ P G Y + +P + HP
Sbjct: 532 YWS---------------PRWWKSKAGAYLFFKRPTATIGEFPP------HPGDRAKLEA 570
Query: 334 VIRAL-ELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 392
++AL L A NE + DL D TR ++L + AY+ D Q
Sbjct: 571 AVKALTALAPAYANE----PLFVLDLTDATRHLATMKIDDLLQAAVAAYRRGDVASGDQA 626
Query: 393 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 452
L +D LL L W++ A+ Y NA+ Q+T+W
Sbjct: 627 RVEIAALALSIDKLLGVQPE-TLATWIDDARAYGDTPADAAAYVANAKAQVTVWGGEGN- 684
Query: 453 EASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
L DY +K W GL R +Y PR +++
Sbjct: 685 ----LNDYASKAWQGLYRGFYLPRWSMFL 709
>gi|293369246|ref|ZP_06615836.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|292635671|gb|EFF54173.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 521
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 93/264 (35%), Positives = 150/264 (56%), Gaps = 6/264 (2%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPAAL+ V+P+ K T++ W R CT+L + D L+ I + ++ +
Sbjct: 233 VLPAFAGHVPAALKRVYPNIKTTRVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTE 289
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG T+HIY D F+E PP + + + IY + + D +AVWL WLF D
Sbjct: 290 QTRLYG-TNHIYGIDPFNEIDPPSWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYAD 348
Query: 128 -PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
W P++K+ L SVP +L++LD F E IW + ++G PY+WC L NF GN +
Sbjct: 349 IKHWTTPRIKSYLRSVPQDRLILLDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLS 408
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
G ++ ++ +A + + + GVG ++EGI+ N +Y+ + + A+ + D K W +
Sbjct: 409 GPVNLVSERLADALKNGGSNLKGVGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKL 467
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVY 270
+ RR G+ P + AW +L + VY
Sbjct: 468 ADRRIGKISPEARKAWEILANKVY 491
>gi|318057780|ref|ZP_07976503.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actG]
Length = 741
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 134/518 (25%), Positives = 231/518 (44%), Gaps = 47/518 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G VPA + P AK G W + P W LD LF E+ AF E
Sbjct: 252 VLPGYFGTVPAGFADRNPGAKTVPQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEI 305
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YGR + +Y D E + P ++ G + +++ DAVW++ GW +
Sbjct: 306 QEELYGRGT-LYKMDLLHEGGSAGNVPVGDATRG--VQRALRAARPDAVWVILGWQKN-- 360
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
PP K ++ + ++V+D ++ P + + G PY + + NF G+ +
Sbjct: 361 ----PP--KEVVAAADREAMLVVDGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALG 414
Query: 187 G-ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
D + P R + + G+ + E + NP ++L +E+ + VD+ W +
Sbjct: 415 ANTRDWVDLYP-RWRDRSGSRLSGIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFRE 473
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y+ RYG S + AW++L T Y D+ + P++ +V+ GK+
Sbjct: 474 YARVRYGGSDAHAEAAWDILRTTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--- 528
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
L+ +S++ AL+ ++ EL S TYR DL+D+ RQA
Sbjct: 529 -----SPKALRYPAASFEP------------ALDELLSVRAELRDSATYRRDLLDVARQA 571
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
LA + L + AY+ + +L RR++ L++ ++ L+A + LLG W+ESA+
Sbjct: 572 LANRSRTLLPRLAAAYKAKNQAEFARLGRRWIALIDLLEQLVATDENHLLGRWVESARAW 631
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
+ ++ Q +++A + +T W +A LRDY N+ WSGL+ Y R + Y +
Sbjct: 632 GGSAREKNQLQYDALSLLTTWGTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELS 690
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNG 523
+L+ G K +W L + W + G
Sbjct: 691 AALKEGR----KPVAVDWFALEDRWTRNPGTLATQPRG 724
>gi|318078904|ref|ZP_07986236.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actF]
Length = 719
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 131/494 (26%), Positives = 225/494 (45%), Gaps = 43/494 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G VPA + P AK G W + P W LD LF E+ AF E
Sbjct: 230 VLPGYFGTVPAGFADRNPGAKTVPQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEI 283
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YGR + +Y D E + P ++ G + +++ DAVW++ GW +
Sbjct: 284 QEELYGRGT-LYKMDLLHEGGSAGNVPVGDATRG--VQRALRAARPDAVWVILGWQKN-- 338
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEM- 185
PP K ++ + ++V+D ++ P + + G PY + + NF G+ +
Sbjct: 339 ----PP--KEVVAAADREAMLVVDGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALG 392
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
D + P R + + G+ + E + NP ++L +E+ + VD+ W +
Sbjct: 393 ANTRDWVDLYP-RWRDRSGSRLSGIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFRE 451
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 305
Y+ RYG S + AW++L T Y D+ + P++ +V+ GK+
Sbjct: 452 YARVRYGGSDAHAEAAWDILRTTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--- 506
Query: 306 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 365
L+ +S++ AL+ ++ EL S TYR DL+D+ RQA
Sbjct: 507 -----SPKALRYPAASFEP------------ALDELLSVRAELRDSATYRRDLLDVARQA 549
Query: 366 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 425
LA + L + AY+ + +L RR++ L++ ++ L+A + LLG W+ESA+
Sbjct: 550 LANRSRTLLPRLAAAYKAKNQAEFARLGRRWIALIDLLEQLVATDENHLLGRWVESARAW 609
Query: 426 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
+ ++ Q +++A + +T W +A LRDY N+ WSGL+ Y R + Y +
Sbjct: 610 GGSAREKNQLQYDALSLLTTWGTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELS 668
Query: 486 ESLESGDGFRLKDW 499
+L+ G DW
Sbjct: 669 AALKEGRKPVAVDW 682
>gi|329851961|ref|ZP_08266642.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
biprosthecum C19]
gi|328839810|gb|EGF89383.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
biprosthecum C19]
Length = 731
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 142/542 (26%), Positives = 226/542 (41%), Gaps = 98/542 (18%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF+G VP A P A+I ++ W TY LD DPLF ++ F++
Sbjct: 219 ILPAFAGYVPKAFAESHPQARIYRMRAWEGFHE------TYWLDPADPLFAKLAGRFLDL 272
Query: 68 QLKEYGRTSHIYNCDTFDENTPPV-DSP------------------------EYISSLGA 102
+ YG+ Y D F+E PPV D P E +++ G
Sbjct: 273 YDQTYGK-GRFYLADAFNEMLPPVGDGPVEGGYGDSTANKEAVAEVDPAVKAERLAAYGQ 331
Query: 103 AIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IW 160
++ ++S DAVW+MQGWLF D FW + A L +VP L+VLD+ + P +
Sbjct: 332 RLHDSIRSARPDAVWVMQGWLFGADQGFWTGDAIAAFLRNVPDDGLMVLDIGNDRYPKVR 391
Query: 161 STSKQFYGVPYIWCMLHNFAGNIEMYGILD-------SIAFGPVEARTSENTTMVGVGMS 213
T++ F+G +I+ +HN+ + +YG L +I P R + G G+
Sbjct: 392 QTAQAFHGKGWIYGYVHNYGASNPIYGDLGFYRRDMAAITSDPARGR------LQGFGVF 445
Query: 214 MEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT 273
EG++ N +VY + ++A+ + W+ Y+ RYG S P + AW + VY
Sbjct: 446 PEGLDSNSIVYAYLYDLAWNGGTKSLSDWLAGYTRARYGISSPEVVTAWLDIVKGVYGTR 505
Query: 274 DGATDKNRDVIVAF-----PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 328
R A+ PD+ + G +++ A ++ +D P L
Sbjct: 506 YWTPRWWRSTAGAYLLCKRPDIAMADFEGAPGDRAALRAGLARLAAIR-----HDSPLL- 559
Query: 329 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 388
RYD+I+ TR + + + L + AY+ D
Sbjct: 560 --------------------------RYDVIEFTRHLASLHLDNLIRTALVAYRDGDVAA 593
Query: 389 VFQLS---RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 445
+ + RR ++D+ G CH L W+E A+ ++ YE NAR Q+T+
Sbjct: 594 GDRSATEVRRVTIAIDDLMGAQPCH----LAGWIEQARAYGDTATEKPYYERNARAQVTV 649
Query: 446 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL--ESGDGFRLKDWRREW 503
W L DY +K W GL RD+Y PR + F + + RL W W
Sbjct: 650 WGGKGN-----LHDYASKAWQGLYRDFYLPRWEMLFAALRTGTYNPAATTERLIAWENAW 704
Query: 504 IK 505
++
Sbjct: 705 VE 706
>gi|374985456|ref|YP_004960951.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
gi|297156108|gb|ADI05820.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
Length = 1039
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 140/522 (26%), Positives = 219/522 (41%), Gaps = 53/522 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G VP P A + G W + K P W LD LF E+ AF
Sbjct: 251 VLPGYYGTVPDDFLAKNPGASLVAQGTWGAFKR-PDW-----LDPRTDLFAEVAAAFYRH 304
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + YG +S +Y D E P D P + A+ + +Q + AVW + GW +
Sbjct: 305 QRERYGDSS-MYKMDLLHEGGNPGDVP--VGEAAKAVEAALQKAHAGAVWAILGW--QTN 359
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
P + +L +V ++V+D ++ + + G PY + + NF G+ +
Sbjct: 360 P------SREILGAVDKSMMLVVDGLSDRYTTVIDRESDWDGTPYAFGSIWNFGGHTPIG 413
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
+ R + + G+ M EG + NP L +++A+ + + W Y
Sbjct: 414 ANAPDWVEQYPKWRDKTGSALTGIAMMPEGADNNPAAMALFTDLAWTPGAIGLDDWFASY 473
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD----VDPSIISVTEGKY 302
+V RYG P AW + T YN + D PD PS+ +
Sbjct: 474 AVSRYGGEDPHAVAAWKAIRDTAYNMS------RADAWSEAPDGLFGARPSLGA------ 521
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
+K A E YD + +E+++ +A G L S+ Y YDL D+
Sbjct: 522 -------NKAAAWGPEADRYDTTAFDAALTELLQ-----VAPG--LRDSSAYAYDLADVA 567
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L+ + L I AY+ D +L++ +L ++ MD +LA LLG WL A
Sbjct: 568 RQVLSNRSRVLLPQIKTAYEAGDRGRFDRLTKTWLSWMKLMDKVLATSGQHLLGRWLADA 627
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ Q E++AR+ IT W E L DY N+ WSGLL Y R YF
Sbjct: 628 RSWGATRAEKDQLEYDARSIITTWGGRASSEEG-LHDYANREWSGLLGGLYHLRWKTYFD 686
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
+ +L +G + +W L + W + YPV ++GD
Sbjct: 687 ELSTALAAG----RQPAGIDWFALEDHWARRHDSYPVRTSGD 724
>gi|29828556|ref|NP_823190.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
gi|29605660|dbj|BAC69725.1| putative alpha-N-acetylglucosaminidase [Streptomyces avermitilis
MA-4680]
Length = 728
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 128/518 (24%), Positives = 219/518 (42%), Gaps = 44/518 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P + G VP + A G W + P W LD F + AF
Sbjct: 238 VFPGYFGTVPPGFADRNAGAHTVPQGTWMGF-ARPDW-----LDPRTEHFTRVAAAFYRI 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G S Y D E P D P + + +++ AVW++ GW +
Sbjct: 292 QDEMFGGASTRYKMDLLHEGGSPGDVP--VGDAAKGVERALRAAHPGAVWVILGWQHN-- 347
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMY 186
PP +A++++V +++V+D + P + ++G PY + + NF G+ +
Sbjct: 348 ----PP--RAIVDAVDKDRMLVVDGLCDRFPKVTDREADWHGTPYAFGSIWNFGGHTTLG 401
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
A RT +T+ GV + E + NP + L SE+A++ +D++AW ++
Sbjct: 402 ANTPDWASLYERWRTRPGSTLRGVALLPEAADNNPAAFALFSELAWREGDLDLRAWFARW 461
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RYG P + AW++L T Y T S +EG +G
Sbjct: 462 ARSRYGGRDPHAEAAWDILRRTAYGTT-------------------RADSWSEGADGLFG 502
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
++ ++ ++ +S+ L Y E AL + L S+ YR DL+D+ RQAL
Sbjct: 503 ---ARPSLAATKAASWSPKRLRYRPEEFEPALGELLKVRPGLRGSSAYRRDLLDVARQAL 559
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ + L I AY+ D +L+ +L L++ ++ LLA LLG W+ A+
Sbjct: 560 SNRSRVLLPQIRTAYEAKDTARFDRLTGVWLALMDLLEALLATDSRHLLGRWVADARAWG 619
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ + + ++A + +T+W +A LRDY N+ W+GL+ Y R + YF +
Sbjct: 620 ASAAERDRLAYDALSLLTVWGTRAGADAG-LRDYANREWAGLVGGLYRLRWSTYFAELRS 678
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
+ G + D W L + W GD
Sbjct: 679 ASREGRTPKKTD----WFALEDRWTRNPGGLATRPTGD 712
>gi|384417770|ref|YP_005627130.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353460684|gb|AEQ94963.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 798
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 142/540 (26%), Positives = 231/540 (42%), Gaps = 84/540 (15%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF ++ R F+E
Sbjct: 238 VLPAFAGYVPKAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLEL 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVD-------SPEY-----------------------I 97
+ YG Y D F+E PPV + +Y +
Sbjct: 292 YTQAYG-AGEFYLADAFNEMLPPVADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARL 350
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
++ G A+Y + + A W+MQGWLF D FW+P + A L VP +L+VLD+ +
Sbjct: 351 AAYGQALYRSIAQVNPKATWVMQGWLFGADRAFWQPQAIAAFLGKVPDARLMVLDIGNDR 410
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVG 211
P W S+ F +I+ +HN+ + +YG +AF ++A ++ + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFG 467
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
+ EG+ N VVY+ + +A++ + W+ +Y RYGRS A+ AW L +Y
Sbjct: 468 VFPEGLHSNSVVYEYLYALAWEGPQHPWSQWLARYLRARYGRSDAALLSAWTDLEAGIYQ 527
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
+ P + G Y + +P + D P
Sbjct: 528 TRYWS---------------PRWWNTHAGAYLLFKRPTADIVNFD------DRPG---DP 563
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
+ RA++ + + + + YRYDLI+ R L+ A+ +++AY D A G
Sbjct: 564 QRLRRAIDALLQQADRYADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDA 623
Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
QL+R +LV+ +D L+ L ++A + + Y NAR Q+++W +
Sbjct: 624 QLART-TQLVQGLDALVGGQHETLAAWTGQAAAAAGNDARLRRAYVGNARAQVSVWGGDG 682
Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
L DY +K W G+ D+Y R + + ++G F +L W R+W
Sbjct: 683 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 737
>gi|390989490|ref|ZP_10259787.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372555759|emb|CCF66762.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 798
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 139/540 (25%), Positives = 222/540 (41%), Gaps = 84/540 (15%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF ++ R F+E
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKLARRFLEL 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
+ YG Y D F+E PPV +
Sbjct: 292 YAQTYG-AGEFYLADAFNEMLPPVADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARL 350
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
+ G A+Y + + A W+MQGWLF D FW+ + A L VP +L+VLD+ +
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDR 410
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S+ F +I+ +HN+ + +YG AF + + + + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFG 467
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
+ EG+ N V+Y+ + +A++ + W+ Y RYGRS A+ AW+ L +Y
Sbjct: 468 VFPEGLHSNSVIYEYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQ 527
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
+ P + G Y + +P + D P
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTADIVDFD------DRPG---DP 563
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
+ RA++ + N + + YRYDLI+ R L+ A+ +++AY D A G
Sbjct: 564 QRLRRAIDALLRQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDA 623
Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
QL+R +LV +D L+ L ++A + + Y NAR Q+++W +
Sbjct: 624 QLART-TQLVRGLDALVGGQHETLADWTGQAAAATGHDAGLRRAYVGNARAQVSVWGGDG 682
Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
L DY +K W G+ D+Y R + + ++G F +L W R+W
Sbjct: 683 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737
>gi|329934959|ref|ZP_08285000.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
gi|329305781|gb|EGG49637.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
Length = 1017
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 129/526 (24%), Positives = 215/526 (40%), Gaps = 50/526 (9%)
Query: 8 VLPAFSGNVPAALQNVFPS-AKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
VLP + G+VP + A + G W P W LD F E+ +F
Sbjct: 228 VLPGYYGHVPKDFADRRGGDAHVVPQGTWHGFDR-PSW-----LDPRTDAFAEVAASFYR 281
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
Q +G + D E D P ++ G + +++ A W++ GW +
Sbjct: 282 HQEDVFGPAGD-FKMDLLHEGGTAGDVPVPDAARG--VEKALRAARPGATWVILGWEAN- 337
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+ LL++V +++++D ++ + + + G PY + + NF G +
Sbjct: 338 -------PLPELLDAVDKKRMLIVDGVSDRYTSVTDREEDWGGTPYAFGTIPNFGGRTTI 390
Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
G I A R + + G E +++P ++L SE+A+ E VD W
Sbjct: 391 -GARTHIWREKFFAWRDKPGSALAGTAYLPEAADRDPAAFELFSELAWTDEPVDRARWFT 449
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
Y+ RYG + AW L+ T Y + +D + + A PD+
Sbjct: 450 GYADFRYGGRDAGARRAWRALHDTAYQQHANERSDPHDSLFCARPDL------------- 496
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
+ + Y L Y + AL +A YRYDL+D+ R
Sbjct: 497 -----------AATRAARYAPAALTYDPARFDAALSGLLAVAAHRRGGAAYRYDLVDVAR 545
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
QALA + + + A+ DA L+ ++L L+ + + H FLLGPW+E A+
Sbjct: 546 QALAHRSRQYLPQLKAAFDREDAATFKALATQWLTLMRLSEDITGTHPAFLLGPWIEDAR 605
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
++A N + ++E A+ +T+W D +A L +YGN+ W GLL D+Y PR +
Sbjct: 606 RMATNPRERAEFERTAKALVTVWGDRATSDAGNLHEYGNREWHGLLSDFYLPRWQKWLDA 665
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
++L +G D W W R YP+ GDA T+
Sbjct: 666 CEDALATGTAPAAVD----WFAFEEPWTRERKDYPLRPVGDAYRTA 707
>gi|291301158|ref|YP_003512436.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
gi|290570378|gb|ADD43343.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
Length = 734
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 129/518 (24%), Positives = 226/518 (43%), Gaps = 48/518 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G +P A+ G W+ S P W LD T F ++ +F
Sbjct: 246 VLPGYFGTIPGGFAKRNQQARTVPQGVWYGF-SRPDW-----LDPTGNEFAKVAASFYRH 299
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + G + +Y D E P P ++ G A+ +Q A W+M GW
Sbjct: 300 QAQLLGE-ADMYKMDLMHEGGDPGGIPIPDAAKGVAL--ALQRARPGATWVMLGW----- 351
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
R +L + +++++D ++ + G PY + + NF G+ +
Sbjct: 352 ---RKNPRTDILTDIDTSRVLIVDGISDRFDDLDREHTWPGTPYAFGTIPNFGGHTTIGA 408
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
A + RT+ ++ + G+ EG ++P ++L +E+A++ + +D+ W Y+
Sbjct: 409 NAKVWAKRFGQWRTAPDSAVSGIAWMPEGAGRDPAAFELFAELAWR-DSIDLGEWFADYA 467
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIV-AFPDVDPSIISVTEGKYQNYG 306
RRYG + + AW+ L + Y G + D + A P +D VT Y
Sbjct: 468 DRRYGGADDNARTAWDALRRSAYAMPSGRWAEAADGLFGARPGLD-----VTHADY---- 518
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
+ L Y + +AL + L ++ YR+DL+D+ RQ+L
Sbjct: 519 ---------------FSPEFLRYDAAVFAQALPALLDVDKSLH-NDAYRFDLVDVARQSL 562
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
EL + A+ + + +R +L+ + +D LL FLLGPWLE+A++ A
Sbjct: 563 VNAGRELLPRVKSAFVNQNKKQFDKHTRTWLDWMRLLDRLLETDRRFLLGPWLEAARRSA 622
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 486
+ ++ K E++ART +++W + + L DY N+ +GL+ D Y R YF + E
Sbjct: 623 RTADEAKDLEYDARTIVSVWGHRSGSDEGRLHDYANRELAGLVSDLYAMRWRRYFDSLAE 682
Query: 487 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
SL+SG + D W L ++W + + + E GD
Sbjct: 683 SLDSGQAPQHID----WFALEHEWASKTDDHATEPKGD 716
>gi|21241480|ref|NP_641062.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
gi|21106823|gb|AAM35598.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
Length = 798
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 139/540 (25%), Positives = 222/540 (41%), Gaps = 84/540 (15%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF ++ R F+E
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKLARRFLEL 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
+ YG Y D F+E PPV +
Sbjct: 292 YAQTYG-AGEFYLADAFNEMLPPVADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARL 350
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
+ G A+Y + + A W+MQGWLF D FW+ + A L VP +L+VLD+ +
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDR 410
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S+ F +I+ +HN+ + +YG AF + + + + G G
Sbjct: 411 YPGTWKASRAFDNKQWIYGYVHNYGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFG 467
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
+ EG+ N V+Y+ + +A++ + W+ Y RYGRS A+ AW+ L +Y
Sbjct: 468 VFPEGLHSNSVIYEYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQ 527
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
+ P + G Y + +P + D P
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTADIVDFD------DRPG---DP 563
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
+ RA++ + N + + YRYDLI+ R L+ A+ +++AY D A G
Sbjct: 564 QRLRRAIDALLRQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDA 623
Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
QL+R +LV +D L+ L ++A + + Y NAR Q+++W +
Sbjct: 624 QLART-TQLVRGLDALVGGQHETLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDG 682
Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
L DY +K W G+ D+Y R + + ++G F +L W R+W
Sbjct: 683 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737
>gi|372221472|ref|ZP_09499893.1| alpha-N-acetylglucosaminidase [Mesoflavibacter zeaxanthinifaciens
S86]
Length = 712
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 144/513 (28%), Positives = 217/513 (42%), Gaps = 57/513 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V+PAFSG +PAAL FP+AKI++L W D TYLLD DPLF EIG+ FIE
Sbjct: 237 VVPAFSGYIPAALAEKFPNAKISELNGWSGGGFD----STYLLDPKDPLFKEIGKRFIEL 292
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLF 124
+EYG+ + Y D+F+E TPPV + + L G IY + A W+MQGWLF
Sbjct: 293 YNQEYGKAEY-YLADSFNEVTPPVSTENKLDELAAYGQVIYETLNEAAPGATWVMQGWLF 351
Query: 125 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 183
+D FW + A L+ VP KL++ D + +W FYG + + +HN+ G+
Sbjct: 352 GHDAYFWEKDAVIAFLSKVPNDKLIIQDFGNDRYKVWEKQDAFYGKQWTYGYVHNYGGSN 411
Query: 184 EMYGILDSIAFGPVEA----RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 239
+YG D F E ++T ++G G+ EG+ QN +VY+ + ++ + K+ V
Sbjct: 412 PIYGDFD---FYKEEINYLLEHDKSTKVLGYGVMPEGLHQNSMVYEYLYDLPWD-SKIPV 467
Query: 240 KAWINQYSVRRYGRS-VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 298
K W+ RYG+ AW L VY+ P +
Sbjct: 468 KDWLKTNIKARYGKDFTKETLTAWIKLDSAVYSTKYWT---------------PRWWNDQ 512
Query: 299 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 358
G Y + +P + K HP T+ + + N+ + + D
Sbjct: 513 AGAYLLFKQPSKEITAFKG------HP-----TNLKLLEEANLLLEKNK-ENNPLIQEDF 560
Query: 359 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 418
I R L+ + L AY ND L +F L++ + LL L W
Sbjct: 561 IAHKRHELSLKIDTLLQQATYAYINNDFEKGDSLQLQFHTLIDSTEQLLENSKLDRLDYW 620
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
++ A E + Y+ NAR I W L +Y ++ W + Y R
Sbjct: 621 VQEATNYGDTPETKAFYKKNARLLINQWGG-----VGNLNNYASRAWKDQYQLLYKTRWD 675
Query: 479 IYFKYMIESLESGDGF-------RLKDWRREWI 504
IY + + E G +K+W W+
Sbjct: 676 IYLGSLRVNSELGGELNQERIEQNIKEWDELWL 708
>gi|381169859|ref|ZP_09879021.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689629|emb|CCG35508.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 798
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 139/540 (25%), Positives = 222/540 (41%), Gaps = 84/540 (15%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF ++ R F+E
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKLARRFLEL 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
+ YG Y D F+E PPV +
Sbjct: 292 YAQTYG-AGEFYLADAFNEMLPPVADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARL 350
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
+ G A+Y + + A W+MQGWLF D FW+ + A L VP +L+VLD+ +
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDR 410
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S+ F +I+ +HN+ + +YG AF + + + + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFG 467
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
+ EG+ N V+Y+ + +A++ + W+ Y RYGRS A+ AW+ L +Y
Sbjct: 468 VFPEGLHSNSVIYEYLYALAWESPQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQ 527
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
+ P + G Y + +P + D P
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTADIVDFD------DRPG---DP 563
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
+ RA++ + N + + YRYDLI+ R L+ A+ +++AY D A G
Sbjct: 564 QRLRRAIDALLRQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDA 623
Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
QL+R +LV +D L+ L ++A + + Y NAR Q+++W +
Sbjct: 624 QLART-TQLVRGLDALIGGQYETLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDG 682
Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
L DY +K W G+ D+Y R + + ++G F +L W R+W
Sbjct: 683 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737
>gi|418520969|ref|ZP_13087015.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410702945|gb|EKQ61442.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
Length = 798
Score = 165 bits (417), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 140/540 (25%), Positives = 222/540 (41%), Gaps = 84/540 (15%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF ++ R F+E
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKLARRFLEL 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
+ YG Y D F+E PPV +
Sbjct: 292 YAQTYG-AGEFYLADAFNEMLPPVADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARL 350
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
+ G A+Y + + A W+MQGWLF D FW+ + A L VP +L+VLD+ +
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDR 410
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S+ F +I+ +HN+ + +YG AF + + + + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFG 467
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
+ EG+ N V+Y+ + +A++ + W+ Y RYGRS A+ AW+ L +Y
Sbjct: 468 VFPEGLHSNSVIYEYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQ 527
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
+ P + G Y + +P + A D P
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTADIADFD------DRPG---DP 563
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
+ RA++ + N + + YRYDLI+ R L+ A+ +++AY D A G
Sbjct: 564 QRLRRAIDALLQQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDA 623
Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
QL+R +LV +D L+ L ++A + + Y NAR Q+++W +
Sbjct: 624 QLART-TQLVRGLDALVGGQYETLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDG 682
Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
L DY +K W G+ D+Y R + + +G F +L W R+W
Sbjct: 683 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARMAGTPFDAVAMDHQLATWERQW 737
>gi|456388164|gb|EMF53654.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
25435]
Length = 732
Score = 165 bits (417), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 133/519 (25%), Positives = 226/519 (43%), Gaps = 47/519 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
V P + G VPA P A+ G W + P W LD F + AF
Sbjct: 243 VFPGYFGTVPAGFAERVPGARTVPQGEWMGF-ARPDW-----LDPRTDDFARVAAAFYRV 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G +S +Y D E P D P ++ + ++ A W++ GW +
Sbjct: 297 QEEMFGPSS-LYKMDLLHEGGDPGDVP--VADAAKGVERALRRSRPGATWVILGWQHN-- 351
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMY 186
PP +A++++V ++V+D ++ P + + +G PY + + NF G+ +
Sbjct: 352 ----PP--RAIVDAVDKQHMLVVDGLSDRFPTVTDREADWGDTPYAFGSIWNFGGHTALG 405
Query: 187 GILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQY 246
A + RT + + + G+ + E + NP + L SE+A++ ++D+K W ++
Sbjct: 406 ANTPDWAALYEKWRTKDGSRLHGIALMPEAADNNPAAFALFSELAWREGELDLKTWFAEW 465
Query: 247 SVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG 306
+ RYG P + AW++L T Y T S +EG +G
Sbjct: 466 AHARYGGRDPHAEAAWDILRRTAYGTT-------------------RADSWSEGADGLFG 506
Query: 307 KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQAL 366
S+ A+ + L Y ++ AL + EL AS+ YR DL+D+ RQAL
Sbjct: 507 ---SRPALNAVRAGRWSPKQLRYDAADFEPALGEMLRVRPELRASSAYRRDLLDVARQAL 563
Query: 367 AKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 426
+ + + I AY DA + SR +L L++ +D L+A LLG W+ A+
Sbjct: 564 SNRSRVMLPQIKAAYDAKDATRLAAASRDWLSLMDLLDELVATDSRHLLGRWVADARSWG 623
Query: 427 QNEEQEKQYEWNARTQITMWFDNTQEEASL-LRDYGNKYWSGLLRDYYGPRAAIYFKYMI 485
+ + ++ + +T+W T+E A LRDY N+ W+GL+ Y R + YF+ +
Sbjct: 624 AGAAERTELGYDNLSLLTVW--GTREGADAGLRDYANREWAGLVGGLYRLRWSTYFEELR 681
Query: 486 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD 524
+L +G + D W L + W E GD
Sbjct: 682 AALAAGRAPKKID----WFALEDRWARNPGPLATEPTGD 716
>gi|418515337|ref|ZP_13081518.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|410708056|gb|EKQ66505.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 782
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 140/540 (25%), Positives = 222/540 (41%), Gaps = 84/540 (15%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF ++ R F+E
Sbjct: 222 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKLARRFLEL 275
Query: 68 QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
+ YG Y D F+E PPV +
Sbjct: 276 YAQTYG-AGEFYLADAFNEMLPPVADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARL 334
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
+ G A+Y + + A W+MQGWLF D FW+ + A L VP +L+VLD+ +
Sbjct: 335 AEYGQALYRSIAQVNPKATWVMQGWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDR 394
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S+ F +I+ +HN+ + +YG AF + + + + G G
Sbjct: 395 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFG 451
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
+ EG+ N V+Y+ + +A++ + W+ Y RYGRS A+ AW+ L +Y
Sbjct: 452 VFPEGLHSNSVIYEYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQ 511
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
+ P + G Y + +P + A D P
Sbjct: 512 TRYWS---------------PRWWNKRAGAYLLFKRPTADIADFD------DRPG---DP 547
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
+ RA++ + N + + YRYDLI+ R L+ A+ +++AY D A G
Sbjct: 548 QRLRRAIDALLQQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDA 607
Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
QL+R +LV +D L+ L ++A + + Y NAR Q+++W +
Sbjct: 608 QLART-TQLVRGLDALVGGQYETLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDG 666
Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
L DY +K W G+ D+Y R + + +G F +L W R+W
Sbjct: 667 N-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARMAGTPFDAVAMDHQLATWERQW 721
>gi|294627661|ref|ZP_06706243.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 11122]
gi|292598013|gb|EFF42168.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 11122]
Length = 798
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 143/542 (26%), Positives = 228/542 (42%), Gaps = 88/542 (16%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF ++ R F+E
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLEL 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPV------------------------------DSPEYI 97
+ YG Y D F+E PPV +
Sbjct: 292 YTQTYG-AGEFYLADAFNEMLPPVADDGSDVAAAKYGDSVANSDAARAKAVPPAQRDARL 350
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
+ G A+Y + + A W+MQGWLF D FW+ + A L VP +L+VLD+ +
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDR 410
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVG 211
P W S+ F +I+ +HN+ + +YG AF ++A ++ + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRQDLQALLADPGKRNLRGFG 467
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
+ EG+ N V+Y+ + +A++ + W+ Y RYGRS A+ AW L +Y
Sbjct: 468 VFPEGLHSNSVIYEYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQ 527
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
+ P + G Y + +P + ++ + D L
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTAD--IVDFDDCPGDPQRL---- 566
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
RA++ + N + + YRYDLI+ R L+ A+ +++AY D A G
Sbjct: 567 ---RRAIDALLQQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDA 623
Query: 391 QLSRRFLELVEDMDGLLAC-HDGFLLGPWL-ESAKQLAQNEEQEKQYEWNARTQITMWFD 448
QL+R +LV +D L+ HD L W ++A + + Y NAR Q+++W
Sbjct: 624 QLART-TQLVRGLDALVGGQHD--TLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGG 680
Query: 449 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRR 501
+ L DY +K W G+ D+Y R + + ++G F +L W R
Sbjct: 681 DGN-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAVAVDHQLAAWER 735
Query: 502 EW 503
+W
Sbjct: 736 QW 737
>gi|398786493|ref|ZP_10549210.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
gi|396993639|gb|EJJ04702.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
Length = 1048
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 126/528 (23%), Positives = 217/528 (41%), Gaps = 47/528 (8%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G VP P A+ G W + + P W LD +F + AF
Sbjct: 243 VLPGYFGTVPGGFAARNPGARTVPQGTWSGL-ARPDW-----LDPRTEVFAKTAAAFYRH 296
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q G H + D E P D P + A+ +++ A W++ GW +
Sbjct: 297 QEHLLGPADH-FKMDLLHEGGDPGDVP--VPDAARAVEKALRTARPGATWVILGWQNN-- 351
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG 187
+ LL++V +++++D ++++ + + + GVPY + + NF G +
Sbjct: 352 ------PRRDLLDAVDHDRMLIVDGLSDLETVTDRERDWGGVPYAFGSIPNFGGRTTIGA 405
Query: 188 ILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 247
A R + + G E E++P ++L SE+A++ VD AW + Y+
Sbjct: 406 KTHVWAERFPAWRDKPGSRLAGTAYMPEAAERDPAAFELFSELAWRERPVDRAAWFDGYA 465
Query: 248 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 307
RYG + A+ L + Y + + V A PD+
Sbjct: 466 DLRYGARDKGARAAFAALGTSAYEISSKDGRPHDSVFAARPDLA---------------- 509
Query: 308 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 367
V + T ++D + A + L S+ YR DL D RQALA
Sbjct: 510 -ARSGTVYATHTPAFD-------PAAFDTAFAALLTVRPALRGSDAYRRDLTDTARQALA 561
Query: 368 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 427
+ +L + +AY+ D LS +L L+ + + H FLLGPWL A+ +A
Sbjct: 562 NRSWQLIGQLQDAYRRKDRATFRALSGLWLHLMRLSEDVTGAHRQFLLGPWLTDARAMAS 621
Query: 428 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
E+E + E +AR +T W D + L +Y N+ W GL+ + + P+ Y + ++
Sbjct: 622 GPEEEARLEHSARALLTTWADRPTADGGSLANYANRDWHGLIGEVHLPQWQAYLGELADA 681
Query: 488 LESGDGFRLKDW-RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
L + + DW RE W + R P+ +A T++ +++
Sbjct: 682 LAADRPPKPFDWYARE-----EPWTHERTTPPLHPTTEAYRTARRVHD 724
>gi|329940646|ref|ZP_08289927.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
gi|329300707|gb|EGG44604.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
Length = 798
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 129/502 (25%), Positives = 211/502 (42%), Gaps = 50/502 (9%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP + G VP A+ G W P W LD LF + AF
Sbjct: 255 VLPGYFGTVPPGFAARNRGARTVPQGTWMGFDR-PDW-----LDPRTDLFARVAAAFYRV 308
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
Q + +G ++H Y D E D P + + ++ DAVW++ GW +
Sbjct: 309 QGELFGASTH-YKMDLLHEGGTAGDVP--VGEAAKGVERALRRARPDAVWVLLGWRHN-- 363
Query: 128 PFWRPPQMKALLNSVPLG---------KLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 177
PP +A+L++V G +L+V+D ++ P + + GVPY + +
Sbjct: 364 ----PP--RAILDAVASGGPDGAAGRERLLVVDGLSDRFPTVTDREADWGGVPYAFGSIW 417
Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 237
NF G+ + A RT E + + G+ + E + NP + L SE+ + ++
Sbjct: 418 NFGGHTTLGANTPDWARLYEAWRTKEGSALRGIALLPEAADNNPAAFALFSELPWHEGEL 477
Query: 238 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 297
D+KAW +++ RYG + AW+VL T Y T S
Sbjct: 478 DLKAWFARWARSRYGAYDAHAEAAWDVLRRTAYGTT-------------------RADSW 518
Query: 298 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 357
+EG +G ++ ++ +S+ L Y E RAL+ + L S+ YR D
Sbjct: 519 SEGADGLFG---ARPSLTARRAASWSPKELRYDAHEFERALDELLKVRPGLRESSAYRRD 575
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
L+D+ RQ L+ + L I A D S +L L++ ++ L+ LLG
Sbjct: 576 LLDVARQCLSNRSRALLPRIARACAARDVKAFDAASGDWLSLMDLLERLVGTDARHLLGR 635
Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
W A+ +E + + +++A + +T+W EA LRDY N+ W+GL+ Y R
Sbjct: 636 WTAQARAWGADEAERDRLQYDALSLLTVWGTRQGAEAG-LRDYANREWAGLVGGLYRLRW 694
Query: 478 AIYFKYMIESLESGDGFRLKDW 499
+ YF + +L G DW
Sbjct: 695 STYFTELRAALTEGRAPAAVDW 716
>gi|429201402|ref|ZP_19192867.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
91-03]
gi|428663010|gb|EKX62401.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
91-03]
Length = 1042
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 128/522 (24%), Positives = 214/522 (40%), Gaps = 50/522 (9%)
Query: 8 VLPAFSGNVPAA-LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
VLP + G+VP ++ A + G W + P W LD F ++ +F
Sbjct: 249 VLPGYYGHVPKEFVERNGGDAHVVPQGVWHGFER-PDW-----LDPRTDSFAKVAASFYG 302
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
Q +G +H + D E D P + + +Q A W++ GW
Sbjct: 303 HQEDVFGEAAH-FKMDLLHEGGTAGDVP--VPGAAQGVERALQKARPGATWVILGW--QE 357
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+P + LL+++ +++++D ++ + + + G PY + + NF G +
Sbjct: 358 NP------LPELLDAIDKSRMLIVDGVSDRYTSVTDRERDWGGTPYCFGTIPNFGGRTTI 411
Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
G I A R N+ + G E +++P ++L SE+A+ K+D AW +
Sbjct: 412 -GARAHIWNEKFFAWRDKANSALAGTAFMPEATDRDPAAFELFSELAWTPTKIDRAAWFS 470
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQ 303
Y+ RYG + + AW L+ T Y +D + + A PD+
Sbjct: 471 AYADYRYGARDDSARRAWRALHDTAYQQRAVERSDPHDSLFCARPDL------------- 517
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
+ Y L Y AL + L S Y+YD++D+ R
Sbjct: 518 -----------AADRAAEYAPRALTYDPGRFDAALAGLLGVAGGLRGSAAYKYDVVDVAR 566
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
QALA + + + AYQ D LS +L L+ D + + FLLGPW+ A+
Sbjct: 567 QALAHRSRQYLPQLRAAYQRKDLATFRALSTLWLRLMRLSDEVTGANSAFLLGPWVNDAR 626
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
LA N+ + ++E A+ IT+W +A L +YGN+ W GL+ D+Y PR +
Sbjct: 627 LLATNDAERAEFERTAKVLITVWGGRATSDAGDLHEYGNREWHGLMADFYVPRWEKWLDT 686
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDA 525
+ ++L +G D W W R Y + GDA
Sbjct: 687 LEDALATGTAPAAVD----WFAFEEPWTRERKDYALRPVGDA 724
>gi|294667089|ref|ZP_06732314.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 10535]
gi|292603099|gb|EFF46525.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 10535]
Length = 798
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 145/542 (26%), Positives = 228/542 (42%), Gaps = 88/542 (16%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD DPLF ++ R F+E
Sbjct: 238 VLPAFAGYVPRAFAQAHPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLEL 291
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVD-------SPEYISSL-------------------- 100
+ YG Y D F+E PPV + +Y S+
Sbjct: 292 YTQTYG-AGEFYLADAFNEMLPPVADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARL 350
Query: 101 ---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
G A+Y + + A W+MQGWLF D FW+ + A L VP +L+VLD+ +
Sbjct: 351 AEYGQALYRSIAQVNPKATWVMQGWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDR 410
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVG 211
P W S+ F +I+ +HN+ + +YG AF ++A ++ + G G
Sbjct: 411 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRQDLQALLADPGKRNLRGFG 467
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
+ EG+ N V+Y + +A++ + W+ Y RYGRS A+ AW L +Y
Sbjct: 468 VFPEGLHSNSVIYAYLYALAWEGPQQSWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQ 527
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 331
+ P + G Y + +P + D P
Sbjct: 528 TRYWS---------------PRWWNKRAGAYLLFKRPTADIVDFD------DRPG---DP 563
Query: 332 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVF 390
+ RA++ + N + + YRYDLI+ R L+ A+ +++AY D A G
Sbjct: 564 QRLRRAIDALLQQANRYADAPLYRYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDA 623
Query: 391 QLSRRFLELVEDMDGLLA-CHDGFLLGPWL-ESAKQLAQNEEQEKQYEWNARTQITMWFD 448
QL+R +LV +D L+ HD L W ++A + + Y NAR Q+++W
Sbjct: 624 QLART-TQLVRGLDALVGDQHD--TLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGG 680
Query: 449 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRR 501
+ L DY +K W G+ D+Y R + + ++G F +L W R
Sbjct: 681 DGN-----LADYASKAWQGMYADFYLQRWTRFLSAYRAARKAGTPFDAVTVDHQLAAWER 735
Query: 502 EW 503
+W
Sbjct: 736 QW 737
>gi|325922205|ref|ZP_08183992.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
19865]
gi|325547324|gb|EGD18391.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
19865]
Length = 807
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 138/545 (25%), Positives = 225/545 (41%), Gaps = 90/545 (16%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P+A+I ++ W TY LD DPLF ++ R F+E
Sbjct: 239 VLPAFAGYVPKAFAQAHPNARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLEL 292
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVD-------SPEYISSL-------------------- 100
+ YG Y D F+E PPV + +Y S+
Sbjct: 293 YTQTYG-AGEFYLADAFNEMLPPVADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARL 351
Query: 101 ---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
G A+Y + + A W+MQGWLF D FW+P + A L VP +L+VLD+ +
Sbjct: 352 AEYGQALYRSIAQVNPQATWVMQGWLFGADREFWQPQAIAAFLGKVPDARLMVLDIGNDR 411
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART----SENTTMVGVG 211
P W S+ F +I+ +HN+ + +YG AF + + + + G G
Sbjct: 412 YPGTWKASQAFDNKQWIYGYVHNYGASNPLYG---DFAFYRQDLQALLADPDKRNLRGFG 468
Query: 212 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 271
+ EG+ N VVY+ + +A++ + W+ QY+ RYG S A+ AW+
Sbjct: 469 VFPEGLHSNSVVYEYLYALAWEGPQQSWSQWLTQYTRARYGHSDAALLQAWS-------- 520
Query: 272 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY----DHPHL 327
D+D I + + K + K T+ D P
Sbjct: 521 -----------------DLDAGIYQTRYWSLRWWNKRAGAYLLFKRPTADIVGFDDRPG- 562
Query: 328 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 387
+ RA++ + + + + YRYDLI+ R L+ +A+ +++AY D
Sbjct: 563 --DPQRLRRAIDALLQQADRYADAPLYRYDLIEDARHYLSLHADRQLQAVVQAYGTGDFA 620
Query: 388 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 447
L R LV+ +D L+ L ++A + + Y NAR Q+++W
Sbjct: 621 RGDALLARTTRLVQGLDALVGGQHETLADWTDQAAAAAGDDAALRRVYVGNARAQVSVWG 680
Query: 448 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWR 500
+ L DY +K W G+ ++Y R + + ++G F +L W
Sbjct: 681 GDGN-----LADYASKAWQGMYAEFYLQRWTRFLSAYRAARKAGTPFDEAAFNKQLAAWE 735
Query: 501 REWIK 505
R+W +
Sbjct: 736 RQWAE 740
>gi|409097333|ref|ZP_11217357.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Pedobacter agri
PB92]
Length = 724
Score = 162 bits (411), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 134/520 (25%), Positives = 217/520 (41%), Gaps = 50/520 (9%)
Query: 8 VLPAFSGNVPAALQN---VFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAF 64
+L F G VP L A+I GNW + + +L T+ F + +
Sbjct: 232 ILQGFYGMVPHDLNKKVAALKDAQIIDQGNWVFTE----FIRPAILAPTNDKFNTVADVY 287
Query: 65 IEQQLKEYGRTSHIYNCDTFDENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 122
+ K YG + + F E VD I+++ ++ MQ ++ W++QGW
Sbjct: 288 YSELKKLYGSDIKFFGGEPFHEGGKKGGVD----ITAVAKSVQDVMQKNFPNSTWVLQGW 343
Query: 123 LFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN 182
+ ALL + ++++LF E W K + G +IW + NF
Sbjct: 344 --------QNNPADALLAGLKKENTLIIELFGENTSNWEQRKGYGGTNFIWSNVSNFGEK 395
Query: 183 IEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
+YG L + S + GVG+ EGI NPV YDLM ++A+++EK +
Sbjct: 396 NGLYGRLQRFLDEVYRIKQSPYKDYLKGVGIIPEGINNNPVAYDLMLDIAWRNEKPPLDK 455
Query: 242 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 301
WI Y+ RYG + DAW V TVY+ +K + V P +E
Sbjct: 456 WITDYTTYRYGSYNKDVADAWKVFTETVYSSP--VNEKGKIVYQEGP---------SESI 504
Query: 302 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 361
Y ++ ++ + SS+ Y T +A+ LFI + + S TY+ D D
Sbjct: 505 Y------CARPSLKVNPVSSWGTRKRNYDTKLFKQAVALFIKAETQFKNSETYQTDKTDF 558
Query: 362 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 421
RQ +A ++ + +I A Q D + + + FL ++ D LL + F L WL
Sbjct: 559 LRQVMADKGDQAYDELINAIQAKDKNAIKEKGNHFLTMILQQDSLLNNNHFFTLNRWLNQ 618
Query: 422 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
A L + K +NA+ QIT W + + + LRDY +K W GLL Y R ++
Sbjct: 619 AVALGKGLPDAKNILFNAKAQITFWGPDNNPKTT-LRDYAHKEWGGLLSSLYYNRWKLFI 677
Query: 482 KYMI-ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 520
+ + + S F + W N+YP++
Sbjct: 678 DDALNDKITSASTF---------YDMEVKWSKDSNLYPIK 708
>gi|365876979|ref|ZP_09416485.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
gi|442587289|ref|ZP_21006107.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
anophelis R26]
gi|365755253|gb|EHM97186.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
gi|442562959|gb|ELR80176.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
anophelis R26]
Length = 712
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 131/514 (25%), Positives = 220/514 (42%), Gaps = 54/514 (10%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL F G VP L+N AK+ + G W P +LD T LF +I + +
Sbjct: 230 VLQGFYGMVPHDLKNKISEAKVIEQGKWAGEFQRPG-----ILDPTTKLFSKIADTYYTE 284
Query: 68 QLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
YG H + + F E T +D + S I + MQ ++ W++QGW
Sbjct: 285 MKNLYGEDIHYFGGEPFHEGGKTNGLDLKNVVES----IQTSMQKSYPNSTWVLQGW--Q 338
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+P LL + ++++LF E W K + G +IW + NF +
Sbjct: 339 QNP------SDGLLAGLKKENTLIIELFGENTANWEKRKGYGGTSFIWSNVSNFGEKNGL 392
Query: 186 YGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
YG L A+ S + G+G+ EGI NPV YDLM ++A+ EK + W+
Sbjct: 393 YGKLQRFIDEVFRAKESIYGANLKGIGIIPEGIFNNPVAYDLMLDIAWYSEKPILDQWLT 452
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQ 303
+Y+ RYG+ + AW T+Y+ D + + + A P ++ + +S + +
Sbjct: 453 EYTKYRYGKENQDVIQAWKEFAQTIYSSPDVYQEGPSESIYCARPSLNVNPVSSWGTRKR 512
Query: 304 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 363
NY + KEAV ++F+ + + S TY+ D D R
Sbjct: 513 NYDQSRFKEAV------------------------KVFVKADTDFKDSETYQTDKTDFLR 548
Query: 364 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
Q A + ++ +I+A + + +FLE++ + LL + F L L+ A+
Sbjct: 549 QVWANKGDVVYDELIKAIHEKKTTKIQKSGHQFLEMISIQNMLLGNNRYFTLNRLLKEAE 608
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
+ + +NA++Q+T W + + LRDY +K W+GLL Y R +K
Sbjct: 609 HFGEKLPDAQNVMFNAKSQLTYWGPDNNPKTD-LRDYAHKEWNGLLSSLYYNR----WKV 663
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVY 517
IE +SG + + + +W G+N+Y
Sbjct: 664 FIEQAQSG----IITAPEVFYNMEVEWSKGKNMY 693
>gi|16124795|ref|NP_419359.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
gi|221233511|ref|YP_002515947.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
gi|13421729|gb|AAK22527.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
gi|220962683|gb|ACL94039.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
Length = 770
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 128/503 (25%), Positives = 205/503 (40%), Gaps = 71/503 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAF G VP A P A+I ++ W TY LD DPLF +I F+
Sbjct: 239 ILPAFGGYVPKAFAEKNPKARIYRMRPWEGFHE------TYWLDPADPLFAKIAARFLAL 292
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVD-------------------------------SPEY 96
+ +G ++ Y D+F+E PP++ +
Sbjct: 293 YTETFGAGTY-YLADSFNEMLPPINADGADARDAAYGDGTANTAVTKTKVEVDPALKAQR 351
Query: 97 ISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAE 155
+++ G AIY ++ DAVW+MQGWLF D FW P + A L+ VP KL++LD+ +
Sbjct: 352 LAAYGKAIYDSIRQTRPDAVWVMQGWLFGADSHFWDPAAISAYLSLVPDDKLMILDIGND 411
Query: 156 VKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTMVGVGMS 213
P +W +K F G P+I+ +HN+ G+ +YG L P A + + G GM
Sbjct: 412 RYPNVWKNAKAFGGKPWIYGYVHNYGGSNPVYGDLGFYRQDIPAIAANPDAGKLAGFGMF 471
Query: 214 MEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT 273
EG+ N +VY+ + ++A+ + W+ +Y+ RYG++ PA+ A L ++
Sbjct: 472 PEGLHNNSIVYEAVYDLAWSEGQASPATWLTRYARARYGKTSPALDAALGQLVEAAFSTR 531
Query: 274 DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSE 333
+ P G Y + +P + D P ++
Sbjct: 532 YWS---------------PRWWKSKAGAYLFFKRPTATVG---------DFPQHPGDRAK 567
Query: 334 VIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLS 393
+ A++ A + DL D TR ++L + AY+ D
Sbjct: 568 LEAAVKALTALAPTYGQEPLFVLDLTDATRHLATMKIDDLLQVAVAAYRRGDTAAGDAAR 627
Query: 394 RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEE 453
L +D LL L W++ A+ Y NA+ Q+T+W
Sbjct: 628 VEIEALALSIDKLLGVQPD-TLATWIDEARAYGDTPADAAAYVANAKAQVTIWGGEGN-- 684
Query: 454 ASLLRDYGNKYWSGLLRDYYGPR 476
L DY +K W GL + +Y PR
Sbjct: 685 ---LNDYASKAWQGLYKSFYLPR 704
>gi|29832531|ref|NP_827165.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
gi|29609651|dbj|BAC73700.1| putative alpha-N-acetylglucosaminidase, secreted [Streptomyces
avermitilis MA-4680]
Length = 1038
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 125/526 (23%), Positives = 215/526 (40%), Gaps = 50/526 (9%)
Query: 8 VLPAFSGNVPAA-LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
VLP + G+VP ++ A + G W + P W LD F + ++F
Sbjct: 249 VLPGYYGHVPKGFVERNGGDAHVVPQGIWHGFER-PDW-----LDPRTASFAAVAKSFYR 302
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
Q +G+ +H + D E D P + + +Q+ A W++ GW
Sbjct: 303 HQKDVFGKAAH-FKMDLLHEGGTAGDVP--VPGAARGVEKALQAAHPGATWVILGW--EA 357
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+P + ALL+++ K++++D ++ + K + G PY + + NF G +
Sbjct: 358 NP------LPALLDAIDKKKMLIVDGVSDRYTSVTDREKDWGGTPYAFGTIPNFGGRTTI 411
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
R + + G E +++P ++L SE+A+ K+D AW +
Sbjct: 412 GARAHLWNEKFFAWRDKAGSALAGTAYLPEAADRDPAAFELFSELAWSAGKIDRAAWFSS 471
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y+ RYG + Q AW L+ T Y +D + + A PD+
Sbjct: 472 YADFRYGGRDASAQKAWRALHDTAYQQHAVERSDAHDSLFCARPDL-------------- 517
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+ + Y L Y AL + L S Y YDL+D+ RQ
Sbjct: 518 ----------AANRAAEYAPRALTYDPGRFDAALSGLLGVAGGLRGSAAYTYDLVDVARQ 567
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVF-QLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 423
ALA + + +L ++ A F L+ +L L+ D + H FLLGPW+ A+
Sbjct: 568 ALAHRSRQ-YLPLLRAAYARKDAAAFTSLATLWLRLMGLSDEVTGTHPAFLLGPWINDAR 626
Query: 424 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 483
LA + + ++E A+ +T+W +A L +Y + W+GL+ D+Y PR +
Sbjct: 627 LLATDAGERAEFERTAKVLLTVWGGRATSDAGDLHEYAGREWNGLMADFYLPRWKKWLDA 686
Query: 484 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
+ ++L +G D W + W R YP+ GD T+
Sbjct: 687 LADALATGTPPAAVD----WFAVEEPWTRERKDYPLRPVGDPYRTA 728
>gi|383643231|ref|ZP_09955637.1| N-acetylglucosaminidase [Sphingomonas elodea ATCC 31461]
Length = 778
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 134/557 (24%), Positives = 215/557 (38%), Gaps = 82/557 (14%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
+LPAFSG VP A P AKI Q+ W TY LD +DPLF + F++
Sbjct: 246 ILPAFSGYVPEAFAKAHPEAKIYQMRQWEGFPG------TYWLDPSDPLFARLAARFLQL 299
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDS-------------------------PEYI----- 97
YG Y D F+E PP+ P+ +
Sbjct: 300 YTATYG-PGEYYLADAFNEMVPPIAEDGSDARAATYGDAIANTAATRAAALPKEVRDARL 358
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 156
++ G +Y + + +A W+MQGWLF D FW P + A L+ VP ++++LD+ +
Sbjct: 359 AAYGERLYRSITAAAPNATWVMQGWLFGADKAFWTPDAIAAFLSKVPDERMLILDIGNDR 418
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSM 214
P IW+ ++ FYG + + +HN+ G+ +YG L A + + M G G+
Sbjct: 419 YPGIWNATRAFYGKGWAYGYVHNYGGSNPVYGDLAFYRSDITAALANPGHGRMRGFGLFP 478
Query: 215 EGIEQNPVVYDLMSEMAFQHEKVDVK-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 269
EG+ N + Y ++A+ K AWI Y+ RYG++ PA+ AW+
Sbjct: 479 EGLHSNGIAYAYAYDLAWGEIDATGKARPLDAWIGDYTRARYGKTSPALVAAWDKAIAGA 538
Query: 270 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 329
Y P G Y + P A D+P
Sbjct: 539 YTTR---------------YWTPRWWHEQAGGYLFFKFPSLDGA---------DYPAAPG 574
Query: 330 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 389
+ + +E +A + Y YD++DL R + ++ + AY+ D
Sbjct: 575 DPAALRAGIEALLAQAPQHGGEPLYTYDVVDLVRHYASVQLDDRLKTAVAAYKAGDLAAG 634
Query: 390 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 449
+ + L +D LA + LG WL A ++ + A+ +T+W
Sbjct: 635 DRATAAAERLARHIDA-LAGNQQETLGSWLADAAAYGDTPAEKAAFVEQAKAVVTVWGGT 693
Query: 450 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRRE 502
L DY ++ W GL YY PR + + + F ++ W+
Sbjct: 694 GH-----LSDYASRAWQGLYAGYYWPRWQRFLAAQRAAAAAHTPFDAKATSDAIRTWQAA 748
Query: 503 WIKLTNDWQNGRNVYPV 519
W+K W R P+
Sbjct: 749 WLKDGRMWPRQRPAAPL 765
>gi|328867426|gb|EGG15808.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
Length = 992
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 105/329 (31%), Positives = 160/329 (48%), Gaps = 38/329 (11%)
Query: 207 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 266
M G G++ E IEQN ++YDLM+EMA++ ++ WINQY+ RRYG VP + AWN+L
Sbjct: 219 MKGTGLTPEAIEQNYMMYDLMNEMAWRTTAPNMTEWINQYTQRRYGVFVPELAQAWNLLI 278
Query: 267 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 326
TV+N T G YG P S + + D
Sbjct: 279 PTVFNATLGY----------------------------YGPPSSFVGMRPQLNMTND--- 307
Query: 327 LWYSTSEVIRALELFIASGNELSASN-TYRYDLIDLTRQALAKYANELFLNIIEAYQLND 385
L+Y S V +A +L++ +E S T+ +D+ ++T QAL+ + + + +AY N
Sbjct: 308 LYYDPSVVQQAWQLYLGVTDEYVLSTATFSFDVSEITLQALSNLFMDTQMAMYDAYLTNQ 367
Query: 386 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE--QEKQYEWNARTQI 443
+ + + L ++ DMD + A L+G W +A+Q A N + +E+NAR QI
Sbjct: 368 STVFEERATSCLNIITDMDTIAATQQMLLVGTWTANARQWALNTSSGETAPFEFNARNQI 427
Query: 444 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 503
T+W S L DY WSGLL D+Y R A++ KYM SL + F D+ +
Sbjct: 428 TLW----GPPNSSLHDYAYHLWSGLLNDFYFARWALFIKYMDTSLSTNTTFNNTDYTNDI 483
Query: 504 IKLTNDWQNGRNVYPVESNGDALITSQWL 532
L W N YP G+A + S+++
Sbjct: 484 ESLEESWNNQNYQYPTLPTGNAYLLSKFI 512
>gi|154321596|ref|XP_001560113.1| hypothetical protein BC1G_00945 [Botryotinia fuckeliana B05.10]
Length = 701
Score = 155 bits (391), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 86/219 (39%), Positives = 126/219 (57%), Gaps = 6/219 (2%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP L+ V P+A I +W ++ T+L DPLF + F+
Sbjct: 223 VLPAFTGFVPRDLRRVAPNANIINGSDWGNLFPFEYSNDTFLY-PIDPLFKTLQHTFLSL 281
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SY 126
Q + YG SHIY D F+EN P P Y+ ++ Y +QS DS+A W++QGWLF +
Sbjct: 282 QSEYYGNVSHIYTLDQFNENLPASGDPLYLGNISRGTYDSLQSFDSNATWMLQGWLFYAA 341
Query: 127 DPFWRPPQMKALLNSVPLGK-LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
FW +++A L VP + +++LDLF+E P W + Q+YG P+IWC LH + G +
Sbjct: 342 SSFWTQDRVEAYLGGVPKNESMLILDLFSESFPEWENTHQYYGKPWIWCQLHGYGGTPGI 401
Query: 186 YGILDSIAFGPVEA-RTSENTTMVGVGMSMEGIEQNPVV 223
YG + +I +EA R SE MVG+G +MEG + N ++
Sbjct: 402 YGQIYNITNSSIEAFRNSEK--MVGMGNTMEGQDGNGLI 438
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 111/250 (44%), Gaps = 46/250 (18%)
Query: 265 LYHTVYNCTDGATD--KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 322
+Y +YN T+ + + +N + +V + EG+ N G + K + +
Sbjct: 401 IYGQIYNITNSSIEAFRNSEKMVGMGNT-------MEGQDGN-GLILPKSLLEMQPNITE 452
Query: 323 DHPHLWYS-TSEVIRALELFIASGNELSAS---------NTYRYDLIDLTRQALAKYANE 372
+H L S T ++ +LF A G +AS N +++D++D+TRQ L++
Sbjct: 453 NHGRLGQSLTIDLFNPADLFRAWGLLYNASISVPELWYDNGWKFDMVDVTRQVLSERFKL 512
Query: 373 LFLNIIEAYQLNDAHGVFQLSRRFLELV-EDMDGLLACHDGFLLGPWLESAKQLAQNE-- 429
++++IE Y A F+ + L ++ ++D +L+ F L W+ +A + N
Sbjct: 513 EYVDLIEKYT---AEIDFEATSENLSMILRELDDILSTSPHFRLDTWINAAIASSPNSST 569
Query: 430 ---------------EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 474
+ + + +NA QIT+W Q + DY +K W GL+R YY
Sbjct: 570 YPIPSSDGSSELNITQTQHLFAYNAINQITIWGPTGQ-----INDYASKSWGGLVRGYYL 624
Query: 475 PRAAIYFKYM 484
R I+ Y+
Sbjct: 625 KRWEIFLDYI 634
>gi|84625358|ref|YP_452730.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|84369298|dbj|BAE70456.1| putative N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
MAFF 311018]
Length = 590
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 132/524 (25%), Positives = 218/524 (41%), Gaps = 84/524 (16%)
Query: 24 FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 83
P A+I ++ W TY LD DPLF ++ R F+E + YG Y D
Sbjct: 46 LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 98
Query: 84 FDENTPPV------------------------------DSPEYISSLGAAIYSGMQSGDS 113
F+E PPV +++ G A+Y + +
Sbjct: 99 FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 158
Query: 114 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 171
A W+MQGWLF D FW+P + A L VP +L+VLD+ + P W S+ F +
Sbjct: 159 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 218
Query: 172 IWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLM 227
I+ +HN+ + +YG +AF ++A ++ + G G+ EG+ N VVY+ +
Sbjct: 219 IYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYL 275
Query: 228 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
+A++ + W+ QY RYGRS A+ AW L +Y +
Sbjct: 276 YALAWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTRYWS----------- 324
Query: 288 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 347
P + G Y + +P + D P + A++ + +
Sbjct: 325 ----PRWWNTHAGAYLLFKRPTADIVNFD------DRPG---DPQRLRSAIDALLQQADR 371
Query: 348 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGL 406
+ + YRYDLI+ R L+ A+ +++AY D A G QL+R +LV+ +D L
Sbjct: 372 YADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDAL 430
Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
+ L ++A + + + Y NAR Q+++W + L DY +K W
Sbjct: 431 VGGQHETLAAWTGQAAAAVGNDARLLRAYVGNARAQVSVWGGDGN-----LADYASKAWQ 485
Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
G+ D+Y R + + ++G F +L W R+W
Sbjct: 486 GMYADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 529
>gi|293371910|ref|ZP_06618314.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292633156|gb|EFF51733.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 411
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 119/449 (26%), Positives = 202/449 (44%), Gaps = 56/449 (12%)
Query: 99 SLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVK 157
+ + +Y+ + + D A W+ W+F +D W +MKALL VP K+++LD E
Sbjct: 3 KIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMILLDYHCENV 62
Query: 158 PIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGI 217
+W ++ F+ PYIWC L NF GN + G + A + + G+G ++EG+
Sbjct: 63 ELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKGIGSTLEGL 122
Query: 218 EQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGAT 277
+ Y+ + E A+ + VD WI + R G +++DAW L++ +Y
Sbjct: 123 DVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDIY------- 174
Query: 278 DKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA 337
V P T G Y +P + K ++ Y + L EV R
Sbjct: 175 -------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL----LEVWRK 214
Query: 338 LELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR 395
L NE + + +R DLI + RQ L Y ++ + + D + +
Sbjct: 215 L-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQALKACGEK 267
Query: 396 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 455
E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 268 MKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW-------GG 320
Query: 456 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE----WIKLTN 508
L DY ++ W+GL+ DYY R +Y I+ + G D +L+D +E W+ T+
Sbjct: 321 SLNDYASRSWAGLISDYYAKRWEVYINTFIKVVGEGVEVDQKQLEDELKEIEEGWVNATD 380
Query: 509 DWQNGRNVYPVESNGDALIT-SQWLYNKY 536
++V+ S D L++ S +L++KY
Sbjct: 381 RKDTRKDVH---STTDGLLSFSTFLFSKY 406
>gi|58583545|ref|YP_202561.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
gi|58428139|gb|AAW77176.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
Length = 753
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 131/524 (25%), Positives = 219/524 (41%), Gaps = 84/524 (16%)
Query: 24 FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 83
P A+I ++ W TY LD DPLF ++ R F+E + YG Y D
Sbjct: 209 LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 261
Query: 84 FDENTPPV------------------------------DSPEYISSLGAAIYSGMQSGDS 113
F+E PPV +++ G A+Y + +
Sbjct: 262 FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 321
Query: 114 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 171
A W+MQGWLF D FW+P + A L VP +L+VLD+ + P W S+ F +
Sbjct: 322 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 381
Query: 172 IWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLM 227
I+ +HN+ + +YG +AF ++A ++ + G G+ EG+ N VVY+ +
Sbjct: 382 IYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYL 438
Query: 228 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 287
+A++ + W+ QY RYGRS A+ AW L +Y
Sbjct: 439 YALAWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTR-------------- 484
Query: 288 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 347
P + G Y + +P + ++ + D + A++ + +
Sbjct: 485 -YWSPRWWNTHAGAYLLFKRPTAD--IVNFDDRPGD-------PQRLRSAIDALLQQADR 534
Query: 348 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGL 406
+ + YRYDLI+ R L+ A+ +++AY D A G QL+ R +LV+ +D L
Sbjct: 535 YADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLA-RTTQLVQGLDAL 593
Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
+ L ++A + + + Y NAR Q+++W + L DY +K W
Sbjct: 594 VGGQHETLAAWTGQAAAAVGNDARLLRAYVGNARAQVSVWGGDGN-----LADYASKAWQ 648
Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 503
G+ D+Y R + + ++G F +L W R+W
Sbjct: 649 GMYADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 692
>gi|440695019|ref|ZP_20877582.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
gi|440282912|gb|ELP70302.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
Length = 1050
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 135/532 (25%), Positives = 226/532 (42%), Gaps = 52/532 (9%)
Query: 8 VLPAFSGNVPAAL--QNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFI 65
VLP + G VP +N +A + Q G+W + K P W LD F E+ AF
Sbjct: 253 VLPGYFGTVPDEFVARNGGDAAVVPQ-GDWGAFKR-PDW-----LDPRTTAFGEVAAAFY 305
Query: 66 EQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
+ Q + +G S +Y D E P D P + A+ + ++ AVW + GW +
Sbjct: 306 QAQSERFG-DSTMYKMDLLHEGGNPGDVP--VGRAAQAVEAALRKAHPGAVWAILGWQNN 362
Query: 126 YDPFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIE 184
P + +L++V ++ V+D ++ + + G PY + + NF G+
Sbjct: 363 -------PSGE-ILDAVDKSRMFVVDGLSDRYTTVTDRESDWGGTPYAFGSIWNFGGHTP 414
Query: 185 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 244
M + R E++ + G+ E + N L++++A+ +D+ W
Sbjct: 415 MGANAPDWVEQYPKWRDKEDSALAGIAAMPEAADNNHAALALLTDLAWTPGTIDLDDWFA 474
Query: 245 QYSVRRYGRSVPAIQDAWNVLYHTVYNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKY 302
Y+V RYG P AW ++ T Y + DG ++ + A P +
Sbjct: 475 SYAVSRYGAEDPHALAAWKIIGDTAYGMSRADGWSEAPDGLFGARPSLG----------- 523
Query: 303 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 362
+K A E YD T+ AL + L ++ YRYDL D+
Sbjct: 524 ------ANKAAAWGPEADRYD-------TTAFDLALTELLQVAPALRGNSAYRYDLADVA 570
Query: 363 RQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 422
RQ L+ + L I AY D +L+ +L+ + MD +LA LLG WL A
Sbjct: 571 RQVLSNRSRMLLPQIRAAYDTADRVRFDELTGVWLDWMRLMDKVLATSGQHLLGRWLADA 630
Query: 423 KQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 482
+ ++ Q E++AR+ IT W E L DY N+ WSGL+ Y R +YF+
Sbjct: 631 RSWGATRGEKDQLEYDARSIITTWGGRASSEEG-LHDYANREWSGLVGGLYLTRWTLYFR 689
Query: 483 YMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 534
+ +L + D W L +DW + + +P +++GD ++ ++N
Sbjct: 690 ELSRALRQNRPPKTVD----WFTLEDDWAHRHDSHPTKTSGDVHKLARRVHN 737
>gi|456390168|gb|EMF55563.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
25435]
Length = 1042
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 126/525 (24%), Positives = 218/525 (41%), Gaps = 48/525 (9%)
Query: 8 VLPAFSGNVPAA-LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
VLP + G+VP ++ A + G W + P W LD F + +F
Sbjct: 249 VLPGYYGHVPKQFVERNGGDAHVVPQGLWHGFER-PDW-----LDPRTDSFARVAASFYG 302
Query: 67 QQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY 126
+G +H + D E D P + + + DA+W++ GW
Sbjct: 303 HVRDVFGAAAH-FKMDLLHEGGTAGDVP--VPDAARGVERALHKAHPDAIWVILGW--QE 357
Query: 127 DPFWRPPQMKALLNSVPLGKLVVLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 185
+P + LL+++ +++++D ++ + + + G PY + + NF G +
Sbjct: 358 NP------LPELLDAIDRSRMLIVDGVSDRYASVTDRERDWGGTPYCFGTIPNFGGRTTI 411
Query: 186 YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQ 245
R ++ +VG E +++P ++L SE+A+ K+D AW +
Sbjct: 412 GARAHLWTDKFFAWRDKPDSALVGTAYMPEATDRDPAAFELFSELAWTPGKIDRAAWFSA 471
Query: 246 YSVRRYGRSVPAIQDAWNVLYHTVYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQN 304
Y+ RYG A + AW L+ T Y +D + + A PD+
Sbjct: 472 YADFRYGGRDDAARAAWRALHETAYQQRAVERSDPHDSLFCARPDL-------------- 517
Query: 305 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 364
+ A T +YD + A L +A G + + YRYD++DL RQ
Sbjct: 518 ---AADRAAEYAPRTLTYDPGRF-----DAAFAGLLDVAGGRRRNPA--YRYDVVDLARQ 567
Query: 365 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 424
ALA + + + A++ D LS +L L+ D + FLLGPW+ A+
Sbjct: 568 ALAHRSRQYLPQLRAAHRRKDLTTFRALSTLWLRLMRLSDEVTGTDGAFLLGPWVNDARL 627
Query: 425 LAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 484
LA ++ + ++E A+ IT+W + L +YGN+ W+GL+ D+Y PR + +
Sbjct: 628 LATDDAERAEFERTAKVLITVWGGRATSDTGDLHEYGNREWNGLMADFYVPRWQKWLDAL 687
Query: 485 IESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITS 529
++L +G D W W R YP+ GDA T+
Sbjct: 688 EDALATGTAPAAVD----WFAFEEPWTRERKDYPLRPVGDAYRTA 728
>gi|169351438|ref|ZP_02868376.1| hypothetical protein CLOSPI_02218 [Clostridium spiroforme DSM 1552]
gi|169291660|gb|EDS73793.1| F5/8 type C domain protein [Clostridium spiroforme DSM 1552]
Length = 1762
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 136/529 (25%), Positives = 226/529 (42%), Gaps = 80/529 (15%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
VL ++G VP + + +I + GNW V PR ++ L+ + + F E
Sbjct: 852 TVLQGYAGMVPNNFTD-YQDVEILEQGNWCGV---PR---PDMIRTDGELYDQYAKLFYE 904
Query: 67 QQLKEYGRTSHIYNCDTFDENTP-PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q +G+TS+ Y D F E P D + + + + + + D +AVW++Q W
Sbjct: 905 AQEWAFGKTSNYYAVDPFHEGGKRPSDLTDDV--ISREVLNSLLEYDQEAVWMVQAW--- 959
Query: 126 YDPFWRPPQMKAL--LNSVPLGKLVVLDLFA------------EVKPIWSTSKQFYGVPY 171
W P L + +++LDL E S +F +
Sbjct: 960 ----WSNPTNDLLKGMGDDREDHVIILDLNGLNDAYDSYWDKTEYNGTVLESDEFNSTSW 1015
Query: 172 IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMA 231
+WCML N+ GN M G I + +++ M G+G E NP++Y+L+ +MA
Sbjct: 1016 VWCMLENYGGNPSMDGRPKEI-INRINKASTQAEHMKGIGFISEATYDNPMIYELLLDMA 1074
Query: 232 FQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD 291
+Q + +D+ W+++Y +RRYG + +AW++L TVY+ + TD ++A D
Sbjct: 1075 WQQDTIDLDDWLDEYVLRRYGDYSESAGEAWDILLKTVYSRSGKTTD-----VIARS--D 1127
Query: 292 PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS 351
PS++ YG P Y+ SE+ ALEL ++LSAS
Sbjct: 1128 PSLVQ--------YGLP--------------------YTASELEEALELLYKDYDKLSAS 1159
Query: 352 NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 411
YRYDL ++ RQ + YA ++ AY + L ++L ++ ++ +
Sbjct: 1160 EAYRYDLTEIMRQVVNNYAVVRLGDLKTAYDAKEIDNFKSLKEQYLNAIDLLNEVCGTQQ 1219
Query: 412 GFLLGPWLESAKQLAQNEEQEK-QYE---WNARTQITMWFDNTQEEASLLRDYGNKYWSG 467
L+G W+ A A++ + Y+ NA+T IT+W +T L Y + + G
Sbjct: 1220 DLLIGEWVGRAVDWAKDTNSDDFAYDSMIINAKTLITVWAPSTT-----LGTYAYRNYEG 1274
Query: 468 LLRDYYGPRAAIYFKYMIESLESGDG----FRLKDWRREWIKLTNDWQN 512
++ D Y Y E LE G D +WI D QN
Sbjct: 1275 MINDIYKVIWQAYLDQSEEILEFGSAKTNLVNYHDLCMDWIYADWDLQN 1323
>gi|212722968|ref|NP_001131519.1| uncharacterized protein LOC100192858 [Zea mays]
gi|194691748|gb|ACF79958.1| unknown [Zea mays]
Length = 114
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 61/99 (61%), Positives = 75/99 (75%)
Query: 445 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
MWFDNT+ +ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+ F LK+WRREWI
Sbjct: 1 MWFDNTETKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWI 60
Query: 505 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 543
LTN+WQ+ R V+ + GD L SQ LY KYL +
Sbjct: 61 SLTNNWQSDRKVFSTTATGDPLNISQSLYTKYLSNADLL 99
>gi|302522684|ref|ZP_07275026.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
gi|302431579|gb|EFL03395.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
Length = 355
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 89/333 (26%), Positives = 155/333 (46%), Gaps = 25/333 (7%)
Query: 168 GVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDL 226
G PY + + NF G+ + D + P R + + G+ + E + NP ++L
Sbjct: 10 GTPYAFGSIWNFGGHTALGANTRDWVDLYP-RWRDRSGSRLSGIALMPEAADNNPAAFEL 68
Query: 227 MSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVA 286
+E+ + VD+ W +Y+ RYG S + AW++L TVY D+ +
Sbjct: 69 FAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILRTTVYGTRR--DDRWSEPADG 126
Query: 287 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 346
P++ +V+ GK+ L+ +S++ AL+ ++
Sbjct: 127 LFGARPALDAVSAGKWS--------PKALRYPAASFEP------------ALDELLSVRA 166
Query: 347 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 406
EL S TYR DL+D+ RQALA + L + AY+ + +L RR++ L++ ++ L
Sbjct: 167 ELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQAEFARLGRRWIALIDLLEQL 226
Query: 407 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 466
+A + LLG W+ESA+ + ++ Q +++A + +T W +A LRDY N+ WS
Sbjct: 227 VATDENHLLGRWVESARAWGGSAREKSQLQYDALSLLTTWGTRQGADAG-LRDYANREWS 285
Query: 467 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 499
GL+ Y R Y + +L+ G DW
Sbjct: 286 GLVGGLYRLRWGTYIDELSAALKEGRKPVAVDW 318
>gi|339238239|ref|XP_003380674.1| GDP-L-fucose synthetase [Trichinella spiralis]
gi|316976398|gb|EFV59699.1| GDP-L-fucose synthetase [Trichinella spiralis]
Length = 1203
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 75/220 (34%), Positives = 113/220 (51%), Gaps = 7/220 (3%)
Query: 59 EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 118
+G + + L+ Y H Y+ D F+E P ++ ++ AIY+ M S D +VW+
Sbjct: 801 HVGNEVVWKSLENYFGLFHAYSADPFNEMVPNTFDVMFLRNVSFAIYNVMLSVDPKSVWV 860
Query: 119 MQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 177
+Q W+F W + K L +VP G ++V+DL+AE P++ FY P+IWCMLH
Sbjct: 861 LQSWMFLSSERWLENENAKHFLTAVPTGSILVVDLYAEEYPLYEKFSGFYNQPFIWCMLH 920
Query: 178 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF--QHE 235
NF G +YG L I + T N MVG G+SMEGI+QN VVY + + + ++
Sbjct: 921 NFGGVQGLYGNLARINQKLADVSTVSNINMVGTGLSMEGIDQNYVVYQMALDRFWSPNNQ 980
Query: 236 KVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTD 274
KVD+ AW Y G + +I AW + C +
Sbjct: 981 KVDLAAW---YIYIHLGVGITKSIYTAWGAFLQSSRTCQE 1017
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 94/209 (44%), Gaps = 10/209 (4%)
Query: 331 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 390
T + A F+ S + Y DL++LT+ AL +L+ + +Y
Sbjct: 998 TKSIYTAWGAFLQSSRTCQENEIYINDLVELTKHALMLTGAKLYEKLQASYIRKCGQEFL 1057
Query: 391 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 450
+ + +++ D++ + H +L W+E A+ + Q Q E N R Q+T+W
Sbjct: 1058 ENAAAVEQVLSDLEWISKTHSRSMLSKWIEIARANGKTAAQSDQLEENLRMQVTIW--GP 1115
Query: 451 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM---IESLESGDGFRLKDWRREWIKLT 507
Q E + DY K W+ L +YY PR ++F ++ I LE+ + L I+L
Sbjct: 1116 QGE---IVDYARKQWAALFSEYYLPRWRLFFAHLYADILQLETFNQTLLNSRLFHEIELP 1172
Query: 508 NDWQNGRNVYPVESNGDALITSQWLYNKY 536
Q N+ + G+ ++ S+ LY++Y
Sbjct: 1173 FALQKIPNI--DQPTGNTVVVSKILYSRY 1199
>gi|293402299|ref|ZP_06646437.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
5_2_54FAA]
gi|291304406|gb|EFE45657.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
5_2_54FAA]
Length = 2330
Score = 128 bits (322), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 117/493 (23%), Positives = 209/493 (42%), Gaps = 60/493 (12%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VL ++G VP P+ +T +W + + P T P + E + F E
Sbjct: 707 VLQGYAGMVPTNFNEFQPNVPLTAQKSWGGL-ARPSMIPT-----DSPYYDEYAKLFYEA 760
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYIS--SLGAAIYSGMQSGDSDAVWLMQGWLFS 125
Q YG TS Y D + E PE +S ++ + + + D DAVW++Q W
Sbjct: 761 QEYIYGATSDYYAVDPYHEGGT---RPEGLSDETVAREVLNSLLDYDKDAVWVVQAW--- 814
Query: 126 YDPFWRPPQMKALLNSV---PLGKLVVLDLFAEVKPIWS--TSKQFYGVPYIWCMLHNFA 180
+ LLN + ++++DL W+ +F G + W +L F
Sbjct: 815 -----QSNPTDGLLNGMGEYRENHVLIVDLIKYPIKSWTKYNKSEFKGTSWAWGLLGGFG 869
Query: 181 GNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVK 240
GN M G + ++ ++ E T M G+G+ E NPV+YDL+ ++A+ + +
Sbjct: 870 GNPTMNGEMQTM-VNDIQTAKKERTHMAGLGIISEAQYDNPVLYDLIFDLAWVDDDFSLD 928
Query: 241 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEG 300
W+N+Y RRYG + ++AW ++ + YN V F + +
Sbjct: 929 QWLNKYIERRYGGTSDNAKEAWKIMKNANYNHG-----------VRFT---AQVYGMKGK 974
Query: 301 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLID 360
Q+YGK ++ Y ++ A L I ++ S YRYDL +
Sbjct: 975 SPQDYGK-----------------QNISYGADKLETAFRLLIEDYDKFKDSECYRYDLTE 1017
Query: 361 LTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLE 420
+ RQ ++ Y+ + N+I+A + + + +FL+ + ++ + L G W+
Sbjct: 1018 IMRQMVSNYSTLTYNNVIDAREDKNIEKFKEEKAKFLKSFDVLNDIQETQVDQLAGEWIG 1077
Query: 421 SAKQLAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
A+ A + + + +E NA+ IT W ++ A L+DY + + G+ D Y
Sbjct: 1078 KAQDRAADYDDFAKDAFEMNAKALITSWA--SRSSAGGLKDYAWRNYQGMFIDLYKQNWI 1135
Query: 479 IYFKYMIESLESG 491
Y + +LE+G
Sbjct: 1136 DYLDQVEANLENG 1148
>gi|281423204|ref|ZP_06254117.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
gi|281402540|gb|EFB33371.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
Length = 291
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 112/240 (46%), Gaps = 22/240 (9%)
Query: 250 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 309
RYG++ P I+ AW +L T+YNC G + SI G+P
Sbjct: 4 RYGKTSPEIERAWQLLSETIYNCPAGNNQQG---------PHESIFC---------GRP- 44
Query: 310 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 369
++ + S+ +Y + A +L ++ +N + YDL+D+ RQALA
Sbjct: 45 ---SLNNFQVKSWSKMRNYYDLQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALADQ 101
Query: 370 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 429
+L I Y + + RFLE++ D LL F LG W E+A++L +
Sbjct: 102 GRLQYLKTIADYNGFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGTTQ 161
Query: 430 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 489
+++ YEWNAR QIT W + + L DY +K W G+L+D+Y R I+ + + +E
Sbjct: 162 QEKDLYEWNARVQITTWGNRMCADKGGLHDYAHKEWQGILKDFYYKRWKIFMDALAKQME 221
>gi|260821254|ref|XP_002605948.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
gi|229291285|gb|EEN61958.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
Length = 673
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 93/201 (46%), Gaps = 51/201 (25%)
Query: 91 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 150
+ P Y+S GAA+Y+GM +GD A+WLMQGWLF FW+P Q KALL SVP G
Sbjct: 442 LSEPNYLSKAGAAVYAGMLAGDPQAIWLMQGWLFQARDFWQPAQTKALLQSVPEG----- 496
Query: 151 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 210
P AR +TMVG
Sbjct: 497 ---------------------------------------------PFLARKYLGSTMVGT 511
Query: 211 GMSMEGIEQNPVVYDLMSEMAFQHEKVDV-KAWINQYSVRRYGRSVPAIQDAWNVLYHTV 269
G++ EGI+QN ++Y+LM+E+A+ + + W + Y+ RYG W +L +V
Sbjct: 512 GLTPEGIDQNYIMYELMNEVAWMPQPFQILDNWASDYAWSRYGVKNSNASLGWQILLKSV 571
Query: 270 YNCTDGATDKNRDVIVAFPDV 290
Y+C +G D V+V PD+
Sbjct: 572 YDCENGFKDHCDSVVVHRPDL 592
>gi|358381741|gb|EHK19415.1| hypothetical protein TRIVIDRAFT_224650 [Trichoderma virens Gv29-8]
Length = 217
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 110/214 (51%), Gaps = 11/214 (5%)
Query: 88 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL-G 145
TPP Y+ + + + ++S D +A+W+ Q WLF+ + FW +++ + +
Sbjct: 2 TPPSGELNYLRNASSNTWKALKSADPEAIWVFQAWLFAQNTTFWTNDRIEVYPGGITIDS 61
Query: 146 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 205
+++LD++ E W ++ +Y P+IWC L N+ I MYG + ++ P+ A E+
Sbjct: 62 DMLILDIWLESMSQWQCAQSYYSKPWIWCELQNYGATINMYGQIQNLTKSPILA-LQESQ 120
Query: 206 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY--GRSVPAIQDAWN 263
++VG+G+SME + N +V+DL+ A+ +D + ++ RY + +I AW
Sbjct: 121 SLVGLGLSMEAQQSNEIVFDLLLSQAWNCTPIDTNIYFKSWAAARYLSSKRPASIYTAWE 180
Query: 264 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 297
+ TVY+ T N +++ + P S I V
Sbjct: 181 AVRATVYDNT------NLNMMSSVPKSRSSEIKV 208
>gi|47212645|emb|CAF95026.1| unnamed protein product [Tetraodon nigroviridis]
Length = 121
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 47/103 (45%), Positives = 72/103 (69%), Gaps = 3/103 (2%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAFSGNVP + ++P A++T+LG W K + + C+Y+LD DPLF+ IG ++ Q
Sbjct: 22 VLPAFSGNVPKGILRLYPEARVTRLGPW--SKFNCSFSCSYILDPRDPLFLRIGSLYLAQ 79
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQS 110
+K++G T+HIYN DTF+E TPP P Y+S++ A+++ M +
Sbjct: 80 VVKQFG-TNHIYNTDTFNEMTPPSSEPNYLSAVSRAVFAAMTA 121
>gi|347541919|ref|YP_004856555.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-rat-Yit]
gi|346984954|dbj|BAK80629.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-rat-Yit]
Length = 912
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 98/407 (24%), Positives = 184/407 (45%), Gaps = 56/407 (13%)
Query: 108 MQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFY 167
+ + +++W++Q W S P + + N + ++LDL +++ W +F
Sbjct: 490 IDNNGENSIWIIQSWSHS-------PSSETIEN-LNRNNTLILDLHSQLNTRWKGISKFN 541
Query: 168 GVPY----------IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGI 217
+ + I+ +L+NF G +YG + +A+ + N + GV + EGI
Sbjct: 542 NMSWKDREFDRSNWIFGVLNNFGGRSGLYGHTRHLLNQFYDAKYNSNY-LKGVAHTSEGI 600
Query: 218 EQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT---- 273
N + +L++E+ F +K+D+ ++++Y RYG+S + A+N+L TVYN
Sbjct: 601 GFNNFIDELVTEIIFS-DKLDIDEFVSRYLRNRYGKSDNDLLKAFNILLDTVYNPVINIY 659
Query: 274 -DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTS 332
+GA++ VI A P +D +KS S + H Y++
Sbjct: 660 HEGASES---VINARPSLD-----------------------VKS-ASKWGSIHKNYNSE 692
Query: 333 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 392
++ AL ++ + NE S Y DLID+ + + +NE + N+ + Y N F+L
Sbjct: 693 KLEEALRIYFSKYNEFKDSKGYMTDLIDIASEVIINLSNEYYKNLQDYYN-NGEIEFFKL 751
Query: 393 -SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 451
S+RFL ++ +L ++ L ++ L ++ E N +T +T W+D
Sbjct: 752 NSQRFLNMILLQANILYYNERKSLQKLIDKLDDLNYDDYFEDTLIINKKTILTTWYDKQV 811
Query: 452 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG--DGFRL 496
E LRDY N + ++ Y R +F + E+ +G D +R
Sbjct: 812 SEDDGLRDYANTDFYDIVGTLYYNRWKRFFDNIQENAVNGFYDDYRF 858
>gi|293369245|ref|ZP_06615835.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
gi|292635670|gb|EFF54172.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
Length = 221
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 102/214 (47%), Gaps = 7/214 (3%)
Query: 325 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 384
P + Y +++ A L ++ + ++Y +DL+++ RQ L Y N + AY+
Sbjct: 11 PTIEYQPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAG 68
Query: 385 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 444
D + + E++ D+D L++CH F L W+ A+ + + + YE NAR+ IT
Sbjct: 69 DIPMMKNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHDAASKNYYEMNARSLIT 128
Query: 445 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 504
+W D+ L DY N+ W+GL YY R + +IE+ E F +++ +
Sbjct: 129 IWGDSYH-----LTDYANRSWAGLTNQYYSVRWDHFINEVIEAAEKKKNFDEEEFFNQSR 183
Query: 505 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 538
N+W N N GD + ++ +Y KY +
Sbjct: 184 MYENEWVNPSNRISYNEGGDGIKLARQIYKKYAK 217
>gi|380804373|gb|AFE74062.1| alpha-N-acetylglucosaminidase precursor, partial [Macaca mulatta]
Length = 265
Score = 95.5 bits (236), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 68/108 (62%), Gaps = 3/108 (2%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VP A+ VFP +T++G+W + C++LL DP+F IG F+ +
Sbjct: 161 VLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRE 218
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDA 115
+KE+G T HIY DTF+E PP +P Y+++ A+Y M + D++A
Sbjct: 219 LVKEFG-TDHIYGADTFNEMQPPSSAPSYLAAATTAVYEAMIAVDTEA 265
>gi|342731751|ref|YP_004770590.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-Japan]
gi|342329206|dbj|BAK55848.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Japan]
Length = 898
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 100/440 (22%), Positives = 196/440 (44%), Gaps = 67/440 (15%)
Query: 60 IGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGD-SDAV 116
I + E+Q + G++ + + D F E N D+ E L + S +++ D+V
Sbjct: 432 ISSVYYEKQRELLGKSKY-FAGDLFHEGANLYGYDAGE----LSNRVLSLLKNNTGEDSV 486
Query: 117 WLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS----------TSKQF 166
W++Q W P +++ N + +++LDL +++ W +K+F
Sbjct: 487 WIIQSWA-------HNPSSESIEN-LNKDNILILDLHSQLNTRWKGISKFNYMSWDNKEF 538
Query: 167 YGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDL 226
+I+ +L+NF G +YG + + +A+ + + + G+ + EG+ N + +L
Sbjct: 539 DNSNWIFGILNNFGGRNGLYGHSNHLLRQFYDAKYNSDY-LSGIANTSEGVGFNNFIDEL 597
Query: 227 MSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN-CTD----GATDKNR 281
+E+ F E V++ ++ +Y RYG+S + A+N+L TVYN TD GA++
Sbjct: 598 STELIFSDE-VNMDEFVKRYLKNRYGKSDRDLLVAFNILLDTVYNPVTDIYHEGASES-- 654
Query: 282 DVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF 341
VI A P ++ + S + H Y + ++ R +E++
Sbjct: 655 -VINARPSLEIN------------------------SASKWGTIHKNYDSRKLERVIEIY 689
Query: 342 IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVE 401
I+ +E + Y DLID+ + + A+E + I E Y + + +S++FL L+
Sbjct: 690 ISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQIIQEYYNNGNIKYLQLISKKFLNLIL 749
Query: 402 DMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYG 461
+L+ +D L + L ++ + ++N + +T W+D E LRDY
Sbjct: 750 LQANILSYNDKKSLQKIINKLDALDYDDYFKDTLKYNKKMILTTWYDKLVSEDGGLRDYA 809
Query: 462 NKYWSGLLRDYYGPRAAIYF 481
N D+Y +Y+
Sbjct: 810 NT-------DFYDIVGTLYY 822
>gi|402824586|ref|ZP_10873940.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
gi|402261896|gb|EJU11905.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
Length = 486
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 100/215 (46%), Gaps = 39/215 (18%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G VP A P A+I ++ W TY LD +DPLF ++ F+
Sbjct: 260 VLPAFAGYVPEAFAKAHPKARIYKMRAWEGFPP------TYWLDPSDPLFTQLAARFVTL 313
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVD-------SPEY-----------------------I 97
+ YG + Y D F+E PP+ + EY +
Sbjct: 314 YNRTYGEGEY-YLADAFNEMIPPIAEDGSDAAAAEYGDSIANTAATRAAALPPAVRDARL 372
Query: 98 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEV 156
++ G +Y + + A W+MQGWLF D +R P+ + A L+ VP ++++LD+ +
Sbjct: 373 AAYGERLYGSITAAAPKATWVMQGWLFGADKAFRTPEAIAAFLSRVPDDRMLILDIGNDR 432
Query: 157 KP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 190
P IW + F G + + +HN+ G+ +YG L+
Sbjct: 433 YPGIWQKTDAFDGKAWTYGYVHNYGGSNPVYGDLE 467
>gi|417965571|ref|ZP_12607078.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-4]
gi|380336329|gb|EIA26351.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-4]
Length = 685
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/463 (22%), Positives = 203/463 (43%), Gaps = 65/463 (14%)
Query: 33 GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPP 90
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 205 GYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANLYG 257
Query: 91 VDSPEYISSLGAAIYSGMQSGDS-DAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 149
D+ E L + S +++ D+VW++Q W P +++ N + +++
Sbjct: 258 YDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNILI 305
Query: 150 LDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA 199
LDL +++ W +K+F +I+ +L+NF G +YG + + +A
Sbjct: 306 LDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFYDA 365
Query: 200 RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 259
+ + + + G+ + EG+ N + +L +E+ F E V++ ++ +Y RYG+S +
Sbjct: 366 KYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRDLL 423
Query: 260 DAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 318
A+N+L TVYN TD + S+I+ ++ ++ +
Sbjct: 424 VAFNILLDTVYNPVTD----------IYHEGASESVIN-------------ARPSLGINS 460
Query: 319 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 378
S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E + I
Sbjct: 461 ASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQIIQ 520
Query: 379 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 438
E Y + + +S++FL L+ +L+ +D L + L ++ + ++N
Sbjct: 521 EYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFKDTLKYN 580
Query: 439 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 481
+ +T W+D E LRDY N D+Y +Y+
Sbjct: 581 KKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 616
>gi|417967717|ref|ZP_12608785.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-co]
gi|380340884|gb|EIA29424.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-co]
Length = 741
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/436 (22%), Positives = 195/436 (44%), Gaps = 59/436 (13%)
Query: 60 IGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGDS-DAV 116
I + E+Q + G++ + + D F E N D+ E L + S +++ D+V
Sbjct: 275 ISSVYYEKQRELLGKSKY-FAGDLFHEGANLYGYDAGE----LSNRVLSLLKNNTGEDSV 329
Query: 117 WLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS----------TSKQF 166
W++Q W P +++ N + +++LDL +++ W +K+F
Sbjct: 330 WIIQSWA-------HNPSSESIEN-LNKDNILILDLHSQLNTRWKGISKFNYMSWDNKEF 381
Query: 167 YGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDL 226
+I+ +L+NF G +YG + + +A+ + + + G+ + EG+ N + +L
Sbjct: 382 DNSNWIFGILNNFGGRNGLYGHSNHLLRQFYDAKYNSDY-LSGIANTSEGVGFNNFIDEL 440
Query: 227 MSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN-CTDGATDKNRDVIV 285
+E+ F E V++ ++ +Y RYG+S + A+N+L TVYN TD +
Sbjct: 441 STELIFSDE-VNMDEFVKRYLKNRYGKSDRDLLVAFNILLDTVYNPVTD----------I 489
Query: 286 AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASG 345
S+I+ ++ ++ + S + H Y + ++ R +E++I+
Sbjct: 490 YHEGASESVIN-------------ARPSLGINSASKWGTIHKNYDSRKLERVIEIYISKY 536
Query: 346 NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 405
+E + Y DLID+ + + A+E + I E Y + + +S++FL L+
Sbjct: 537 DEFKDNEGYIIDLIDIASEVIINLASEYYQIIQEYYNNGNIKYLQLISKKFLNLILLQAN 596
Query: 406 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 465
+L+ +D L + L ++ + ++N + +T W+D E LRDY N
Sbjct: 597 ILSYNDKKSLQKIINKLDALDYDDYFKDTLKYNKKMILTTWYDKLVSEDGGLRDYANT-- 654
Query: 466 SGLLRDYYGPRAAIYF 481
D+Y +Y+
Sbjct: 655 -----DFYDIVGTLYY 665
>gi|384455191|ref|YP_005667784.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Yit]
gi|418016862|ref|ZP_12656425.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|418371995|ref|ZP_12964091.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-SU]
gi|345505596|gb|EGX27892.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|346983532|dbj|BAK79208.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Yit]
gi|380342872|gb|EIA31299.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-SU]
Length = 898
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/440 (22%), Positives = 195/440 (44%), Gaps = 67/440 (15%)
Query: 60 IGRAFIEQQLKEYGRTSHIYNCDTFDE--NTPPVDSPEYISSLGAAIYSGMQSGD-SDAV 116
I + E+Q + G++ + + D F E N D+ E L + S +++ D+V
Sbjct: 432 ISSVYYEKQRELLGKSKY-FAGDLFHEGANLYGYDAGE----LSNRVLSLLKNNTGEDSV 486
Query: 117 WLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWS----------TSKQF 166
W++Q W P +++ N + +++LDL +++ W +K+F
Sbjct: 487 WIIQSWA-------HNPSSESIEN-LNKDNILILDLHSQLNTRWKGISKFNYMSWDNKEF 538
Query: 167 YGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDL 226
+I+ +L+NF G +YG + + +A+ + + + G+ + EG+ N + +L
Sbjct: 539 DNSNWIFGILNNFGGRNGLYGHSNHLLRQFYDAKYNSDY-LSGIANTSEGVGFNNFIDEL 597
Query: 227 MSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN-CTD----GATDKNR 281
+E+ F E V++ ++ +Y RYG+S + A+N+L TVYN TD GA++
Sbjct: 598 STELIFSDE-VNMDEFVKRYLKNRYGKSDRDLLVAFNILLDTVYNPVTDIYHEGASES-- 654
Query: 282 DVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELF 341
VI A P + + S + H Y + ++ R +E++
Sbjct: 655 -VINARPSLGIN------------------------SASKWGTIHKNYDSRKLERVIEIY 689
Query: 342 IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVE 401
I+ +E + Y DLID+ + + A+E + I E Y + + +S++FL L+
Sbjct: 690 ISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQIIQEYYNNGNIKYLQLISKKFLNLIL 749
Query: 402 DMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYG 461
+L+ +D L + L ++ + ++N + +T W+D E LRDY
Sbjct: 750 LQANILSYNDKKSLQKIINKLDALDYDDYFKDTLKYNKKMILTTWYDKLVSEDGGLRDYA 809
Query: 462 NKYWSGLLRDYYGPRAAIYF 481
N D+Y +Y+
Sbjct: 810 NT-------DFYDIVGTLYY 822
>gi|321458423|gb|EFX69492.1| hypothetical protein DAPPUDRAFT_35389 [Daphnia pulex]
Length = 132
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 78/137 (56%), Gaps = 5/137 (3%)
Query: 358 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 417
++DLTRQ++ + + L+ ++E Y ++ + ++ + + L++D+D L+ FLLG
Sbjct: 1 MVDLTRQSMQEIFHLLYSKLLEVYLEKNSTAIEGIAYKMINLLQDLDELIQTGKTFLLGK 60
Query: 418 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 477
W+ AK E ++ QYEWNAR QIT+W + +RDY K W+G++ DYY P
Sbjct: 61 WIADAKSWGTTEGEKLQYEWNARNQITLWGPRGE-----IRDYAAKKWAGVVADYYKPHW 115
Query: 478 AIYFKYMIESLESGDGF 494
++ + M SL+ F
Sbjct: 116 EVFIREMQMSLDENRAF 132
>gi|281423203|ref|ZP_06254116.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
gi|281402539|gb|EFB33370.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
Length = 450
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 68/230 (29%), Positives = 107/230 (46%), Gaps = 26/230 (11%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLP F G +P + +T G W L TD +I + +
Sbjct: 223 VLPGFCGMMPHDAKAKL-GLNVTDGGIWNGYTRPAN------LSPTDAHSDKIADLYYAE 275
Query: 68 QLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD 127
YG+ ++ Y+ D F E D S G + M+ + +A W++QGW + +
Sbjct: 276 LTNLYGKANY-YSMDPFHETND--DEAIDYSKAGRKVMEAMKRVNPNATWVIQGW--TEN 330
Query: 128 PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAG 181
P RP +K + N G L+VLDLF+E +P IW K + +++CML NF
Sbjct: 331 P--RPQMIKNMKN----GDLLVLDLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGA 384
Query: 182 NIEMYGILDSIAFGPVEARTSENTT--MVGVGMSMEGIEQNPVVYDLMSE 229
N+ ++G +D + + S T + G+G +MEG E NPV+++LMSE
Sbjct: 385 NVGLHGRMDLLLHNFYSTKQSSPNTQHLKGIGFTMEGSENNPVMFELMSE 434
>gi|322792283|gb|EFZ16267.1| hypothetical protein SINV_02225 [Solenopsis invicta]
Length = 87
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/92 (45%), Positives = 61/92 (66%), Gaps = 5/92 (5%)
Query: 396 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 455
LEL +D++ +LA FLLG WL AK++A NEE+ + YE+NAR QIT+W N +
Sbjct: 1 LLELFDDLESILASGSNFLLGTWLTQAKEMADNEEERRSYEYNARNQITLWGPNGE---- 56
Query: 456 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 487
+RDY NK WSG++ DY+ PR ++ K + +S
Sbjct: 57 -IRDYANKQWSGVVADYFKPRWELFLKALEKS 87
>gi|326435733|gb|EGD81303.1| alpha-N-acetylglucosaminidase [Salpingoeca sp. ATCC 50818]
Length = 696
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 71/126 (56%), Gaps = 2/126 (1%)
Query: 104 IYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST 162
+Y+ M D A+W+ QGW+ D M ++VP G+LV+LD+ AE IW+
Sbjct: 500 VYTTMTKRDPHAIWVYQGWIWLDLDNAQGFSFMSGFTSAVPRGRLVILDMEAEFDEIWAW 559
Query: 163 SKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPV 222
S+ F+ +IW + NF GN MYG + + F +++ +VGVG++MEGI+QNP
Sbjct: 560 SQSFFNTTFIWAAMDNFGGNNGMYGDIQ-LVFDRTRRVFAQSDAVVGVGITMEGIDQNPA 618
Query: 223 VYDLMS 228
Y ++
Sbjct: 619 YYQAIA 624
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 43/85 (50%), Gaps = 18/85 (21%)
Query: 7 AVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIE 66
++LPAF GNVPAAL ++P A I+ W LD DPLF I ++
Sbjct: 277 SLLPAFQGNVPAALAQLYPHANISN--GW--------------LDGLDPLFATIADLTMQ 320
Query: 67 QQLKEYGRTSHIYNCDT-FDENTPP 90
+ + ++G T H Y D FD +T P
Sbjct: 321 ELIADFGAT-HFYQADGFFDHSTGP 344
>gi|315131339|emb|CBM69278.1| venom protein Ci-120 [Chelonus inanitus]
Length = 165
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 79/131 (60%), Gaps = 7/131 (5%)
Query: 360 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPW 418
D+TRQ+L A ++L + +++ D VF+ L +L D++ +L+ + FL+G W
Sbjct: 1 DVTRQSLQLIAEHVYLKLQQSFHQKDL-AVFKAHANLLMQLFSDLESILSTNKHFLVGKW 59
Query: 419 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 478
+++A+ L N +++K YE NAR QIT+W N + +RDY NK W+G++ Y+G R +
Sbjct: 60 IKNARSLGTNVQEQKLYELNARNQITLWGPNGE-----IRDYANKQWAGVMSQYFGARWS 114
Query: 479 IYFKYMIESLE 489
+Y + +LE
Sbjct: 115 LYLSVLEFALE 125
>gi|224135741|ref|XP_002322149.1| predicted protein [Populus trichocarpa]
gi|222869145|gb|EEF06276.1| predicted protein [Populus trichocarpa]
Length = 173
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/34 (76%), Positives = 29/34 (85%)
Query: 208 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 241
VGVGM M+GI+QNPVV DLMS+MAF H KVDVK
Sbjct: 30 VGVGMPMDGIKQNPVVSDLMSKMAFHHNKVDVKG 63
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 33/60 (55%), Gaps = 2/60 (3%)
Query: 480 YFKYMIESLESGD--GFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 537
++ + I++ E G F +K + K + R + PVESNG+AL S+WL+ KYL
Sbjct: 97 HYTFQIQNTEFGKWPRFPVKGLEKRMDKASKQLAESRKIIPVESNGNALNISRWLFYKYL 156
>gi|147798252|emb|CAN69797.1| hypothetical protein VITISV_036335 [Vitis vinifera]
Length = 273
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 31/47 (65%), Gaps = 3/47 (6%)
Query: 207 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD---VKAWINQYSVRR 250
MVGVG+ MEGIEQNPVVY+ M EMAF E V + + N + RR
Sbjct: 112 MVGVGVCMEGIEQNPVVYESMFEMAFHSENVQLVVISSTCNTMARRR 158
>gi|297723521|ref|NP_001174124.1| Os04g0650900 [Oryza sativa Japonica Group]
gi|255675839|dbj|BAH92852.1| Os04g0650900, partial [Oryza sativa Japonica Group]
Length = 128
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/29 (75%), Positives = 26/29 (89%)
Query: 6 FAVLPAFSGNVPAALQNVFPSAKITQLGN 34
F VLPAFSGN+PAAL++ FPSAK+T LGN
Sbjct: 99 FPVLPAFSGNIPAALRSKFPSAKVTHLGN 127
>gi|293371911|ref|ZP_06618315.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
gi|292633157|gb|EFF51734.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
Length = 289
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 3/61 (4%)
Query: 8 VLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQ 67
VLPAF+G+VPA L+ ++P A I LG W R C + L+ D LF +I + F+++
Sbjct: 221 VLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDE 277
Query: 68 Q 68
Q
Sbjct: 278 Q 278
>gi|440799252|gb|ELR20307.1| AlphaN-acetylglucosaminidase, putative, partial [Acanthamoeba
castellanii str. Neff]
Length = 389
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 47/101 (46%), Gaps = 19/101 (18%)
Query: 2 LLADFAVLPAFSGNVPAA----------------LQNVFPSAKITQLGNWFSVKSDPRWC 45
+ VLPAF+G VP A ++ ++P+A +T+ +W ++
Sbjct: 292 MFGMLPVLPAFAGFVPEASVKFTLGRGGGCGEQGIKRIYPTANLTKSADWAGFPH--QYT 349
Query: 46 CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 86
Y L D L+ IG I +E+G T HIYN DTF+E
Sbjct: 350 NVYFLSPLDSLYKTIGSKVIRLVEEEFG-TDHIYNADTFNE 389
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.425
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,162,292,582
Number of Sequences: 23463169
Number of extensions: 394314948
Number of successful extensions: 856777
Number of sequences better than 100.0: 512
Number of HSP's better than 100.0 without gapping: 496
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 853567
Number of HSP's gapped (non-prelim): 810
length of query: 545
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 397
effective length of database: 8,886,646,355
effective search space: 3527998602935
effective search space used: 3527998602935
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)