BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 027682
(220 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224121634|ref|XP_002318632.1| predicted protein [Populus trichocarpa]
gi|222859305|gb|EEE96852.1| predicted protein [Populus trichocarpa]
Length = 812
Score = 340 bits (871), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 163/217 (75%), Positives = 186/217 (85%), Gaps = 4/217 (1%)
Query: 1 MDSP-FRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYA 59
MDSP +VSL+L L + AQSSTIGV YIS++L+IQDRERA PSVQ+AAA
Sbjct: 1 MDSPPVAAVSLVLL---ASFLLFFSFAQSSTIGVGYISKILEIQDRERALPSVQVAAARG 57
Query: 60 VLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLH 119
VLQRLLPSH S+F+FRI+SK+QCGGE CFI++NHPS RG P+I+ISGVTGVEVLAGLH
Sbjct: 58 VLQRLLPSHSSSFEFRIVSKEQCGGESCFIIKNHPSFTRRGAPQILISGVTGVEVLAGLH 117
Query: 120 WYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFA 179
WYL+YWCGSHISWDKTGGVQ+ S+PKLGS PR+QD + V+RP+P NYYQNAVTSSY+FA
Sbjct: 118 WYLKYWCGSHISWDKTGGVQLNSIPKLGSLPRLQDDSILVQRPVPWNYYQNAVTSSYSFA 177
Query: 180 WWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
WWDWKRWEKEIDWMALQGINLPLAFTGQE IWQKVFQ
Sbjct: 178 WWDWKRWEKEIDWMALQGINLPLAFTGQEAIWQKVFQ 214
>gi|297733843|emb|CBI15090.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 335 bits (860), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 160/216 (74%), Positives = 185/216 (85%), Gaps = 5/216 (2%)
Query: 1 MDSPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAV 60
M SPF +VS ++ S L+ AQSSTIGV YISRLL+IQDRERAPPSVQ+AAAY V
Sbjct: 1 MASPFAAVS-----LSFFLFSFLSFAQSSTIGVTYISRLLEIQDRERAPPSVQIAAAYGV 55
Query: 61 LQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHW 120
L RLLPSH S+F+F I+SK+QCGG+ CF++ NHPSS G PEI+I+GVTGVE++AGLHW
Sbjct: 56 LHRLLPSHSSSFEFGIVSKEQCGGDSCFMISNHPSSSGHGAPEILITGVTGVEIMAGLHW 115
Query: 121 YLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAW 180
YL+YWCGSHISWDKTGG Q+ S+P GSFPRVQ+AG+ ++RPIP NYYQNAVTSSYTFAW
Sbjct: 116 YLKYWCGSHISWDKTGGAQLLSVPDSGSFPRVQEAGILIQRPIPWNYYQNAVTSSYTFAW 175
Query: 181 WDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
WDWKRWEKEIDWMALQGINLPLAFTGQE IWQKVF+
Sbjct: 176 WDWKRWEKEIDWMALQGINLPLAFTGQEAIWQKVFR 211
>gi|225457148|ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
Length = 813
Score = 335 bits (860), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 160/216 (74%), Positives = 185/216 (85%), Gaps = 5/216 (2%)
Query: 1 MDSPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAV 60
M SPF +VS ++ S L+ AQSSTIGV YISRLL+IQDRERAPPSVQ+AAAY V
Sbjct: 1 MASPFAAVS-----LSFFLFSFLSFAQSSTIGVTYISRLLEIQDRERAPPSVQIAAAYGV 55
Query: 61 LQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHW 120
L RLLPSH S+F+F I+SK+QCGG+ CF++ NHPSS G PEI+I+GVTGVE++AGLHW
Sbjct: 56 LHRLLPSHSSSFEFGIVSKEQCGGDSCFMISNHPSSSGHGAPEILITGVTGVEIMAGLHW 115
Query: 121 YLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAW 180
YL+YWCGSHISWDKTGG Q+ S+P GSFPRVQ+AG+ ++RPIP NYYQNAVTSSYTFAW
Sbjct: 116 YLKYWCGSHISWDKTGGAQLLSVPDSGSFPRVQEAGILIQRPIPWNYYQNAVTSSYTFAW 175
Query: 181 WDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
WDWKRWEKEIDWMALQGINLPLAFTGQE IWQKVF+
Sbjct: 176 WDWKRWEKEIDWMALQGINLPLAFTGQEAIWQKVFR 211
>gi|255540793|ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
gi|223550576|gb|EEF52063.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
Length = 809
Score = 332 bits (850), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 161/216 (74%), Positives = 178/216 (82%), Gaps = 6/216 (2%)
Query: 1 MDSPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAV 60
MDS ++S+ L F S A SSTIGV YISRLL+IQ+RERA PSVQLAAA V
Sbjct: 1 MDSLLPAISVFLLF------SIFAFTHSSTIGVGYISRLLEIQERERASPSVQLAAARGV 54
Query: 61 LQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHW 120
L RLLPSH SAF+FRIISK+QCGG+ CFI+ N+P S TPEI+ISGV G+EV+AGLHW
Sbjct: 55 LHRLLPSHSSAFEFRIISKEQCGGQSCFIIENYPFSTGPVTPEIIISGVNGMEVVAGLHW 114
Query: 121 YLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAW 180
YL+YWCGSHISWDKTGG Q+ S+PKLGS P VQDAGV V RPIP NYYQNAVTSSYTFAW
Sbjct: 115 YLKYWCGSHISWDKTGGAQLNSIPKLGSLPHVQDAGVLVLRPIPWNYYQNAVTSSYTFAW 174
Query: 181 WDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
WDWKRWEKEIDWMALQGINLPLAFTGQE IWQKVF+
Sbjct: 175 WDWKRWEKEIDWMALQGINLPLAFTGQEAIWQKVFK 210
>gi|147860882|emb|CAN83148.1| hypothetical protein VITISV_031934 [Vitis vinifera]
Length = 562
Score = 332 bits (850), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 159/216 (73%), Positives = 183/216 (84%), Gaps = 5/216 (2%)
Query: 1 MDSPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAV 60
M SPF +VS + S L+ AQSSTIGV YISRLL+IQDRERAPPSVQ+AAAY V
Sbjct: 1 MASPFAAVSF-----SFFLFSFLSFAQSSTIGVTYISRLLEIQDRERAPPSVQIAAAYGV 55
Query: 61 LQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHW 120
L RLLPSH S+F+F I+SK+QCGG+ CF++ NHPSS G PEI+I+GVTGVE++AGLHW
Sbjct: 56 LHRLLPSHSSSFEFGIVSKEQCGGDSCFMISNHPSSSGHGAPEILITGVTGVEIMAGLHW 115
Query: 121 YLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAW 180
YL+YWCGSHISWDKTGG Q+ S+P GSFP VQ+AG+ ++RPIP NYYQNAVTSSYTFAW
Sbjct: 116 YLKYWCGSHISWDKTGGAQLLSVPDSGSFPHVQEAGILIQRPIPWNYYQNAVTSSYTFAW 175
Query: 181 WDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
WDWKRWEKEIDWMALQGINLPLAFTGQE IWQKVF+
Sbjct: 176 WDWKRWEKEIDWMALQGINLPLAFTGQEAIWQKVFR 211
>gi|449441031|ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
Length = 808
Score = 309 bits (791), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 140/204 (68%), Positives = 169/204 (82%)
Query: 13 FFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAF 72
F I + + ++SSTIGV+YISRLL+IQDRER P VQ+AAA VL+RLLPSH +F
Sbjct: 9 FLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRLLPSHLPSF 68
Query: 73 QFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISW 132
F+I+SK +CGGE CF++RNH + G PEI+I+GVTGVE+LAGLHWYL++WCG+HISW
Sbjct: 69 DFQIVSKDKCGGESCFVIRNHRAFRKSGDPEILIAGVTGVEILAGLHWYLKHWCGAHISW 128
Query: 133 DKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDW 192
DKTGG Q+ S+PK G PR+Q V V+RPIPLNYYQNAVTSSY+FAWWDWKRWEKEIDW
Sbjct: 129 DKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWKRWEKEIDW 188
Query: 193 MALQGINLPLAFTGQETIWQKVFQ 216
MALQGIN+PLAFTGQE IW+KVF+
Sbjct: 189 MALQGINMPLAFTGQEAIWRKVFR 212
>gi|356519003|ref|XP_003528164.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
Length = 812
Score = 293 bits (751), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 138/217 (63%), Positives = 167/217 (76%), Gaps = 9/217 (4%)
Query: 1 MDSPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAV 60
M PF ++ LI F + ++ G+ I RL+ IQDRERAPPSVQ AAA V
Sbjct: 1 MKLPFPAIFLIFIFFLP--------SSTTGAGIDTIFRLIRIQDRERAPPSVQEAAARGV 52
Query: 61 LQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHW 120
L RLLPSH S+F+FRI+SKKQCGGEYCF ++NHPS G P+I+I G TGV+++AGLHW
Sbjct: 53 LLRLLPSHSSSFEFRILSKKQCGGEYCFKIKNHPSFTKAGDPQILIEGTTGVDIVAGLHW 112
Query: 121 YLRYWCGSHISWDKTGGVQVASMPKLG-SFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFA 179
YL++WCGSHISWDKTGG Q+ S+P +G PRV AGV V+RP+P +YYQNAVTSSY+FA
Sbjct: 113 YLKHWCGSHISWDKTGGSQLFSVPNVGLLLPRVHHAGVSVQRPVPWSYYQNAVTSSYSFA 172
Query: 180 WWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
WWDW+RWE+EIDWM L G+NLPLAFTGQE IWQKVFQ
Sbjct: 173 WWDWERWEREIDWMVLHGVNLPLAFTGQEAIWQKVFQ 209
>gi|357458271|ref|XP_003599416.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488464|gb|AES69667.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 807
Score = 268 bits (685), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 118/183 (64%), Positives = 148/183 (80%), Gaps = 1/183 (0%)
Query: 34 QYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNH 93
+ I LL D +RA PSVQ +AA VL+RLLP+H+S+F+F I+SK CGG+ CFI+ N+
Sbjct: 29 EAIQSLLHRLDSKRALPSVQESAAKGVLKRLLPTHFSSFEFIIVSKDACGGDSCFIINNY 88
Query: 94 PSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQ 153
S +G PEI+I G TGVE+ +GLHWYL+YWCG+H+SWDKTGG+Q S+PK GS P ++
Sbjct: 89 NKSSQKG-PEIIIRGTTGVEIASGLHWYLKYWCGAHVSWDKTGGIQTTSIPKPGSLPLLK 147
Query: 154 DAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
D GV +KRP+P NYYQN VTSSY+F WWDW+RWEKE+DWMALQG+NLPLAFTGQE IWQK
Sbjct: 148 DGGVKIKRPVPWNYYQNVVTSSYSFVWWDWERWEKEVDWMALQGVNLPLAFTGQEAIWQK 207
Query: 214 VFQ 216
VF+
Sbjct: 208 VFK 210
>gi|357458267|ref|XP_003599414.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488462|gb|AES69665.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 832
Score = 268 bits (684), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 118/183 (64%), Positives = 148/183 (80%), Gaps = 1/183 (0%)
Query: 34 QYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNH 93
+ I LL D +RA PSVQ +AA VL+RLLP+H+S+F+F I+SK CGG+ CFI+ N+
Sbjct: 29 EAIQSLLHRLDSKRALPSVQESAAKGVLKRLLPTHFSSFEFIIVSKDACGGDSCFIINNY 88
Query: 94 PSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQ 153
S +G PEI+I G TGVE+ +GLHWYL+YWCG+H+SWDKTGG+Q S+PK GS P ++
Sbjct: 89 NKSSQKG-PEIIIRGTTGVEIASGLHWYLKYWCGAHVSWDKTGGIQTTSIPKPGSLPLLK 147
Query: 154 DAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
D GV +KRP+P NYYQN VTSSY+F WWDW+RWEKE+DWMALQG+NLPLAFTGQE IWQK
Sbjct: 148 DGGVKIKRPVPWNYYQNVVTSSYSFVWWDWERWEKEVDWMALQGVNLPLAFTGQEAIWQK 207
Query: 214 VFQ 216
VF+
Sbjct: 208 VFK 210
>gi|357458269|ref|XP_003599415.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488463|gb|AES69666.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 539
Score = 267 bits (683), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 118/183 (64%), Positives = 148/183 (80%), Gaps = 1/183 (0%)
Query: 34 QYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNH 93
+ I LL D +RA PSVQ +AA VL+RLLP+H+S+F+F I+SK CGG+ CFI+ N+
Sbjct: 29 EAIQSLLHRLDSKRALPSVQESAAKGVLKRLLPTHFSSFEFIIVSKDACGGDSCFIINNY 88
Query: 94 PSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQ 153
S +G PEI+I G TGVE+ +GLHWYL+YWCG+H+SWDKTGG+Q S+PK GS P ++
Sbjct: 89 NKSSQKG-PEIIIRGTTGVEIASGLHWYLKYWCGAHVSWDKTGGIQTTSIPKPGSLPLLK 147
Query: 154 DAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
D GV +KRP+P NYYQN VTSSY+F WWDW+RWEKE+DWMALQG+NLPLAFTGQE IWQK
Sbjct: 148 DGGVKIKRPVPWNYYQNVVTSSYSFVWWDWERWEKEVDWMALQGVNLPLAFTGQEAIWQK 207
Query: 214 VFQ 216
VF+
Sbjct: 208 VFK 210
>gi|356534602|ref|XP_003535842.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
Length = 807
Score = 264 bits (674), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 114/183 (62%), Positives = 147/183 (80%), Gaps = 1/183 (0%)
Query: 34 QYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNH 93
+ I LL D +RAPPSVQ AAA +L+RLLP H+S+FQF+I+SK CGG+ CF++ NH
Sbjct: 25 EAIEPLLQRLDSKRAPPSVQEAAAIGLLKRLLPIHFSSFQFKIVSKDVCGGDSCFLINNH 84
Query: 94 PSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQ 153
S + PEI+I G T VE+ +GLHWYL+YWCG+H+SWDKTGG+Q S+P+ GS P ++
Sbjct: 85 NKSS-QNEPEIIIRGTTAVEIASGLHWYLKYWCGAHVSWDKTGGIQTTSIPEPGSLPSLK 143
Query: 154 DAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
D G+ +KRP+P NYYQN VTSSY++ WW+W+RWEKE+DWMALQG+NLPLAFTGQE IWQK
Sbjct: 144 DEGLKIKRPVPWNYYQNVVTSSYSYVWWNWERWEKELDWMALQGVNLPLAFTGQEAIWQK 203
Query: 214 VFQ 216
VF+
Sbjct: 204 VFK 206
>gi|326519955|dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 829
Score = 263 bits (672), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 115/181 (63%), Positives = 147/181 (81%)
Query: 36 ISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPS 95
+ RL ++ RER P + Q+ AA +L RLLPSH ++F+FR++S +QCGG+ CF + NHPS
Sbjct: 35 LDRLRELHQRERRPAAEQVDAARGLLARLLPSHSASFEFRVVSTEQCGGKACFNINNHPS 94
Query: 96 SYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDA 155
+ GTPEI+I G +GVE+ AGLHWYL+++C +HISW KTGG Q++S+P GS PRV
Sbjct: 95 FHGEGTPEILILGASGVEISAGLHWYLKHYCAAHISWAKTGGAQLSSVPYPGSLPRVPAG 154
Query: 156 GVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
G+ ++RP+ +YYQNAVTSSY+FAWWDW+RWEKEIDWMALQGINLPLAFTGQETIWQKVF
Sbjct: 155 GILIQRPVDWSYYQNAVTSSYSFAWWDWERWEKEIDWMALQGINLPLAFTGQETIWQKVF 214
Query: 216 Q 216
Q
Sbjct: 215 Q 215
>gi|326515664|dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 829
Score = 263 bits (672), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 115/181 (63%), Positives = 147/181 (81%)
Query: 36 ISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPS 95
+ RL ++ RER P + Q+ AA +L RLLPSH ++F+FR++S +QCGG+ CF + NHPS
Sbjct: 35 LDRLRELHQRERRPAAEQVDAARGLLARLLPSHSASFEFRVVSTEQCGGKACFNINNHPS 94
Query: 96 SYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDA 155
+ GTPEI+I G +GVE+ AGLHWYL+++C +HISW KTGG Q++S+P GS PRV
Sbjct: 95 FHGEGTPEILILGASGVEISAGLHWYLKHYCAAHISWAKTGGAQLSSVPYPGSLPRVPAG 154
Query: 156 GVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
G+ ++RP+ +YYQNAVTSSY+FAWWDW+RWEKEIDWMALQGINLPLAFTGQETIWQKVF
Sbjct: 155 GILIQRPVDWSYYQNAVTSSYSFAWWDWERWEKEIDWMALQGINLPLAFTGQETIWQKVF 214
Query: 216 Q 216
Q
Sbjct: 215 Q 215
>gi|357166414|ref|XP_003580702.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
distachyon]
Length = 829
Score = 262 bits (670), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 117/205 (57%), Positives = 151/205 (73%)
Query: 12 LFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSA 71
+ + +V + S + RL ++ RER PP+ Q+ AA +L RLLPSH ++
Sbjct: 10 FLLLVAVIACAASVTRCSGRRFAQLDRLRELHQRERRPPAEQVYAAEGLLARLLPSHSTS 69
Query: 72 FQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHIS 131
F+FR+IS +QCGG+ CFI+ NHP GTPEI+I GV+GVE+ AGLHWYL+++C +HIS
Sbjct: 70 FEFRVISTEQCGGKACFIINNHPLFDGEGTPEILILGVSGVEISAGLHWYLKHYCAAHIS 129
Query: 132 WDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEID 191
W KTGG Q++S+P GS P V G+ ++RP+ +YYQNAVTSSY+FAWWDW+RWE EID
Sbjct: 130 WAKTGGAQLSSVPHPGSLPHVPAGGILIRRPVDWSYYQNAVTSSYSFAWWDWERWENEID 189
Query: 192 WMALQGINLPLAFTGQETIWQKVFQ 216
WMALQGINLPLAFTGQE IWQKVFQ
Sbjct: 190 WMALQGINLPLAFTGQEAIWQKVFQ 214
>gi|297736304|emb|CBI24942.3| unnamed protein product [Vitis vinifera]
Length = 868
Score = 261 bits (668), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 123/213 (57%), Positives = 159/213 (74%), Gaps = 4/213 (1%)
Query: 3 SPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQ 62
+PF+++S L ++ + L L ++ SS + I LL +RA PSVQ +AA AVLQ
Sbjct: 61 NPFQTMSKFLLWVLML-LPFLPLSSSSH--SEAIEALLSRLATKRAAPSVQESAAKAVLQ 117
Query: 63 RLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYL 122
RLLP+H +FQF I+SK CGG+ CF + N+ S G PEI+I G T VE+ +GLHWY+
Sbjct: 118 RLLPTHLDSFQFEIVSKDVCGGKSCFWISNYNVSSKNG-PEIMIKGTTAVEIASGLHWYI 176
Query: 123 RYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWD 182
+YWCG+H+SWDKTG +Q+AS+PK GS P V+D GV ++RP+P NYYQN VTSSY++ WWD
Sbjct: 177 KYWCGAHVSWDKTGSIQIASIPKPGSLPLVKDEGVLIQRPVPWNYYQNVVTSSYSYVWWD 236
Query: 183 WKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
W+RWEKEIDWMALQG+NLPLAF GQE IWQKVF
Sbjct: 237 WERWEKEIDWMALQGVNLPLAFNGQEAIWQKVF 269
>gi|225450036|ref|XP_002273084.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
Length = 803
Score = 258 bits (658), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 117/188 (62%), Positives = 145/188 (77%), Gaps = 1/188 (0%)
Query: 28 SSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYC 87
SS+ + I LL +RA PSVQ +AA AVLQRLLP+H +FQF I+SK CGG+ C
Sbjct: 18 SSSSHSEAIEALLSRLATKRAAPSVQESAAKAVLQRLLPTHLDSFQFEIVSKDVCGGKSC 77
Query: 88 FILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLG 147
F + N+ S G PEI+I G T VE+ +GLHWY++YWCG+H+SWDKTG +Q+AS+PK G
Sbjct: 78 FWISNYNVSSKNG-PEIMIKGTTAVEIASGLHWYIKYWCGAHVSWDKTGSIQIASIPKPG 136
Query: 148 SFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQ 207
S P V+D GV ++RP+P NYYQN VTSSY++ WWDW+RWEKEIDWMALQG+NLPLAF GQ
Sbjct: 137 SLPLVKDEGVLIQRPVPWNYYQNVVTSSYSYVWWDWERWEKEIDWMALQGVNLPLAFNGQ 196
Query: 208 ETIWQKVF 215
E IWQKVF
Sbjct: 197 EAIWQKVF 204
>gi|168060822|ref|XP_001782392.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666123|gb|EDQ52786.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 801
Score = 256 bits (654), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 118/205 (57%), Positives = 151/205 (73%), Gaps = 9/205 (4%)
Query: 13 FFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAF 72
F + +T S V Q T+ L+ E+AP S Q A+Y VL RLLPSH S+F
Sbjct: 14 FLLVVVTFSDCHVPQYETV--------LNKIQSEKAPASTQERASYDVLARLLPSHLSSF 65
Query: 73 QFRIISKKQCGGEYCFILRNHPSSYIRG-TPEIVISGVTGVEVLAGLHWYLRYWCGSHIS 131
+FRII+K++CGG++CF+L+NHPS+ + G PEI ISG TGVE+ AGLHWYL++WC HIS
Sbjct: 66 EFRIITKEECGGKFCFLLQNHPSAGVYGGAPEIRISGTTGVELCAGLHWYLKHWCNGHIS 125
Query: 132 WDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEID 191
WDKTGG Q+ ++ + G PR+Q + ++RP+ NYYQN VTSSY++ WW W+RWEKEID
Sbjct: 126 WDKTGGAQLHTVHRPGFLPRLQGDTLTIQRPVDWNYYQNVVTSSYSYVWWTWERWEKEID 185
Query: 192 WMALQGINLPLAFTGQETIWQKVFQ 216
WMALQGINLPLAFTGQE +WQKVFQ
Sbjct: 186 WMALQGINLPLAFTGQEAVWQKVFQ 210
>gi|224106113|ref|XP_002314048.1| predicted protein [Populus trichocarpa]
gi|222850456|gb|EEE88003.1| predicted protein [Populus trichocarpa]
Length = 806
Score = 256 bits (653), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 114/182 (62%), Positives = 140/182 (76%)
Query: 34 QYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNH 93
+ I LL D +RA S Q +AA AVL+RLLPSH +F F+I+SK CGG CF++ N+
Sbjct: 27 EAIDSLLKRLDSKRASSSDQESAAKAVLKRLLPSHIHSFLFKIVSKDVCGGHSCFLINNY 86
Query: 94 PSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQ 153
PEI I G T VE+ +GLHWYL+YWCG+H+SWDKTGGVQ+AS+PK GS P V+
Sbjct: 87 YKESSGNGPEISIKGTTAVEIASGLHWYLKYWCGAHVSWDKTGGVQIASIPKPGSLPHVK 146
Query: 154 DAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
D GV ++RP+P NYYQN VTSSY++ WW+W+RWEKE+DWMALQGINLPLAFTGQE IWQK
Sbjct: 147 DKGVMIQRPVPWNYYQNVVTSSYSYVWWNWERWEKELDWMALQGINLPLAFTGQEAIWQK 206
Query: 214 VF 215
VF
Sbjct: 207 VF 208
>gi|414585094|tpg|DAA35665.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 1202
Score = 254 bits (650), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 111/184 (60%), Positives = 147/184 (79%), Gaps = 2/184 (1%)
Query: 35 YISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISK--KQCGGEYCFILRN 92
++ R+ ++Q RE + + Q AAA +L RLLPSH ++F+FR+IS + CGG+ CF++ N
Sbjct: 33 HLDRVRELQRREGSSSAEQEAAARGLLARLLPSHSTSFEFRVISTSTELCGGKTCFVINN 92
Query: 93 HPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRV 152
HP GTPEI+I G +GVE+ AG HWYL+++C +HISWDKTGG Q++++P+ GS PRV
Sbjct: 93 HPLFDGEGTPEILILGASGVEISAGFHWYLKHYCAAHISWDKTGGAQLSTIPRPGSLPRV 152
Query: 153 QDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQ 212
D GV ++RPI +YYQNAVTSSY+FAWWDW RWEKEIDWMALQGINLPLAFTGQE+IWQ
Sbjct: 153 PDGGVLIQRPIDWSYYQNAVTSSYSFAWWDWDRWEKEIDWMALQGINLPLAFTGQESIWQ 212
Query: 213 KVFQ 216
++F+
Sbjct: 213 RIFE 216
>gi|414585093|tpg|DAA35664.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 721
Score = 252 bits (643), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 118/215 (54%), Positives = 157/215 (73%), Gaps = 3/215 (1%)
Query: 4 PFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQR 63
P S L L + I + V + S ++ R+ ++Q RE + + Q AAA +L R
Sbjct: 3 PRPSRFLQLLLVVVIACAVPGV-RCSDRRFPHLDRVRELQRREGSSSAEQEAAARGLLAR 61
Query: 64 LLPSHYSAFQFRIISK--KQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWY 121
LLPSH ++F+FR+IS + CGG+ CF++ NHP GTPEI+I G +GVE+ AG HWY
Sbjct: 62 LLPSHSTSFEFRVISTSTELCGGKTCFVINNHPLFDGEGTPEILILGASGVEISAGFHWY 121
Query: 122 LRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWW 181
L+++C +HISWDKTGG Q++++P+ GS PRV D GV ++RPI +YYQNAVTSSY+FAWW
Sbjct: 122 LKHYCAAHISWDKTGGAQLSTIPRPGSLPRVPDGGVLIQRPIDWSYYQNAVTSSYSFAWW 181
Query: 182 DWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
DW RWEKEIDWMALQGINLPLAFTGQE+IWQ++F+
Sbjct: 182 DWDRWEKEIDWMALQGINLPLAFTGQESIWQRIFE 216
>gi|414585092|tpg|DAA35663.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 831
Score = 252 bits (643), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 118/215 (54%), Positives = 157/215 (73%), Gaps = 3/215 (1%)
Query: 4 PFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQR 63
P S L L + I + V + S ++ R+ ++Q RE + + Q AAA +L R
Sbjct: 3 PRPSRFLQLLLVVVIACAVPGV-RCSDRRFPHLDRVRELQRREGSSSAEQEAAARGLLAR 61
Query: 64 LLPSHYSAFQFRIISK--KQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWY 121
LLPSH ++F+FR+IS + CGG+ CF++ NHP GTPEI+I G +GVE+ AG HWY
Sbjct: 62 LLPSHSTSFEFRVISTSTELCGGKTCFVINNHPLFDGEGTPEILILGASGVEISAGFHWY 121
Query: 122 LRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWW 181
L+++C +HISWDKTGG Q++++P+ GS PRV D GV ++RPI +YYQNAVTSSY+FAWW
Sbjct: 122 LKHYCAAHISWDKTGGAQLSTIPRPGSLPRVPDGGVLIQRPIDWSYYQNAVTSSYSFAWW 181
Query: 182 DWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
DW RWEKEIDWMALQGINLPLAFTGQE+IWQ++F+
Sbjct: 182 DWDRWEKEIDWMALQGINLPLAFTGQESIWQRIFE 216
>gi|218195716|gb|EEC78143.1| hypothetical protein OsI_17702 [Oryza sativa Indica Group]
Length = 829
Score = 251 bits (640), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 109/182 (59%), Positives = 143/182 (78%)
Query: 35 YISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHP 94
++ R+ ++ E P + Q AAA +L RLLPSH +F FR+IS QCGG+ CFI+ NHP
Sbjct: 32 HLGRVRELHRGEGRPAAEQEAAARGLLARLLPSHSGSFDFRVISADQCGGKACFIVDNHP 91
Query: 95 SSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQD 154
GTP++++ G +GVE+ AGLHWYL+++C +H+SWDKTGG Q++S+P+ GS PR+
Sbjct: 92 LFDGEGTPQVLLLGTSGVEISAGLHWYLKHYCAAHVSWDKTGGAQLSSVPRPGSLPRLPS 151
Query: 155 AGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
G+ ++RP+ +YYQNAVTSSY+FAWWDW+RWEKEIDWMALQGINLPLAFTGQE IWQKV
Sbjct: 152 GGILIQRPVGWSYYQNAVTSSYSFAWWDWERWEKEIDWMALQGINLPLAFTGQEAIWQKV 211
Query: 215 FQ 216
FQ
Sbjct: 212 FQ 213
>gi|4160292|emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum]
Length = 811
Score = 247 bits (631), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 118/208 (56%), Positives = 151/208 (72%), Gaps = 3/208 (1%)
Query: 8 VSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPS 67
V+L +F+ I ++ + +SS + I +L + APP VQ +AA VLQRLLP+
Sbjct: 2 VNLKFWFLLAIL--SVFLCKSSAVESDAIESVLRRLHSKEAPPIVQESAAKGVLQRLLPA 59
Query: 68 HYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCG 127
H +F+F+I+SK CGG CF + N+ S R +PEI+I G T VE+ +GLHWYL+Y CG
Sbjct: 60 HLHSFEFKIVSKDLCGGRSCFRITNYKVSS-RNSPEILIQGTTAVEITSGLHWYLKYRCG 118
Query: 128 SHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWE 187
+HISWDKTGGVQ+AS+PK GS P V+ + ++RP+P NYYQN VTSSY+F WWDW+RWE
Sbjct: 119 AHISWDKTGGVQLASVPKPGSLPLVKQMELTIQRPVPWNYYQNVVTSSYSFVWWDWQRWE 178
Query: 188 KEIDWMALQGINLPLAFTGQETIWQKVF 215
KEIDWM L GINLPLAFTGQE IWQKVF
Sbjct: 179 KEIDWMTLPGINLPLAFTGQEAIWQKVF 206
>gi|449436325|ref|XP_004135943.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
Length = 774
Score = 242 bits (618), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 107/173 (61%), Positives = 138/173 (79%), Gaps = 1/173 (0%)
Query: 44 DRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPE 103
D + PS+Q AAA A+L+RLLP+H +F+F+I+S+ CGG CF++ N SS R E
Sbjct: 34 DSKALSPSIQEAAAKALLRRLLPTHVDSFEFQIVSRDVCGGGSCFLISNFKSSS-RNGAE 92
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
I+I G T VE+ +GL+WYL+YWCG+H+SWDKTGGVQ+AS+PK GS P ++ GV +KRP+
Sbjct: 93 ILIRGTTAVEITSGLYWYLKYWCGAHVSWDKTGGVQLASIPKPGSLPFLKGNGVVIKRPV 152
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P NYYQN VTSSY++ WWDW+RWEKEIDWMAL GINLPLAFTGQE+IW+ VF+
Sbjct: 153 PWNYYQNVVTSSYSYVWWDWERWEKEIDWMALHGINLPLAFTGQESIWRNVFR 205
>gi|15240689|ref|NP_196873.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|9758035|dbj|BAB08696.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|19423948|gb|AAL87291.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|21436231|gb|AAM51254.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|332004545|gb|AED91928.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
Length = 806
Score = 241 bits (615), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 109/181 (60%), Positives = 136/181 (75%), Gaps = 1/181 (0%)
Query: 36 ISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPS 95
I LLD D SVQ +AA +LQRLLP+H +F+ RIISK CGG CF++ N+
Sbjct: 28 IDGLLDRLDSLLPTSSVQESAAKGLLQRLLPTHSQSFELRIISKDACGGTSCFVIENYDG 87
Query: 96 SYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDA 155
R PEI+I G TGVE+ +GLHWYL+Y C +H+SWDKTGG+QVAS+P+ G PR+
Sbjct: 88 PG-RIGPEILIKGTTGVEIASGLHWYLKYKCNAHVSWDKTGGIQVASVPQPGHLPRIDSK 146
Query: 156 GVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+F++RP+P NYYQN VTSSY++ WW W+RWE+EIDWMALQGINLPLAFTGQE IWQKVF
Sbjct: 147 RIFIRRPVPWNYYQNVVTSSYSYVWWGWERWEREIDWMALQGINLPLAFTGQEAIWQKVF 206
Query: 216 Q 216
+
Sbjct: 207 K 207
>gi|222629680|gb|EEE61812.1| hypothetical protein OsJ_16433 [Oryza sativa Japonica Group]
Length = 1129
Score = 241 bits (614), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 110/187 (58%), Positives = 144/187 (77%)
Query: 30 TIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFI 89
T G ++ R+ ++ E P + Q AAA +L LLPSH +F FR+IS QCGG+ CFI
Sbjct: 327 TRGSPHLGRVRELHRGEGRPAAEQEAAARGLLALLLPSHSGSFDFRVISADQCGGKACFI 386
Query: 90 LRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSF 149
+ NHP GTP++++ G +GVE+ AGLHWYL+++C +H+SWDKTGG Q++S+P+ GS
Sbjct: 387 VDNHPLFDGEGTPQVLLLGTSGVEISAGLHWYLKHYCAAHVSWDKTGGAQLSSVPRPGSL 446
Query: 150 PRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQET 209
PR+ G+ ++RP+ +YYQNAVTSSY+FAWWDW+RWEKEIDWMALQGINLPLAFTGQE
Sbjct: 447 PRLPSGGILIQRPVGWSYYQNAVTSSYSFAWWDWERWEKEIDWMALQGINLPLAFTGQEA 506
Query: 210 IWQKVFQ 216
IWQKVFQ
Sbjct: 507 IWQKVFQ 513
>gi|297807393|ref|XP_002871580.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
lyrata]
gi|297317417|gb|EFH47839.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
lyrata]
Length = 806
Score = 240 bits (613), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 108/181 (59%), Positives = 136/181 (75%), Gaps = 1/181 (0%)
Query: 36 ISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPS 95
I RLL+ D SVQ +AA +LQRLLP+H+ +F+FRI SK CGG CF++ N+
Sbjct: 28 IDRLLNRLDSLLPTSSVQESAAKGLLQRLLPTHFHSFEFRIFSKDVCGGTSCFLIENYDD 87
Query: 96 SYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDA 155
R PEI I G TGVE+ +GLHWYL+Y C +H+SWDKTGG+Q+AS+P+ G PR+
Sbjct: 88 PR-RIGPEIRIKGTTGVEIASGLHWYLKYKCNAHVSWDKTGGIQIASVPQPGHLPRLDSK 146
Query: 156 GVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ ++RPIP NYYQN VTSSY++ WW W+RWE+EIDWMALQGINLPLAFTGQE IWQKVF
Sbjct: 147 RILIRRPIPWNYYQNVVTSSYSYVWWGWERWEREIDWMALQGINLPLAFTGQEAIWQKVF 206
Query: 216 Q 216
+
Sbjct: 207 K 207
>gi|90399367|emb|CAJ86183.1| H0212B02.15 [Oryza sativa Indica Group]
gi|116311963|emb|CAJ86322.1| OSIGBa0113E10.5 [Oryza sativa Indica Group]
Length = 692
Score = 238 bits (608), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 109/206 (52%), Positives = 144/206 (69%), Gaps = 24/206 (11%)
Query: 35 YISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIIS---------------- 78
++ R+ ++ E P + Q AAA +L RLLPSH +F FR+IS
Sbjct: 32 HLGRVRELHRGEGRPAAEQEAAARGLLARLLPSHSGSFDFRVISADFPLPGTVVVLICKD 91
Query: 79 --------KKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHI 130
+ QCGG+ CFI+ NHP GTP++++ G +GVE+ AGLHWYL+++C +H+
Sbjct: 92 SQSVPWNPQDQCGGKACFIVDNHPLFDGEGTPQVLLLGTSGVEISAGLHWYLKHYCAAHV 151
Query: 131 SWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEI 190
SWDKTGG Q++S+P+ GS PR+ G+ ++RP+ +YYQNAVTSSY+FAWWDW+RWEKEI
Sbjct: 152 SWDKTGGAQLSSVPRPGSLPRLPSGGILIQRPVGWSYYQNAVTSSYSFAWWDWERWEKEI 211
Query: 191 DWMALQGINLPLAFTGQETIWQKVFQ 216
DWMALQGINLPLAFTGQE IWQKVFQ
Sbjct: 212 DWMALQGINLPLAFTGQEAIWQKVFQ 237
>gi|222624949|gb|EEE59081.1| hypothetical protein OsJ_10898 [Oryza sativa Japonica Group]
Length = 812
Score = 238 bits (606), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 111/170 (65%), Positives = 132/170 (77%), Gaps = 2/170 (1%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQ-CGGEYCFILRNHPSSYIRGTPEIVI 106
A P Q AAA +L+RLLPSH +F+F+I+SK CGG CF + N S R EI+I
Sbjct: 51 ASPGEQEAAAAGLLRRLLPSHARSFRFQIVSKGGVCGGSSCFRISNADGSR-RNGAEILI 109
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
G T VE+ +GLHWYL+YWCG+HISWDKTGG Q+AS+P GS P+V+ GV ++RP+P N
Sbjct: 110 QGTTAVELASGLHWYLKYWCGAHISWDKTGGAQLASVPLPGSLPQVKGTGVKIERPVPWN 169
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YYQN VTSSY+F WWDWKRWEKEIDWMALQGINLPLAFTGQE IWQKVF+
Sbjct: 170 YYQNVVTSSYSFVWWDWKRWEKEIDWMALQGINLPLAFTGQEAIWQKVFK 219
>gi|357112065|ref|XP_003557830.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
distachyon]
Length = 809
Score = 238 bits (606), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 112/173 (64%), Positives = 130/173 (75%), Gaps = 2/173 (1%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQ-CGGEYCFILRNHPSSYIRGTPE 103
R A P+VQ AA VL RLLPSH +F+F+I +K CG CF + N S RG E
Sbjct: 45 RRAASPAVQEGAAAGVLHRLLPSHAHSFRFQIDTKGGVCGESSCFRISNVDGSG-RGGAE 103
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
I+I G T VE+ +GLHWYL+YWCG HISWDKTGG Q+AS+P GS PRV GV +KRP+
Sbjct: 104 IMIQGTTAVELASGLHWYLKYWCGVHISWDKTGGAQLASVPSPGSLPRVNGTGVKIKRPV 163
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P NYYQN VTSSY+F WWDW+RWEKEIDWMALQGINLPLAFTGQE +WQKVF+
Sbjct: 164 PWNYYQNVVTSSYSFVWWDWRRWEKEIDWMALQGINLPLAFTGQEAVWQKVFK 216
>gi|242035709|ref|XP_002465249.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
gi|241919103|gb|EER92247.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
Length = 777
Score = 236 bits (603), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 109/171 (63%), Positives = 132/171 (77%), Gaps = 1/171 (0%)
Query: 47 RAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQ-CGGEYCFILRNHPSSYIRGTPEIV 105
+A P+ Q AAA +L+RLLPSH +F F+I SK CG CF + N +G EI+
Sbjct: 50 KASPAEQEAAAAGLLRRLLPSHAGSFSFQIDSKGAVCGQSSCFRISNVVDGSGKGGAEIL 109
Query: 106 ISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPL 165
I G TGVE+ +GLHWYL+YWCG+HISWD+TGG Q+AS+P GS PRVQ GV ++RP+
Sbjct: 110 IQGTTGVELASGLHWYLKYWCGAHISWDRTGGAQLASIPSPGSLPRVQGKGVKIERPVLW 169
Query: 166 NYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
NYYQN VTSSY++ WWDWKRWEKEIDWMALQGINLPLAFTGQE+IWQKVF+
Sbjct: 170 NYYQNVVTSSYSYVWWDWKRWEKEIDWMALQGINLPLAFTGQESIWQKVFK 220
>gi|38345908|emb|CAE04506.2| OSJNBb0059K02.16 [Oryza sativa Japonica Group]
Length = 829
Score = 236 bits (602), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 108/182 (59%), Positives = 142/182 (78%)
Query: 35 YISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHP 94
++ R+ ++ E P + Q AAA +L LLPSH +F FR+IS QCGG+ CFI+ NHP
Sbjct: 32 HLGRVRELHRGEGRPAAEQEAAARGLLALLLPSHSGSFDFRVISADQCGGKACFIVDNHP 91
Query: 95 SSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQD 154
GTP++++ G +GVE+ AGLHWYL+++C +H+SWDKTGG Q++S+P+ GS PR+
Sbjct: 92 LFDGEGTPQVLLLGTSGVEISAGLHWYLKHYCAAHVSWDKTGGAQLSSVPRPGSLPRLPS 151
Query: 155 AGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
G+ ++RP+ +YYQNAVTSSY+FAWWDW+RWEKEIDWMALQGINLPLAFTGQE IWQKV
Sbjct: 152 GGILIQRPVGWSYYQNAVTSSYSFAWWDWERWEKEIDWMALQGINLPLAFTGQEAIWQKV 211
Query: 215 FQ 216
FQ
Sbjct: 212 FQ 213
>gi|413955691|gb|AFW88340.1| hypothetical protein ZEAMMB73_315381 [Zea mays]
Length = 814
Score = 235 bits (600), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 109/171 (63%), Positives = 130/171 (76%), Gaps = 1/171 (0%)
Query: 47 RAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQ-CGGEYCFILRNHPSSYIRGTPEIV 105
+A P+ Q AA +L+RLLP H +F F+I SK CG CF + N +G EI+
Sbjct: 50 KASPAEQEGAAAGLLRRLLPFHSGSFSFQIDSKGGVCGQSSCFRISNVVDGSGKGGAEIL 109
Query: 106 ISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPL 165
I G T VE+ +GLHWYL+YWCG+HISWDKTGG Q+AS+P GS PRVQ GV ++RP+P
Sbjct: 110 IQGTTAVELASGLHWYLKYWCGAHISWDKTGGAQLASIPSPGSLPRVQGKGVKIERPVPW 169
Query: 166 NYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
NYYQN VTSSY+F WWDWKRWEKEIDWMALQGINLPLAFTGQE+IWQKVF+
Sbjct: 170 NYYQNVVTSSYSFVWWDWKRWEKEIDWMALQGINLPLAFTGQESIWQKVFK 220
>gi|218192858|gb|EEC75285.1| hypothetical protein OsI_11626 [Oryza sativa Indica Group]
Length = 812
Score = 234 bits (596), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 110/170 (64%), Positives = 131/170 (77%), Gaps = 2/170 (1%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQ-CGGEYCFILRNHPSSYIRGTPEIVI 106
A P Q AAA +L+RLLPSH +F+F+I+SK CGG CF + N S R EI+I
Sbjct: 51 ASPGEQEAAAAGLLRRLLPSHARSFRFQIVSKGGVCGGSSCFRISNADGSR-RNGAEILI 109
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
G T VE+ +GLHWYL+YWCG+HISWDKTGG Q+AS+P GS P+V+ V ++RP+P N
Sbjct: 110 QGTTAVELASGLHWYLKYWCGAHISWDKTGGAQLASVPLPGSLPQVKRTVVKIERPVPWN 169
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YYQN VTSSY+F WWDWKRWEKEIDWMALQGINLPLAFTGQE IWQKVF+
Sbjct: 170 YYQNVVTSSYSFVWWDWKRWEKEIDWMALQGINLPLAFTGQEAIWQKVFK 219
>gi|302791289|ref|XP_002977411.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
gi|300154781|gb|EFJ21415.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
Length = 761
Score = 215 bits (547), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 100/171 (58%), Positives = 128/171 (74%), Gaps = 3/171 (1%)
Query: 49 PPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISG 108
P QL AA +VL+RL+P H SAFQ +II+ + CGG+ CF L S G+ +I I G
Sbjct: 3 PARDQLRAAASVLRRLIPGHVSAFQLQIITLEDCGGKSCFQLS---SLMDEGSRKISIHG 59
Query: 109 VTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYY 168
+GVE+ AGL+WYL+ WCG+HISWD TGG Q+ ++P+ + P + GV ++R +P +YY
Sbjct: 60 TSGVELCAGLYWYLKNWCGAHISWDLTGGSQLDTIPEPQALPALPPGGVKIQRSVPWSYY 119
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAF 219
QN VTSSY++AWW+WKRW KEIDWMALQGINLPLAFTGQETIWQKVF+ F
Sbjct: 120 QNVVTSSYSYAWWNWKRWRKEIDWMALQGINLPLAFTGQETIWQKVFESKF 170
>gi|302786446|ref|XP_002974994.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
gi|300157153|gb|EFJ23779.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
Length = 761
Score = 215 bits (547), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 100/171 (58%), Positives = 128/171 (74%), Gaps = 3/171 (1%)
Query: 49 PPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISG 108
P QL AA +VL+RL+P H SAFQ +II+ + CGG+ CF L S G+ +I I G
Sbjct: 3 PARDQLRAAASVLRRLIPGHASAFQLQIITLEDCGGKSCFQLS---SLMDEGSRKISIHG 59
Query: 109 VTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYY 168
+GVE+ AGL+WYL+ WCG+HISWD TGG Q+ ++P+ + P + GV ++R +P +YY
Sbjct: 60 TSGVELCAGLYWYLKNWCGAHISWDLTGGSQLDTIPEPQALPALPPGGVKIQRSVPWSYY 119
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAF 219
QN VTSSY++AWW+WKRW KEIDWMALQGINLPLAFTGQETIWQKVF+ F
Sbjct: 120 QNVVTSSYSYAWWNWKRWRKEIDWMALQGINLPLAFTGQETIWQKVFESKF 170
>gi|255553490|ref|XP_002517786.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
gi|223543058|gb|EEF44593.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
Length = 174
Score = 172 bits (436), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 79/143 (55%), Positives = 102/143 (71%), Gaps = 1/143 (0%)
Query: 33 VQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRN 92
VQ I LL D +R+ PSVQ +AA VLQRLLP+H +F+F I+ K CGG CF++ N
Sbjct: 30 VQPIEALLSRLDSKRSSPSVQESAAKGVLQRLLPTHVHSFEFEIVPKDVCGGHSCFLINN 89
Query: 93 HPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRV 152
+ + PEI I G + VE+ +GLHWYL+YWCG+HISWDKTGG+Q AS+P GS P V
Sbjct: 90 S-NQLSQDGPEIFIKGTSAVELTSGLHWYLKYWCGAHISWDKTGGIQKASIPNPGSLPPV 148
Query: 153 QDAGVFVKRPIPLNYYQNAVTSS 175
+D G+ ++RP+P NYYQN VTSS
Sbjct: 149 KDEGIVIQRPVPWNYYQNVVTSS 171
>gi|384247107|gb|EIE20595.1| hypothetical protein COCSUDRAFT_37819 [Coccomyxa subellipsoidea
C-169]
Length = 762
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/167 (44%), Positives = 101/167 (60%), Gaps = 9/167 (5%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
S Q+ A +L R+LP + F+ R + CF + IV+ G +
Sbjct: 15 SQQIDAVNGLLDRILPEYSHLFELRALPDCDSIQSACFEVHTD-------GKRIVVKGAS 67
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQV-ASMPKLGSFPRVQDAGVFV-KRPIPLNYY 168
G+E+ +G+HW+L+Y C S SW TGG + +S K S R+ AG R +P +YY
Sbjct: 68 GIEMASGIHWFLKYACNSSTSWHLTGGNNIDSSCFKPESLERISLAGPLRGARSVPWSYY 127
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
QN VT SY+ A+WDW+RWE+EIDWMALQGINLPLAFTGQE +WQKV+
Sbjct: 128 QNVVTPSYSMAFWDWERWEQEIDWMALQGINLPLAFTGQEYVWQKVW 174
>gi|255079272|ref|XP_002503216.1| GH family 89 protein [Micromonas sp. RCC299]
gi|226518482|gb|ACO64474.1| GH family 89 protein [Micromonas sp. RCC299]
Length = 1260
Score = 141 bits (356), Expect = 2e-31, Method: Composition-based stats.
Identities = 72/160 (45%), Positives = 98/160 (61%), Gaps = 10/160 (6%)
Query: 60 VLQRLLPSHYSAFQFRIISKKQCGG----EYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
+++RLLP+H + + + GG CF ++N GT I I+G TGVE
Sbjct: 30 LMERLLPNHAGSNLIEFRGRHRDGGCAEGARCFSIQNGDKP---GT--ISIAGSTGVEQA 84
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSS 175
AGLH YLR +CG+H+ W+ TGG Q+ S+P+ GS P V DAGV V P Y N T S
Sbjct: 85 AGLHHYLRRFCGAHLGWEATGGHQLHSVPR-GSLPPVDDAGVVVNLPFERTVYMNPETFS 143
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y+ A+WD++RWEKEI+WMAL G+N P+A G E +W +V
Sbjct: 144 YSTAFWDYERWEKEIEWMALHGVNTPMALNGVEQVWMRVL 183
>gi|325103828|ref|YP_004273482.1| alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
gi|324972676|gb|ADY51660.1| Alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
Length = 738
Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats.
Identities = 66/165 (40%), Positives = 100/165 (60%), Gaps = 16/165 (9%)
Query: 52 VQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTG 111
+ +AA+ ++QR++P HY F+ + ++K + +N +IV+SG G
Sbjct: 23 LNIAASKKLIQRIVPQHYQQFEVKYLTKASEEDRFEIESKNG---------KIVLSGNNG 73
Query: 112 VEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNA 171
V V + L++YL+ +C I+W+ + +MPK+ P +++ + P L YY N
Sbjct: 74 VAVASALNYYLKEYCNFEITWNGSN----RNMPKV--LPVIKEK-ISKTSPHELRYYLNY 126
Query: 172 VTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T SY+ AWWDWKRWE+EIDWMAL GIN+PLA TGQE IWQKV++
Sbjct: 127 CTFSYSMAWWDWKRWEEEIDWMALNGINMPLAITGQEAIWQKVYK 171
>gi|449681189|ref|XP_004209763.1| PREDICTED: alpha-N-acetylglucosaminidase-like, partial [Hydra
magnipapillata]
Length = 220
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/172 (38%), Positives = 97/172 (56%), Gaps = 12/172 (6%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
R ++ PS Q + +++RL+P+H S F I +K E+ ++ I
Sbjct: 41 RTKSSPSEQKESVIQLIKRLVPAHASKFIISI-NKNYVDSEFADYFEIVSTT----NGNI 95
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
++G TGV AG + YL+YWC +HISW ++P + S + VF +R
Sbjct: 96 KVTGSTGVAAAAGFYHYLKYWCFAHISWSGNHLNIPINLPLVHSPVK----KVFYER--- 148
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YYQN T SY+ +W+W RWE+EIDWMA+ GIN PLAFTGQE++WQ V++
Sbjct: 149 FRYYQNVCTVSYSMVFWNWTRWEQEIDWMAMNGINFPLAFTGQESVWQIVYK 200
>gi|348533253|ref|XP_003454120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oreochromis
niloticus]
Length = 845
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 67/173 (38%), Positives = 91/173 (52%), Gaps = 13/173 (7%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
R RA Q A +L+RLL + + F + LR+ ++ +
Sbjct: 126 RPRASDRAQGRAVTELLRRLLGNRSAEFIVSVNGSLSNDSLDVCELRSTKNN------RV 179
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
V +G TGV V +G++ YL+Y+C H+SW +PKL R+ P
Sbjct: 180 VATGSTGVAVASGIYNYLKYFCNCHVSWSGNQLDLPRPLPKLTGVLRI-------STPHR 232
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQV 217
YYQN T SY+ WWDW RWE+EIDWMAL GINLPLAFTGQE +WQ+V++
Sbjct: 233 FRYYQNVCTFSYSSVWWDWPRWEREIDWMALNGINLPLAFTGQEALWQEVYRA 285
>gi|440799253|gb|ELR20308.1| alpha-N-acetylglucosaminidase family protein [Acanthamoeba
castellanii str. Neff]
Length = 854
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 91/174 (52%), Gaps = 4/174 (2%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHY-SAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPE 103
R PP VQ+ A ++ RLL + Y FQ I+ + + G +
Sbjct: 47 RSATPPEVQVQAVQGLIGRLLGASYVPLFQLSILPSPYASSSTDHDYYTLSTRTVGGVTQ 106
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ I G +GV + +GLHWYL+Y C +SW G ++P P + + ++ P+
Sbjct: 107 VQIQGSSGVALASGLHWYLKYSCNCSVSWGVAGSGNNLALPS--RLPALSKP-LAMQSPV 163
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQV 217
YY N T SY+ WW W RWE+EIDWMAL GINLPL+ TGQE ++ +VF+
Sbjct: 164 KYRYYLNVCTHSYSSVWWQWDRWEQEIDWMALHGINLPLSSTGQEYVFAEVFKA 217
>gi|340717403|ref|XP_003397173.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus terrestris]
Length = 770
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 70/173 (40%), Positives = 95/173 (54%), Gaps = 18/173 (10%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCG--GEYCFILRNHPSSYIRGTP 102
+ R P VQ AA V +RLL +A +F ++ G G F+++ +
Sbjct: 37 KPRTSPEVQAEAARGVAERLLGRERAA-KFLMVVNPDLGPTGLDTFLIKKN------SLD 89
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ I G +GV V GLH+YL+ +C HISW+ G Q+ +P+ + D V V
Sbjct: 90 QVEIFGSSGVAVAWGLHYYLKTYCNVHISWE---GNQI-ELPQT-----LPDVRVKVTSN 140
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T YTFAWW W +WE+ IDWMAL GINL LAF GQE IWQ+V+
Sbjct: 141 DRFRYYQNVCTLGYTFAWWQWDQWERNIDWMALNGINLALAFNGQEAIWQRVY 193
>gi|350407422|ref|XP_003488083.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus impatiens]
Length = 770
Score = 119 bits (298), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 70/173 (40%), Positives = 94/173 (54%), Gaps = 18/173 (10%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCG--GEYCFILRNHPSSYIRGTP 102
+ + P VQ AA V +RLL H A +F ++ G G F+++ +
Sbjct: 37 KPQTSPEVQAEAARGVAERLL-GHERAAKFLMVVNPDLGPTGLDTFLIKKN------SLD 89
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ I G +GV V GLH+YL+ +C HISW+ G Q+ +P+ + D V V
Sbjct: 90 QVEIFGSSGVAVAWGLHYYLKTYCNVHISWE---GNQI-ELPQT-----LPDVRVKVTSN 140
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T YTFAWW W WE+ IDWMAL GINL LAF GQE IWQ+V+
Sbjct: 141 DRFRYYQNVCTLGYTFAWWQWDHWERNIDWMALNGINLALAFNGQEAIWQRVY 193
>gi|390348210|ref|XP_785272.3| PREDICTED: alpha-N-acetylglucosaminidase [Strongylocentrotus
purpuratus]
Length = 793
Score = 119 bits (298), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 111/205 (54%), Gaps = 21/205 (10%)
Query: 11 ILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYS 70
I+ FIT +TL +S G + + L Q A S Q +AA V++RLLP+ Y
Sbjct: 12 IVAFITALTL-----GHASANGFENELKHLKPQ----ASASTQTSAAQEVIKRLLPTRYQ 62
Query: 71 AFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHI 130
F+ +++ G + + + T ++ V GV+ YL ++C HI
Sbjct: 63 NFKV-VVNPAPSGDNLDTFEISADGTTVNVTGTTGVAAVWGVQH------YLVHYCNCHI 115
Query: 131 SWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEI 190
SW+ G Q+ +P G +P + + V P +YQN T+SYTFAWWDW+RWE+ I
Sbjct: 116 SWN---GDQLY-LPPDGQWPVIHPP-LKVTSPNRFRFYQNVCTASYTFAWWDWERWERHI 170
Query: 191 DWMALQGINLPLAFTGQETIWQKVF 215
DWMAL GINLPLAF GQE IWQKV+
Sbjct: 171 DWMALSGINLPLAFNGQEAIWQKVY 195
>gi|301626955|ref|XP_002942650.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Xenopus (Silurana)
tropicalis]
Length = 759
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/165 (42%), Positives = 89/165 (53%), Gaps = 13/165 (7%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
VQ AA +L RLL S F + S GG + L SS GT +++ G +
Sbjct: 36 DVQTAAVRELLARLLGGRASDFSVTVNSSLAEGGRDTYRL----SSGAGGT--VLVVGSS 89
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQN 170
GV +G + YL+ +CG+H+SW +G + + P YYQN
Sbjct: 90 GVAAASGCYTYLKAFCGAHLSW--SGAQLQLPPALPPVPAPLTRSA-----PHRFRYYQN 142
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
TSSY+F WWDW RWEKEIDWMAL GIN+PLAFTGQE IW KV+
Sbjct: 143 VCTSSYSFVWWDWARWEKEIDWMALSGINMPLAFTGQEAIWYKVY 187
>gi|390353486|ref|XP_003728120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Strongylocentrotus
purpuratus]
Length = 385
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/205 (40%), Positives = 112/205 (54%), Gaps = 22/205 (10%)
Query: 13 FFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAF 72
FIT + +A A T G + + L + +A S Q +AA +++RLLP+ Y
Sbjct: 9 LFITAVIHGHVADA---TTGFEKELKHL----KPQASASTQTSAAQELIKRLLPTRYQ-- 59
Query: 73 QFRIISKKQCGGEYCFILRNHPSSY--IRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHI 130
F + G + +H S+ + ++G TGV + G+ YL ++C HI
Sbjct: 60 HFTVTVNPAIGSD------DHLDSFEISADGTTVQVTGTTGVATVWGVQHYLVHYCNCHI 113
Query: 131 SWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEI 190
SWD G Q+ +P G +P +Q + V P YYQN T SYTFAWWDW+RWE+ I
Sbjct: 114 SWD---GDQL-YLPPDGQWPVIQPL-LRVTSPNRFRYYQNVCTVSYTFAWWDWERWERHI 168
Query: 191 DWMALQGINLPLAFTGQETIWQKVF 215
DWMAL GINLPLAF GQE IWQKV+
Sbjct: 169 DWMALSGINLPLAFNGQEAIWQKVY 193
>gi|440800773|gb|ELR21808.1| AlphaN-acetylglucosaminidase (NAGLU) [Acanthamoeba castellanii str.
Neff]
Length = 800
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 92/162 (56%), Gaps = 14/162 (8%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
A +V+ + +H +F F ++ + G F L P+ + G+ GV +
Sbjct: 29 GATSVVDPQVEAHVDSFDFGLVPPQPHSGNDLFQLL--PTHTVNGS---------GVALA 77
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSS 175
+GL+WYL+Y+C + ++W + G +P P+V A V + P+ YY N T S
Sbjct: 78 SGLYWYLKYYCNASVTWGENGTGDNLVLPD--PLPQVATA-VVMNAPVRWRYYLNVCTVS 134
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQV 217
Y+ AWW+W RWE+EIDWMAL GINLPLAFTGQE +W +V++
Sbjct: 135 YSSAWWNWTRWEREIDWMALHGINLPLAFTGQELVWTEVWKA 176
>gi|326679829|ref|XP_688608.3| PREDICTED: alpha-N-acetylglucosaminidase-like [Danio rerio]
Length = 757
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 68/172 (39%), Positives = 95/172 (55%), Gaps = 15/172 (8%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
R +A Q A +L+RLL + F + G LR+ ++ ++
Sbjct: 32 RAKASDKSQSRAVLDLLRRLLGNRAREFIVSVNRTLSADGLDVCELRSAKNN------KV 85
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVF-VKRPI 163
V G TGV V G++ YL+Y+C H+SW G Q+ ++P+ P GV + P
Sbjct: 86 VAVGSTGVAVATGIYNYLKYFCNCHVSW---AGDQL-NLPR----PLPALTGVLRISTPH 137
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T+SY+ WWDW RW++EIDWMAL GINLPLAFTGQE +WQ+V+
Sbjct: 138 RFRYYQNVCTASYSSVWWDWPRWQREIDWMALNGINLPLAFTGQEVLWQEVY 189
>gi|326435733|gb|EGD81303.1| alpha-N-acetylglucosaminidase [Salpingoeca sp. ATCC 50818]
Length = 696
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 92/162 (56%), Gaps = 7/162 (4%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEV 114
AA V +RL + F+F +I+ CF L N ++ + I G + VE+
Sbjct: 63 AAQALVRRRLGSDALNHFEFYLINTTH-DETLCFELANSNTN----ASVVTIGGTSPVEM 117
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTS 174
GL+++LR + SW+KTGG + +P + P V V + YYQN VT
Sbjct: 118 ATGLNFFLREYLHMSFSWNKTGGDVIHPLP--ATMPVVHSHHAHVNKTTFYTYYQNVVTV 175
Query: 175 SYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
SY+ WWDW+RWE+EIDWMA+ G+NL LA+TGQE +++KV++
Sbjct: 176 SYSMVWWDWQRWEEEIDWMAMNGVNLALAYTGQEYVYRKVYE 217
>gi|322792330|gb|EFZ16314.1| hypothetical protein SINV_06335 [Solenopsis invicta]
Length = 187
Score = 116 bits (290), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 93/172 (54%), Gaps = 14/172 (8%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
+ R P +Q AA V +RLL S F +I G I ++ +I
Sbjct: 7 KSRTSPEIQTKAAKDVAERLLGKETSKL-FIMILDTDLGP----IDKDTFEITKNSLGKI 61
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
I G +GV + GL++YL+++C H+SWD G Q+ S+P + RV+ +
Sbjct: 62 EIRGTSGVAITWGLNYYLKHYCNVHVSWD---GTQI-SLPNTLPYVRVK-----ITSNDR 112
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YYQN T Y+ AWW W++WEK IDWMAL GINL LAFT QE IWQ+++Q
Sbjct: 113 FRYYQNVCTVGYSSAWWQWRQWEKSIDWMALNGINLALAFTAQEAIWQRLYQ 164
>gi|307168312|gb|EFN61518.1| Alpha-N-acetylglucosaminidase [Camponotus floridanus]
Length = 737
Score = 116 bits (290), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 93/174 (53%), Gaps = 18/174 (10%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCG--GEYCFILRNHPSSYIRGTP 102
R P +Q AA V +RLL S F ++ G + F + +P
Sbjct: 40 RSHTSPEIQAKAAKDVAERLLGKEISKL-FIMMVDTDLGPIDKDTFKVTKNPLG------ 92
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
EI I G +GV + GLH+YL+ +C HISWD G Q+ ++P + P V+ + +
Sbjct: 93 EIEIRGTSGVAITWGLHYYLKNYCNVHISWD---GTQI-TLPN--TLPEVR---IKITSN 143
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YYQN T Y+ WW WK+WE+ IDWMAL GINL LAFT QE IWQ+++Q
Sbjct: 144 DRFRYYQNVCTVGYSSTWWQWKQWERNIDWMALNGINLALAFTAQEAIWQRLYQ 197
>gi|328778968|ref|XP_623833.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Apis mellifera]
Length = 752
Score = 116 bits (290), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 72/191 (37%), Positives = 99/191 (51%), Gaps = 21/191 (10%)
Query: 26 AQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLL-PSHYSAFQFRIISKKQCGG 84
++ S++ V +S + R R Q AA V++RLL P F + G
Sbjct: 3 SRPSSLSVNTLSHI-----RPRISSEEQANAAAGVVERLLGPERAKNFVMMVNPNFTSPG 57
Query: 85 EYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMP 144
+ F+++ + ++ I G +GV GLH+YL+ +C HISW+ G QV +P
Sbjct: 58 KDSFLIKKNSMG------QVEILGTSGVVAAWGLHYYLKTYCNVHISWE---GNQV-ELP 107
Query: 145 KLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAF 204
+ + D V + YYQN T YT AWW W+ WEK IDWMAL GINL LAF
Sbjct: 108 DI-----LPDVRVKISSNDRFRYYQNVCTLGYTSAWWQWEDWEKNIDWMALNGINLALAF 162
Query: 205 TGQETIWQKVF 215
TGQE IWQKV+
Sbjct: 163 TGQEAIWQKVY 173
>gi|410930376|ref|XP_003978574.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Takifugu rubripes]
Length = 751
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 72/114 (63%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+IV SG +GV V +G++ Y++Y+C H+SW +PKL R+ P
Sbjct: 84 KIVASGSSGVAVASGIYNYMKYFCNCHVSWSGDQLDLPRPLPKLTGVLRINT-------P 136
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YYQN T SY+ WW+W RWE+EIDWMAL GINLPLAFTGQE +WQ+V++
Sbjct: 137 HRFRYYQNVCTFSYSSVWWNWPRWEREIDWMALNGINLPLAFTGQEALWQEVYR 190
>gi|198433857|ref|XP_002122480.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 880
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 55/117 (47%), Positives = 72/117 (61%), Gaps = 5/117 (4%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ I+G V V + YL+Y+C ISW G Q++ +P P V GV +K +
Sbjct: 80 LKITGSDAVSVAFAFNHYLKYYCRKQISW---AGDQISDIPN--PLPPVPAEGVTIKAGV 134
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAFC 220
YYQN T SY+ WW+W RWE+EIDWMAL GINLPLAFTGQE IW++V++ C
Sbjct: 135 KYRYYQNVCTVSYSSVWWNWTRWEREIDWMALNGINLPLAFTGQEAIWERVYKKLGC 191
>gi|443691318|gb|ELT93213.1| hypothetical protein CAPTEDRAFT_144379, partial [Capitella teleta]
Length = 718
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/172 (37%), Positives = 88/172 (51%), Gaps = 14/172 (8%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRI-ISKKQCGGEYCFILRNHPSSYIRGTPE 103
R A Q A +++RL+ F+ I + F L N + +
Sbjct: 5 RAEADVDTQAEATRDLIRRLIDDRADEFEVDIQLDFNHADDRDIFKLANIGA-------K 57
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ I+ TGV G ++YLR +CG ++W GVQ +P + P + G+ +
Sbjct: 58 VHITASTGVAAAWGFYYYLREYCGCQVTW---AGVQT-ELPAV--LPSLPPTGITITAND 111
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDWKRWE+EIDWMAL INLPLAF QE IWQ+V+
Sbjct: 112 RFRYYQNVCTVSYSFVWWDWKRWEREIDWMALHSINLPLAFNAQEAIWQRVY 163
>gi|380030624|ref|XP_003698943.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Apis florea]
Length = 769
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/211 (35%), Positives = 104/211 (49%), Gaps = 19/211 (9%)
Query: 14 FITTITLSTLAVAQSSTIGVQYISRLLDIQD--RERAPPSVQLAAAYAVLQRLL-PSHYS 70
I ++ +S + ++ S T+ + + + D R + Q AA V++RLL P
Sbjct: 2 LIYSLLVSIIFLSGSLTVSCKNSNVFQNTLDHIRPQISSEEQANAAAGVVERLLGPERAK 61
Query: 71 AFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHI 130
F + G+ F ++ + +I I G +GV GLH+YL+ +C HI
Sbjct: 62 NFVMMVNPNFTSPGKDTFSIKKN------SVGQIEILGTSGVAAAWGLHYYLKTYCNVHI 115
Query: 131 SWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEI 190
SW+ T ++P D V + YYQN T YT AWW W+ WEK I
Sbjct: 116 SWEGTKXQLPDTLP---------DVRVKISSNDRFRYYQNVCTLGYTSAWWQWEDWEKNI 166
Query: 191 DWMALQGINLPLAFTGQETIWQKVF-QVAFC 220
DWMAL GINL LAFTGQE IWQKV+ Q+ F
Sbjct: 167 DWMALNGINLALAFTGQEAIWQKVYLQLNFT 197
>gi|321472423|gb|EFX83393.1| hypothetical protein DAPPUDRAFT_301977 [Daphnia pulex]
Length = 799
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 66/174 (37%), Positives = 88/174 (50%), Gaps = 20/174 (11%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPE- 103
R R +Q A +++RLLP S F ++ E + + + +PE
Sbjct: 31 RTRTSVDIQAQAVRDLIERLLPERASEFNVKV--------EPGLLEEENGYFQVIKSPEN 82
Query: 104 --IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKR 161
+ + G TGV G H+YL C +SWD +P + P V V +
Sbjct: 83 TTVFLQGSTGVMAAWGFHYYLTQLCHCQVSWD----ADQLDLPVI--LPEV---SVRITS 133
Query: 162 PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+FAWW W RWE+EIDWMA+ GINLPLAFTGQE IWQ+V+
Sbjct: 134 LDKFRYYQNTCTFSYSFAWWKWPRWEREIDWMAMNGINLPLAFTGQEIIWQRVY 187
>gi|73965663|ref|XP_548088.2| PREDICTED: alpha-N-acetylglucosaminidase [Canis lupus familiaris]
Length = 747
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 55/113 (48%), Positives = 71/113 (62%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++ G TGV AGLH YLR +CG H++W G Q+ L + P+V P
Sbjct: 75 RVLVLGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPEPLPAVPQVLTEAT----P 127
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWE+E+DWMAL GINL LA++GQE IWQ+V+
Sbjct: 128 NRYRYYQNVCTHSYSFVWWDWARWERELDWMALNGINLALAWSGQEAIWQRVY 180
>gi|109491871|ref|XP_001081442.1| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
gi|392351622|ref|XP_002727861.2| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
gi|149054262|gb|EDM06079.1| rCG33377, isoform CRA_b [Rattus norvegicus]
Length = 739
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 72/116 (62%), Gaps = 8/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G P +++ G +GV AGLH YLR +CG HI+W + +P P V D G+
Sbjct: 70 GVP-VLVRGSSGVAAAAGLHRYLRDFCGCHIAWSSS----QLHLPS--PLPAVPD-GLTE 121
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA+ GQE IWQ+V+
Sbjct: 122 ATPNRYRYYQNVCTHSYSFVWWDWARWEQEIDWMALNGINLALAWNGQEAIWQRVY 177
>gi|320162905|gb|EFW39804.1| lysosomal alpha-N-acetyl glucosaminidase [Capsaspora owczarzaki
ATCC 30864]
Length = 786
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 122/230 (53%), Gaps = 32/230 (13%)
Query: 5 FRSVSLILFFITTITLSTLAVAQSSTIGVQYIS---------------RLLDIQDRE--R 47
R++ L+L T + +++LA A + T V + + R +D +E R
Sbjct: 1 MRNLMLLLVAATVVCVASLASALAPTSFVVHETFLSAHWQDATTEAKQRFVDQASQEAIR 60
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVIS 107
P VQ+ AA ++ R++P+H F +I G E+ F + ++ ++ I +
Sbjct: 61 DIPDVQVLAADLLIHRIIPNHDKDFVLSLIPLAN-GTEHVFEVESNSTA-------IFLR 112
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY 167
G +GV + + ++YL+Y C + +SW ++P++ + +V+ +F NY
Sbjct: 113 GSSGVALSSAFNYYLKYVCNASLSWSGDQLNIPTTLPRVPA--KVRSVSLF-----EWNY 165
Query: 168 YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQV 217
Y N T SY+ WW++ RWE+EIDWMAL G+ +PLAFTGQE +W+++F +
Sbjct: 166 YMNVCTVSYSSVWWEFARWEREIDWMALNGVTMPLAFTGQEYVWRRLFHL 215
>gi|307192254|gb|EFN75548.1| Alpha-N-acetylglucosaminidase [Harpegnathos saltator]
Length = 741
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 94/168 (55%), Gaps = 18/168 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCG--GEYCFILRNHPSSYIRGTPEIVIS 107
P VQ AA V++RLL + +A F + G G+ F + +P I I
Sbjct: 18 PEVQTEAAKDVVRRLL-GNKTAELFVVAVDNDLGPAGKDTFKVTKNPLG------NIEIR 70
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY 167
G +GV + GL++YL+ +C H+SWD G Q+ S+P + P V+ + + Y
Sbjct: 71 GTSGVAITWGLNYYLKNYCNVHVSWD---GTQI-SLPNI--LPEVR---IKITSNDRFRY 121
Query: 168 YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YQN T Y+ AWW W++WEK IDWMAL GINL LAFT QE IWQ++F
Sbjct: 122 YQNVCTVGYSSAWWQWQQWEKNIDWMALNGINLALAFTAQEAIWQRLF 169
>gi|344285558|ref|XP_003414528.1| PREDICTED: alpha-N-acetylglucosaminidase [Loxodonta africana]
Length = 744
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 68/163 (41%), Positives = 92/163 (56%), Gaps = 10/163 (6%)
Query: 53 QLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGV 112
+ AA A+L RLL +A F + ++ E L + S + ++G TGV
Sbjct: 28 EAAAVQALLARLLGPRPAA-DFSVSVERSLAAESG--LDTYRLSGGGAGARVRVAGSTGV 84
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAV 172
AGLH YLR +CG H++W G Q+ +P+ P V + + P YYQN
Sbjct: 85 AAAAGLHRYLRDFCGCHVAWS---GSQL-RLPR--PLPAVPEE-LTEATPNRYRYYQNVC 137
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 138 TQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 180
>gi|149054263|gb|EDM06080.1| rCG33377, isoform CRA_c [Rattus norvegicus]
Length = 239
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 72/116 (62%), Gaps = 8/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G P +++ G +GV AGLH YLR +CG HI+W + +P P V D G+
Sbjct: 70 GVP-VLVRGSSGVAAAAGLHRYLRDFCGCHIAWSSS----QLHLPS--PLPAVPD-GLTE 121
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA+ GQE IWQ+V+
Sbjct: 122 ATPNRYRYYQNVCTHSYSFVWWDWARWEQEIDWMALNGINLALAWNGQEAIWQRVY 177
>gi|402900329|ref|XP_003913130.1| PREDICTED: alpha-N-acetylglucosaminidase [Papio anubis]
Length = 743
Score = 112 bits (280), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 70/116 (60%), Gaps = 7/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G TGV AGLH YLR +CG H++W G Q+ L + P +
Sbjct: 71 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTE 123
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 124 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 179
>gi|297701096|ref|XP_002827555.1| PREDICTED: alpha-N-acetylglucosaminidase [Pongo abelii]
Length = 836
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 70/116 (60%), Gaps = 7/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G TGV AGLH YLR +CG H++W G Q+ L + P +
Sbjct: 164 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTE 216
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 217 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 272
>gi|297273081|ref|XP_001095618.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Macaca mulatta]
Length = 691
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 70/116 (60%), Gaps = 7/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G TGV AGLH YLR +CG H++W G Q+ L + P +
Sbjct: 71 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTE 123
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 124 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 179
>gi|426348060|ref|XP_004041658.1| PREDICTED: alpha-N-acetylglucosaminidase [Gorilla gorilla gorilla]
Length = 743
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 70/116 (60%), Gaps = 7/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G TGV AGLH YLR +CG H++W G Q+ L + P +
Sbjct: 71 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTE 123
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 124 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 179
>gi|114667172|ref|XP_523654.2| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Pan
troglodytes]
gi|410216584|gb|JAA05511.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410258938|gb|JAA17435.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410304442|gb|JAA30821.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410337929|gb|JAA37911.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
Length = 743
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 70/116 (60%), Gaps = 7/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G TGV AGLH YLR +CG H++W G Q+ L + P +
Sbjct: 71 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTE 123
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 124 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 179
>gi|1171229|gb|AAC50512.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1171231|gb|AAC50513.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1197840|gb|AAB06188.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1479981|gb|AAB36604.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|32450702|gb|AAH53991.1| N-acetylglucosaminidase, alpha- [Homo sapiens]
gi|119581237|gb|EAW60833.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
CRA_b [Homo sapiens]
Length = 743
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 70/116 (60%), Gaps = 7/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G TGV AGLH YLR +CG H++W G Q+ L + P +
Sbjct: 71 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTE 123
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 124 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 179
>gi|66346698|ref|NP_000254.2| alpha-N-acetylglucosaminidase precursor [Homo sapiens]
gi|317373322|sp|P54802.2|ANAG_HUMAN RecName: Full=Alpha-N-acetylglucosaminidase; AltName:
Full=N-acetyl-alpha-glucosaminidase; Short=NAG;
Contains: RecName: Full=Alpha-N-acetylglucosaminidase 82
kDa form; Contains: RecName:
Full=Alpha-N-acetylglucosaminidase 77 kDa form; Flags:
Precursor
Length = 743
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 70/116 (60%), Gaps = 7/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G TGV AGLH YLR +CG H++W G Q+ L + P +
Sbjct: 71 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTE 123
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 124 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 179
>gi|262406058|ref|ZP_06082608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294806855|ref|ZP_06765680.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345510563|ref|ZP_08790130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|262356933|gb|EEZ06023.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294445884|gb|EFG14526.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345454460|gb|EEO49066.2| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
Length = 727
Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats.
Identities = 65/181 (35%), Positives = 97/181 (53%), Gaps = 24/181 (13%)
Query: 35 YISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHP 94
++S +L +D++ +A A A+L+RLLPS+ +FQF +K G + CF + +
Sbjct: 14 FMSLVLQAKDKD-------VAVAEALLKRLLPSYIESFQF----QKLKGEKDCFTIESVK 62
Query: 95 SSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQD 154
+IVI G + GL+ YL+Y+C + +SW V++ P V D
Sbjct: 63 D-------KIVIGGNNANSMAMGLNHYLKYYCLTTVSWYADIAVEIPE-----ELPMV-D 109
Query: 155 AGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
V + + ++ N T YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV
Sbjct: 110 EKVVSEARVDTRFFLNYCTYGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWYKV 169
Query: 215 F 215
+
Sbjct: 170 W 170
>gi|336412606|ref|ZP_08592959.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
3_8_47FAA]
gi|335942652|gb|EGN04494.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
3_8_47FAA]
Length = 727
Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats.
Identities = 65/181 (35%), Positives = 97/181 (53%), Gaps = 24/181 (13%)
Query: 35 YISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHP 94
++S +L +D++ +A A A+L+RLLPS+ +FQF +K G + CF + +
Sbjct: 14 FMSLVLQAKDKD-------VAVAEALLKRLLPSYIESFQF----QKLKGEKDCFTIESVK 62
Query: 95 SSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQD 154
+IVI G + GL+ YL+Y+C + +SW V++ P V D
Sbjct: 63 D-------KIVIGGNNANSMAMGLNHYLKYYCLTTVSWYADIAVEIPE-----ELPMV-D 109
Query: 155 AGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
V + + ++ N T YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV
Sbjct: 110 EKVVSEARVDTRFFLNYCTYGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWYKV 169
Query: 215 F 215
+
Sbjct: 170 W 170
>gi|326437768|gb|EGD83338.1| lysosomal alpha-N-acetyl glucosaminidase [Salpingoeca sp. ATCC
50818]
Length = 820
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 95/178 (53%), Gaps = 21/178 (11%)
Query: 48 APPSVQLAA--------AYAVLQRLLPSHYS-AFQFRIISKKQCGGEYCFILRNHPSSYI 98
APP V L+A A V +R+L + F+ I G F + +
Sbjct: 24 APPKVSLSAPVHDAVAGAEGVARRVLGEEVALNIVFQEIPPNVADGHDVFEIET-----V 78
Query: 99 RGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWD-KTGGVQVASMPKLGSFPRVQDAGV 157
G ++VI G T V + +GL+WYLRY+C S ++W G Q+ +L P V+
Sbjct: 79 DG--QLVIRGNTAVAMSSGLYWYLRYYCNSQVTWGVNDSGRQINVSAQL---PVVKPMVR 133
Query: 158 FVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
V + + YY N T SY+ AWW+W RWEKE+DWMAL G+NLPLAFTGQE IW + +
Sbjct: 134 KVNQ-VRFRYYMNVCTVSYSMAWWNWTRWEKELDWMALHGVNLPLAFTGQEYIWYEFY 190
>gi|332260899|ref|XP_003279518.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Nomascus leucogenys]
Length = 736
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/115 (48%), Positives = 69/115 (60%), Gaps = 7/115 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G TGV AGLH YLR +CG H++W G Q+ L + P +
Sbjct: 71 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTE 123
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V
Sbjct: 124 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRV 178
>gi|383856382|ref|XP_003703688.1| PREDICTED: alpha-N-acetylglucosaminidase [Megachile rotundata]
Length = 744
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/173 (37%), Positives = 89/173 (51%), Gaps = 18/173 (10%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCG--GEYCFILRNHPSSYIRGTP 102
+ + P Q AA V +RLL + F + CG G+ F++ S+ I+
Sbjct: 13 KPQVAPKAQAVAAKGVAERLLGVERARL-FVMDVDPDCGPVGKDTFLIEKTSSNQIK--- 68
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
I G +GV V GLH+YL+++C +HISW+ +++P D V V
Sbjct: 69 ---IVGTSGVAVSWGLHYYLKHYCNAHISWEGNQLHLPSTLP---------DVHVKVTSN 116
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN + YT WW W WEK IDWMAL G NL LAFTGQE IW++V+
Sbjct: 117 DRFRYYQNVCSFGYTSVWWQWDDWEKSIDWMALNGYNLALAFTGQEAIWERVY 169
>gi|410100551|ref|ZP_11295511.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
CL02T12C30]
gi|409215586|gb|EKN08585.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
CL02T12C30]
Length = 739
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 93/163 (57%), Gaps = 20/163 (12%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
A+ A+++R++P S+F+ + K G+ CF + + + +I++ G GV V
Sbjct: 26 ASEALIKRVIPEKASSFKVEMFQPKD--GKDCFEIESKGN-------KIILRGNNGVSVA 76
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY--YQNAVT 173
+ L+ YL+Y+C +HISW+ G + +L P + RP ++ Y N T
Sbjct: 77 SALYHYLKYYCNAHISWN---GDNLNLPSRLPEVPEK------IIRPTAFDHRVYLNYCT 127
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
SYT AWW+W+RW++EIDWMA+ GIN+PLA GQE +WQ +
Sbjct: 128 ISYTMAWWNWERWQREIDWMAMHGINMPLAVIGQEAVWQNTLR 170
>gi|291406137|ref|XP_002719212.1| PREDICTED: alpha-N-acetylglucosaminidase [Oryctolagus cuniculus]
Length = 743
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/116 (47%), Positives = 69/116 (59%), Gaps = 7/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G +GV AGLH YLR +CG H++W G Q+ L + P
Sbjct: 71 GAARVRVRGSSGVAAAAGLHRYLRDFCGCHVAWS---GSQLHLPHPLPAVPEELTEAT-- 125
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 126 --PHRYRYYQNVCTQSYSFVWWDWARWEQEIDWMALNGINLALAWSGQEAIWQRVY 179
>gi|156399499|ref|XP_001638539.1| predicted protein [Nematostella vectensis]
gi|156225660|gb|EDO46476.1| predicted protein [Nematostella vectensis]
Length = 675
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 59/119 (49%), Positives = 75/119 (63%), Gaps = 7/119 (5%)
Query: 97 YIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAG 156
Y ++VI+G +GV L +L+ +C +HISW G Q+ +P FP V D
Sbjct: 6 YKTNDSKLVITGTSGVACAMALQHFLKAFCFAHISW---SGDQL-KIPT--PFPTVLDE- 58
Query: 157 VFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
V V P YYQN T SY+F WW+W RW++EIDWMAL GINLPLAFTGQE IWQ+V+
Sbjct: 59 VSVGIPYRFRYYQNVCTPSYSFVWWNWTRWQREIDWMALNGINLPLAFTGQEAIWQRVY 117
>gi|196001339|ref|XP_002110537.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
gi|190586488|gb|EDV26541.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
Length = 757
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 71/114 (62%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+I+I+G +GV G+H YL+Y+C H+SW ++PKL + V+
Sbjct: 87 QILITGSSGVAAAFGVHHYLKYYCKVHLSWTGDRLKLPQTLPKLSQ-------KIHVEIN 139
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YY NA T SY+ WWDW+RW+KEIDWMAL GIN+PLA TGQE IW KV++
Sbjct: 140 QKFRYYFNACTFSYSMVWWDWQRWKKEIDWMALNGINMPLALTGQEGIWTKVYK 193
>gi|296237182|ref|XP_002763645.1| PREDICTED: alpha-N-acetylglucosaminidase-like, partial [Callithrix
jacchus]
Length = 249
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/117 (47%), Positives = 69/117 (58%), Gaps = 7/117 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G TGV AGLH YLR +CG H++W G Q+ L + P +
Sbjct: 69 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTE 121
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P YYQN T SY+F WWDW RWE+EIDWM L GINL LA++GQE IWQ++ Q
Sbjct: 122 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMVLNGINLALAWSGQEAIWQRLLQ 178
>gi|395827009|ref|XP_003786703.1| PREDICTED: alpha-N-acetylglucosaminidase [Otolemur garnettii]
Length = 756
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 72/116 (62%), Gaps = 7/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G TGV AGLH YLR +CG H++W G Q+ +P+ P V + +
Sbjct: 71 GGARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQL-RLPR--PLPAVPEE-LTE 123
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWM L GINL LA++GQE IWQ+V+
Sbjct: 124 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMVLNGINLALAWSGQEAIWQRVY 179
>gi|2660688|gb|AAB88084.1| Naglu [Mus musculus]
Length = 739
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 57/116 (49%), Positives = 70/116 (60%), Gaps = 8/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G P +++ G TGV AGLH YLR +CG ++W A + P V D G+
Sbjct: 70 GVP-VLVRGSTGVAAAAGLHRYLRDFCGCQVAWSS------AQLHLPWPLPAVPD-GLTE 121
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA+ GQE IWQ+V+
Sbjct: 122 TTPNRYRYYQNVCTHSYSFVWWDWARWEQEIDWMALNGINLALAWNGQEAIWQRVY 177
>gi|254910995|ref|NP_038820.2| alpha-N-acetylglucosaminidase precursor [Mus musculus]
gi|20385160|gb|AAM21194.1|AF363242_1 N-acetyl-glucosaminidase [Mus musculus]
gi|3329361|gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus]
gi|33585908|gb|AAH55733.1| Alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB) [Mus
musculus]
gi|74211094|dbj|BAE37639.1| unnamed protein product [Mus musculus]
gi|74218052|dbj|BAE42009.1| unnamed protein product [Mus musculus]
gi|148671929|gb|EDL03876.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
CRA_b [Mus musculus]
Length = 739
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 57/116 (49%), Positives = 70/116 (60%), Gaps = 8/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G P +++ G TGV AGLH YLR +CG ++W A + P V D G+
Sbjct: 70 GVP-VLVRGSTGVAAAAGLHRYLRDFCGCQVAWSS------AQLHLPWPLPAVPD-GLTE 121
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA+ GQE IWQ+V+
Sbjct: 122 TTPNRYRYYQNVCTHSYSFVWWDWARWEQEIDWMALNGINLALAWNGQEAIWQRVY 177
>gi|237719043|ref|ZP_04549524.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451821|gb|EEO57612.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 713
Score = 110 bits (276), Expect = 3e-22, Method: Composition-based stats.
Identities = 62/162 (38%), Positives = 88/162 (54%), Gaps = 17/162 (10%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
+A A A+L+RLLPS+ +FQF +K G + CF + + +IVI G
Sbjct: 12 VAVAEALLKRLLPSYIESFQF----QKLKGEKDCFTIESVKD-------KIVIGGNNANS 60
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
+ GL+ YL+Y+C + +SW V++ P V + V R + ++ N T
Sbjct: 61 MAMGLNHYLKYYCLTTVSWYADIAVEIPE-----ELPMVGEKVVSEAR-VDTRFFLNYCT 114
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV+
Sbjct: 115 YGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWHKVW 156
>gi|423293377|ref|ZP_17271504.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
CL03T12C18]
gi|392678320|gb|EIY71728.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 110 bits (276), Expect = 3e-22, Method: Composition-based stats.
Identities = 62/162 (38%), Positives = 88/162 (54%), Gaps = 17/162 (10%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
+A A A+L+RLLPS+ +FQF +K G + CF + + +IVI G
Sbjct: 26 VAVAEALLKRLLPSYIESFQF----QKLKGEKDCFTIESVKD-------KIVIGGNNANS 74
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
+ GL+ YL+Y+C + +SW V++ P V + V R + ++ N T
Sbjct: 75 MAMGLNHYLKYYCLTTVSWYADIAVEIPE-----ELPMVGEKVVSEAR-VDTRFFLNYCT 128
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV+
Sbjct: 129 YGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWHKVW 170
>gi|311267179|ref|XP_003131436.1| PREDICTED: alpha-N-acetylglucosaminidase [Sus scrofa]
Length = 744
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 55/113 (48%), Positives = 69/113 (61%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ + G TGV AGLH YLR +CG H++W G Q+ L + P P
Sbjct: 75 QVRVIGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPQPLPAVPEELTEAT----P 127
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 128 NRYRYYQNVCTQSYSFVWWDWARWEQEIDWMALNGINLALAWSGQEAIWQRVY 180
>gi|270005801|gb|EFA02249.1| hypothetical protein TcasGA2_TC007912 [Tribolium castaneum]
Length = 747
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 84/172 (48%), Gaps = 16/172 (9%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
R +A Q A +++RL+P F +I + F + + I
Sbjct: 22 RPKASLKTQTDAVKDLIKRLIPGQAQFFDIKIQPDLNHEDKDAFKIE-------KIGDLI 74
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
I+G TGV G + YL+Y+C H+SW+ T +P++ + D
Sbjct: 75 QITGTTGVAAATGFNHYLKYFCNCHVSWETTQLKVPEVLPEVNLTITLNDR--------- 125
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YYQN T+SY+F WWDW +WEK IDWM L G NL LAF GQE IW +V++
Sbjct: 126 FRYYQNVCTTSYSFVWWDWTQWEKHIDWMVLNGFNLVLAFNGQEAIWDRVYK 177
>gi|91080563|ref|XP_973259.1| PREDICTED: similar to alpha-N-acetyl glucosaminidase [Tribolium
castaneum]
Length = 747
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 84/172 (48%), Gaps = 16/172 (9%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
R +A Q A +++RL+P F +I + F + + I
Sbjct: 22 RPKASLKTQTDAVKDLIKRLIPGQAQFFDIKIQPDLNHEDKDAFKIE-------KIGDLI 74
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
I+G TGV G + YL+Y+C H+SW+ T +P++ + D
Sbjct: 75 QITGTTGVAAATGFNHYLKYFCNCHVSWETTQLKVPEVLPEVNLTITLNDR--------- 125
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YYQN T+SY+F WWDW +WEK IDWM L G NL LAF GQE IW +V++
Sbjct: 126 FRYYQNVCTTSYSFVWWDWTQWEKHIDWMVLNGFNLVLAFNGQEAIWDRVYK 177
>gi|431890602|gb|ELK01481.1| Alpha-N-acetylglucosaminidase [Pteropus alecto]
Length = 740
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 55/112 (49%), Positives = 68/112 (60%), Gaps = 7/112 (6%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G TGV AGLH YLR +CG H++W G Q+ L + P P
Sbjct: 72 VQVLGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPQPLPAVPEELTEAT----PN 124
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 125 RYRYYQNVCTQSYSFVWWDWARWEQEIDWMALNGINLALAWSGQEAIWQRVY 176
>gi|299144715|ref|ZP_07037783.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298515206|gb|EFI39087.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 727
Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats.
Identities = 61/163 (37%), Positives = 89/163 (54%), Gaps = 19/163 (11%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
+A A A+L+RLLPS+ +FQF +K G + CF + + +IVI G
Sbjct: 26 VAVAEALLKRLLPSYIESFQF----QKLKGEKDCFTIESVKD-------KIVIGGNNANS 74
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVAS-MPKLGSFPRVQDAGVFVKRPIPLNYYQNAV 172
+ GL+ YL+Y+C + +SW V++ +P +G V + + ++ N
Sbjct: 75 MAMGLNHYLKYYCLTTVSWYADIAVEIPEELPMVGE-------KVISEARVDTRFFLNYC 127
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV+
Sbjct: 128 TYGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWYKVW 170
>gi|298480128|ref|ZP_06998327.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|336404356|ref|ZP_08585054.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
gi|298273937|gb|EFI15499.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|335943684|gb|EGN05523.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
Length = 727
Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats.
Identities = 62/162 (38%), Positives = 88/162 (54%), Gaps = 17/162 (10%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
+A A A+L+RLLPS+ +FQF +K G + CF + + +IVI G
Sbjct: 26 VAVAEALLKRLLPSYIESFQF----QKLKGEKDCFTIESVKD-------KIVIGGNNANS 74
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
+ GL+ YL+Y+C + +SW V++ P V + V R + ++ N T
Sbjct: 75 MAMGLNHYLKYYCLTTVSWYADIAVEIPE-----ELPMVGEKVVSEAR-VDTRFFLNYCT 128
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV+
Sbjct: 129 YGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWYKVW 170
>gi|295085509|emb|CBK67032.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 716
Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats.
Identities = 62/162 (38%), Positives = 88/162 (54%), Gaps = 17/162 (10%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
+A A A+L+RLLPS+ +FQF +K G + CF + + +IVI G
Sbjct: 15 VAVAEALLKRLLPSYIESFQF----QKLKGEKDCFTIESVKD-------KIVIGGNNANS 63
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
+ GL+ YL+Y+C + +SW V++ P V + V R + ++ N T
Sbjct: 64 MAMGLNHYLKYYCLTTVSWYADIAVEIPE-----ELPMVGEKVVSEAR-VDTRFFLNYCT 117
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV+
Sbjct: 118 YGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWYKVW 159
>gi|160884062|ref|ZP_02065065.1| hypothetical protein BACOVA_02038 [Bacteroides ovatus ATCC 8483]
gi|423291477|ref|ZP_17270325.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
CL02T12C04]
gi|156110404|gb|EDO12149.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392663477|gb|EIY57027.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
CL02T12C04]
Length = 727
Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats.
Identities = 62/162 (38%), Positives = 88/162 (54%), Gaps = 17/162 (10%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
+A A A+L+RLLPS+ +FQF +K G + CF + + +IVI G
Sbjct: 26 VAVAEALLKRLLPSYIESFQF----QKLKGEKDCFTIESVKD-------KIVIGGNNANS 74
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
+ GL+ YL+Y+C + +SW V++ P V + V R + ++ N T
Sbjct: 75 MAMGLNHYLKYYCLTTVSWYADIAVEIPE-----ELPMVGEKVVSEAR-VDTRFFLNYCT 128
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV+
Sbjct: 129 YGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWYKVW 170
>gi|423214204|ref|ZP_17200732.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693149|gb|EIY86384.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
CL03T12C04]
Length = 727
Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats.
Identities = 62/162 (38%), Positives = 88/162 (54%), Gaps = 17/162 (10%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
+A A A+L+RLLPS+ +FQF +K G + CF + + +IVI G
Sbjct: 26 VAVAEALLKRLLPSYIESFQF----QKLKGEKDCFTIESVKD-------KIVIGGNNANS 74
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
+ GL+ YL+Y+C + +SW V++ P V + V R + ++ N T
Sbjct: 75 MAMGLNHYLKYYCLTTVSWYADIAVEIPE-----ELPMVGEKVVSEAR-VDTRFFLNYCT 128
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV+
Sbjct: 129 YGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWYKVW 170
>gi|358419179|ref|XP_003584151.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bos taurus]
Length = 741
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 55/112 (49%), Positives = 68/112 (60%), Gaps = 7/112 (6%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G TGV AGLH YLR +CG H++W G Q+ L + P P
Sbjct: 75 VQVLGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPQPLPAVPEELTEAT----PN 127
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 128 RYRYYQNVCTQSYSFLWWDWARWEQEIDWMALNGINLALAWSGQEAIWQRVY 179
>gi|383115203|ref|ZP_09935961.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
gi|313695380|gb|EFS32215.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
Length = 727
Score = 110 bits (274), Expect = 6e-22, Method: Composition-based stats.
Identities = 62/162 (38%), Positives = 88/162 (54%), Gaps = 17/162 (10%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
+A A A+L+RLLPS+ +FQF +K G + CF + + +IVI G
Sbjct: 26 VAVAEALLKRLLPSYIESFQF----QKLKGEKDCFTIESVKD-------KIVIGGNNANS 74
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
+ GL+ YL+Y+C + +SW V++ P V + V R + ++ N T
Sbjct: 75 MAMGLNHYLKYYCLTTVSWYADIAVEIPE-----ELPMVGEKVVSEAR-VDTRFFLNYCT 128
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV+
Sbjct: 129 YGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWYKVW 170
>gi|294648123|ref|ZP_06725666.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
gi|292636507|gb|EFF54982.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
Length = 215
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 97/181 (53%), Gaps = 24/181 (13%)
Query: 35 YISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHP 94
++S +L +D++ +A A A+L+RLLPS+ +FQF+ K G + CF + +
Sbjct: 3 FMSLVLQAKDKD-------VAVAEALLKRLLPSYIESFQFQ----KLKGEKDCFTIESVK 51
Query: 95 SSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQD 154
+IVI G + GL+ YL+Y+C + +SW V++ P V D
Sbjct: 52 D-------KIVIGGNNANSMAMGLNHYLKYYCLTTVSWYADIAVEIPE-----ELPMV-D 98
Query: 155 AGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
V + + ++ N T YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV
Sbjct: 99 EKVVSEARVDTRFFLNYCTYGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWYKV 158
Query: 215 F 215
+
Sbjct: 159 W 159
>gi|156121099|ref|NP_001095696.1| alpha-N-acetylglucosaminidase precursor [Bos taurus]
gi|151554244|gb|AAI48148.1| NAGLU protein [Bos taurus]
gi|296476361|tpg|DAA18476.1| TPA: alpha-N-acetylglucosaminidase [Bos taurus]
Length = 667
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 55/112 (49%), Positives = 68/112 (60%), Gaps = 7/112 (6%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G TGV AGLH YLR +CG H++W G Q+ L + P P
Sbjct: 75 VQVLGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPQPLPAVPEELTEAT----PN 127
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 128 RYRYYQNVCTQSYSFLWWDWARWEQEIDWMALNGINLALAWSGQEAIWQRVY 179
>gi|242011515|ref|XP_002426494.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus
corporis]
gi|212510620|gb|EEB13756.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus
corporis]
Length = 1345
Score = 109 bits (273), Expect = 7e-22, Method: Composition-based stats.
Identities = 52/113 (46%), Positives = 72/113 (63%), Gaps = 9/113 (7%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ +SG + ++ G+ +YL+ +C HISWD G Q+ +P + P V + V
Sbjct: 712 KVSVSGPNAISLVWGVGFYLKNFCNVHISWD---GDQL-RIPV--TLPVVN---ISVNSV 762
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN TSSY+F WWDW RWE+EIDWMA+ GINL LAFTGQE IW++ +
Sbjct: 763 EKFRYYQNVCTSSYSFVWWDWNRWEREIDWMAINGINLALAFTGQEAIWKRTY 815
>gi|66801665|ref|XP_629757.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
gi|60463162|gb|EAL61355.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
Length = 798
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 94/183 (51%), Gaps = 12/183 (6%)
Query: 39 LLDIQ-DRERAPPSVQLAAAYAVLQRLLPSHYSAF-QFRIISKKQCGGEYCFI----LRN 92
LLD+ +R+ + Q++ Y +++RL + F + +I + GEY I + N
Sbjct: 54 LLDLHMERDYKDGNKQISTVYGLIERLFNFEMTLFFKLKIEESEWMTGEYYEISTESVEN 113
Query: 93 HPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRV 152
SS + + I +GV + GL +YL+Y+C +W G Q S+ P V
Sbjct: 114 EDSS--KNITFVTIRADSGVNLAMGLQYYLKYYCFCSYTW---SGDQ-CSITSYSQLPAV 167
Query: 153 QDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQ 212
+ V + YY N T Y+ WW+W RWE+EIDWMAL G NLPLAF GQE IW
Sbjct: 168 TEGSVSIPVISAYRYYMNVCTFGYSTTWWNWSRWEREIDWMALNGYNLPLAFVGQEYIWY 227
Query: 213 KVF 215
+VF
Sbjct: 228 RVF 230
>gi|260821254|ref|XP_002605948.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
gi|229291285|gb|EEN61958.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
Length = 673
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/115 (51%), Positives = 69/115 (60%), Gaps = 11/115 (9%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAG--VFVK 160
+IVI G TGV G + YL+Y+ +SW VQ +P G P V A V V
Sbjct: 238 KIVIRGTTGVAAAHGFNHYLKYYARCQVSWT----VQQVHLP--GVLPPVTPAVTIVTVN 291
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
R YYQN T SY+F WWDW RWE+ +DWMAL GINLPLAF GQE IWQKV+
Sbjct: 292 R---YRYYQNVCTVSYSFVWWDWSRWERHLDWMALSGINLPLAFNGQEAIWQKVY 343
>gi|340617022|ref|YP_004735475.1| alpha-N-acetylglucosaminidase [Zobellia galactanivorans]
gi|339731819|emb|CAZ95084.1| Alpha-N-acetylglucosaminidase, family GH89 [Zobellia
galactanivorans]
Length = 747
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 96/183 (52%), Gaps = 15/183 (8%)
Query: 33 VQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRN 92
V I L I +A S +AY +L R+LP S+F F+ I+ + + F + N
Sbjct: 5 VSLIVLLFTIATYSQARQSGAEKSAYGILNRILPERASSFVFKEITASK--DKDVFEVSN 62
Query: 93 HPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRV 152
+ +IVISG + V + +GL WYL+ +C +HIS + ++P+L S R
Sbjct: 63 GHAG------KIVISGNSAVAMASGLDWYLKNYCNAHISLNYNQLDLPKTLPRLTSKIRK 116
Query: 153 QDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQ 212
+ P Y N T YT WW+W RWE+ ID+MAL+G+N+PLA GQE +WQ
Sbjct: 117 E-------TPFDYRYLFNYCTYGYTMPWWNWDRWEEMIDYMALKGVNMPLAIIGQEAVWQ 169
Query: 213 KVF 215
+V
Sbjct: 170 EVL 172
>gi|390463730|ref|XP_003733088.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Callithrix jacchus]
Length = 830
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 68/116 (58%), Gaps = 7/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G TGV AGLH YLR +CG H++W G Q+ L + P +
Sbjct: 164 GAARVRVRGSTGVAAAAGLHGYLRDFCGCHVAWS---GSQLRLPRPLSAVP----GELIE 216
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW WE+EIDWMAL GIN LA++GQE IWQ+V+
Sbjct: 217 ATPNRYRYYQNVCTQSYSFVWWDWAHWEREIDWMALNGINPALAWSGQEAIWQRVY 272
>gi|156545487|ref|XP_001606979.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Nasonia vitripennis]
Length = 755
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/165 (38%), Positives = 90/165 (54%), Gaps = 15/165 (9%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
VQ +A ++ R+L F+ + G+ F L+ + +++I+G +
Sbjct: 47 EVQTSAVENLVVRVLGKDADLFKVTLDPDLGPKGKDTFNLKKLKDN------KVIITGTS 100
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQN 170
GV V GLH+YL +C H+SW+ G Q+ ++PK+ P V + V YYQN
Sbjct: 101 GVAVAWGLHYYLTTYCNCHVSWE---GSQL-NLPKI--LPDVNET---VSSNDRFRYYQN 151
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T+ Y+ AWW W +WEK IDWMAL INL LAF GQE IWQKV+
Sbjct: 152 VCTAGYSSAWWKWPQWEKNIDWMALNSINLALAFHGQEAIWQKVY 196
>gi|198277542|ref|ZP_03210073.1| hypothetical protein BACPLE_03764 [Bacteroides plebeius DSM 17135]
gi|198270040|gb|EDY94310.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides plebeius DSM
17135]
Length = 722
Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats.
Identities = 51/114 (44%), Positives = 70/114 (61%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ S+PK+ + R +
Sbjct: 61 KVVIRGNNYVNIATGLNWYLKYYAGIHLSWNGMTAELPESLPKVSTPVRKETN------- 113
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ L Y N T SYT A+WDW+RWEKEIDWMAL GINLPLA GQE +W+ + +
Sbjct: 114 LSLRYDFNYCTYSYTMAFWDWERWEKEIDWMALHGINLPLAVVGQECVWKNMLE 167
>gi|325299497|ref|YP_004259414.1| alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
gi|324319050|gb|ADY36941.1| Alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
Length = 723
Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats.
Identities = 52/113 (46%), Positives = 69/113 (61%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+IV+ G T V + +GL+WYL+Y+ G H+SW+ AS+P + R +
Sbjct: 60 KIVVRGNTYVNIASGLNWYLKYYAGIHLSWNNMTAELPASLPPVTKPERHET-------D 112
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ L Y N T SYT A+WDW RWEKEIDWMAL GINLPLA GQE +W+ +
Sbjct: 113 LSLRYDFNYCTYSYTMAFWDWARWEKEIDWMALHGINLPLAAVGQECVWRNML 165
>gi|393786624|ref|ZP_10374756.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
CL02T12C05]
gi|392657859|gb|EIY51489.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
CL02T12C05]
Length = 717
Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats.
Identities = 51/113 (45%), Positives = 66/113 (58%), Gaps = 7/113 (6%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+VI G V + GL+WYL+Y+ G H+SW+ +P + R + +
Sbjct: 58 VVIRGNNSVSIATGLNWYLKYYAGIHLSWNGMTATLPDVLPPVMKKERHETK-------L 110
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P Y N T SY+ A+WDW+RWEKEIDWMAL GINLPLA TG ET+W V Q
Sbjct: 111 PWRYDLNYCTYSYSMAFWDWERWEKEIDWMALHGINLPLAITGTETVWYNVLQ 163
>gi|293371911|ref|ZP_06618315.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
gi|292633157|gb|EFF51734.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
Length = 289
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 87/162 (53%), Gaps = 17/162 (10%)
Query: 53 QLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGV 112
+A A A+L+RLLPS+ +FQF+ K G + CF + + +IVI G
Sbjct: 14 DVAVAEALLKRLLPSYIESFQFQ----KLKGEKDCFTIESVKD-------KIVIGGNNAN 62
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAV 172
+ GL+ YL+Y+C + +SW V++ P V + V R + ++ N
Sbjct: 63 SMAMGLNHYLKYYCLTTVSWYADIAVEIPE-----ELPMVGEKVVSEAR-VDTRFFLNYC 116
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
T YT WW WK WE+ IDWMAL GIN+PLA TGQE +W KV
Sbjct: 117 TYGYTMPWWQWKEWERFIDWMALNGINMPLAITGQEAVWYKV 158
>gi|14861380|gb|AAK73655.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
Length = 753
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 95/171 (55%), Gaps = 11/171 (6%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
R A + Q AA A+ +RLL +A + GG + + + P + + +
Sbjct: 28 RAGAEDARQEAAVRALARRLLGPRAAAVALSVDGSLAAGGLDTYRVHSPPGAAV----AV 83
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
++G +GV AGLH YLR CG H+SW G Q+ +P PRV A + P
Sbjct: 84 AVAGSSGVAAAAGLHRYLRDLCGCHLSWS---GRQL-RLPD--PLPRVP-AEIRATAPGR 136
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+FAWWDW RWE+EIDWMAL GINL LAF GQE +WQ+V+
Sbjct: 137 FRYYQNVCTQSYSFAWWDWARWEEEIDWMALSGINLALAFAGQEAVWQRVY 187
>gi|14861378|gb|AAK73654.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
Length = 753
Score = 107 bits (266), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 95/171 (55%), Gaps = 11/171 (6%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
R A + Q AA A+ +RLL +A + GG + + + P + + +
Sbjct: 28 RAGAEDARQEAAVRALARRLLGPRAAAVALSVDGSLAAGGLDTYRVHSPPGAAV----AV 83
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
++G +GV AGLH YLR CG H+SW G Q+ +P PRV A + P
Sbjct: 84 AVAGSSGVAAAAGLHRYLRDLCGCHLSWS---GRQL-RLPD--PLPRV-PAEIRATAPGR 136
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+FAWWDW RWE+EIDWMAL GINL LAF GQE +WQ+V+
Sbjct: 137 FRYYQNVCTQSYSFAWWDWARWEEEIDWMALSGINLALAFAGQEAVWQRVY 187
>gi|380804373|gb|AFE74062.1| alpha-N-acetylglucosaminidase precursor, partial [Macaca mulatta]
Length = 265
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 66/106 (62%), Gaps = 7/106 (6%)
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
TGV AGLH YLR +CG H++W G Q+ L + P + P YYQ
Sbjct: 1 TGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTEATPNRYRYYQ 53
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
N T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 54 NVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 99
>gi|126307952|ref|XP_001365931.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
domestica]
Length = 481
Score = 105 bits (263), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 87/167 (52%), Gaps = 19/167 (11%)
Query: 53 QLAAAYAVLQRLL-PSHYSAFQF---RIISKKQCGGEYCFILRNHPSSYIRGTPEIVISG 108
+ AA +L RLL P + F R ++ YC S GTP + ++G
Sbjct: 29 EAAAVRGLLTRLLGPRAAAGFSVSVERALAPAPDADTYCL-------SGGGGTP-VQVTG 80
Query: 109 VTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYY 168
+GV AGLH YLR +CG H++W G Q+ L P+ + P YY
Sbjct: 81 SSGVAAAAGLHRYLRDFCGCHVAWS---GAQLRLPEPLPEVPKK----LIEITPNRYRYY 133
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
N T SY+F WW W+RWE+EIDWMAL GINL LA GQE IW++V+
Sbjct: 134 LNICTPSYSFVWWGWERWEREIDWMALNGINLVLAPVGQEAIWRRVY 180
>gi|332018247|gb|EGI58852.1| Alpha-N-acetylglucosaminidase [Acromyrmex echinatior]
Length = 686
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 69/114 (60%), Gaps = 9/114 (7%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+I I G +GV + GLH+YL+ +C HISWD T +P + P V+ + R
Sbjct: 24 QIEIRGTSGVAITWGLHYYLKNYCNVHISWDGTH----IELPN--TLPEVRVKIISNDR- 76
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YYQN T Y+ AWW W +WEK IDWMAL GINL LAF+ QE IWQ+++Q
Sbjct: 77 --FRYYQNVCTVGYSSAWWQWHQWEKNIDWMALNGINLALAFSAQEAIWQRLYQ 128
>gi|294674521|ref|YP_003575137.1| alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
gi|294472030|gb|ADE81419.1| putative alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
Length = 754
Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats.
Identities = 48/113 (42%), Positives = 70/113 (61%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G + V + +GL+WYL+Y+ G H+SW++ A +P + R +
Sbjct: 56 KVVIRGNSWVNIASGLNWYLKYYAGVHLSWNQMQTKLPAKLPAVTKKERRET-------D 108
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ L Y N T SY+ A+WDWKRWEKEIDWMAL GIN+PLA G+E +W+ +
Sbjct: 109 LSLRYDFNYCTFSYSMAFWDWKRWEKEIDWMALHGINMPLAIVGEECVWRNML 161
>gi|170060634|ref|XP_001865888.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
gi|167879069|gb|EDS42452.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
Length = 761
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 101/201 (50%), Gaps = 18/201 (8%)
Query: 15 ITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQF 74
++ + L+ LA SST G S + + + P VQ A ++ R++P S QF
Sbjct: 3 LSVLVLTFLAGTISSTHGEATFSSHIFKHVKTKTSPEVQQTATAELISRIVPERAS--QF 60
Query: 75 RIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDK 134
++ I + SS + +I G +GV G + YL+Y+CG H+SWD
Sbjct: 61 KVTVDSALKPNSFKITKTSDSSVV------LIVGSSGVAASKGFYHYLKYYCGCHVSWD- 113
Query: 135 TGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMA 194
G Q+ +P P V V V+ P + YYQN T SY+F+WW W W + IDWMA
Sbjct: 114 --GDQL-DLPD--DLPNV---NVSVEAPSSIVYYQNVCTWSYSFSWWTWTEWRRHIDWMA 165
Query: 195 LQGINLPLAFTGQETIWQKVF 215
LQGI L LA QE +W +V+
Sbjct: 166 LQGITLSLA-PFQEDLWTEVY 185
>gi|393783261|ref|ZP_10371436.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
CL02T12C01]
gi|392669540|gb|EIY63028.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
CL02T12C01]
Length = 724
Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats.
Identities = 61/163 (37%), Positives = 92/163 (56%), Gaps = 14/163 (8%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
+ A ++++RL+P++ FQFR I K G+ CF L +I+ISG
Sbjct: 26 VVAMQSMVKRLIPAYAENFQFRKI--KPIDGKDCFRLDASDPR------KIIISGNNANS 77
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
+ GL+ YLRY+C + ISW V+ MP+ + P V+ + ++ + ++ N T
Sbjct: 78 MAMGLNHYLRYYCLTTISWYADIPVE---MPE--TLPAVEKP-IQIEAKVDRRFFLNYCT 131
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YT +W WK WE+ IDWMAL GIN+PLA TGQE +W KV++
Sbjct: 132 YGYTMPFWQWKDWERLIDWMALNGINMPLAITGQEAVWYKVWK 174
>gi|354485058|ref|XP_003504701.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cricetulus griseus]
gi|344251941|gb|EGW08045.1| Alpha-N-acetylglucosaminidase [Cricetulus griseus]
Length = 740
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/118 (45%), Positives = 72/118 (61%), Gaps = 8/118 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G P + + G TGV AGL+ YLR +CG ++W + +P L +Q+ +
Sbjct: 71 GIP-VRVRGSTGVAAAAGLYRYLRDFCGCQVAWSGSQLRLPRPLPAL-----LQE--LTE 122
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQV 217
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+ +
Sbjct: 123 ATPNRYRYYQNVCTHSYSFVWWDWARWEQEIDWMALNGINLALAWSGQEAIWQRVYLI 180
>gi|53711968|ref|YP_097960.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis YCH46]
gi|52214833|dbj|BAD47426.1| alpha-N-acetylglucosaminidase precursor [Bacteroides fragilis
YCH46]
Length = 718
Score = 104 bits (260), Expect = 3e-20, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETT------- 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW RWEKEIDWMAL GIN+PLA TG E +W V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWNRWEKEIDWMALHGINIPLAVTGAEAVWHNVL 163
>gi|423282107|ref|ZP_17260992.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
615]
gi|404582594|gb|EKA87288.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
615]
Length = 718
Score = 104 bits (260), Expect = 3e-20, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETT------- 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW RWEKEIDWMAL GIN+PLA TG E +W V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWNRWEKEIDWMALHGINIPLAVTGAEAVWHNVL 163
>gi|423269418|ref|ZP_17248390.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
CL05T00C42]
gi|423273021|ref|ZP_17251968.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
CL05T12C13]
gi|392701212|gb|EIY94372.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
CL05T00C42]
gi|392708585|gb|EIZ01692.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
CL05T12C13]
Length = 718
Score = 104 bits (260), Expect = 3e-20, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETT------- 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW RWEKEIDWMAL GIN+PLA TG E +W V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWNRWEKEIDWMALHGINIPLAVTGAEAVWHNVL 163
>gi|336408181|ref|ZP_08588675.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
gi|335939481|gb|EGN01355.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
Length = 718
Score = 104 bits (260), Expect = 3e-20, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETT------- 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW RWEKEIDWMAL GIN+PLA TG E +W V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWNRWEKEIDWMALHGINIPLAVTGAEAVWHNVL 163
>gi|265765312|ref|ZP_06093587.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
gi|263254696|gb|EEZ26130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
Length = 718
Score = 104 bits (260), Expect = 3e-20, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETT------- 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW RWEKEIDWMAL GIN+PLA TG E +W V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWNRWEKEIDWMALHGINIPLAVTGAEAVWHNVL 163
>gi|60680169|ref|YP_210313.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC 9343]
gi|375357012|ref|YP_005109784.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
gi|383116930|ref|ZP_09937677.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
gi|60491603|emb|CAH06355.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC
9343]
gi|251947777|gb|EES88059.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
gi|301161693|emb|CBW21233.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
Length = 718
Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETT------- 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW RWEKEIDWMAL GIN+PLA TG E +W V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWNRWEKEIDWMALHGINIPLAVTGAEAVWHNVL 163
>gi|423248659|ref|ZP_17229675.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
CL03T00C08]
gi|423253608|ref|ZP_17234539.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
CL03T12C07]
gi|392655237|gb|EIY48880.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
CL03T12C07]
gi|392657600|gb|EIY51231.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
CL03T00C08]
Length = 718
Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETT------- 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW RWEKEIDWMAL GIN+PLA TG E +W V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWNRWEKEIDWMALHGINIPLAVTGAEAVWHNVL 163
>gi|423259033|ref|ZP_17239956.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
CL07T00C01]
gi|423263996|ref|ZP_17242999.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
CL07T12C05]
gi|387776613|gb|EIK38713.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
CL07T00C01]
gi|392706262|gb|EIY99385.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
CL07T12C05]
Length = 718
Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETT------- 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW RWEKEIDWMAL GIN+PLA TG E +W V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWNRWEKEIDWMALHGINIPLAVTGAEAVWHNVL 163
>gi|299140550|ref|ZP_07033688.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella oris C735]
gi|298577516|gb|EFI49384.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella oris C735]
Length = 741
Score = 103 bits (258), Expect = 4e-20, Method: Composition-based stats.
Identities = 49/112 (43%), Positives = 67/112 (59%), Gaps = 7/112 (6%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
IVI G T V + GL+WYL+Y+ G H++W+ AS+P++ S R +
Sbjct: 57 IVIKGNTWVNIAVGLNWYLKYYAGIHLTWNNMNTHLPASLPRVTSPERHAT-------DL 109
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
L Y N T SY+ A+WDWKRW+ EIDWMAL G+NLPLA G+E W+ +
Sbjct: 110 KLRYDFNYCTFSYSMAFWDWKRWQTEIDWMALHGVNLPLAIVGEEVAWRNML 161
>gi|424666301|ref|ZP_18103337.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
616]
gi|404573840|gb|EKA78592.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
616]
Length = 718
Score = 103 bits (258), Expect = 4e-20, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETT------- 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW RWEKEIDWMAL GIN+PLA TG E +W V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWDRWEKEIDWMALHGINIPLAVTGAEAVWYNVL 163
>gi|313145188|ref|ZP_07807381.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
gi|313133955|gb|EFR51315.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
Length = 718
Score = 103 bits (258), Expect = 4e-20, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETT------- 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW RWEKEIDWMAL GIN+PLA TG E +W V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWDRWEKEIDWMALHGINIPLAVTGAEAVWYNVL 163
>gi|423280158|ref|ZP_17259071.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
610]
gi|404584494|gb|EKA89159.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
610]
Length = 718
Score = 103 bits (258), Expect = 4e-20, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETT------- 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW RWEKEIDWMAL GIN+PLA TG E +W V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWDRWEKEIDWMALHGINIPLAVTGAEAVWYNVL 163
>gi|255533666|ref|YP_003094038.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346650|gb|ACU05976.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 735
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 88/163 (53%), Gaps = 17/163 (10%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRIISKKQCGGE-YCFILRNHPSSYIRGTPEIVISGVTGVE 113
AA A L+R++ H + F+ I+ G + Y +N+ +IV+SG +
Sbjct: 29 GAAEAFLKRIVKDHAADFEISYIAAAANGNDRYELESKNN---------KIVLSGNNNIS 79
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
+ + L+ YLRY+ G ISW+ G + KL + P V P YY N T
Sbjct: 80 IASALNHYLRYYAGCLISWN---GSNLKLPAKLPAIP----VKVSKTSPYKYRYYLNYCT 132
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+Y+ +WWDW+RW+ EID+MAL GIN+PLA TGQ +W +V++
Sbjct: 133 FNYSMSWWDWQRWQWEIDFMALNGINMPLAITGQNAVWSRVYK 175
>gi|427385205|ref|ZP_18881710.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
12058]
gi|425727373|gb|EKU90233.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
12058]
Length = 719
Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats.
Identities = 46/114 (40%), Positives = 69/114 (60%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++V+ G V + G++WYL+Y+ G H+SW+ ++P + R + +
Sbjct: 60 KVVVRGNNYVNIATGINWYLKYYAGIHLSWNGMQAQLPPALPPVSQKERHETS------- 112
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ L Y N T SYT A+WDW+RWE+E+DWMAL GIN+PLA GQE IW+ + Q
Sbjct: 113 LSLRYDFNYCTYSYTMAFWDWERWERELDWMALHGINMPLAAVGQECIWRNMLQ 166
>gi|373953359|ref|ZP_09613319.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
gi|373889959|gb|EHQ25856.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
Length = 733
Score = 102 bits (255), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 86/159 (54%), Gaps = 16/159 (10%)
Query: 57 AYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLA 116
+YA+++RL+P A F + + G CF + + +I++ G GV + +
Sbjct: 28 SYALIKRLIPQR--AQSFVVQTTGMSSGHDCFEIESK-------NGKIILRGNNGVAIAS 78
Query: 117 GLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSY 176
L++YL + ++W+ T A +P + + R + P YY N T +Y
Sbjct: 79 ALYYYLTEYGHCQVTWNGTNLKLPAKLPLVKNKIRKE-------TPYQYRYYLNYCTFNY 131
Query: 177 TFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ +WWDW+RWEKEIDWMAL GIN+PLA TG+E W KV+
Sbjct: 132 SMSWWDWQRWEKEIDWMALHGINMPLAITGEEYTWYKVY 170
>gi|224537466|ref|ZP_03678005.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520904|gb|EEF90009.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
DSM 14838]
Length = 721
Score = 102 bits (255), Expect = 8e-20, Method: Composition-based stats.
Identities = 55/119 (46%), Positives = 75/119 (63%), Gaps = 10/119 (8%)
Query: 99 RGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPK-LGSFPRVQDAGV 157
+GT ++V+ G V + G++WYL+Y+ G +SW+ G+Q A +P+ L PR +
Sbjct: 59 KGT-KVVVRGNNYVNIATGINWYLKYYAGIQLSWN---GMQ-AKLPQVLPPVPRKERH-- 111
Query: 158 FVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ + L Y N T SYT A+WDW+RWEKEIDWMAL GINLPLA GQE IW + Q
Sbjct: 112 --ETSLSLRYDFNYCTYSYTMAFWDWERWEKEIDWMALHGINLPLAAVGQECIWFNMLQ 168
>gi|295132875|ref|YP_003583551.1| hypothetical protein ZPR_1010 [Zunongwangia profunda SM-A87]
gi|294980890|gb|ADF51355.1| predicted protein [Zunongwangia profunda SM-A87]
Length = 750
Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats.
Identities = 58/174 (33%), Positives = 93/174 (53%), Gaps = 19/174 (10%)
Query: 43 QDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTP 102
++ ++AP + L ++L R++P H AF II G+ F +
Sbjct: 28 KETKQAPAA--LMPYQSLLNRVVPEHADAF---IIDSLPGAGDQAFEIAAQDG------- 75
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+I + G GV + + L++Y++ + I+W+ T S+P + + V + P
Sbjct: 76 KISLRGNNGVAIASALYYYIKEYAHGQITWNGTNLNFPESLP-------LPEQPVRKESP 128
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YY N T +Y+ +WWDW+RWEKEIDWMAL GIN+PLA TG+E IW +V++
Sbjct: 129 YEYRYYLNYCTFNYSMSWWDWERWEKEIDWMALHGINMPLAITGEEYIWDEVYK 182
>gi|212693694|ref|ZP_03301822.1| hypothetical protein BACDOR_03214 [Bacteroides dorei DSM 17855]
gi|265755881|ref|ZP_06090348.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
gi|212663753|gb|EEB24327.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
gi|263233959|gb|EEZ19560.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
Length = 718
Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats.
Identities = 48/113 (42%), Positives = 67/113 (59%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVIIRGNDYVNIATGLNWYLKYYAGIHLSWNGMTAKLPAVLPPVTKKERHET-------D 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW+RWEKEIDWMAL GINL LA TG E++W+ V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWERWEKEIDWMALHGINLSLALTGTESVWRNVL 163
>gi|237708859|ref|ZP_04539340.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
gi|345513372|ref|ZP_08792893.1| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
gi|423228941|ref|ZP_17215347.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
CL02T00C15]
gi|423242228|ref|ZP_17223337.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
CL03T12C01]
gi|423247755|ref|ZP_17228803.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
CL02T12C06]
gi|229457285|gb|EEO63006.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
gi|345456211|gb|EEO47557.2| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
gi|392631297|gb|EIY25272.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
CL02T12C06]
gi|392635177|gb|EIY29082.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
CL02T00C15]
gi|392639514|gb|EIY33330.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
CL03T12C01]
Length = 717
Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats.
Identities = 48/113 (42%), Positives = 67/113 (59%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 57 KVIIRGNDYVNIATGLNWYLKYYAGIHLSWNGMTAKLPAVLPPVTKKERHET-------D 109
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW+RWEKEIDWMAL GINL LA TG E++W+ V
Sbjct: 110 LPYRYDLNYCTFSYSMAFWDWERWEKEIDWMALHGINLSLALTGTESVWRNVL 162
>gi|237717696|ref|ZP_04548177.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229453015|gb|EEO58806.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 729
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 85/164 (51%), Gaps = 16/164 (9%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
+ + + QRL P S+F FR+++ F +++ + +I+ISG
Sbjct: 23 DTNITTMHKMTQRLFPQQASSFDFRLLNDTSAD---TFTIKSEGN-------KIIISGNN 72
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQN 170
+ GL+ YL+ +C + ISW K ++ +PK + P + V VK +P ++ N
Sbjct: 73 ANSMAVGLNHYLKNYCLTTISWYKDDPIE---LPK--TLPSIS-TEVTVKANVPTRFFLN 126
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
T Y+ WW W WE IDWMAL GIN+PLA TGQE IW KV
Sbjct: 127 YCTFGYSMTWWKWSDWEHFIDWMALNGINMPLAITGQEAIWYKV 170
>gi|299149196|ref|ZP_07042257.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298512863|gb|EFI36751.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 738
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 85/164 (51%), Gaps = 16/164 (9%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
+ + + QRL P S+F FR+++ F +++ + +I+ISG
Sbjct: 32 DTNITTMHKMTQRLFPQQASSFDFRLLNDTSAD---TFTIKSEGN-------KIIISGNN 81
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQN 170
+ GL+ YL+ +C + ISW K ++ +PK + P + V VK +P ++ N
Sbjct: 82 ANSMAVGLNHYLKNYCLTTISWYKDDPIE---LPK--TLPSIS-TEVTVKANVPTRFFLN 135
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
T Y+ WW W WE IDWMAL GIN+PLA TGQE IW KV
Sbjct: 136 YCTFGYSMTWWKWSDWEHFIDWMALNGINMPLAITGQEAIWYKV 179
>gi|319643377|ref|ZP_07998003.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
gi|317385006|gb|EFV65959.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
Length = 718
Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats.
Identities = 48/113 (42%), Positives = 67/113 (59%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 58 KVIIRGNDYVNIATGLNWYLKYYAGIHLSWNGMTAKLPAVLPPVTKKERHET-------D 110
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW+RWEKEIDWMAL GINL LA TG E++W+ V
Sbjct: 111 LPYRYDLNYCTFSYSMAFWDWERWEKEIDWMALHGINLSLALTGTESVWRNVL 163
>gi|150004413|ref|YP_001299157.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149932837|gb|ABR39535.1| glycoside hydrolase family 89 [Bacteroides vulgatus ATCC 8482]
Length = 717
Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats.
Identities = 48/113 (42%), Positives = 67/113 (59%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 57 KVIIRGNDYVNIATGLNWYLKYYAGIHLSWNGMTAKLPAVLPPVTKKERHET-------D 109
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW+RWEKEIDWMAL GINL LA TG E++W+ V
Sbjct: 110 LPYRYDLNYCTFSYSMAFWDWERWEKEIDWMALHGINLSLALTGTESVWRNVL 162
>gi|345519733|ref|ZP_08799147.1| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
gi|345457107|gb|EET15964.2| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
Length = 717
Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats.
Identities = 48/113 (42%), Positives = 67/113 (59%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 57 KVIIRGNDYVNIATGLNWYLKYYAGIHLSWNGMTAKLPAVLPPVTKKERHET-------D 109
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW+RWEKEIDWMAL GINL LA TG E++W+ V
Sbjct: 110 LPYRYDLNYCTFSYSMAFWDWERWEKEIDWMALHGINLSLALTGTESVWRNVL 162
>gi|293369246|ref|ZP_06615836.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|292635671|gb|EFF54173.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 521
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 86/165 (52%), Gaps = 16/165 (9%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
+ + + QRL P S+F FR+++ F +++ + +I+ISG
Sbjct: 23 DTNITTMHKMTQRLFPQQASSFDFRLLNDTSAD---TFTIKSEGN-------KIIISGNN 72
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQN 170
+ GL+ YL+ +C + ISW K ++ +PK + P + V VK +P ++ N
Sbjct: 73 ANSMAVGLNHYLKNYCLTTISWYKDDPIE---LPK--TLPSIS-TEVTVKANVPTRFFLN 126
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T Y+ WW W WE IDWMAL GIN+PLA TGQE IW KV+
Sbjct: 127 YCTFGYSMTWWKWSDWEHFIDWMALNGINMPLAITGQEAIWYKVW 171
>gi|157134500|ref|XP_001656341.1| alpha-n-acetylglucosaminidase [Aedes aegypti]
gi|108881379|gb|EAT45604.1| AAEL003150-PA [Aedes aegypti]
Length = 763
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 110/206 (53%), Gaps = 27/206 (13%)
Query: 10 LILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHY 69
++L F +++++ S+ I ++++ +D+ S+Q AA ++ R+LP
Sbjct: 8 MLLVFAYSVSIAEADPYHSAHI-LKHVKSNVDL--------SIQKQAAQELIGRILPER- 57
Query: 70 SAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSH 129
A QF + + IL+ + S+ + I+G GV G ++YL+Y+CG H
Sbjct: 58 -AHQFEVELNGRLKLNSFKILKTNDST------SVSITGSNGVAAAKGFYYYLKYYCGCH 110
Query: 130 ISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKE 189
+SWD G Q+ ++P P V V ++ P + YYQN T SY+F+WW WK W +
Sbjct: 111 VSWD---GDQL-NLPD--DLPEVD---VEIQAPSSIVYYQNVCTWSYSFSWWTWKEWRRH 161
Query: 190 IDWMALQGINLPLAFTGQETIWQKVF 215
IDWMA+QGI L LA QE +W +++
Sbjct: 162 IDWMAMQGITLSLA-PFQEDLWAELY 186
>gi|256422141|ref|YP_003122794.1| alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
gi|256037049|gb|ACU60593.1| Alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
Length = 728
Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats.
Identities = 61/170 (35%), Positives = 94/170 (55%), Gaps = 18/170 (10%)
Query: 49 PPSVQL--AAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVI 106
P QL AA A L+R++ +AF + G+ F + ++ S I++
Sbjct: 18 PAFAQLDKAATEAFLKRVVKDRATAFTCEYLPADN--GKDVFEIESNGS-------RIML 68
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
G GV V + L++YLR +C S I+W+ T +P + P V++ V P
Sbjct: 69 RGNNGVSVASALNYYLRNYCNSIITWNGTN----LHLPAV--LPVVKEKEHHVT-PYRYR 121
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YY N T Y+ +WW+W+RW++EIDWMA+ GIN+PLA TG+E IWQ+V++
Sbjct: 122 YYINYCTFQYSMSWWNWERWQQEIDWMAMNGINMPLALTGEEAIWQEVYK 171
>gi|440799252|gb|ELR20307.1| AlphaN-acetylglucosaminidase, putative, partial [Acanthamoeba
castellanii str. Neff]
Length = 389
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 96/191 (50%), Gaps = 23/191 (12%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHY-SAFQFRIISKKQ----CGGEYCFILRNHPSSYIR 99
+ PP VQ+ A ++ RLL + Y FQ I Q ++ F + ++ +
Sbjct: 53 KSTTPPEVQVQAVQGLIGRLLGASYVPRFQLSISPTPQEDESSTSDHDFYTLS--TTTVG 110
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G ++ I G +GV + +GLHWYL+Y C +SW G + L S + + +
Sbjct: 111 GVTQVQIQGSSGVALASGLHWYLKYSCNCSVSWGVAGS---GNNLALPSSLPLLSKPLTM 167
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWM-------------ALQGINLPLAFTG 206
+ P+ YY N T SY+ WW W RWE+EIDWM AL GI+LPL+ TG
Sbjct: 168 QSPVKYRYYLNTCTHSYSSVWWQWDRWEQEIDWMARNHGLLTTHRLLALHGISLPLSSTG 227
Query: 207 QETIWQKVFQV 217
QE+I+ +VF+
Sbjct: 228 QESIFAEVFKA 238
>gi|375144105|ref|YP_005006546.1| alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
gi|361058151|gb|AEV97142.1| Alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
Length = 735
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 55/161 (34%), Positives = 86/161 (53%), Gaps = 16/161 (9%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
A+Y +++R++P + ++F + + G+ F + N S IV+ G G +
Sbjct: 33 ASYGLIKRVVPRYATSFIVEDLPAE--NGKDVFEVENRGKS-------IVLRGNNGTSIA 83
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSS 175
L+ YL +C I+W+ T AS+P V + P YY N T +
Sbjct: 84 CALYHYLTEYCHCQITWNGTNLQLPASLP-------VVKEKIHKVTPYTYRYYLNYCTFN 136
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y+ +WWDW+RW+KEIDWMAL GIN+PLA TG+E W +V++
Sbjct: 137 YSMSWWDWERWQKEIDWMALHGINMPLAITGEEYTWYEVYK 177
>gi|329962235|ref|ZP_08300241.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
gi|328530343|gb|EGF57220.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
Length = 726
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 78/146 (53%), Gaps = 13/146 (8%)
Query: 76 IISKKQCGGEYCFILRNHPSSYI------RGTPEIVISGVTGVEVLAGLHWYLRYWCGSH 129
+I + G FI+ PSS I + +IVI G V + AGL+WYL+Y+ G H
Sbjct: 25 LIERVAPGSSGKFIIEQKPSSGIDFFELDQKGEKIVIRGNNPVSIAAGLNWYLKYYAGIH 84
Query: 130 ISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKE 189
+SW+K +P + R + A V+ Y N T SYT A+WDW RWE+E
Sbjct: 85 LSWNKMTAELPGVLPPVTRKERHETASVY-------RYDLNYCTFSYTMAFWDWARWEQE 137
Query: 190 IDWMALQGINLPLAFTGQETIWQKVF 215
+DWMAL GINL LA G + +W+ +
Sbjct: 138 LDWMALHGINLSLALVGTDAVWRNML 163
>gi|291515668|emb|CBK64878.1| Alpha-N-acetylglucosaminidase (NAGLU) [Alistipes shahii WAL 8301]
Length = 713
Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats.
Identities = 58/161 (36%), Positives = 88/161 (54%), Gaps = 17/161 (10%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEV 114
AA A+ +R++P H F+F ++ CF L + RG+ +IV+ G + +
Sbjct: 11 AAVRALARRIIPEHEKQFRFERLTDTT----ECFELES------RGS-KIVVRGNSSASI 59
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTS 174
GL+ YL+ +C + +SW V+ MP + PR+ D V V+ ++ N T
Sbjct: 60 AVGLNHYLKNYCLTTVSWHACNPVE---MPAV--LPRI-DEKVRVESRARERFFLNYCTF 113
Query: 175 SYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YT WW W WE+ IDWMALQG+ +PLA TGQE +WQ+V+
Sbjct: 114 GYTMPWWGWTEWERFIDWMALQGVTMPLAITGQEAVWQRVW 154
>gi|423312588|ref|ZP_17290525.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
CL09T03C04]
gi|392688276|gb|EIY81565.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
CL09T03C04]
Length = 717
Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats.
Identities = 47/113 (41%), Positives = 66/113 (58%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G V + GL+WYL+Y+ G H+SW+ A +P + R +
Sbjct: 57 KVIIRGNDYVNIATGLNWYLKYYAGIHLSWNGMTAKLPAVLPPVTKKERHET-------D 109
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ +WDW+RWEKEIDWMAL GINL LA TG E++W+ V
Sbjct: 110 LPYRYDLNYCTFSYSMTFWDWERWEKEIDWMALHGINLSLALTGTESVWRNVL 162
>gi|336417192|ref|ZP_08597519.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
3_8_47FAA]
gi|423297818|ref|ZP_17275878.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
CL03T12C18]
gi|335936512|gb|EGM98438.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
3_8_47FAA]
gi|392664455|gb|EIY57993.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 84/164 (51%), Gaps = 16/164 (9%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
+ + QRL P S F FR+++ F +++ + +I+ISG
Sbjct: 21 DTNITTMRKMTQRLFPQQASFFDFRLLNDTSTDS---FTIKSEGN-------KIIISGNN 70
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQN 170
+ GL+ YL+ +C + ISW K ++ +PK + P + A V +K +P ++ N
Sbjct: 71 ANSMAVGLNHYLKNYCLTTISWYKDDPIE---LPK--TLPNIP-AEVTIKANVPTRFFLN 124
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
T Y+ WW W WE IDWMAL GIN+PLA TGQE IW KV
Sbjct: 125 YCTFGYSMTWWKWSDWEHFIDWMALNGINMPLAITGQEAIWYKV 168
>gi|423226735|ref|ZP_17213200.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392627008|gb|EIY21049.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 718
Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats.
Identities = 54/118 (45%), Positives = 74/118 (62%), Gaps = 8/118 (6%)
Query: 99 RGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVF 158
+GT ++V+ G V + G++WYL+Y+ G +SW+ G+Q A +P++ P V
Sbjct: 56 KGT-KVVVRGNNYVNIATGINWYLKYYAGIQLSWN---GMQ-AKLPQV--LPPVSRKERH 108
Query: 159 VKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ + L Y N T SYT A+WDW+RWEKEIDWMAL GINLPLA GQE IW + Q
Sbjct: 109 -ETSLSLRYDFNYCTYSYTMAFWDWERWEKEIDWMALHGINLPLAAVGQECIWFNMLQ 165
>gi|189465172|ref|ZP_03013957.1| hypothetical protein BACINT_01517 [Bacteroides intestinalis DSM
17393]
gi|189437446|gb|EDV06431.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides intestinalis DSM
17393]
Length = 723
Score = 100 bits (249), Expect = 5e-19, Method: Composition-based stats.
Identities = 53/114 (46%), Positives = 71/114 (62%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++V+ G V + G++WYL+Y+ G +SW+ G+Q AS+P + P V +
Sbjct: 62 KVVVRGNNYVNIATGINWYLKYYAGIQLSWN---GMQ-ASLPVV--LPPVTRKERH-ETS 114
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ L Y N T SYT A+WDW+RWEKEIDWMAL GINLPLA GQE IW + Q
Sbjct: 115 LSLRYDFNYCTYSYTMAFWDWERWEKEIDWMALHGINLPLAAVGQECIWFNMLQ 168
>gi|281423203|ref|ZP_06254116.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
gi|281402539|gb|EFB33370.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
Length = 450
Score = 100 bits (248), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 49/112 (43%), Positives = 67/112 (59%), Gaps = 7/112 (6%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
IVI G T V + GL+WYL+Y+ G H++W+ AS+P++ S R +
Sbjct: 57 IVIKGNTWVNIAVGLNWYLKYYAGIHLTWNNMNTHLPASLPRVTSPERHA-------TDL 109
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
L Y N T SY+ A+WDWKRW+ EIDWMAL G+NLPLA G+E W+ +
Sbjct: 110 KLRYDFNYCTFSYSMAFWDWKRWQTEIDWMALHGVNLPLAIVGEEVAWRNML 161
>gi|391338146|ref|XP_003743422.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Metaseiulus
occidentalis]
Length = 665
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 84/164 (51%), Gaps = 16/164 (9%)
Query: 59 AVLQRLLPSHYSAFQFRIIS--KKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLA 116
+L+R++ F+ R+ K G+ F + + + +I I+G G
Sbjct: 19 GLLERVVEEGSPIFELRVDPSLKSVLDGKEGFRVEGN-------STKIFITGTVGYAAAN 71
Query: 117 GLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSY 176
GL++Y+R G SW +P P Q A + V P + YY+N T+SY
Sbjct: 72 GLNFYIRKLLGGQFSWSGK------RIPLPEKMPAPQRA-LLVTLPDQVRYYENVCTASY 124
Query: 177 TFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAFC 220
+F WW W RW++EIDWMA+ GINLPLAF+GQE + +VF+ C
Sbjct: 125 SFVWWQWPRWQREIDWMAMNGINLPLAFSGQEIVAAEVFKTFGC 168
>gi|29349767|ref|NP_813270.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341678|gb|AAO79464.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 744
Score = 99.8 bits (247), Expect = 6e-19, Method: Composition-based stats.
Identities = 45/116 (38%), Positives = 71/116 (61%), Gaps = 11/116 (9%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFP--RVQDAGVFVK 160
+++I+G + + + GL+WYL+Y G H+SW+ +P++ P +++ A
Sbjct: 75 KVLITGNSDLSLATGLNWYLKYVAGIHLSWNNPS----QKLPEVLPLPQKKIRQATAMKN 130
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
R YY N T SY+ A+WDW+RWEKEIDWMA+ GIN+PL+ TG E +W + +
Sbjct: 131 R-----YYLNYCTYSYSMAFWDWERWEKEIDWMAMHGINMPLSITGMEVVWYNLLK 181
>gi|383120707|ref|ZP_09941431.1| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
gi|382984934|gb|EES68331.2| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
Length = 736
Score = 99.8 bits (247), Expect = 7e-19, Method: Composition-based stats.
Identities = 45/116 (38%), Positives = 71/116 (61%), Gaps = 11/116 (9%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFP--RVQDAGVFVK 160
+++I+G + + + GL+WYL+Y G H+SW+ +P++ P +++ A
Sbjct: 67 KVLITGNSDLSLATGLNWYLKYVAGIHLSWNNPS----QKLPEVLPLPQKKIRQATAMKN 122
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
R YY N T SY+ A+WDW+RWEKEIDWMA+ GIN+PL+ TG E +W + +
Sbjct: 123 R-----YYLNYCTYSYSMAFWDWERWEKEIDWMAMHGINMPLSITGMEVVWYNLLK 173
>gi|449491231|ref|XP_004174728.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase,
partial [Taeniopygia guttata]
Length = 752
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 64/99 (64%), Gaps = 7/99 (7%)
Query: 118 LHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYT 177
L+ YLR +CG H+SW G Q+ +P PR++ A + P YYQN SY+
Sbjct: 96 LYRYLRDFCGCHLSWS---GAQL-RLPD--PLPRLR-AEIRAAAPGRYRYYQNVCAQSYS 148
Query: 178 FAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
FAWWDW RWE+EIDWMAL GINL AF GQE +WQ+V++
Sbjct: 149 FAWWDWARWEREIDWMALSGINLAPAFAGQEAVWQRVYR 187
>gi|255533285|ref|YP_003093657.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346269|gb|ACU05595.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 749
Score = 99.8 bits (247), Expect = 8e-19, Method: Composition-based stats.
Identities = 62/163 (38%), Positives = 95/163 (58%), Gaps = 20/163 (12%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
A+Y +++R+LP+H A +F I G+ F L + RG +IV+ G TG+ V
Sbjct: 38 ASYDLIKRILPNH--ADRFVIEYLPAANGKDIFELES------RGN-QIVLRGNTGISVA 88
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY--YQNAVT 173
+ L+++L+ + ISW+ T ++PK FP V V++ P Y Y N T
Sbjct: 89 SALNYWLKNYAHCDISWNGTN----LNIPK--PFPMVPGK---VRKVTPHEYRHYFNYCT 139
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+YT +WWDW+RWE EID+MAL G+N+PLA TGQ +W +V++
Sbjct: 140 FNYTSSWWDWQRWEWEIDFMALNGVNMPLAMTGQNALWDRVYR 182
>gi|393785791|ref|ZP_10373937.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
CL02T12C05]
gi|392661410|gb|EIY54996.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
CL02T12C05]
Length = 727
Score = 98.6 bits (244), Expect = 1e-18, Method: Composition-based stats.
Identities = 61/164 (37%), Positives = 88/164 (53%), Gaps = 16/164 (9%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
+A ++++RL+P++ FQF+ K G+ F L S +IVI G
Sbjct: 26 VATMQSMVKRLIPAYAENFQFK--KTKPVDGKDYFQLDTSDSG------KIVIGGNNANS 77
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFP-RVQDAGVFVKRPIPLNYYQNAV 172
+ GL++YL+Y+C + ISW V+ MP+ P + A V R LNY
Sbjct: 78 MAMGLNYYLKYYCLTTISWYADIPVE---MPETLPVPEKTFSAEAKVDRRFFLNY----C 130
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T YT +W WK WE+ IDWMAL G+N+PLA TGQE +W KV++
Sbjct: 131 TYGYTMPYWQWKDWERFIDWMALNGVNMPLAITGQEAVWYKVWK 174
>gi|294775488|ref|ZP_06741000.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
gi|294450633|gb|EFG19121.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
Length = 712
Score = 98.6 bits (244), Expect = 1e-18, Method: Composition-based stats.
Identities = 48/114 (42%), Positives = 66/114 (57%), Gaps = 9/114 (7%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWD-KTGGVQVASMPKLGSFPRVQDAGVFVKR 161
+++I G V + GL+WYL+Y+ G H+SW+ T + +P D
Sbjct: 52 KVIIRGNDYVNIATGLNWYLKYYAGIHLSWNGMTAKLPAVLLPVTKKERHETD------- 104
Query: 162 PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P Y N T SY+ A+WDW+RWEKEIDWMAL GINL LA TG E++W+ V
Sbjct: 105 -LPYRYDLNYCTFSYSMAFWDWERWEKEIDWMALHGINLSLALTGTESVWRNVL 157
>gi|299148671|ref|ZP_07041733.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
3_1_23]
gi|383114572|ref|ZP_09935334.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
gi|298513432|gb|EFI37319.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
3_1_23]
gi|313693722|gb|EFS30557.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
Length = 711
Score = 98.6 bits (244), Expect = 2e-18, Method: Composition-based stats.
Identities = 43/114 (37%), Positives = 71/114 (62%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I+G + + + GL+WYL+Y G H+SW+ +P++ P+ + + +
Sbjct: 66 KVLITGNSDLSLATGLNWYLKYVAGIHLSWNNLS----QKLPEILPLPQEK---IRKETS 118
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ YY N T SY+ A+WDW+RWEKEIDWMA+ GIN+PL+ TG E +W + +
Sbjct: 119 MQNRYYLNYCTYSYSMAFWDWERWEKEIDWMAMHGINMPLSITGMEVVWYNLLK 172
>gi|160887167|ref|ZP_02068170.1| hypothetical protein BACOVA_05183 [Bacteroides ovatus ATCC 8483]
gi|423295093|ref|ZP_17273220.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
CL03T12C18]
gi|156107578|gb|EDO09323.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392673999|gb|EIY67450.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
CL03T12C18]
Length = 711
Score = 98.2 bits (243), Expect = 2e-18, Method: Composition-based stats.
Identities = 43/114 (37%), Positives = 71/114 (62%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I+G + + + GL+WYL+Y G H+SW+ +P++ P+ + + +
Sbjct: 66 KVLITGNSDLSLATGLNWYLKYVAGIHLSWNNLS----QKLPEILPLPQEK---IRKETS 118
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ YY N T SY+ A+WDW+RWEKEIDWMA+ GIN+PL+ TG E +W + +
Sbjct: 119 MQNRYYLNYCTYSYSMAFWDWERWEKEIDWMAMHGINMPLSITGMEVVWYNLLK 172
>gi|212695333|ref|ZP_03303461.1| hypothetical protein BACDOR_04880 [Bacteroides dorei DSM 17855]
gi|212662112|gb|EEB22686.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
Length = 754
Score = 98.2 bits (243), Expect = 2e-18, Method: Composition-based stats.
Identities = 55/138 (39%), Positives = 76/138 (55%), Gaps = 12/138 (8%)
Query: 83 GGEYCFILRNHPSS-----YIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGG 137
G F++ H SS R ++VI G T + + G++WYL+Y G H+SW+
Sbjct: 37 GASNKFVVELHKSSNDFFELDRKGDKVVIRGNTYINIATGINWYLKYHAGIHLSWNG--- 93
Query: 138 VQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQG 197
AS+P + P V + + L Y N T SY+ A+WDWKRWEKE+DWMAL G
Sbjct: 94 -MYASLPDV--LPPVLRKERH-ETNLALRYDFNYCTYSYSMAFWDWKRWEKELDWMALHG 149
Query: 198 INLPLAFTGQETIWQKVF 215
INLPLA G E +W+ +
Sbjct: 150 INLPLAAVGHECVWRNLL 167
>gi|345513909|ref|ZP_08793424.1| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
gi|345456132|gb|EEO45798.2| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
Length = 754
Score = 98.2 bits (243), Expect = 2e-18, Method: Composition-based stats.
Identities = 55/138 (39%), Positives = 76/138 (55%), Gaps = 12/138 (8%)
Query: 83 GGEYCFILRNHPSS-----YIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGG 137
G F++ H SS R ++VI G T + + G++WYL+Y G H+SW+
Sbjct: 37 GASNKFVVELHKSSNDFFELDRKGDKVVIRGNTYINIATGINWYLKYHAGIHLSWNG--- 93
Query: 138 VQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQG 197
AS+P + P V + + L Y N T SY+ A+WDWKRWEKE+DWMAL G
Sbjct: 94 -MYASLPDV--LPPVLRKERH-ETNLALRYDFNYCTYSYSMAFWDWKRWEKELDWMALHG 149
Query: 198 INLPLAFTGQETIWQKVF 215
INLPLA G E +W+ +
Sbjct: 150 INLPLAAVGHECVWRNLL 167
>gi|265753065|ref|ZP_06088634.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263236251|gb|EEZ21746.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 750
Score = 98.2 bits (243), Expect = 2e-18, Method: Composition-based stats.
Identities = 56/138 (40%), Positives = 78/138 (56%), Gaps = 12/138 (8%)
Query: 83 GGEYCFILRNHPSS-----YIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGG 137
G F++ H SS R ++VI G T + + G++WYL+Y G H+SW+ G
Sbjct: 33 GASNKFVVELHKSSNDFFELDRKGDKVVIRGNTYINIATGINWYLKYHAGIHLSWN---G 89
Query: 138 VQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQG 197
+ AS+P + P V + + L Y N T SY+ A+WDWKRWEKE+DWMAL G
Sbjct: 90 MH-ASLPDV--LPPVLRKERH-ETNLALRYDFNYCTYSYSMAFWDWKRWEKELDWMALHG 145
Query: 198 INLPLAFTGQETIWQKVF 215
INLPLA G E +W+ +
Sbjct: 146 INLPLAAVGHECVWRNLL 163
>gi|423241433|ref|ZP_17222546.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
CL03T12C01]
gi|392641326|gb|EIY35103.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
CL03T12C01]
Length = 754
Score = 97.8 bits (242), Expect = 3e-18, Method: Composition-based stats.
Identities = 56/138 (40%), Positives = 78/138 (56%), Gaps = 12/138 (8%)
Query: 83 GGEYCFILRNHPSS-----YIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGG 137
G F++ H SS R ++VI G T + + G++WYL+Y G H+SW+ G
Sbjct: 37 GASNKFVVELHKSSNDFFELDRKGDKVVIRGNTYINIATGINWYLKYHAGIHLSWN---G 93
Query: 138 VQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQG 197
+ AS+P + P V + + L Y N T SY+ A+WDWKRWEKE+DWMAL G
Sbjct: 94 MH-ASLPDV--LPPVLRKERH-ETNLALRYDFNYCTYSYSMAFWDWKRWEKELDWMALHG 149
Query: 198 INLPLAFTGQETIWQKVF 215
INLPLA G E +W+ +
Sbjct: 150 INLPLAAVGHECVWRNLL 167
>gi|423346424|ref|ZP_17324112.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
CL03T12C32]
gi|409220242|gb|EKN13198.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
CL03T12C32]
Length = 718
Score = 97.8 bits (242), Expect = 3e-18, Method: Composition-based stats.
Identities = 51/113 (45%), Positives = 69/113 (61%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y+ G H+SW+ G+Q A +P++ P V+ +
Sbjct: 57 KVVIRGNNYVSIATGLNWYLKYYAGIHLSWN---GMQ-AELPEV--LPAVKQKERH-ETD 109
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ Y N T SYT A+WDW RWEKEIDWMAL GINLPLA G + +W V
Sbjct: 110 MKYRYDFNYCTFSYTMAFWDWTRWEKEIDWMALHGINLPLAMVGTDGVWYNVL 162
>gi|237711645|ref|ZP_04542126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
gi|229454340|gb|EEO60061.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
Length = 732
Score = 97.8 bits (242), Expect = 3e-18, Method: Composition-based stats.
Identities = 53/139 (38%), Positives = 76/139 (54%), Gaps = 10/139 (7%)
Query: 80 KQCGGEYCFILRNHPSSYI---RGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTG 136
K ++ L P+ + R ++VI G T + + G++WYL+Y G H+SW+
Sbjct: 14 KGASNKFVVELHKSPNDFFELDRKGDKVVIRGNTYINIATGINWYLKYHAGIHLSWNG-- 71
Query: 137 GVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQ 196
AS+P + P V + + L Y N T SY+ A+WDWKRWEKE+DWMAL
Sbjct: 72 --MYASLPDV--LPPVLRKERH-ETNLALRYDFNYCTYSYSMAFWDWKRWEKELDWMALH 126
Query: 197 GINLPLAFTGQETIWQKVF 215
GINLPLA G E +W+ +
Sbjct: 127 GINLPLAAVGHECVWRNLL 145
>gi|423230938|ref|ZP_17217342.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
CL02T00C15]
gi|423244649|ref|ZP_17225724.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
CL02T12C06]
gi|392630058|gb|EIY24060.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
CL02T00C15]
gi|392641498|gb|EIY35274.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
CL02T12C06]
Length = 754
Score = 97.8 bits (242), Expect = 3e-18, Method: Composition-based stats.
Identities = 53/139 (38%), Positives = 76/139 (54%), Gaps = 10/139 (7%)
Query: 80 KQCGGEYCFILRNHPSSYI---RGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTG 136
K ++ L P+ + R ++VI G T + + G++WYL+Y G H+SW+
Sbjct: 36 KGASNKFVVELHKSPNDFFELDRKGDKVVIRGNTYINIATGINWYLKYHAGIHLSWNG-- 93
Query: 137 GVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQ 196
AS+P + P V + + L Y N T SY+ A+WDWKRWEKE+DWMAL
Sbjct: 94 --MYASLPDV--LPPVLRKERH-ETNLALRYDFNYCTYSYSMAFWDWKRWEKELDWMALH 148
Query: 197 GINLPLAFTGQETIWQKVF 215
GINLPLA G E +W+ +
Sbjct: 149 GINLPLAAVGHECVWRNLL 167
>gi|410096483|ref|ZP_11291470.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
CL02T12C30]
gi|409226447|gb|EKN19356.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
CL02T12C30]
Length = 718
Score = 97.8 bits (242), Expect = 3e-18, Method: Composition-based stats.
Identities = 52/117 (44%), Positives = 67/117 (57%), Gaps = 13/117 (11%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKR- 161
++VI G V + GL+WYL+Y G H+SW+ G+Q P V A V +R
Sbjct: 57 KVVIRGNNPVNIAVGLNWYLKYHAGIHLSWN---GMQ-------AKLPEVLPAVVRKERH 106
Query: 162 --PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ Y N T SYT A+WDW+RWEKEIDWMAL GINLPLA G + +W V +
Sbjct: 107 ETDMKYRYDFNYCTYSYTMAFWDWERWEKEIDWMALHGINLPLAMVGTDGVWYNVLK 163
>gi|255533286|ref|YP_003093658.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346270|gb|ACU05596.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 734
Score = 97.8 bits (242), Expect = 3e-18, Method: Composition-based stats.
Identities = 57/163 (34%), Positives = 91/163 (55%), Gaps = 20/163 (12%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
AAY +++R+LP++ F+ + K+ F L + +IV+ G GV V
Sbjct: 22 AAYELIKRILPAYAHKFEVAYVPKEN--DSDVFELESKAG-------KIVLRGNNGVAVA 72
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY--YQNAVT 173
+ L+++L+ + I+W+ T ++PK FP V +++ P Y Y N T
Sbjct: 73 SALNYWLKNYAHCEITWNGTN----LNIPK--PFPMVSKK---IRKVTPYEYRHYFNYCT 123
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+YT WWDW+RW+ EID+MAL G+N+PLA TGQ +IW KV++
Sbjct: 124 FNYTATWWDWERWQWEIDFMALNGVNMPLALTGQNSIWDKVYR 166
>gi|397485721|ref|XP_003813989.1| PREDICTED: alpha-N-acetylglucosaminidase [Pan paniscus]
Length = 682
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/97 (49%), Positives = 60/97 (61%), Gaps = 7/97 (7%)
Query: 119 HWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTF 178
H YLR +CG H++W G Q+ L + P + P YYQN T SY+F
Sbjct: 29 HRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTEATPNRYRYYQNVCTQSYSF 81
Query: 179 AWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 82 VWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 118
>gi|423217398|ref|ZP_17203894.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
CL03T12C61]
gi|392628557|gb|EIY22583.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
CL03T12C61]
Length = 707
Score = 97.4 bits (241), Expect = 3e-18, Method: Composition-based stats.
Identities = 45/116 (38%), Positives = 69/116 (59%), Gaps = 11/116 (9%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFP--RVQDAGVFVK 160
+++I G + + + GL+WYL+Y G H+SW+ +P++ P +++
Sbjct: 66 KVLIVGNSDLSLATGLNWYLKYVAGIHLSWNNPS----QKLPEVLPLPTGKIRQETAMQN 121
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
R YY N T SY+ A+WDW+RWEKEIDWMAL GIN+PL+ TG E +W + +
Sbjct: 122 R-----YYLNYCTYSYSMAFWDWERWEKEIDWMALHGINMPLSITGMEVVWYNLLK 172
>gi|153807690|ref|ZP_01960358.1| hypothetical protein BACCAC_01972 [Bacteroides caccae ATCC 43185]
gi|149130052|gb|EDM21264.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
Length = 707
Score = 97.4 bits (241), Expect = 4e-18, Method: Composition-based stats.
Identities = 45/116 (38%), Positives = 69/116 (59%), Gaps = 11/116 (9%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFP--RVQDAGVFVK 160
+++I G + + + GL+WYL+Y G H+SW+ +P++ P +++
Sbjct: 66 KVLIVGNSDLSLATGLNWYLKYVAGIHLSWNNPS----QKLPEVLPLPTGKIRQETAMQN 121
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
R YY N T SY+ A+WDW+RWEKEIDWMAL GIN+PL+ TG E +W + +
Sbjct: 122 R-----YYLNYCTYSYSMAFWDWERWEKEIDWMALHGINMPLSITGMEVVWYNLLK 172
>gi|404487206|ref|ZP_11022393.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
YIT 11860]
gi|404335702|gb|EJZ62171.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
YIT 11860]
Length = 731
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 63/171 (36%), Positives = 91/171 (53%), Gaps = 20/171 (11%)
Query: 47 RAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVI 106
R P AA A +R++PSH F F ++ ++ F LR + +I I
Sbjct: 21 RTPEYTDSAARLA--ERIVPSHSDNFIFTLVPAEKD----FFELRQNGD-------KIEI 67
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
G G+ + GL+ YLR++C + +SW G +AS+P + P V + V ++ PL
Sbjct: 68 QGNNGISMARGLNHYLRHYCHASVSW---CGDNLASIPD--TLPAVGEP-VHIEASQPLR 121
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPL-AFTGQETIWQKVFQ 216
YY N T SY+ A+W W+ WEKEID MALQG+NLPL A Q +WQ +
Sbjct: 122 YYLNYCTYSYSMAFWGWEEWEKEIDRMALQGVNLPLMAVNSQYAVWQNTLK 172
>gi|330791218|ref|XP_003283691.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
gi|325086434|gb|EGC39824.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
Length = 712
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 82/148 (55%), Gaps = 11/148 (7%)
Query: 72 FQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIV--ISGVTGVEVLAGLHWYLRYWCGSH 129
F+ +I + + G+Y +I + +S + + ++ I+ GV + GL YL+Y+C
Sbjct: 5 FELKISNTQYSKGQYYYISTDVITSQLSNSSTVIVHITADNGVNLAMGLQHYLKYYCQCS 64
Query: 130 ISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN--YYQNAVTSSYTFAWWDWKRWE 187
SW+ G Q A + SF ++ V P+ N YY N T Y+ WW+W RWE
Sbjct: 65 YSWN---GDQCA----IQSFDQLPPVPEPVLVPVVSNFRYYMNVCTFGYSTVWWNWSRWE 117
Query: 188 KEIDWMALQGINLPLAFTGQETIWQKVF 215
+EIDWMAL G NLPLAF GQE IW KVF
Sbjct: 118 REIDWMALNGYNLPLAFVGQEYIWYKVF 145
>gi|405964692|gb|EKC30145.1| Alpha-N-acetylglucosaminidase [Crassostrea gigas]
Length = 859
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 89/171 (52%), Gaps = 13/171 (7%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
+ +A Q AA + QR+L F +I S + F++ N Y+
Sbjct: 35 KPKASLGKQEDAAKDLYQRVLGPRAGEFVVKIQSNIGPKEKDTFLI-NTKEKYVE----- 88
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
I+G TGV GL++YL +C I+W GG Q+A L P+V+ V +
Sbjct: 89 -ITGTTGVAAAMGLYYYLTNYCNCQITW---GGRQMAIPSPL---PKVEGGSVNITTNDK 141
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ ++QN T SY+ W+ W WE +DWMA++GIN+ LAFTGQE I+Q+V+
Sbjct: 142 IRFFQNVCTVSYSSVWFKWADWEAYLDWMAMRGINMALAFTGQEAIFQRVY 192
>gi|194216885|ref|XP_001917396.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Equus caballus]
Length = 744
Score = 97.1 bits (240), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 52/115 (45%), Positives = 63/115 (54%), Gaps = 7/115 (6%)
Query: 101 TPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVK 160
P V S T G YLR +C H++W G Q+ L + P V +
Sbjct: 73 APVAVCSAQTAXARSGGCTAYLRDFCCFHVAWS---GXQLRLPQPLPAVPEVLTETM--- 126
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA+ GQE IWQ+V+
Sbjct: 127 -PNRYRYYQNVCTHSYSFVWWDWARWEREIDWMALNGINLALAWNGQEAIWQRVY 180
>gi|374385779|ref|ZP_09643282.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
12061]
gi|373225481|gb|EHP47815.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
12061]
Length = 715
Score = 97.1 bits (240), Expect = 5e-18, Method: Composition-based stats.
Identities = 57/168 (33%), Positives = 89/168 (52%), Gaps = 22/168 (13%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
S + + Y +L+R+ P F+ + GE F + ++V+ G
Sbjct: 15 SAKSSPVYELLERIDPGASQKFKIELSE-----GEPDFFELDQAGE------KVVVRGNN 63
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ- 169
V + G++WYL+Y+ G H+SW+ G+Q A +P + P V++ +R L Y
Sbjct: 64 YVSIATGINWYLKYYAGIHLSWN---GMQ-AKLPAV--LPPVKEKE---RREASLKYRYD 114
Query: 170 -NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N T SY+ A+WDW+RWE+E+DWMA+ GIN+ LA TG E +W V Q
Sbjct: 115 FNYCTYSYSMAFWDWERWERELDWMAMHGINMALALTGMEVVWHNVLQ 162
>gi|373460171|ref|ZP_09551927.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
gi|371956556|gb|EHO74342.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
Length = 742
Score = 97.1 bits (240), Expect = 5e-18, Method: Composition-based stats.
Identities = 44/113 (38%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G T V + GL+WYL+Y G H++W+ +P++ R +
Sbjct: 56 KVVIKGNTWVNIATGLNWYLKYHAGIHLTWNNMTANLPERLPQVTQTERHET-------D 108
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ L Y N T SY+ A+WDW+RW+ EIDWMAL G+NLPLA G+E W+ +
Sbjct: 109 LKLRYDFNYCTFSYSMAFWDWERWQTEIDWMALHGVNLPLAIVGEEVAWRNML 161
>gi|351699889|gb|EHB02808.1| Alpha-N-acetylglucosaminidase, partial [Heterocephalus glaber]
Length = 652
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 47/95 (49%), Positives = 59/95 (62%), Gaps = 7/95 (7%)
Query: 121 YLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAW 180
YLR +CG H++W G Q+ L + P + P YYQN T SY+F W
Sbjct: 2 YLRDFCGCHVAWS---GAQLRLPRPLPAVP----GELTEASPYRYRYYQNVCTHSYSFVW 54
Query: 181 WDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
WDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 55 WDWARWEREIDWMALHGINLALAWSGQEAIWQRVY 89
>gi|423212382|ref|ZP_17198911.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694828|gb|EIY88054.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
CL03T12C04]
Length = 705
Score = 97.1 bits (240), Expect = 5e-18, Method: Composition-based stats.
Identities = 43/114 (37%), Positives = 69/114 (60%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I+G + + + GL+WYL+Y G H+SW+ +P++ P+ + +
Sbjct: 66 KVLITGNSDLSLSTGLNWYLKYVAGIHLSWNNLS----QKLPEVLPLPQ---ETIRKETS 118
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ YY N SY+ A+WDW+RWEKEIDWMAL GIN+PL+ TG E +W + +
Sbjct: 119 MQARYYLNYCAYSYSMAFWDWERWEKEIDWMALHGINMPLSITGMEVVWYNLLK 172
>gi|237721435|ref|ZP_04551916.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|293370838|ref|ZP_06617383.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|229449231|gb|EEO55022.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|292634054|gb|EFF52598.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 711
Score = 96.7 bits (239), Expect = 6e-18, Method: Composition-based stats.
Identities = 42/114 (36%), Positives = 70/114 (61%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I+G + + + GL+WYL+Y G H+SW+ +P++ P+ + + +
Sbjct: 66 KVLITGNSDLSLATGLNWYLKYVAGIHLSWNNLS----QKLPEILPLPQEK---IRKETS 118
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ YY N T SY+ +WDW+RWEKEIDWMA+ GIN+PL+ TG E +W + +
Sbjct: 119 MQNRYYLNYCTYSYSMVFWDWERWEKEIDWMAMHGINMPLSITGMEVVWYNLLK 172
>gi|333031143|ref|ZP_08459204.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
gi|332741740|gb|EGJ72222.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
Length = 723
Score = 96.7 bits (239), Expect = 6e-18, Method: Composition-based stats.
Identities = 53/156 (33%), Positives = 86/156 (55%), Gaps = 17/156 (10%)
Query: 62 QRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWY 121
QRL+P++ S+F F+ I K F + + + +I+I G + GL+ Y
Sbjct: 34 QRLIPTYSSSFSFKKI--KAINQRDFFTVESKQN-------KILIEGNNANSMAMGLNHY 84
Query: 122 LRYWCGSHISWDKTGGVQVASMPK-LGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAW 180
L+++C + +SW ++ MPK L + P+ + V+ + ++ N T YT W
Sbjct: 85 LKHYCLTTVSWYADKAIE---MPKELPALPQ----KISVEAKVDTRFFLNYCTYGYTMPW 137
Query: 181 WDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
W WK WE+ IDWMAL G+N+PLA TGQE++W V++
Sbjct: 138 WQWKDWERFIDWMALNGVNMPLAITGQESVWYNVWK 173
>gi|288927792|ref|ZP_06421639.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella sp. oral taxon 317 str. F0108]
gi|288330626|gb|EFC69210.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella sp. oral taxon 317 str. F0108]
Length = 734
Score = 96.7 bits (239), Expect = 6e-18, Method: Composition-based stats.
Identities = 57/170 (33%), Positives = 89/170 (52%), Gaps = 17/170 (10%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPE-IVI 106
A +V L +++R+LP HY +I+ EY + N + + I+I
Sbjct: 21 AETNVSLVPVRELVKRILPEHY----LKIVV------EYMPDVTNDERFELYSQADKIII 70
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
G T + GL++YL+Y+C +++SW ++ PK+ P V + V R +P
Sbjct: 71 RGTTKSAIGVGLNYYLKYYCKTYVSWYSFDKIET---PKV--LPVVPEKVVRSAR-VPER 124
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
++ N T YT WW W WE+ IDWMAL GIN+PLA GQE++W V++
Sbjct: 125 FFLNYCTYGYTLTWWGWHEWERLIDWMALNGINMPLAIAGQESVWLNVWK 174
>gi|379334158|gb|AFD03088.1| putative alpha-N-acetylglucosaminidase [uncultured bacterium 8]
Length = 726
Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 61/116 (52%), Gaps = 7/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G + + G +GV V + L WYLR CG+ I+WD +P+ G+ R
Sbjct: 48 GPGGVALRGSSGVAVASALRWYLRTACGTQITWDDPEPRLPDRLPRTGTTARTTS----- 102
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P P Y+ N T YT A+WDW RWE+ IDWMAL G+ PLA TG E WQ+
Sbjct: 103 --PHPHRYHFNVCTFGYTTAFWDWARWERHIDWMALHGVTTPLAMTGLEAAWQRAL 156
>gi|261880010|ref|ZP_06006437.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
gi|270333326|gb|EFA44112.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
Length = 719
Score = 96.3 bits (238), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 82/162 (50%), Gaps = 15/162 (9%)
Query: 53 QLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGV 112
+ A + RL+P H SAF F+ I K+ Y RN +I+ISG
Sbjct: 23 DIVTANQLTMRLIPEHASAFTFKKIRKRNDKDIYTLQTRNR---------KIIISGNNAN 73
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAV 172
+ GL+ YL ++C +++SW V MP+ P V+ V + + ++ N
Sbjct: 74 AMAVGLNKYLTHYCHTNVSWYAEEKV---DMPQ--QLPEVKGV-VKGEAKVARRFFLNYC 127
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
T YT ++DW WE+ IDWMAL G+NLPLA TGQE IW V
Sbjct: 128 TFGYTLPFFDWHAWERVIDWMALHGVNLPLAITGQEYIWYNV 169
>gi|345517325|ref|ZP_08796802.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
gi|345457718|gb|EET14396.2| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
Length = 754
Score = 96.3 bits (238), Expect = 9e-18, Method: Composition-based stats.
Identities = 50/117 (42%), Positives = 69/117 (58%), Gaps = 15/117 (12%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKR- 161
++VI G T + + G++WYL+Y G H+SW+ AS+P + P V F K
Sbjct: 62 KVVIRGNTYINIATGINWYLKYHAGIHLSWNSMH----ASLPNV--LPPV-----FRKER 110
Query: 162 ---PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ L Y N T SY+ A+WDWKRWE+E+DWMAL GINLPLA G E +W+ +
Sbjct: 111 HETNLALRYDFNYCTYSYSMAFWDWKRWEEELDWMALHGINLPLAAVGHECVWRNLL 167
>gi|294777713|ref|ZP_06743164.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
gi|294448781|gb|EFG17330.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
Length = 752
Score = 96.3 bits (238), Expect = 9e-18, Method: Composition-based stats.
Identities = 50/117 (42%), Positives = 69/117 (58%), Gaps = 15/117 (12%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKR- 161
++VI G T + + G++WYL+Y G H+SW+ AS+P + P V F K
Sbjct: 60 KVVIRGNTYINIATGINWYLKYHAGIHLSWNSMH----ASLPNV--LPPV-----FRKER 108
Query: 162 ---PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ L Y N T SY+ A+WDWKRWE+E+DWMAL GINLPLA G E +W+ +
Sbjct: 109 HETNLALRYDFNYCTYSYSMAFWDWKRWEEELDWMALHGINLPLAAVGHECVWRNLL 165
>gi|319640296|ref|ZP_07995021.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
gi|317388071|gb|EFV68925.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
Length = 752
Score = 96.3 bits (238), Expect = 9e-18, Method: Composition-based stats.
Identities = 50/117 (42%), Positives = 69/117 (58%), Gaps = 15/117 (12%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKR- 161
++VI G T + + G++WYL+Y G H+SW+ AS+P + P V F K
Sbjct: 60 KVVIRGNTYINIATGINWYLKYHAGIHLSWNSMH----ASLPNV--LPPV-----FRKER 108
Query: 162 ---PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ L Y N T SY+ A+WDWKRWE+E+DWMAL GINLPLA G E +W+ +
Sbjct: 109 HETNLALRYDFNYCTYSYSMAFWDWKRWEEELDWMALHGINLPLAAVGHECVWRNLL 165
>gi|154489986|ref|ZP_02030247.1| hypothetical protein PARMER_00215 [Parabacteroides merdae ATCC
43184]
gi|423722990|ref|ZP_17697143.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
CL09T00C40]
gi|154089428|gb|EDN88472.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
43184]
gi|409241820|gb|EKN34587.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
CL09T00C40]
Length = 718
Score = 95.9 bits (237), Expect = 1e-17, Method: Composition-based stats.
Identities = 51/113 (45%), Positives = 68/113 (60%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y G H+SW+ G+Q A +P++ P V+ +
Sbjct: 57 KVVIRGNNYVSIATGLNWYLKYHVGIHLSWN---GMQ-AELPEV--LPAVKQKERH-ETD 109
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ Y N T SYT A+WDW RWEKEIDWMAL GINLPLA G + +W V
Sbjct: 110 MKYRYDFNYCTFSYTMAFWDWTRWEKEIDWMALHGINLPLAMVGTDGVWYNVL 162
>gi|423299508|ref|ZP_17277533.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
CL09T03C10]
gi|408473317|gb|EKJ91839.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
CL09T03C10]
Length = 727
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/163 (34%), Positives = 86/163 (52%), Gaps = 15/163 (9%)
Query: 53 QLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGV 112
Q + + QRL P H S+FQF ++ FIL + + +I I G
Sbjct: 22 QETVIHQLCQRLFPQHASSFQFDLLQDSLKTDH--FILTSQ-------SGKICIQGNNYN 72
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAV 172
+ GL++YL+++C +H+SW + + MPK P V++ V + ++ N
Sbjct: 73 SIAVGLNYYLKHYCHTHVSWYASDSI---VMPK--ELPMVKEP-VSRQAKCDSRFFLNYC 126
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T YT +W W+ WE+ IDWMAL GI +PLA TGQE+IW KV+
Sbjct: 127 TFGYTMPYWKWQDWERLIDWMALNGITMPLAITGQESIWYKVW 169
>gi|428176410|gb|EKX45295.1| hypothetical protein GUITHDRAFT_51145, partial [Guillardia theta
CCMP2712]
Length = 680
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/104 (47%), Positives = 64/104 (61%), Gaps = 8/104 (7%)
Query: 114 VLAGLHWYLRYWCGSHISW-DKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAV 172
V + L+WYLRY C SW K V +P++G + + + YY+N
Sbjct: 1 VASALNWYLRYHCSVDTSWMSKFPLVLPRQLPRVGR-------SIVKQSLVKWGYYENVC 53
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T SYT AWWDW+RWE+EIDWMA+ GINLPL+ TGQE I Q+VF+
Sbjct: 54 TFSYTQAWWDWERWEREIDWMAMSGINLPLSLTGQEYISQRVFR 97
>gi|319900259|ref|YP_004159987.1| alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
gi|319415290|gb|ADV42401.1| Alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
Length = 718
Score = 95.5 bits (236), Expect = 1e-17, Method: Composition-based stats.
Identities = 47/117 (40%), Positives = 66/117 (56%), Gaps = 7/117 (5%)
Query: 99 RGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVF 158
R ++V+ G V + G++WYL+Y G H+ W+ S+P + P +
Sbjct: 53 RQKGKVVVRGNNYVSIATGINWYLKYHAGIHLCWNNMQAKLPDSLPPV---PEKERHDTD 109
Query: 159 VKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+K LNY T SYT A+WDW+RWEKEIDWMAL GINLPL+ G ++W+ V
Sbjct: 110 LKYRYNLNY----CTYSYTMAFWDWERWEKEIDWMALHGINLPLSIVGTGSVWRNVL 162
>gi|380694112|ref|ZP_09858971.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 736
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 70/114 (61%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I+G + + + GL+WYL+Y G H+SW+ +P++ P+ + +
Sbjct: 69 KVLITGNSDLSLATGLNWYLKYVAGIHLSWNNPS----QKLPEVLPLPQKK---IRQTTA 121
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ YY N T SY+ A+WDW+RWEKEIDWMA+ GIN+PL+ TG E +W + +
Sbjct: 122 MKNRYYLNYCTYSYSMAFWDWERWEKEIDWMAMHGINMPLSITGMEVVWYNLLK 175
>gi|224025137|ref|ZP_03643503.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
18228]
gi|224018373|gb|EEF76371.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
18228]
Length = 718
Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 48/114 (42%), Positives = 72/114 (63%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++V+ G V + G++WYL+Y+ G ++SW+ G+Q A +P++ P V +
Sbjct: 57 KVVVRGNNYVSIATGINWYLKYYAGINLSWN---GMQ-ADLPEV--LPPVLKKERH-ETD 109
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ L Y N T SY+ A+WDWKRWE+EIDWMAL GINLPLA G + +W+ V +
Sbjct: 110 LKLRYDFNYCTFSYSMAFWDWKRWEQEIDWMALHGINLPLAMVGTDVVWKNVLE 163
>gi|126307960|ref|XP_001366343.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
domestica]
Length = 741
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 70/112 (62%), Gaps = 7/112 (6%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ ++G +GV AGL+ YLR +CG ++W G Q+ +P+ P V + + + P
Sbjct: 73 VQVTGSSGVAAAAGLYRYLRDFCGCQVAWS---GAQL-RLPE--PLPAVPEELIEIT-PN 125
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YY N T SY+F WW W+RWE+EIDWMAL GINL LA GQE IW++V+
Sbjct: 126 WYRYYLNICTPSYSFVWWGWERWEREIDWMALNGINLVLAPVGQEAIWRRVY 177
>gi|218258436|ref|ZP_03474815.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
DSM 18315]
gi|423342591|ref|ZP_17320305.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
CL02T12C29]
gi|218225494|gb|EEC98144.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
DSM 18315]
gi|409217508|gb|EKN10484.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
CL02T12C29]
Length = 718
Score = 94.7 bits (234), Expect = 2e-17, Method: Composition-based stats.
Identities = 49/113 (43%), Positives = 69/113 (61%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y G H+SW+ G+Q A +P++ ++++ +
Sbjct: 57 KVVIRGNNYVSIATGLNWYLKYHAGIHLSWN---GMQ-AELPEVLPAVKLKERH---ETD 109
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ Y N T SYT A+WDW RWEKEIDWMAL GINLPLA G + +W V
Sbjct: 110 MKYRYDFNYCTFSYTMAFWDWARWEKEIDWMALHGINLPLAMVGTDGVWFNVL 162
>gi|348562747|ref|XP_003467170.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Cavia porcellus]
Length = 750
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/166 (40%), Positives = 88/166 (53%), Gaps = 15/166 (9%)
Query: 53 QLAAAYAVLQRLL-PSHYSAFQFRIISKKQCG-GEYCFILRNHPSSYIRGTPEIVISGVT 110
+ AA A+ RLL P +AF+ + G G F L S+ R E V + +
Sbjct: 33 ETAAVRALAVRLLGPGPAAAFELSVERTLAAGPGADVFEL----SAEARRR-EGVCAAPS 87
Query: 111 GVEVLA-GLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
GV V A +CG H++W G Q+ S+P+ P V + P YYQ
Sbjct: 88 GVAVAARAAQTTCGTFCGCHVAWS---GAQL-SLPR--PLPPV-PGQLTEASPYRYRYYQ 140
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
N T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 141 NVCTHSYSFVWWDWPRWEREIDWMALNGINLALAWSGQEAIWQRVY 186
>gi|158300970|ref|XP_320760.4| AGAP011750-PA [Anopheles gambiae str. PEST]
gi|157013415|gb|EAA00039.4| AGAP011750-PA [Anopheles gambiae str. PEST]
Length = 770
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 85/173 (49%), Gaps = 20/173 (11%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNH--PSSYIRGTP 102
R + +Q AA V+ RLLP A QFR+ Y + RN S +
Sbjct: 34 RPQVSEVIQQQAALEVIARLLPQQ-QAEQFRVTI-------YSSMERNSFSVSEEVSDPS 85
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
E+ I+ +GV + YLRYWC ++W+ + A +P + V ++ P
Sbjct: 86 EVQITASSGVAATKAFYHYLRYWCRVLVAWEGSQLNLPAVLPPVN---------VTIQAP 136
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ YYQN T SY+F WW W +W + IDWMALQGI L LA QE +W +VF
Sbjct: 137 SSIIYYQNVCTWSYSFTWWSWPQWRRHIDWMALQGITLSLA-PFQEDLWTQVF 188
>gi|262407713|ref|ZP_06084261.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|262354521|gb|EEZ03613.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
Length = 735
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 86/170 (50%), Gaps = 18/170 (10%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
R + P + L+ + +RL P H +FQF++++ + N +I
Sbjct: 21 RNQDPATNTLSE---MCERLFPQHAHSFQFQLLTDSVDIDRFTLESDN---------GKI 68
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
+I G + GL+ YL+Y+C +H+SW + V MP P V+ A V ++
Sbjct: 69 LIKGNNRNSLAVGLNHYLKYYCQAHVSWYASDSV---VMP--AQLPEVE-APVILRSKCK 122
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
++ N T Y+ +W W WE+ IDWMAL G+ +PLA TGQE+IW KV
Sbjct: 123 NRFFLNYCTFGYSMPYWKWSDWERLIDWMALNGVTMPLAITGQESIWYKV 172
>gi|294647264|ref|ZP_06724861.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|292637401|gb|EFF55822.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
Length = 733
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 86/170 (50%), Gaps = 18/170 (10%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
R + P + L+ + +RL P H +FQF++++ + N +I
Sbjct: 19 RNQDPATNTLSE---MCERLFPQHAHSFQFQLLTDSVDIDRFTLESDN---------GKI 66
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
+I G + GL+ YL+Y+C +H+SW + V MP P V+ A V ++
Sbjct: 67 LIKGNNRNSLAVGLNHYLKYYCQAHVSWYASDSV---VMP--AQLPEVE-APVILRSKCK 120
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
++ N T Y+ +W W WE+ IDWMAL G+ +PLA TGQE+IW KV
Sbjct: 121 NRFFLNYCTFGYSMPYWKWSDWERLIDWMALNGVTMPLAITGQESIWYKV 170
>gi|261880159|ref|ZP_06006586.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
gi|270333130|gb|EFA43916.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
Length = 772
Score = 93.2 bits (230), Expect = 6e-17, Method: Composition-based stats.
Identities = 44/114 (38%), Positives = 69/114 (60%), Gaps = 11/114 (9%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ I G + V + +GL+WYL+Y+ G H++W+ ++V + + P+V+ +R
Sbjct: 64 VCIKGNSWVNIASGLNWYLKYYAGIHLTWNN---MKVKAPVTWPTVPKVE------RRET 114
Query: 164 PLN--YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
L Y N T SY+ A+WDW+RWE+EIDWMAL G+N+PLA G E W+ +
Sbjct: 115 DLTWRYNFNYCTFSYSMAFWDWERWEQEIDWMALHGVNMPLAVVGAEVAWRNML 168
>gi|345511813|ref|ZP_08791352.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|229443748|gb|EEO49539.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
Length = 720
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 55/171 (32%), Positives = 87/171 (50%), Gaps = 18/171 (10%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
R + P + L+ + +RL P H +FQF++++ + N +I
Sbjct: 6 RNQDPATNTLSE---MCERLFPQHAHSFQFQLLTDSVDIDRFTLESDN---------GKI 53
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
+I G + GL+ YL+Y+C +H+SW + V MP P V+ A V ++
Sbjct: 54 LIKGNNRNSLAVGLNHYLKYYCQAHVSWYASDSV---VMP--AQLPEVE-APVILRSKCK 107
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
++ N T Y+ +W W WE+ IDWMAL G+ +PLA TGQE+IW KV+
Sbjct: 108 NRFFLNYCTFGYSMPYWKWSDWERLIDWMALNGVTMPLAITGQESIWYKVW 158
>gi|395804724|ref|ZP_10483959.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
gi|395433112|gb|EJF99070.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
Length = 722
Score = 92.8 bits (229), Expect = 1e-16, Method: Composition-based stats.
Identities = 55/161 (34%), Positives = 78/161 (48%), Gaps = 17/161 (10%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
+A+ V++RL+ S + FQ + K+ E+ I + I+G+ I L
Sbjct: 41 SAFGVIERLVGSRANEFQLSLDENKKSNSEWFEIETENNLVKIKGSTNTAIC-YAAYNFL 99
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSS 175
+ L W G+ I KT +P+ G + P Y NA T
Sbjct: 100 RDIGAVLTSWEGNRIDLPKT-------------WPKYSKKG---ETPFQYREYLNACTFG 143
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YT WWDWKRWE+EIDWMAL GINLP A GQE +WQ++++
Sbjct: 144 YTTPWWDWKRWEQEIDWMALHGINLPTAMEGQEAVWQELWK 184
>gi|393784337|ref|ZP_10372502.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
CL02T12C01]
gi|392666113|gb|EIY59630.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
CL02T12C01]
Length = 728
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/160 (35%), Positives = 85/160 (53%), Gaps = 17/160 (10%)
Query: 58 YAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAG 117
Y + +RLLP++ +F F +Q + F + +I+I G GV + G
Sbjct: 26 YGLAERLLPANNPSFVF-----EQLPSDSDFFELEQAAE-----DKIIIRGNNGVSMARG 75
Query: 118 LHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYT 177
L+ YLR +C +SW G ++ +P+ P V++ + + P YY N T SY+
Sbjct: 76 LNHYLRNYCHRSVSW---CGNNLSDLPE--PLPPVKEK-IRIDASFPYRYYLNYCTYSYS 129
Query: 178 FAWWDWKRWEKEIDWMALQGINLPL-AFTGQETIWQKVFQ 216
A+WDW+RWE EID MALQGIN+PL A G+ +WQ +
Sbjct: 130 MAFWDWERWEAEIDRMALQGINMPLMAVYGEYAVWQNTLR 169
>gi|380697007|ref|ZP_09861866.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 703
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 81/156 (51%), Gaps = 15/156 (9%)
Query: 60 VLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLH 119
+ +RL P H +FQF++++ + N +I+I G + GL+
Sbjct: 1 MCERLFPQHAHSFQFQLLTDSVDIDRFTLESDN---------GKILIKGNNRNSLAVGLN 51
Query: 120 WYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFA 179
YL+Y+C +H+SW + V MP P V+ A V ++ ++ N T Y+
Sbjct: 52 HYLKYYCQAHVSWYASDSV---VMP--AQLPEVE-APVILRSKCKNRFFLNYCTFGYSMP 105
Query: 180 WWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+W W WE+ IDWMAL G+ +PLA TGQE+IW KV+
Sbjct: 106 YWKWSDWERLIDWMALNGVTMPLAITGQESIWYKVW 141
>gi|242809019|ref|XP_002485282.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
ATCC 10500]
gi|218715907|gb|EED15329.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
ATCC 10500]
Length = 755
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 86/158 (54%), Gaps = 11/158 (6%)
Query: 59 AVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGL 118
A+LQR LP H F+F I++ G +++ + + IR I G + + GL
Sbjct: 31 ALLQRRLPQHADKFEFGIVNATSLGENDVYVVLSAENGSIR------IEGSSLSALATGL 84
Query: 119 HWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTF 178
H YL I W G ++ + G FP++ + + +P Y+ N VT+SYT
Sbjct: 85 HRYLSDVAHVDIYW--FIGSRLDQID--GQFPKLNEP-LTGSSVVPYRYHFNTVTTSYTS 139
Query: 179 AWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
A+W W+ WE ++DWMAL+GINLPLA+ G E I+ +VFQ
Sbjct: 140 AFWSWEDWELQLDWMALRGINLPLAWIGIERIFIEVFQ 177
>gi|294807833|ref|ZP_06766618.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|294444952|gb|EFG13634.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
Length = 703
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 81/156 (51%), Gaps = 15/156 (9%)
Query: 60 VLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLH 119
+ +RL P H +FQF++++ + N +I+I G + GL+
Sbjct: 1 MCERLFPQHAHSFQFQLLTDSVDIDRFTLESDN---------GKILIKGNNRNSLAVGLN 51
Query: 120 WYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFA 179
YL+Y+C +H+SW + V MP P V+ A V ++ ++ N T Y+
Sbjct: 52 HYLKYYCQAHVSWYASDSV---VMP--AQLPEVE-APVILRSKCKNRFFLNYCTFGYSMP 105
Query: 180 WWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+W W WE+ IDWMAL G+ +PLA TGQE+IW KV+
Sbjct: 106 YWKWSDWERLIDWMALNGVTMPLAITGQESIWYKVW 141
>gi|374385255|ref|ZP_09642763.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
12061]
gi|373226460|gb|EHP48786.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
12061]
Length = 736
Score = 92.0 bits (227), Expect = 2e-16, Method: Composition-based stats.
Identities = 52/161 (32%), Positives = 87/161 (54%), Gaps = 16/161 (9%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
AAYA+++R++ F+ ++++ G+ F L +IV+ G G+ V
Sbjct: 30 AAYALIKRVVGEKADHFKVEYVAREN--GKDVFELEQKGK-------KIVLRGNNGISVA 80
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSS 175
+ L+ YL + W+ + +P++ +V+ +V R +Y N T S
Sbjct: 81 SALNHYLNEYAHCQYGWNASHMQLPDPLPEVKQ--KVRKVTPYVYR-----HYFNYCTFS 133
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YT AWWDW+RW+KEID+MAL G+N+PLA TGQ +W +V++
Sbjct: 134 YTAAWWDWERWQKEIDYMALHGVNMPLAMTGQNAVWDRVYR 174
>gi|260642393|ref|ZP_05415712.2| alpha-N-acetylglucosaminidase [Bacteroides finegoldii DSM 17565]
gi|260622285|gb|EEX45156.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides finegoldii DSM
17565]
Length = 735
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 85/170 (50%), Gaps = 18/170 (10%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEI 104
R + P + L+ + +RL P H +FQF++++ + N +I
Sbjct: 21 RNQDPATNTLSE---MCERLFPQHAHSFQFQLLTDSVDIDRFTLESDN---------GKI 68
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
+I G + GL+ YL+Y+C +H+SW + V MP P V+ V ++
Sbjct: 69 LIKGNNRNSLAVGLNHYLKYYCQTHVSWYASDSV---VMP--AQLPEVE-TPVILRSKCK 122
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
++ N T Y+ +W W WE+ IDWMAL G+ +PLA TGQE+IW KV
Sbjct: 123 NRFFLNYCTFGYSMPYWKWSDWERLIDWMALNGVTMPLAITGQESIWYKV 172
>gi|404487028|ref|ZP_11022215.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
YIT 11860]
gi|404335524|gb|EJZ61993.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
YIT 11860]
Length = 726
Score = 92.0 bits (227), Expect = 2e-16, Method: Composition-based stats.
Identities = 55/161 (34%), Positives = 86/161 (53%), Gaps = 16/161 (9%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
A + RL+P + F F I+ + R++ S + +I ISG +
Sbjct: 28 AVEQLTHRLIPQYADRFVFEIVPDED---------RDYYSLESKDG-KIHISGNNANSMA 77
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSS 175
AGL++YL+Y+C + +SW V+ MP + P V+ V V+ + ++ N T
Sbjct: 78 AGLNYYLKYYCLTTVSWYADIPVE---MPDV--LPMVEKP-VVVEAKVDNRFFLNYCTYG 131
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y+ +W WK WE+ IDWMAL G+N+PLA TGQE +W KV++
Sbjct: 132 YSMPFWKWKDWERFIDWMALNGVNMPLAITGQEMVWYKVWK 172
>gi|355754184|gb|EHH58149.1| Alpha-N-acetylglucosaminidase, partial [Macaca fascicularis]
Length = 650
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 7/93 (7%)
Query: 123 RYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWD 182
R +CG H++W G Q+ L + P + P YYQN T SY+F WWD
Sbjct: 1 RDFCGCHVAWS---GSQLRLPRPLPAVP----GELTEATPNRYRYYQNVCTQSYSFVWWD 53
Query: 183 WKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
W RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 54 WARWEREIDWMALNGINLALAWSGQEAIWQRVY 86
>gi|298386708|ref|ZP_06996263.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
gi|298260382|gb|EFI03251.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
Length = 732
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 87/170 (51%), Gaps = 20/170 (11%)
Query: 46 ERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIV 105
E PP ++ + QRL P H +F F +++ FIL + +I
Sbjct: 22 ETGPPVLK-----EMCQRLFPRHAQSFLFELLTDSIDTDR--FILESSQG-------KIR 67
Query: 106 ISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPL 165
I G + AGL+ YL+ +C +H+SW + V+ MP + P + V+++
Sbjct: 68 IKGNNRNSLAAGLNHYLKNYCHTHVSWYASETVE---MPDV--LPEIPQP-VYIRSKCDN 121
Query: 166 NYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
++ N T YT +W W+ WE+ IDWMAL G+ +PLA TGQE+IW KV+
Sbjct: 122 RFFLNYCTFGYTMPYWKWQDWERLIDWMALNGVTMPLAITGQESIWYKVW 171
>gi|29348998|ref|NP_812501.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340905|gb|AAO78695.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 732
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/169 (33%), Positives = 86/169 (50%), Gaps = 20/169 (11%)
Query: 46 ERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIV 105
E PP ++ + QRL P H +F F +++ FIL + +I
Sbjct: 22 ETGPPVLK-----EMCQRLFPRHAQSFLFELLTDSIDTDR--FILESSQG-------KIR 67
Query: 106 ISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPL 165
I G + AGL+ YL+ +C +H+SW + V+ MP + P + V+++
Sbjct: 68 IKGNNRNSLAAGLNHYLKNYCHTHVSWYASETVE---MPDV--LPEIPQP-VYIRSKCDN 121
Query: 166 NYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
++ N T YT +W W+ WE+ IDWMAL G+ +PLA TGQE+IW KV
Sbjct: 122 RFFLNYCTFGYTMPYWKWQDWERLIDWMALNGVTMPLAITGQESIWYKV 170
>gi|383124408|ref|ZP_09945072.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
gi|251839096|gb|EES67180.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
Length = 732
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 87/170 (51%), Gaps = 20/170 (11%)
Query: 46 ERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIV 105
E PP ++ + QRL P H +F F +++ FIL + +I
Sbjct: 22 ETGPPVLK-----EMCQRLFPRHAQSFLFELLTDSIDTDR--FILESSQG-------KIR 67
Query: 106 ISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPL 165
I G + AGL+ YL+ +C +H+SW + V+ MP + P + V+++
Sbjct: 68 IKGNNRNSLAAGLNHYLKNYCHTHVSWYASETVE---MPDV--LPEIPQP-VYIRSKCDN 121
Query: 166 NYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
++ N T YT +W W+ WE+ IDWMAL G+ +PLA TGQE+IW KV+
Sbjct: 122 RFFLNYCTFGYTMPYWKWQDWERLIDWMALNGVTMPLAITGQESIWYKVW 171
>gi|329963073|ref|ZP_08300853.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
gi|328529114|gb|EGF56044.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
Length = 717
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 64/113 (56%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI G V + GL+WYL+Y G +SW+ +P + R + +K
Sbjct: 57 KVVIRGNNYVSIATGLNWYLKYHAGIQLSWNGMKAKLPDVLPAVTQKERHE---TDLKYR 113
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
LNY T SYT A+WDWKRWE+EIDWMAL GINLPLA G + +W+ V
Sbjct: 114 YDLNY----CTYSYTMAFWDWKRWEQEIDWMALHGINLPLAIIGTDVVWRNVL 162
>gi|423287380|ref|ZP_17266231.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
CL02T12C04]
gi|392672495|gb|EIY65962.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
CL02T12C04]
Length = 726
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 80/169 (47%), Gaps = 15/169 (8%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVIS 107
+P + A + RL P H F F + L N + +I IS
Sbjct: 15 SPVFAENRAMKDMCNRLFPEHSGNFTFELAPDS---------LENDFFTIESINDKIKIS 65
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY 167
G + GL+ YL+Y+C +H+SW T ++ MP+ P + D + V +
Sbjct: 66 GNNNNSLATGLNHYLKYYCHTHVSWYATDKIE---MPR--QLPVLLDK-ITVLAKCKTRF 119
Query: 168 YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ N T Y+ +W WK WE+ IDWMAL G+N PLA TGQE IW V++
Sbjct: 120 FLNYCTFGYSMPYWKWKDWERLIDWMALNGVNTPLAITGQEAIWYDVWK 168
>gi|423213214|ref|ZP_17199743.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693674|gb|EIY86904.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
CL03T12C04]
Length = 726
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 81/170 (47%), Gaps = 17/170 (10%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVIS 107
+P + A + RL P H F F + L N + +I IS
Sbjct: 15 SPVFAENRAMKDMCNRLFPEHSGNFTFELAPDS---------LENDFFTIESINDKIKIS 65
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDA-GVFVKRPIPLN 166
G + GL+ YL+Y+C +H+SW T ++ MP+ P + D +F K
Sbjct: 66 GNNNNSLATGLNHYLKYYCHTHVSWYATDKIE---MPR--QLPVLLDKITIFAK--CKTR 118
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
++ N T Y+ +W WK WE+ IDWMAL G+N PLA TGQE IW V++
Sbjct: 119 FFLNYCTFGYSMPYWKWKDWERLIDWMALNGVNTPLAITGQEAIWYDVWK 168
>gi|212537509|ref|XP_002148910.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
gi|210068652|gb|EEA22743.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
Length = 768
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 82/161 (50%), Gaps = 16/161 (9%)
Query: 59 AVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGL 118
++LQR LP H F+F +++ G + + N + I+ + G + + GL
Sbjct: 39 SLLQRRLPQHAHKFEFSVVNATSLGENDAYTVANAENGRIK------VEGSSLSALATGL 92
Query: 119 HWYLRYWCGSHISW---DKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSS 175
H YL I W + + +PKL + + +P Y+ N VT+S
Sbjct: 93 HRYLADVAHVDIYWFIGSRLDQISAGQLPKL-------NETLTGSSVVPYRYHFNTVTTS 145
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YT A+W W+ WE ++DWMAL+G+NLPLA+ G E I+ +VFQ
Sbjct: 146 YTSAFWSWEDWELQLDWMALRGVNLPLAWIGVEKIFIEVFQ 186
>gi|432926094|ref|XP_004080826.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oryzias latipes]
Length = 882
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 37/58 (63%), Positives = 44/58 (75%)
Query: 159 VKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ P YYQN T SY+ WWDW RWE+EIDWMAL GINLPLAFTGQE +WQ+V++
Sbjct: 264 INTPHRFRYYQNVCTVSYSSVWWDWPRWEREIDWMALNGINLPLAFTGQEALWQEVYR 321
>gi|339238239|ref|XP_003380674.1| GDP-L-fucose synthetase [Trichinella spiralis]
gi|316976398|gb|EFV59699.1| GDP-L-fucose synthetase [Trichinella spiralis]
Length = 1203
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 66/112 (58%), Gaps = 7/112 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+ +I G +GV + G++ +LR HI+WD G++V +P + S P Q + R
Sbjct: 707 QTLIKGTSGVALAFGVNAHLRNMYDVHIAWD---GIRV-ELPAIVSPPHKQMVFKSIGR- 761
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
Y+ N T SY+FAWW+W RWE IDWMAL GINLPLA G E +W+ +
Sbjct: 762 --YRYFGNVCTFSYSFAWWNWSRWEYFIDWMALNGINLPLAHVGNEVVWKSL 811
>gi|340347658|ref|ZP_08670763.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
gi|433652542|ref|YP_007296396.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
3688]
gi|339608852|gb|EGQ13735.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
gi|433303075|gb|AGB28890.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
3688]
Length = 781
Score = 90.5 bits (223), Expect = 5e-16, Method: Composition-based stats.
Identities = 41/113 (36%), Positives = 64/113 (56%), Gaps = 7/113 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ I G T V + +GL+WYL+Y G H+SW+ A +P + R +
Sbjct: 77 KVYIKGNTWVNIASGLNWYLKYHAGIHLSWNNMTAELPARLPCIARPVRRET-------D 129
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ Y N T SY+ +W+W RW++EIDWMAL G+N+PLA G+E +W+ +
Sbjct: 130 LAERYDFNYCTFSYSMVFWNWDRWQQEIDWMALHGVNMPLAIVGEEVVWRNML 182
>gi|393782608|ref|ZP_10370791.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
CL02T12C01]
gi|392672835|gb|EIY66301.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
CL02T12C01]
Length = 761
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 83/157 (52%), Gaps = 18/157 (11%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGT-PEIVISGVTGVEV 114
A Y +++R+ P + S ++ II+ Y + G EI++ G T V +
Sbjct: 31 AIYELIERVTPGYSSQYRLEIIAPDNGVDVYE----------VDGDGKEIILRGNTPVAL 80
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTS 174
+WYL+Y C +H+SW G Q+ ++P+ PR ++ V R Y N T
Sbjct: 81 ATAFNWYLKYTCQAHVSW---FGNQL-NLPEKLPQPRERERRVINGR---YRVYMNYCTV 133
Query: 175 SYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
SYT AWWDW+RW+KE+D+MA+ +N+PL G + +W
Sbjct: 134 SYTAAWWDWERWQKELDFMAMNSVNMPLFTIGLDAVW 170
>gi|348681836|gb|EGZ21652.1| hypothetical protein PHYSODRAFT_247428 [Phytophthora sojae]
Length = 991
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/221 (29%), Positives = 99/221 (44%), Gaps = 27/221 (12%)
Query: 3 SPFRSVSLILFFITTITLSTLAVAQSSTI-----GVQY-ISRLLDIQDRERAPPSVQLAA 56
+ F+ L L I+ ST+ +S + GV+ ++RL ++ A P +AA
Sbjct: 197 TTFKKAGLTLLEHVPISGSTMTSTRSFRLFGLGGGVEASVARL-----KKNAEPHDPVAA 251
Query: 57 AYAVLQRLL-PSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
++ R L P + R++ + F L + ++ I+ + +
Sbjct: 252 TRGLINRRLGPKYNDQISLRVLPSDSDELD-VFELGSDGD-------KLEIAANSATAMA 303
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSS 175
GL WY + + W+ +PK+ D V KR +YYQN T S
Sbjct: 304 YGLQWYFKSVLRTQTDWENHKLQLPDKLPKV-------DERVRHKRSSKFSYYQNVDTGS 356
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y+ W W +WEK IDWMAL GIN+PLAFTGQE +WQ F
Sbjct: 357 YSLWAWSWPQWEKHIDWMALNGINMPLAFTGQEKVWQNTFH 397
>gi|29345848|ref|NP_809351.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337741|gb|AAO75545.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 730
Score = 89.7 bits (221), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 79/155 (50%), Gaps = 15/155 (9%)
Query: 60 VLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLH 119
+ QRL P H +F+F +++ FIL + +I I G + GL+
Sbjct: 32 MCQRLFPQHADSFEFELLNDSNQMDR--FILES-------AHKKICIKGNNNNSLAVGLN 82
Query: 120 WYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFA 179
YL+Y+C +++SW + + + +L P+ + ++ ++ N T YT
Sbjct: 83 HYLKYYCHTNVSWYASDSIDMPE--ELPVIPQ----PISIRSKCDNRFFLNYCTFGYTMP 136
Query: 180 WWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
+W W WE+ IDWMAL GI +PLA +GQET+W KV
Sbjct: 137 YWKWSDWERLIDWMALNGITMPLAISGQETVWYKV 171
>gi|301107007|ref|XP_002902586.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
gi|262098460|gb|EEY56512.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
Length = 736
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 57/173 (32%), Positives = 82/173 (47%), Gaps = 16/173 (9%)
Query: 45 RERAPPSVQLAAAYAVLQRLLPSHYSA-FQFRIISKKQCGGEYCFILRNHPSSYIRGTPE 103
++ A P +AA ++ RLL YS R++ + F L + +
Sbjct: 14 KKNAEPHDPVAATRGLIHRLLGDKYSNQITLRVLPSDADDLD-VFELGSDGD-------K 65
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ I+ + + GL WY + + WD +P P+V++ V KR
Sbjct: 66 LEIAANSATAMAYGLQWYFKSVLHAQTDWDN----HKLQLPD--KLPKVEER-VRHKRSS 118
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+YYQN T SY+ W W +WEK IDWMAL GIN+PLAFTGQE +WQ F
Sbjct: 119 KFSYYQNVDTGSYSLWAWSWPQWEKHIDWMALNGINMPLAFTGQEKVWQITFH 171
>gi|383122982|ref|ZP_09943669.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
gi|251841923|gb|EES70003.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
Length = 730
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 79/155 (50%), Gaps = 15/155 (9%)
Query: 60 VLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLH 119
+ QRL P H +F+F +++ FIL + +I I G + GL+
Sbjct: 32 MCQRLFPQHADSFEFELLNDSNQMDR--FILES-------AHKKICIKGNNNNSLAVGLN 82
Query: 120 WYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFA 179
YL+Y+C +++SW + + + +L P+ + ++ ++ N T YT
Sbjct: 83 HYLKYYCHTNVSWYASDSIDMPE--ELPVIPQ----PISIRSKCDNRFFLNYCTFGYTMP 136
Query: 180 WWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
+W W WE+ IDWMAL GI +PLA +GQET+W KV
Sbjct: 137 YWKWSDWERLIDWMALNGITMPLAISGQETVWYKV 171
>gi|328867411|gb|EGG15793.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
Length = 1501
Score = 89.4 bits (220), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 53/163 (32%), Positives = 79/163 (48%), Gaps = 16/163 (9%)
Query: 58 YAVLQRLLPSHYSAF---QFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEV 114
Y++++R++ F + S GGEY ++L + + I + V +
Sbjct: 774 YSLIERVIGYKQLPFIKLELANTSSSGVGGEYFYLL----TRTLDNVTYIYLQANNAVNL 829
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI--PLNYYQNAV 172
++YL+Y+ +W LG + + V V PI L YY N V
Sbjct: 830 AQAFNYYLKYYAMCVYTW-------TGDQCNLGDGRSLPEMPVPVTIPIASDLRYYMNVV 882
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T Y+ WWDW+RW++EIDWMAL G NLPLAF GQE +W V+
Sbjct: 883 TFGYSTVWWDWERWQREIDWMALNGYNLPLAFVGQEYVWFAVY 925
>gi|295087651|emb|CBK69174.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 703
Score = 89.4 bits (220), Expect = 9e-16, Method: Composition-based stats.
Identities = 54/155 (34%), Positives = 76/155 (49%), Gaps = 17/155 (10%)
Query: 63 RLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYL 122
RL P H F F + L N + +I ISG + GL+ YL
Sbjct: 7 RLFPEHSGNFTFELAPDS---------LENDFFTIESINDKIKISGNNNNSLATGLNHYL 57
Query: 123 RYWCGSHISWDKTGGVQVASMPKLGSFPRVQDA-GVFVKRPIPLNYYQNAVTSSYTFAWW 181
+Y+C +H+SW T ++ MP+ P + D +F K ++ N T Y+ +W
Sbjct: 58 KYYCHTHVSWYATDKIE---MPR--QLPVLLDKITIFAK--CKTRFFLNYCTFGYSMPYW 110
Query: 182 DWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
WK WE+ IDWMAL G+N PLA TGQE IW V++
Sbjct: 111 KWKDWERLIDWMALNGVNTPLAITGQEAIWYDVWK 145
>gi|146300873|ref|YP_001195464.1| alpha-N-acetylglucosaminidase [Flavobacterium johnsoniae UW101]
gi|146155291|gb|ABQ06145.1| Candidate alpha-glycosidase; Glycoside hydrolase family 89
[Flavobacterium johnsoniae UW101]
Length = 723
Score = 89.4 bits (220), Expect = 9e-16, Method: Composition-based stats.
Identities = 56/162 (34%), Positives = 81/162 (50%), Gaps = 19/162 (11%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIV-ISGVTGVEV 114
+A AV++RL+ + F+ I+ N I T +V I +
Sbjct: 41 SAAAVIERLVGRRVNEFELHIVENNN---------ENKDWFEIETTDNLVKIKASNNTTI 91
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTS 174
+ +LR +SW+ G ++ ++PK S+P+ G P P Y NA T
Sbjct: 92 CYAAYNFLRDIGAVLVSWE---GNRI-NLPK--SWPKYSKKG---DTPFPYREYLNACTF 142
Query: 175 SYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YT WWDWKRWE+EIDWMAL GINLP A GQE +WQ++++
Sbjct: 143 GYTTPWWDWKRWEQEIDWMALHGINLPTAMEGQEAVWQELWK 184
>gi|323344412|ref|ZP_08084637.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
gi|323094539|gb|EFZ37115.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
Length = 730
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 84/159 (52%), Gaps = 17/159 (10%)
Query: 59 AVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGL 118
+ +RL+P + FQF I G+ CF L + +IVISG + GL
Sbjct: 32 GLAKRLIPQQSAQFQF--IKLPAHKGKDCFTLASKGG-------KIVISGNNANSMAVGL 82
Query: 119 HWYLRYWCGSHISW-DKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYT 177
++YL +C + +SW + V A +P + S V + +R ++ N T YT
Sbjct: 83 NYYLNNYCRTTVSWYAEVPVVLPAKLPDIKS--PVSSSAKVARR-----FFFNYCTYGYT 135
Query: 178 FAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+++WK WE+ IDWMAL GIN+PLA TGQET+W V++
Sbjct: 136 VPFFNWKEWERLIDWMALNGINMPLAITGQETVWYNVWK 174
>gi|410095990|ref|ZP_11290981.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
CL02T12C30]
gi|409227396|gb|EKN20294.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
CL02T12C30]
Length = 753
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 84/161 (52%), Gaps = 20/161 (12%)
Query: 53 QLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGV 112
Q+ AAY +++R+ P + F+ +I + GE + + + +I++ G V
Sbjct: 22 QIQAAYDLIERVTPGYGQQFKLELI--EPVNGEDAYEIAS-------ADEKILLRGNNTV 72
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY--YQN 170
+ + YL+Y C +H+SW G Q+ +P+ P +K I Y Y N
Sbjct: 73 SLATAYNQYLKYTCNAHVSW---FGSQL-DLPEQLPLPEK-----VIKNTINGKYRVYMN 123
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
T SYT +WWDW+RW+KE+D+MA+ IN+PL+ G E +W
Sbjct: 124 YCTVSYTASWWDWERWQKELDYMAMNSINMPLSVVGLEAVW 164
>gi|281200617|gb|EFA74835.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
Length = 688
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 83/174 (47%), Gaps = 12/174 (6%)
Query: 42 IQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGT 101
I D P + + ++ R+L + + F ++ G+ F + + S T
Sbjct: 27 IDDEVNEDPGILVVQG--LISRILGPQFIPY-FNLVLINTTTGQQYFEVSSDTSGR---T 80
Query: 102 PEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKR 161
I I T V + +G+++YL+Y+ +W AS + P V + V +
Sbjct: 81 TTITIKADTPVNLASGINFYLKYYGQCSFTWTGDQCNLTAS-----NLPVVNNP-VSIDM 134
Query: 162 PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YY N T Y+ WWDW RWE+EIDWMAL G NLPLAF GQE +W +VF
Sbjct: 135 LSQYRYYMNVCTFGYSTVWWDWPRWEREIDWMALNGYNLPLAFVGQEYVWYQVF 188
>gi|348681870|gb|EGZ21686.1| hypothetical protein PHYSODRAFT_495971 [Phytophthora sojae]
Length = 692
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 57/103 (55%), Gaps = 7/103 (6%)
Query: 117 GLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSY 176
GL WYL+ + WD +PK+ D V KR +YYQN T SY
Sbjct: 4 GLQWYLKTALHTQTDWDDHKLQLPYVLPKV-------DKRVRHKRSAKFSYYQNVCTVSY 56
Query: 177 TFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAF 219
+ W W++WEK IDWMAL GIN+PLAFTGQE +WQ F+ +
Sbjct: 57 SSWAWGWQKWEKHIDWMALNGINMPLAFTGQEKVWQNTFKKHY 99
>gi|429740222|ref|ZP_19273924.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
gi|429153947|gb|EKX96708.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
Length = 730
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 78/162 (48%), Gaps = 15/162 (9%)
Query: 53 QLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGV 112
L ++ RL+P FQF + ++ G+ CF L +I+ISG
Sbjct: 26 DLTTIQGLVTRLIPQKADHFQF--VKQQSKKGKDCFTLTGKDG-------KIIISGNNAN 76
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAV 172
+ GL++YL +C + +SW V V K+ + + V R LNY
Sbjct: 77 AMAVGLNYYLNRYCHTTVSW--YAEVPVVLPNKMPEVKGIITSSAKVDRRFFLNY----C 130
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
T YT ++ WK WE+ IDWMAL GIN+PLA TGQE +W V
Sbjct: 131 TYGYTLPFFGWKDWERLIDWMALNGINMPLAITGQEMVWYNV 172
>gi|301106961|ref|XP_002902563.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
gi|262098437|gb|EEY56489.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
Length = 684
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 117 GLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSY 176
GL WYL+ + WD +PK+ R KR +YYQN T SY
Sbjct: 4 GLQWYLKTALHTQTDWDNHKLQLPYVLPKVKQHVRH-------KRSAKFSYYQNVCTVSY 56
Query: 177 TFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAF 219
+ W W +WEK IDWMAL GIN+PLAFTGQE +WQ FQ +
Sbjct: 57 SSWTWGWAQWEKHIDWMALNGINMPLAFTGQEKVWQNTFQKHY 99
>gi|404406328|ref|ZP_10997912.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 738
Score = 87.4 bits (215), Expect = 4e-15, Method: Composition-based stats.
Identities = 52/168 (30%), Positives = 85/168 (50%), Gaps = 16/168 (9%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P +AAA + R++P + S +F ++ + R + R ++V+ G
Sbjct: 30 PDADVAAAQGLAGRIVPGYASKIRFEKLAPAADSTD-----RFELETVGR---KLVVRGN 81
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQV-ASMPKLGSFPRVQDAGVFVKRPIPLNYY 168
+ GL+ +L+Y+ + +SW V+V A MP + +V + + R ++
Sbjct: 82 NANSMAVGLNHFLKYYARTSVSWFAGQPVEVPAEMPVVAE--KVAHSALLDDR-----FF 134
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N T YT WW W+ WE+ IDWMAL G+ LPLA TGQE +W +V+Q
Sbjct: 135 LNYCTFGYTMVWWQWRDWERLIDWMALNGVTLPLAITGQEAVWARVWQ 182
>gi|282877910|ref|ZP_06286719.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
gi|281299911|gb|EFA92271.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
Length = 723
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 55/158 (34%), Positives = 80/158 (50%), Gaps = 14/158 (8%)
Query: 59 AVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGL 118
A++ R+LP H S F+F SK + + F L + I I+G + L
Sbjct: 32 ALVSRILPKHESQFEF-TKSKSKVKAD-AFTLESGKDGRIH------IAGNNANSMAVAL 83
Query: 119 HWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTF 178
++YL +C + +SW V V KL P + + V+R LNY T YT
Sbjct: 84 NYYLNNYCHTTVSW--YAEVPVVVPDKLPGVPTLVKSNAKVERRFFLNY----CTYGYTM 137
Query: 179 AWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
++ WK WE+ IDWMAL G+N+PLA TGQE +W V++
Sbjct: 138 PFFSWKEWERLIDWMALNGVNMPLAITGQEAVWYAVWE 175
>gi|374312699|ref|YP_005059129.1| alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
gi|358754709|gb|AEU38099.1| Alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
Length = 754
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 56/169 (33%), Positives = 80/169 (47%), Gaps = 17/169 (10%)
Query: 47 RAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVI 106
R+ + A VLQR L S + F ++ G + L + +
Sbjct: 45 RSSAETAIEACRGVLQRQLGSRAAEFDLHLLPA--VDGHEVYELSASGG-------RVAV 95
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
+G +GV + G++ YLR C + I+W A +P + RV FV+
Sbjct: 96 AGSSGVSLCRGIYIYLREHCQAMIAWSGRHLDLPARLPDV-EHHRVVCPYKFVQ------ 148
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T YT A+WDW RWE+E+DWMAL GI +PLA GQE IW +V+
Sbjct: 149 -YLNPCTYGYTMAFWDWPRWERELDWMALHGITMPLALEGQEVIWNRVW 196
>gi|393788556|ref|ZP_10376683.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
CL02T12C05]
gi|392654236|gb|EIY47884.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
CL02T12C05]
Length = 732
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 66/114 (57%), Gaps = 7/114 (6%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
I+I G GV + GL+ YLR +C +SW G ++ +P P V++ V V
Sbjct: 65 IIIRGNNGVSIARGLNHYLRNYCHKSVSW---CGNNLSELPV--PLPPVREK-VRVTASF 118
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPL-AFTGQETIWQKVFQ 216
P YY N T SY+ A+WDW++WEKEID MALQGIN+PL A Q +WQ +
Sbjct: 119 PYRYYLNYCTYSYSMAFWDWEQWEKEIDRMALQGINMPLMAVYSQYAVWQNTLR 172
>gi|426238067|ref|XP_004012979.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Ovis aries]
Length = 739
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 37/57 (64%), Positives = 43/57 (75%)
Query: 159 VKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
V P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 121 VPAPPQYRYYQNVCTQSYSFLWWDWARWEQEIDWMALNGINLALAWSGQEAIWQRVY 177
>gi|380692804|ref|ZP_09857663.1| putative alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 709
Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 67/117 (57%), Gaps = 10/117 (8%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLG-SFPRVQDAGVF 158
G ++ + V G++WYL+Y+C +H+S+ + +P L P+V++
Sbjct: 56 GGRKVTVRANNYVSAAFGINWYLKYYCHAHVSFCED------RLPDLPVDLPQVKERHET 109
Query: 159 VKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
V + N+Y N T SYT A+WDWKRWE+EID MAL GIN+P+A G E +W+
Sbjct: 110 V---LSDNFYMNYCTFSYTTAFWDWKRWEREIDLMALSGINMPMAMVGAEVVWRNTL 163
>gi|384417770|ref|YP_005627130.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353460684|gb|AEQ94963.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 798
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 86/171 (50%), Gaps = 19/171 (11%)
Query: 48 APPSVQLAA--AYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIV 105
A P LAA A AV+QRL+ + + F+ R+ S+ Y G +
Sbjct: 24 ATPVTALAATPAQAVVQRLIGARATQFEMRVASRGDGADWYRIEA---------GGDAVR 74
Query: 106 ISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPL 165
I+G + V + G + YL + +SW+ G +VA + ++ Q V+ P
Sbjct: 75 IAGSSQVALARGAYAYLGQAGAASMSWE---GDRVALPAQWPAYSSGQ-----VRTPFAH 126
Query: 166 NYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 127 RAYLNTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQALWR 177
>gi|410981277|ref|XP_003996997.1| PREDICTED: alpha-N-acetylglucosaminidase [Felis catus]
Length = 857
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 35/49 (71%), Positives = 41/49 (83%)
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWEKE+DWMAL GINL LA++GQE IWQ+V+
Sbjct: 245 YYQNVCTHSYSFVWWDWARWEKELDWMALNGINLALAWSGQEAIWQRVY 293
>gi|298385999|ref|ZP_06995556.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
1_1_14]
gi|298261227|gb|EFI04094.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
1_1_14]
Length = 715
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 67/116 (57%), Gaps = 8/116 (6%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G ++ + V G++WYL+Y+C +H+S+ G Q+ +P P+V++
Sbjct: 62 GGRKVTVRANNYVSAAFGINWYLKYYCHAHVSF---CGDQLPQLPV--DLPQVKERHA-- 114
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ N+Y N T SYT A+W+WKRWE+EID MAL GIN+P+A G E +W+
Sbjct: 115 -TKLSDNFYMNYCTFSYTTAFWNWKRWEREIDLMALSGINMPMAMVGVEAVWRNTL 169
>gi|403304646|ref|XP_003942904.1| PREDICTED: alpha-N-acetylglucosaminidase [Saimiri boliviensis
boliviensis]
Length = 754
Score = 86.3 bits (212), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 36/54 (66%), Positives = 42/54 (77%)
Query: 162 PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 136 PNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 189
>gi|355568706|gb|EHH24987.1| Alpha-N-acetylglucosaminidase, partial [Macaca mulatta]
Length = 711
Score = 86.3 bits (212), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 35/49 (71%), Positives = 41/49 (83%)
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 99 YYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 147
>gi|1479983|gb|AAB36605.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|119581236|gb|EAW60832.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
CRA_a [Homo sapiens]
Length = 639
Score = 86.3 bits (212), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 35/49 (71%), Positives = 41/49 (83%)
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 27 YYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 75
>gi|444714090|gb|ELW54978.1| Alpha-N-acetylglucosaminidase [Tupaia chinensis]
Length = 724
Score = 85.9 bits (211), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 36/54 (66%), Positives = 42/54 (77%)
Query: 162 PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 96 PNRYRYYQNVCTQSYSFVWWDWARWEQEIDWMALNGINLALAWSGQEAIWQRVY 149
>gi|281200619|gb|EFA74837.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
Length = 199
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 94/194 (48%), Gaps = 10/194 (5%)
Query: 15 ITTITLSTLAVAQSSTIGVQYISRLLDIQD-RERAPPSVQLAAAYAVLQRLLPSHYSAFQ 73
+T + L ++++ +T + DI++ R + P + + ++ R+L S + +
Sbjct: 1 MTRVCLESISIKNINTNIIDIKIEKNDIENNRFQEDPGILVVQG--LISRILGSQFIPY- 57
Query: 74 FRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWD 133
F + K G F + + S+ I IS T V + +G+++YL+Y+ +W
Sbjct: 58 FNLNLKNSTNGSQYFEITSSYSTTNSNNVIINISADTPVNLASGINYYLKYYGQCSFTWT 117
Query: 134 KTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWM 193
AS + P V + V ++ YY N T Y+ WWDW RWE+EIDWM
Sbjct: 118 GDQCNLTAS-----NLPVVNNP-VNIEILSQYRYYMNVCTFGYSTVWWDWPRWEREIDWM 171
Query: 194 ALQGINLPLAFTGQ 207
AL G NLPLAF GQ
Sbjct: 172 ALNGYNLPLAFVGQ 185
>gi|320106778|ref|YP_004182368.1| alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
gi|319925299|gb|ADV82374.1| Alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
Length = 754
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 64/195 (32%), Positives = 93/195 (47%), Gaps = 19/195 (9%)
Query: 24 AVAQSSTIGVQYISRLLDIQDRERAPP--SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQ 81
A +S+ G + + +E + P + AA A+L R L S F+ ++I +
Sbjct: 19 AALRSNEAGAAVPALPAERTKKESSKPLSETPVQAARALLYRQLGERASQFELKLIPLE- 77
Query: 82 CGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVA 141
GG + + S G + ++G + V + G + YLR C I+W G V
Sbjct: 78 -GGNEVYEI-----SAANG--RVYLAGSSAVAMCRGAYSYLRATCNVMITWS---GRHV- 125
Query: 142 SMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLP 201
+P +FP V P Y N T YT A+WDW RWEKE+DWMAL GI +P
Sbjct: 126 ELP--AAFPDAPMQRVVC--PYKFVQYFNPCTYGYTMAFWDWARWEKELDWMALHGITMP 181
Query: 202 LAFTGQETIWQKVFQ 216
LA GQE IW +V++
Sbjct: 182 LALEGQEAIWDRVWR 196
>gi|357622373|gb|EHJ73879.1| putative alpha-N-acetyl glucosaminidase [Danaus plexippus]
Length = 780
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 80/167 (47%), Gaps = 16/167 (9%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P VQ AA V+ LL + YS + + F LR ++ I
Sbjct: 32 PEVQRQAALDVINSLLRNQYSFVAVDVNPILFNDHKDVFSLRT-------DNGKLNIRAS 84
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
+GV + G ++YL+ +C S ++W QV+ + + P + + R YYQ
Sbjct: 85 SGVAAVWGFNYYLKKYCKSQVAW------QVSRVVIPSTIPEADETVIANDR---FRYYQ 135
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N T+SY+F WWD + W++ + WMAL GIN+ LA QE W +V++
Sbjct: 136 NVCTASYSFVWWDTEDWKRHVTWMALNGINMALAPVAQEAAWTRVYK 182
>gi|426238065|ref|XP_004012978.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 1 [Ovis aries]
Length = 748
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 35/49 (71%), Positives = 41/49 (83%)
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 138 YYQNVCTQSYSFLWWDWARWEQEIDWMALNGINLALAWSGQEAIWQRVY 186
>gi|297723521|ref|NP_001174124.1| Os04g0650900 [Oryza sativa Japonica Group]
gi|255675839|dbj|BAH92852.1| Os04g0650900, partial [Oryza sativa Japonica Group]
Length = 128
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 37/40 (92%), Positives = 39/40 (97%)
Query: 177 TFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+FAWWDW+RWEKEIDWMALQGINLPLAFTGQE IWQKVFQ
Sbjct: 1 SFAWWDWERWEKEIDWMALQGINLPLAFTGQEAIWQKVFQ 40
>gi|301773566|ref|XP_002922216.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Ailuropoda
melanoleuca]
Length = 634
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/49 (69%), Positives = 41/49 (83%)
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWE+E+DWMAL GINL LA++GQE IWQ+V+
Sbjct: 22 YYQNVCTHSYSFVWWDWARWERELDWMALNGINLALAWSGQEAIWQRVY 70
>gi|281344539|gb|EFB20123.1| hypothetical protein PANDA_011160 [Ailuropoda melanoleuca]
Length = 619
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 34/49 (69%), Positives = 41/49 (83%)
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWE+E+DWMAL GINL LA++GQE IWQ+V+
Sbjct: 8 YYQNVCTHSYSFVWWDWARWERELDWMALNGINLALAWSGQEAIWQRVY 56
>gi|423248233|ref|ZP_17229249.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
CL03T00C08]
gi|423253182|ref|ZP_17234113.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
CL03T12C07]
gi|392657082|gb|EIY50719.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
CL03T12C07]
gi|392660340|gb|EIY53954.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
CL03T00C08]
Length = 732
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 63/115 (54%), Gaps = 8/115 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G G+ + GL+ Y+RY+ SW +S+P V + V
Sbjct: 67 KLIIRGNNGISMARGLNHYMRYYLRKTTSWTGHNLSYTSSLP-------VVKEKIRVNAT 119
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPL-AFTGQETIWQKVFQ 216
+PL YY N T SY+ A+WDW +WE EID MA+QGIN+PL A GQ +WQ +
Sbjct: 120 LPLRYYLNYCTYSYSMAFWDWTQWEDEIDRMAMQGINMPLVAVIGQYAVWQNTLR 174
>gi|423269877|ref|ZP_17248849.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
CL05T00C42]
gi|423272668|ref|ZP_17251615.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
CL05T12C13]
gi|392700723|gb|EIY93885.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
CL05T00C42]
gi|392708745|gb|EIZ01850.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
CL05T12C13]
Length = 732
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 63/115 (54%), Gaps = 8/115 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G G+ + GL+ Y+RY+ SW +S+P V + V
Sbjct: 67 KLIIRGNNGISMARGLNHYMRYYLRKTTSWTGHNLSYTSSLP-------VVKEKIRVNAT 119
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPL-AFTGQETIWQKVFQ 216
+PL YY N T SY+ A+WDW +WE EID MA+QGIN+PL A GQ +WQ +
Sbjct: 120 LPLRYYLNYCTYSYSMAFWDWTQWEDEIDRMAMQGINMPLVAVIGQYAVWQNTLR 174
>gi|440903235|gb|ELR53922.1| Alpha-N-acetylglucosaminidase, partial [Bos grunniens mutus]
Length = 614
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 35/49 (71%), Positives = 41/49 (83%)
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 4 YYQNVCTQSYSFLWWDWARWEQEIDWMALNGINLALAWSGQEAIWQRVY 52
>gi|393788286|ref|ZP_10376416.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
CL02T12C05]
gi|392655959|gb|EIY49600.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
CL02T12C05]
Length = 757
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 83/160 (51%), Gaps = 18/160 (11%)
Query: 53 QLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGT-PEIVISGVTG 111
++ A Y +++R+ P + S ++ +I+ Y + G ++++ G T
Sbjct: 25 RVKAVYDLIERVTPGYSSQYRLELIAPDNGSDVY----------EVDGNGKQVILRGNTS 74
Query: 112 VEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNA 171
V + +WYL+Y C +H+SW G Q+ +PK P ++ + R Y N
Sbjct: 75 VALATAFNWYLKYTCQAHVSW---FGNQL-QLPKKLPQPLAKERRLINGR---YRVYMNY 127
Query: 172 VTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
T SYT AWW+W+RW++E+D+MA+ IN+PL G + +W
Sbjct: 128 CTVSYTAAWWNWERWQRELDFMAMNSINMPLFTIGLDAVW 167
>gi|424665881|ref|ZP_18102917.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
616]
gi|404574134|gb|EKA78885.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
616]
Length = 732
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 64/115 (55%), Gaps = 8/115 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G G+ + GL+ Y+RY+ S SW +S+P + + V
Sbjct: 67 KLIIRGNNGISMARGLNHYMRYYLHSTTSWTGHNLSYTSSLP-------IIKEKIHVNAT 119
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPL-AFTGQETIWQKVFQ 216
+PL YY N T SY+ A+W+W +WE EID MA+QGIN+PL A GQ +WQ +
Sbjct: 120 LPLRYYLNYCTYSYSMAFWNWTQWEDEIDRMAMQGINMPLVAVIGQYAVWQNTLR 174
>gi|423345423|ref|ZP_17323112.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
CL03T12C32]
gi|409223209|gb|EKN16146.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
CL03T12C32]
Length = 752
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 82/167 (49%), Gaps = 23/167 (13%)
Query: 47 RAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVI 106
+ P +Q AAY +++R+ P + F+ ++ Y N ++V+
Sbjct: 18 KNDPRIQ--AAYDLIERVTPGYGEQFKLELMEPIDGMDAYEITSDNG---------KVVL 66
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
G + + + YL+Y C +H+SW G Q+ +PK P VK I
Sbjct: 67 RGNNTISLATAFNQYLKYTCNAHVSW---FGNQL-DLPKQLPMP------APVKNTINGK 116
Query: 167 Y--YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
Y Y N T SY+ AWWDW+RW++E+D+MA+ IN+PL+ G E +W
Sbjct: 117 YRVYMNYCTVSYSAAWWDWERWQRELDFMAMNSINMPLSVVGLEAVW 163
>gi|154492110|ref|ZP_02031736.1| hypothetical protein PARMER_01741 [Parabacteroides merdae ATCC
43184]
gi|154087335|gb|EDN86380.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
43184]
Length = 752
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 82/167 (49%), Gaps = 23/167 (13%)
Query: 47 RAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVI 106
+ P +Q AAY +++R+ P + F+ ++ Y N ++V+
Sbjct: 18 KNDPRIQ--AAYDLIERVTPGYGEQFKLELMEPIDGMDAYEITSDNG---------KVVL 66
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
G + + + YL+Y C +H+SW G Q+ +PK P VK I
Sbjct: 67 RGNNTISLATAFNQYLKYTCNAHVSW---FGNQL-DLPKQLPMP------APVKNTINGK 116
Query: 167 Y--YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
Y Y N T SY+ AWWDW+RW++E+D+MA+ IN+PL+ G E +W
Sbjct: 117 YRVYMNYCTVSYSAAWWDWERWQRELDFMAMNSINMPLSVVGLEAVW 163
>gi|423722278|ref|ZP_17696454.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
CL09T00C40]
gi|409242419|gb|EKN35181.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
CL09T00C40]
Length = 752
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 82/167 (49%), Gaps = 23/167 (13%)
Query: 47 RAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVI 106
+ P +Q AAY +++R+ P + F+ ++ Y N ++V+
Sbjct: 18 KNDPRIQ--AAYDLIERVTPGYGEQFKLELMEPIDGMDAYEITSDNG---------KVVL 66
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
G + + + YL+Y C +H+SW G Q+ +PK P VK I
Sbjct: 67 RGNNTISLATAFNQYLKYTCNAHVSW---FGNQL-DLPKQLPMP------APVKNTINGK 116
Query: 167 Y--YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
Y Y N T SY+ AWWDW+RW++E+D+MA+ IN+PL+ G E +W
Sbjct: 117 YRVYMNYCTVSYSAAWWDWERWQRELDFMAMNSINMPLSVVGLEAVW 163
>gi|373461342|ref|ZP_09553084.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
gi|371952896|gb|EHO70729.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
Length = 731
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 80/158 (50%), Gaps = 17/158 (10%)
Query: 59 AVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGL 118
++ R+ P++ F+F + + F +R+ IVI+G + + + GL
Sbjct: 30 GLVSRIAPAYADRFRFELTADTV----ERFTVRSEGR-------RIVITGRSAIAMAVGL 78
Query: 119 HWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTF 178
+ YLR +C + +SW + + +P+ PRV R + ++ N T YT
Sbjct: 79 NHYLRRYCLTEVSWRRINPI---VLPR--ELPRVSRTMTGRAR-VAERFFLNYCTFGYTM 132
Query: 179 AWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
WW W WE+ IDWMAL G+ +PLA TG E +WQ+V++
Sbjct: 133 PWWRWADWERLIDWMALNGVTMPLAITGTEAVWQRVWR 170
>gi|195473052|ref|XP_002088810.1| GE10991 [Drosophila yakuba]
gi|194174911|gb|EDW88522.1| GE10991 [Drosophila yakuba]
Length = 778
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 92/194 (47%), Gaps = 21/194 (10%)
Query: 22 TLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQ 81
L A + +G+Q ++RL P VQ +AA AV+ R++ S F++ KQ
Sbjct: 26 NLDAASGAEMGLQMVARLA-----PSTPTDVQESAAMAVISRVIGEQSSQL-FKVHVNKQ 79
Query: 82 CGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVA 141
LR+ + + I++ G GV V H YL+Y + W K
Sbjct: 80 MD------LRSFQITMLNDG-RILLMGWDGVSVCKAFHHYLKYVLNKDVDWFKMR----I 128
Query: 142 SMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLP 201
+P P V + K P Y+QN T SY+FAWW +K+W + +DWMAL GI+L
Sbjct: 129 ELPTNLQLPNVT---IESKSASPFIYHQNVCTWSYSFAWWSFKQWRRHLDWMALMGISLT 185
Query: 202 LAFTGQETIWQKVF 215
+A QE IW +V+
Sbjct: 186 IAPV-QEDIWVEVY 198
>gi|118370728|ref|XP_001018564.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila]
gi|89300331|gb|EAR98319.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila
SB210]
Length = 879
Score = 84.3 bits (207), Expect = 3e-14, Method: Composition-based stats.
Identities = 42/115 (36%), Positives = 64/115 (55%), Gaps = 8/115 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI T + + GL ++++ + S ISW+ Q+ +P + R+Q P
Sbjct: 87 QLVIQSNTKIGLSKGLFYFMKNYMNSSISWNGDNIQQLEYLPTVSEQIRIQT-------P 139
Query: 163 IPLNYYQNAVTSSYTF-AWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T SY+ ++WDW+RWE+EID+MALQGIN+PLA G IWQ +
Sbjct: 140 YQFRYMFNYCTYSYSLMSFWDWQRWEREIDYMALQGINMPLAIIGTSKIWQNTLK 194
>gi|410097657|ref|ZP_11292638.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
CL02T12C30]
gi|409223747|gb|EKN16682.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
CL02T12C30]
Length = 740
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 78/161 (48%), Gaps = 20/161 (12%)
Query: 53 QLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGV 112
Q+ AAY +++R+ P + F +I + Y N ++V+ G V
Sbjct: 24 QIQAAYDLIERITPGYSQQFSLELIEPENGNDVYEVAGENG---------KVVLRGNNTV 74
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY--YQN 170
+ + YL+Y C +H+SW G Q+ ++P P R I Y Y N
Sbjct: 75 SLATAYNQYLKYHCNAHVSW---FGDQL-NLPATLPVP-----AETTHRIINGKYRVYFN 125
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
T SYT AWWDW+RW++EID+MA+ IN PL+ G E +W
Sbjct: 126 YCTLSYTGAWWDWERWQREIDYMAMNSINTPLSVVGLEGVW 166
>gi|160883168|ref|ZP_02064171.1| hypothetical protein BACOVA_01137 [Bacteroides ovatus ATCC 8483]
gi|156111393|gb|EDO13138.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
Length = 737
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 78/159 (49%), Gaps = 16/159 (10%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
A +++RL P + S F F K I ++ +G + I G +
Sbjct: 28 AMQQMVERLFPEYASQFSFEQSEK---------IDKDWYEIEAQGGT-VRIRGNNANSMA 77
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSS 175
GL++YL ++C + +SW V+ MP++ P A + ++ N T
Sbjct: 78 VGLNYYLNHYCLTSVSWYVNDTVE---MPEVLPMP---PAKIISTARCKNRFFLNYCTFG 131
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
YT WW WK WE+ IDWMAL GIN+PLA TGQE++W +V
Sbjct: 132 YTMPWWTWKDWERLIDWMALNGINMPLAITGQESVWYRV 170
>gi|194759443|ref|XP_001961958.1| GF14678 [Drosophila ananassae]
gi|190615655|gb|EDV31179.1| GF14678 [Drosophila ananassae]
Length = 783
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 82/168 (48%), Gaps = 18/168 (10%)
Query: 49 PPSVQLAAAYAVLQRLL-PSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVIS 107
PP VQ+ AA AV++RL+ P FQ + + L N I+++
Sbjct: 53 PPEVQVEAARAVIRRLVGPKSAKRFQVEVDKDLELRSFKITKLDNG---------RILLT 103
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY 167
G GV H YL+Y+ + W K +L R+ +A V P+ Y
Sbjct: 104 GWDGVCACKAFHHYLKYYLNKDVDWFKM-------RIELPDNLRLPNATVVSISASPIIY 156
Query: 168 YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+QN T SY+FAWW +++W + +DWMAL GINL +A QE IW +V+
Sbjct: 157 HQNVCTWSYSFAWWSFQQWRRHLDWMALMGINLSIAPI-QEAIWVEVY 203
>gi|423292430|ref|ZP_17271008.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
CL02T12C04]
gi|423294620|ref|ZP_17272747.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
CL03T12C18]
gi|392661665|gb|EIY55241.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
CL02T12C04]
gi|392675811|gb|EIY69252.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 79/160 (49%), Gaps = 16/160 (10%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
A +++RL P + S F F ++ ++ I + IRG +
Sbjct: 18 AMQQMVERLFPEYASQFSFE--QSEKIDKDWYEIEAQGGTVRIRGN--------NANSMA 67
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSS 175
GL++YL ++C + +SW V+ MP++ P A + ++ N T
Sbjct: 68 VGLNYYLNHYCLTSVSWYVNDTVE---MPEVLPMP---PAKIISTARCKNRFFLNYCTFG 121
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YT WW WK WE+ IDWMAL GIN+PLA TGQE++W +V+
Sbjct: 122 YTMPWWTWKDWERLIDWMALNGINMPLAITGQESVWYRVW 161
>gi|372221472|ref|ZP_09499893.1| alpha-N-acetylglucosaminidase [Mesoflavibacter zeaxanthinifaciens
S86]
Length = 712
Score = 84.0 bits (206), Expect = 4e-14, Method: Composition-based stats.
Identities = 42/114 (36%), Positives = 60/114 (52%), Gaps = 9/114 (7%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
E+ I + + + G ++YL+ + SW+ T A PK + + P
Sbjct: 72 EVRIKASSKIALTYGAYYYLKQIGAAFNSWEGTALDLPAKWPK---------KKLRKQTP 122
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N VT YT WWDW+RW +EI+WMAL GIN+P A GQE IWQK+++
Sbjct: 123 FKFRKYLNVVTYGYTTPWWDWERWNQEINWMALHGINMPTAMEGQEYIWQKLWK 176
>gi|237719130|ref|ZP_04549611.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451509|gb|EEO57300.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 737
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 80/164 (48%), Gaps = 26/164 (15%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
A +++RL P + S F F K I ++ +G + I G +
Sbjct: 28 AMQQMVERLFPEYASQFSFEQSEK---------IDKDWYEIEAQGGT-VRIRGNNANSMA 77
Query: 116 AGLHWYLRYWCGSHISW---DKTGGVQVASMP--KLGSFPRVQDAGVFVKRPIPLNYYQN 170
GL++YL ++C + +SW D +V MP K+ S R ++ ++ N
Sbjct: 78 VGLNYYLNHYCLTSVSWYVNDTVEMPEVLPMPPAKITSTARCKN-----------RFFLN 126
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
T YT WW WK WE+ IDWMAL GIN+PLA TGQE++W +V
Sbjct: 127 YCTFGYTMPWWTWKDWERLIDWMALNGINMPLAITGQESVWYRV 170
>gi|295086519|emb|CBK68042.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 727
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 81/165 (49%), Gaps = 26/165 (15%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
A +++RL P + S F F ++ ++ I + IRG +
Sbjct: 18 AMQQMVERLFPEYASQFSFE--QSEKIDKDWYEIEAQGGTVRIRGN--------NANSMA 67
Query: 116 AGLHWYLRYWCGSHISW---DKTGGVQVASMP--KLGSFPRVQDAGVFVKRPIPLNYYQN 170
GL++YL ++C + +SW D +V MP K+ S R ++ ++ N
Sbjct: 68 VGLNYYLNHYCLTSVSWYVNDTVEMPEVLPMPPAKITSTARCKN-----------RFFLN 116
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T YT WW WK WE+ IDWMAL GIN+PLA TGQE++W +V+
Sbjct: 117 YCTFGYTMPWWTWKDWERLIDWMALNGINMPLAITGQESVWYRVW 161
>gi|224026593|ref|ZP_03644959.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
18228]
gi|224019829|gb|EEF77827.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
18228]
Length = 635
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 79/164 (48%), Gaps = 16/164 (9%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVIS 107
A P + AAYA+++R+ P + F +IS ++ GE + + + ++V+
Sbjct: 20 AKPVSPVEAAYALIERVTPGYGDQFVLELISPQE--GEDVYEISSKDG-------KVVLR 70
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY 167
G V + + YL+Y C HISW +P R G +
Sbjct: 71 GNNPVALATAFNQYLKYVCKVHISWLGDHLDLPEKLPLPSETVRNTIHGKY-------RV 123
Query: 168 YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
Y N T SY+ AWW+W RW++EID+MA+ IN+PL+ G E +W
Sbjct: 124 YMNYCTLSYSAAWWNWDRWQREIDYMAMNSINMPLSVVGLEAVW 167
>gi|383114162|ref|ZP_09934927.1| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
gi|382948607|gb|EFS30964.2| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
Length = 727
Score = 83.2 bits (204), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 81/165 (49%), Gaps = 26/165 (15%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
A +++RL P + S F F K I ++ +G + I G +
Sbjct: 18 AMQQMVERLFPEYASQFSFEQSEK---------IDKDWYEIEAQGGT-VRIRGNNANSMA 67
Query: 116 AGLHWYLRYWCGSHISW---DKTGGVQVASMP--KLGSFPRVQDAGVFVKRPIPLNYYQN 170
GL++YL ++C + +SW D +V MP K+ S R ++ ++ N
Sbjct: 68 VGLNYYLNHYCLTSVSWYVNDTVEMPEVLPMPPAKITSTARCKN-----------RFFLN 116
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T YT WW WK WE+ IDWMAL GIN+PLA TGQE++W +V+
Sbjct: 117 YCTFGYTMPWWTWKDWERLIDWMALNGINMPLAITGQESVWYRVW 161
>gi|195115262|ref|XP_002002183.1| GI17241 [Drosophila mojavensis]
gi|193912758|gb|EDW11625.1| GI17241 [Drosophila mojavensis]
Length = 773
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSY---IRGTPEIVI 106
P VQ AA ++QR++ S + F++ E IL PSSY +++I
Sbjct: 38 PDVQEDAAKQLIQRVIGLRASQW-FQV--------EVNRIL--DPSSYQISRLDNGKVLI 86
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
+G GV V G H+YL++ W KT +P+ P V K PL
Sbjct: 87 TGSNGVSVCKGFHYYLKFVLKKDFDWFKTH----IELPEDWDPPNVTHRS---KSSSPLI 139
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YYQN SY+F WW + W + IDWMA+ GINL +A QETIWQ V+
Sbjct: 140 YYQNPCVWSYSFVWWQFDEWRRHIDWMAMMGINLVIA-PNQETIWQDVY 187
>gi|47188476|emb|CAF93158.1| unnamed protein product [Tetraodon nigroviridis]
Length = 52
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 36/50 (72%), Positives = 40/50 (80%)
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
YYQN T SY+ WWDW RWE+EIDWMAL GINLPLAFTGQE +WQ+V
Sbjct: 2 FRYYQNVCTFSYSSVWWDWPRWEREIDWMALNGINLPLAFTGQEALWQEV 51
>gi|29832531|ref|NP_827165.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
gi|29609651|dbj|BAC73700.1| putative alpha-N-acetylglucosaminidase, secreted [Streptomyces
avermitilis MA-4680]
Length = 1038
Score = 82.8 bits (203), Expect = 8e-14, Method: Composition-based stats.
Identities = 48/167 (28%), Positives = 79/167 (47%), Gaps = 16/167 (9%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P++ A A + L RLLP H F+ ++ + + + GT I +SG
Sbjct: 38 PALDTAPARSALNRLLPGHAGQFRLSLVGRTRGRDRF---------RVTGGTGRIQVSGT 88
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
T +L G+HWYL+Y CG+H++W+ G Q+ +L + R + + LN
Sbjct: 89 TPAVLLTGVHWYLKYVCGAHLAWN---GGQLDLPRRLPAPARPLERSTALSHRFALNDTN 145
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ T+ Y DW WE +ID +AL G N + G E ++ +V +
Sbjct: 146 DGYTAPYA----DWSYWEHQIDLLALHGCNEVMVIAGTEAVYHRVLK 188
>gi|449541596|gb|EMD32579.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
B]
Length = 754
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 92/177 (51%), Gaps = 26/177 (14%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVIS 107
A S L YA+++R LP H S+F FR+ + G+ F++ + P+++ RG I +
Sbjct: 21 ATTSATLDGLYALVKRRLPDHASSFTFRL---TESDGD-SFVISDAPATH-RG---ITVE 72
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI---- 163
T GL+ YL + I W + +LG PR A V +P+
Sbjct: 73 CTTVSACARGLYTYLTDFGEVDIWWTGS---------RLGQLPRELPA---VGKPVHGAA 120
Query: 164 --PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVA 218
P Y+ N VT YT A+W +++WE E+DW+AL+G+NLPLA+ G E I +V + A
Sbjct: 121 IVPYRYHFNTVTFDYTAAFWTFEQWELELDWLALRGVNLPLAWVGYEYILIEVLRDA 177
>gi|399028591|ref|ZP_10729778.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
gi|398073682|gb|EJL64846.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
Length = 727
Score = 82.8 bits (203), Expect = 9e-14, Method: Composition-based stats.
Identities = 52/163 (31%), Positives = 81/163 (49%), Gaps = 20/163 (12%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVL 115
++ AVL RL+ + F+ II K F + + + I + +
Sbjct: 41 SSMAVLARLIGNRADEFELSIIENKDQTNSDWFEVEATGN-------RVKIQATSNTAIC 93
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY--YQNAVT 173
+ +LR +SW+ G ++ +PK ++P+ K+ P Y Y NA T
Sbjct: 94 YAAYTFLRDIGAVLVSWE---GNRI-DLPK--AWPKYSK-----KQSTPFKYREYLNACT 142
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YT WWDW+RWE+EIDWMAL G+NLP A GQE +WQ++++
Sbjct: 143 FGYTTPWWDWERWEQEIDWMALHGVNLPTAMEGQEAVWQQLWK 185
>gi|329851961|ref|ZP_08266642.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
biprosthecum C19]
gi|328839810|gb|EGF89383.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
biprosthecum C19]
Length = 731
Score = 82.4 bits (202), Expect = 1e-13, Method: Composition-based stats.
Identities = 56/173 (32%), Positives = 80/173 (46%), Gaps = 23/173 (13%)
Query: 48 APPSVQLAA----AYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPE 103
A PSV A A AVL RL+ + F I Q G + H
Sbjct: 5 ALPSVAAATPAHPAQAVLTRLIGRRANDFMLDI----QPGSAPAYAYDAHGG-------R 53
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ ++ + V + G + YL + +H SW+ +P+ GS P V P
Sbjct: 54 VTVTASSPVAAVRGAYAYLSDFGLAHTSWEGDRVALPRRLPQGGSGP--------VTSPF 105
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT +WDWKRW +EIDWMAL GI++PLA GQE +W+++++
Sbjct: 106 AHRVYMNTCTFGYTTPFWDWKRWRREIDWMALHGIDMPLAMEGQEWVWRELWR 158
>gi|281210062|gb|EFA84230.1| hypothetical protein PPL_03307 [Polysphondylium pallidum PN500]
Length = 744
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 62/113 (54%), Gaps = 8/113 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGG-VQVASMPKLGSFPRVQDAGVFVKRP 162
+ + G V + ++YL+Y I+W TG Q+ + KL P A V V+ P
Sbjct: 84 VTLLGNNAVNLAQAFNYYLKYTAKCSITW--TGNQCQLRAGTKL-PLP----ANVQVEIP 136
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YY N T Y+ WWDW+RWE+EIDWMAL G NLPLA GQE +W ++
Sbjct: 137 HKYRYYMNTCTFGYSTVWWDWERWEREIDWMALNGYNLPLAQVGQEYVWNELM 189
>gi|84625359|ref|YP_452731.1| hypothetical protein XOO_3702 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84369299|dbj|BAE70457.1| truncated N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
MAFF 311018]
Length = 369
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 83/171 (48%), Gaps = 19/171 (11%)
Query: 48 APPSVQLAA--AYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIV 105
A P +AA A AV+QRL+ + + F+ R+ + G ++ I G +
Sbjct: 24 ATPVTAMAAPPAQAVVQRLIGARATQFEMRVAPRGD-GADWYRI--------DAGGDTVR 74
Query: 106 ISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPL 165
I+G + V + G + YL + +SW+ A P S V+ P
Sbjct: 75 IAGSSQVALARGAYAYLGQAGAASMSWEGDRLALPAQWPAYSSG--------QVRTPFAH 126
Query: 166 NYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 127 RAYLNTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQALWR 177
>gi|358391826|gb|EHK41230.1| glycoside hydrolase family 89 protein [Trichoderma atroviride IMI
206040]
Length = 751
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 86/163 (52%), Gaps = 13/163 (7%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEV 114
A ++++R LP+H +F+F + S ++ S R +I I G T +
Sbjct: 26 AGVESLVKRRLPNHAKSFEFALESPSHGS---TLTNDSYTVSSTR-DGKIRIEGTTTSAL 81
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQD--AGVFVKRPIPLNYYQNAV 172
L+GLH YL I W G Q+ P+ + P+++ G V +P Y+ N V
Sbjct: 82 LSGLHKYLSSEANVDIWW--YIGSQLDQAPR--NLPKLKSPLTGTSV---VPYRYHWNTV 134
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T+SYT A+W W+ WE ++DWMAL+G+NL LA+ G E I+ VF
Sbjct: 135 TTSYTSAFWTWEDWETQLDWMALRGVNLALAWIGVEKIFIDVF 177
>gi|380512475|ref|ZP_09855882.1| N-acetylglucosaminidase [Xanthomonas sacchari NCPPB 4393]
Length = 785
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/166 (33%), Positives = 83/166 (50%), Gaps = 17/166 (10%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
Q A VL R L A + R+ + + GG + + + + +SG +
Sbjct: 49 DTQGGTAREVLLRTLGPR--AAELRLQRQPRGGGNDWYQIAAEAGT-------LRVSGSS 99
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQN 170
V + G + YL+ + +SW+ G +VA +P ++ VQ G V P Y N
Sbjct: 100 EVALAHGAYSYLQSIGAASVSWE---GSRVA-LP--AAYADVQ--GPRVATPFAHRAYLN 151
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T YT WWDW RWE+EIDWMAL GI++PLA GQE +WQ +++
Sbjct: 152 VCTYGYTTPWWDWPRWEREIDWMALHGIDMPLAMEGQEYVWQALWR 197
>gi|153808241|ref|ZP_01960909.1| hypothetical protein BACCAC_02529 [Bacteroides caccae ATCC 43185]
gi|423219048|ref|ZP_17205544.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
CL03T12C61]
gi|149129144|gb|EDM20360.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
gi|392625814|gb|EIY19870.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
CL03T12C61]
Length = 752
Score = 82.4 bits (202), Expect = 1e-13, Method: Composition-based stats.
Identities = 47/161 (29%), Positives = 84/161 (52%), Gaps = 20/161 (12%)
Query: 53 QLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGV 112
+ A YA+++R+ P + + ++I GE + + + ++++ G +
Sbjct: 22 HIKAVYALIERVTPGYGKQIKLQLIDPAN--GEDVYEISSEKG-------KVLLKGNNAI 72
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY--YQN 170
+ + YL+Y C +H+SW G Q+ ++P+ P+ ++ I Y Y N
Sbjct: 73 ALSTAFNQYLKYTCNAHVSWL---GNQL-NLPENLPLPQKT-----IRNTINGKYRVYMN 123
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
T SYT A+WDW+RW++EID+MA+ IN+PLA G E +W
Sbjct: 124 YCTVSYTAAYWDWERWQREIDFMAMNSINMPLATVGLEAVW 164
>gi|340514474|gb|EGR44736.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
Length = 762
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 96/207 (46%), Gaps = 32/207 (15%)
Query: 14 FITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQ 73
F +T+S A+ S++I A + A A+++R LP H +F
Sbjct: 9 FAGLVTISAAAITTSASI----------------ASATCSTAGIEALVRRRLPQHEHSFT 52
Query: 74 FRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWD 133
F + + K+ G E + H + + G + +L GLH YL I W
Sbjct: 53 FSLSANKESGLESYAVTTTHDGKF-------RVEGTSLSSILYGLHSYLFDVVHVDIWWY 105
Query: 134 KTGGVQVASM--PKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEID 191
++ A + PKL F + + V +P Y N VT+SYT A+W WK WE ++D
Sbjct: 106 AGSQLEDAPLVLPKL--FLPLNGSSV-----VPYRYDFNTVTTSYTAAFWTWKDWELQLD 158
Query: 192 WMALQGINLPLAFTGQETIWQKVFQVA 218
WMAL+GIN+ A+ G E I +VFQ A
Sbjct: 159 WMALRGINMAPAWIGIEKILIEVFQEA 185
>gi|423223006|ref|ZP_17209475.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640582|gb|EIY34381.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 755
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 77/157 (49%), Gaps = 18/157 (11%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPE-IVISGVTGVEV 114
A Y +++R+ P + S ++ +I + Y + G + I++ G V +
Sbjct: 27 AVYDLIERVTPGYASQYRLEMIQPENGSDVYE----------VDGDGQRIILRGNNAVSL 76
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTS 174
+WYL+Y C +H+SW A +P+ + R G + Y N T
Sbjct: 77 ATAFNWYLKYTCHAHVSWFGNQLKLPAKLPQPANKERRIINGKY-------RVYMNYCTV 129
Query: 175 SYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
SYT AWW+W+RW++E+D+MA+ IN+PL G + +W
Sbjct: 130 SYTAAWWNWERWQQELDYMAMNAINMPLFSVGLDGVW 166
>gi|224537227|ref|ZP_03677766.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521150|gb|EEF90255.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
DSM 14838]
Length = 755
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 77/157 (49%), Gaps = 18/157 (11%)
Query: 56 AAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPE-IVISGVTGVEV 114
A Y +++R+ P + S ++ +I + Y + G + I++ G V +
Sbjct: 27 AVYDLIERVTPGYASQYRLEMIQPENGSDVYE----------VDGDGQRIILRGNNAVSL 76
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTS 174
+WYL+Y C +H+SW A +P+ + R G + Y N T
Sbjct: 77 ATAFNWYLKYTCHAHVSWFGNQLKLPAKLPQPANKERRIINGKY-------RVYMNYCTV 129
Query: 175 SYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
SYT AWW+W+RW++E+D+MA+ IN+PL G + +W
Sbjct: 130 SYTAAWWNWERWQQELDYMAMNAINMPLFSVGLDGVW 166
>gi|340520426|gb|EGR50662.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
Length = 747
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/158 (34%), Positives = 84/158 (53%), Gaps = 10/158 (6%)
Query: 59 AVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGL 118
++++R LP H +F+F + + G SS G ++ I G T +L+GL
Sbjct: 30 SLVKRRLPQHVDSFEFVL---EPPHGSVVVNDSYTVSSTKNG--KVRIEGTTTSALLSGL 84
Query: 119 HWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTF 178
H YL + I W G Q+ PK + P+++ A + +P YY N VT+SYT
Sbjct: 85 HKYLSSEANADIWW--FIGSQLDQTPK--NLPKLR-APIKGSSVVPYRYYWNTVTTSYTS 139
Query: 179 AWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
A+W W+ WE ++DWMAL+G+NL LA+ G E I+ F
Sbjct: 140 AFWTWEDWETQLDWMALRGVNLALAWIGVEKIFIDAFH 177
>gi|449541595|gb|EMD32578.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
B]
Length = 752
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 88/171 (51%), Gaps = 14/171 (8%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVIS 107
A S L YA++QR LP H S+F F++ G +I+ + P+ + I +
Sbjct: 18 ATTSSPLDGLYALVQRRLPDHASSFTFQLTE----GEGDSYIISDTPAKH----HGITVE 69
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY 167
T GL+ YL + G I W G ++ +P+ P + D V +P Y
Sbjct: 70 CTTVSACARGLYAYLTDFGGVDIWWT---GSRLDRLPR--ELPVIGDP-VHGAAIVPYRY 123
Query: 168 YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVA 218
+ N VT YT A+W ++WE E+DW+AL+G+NLPLA+ G E I +VF+ A
Sbjct: 124 HFNTVTFDYTAAFWTVEQWELELDWLALRGVNLPLAWVGYEYILIEVFREA 174
>gi|429201402|ref|ZP_19192867.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
91-03]
gi|428663010|gb|EKX62401.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
91-03]
Length = 1042
Score = 82.0 bits (201), Expect = 2e-13, Method: Composition-based stats.
Identities = 46/167 (27%), Positives = 77/167 (46%), Gaps = 16/167 (9%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P+ A A A L RLLP H F+ ++ ++ G + GT + +
Sbjct: 38 PAFDTAPARAALNRLLPGHADQFRLTLVRREGTGDRF---------RVSGGTGRVQVQAT 88
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
T +L G+HWYL+Y CG+H++W+ G Q+ +L + R + +P +
Sbjct: 89 TPATLLMGVHWYLKYVCGAHLAWN---GSQLDLPRRLPAPARPLERST----ALPHRFAL 141
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N YT + DW WE++ID +A G N L G E ++ ++ +
Sbjct: 142 NDTNDGYTAPYADWPYWERQIDLLAAHGCNEVLVIAGTEAVYHRLLK 188
>gi|195050088|ref|XP_001992825.1| GH13491 [Drosophila grimshawi]
gi|193899884|gb|EDV98750.1| GH13491 [Drosophila grimshawi]
Length = 771
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/166 (33%), Positives = 80/166 (48%), Gaps = 16/166 (9%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P VQ A ++QR+L S F + K LR++ + +++I G
Sbjct: 39 PDVQEEAVRQLMQRVLGDRASNL-FEVEVNK------ALELRSYQFRQLDNG-KVLIVGW 90
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
GV V G H+YL+Y + W KT A + + A F+ YYQ
Sbjct: 91 DGVSVCKGFHYYLKYVLNMDLDWFKTHIELPAQLQPKNVTHSSESASWFI-------YYQ 143
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
N T SY+F WW++ +W + IDWMA+ GINL LA QE IWQ+V+
Sbjct: 144 NVCTWSYSFVWWNFGQWRRHIDWMAMMGINLSLA-PNQEAIWQEVY 188
>gi|358378969|gb|EHK16650.1| glycoside hydrolase family 89 protein [Trichoderma virens Gv29-8]
Length = 748
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/163 (33%), Positives = 88/163 (53%), Gaps = 14/163 (8%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPE--IVISGVTGV 112
A ++++R LP H ++F+F ++ Q G + S + T + + I G T
Sbjct: 26 AGVESLVKRRLPQHVNSFEF-VLEPTQGSG------LTNDSYTVSSTKDGKVRIEGTTTS 78
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAV 172
+++GLH YL I W G Q+ PK + P+++ + + +P Y+ N V
Sbjct: 79 ALISGLHKYLSSEANVDIWW--YLGSQLDQAPK--NLPKLK-SPLKGSSVVPYRYHWNTV 133
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T+SYT A+W W+ WE ++DWMAL+G+NL LA+ G E I+ VF
Sbjct: 134 TTSYTSAFWSWEDWETQLDWMALRGVNLALAWIGVEKIFIDVF 176
>gi|121698957|ref|XP_001267859.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
1]
gi|119396001|gb|EAW06433.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
1]
Length = 671
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 80/159 (50%), Gaps = 12/159 (7%)
Query: 58 YAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAG 117
Y +++R +P H +F+F ++ G + + SS T I++ G T + G
Sbjct: 25 YDLVRRRIPRHADSFRFSLVDFNSTGNDQFVV-----SSTDNDT--ILVEGSTLSALSYG 77
Query: 118 LHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYT 177
LH YL I W + +A P + + KR Y+ N VT SYT
Sbjct: 78 LHRYLADIAHVDIYWYIGSQLHLAPTPLPHLSQPLTGSSTVEKR-----YHFNTVTFSYT 132
Query: 178 FAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
A+W W+ WE ++DWMAL+GINLPLA+ GQE I +VF+
Sbjct: 133 TAFWSWEDWELQLDWMALRGINLPLAWVGQEKILVEVFR 171
>gi|315500594|ref|YP_004089396.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
gi|315418606|gb|ADU15245.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
Length = 765
Score = 81.6 bits (200), Expect = 2e-13, Method: Composition-based stats.
Identities = 42/113 (37%), Positives = 66/113 (58%), Gaps = 8/113 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ I+G + V + G + YL+ +++SW+ G +VA +L + +G V+ P
Sbjct: 74 VTIAGNSPVALTHGAYSYLKSAGAANLSWE---GNRVALPTRLPN-----ASGTKVESPF 125
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT +WDW RWE+E+DWMAL GI++PLA GQE +WQ +++
Sbjct: 126 AHRVYMNTCTFGYTTPFWDWARWERELDWMALHGIDMPLAMEGQEYVWQALWR 178
>gi|294627661|ref|ZP_06706243.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 11122]
gi|292598013|gb|EFF42168.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 11122]
Length = 798
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 85/167 (50%), Gaps = 17/167 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P + + A AV+QRL+ + + F ++ + G ++ I G + I+G
Sbjct: 28 PMLAASPAQAVVQRLIGARAAQFDM-TVAPRGDGADWYRI--------DAGGDTVRIAGS 78
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
+ V + G + YL + +SW+ G +VA + ++ Q V+ P Y
Sbjct: 79 SQVALARGAYAYLGQAGAASMSWE---GDRVALPAQWPAYSSGQ-----VRTPFAHRAYL 130
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 131 NTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQALWR 177
>gi|239615395|gb|EEQ92382.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ER-3]
Length = 829
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 80/160 (50%), Gaps = 12/160 (7%)
Query: 61 LQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHW 120
L+R LP+H + F+F ++ + ++ + + + +I + + + GLH
Sbjct: 50 LERRLPNHVNDFKFSLVGPMRTSDDWT---NDKYTVFTGCNGKINVQANSLSGLFQGLHR 106
Query: 121 YLRYWCGSHISWDKTGGVQVA--SMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTF 178
YL I W + +A +PKL D + + IP Y+ N VT SY
Sbjct: 107 YLSDVVHVDIFWFIGNRLSLAPRKLPKL-------DKPLKGESSIPWRYHLNTVTFSYAT 159
Query: 179 AWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVA 218
WW W+ WE E+DW+A++G+NLPLA+TG E I VFQ A
Sbjct: 160 PWWTWEDWELELDWLAIRGVNLPLAWTGYEKILISVFQEA 199
>gi|325922205|ref|ZP_08183992.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
19865]
gi|325547324|gb|EGD18391.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
19865]
Length = 807
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 86/169 (50%), Gaps = 23/169 (13%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVIS 107
APP A AV+QRL+ + + F+ ++ + G ++ I G + I+
Sbjct: 33 APP------AQAVVQRLIGARAAQFEM-TMAPRGDGADWYRI--------DAGGDTVRIA 77
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY 167
G + V + G + YL + +SW+ G +VA + ++ Q V+ P
Sbjct: 78 GSSQVALARGAYAYLGQAGAASMSWE---GDRVALRAQWPAYSSGQ-----VRTPFAHRA 129
Query: 168 YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 130 YLNTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQTLWR 178
>gi|449299394|gb|EMC95408.1| glycoside hydrolase family 89 protein [Baudoinia compniacensis UAMH
10762]
Length = 801
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 84/166 (50%), Gaps = 7/166 (4%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRG--TPEIVISGVTGV 112
A ++QR LP+H + F F + + I + + + I I+G + +
Sbjct: 42 AGIQGLVQRRLPNHVNDFTFNLTNSSFVTHATTGISKQNDQFTVSNGVNNTINIAGNSPI 101
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAV 172
+ +GL WYL + I W G ++ P+ P + ++ +P Y+ N V
Sbjct: 102 ALASGLRWYLTNYVHVDIYWYI--GSRLHLTPQ--RLPAI-NSTYHGSSVVPWRYHFNTV 156
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVA 218
T SYT A+W W+ WE E+DW+AL+G+NLPLA+ G E I +VFQ A
Sbjct: 157 TFSYTAAFWTWEDWELELDWLALRGVNLPLAWVGYEQILMQVFQDA 202
>gi|294667089|ref|ZP_06732314.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 10535]
gi|292603099|gb|EFF46525.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 10535]
Length = 798
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 85/167 (50%), Gaps = 17/167 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P + + A AV+QRL+ + + F ++ + G ++ I G + I+G
Sbjct: 28 PVLAASPAQAVVQRLIGARAAQFDM-TVAPRGDGADWYRI--------DAGGDTVRIAGS 78
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
+ V + G + YL + +SW+ G +VA + ++ Q V+ P Y
Sbjct: 79 SQVALARGAYAYLGQAGAASMSWE---GDRVALPAQWPAYSSGQ-----VRTPFAHRAYL 130
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 131 NTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQALWR 177
>gi|327356744|gb|EGE85601.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ATCC 18188]
Length = 752
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 80/160 (50%), Gaps = 12/160 (7%)
Query: 61 LQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHW 120
L+R LP+H + F+F ++ + ++ + + + +I + + + GLH
Sbjct: 30 LERRLPNHVNDFKFSLVGPMRTSDDWT---NDKYTVFTGCNGKINVQANSLSGLFQGLHR 86
Query: 121 YLRYWCGSHISWDKTGGVQVA--SMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTF 178
YL I W + +A +PKL D + + IP Y+ N VT SY
Sbjct: 87 YLSDVVHVDIFWFIGNRLSLAPRKLPKL-------DKPLKGESSIPWRYHLNTVTFSYAT 139
Query: 179 AWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVA 218
WW W+ WE E+DW+A++G+NLPLA+TG E I VFQ A
Sbjct: 140 PWWTWEDWELELDWLAIRGVNLPLAWTGYEKILISVFQEA 179
>gi|261199246|ref|XP_002626024.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
gi|239594232|gb|EEQ76813.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
Length = 752
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 80/160 (50%), Gaps = 12/160 (7%)
Query: 61 LQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHW 120
L+R LP+H + F+F ++ + ++ + + + +I + + + GLH
Sbjct: 30 LERRLPNHVNDFKFSLVGPMRTSDDWT---NDKYTVFTGCNGKINVQANSLSGLFQGLHR 86
Query: 121 YLRYWCGSHISWDKTGGVQVA--SMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTF 178
YL I W + +A +PKL D + + IP Y+ N VT SY
Sbjct: 87 YLSDVVHVDIFWFIGNRLSLAPRKLPKL-------DKPLKGESSIPWRYHLNTVTFSYAT 139
Query: 179 AWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVA 218
WW W+ WE E+DW+A++G+NLPLA+TG E I VFQ A
Sbjct: 140 PWWTWEDWELELDWLAIRGVNLPLAWTGYEKILISVFQEA 179
>gi|398786493|ref|ZP_10549210.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
gi|396993639|gb|EJJ04702.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
Length = 1048
Score = 80.9 bits (198), Expect = 3e-13, Method: Composition-based stats.
Identities = 54/175 (30%), Positives = 81/175 (46%), Gaps = 17/175 (9%)
Query: 41 DIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRG 100
D + +ERAP + A A A L RLLP H FQ ++ G + R
Sbjct: 24 DGRHQERAP-AYDTAPARAALTRLLPRHADQFQLVALAPDGTGDRFRVEGR--------- 73
Query: 101 TPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVK 160
I ++G T +L G+HWYL+Y C +H+SW G QV +L + + G
Sbjct: 74 AGRITVAGTTPAVLLTGVHWYLKYTCRAHLSW---AGDQVELPGRLPAPASPVERGT--- 127
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+P + N YT + +W RWE+ ID +AL G+N L G E ++ ++
Sbjct: 128 -ALPHRFALNDTHDGYTAPYANWPRWERMIDVLALHGVNEVLVTPGAEAVYHRLL 181
>gi|289667570|ref|ZP_06488645.1| N-acetylglucosaminidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 798
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 86/169 (50%), Gaps = 23/169 (13%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVIS 107
APP A AV+QRL+ + + F+ ++ + G ++ I G + I+
Sbjct: 32 APP------AQAVVQRLIGARAAQFEM-TVAPRGDGADWYRI--------DAGGDTVRIA 76
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY 167
G + V + G + YL + +SW+ G +VA + ++ Q V+ P
Sbjct: 77 GSSQVALARGAYAYLGQVGAASMSWE---GDRVALPAQWPAYSSGQ-----VRTPFAHRA 128
Query: 168 YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 129 YLNTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQALWR 177
>gi|295690503|ref|YP_003594196.1| alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
gi|295432406|gb|ADG11578.1| Alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
Length = 770
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 61/114 (53%), Gaps = 8/114 (7%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ ISG T V ++ G + YL +H+SW+ Q ++P +G V+ P
Sbjct: 73 DLSISGDTPVALVRGAYAYLNQAGLAHVSWEGDRIAQSGALPAA--------SGARVETP 124
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT WW W RW +EIDWMA G+++PLA GQE +W+ +++
Sbjct: 125 FRHRAYLNTCTYGYTTPWWGWDRWTREIDWMAAHGVDMPLAMEGQEYVWRALWR 178
>gi|289663931|ref|ZP_06485512.1| N-acetylglucosaminidase [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 798
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 86/169 (50%), Gaps = 23/169 (13%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVIS 107
APP A AV+QRL+ + + F+ ++ + G ++ I G + I+
Sbjct: 32 APP------AQAVVQRLIGARAAQFEM-TVAPRGDGADWYRI--------DAGGDTVRIA 76
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNY 167
G + V + G + YL + +SW+ G +VA + ++ Q V+ P
Sbjct: 77 GSSQVALARGAYAYLGQVGAASMSWE---GDRVALPAQWPAYSSGQ-----VRTPFAHRA 128
Query: 168 YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 129 YLNTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQALWR 177
>gi|21241480|ref|NP_641062.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
gi|21106823|gb|AAM35598.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
Length = 798
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 87/171 (50%), Gaps = 19/171 (11%)
Query: 48 APPSVQLAA--AYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIV 105
A P LAA A AV+QRL+ + + F ++ + G ++ I G +
Sbjct: 24 ATPVSVLAASPAQAVVQRLIGARAAQFDM-TVAPRGDGADWYRI--------DAGGDTVR 74
Query: 106 ISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPL 165
I+G + V + G + YL + +SW+ G +VA + ++ Q V+ P
Sbjct: 75 IAGSSQVALARGAYAYLGQAGAASMSWE---GDRVALPAQWPAYRSGQ-----VRTPFAH 126
Query: 166 NYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 127 RAYLNTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQALWR 177
>gi|381169859|ref|ZP_09879021.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689629|emb|CCG35508.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 798
Score = 80.1 bits (196), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 84/167 (50%), Gaps = 17/167 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P + A AV+QRL+ + + F ++ + G ++ I G + I+G
Sbjct: 28 PVLAAPPAQAVVQRLIGARAAQFDM-TVAPRGDGADWYRI--------DAGGDTVRIAGS 78
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
+ V + G + YL + +SW+ G +VA + ++ Q V+ P Y
Sbjct: 79 SQVALARGAYAYLGQAGAASMSWE---GDRVALPAQWPAYRSGQ-----VRTPFAHRAYL 130
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 131 NTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQALWR 177
>gi|390989490|ref|ZP_10259787.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372555759|emb|CCF66762.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 798
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 84/167 (50%), Gaps = 17/167 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P + A AV+QRL+ + + F ++ + G ++ I G + I+G
Sbjct: 28 PVLAAPPAQAVVQRLIGARAAQFDM-TVAPRGDGADWYRI--------EAGGDTVRIAGS 78
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
+ V + G + YL + +SW+ G +VA + ++ Q V+ P Y
Sbjct: 79 SQVALARGAYAYLGQAGAASMSWE---GDRVALPAQWPAYRSGQ-----VRTPFAHRAYL 130
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 131 NTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQALWR 177
>gi|333031147|ref|ZP_08459208.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
gi|332741744|gb|EGJ72226.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
Length = 721
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 77/164 (46%), Gaps = 17/164 (10%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
S +AAA V++R + + F I + GG+ F + + ++ I G +
Sbjct: 27 SEYIAAAKRVIERQIGKKVNEIHFESI--EPIGGKETFDV-------VAKDGKLSIKGSS 77
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQN 170
V + G H YL+ C S +W + + P+ +V P L Y+ N
Sbjct: 78 AVAICYGFHTYLKNACNSIKTWSGSHTEALDIWPEYELKNQVT--------PYDLRYFLN 129
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
T YT +WDW RWEKEIDWMA +G+N+PLA E I ++V
Sbjct: 130 VCTFGYTTPFWDWNRWEKEIDWMAFRGVNMPLATVASEAIAERV 173
>gi|418520969|ref|ZP_13087015.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410702945|gb|EKQ61442.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
Length = 798
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 84/167 (50%), Gaps = 17/167 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P + A AV+QRL+ + + F ++ + G ++ I G + I+G
Sbjct: 28 PVLAAPPAQAVVQRLIGARAAQFDM-TVAPRGDGADWYRI--------EAGGDTVRIAGS 78
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
+ V + G + YL + +SW+ G +VA + ++ Q V+ P Y
Sbjct: 79 SQVALARGAYAYLGQAGAASMSWE---GDRVALPAQWPAYRSGQ-----VRTPFAHRAYL 130
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 131 NTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQALWR 177
>gi|418515337|ref|ZP_13081518.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|410708056|gb|EKQ66505.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 782
Score = 80.1 bits (196), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 84/167 (50%), Gaps = 17/167 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P + A AV+QRL+ + + F ++ + G ++ I G + I+G
Sbjct: 12 PVLAAPPAQAVVQRLIGARAAQFDM-TVAPRGDGADWYRI--------EAGGDTVRIAGS 62
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
+ V + G + YL + +SW+ G +VA + ++ Q V+ P Y
Sbjct: 63 SQVALARGAYAYLGQAGAASMSWE---GDRVALPAQWPAYRSGQ-----VRTPFAHRAYL 114
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N T YT +WDW RW++EIDWMAL GI++PLA GQE IWQ +++
Sbjct: 115 NTCTYGYTTPFWDWPRWQREIDWMALHGIDMPLAMEGQEAIWQALWR 161
>gi|119480815|ref|XP_001260436.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
181]
gi|119408590|gb|EAW18539.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
181]
Length = 748
Score = 79.7 bits (195), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 83/164 (50%), Gaps = 14/164 (8%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
L Y +++R +P H +F+F ++ + + S+ GT I++ G +
Sbjct: 21 LDGIYDLVRRRMPKHADSFRFSLVDFNSTNNDQFVV-----STAANGT--ILVQGNSLSA 73
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMP-KLGSFPRVQDAGVFVKRPIPLNYYQNAV 172
+ GLH YL I W + +A P L S P + V + Y+ N V
Sbjct: 74 LSYGLHRYLADVALVDIYWFVGSQLHLAPTPLPLLSKPLTGSSTVERR------YHFNTV 127
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T SYT A+W W+ WE ++DWMAL+GINLPLA+ GQE I +VF+
Sbjct: 128 TFSYTTAFWSWEDWELQLDWMALRGINLPLAWVGQEKILVEVFR 171
>gi|229818803|ref|YP_002880329.1| alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
gi|229564716|gb|ACQ78567.1| Alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
Length = 751
Score = 79.7 bits (195), Expect = 8e-13, Method: Composition-based stats.
Identities = 41/120 (34%), Positives = 62/120 (51%), Gaps = 8/120 (6%)
Query: 96 SYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDA 155
+Y E+V+ AGL+ YL++ G ++WD+ ++ ++ +L P + +
Sbjct: 58 AYAASGGELVVRASDTPAAAAGLYHYLKHAGGRQVTWDR---LRPEALTRLPDAPPTRRS 114
Query: 156 GVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P Y N VT+ YT +WDW RWE+EIDWMAL GI PL G ET+ + F
Sbjct: 115 -----TPAAARYDLNVVTTGYTAPYWDWARWEREIDWMALHGITTPLMVVGHETVLLRTF 169
>gi|440731409|ref|ZP_20911430.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
gi|440373101|gb|ELQ09870.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
Length = 732
Score = 79.7 bits (195), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 61/114 (53%), Gaps = 8/114 (7%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+ +SG + V + G + YL+ + +SW+ G +VA F G V P
Sbjct: 39 RLRVSGSSEVALAHGAYNYLQSIGAASVSWE---GSRVALPAAYADF-----RGQRVVTP 90
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT WWDW RWE+EIDWMAL GI++PLA GQ+ +WQ +++
Sbjct: 91 FAYRAYLNVCTYGYTTPWWDWARWEREIDWMALHGIDMPLAMEGQDYVWQALWR 144
>gi|400599317|gb|EJP67021.1| alpha-N-acetylglucosaminidase, putative [Beauveria bassiana ARSEF
2860]
Length = 753
Score = 79.3 bits (194), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 59/163 (36%), Positives = 82/163 (50%), Gaps = 21/163 (12%)
Query: 59 AVLQRLLPSHYSAFQFRI----ISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEV 114
A ++R LP H ++FQ I + G Y SS G +I I G + V
Sbjct: 29 AFIKRRLPDHAASFQVSIDNAAATSATSGDRYTV------SSGSNG--KIQIQGTSPSAV 80
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDA--GVFVKRPIPLNYYQNAV 172
L GLH YL I W G ++ PK + PR+ G + +P Y+ N V
Sbjct: 81 LQGLHRYLADVAHVDIWW--FVGSELDKAPK--TLPRLNKTLEGASI---VPYRYHWNTV 133
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T+SYT A+W W+ WE ++DWMAL+G+NL LA+ G E I+ VF
Sbjct: 134 TTSYTSAFWSWEDWELQLDWMALRGVNLALAWIGVEKIFTDVF 176
>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 1105
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 61/113 (53%), Gaps = 8/113 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ +SG + V + G + YL+ + +SW+ G +VA F G V P
Sbjct: 36 LRVSGSSEVALAHGAYSYLQSIGAASVSWE---GSRVALPAAYADF-----NGQRVVTPF 87
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT WWDW RWE+EIDWMAL GI++PLA GQ+ +WQ +++
Sbjct: 88 AYRAYLNVCTYGYTTPWWDWARWEREIDWMALHGIDMPLAMEGQDYVWQALWR 140
>gi|433678127|ref|ZP_20510026.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430816763|emb|CCP40478.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 691
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 60/111 (54%), Gaps = 8/111 (7%)
Query: 106 ISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPL 165
+SG + V + G + YL+ + +SW+ G +VA F G V P
Sbjct: 1 MSGSSEVALAHGAYSYLQSIGAASVSWE---GSRVALPAAYADF-----RGQRVVTPFAY 52
Query: 166 NYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT WWDW RWE+EIDWMAL GI++PLA GQ+ +WQ +++
Sbjct: 53 RAYLNVCTYGYTTPWWDWARWEREIDWMALHGIDMPLAMEGQDYVWQALWR 103
>gi|453081268|gb|EMF09317.1| glycoside hydrolase family 89 protein [Mycosphaerella populorum
SO2202]
Length = 784
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 75/162 (46%), Gaps = 6/162 (3%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIV-ISGVTGVE 113
A ++++R LP + F F + + N + G ++ ISG + +
Sbjct: 28 AGIRSLVERRLPDYVDYFSFSLTTDNDSSIGSSTRPPNDEYTVSNGENGVIAISGNSPIA 87
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
+ +GL WYL I W + G PR+ +P Y+ N VT
Sbjct: 88 LASGLRWYLANILHVDIYWFVGSNFNLIH----GDLPRLNQT-YHGSSIVPYRYWANTVT 142
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
SYT A+W W W+ ++DWMAL+GINLPLA+ G E I Q VF
Sbjct: 143 FSYTAAFWSWDDWQIQLDWMALRGINLPLAWVGVEKIIQDVF 184
>gi|395532374|ref|XP_003768245.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Sarcophilus
harrisii]
Length = 726
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/60 (55%), Positives = 40/60 (66%)
Query: 156 GVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
G + P YYQN T SY+F WW W RWE+EIDWMAL GINL A GQE +W++V+
Sbjct: 103 GSSPRPPCRYRYYQNVCTQSYSFVWWGWARWEREIDWMALNGINLARAAVGQEAVWRRVY 162
>gi|313203962|ref|YP_004042619.1| alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
gi|312443278|gb|ADQ79634.1| Alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
Length = 738
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 81/158 (51%), Gaps = 15/158 (9%)
Query: 59 AVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGL 118
A+++R+L H F + IS ++ + +N +IV+ G GV +
Sbjct: 30 ALIRRVLAGHADDFVVKEISSEKGLDVFEVEAQN---------GKIVLRGNNGVSIATAF 80
Query: 119 HWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTF 178
+WYL+ +H+S+D + + G+ P + A + ++ N T YTF
Sbjct: 81 NWYLKE--TAHLSYDWQA---IKPLTISGNLP-LPAAKIRKTCAAQQRFFNNTCTFGYTF 134
Query: 179 AWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
A+W+W +W++ IDWMA+ G+N PL GQE +WQ+V++
Sbjct: 135 AFWNWGQWQRFIDWMAMNGVNRPLMLAGQEAVWQEVWK 172
>gi|336374066|gb|EGO02404.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 761
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 86/164 (52%), Gaps = 11/164 (6%)
Query: 58 YAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIR-GTPEIVISGVTGVEVLA 116
Y+++QR +P+H AF F++ S F L + S+ + G +I++ T
Sbjct: 30 YSLVQRRIPAHAGAFTFKLSSGNSSNVTDTFTLSDQISATTQAGGAKILVECSTISACAR 89
Query: 117 GLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP--IPLNYYQNAVTS 174
GL+ Y+ + G I W G ++ +P P V G V R +P Y+ N VT
Sbjct: 90 GLYTYVTEFGGVDIWWT---GSRLNELPS--ELPSV---GQPVTRSSLVPYRYHFNTVTF 141
Query: 175 SYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVA 218
SYT A++++ W +DW+AL+G+NLPLA+ G E I +VF+ A
Sbjct: 142 SYTAAFYNFDDWSLLLDWLALRGVNLPLAWVGNEYILVQVFREA 185
>gi|195454475|ref|XP_002074254.1| GK18384 [Drosophila willistoni]
gi|194170339|gb|EDW85240.1| GK18384 [Drosophila willistoni]
Length = 743
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 82/167 (49%), Gaps = 18/167 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYS-AFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISG 108
P VQ AA V++R++ S +F+ + +R+ S + + +I+I+G
Sbjct: 52 PDVQEQAARDVIKRVIGKKASESFEVEV--------NRLLDVRSFQISML-DSGKILIAG 102
Query: 109 VTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYY 168
G V G H YL+Y + W K +P P V + P+ YY
Sbjct: 103 WDGTSVCKGFHQYLKYQLNKDVDWFKM----TIELPVNQHLPNVSMSSTSAS---PIIYY 155
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
QN T Y+FAWW++++W + IDWMAL GI+L +A QE IWQ ++
Sbjct: 156 QNVCTWGYSFAWWNFQQWRRNIDWMALMGISLTIAPI-QEFIWQDIY 201
>gi|195339231|ref|XP_002036223.1| GM12949 [Drosophila sechellia]
gi|194130103|gb|EDW52146.1| GM12949 [Drosophila sechellia]
Length = 778
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 80/167 (47%), Gaps = 16/167 (9%)
Query: 49 PPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISG 108
P VQ AA AV+ R++ S F + K LR+ S + I++ G
Sbjct: 48 PKDVQETAAMAVISRVIGERSSQL-FNVQVNKNMD------LRSFQISML-DDGRILLMG 99
Query: 109 VTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYY 168
GV V LH YL+Y G + W K +P P V + K P+ Y+
Sbjct: 100 WDGVSVCKALHHYLKYVLGKDVDWFKMR----IELPTNLQLPNVT---IESKSASPIIYH 152
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
QN T SY+FAWW ++W + +DWMAL GI+L +A QE IW +V+
Sbjct: 153 QNVCTWSYSFAWWGIEQWRRHLDWMALMGISLTIAPV-QEAIWVEVY 198
>gi|195398029|ref|XP_002057627.1| GJ18000 [Drosophila virilis]
gi|194141281|gb|EDW57700.1| GJ18000 [Drosophila virilis]
Length = 766
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 89/186 (47%), Gaps = 23/186 (12%)
Query: 31 IGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSA-FQFRIISKKQCGGEYCFI 89
+G + ++RL P VQ AA ++QR++ S FQ + +
Sbjct: 26 LGPELVARLT-----PETTPDVQEAAVRQLMQRVIGLRASNLFQVEVNKLLETSSYQISR 80
Query: 90 LRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSF 149
L N +++I+G GV V G H+YL+Y + W KT ++++ +L
Sbjct: 81 LDNG---------KVLIAGWDGVSVCKGFHYYLKYVLNQDLDWFKTR-IELSEQLQL--- 127
Query: 150 PRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQET 209
P V + K + YYQN T SY++ WW ++ W IDWMA+ GINL +A QE
Sbjct: 128 PNVT---LTSKSASSIIYYQNVCTWSYSYVWWSFEEWRAHIDWMAMMGINLVIA-PNQEA 183
Query: 210 IWQKVF 215
IWQ V+
Sbjct: 184 IWQAVY 189
>gi|329934959|ref|ZP_08285000.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
gi|329305781|gb|EGG49637.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
Length = 1017
Score = 77.8 bits (190), Expect = 3e-12, Method: Composition-based stats.
Identities = 51/173 (29%), Positives = 78/173 (45%), Gaps = 28/173 (16%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P+ A A + L RLLP H A QFR+ + GG F + T + ++G
Sbjct: 17 PAFDTAPARSALNRLLPDH--AGQFRLTPLRPSGGRERFRVTG-------TTGRVEVAGT 67
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP------I 163
+ +L G HWYL+Y CG+H++W+ GG +P+ P RP +
Sbjct: 68 SPAVLLHGAHWYLKYVCGAHLAWN--GG--TLDLPRRLPAP---------ARPLERSTAL 114
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P + N YT + DW W+ EID +A G N + G E ++ +V +
Sbjct: 115 PHRFALNDTNDGYTAPYADWPYWQHEIDLLAAHGCNEVMVIAGMEAVYHRVLK 167
>gi|21356587|ref|NP_652045.1| CG13397, isoform A [Drosophila melanogaster]
gi|442626853|ref|NP_001260251.1| CG13397, isoform B [Drosophila melanogaster]
gi|16185856|gb|AAL13967.1| LP03571p [Drosophila melanogaster]
gi|22945953|gb|AAF52672.2| CG13397, isoform A [Drosophila melanogaster]
gi|440213562|gb|AGB92787.1| CG13397, isoform B [Drosophila melanogaster]
Length = 778
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 80/167 (47%), Gaps = 16/167 (9%)
Query: 49 PPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISG 108
P VQ AA AV+ R++ S F++ K LR+ S + I++ G
Sbjct: 48 PKDVQETAAMAVISRVIGERSSQL-FKVQVNKNMD------LRSFQISML-DDGRILLMG 99
Query: 109 VTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYY 168
GV V LH YL+Y + W K +P P V + K P+ Y+
Sbjct: 100 WDGVSVCKALHHYLKYVLNKDVDWFKMR----IELPTNLQLPNVT---IESKSASPIIYH 152
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
QN T SY+FAWW ++W + +DWMAL GI+L +A QE IW KV+
Sbjct: 153 QNVCTWSYSFAWWGIEQWRRHLDWMALMGISLTIAPV-QEAIWVKVY 198
>gi|345014586|ref|YP_004816940.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
gi|344040935|gb|AEM86660.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
Length = 1044
Score = 77.4 bits (189), Expect = 4e-12, Method: Composition-based stats.
Identities = 56/176 (31%), Positives = 85/176 (48%), Gaps = 18/176 (10%)
Query: 41 DIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRG 100
D QD A + A A A L+RLLP+H A QFR+++ + G E F +
Sbjct: 35 DTQDP--AVTAFDTAPARAALRRLLPAH--AEQFRLVALAKRGDEERFEVDGSAG----- 85
Query: 101 TPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVK 160
+ +SG + +L G+HWYL+Y C + ++W G Q+ ++PK P + +
Sbjct: 86 --RVTVSGTSPAVLLTGVHWYLKYTCHAQLTWS---GDQL-NLPKKLPAPARR---LTHS 136
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+P + N YT + DW RWE ID AL G N L GQE ++ ++ Q
Sbjct: 137 TALPHRFAFNDTHDGYTAPFADWDRWEHLIDVAALHGCNELLVTAGQEAVYHRLLQ 192
>gi|195577611|ref|XP_002078662.1| GD22403 [Drosophila simulans]
gi|194190671|gb|EDX04247.1| GD22403 [Drosophila simulans]
Length = 778
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 80/167 (47%), Gaps = 16/167 (9%)
Query: 49 PPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISG 108
P VQ AA AV+ R++ S F + K LR+ S + I++ G
Sbjct: 48 PKDVQKTAAMAVISRVIGERSSQL-FNVQVNKNMD------LRSFQISMLDDG-RILLMG 99
Query: 109 VTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYY 168
GV V LH YL+Y G + W K +P P V + K P+ Y+
Sbjct: 100 WDGVSVCKALHHYLKYVLGKDVDWFKMR----IELPTNLQLPNVT---IESKSASPIIYH 152
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
QN T SY+FAWW ++W + +DWMAL GI+L +A QE IW +V+
Sbjct: 153 QNVCTWSYSFAWWGIEQWRRHLDWMALMGISLTIAPV-QEAIWVEVY 198
>gi|336371253|gb|EGN99592.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.3]
gi|336384013|gb|EGO25161.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 761
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/170 (30%), Positives = 86/170 (50%), Gaps = 11/170 (6%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
S QL Y++++R +P+H AF F + + F L + + G I+I T
Sbjct: 24 SSQLEGLYSLVKRQIPAHAGAFTFELSAANTTNATDTFTLSDIGKN-ATGNATILIECST 82
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP--IPLNYY 168
GL+ Y+ + G I W G ++ +P ++ + G V R +P Y+
Sbjct: 83 ISACARGLYTYVTEFGGVDIWWT---GSRLNELPS-----QLPEIGEPVTRTSLVPYRYH 134
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVA 218
N VT SYT +++D+ W +DW+AL+G+NLPLA+ G E + +VF+ A
Sbjct: 135 FNTVTFSYTASFYDFDDWSFLLDWLALRGVNLPLAWVGNEYVLVQVFREA 184
>gi|198476648|ref|XP_001357424.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
gi|198137793|gb|EAL34493.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
Length = 767
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 80/166 (48%), Gaps = 17/166 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P VQ AA AV+QR++ S +S+ R+ S + + +I+I+G
Sbjct: 39 PDVQEEAARAVIQRVIGPRSSQLFLVEVSR-------ILDFRSFQISML-DSGQILIAGW 90
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
GV V G H YL+Y + W K +P P V + Y+Q
Sbjct: 91 DGVSVCKGFHHYLKYVLERDVDWFKMR----IELPTNLQLPNV----TIRSTSTTILYHQ 142
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
N T SY+FAWW +++W + IDWMA+ GI+L +A QE +WQ V+
Sbjct: 143 NVCTWSYSFAWWGFEQWRRHIDWMAMMGISLTIAPV-QEAVWQDVY 187
>gi|195155652|ref|XP_002018715.1| GL25802 [Drosophila persimilis]
gi|194114868|gb|EDW36911.1| GL25802 [Drosophila persimilis]
Length = 767
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 80/166 (48%), Gaps = 17/166 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P VQ AA AV+QR++ S +S+ R+ S + + +I+I+G
Sbjct: 39 PDVQEEAARAVIQRVIGPRSSQLFLVEVSR-------ILDFRSFQISML-DSGQILIAGW 90
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
GV V G H YL+Y + W K +P P V + Y+Q
Sbjct: 91 DGVSVCKGFHHYLKYVLERDVDWFKMR----IELPTNLQLPNV----TIRSTSTTILYHQ 142
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
N T SY+FAWW +++W + IDWMA+ GI+L +A QE +WQ V+
Sbjct: 143 NVCTWSYSFAWWGFEQWRRHIDWMAMMGISLTIAPV-QEAVWQDVY 187
>gi|452988463|gb|EME88218.1| glycoside hydrolase family 89 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 772
Score = 77.0 bits (188), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 79/164 (48%), Gaps = 12/164 (7%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRII-SKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
A A+++R LP H F F I S R ++ GT I ISG + +
Sbjct: 21 AGIEALVKRRLPRHVDGFSFSIDPSYPSASSSTPPNDRYTVANGDNGT--ICISGNSPIA 78
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVA--SMPKLGSFPRVQDAGVFVKRPIPLNYYQNA 171
+ + L WYL + W +A ++P L S +P Y+ N
Sbjct: 79 LASALRWYLVNAMHVDLYWFVGSNFDLAPEALPPLKS-------TYHGSSIVPYRYHFNT 131
Query: 172 VTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
VT SYT A+W W+ WE+++DWMAL+GINLPLA+ G E + Q VF
Sbjct: 132 VTFSYTAAFWSWEEWEQQLDWMALRGINLPLAWVGFEKLLQDVF 175
>gi|71001188|ref|XP_755275.1| alpha-N-acetylglucosaminidase [Aspergillus fumigatus Af293]
gi|66852913|gb|EAL93237.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
Af293]
gi|159129357|gb|EDP54471.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
A1163]
Length = 756
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 8/165 (4%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGE-YCFILRNHPSSYIRGTPEIVISGVTGV 112
L Y +++R +P H +F+F ++ + + + + ++G +S +
Sbjct: 21 LDGIYDLVRRRMPKHVDSFRFSLVDFNSTNNDRFVVSTAANGTILVQGNSLSALSYGNAI 80
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMP-KLGSFPRVQDAGVFVKRPIPLNYYQNA 171
+ LH YL I W + +A P L S P + V + Y+ N
Sbjct: 81 DDKDSLHRYLADVALVDIYWFVGSQLHLAPTPLPLLSKPLTGSSTVERR------YHFNT 134
Query: 172 VTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
VT SYT A+W W+ WE ++DWMAL+GINLPLA+ GQE I +VF+
Sbjct: 135 VTFSYTTAFWSWEDWELQLDWMALRGINLPLAWVGQEKILVEVFR 179
>gi|374384144|ref|ZP_09641670.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
12061]
gi|373228751|gb|EHP51054.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
12061]
Length = 835
Score = 76.6 bits (187), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 61/113 (53%), Gaps = 8/113 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ I G +GV + G + +++ +W + AS+P ++ V P
Sbjct: 64 LEIRGSSGVALCRGFYDFIKSNHAGISAWSGNRCIFPASLP--------ENMEKRVVSPF 115
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P +YY N VT YT +WDW+RWE+EIDWMAL G+++PLA E I +V++
Sbjct: 116 PRHYYYNVVTYGYTLPYWDWQRWEEEIDWMALHGVDMPLALVANEAITARVWK 168
>gi|404406438|ref|ZP_10998022.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 726
Score = 76.3 bits (186), Expect = 8e-12, Method: Composition-based stats.
Identities = 40/115 (34%), Positives = 62/115 (53%), Gaps = 8/115 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGV-QVASMPKLGSFPRVQDAGVFVKR 161
E+ ISG + V + GL+ YLR +C ++W + + + +P + RV+ R
Sbjct: 59 ELHISGNSAVSMAFGLNCYLREYCRVTVTWYRRDKIDEPKRLPAVEG--RVEREARVGNR 116
Query: 162 PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
++ N T Y+ WW W WE IDWMAL G+ + LA TGQE +WQ+V++
Sbjct: 117 -----FFLNYCTYGYSLNWWQWDEWEHFIDWMALNGVTMALATTGQEAVWQRVWR 166
>gi|456390168|gb|EMF55563.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
25435]
Length = 1042
Score = 76.3 bits (186), Expect = 9e-12, Method: Composition-based stats.
Identities = 45/168 (26%), Positives = 76/168 (45%), Gaps = 18/168 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTP-EIVISG 108
P+ A A + L RLLP H F+ ++ + + + GT +I +
Sbjct: 38 PAFDTAPARSALNRLLPGHAGQFRLSLLDRSGTVDRF----------RVTGTKRDIQVQA 87
Query: 109 VTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYY 168
T +L G+HWYL+Y CG+H++W+ G Q+ +L + R + +P +
Sbjct: 88 TTPATLLMGVHWYLKYVCGAHLAWN---GSQLDLPSRLPAPARPLEGST----ALPHRFA 140
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N YT + DW WE +ID +A G N L G E ++ ++ +
Sbjct: 141 LNDTNDGYTAPYADWPYWEHQIDLLAAHGCNEVLVIAGMEAVYHRLLK 188
>gi|302546018|ref|ZP_07298360.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
[Streptomyces hygroscopicus ATCC 53653]
gi|302463636|gb|EFL26729.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
[Streptomyces himastatinicus ATCC 53653]
Length = 679
Score = 75.9 bits (185), Expect = 1e-11, Method: Composition-based stats.
Identities = 54/164 (32%), Positives = 79/164 (48%), Gaps = 19/164 (11%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEV 114
A A A L+RLLP H A QFR+++ G + T I ++G + +
Sbjct: 7 APARAALRRLLPDH--AEQFRLVALPGGKG------KKERFEVTGSTGRITVAGTSPAVL 58
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP--IPLNYYQNAV 172
L G+HWYL+Y C + I+W G Q+ +PK ++ G V+R +P + N
Sbjct: 59 LTGVHWYLKYTCHTQITW---AGDQL-ELPK-----KLPAPGGTVERSTALPHRFAFNDT 109
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YT + DW RWE ID AL G N L GQE ++ ++ Q
Sbjct: 110 HDGYTAPFADWDRWEHLIDVAALHGSNELLVTAGQEVVYHRLLQ 153
>gi|328867426|gb|EGG15808.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
Length = 992
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 56/107 (52%), Gaps = 10/107 (9%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + V + ++YL+Y+ +W TG G+ P V V V +PI
Sbjct: 115 VYLQANNAVNLAQAFNYYLKYYAMCVYTW--TGD---QCNLNDGALPEV---PVPVTQPI 166
Query: 164 PLN--YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQE 208
YY N T Y+ WWDW+RW++EIDWMAL G NLPLAF GQE
Sbjct: 167 ASQWRYYMNVCTFGYSTVWWDWQRWQREIDWMALNGYNLPLAFVGQE 213
>gi|197302378|ref|ZP_03167435.1| hypothetical protein RUMLAC_01107 [Ruminococcus lactaris ATCC
29176]
gi|197298557|gb|EDY33100.1| F5/8 type C domain protein [Ruminococcus lactaris ATCC 29176]
Length = 1655
Score = 75.5 bits (184), Expect = 2e-11, Method: Composition-based stats.
Identities = 43/111 (38%), Positives = 65/111 (58%), Gaps = 7/111 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+I I G GV + GL++YL+Y+C +IS G QV +MPK S V++ V +
Sbjct: 550 KIHIKGNNGVSLATGLNYYLKYYCNVNIS---QVGDQV-TMPK--SIIPVEET-VHKETK 602
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
P+ Y N T SY+ A+W + W E+DW+AL G+N+ L T QE +W++
Sbjct: 603 FPVRYSYNYCTLSYSMAFWGEEEWRNELDWLALNGVNVVLDATAQEEVWRR 653
>gi|346324333|gb|EGX93930.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
Length = 751
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/160 (33%), Positives = 78/160 (48%), Gaps = 13/160 (8%)
Query: 59 AVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGL 118
A ++R LP H +FQ I + + G +I I G + ++ GL
Sbjct: 29 AFIKRRLPDHVDSFQVSIAKAPSA----SIAANDSYNVSSGGNGKIHIQGTSPSAIMQGL 84
Query: 119 HWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQD--AGVFVKRPIPLNYYQNAVTSSY 176
H YL I W G ++ PK + PR+ G + R Y+ N VT+SY
Sbjct: 85 HRYLADVAHVDIWW--FVGSELDKAPK--ALPRLVQPLEGSSIVR---FRYHWNTVTTSY 137
Query: 177 TFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T A+W W+ WE ++DWMAL+G+NL LA+ G E I+ VF+
Sbjct: 138 TSAFWSWEDWELQLDWMALRGVNLALAWIGVEKIFTDVFR 177
>gi|402824586|ref|ZP_10873940.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
gi|402261896|gb|EJU11905.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
Length = 486
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 61/119 (51%), Gaps = 9/119 (7%)
Query: 98 IRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGV 157
IRG I G + V ++ G+ L H+SW+ + ++P + D G
Sbjct: 90 IRGG-HIEAEGTSPVALVRGVATTLEKLGRLHVSWEGRRLGALDALPDM-------DTGE 141
Query: 158 FVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
V P L Y N T YT WW W RWE+EIDWMA G++ PLA GQE +W+++++
Sbjct: 142 -VASPFALRAYLNTCTYGYTTPWWGWDRWEQEIDWMAAHGVDTPLAMEGQEYVWRELWR 199
>gi|16124795|ref|NP_419359.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
gi|221233511|ref|YP_002515947.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
gi|13421729|gb|AAK22527.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
gi|220962683|gb|ACL94039.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
Length = 770
Score = 74.3 bits (181), Expect = 3e-11, Method: Composition-based stats.
Identities = 42/113 (37%), Positives = 61/113 (53%), Gaps = 8/113 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
I ISG + V ++ G + +LR +H+SW+ VQ ++ AG V+ P
Sbjct: 74 ISISGDSPVALVRGAYAHLRQAGLAHVSWEGDRVVQAGAV--------PAGAGARVETPF 125
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T YT WW W RW +EIDWMA GI++PLA GQE +W+ +++
Sbjct: 126 RHRAYLNTCTYGYTTPWWGWGRWTREIDWMAAHGIDMPLAMEGQEYVWRALWR 178
>gi|210611122|ref|ZP_03288736.1| hypothetical protein CLONEX_00926, partial [Clostridium nexile DSM
1787]
gi|210152109|gb|EEA83116.1| hypothetical protein CLONEX_00926 [Clostridium nexile DSM 1787]
Length = 1662
Score = 74.3 bits (181), Expect = 3e-11, Method: Composition-based stats.
Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 7/111 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+I I G GV + GL+ YL+Y+C +IS QV K+ + V +
Sbjct: 555 KIHIKGNDGVSLATGLNHYLKYYCNVNIS-------QVGDQVKMPKSIVPIEGTVHKETK 607
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
P+ Y N T SY+ A+W K W E+DW+AL G+N+ L T QE +W++
Sbjct: 608 FPVRYSYNYCTLSYSMAFWGEKEWRNELDWLALNGVNVVLDATAQEEVWRR 658
>gi|224027030|ref|ZP_03645396.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
18228]
gi|224020266|gb|EEF78264.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
18228]
Length = 837
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 72/160 (45%), Gaps = 18/160 (11%)
Query: 57 AYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLA 116
A+ VLQR+ S S+ ++ KK G Y Y + + G V +
Sbjct: 28 AFRVLQRV--SKQSSVPVKLELKKGVGNMYY--------EYSVKDGILHLKGSDNVSLCR 77
Query: 117 GLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSY 176
G + Y++ +W +P S V V+ P +YY N T Y
Sbjct: 78 GFYDYVKTHRLGMYTWSGNNIRLPEELPAASS--------VRVESPFRNHYYFNVCTYGY 129
Query: 177 TFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T +WDW RWE+EIDWMAL GIN+PLA G E I +V+Q
Sbjct: 130 TMPYWDWARWEQEIDWMALHGINMPLALVGYEAILARVWQ 169
>gi|374990497|ref|YP_004965992.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
gi|297161149|gb|ADI10861.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
Length = 1001
Score = 73.9 bits (180), Expect = 5e-11, Method: Composition-based stats.
Identities = 56/162 (34%), Positives = 79/162 (48%), Gaps = 16/162 (9%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEV 114
A A A L+RLLP H A QFR++ + GGE F + P I ++G + +
Sbjct: 5 APARAALKRLLPDH--AEQFRLVPVEGEGGEERFEVSGRPG-------RIEVAGTSPAVL 55
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTS 174
LAG+HWYL+Y C + I+W G Q+ +PK PR A + +P + N
Sbjct: 56 LAGVHWYLKYTCHALITW---AGDQL-DLPKTLPAPR---ARIEQSTALPHRFAFNDTHD 108
Query: 175 SYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YT + W WE+ ID AL G N L GQE ++ + Q
Sbjct: 109 GYTAPFAGWDHWERLIDVAALHGCNELLVTAGQEAVYHLLLQ 150
>gi|288927801|ref|ZP_06421648.1| putative alpha-N-acetylglucosaminidase
(N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
oral taxon 317 str. F0108]
gi|288330635|gb|EFC69219.1| putative alpha-N-acetylglucosaminidase
(N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
oral taxon 317 str. F0108]
Length = 723
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 32/58 (55%), Positives = 39/58 (67%)
Query: 159 VKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
V P +YY NAVT YT +WDW RWE+EIDWMA GI++PLA T E I +VF+
Sbjct: 117 VTSPFQYHYYFNAVTFGYTMPYWDWNRWEQEIDWMAFHGIDMPLALTANEAILARVFK 174
>gi|281424178|ref|ZP_06255091.1| N-acetylglucosaminidase [Prevotella oris F0302]
gi|281401447|gb|EFB32278.1| N-acetylglucosaminidase [Prevotella oris F0302]
Length = 723
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 32/58 (55%), Positives = 39/58 (67%)
Query: 159 VKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
V P +YY NAVT YT +WDW RWE+EIDWMA GI++PLA T E I +VF+
Sbjct: 117 VTSPFQYHYYFNAVTFGYTMPYWDWNRWEQEIDWMAFHGIDMPLALTANEAILARVFK 174
>gi|290956360|ref|YP_003487542.1| alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
gi|260645886|emb|CBG68977.1| putative alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
Length = 732
Score = 73.6 bits (179), Expect = 6e-11, Method: Composition-based stats.
Identities = 55/181 (30%), Positives = 87/181 (48%), Gaps = 22/181 (12%)
Query: 37 SRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSS 96
+R + D RA +++ A+A A +RLLP H+ F G EY +
Sbjct: 23 ARAIPSDDTPRADRALRTASAAA--RRLLPDHWRQITF------AEGREY-------DTF 67
Query: 97 YIRGTP-EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDA 155
+ G+ I ++G T L GL+WYLR+ + I+W G Q S+P+ P + D
Sbjct: 68 RVSGSAGRITVAGDTPATQLTGLNWYLRHIAYAEINW---AGEQTDSLPR--ELPGL-DG 121
Query: 156 GVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
V + + + N YT A+ DW WE+E+D +AL G N L + G + ++ +VF
Sbjct: 122 AVVRRANVTHRFALNDTNDGYTGAYLDWTYWERELDVLALHGYNEVLVYAGADALYHRVF 181
Query: 216 Q 216
Q
Sbjct: 182 Q 182
>gi|169351438|ref|ZP_02868376.1| hypothetical protein CLOSPI_02218 [Clostridium spiroforme DSM 1552]
gi|169291660|gb|EDS73793.1| F5/8 type C domain protein [Clostridium spiroforme DSM 1552]
Length = 1762
Score = 73.6 bits (179), Expect = 6e-11, Method: Composition-based stats.
Identities = 57/229 (24%), Positives = 107/229 (46%), Gaps = 30/229 (13%)
Query: 2 DSPFRSVSLILFFITTITLSTLAVAQ---------SSTIGVQYISRLLDIQDRERAPPSV 52
+S +R V + L +++ + S L+ + + I +L I+ ++ +
Sbjct: 574 NSTYRIVRVYLEYVSDTSTSLLSEVRIHGKATNENKEVLRTGTIDEILGIEPYDKTSYAT 633
Query: 53 QLAAA------YAVLQRLLPSHYSA-FQFRIISK-KQCGGEYCFILRNHPSSYIRGTPEI 104
++ A Y ++ R L Y F F++ KQ EY + ++ + +I
Sbjct: 634 EITEAETIENVYGIIDRNLGEKYRDWFTFKLAKNPKQEYAEYDYYTVSNAND------KI 687
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
+I+G G+ + GL++YL+ +C +IS + A+ K+ D ++ P
Sbjct: 688 LITGNDGISLATGLNYYLKNYCKVNIS-------EQANQTKMPDKLIKVDKTIYQYTPYE 740
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
+ Y N T +YTF++ D K ++KE DW+AL G+N+ L GQE +W K
Sbjct: 741 IRYGFNYCTLNYTFSYADAKMFQKEYDWLALNGVNVVLDLAGQEAVWIK 789
>gi|194863164|ref|XP_001970307.1| GG23441 [Drosophila erecta]
gi|190662174|gb|EDV59366.1| GG23441 [Drosophila erecta]
Length = 778
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 78/167 (46%), Gaps = 16/167 (9%)
Query: 49 PPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISG 108
P VQ AA AV+ R++ S ++K LR+ + + I++ G
Sbjct: 48 PTDVQETAAMAVISRVIGERSSQLFMVQVNKH-------MDLRSFQITKL-DDGRILLMG 99
Query: 109 VTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYY 168
GV V G H YL+Y + W K +P P V + K P Y+
Sbjct: 100 WDGVSVCKGFHHYLKYVLNKDVDWFKMR----IELPTNLQLPNVT---IESKSASPFIYH 152
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
QN T SY+F WW +++W + +DWMAL GI+L +A QE IW +V+
Sbjct: 153 QNVCTWSYSFVWWGFEQWRRHLDWMALMGISLTIAPV-QEAIWVEVY 198
>gi|409097333|ref|ZP_11217357.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Pedobacter agri
PB92]
Length = 724
Score = 73.6 bits (179), Expect = 6e-11, Method: Composition-based stats.
Identities = 39/109 (35%), Positives = 60/109 (55%), Gaps = 7/109 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
E+ I+ GL WYL+Y+C +S G ++ + KL P ++D V + P
Sbjct: 65 EVFIAATNANAAAQGLGWYLKYYCNKSLS---HLGNNLSPVSKL---PVIKDK-VKISSP 117
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
Y N T +Y+F+++ W+ WEKE+DWMAL G+N+ LA G E +W
Sbjct: 118 FKYRYALNYCTINYSFSFYRWEDWEKELDWMALNGVNIMLAPMGTELVW 166
>gi|168207628|ref|ZP_02633633.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens E str. JGS1987]
gi|170661027|gb|EDT13710.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens E str. JGS1987]
Length = 2104
Score = 72.8 bits (177), Expect = 9e-11, Method: Composition-based stats.
Identities = 37/119 (31%), Positives = 64/119 (53%), Gaps = 12/119 (10%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVA---SMPKLGSFPRVQDAG 156
G +++I G GV + +G ++YL+ +C ++S++ G + +MP +G
Sbjct: 244 GDGKVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNLKIPETMPSVGE-------R 294
Query: 157 VFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
V + P Y N T SYT A+WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 295 VVIDTPYEHRYALNFCTYSYTMAFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 353
>gi|375146756|ref|YP_005009197.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
koreensis GR20-10]
gi|361060802|gb|AEV99793.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
koreensis GR20-10]
Length = 1147
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 60/114 (52%), Gaps = 7/114 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+I+I+ GL+WYL+Y+C H S G ++ + S P V +A V +
Sbjct: 61 KIMIAATGPNAAAVGLNWYLKYYC--HRSMSHMGD----NLSPVSSLPVVTEA-VTIDAA 113
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T +YT +++ W WE E+DWMAL G+NL L G+E +WQ V +
Sbjct: 114 SQYRYALNYCTYNYTMSFYSWSDWEHELDWMALNGVNLMLVANGEEAVWQNVLR 167
>gi|169346867|ref|ZP_02865815.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens C str. JGS1495]
gi|169296926|gb|EDS79050.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens C str. JGS1495]
Length = 2104
Score = 72.4 bits (176), Expect = 1e-10, Method: Composition-based stats.
Identities = 37/116 (31%), Positives = 66/116 (56%), Gaps = 6/116 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G +++I G GV + +G ++YL+ +C ++S++ G + MP+ + P V + V +
Sbjct: 244 GDGKVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNL-KMPE--TMPSVGER-VVI 297
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 298 DTPYEHRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 353
>gi|110800516|ref|YP_695309.1| alpha-N-acetylglucosaminidase [Clostridium perfringens ATCC 13124]
gi|110675163|gb|ABG84150.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens ATCC 13124]
Length = 2095
Score = 72.4 bits (176), Expect = 1e-10, Method: Composition-based stats.
Identities = 37/116 (31%), Positives = 66/116 (56%), Gaps = 6/116 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G +++I G GV + +G ++YL+ +C ++S++ G + MP+ + P V + V +
Sbjct: 235 GDGKVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNL-KMPE--TMPSVGER-VVI 288
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 289 DTPYEHRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 344
>gi|18309848|ref|NP_561782.1| alpha-N-acetylglucosaminidase [Clostridium perfringens str. 13]
gi|18144526|dbj|BAB80572.1| probable alpha-N-acetylglucosaminidase [Clostridium perfringens
str. 13]
gi|288872041|dbj|BAI70446.1| alpha-N-acetylglucosaminidase [Clostridium perfringens]
Length = 2104
Score = 72.0 bits (175), Expect = 2e-10, Method: Composition-based stats.
Identities = 36/113 (31%), Positives = 65/113 (57%), Gaps = 6/113 (5%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G GV + +G ++YL+ +C ++S++ G + MP+ + P V + V + P
Sbjct: 247 KVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNL-KMPE--TMPSVGER-VVIDTP 300
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 301 YEYRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 353
>gi|408676293|ref|YP_006876120.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
gi|328880622|emb|CCA53861.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
Length = 855
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 72/160 (45%), Gaps = 15/160 (9%)
Query: 57 AYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLA 116
A A L+RLLPSH + F + + + G + S G +IV+ G + +L
Sbjct: 159 ARAGLERLLPSHAAQFTLVPVRESEAGASF---------SLTGGAGDIVVRGTSPATLLT 209
Query: 117 GLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSY 176
G+ WYL+ G + W G + +P G+ P V V +P Y N Y
Sbjct: 210 GVGWYLKQVAGVDVGWP---GDSLGRLP--GTLPAVSGT-VTRSAAVPHRYALNDTDEGY 263
Query: 177 TFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ + D+ +E+EID MAL G+N TG E + + Q
Sbjct: 264 SGPYRDFASYEREIDLMALHGVNEVFVPTGAEYPYYRALQ 303
>gi|293371915|ref|ZP_06618319.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|292633161|gb|EFF51738.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 718
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H Y++ C S +W V P + +V P
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTWSGEHITSVMPWPDYELYEQVS--------PY 120
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKEID MAL GIN+PLA E I ++V
Sbjct: 121 ELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGINMPLATVASEAIAERV 171
>gi|422873453|ref|ZP_16919938.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens F262]
gi|380305838|gb|EIA18115.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens F262]
Length = 2104
Score = 72.0 bits (175), Expect = 2e-10, Method: Composition-based stats.
Identities = 36/119 (30%), Positives = 64/119 (53%), Gaps = 12/119 (10%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVA---SMPKLGSFPRVQDAG 156
G +++I G GV + +G ++YL+ +C ++S++ G + +MP +G
Sbjct: 244 GDGKVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNLKMPETMPSIGE-------R 294
Query: 157 VFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
V + P Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 295 VVIDTPYEHRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 353
>gi|168216263|ref|ZP_02641888.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens NCTC 8239]
gi|182381741|gb|EDT79220.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens NCTC 8239]
Length = 2104
Score = 72.0 bits (175), Expect = 2e-10, Method: Composition-based stats.
Identities = 37/116 (31%), Positives = 66/116 (56%), Gaps = 6/116 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G +++I G GV + +G ++YL+ +C ++S++ G + MP+ + P V + V +
Sbjct: 244 GDGKVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIIGSNL-KMPE--TMPSVGER-VVI 297
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 298 DTPYEHRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 353
>gi|393785795|ref|ZP_10373941.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
CL02T12C05]
gi|392661414|gb|EIY55000.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
CL02T12C05]
Length = 724
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 70/165 (42%), Gaps = 17/165 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P+ + A V++R + + F+ I G+ F + E+ + G
Sbjct: 26 PADIITPAKKVIERQIGDRVNGISFQQIDP--VNGKETFEIEAKEG-------ELTLRGS 76
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
+ V + H Y R C + +W + S P + P Y+
Sbjct: 77 SVVALCYAFHIYQREACHAMTTWGGKYSESITSWPDYSLKKQTT--------PYEYRYFL 128
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
N T YT +WDW RWEKEIDWMAL+G+N+PLA E I ++V
Sbjct: 129 NVCTYGYTTPYWDWARWEKEIDWMALRGVNMPLATVASEAIAERV 173
>gi|170292392|pdb|2VC9|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With 2-Acetamido-1,2-Dideoxynojirmycin
gi|170292393|pdb|2VCA|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With Beta-N-Acetyl-D-Glucosamine
gi|170292394|pdb|2VCB|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With Pugnac
gi|170292395|pdb|2VCC|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
Length = 891
Score = 71.6 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 37/116 (31%), Positives = 66/116 (56%), Gaps = 6/116 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G +++I G GV + +G ++YL+ +C ++S++ G + MP+ + P V + V +
Sbjct: 210 GDGKVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNL-KMPE--TMPSVGER-VVI 263
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 264 DTPYEHRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 319
>gi|182624959|ref|ZP_02952737.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens D str. JGS1721]
gi|177909756|gb|EDT72174.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens D str. JGS1721]
Length = 2104
Score = 71.6 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 36/113 (31%), Positives = 65/113 (57%), Gaps = 6/113 (5%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G GV + +G ++YL+ +C ++S++ G + MP+ + P V + V + P
Sbjct: 247 KVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNL-KMPE--TMPSVGER-VVIDTP 300
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 301 YEHRYALNFCTYSYTMSFWDWNQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 353
>gi|386386798|ref|ZP_10071901.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
gi|385665738|gb|EIF89378.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
Length = 1033
Score = 71.6 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 39/113 (34%), Positives = 59/113 (52%), Gaps = 7/113 (6%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
I + G + +L G+HWYL+Y C +HISW G +V +P + PR G+ +
Sbjct: 84 IAVEGTSPAVLLTGVHWYLKYVCRAHISWS---GSRV-DLPDVLPPPR---PGLERSATV 136
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P + N YT + DW RWE+ ID +AL G N L GQE ++ ++ +
Sbjct: 137 PHRFAYNDTHDGYTAPYADWARWERLIDVLALHGCNEVLVTPGQEAVYHRLLK 189
>gi|395331391|gb|EJF63772.1| alpha-N-acetylglucosaminidase [Dichomitus squalens LYAD-421 SS1]
Length = 750
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 81/168 (48%), Gaps = 21/168 (12%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
+ L YA+ +R +P+H +F F + G F++ + P I + T
Sbjct: 23 TTSLDGIYALAKRRVPAHAESFSFSL----STGEADSFVVSDVPGG-------IHVECTT 71
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP--IPLNYY 168
G + YL + + I W G ++ +P P + G +K +P Y+
Sbjct: 72 VSACARGFYTYLTQYGCTDIWWT---GSRLGQLP-----PDLPKVGTPIKGQAIVPYRYH 123
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N VT SYT +W++ +WE E+DW+AL+G+NLPLA+ G E I + F+
Sbjct: 124 FNTVTFSYTTPFWNFDQWELELDWLALRGVNLPLAWNGYEYILIETFR 171
>gi|383280354|pdb|4A4A|A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In
Complex With Its Substrate Glcnac-Alpha-1,4-Galactose
Length = 914
Score = 71.6 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 37/116 (31%), Positives = 66/116 (56%), Gaps = 6/116 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G +++I G GV + +G ++YL+ +C ++S++ G + MP+ + P V + V +
Sbjct: 233 GDGKVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNL-KMPE--TMPSVGER-VVI 286
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 287 DTPYEHRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 342
>gi|295085513|emb|CBK67036.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 718
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H Y++ C S +W V P + +V P
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTWSGEHITSVMPWPDYELYEQVS--------PY 120
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKEID MAL G+N+PLA E I ++V
Sbjct: 121 ELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGVNMPLATVASEAIAERV 171
>gi|336404352|ref|ZP_08585050.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
gi|335943680|gb|EGN05519.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
Length = 718
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H Y++ C S +W V P + +V P
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTWSGEHITSVMPWPDYELYEQVS--------PY 120
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKEID MAL G+N+PLA E I ++V
Sbjct: 121 ELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGVNMPLATVASEAIAERV 171
>gi|423214208|ref|ZP_17200736.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693153|gb|EIY86388.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
CL03T12C04]
Length = 718
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H Y++ C S +W V P + +V P
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTWSGEHITSVMPWPDYELYEQVS--------PY 120
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKEID MAL G+N+PLA E I ++V
Sbjct: 121 ELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGVNMPLATVASEAIAERV 171
>gi|168209163|ref|ZP_02634788.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens B str. ATCC 3626]
gi|170712640|gb|EDT24822.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens B str. ATCC 3626]
Length = 2104
Score = 71.2 bits (173), Expect = 3e-10, Method: Composition-based stats.
Identities = 36/113 (31%), Positives = 65/113 (57%), Gaps = 6/113 (5%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G GV + +G ++YL+ +C ++S++ G + MP+ + P V + V + P
Sbjct: 247 KVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNL-KMPE--TMPSVGER-VVIDTP 300
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 301 YEHRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 353
>gi|336412611|ref|ZP_08592964.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
3_8_47FAA]
gi|335942657|gb|EGN04499.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
3_8_47FAA]
Length = 718
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H Y++ C S +W V P + +V P
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTWSGEHITSVMPWPDYELYEQVS--------PY 120
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKEID MAL G+N+PLA E I ++V
Sbjct: 121 ELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGVNMPLATVASEAIAERV 171
>gi|341892319|gb|EGT48254.1| hypothetical protein CAEBREN_28412 [Caenorhabditis brenneri]
Length = 713
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 53/160 (33%), Positives = 77/160 (48%), Gaps = 16/160 (10%)
Query: 59 AVLQRLLPS-HYSAFQ-FRIISKKQCGGEY-CFILRNHPSSYIRGTPEIVISGVTGVEVL 115
VL+R+ PS F+ R+ S + G + C +R H +YI G ++ + T + L
Sbjct: 37 GVLRRVAPSWVLEEFEDIRVTSAAENGFQVRCTSIRGH--TYIHGRKTVIAN--TPTDAL 92
Query: 116 AGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSS 175
++ YLR C S +S S G D F + I Y+ N T S
Sbjct: 93 NAINTYLRTECLSQVS-------WSNSSFSSGCRKHSNDVINFESKQI--RYFGNMCTFS 143
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y+F WW+W +WE+ IDW+AL G N L GQE IW+ +F
Sbjct: 144 YSFVWWEWPQWERFIDWIALNGFNTVLMPLGQEAIWRDIF 183
>gi|423293381|ref|ZP_17271508.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
CL03T12C18]
gi|392678324|gb|EIY71732.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
CL03T12C18]
Length = 718
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H Y++ C S +W V P + +V P
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTWSGEHITSVMPWPDYELYEQVS--------PY 120
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKEID MAL G+N+PLA E I ++V
Sbjct: 121 ELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGVNMPLATVASEAIAERV 171
>gi|298480124|ref|ZP_06998323.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|298273933|gb|EFI15495.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
Length = 718
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H Y++ C S +W V P + +V P
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTWSGEHITSVMPWPDYELYEQVS--------PY 120
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKEID MAL G+N+PLA E I ++V
Sbjct: 121 ELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGVNMPLATVASEAIAERV 171
>gi|187734575|ref|YP_001876687.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
gi|187424627|gb|ACD03906.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
Length = 848
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 59/113 (52%), Gaps = 8/113 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ I G +G + G + +L+ +SW+ A +P + PR + P
Sbjct: 74 LTIRGSSGTALCRGFYDFLKTNRLGMVSWENKDIRWPAQLPD--TAPRR------IVSPF 125
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+YY N VT YT +W W+RWEKEIDWMAL GIN+PLA E I +V++
Sbjct: 126 RNHYYFNVVTYGYTMPYWTWERWEKEIDWMALHGINMPLALVATEGIAVRVWK 178
>gi|237719039|ref|ZP_04549520.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451817|gb|EEO57608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 718
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H Y++ C S +W V P + +V P
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTWSGEHITSVMPWPDYELYEQVS--------PY 120
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKEID MAL G+N+PLA E I ++V
Sbjct: 121 ELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGVNMPLATVASEAIAERV 171
>gi|160884066|ref|ZP_02065069.1| hypothetical protein BACOVA_02042 [Bacteroides ovatus ATCC 8483]
gi|423291473|ref|ZP_17270321.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
CL02T12C04]
gi|156110408|gb|EDO12153.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392663473|gb|EIY57023.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
CL02T12C04]
Length = 718
Score = 70.9 bits (172), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 57/114 (50%), Gaps = 14/114 (12%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASM---PKLGSFPRVQDAGVFVK 160
+ + G + V + H Y++ C S +W G + SM P + +V
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTW---SGEHITSMMPWPDYELYEQVS------- 118
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
P L Y+ N T YT +WDW+RWEKEID MAL G+N+PLA E I ++V
Sbjct: 119 -PYELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGVNMPLATVASEAIAERV 171
>gi|422345314|ref|ZP_16426228.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
WAL-14572]
gi|373228039|gb|EHP50349.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
WAL-14572]
Length = 1842
Score = 70.9 bits (172), Expect = 3e-10, Method: Composition-based stats.
Identities = 35/116 (30%), Positives = 63/116 (54%), Gaps = 12/116 (10%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVA---SMPKLGSFPRVQDAGVFV 159
+++I G GV + +G ++YL+ +C ++S++ G + +MP +G V +
Sbjct: 247 KVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNLKMPETMPSIGE-------RVVI 297
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 298 DTPYEHRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 353
>gi|336439030|ref|ZP_08618649.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
1_1_57FAA]
gi|336017072|gb|EGN46842.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 1863
Score = 70.9 bits (172), Expect = 3e-10, Method: Composition-based stats.
Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 7/111 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ I G GV + GL+ YL+Y+C ++S QV + V VF +
Sbjct: 544 QVKIKGNDGVSLATGLNHYLKYFCQVNLS-------QVGDQADMPENKPVVTEKVFKETK 596
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
+ Y N T SY+ A+W + W E+DW+AL G+N+ L T QE +W++
Sbjct: 597 AEVRYSYNYCTLSYSMAFWGEQEWRDELDWLALNGVNVVLDATAQEEVWRR 647
>gi|317501265|ref|ZP_07959469.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
8_1_57FAA]
gi|316897332|gb|EFV19399.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
8_1_57FAA]
Length = 1847
Score = 70.9 bits (172), Expect = 3e-10, Method: Composition-based stats.
Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 7/111 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ I G GV + GL+ YL+Y+C ++S QV + V VF +
Sbjct: 528 QVKIKGNDGVSLATGLNHYLKYFCQVNLS-------QVGDQADMPENKPVVTEKVFKETK 580
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
+ Y N T SY+ A+W + W E+DW+AL G+N+ L T QE +W++
Sbjct: 581 AEVRYSYNYCTLSYSMAFWGEQEWRDELDWLALNGVNVVLDATAQEEVWRR 631
>gi|168212494|ref|ZP_02638119.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens CPE str. F4969]
gi|170716100|gb|EDT28282.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens CPE str. F4969]
Length = 2104
Score = 70.9 bits (172), Expect = 3e-10, Method: Composition-based stats.
Identities = 35/116 (30%), Positives = 63/116 (54%), Gaps = 12/116 (10%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVA---SMPKLGSFPRVQDAGVFV 159
+++I G GV + +G ++YL+ +C ++S++ G + +MP +G V +
Sbjct: 247 KVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNLKMPETMPSIGE-------RVVI 297
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 298 DTPYEHRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 353
>gi|153814573|ref|ZP_01967241.1| hypothetical protein RUMTOR_00787 [Ruminococcus torques ATCC 27756]
gi|331089988|ref|ZP_08338878.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
3_1_46FAA]
gi|145848067|gb|EDK24985.1| F5/8 type C domain protein [Ruminococcus torques ATCC 27756]
gi|330402902|gb|EGG82468.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 1863
Score = 70.9 bits (172), Expect = 3e-10, Method: Composition-based stats.
Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 7/111 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ I G GV + GL+ YL+Y+C ++S QV + V VF +
Sbjct: 544 QVKIKGNDGVSLATGLNHYLKYFCQVNLS-------QVGDQADMPENKPVVTEKVFKETK 596
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
+ Y N T SY+ A+W + W E+DW+AL G+N+ L T QE +W++
Sbjct: 597 AEVRYSYNYCTLSYSMAFWGEQEWRDELDWLALNGVNVVLDATAQEEVWRR 647
>gi|299144719|ref|ZP_07037787.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298515210|gb|EFI39091.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 718
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H Y++ C S +W V P + +V P
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTWSGEHITSVMLWPDYELYEQVS--------PY 120
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKEID MAL G+N+PLA E I ++V
Sbjct: 121 ELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGVNMPLATVASEAIAERV 171
>gi|323456608|gb|EGB12475.1| hypothetical protein AURANDRAFT_20306 [Aureococcus anophagefferens]
Length = 243
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 54/90 (60%), Gaps = 5/90 (5%)
Query: 130 ISWDKTGGVQVASMPKLGS----FPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKR 185
WD+ GG QVA +L + +P ++ + +R ++Y N T+SYT AWW W
Sbjct: 3 FGWDRAGGDQVAIAQRLANGLPAWPPLKGPTTY-RRLRDVSYGWNVCTASYTQAWWAWDD 61
Query: 186 WEKEIDWMALQGINLPLAFTGQETIWQKVF 215
W + +DW+AL G+NL LA+TGQE ++ V+
Sbjct: 62 WSRHLDWLALNGVNLALAYTGQERLYADVY 91
>gi|392566857|gb|EIW60032.1| alpha-N-acetylglucosaminidase [Trametes versicolor FP-101664 SS1]
Length = 747
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 54/169 (31%), Positives = 81/169 (47%), Gaps = 19/169 (11%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
S L YA+ +R P+H F F + + E I H +I+ T T
Sbjct: 21 SQDLDGIYALAKRRAPAHAHEFTFALFEGEADAFE---ISDAHSGIHIQCT--------T 69
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVAS-MPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
G + YL + G I W + Q+ S +PK+G + Q + Y+
Sbjct: 70 VSACARGFYTYLTQYGGVDIWWTGSRLNQLPSRLPKVGKTVKGQAI-------VSYRYHF 122
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVA 218
N VT YT A+WD+++WE E+DW+AL+G+NLPLA+ G E I + F+ A
Sbjct: 123 NTVTFDYTAAFWDFEQWELELDWLALRGVNLPLAWVGYEYILIETFREA 171
>gi|429198382|ref|ZP_19190217.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
gi|428665917|gb|EKX65105.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
Length = 747
Score = 70.5 bits (171), Expect = 4e-10, Method: Composition-based stats.
Identities = 47/155 (30%), Positives = 74/155 (47%), Gaps = 18/155 (11%)
Query: 62 QRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWY 121
+RLLP+H+ F +++ F + H T I ++G T L GL+WY
Sbjct: 60 RRLLPNHWRQITFVRGTERDT-----FRVSGH-------TGRITVTGDTPATQLTGLNWY 107
Query: 122 LRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWW 181
L + + I+W G Q +P+ P V D V + +P + N YT A+
Sbjct: 108 LNHTAHAEINW---AGEQTNLLPR--DLPGV-DGTVSRRANVPHRFALNDTNDGYTGAYL 161
Query: 182 DWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
DW WE+E+D +AL G N L + G + ++ +VFQ
Sbjct: 162 DWTYWERELDVLALHGYNEVLVYAGADALYHRVFQ 196
>gi|282877909|ref|ZP_06286718.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
gi|281299910|gb|EFA92270.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
Length = 717
Score = 70.1 bits (170), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 76/172 (44%), Gaps = 24/172 (13%)
Query: 50 PSVQLAA-------AYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTP 102
P V LAA A VL R+L + ++++ +++ + N+
Sbjct: 15 PQVFLAADNDITVPAKRVLDRILEVPENRIVYKVLPRQEGLDAFSISCENNC-------- 66
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+ +SG + + + Y+R C +W +P S GV + P
Sbjct: 67 -LTLSGTSTTALTYAFYTYVRNACHVMATWSGNNIKLPKQLPNYHS-------GV-INSP 117
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKE+D MAL G+N+PLA E I ++V
Sbjct: 118 FKLRYFLNVCTFGYTMPYWDWERWEKELDLMALHGVNMPLASVASEAIAERV 169
>gi|262406054|ref|ZP_06082604.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294648118|ref|ZP_06725661.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|294806859|ref|ZP_06765684.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345510559|ref|ZP_08790126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|229443271|gb|EEO49062.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|262356929|gb|EEZ06019.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|292636502|gb|EFF54977.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|294445888|gb|EFG14530.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
Length = 718
Score = 70.1 bits (170), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H Y++ C S +W V P + ++ P
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTWSGEHITSVMPWPDYELYEQMS--------PY 120
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKEID MAL G+N+PLA E I ++V
Sbjct: 121 ELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGVNMPLATVASEAIAERV 171
>gi|404487024|ref|ZP_11022211.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
YIT 11860]
gi|404335520|gb|EJZ61989.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
YIT 11860]
Length = 722
Score = 70.1 bits (170), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 57/114 (50%), Gaps = 12/114 (10%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H YLR C + +W G + P V ++
Sbjct: 72 LTLRGSSSVAICYAFHTYLREACSAMKTW---SGEHM-------ELPETWPDFVLKEQTT 121
Query: 164 PLNY--YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
P Y + N T YT +WDW+RWEKEIDWMAL+G+N+PLA E I ++V+
Sbjct: 122 PYEYRYFLNVCTFGYTTPYWDWERWEKEIDWMALRGVNMPLATVASEAIAERVW 175
>gi|383115207|ref|ZP_09935965.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
gi|313695376|gb|EFS32211.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
Length = 718
Score = 70.1 bits (170), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ + G + V + H Y++ C S +W V P + ++ P
Sbjct: 69 LTLRGSSSVAICYAFHTYMKEACKSMKTWSGEHITSVMPWPDYELYEQMS--------PY 120
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
L Y+ N T YT +WDW+RWEKEID MAL G+N+PLA E I ++V
Sbjct: 121 ELRYFLNVCTFGYTTPYWDWERWEKEIDRMALYGVNMPLATVASEAIAERV 171
>gi|268533054|ref|XP_002631655.1| Hypothetical protein CBG20846 [Caenorhabditis briggsae]
Length = 712
Score = 70.1 bits (170), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 54/111 (48%), Gaps = 9/111 (8%)
Query: 105 VISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIP 164
VI T + L +H YLR C S +S S G D F + I
Sbjct: 81 VIIANTPTDALNAIHTYLRSQCLSQVS-------WSNSSFSSGCRKHTSDLLTFESKQI- 132
Query: 165 LNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y+ N T SY+FAWW+W +WE+ IDW+AL G N L GQE IW+ VF
Sbjct: 133 -RYFGNMCTFSYSFAWWEWPQWERFIDWIALNGFNTVLMPLGQEAIWRDVF 182
>gi|421734750|ref|ZP_16173809.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
gi|407077324|gb|EKE50171.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
Length = 1919
Score = 70.1 bits (170), Expect = 6e-10, Method: Composition-based stats.
Identities = 41/116 (35%), Positives = 61/116 (52%), Gaps = 6/116 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G I I G GV + +GL++YLR+WC + ++ G Q+ SMP+ S P V +
Sbjct: 228 GDDRISIRGNNGVSLASGLNYYLRHWC--KVDYNPLFGSQL-SMPE--SLPAVGRK-ILK 281
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT A+W+W +E +DW A+ G+NL L GQE + ++
Sbjct: 282 YTNYEYRYALNFCTYSYTMAFWNWDDYEPFLDWAAMNGVNLMLDIVGQEEVLRETL 337
>gi|311064845|ref|YP_003971571.1| beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
gi|310867165|gb|ADP36534.1| Beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
Length = 1923
Score = 69.7 bits (169), Expect = 7e-10, Method: Composition-based stats.
Identities = 41/116 (35%), Positives = 61/116 (52%), Gaps = 6/116 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G I I G GV + +GL++YLR+WC + ++ G Q+ SMP+ S P V +
Sbjct: 232 GDGRISIRGNNGVSLASGLNYYLRHWC--KVDYNPLFGSQL-SMPE--SLPAVGRK-ILK 285
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT A+W+W +E +DW A+ G+NL L GQE + ++
Sbjct: 286 YTNYEYRYALNFCTYSYTMAFWNWDDYEPFLDWAAMNGVNLMLDIVGQEEVLRETL 341
>gi|310287970|ref|YP_003939229.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
gi|309251907|gb|ADO53655.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
Length = 1923
Score = 69.7 bits (169), Expect = 8e-10, Method: Composition-based stats.
Identities = 41/116 (35%), Positives = 61/116 (52%), Gaps = 6/116 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G I I G GV + +GL++YLR+WC + ++ G Q+ SMP+ S P V +
Sbjct: 232 GDGRISIRGNNGVSLASGLNYYLRHWC--KVDYNPLFGSQL-SMPE--SLPAVGRK-ILK 285
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT A+W+W +E +DW A+ G+NL L GQE + ++
Sbjct: 286 YTNYEYRYALNFCTYSYTMAFWNWDDYEPFLDWAAMNGVNLMLDIVGQEEVLRETL 341
>gi|331092442|ref|ZP_08341267.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330401285|gb|EGG80874.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 1598
Score = 69.7 bits (169), Expect = 8e-10, Method: Composition-based stats.
Identities = 46/174 (26%), Positives = 77/174 (44%), Gaps = 15/174 (8%)
Query: 44 DRERAPPSVQLAAAYAVLQRLLPSHY-SAFQFRIISKKQCGGEYCFILRNHPSSYIRGTP 102
+ E+ P +++R L + Y S F ++ G F L +
Sbjct: 497 NTEKVTPEQTYEEVRGIIERRLGAEYKSWFDLKLQENPNETGYDYFELNDKDG------- 549
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ I+G GV + GL+ YL+Y+C +IS A + +G V +
Sbjct: 550 KVQITGNDGVSLAMGLNHYLKYFCNVNISQVGDQADMPAKIVPIGE-------KVHKETK 602
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ Y N T SY+ A+W K W E+DW+AL G+N+ L T QE +W++ +
Sbjct: 603 VGTRYSYNYCTLSYSMAFWGEKEWRNELDWLALNGVNVVLDATAQEEVWRRFLE 656
>gi|313140918|ref|ZP_07803111.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
bifidum NCIMB 41171]
gi|313133428|gb|EFR51045.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
bifidum NCIMB 41171]
Length = 2005
Score = 69.7 bits (169), Expect = 8e-10, Method: Composition-based stats.
Identities = 41/116 (35%), Positives = 61/116 (52%), Gaps = 6/116 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G I I G GV + +GL++YLR+WC + ++ G Q+ SMP+ S P V +
Sbjct: 314 GDGRISIRGNNGVSLASGLNYYLRHWC--KVDYNPLFGSQL-SMPE--SLPAVGRK-ILK 367
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT A+W+W +E +DW A+ G+NL L GQE + ++
Sbjct: 368 YTNYEYRYALNFCTYSYTMAFWNWDDYEPFLDWAAMNGVNLMLDIVGQEEVLRETL 423
>gi|390937398|ref|YP_006394957.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
gi|389891011|gb|AFL05078.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
Length = 1957
Score = 69.7 bits (169), Expect = 8e-10, Method: Composition-based stats.
Identities = 41/116 (35%), Positives = 61/116 (52%), Gaps = 6/116 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G I I G GV + +GL++YLR+WC + ++ G Q+ SMP+ S P V +
Sbjct: 266 GDGRISIRGNNGVSLASGLNYYLRHWC--KVDYNPLFGSQL-SMPE--SLPAVGRK-ILK 319
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT A+W+W +E +DW A+ G+NL L GQE + ++
Sbjct: 320 YTNYEYRYALNFCTYSYTMAFWNWDDYEPFLDWAAMNGVNLMLDIVGQEEVLRETL 375
>gi|393236266|gb|EJD43816.1| putative alpha-N-acetylglucosaminidase [Auricularia delicata
TFB-10046 SS5]
Length = 778
Score = 69.7 bits (169), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 74/159 (46%), Gaps = 8/159 (5%)
Query: 60 VLQRLLPSHYSAFQFRI-ISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAG- 117
++QR +P H F F + + G + + +R + + TG G
Sbjct: 31 LVQRRIPLHARDFDFVLSTAPSNATGYDAYTVSTLSLGRVRIESDSLSGLATGSLNAIGC 90
Query: 118 -LHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSY 176
LH YL I W + +A + PRV + + +P Y+ N VT SY
Sbjct: 91 SLHRYLTDIVNVDIWWYIGSRLHLAPL----ILPRV-NGKLTGAATVPWRYHFNTVTFSY 145
Query: 177 TFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
T AWW W+ WE E+DW+AL+G+NLPLA+ G E I VF
Sbjct: 146 TTAWWTWEDWELELDWLALRGVNLPLAWVGVERIIYDVF 184
>gi|294812279|ref|ZP_06770922.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
gi|294324878|gb|EFG06521.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
Length = 1086
Score = 69.3 bits (168), Expect = 1e-09, Method: Composition-based stats.
Identities = 54/169 (31%), Positives = 77/169 (45%), Gaps = 20/169 (11%)
Query: 47 RAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVI 106
RA P A A + L+RLLP H F+ R + G E R SS I +
Sbjct: 84 RATP-FDTAPAASALRRLLPRHAGQFRLRALD----GAER---FRVTGSS-----GRIEV 130
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
+G + +L G HWYL+Y C + ISW G Q+ +P P A + +P
Sbjct: 131 AGTSPAVLLTGAHWYLKYVCHAMISWS---GSQL-DLPDRLPAPA---APLERSATVPHR 183
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ N YT + DW RWE+ +D +AL G N L GQE ++ ++
Sbjct: 184 FAYNDTHDGYTGPYADWARWERLLDVLALHGCNEVLVTPGQEAVYHRLL 232
>gi|383643231|ref|ZP_09955637.1| N-acetylglucosaminidase [Sphingomonas elodea ATCC 31461]
Length = 778
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 39/63 (61%), Gaps = 1/63 (1%)
Query: 154 DAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
D+G V P L Y N T YT WW W RWE+EIDWMA G++ PLA GQE +W+
Sbjct: 124 DSGT-VASPFSLRTYLNTCTYGYTTPWWTWSRWEREIDWMAAHGVDTPLAMEGQEHVWRA 182
Query: 214 VFQ 216
+++
Sbjct: 183 LWR 185
>gi|110801838|ref|YP_698175.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens SM101]
gi|110682339|gb|ABG85709.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens SM101]
Length = 2095
Score = 69.3 bits (168), Expect = 1e-09, Method: Composition-based stats.
Identities = 36/113 (31%), Positives = 64/113 (56%), Gaps = 6/113 (5%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+++I G GV + +G ++YL+ +C ++S++ + MP+ + P V + V + P
Sbjct: 238 KVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMESNL-KMPE--TMPSVGER-VVIDTP 291
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT A+WDW ++E+ +DW A+ G+NL L GQE + ++
Sbjct: 292 YEHRYALNFCTYSYTMAFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 344
>gi|161505009|ref|YP_001572121.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160866356|gb|ABX22979.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 1014
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 83/165 (50%), Gaps = 18/165 (10%)
Query: 54 LAAAYAVLQRLLPSHY-SAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGV 112
++ A +++R+L + Y + F F I+ G+ F ++ +I ISG G+
Sbjct: 154 VSEAQELVERVLGAQYQNRFIFAIVPS--TTGKDSFTVK-------AADGKISISGPNGI 204
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVK-RPIPLNYYQNA 171
+ +GL+WYL+ + H+++D V ++P P+ GV K P Y N
Sbjct: 205 SLASGLNWYLKNYL--HVNYDPLN-VSSLTIPTNWPMPK----GVTEKATPYQYKYALNF 257
Query: 172 VTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T SYT A+W W +EK +DW A+ G+NL L GQE + +++
Sbjct: 258 CTPSYTMAFWRWHDYEKFLDWAAMNGVNLMLDIVGQEEVQRRMLH 302
>gi|423219557|ref|ZP_17206053.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
CL03T12C61]
gi|392624762|gb|EIY18840.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
CL03T12C61]
Length = 715
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 66/144 (45%), Gaps = 17/144 (11%)
Query: 73 QFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISW 132
QF+ I+ CFIL+ ++V+ G++WYL+Y+C +S
Sbjct: 45 QFKEIASSDGD---CFILQTVGK-------KLVVQATGANAAAVGVNWYLKYYCHRSMSH 94
Query: 133 DKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDW 192
V +P +G V+ ++ Y N T +YT +++DW W+ E+DW
Sbjct: 95 LGDQLAPVTELPVIGQPVTVKTTSIY-------RYALNYCTFNYTMSFYDWDDWQWELDW 147
Query: 193 MALQGINLPLAFTGQETIWQKVFQ 216
MAL G+NL L G E +WQ +
Sbjct: 148 MALNGVNLMLVANGSEAVWQNTLR 171
>gi|153806010|ref|ZP_01958678.1| hypothetical protein BACCAC_00255 [Bacteroides caccae ATCC 43185]
gi|149130687|gb|EDM21893.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
Length = 715
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 66/144 (45%), Gaps = 17/144 (11%)
Query: 73 QFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISW 132
QF+ I+ CFIL+ ++V+ G++WYL+Y+C +S
Sbjct: 45 QFKEIASSDGD---CFILQTVGK-------KLVVQATGANAAAVGVNWYLKYYCHRSMSH 94
Query: 133 DKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDW 192
V +P +G V+ ++ Y N T +YT +++DW W+ E+DW
Sbjct: 95 LGDQLAPVTELPVIGQPVTVKTTSIY-------RYALNYCTFNYTMSFYDWDDWQWELDW 147
Query: 193 MALQGINLPLAFTGQETIWQKVFQ 216
MAL G+NL L G E +WQ +
Sbjct: 148 MALNGVNLMLVANGSEAVWQNTLR 171
>gi|326440885|ref|ZP_08215619.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
Length = 1038
Score = 68.9 bits (167), Expect = 1e-09, Method: Composition-based stats.
Identities = 54/169 (31%), Positives = 77/169 (45%), Gaps = 20/169 (11%)
Query: 47 RAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVI 106
RA P A A + L+RLLP H F+ R + G E R SS I +
Sbjct: 36 RATP-FDTAPAASALRRLLPRHAGQFRLRALD----GAER---FRVTGSS-----GRIEV 82
Query: 107 SGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLN 166
+G + +L G HWYL+Y C + ISW G Q+ +P P A + +P
Sbjct: 83 AGTSPAVLLTGAHWYLKYVCHAMISWS---GSQL-DLPDRLPAPA---APLERSATVPHR 135
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ N YT + DW RWE+ +D +AL G N L GQE ++ ++
Sbjct: 136 FAYNDTHDGYTGPYADWARWERLLDVLALHGCNEVLVTPGQEAVYHRLL 184
>gi|212541222|ref|XP_002150766.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
gi|210068065|gb|EEA22157.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
Length = 787
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 81/164 (49%), Gaps = 15/164 (9%)
Query: 58 YAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPE---IVISGVTGVEV 114
Y +++R +P H +FQF + Q ++ + + Y+ TPE ++I G + +
Sbjct: 26 YDLVKRRMPQHADSFQFSPLETVQNSPDHTDQVTDQ---YVVSTPENGTVLIEGNSLSAL 82
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQD--AGVFVKRPIPLNYYQNAV 172
GLH YL + W + +A P+ P ++ G V + Y+ N V
Sbjct: 83 SYGLHRYLADVAHVDMWWFIGNRLDIA--PE--ELPTIEQPITGSSVAQ---WRYHFNTV 135
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T Y A+W W WE ++DWMAL+GINL LA+ G E I +VF+
Sbjct: 136 TFDYMAAFWSWSDWEFQLDWMALRGINLSLAWVGYEKILLEVFK 179
>gi|365104185|ref|ZP_09333846.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
4_7_47CFAA]
gi|363644798|gb|EHL84079.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
4_7_47CFAA]
Length = 1049
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 82/163 (50%), Gaps = 16/163 (9%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEV 114
+ A A+++R+L + Y +F G+ F ++ +I ISG G+ +
Sbjct: 190 SEAQALVERVLGAQYQN-RFTFAVTPSTTGKDSFTVK-------EADGKISISGPNGISL 241
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVK-RPIPLNYYQNAVT 173
+GL+WYL+ + H+++D + V+++ +P Q G+ K P Y N T
Sbjct: 242 ASGLNWYLKNYL--HVNYDP---LNVSNLTLPAEWPMPQ--GITEKATPYQYKYALNFCT 294
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
SYT A+W W +E+ +DW A+ G+NL L GQE + +++
Sbjct: 295 PSYTMAFWRWHDYEQFLDWAAMNGVNLMLDVVGQEEVQRRMLH 337
>gi|432896403|ref|ZP_20107613.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
gi|433031274|ref|ZP_20219108.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
gi|431432398|gb|ELH14169.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
gi|431538475|gb|ELI14460.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
Length = 1049
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 82/164 (50%), Gaps = 16/164 (9%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
++ A+++R+L + Y +F G+ F ++ +I ISG G+
Sbjct: 189 ISETQALVERVLGAQYQN-RFTFTVTPSTTGKDSFTVK-------AADGKISISGPNGIS 240
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVK-RPIPLNYYQNAV 172
+ +GL+WYL+ + H+++D + V+++ +P Q GV K P Y N
Sbjct: 241 LASGLNWYLKNYL--HVNYDP---LNVSNLAIPTDWPMPQ--GVTEKATPYQYKYALNFC 293
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T SYT A+W W +EK +DW A+ G+NL L GQE + +++
Sbjct: 294 TPSYTMAFWRWHDYEKFLDWAAMNGVNLMLDIIGQEEVQRRMLH 337
>gi|331660873|ref|ZP_08361805.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
TA206]
gi|422369309|ref|ZP_16449711.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
gi|315298924|gb|EFU58178.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
gi|331051915|gb|EGI23954.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
TA206]
Length = 1052
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 82/164 (50%), Gaps = 16/164 (9%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
++ A+++R+L + Y +F G+ F ++ +I ISG G+
Sbjct: 192 ISETQALVERVLGAQYQN-RFTFTVTPSTTGKDSFTVK-------AADGKISISGPNGIS 243
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVK-RPIPLNYYQNAV 172
+ +GL+WYL+ + H+++D + V+++ +P Q GV K P Y N
Sbjct: 244 LASGLNWYLKNYL--HVNYDP---LNVSNLAIPTDWPMPQ--GVTEKATPYQYKYALNFC 296
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T SYT A+W W +EK +DW A+ G+NL L GQE + +++
Sbjct: 297 TPSYTMAFWRWHDYEKFLDWAAMNGVNLMLDIIGQEEVQRRMLH 340
>gi|393783265|ref|ZP_10371440.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
CL02T12C01]
gi|392669544|gb|EIY63032.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
CL02T12C01]
Length = 723
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 55/112 (49%), Gaps = 8/112 (7%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ + G + V + H YLR C S +W + +P L ++ P
Sbjct: 70 KLTLKGSSTVALCYAFHTYLREACHSMKTWSG----EHLELPSLWPDYSLRKQTT----P 121
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
Y+ N T YT +WDW RWE EIDWMAL+G+N+PLA E I ++V
Sbjct: 122 YDFRYFLNVCTYGYTTPYWDWARWENEIDWMALRGVNMPLATVASEAIAERV 173
>gi|392588150|gb|EIW77482.1| glycoside hydrolase family 89 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 761
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/163 (31%), Positives = 78/163 (47%), Gaps = 11/163 (6%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
L YA++QR +PSH +F F + + F L + + T I + T
Sbjct: 31 LQGLYALVQRRIPSHAHSFTFSFTNSSDSELD-TFTLTDA----VNKTRAIHVECNTVSA 85
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
GL+ YL + G I W G ++ +P P V+ A V +P Y N VT
Sbjct: 86 CSRGLYTYLTEYGGVDIWWT---GSRLDQLP--APLPAVKTA-VTRSAIVPYRYMFNTVT 139
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YT A+W + WE +DW++L+G+NLPLA+ G E +VF+
Sbjct: 140 FDYTSAFWTFDDWELMLDWLSLRGVNLPLAWVGFEHTLVEVFR 182
>gi|429740221|ref|ZP_19273923.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
gi|429153946|gb|EKX96707.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
Length = 721
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 72/169 (42%), Gaps = 23/169 (13%)
Query: 51 SVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
S + A V++RL + + F ++ + + Y +N ++ + G +
Sbjct: 25 SDMVTPARQVIERLTGASAAHINFEVLPQHENKDGYAVTCQNG---------KLTLKGSS 75
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVK---RPIPLNY 167
+ Y R C S SW + +P Q A V P Y
Sbjct: 76 PTAFTYAFYCYARQACHSMASWSGS----TLHLP-------AQSADFEVPVTYSPYQFRY 124
Query: 168 YQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ N T YT +WDWKRWEKEID MAL GIN+PLA E I ++V++
Sbjct: 125 FLNVCTFGYTTPYWDWKRWEKEIDLMALHGINMPLATVASEAIAERVWK 173
>gi|400595379|gb|EJP63180.1| alpha-N-acetylglucosaminidase [Beauveria bassiana ARSEF 2860]
Length = 761
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 90/209 (43%), Gaps = 41/209 (19%)
Query: 15 ITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQF 74
+T I LS LA A T+ Q + + IQ + +RL H + F+F
Sbjct: 5 LTQILLSALAAA---TVNAQQAASVTGIQ---------------GLAKRLFKGHDAQFEF 46
Query: 75 RIISKKQC-------GGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCG 127
++ K + G + + N+ ++GT L+ L LR++
Sbjct: 47 QLTQKHENWSRWNTPGNDNYTVSSNNGKIRVQGT------------TLSALARALRHYAK 94
Query: 128 SHISWDKTGGVQ-VASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRW 186
++ D+ V V ++P+ P+ G +P Y N VT YTF W+ W W
Sbjct: 95 DALNMDEHWFVDGVYNIPEQLPLPKEVLTGA---STVPWRYNLNTVTFGYTFVWYSWSDW 151
Query: 187 EKEIDWMALQGINLPLAFTGQETIWQKVF 215
EK +DW AL+GINL LA+ G E I+ F
Sbjct: 152 EKLLDWAALRGINLQLAWVGYEKIFLDSF 180
>gi|291086028|ref|ZP_06354661.2| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
ATCC 29220]
gi|291069185|gb|EFE07294.1| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
ATCC 29220]
Length = 1014
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 80/164 (48%), Gaps = 16/164 (9%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
++ A +++R+L + Y +F G+ F ++ +I ISG G+
Sbjct: 154 ISEAQVLVERVLGAQYQN-RFTFAVTPSTTGKDSFTVK-------EADGKISISGPNGIS 205
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVK-RPIPLNYYQNAV 172
+ +GL+WYL+ + H+++D +A +P P+ GV K P Y N
Sbjct: 206 LASGLNWYLKNYL--HVNYDPLNASNLA-IPTDWPMPQ----GVTEKATPYQYKYALNFC 258
Query: 173 TSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T SYT A+W W +E+ +DW A+ G+NL L GQE + +++
Sbjct: 259 TPSYTMAFWRWHDYEQFLDWAAMNGVNLMLDVVGQEEVQRRMLH 302
>gi|257067709|ref|YP_003153964.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
4810]
gi|256558527|gb|ACU84374.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
4810]
Length = 768
Score = 68.2 bits (165), Expect = 2e-09, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 50/90 (55%), Gaps = 7/90 (7%)
Query: 127 GSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRW 186
G+ +SWD + V + + + RV+ P+ + Y+ N VT YT A+WDW+RW
Sbjct: 95 GADLSWDAAPHIDVPASLRDRAPQRVET-------PLQIRYHLNPVTFGYTMAFWDWERW 147
Query: 187 EKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
E+ IDWMAL G+ PL G + + ++ +
Sbjct: 148 EQHIDWMALHGVTHPLNLVGHDLVLVRMLR 177
>gi|421736727|ref|ZP_16175487.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
IPLA 20015]
gi|407295984|gb|EKF15606.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
IPLA 20015]
Length = 1044
Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 61/116 (52%), Gaps = 6/116 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G I I G GV + +GL++YLR+WC + ++ G Q+ SMP+ S P V +
Sbjct: 204 GDGRISIRGNNGVSLASGLNYYLRHWC--KVDYNPLFGSQL-SMPE--SLPAVG-RKILK 257
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT A+W+W +E +DW A+ G+NL L GQE + ++
Sbjct: 258 YTNYEYRYALNFCTYSYTMAFWNWDDYEPFLDWAAMNGVNLMLDIVGQEEVLRETL 313
>gi|409042145|gb|EKM51629.1| glycoside hydrolase family 89 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 749
Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 79/160 (49%), Gaps = 16/160 (10%)
Query: 58 YAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAG 117
Y +++R +P H F F + G F++ S + T I ++ T G
Sbjct: 27 YGIVERRMPQHARDFVFTSME----GDGDVFVV----SDTVNKTGSITVTCTTTSACARG 78
Query: 118 LHWYLRYWCGSHISWDKTGGVQVAS-MPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSY 176
L+ Y + G I W + ++ S +PK+G+ + + + R Y+ N VT Y
Sbjct: 79 LYTYATEFGGVDIFWTGSRLHELPSPLPKVGT--PINGSAIVQYR-----YHFNTVTFGY 131
Query: 177 TFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
T A+W + +W +DW+AL+G+N+PLA+ G E I +VFQ
Sbjct: 132 TTAFWTFDKWSLLLDWLALRGVNMPLAWDGYEAILTEVFQ 171
>gi|456388164|gb|EMF53654.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
25435]
Length = 732
Score = 67.8 bits (164), Expect = 3e-09, Method: Composition-based stats.
Identities = 50/166 (30%), Positives = 74/166 (44%), Gaps = 24/166 (14%)
Query: 54 LAAAYAVLQRLLPSHYSAFQF---RIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVT 110
L A A +RLLP H+ F R + + G T I I+G T
Sbjct: 38 LRTASAAARRLLPDHWRQITFTDTRELDAFRVSGS---------------TGRITIAGDT 82
Query: 111 GVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQN 170
L GL+WYLR+ + I+W G Q +P+ P + D V + + + N
Sbjct: 83 PATQLTGLNWYLRHIAYAEINW---AGEQTNLLPR--ELPGL-DGTVARQANVIHRFALN 136
Query: 171 AVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YT A+ DW WE+E+D +AL G N L + G + ++ +VFQ
Sbjct: 137 DTNDGYTGAYLDWTYWERELDVLALHGYNEVLVYAGADALYHRVFQ 182
>gi|410634789|ref|ZP_11345419.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
gi|410145665|dbj|GAC22286.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
Length = 750
Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 54/113 (47%), Gaps = 9/113 (7%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+ I G + V + + YL+ + +SW G L F + + P
Sbjct: 84 VFIEGNSPVALTRASYSYLKQEHNNLLSW---SGENFHLPTTLSPFNKSEST------PY 134
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YY N VT Y+ +WDW RWEKEIDWMA+ G+N+PL E I +VF+
Sbjct: 135 QYRYYFNVVTHGYSTPYWDWSRWEKEIDWMAMHGMNMPLIGGAHEAILHRVFR 187
>gi|156046298|ref|XP_001589681.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980]
gi|154693798|gb|EDN93536.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 795
Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 73/149 (48%), Gaps = 10/149 (6%)
Query: 60 VLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLH 119
+L+R LP+H + F+F+I G + S T I++ G + + GL
Sbjct: 37 LLKRRLPAHVNDFEFKIRGNHTLGAT-----NDDYSVSQTCTGVILVEGNSLSALATGLR 91
Query: 120 WYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFA 179
Y I W G+Q+ PK FP + ++ + +P Y+ N VT SYT A
Sbjct: 92 KYFTDILHVDIWW--YIGLQLHLAPK--EFPPL-NSTLRGSSIVPWRYHFNTVTFSYTTA 146
Query: 180 WWDWKRWEKEIDWMALQGINLPLAFTGQE 208
+W W WE ++DWM+L GINL LA+ G E
Sbjct: 147 FWTWDDWELQLDWMSLHGINLSLAWVGYE 175
>gi|32564213|ref|NP_496948.2| Protein K09E4.4 [Caenorhabditis elegans]
gi|25814792|emb|CAB70170.2| Protein K09E4.4 [Caenorhabditis elegans]
Length = 715
Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 57/118 (48%), Gaps = 9/118 (7%)
Query: 98 IRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGV 157
I + ++ T + L ++ YLR C S +S S G R D
Sbjct: 77 INKSGRKTVTANTPTDALNAINTYLRTECLSQVS-------WSNSSFSSGCRKRTSDFIN 129
Query: 158 FVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
F + I Y+ N T SY+FAWW+W +WE+ IDW+AL G N L GQE IW+ +F
Sbjct: 130 FESKQI--RYFGNMCTFSYSFAWWEWPQWERFIDWIALNGFNTVLMPLGQEIIWRDIF 185
>gi|154321596|ref|XP_001560113.1| hypothetical protein BC1G_00945 [Botryotinia fuckeliana B05.10]
Length = 701
Score = 67.0 bits (162), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 79/163 (48%), Gaps = 15/163 (9%)
Query: 49 PPSVQ-LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVIS 107
P ++Q + +L+R LP+H F+FRI + G + + T +I+I
Sbjct: 3 PTTLQSIQGLVNLLKRRLPAHVGDFEFRIQNNHTPG-----VTNDEYVVSQSATGKILIE 57
Query: 108 GVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVA--SMPKLGSFPRVQDAGVFVKRPIPL 165
G + + +GL Y I W + +A S+P L S + G + +P
Sbjct: 58 GNSISGIASGLRKYFIDILHVDIWWYIGSQLHLAPESLPTLNSILK----GTSI---VPW 110
Query: 166 NYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQE 208
Y+ N VT SY A+W W+ WE ++DWM+L GINL LA+ G E
Sbjct: 111 RYHFNTVTFSYQAAFWTWEDWELQLDWMSLHGINLSLAWVGYE 153
>gi|225875033|ref|YP_002756492.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
51196]
gi|225793771|gb|ACO33861.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
51196]
Length = 800
Score = 66.6 bits (161), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 76/171 (44%), Gaps = 20/171 (11%)
Query: 48 APPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFI--LRNHPSSYIRGTPEIV 105
A ++ L A A L+RL+PS + +K GGE I R H IV
Sbjct: 65 AKDAIPLQPARAALRRLIPSLQQQITLLPLQRK-GGGERFRICGTRGH----------IV 113
Query: 106 ISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPL 165
++ + V L ++WYL+Y IS T G Q+ L + PR G + P P
Sbjct: 114 VAATSNVAALFAVNWYLKYTAHLQIS---TNGDQLTLRGPLPA-PRHVIEG---RTPYPY 166
Query: 166 NYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y N T Y+ +W W RW++EID A GIN L G E + + F+
Sbjct: 167 RYALNENTDGYSTPYWSWPRWQREIDIFAASGINAMLVERGMEAVLYETFR 217
>gi|336386984|gb|EGO28130.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 738
Score = 66.6 bits (161), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 76/163 (46%), Gaps = 32/163 (19%)
Query: 58 YAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAG 117
Y+++QR +P+H AF F++ S F L + IS T G
Sbjct: 30 YSLVQRRIPAHAGAFTFKLSSGNSSNVTDTFTLSDQ------------ISATTQAGEFGG 77
Query: 118 LHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP--IPLNYYQNAVTSS 175
+ +W GS ++ +P P V G V R +P Y+ N VT S
Sbjct: 78 VD---IWWTGSRLN----------ELPS--ELPSV---GQPVTRSSLVPYRYHFNTVTFS 119
Query: 176 YTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVA 218
YT A++++ W +DW+AL+G+NLPLA+ G E I +VF+ A
Sbjct: 120 YTAAFYNFDDWSLLLDWLALRGVNLPLAWVGNEYILVQVFREA 162
>gi|187735714|ref|YP_001877826.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
gi|187425766|gb|ACD05045.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
Length = 852
Score = 66.2 bits (160), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 86/189 (45%), Gaps = 23/189 (12%)
Query: 24 AVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCG 83
A A + +G IS L+ + ER+P +AAA V+ R+ PS + F + +
Sbjct: 5 AFAAGALLGTA-ISPLVCVSAAERSP----VAAAEDVVARVTPSAKGSVLFHLNPDQ--- 56
Query: 84 GEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASM 143
G I S I G + S V ++AG WYL+ H SW+ + +
Sbjct: 57 GNKITI------SGIAGGIRVEASDVR--RLVAGYGWYLKNIAKVHFSWNGNRITLPSPL 108
Query: 144 PKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLA 203
P V + V V+ P + + N T SYT A+WDW RW++EID++AL G L
Sbjct: 109 P-------VPASPVTVESPWNIVFAYNYCTLSYTAAFWDWNRWQREIDFLALNGFTHALV 161
Query: 204 FTGQETIWQ 212
G E W+
Sbjct: 162 TAGLEKTWE 170
>gi|29828556|ref|NP_823190.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
gi|29605660|dbj|BAC69725.1| putative alpha-N-acetylglucosaminidase [Streptomyces avermitilis
MA-4680]
Length = 728
Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats.
Identities = 49/156 (31%), Positives = 73/156 (46%), Gaps = 21/156 (13%)
Query: 62 QRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWY 121
+RLLP H+ FR GG F + T + ++G T L GL+WY
Sbjct: 42 RRLLPRHWRQLTFR-----TAGGRDTFQVSGR-------TGRVTVTGGTPATQLTGLNWY 89
Query: 122 LRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP-IPLNYYQNAVTSSYTFAW 180
LR + I+W G Q+ +P+ + P + AG +R +P + N YT +
Sbjct: 90 LRNIADADINW---AGRQL-RLPR--ALPGL--AGTVTRRANVPHRFALNDTNDGYTGPY 141
Query: 181 WDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
DW WE+E+D +AL G N L TG + + +VFQ
Sbjct: 142 HDWTYWERELDVLALHGYNEVLVQTGADALHHRVFQ 177
>gi|308480701|ref|XP_003102557.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
gi|308261289|gb|EFP05242.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
Length = 718
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 52/106 (49%), Gaps = 9/106 (8%)
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
T + L ++ YLR C S +S S G D F + I Y+
Sbjct: 89 TPTDALNAINTYLRTQCLSQVS-------WSNSSFSSGCRKHSNDILNFESKQI--RYFG 139
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
N T SY+FAWW+W +WEK IDW+AL G N L GQE IW+ VF
Sbjct: 140 NMCTFSYSFAWWEWPQWEKFIDWIALNGFNTVLMPLGQEAIWRDVF 185
>gi|169351448|ref|ZP_02868386.1| hypothetical protein CLOSPI_02228 [Clostridium spiroforme DSM 1552]
gi|169291670|gb|EDS73803.1| LPXTG-motif cell wall anchor domain protein [Clostridium spiroforme
DSM 1552]
Length = 1990
Score = 65.5 bits (158), Expect = 1e-08, Method: Composition-based stats.
Identities = 35/113 (30%), Positives = 62/113 (54%), Gaps = 6/113 (5%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+IVI G GV + + ++YLRY+C ++ ++ Q+ MP+ + P++ ++ + +
Sbjct: 229 KIVIRGNDGVSMASAFNYYLRYYC--NVDYNPLFASQL-KMPE--TLPQL-NSKIVKETQ 282
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT A+W+W +E +DW A+ GIN L GQE + ++
Sbjct: 283 YDYRYALNFCTYSYTMAFWNWDEYEAFLDWAAMSGINTMLDIVGQEEVIRRTL 335
>gi|418473272|ref|ZP_13042874.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
coelicoflavus ZG0656]
gi|371546106|gb|EHN74664.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
coelicoflavus ZG0656]
Length = 716
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/167 (25%), Positives = 74/167 (44%), Gaps = 15/167 (8%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P +A A L RLLP H + F + + G + R + ++R + G
Sbjct: 38 PGFSTHSARAALTRLLPRHVTQFTLVPVRRPASGDWFGVSGR---AGHVR------VRGT 88
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
+ +L+G++WYL+Y + W G A +P++ P + V +P +
Sbjct: 89 SPAAILSGVNWYLKYTAKVDLGWP---GRSTAKLPRVLPAP---EGTVRRHASVPHRFAL 142
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N Y+ A+ DW +EK+ID +AL G+N G + ++ + Q
Sbjct: 143 NDTDDGYSGAYRDWASYEKQIDLLALHGVNEVFVQMGADAVYYETLQ 189
>gi|291301158|ref|YP_003512436.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
gi|290570378|gb|ADD43343.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
Length = 734
Score = 65.5 bits (158), Expect = 2e-08, Method: Composition-based stats.
Identities = 50/174 (28%), Positives = 80/174 (45%), Gaps = 24/174 (13%)
Query: 48 APPSV------QLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGT 101
APP V + A L+RLLP H++ F + G F + ++
Sbjct: 29 APPDVAYVEPTNVRPARESLKRLLPKHHTQFALSLT-----GEVDRFTVSGREAA----- 78
Query: 102 PEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKR 161
I ++G T VL G++WYL++ G +SW G ++++ K PR + V R
Sbjct: 79 --IEVTGSTPAVVLTGVNWYLKHVAGVDVSWP---GSSLSALAKRLPAPRQRIEHTAVVR 133
Query: 162 PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ N YT + DW WE+E+D +AL G N L TG + ++++VF
Sbjct: 134 H---RFAFNDTHEGYTGPYRDWTAWERELDLLALHGYNEVLLTTGTDAVYREVF 184
>gi|365876979|ref|ZP_09416485.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
gi|442587289|ref|ZP_21006107.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
anophelis R26]
gi|365755253|gb|EHM97186.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
gi|442562959|gb|ELR80176.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
anophelis R26]
Length = 712
Score = 65.5 bits (158), Expect = 2e-08, Method: Composition-based stats.
Identities = 31/95 (32%), Positives = 52/95 (54%), Gaps = 7/95 (7%)
Query: 117 GLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSY 176
GL WY++++ IS + +P + P ++ + +P Y N T +Y
Sbjct: 77 GLDWYVKHYAHQSISHFGNNTRNINKLPVVS--PLLKKTSL-----VPYRYALNYCTINY 129
Query: 177 TFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIW 211
+F+++ W+ WEKE+DWMAL G+N+ LA G E +W
Sbjct: 130 SFSFYTWEDWEKELDWMALNGVNIMLAPVGTELVW 164
>gi|210631701|ref|ZP_03296968.1| hypothetical protein COLSTE_00853, partial [Collinsella stercoris
DSM 13279]
gi|210159960|gb|EEA90931.1| F5/8 type C domain protein, partial [Collinsella stercoris DSM
13279]
Length = 1906
Score = 65.5 bits (158), Expect = 2e-08, Method: Composition-based stats.
Identities = 36/119 (30%), Positives = 60/119 (50%), Gaps = 12/119 (10%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQV---ASMPKLGSFPRVQDAG 156
G I + G G+ + +GL++YLR +C + ++ G Q+ A++P +G G
Sbjct: 220 GGGRIKVRGNDGISLASGLNYYLRNYC--KVDYNPLFGSQLDMPATLPSVGE-------G 270
Query: 157 VFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
+ Y N T SYT A+W+W +E +DW A+ G+NL L GQE + ++
Sbjct: 271 ILKFTDYEYRYALNFCTYSYTMAFWNWDEYEPFLDWCAMNGVNLVLDIVGQEEVLRQTL 329
>gi|126347839|emb|CAJ89559.1| putative alpha-N-acetylglucosaminidase [Streptomyces ambofaciens
ATCC 23877]
Length = 740
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 75/168 (44%), Gaps = 17/168 (10%)
Query: 50 PSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
P+ A A L RLLP H S F + + + G +Y + +RGT V+
Sbjct: 38 PAFATQPARAALARLLPRHASQFTLVAVDRPESG-DYFDVSGRAGRVRVRGTSPAVL--- 93
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAG-VFVKRPIPLNYY 168
L+G++WYL++ + W G A +P+ P AG V + +P +
Sbjct: 94 -----LSGVNWYLKHTAEVDLGWP---GRSTARLPRTLPAP----AGPVHRQATVPHRFA 141
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N Y+ A+ DW +EK+ID +AL G+N G + ++ + Q
Sbjct: 142 LNDTDDGYSGAYRDWASYEKQIDLLALHGVNEVFVQMGADAVYYETLQ 189
>gi|322699924|gb|EFY91682.1| alpha-N-acetylglucosaminidase, putative [Metarhizium acridum CQMa
102]
Length = 775
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 73/150 (48%), Gaps = 9/150 (6%)
Query: 62 QRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWY 121
RL SAF+F + ++ + + N + R + +I + G T L L
Sbjct: 35 DRLFNGQGSAFEFVLTTRHDDWSRWNPPV-NDNYTVQRASGKIRVEGTT----LNALARG 89
Query: 122 LRYWCGSHISWDKTGGVQV-ASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAW 180
LR++ S + D+ + + P+ P+ +G V +P Y N VT SYTF W
Sbjct: 90 LRHYANSVLQMDEFWFIDTYKTAPERLPAPKEPLSGASV---VPWRYNLNTVTFSYTFPW 146
Query: 181 WDWKRWEKEIDWMALQGINLPLAFTGQETI 210
+ W+ WEK +DW AL+G+NL LA+ G E I
Sbjct: 147 YQWEDWEKLLDWAALRGVNLQLAWVGYEKI 176
>gi|453051703|gb|EME99203.1| alpha-N-acetylglucosaminidase [Streptomyces mobaraensis NBRC 13819
= DSM 40847]
Length = 763
Score = 64.3 bits (155), Expect = 4e-08, Method: Composition-based stats.
Identities = 41/155 (26%), Positives = 65/155 (41%), Gaps = 16/155 (10%)
Query: 61 LQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHW 120
L RLLP H+ FR + ++ Y R +++ G T L G H
Sbjct: 76 LARLLPRHHRQVTFRTVPRRSGADVYRVTGR---------AGHLLVEGSTPAVQLTGFHR 126
Query: 121 YLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAW 180
YLR +H SW A++P + + + + + N YT +
Sbjct: 127 YLRDVAHAHFSWSGERTALPATLPAV-------RGAIAAEANVRHRFALNDTNDGYTGPY 179
Query: 181 WDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
DW WE+EID +AL G N L +TG + ++++ F
Sbjct: 180 RDWAYWEREIDVLALHGFNEVLVYTGADAVYRRTF 214
>gi|373461651|ref|ZP_09553390.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
gi|371951955|gb|EHO69797.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
Length = 713
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 58/118 (49%), Gaps = 7/118 (5%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++VI L++YL +C H S G ++ + S P +++ +
Sbjct: 64 KLVIQASGDNVACMALNYYLENYC--HRSMSHMGD----NLKPVQSLPEIKEP-LTKHIE 116
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAFC 220
+PL Y N T +YT +++ W+ WE E+DWMAL G+NL L G E +WQ + C
Sbjct: 117 LPLRYALNYCTFNYTMSFYQWQDWEHELDWMALHGVNLMLMPVGMEKVWQNTLRKFGC 174
>gi|322703040|gb|EFY94656.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
ARSEF 23]
Length = 774
Score = 63.2 bits (152), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 46/138 (33%), Positives = 66/138 (47%), Gaps = 13/138 (9%)
Query: 83 GGEYCFILRNHPSSYIRGTPEI----VISGVTG---VE--VLAGLHWYLRYWCGSHISWD 133
G + F+L P + R P + + G G VE L L LR++ + D
Sbjct: 40 GSAFEFVLTTRPEDWSRWNPPVNDNYTVQGARGKIRVEGTSLNALARGLRHYANDVLQMD 99
Query: 134 KTGGVQV-ASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDW 192
+ V + P+ P+ +G V +P Y N VT SYTF W+ W+ WEK +DW
Sbjct: 100 EFWFVDTYKTAPQHLPAPKEPLSGASV---VPWRYNLNTVTFSYTFPWYQWEDWEKLLDW 156
Query: 193 MALQGINLPLAFTGQETI 210
AL+G+NL LA+ G E I
Sbjct: 157 AALRGVNLQLAWVGYEKI 174
>gi|293402299|ref|ZP_06646437.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae
bacterium 5_2_54FAA]
gi|291304406|gb|EFE45657.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae
bacterium 5_2_54FAA]
Length = 2330
Score = 62.8 bits (151), Expect = 9e-08, Method: Composition-based stats.
Identities = 40/111 (36%), Positives = 62/111 (55%), Gaps = 7/111 (6%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+I I G G + GL++Y + + IS ++T +QV +MPK +V+ V + P
Sbjct: 540 KIHIKGNVGTSITYGLNYYFKNYLKVQIS-EQT--MQV-NMPK--EIVKVEGT-VRKETP 592
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
+ Y N T SYTFA++ + W++E DW+AL G+N+ L GQE W K
Sbjct: 593 YKVRYAYNYCTLSYTFAFFGEEEWQRENDWLALNGVNVVLDVAGQEATWIK 643
>gi|374985456|ref|YP_004960951.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
gi|297156108|gb|ADI05820.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
Length = 1039
Score = 62.8 bits (151), Expect = 1e-07, Method: Composition-based stats.
Identities = 47/162 (29%), Positives = 75/162 (46%), Gaps = 17/162 (10%)
Query: 55 AAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEV 114
A A A L+RL+P+H+ R ++ GG F + S I + G + +
Sbjct: 46 APATAALRRLVPAHHQQVTLRAVA---AGGGDHFRVTGRAGS-------ITVEGTSPAVL 95
Query: 115 LAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTS 174
L G + YL + ISW+ G Q+ ++P+L P + G + + N
Sbjct: 96 LTGFNRYLAEAARADISWN---GEQL-NLPRLLPAPGAEITGA---ANVAHRFAYNDTNE 148
Query: 175 SYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
YT A+ WK WE+EID +AL GIN L + G + ++ F+
Sbjct: 149 GYTGAYRGWKAWEREIDVLALHGINEVLVYIGADAVYYDTFR 190
>gi|429766730|ref|ZP_19298977.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
gi|429183354|gb|EKY24416.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
Length = 2284
Score = 62.8 bits (151), Expect = 1e-07, Method: Composition-based stats.
Identities = 34/113 (30%), Positives = 60/113 (53%), Gaps = 6/113 (5%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
+I+I G GV + +G ++YL+ + + + ++ G + +MP+ P V + + + P
Sbjct: 247 KILIRGNNGVSLASGFNYYLKNY--AKVMYNPIMGSNL-NMPE--ELPAVGER-IIIDTP 300
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT ++WDW +E +DW A+ G NL L GQE + ++
Sbjct: 301 YEERYALNFCTYSYTMSFWDWDEYEAFLDWSAMNGYNLMLDIVGQEEVLRRTL 353
>gi|255944663|ref|XP_002563099.1| Pc20g05650 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211587834|emb|CAP85894.1| Pc20g05650 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 248
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/45 (60%), Positives = 35/45 (77%)
Query: 172 VTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
VT SYT A+W W+ WE ++DWMAL+G+NLPLA+ GQE I VF+
Sbjct: 151 VTFSYTTAFWTWEDWESQLDWMALRGVNLPLAWVGQEKIILAVFR 195
>gi|403512485|ref|YP_006644123.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
[Nocardiopsis alba ATCC BAA-2165]
gi|402798758|gb|AFR06168.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
[Nocardiopsis alba ATCC BAA-2165]
Length = 718
Score = 61.2 bits (147), Expect = 3e-07, Method: Composition-based stats.
Identities = 23/44 (52%), Positives = 30/44 (68%)
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETI 210
Y+ N T+ YT A+W+W WE+EIDWMAL G+ PL TG E +
Sbjct: 87 YHLNFCTTGYTAAYWNWADWEREIDWMALHGVTTPLTLTGHEAV 130
>gi|329940646|ref|ZP_08289927.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
gi|329300707|gb|EGG44604.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
Length = 798
Score = 61.2 bits (147), Expect = 3e-07, Method: Composition-based stats.
Identities = 43/152 (28%), Positives = 68/152 (44%), Gaps = 19/152 (12%)
Query: 66 PSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYW 125
P H+ F FR GG F + + I+G T L GL+WYLR+
Sbjct: 55 PRHWHQFSFRTE-----GGRDTFTVSGEAG-------RLRITGGTPGTQLTGLNWYLRHV 102
Query: 126 CGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP-IPLNYYQNAVTSSYTFAWWDWK 184
+ ++W G Q+ +P++ P + +R +P + N YT + DW
Sbjct: 103 ADADVNW---AGEQL-DLPRI--LPGMPGGTAVTRRAGVPHRFALNDTNEGYTGPYHDWS 156
Query: 185 RWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
WE+E+D +AL G N L G + ++ +VFQ
Sbjct: 157 YWERELDVLALHGYNQVLVTVGADALYHRVFQ 188
>gi|392584963|gb|EIW74305.1| glycoside hydrolase family 89 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 772
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 75/167 (44%), Gaps = 13/167 (7%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
LA +++R + +H + F F + + G + + S G I I T
Sbjct: 28 LAGIRDLVKRRISNHVNDFTFTLSTANDTGSDLDTFTISDASDSSGG---IDIECSTASA 84
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP--IPLNYYQNA 171
GL+ Y I W G ++ ++P PRV G + R + Y+ N
Sbjct: 85 CARGLYTYATEVGNVDIWWT---GSRLDALPS--PLPRV---GEPITRSAIVEYRYFLNT 136
Query: 172 VTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVA 218
VT YT WW +++WE +DW AL+G+NLPLA+ G E + F+ A
Sbjct: 137 VTFGYTANWWTFEQWELLLDWAALRGVNLPLAWVGYEHTLAETFRDA 183
>gi|403416059|emb|CCM02759.1| predicted protein [Fibroporia radiculosa]
Length = 705
Score = 60.5 bits (145), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 38/50 (76%)
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
Y+ N VT YT A+W++ +WE E+DW+AL+G+NLPLA+ G E I +VFQ
Sbjct: 83 YHFNTVTFDYTAAFWNFDQWELELDWLALRGVNLPLAWVGYEYILVQVFQ 132
>gi|260910505|ref|ZP_05917173.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
F0295]
gi|260635347|gb|EEX53369.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
F0295]
Length = 1566
Score = 60.5 bits (145), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 58/116 (50%), Gaps = 11/116 (9%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
+V+ + GL+ YL ++C HIS + G ++P +Q V+R
Sbjct: 70 LVVKATSPSAAAVGLNHYLNHYC--HISISRCGN----NLPTHFRLVPIQGT---VRRTT 120
Query: 164 PLNYYQ--NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQV 217
P Y N T +Y++A++ W +E E+DWMAL G+NL LA G E +W + +
Sbjct: 121 PFKYRYALNYCTYNYSYAFYRWPDFEWELDWMALNGVNLMLAPLGMEAVWAETLKT 176
>gi|373451393|ref|ZP_09543318.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
3_1_31]
gi|371968665|gb|EHO86120.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
3_1_31]
Length = 2190
Score = 60.1 bits (144), Expect = 6e-07, Method: Composition-based stats.
Identities = 45/169 (26%), Positives = 78/169 (46%), Gaps = 27/169 (15%)
Query: 54 LAAAYAVLQRLLPSHY-SAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGV 112
L A A++ R++ S + S+F+F + SK G F + + G I++ G G+
Sbjct: 217 LNEASALVGRVIGSKWQSSFKFELRSKMN--GRDVFEIEDG------GNNTIIVRGNNGI 268
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI------PLN 166
+ +G ++YL+ + V P S ++ V V + I
Sbjct: 269 SLASGFNYYLKNY------------AMVDYNPLFDSNTEMKKGIVPVGKKIVKDTQYEYR 316
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT ++W+W ++E+ IDW A+ G+NL L GQE + ++
Sbjct: 317 YALNFCTYSYTMSFWNWDQYEEFIDWAAMNGVNLMLDIVGQEEVLRQTL 365
>gi|296115989|ref|ZP_06834611.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
23769]
gi|295977458|gb|EFG84214.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
23769]
Length = 758
Score = 60.1 bits (144), Expect = 7e-07, Method: Composition-based stats.
Identities = 42/162 (25%), Positives = 72/162 (44%), Gaps = 15/162 (9%)
Query: 54 LAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVE 113
+ AA V+ RLLP+H + E+ + + H +I ++ +
Sbjct: 51 IQAARNVVARLLPAHAGQVTLTLRPASGTTAEH-YRISGHKG-------DIHVTASSDSA 102
Query: 114 VLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVT 173
+L GL+WYL+Y + IS + +A +P G ++ F P Y N T
Sbjct: 103 LLFGLNWYLKYVAHAGISPNGITPPAMAQLPAPGH--PIEGEATF-----PYRYALNENT 155
Query: 174 SSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
YT +W+W+RW+ E+D A+ G+N L G + + + F
Sbjct: 156 DGYTSPYWNWERWQHELDIYAMNGLNTLLIERGTDAVLYRTF 197
>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
Length = 1552
Score = 59.7 bits (143), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 61/120 (50%), Gaps = 10/120 (8%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAG-VFVKR 161
+++I T L+ YL +C H+S ++P +F V G V ++
Sbjct: 66 QLLIRANTTSAAAMALNHYLNTYC--HVSLSHNAD----NLPP--TFQLVPIKGEVRMET 117
Query: 162 PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF-QVAFC 220
P Y N T +YT+++++W +E+E+DWMAL GINL LA G E +W + Q+ F
Sbjct: 118 PFKYRYALNYCTYNYTYSFYNWSDFERELDWMALNGINLMLAPMGMEKVWMETLTQLGFS 177
>gi|347541919|ref|YP_004856555.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-rat-Yit]
gi|346984954|dbj|BAK80629.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-rat-Yit]
Length = 912
Score = 58.9 bits (141), Expect = 2e-06, Method: Composition-based stats.
Identities = 31/111 (27%), Positives = 58/111 (52%), Gaps = 6/111 (5%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++++ G + L++Y ++ ++++ G ++ ++ L PRV D +
Sbjct: 233 KVMLKGNNINSICVALNYYFEHYLEQ--TFERFGDSKIKAILPL---PRV-DNKIEKNID 286
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQK 213
+ Y N V YT A+WD+K WE+EIDWM+L G N+ L G E + ++
Sbjct: 287 MEFRYNYNYVAYGYTMAYWDFKDWEREIDWMSLNGFNMALNLVGYEEVVRR 337
>gi|293402122|ref|ZP_06646261.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
bacterium 5_2_54FAA]
gi|291304514|gb|EFE45764.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
bacterium 5_2_54FAA]
Length = 2295
Score = 58.5 bits (140), Expect = 2e-06, Method: Composition-based stats.
Identities = 45/169 (26%), Positives = 77/169 (45%), Gaps = 27/169 (15%)
Query: 54 LAAAYAVLQRLLPSHY-SAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGV 112
L A A++ R++ S + S+F+F + SK G F + + G I++ G G+
Sbjct: 225 LNEASALVGRVIGSKWQSSFKFELRSKMN--GRDVFEIEDG------GNNTIIVRGNNGI 276
Query: 113 EVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI------PLN 166
+ +G ++YL+ + V P S ++ V V + I
Sbjct: 277 SLASGFNYYLKNY------------AMVDYNPLFDSNTEMKKGIVPVGKKIVKDTQYEYR 324
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
Y N T SYT ++W+W ++E+ IDW A+ G NL L GQE + ++
Sbjct: 325 YALNFCTYSYTMSFWNWDQYEEFIDWAAMNGANLMLDIVGQEEVLRQTL 373
>gi|282881077|ref|ZP_06289764.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
5C-B1]
gi|281304881|gb|EFA96954.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
5C-B1]
Length = 688
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 38/61 (62%)
Query: 157 VFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ + P Y N T +Y++A++ WK +E+E+DWMAL GINL LA G E +WQ+ +
Sbjct: 81 IRISTPFRYRYSLNYCTYNYSYAFYKWKDFERELDWMALHGINLMLAPLGMEKVWQETLR 140
Query: 217 V 217
Sbjct: 141 A 141
>gi|440695019|ref|ZP_20877582.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
gi|440282912|gb|ELP70302.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
Length = 1050
Score = 58.2 bits (139), Expect = 3e-06, Method: Composition-based stats.
Identities = 35/113 (30%), Positives = 57/113 (50%), Gaps = 7/113 (6%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPI 163
IV+ G + +L G++ YL + ISW+ G Q+ +P+L P + G +
Sbjct: 87 IVVEGTSPAVLLTGVNTYLARVAQADISWN---GAQL-DLPRLLPAPHREITG---SANV 139
Query: 164 PLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P + N YT + DW WE+E+D +AL GIN L +TG + ++ F+
Sbjct: 140 PHRFALNDTNDGYTGPYRDWDAWERELDVLALHGINEVLVYTGGDAVYYDTFR 192
>gi|346323119|gb|EGX92717.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
Length = 742
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 34/54 (62%)
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+P Y N VT +Y+FAW+ W WEK +DW AL G+N LA+ G E I+ F+
Sbjct: 103 VPWRYNLNTVTFAYSFAWYQWSDWEKLLDWAALHGVNFQLAWVGYEKIYLDSFR 156
>gi|291302495|ref|YP_003513773.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
gi|290571715|gb|ADD44680.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
Length = 696
Score = 57.4 bits (137), Expect = 4e-06, Method: Composition-based stats.
Identities = 46/166 (27%), Positives = 67/166 (40%), Gaps = 22/166 (13%)
Query: 54 LAAAYAVLQRLLP----SHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGV 109
L +AYA ++RL P +H F I + G + + +SG
Sbjct: 6 LESAYAAMRRLAPRLWETHREKFTVDITGEAADGYRFA-----------ADGERLSLSGN 54
Query: 110 TGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQ 169
VL G YL SH+ GG + ++P+ P + V P Y
Sbjct: 55 DIGSVLTGFRHYLE---SSHLGHISRGGDRF-TVPETLPLP---ERPVSRTSPHRFRYAT 107
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
N + YT +W W RWE+E+D +A GINL L G + +W F
Sbjct: 108 NFTVTGYTSPYWQWPRWERELDLLAASGINLSLVTVGTDAVWLDTF 153
>gi|417963469|ref|ZP_12605415.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-3]
gi|380332825|gb|EIA23540.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-3]
Length = 315
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 63/119 (52%), Gaps = 12/119 (10%)
Query: 101 TPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQV-ASMPKLGSFPRVQDAGVFV 159
++++ G + L++Y ++ ++++ G ++ ++P P+++ GV +
Sbjct: 185 NDKVILKGNNINSISVALNYYFEHYLEQ--TFERFGNSKIKVTLP----LPQIE--GV-I 235
Query: 160 KRPIPLNY-YQ-NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
++ I + Y Y N V YT A+W + WE+EIDWMAL G N+ L G E + ++ +
Sbjct: 236 EKSIDMKYRYNYNYVAYGYTMAYWTFDEWEREIDWMALNGFNMALNLVGHEEVVRRFLK 294
>gi|406693970|gb|EKC97309.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
asahii CBS 8904]
Length = 781
Score = 56.6 bits (135), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 35/47 (74%)
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N VT SY+ A++DW +WE +DW L G NLPLA+TGQE ++ +V++
Sbjct: 166 NLVTHSYSLAFYDWAKWEYLLDWAVLHGYNLPLAYTGQEYVYAQVWK 212
>gi|401885538|gb|EJT49648.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
asahii CBS 2479]
Length = 781
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 35/47 (74%)
Query: 170 NAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
N VT SY+ A++DW +WE +DW L G NLPLA+TGQE ++ +V++
Sbjct: 166 NLVTHSYSLAFYDWAKWEYLLDWAVLHGYNLPLAYTGQEYVYAQVWK 212
>gi|384455191|ref|YP_005667784.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Yit]
gi|418016862|ref|ZP_12656425.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|418371995|ref|ZP_12964091.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-SU]
gi|345505596|gb|EGX27892.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|346983532|dbj|BAK79208.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Yit]
gi|380342872|gb|EIA31299.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-SU]
Length = 898
Score = 56.2 bits (134), Expect = 8e-06, Method: Composition-based stats.
Identities = 26/82 (31%), Positives = 39/82 (47%), Gaps = 13/82 (15%)
Query: 148 SFPRVQDAGVFVKRPIP-------------LNYYQNAVTSSYTFAWWDWKRWEKEIDWMA 194
+F R ++ + V P+P Y N V YT A+W + WE+EIDWMA
Sbjct: 247 TFERFGNSKIKVTLPLPQIEGVIEKSIDMKYRYNYNYVAYGYTMAYWTFDEWEREIDWMA 306
Query: 195 LQGINLPLAFTGQETIWQKVFQ 216
L G N+ L G E + ++ +
Sbjct: 307 LNGFNMALNLVGHEEVVRRFLK 328
>gi|342731751|ref|YP_004770590.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-Japan]
gi|342329206|dbj|BAK55848.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Japan]
Length = 898
Score = 56.2 bits (134), Expect = 8e-06, Method: Composition-based stats.
Identities = 26/82 (31%), Positives = 39/82 (47%), Gaps = 13/82 (15%)
Query: 148 SFPRVQDAGVFVKRPIP-------------LNYYQNAVTSSYTFAWWDWKRWEKEIDWMA 194
+F R ++ + V P+P Y N V YT A+W + WE+EIDWMA
Sbjct: 247 TFERFGNSKIKVTLPLPQIEGVIEKSIDMKYRYNYNYVAYGYTMAYWTFDEWEREIDWMA 306
Query: 195 LQGINLPLAFTGQETIWQKVFQ 216
L G N+ L G E + ++ +
Sbjct: 307 LNGFNMALNLVGHEEVVRRFLK 328
>gi|417967717|ref|ZP_12608785.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-co]
gi|380340884|gb|EIA29424.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-co]
Length = 741
Score = 56.2 bits (134), Expect = 9e-06, Method: Composition-based stats.
Identities = 26/82 (31%), Positives = 39/82 (47%), Gaps = 13/82 (15%)
Query: 148 SFPRVQDAGVFVKRPIP-------------LNYYQNAVTSSYTFAWWDWKRWEKEIDWMA 194
+F R ++ + V P+P Y N V YT A+W + WE+EIDWMA
Sbjct: 90 TFERFGNSKIKVTLPLPQIEGVIEKSIDMKYRYNYNYVAYGYTMAYWTFDEWEREIDWMA 149
Query: 195 LQGINLPLAFTGQETIWQKVFQ 216
L G N+ L G E + ++ +
Sbjct: 150 LNGFNMALNLVGHEEVVRRFLK 171
>gi|160914140|ref|ZP_02076362.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
gi|158433951|gb|EDP12240.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
Length = 2150
Score = 56.2 bits (134), Expect = 9e-06, Method: Composition-based stats.
Identities = 38/121 (31%), Positives = 59/121 (48%), Gaps = 7/121 (5%)
Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
G I+I G GV + +G ++YL+ + D + KLG P V++ +
Sbjct: 253 GDNSIIIRGNDGVSLASGFNYYLKNFAMV----DYNPLYDSNTEMKLGLVP-VKNK-IVK 306
Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQ-ETIWQKVFQVA 218
+ + Y N T SYT A+W W +++ IDW A+ G+NL L GQ E I Q + +
Sbjct: 307 ETQYDVRYALNFCTYSYTMAFWSWDEYQEFIDWAAMNGVNLMLDIVGQEEVIRQTLLEYG 366
Query: 219 F 219
F
Sbjct: 367 F 367
>gi|417965571|ref|ZP_12607078.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-4]
gi|380336329|gb|EIA26351.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-4]
Length = 685
Score = 56.2 bits (134), Expect = 1e-05, Method: Composition-based stats.
Identities = 26/82 (31%), Positives = 39/82 (47%), Gaps = 13/82 (15%)
Query: 148 SFPRVQDAGVFVKRPIP-------------LNYYQNAVTSSYTFAWWDWKRWEKEIDWMA 194
+F R ++ + V P+P Y N V YT A+W + WE+EIDWMA
Sbjct: 41 TFERFGNSKIKVTLPLPQIEGVIEKSIDMKYRYNYNYVAYGYTMAYWTFDEWEREIDWMA 100
Query: 195 LQGINLPLAFTGQETIWQKVFQ 216
L G N+ L G E + ++ +
Sbjct: 101 LNGFNMALNLVGHEEVVRRFLK 122
>gi|318078904|ref|ZP_07986236.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actF]
Length = 719
Score = 55.8 bits (133), Expect = 1e-05, Method: Composition-based stats.
Identities = 34/116 (29%), Positives = 52/116 (44%), Gaps = 10/116 (8%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISW--DKTGGVQVASMPKLGSFPRVQDAGVFVK 160
+I I+G LAGL YL+ + +W + G + P+ G D
Sbjct: 62 QITIAGTNPATQLAGLRRYLKQAGLAQFTWAGSRAGLPERVPAPEGGRLEGRADT----- 116
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P + N YT + DW WE+EID +AL G N L + G + ++Q++FQ
Sbjct: 117 ---PHRFVLNDTNDGYTGPYHDWAYWEREIDLLALHGFNEVLVYAGADAVYQRLFQ 169
>gi|318057780|ref|ZP_07976503.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actG]
Length = 741
Score = 55.8 bits (133), Expect = 1e-05, Method: Composition-based stats.
Identities = 34/116 (29%), Positives = 52/116 (44%), Gaps = 10/116 (8%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISW--DKTGGVQVASMPKLGSFPRVQDAGVFVK 160
+I I+G LAGL YL+ + +W + G + P+ G D
Sbjct: 84 QITIAGTNPATQLAGLRRYLKQAGLAQFTWAGSRAGLPERVPAPEGGRLEGRADT----- 138
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P + N YT + DW WE+EID +AL G N L + G + ++Q++FQ
Sbjct: 139 ---PHRFVLNDTNDGYTGPYHDWAYWEREIDLLALHGFNEVLVYAGADAVYQRLFQ 191
>gi|333023613|ref|ZP_08451677.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
gi|332743465|gb|EGJ73906.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
Length = 741
Score = 55.8 bits (133), Expect = 1e-05, Method: Composition-based stats.
Identities = 34/116 (29%), Positives = 52/116 (44%), Gaps = 10/116 (8%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISW--DKTGGVQVASMPKLGSFPRVQDAGVFVK 160
+I I+G LAGL YL+ + +W + G + P+ G D
Sbjct: 84 QITIAGTNPATQLAGLRRYLKQAGLAQFTWAGSRAGLPERVPAPEGGRLEGRADT----- 138
Query: 161 RPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P + N YT + DW WE+EID +AL G N L + G + ++Q++FQ
Sbjct: 139 ---PHRFVLNDTNDGYTGPYHDWAYWEREIDLLALHGFNEVLVYAGADAVYQRLFQ 191
>gi|404403947|ref|ZP_10995531.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 828
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 32/58 (55%)
Query: 159 VKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
V P Y N T YT +W W WE+EIDWMAL G ++PLA E I +V++
Sbjct: 103 VTSPFERRLYMNVCTFGYTSPFWGWDEWEREIDWMALHGFDMPLAPIAGEAILARVWR 160
>gi|261880009|ref|ZP_06006436.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
gi|270333325|gb|EFA44111.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
Length = 722
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 47/112 (41%), Gaps = 8/112 (7%)
Query: 103 EIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRP 162
++ I G + + + YL+ +W T Q V + P
Sbjct: 65 QLTIQGSSPTAMTYAFYQYLKRTRQGMFTWSGTNARQTVEF--------VDHPLTTGRSP 116
Query: 163 IPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKV 214
Y+ N T YT +WDW RWE+E+D MAL G N+ LA E I ++V
Sbjct: 117 YSYRYFLNVCTFGYTMPYWDWARWEQELDLMALHGTNMILASVASEAIAERV 168
>gi|417960062|ref|ZP_12602729.1| alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-1]
gi|380331910|gb|EIA22854.1| alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-1]
Length = 155
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 55/117 (47%), Gaps = 8/117 (6%)
Query: 101 TPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVK 160
++++ G + L++Y ++ ++++ G ++ + P Q GV K
Sbjct: 20 NDKVILKGNNINSISVALNYYFEHYLEQ--TFERFGNSKIKV-----TLPLPQIEGVIEK 72
Query: 161 R-PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
+ Y N V YT A+W + WE+EIDWMAL G N+ L G E + ++ +
Sbjct: 73 SIDMKYRYNYNYVAYGYTMAYWTFDEWEREIDWMALNGFNMALNLVGHEEVVRRFLK 129
>gi|317158657|ref|XP_001827155.2| alpha-N-acetylglucosaminidase [Aspergillus oryzae RIB40]
Length = 849
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 31/44 (70%)
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETI 210
Y + VT SYT A+W W+ WE E+DW AL+G+NL LA+ G E +
Sbjct: 84 YMHHTVTFSYTTAFWTWEDWELELDWAALRGVNLILAWVGYEKV 127
>gi|238506383|ref|XP_002384393.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
NRRL3357]
gi|220689106|gb|EED45457.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
NRRL3357]
Length = 669
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%)
Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETI 210
Y + VT SYT A+W W+ WE E+DW AL+G+N+ LA+ G E +
Sbjct: 36 YMHHTVTFSYTTAFWTWEDWELELDWAALRGVNVILAWVGYEKV 79
>gi|302526099|ref|ZP_07278441.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
gi|302434994|gb|EFL06810.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
Length = 860
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/175 (22%), Positives = 70/175 (40%), Gaps = 21/175 (12%)
Query: 48 APPSVQLAAAYAVLQRL------LPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGT 101
AP S Q A A+ LP + F + ++ G + R
Sbjct: 17 APGSAQAAPAFDTSAASAAVARVLPRQAAQFTLTPVQRRPSGDYFTVKGRAG-------- 68
Query: 102 PEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKR 161
+I + G + +L G++WYL+Y +SW ++ + S P Q A V
Sbjct: 69 -KIEVQGTSPGVLLTGVNWYLKYTAKVDVSWPGDSLSRLPATLPAPSQPVTQSAVV---- 123
Query: 162 PIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
P + N S Y+ + W +E++ID +AL G+N G + ++ + F+
Sbjct: 124 --PNRFALNDTDSGYSGPYKSWADYERQIDVLALHGVNEVFVDIGTDAVYDRTFR 176
>gi|391873368|gb|EIT82411.1| alpha-N-acetylglucosaminidase [Aspergillus oryzae 3.042]
Length = 633
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 30/42 (71%)
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETI 210
+ VT SYT A+W W+ WE E+DW AL+G+NL LA+ G E +
Sbjct: 2 HHTVTFSYTTAFWTWEDWELELDWAALRGVNLILAWVGYEKV 43
>gi|83775903|dbj|BAE66022.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 633
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 30/42 (71%)
Query: 169 QNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETI 210
+ VT SYT A+W W+ WE E+DW AL+G+NL LA+ G E +
Sbjct: 2 HHTVTFSYTTAFWTWEDWELELDWAALRGVNLILAWVGYEKV 43
>gi|149054261|gb|EDM06078.1| rCG33377, isoform CRA_a [Rattus norvegicus]
Length = 178
Score = 40.4 bits (93), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 22/32 (68%)
Query: 104 IVISGVTGVEVLAGLHWYLRYWCGSHISWDKT 135
+++ G +GV AGLH YLR +CG HI+W +
Sbjct: 73 VLVRGSSGVAAAAGLHRYLRDFCGCHIAWSSS 104
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.137 0.446
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,462,810,934
Number of Sequences: 23463169
Number of extensions: 141657064
Number of successful extensions: 384935
Number of sequences better than 100.0: 484
Number of HSP's better than 100.0 without gapping: 479
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 383973
Number of HSP's gapped (non-prelim): 505
length of query: 220
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 83
effective length of database: 9,144,741,214
effective search space: 759013520762
effective search space used: 759013520762
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 74 (33.1 bits)