BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781071|ref|YP_003065484.1| hypothetical protein CLIBASIA_04865 [Candidatus Liberibacter asiaticus str. psy62] (71 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781071|ref|YP_003065484.1| hypothetical protein CLIBASIA_04865 [Candidatus Liberibacter asiaticus str. psy62] gi|254040748|gb|ACT57544.1| hypothetical protein CLIBASIA_04865 [Candidatus Liberibacter asiaticus str. psy62] Length = 71 Score = 146 bits (369), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 71/71 (100%), Positives = 71/71 (100%) Query: 1 MRWAFKALLALIACKWNLSRIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQ 60 MRWAFKALLALIACKWNLSRIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQ Sbjct: 1 MRWAFKALLALIACKWNLSRIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQ 60 Query: 61 RIIYLKNKMKT 71 RIIYLKNKMKT Sbjct: 61 RIIYLKNKMKT 71 >gi|238898650|ref|YP_002924331.1| putative glycosy hydrolase family protein [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] gi|229466409|gb|ACQ68183.1| putative glycosy hydrolase family protein [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] Length = 158 Score = 72.0 bits (175), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 41/73 (56%), Positives = 51/73 (69%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSRIIAVY-NAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 ++ IA ++LS + + NAGAD Q+PADVI+LIYAHVKSGEIK SRIE Sbjct: 72 MSAIADNYSLSEALKLSINAGADMLIFSNQQSPVWQNPADVIDLIYAHVKSGEIKSSRIE 131 Query: 57 SAYQRIIYLKNKM 69 SAYQRII+LK K+ Sbjct: 132 SAYQRIIHLKKKL 144 >gi|270157239|ref|ZP_06185896.1| glycosyl hydrolase [Legionella longbeachae D-4968] gi|289164364|ref|YP_003454502.1| N-acetyl-beta-glucosaminidase [Legionella longbeachae NSW150] gi|269989264|gb|EEZ95518.1| glycosyl hydrolase [Legionella longbeachae D-4968] gi|288857537|emb|CBJ11375.1| putative N-acetyl-beta-glucosaminidase [Legionella longbeachae NSW150] Length = 380 Score = 48.9 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 27/61 (44%), Positives = 32/61 (52%), Gaps = 11/61 (18%) Query: 21 IIAVYNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 ++ NAGAD QDP VI+LI A V SGEI RI AYQ I+ LK + Sbjct: 318 LVLAINAGADMLIFGNNLPAPPQDPKQVIDLIEAKVNSGEISQERINEAYQHIVTLKKSL 377 Query: 70 K 70 K Sbjct: 378 K 378 >gi|254495931|ref|ZP_05108839.1| glycosy hydrolase family protein [Legionella drancourtii LLAP12] gi|254354809|gb|EET13436.1| glycosy hydrolase family protein [Legionella drancourtii LLAP12] Length = 379 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 25/60 (41%), Positives = 32/60 (53%), Gaps = 11/60 (18%) Query: 21 IIAVYNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 ++ NAGAD Q+P VI LI A V+SGEI RI+ AYQ I+ LK + Sbjct: 318 LVLAINAGADMFIFGNTLTAKAQNPEQVINLIAAKVQSGEISQQRIDEAYQHIVTLKQSL 377 >gi|304413318|ref|ZP_07394791.1| glycosyl hydrolase domain-containing hypothetical protein [Candidatus Regiella insecticola LSR1] gi|304284161|gb|EFL92554.1| glycosyl hydrolase domain-containing hypothetical protein [Candidatus Regiella insecticola LSR1] Length = 353 Score = 43.5 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 23/43 (53%), Positives = 29/43 (67%), Gaps = 1/43 (2%) Query: 30 DQQDPADVIELIY-AHVKSGEIKPSRIESAYQRIIYLKNKMKT 71 D Q+P +I+LIY A V SG+I P IE YQRI+ LK KM + Sbjct: 304 DWQNPEAIIDLIYRAVVISGKIHPDIIEDNYQRILQLKKKMTS 346 >gi|54294135|ref|YP_126550.1| hypothetical protein lpl1199 [Legionella pneumophila str. Lens] gi|53753967|emb|CAH15438.1| hypothetical protein lpl1199 [Legionella pneumophila str. Lens] Length = 382 Score = 42.7 bits (99), Expect = 0.015, Method: Composition-based stats. Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 11/52 (21%) Query: 26 NAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 NAGAD QD ++I++I V+SGEI RI AYQRI+ +K Sbjct: 326 NAGADMLIFGNQLVEKFQDSTEIIDMIEQKVRSGEISEQRINEAYQRIVKMK 377 >gi|54297148|ref|YP_123517.1| hypothetical protein lpp1193 [Legionella pneumophila str. Paris] gi|53750933|emb|CAH12344.1| hypothetical protein lpp1193 [Legionella pneumophila str. Paris] Length = 382 Score = 42.7 bits (99), Expect = 0.015, Method: Composition-based stats. Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 11/52 (21%) Query: 26 NAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 NAGAD QD ++I++I V+SGEI RI AYQRI+ +K Sbjct: 326 NAGADMLIFGNQLVEKFQDSTEIIDMIEQKVRSGEISEQRINEAYQRIVKMK 377 >gi|296106822|ref|YP_003618522.1| beta-N-acetylhexosaminidase [Legionella pneumophila 2300/99 Alcoy] gi|295648723|gb|ADG24570.1| beta-N-acetylhexosaminidase [Legionella pneumophila 2300/99 Alcoy] Length = 378 Score = 42.7 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 11/52 (21%) Query: 26 NAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 NAGAD QD ++I++I V+SGEI RI AYQRI+ +K Sbjct: 322 NAGADMLIFGNQLVEKFQDSTEIIDMIEQKVRSGEISEQRINEAYQRIVKMK 373 >gi|148358777|ref|YP_001249984.1| glycosyl hydrolase family transporter 3 [Legionella pneumophila str. Corby] gi|148280550|gb|ABQ54638.1| glycosyl hydrolase family 3 [Legionella pneumophila str. Corby] Length = 382 Score = 42.7 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 11/52 (21%) Query: 26 NAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 NAGAD QD ++I++I V+SGEI RI AYQRI+ +K Sbjct: 326 NAGADMLIFGNQLVEKFQDSTEIIDMIEQKVRSGEISEQRINEAYQRIVKMK 377 >gi|52841424|ref|YP_095223.1| glycosy hydrolase family protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52628535|gb|AAU27276.1| glycosyl hydrolase family 3 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 395 Score = 42.4 bits (98), Expect = 0.019, Method: Composition-based stats. Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 11/52 (21%) Query: 26 NAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 NAGAD QD ++I++I V+SGEI RI AYQRI+ +K Sbjct: 339 NAGADMLIFGNQLVEKFQDSTEIIDMIEQKVRSGEISEQRINEAYQRIVKMK 390 >gi|307609946|emb|CBW99474.1| hypothetical protein LPW_12471 [Legionella pneumophila 130b] Length = 378 Score = 42.4 bits (98), Expect = 0.020, Method: Composition-based stats. Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 11/52 (21%) Query: 26 NAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 NAGAD QD ++I++I V+SGEI RI AYQRI+ +K Sbjct: 322 NAGADMLIFGNQLVEKFQDSTEIIDMIEQKVQSGEISEQRINEAYQRIVKMK 373 >gi|307150649|ref|YP_003886033.1| glycoside hydrolase family 3 domain-containing protein [Cyanothece sp. PCC 7822] gi|306980877|gb|ADN12758.1| glycoside hydrolase family 3 domain protein [Cyanothece sp. PCC 7822] Length = 533 Score = 42.4 bits (98), Expect = 0.023, Method: Composition-based stats. Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 4/53 (7%) Query: 22 IAVYNAGAD----QQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKMK 70 + AGAD DP IE +Y V+SG I P RI + QRII K K+K Sbjct: 291 VKAVEAGADILLMPDDPLVAIEAVYQAVQSGRISPERIAQSVQRIIEAKEKLK 343 >gi|268679756|ref|YP_003304187.1| glycoside hydrolase [Sulfurospirillum deleyianum DSM 6946] gi|268617787|gb|ACZ12152.1| glycoside hydrolase family 3 domain protein [Sulfurospirillum deleyianum DSM 6946] Length = 345 Score = 41.2 bits (95), Expect = 0.042, Method: Composition-based stats. Identities = 20/42 (47%), Positives = 26/42 (61%) Query: 28 GADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 G D P V +I V+ GEI+P RIE +Y+RI+ LK KM Sbjct: 302 GDDASIPFTVQRIIMEGVRKGEIRPQRIELSYKRIMALKQKM 343 >gi|78189925|ref|YP_380263.1| glycosy hydrolase family protein [Chlorobium chlorochromatii CaD3] gi|78172124|gb|ABB29220.1| glycosyl hydrolase, family 3 [Chlorobium chlorochromatii CaD3] Length = 374 Score = 40.0 bits (92), Expect = 0.11, Method: Composition-based stats. Identities = 17/43 (39%), Positives = 28/43 (65%) Query: 26 NAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNK 68 N D+ + +I+A ++ GEI+PSRIE +Y+RI+ LK + Sbjct: 327 NTAYDEAIAEKALAIIHALIERGEIQPSRIEESYRRIMALKQR 369 >gi|115359049|ref|YP_776187.1| Beta-glucosidase [Burkholderia ambifaria AMMD] gi|115284337|gb|ABI89853.1| Beta-glucosidase [Burkholderia ambifaria AMMD] Length = 669 Score = 37.4 bits (85), Expect = 0.64, Method: Composition-based stats. Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 14/74 (18%) Query: 6 KALLALIACKWNLS------RIIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRI 55 KA A IA W + R + NAG +Q DP ++EL VKSG + +R+ Sbjct: 389 KATPADIAMPWGVENLPKAERFLKALNAGVNQFGGIDDPTPIVEL----VKSGRLSETRL 444 Query: 56 ESAYQRIIYLKNKM 69 +++ RI+ LK K+ Sbjct: 445 DASVTRILELKFKL 458 >gi|159030295|emb|CAO91190.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 526 Score = 35.8 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 15/37 (40%), Positives = 24/37 (64%) Query: 33 DPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 DP + IE +Y+ V++G I +RI ++QRI K K+ Sbjct: 306 DPIEAIEAVYSAVQAGTISEARINDSWQRIQRAKEKL 342 >gi|163755898|ref|ZP_02163015.1| b-N-acetylglucosaminidase, glycoside hydrolase family 3 protein [Kordia algicida OT-1] gi|161324069|gb|EDP95401.1| b-N-acetylglucosaminidase, glycoside hydrolase family 3 protein [Kordia algicida OT-1] Length = 366 Score = 35.8 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 15/42 (35%), Positives = 24/42 (57%) Query: 28 GADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 G D P D+I+++ + GEI RI+ +YQRI+ K + Sbjct: 325 GRDMILPQDIIDIVKKMIADGEISEKRIDESYQRILKFKKGL 366 >gi|166363718|ref|YP_001655991.1| putative beta-glucosidase [Microcystis aeruginosa NIES-843] gi|166086091|dbj|BAG00799.1| putative beta-glucosidase [Microcystis aeruginosa NIES-843] Length = 526 Score = 35.4 bits (80), Expect = 2.3, Method: Composition-based stats. Identities = 15/37 (40%), Positives = 24/37 (64%) Query: 33 DPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 DP + IE +Y+ V++G I +RI ++QRI K K+ Sbjct: 306 DPIEAIEAVYSAVQAGTISEARINDSWQRIQRAKEKL 342 >gi|290962043|ref|YP_003493225.1| glycosyl hydrolase [Streptomyces scabiei 87.22] gi|260651569|emb|CBG74693.1| putative glycosyl hydrolase [Streptomyces scabiei 87.22] Length = 611 Score = 35.0 bits (79), Expect = 3.0, Method: Composition-based stats. Identities = 15/50 (30%), Positives = 30/50 (60%) Query: 20 RIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 R++ + NAGADQ +L+ V+ G + SRI+ + +R++ +K ++ Sbjct: 354 RMVKILNAGADQFGGEQCTDLLLELVRDGVVPESRIDESARRVLLIKFRL 403 >gi|256377084|ref|YP_003100744.1| xylan 1,4-beta-xylosidase [Actinosynnema mirum DSM 43827] gi|255921387|gb|ACU36898.1| Xylan 1,4-beta-xylosidase [Actinosynnema mirum DSM 43827] Length = 609 Score = 34.7 bits (78), Expect = 3.9, Method: Composition-based stats. Identities = 15/50 (30%), Positives = 32/50 (64%) Query: 20 RIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 R++ V AG DQ + ++L+ V+SGE+ +RI+ + +R++ +K ++ Sbjct: 352 RMLKVLEAGCDQFGGEECVDLLLDLVRSGEVGEARIDVSARRLLLVKFRL 401 >gi|110636892|ref|YP_677099.1| b-N-acetylglucosaminidase [Cytophaga hutchinsonii ATCC 33406] gi|110279573|gb|ABG57759.1| b-N-acetylglucosaminidase, glycoside hydrolase family 3 protein [Cytophaga hutchinsonii ATCC 33406] Length = 395 Score = 34.7 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 17/42 (40%), Positives = 27/42 (64%) Query: 27 AGADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNK 68 + +D+ +++ +I V SG+I SRI AY+RI+ LKNK Sbjct: 351 SASDRIKASEIHAIIKKLVLSGDIPESRINEAYERILALKNK 392 >gi|209525402|ref|ZP_03273942.1| glycoside hydrolase family 3 domain protein [Arthrospira maxima CS-328] gi|209494082|gb|EDZ94397.1| glycoside hydrolase family 3 domain protein [Arthrospira maxima CS-328] Length = 524 Score = 34.7 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 23/57 (40%), Positives = 31/57 (54%), Gaps = 5/57 (8%) Query: 19 SRIIAVYNAGAD----QQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKMKT 71 S ++AV AGAD +DP I I V++GEI P RI ++ RI K K+ T Sbjct: 284 SSVLAV-KAGADILLMPEDPEVTIRAIVQAVENGEISPERIAASCDRINKAKEKIST 339 >gi|156048580|ref|XP_001590257.1| hypothetical protein SS1G_09021 [Sclerotinia sclerotiorum 1980] gi|154693418|gb|EDN93156.1| hypothetical protein SS1G_09021 [Sclerotinia sclerotiorum 1980 UF-70] Length = 553 Score = 33.9 bits (76), Expect = 6.8, Method: Composition-based stats. Identities = 17/31 (54%), Positives = 20/31 (64%) Query: 38 IELIYAHVKSGEIKPSRIESAYQRIIYLKNK 68 IE + A VKSGEI IES+ R+I LK K Sbjct: 327 IEAVIAAVKSGEISQEMIESSVNRVIRLKTK 357 >gi|284052770|ref|ZP_06382980.1| glycoside hydrolase family 3 domain protein [Arthrospira platensis str. Paraca] Length = 517 Score = 33.9 bits (76), Expect = 7.9, Method: Composition-based stats. Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 5/57 (8%) Query: 19 SRIIAVYNAGAD----QQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKMKT 71 S ++AV AGAD +DP I+ + V++GEI P RI ++ RI K K+ T Sbjct: 274 SSVLAV-KAGADILLMPEDPEITIKAVCQAVENGEISPERIAASCDRINKAKEKIAT 329 >gi|291565862|dbj|BAI88134.1| glycoside hydrolase, family 3 [Arthrospira platensis NIES-39] Length = 527 Score = 33.9 bits (76), Expect = 8.1, Method: Composition-based stats. Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 5/57 (8%) Query: 19 SRIIAVYNAGAD----QQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKMKT 71 S ++AV AGAD +DP I+ + V++GEI P RI ++ RI K K+ T Sbjct: 284 SSVLAV-KAGADILLMPEDPEITIKAVCQAVENGEISPERIAASCDRINKAKEKIAT 339 Searching..................................................done Results from round 2 >gi|254781071|ref|YP_003065484.1| hypothetical protein CLIBASIA_04865 [Candidatus Liberibacter asiaticus str. psy62] gi|254040748|gb|ACT57544.1| hypothetical protein CLIBASIA_04865 [Candidatus Liberibacter asiaticus str. psy62] Length = 71 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 71/71 (100%), Positives = 71/71 (100%) Query: 1 MRWAFKALLALIACKWNLSRIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQ 60 MRWAFKALLALIACKWNLSRIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQ Sbjct: 1 MRWAFKALLALIACKWNLSRIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQ 60 Query: 61 RIIYLKNKMKT 71 RIIYLKNKMKT Sbjct: 61 RIIYLKNKMKT 71 >gi|238898650|ref|YP_002924331.1| putative glycosy hydrolase family protein [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] gi|229466409|gb|ACQ68183.1| putative glycosy hydrolase family protein [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] Length = 158 Score = 80.5 bits (197), Expect = 6e-14, Method: Composition-based stats. Identities = 41/73 (56%), Positives = 51/73 (69%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSRIIAV-YNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 ++ IA ++LS + + NAGAD Q+PADVI+LIYAHVKSGEIK SRIE Sbjct: 72 MSAIADNYSLSEALKLSINAGADMLIFSNQQSPVWQNPADVIDLIYAHVKSGEIKSSRIE 131 Query: 57 SAYQRIIYLKNKM 69 SAYQRII+LK K+ Sbjct: 132 SAYQRIIHLKKKL 144 >gi|270157239|ref|ZP_06185896.1| glycosyl hydrolase [Legionella longbeachae D-4968] gi|289164364|ref|YP_003454502.1| N-acetyl-beta-glucosaminidase [Legionella longbeachae NSW150] gi|269989264|gb|EEZ95518.1| glycosyl hydrolase [Legionella longbeachae D-4968] gi|288857537|emb|CBJ11375.1| putative N-acetyl-beta-glucosaminidase [Legionella longbeachae NSW150] Length = 380 Score = 80.1 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 29/74 (39%), Positives = 38/74 (51%), Gaps = 12/74 (16%) Query: 9 LALIACKWNLSR-IIAVYNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I+ + L + ++ NAGAD QDP VI+LI A V SGEI RI Sbjct: 305 MKAISDNYGLEQALVLAINAGADMLIFGNNLPAPPQDPKQVIDLIEAKVNSGEISQERIN 364 Query: 57 SAYQRIIYLKNKMK 70 AYQ I+ LK +K Sbjct: 365 EAYQHIVTLKKSLK 378 >gi|254495931|ref|ZP_05108839.1| glycosy hydrolase family protein [Legionella drancourtii LLAP12] gi|254354809|gb|EET13436.1| glycosy hydrolase family protein [Legionella drancourtii LLAP12] Length = 379 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 27/73 (36%), Positives = 37/73 (50%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSR-IIAVYNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I+ + L ++ NAGAD Q+P VI LI A V+SGEI RI+ Sbjct: 305 MKAISEHYGLDEALVLAINAGADMFIFGNTLTAKAQNPEQVINLIAAKVQSGEISQQRID 364 Query: 57 SAYQRIIYLKNKM 69 AYQ I+ LK + Sbjct: 365 EAYQHIVTLKQSL 377 >gi|54294135|ref|YP_126550.1| hypothetical protein lpl1199 [Legionella pneumophila str. Lens] gi|53753967|emb|CAH15438.1| hypothetical protein lpl1199 [Legionella pneumophila str. Lens] Length = 382 Score = 56.6 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSRIIAV-YNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I + L + + NAGAD QD ++I++I V+SGEI RI Sbjct: 308 MKAITNYYGLETAVTLSINAGADMLIFGNQLVEKFQDSTEIIDMIEQKVRSGEISEQRIN 367 Query: 57 SAYQRIIYLKNKM 69 AYQRI+ +K Sbjct: 368 EAYQRIVKMKRSF 380 >gi|54297148|ref|YP_123517.1| hypothetical protein lpp1193 [Legionella pneumophila str. Paris] gi|53750933|emb|CAH12344.1| hypothetical protein lpp1193 [Legionella pneumophila str. Paris] Length = 382 Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSRIIAV-YNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I + L + + NAGAD QD ++I++I V+SGEI RI Sbjct: 308 MKAITNYYGLETAVTLSINAGADMLIFGNQLVEKFQDSTEIIDMIEQKVRSGEISEQRIN 367 Query: 57 SAYQRIIYLKNKM 69 AYQRI+ +K Sbjct: 368 EAYQRIVKMKRSF 380 >gi|296106822|ref|YP_003618522.1| beta-N-acetylhexosaminidase [Legionella pneumophila 2300/99 Alcoy] gi|295648723|gb|ADG24570.1| beta-N-acetylhexosaminidase [Legionella pneumophila 2300/99 Alcoy] Length = 378 Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSRIIAV-YNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I + L + + NAGAD QD ++I++I V+SGEI RI Sbjct: 304 MKAITNYYGLETAVTLSINAGADMLIFGNQLVEKFQDSTEIIDMIEQKVRSGEISEQRIN 363 Query: 57 SAYQRIIYLKNKM 69 AYQRI+ +K Sbjct: 364 EAYQRIVKMKRSF 376 >gi|307609946|emb|CBW99474.1| hypothetical protein LPW_12471 [Legionella pneumophila 130b] Length = 378 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSRIIAV-YNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I + L + + NAGAD QD ++I++I V+SGEI RI Sbjct: 304 MKAITNYYGLETAVTLSINAGADMLIFGNQLVEKFQDSTEIIDMIEQKVQSGEISEQRIN 363 Query: 57 SAYQRIIYLKNKM 69 AYQRI+ +K Sbjct: 364 EAYQRIVKMKRSF 376 >gi|148358777|ref|YP_001249984.1| glycosyl hydrolase family transporter 3 [Legionella pneumophila str. Corby] gi|148280550|gb|ABQ54638.1| glycosyl hydrolase family 3 [Legionella pneumophila str. Corby] Length = 382 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSRIIAV-YNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I + L + + NAGAD QD ++I++I V+SGEI RI Sbjct: 308 MKAITNYYGLETAVTLSINAGADMLIFGNQLVEKFQDSTEIIDMIEQKVRSGEISEQRIN 367 Query: 57 SAYQRIIYLKNKM 69 AYQRI+ +K Sbjct: 368 EAYQRIVKMKRSF 380 >gi|52841424|ref|YP_095223.1| glycosy hydrolase family protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52628535|gb|AAU27276.1| glycosyl hydrolase family 3 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 395 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSRIIAV-YNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I + L + + NAGAD QD ++I++I V+SGEI RI Sbjct: 321 MKAITNYYGLETAVTLSINAGADMLIFGNQLVEKFQDSTEIIDMIEQKVRSGEISEQRIN 380 Query: 57 SAYQRIIYLKNKM 69 AYQRI+ +K Sbjct: 381 EAYQRIVKMKRSF 393 >gi|304413318|ref|ZP_07394791.1| glycosyl hydrolase domain-containing hypothetical protein [Candidatus Regiella insecticola LSR1] gi|304284161|gb|EFL92554.1| glycosyl hydrolase domain-containing hypothetical protein [Candidatus Regiella insecticola LSR1] Length = 353 Score = 48.5 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 18/81 (22%) Query: 9 LALIACKWNLSRII-AVYNAGADQ----------------QDPADVIELIY-AHVKSGEI 50 + I + L + NAG + Q+P +I+LIY A V SG+I Sbjct: 266 MKAIQDNYTLEEALELSINAGVNMLIFGHPSVSNQPAEDWQNPEAIIDLIYRAVVISGKI 325 Query: 51 KPSRIESAYQRIIYLKNKMKT 71 P IE YQRI+ LK KM + Sbjct: 326 HPDIIEDNYQRILQLKKKMTS 346 >gi|254495607|ref|ZP_05108529.1| glycosyl hydrolase [Legionella drancourtii LLAP12] gi|254355177|gb|EET13790.1| glycosyl hydrolase [Legionella drancourtii LLAP12] Length = 358 Score = 45.8 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 10/71 (14%) Query: 9 LALIACKWNLSRIIAV-YNAGADQQ---------DPADVIELIYAHVKSGEIKPSRIESA 58 + IA ++L + + NAGAD +VIE I V +I P RIE A Sbjct: 281 MQAIADHYSLDEALRLTINAGADMIIFANQLDTITAPEVIERIECLVLEHKIDPHRIEEA 340 Query: 59 YQRIIYLKNKM 69 Y+R+I LK ++ Sbjct: 341 YRRVIRLKQQI 351 >gi|307609676|emb|CBW99184.1| hypothetical protein LPW_09671 [Legionella pneumophila 130b] Length = 358 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 10/71 (14%) Query: 9 LALIACKWNLSRII-AVYNAGADQQ---------DPADVIELIYAHVKSGEIKPSRIESA 58 + I+ ++L + NAGAD +VI++I V +I RI+ A Sbjct: 280 MHAISNHYSLEEALCLTINAGADMVIFANQLGTITAPEVIDVIEKLVIDKQISSQRIDEA 339 Query: 59 YQRIIYLKNKM 69 Y+RI+ LK ++ Sbjct: 340 YRRIVRLKQQI 350 >gi|54293860|ref|YP_126275.1| hypothetical protein lpl0916 [Legionella pneumophila str. Lens] gi|53753692|emb|CAH15150.1| hypothetical protein lpl0916 [Legionella pneumophila str. Lens] Length = 358 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 10/71 (14%) Query: 9 LALIACKWNLSRII-AVYNAGADQQ---------DPADVIELIYAHVKSGEIKPSRIESA 58 + I+ ++L + NAGAD +VI++I V +I RI+ A Sbjct: 280 MHAISNHYSLEEALCLTINAGADMVIFANQLGTITAPEVIDVIEKLVIDKQISSQRIDEA 339 Query: 59 YQRIIYLKNKM 69 Y+RI+ LK ++ Sbjct: 340 YRRIVRLKQQI 350 >gi|52841120|ref|YP_094919.1| glycosyl hydrolase [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|54296905|ref|YP_123274.1| hypothetical protein lpp0946 [Legionella pneumophila str. Paris] gi|52628231|gb|AAU26972.1| glycosyl hydrolase [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|53750690|emb|CAH12097.1| hypothetical protein lpp0946 [Legionella pneumophila str. Paris] Length = 358 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 21/71 (29%), Positives = 35/71 (49%), Gaps = 10/71 (14%) Query: 9 LALIACKWNLSRII-AVYNAGADQQ---------DPADVIELIYAHVKSGEIKPSRIESA 58 + I+ ++L + NAGAD P +VI++I V +I RI+ A Sbjct: 280 MHAISNHYSLEDALCLTINAGADMVIFANQLGTITPPEVIDVIEKLVIDKQIPYQRIDEA 339 Query: 59 YQRIIYLKNKM 69 Y+RI+ LK ++ Sbjct: 340 YRRIVRLKQQI 350 >gi|307150649|ref|YP_003886033.1| glycoside hydrolase family 3 domain-containing protein [Cyanothece sp. PCC 7822] gi|306980877|gb|ADN12758.1| glycoside hydrolase family 3 domain protein [Cyanothece sp. PCC 7822] Length = 533 Score = 44.2 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 4/54 (7%) Query: 21 IIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKMK 70 + AGAD DP IE +Y V+SG I P RI + QRII K K+K Sbjct: 290 AVKAVEAGADILLMPDDPLVAIEAVYQAVQSGRISPERIAQSVQRIIEAKEKLK 343 >gi|148360469|ref|YP_001251676.1| glycosyl hydrolase [Legionella pneumophila str. Corby] gi|296106464|ref|YP_003618164.1| beta-N-acetylhexosaminidase [Legionella pneumophila 2300/99 Alcoy] gi|148282242|gb|ABQ56330.1| glycosyl hydrolase [Legionella pneumophila str. Corby] gi|295648365|gb|ADG24212.1| beta-N-acetylhexosaminidase [Legionella pneumophila 2300/99 Alcoy] Length = 358 Score = 44.2 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 10/71 (14%) Query: 9 LALIACKWNLSRII-AVYNAGADQQ---------DPADVIELIYAHVKSGEIKPSRIESA 58 + I+ ++L + NAGAD +VI++I V +I RI+ A Sbjct: 280 MHAISNHYSLEEALCLTINAGADMVIFANQLGTITAPEVIDVIEKLVIDKQIPSQRIDEA 339 Query: 59 YQRIIYLKNKM 69 Y+RI+ LK ++ Sbjct: 340 YRRIVRLKQQI 350 >gi|163755898|ref|ZP_02163015.1| b-N-acetylglucosaminidase, glycoside hydrolase family 3 protein [Kordia algicida OT-1] gi|161324069|gb|EDP95401.1| b-N-acetylglucosaminidase, glycoside hydrolase family 3 protein [Kordia algicida OT-1] Length = 366 Score = 41.9 bits (97), Expect = 0.025, Method: Composition-based stats. Identities = 15/42 (35%), Positives = 24/42 (57%) Query: 28 GADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 G D P D+I+++ + GEI RI+ +YQRI+ K + Sbjct: 325 GRDMILPQDIIDIVKKMIADGEISEKRIDESYQRILKFKKGL 366 >gi|295094525|emb|CBK83616.1| Beta-glucosidase-related glycosidases [Coprococcus sp. ART55/1] Length = 416 Score = 41.6 bits (96), Expect = 0.034, Method: Composition-based stats. Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 5/65 (7%) Query: 9 LALIACKWNLSRIIAVY-NAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + IA +++ + + AG D +D A + I V GE+ RI+ + +RI+ Sbjct: 347 MESIADNYSVDDAVVMSVKAGMDMILQPKDMASAVNSIEQAVADGELSEDRIDESVRRIL 406 Query: 64 YLKNK 68 LK Sbjct: 407 TLKES 411 >gi|209525402|ref|ZP_03273942.1| glycoside hydrolase family 3 domain protein [Arthrospira maxima CS-328] gi|209494082|gb|EDZ94397.1| glycoside hydrolase family 3 domain protein [Arthrospira maxima CS-328] Length = 524 Score = 41.6 bits (96), Expect = 0.039, Method: Composition-based stats. Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 5/69 (7%) Query: 8 LLALIACKWNL-SRIIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRI 62 ++ IA ++L S + AGAD +DP I I V++GEI P RI ++ RI Sbjct: 271 VMGAIARGYSLASSSVLAVKAGADILLMPEDPEVTIRAIVQAVENGEISPERIAASCDRI 330 Query: 63 IYLKNKMKT 71 K K+ T Sbjct: 331 NKAKEKIST 339 >gi|270159405|ref|ZP_06188061.1| glycosyl hydrolase family 3 protein [Legionella longbeachae D-4968] gi|289165783|ref|YP_003455921.1| glycosyl hydrolase [Legionella longbeachae NSW150] gi|269987744|gb|EEZ93999.1| glycosyl hydrolase family 3 protein [Legionella longbeachae D-4968] gi|288858956|emb|CBJ12882.1| putative glycosyl hydrolase [Legionella longbeachae NSW150] Length = 357 Score = 41.2 bits (95), Expect = 0.049, Method: Composition-based stats. Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 10/71 (14%) Query: 9 LALIACKWNLSRII-AVYNAGADQQ---------DPADVIELIYAHVKSGEIKPSRIESA 58 + I ++L + NAGAD ++IE+I V +I+ RIE A Sbjct: 280 MQAITDHYSLEDALCLTINAGADMIIFANQLAQITAPEIIEIIERLVLEQKIEYQRIEDA 339 Query: 59 YQRIIYLKNKM 69 Y+RII LK ++ Sbjct: 340 YRRIIRLKQQI 350 >gi|254457746|ref|ZP_05071174.1| B-N-acetylglucosaminidase, glycoside hydrolase family 3 protein [Campylobacterales bacterium GD 1] gi|207086538|gb|EDZ63822.1| B-N-acetylglucosaminidase, glycoside hydrolase family 3 protein [Campylobacterales bacterium GD 1] Length = 555 Score = 40.8 bits (94), Expect = 0.060, Method: Composition-based stats. Identities = 22/70 (31%), Positives = 32/70 (45%), Gaps = 10/70 (14%) Query: 9 LALIACKWNLSRII-AVYNAGADQ---------QDPADVIELIYAHVKSGEIKPSRIESA 58 + I +NL + N+G D D +++E+IYA VKSG I RI + Sbjct: 287 MKAITDHYNLKESVTLAINSGVDILLFGNQLASNDVKELVEIIYAQVKSGAISKKRIIES 346 Query: 59 YQRIIYLKNK 68 +RI L K Sbjct: 347 NRRIENLHTK 356 >gi|291333723|gb|ADD93410.1| b N acetylglucosaminidase glycoside hydrolase family 3 protein [uncultured marine bacterium MedDCM-OCT-S04-C102] Length = 262 Score = 40.4 bits (93), Expect = 0.073, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 13/74 (17%) Query: 9 LALIACKWNLSR-IIAVYNAGAD------------QQDPADVIELIYAHVKSGEIKPSRI 55 + I + L ++ NAG D P +VI +I V++GE+ RI Sbjct: 185 MGAITRNFGLKESVVYAINAGVDVLIFSNNQVYKDLVYPEEVINIIEEGVRNGEVSLLRI 244 Query: 56 ESAYQRIIYLKNKM 69 +++RI LK K+ Sbjct: 245 NESFRRIQNLKKKI 258 >gi|284052770|ref|ZP_06382980.1| glycoside hydrolase family 3 domain protein [Arthrospira platensis str. Paraca] Length = 517 Score = 40.4 bits (93), Expect = 0.083, Method: Composition-based stats. Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 5/69 (7%) Query: 8 LLALIACKWNL-SRIIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRI 62 ++ IA ++L S + AGAD +DP I+ + V++GEI P RI ++ RI Sbjct: 261 VMGAIARGYSLASSSVLAVKAGADILLMPEDPEITIKAVCQAVENGEISPERIAASCDRI 320 Query: 63 IYLKNKMKT 71 K K+ T Sbjct: 321 NKAKEKIAT 329 >gi|218438839|ref|YP_002377168.1| glycoside hydrolase [Cyanothece sp. PCC 7424] gi|218171567|gb|ACK70300.1| glycoside hydrolase family 3 domain protein [Cyanothece sp. PCC 7424] Length = 537 Score = 40.4 bits (93), Expect = 0.091, Method: Composition-based stats. Identities = 20/53 (37%), Positives = 25/53 (47%), Gaps = 4/53 (7%) Query: 21 IIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 + AG D +DP IE IY V+SG I RI + QRI K K+ Sbjct: 290 AVKAVEAGTDILLMPKDPVVAIEAIYQAVESGRISQERIAESAQRIWRAKQKL 342 >gi|21672931|ref|NP_660996.1| glycosy hydrolase family protein [Chlorobium tepidum TLS] gi|21645987|gb|AAM71338.1| glycosyl hydrolase, family 3 [Chlorobium tepidum TLS] Length = 372 Score = 40.4 bits (93), Expect = 0.091, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 32/72 (44%), Gaps = 12/72 (16%) Query: 9 LALIACKWNLSRII-AVYNAGADQ--------QDP---ADVIELIYAHVKSGEIKPSRIE 56 + IA ++ L I +AG D DP + +I V+ G I P RI Sbjct: 295 MKAIADRYGLEEAIRLAIDAGVDVLIFGNNVSYDPEIASKATSIIRHLVEKGAISPERIN 354 Query: 57 SAYQRIIYLKNK 68 +Y+RI+ LK + Sbjct: 355 ESYRRIMTLKTR 366 >gi|291565862|dbj|BAI88134.1| glycoside hydrolase, family 3 [Arthrospira platensis NIES-39] Length = 527 Score = 40.4 bits (93), Expect = 0.094, Method: Composition-based stats. Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 5/69 (7%) Query: 8 LLALIACKWNL-SRIIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRI 62 ++ IA ++L S + AGAD +DP I+ + V++GEI P RI ++ RI Sbjct: 271 VMGAIARGYSLASSSVLAVKAGADILLMPEDPEITIKAVCQAVENGEISPERIAASCDRI 330 Query: 63 IYLKNKMKT 71 K K+ T Sbjct: 331 NKAKEKIAT 339 >gi|307353677|ref|YP_003894728.1| glycoside hydrolase family 3 domain-containing protein [Methanoplanus petrolearius DSM 11571] gi|307156910|gb|ADN36290.1| glycoside hydrolase family 3 domain protein [Methanoplanus petrolearius DSM 11571] Length = 399 Score = 40.0 bits (92), Expect = 0.094, Method: Composition-based stats. Identities = 20/72 (27%), Positives = 29/72 (40%), Gaps = 12/72 (16%) Query: 9 LALIACKWNLSRII-AVYNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I+ + + + NAG D + + LI V GEI RI Sbjct: 323 MGAISDNYGIKEALNLSINAGCDIILFANNIVYDERIAENATGLIKELVLDGEIPEERIN 382 Query: 57 SAYQRIIYLKNK 68 +Y+RII LK K Sbjct: 383 ESYERIIRLKMK 394 >gi|257054254|ref|YP_003132086.1| beta-glucosidase-like glycosyl hydrolase [Saccharomonospora viridis DSM 43017] gi|256584126|gb|ACU95259.1| beta-glucosidase-like glycosyl hydrolase [Saccharomonospora viridis DSM 43017] Length = 383 Score = 40.0 bits (92), Expect = 0.11, Method: Composition-based stats. Identities = 13/65 (20%), Positives = 28/65 (43%), Gaps = 7/65 (10%) Query: 9 LALIACKWNLSRIIA-VYNAGADQQ------DPADVIELIYAHVKSGEIKPSRIESAYQR 61 + + + L + +GADQ D V++ + A + G++ R++ A R Sbjct: 315 MRAVTDNYTLDEAVLLALQSGADQPLWSSGGDVGPVLDKLEAAMADGQLSQERVDEALTR 374 Query: 62 IIYLK 66 ++ K Sbjct: 375 VLTAK 379 >gi|220906275|ref|YP_002481586.1| Beta-N-acetylhexosaminidase [Cyanothece sp. PCC 7425] gi|219862886|gb|ACL43225.1| Beta-N-acetylhexosaminidase [Cyanothece sp. PCC 7425] Length = 552 Score = 39.6 bits (91), Expect = 0.15, Method: Composition-based stats. Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 5/67 (7%) Query: 8 LLALIACKWNLSRI-IAVYNAGADQQ----DPADVIELIYAHVKSGEIKPSRIESAYQRI 62 ++ IA ++ L+ + AGAD DPA I+ I A V+SG I RI+++ +RI Sbjct: 280 VMGAIANRYGLNEAPLMALEAGADILLMPLDPAGAIQAICAAVESGRITVDRIKTSVERI 339 Query: 63 IYLKNKM 69 K K+ Sbjct: 340 WRAKQKV 346 >gi|78189925|ref|YP_380263.1| glycosy hydrolase family protein [Chlorobium chlorochromatii CaD3] gi|78172124|gb|ABB29220.1| glycosyl hydrolase, family 3 [Chlorobium chlorochromatii CaD3] Length = 374 Score = 39.2 bits (90), Expect = 0.19, Method: Composition-based stats. Identities = 23/72 (31%), Positives = 35/72 (48%), Gaps = 12/72 (16%) Query: 9 LALIACKWNLSRII-AVYNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + IA + L I NAG D + +I+A ++ GEI+PSRIE Sbjct: 298 MGAIAAHYGLESAIRLALNAGVDILLFGNNTAYDEAIAEKALAIIHALIERGEIQPSRIE 357 Query: 57 SAYQRIIYLKNK 68 +Y+RI+ LK + Sbjct: 358 ESYRRIMALKQR 369 >gi|332982104|ref|YP_004463545.1| glycoside hydrolase family 3 domain-containing protein [Mahella australiensis 50-1 BON] gi|332699782|gb|AEE96723.1| glycoside hydrolase family 3 domain protein [Mahella australiensis 50-1 BON] Length = 471 Score = 38.5 bits (88), Expect = 0.30, Method: Composition-based stats. Identities = 19/68 (27%), Positives = 30/68 (44%), Gaps = 8/68 (11%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ-------QDPADVIELIYAHVKSGEIKPSRIESAYQ 60 + I +N+ + AGAD ++ + IE I V G I RI+ + + Sbjct: 371 MGAIVKHYNIGDAAVKAIEAGADIILVCHTYKNQIEAIEAISEAVNDGRISQQRIDQSVR 430 Query: 61 RIIYLKNK 68 RI+ LK K Sbjct: 431 RIVMLKQK 438 >gi|119486755|ref|ZP_01620730.1| beta-glucosidase [Lyngbya sp. PCC 8106] gi|119456048|gb|EAW37181.1| beta-glucosidase [Lyngbya sp. PCC 8106] Length = 535 Score = 38.1 bits (87), Expect = 0.39, Method: Composition-based stats. Identities = 22/67 (32%), Positives = 31/67 (46%), Gaps = 5/67 (7%) Query: 8 LLALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRI 62 ++ IA +++ + AGAD DP I + V SG I P RI S+ QRI Sbjct: 272 VMGAIANQYSPEEAPVLAVEAGADIILMPVDPEVAIRSVCDAVVSGRISPERIRSSVQRI 331 Query: 63 IYLKNKM 69 K K+ Sbjct: 332 CSAKAKV 338 >gi|327405500|ref|YP_004346338.1| beta-N-acetylhexosaminidase [Fluviicola taffensis DSM 16823] gi|327321008|gb|AEA45500.1| Beta-N-acetylhexosaminidase [Fluviicola taffensis DSM 16823] Length = 985 Score = 38.1 bits (87), Expect = 0.42, Method: Composition-based stats. Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 5/63 (7%) Query: 9 LALIACKWNLSRII-AVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + ++ K+ S ++ Y AG D ++ D I+LI++ V+SGE+ I +R++ Sbjct: 315 MKAVSDKYGKSEVVAKAYIAGCDILLFPENVEDAIKLIHSKVESGELTKEVINEHCKRVL 374 Query: 64 YLK 66 K Sbjct: 375 RAK 377 >gi|163814622|ref|ZP_02206011.1| hypothetical protein COPEUT_00773 [Coprococcus eutactus ATCC 27759] gi|158450257|gb|EDP27252.1| hypothetical protein COPEUT_00773 [Coprococcus eutactus ATCC 27759] Length = 416 Score = 37.7 bits (86), Expect = 0.48, Method: Composition-based stats. Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 5/63 (7%) Query: 9 LALIACKWNLSR-IIAVYNAGADQQ-DPADV---IELIYAHVKSGEIKPSRIESAYQRII 63 + IA + ++ + AG D PAD+ + I V++G+I RI+ + +RI+ Sbjct: 347 MESIADNYGVADSAVMAVQAGMDMLLQPADLAVAVNAIVTAVQNGDITEPRIDESVRRIL 406 Query: 64 YLK 66 LK Sbjct: 407 TLK 409 >gi|206896505|ref|YP_002246485.1| beta-N-Acetylglucosaminidase [Coprothermobacter proteolyticus DSM 5265] gi|206739122|gb|ACI18200.1| beta-N-Acetylglucosaminidase [Coprothermobacter proteolyticus DSM 5265] Length = 392 Score = 37.7 bits (86), Expect = 0.49, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 8/67 (11%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ-------QDPADVIELIYAHVKSGEIKPSRIESAYQ 60 + I +++ I AGAD + I + VK+G+I RI + + Sbjct: 292 MEAITKHYSVGDAAIKAVQAGADMVLICHSLDEQKQAINALVHAVKTGQISEERINESIK 351 Query: 61 RIIYLKN 67 RI LK Sbjct: 352 RIAMLKR 358 >gi|78357601|ref|YP_389050.1| Beta-N-acetylhexosaminidase [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78220006|gb|ABB39355.1| Beta-N-acetylhexosaminidase [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 398 Score = 37.7 bits (86), Expect = 0.49, Method: Composition-based stats. Identities = 18/72 (25%), Positives = 34/72 (47%), Gaps = 12/72 (16%) Query: 9 LALIACKWNLSRIIA-VYNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I ++ L + +A NAG D +V+++I + V+ G + SRIE Sbjct: 309 MKAITDRYGLEQAVALALNAGVDILLFGNNLTYDADIVPEVVDMIESLVERGVVPRSRIE 368 Query: 57 SAYQRIIYLKNK 68 ++ R++ LK Sbjct: 369 ESFARVLRLKES 380 >gi|16331026|ref|NP_441754.1| beta-glucosidase [Synechocystis sp. PCC 6803] gi|1653521|dbj|BAA18434.1| beta-glucosidase [Synechocystis sp. PCC 6803] Length = 538 Score = 37.7 bits (86), Expect = 0.57, Method: Composition-based stats. Identities = 18/53 (33%), Positives = 25/53 (47%), Gaps = 4/53 (7%) Query: 21 IIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 + AG D DP VI I V+SG + RIE + QR++ K K+ Sbjct: 292 AVRALEAGVDILLMPPDPVTVIAAIAEAVESGRLTEERIEQSLQRVLTAKEKL 344 >gi|261416143|ref|YP_003249826.1| glycoside hydrolase family 3 domain protein [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372599|gb|ACX75344.1| glycoside hydrolase family 3 domain protein [Fibrobacter succinogenes subsp. succinogenes S85] gi|302327825|gb|ADL27026.1| glycosyl hydrolase, family 3 [Fibrobacter succinogenes subsp. succinogenes S85] Length = 384 Score = 37.7 bits (86), Expect = 0.61, Method: Composition-based stats. Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 5/65 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + I +++ + I AG D ++ V + + VK GEIK SRI+ + +RI+ Sbjct: 319 MGAITKQFSNAEAAIKSIQAGVDVVLCSREFTQVFDAVVKAVKKGEIKESRIDESVKRIL 378 Query: 64 YLKNK 68 LK Sbjct: 379 KLKKS 383 >gi|290962043|ref|YP_003493225.1| glycosyl hydrolase [Streptomyces scabiei 87.22] gi|260651569|emb|CBG74693.1| putative glycosyl hydrolase [Streptomyces scabiei 87.22] Length = 611 Score = 37.3 bits (85), Expect = 0.61, Method: Composition-based stats. Identities = 15/50 (30%), Positives = 30/50 (60%) Query: 20 RIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 R++ + NAGADQ +L+ V+ G + SRI+ + +R++ +K ++ Sbjct: 354 RMVKILNAGADQFGGEQCTDLLLELVRDGVVPESRIDESARRVLLIKFRL 403 >gi|298529404|ref|ZP_07016807.1| glycoside hydrolase family 3 domain protein [Desulfonatronospira thiodismutans ASO3-1] gi|298510840|gb|EFI34743.1| glycoside hydrolase family 3 domain protein [Desulfonatronospira thiodismutans ASO3-1] Length = 372 Score = 37.3 bits (85), Expect = 0.73, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 12/74 (16%) Query: 9 LALIACKWNLSRII-AVYNAGADQ----------QDPA-DVIELIYAHVKSGEIKPSRIE 56 + I ++ L + AGAD QD A ++I V++G+I RI+ Sbjct: 299 MGAIHDEYGLETALEQTIKAGADIIIFGNNLVYDQDIAWKARDIILDLVRAGQIPRERID 358 Query: 57 SAYQRIIYLKNKMK 70 +Y+RI+ LK+K++ Sbjct: 359 ESYERIMQLKSKLQ 372 >gi|291557973|emb|CBL35090.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a] Length = 397 Score = 37.3 bits (85), Expect = 0.73, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 5/67 (7%) Query: 9 LALIACKWNLS-RIIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + +A + + AGAD + + V SGEI SRIE + +R++ Sbjct: 329 MGAVADSYTSDIAAVMAVKAGADIILMPESLEKSFNAVLNAVNSGEISISRIEESAERVL 388 Query: 64 YLKNKMK 70 LK K K Sbjct: 389 TLKAKYK 395 >gi|317484721|ref|ZP_07943622.1| glycosyl hydrolase family 3 N terminal domain-containing protein [Bilophila wadsworthia 3_1_6] gi|316924077|gb|EFV45262.1| glycosyl hydrolase family 3 N terminal domain-containing protein [Bilophila wadsworthia 3_1_6] Length = 379 Score = 37.3 bits (85), Expect = 0.77, Method: Composition-based stats. Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSRIIA-VYNAGADQ--------QDPA---DVIELIYAHVKSGEIKPSRIE 56 + IA ++ L ++ AGAD DPA V +I V+ G I +R+E Sbjct: 306 MDAIAAEYTLEEVVLRAIGAGADILLFGNNLEYDPAIVAKVQAVIVRAVEDGTISRARLE 365 Query: 57 SAYQRIIYLKNKM 69 ++++RI+ LK +M Sbjct: 366 ASWRRILKLKQQM 378 >gi|294673403|ref|YP_003574019.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23] gi|294473687|gb|ADE83076.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23] Length = 391 Score = 37.3 bits (85), Expect = 0.79, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 34/67 (50%), Gaps = 5/67 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ-QDPADVI---ELIYAHVKSGEIKPSRIESAYQRII 63 + I ++ + + AGAD DP +++ + + A V G + +RI + +RI+ Sbjct: 321 MGAITKQYTNAEAAVGCIKAGADIVLDPRNLVEAFDAVIAAVNDGTLSEARINQSVRRIL 380 Query: 64 YLKNKMK 70 LK +++ Sbjct: 381 TLKQQIR 387 >gi|193213634|ref|YP_001999587.1| glycoside hydrolase family 3 domain-containing protein [Chlorobaculum parvum NCIB 8327] gi|193087111|gb|ACF12387.1| glycoside hydrolase family 3 domain protein [Chlorobaculum parvum NCIB 8327] Length = 373 Score = 36.9 bits (84), Expect = 0.88, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 36/76 (47%), Gaps = 13/76 (17%) Query: 9 LALIACKWNLSRII-AVYNAGAD---------QQDPA---DVIELIYAHVKSGEIKPSRI 55 + IA ++ L + I +AG D DP +I V+ G++ P RI Sbjct: 295 MKAIADRYGLEQAIRLAIDAGVDVLLFGNNVGIYDPEIAEKANAIIRRLVEKGDVTPERI 354 Query: 56 ESAYQRIIYLKNKMKT 71 +++Y+RII LK + T Sbjct: 355 DASYRRIIALKQRTIT 370 >gi|95929642|ref|ZP_01312384.1| glycoside hydrolase, family 3-like [Desulfuromonas acetoxidans DSM 684] gi|95134339|gb|EAT15996.1| glycoside hydrolase, family 3-like [Desulfuromonas acetoxidans DSM 684] Length = 411 Score = 36.9 bits (84), Expect = 0.91, Method: Composition-based stats. Identities = 16/73 (21%), Positives = 31/73 (42%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSRII-AVYNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I+ + L I NAG D + +I +K G++ +RI+ Sbjct: 311 MKAISAHYGLETAIEKALNAGVDMLVFGNNLSYNEHSVEQAVTIIQRLIKQGKVSEARID 370 Query: 57 SAYQRIIYLKNKM 69 +++RI LK ++ Sbjct: 371 ESWRRITMLKRRL 383 >gi|317130448|ref|YP_004096730.1| glycoside hydrolase [Bacillus cellulosilyticus DSM 2522] gi|315475396|gb|ADU31999.1| glycoside hydrolase family 3 domain protein [Bacillus cellulosilyticus DSM 2522] Length = 384 Score = 36.9 bits (84), Expect = 0.98, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 33/69 (47%), Gaps = 8/69 (11%) Query: 8 LLALIACKWNLSRII-AVYNAG-------ADQQDPADVIELIYAHVKSGEIKPSRIESAY 59 ++ IA +++ + AG +D + ++ + V GEI+ RI+ + Sbjct: 289 VMEAIAANFSVEEAVYKGIQAGIDIFLISSDVEAQQQAMDELLRMVHDGEIQEERIDESV 348 Query: 60 QRIIYLKNK 68 +RI+ +KNK Sbjct: 349 KRILQVKNK 357 >gi|21674933|ref|NP_662998.1| beta-N-acetylglucosaminidase [Chlorobium tepidum TLS] gi|21648162|gb|AAM73340.1| beta-N-acetylglucosaminidase [Chlorobium tepidum TLS] Length = 564 Score = 36.9 bits (84), Expect = 0.98, Method: Composition-based stats. Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 4/53 (7%) Query: 21 IIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 + AG D +DP V + + A V++GEI +I+ + QRI+ +K+ + Sbjct: 309 AVRAVQAGNDMLLFPEDPELVFDAVCAAVENGEISEQQIDHSVQRILQMKHWL 361 >gi|28188982|dbj|BAC56177.1| beta-N-acetylglucosaminidase [Clostridium paraputrificum] Length = 413 Score = 36.5 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 19/70 (27%), Positives = 32/70 (45%), Gaps = 8/70 (11%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ-------QDPADVIELIYAHVKSGEIKPSRIESAYQ 60 + I+ W+L I AGAD ++ V + V G+I +RI+ + + Sbjct: 307 MQAISKNWDLGEAAIKSVEAGADILLVCHTIENQQKVYNAVVQGVNDGKIDENRIDESVR 366 Query: 61 RIIYLKNKMK 70 RI+ LK + K Sbjct: 367 RILRLKYQYK 376 >gi|78778263|ref|YP_394578.1| Beta-N-acetylhexosaminidase [Sulfurimonas denitrificans DSM 1251] gi|78498803|gb|ABB45343.1| Beta-N-acetylhexosaminidase [Sulfurimonas denitrificans DSM 1251] Length = 358 Score = 36.5 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 20/72 (27%), Positives = 34/72 (47%), Gaps = 10/72 (13%) Query: 9 LALIACKWNLSRIIAV-YNAGADQ---------QDPADVIELIYAHVKSGEIKPSRIESA 58 + I ++L I+A+ N+G D QD ++++I+ +K+G I RIE + Sbjct: 285 MKAILSHYSLEEIVALSINSGVDMLLFANQLTTQDIDALVDVIFQEIKNGNIPMDRIEES 344 Query: 59 YQRIIYLKNKMK 70 RI L K Sbjct: 345 NARIEQLYKTYK 356 >gi|332977733|gb|EGK14496.1| glycosyl hydrolase domain protein [Desmospora sp. 8437] Length = 577 Score = 36.5 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 9/57 (15%) Query: 21 IIAVYNAGADQ---------QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNK 68 + +AGAD Q+ +V E I V+SGEI RI+ + RI+ K K Sbjct: 327 AVKAVDAGADMILLTPSLSAQEQIEVFEAIVDAVRSGEISEKRIDRSVHRILQKKKK 383 >gi|159030295|emb|CAO91190.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 526 Score = 36.5 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 15/37 (40%), Positives = 24/37 (64%) Query: 33 DPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 DP + IE +Y+ V++G I +RI ++QRI K K+ Sbjct: 306 DPIEAIEAVYSAVQAGTISEARINDSWQRIQRAKEKL 342 >gi|78187897|ref|YP_375940.1| glycosy hydrolase family protein [Chlorobium luteolum DSM 273] gi|78167799|gb|ABB24897.1| glycosyl hydrolase, family 3 [Chlorobium luteolum DSM 273] Length = 378 Score = 36.5 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 30/74 (40%), Gaps = 13/74 (17%) Query: 9 LALIACKWNLSRIIAV-YNAGADQ---------QDPA---DVIELIYAHVKSGEIKPSRI 55 + IA + + + AGAD DP +I V G I P RI Sbjct: 300 MGAIAQNFGFEEAVRLSIEAGADILVFANNTAVYDPKIAEKASGIIRRMVDEGIISPLRI 359 Query: 56 ESAYQRIIYLKNKM 69 E +Y+RI+ LK + Sbjct: 360 EESYRRIMTLKETV 373 >gi|317499174|ref|ZP_07957451.1| glycosyl hydrolase family 3 N terminal domain-containing protein [Lachnospiraceae bacterium 5_1_63FAA] gi|316893587|gb|EFV15792.1| glycosyl hydrolase family 3 N terminal domain-containing protein [Lachnospiraceae bacterium 5_1_63FAA] Length = 399 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 5/65 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + I ++ + AG D + + + I +KSG+IK SRI+ + +RII Sbjct: 320 MKAITDNYSSGEAAVKAIQAGVDLIVMPDNYKEAYKAIKKALKSGKIKESRIDKSVRRII 379 Query: 64 YLKNK 68 Y K K Sbjct: 380 YTKLK 384 >gi|147678616|ref|YP_001212831.1| beta-glucosidase-related glycosidases and D-alanyl-D-alanine dipeptidase [Pelotomaculum thermopropionicum SI] gi|146274713|dbj|BAF60462.1| beta-glucosidase-related glycosidases and D-alanyl-D-alanine dipeptidase [Pelotomaculum thermopropionicum SI] Length = 1139 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 20/63 (31%), Positives = 29/63 (46%), Gaps = 5/63 (7%) Query: 9 LALIACKWNLSRIIA-VYNAGADQQ----DPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + IA + + AGAD D + A V+SGEI SRI+ + +R+I Sbjct: 841 MKAIADHFGPREAVIGAVKAGADIALMPADLDQAYNGLLAAVRSGEIPESRIDESVKRLI 900 Query: 64 YLK 66 LK Sbjct: 901 RLK 903 >gi|268679756|ref|YP_003304187.1| glycoside hydrolase [Sulfurospirillum deleyianum DSM 6946] gi|268617787|gb|ACZ12152.1| glycoside hydrolase family 3 domain protein [Sulfurospirillum deleyianum DSM 6946] Length = 345 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 20/42 (47%), Positives = 26/42 (61%) Query: 28 GADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 G D P V +I V+ GEI+P RIE +Y+RI+ LK KM Sbjct: 302 GDDASIPFTVQRIIMEGVRKGEIRPQRIELSYKRIMALKQKM 343 >gi|166363718|ref|YP_001655991.1| putative beta-glucosidase [Microcystis aeruginosa NIES-843] gi|166086091|dbj|BAG00799.1| putative beta-glucosidase [Microcystis aeruginosa NIES-843] Length = 526 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 15/37 (40%), Positives = 24/37 (64%) Query: 33 DPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 DP + IE +Y+ V++G I +RI ++QRI K K+ Sbjct: 306 DPIEAIEAVYSAVQAGTISEARINDSWQRIQRAKEKL 342 >gi|313904834|ref|ZP_07838206.1| glycoside hydrolase family 3 domain protein [Eubacterium cellulosolvens 6] gi|313470267|gb|EFR65597.1| glycoside hydrolase family 3 domain protein [Eubacterium cellulosolvens 6] Length = 454 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 5/67 (7%) Query: 9 LALIACKWNLSRII-AVYNAGADQ-QDPADVIE---LIYAHVKSGEIKPSRIESAYQRII 63 + + + + + AGAD Q P D+ E + V++GEI+ SRI+ + +RI+ Sbjct: 377 MKAVTDHYTSAEAVVMAVKAGADMVQRPTDLSEAYQTLLKAVRNGEIEESRIDESVKRIL 436 Query: 64 YLKNKMK 70 K M+ Sbjct: 437 RAKYAMQ 443 >gi|167767602|ref|ZP_02439655.1| hypothetical protein CLOSS21_02135 [Clostridium sp. SS2/1] gi|167710619|gb|EDS21198.1| hypothetical protein CLOSS21_02135 [Clostridium sp. SS2/1] Length = 421 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 5/65 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + I ++ + AG D + + + I +KSG+IK SRI+ + +RII Sbjct: 342 MKAITDNYSSGEAAVKAIQAGVDLIVMPDNYKEAYKAIKKALKSGKIKESRIDKSVRRII 401 Query: 64 YLKNK 68 Y K K Sbjct: 402 YTKLK 406 >gi|126656850|ref|ZP_01728028.1| beta-glucosidase [Cyanothece sp. CCY0110] gi|126621688|gb|EAZ92397.1| beta-glucosidase [Cyanothece sp. CCY0110] Length = 539 Score = 36.2 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 4/53 (7%) Query: 21 IIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 + AGAD DP I +Y V++G + RI+ + QRI K K+ Sbjct: 294 AVKAVEAGADILLMPDDPEIAINAVYDAVETGRLTTERIDESLQRIWQAKQKL 346 >gi|153872350|ref|ZP_02001271.1| conserved hypothetical protein [Beggiatoa sp. PS] gi|152071182|gb|EDN68727.1| conserved hypothetical protein [Beggiatoa sp. PS] Length = 403 Score = 36.2 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 19/74 (25%), Positives = 31/74 (41%), Gaps = 13/74 (17%) Query: 9 LALIACKWNLSRII-AVYNAGADQ------------QDPADVIELIYAHVKSGEIKPSRI 55 + IA + L + +AG D A +I V++G I +RI Sbjct: 330 MKAIASHYGLETAVHKAIDAGVDILVIGNNTGDFVPDIAAQAFNIIKRLVQNGTISEARI 389 Query: 56 ESAYQRIIYLKNKM 69 E +YQRI +K ++ Sbjct: 390 EESYQRIQQMKRRI 403 >gi|268609208|ref|ZP_06142935.1| beta-N-acetylhexosaminidase [Ruminococcus flavefaciens FD-1] Length = 632 Score = 35.8 bits (81), Expect = 1.8, Method: Composition-based stats. Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 4/51 (7%) Query: 20 RIIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 ++I NAG D DV +I V SG+I RI A +RII +K Sbjct: 343 QLITSINAGIDMLMEVDTFEDVYNIIIDAVHSGDISEERINDAAERIIRVK 393 >gi|302669326|ref|YP_003832476.1| beta-N-acetylhexosaminidase Bhx3C [Butyrivibrio proteoclasticus B316] gi|302396990|gb|ADL35894.1| beta-N-acetylhexosaminidase Bhx3C [Butyrivibrio proteoclasticus B316] Length = 666 Score = 35.8 bits (81), Expect = 1.8, Method: Composition-based stats. Identities = 17/33 (51%), Positives = 20/33 (60%) Query: 36 DVIELIYAHVKSGEIKPSRIESAYQRIIYLKNK 68 D I I A V SGEI RI+ + QRI+ LK K Sbjct: 377 DYIAGIGAKVTSGEISQDRIDESVQRILTLKAK 409 >gi|124002449|ref|ZP_01687302.1| glycosyl hydrolase, family 3 [Microscilla marina ATCC 23134] gi|123992278|gb|EAY31646.1| glycosyl hydrolase, family 3 [Microscilla marina ATCC 23134] Length = 383 Score = 35.8 bits (81), Expect = 1.9, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 14/76 (18%) Query: 9 LALIACKWNLSRII-AVYNAGADQ-------------QDPADVIELIYAHVKSGEIKPSR 54 + IA + + + NAG D + I +I +K G+I R Sbjct: 308 MNAIAKNFGIEEALEKSINAGVDIVLFSNNGRIFYNKNIVPEAINIIKKLIKQGKISRKR 367 Query: 55 IESAYQRIIYLKNKMK 70 I+ +YQRI +K +K Sbjct: 368 IDESYQRIKKMKQGLK 383 >gi|115359049|ref|YP_776187.1| Beta-glucosidase [Burkholderia ambifaria AMMD] gi|115284337|gb|ABI89853.1| Beta-glucosidase [Burkholderia ambifaria AMMD] Length = 669 Score = 35.8 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 14/74 (18%) Query: 6 KALLALIACKWNLS------RIIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRI 55 KA A IA W + R + NAG +Q DP ++EL VKSG + +R+ Sbjct: 389 KATPADIAMPWGVENLPKAERFLKALNAGVNQFGGIDDPTPIVEL----VKSGRLSETRL 444 Query: 56 ESAYQRIIYLKNKM 69 +++ RI+ LK K+ Sbjct: 445 DASVTRILELKFKL 458 >gi|308208213|gb|ADO20357.1| beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen bacterium] Length = 775 Score = 35.8 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 11/56 (19%) Query: 21 IIAVYNAGADQ-QDPAD------VIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 + NAG D DP D +I+L VKSGEI RI+ A +RI+ LK ++ Sbjct: 334 LALGINAGIDMIMDPYDPECCTAIIDL----VKSGEIPMERIDDAVRRILRLKVRL 385 >gi|302671153|ref|YP_003831113.1| beta-N-acetylhexosaminidase Bhx3B [Butyrivibrio proteoclasticus B316] gi|302395626|gb|ADL34531.1| beta-N-acetylhexosaminidase Bhx3B [Butyrivibrio proteoclasticus B316] Length = 426 Score = 35.8 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 17/62 (27%), Positives = 27/62 (43%), Gaps = 5/62 (8%) Query: 10 ALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIY 64 A I + + + NAGAD ++ + + + V SG I RI + +RI Sbjct: 356 AAITENYTSAEAAVNAINAGADMIFLPENFEEAYQGVLDAVNSGAITEDRINESIKRIYR 415 Query: 65 LK 66 LK Sbjct: 416 LK 417 >gi|148271261|ref|YP_001220822.1| putative beta-glucosidase/beta-xylosidase [Clavibacter michiganensis subsp. michiganensis NCPPB 382] gi|147829191|emb|CAN00102.1| putative beta-glucosidase/beta-xylosidase [Clavibacter michiganensis subsp. michiganensis NCPPB 382] Length = 612 Score = 35.8 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 14/50 (28%), Positives = 28/50 (56%) Query: 20 RIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 R++ + AGADQ +L+ V G I +RI+ + +R++ +K ++ Sbjct: 354 RMVKIIEAGADQFGGEQCTDLLLDLVHDGSISEARIDESARRLLLVKFQL 403 >gi|218885938|ref|YP_002435259.1| beta-N-acetylhexosaminidase [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218756892|gb|ACL07791.1| Beta-N-acetylhexosaminidase [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 586 Score = 35.4 bits (80), Expect = 2.4, Method: Composition-based stats. Identities = 19/74 (25%), Positives = 31/74 (41%), Gaps = 12/74 (16%) Query: 9 LALIACKWNLSRII-AVYNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + I ++ L ++ +AGAD A V + V+SG I RI Sbjct: 473 MGAITDRYPLEEVVFRAVDAGADILLFGNNLSWQPDLTARVHATLTGLVQSGRISEDRIR 532 Query: 57 SAYQRIIYLKNKMK 70 +YQR+ LK ++ Sbjct: 533 QSYQRVTRLKGLLR 546 >gi|268611122|ref|ZP_06144849.1| beta-N-acetylhexosaminidase [Ruminococcus flavefaciens FD-1] Length = 825 Score = 35.4 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 17/52 (32%), Positives = 24/52 (46%), Gaps = 4/52 (7%) Query: 20 RIIAVYNAGADQQDPADVIE----LIYAHVKSGEIKPSRIESAYQRIIYLKN 67 ++I NAG D D + +I V SG+I R+ A RII +K Sbjct: 337 QVIKSINAGIDMLMETDNFDEAKQIIVDAVGSGDISEERVNDAVTRIIKVKK 388 >gi|172039586|ref|YP_001806087.1| beta-glucosidase [Cyanothece sp. ATCC 51142] gi|171701040|gb|ACB54021.1| beta-glucosidase [Cyanothece sp. ATCC 51142] Length = 539 Score = 35.4 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 4/53 (7%) Query: 21 IIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 I AGAD DP I +Y V++G + RI+ + Q+I K K+ Sbjct: 294 AIKAVEAGADILLMPDDPEMAINAVYNAVETGRLTTERIDESLQKIWQAKQKL 346 >gi|254787629|ref|YP_003075058.1| glycoside hydrolase family 3 domain-containing protein [Teredinibacter turnerae T7901] gi|237683422|gb|ACR10686.1| glycoside hydrolase family 3 domain protein [Teredinibacter turnerae T7901] Length = 1064 Score = 35.4 bits (80), Expect = 2.6, Method: Composition-based stats. Identities = 19/54 (35%), Positives = 28/54 (51%), Gaps = 4/54 (7%) Query: 17 NLSRIIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 +S NAG D D +I+ A V+SGEI +RI+ A +RI+ +K Sbjct: 335 TVSSCAQAINAGIDLVMVPNDWKALIKNTIAQVESGEISQARIDDAVRRILRVK 388 >gi|268607950|ref|ZP_06141681.1| glycoside hydrolase family 3 domain protein [Ruminococcus flavefaciens FD-1] Length = 419 Score = 35.4 bits (80), Expect = 2.8, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 28/65 (43%), Gaps = 5/65 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + I ++ + + AG D + + + I V SG++ RI + +RI+ Sbjct: 350 MGAITNSYSSADAAVMAVQAGNDILLTPDNFLEAVNGIEEAVNSGKLTEERINESVRRIL 409 Query: 64 YLKNK 68 LK + Sbjct: 410 TLKKE 414 >gi|192359054|ref|YP_001981636.1| putative 1,4-beta-D-glucan glucohydrolase cel3D [Cellvibrio japonicus Ueda107] gi|190685219|gb|ACE82897.1| putative 1,4-beta-D-glucan glucohydrolase cel3D [Cellvibrio japonicus Ueda107] Length = 1069 Score = 35.4 bits (80), Expect = 2.8, Method: Composition-based stats. Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 4/47 (8%) Query: 24 VYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 NAG D D D+I A VKSGEI +R++ A +RI+ +K Sbjct: 339 AINAGIDLVMVTYDWKDMITNTLAQVKSGEISQARLDDAVRRILRVK 385 >gi|299536575|ref|ZP_07049887.1| lipoprotein ybbD precursor [Lysinibacillus fusiformis ZC1] gi|298728059|gb|EFI68622.1| lipoprotein ybbD precursor [Lysinibacillus fusiformis ZC1] Length = 562 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 8/70 (11%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ-------QDPADVIELIYAHVKSGEIKPSRIESAYQ 60 + I + + + + AG+D + IE + V +GEI RI + Q Sbjct: 466 MKAITNHYAIGQAAVDSIKAGSDIILIAHEYANMTAAIEAVKVAVSNGEITEERINESVQ 525 Query: 61 RIIYLKNKMK 70 RI+ LK K + Sbjct: 526 RILKLKEKYQ 535 >gi|332974899|gb|EGK11812.1| glycosyl hydrolase domain protein [Desmospora sp. 8437] Length = 587 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 20/68 (29%), Positives = 28/68 (41%), Gaps = 8/68 (11%) Query: 9 LALIACKWNLSRI-IAVYNAGAD-------QQDPADVIELIYAHVKSGEIKPSRIESAYQ 60 + I + I AGAD I I VK GEI +RI+ + + Sbjct: 321 MGAIVDNFPAEEAAIRAVKAGADILLISHDLNRQQASIRGIRDAVKRGEISEARIDRSLR 380 Query: 61 RIIYLKNK 68 RI++LK K Sbjct: 381 RILHLKGK 388 >gi|55377095|ref|YP_134945.1| beta-glucosidase [Haloarcula marismortui ATCC 43049] gi|55229820|gb|AAV45239.1| beta-glucosidase [Haloarcula marismortui ATCC 43049] Length = 854 Score = 35.4 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 14/46 (30%), Positives = 23/46 (50%) Query: 24 VYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 + G D P I+ + + V+ G I RI+ A +RI+ LK + Sbjct: 347 MIGNGGDAPGPVQFIDTVVSLVEDGAIPMERIDEAVRRILELKADL 392 >gi|296119065|ref|ZP_06837637.1| glycosyl hydrolase, family 3 [Corynebacterium ammoniagenes DSM 20306] gi|295967900|gb|EFG81153.1| glycosyl hydrolase, family 3 [Corynebacterium ammoniagenes DSM 20306] Length = 335 Score = 35.0 bits (79), Expect = 3.0, Method: Composition-based stats. Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 7/69 (10%) Query: 9 LALIACKWNLSRII-AVYNAGADQ------QDPADVIELIYAHVKSGEIKPSRIESAYQR 61 +A I+ ++ + + AGADQ D VI+ V++GEI P R++S QR Sbjct: 267 MAAISNTMSIEQAVPKALAAGADQALWSSASDINAVIDACVHAVETGEIHPYRLQSGAQR 326 Query: 62 IIYLKNKMK 70 + + + M+ Sbjct: 327 VAHRLDSMQ 335 >gi|119502835|ref|ZP_01624920.1| Beta-glucosidase [marine gamma proteobacterium HTCC2080] gi|119461181|gb|EAW42271.1| Beta-glucosidase [marine gamma proteobacterium HTCC2080] Length = 824 Score = 35.0 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 4/47 (8%) Query: 24 VYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 NAG D +D +E + A V+SGEI +RI+ A RI+ +K Sbjct: 317 AINAGIDMVMVPEDWLSALENLVAQVQSGEISEARIDEAVLRILKVK 363 >gi|15613238|ref|NP_241541.1| beta-hexosamidase A precursor [Bacillus halodurans C-125] gi|10173289|dbj|BAB04394.1| beta-hexosamidase A precursor [Bacillus halodurans C-125] Length = 686 Score = 35.0 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 5/63 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + I+ + + I NAGAD V + V++GEI R+ A +RI+ Sbjct: 412 MNAISDHFGPTDAVIRSINAGADIILMPVGLQTVFPAVVEAVENGEISEERVNEAVKRIL 471 Query: 64 YLK 66 LK Sbjct: 472 TLK 474 >gi|332706080|ref|ZP_08426152.1| beta-glucosidase-related protein [Lyngbya majuscula 3L] gi|332355172|gb|EGJ34640.1| beta-glucosidase-related protein [Lyngbya majuscula 3L] Length = 586 Score = 35.0 bits (79), Expect = 3.3, Method: Composition-based stats. Identities = 18/67 (26%), Positives = 31/67 (46%), Gaps = 5/67 (7%) Query: 8 LLALIACKWNLSRI-IAVYNAGADQQ----DPADVIELIYAHVKSGEIKPSRIESAYQRI 62 ++ IA ++ + AGAD +P IE + V+ G I RI+++ +RI Sbjct: 283 VMGAIANRYGADEATVMAVEAGADILLMPVNPETAIEAVCQAVEQGRISRQRIQASVERI 342 Query: 63 IYLKNKM 69 K K+ Sbjct: 343 SRAKRKV 349 >gi|125972843|ref|YP_001036753.1| glycoside hydrolase family protein [Clostridium thermocellum ATCC 27405] gi|256005885|ref|ZP_05430832.1| glycoside hydrolase family 3 domain protein [Clostridium thermocellum DSM 2360] gi|125713068|gb|ABN51560.1| glycoside hydrolase, family 3-like protein [Clostridium thermocellum ATCC 27405] gi|255990154|gb|EEU00289.1| glycoside hydrolase family 3 domain protein [Clostridium thermocellum DSM 2360] gi|316940921|gb|ADU74955.1| glycoside hydrolase family 3 domain protein [Clostridium thermocellum DSM 1313] Length = 444 Score = 35.0 bits (79), Expect = 3.3, Method: Composition-based stats. Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 5/63 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + I+ W+ S+ + + AGAD + + I VK GEI R+ + QRI+ Sbjct: 341 MKAISNYWSSSKAAVMAFKAGADIILMPESFEEAYNGILKAVKDGEITEERLNQSLQRIL 400 Query: 64 YLK 66 LK Sbjct: 401 ALK 403 >gi|281417041|ref|ZP_06248061.1| glycoside hydrolase family 3 domain protein [Clostridium thermocellum JW20] gi|281408443|gb|EFB38701.1| glycoside hydrolase family 3 domain protein [Clostridium thermocellum JW20] Length = 444 Score = 35.0 bits (79), Expect = 3.4, Method: Composition-based stats. Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 5/63 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + I+ W+ S+ + + AGAD + + I VK GEI R+ + QRI+ Sbjct: 341 MKAISNYWSSSKAAVMAFKAGADIILMPESFEEAYNGILKAVKDGEITEERLNQSLQRIL 400 Query: 64 YLK 66 LK Sbjct: 401 ALK 403 >gi|152992043|ref|YP_001357764.1| glycosy hydrolase family protein [Sulfurovum sp. NBC37-1] gi|151423904|dbj|BAF71407.1| glycosyl hydrolase, family 3 [Sulfurovum sp. NBC37-1] Length = 361 Score = 35.0 bits (79), Expect = 3.4, Method: Composition-based stats. Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 13/74 (17%) Query: 9 LALIACKWNLSRIIA-VYNAGAD------QQDP------ADVIELIYAHVKSGEIKPSRI 55 + I+ K+ L + NAG D Q DP ++E I + +K GE+ P I Sbjct: 288 MGAISKKYGLKNTLKLAINAGDDILLFGNQLDPRKTVSTKKLVETIKSLLKRGEVNPKSI 347 Query: 56 ESAYQRIIYLKNKM 69 + AY RI LK K+ Sbjct: 348 DYAYIRIQNLKRKL 361 >gi|291303007|ref|YP_003514285.1| glycoside hydrolase family 3 domain-containing protein [Stackebrandtia nassauensis DSM 44728] gi|290572227|gb|ADD45192.1| glycoside hydrolase family 3 domain protein [Stackebrandtia nassauensis DSM 44728] Length = 612 Score = 35.0 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 4/55 (7%) Query: 19 SRIIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 S + NAG D D I + + V +G I RI+ A RI+ K K+ Sbjct: 319 SDVRTSINAGVDMVMVPYDYKTFISTLISEVNAGRIPMERIDDAVTRILTAKEKL 373 >gi|158317032|ref|YP_001509540.1| glycoside hydrolase family 3 protein [Frankia sp. EAN1pec] gi|158112437|gb|ABW14634.1| glycoside hydrolase family 3 domain protein [Frankia sp. EAN1pec] Length = 656 Score = 35.0 bits (79), Expect = 3.9, Method: Composition-based stats. Identities = 14/53 (26%), Positives = 28/53 (52%), Gaps = 4/53 (7%) Query: 21 IIAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 + AG D D ++ + + V+SG I P RI+++ +RI+ +K ++ Sbjct: 413 AVRAVQAGVDMLLMPPDLTQALDAVVSAVRSGAIVPERIDASVRRILRMKWRL 465 >gi|256377084|ref|YP_003100744.1| xylan 1,4-beta-xylosidase [Actinosynnema mirum DSM 43827] gi|255921387|gb|ACU36898.1| Xylan 1,4-beta-xylosidase [Actinosynnema mirum DSM 43827] Length = 609 Score = 35.0 bits (79), Expect = 3.9, Method: Composition-based stats. Identities = 15/50 (30%), Positives = 32/50 (64%) Query: 20 RIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 R++ V AG DQ + ++L+ V+SGE+ +RI+ + +R++ +K ++ Sbjct: 352 RMLKVLEAGCDQFGGEECVDLLLDLVRSGEVGEARIDVSARRLLLVKFRL 401 >gi|251797155|ref|YP_003011886.1| glycoside hydrolase [Paenibacillus sp. JDR-2] gi|247544781|gb|ACT01800.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp. JDR-2] Length = 403 Score = 34.6 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 7/51 (13%) Query: 27 AGADQ----QDPAD---VIELIYAHVKSGEIKPSRIESAYQRIIYLKNKMK 70 AGAD DP VI+ + A +SGEI PS ++++ RII LK K Sbjct: 329 AGADIILVGHDPVQQQTVIDALTAAAQSGEISPSVLDASVYRIIKLKQSFK 379 >gi|116671694|ref|YP_832627.1| Beta-glucosidase [Arthrobacter sp. FB24] gi|116611803|gb|ABK04527.1| Beta-glucosidase [Arthrobacter sp. FB24] Length = 663 Score = 34.6 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 14/50 (28%), Positives = 29/50 (58%) Query: 20 RIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 R++ + NAG DQ + EL+ V+ G + RI+ + +R++ +K ++ Sbjct: 388 RMLKILNAGVDQFGGEECTELLLGLVRDGLVSEERIDESARRLLLVKFQL 437 >gi|323490781|ref|ZP_08095983.1| glycoside hydrolase family 3 domain protein [Planococcus donghaensis MPA1U2] gi|323395663|gb|EGA88507.1| glycoside hydrolase family 3 domain protein [Planococcus donghaensis MPA1U2] Length = 701 Score = 34.6 bits (78), Expect = 4.3, Method: Composition-based stats. Identities = 20/65 (30%), Positives = 32/65 (49%), Gaps = 5/65 (7%) Query: 9 LALIACKWN-LSRIIAVYNAGADQQ----DPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + IA + + +I NAG D V +Y V+SGE+ RI+++ +RI+ Sbjct: 431 MQAIADHFGPVDAVIRAVNAGTDIVLMPVGLEQVATGLYEAVRSGEVTEERIDASAKRIL 490 Query: 64 YLKNK 68 LK K Sbjct: 491 SLKMK 495 >gi|238852774|ref|ZP_04643180.1| beta-N-acetylhexosaminidase [Lactobacillus gasseri 202-4] gi|238834624|gb|EEQ26855.1| beta-N-acetylhexosaminidase [Lactobacillus gasseri 202-4] Length = 641 Score = 34.6 bits (78), Expect = 4.3, Method: Composition-based stats. Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%) Query: 17 NLSRIIAVYNAGADQ---QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 N S + AG D D A I I + VK+GEI S+I ++ RI+ LKNK+ Sbjct: 321 NASVDVLAVKAGNDMIMTTDYATGINEIVSAVKAGEIPESQINASVTRILQLKNKL 376 >gi|332184589|gb|AEE26843.1| Beta-hexosaminidase [Francisella cf. novicida 3523] Length = 378 Score = 34.6 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 5/59 (8%) Query: 18 LSRIIAVYNAGADQ-----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKMKT 71 L + NAG + +P +I+ I V+SGE+ + I+ +Y+ II K T Sbjct: 304 LESLKLAINAGVNMFIFSDANPDTIIDNIAKLVESGEVTEATIKQSYENIIAYKQNYLT 362 >gi|291539206|emb|CBL12317.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis XB6B4] Length = 430 Score = 34.6 bits (78), Expect = 4.6, Method: Composition-based stats. Identities = 15/67 (22%), Positives = 29/67 (43%), Gaps = 5/67 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + + + + + NAG D QD + V+ G I RI+ + +RI+ Sbjct: 364 MGAVTGNYTADQAAVMAVNAGVDMILMPQDYETAYNGLLQAVQDGTISEERIDESVERIV 423 Query: 64 YLKNKMK 70 +K +M+ Sbjct: 424 KVKLQMQ 430 >gi|322517772|gb|ADX05691.1| putative carbohydrate-active enzyme [uncultured organism] Length = 574 Score = 34.6 bits (78), Expect = 4.8, Method: Composition-based stats. Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 4/50 (8%) Query: 24 VYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 V AG D DP + + + A V SG I R+ A RI+ LK K+ Sbjct: 303 VIAAGCDMLLFFNDPEEDLAYMKAGVDSGIISQERLSDALHRILGLKAKL 352 >gi|256830691|ref|YP_003159419.1| beta-N-acetylhexosaminidase [Desulfomicrobium baculatum DSM 4028] gi|256579867|gb|ACU91003.1| Beta-N-acetylhexosaminidase [Desulfomicrobium baculatum DSM 4028] Length = 382 Score = 34.6 bits (78), Expect = 4.8, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 32/75 (42%), Gaps = 12/75 (16%) Query: 9 LALIACKWNLSR-IIAVYNAGADQ--------QDPADV---IELIYAHVKSGEIKPSRIE 56 + IA + + I+ AG D DP V I+++ V+ G + RI Sbjct: 298 MRAIADHYGQAEAILLAVEAGVDVLVFGNNLDYDPEIVPKAIDILVKAVEDGRLSVERIA 357 Query: 57 SAYQRIIYLKNKMKT 71 ++YQRI K + T Sbjct: 358 ASYQRIQAAKQQFYT 372 >gi|302382873|ref|YP_003818696.1| glycoside hydrolase [Brevundimonas subvibrioides ATCC 15264] gi|302193501|gb|ADL01073.1| glycoside hydrolase family 3 domain protein [Brevundimonas subvibrioides ATCC 15264] Length = 827 Score = 34.6 bits (78), Expect = 4.8, Method: Composition-based stats. Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 4/50 (8%) Query: 23 AVYNAGADQQDPAD----VIELIYAHVKSGEIKPSRIESAYQRIIYLKNK 68 A +NAG D D + + A V+SGEI +R++ A +RI+ +K K Sbjct: 346 AAFNAGIDMFMAPDSWKPLFDNTLAQVRSGEIAMTRLDEAVRRILTVKVK 395 >gi|160942077|ref|ZP_02089392.1| hypothetical protein CLOBOL_06965 [Clostridium bolteae ATCC BAA-613] gi|158434968|gb|EDP12735.1| hypothetical protein CLOBOL_06965 [Clostridium bolteae ATCC BAA-613] Length = 447 Score = 34.6 bits (78), Expect = 4.9, Method: Composition-based stats. Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 5/66 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + I + R + AGAD D + + VK+GE+ RI+ + RI+ Sbjct: 377 MGAIQDNYPPDRAAVMALQAGADLLLMPADFKEAYNGVLDAVKTGELTEERIDQSLTRIL 436 Query: 64 YLKNKM 69 LK + Sbjct: 437 GLKLTL 442 >gi|118580772|ref|YP_902022.1| glycoside hydrolase family 3 protein [Pelobacter propionicus DSM 2379] gi|118503482|gb|ABK99964.1| glycoside hydrolase, family 3 domain protein [Pelobacter propionicus DSM 2379] Length = 393 Score = 34.6 bits (78), Expect = 5.1, Method: Composition-based stats. Identities = 20/72 (27%), Positives = 32/72 (44%), Gaps = 12/72 (16%) Query: 9 LALIACKWNLSRII-AVYNAGADQQDPAD-----------VIELIYAHVKSGEIKPSRIE 56 +A I ++ + NAG D A+ I+L+ V+SG+I RI+ Sbjct: 318 MAAIVQHYSYETAVEKAINAGVDLLILANDKLYSPDIAPRTIDLVVKMVESGKISRERID 377 Query: 57 SAYQRIIYLKNK 68 A RI+ LK + Sbjct: 378 QACGRIMKLKAR 389 >gi|163846652|ref|YP_001634696.1| glycoside hydrolase family 3 protein [Chloroflexus aurantiacus J-10-fl] gi|222524453|ref|YP_002568924.1| glycoside hydrolase family 3 domain-containing protein [Chloroflexus sp. Y-400-fl] gi|163667941|gb|ABY34307.1| glycoside hydrolase family 3 domain protein [Chloroflexus aurantiacus J-10-fl] gi|222448332|gb|ACM52598.1| glycoside hydrolase family 3 domain protein [Chloroflexus sp. Y-400-fl] Length = 619 Score = 34.2 bits (77), Expect = 5.3, Method: Composition-based stats. Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 4/53 (7%) Query: 21 IIAVYNAGADQQ----DPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 +I NAG D D I+ + V+ G + RI+ A +RI+ +K M Sbjct: 330 VITAINAGIDMNMVPYDAQRFIDSLTRAVERGAVSEERIDDAVRRILTVKFAM 382 >gi|300362737|ref|ZP_07058912.1| beta-N-acetylhexosaminidase [Lactobacillus gasseri JV-V03] gi|300353165|gb|EFJ69038.1| beta-N-acetylhexosaminidase [Lactobacillus gasseri JV-V03] Length = 641 Score = 34.2 bits (77), Expect = 5.8, Method: Composition-based stats. Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 3/56 (5%) Query: 17 NLSRIIAVYNAGADQ---QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 N S + AG D D A I+ I + VK+GEI S+I ++ RI+ LKNK+ Sbjct: 321 NASVDVLAVKAGNDMIMTTDYATGIKEIVSAVKAGEIPESQINASVTRILQLKNKL 376 >gi|225572643|ref|ZP_03781398.1| hypothetical protein RUMHYD_00831 [Blautia hydrogenotrophica DSM 10507] gi|225039997|gb|EEG50243.1| hypothetical protein RUMHYD_00831 [Blautia hydrogenotrophica DSM 10507] Length = 441 Score = 34.2 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 18/67 (26%), Positives = 30/67 (44%), Gaps = 5/67 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ-QDPAD---VIELIYAHVKSGEIKPSRIESAYQRII 63 + IA ++ + + AG D PAD + V S EI R+ A +RI+ Sbjct: 373 MGAIAENYSSAEAAVQAIQAGIDMVLMPADFEAAYNGVLQAVSSQEISQERLHDALRRIL 432 Query: 64 YLKNKMK 70 +K +M+ Sbjct: 433 TVKLEMQ 439 >gi|282850714|ref|ZP_06260089.1| LPXTG-motif cell wall anchor domain protein [Lactobacillus gasseri 224-1] gi|282558122|gb|EFB63709.1| LPXTG-motif cell wall anchor domain protein [Lactobacillus gasseri 224-1] Length = 390 Score = 34.2 bits (77), Expect = 6.2, Method: Composition-based stats. Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%) Query: 17 NLSRIIAVYNAGADQ---QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 N S + AG D D A I I + VK+GEI S+I ++ RI+ LKNK+ Sbjct: 70 NASVDVLAVKAGNDMIMTTDYATGINEIVSAVKAGEIPESQINASVTRILQLKNKL 125 >gi|108803474|ref|YP_643411.1| glycosyl hydrolase [Rubrobacter xylanophilus DSM 9941] gi|108764717|gb|ABG03599.1| beta-N-acetylhexosaminidase. Glycosyl Hydrolase family 3 [Rubrobacter xylanophilus DSM 9941] Length = 604 Score = 34.2 bits (77), Expect = 6.2, Method: Composition-based stats. Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 4/49 (8%) Query: 22 IAVYNAGADQQDPADVIELIY----AHVKSGEIKPSRIESAYQRIIYLK 66 + AGAD I+L Y V+SGEIK RI+++ +RI+ LK Sbjct: 339 VEAIKAGADMLLMPPDIDLAYNAVLEAVRSGEIKRRRIDASVRRILALK 387 >gi|319788699|ref|YP_004090014.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7] gi|315450566|gb|ADU24128.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7] Length = 453 Score = 34.2 bits (77), Expect = 6.3, Method: Composition-based stats. Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 5/65 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + +A ++ I +AV AG D +D ++ I VK+G++ RI+ + +R++ Sbjct: 384 MGALANYYSSDEIAVAVLKAGGDLLLMPEDLDSAVKGIEKAVKNGDLTEKRIDESLERVL 443 Query: 64 YLKNK 68 LK + Sbjct: 444 RLKKE 448 >gi|303241469|ref|ZP_07327971.1| glycoside hydrolase family 3 domain protein [Acetivibrio cellulolyticus CD2] gi|302590978|gb|EFL60724.1| glycoside hydrolase family 3 domain protein [Acetivibrio cellulolyticus CD2] Length = 724 Score = 34.2 bits (77), Expect = 6.3, Method: Composition-based stats. Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 3/52 (5%) Query: 21 IIAVYNAGADQ-QDPADV--IELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 ++ N G D P D+ ++LI +V +G I SRI+ A +RI+ K K Sbjct: 308 LVKTINNGVDMIMAPVDLNYVDLIEQNVNNGRIPLSRIDDAVRRILKAKFKF 359 >gi|307719143|ref|YP_003874675.1| glycosyl hydrolase [Spirochaeta thermophila DSM 6192] gi|306532868|gb|ADN02402.1| putative glycosyl hydrolase [Spirochaeta thermophila DSM 6192] Length = 560 Score = 34.2 bits (77), Expect = 6.7, Method: Composition-based stats. Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 4/51 (7%) Query: 23 AVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 +AG D +DP + ++++ VKSG + P R++ A RI+ K + Sbjct: 301 RALDAGCDIILFSEDPEEDVQIVLDAVKSGRVAPERLDEAVLRILAWKAAL 351 >gi|167751746|ref|ZP_02423873.1| hypothetical protein EUBSIR_02755 [Eubacterium siraeum DSM 15702] gi|167655554|gb|EDR99683.1| hypothetical protein EUBSIR_02755 [Eubacterium siraeum DSM 15702] Length = 406 Score = 34.2 bits (77), Expect = 6.7, Method: Composition-based stats. Identities = 15/67 (22%), Positives = 28/67 (41%), Gaps = 5/67 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + +A + + + AG D Q+ + + V GEI R++ + RI+ Sbjct: 338 MGAVADNYTSAEAAVTAVKAGVDIVLMPQNLDEAFNGVMNAVTDGEISMERLDESVLRIL 397 Query: 64 YLKNKMK 70 +K K K Sbjct: 398 KMKAKYK 404 >gi|302754618|ref|XP_002960733.1| hypothetical protein SELMODRAFT_74114 [Selaginella moellendorffii] gi|300171672|gb|EFJ38272.1| hypothetical protein SELMODRAFT_74114 [Selaginella moellendorffii] Length = 619 Score = 33.8 bits (76), Expect = 6.9, Method: Composition-based stats. Identities = 18/56 (32%), Positives = 28/56 (50%), Gaps = 5/56 (8%) Query: 15 KWNLSRIIAVYNAGADQQ----DPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 + S ++ NAG D D + I ++ VKSG + SRI+ A RI+ +K Sbjct: 315 NYTYS-VLTSVNAGIDMIMVPFDYQNFINILTGLVKSGAVSQSRIDDAVTRILRVK 369 >gi|325956120|ref|YP_004286730.1| beta-N-acetylhexosaminidase [Lactobacillus acidophilus 30SC] gi|325332685|gb|ADZ06593.1| beta-N-acetylhexosaminidase [Lactobacillus acidophilus 30SC] Length = 584 Score = 33.8 bits (76), Expect = 6.9, Method: Composition-based stats. Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 3/51 (5%) Query: 22 IAVYNAGADQ---QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 + AG D D A I I A VK GEI ++I ++ +RI+ +KNK+ Sbjct: 342 VLAVKAGNDMIMTTDYATGIPEIAAAVKKGEISKTQINNSVRRILNMKNKL 392 >gi|302804372|ref|XP_002983938.1| hypothetical protein SELMODRAFT_119324 [Selaginella moellendorffii] gi|300148290|gb|EFJ14950.1| hypothetical protein SELMODRAFT_119324 [Selaginella moellendorffii] Length = 601 Score = 33.8 bits (76), Expect = 7.0, Method: Composition-based stats. Identities = 18/56 (32%), Positives = 28/56 (50%), Gaps = 5/56 (8%) Query: 15 KWNLSRIIAVYNAGADQQ----DPADVIELIYAHVKSGEIKPSRIESAYQRIIYLK 66 + S ++ NAG D D + I ++ VKSG + SRI+ A RI+ +K Sbjct: 297 NYTYS-VLTSVNAGIDMIMVPFDYQNFINILTGLVKSGAVSQSRIDDAVTRILRVK 351 >gi|219848593|ref|YP_002463026.1| glycoside hydrolase family 3 domain-containing protein [Chloroflexus aggregans DSM 9485] gi|219542852|gb|ACL24590.1| glycoside hydrolase family 3 domain protein [Chloroflexus aggregans DSM 9485] Length = 619 Score = 33.8 bits (76), Expect = 7.2, Method: Composition-based stats. Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 4/53 (7%) Query: 21 IIAVYNAGADQQ----DPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 ++ NAG D D IE + V +G + +RI+ A +RI+ K M Sbjct: 330 VVTAINAGIDMNMVPYDAVRFIETLTRAVNTGMVSETRIDDAVRRILTTKFAM 382 >gi|167461151|ref|ZP_02326240.1| Beta-glucosidase-related glycosidase [Paenibacillus larvae subsp. larvae BRL-230010] gi|322384898|ref|ZP_08058554.1| beta-hexosaminidase-like protein [Paenibacillus larvae subsp. larvae B-3650] gi|321150195|gb|EFX43702.1| beta-hexosaminidase-like protein [Paenibacillus larvae subsp. larvae B-3650] Length = 536 Score = 33.8 bits (76), Expect = 7.4, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 28/70 (40%), Gaps = 8/70 (11%) Query: 9 LALIACKWNLSR-IIAVYNAGADQ-------QDPADVIELIYAHVKSGEIKPSRIESAYQ 60 + I + ++ + AG D I I ++SG + RI+ + Sbjct: 268 MKAIDDHYGVAEGAVKAIEAGVDLVLVSHTLTKQVAAIGAILQALESGRLTEERIDESVD 327 Query: 61 RIIYLKNKMK 70 RI+ LK K+K Sbjct: 328 RILRLKQKLK 337 >gi|323700659|ref|ZP_08112571.1| glycoside hydrolase family 3 domain protein [Desulfovibrio sp. ND132] gi|323460591|gb|EGB16456.1| glycoside hydrolase family 3 domain protein [Desulfovibrio desulfuricans ND132] Length = 380 Score = 33.8 bits (76), Expect = 7.5, Method: Composition-based stats. Identities = 19/73 (26%), Positives = 30/73 (41%), Gaps = 12/73 (16%) Query: 9 LALIACKWNLSRIIA-VYNAGADQ-----------QDPADVIELIYAHVKSGEIKPSRIE 56 + IA ++ + AGAD V LI + V G I +RIE Sbjct: 308 MGAIADEYGRREAVRRAIEAGADILLFGNNLSFDEHIVEKVHALIRSMVDDGTIPKARIE 367 Query: 57 SAYQRIIYLKNKM 69 +++ RI+ LK + Sbjct: 368 ASFARIMRLKRSL 380 >gi|157165607|ref|YP_001466722.1| glycoside hydrolase family 3 protein [Campylobacter concisus 13826] gi|112800169|gb|EAT97513.1| beta-hexosaminidase A (N-acetyl-beta-glucosaminidase) (Beta-N-acetylhexosaminidase) (Chitobiase) [Campylobacter concisus 13826] Length = 351 Score = 33.8 bits (76), Expect = 7.7, Method: Composition-based stats. Identities = 20/73 (27%), Positives = 36/73 (49%), Gaps = 11/73 (15%) Query: 8 LLALIACKWNLSRIIAVYNAGADQ----------QDPADVI-ELIYAHVKSGEIKPSRIE 56 L+ + + +++ NAG D Q AD+I ++I V +I RI+ Sbjct: 279 LMKGVGDEALAQKVVKFINAGGDILLFSEFKINNQRTADLITQIIIDAVNEKKISKERID 338 Query: 57 SAYQRIIYLKNKM 69 ++Y+RI+ LK K+ Sbjct: 339 ASYKRIMALKAKL 351 >gi|169827957|ref|YP_001698115.1| lipoprotein ybbD [Lysinibacillus sphaericus C3-41] gi|168992445|gb|ACA39985.1| Hypothetical lipoprotein ybbD precursor [Lysinibacillus sphaericus C3-41] Length = 566 Score = 33.8 bits (76), Expect = 8.0, Method: Composition-based stats. Identities = 19/70 (27%), Positives = 32/70 (45%), Gaps = 8/70 (11%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ-------QDPADVIELIYAHVKSGEIKPSRIESAYQ 60 + I +N+ + + AG D + I+ + A VK+GEI +I + + Sbjct: 470 MKAITNHFNIGQAAVDSVKAGNDIILIAHEFANVTAAIDALKAAVKNGEITEQQINDSVR 529 Query: 61 RIIYLKNKMK 70 RII LK K + Sbjct: 530 RIIQLKEKYQ 539 >gi|317154368|ref|YP_004122416.1| glycoside hydrolase family 3 domain-containing protein [Desulfovibrio aespoeensis Aspo-2] gi|316944619|gb|ADU63670.1| glycoside hydrolase family 3 domain protein [Desulfovibrio aespoeensis Aspo-2] Length = 379 Score = 33.8 bits (76), Expect = 8.1, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 28/72 (38%), Gaps = 12/72 (16%) Query: 9 LALIACKWNLSRII-AVYNAGADQ--------QDP---ADVIELIYAHVKSGEIKPSRIE 56 + I ++ + AGAD DP LI A V+ G I +RI Sbjct: 307 MKAITERYGRDEAVRLAIEAGADILLFGNNLTYDPDVVRQTHALIKAMVRDGVISQTRIR 366 Query: 57 SAYQRIIYLKNK 68 ++ RI+ LK Sbjct: 367 QSHDRIMRLKGS 378 >gi|254411786|ref|ZP_05025562.1| Glycosyl hydrolase family 3 N terminal domain protein [Microcoleus chthonoplastes PCC 7420] gi|196181508|gb|EDX76496.1| Glycosyl hydrolase family 3 N terminal domain protein [Microcoleus chthonoplastes PCC 7420] Length = 548 Score = 33.8 bits (76), Expect = 8.2, Method: Composition-based stats. Identities = 18/67 (26%), Positives = 31/67 (46%), Gaps = 5/67 (7%) Query: 8 LLALIACKWNLSRI-IAVYNAGADQQ----DPADVIELIYAHVKSGEIKPSRIESAYQRI 62 ++ IA ++ + + AGAD +P I+ I V +G I RI ++ +RI Sbjct: 279 IMGAIANRYGATEAPVKAVEAGADILLMPVNPETTIQAICEAVTAGRISRDRILASVERI 338 Query: 63 IYLKNKM 69 K K+ Sbjct: 339 WQAKAKI 345 >gi|115314432|ref|YP_763155.1| glycosyl hydrolase [Francisella tularensis subsp. holarctica OSU18] gi|156501943|ref|YP_001428008.1| glycosy hydrolase family protein [Francisella tularensis subsp. holarctica FTNF002-00] gi|290954606|ref|ZP_06559227.1| glycosy hydrolase family protein [Francisella tularensis subsp. holarctica URFT1] gi|295311949|ref|ZP_06802773.1| glycosy hydrolase family protein [Francisella tularensis subsp. holarctica URFT1] gi|115129331|gb|ABI82518.1| probable glycosyl hydrolase [Francisella tularensis subsp. holarctica OSU18] gi|156252546|gb|ABU61052.1| glycosyl hydrolase family 3 [Francisella tularensis subsp. holarctica FTNF002-00] Length = 347 Score = 33.8 bits (76), Expect = 8.6, Method: Composition-based stats. Identities = 16/59 (27%), Positives = 28/59 (47%), Gaps = 5/59 (8%) Query: 18 LSRIIAVYNAGADQ-----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKMKT 71 L + NAG + +P +I+ I V+SGE+ + I+ +Y+ I+ K T Sbjct: 273 LESLKLAINAGVNIFIFSDGNPDTIIDNIAKLVESGEVAEATIKQSYENIVTYKQNYLT 331 >gi|224457774|ref|ZP_03666247.1| glycosy hydrolase family protein [Francisella tularensis subsp. tularensis MA00-2987] gi|254371223|ref|ZP_04987225.1| hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254875454|ref|ZP_05248164.1| glycosyl hydrolase [Francisella tularensis subsp. tularensis MA00-2987] gi|151569463|gb|EDN35117.1| hypothetical protein FTBG_00992 [Francisella tularensis subsp. tularensis FSC033] gi|254841453|gb|EET19889.1| glycosyl hydrolase [Francisella tularensis subsp. tularensis MA00-2987] gi|282159820|gb|ADA79211.1| glycosyl hydrolase family 3 [Francisella tularensis subsp. tularensis NE061598] Length = 347 Score = 33.8 bits (76), Expect = 8.7, Method: Composition-based stats. Identities = 16/59 (27%), Positives = 28/59 (47%), Gaps = 5/59 (8%) Query: 18 LSRIIAVYNAGADQ-----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKMKT 71 L + NAG + +P +I+ I V+SGE+ + I+ +Y+ I+ K T Sbjct: 273 LESLKLAINAGVNIFIFSDGNPDTIIDNIAKLVESGEVAEATIKQSYENIVTYKQNYLT 331 >gi|167009145|ref|ZP_02274076.1| glycosyl hydrolase family 3 [Francisella tularensis subsp. holarctica FSC200] gi|254367306|ref|ZP_04983332.1| glycosyl hydrolase [Francisella tularensis subsp. holarctica 257] gi|134253122|gb|EBA52216.1| glycosyl hydrolase [Francisella tularensis subsp. holarctica 257] Length = 347 Score = 33.8 bits (76), Expect = 8.7, Method: Composition-based stats. Identities = 16/59 (27%), Positives = 28/59 (47%), Gaps = 5/59 (8%) Query: 18 LSRIIAVYNAGADQ-----QDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKMKT 71 L + NAG + +P +I+ I V+SGE+ + I+ +Y+ I+ K T Sbjct: 273 LESLKLAINAGVNIFIFSDGNPDTIIDNIAKLVESGEVAEATIKQSYENIVTYKQNYLT 331 >gi|253575841|ref|ZP_04853176.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14] gi|251844884|gb|EES72897.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14] Length = 544 Score = 33.8 bits (76), Expect = 8.8, Method: Composition-based stats. Identities = 18/68 (26%), Positives = 32/68 (47%), Gaps = 8/68 (11%) Query: 9 LALIACKWNLSR-IIAVYNAGADQ-------QDPADVIELIYAHVKSGEIKPSRIESAYQ 60 + IA + ++ + AGAD + IE + V+SG I +R++ + + Sbjct: 268 MQAIAGYYGIAEGAVQAIEAGADLVLVSHTLAEQRAAIERVAEAVRSGRISEARLDRSLE 327 Query: 61 RIIYLKNK 68 RI+ LK K Sbjct: 328 RILALKAK 335 >gi|110597288|ref|ZP_01385576.1| Beta-N-acetylhexosaminidase [Chlorobium ferrooxidans DSM 13031] gi|110341124|gb|EAT59592.1| Beta-N-acetylhexosaminidase [Chlorobium ferrooxidans DSM 13031] Length = 389 Score = 33.5 bits (75), Expect = 8.8, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 33/72 (45%), Gaps = 12/72 (16%) Query: 9 LALIACKWNLSRII-AVYNAGADQ--------QDPA---DVIELIYAHVKSGEIKPSRIE 56 + IA ++ L I +AG D DP E+I + V+ + P RI+ Sbjct: 312 MKAIADRYGLEEAIRLAIDAGVDLLLFGNNTSWDPEIATKATEIIRSLVEKRVVTPRRID 371 Query: 57 SAYQRIIYLKNK 68 +Y+R++ LK + Sbjct: 372 LSYRRVMELKKQ 383 >gi|332883417|gb|EGK03700.1| hypothetical protein HMPREF9456_01767 [Dysgonomonas mossii DSM 22836] Length = 770 Score = 33.5 bits (75), Expect = 8.9, Method: Composition-based stats. Identities = 15/58 (25%), Positives = 28/58 (48%), Gaps = 5/58 (8%) Query: 17 NLSRIIAV-YNAGADQQ----DPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 ++ I NAG D + + +L+ V G++ SRI+ A R++ +K K+ Sbjct: 322 SIKEAIKAGINAGIDMSMIPYNYKEFCDLLTELVNEGQVPMSRIDDAATRVLTVKIKL 379 >gi|315498957|ref|YP_004087761.1| glycoside hydrolase family 3 domain protein [Asticcacaulis excentricus CB 48] gi|315416969|gb|ADU13610.1| glycoside hydrolase family 3 domain protein [Asticcacaulis excentricus CB 48] Length = 863 Score = 33.5 bits (75), Expect = 9.4, Method: Composition-based stats. Identities = 20/49 (40%), Positives = 25/49 (51%), Gaps = 4/49 (8%) Query: 24 VYNAGADQQDPADVIELIY----AHVKSGEIKPSRIESAYQRIIYLKNK 68 +NAG D D + IY A VKSGEI R+ A +RI+ K K Sbjct: 364 AFNAGIDMFMAPDSWKGIYENTLAQVKSGEISEDRLNDAVRRILRAKIK 412 >gi|197302730|ref|ZP_03167784.1| hypothetical protein RUMLAC_01460 [Ruminococcus lactaris ATCC 29176] gi|197298312|gb|EDY32858.1| hypothetical protein RUMLAC_01460 [Ruminococcus lactaris ATCC 29176] Length = 408 Score = 33.5 bits (75), Expect = 9.4, Method: Composition-based stats. Identities = 14/66 (21%), Positives = 34/66 (51%), Gaps = 5/66 (7%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ----QDPADVIELIYAHVKSGEIKPSRIESAYQRII 63 + I+ ++ + + + V AG D ++ + + I V++G + RI+ + +RI+ Sbjct: 339 MGAISQHYSSAEVSVKVIEAGGDMLLMPENFQEAYQGILEAVQNGTLTEERIDESVRRIL 398 Query: 64 YLKNKM 69 +K K+ Sbjct: 399 KVKEKL 404 >gi|110636892|ref|YP_677099.1| b-N-acetylglucosaminidase [Cytophaga hutchinsonii ATCC 33406] gi|110279573|gb|ABG57759.1| b-N-acetylglucosaminidase, glycoside hydrolase family 3 protein [Cytophaga hutchinsonii ATCC 33406] Length = 395 Score = 33.5 bits (75), Expect = 9.4, Method: Composition-based stats. Identities = 17/42 (40%), Positives = 27/42 (64%) Query: 27 AGADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNK 68 + +D+ +++ +I V SG+I SRI AY+RI+ LKNK Sbjct: 351 SASDRIKASEIHAIIKKLVLSGDIPESRINEAYERILALKNK 392 >gi|254418604|ref|ZP_05032328.1| Glycosyl hydrolase family 3 N terminal domain protein [Brevundimonas sp. BAL3] gi|196184781|gb|EDX79757.1| Glycosyl hydrolase family 3 N terminal domain protein [Brevundimonas sp. BAL3] Length = 627 Score = 33.5 bits (75), Expect = 9.7, Method: Composition-based stats. Identities = 18/50 (36%), Positives = 27/50 (54%), Gaps = 4/50 (8%) Query: 23 AVYNAGADQQDPADVIELIY----AHVKSGEIKPSRIESAYQRIIYLKNK 68 NAG D D + +Y A V+SGEI +R++ A +RI+ +K K Sbjct: 294 LAVNAGIDMLMAPDSWKPLYQNTLAQVRSGEIPTARLDEAVRRILRVKVK 343 >gi|118464303|ref|YP_884041.1| glycosyl hydrolase family protein 3 [Mycobacterium avium 104] gi|254777359|ref|ZP_05218875.1| glycosyl hydrolase family protein 3 [Mycobacterium avium subsp. avium ATCC 25291] gi|118165590|gb|ABK66487.1| Glycosyl hydrolase family protein 3 [Mycobacterium avium 104] Length = 388 Score = 33.5 bits (75), Expect = 9.7, Method: Composition-based stats. Identities = 16/68 (23%), Positives = 33/68 (48%), Gaps = 9/68 (13%) Query: 9 LALIACKWNLSRIIA-VYNAGADQ-------QDPADVIELIYAHVKSGEIKPSRIESAYQ 60 +A I+ ++ +S + AG D + PA V++ + V SGE+ R++ + Sbjct: 316 MAAISDRYGVSEAVLRSLLAGVDVALWVTTDEVPA-VLDRLQKAVASGELPAQRVDESLV 374 Query: 61 RIIYLKNK 68 R+ +K + Sbjct: 375 RVATMKGR 382 >gi|41409786|ref|NP_962622.1| LpqI [Mycobacterium avium subsp. paratuberculosis K-10] gi|41398618|gb|AAS06238.1| LpqI [Mycobacterium avium subsp. paratuberculosis K-10] Length = 388 Score = 33.5 bits (75), Expect = 9.7, Method: Composition-based stats. Identities = 16/68 (23%), Positives = 33/68 (48%), Gaps = 9/68 (13%) Query: 9 LALIACKWNLSRIIA-VYNAGADQ-------QDPADVIELIYAHVKSGEIKPSRIESAYQ 60 +A I+ ++ +S + AG D + PA V++ + V SGE+ R++ + Sbjct: 316 MAAISDRYGVSEAVLRSLLAGVDVALWVTTDEVPA-VLDRLQKAVASGELPAQRVDESLV 374 Query: 61 RIIYLKNK 68 R+ +K + Sbjct: 375 RVATMKGR 382 >gi|323358895|ref|YP_004225291.1| beta-glucosidase-related glycosidase [Microbacterium testaceum StLB037] gi|323275266|dbj|BAJ75411.1| beta-glucosidase-related glycosidase [Microbacterium testaceum StLB037] Length = 619 Score = 33.5 bits (75), Expect = 9.8, Method: Composition-based stats. Identities = 12/53 (22%), Positives = 32/53 (60%) Query: 17 NLSRIIAVYNAGADQQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNKM 69 + R+ + +AG+DQ + +E++ V++G + RI+ + +R++ +K ++ Sbjct: 354 AVERMEKILDAGSDQFGGEECVEMLVDLVRAGRVSEERIDESVRRLLRVKFQL 406 >gi|224543194|ref|ZP_03683733.1| hypothetical protein CATMIT_02394 [Catenibacterium mitsuokai DSM 15897] gi|224523981|gb|EEF93086.1| hypothetical protein CATMIT_02394 [Catenibacterium mitsuokai DSM 15897] Length = 790 Score = 33.5 bits (75), Expect = 9.8, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 14/74 (18%) Query: 9 LALIACKWNLSRIIA-VYNAGAD----------QQDPAD---VIELIYAHVKSGEIKPSR 54 + IA + S+ + AGAD Q+D +I+ + VK GEI SR Sbjct: 323 MKAIADTFGESQAVKLAIEAGADLICMPTVLYNQEDVKKLDTIIDYVEDAVKKGEISESR 382 Query: 55 IESAYQRIIYLKNK 68 ++ +RI+ +K Sbjct: 383 LDDGCRRILTVKEN 396 >gi|156048580|ref|XP_001590257.1| hypothetical protein SS1G_09021 [Sclerotinia sclerotiorum 1980] gi|154693418|gb|EDN93156.1| hypothetical protein SS1G_09021 [Sclerotinia sclerotiorum 1980 UF-70] Length = 553 Score = 33.5 bits (75), Expect = 9.8, Method: Composition-based stats. Identities = 17/38 (44%), Positives = 21/38 (55%) Query: 31 QQDPADVIELIYAHVKSGEIKPSRIESAYQRIIYLKNK 68 + IE + A VKSGEI IES+ R+I LK K Sbjct: 320 MKAQVGAIEAVIAAVKSGEISQEMIESSVNRVIRLKTK 357 >gi|303248130|ref|ZP_07334395.1| glycoside hydrolase family 3 domain protein [Desulfovibrio fructosovorans JJ] gi|302490529|gb|EFL50437.1| glycoside hydrolase family 3 domain protein [Desulfovibrio fructosovorans JJ] Length = 572 Score = 33.5 bits (75), Expect = 9.9, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 32/72 (44%), Gaps = 12/72 (16%) Query: 9 LALIACKWNLSRI-IAVYNAGADQ-------QDPAD----VIELIYAHVKSGEIKPSRIE 56 + +A W + + NAGAD PA ++ + V+SG + R++ Sbjct: 302 MGAVADTWGTAEAAVLALNAGADILLVGADAGRPASERLLAMDAVVQAVRSGRVPVKRLD 361 Query: 57 SAYQRIIYLKNK 68 +A R++ LK + Sbjct: 362 AAVARVLRLKQR 373 >gi|77918922|ref|YP_356737.1| putative glycosyl hydrolase [Pelobacter carbinolicus DSM 2380] gi|77545005|gb|ABA88567.1| putative glycosyl hydrolase [Pelobacter carbinolicus DSM 2380] Length = 382 Score = 33.5 bits (75), Expect = 9.9, Method: Composition-based stats. Identities = 19/72 (26%), Positives = 33/72 (45%), Gaps = 9/72 (12%) Query: 9 LALIACKWNLSRII-AVYNAGADQQDPAD--------VIELIYAHVKSGEIKPSRIESAY 59 + IA ++ L + NAG D AD +I ++ + SG + RI A Sbjct: 305 MGAIADQYRLEDAVEKALNAGVDILLLADNSPDTTSRMIAIMQKLIDSGRVTRKRIVQAL 364 Query: 60 QRIIYLKNKMKT 71 +RI LK+ +++ Sbjct: 365 KRIDDLKSHLRS 376 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.325 0.137 0.373 Lambda K H 0.267 0.0413 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,183,665,958 Number of Sequences: 14124377 Number of extensions: 35418292 Number of successful extensions: 93854 Number of sequences better than 10.0: 279 Number of HSP's better than 10.0 without gapping: 94 Number of HSP's successfully gapped in prelim test: 211 Number of HSP's that attempted gapping in prelim test: 93745 Number of HSP's gapped (non-prelim): 308 length of query: 71 length of database: 4,842,793,630 effective HSP length: 43 effective length of query: 28 effective length of database: 4,235,445,419 effective search space: 118592471732 effective search space used: 118592471732 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.1 bits) S2: 75 (33.5 bits)