BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy5277
         (314 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|432110769|gb|ELK34246.1| Non-lysosomal glucosylceramidase [Myotis davidii]
          Length = 868

 Score =  282 bits (722), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 139/292 (47%), Positives = 190/292 (65%), Gaps = 12/292 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 560 PDDEPWLRVNAYVIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMK 619

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q++FS+
Sbjct: 620 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDIQEKFSS 679

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
            L++  Q Y  LLW G YY +D S    S SIM+DQ  G  FL+ASG+      +FP+ +
Sbjct: 680 ILSRGQQAYERLLWNGRYYNYDCSSRPQSCSIMSDQCAGQWFLRASGLGEGETEVFPTQH 739

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           + RAL TIF  NV+ F  G+MGAVNGM+P G  DRS++QS+EVW GV Y L+A M+ EG 
Sbjct: 740 VVRALQTIFEFNVQAFAGGTMGAVNGMQPQGVPDRSSVQSDEVWVGVVYGLAATMIQEGL 799

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
             E + TA G YRTV+ER GLGF+TPE     + +RS  YMR L+++AMQ A
Sbjct: 800 TWEGFQTAEGCYRTVWERLGLGFQTPEAYCQKRVFRSLAYMRPLSIWAMQLA 851


>gi|328709489|ref|XP_001943760.2| PREDICTED: non-lysosomal glucosylceramidase-like [Acyrthosiphon
           pisum]
          Length = 776

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 136/290 (46%), Positives = 188/290 (64%), Gaps = 11/290 (3%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           PEE+PF+L+N Y V +VS+W+DLNLKF++  YRDY +  D ++L  +YP    ++ K  S
Sbjct: 481 PEEDPFNLVNAYLVHNVSEWRDLNLKFVLQCYRDYTMFGDNKYLADMYPQIKNVMEKSLS 540

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           +D  G G+++N G  DQT          AYCGGL +AAL C V++S++L      +++S 
Sbjct: 541 WDTDGDGMIENSGTADQTFDTWIMTGISAYCGGLWLAALYCTVQISNILKQDDISEKYST 600

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
            L KA + ++  LW G Y+ FDSS   HS SIMADQLCG  +LK  GVN E + P   + 
Sbjct: 601 ILNKAKESFNAALWNGGYFNFDSSGQHHSKSIMADQLCGEWYLKCCGVN-EEVLPVDRVK 659

Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
           +AL TI+  NV GF  G MGAVNGM P+G+ D  ++QSEEVWTGVTY LS+ M+  G  D
Sbjct: 660 KALHTIYKMNVLGFNDGKMGAVNGMMPDGNPDTFSLQSEEVWTGVTYALSSLMIAHGLRD 719

Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           E + TA G+Y TVY   G+ ++TPE +     YRS GYMR L+++++Q A
Sbjct: 720 EGFNTAKGIYNTVYNNIGMAYQTPEAIYSKNAYRSVGYMRPLSIWSIQTA 769


>gi|444729906|gb|ELW70309.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
          Length = 1001

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 138/290 (47%), Positives = 191/290 (65%), Gaps = 14/290 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 672 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 731

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q++F++
Sbjct: 732 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQEKFTS 791

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            LT+  + Y  LLW G YY +DSS    S SIM+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 792 ILTRGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTE-VFPTP 850

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 851 HVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 910

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++AM
Sbjct: 911 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 960


>gi|291241331|ref|XP_002740565.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 792

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 135/291 (46%), Positives = 194/291 (66%), Gaps = 12/291 (4%)

Query: 29  EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
           ++EP+  +N Y ++D +DWKDLNLKF++ +YRDY + KD  +L  ++P    +++   ++
Sbjct: 494 DDEPWVRVNAYLMYDTADWKDLNLKFVLQVYRDYYVTKDFIYLQDMWPKVKVVMQTSLAW 553

Query: 89  DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
           D  G GI+ N G+ DQT          AYCGGL +AA+  M+E++ +L++  Q +++S  
Sbjct: 554 DTDGDGIIDNSGYADQTYDAWSVTGASAYCGGLWLAAVRAMMEIAKILDEEDQFKKYSDV 613

Query: 139 LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE--AIFPSANI 196
           L +    Y   LW G YY +DSS+ +HS+SIM DQ  G  +L+A G+  E   +FP  +I
Sbjct: 614 LIQGKSSYEKKLWNGKYYNYDSSKQKHSNSIMCDQAAGQWYLRACGLVQENDEVFPKDHI 673

Query: 197 ARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNV 256
             AL  I+  NV GFE G+MGAVNGM+P+G  D ++ QSEEVWTGVTY L+A M+ EG +
Sbjct: 674 LCALRAIYKMNVMGFEKGTMGAVNGMRPDGSPDTTSFQSEEVWTGVTYALAANMIQEGLL 733

Query: 257 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           +EA+TTA G+Y+T YER GLGF+TPE    +  YRS GYMR LA++AMQ A
Sbjct: 734 EEAFTTASGVYKTCYERCGLGFQTPEAYFANNCYRSLGYMRPLAIWAMQYA 784


>gi|351707038|gb|EHB09957.1| Non-lysosomal glucosylceramidase [Heterocephalus glaber]
          Length = 922

 Score =  277 bits (709), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 140/300 (46%), Positives = 193/300 (64%), Gaps = 14/300 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 593 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 652

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A  V+M+ L      Q +FS+
Sbjct: 653 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVTVQMAVLCAAQDVQDKFSS 712

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 713 ILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTT 771

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 772 HVVRALQTIFELNVQSFAGGAMGAVNGMQPHGIPDRSSVQSDEVWVGVVYGLAATMIQEG 831

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++AMQ A  + K K
Sbjct: 832 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIWAMQLALQQQKHK 891


>gi|334333160|ref|XP_003341682.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
           glucosylceramidase-like [Monodelphis domestica]
          Length = 950

 Score =  276 bits (706), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 135/293 (46%), Positives = 192/293 (65%), Gaps = 14/293 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  IN Y + D +DWKDLNLKF++ IYRD+ L  +  FL  ++P C  ++     
Sbjct: 603 PDDEPWLRINAYLIHDTADWKDLNLKFVLQIYRDFYLTGNESFLRDMWPVCQAVMESEMK 662

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+PDQT          AYCGGL +AA+A MV+M+++      Q++F +
Sbjct: 663 FDKDQDGLIENGGYPDQTYDGWITTGPSAYCGGLWLAAVAVMVQMATVCGTQDIQEKFLS 722

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S  IM+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 723 ILSRGREAYERLLWNGRYYNYDSSTQPQSRIIMSDQCAGQWFLRACGLGEGDTE-VFPTP 781

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+GF  G+MGAVNGM+P+G  D S++QS+EVW GV Y L+A M+ EG
Sbjct: 782 HVIRALQTIFEVNVQGFAGGAMGAVNGMQPDGIPDTSSVQSDEVWVGVVYGLAATMIQEG 841

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
            + E + TA G YRTV+ER G+ F+TPE     + +RS  YMR L+++AMQ A
Sbjct: 842 LIQEGFRTAEGCYRTVWERLGMAFQTPEAYCQHRVFRSLAYMRPLSIWAMQLA 894


>gi|156554493|ref|XP_001604835.1| PREDICTED: non-lysosomal glucosylceramidase-like [Nasonia
           vitripennis]
          Length = 826

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 133/291 (45%), Positives = 186/291 (63%), Gaps = 11/291 (3%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P EEPF LIN Y + DVS W+DLN KF++S YR Y  ++++  L   + T   ++    +
Sbjct: 512 PGEEPFDLINAYPIHDVSQWRDLNSKFVLSCYRIYFFNRNLEQLRDFWSTIKLVLEHSLT 571

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FD+   G+++NGGFPDQT          AYCGGL IAAL C V+M+ LL +++ + ++  
Sbjct: 572 FDEDNDGLIENGGFPDQTYDCWVMSGPSAYCGGLWIAALHCAVKMAELLGESEDEARYKG 631

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
            L +    + D LW G YY FD  +     SIM+DQLCG+  L+A G + E +FP   + 
Sbjct: 632 ILDRGKVAFQDKLWNGKYYNFDCGKDESKLSIMSDQLCGHWLLRACGFSYE-VFPQDRVR 690

Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
            +L T+F  NV  ++ G  GAVNG  P G  D S IQSEE+WTGV Y L+A +++EG ++
Sbjct: 691 SSLETVFQNNVMKYKNGQQGAVNGFSPTGSIDYSCIQSEEMWTGVAYGLAALLIHEGMIE 750

Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
           EA+ TA G+YRTVYE+ G+GFETPE L   K YR+ GYMR L+++AMQ A+
Sbjct: 751 EAFRTAEGVYRTVYEKIGMGFETPEALYEHKVYRAIGYMRPLSIWAMQHAW 801


>gi|156356077|ref|XP_001623757.1| predicted protein [Nematostella vectensis]
 gi|156210486|gb|EDO31657.1| predicted protein [Nematostella vectensis]
          Length = 783

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 130/300 (43%), Positives = 194/300 (64%), Gaps = 13/300 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P E P+  +N Y++ D S WKDLNLKF++ +YRDY    D+ +L  ++P    ++ K  +
Sbjct: 480 PSEAPWDHVNAYHIHDTSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWPITKTVMTKSMT 539

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           +D  G G+++N G  DQT          AYCGGL +AAL  M E++++L+   ++ ++  
Sbjct: 540 YDSDGDGLIENSGLADQTFDAWPVTGPSAYCGGLWLAALRVMAEIATILDFPDERGKYEK 599

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +DSS S++ +SIMADQL G  +L A  +   + + +FPS 
Sbjct: 600 ILARGKKAYERLLWNGKYYNYDSSTSKYHNSIMADQLSGQWYLHACDLAQTSNDRVFPSE 659

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           N+  AL T+F+ NV  F+ G+MGAVNG++P+G  D S++Q+EEVWTGVTY ++A+M+ EG
Sbjct: 660 NVISALRTVFNFNVMKFQEGTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEG 719

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
            VDE + TA G+Y T +ER G+ F+TPE +  +  YRS  YMR L+++AMQ A  K K K
Sbjct: 720 LVDEGFKTASGIYNTCFERLGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNK 779


>gi|343959578|dbj|BAK63646.1| bile acid beta-glucosidase [Pan troglodytes]
          Length = 514

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 185 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 244

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 245 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 304

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 305 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 363

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 364 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 423

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 424 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 471


>gi|194669510|ref|XP_001790122.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
 gi|297477943|ref|XP_002689726.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
 gi|296484715|tpg|DAA26830.1| TPA: hypothetical protein BOS_8918 [Bos taurus]
          Length = 918

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 134/289 (46%), Positives = 184/289 (63%), Gaps = 12/289 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D  DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 588 PDDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMK 647

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L    + Q +FS+
Sbjct: 648 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSS 707

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
            L++  + Y  LLW G YY +D S    S SIM+DQ  G  FL+ASG+      +FP+  
Sbjct: 708 ILSRGQEAYDRLLWNGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQ 767

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           + RAL TIF  NV+ F  G+MGAVNGM+P G  D S++QS+EVW GV Y L+A M+ EG 
Sbjct: 768 VVRALQTIFEFNVQAFAGGAMGAVNGMQPCGVPDTSSVQSDEVWVGVVYGLAATMIQEGL 827

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
             E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++AM
Sbjct: 828 TREGFRTAEGCYRTVWERLGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 876


>gi|395740303|ref|XP_003777396.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pongo
           abelii]
          Length = 933

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 604 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 663

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 664 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 723

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 724 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 782

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|385648282|ref|NP_001245314.1| glucosidase, beta (bile acid) 2 [Sus scrofa]
          Length = 915

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 135/288 (46%), Positives = 189/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y V D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 590 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQVFLRDMWPVCLAVMESEMK 649

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L    + Q QFS+
Sbjct: 650 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGTREVQDQFSS 709

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +D S    + SIM+DQ  G  FL+ASG+   +TE +FP+ 
Sbjct: 710 ILSRGREAYERLLWNGRYYNYDCSSQPQAYSIMSDQCAGQWFLRASGLGEGDTE-VFPTP 768

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ +AL TIF  NV+ F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 769 HVVQALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYALAATMIQEG 828

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 829 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876


>gi|194386992|dbj|BAG59862.1| unnamed protein product [Homo sapiens]
          Length = 755

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 426 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 485

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 486 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 545

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 546 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 604

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 605 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 664

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 665 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 712


>gi|297684086|ref|XP_002819683.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pongo
           abelii]
          Length = 927

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 776

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|332228461|ref|XP_003263408.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
           [Nomascus leucogenys]
          Length = 919

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 590 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 649

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 650 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 709

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 710 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 768

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 769 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 828

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 829 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876


>gi|426361725|ref|XP_004048050.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Gorilla
           gorilla gorilla]
          Length = 927

 Score =  273 bits (699), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|14042883|dbj|BAB55430.1| unnamed protein product [Homo sapiens]
          Length = 927

 Score =  273 bits (699), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|332831849|ref|XP_001167923.2| PREDICTED: non-lysosomal glucosylceramidase isoform 4 [Pan
           troglodytes]
          Length = 933

 Score =  273 bits (699), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 604 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 663

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 664 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 723

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 724 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 782

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|440901933|gb|ELR52793.1| Non-lysosomal glucosylceramidase, partial [Bos grunniens mutus]
          Length = 927

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 134/289 (46%), Positives = 184/289 (63%), Gaps = 12/289 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D  DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 597 PDDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMK 656

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L    + Q +FS+
Sbjct: 657 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSS 716

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
            L++  + Y  LLW G YY +D S    S SIM+DQ  G  FL+ASG+      +FP+  
Sbjct: 717 ILSRGQEAYDRLLWNGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQ 776

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           + RAL TIF  NV+ F  G+MGAVNGM+P G  D S++QS+EVW GV Y L+A M+ EG 
Sbjct: 777 VVRALQTIFEFNVQAFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGL 836

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
             E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++AM
Sbjct: 837 TWEGFRTAEGCYRTVWERLGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 885


>gi|24308251|ref|NP_065995.1| non-lysosomal glucosylceramidase [Homo sapiens]
 gi|143018392|sp|Q9HCG7.2|GBA2_HUMAN RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
 gi|15030210|gb|AAH11363.1| Glucosidase, beta (bile acid) 2 [Homo sapiens]
 gi|16215453|emb|CAC83792.1| bile acid beta-glucosidase [Homo sapiens]
 gi|21739906|emb|CAD38976.1| hypothetical protein [Homo sapiens]
 gi|119578752|gb|EAW58348.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
 gi|119578753|gb|EAW58349.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
          Length = 927

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|426220232|ref|XP_004004320.1| PREDICTED: non-lysosomal glucosylceramidase [Ovis aries]
          Length = 918

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 134/289 (46%), Positives = 184/289 (63%), Gaps = 12/289 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D  DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 588 PDDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMK 647

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L    + Q +FS+
Sbjct: 648 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSS 707

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
            L++  + Y  LLW G YY +D S    S SIM+DQ  G  FL+ASG+      +FP+  
Sbjct: 708 ILSRGQEAYDRLLWNGRYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDSEVFPTQQ 767

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           + RAL TIF  NV+ F  G+MGAVNGM+P G  D S++QS+EVW GV Y L+A M+ EG 
Sbjct: 768 VVRALQTIFEFNVQAFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGL 827

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
             E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++AM
Sbjct: 828 TWEGFRTAEGCYRTVWERLGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 876


>gi|221041110|dbj|BAH12232.1| unnamed protein product [Homo sapiens]
          Length = 933

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 604 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 663

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 664 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 723

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 724 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 782

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|426361727|ref|XP_004048051.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Gorilla
           gorilla gorilla]
          Length = 933

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 604 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 663

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 664 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 723

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 724 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 782

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|55633173|ref|XP_520565.1| PREDICTED: non-lysosomal glucosylceramidase isoform 6 [Pan
           troglodytes]
 gi|410227830|gb|JAA11134.1| talin 1 [Pan troglodytes]
          Length = 927

 Score =  273 bits (698), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|395515264|ref|XP_003761826.1| PREDICTED: non-lysosomal glucosylceramidase [Sarcophilus harrisii]
          Length = 954

 Score =  273 bits (698), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 137/293 (46%), Positives = 191/293 (65%), Gaps = 14/293 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  IN Y + D +DWKDLNLKF++ IYRD+ L  +  FL  ++P C  ++     
Sbjct: 607 PDDEPWLRINAYLIHDTADWKDLNLKFVLQIYRDFYLTGNESFLRDMWPVCQAVMESEMK 666

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+++      Q++F +
Sbjct: 667 FDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQMAAVCGAQDVQEKFLS 726

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S  IM+DQ  G  FL+A G+   +TE +FPSA
Sbjct: 727 ILSRGREAYERLLWNGRYYNYDSSSQPQSRIIMSDQCAGQWFLRACGLGKGDTE-VFPSA 785

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+GF  G+MGAVNGM P+G  D S++QS+EVW GV Y L+A M+ EG
Sbjct: 786 HVIRALQTIFELNVQGFAGGAMGAVNGMHPDGVPDTSSVQSDEVWVGVVYGLAATMIQEG 845

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
            V E + TA G YRTV+ER G+ F+TPE     + +RS  YMR L+++AMQ A
Sbjct: 846 LVQEGFLTAEGCYRTVWERLGMAFQTPEAYCQRRVFRSLAYMRPLSIWAMQLA 898


>gi|403306662|ref|XP_003943842.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 923

 Score =  273 bits (698), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+++      Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 776

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|403306664|ref|XP_003943843.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 929

 Score =  273 bits (698), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 604 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 663

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+++      Q +FS+
Sbjct: 664 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSS 723

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 724 ILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 782

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|397519483|ref|XP_003829888.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pan
           paniscus]
          Length = 755

 Score =  273 bits (698), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 426 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 485

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 486 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 545

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 546 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 604

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL T+F  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 605 HVVRALQTVFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 664

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 665 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 712


>gi|73971791|ref|XP_531991.2| PREDICTED: non-lysosomal glucosylceramidase [Canis lupus familiaris]
          Length = 1136

 Score =  273 bits (697), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 138/290 (47%), Positives = 187/290 (64%), Gaps = 14/290 (4%)

Query: 28   PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
            P++EP+  +N Y V D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 809  PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMK 868

Query: 88   FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
            FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q +FS+
Sbjct: 869  FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAILCGAHDVQDKFSS 928

Query: 138  KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
             L +  + Y  LLW G YY +D S    S SIM+DQ  G  FL+ASG+   +TE +FP+ 
Sbjct: 929  ILRRGQEAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLRASGLGEGDTE-VFPTR 987

Query: 195  NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
            ++  AL TIF  NVR F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 988  HVVCALQTIFEFNVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 1047

Query: 255  NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
               E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++AM
Sbjct: 1048 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 1097


>gi|397519481|ref|XP_003829887.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pan
           paniscus]
          Length = 927

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL T+F  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTVFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|410925034|ref|XP_003975986.1| PREDICTED: non-lysosomal glucosylceramidase-like [Takifugu
           rubripes]
          Length = 827

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 132/292 (45%), Positives = 184/292 (63%), Gaps = 12/292 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           PE+EP+  +N Y + + +DWKDLNLKF++ +YRD+ L +D ++L  ++P C  ++     
Sbjct: 526 PEDEPWQRVNAYLIHNTADWKDLNLKFVLQVYRDFHLTQDSQYLQDMWPICQTVMESALK 585

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FD  G G+++N GF DQT          AYCGGL +A+L  M +M+ LL +  + Q +  
Sbjct: 586 FDLDGDGLIENSGFADQTYDGWTVTGPSAYCGGLWLASLVVMCKMARLLENEMKYQYYRD 645

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEA--IFPSAN 195
            L + S  Y  +LW G YY +DSS   HS+S+M+DQ  G+ FL+ASG+       FP   
Sbjct: 646 ILDRGSVAYDKVLWNGKYYNYDSSGKDHSNSVMSDQCAGHWFLRASGLGDGEYQAFPKEK 705

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           I RAL T+F  NV  F  G MGAVNGM+P G  DRS++QS+EVW GV Y L+A M++EG 
Sbjct: 706 IQRALKTVFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGM 765

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           ++E   TA G YRTV+ER G+ F+TPE       YRS  YMR L+++AMQ A
Sbjct: 766 LEEGMRTAEGCYRTVWERLGMAFQTPEAYCEKGIYRSLAYMRPLSIWAMQLA 817


>gi|354485725|ref|XP_003505033.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Cricetulus griseus]
          Length = 898

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 569 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 628

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q +FS+
Sbjct: 629 FDKDQDGLIENGGYADQTYDGWVATGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSS 688

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +DSS    S SIM+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 689 ILCRGREAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 747

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 748 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 807

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 808 LTREGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 855


>gi|390458158|ref|XP_002743170.2| PREDICTED: non-lysosomal glucosylceramidase [Callithrix jacchus]
          Length = 1065

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28   PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
            P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 740  PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 799

Query: 88   FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
            FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+++      Q +FS+
Sbjct: 800  FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSS 859

Query: 138  KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
             L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 860  ILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 918

Query: 195  NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
            ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 919  HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 978

Query: 255  NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
               E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 979  LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 1026


>gi|355689821|gb|AER98956.1| glucosidase, beta 2 [Mustela putorius furo]
          Length = 550

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 137/288 (47%), Positives = 185/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y V D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 220 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYHLTGDQSFLRDMWPVCLAVMESEMK 279

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q +FS+
Sbjct: 280 FDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQMAVLCGAKDIQDKFSS 339

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +D S    S SIM+DQ  G  FLKASG+   +TE +FP+ 
Sbjct: 340 ILRRGQEAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTE-VFPTQ 398

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++  AL TIF  NVR F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 399 HVVCALQTIFEFNVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 458

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 459 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 506


>gi|109111050|ref|XP_001084705.1| PREDICTED: non-lysosomal glucosylceramidase [Macaca mulatta]
          Length = 927

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 132/287 (45%), Positives = 184/287 (64%), Gaps = 12/287 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
            L +  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+      +FP+ +
Sbjct: 718 ILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPH 777

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           + RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG 
Sbjct: 778 VVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGL 837

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
             E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 838 TWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|355567647|gb|EHH23988.1| Non-lysosomal glucosylceramidase [Macaca mulatta]
 gi|355753217|gb|EHH57263.1| Non-lysosomal glucosylceramidase [Macaca fascicularis]
 gi|380789499|gb|AFE66625.1| non-lysosomal glucosylceramidase [Macaca mulatta]
 gi|384943046|gb|AFI35128.1| non-lysosomal glucosylceramidase [Macaca mulatta]
          Length = 927

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 132/287 (45%), Positives = 184/287 (64%), Gaps = 12/287 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
            L +  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+      +FP+ +
Sbjct: 718 ILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPH 777

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           + RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG 
Sbjct: 778 VVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGL 837

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
             E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 838 TWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|148670516|gb|EDL02463.1| glucosidase beta 2, isoform CRA_c [Mus musculus]
          Length = 467

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ IYRDY L  D  FL  ++P CL ++     
Sbjct: 138 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 197

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q++F++
Sbjct: 198 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 257

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +DSS    S SIM+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 258 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 316

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 317 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 376

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 377 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 424


>gi|395855828|ref|XP_003800351.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
            [Otolemur garnettii]
          Length = 1147

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 135/288 (46%), Positives = 187/288 (64%), Gaps = 14/288 (4%)

Query: 28   PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
            P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 818  PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 877

Query: 88   FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
            FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 878  FDKDHDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDLQDKFSS 937

Query: 138  KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
             L +  + Y  LLW G YY +DSS    S SIM+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 938  ILRQGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTE-VFPTQ 996

Query: 195  NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
            ++ RAL TI+  NV+ F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 997  HVVRALQTIYELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 1056

Query: 255  NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
               E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 1057 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 1104


>gi|189217893|ref|NP_001013109.2| non-lysosomal glucosylceramidase [Rattus norvegicus]
 gi|149045757|gb|EDL98757.1| glucosidase beta 2 [Rattus norvegicus]
          Length = 904

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 589 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 648

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q +FS+
Sbjct: 649 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSS 708

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +DSS    S S+M+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 709 ILCRGREAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTE-VFPTL 767

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 768 HVVRALKTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 827

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 828 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 875


>gi|338720173|ref|XP_001497796.3| PREDICTED: non-lysosomal glucosylceramidase [Equus caballus]
          Length = 983

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 135/288 (46%), Positives = 187/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ IYRDY L  D  FL  ++P CL ++     
Sbjct: 654 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLRDMWPVCLAVMESEMK 713

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 714 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQDRFSS 773

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +D S    S SIM+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 774 ILSRGQEAYERLLWNGRYYNYDCSSQPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTR 832

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           +I RAL TIF  NV+ F  G+MGAVNGM+P+G  DR+++QS+EVW GV Y L+A M+ EG
Sbjct: 833 HIVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRTSVQSDEVWVGVVYGLAATMIQEG 892

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 893 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 940


>gi|344271640|ref|XP_003407645.1| PREDICTED: non-lysosomal glucosylceramidase [Loxodonta africana]
          Length = 967

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 642 PDDEPWLRVNAYVIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEME 701

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q++FS+
Sbjct: 702 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMATLCEAQDIQEKFSS 761

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S SIM+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 762 ILSRGKEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTP 820

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D S++QS+EVW GV Y L+A M+ EG
Sbjct: 821 HVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDTSSVQSDEVWVGVVYGLAATMIQEG 880

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              + + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 881 LTSKGFWTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 928


>gi|402897018|ref|XP_003911574.1| PREDICTED: non-lysosomal glucosylceramidase [Papio anubis]
          Length = 927

 Score =  271 bits (692), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 131/287 (45%), Positives = 183/287 (63%), Gaps = 12/287 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT           YCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSVYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
            L +  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+      +FP+ +
Sbjct: 718 ILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPH 777

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           + RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ EG 
Sbjct: 778 VVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGL 837

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
             E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 838 TWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|143018484|sp|Q5M868.2|GBA2_RAT RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
          Length = 912

 Score =  271 bits (692), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 134/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 589 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 648

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q +FS+
Sbjct: 649 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSS 708

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +DSS    S S+M+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 709 ILCRGREAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTE-VFPTL 767

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 768 HVVRALKTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 827

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 828 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 875


>gi|50511011|dbj|BAD32491.1| mKIAA1605 protein [Mus musculus]
          Length = 940

 Score =  271 bits (692), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ IYRDY L  D  FL  ++P CL ++     
Sbjct: 611 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 670

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q++F++
Sbjct: 671 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 730

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +DSS    S SIM+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 731 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 789

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 790 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 849

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 850 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 897


>gi|26354314|dbj|BAC40785.1| unnamed protein product [Mus musculus]
          Length = 918

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ IYRDY L  D  FL  ++P CL ++     
Sbjct: 589 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 648

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q++F++
Sbjct: 649 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 708

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +DSS    S SIM+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 709 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 767

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 768 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 827

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 828 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875


>gi|148670514|gb|EDL02461.1| glucosidase beta 2, isoform CRA_a [Mus musculus]
          Length = 918

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ IYRDY L  D  FL  ++P CL ++     
Sbjct: 589 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 648

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q++F++
Sbjct: 649 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 708

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +DSS    S SIM+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 709 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 767

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 768 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 827

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 828 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875


>gi|240120073|ref|NP_766280.2| non-lysosomal glucosylceramidase [Mus musculus]
 gi|143018443|sp|Q69ZF3.2|GBA2_MOUSE RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
 gi|34784135|gb|AAH56935.1| Glucosidase beta 2 [Mus musculus]
          Length = 918

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ IYRDY L  D  FL  ++P CL ++     
Sbjct: 589 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 648

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q++F++
Sbjct: 649 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 708

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +DSS    S SIM+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 709 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 767

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 768 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 827

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 828 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875


>gi|410978559|ref|XP_003995657.1| PREDICTED: non-lysosomal glucosylceramidase [Felis catus]
          Length = 917

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 136/288 (47%), Positives = 185/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y V D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 590 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMK 649

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q +FS+
Sbjct: 650 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSS 709

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +D S    S SIM+DQ  G  FL+ASG+   +TE +FP+ 
Sbjct: 710 ILRRGQEAYERLLWNGRYYNYDCSPPPQSCSIMSDQCAGQWFLRASGLGEGDTE-VFPTP 768

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++  AL TIF  NVR F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 769 HVVCALQTIFEFNVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 828

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 829 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876


>gi|281339788|gb|EFB15372.1| hypothetical protein PANDA_007478 [Ailuropoda melanoleuca]
          Length = 923

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 135/288 (46%), Positives = 185/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y V D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+++ L      Q +FS+
Sbjct: 658 FDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +D S    S SIM+DQ  G  FLKASG+   +TE +FP+ 
Sbjct: 718 ILRRGREAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTE-VFPTP 776

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++  AL TIF  NV+ F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVCALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 836

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|301767028|ref|XP_002918945.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Ailuropoda melanoleuca]
          Length = 928

 Score =  268 bits (685), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 135/288 (46%), Positives = 185/288 (64%), Gaps = 14/288 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y V D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 601 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESEMK 660

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+++ L      Q +FS+
Sbjct: 661 FDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKFSS 720

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +D S    S SIM+DQ  G  FLKASG+   +TE +FP+ 
Sbjct: 721 ILRRGREAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTE-VFPTP 779

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++  AL TIF  NV+ F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 780 HVVCALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 839

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++
Sbjct: 840 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 887


>gi|405977906|gb|EKC42333.1| Non-lysosomal glucosylceramidase [Crassostrea gigas]
          Length = 1464

 Score =  267 bits (682), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 138/315 (43%), Positives = 186/315 (59%), Gaps = 12/315 (3%)

Query: 6    TMGEVKFRSAANTHERYYTRWFPEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH 65
            T GEV  R     H   +    PE+EP++ +N Y +    +WKDLN+KF++  YRDY+  
Sbjct: 1144 TSGEVGIRK--RLHAVPHDMGDPEDEPWNRVNCYVIHPTCNWKDLNMKFVLQTYRDYSAT 1201

Query: 66   KDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAA 115
            KD  +L  +YP    ++   + +D    GI+ NGGF DQT          AYCGG+ +AA
Sbjct: 1202 KDDAYLRHMYPVAKYVMETAKKWDVDDDGIIDNGGFADQTFDAWTMTGASAYCGGMWLAA 1261

Query: 116  LACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLC 175
            L  M+EM+++L        +   L K    Y + LW GSYY +D S   H DSIMADQL 
Sbjct: 1262 LRMMIEMATILKKEDDVLHYKEILDKGKVSYQNKLWNGSYYNYDCSSGGHHDSIMADQLA 1321

Query: 176  GYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQS 235
            G+ FLKASG+  + IFP   +  +L  IF  NV  FE G+MGA+NG +P+G +D S+ QS
Sbjct: 1322 GHWFLKASGLQDDDIFPPDRVKSSLKKIFENNVMMFEGGNMGAINGTRPDGSKDISSCQS 1381

Query: 236  EEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGY 295
            EE W GVTY L+A M+ EG +D+A+ TA G Y   +E  GL F+TPE    D  YRS GY
Sbjct: 1382 EEFWVGVTYGLAANMIQEGLLDQAFQTAWGAYHVCWEWYGLAFQTPEAYMTDNIYRSLGY 1441

Query: 296  MRALAVYAMQDAYLK 310
            MR LA+++MQ A  K
Sbjct: 1442 MRPLAIWSMQWALEK 1456


>gi|125821356|ref|XP_687652.2| PREDICTED: non-lysosomal glucosylceramidase [Danio rerio]
          Length = 851

 Score =  266 bits (680), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 129/292 (44%), Positives = 183/292 (62%), Gaps = 12/292 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L +D ++L  ++P C  ++     
Sbjct: 546 PDDEPWVRVNAYLIHDTADWKDLNLKFVLQVYRDYHLTQDQQYLKDMWPVCQTVMENELK 605

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK G G+++N G+ DQT          AYCGGL +A++  M +M+ +LN     Q++  
Sbjct: 606 FDKDGDGLIENSGYADQTYDGWKVTGPSAYCGGLWLASVCMMCKMARVLNCESVYQRYRD 665

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEA--IFPSAN 195
            L + S  +  LLW G YY +DSS    S+S+M+DQ  G+ FL+ASG+  +    FP   
Sbjct: 666 ILERGSAAFDKLLWNGKYYNYDSSGRSLSNSVMSDQCAGHWFLRASGLGDDEYQAFPKEK 725

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           I  AL ++F  NV  F  G MGAVNGM+P G  DRS++QS+EVW GV Y L+A M++EG 
Sbjct: 726 ICSALKSVFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWVGVVYGLAATMIHEGM 785

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           V+E   TA G YR V+ER G+ F+TPE       YRS  YMR L+++AMQ A
Sbjct: 786 VEEGLRTAEGCYRAVWERMGMAFQTPEAYCEKGIYRSLAYMRPLSIWAMQLA 837


>gi|443710865|gb|ELU04898.1| hypothetical protein CAPTEDRAFT_216398 [Capitella teleta]
          Length = 449

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 129/296 (43%), Positives = 180/296 (60%), Gaps = 14/296 (4%)

Query: 29  EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
           + EP+S +N Y +   ++WKDLN KF++ +YRDY   KD ++L  +YP C  ++     +
Sbjct: 143 DNEPWSRLNAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMYPQCKAVMDHSLRW 202

Query: 89  DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
           D    GI+ N GF DQT          AYCGGL +AA+    EM+  L   +   ++   
Sbjct: 203 DVDRDGIIDNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHRLGVVEDLMKYEEI 262

Query: 139 LTKASQVYHDLLWTG----SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSA 194
           L +  + + + LW G     YY +D S+S H DSIMADQ+CG  FLKA GV   A+FPSA
Sbjct: 263 LERGKKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLKACGVADNAVFPSA 322

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           N+ RAL  +F  NV  F+ G MGA+NGM+PN  +D ++ QS+E WTGVTY L A M+  G
Sbjct: 323 NVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYALGATMIQVG 382

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
            +D+ + TA G Y T +ER GL F+TPE    ++ +RS GYMR LA++A+Q A  K
Sbjct: 383 MIDKGFQTAYGAYHTCWERYGLAFQTPEAYFDNRRFRSLGYMRPLAIWAIQHAVEK 438


>gi|169642399|gb|AAI60640.1| LOC100145361 protein [Xenopus (Silurana) tropicalis]
          Length = 571

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 124/290 (42%), Positives = 181/290 (62%), Gaps = 12/290 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D ++WKDLNLKF++ +YRDY + KD  +L  ++  C  ++     
Sbjct: 263 PDDEPWQKLNAYLIHDTANWKDLNLKFVLQVYRDYHITKDTTYLRDMWSVCQTVMETSLK 322

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FD+ G G+++N GF DQT          +YCGGL +AA+  M +M+ +L D   Q++F+ 
Sbjct: 323 FDEDGDGLIENSGFADQTYDDWVMTGPSSYCGGLWLAAVCMMCKMAEVLGDGAAQEKFTD 382

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
            L +    +  LLW G YY FD     +S+S+M DQ  G  FL+A G+      +FP  +
Sbjct: 383 ILHRGKAAFEKLLWNGKYYNFDCGDQPYSNSVMVDQCAGNWFLRACGLGAGESEVFPKDH 442

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           +  AL TIF  NV+ F  G MGAVNGM+P+G  D S++QS+EVW GV Y L+A M++EG 
Sbjct: 443 VVSALRTIFELNVKQFADGQMGAVNGMRPDGTTDTSSVQSDEVWIGVVYGLAATMIHEGL 502

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           V E + TA G YRTV+ER G+ F+TPE     + +RS  YMR L+++AMQ
Sbjct: 503 VQEGFATAEGCYRTVWERLGMSFQTPEAYCEKQVFRSLAYMRPLSIWAMQ 552


>gi|443709232|gb|ELU03988.1| hypothetical protein CAPTEDRAFT_186839 [Capitella teleta]
          Length = 802

 Score =  263 bits (672), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 129/296 (43%), Positives = 180/296 (60%), Gaps = 14/296 (4%)

Query: 29  EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
           + EP+S +N Y +   ++WKDLN KF++ +YRDY   KD ++L  +YP C  ++     +
Sbjct: 496 DNEPWSRLNAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMYPQCKAVMDHSLRW 555

Query: 89  DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
           D    GI+ N GF DQT          AYCGGL +AA+    EM+  L   +   ++   
Sbjct: 556 DVDRDGIIDNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHRLGVVEDLMKYEEI 615

Query: 139 LTKASQVYHDLLWTG----SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSA 194
           L +  + + + LW G     YY +D S+S H DSIMADQ+CG  FLKA GV   A+FPSA
Sbjct: 616 LERGKKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLKACGVADNAVFPSA 675

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           N+ RAL  +F  NV  F+ G MGA+NGM+PN  +D ++ QS+E WTGVTY L A M+  G
Sbjct: 676 NVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYALGATMIQVG 735

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
            +D+ + TA G Y T +ER GL F+TPE    ++ +RS GYMR LA++A+Q A  K
Sbjct: 736 MIDKGFQTAYGAYHTCWERYGLAFQTPEAYFDNRRFRSLGYMRPLAIWAIQHAVEK 791


>gi|198424051|ref|XP_002127036.1| PREDICTED: similar to Non-lysosomal glucosylceramidase (NLGase)
           (Glucosylceramidase 2) (Beta-glucocerebrosidase 2)
           (Beta-glucosidase 2) [Ciona intestinalis]
          Length = 861

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 127/291 (43%), Positives = 185/291 (63%), Gaps = 13/291 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           PE+EP+  +N Y V D +DW+DLN KF++  YRDY + KDI FL  ++P C  ++ +   
Sbjct: 567 PEDEPWLRVNTYFVHDTADWRDLNPKFVLQAYRDYYITKDIDFLKAMWPICKIVMEQSMR 626

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
            DK   G+++N G  DQT          AYCGGL +AAL CM E + +L++   Q +F  
Sbjct: 627 HDKDNDGLIENSGAADQTFDGWCVTGPSAYCGGLWLAALRCMEEAADILHEKDDQVKFHD 686

Query: 138 KLTKASQVYHDLLWTGSYYKFD-SSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANI 196
            L +A++ Y D LW G Y+ +D S+  +H  S+MADQ  G  FL ASG+    + P   +
Sbjct: 687 ILVRATKAYQDRLWNGKYFNYDCSNNPQHRTSVMADQCAGQWFLLASGICD--VLPDDQV 744

Query: 197 ARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNV 256
             +L TIFS NV  ++ G  GAVNGM+P+G  D S++QS+EVWTGVTY L+A M+++G +
Sbjct: 745 MSSLKTIFSLNVEQYDGGRHGAVNGMRPHGAVDTSSVQSDEVWTGVTYALAATMIHKGML 804

Query: 257 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
            E + TA G+YR+ ++R G+ F+TPE +  + TYRS  YMR L+++ MQ A
Sbjct: 805 KEGFRTASGIYRSCWQRYGMAFQTPEAMRKNHTYRSLAYMRPLSIWGMQYA 855


>gi|432920219|ref|XP_004079895.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oryzias latipes]
          Length = 858

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 127/292 (43%), Positives = 179/292 (61%), Gaps = 12/292 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D + WKDLNLKF++ +YRD+ + +D ++L  ++P C  ++     
Sbjct: 547 PDDEPWHRLNAYLIHDTAGWKDLNLKFVLQVYRDFHITQDNQYLKDMWPICQAVMESEIK 606

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FD  G G+++N G+ DQT          AYCGGL +A+L  M +M+ L++  +  Q +  
Sbjct: 607 FDLDGDGLIENSGYADQTYDGWTVTGPSAYCGGLWLASLCVMCKMARLVDTERTYQHYKD 666

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE--AIFPSAN 195
            L + S  +  LLW G YY +DSS   HS+S+M+DQ  G+ FLKAS    E    FP   
Sbjct: 667 ILDRGSAAFDKLLWNGKYYNYDSSGRYHSNSVMSDQCAGHWFLKASACGEEDYQAFPKEK 726

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           I   L +IF  NV  F  G MGAVNGM+P G  DRS++QS+EVW GV Y L+A M++EG 
Sbjct: 727 IQSTLKSIFDLNVMAFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGM 786

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
            +E   TA G YRTV+ER G+ F+TPE       YRS  YMR L+++AMQ A
Sbjct: 787 REEGMHTAEGCYRTVWERLGMAFQTPEAYCEKNIYRSLAYMRPLSIWAMQLA 838


>gi|242015137|ref|XP_002428230.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212512791|gb|EEB15492.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 812

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 128/291 (43%), Positives = 180/291 (61%), Gaps = 12/291 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P EEPF  IN Y + DVS+W+DL+LKF++  YRDY L+ D+  L  ++ +   L+     
Sbjct: 518 PFEEPFLKINSYPIHDVSEWRDLDLKFVLQSYRDYFLNNDLNQLKYLWKSVCVLMNNALR 577

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           +DK G G+++N G PDQT          AYCG L +AAL C  +M+ +L+D     +FS 
Sbjct: 578 WDKDGDGLIENVGEPDQTYDAWTMEGPSAYCGSLWLAALQCTCKMALILSDVDTYNKFSE 637

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
            L  A Q +   LW G YY FD S+ +   +IM+DQLCG  +L   G+  E + P A + 
Sbjct: 638 TLKMAQQSFDKKLWNGKYYNFDCSE-KWGTTIMSDQLCGLWYLYCCGIKNE-VLPPAKVN 695

Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
            +L  I+  NV     G++G VNGM PNG+ D   +QSEE+W GV Y L++ M++EG V+
Sbjct: 696 SSLRCIYENNVLKLGKGNLGCVNGMLPNGEIDIHTLQSEEIWVGVVYALASLMIFEGMVE 755

Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
           E W TA G+Y TVY+R G+GFETPE L   K YR+ GYMR L++++MQ A+
Sbjct: 756 EGWKTARGIYETVYDRIGMGFETPEALRESKKYRAIGYMRPLSIWSMQLAW 806


>gi|224090236|ref|XP_002190896.1| PREDICTED: non-lysosomal glucosylceramidase [Taeniopygia guttata]
          Length = 826

 Score =  258 bits (659), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 133/292 (45%), Positives = 184/292 (63%), Gaps = 12/292 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P +EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  +L  ++P C  ++     
Sbjct: 516 PGDEPWQRVNAYLMHDTADWKDLNLKFVLQVYRDYYLTHDSLYLQDMWPVCQAVMESELK 575

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FD    G+++NGGF DQT          AYCGGL +AA+  M +M+ +L DT+ +Q++  
Sbjct: 576 FDTDNDGLIENGGFADQTYDAWVVIRRCAYCGGLWLAAVCMMCKMAEVLGDTEIRQKYLG 635

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEA--IFPSAN 195
            L K  + +  +LW G YY +DSS S  S SIM+DQ  G  FL A G++     +FP ++
Sbjct: 636 ILNKGKEAFERMLWNGKYYNYDSSGSDTSSSIMSDQCAGQWFLGACGLDQGEFEVFPKSH 695

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           I  AL TIF  NV  F  G+MGAVNGM+P+G  D S++QS EVW GV Y L+A M+ EG 
Sbjct: 696 ILSALRTIFEKNVLSFAGGTMGAVNGMRPDGVPDTSSVQSNEVWVGVVYSLAATMIQEGM 755

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           V+E + TA G YRTV+ER G+ F+TPE     K YRS  YMR L++++MQ A
Sbjct: 756 VEEGFRTAEGCYRTVWERLGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQLA 807


>gi|390344669|ref|XP_001200431.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Strongylocentrotus purpuratus]
          Length = 347

 Score =  258 bits (659), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 127/290 (43%), Positives = 179/290 (61%), Gaps = 12/290 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           PEE+P+  +N Y   D ++WKDLNLK ++ +YRDY   KD  FL  ++P C+ ++ K  S
Sbjct: 41  PEEDPWLKLNCYWFHDTAEWKDLNLKMVLMVYRDYFATKDREFLDFMWPKCMTVMAKANS 100

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
            DK G G++ + G  DQT          AYCGGL++AAL CM EM+ LL D + ++Q+S 
Sbjct: 101 QDKDGDGLIDHFGTADQTYDVWTAKGCSAYCGGLYLAALKCMCEMADLLGDDKAKRQYSE 160

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE--AIFPSAN 195
            L +    Y   LW G Y+ +DSS   + DSIMADQ CG+ +L+A  +  E   +F   +
Sbjct: 161 ILQRGKATYEKKLWNGRYFNYDSSDQSYHDSIMADQTCGHWYLRACDLVPERDQVFQVDH 220

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           +  AL TIF  NV G + G+ GA+NG++P+G  D +++Q EEVW G TY L+  M+ EG 
Sbjct: 221 VRSALRTIFDMNVMGVKEGNFGAMNGIRPSGKPDHTSLQGEEVWVGTTYGLAGNMIQEGM 280

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            +E + TA G Y T YE+ GL ++ PE     K YRS GYMR LA++AMQ
Sbjct: 281 WEEGFRTAKGCYTTCYEQAGLAYQVPEAYMSKKIYRSLGYMRPLAIWAMQ 330


>gi|363744064|ref|XP_003642968.1| PREDICTED: non-lysosomal glucosylceramidase-like [Gallus gallus]
          Length = 853

 Score =  256 bits (655), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 133/292 (45%), Positives = 183/292 (62%), Gaps = 12/292 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D ++WKDLNLKF++ +YRDY L  D  +L  ++P C  ++     
Sbjct: 541 PDDEPWQRVNAYLMHDTANWKDLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELK 600

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FD    G+++NGG  DQT          AYCGGL +AA+  M EM+ +L DT+ +Q++ A
Sbjct: 601 FDTDNDGLIENGGIADQTYDAWVVDGASAYCGGLWLAAVRMMCEMAEVLGDTETRQKYDA 660

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAI--FPSAN 195
            L K  + +  LLW G YY +DSS S  S SIM+DQ  G  FL A G++ + +  FP ++
Sbjct: 661 ILQKGKESFERLLWNGKYYNYDSSGSSTSSSIMSDQCAGQWFLGACGLDQKEVEVFPKSH 720

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           I  AL TIF  NV  F  G MGAVNGM+P+G  D S++QS EVW GV Y L+A M+ EG 
Sbjct: 721 IVSALKTIFEKNVMSFAGGKMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGL 780

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           V E + TA G YRTV+E  G+ F+TPE     K YRS  YMR L++++MQ A
Sbjct: 781 VQEGFHTAEGCYRTVWEHLGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLA 832


>gi|348520215|ref|XP_003447624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oreochromis
           niloticus]
          Length = 856

 Score =  256 bits (654), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 123/292 (42%), Positives = 179/292 (61%), Gaps = 12/292 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D + WKDLNLKF++ +YRD+ L +D ++L  ++P C  ++     
Sbjct: 547 PDDEPWMRVNAYLIHDTAGWKDLNLKFVLQVYRDFHLTQDRQYLQDMWPICQAVMESELK 606

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FD  G G+++N G+ DQT          AYCGG+ +A+L  M +M+ LL+  +  + +  
Sbjct: 607 FDLDGDGLIENSGYADQTYDGWIVTGPSAYCGGMWVASLCVMCKMARLLDSEESYRHYKD 666

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAI--FPSAN 195
            L + S  +  LLW G YY +DSS    S+S+M+DQ  G+ FL+ASG+       FP   
Sbjct: 667 LLDRGSAAFEKLLWNGKYYNYDSSGRELSNSVMSDQCAGHWFLRASGLGDGEFQAFPKEK 726

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           I  AL ++F  NV  F  G MGAVNGM+P G  D S++QS+EVW GV Y L+A M++EG 
Sbjct: 727 IQSALKSVFDLNVMSFAGGQMGAVNGMRPEGVPDHSSVQSDEVWIGVVYGLAATMIHEGM 786

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
            +E   TA G YRTV+E+ G+ F+TPE       YRS  YMR L+++AMQ A
Sbjct: 787 QEEGMRTAEGCYRTVWEKLGMAFQTPEAYCEKNIYRSLAYMRPLSIWAMQLA 838


>gi|326934726|ref|XP_003213436.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Meleagris gallopavo]
          Length = 639

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 133/292 (45%), Positives = 183/292 (62%), Gaps = 12/292 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D ++WKDLNLKF++ +YRDY L  D  +L  ++P C  ++     
Sbjct: 327 PDDEPWQRVNAYLMHDTANWKDLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELK 386

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FD    G+++NGG  DQT          AYCGGL +AA+  M EM+ +L D + QQ++ A
Sbjct: 387 FDTDNDGLIENGGTADQTYDAWVVNGASAYCGGLWLAAVRMMCEMAEVLGDAETQQKYGA 446

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEA--IFPSAN 195
            L K  + +  LLW G YY +DSS S  S SIM+DQ  G  FL A G++ +   +FP ++
Sbjct: 447 ILQKGKESFERLLWNGKYYNYDSSGSSTSSSIMSDQCAGQWFLGACGLDQKELEVFPKSH 506

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           I  AL TIF  NV  F  G+MGAVNGM+P+G  D S++QS EVW GV Y L+A M+ EG 
Sbjct: 507 IVSALKTIFEKNVMSFAGGTMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGL 566

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           V E + TA G YRTV+E  G+ F+TPE     K YRS  YMR L++++MQ A
Sbjct: 567 VQEGFHTAEGCYRTVWENLGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLA 618


>gi|449275451|gb|EMC84314.1| Non-lysosomal glucosylceramidase [Columba livia]
          Length = 692

 Score =  253 bits (645), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 129/290 (44%), Positives = 180/290 (62%), Gaps = 14/290 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P +EP+  +N Y + D ++WKDLNLKF++ +YRDY L  D  +L  ++P C  ++     
Sbjct: 386 PADEPWQRVNAYLIHDTANWKDLNLKFVLQVYRDYYLTHDSLYLRDMWPVCQAVMESELK 445

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FD    G+++NGGF DQT          AYCGGL +AA+  M +M+ +L D + QQ++  
Sbjct: 446 FDMDNDGLIENGGFADQTYDAWVVNGASAYCGGLWLAAVCMMCKMAEVLGDAEIQQKYMD 505

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
            L K  + +  LLW G YY +DSS S  S SIM+DQ  G+     S      +FP +++ 
Sbjct: 506 ILRKGKETFERLLWNGKYYNYDSSGSNTSSSIMSDQCAGHPKPPLS----MQVFPKSHVV 561

Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
            AL TIF  NV GF  G+MGAVNGM+P+G  D S++QS EVW GV Y L+A M+ EG V+
Sbjct: 562 SALKTIFEKNVMGFAGGTMGAVNGMRPSGVPDTSSVQSNEVWVGVVYALAATMIQEGLVE 621

Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           E + TA G YRTV+E+ G+ F+TPE     K YRS  YMR L++++MQ A
Sbjct: 622 EGFRTAEGCYRTVWEQLGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQLA 671


>gi|196002317|ref|XP_002111026.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
 gi|190586977|gb|EDV27030.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
          Length = 843

 Score =  250 bits (638), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 125/294 (42%), Positives = 180/294 (61%), Gaps = 14/294 (4%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P +EP+  IN Y + DV DW DLN KF++ IYRDY    ++ FL  ++P    ++    S
Sbjct: 538 PCDEPWKRINAYCISDVQDWTDLNTKFVLQIYRDYCATGNLEFLKDMWPCAKHVMETAIS 597

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
            D  G GI+ N G  DQT          AYCGG+ +AAL  M +++ L+   + + ++S 
Sbjct: 598 QDTDGDGIIDNNG-ADQTYDVWRMYGASAYCGGIFLAALYIMYKIADLIGCKEDKAKYSK 656

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +    +   LW G YY +DSS+  HSDSIMADQ+CG+ +L+A G+   N + IFP +
Sbjct: 657 ILLRGRDSFQQKLWNGDYYLYDSSKGNHSDSIMADQMCGHWYLQACGLVNNNEDDIFPKS 716

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
            + +AL  +F  NV  F  G+ GAVNGMKPNG+ D S++QSEEVWTGVTY L+A ++ +G
Sbjct: 717 QVDKALRKVFDYNVMKFMNGTWGAVNGMKPNGEIDTSSVQSEEVWTGVTYALAANLIQQG 776

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
            + E +  A G+Y T +   GLGF+TPE       YR  GYMR L+++A++ A+
Sbjct: 777 MIKEGFKVAEGMYDTTFNVMGLGFQTPEAYYKINRYRCTGYMRPLSIWAIEWAF 830


>gi|260813884|ref|XP_002601646.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
 gi|229286945|gb|EEN57658.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
          Length = 871

 Score =  246 bits (628), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 122/296 (41%), Positives = 183/296 (61%), Gaps = 19/296 (6%)

Query: 29  EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
           ++EPF  +N Y + D SDW+DLNLKF++ I+RDY + +D  FL  ++P    ++ K  ++
Sbjct: 562 DDEPFIHVNAYVMHDTSDWRDLNLKFVLQIFRDYFVGRDKGFLAFMWPRAQAVMTKALTW 621

Query: 89  DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
           D    G++ N G  DQT          AYCGGL +AAL CMVE++++L   +++Q + A 
Sbjct: 622 DTDQDGMIDNSGAADQTYDAWIVTGASAYCGGLWLAALRCMVEIANILGKEEEEQHYLAV 681

Query: 139 LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE---------A 189
           + +A   Y + LW G YY +D S   +S+SIMA    G+ +L+A G+  E          
Sbjct: 682 MQRAKTAYEEKLWNGEYYNYDCSGRSYSNSIMAGAGSGHWYLRACGLVPEDNEVMDSDRE 741

Query: 190 IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAA 249
           + P  ++A +L  ++  NV  F  G+MGAVNGM+PNG +D +++QSEEVWTG+TY L+A+
Sbjct: 742 VLPLDHVASSLRKVYDFNVMQFHQGTMGAVNGMRPNGKKDLTSMQSEEVWTGITYALAAS 801

Query: 250 MLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           M+ EG V+E + TA G+Y   Y + G  F+TPE       +RS GYMR LA++AMQ
Sbjct: 802 MIQEGMVEEGFQTAYGVYNMCYLQCGFAFQTPEAYLERDFFRSLGYMRPLAIWAMQ 857


>gi|189239183|ref|XP_966847.2| PREDICTED: similar to bile acid beta-glucosidase, putative
           [Tribolium castaneum]
 gi|270010940|gb|EFA07388.1| hypothetical protein TcasGA2_TC016367 [Tribolium castaneum]
          Length = 818

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 186/314 (59%), Gaps = 19/314 (6%)

Query: 8   GEVKFRSAANTHERYYTRWFPEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--- 64
           GEV  R   NT         P EEPF L+N Y + DVS W+DLN KF++ ++RD  +   
Sbjct: 501 GEVCERKVPNTVPHDIGD--PGEEPFILLNSYPIHDVSQWRDLNSKFVLQVFRDAFITGL 558

Query: 65  -HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHI 113
             + I +L  +Y  C  ++ K   FD  G G+++N G PDQT          AYCGGL +
Sbjct: 559 DDRSIAYLNDMYNACYTVMHKSLDFDVDGDGLIENSGSPDQTFDTWVMTGASAYCGGLWL 618

Query: 114 AALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQ 173
           AAL  M +++  L  T+ +++F   L K +  +   LW G  Y FD S  +   SIMADQ
Sbjct: 619 AALFAMTKIADALQKTEDKEKFQELLDKGTAAFERKLWNGKCYNFDCS-DKECRSIMADQ 677

Query: 174 LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAI 233
           LCG  +L++ G N E +FP   +  +L TI+  NV+ F  G MGAVNG   +G  D   I
Sbjct: 678 LCGQWYLRSCGFNYE-VFPQDRVKTSLKTIYENNVQSFCDGRMGAVNGFI-DGVIDEFTI 735

Query: 234 QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSG 293
           QS+EVWTGVTY L+A+ML EG   EA+ TAGG+++++ ER GL F+TPE L   K YR+ 
Sbjct: 736 QSQEVWTGVTYALAASMLQEGMRTEAFNTAGGMFKSMSERFGLSFDTPEALYAAKYYRAI 795

Query: 294 GYMRALAVYAMQDA 307
           GYMR L++++MQ A
Sbjct: 796 GYMRPLSIWSMQLA 809


>gi|71992731|ref|NP_001021682.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
 gi|15130776|emb|CAC48128.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
          Length = 922

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 124/294 (42%), Positives = 187/294 (63%), Gaps = 19/294 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL----HKDIRFLTRVYPTC-LELI 82
           P +EP+   N Y + D   WKDLN+K++++ +RDY +    H+D  FL   +P   + ++
Sbjct: 562 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHED--FLFHTWPAVRMIML 619

Query: 83  RKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ 132
              E++D+ G G+++N G  DQT          AYCG L +A+L   +EM+ L+ D + Q
Sbjct: 620 EALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMKDDETQ 679

Query: 133 QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIF 191
           Q F   L KA +V+ D LWTG+Y++F   +SR  +++MADQLCGY FL++ S    + + 
Sbjct: 680 QLFRNTLEKAKKVFIDTLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPELADELL 738

Query: 192 PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 251
           P+  +  AL TI+  NV  F  G MGAVNGMKP+G  DR  IQ++E+WTGVTY +++ ++
Sbjct: 739 PNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLI 798

Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            +G V++A+ TA G Y T +E TGL ++TPE L   K YR+ GYMR L+++AMQ
Sbjct: 799 QQGEVEKAFHTASGSYLTCFEETGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 852


>gi|71992723|ref|NP_001021681.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
 gi|50511739|emb|CAH04764.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
          Length = 930

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 124/294 (42%), Positives = 187/294 (63%), Gaps = 19/294 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL----HKDIRFLTRVYPTC-LELI 82
           P +EP+   N Y + D   WKDLN+K++++ +RDY +    H+D  FL   +P   + ++
Sbjct: 570 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHED--FLFHTWPAVRMIML 627

Query: 83  RKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ 132
              E++D+ G G+++N G  DQT          AYCG L +A+L   +EM+ L+ D + Q
Sbjct: 628 EALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMKDDETQ 687

Query: 133 QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIF 191
           Q F   L KA +V+ D LWTG+Y++F   +SR  +++MADQLCGY FL++ S    + + 
Sbjct: 688 QLFRNTLEKAKKVFIDTLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPELADELL 746

Query: 192 PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 251
           P+  +  AL TI+  NV  F  G MGAVNGMKP+G  DR  IQ++E+WTGVTY +++ ++
Sbjct: 747 PNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLI 806

Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            +G V++A+ TA G Y T +E TGL ++TPE L   K YR+ GYMR L+++AMQ
Sbjct: 807 QQGEVEKAFHTASGSYLTCFEETGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 860


>gi|71992715|ref|NP_001021680.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
 gi|18376555|emb|CAD21661.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
          Length = 959

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 124/294 (42%), Positives = 187/294 (63%), Gaps = 19/294 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL----HKDIRFLTRVYPTC-LELI 82
           P +EP+   N Y + D   WKDLN+K++++ +RDY +    H+D  FL   +P   + ++
Sbjct: 599 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHED--FLFHTWPAVRMIML 656

Query: 83  RKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ 132
              E++D+ G G+++N G  DQT          AYCG L +A+L   +EM+ L+ D + Q
Sbjct: 657 EALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMKDDETQ 716

Query: 133 QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIF 191
           Q F   L KA +V+ D LWTG+Y++F   +SR  +++MADQLCGY FL++ S    + + 
Sbjct: 717 QLFRNTLEKAKKVFIDTLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPELADELL 775

Query: 192 PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 251
           P+  +  AL TI+  NV  F  G MGAVNGMKP+G  DR  IQ++E+WTGVTY +++ ++
Sbjct: 776 PNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLI 835

Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            +G V++A+ TA G Y T +E TGL ++TPE L   K YR+ GYMR L+++AMQ
Sbjct: 836 QQGEVEKAFHTASGSYLTCFEETGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 889


>gi|268569852|ref|XP_002640631.1| Hypothetical protein CBG08749 [Caenorhabditis briggsae]
          Length = 899

 Score =  235 bits (599), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 121/292 (41%), Positives = 186/292 (63%), Gaps = 15/292 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTC-LELIRK 84
           P +EP+   N Y + D   WKDLN+K++++ +RDY +   +   FL   +P   + ++  
Sbjct: 547 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEEHQEFLHHTWPAVKMIMLEA 606

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
             ++D+ G G+++N G  DQT          AYCG L +A+L   +EM+ LL D + ++ 
Sbjct: 607 LANWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSIEMAGLLEDGETKKL 666

Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIFPS 193
           F   L KA +V+ D+LWTG+Y++F   +SR  +++MADQLCGY FL++ S    + + PS
Sbjct: 667 FLETLNKAKKVFVDVLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPEMADDLLPS 725

Query: 194 ANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
             +  AL TI+  NV  F  G MGAVNGMKP+G  DR  IQ++E+WTGVTY +++ ++ +
Sbjct: 726 HMVRSALDTIYRLNVCQFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLIQQ 785

Query: 254 GNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           G V++A+ TA G Y T +E+TGL ++TPE L   K YR+ GYMR L+++AMQ
Sbjct: 786 GEVEKAFHTASGSYLTCFEQTGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 837


>gi|308485740|ref|XP_003105068.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
 gi|308257013|gb|EFP00966.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
          Length = 933

 Score =  234 bits (598), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 120/292 (41%), Positives = 184/292 (63%), Gaps = 15/292 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTC-LELIRK 84
           P +EP+   N Y + D   WKDLN+K++++ +RDY +   +   FL   +P   + ++  
Sbjct: 575 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEEHQEFLFHTWPAVKMIMLEA 634

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
            E++D+ G G+++N G  DQT          AYCG L +A+L   +EM+ L+ +   +  
Sbjct: 635 LENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSIEMARLMGEEATENL 694

Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIFPS 193
           F   L KA +V+ D LWTG+Y++F   +SR  +++MADQLCGY FL++ S    + + P+
Sbjct: 695 FRVTLNKAKKVFIDTLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPEMVDDLLPN 753

Query: 194 ANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
             +  AL TI+  NV  F  G MGAVNGMKPNG  DR  IQ++E+WTGVTY +++ ++ +
Sbjct: 754 HMVRSALDTIYRLNVCRFGNGQMGAVNGMKPNGVVDREYIQADEMWTGVTYAVASLLIQQ 813

Query: 254 GNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           G V++A+ TA G Y T +E+TGL ++TPE L   K YR+ GYMR L+++AMQ
Sbjct: 814 GEVEKAFHTASGSYLTCFEQTGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 865


>gi|341901617|gb|EGT57552.1| hypothetical protein CAEBREN_15139 [Caenorhabditis brenneri]
          Length = 925

 Score =  234 bits (596), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 120/292 (41%), Positives = 185/292 (63%), Gaps = 15/292 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIR--FLTRVYPTC-LELIRK 84
           P +EP+   N Y + D   WKDLN+K++++ +RDY +  +    FL   +P   + ++  
Sbjct: 551 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEAHQEFLHHTWPAIKMIMLEA 610

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
            E++D+ G G+++N G  DQT          AYCG L +A+L   VEM++++ D   +  
Sbjct: 611 LENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSVEMAAIVKDEITENL 670

Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIFPS 193
           F   L KA +V+ DLLWTG+Y++F   +SR  +++MADQLCGY FL++ S    + + P+
Sbjct: 671 FKGTLEKAKKVFIDLLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPEMADDLLPN 729

Query: 194 ANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
             +  AL TI+  NV  F  G MGAVNGMKP+G  DR  IQ++E+WTGVTY +++ ++ +
Sbjct: 730 HMVRSALDTIYRLNVCKFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLIQQ 789

Query: 254 GNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           G V++A+ TA G Y T +E+ GL ++TPE L   K YR+ GYMR L+++AMQ
Sbjct: 790 GEVEKAFHTASGSYLTCFEQAGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 841


>gi|297795757|ref|XP_002865763.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311598|gb|EFH42022.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 956

 Score =  229 bits (584), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 123/296 (41%), Positives = 171/296 (57%), Gaps = 14/296 (4%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +NGY + +   WKDLN KF++ +YRD     D +F + V+P+    +     FDK
Sbjct: 605 DPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDK 664

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     ++ ++ D   Q  F +K  
Sbjct: 665 DGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASVLARVVGDKNSQDYFWSKFQ 724

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY   LW GSY+ +D+S S++S SI ADQL G  + +ASG+    I        AL
Sbjct: 725 KAKVVYEKKLWNGSYFNYDNSGSQYSSSIQADQLAGQWYARASGL--LPIVDEDKARTAL 782

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             +++ NV   + G  GAVNGM PNG  D +++QS E+W+GVTY LSA M+ EG V+ A+
Sbjct: 783 EKVYNFNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAF 842

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
            TA G+Y   +   GLG  F+TPE    +  YRS  YMR LA++AMQ A  K   K
Sbjct: 843 QTASGVYEAAWSEAGLGYSFQTPEAWNTNDEYRSLTYMRPLAIWAMQWALTKTSQK 898


>gi|10047285|dbj|BAB13431.1| KIAA1605 protein [Homo sapiens]
          Length = 922

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 111/239 (46%), Positives = 157/239 (65%), Gaps = 14/239 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 643 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 702

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 703 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 762

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 763 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 821

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ E
Sbjct: 822 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 880


>gi|119578751|gb|EAW58347.1| glucosidase, beta (bile acid) 2, isoform CRA_c [Homo sapiens]
 gi|168270622|dbj|BAG10104.1| non-lysosomal glucosylceramidase [synthetic construct]
          Length = 877

 Score =  228 bits (580), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 111/239 (46%), Positives = 157/239 (65%), Gaps = 14/239 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +DSS    S S+M+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  D+S++QS+EVW GV Y L+A M+ E
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835


>gi|148670515|gb|EDL02462.1| glucosidase beta 2, isoform CRA_b [Mus musculus]
          Length = 377

 Score =  226 bits (575), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 112/239 (46%), Positives = 155/239 (64%), Gaps = 14/239 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ IYRDY L  D  FL  ++P CL ++     
Sbjct: 89  PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 148

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M+ L      Q++F++
Sbjct: 149 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 208

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L +  + Y  LLW G YY +DSS    S SIM+DQ  G  FL+A G+   +TE +FP+ 
Sbjct: 209 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 267

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
           ++ RAL TIF  NV+ F  G+MGAVNGM P+G  DRS++QS+EVW GV Y L+A M+ E
Sbjct: 268 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 326


>gi|312382731|gb|EFR28085.1| hypothetical protein AND_04399 [Anopheles darlingi]
          Length = 777

 Score =  226 bits (575), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 114/263 (43%), Positives = 168/263 (63%), Gaps = 18/263 (6%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           + +L  +YP C  ++ +   +DK   G+++N   PDQT          AYCGGL +A+L 
Sbjct: 515 MTYLKAMYPACKLVLERSREWDKDDDGLIENSRSPDQTYDSWVMDGPSAYCGGLWLASLH 574

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
           CM  M+SLL+ T+    + A L K  + + + LW G+YYKFD+ QS   +SIM+DQLCG+
Sbjct: 575 CMATMASLLDQTEDAANYQAILDKGRRSFEEKLWNGTYYKFDA-QSASKNSIMSDQLCGH 633

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN------GDRDRS 231
            +L+A G + + +FP  N+  A+ TI+  NV  F  G++GAVNG  PN      G  D S
Sbjct: 634 WYLRACGFDYD-VFPKENVRLAMRTIYENNVMRFCGGNLGAVNGYVPNAQPNKEGRPDVS 692

Query: 232 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYR 291
            IQ EEVWTGVTY L+A M++EG  +EA+ TAGGLYR++ E+ G+ FETPE +  ++ YR
Sbjct: 693 NIQGEEVWTGVTYALAATMIHEGMFEEAFQTAGGLYRSLSEKIGMNFETPEAVYAERHYR 752

Query: 292 SGGYMRALAVYAMQDAYLKGKVK 314
           + GYMR L++++MQ A+   K++
Sbjct: 753 AIGYMRPLSIWSMQTAWELRKLR 775



 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 29/35 (82%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P EEPF LIN Y + DVS+W+DLN+KFI+ +YRDY
Sbjct: 405 PAEEPFDLINAYPIHDVSEWRDLNIKFILQVYRDY 439


>gi|357630375|gb|EHJ78536.1| hypothetical protein KGM_16417 [Danaus plexippus]
          Length = 912

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 119/253 (47%), Positives = 168/253 (66%), Gaps = 15/253 (5%)

Query: 69  RFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC 118
           ++L+ +YP+C+ L+R+   +D+ G G+++NGGFPDQT          AYCGGL +A+++ 
Sbjct: 641 QYLSDMYPSCVTLLRRGLDWDRDGDGLIENGGFPDQTYDAWVMTGPSAYCGGLWVASVSA 700

Query: 119 MVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
           +  M+ +L  T  +++FS  L KA   Y   LW GSYYKFD+    +S+ +MADQL G  
Sbjct: 701 VHAMAKILGFTDDEKEFSTLLEKARDSYERKLWNGSYYKFDTKPC-NSEVVMADQLAGQW 759

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGM--KPNGDRDRSAIQSE 236
           FL+ASG  TE +FP AN+ +AL TI+  NV+ F  G MGAVNG    P    D +AIQSE
Sbjct: 760 FLRASGW-TEPVFPEANVKKALHTIYENNVQRFLNGRMGAVNGFVRGPRPGIDTTAIQSE 818

Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
           EVWTGVTY L+A M+YEG  ++A++TAGGLY T+  + GL FETPE L  +  +RS  YM
Sbjct: 819 EVWTGVTYGLAALMIYEGMHEQAFSTAGGLYNTLM-KMGLAFETPEALYENGNHRSVAYM 877

Query: 297 RALAVYAMQDAYL 309
           R L++++M  A +
Sbjct: 878 RPLSIWSMYHAII 890



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 23/37 (62%), Positives = 29/37 (78%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
           P+  PFS IN YN+ DVS+W+DLNLKFI+ + RDY L
Sbjct: 492 PDGVPFSHINAYNIHDVSEWRDLNLKFILQVMRDYRL 528


>gi|168014916|ref|XP_001759997.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162688747|gb|EDQ75122.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 970

 Score =  223 bits (569), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 121/287 (42%), Positives = 167/287 (58%), Gaps = 12/287 (4%)

Query: 29  EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
           + +P++ +N YN+ D S WKDLN KF++ +YRD     D  F   V+P     +   + F
Sbjct: 620 QHDPWAEVNAYNIHDTSRWKDLNAKFVLQVYRDVVATGDKVFAKAVWPAVYAAMAYLDQF 679

Query: 89  DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
           D+   G+++N GFPDQT          AYCGGL IAAL     M+ LL +    + F +K
Sbjct: 680 DRDRDGLIENDGFPDQTYDTWTVHGVSAYCGGLWIAALQAAEAMAVLLEEKNAARYFQSK 739

Query: 139 LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIAR 198
             +A + Y   LW GSY+ +DS  S +S SI ADQ+ G  +  ASG+    +F  A    
Sbjct: 740 FIQAREAYLKKLWNGSYFSYDSGNSSNSKSIQADQMAGQWYAWASGL--PPLFDDAKARS 797

Query: 199 ALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 258
           AL  IF  NV   + G +GAVNGM PNG  D + +QS EVWTGVTY L+AAM++EG +D+
Sbjct: 798 ALQKIFDFNVMKVKNGHLGAVNGMHPNGKVDETCMQSREVWTGVTYALAAAMIHEGMLDQ 857

Query: 259 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           A+  A G++   +   G  F+TPE  T D  +RS  YMR LA++AMQ
Sbjct: 858 AFRAAEGVFIAGWSDLGYWFQTPEAWTMDGYFRSLAYMRPLAIWAMQ 904


>gi|431902810|gb|ELK09025.1| Non-lysosomal glucosylceramidase [Pteropus alecto]
          Length = 970

 Score =  223 bits (568), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 111/242 (45%), Positives = 157/242 (64%), Gaps = 14/242 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P CL ++     
Sbjct: 566 PDDEPWLRVNAYMIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMK 625

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L        +FS+
Sbjct: 626 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDILDKFSS 685

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            L++  + Y  LLW G YY +D S    S SIM+DQ  G  FL+ASG+   +TE +FP+ 
Sbjct: 686 ILSRGREAYERLLWNGHYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDTE-VFPTP 744

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++  AL TIF  NV+ F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ E 
Sbjct: 745 HVVCALQTIFKFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQET 804

Query: 255 NV 256
           N+
Sbjct: 805 NL 806



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 27/38 (71%)

Query: 267 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           YRTV+ER GL F+TPE     + +RS  YMR L+++AM
Sbjct: 891 YRTVWERLGLAFQTPEAYCQHQVFRSLAYMRPLSIWAM 928


>gi|110742221|dbj|BAE99037.1| hypothetical protein [Arabidopsis thaliana]
          Length = 957

 Score =  222 bits (566), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 123/296 (41%), Positives = 172/296 (58%), Gaps = 14/296 (4%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +NGY + +   WKDLN KF++ +YRD     D +F + V+P+    +     FDK
Sbjct: 605 DPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDK 664

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     ++ ++ D   Q  F +K  
Sbjct: 665 DGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWSKFQ 724

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY   LW GSY+ +D+S S++S +I ADQL G  + +ASG+    I        AL
Sbjct: 725 KAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGL--LPIVDEDKARTAL 782

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             +++ NV   + G  GAVNGM PNG  D +++QS E+W+GVTY LSA M+ EG V+ A+
Sbjct: 783 EKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAF 842

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
            TA G+Y   + +TGLG  F+TPE       YRS  YMR LA++AMQ A  K   K
Sbjct: 843 QTASGIYEAAWSKTGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 898


>gi|8777421|dbj|BAA97011.1| unnamed protein product [Arabidopsis thaliana]
          Length = 928

 Score =  221 bits (564), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 123/296 (41%), Positives = 171/296 (57%), Gaps = 14/296 (4%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +NGY + +   WKDLN KF++ +YRD     D +F + V+P+    +     FDK
Sbjct: 576 DPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDK 635

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     ++ ++ D   Q  F +K  
Sbjct: 636 DGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWSKFQ 695

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY   LW GSY+ +D+S S++S +I ADQL G  + +ASG+    I        AL
Sbjct: 696 KAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGL--LPIVDEDKARTAL 753

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             +++ NV   + G  GAVNGM PNG  D +++QS E+W+GVTY LSA M+ EG V+ A+
Sbjct: 754 EKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAF 813

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
            TA G+Y   +  TGLG  F+TPE       YRS  YMR LA++AMQ A  K   K
Sbjct: 814 QTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 869


>gi|222423179|dbj|BAH19567.1| AT5G49900 [Arabidopsis thaliana]
          Length = 957

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 123/296 (41%), Positives = 171/296 (57%), Gaps = 14/296 (4%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +NGY + +   WKDLN KF++ +YRD     D +F + V+P+    +     FDK
Sbjct: 605 DPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDK 664

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     ++ ++ D   Q  F +K  
Sbjct: 665 DGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWSKFQ 724

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY   LW GSY+ +D+S S++S +I ADQL G  + +ASG+    I        AL
Sbjct: 725 KAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGL--LPIVDEDKARTAL 782

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             +++ NV   + G  GAVNGM PNG  D +++QS E+W+GVTY LSA M+ EG V+ A+
Sbjct: 783 EKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAF 842

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
            TA G+Y   +  TGLG  F+TPE       YRS  YMR LA++AMQ A  K   K
Sbjct: 843 QTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 898


>gi|30695777|ref|NP_199801.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|27311753|gb|AAO00842.1| putative protein [Arabidopsis thaliana]
 gi|34365725|gb|AAQ65174.1| At5g49900 [Arabidopsis thaliana]
 gi|332008486|gb|AED95869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 957

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 123/296 (41%), Positives = 171/296 (57%), Gaps = 14/296 (4%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +NGY + +   WKDLN KF++ +YRD     D +F + V+P+    +     FDK
Sbjct: 605 DPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDK 664

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     ++ ++ D   Q  F +K  
Sbjct: 665 DGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWSKFQ 724

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY   LW GSY+ +D+S S++S +I ADQL G  + +ASG+    I        AL
Sbjct: 725 KAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGL--LPIVDEDKARTAL 782

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             +++ NV   + G  GAVNGM PNG  D +++QS E+W+GVTY LSA M+ EG V+ A+
Sbjct: 783 EKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAF 842

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
            TA G+Y   +  TGLG  F+TPE       YRS  YMR LA++AMQ A  K   K
Sbjct: 843 QTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 898


>gi|157119417|ref|XP_001653371.1| bile acid beta-glucosidase, putative [Aedes aegypti]
 gi|108883154|gb|EAT47379.1| AAEL001478-PA, partial [Aedes aegypti]
          Length = 898

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 111/261 (42%), Positives = 162/261 (62%), Gaps = 18/261 (6%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           + +L  +YP C +++ K   +DK   GI++N   PDQT          AYCGGL +A+L 
Sbjct: 636 MTYLKSMYPACKQVLEKTLEWDKDNDGIIENSKCPDQTYDTWVMDGPSAYCGGLWLASLH 695

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
           CM  M++LL+      +F   L K    + + LW G+YYKFD  QS   ++IM+DQLCG+
Sbjct: 696 CMTAMANLLDQNDDCNKFREILEKGKTSFEEKLWNGTYYKFDG-QSASKNTIMSDQLCGH 754

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP------NGDRDRS 231
            +L+  G + + IFP  N+  AL TI+  NV  F  G+MGAVNG  P      +G  D S
Sbjct: 755 WYLRCCGFDYD-IFPKENVRTALKTIYDNNVMRFCGGNMGAVNGYVPSTQPNKDGRADAS 813

Query: 232 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYR 291
            +Q EEVWTGVTY L++ M++EG   EA+ TAGG+Y+T+ E+ G+ FETPE L  ++ YR
Sbjct: 814 TVQGEEVWTGVTYALASTMIHEGMFTEAFQTAGGMYQTLSEKIGMNFETPEALYAERHYR 873

Query: 292 SGGYMRALAVYAMQDAYLKGK 312
           + GYMR L+++++Q A+ + K
Sbjct: 874 AIGYMRPLSIWSLQTAWEQKK 894



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 29/35 (82%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF LIN Y + DVS+W+DLN KFI+ +YRDY
Sbjct: 533 PDEEPFDLINAYPIHDVSEWRDLNTKFILQVYRDY 567


>gi|168037390|ref|XP_001771187.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677567|gb|EDQ64036.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 981

 Score =  221 bits (562), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 120/294 (40%), Positives = 165/294 (56%), Gaps = 12/294 (4%)

Query: 29  EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
           + +P+  +N YN+ D S WKDLN KF++ +YRD     D +F   V+P     +   + F
Sbjct: 615 QHDPWVEVNAYNIHDTSHWKDLNPKFVLQVYRDVVATGDRQFAKAVWPAVYAAMAYMDQF 674

Query: 89  DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
           D+   G+++N GFPDQT          AYCGGL IAAL     M+ L++D      F  K
Sbjct: 675 DRDRDGLIENDGFPDQTYDTWSVHGVSAYCGGLWIAALQAAAAMADLVDDKDAATYFKGK 734

Query: 139 LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIAR 198
             +A  VY   LW G Y+ +DS  S +S+S+ ADQL G  +  ASG+    +F       
Sbjct: 735 FNQARDVYERKLWNGEYFNYDSGTSSNSNSVQADQLAGQWYAWASGL--PPLFDDYKARS 792

Query: 199 ALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 258
           AL  +F  NV   + G  GA NGM PNG  D + +QS E+WTGVTY  SAAM++EG V++
Sbjct: 793 ALQKVFDFNVMKVKGGKWGAANGMHPNGKVDETCMQSREIWTGVTYAASAAMIHEGMVEQ 852

Query: 259 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
           A+T A G++   +   G  F+TPE  T D  +RS  YMR LA++AMQ A    K
Sbjct: 853 AFTAAQGVFLAGWSDLGYWFQTPEAWTIDGYFRSLAYMRPLAIWAMQWALYPPK 906


>gi|170036781|ref|XP_001846240.1| bile acid beta-glucosidase [Culex quinquefasciatus]
 gi|167879683|gb|EDS43066.1| bile acid beta-glucosidase [Culex quinquefasciatus]
          Length = 913

 Score =  220 bits (560), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 112/261 (42%), Positives = 165/261 (63%), Gaps = 18/261 (6%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           + +L  +YP C +++ K   +DK   G+++N   PDQT          +YCGGL +A+L 
Sbjct: 651 MTYLKSMYPACKQVLEKTLEWDKDNDGLIENSKSPDQTYDTWVMDGPSSYCGGLWLASLH 710

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
           CM  M++LL+ ++   ++   L K    + + LW GSYYKFDS QS   +SIM+DQLCG+
Sbjct: 711 CMSVMANLLDQSEDCIKYREILEKGKASFEEKLWNGSYYKFDS-QSSSKNSIMSDQLCGH 769

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP------NGDRDRS 231
            +L+  G + + I P  N+  AL TI+  NV  F  G+MGAVNG  P      +G  D S
Sbjct: 770 WYLRCCGFDYD-IVPKENVRTALKTIYDNNVMRFCGGNMGAVNGYIPSSQPNKDGRADSS 828

Query: 232 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYR 291
            +Q+EEVWTGVTY L++ M++EG   EA+ TAGGLY+T+ E+ G+ FETPE L  ++ YR
Sbjct: 829 TLQAEEVWTGVTYALASTMIHEGMFTEAFQTAGGLYQTLSEKIGMSFETPEALYAERHYR 888

Query: 292 SGGYMRALAVYAMQDAYLKGK 312
           + GYMR L++++MQ A+ + K
Sbjct: 889 AIGYMRPLSIWSMQTAWEQKK 909



 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 29/35 (82%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF LIN Y + DVS+WKDLN KFI+ +YRDY
Sbjct: 548 PDEEPFELINAYPIHDVSEWKDLNTKFILQVYRDY 582


>gi|444729905|gb|ELW70308.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
          Length = 279

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 112/230 (48%), Positives = 152/230 (66%), Gaps = 14/230 (6%)

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FDK   G+++NGG+ DQT          AYCGGL +AA+A MV+M++L      Q++F++
Sbjct: 10  FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQEKFTS 69

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
            LT+  + Y  LLW G YY +DSS    S SIM+DQ  G  FLKA G+   +TE +FP+ 
Sbjct: 70  ILTRGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTE-VFPTP 128

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
           ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 129 HVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 188

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              E + TA G YRTV+ER GL F+TPE     + +RS  YMR L+++AM
Sbjct: 189 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 238


>gi|158299437|ref|XP_319575.4| AGAP008830-PA [Anopheles gambiae str. PEST]
 gi|157013519|gb|EAA14848.4| AGAP008830-PA [Anopheles gambiae str. PEST]
          Length = 913

 Score =  218 bits (556), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 108/263 (41%), Positives = 166/263 (63%), Gaps = 18/263 (6%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           + +L  +YP C  ++     +DK G G+++N   PDQT          AYCGGL +A+L 
Sbjct: 651 LTYLKAMYPACRVVLEHSLEWDKDGDGLIENSKAPDQTYDTWVMDGPSAYCGGLWLASLH 710

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
           CM  M+SLL+  +   ++ A L K    + + LW G+YY+FD+ QS   +SIM+DQLCG+
Sbjct: 711 CMSAMASLLDQNEDSDRYKAVLDKGRASFEEKLWNGTYYRFDA-QSASKNSIMSDQLCGH 769

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS------ 231
            +L++ G + + +FP  N+  A+ TI+  NV  F  G +GAVNG  P+G  ++       
Sbjct: 770 WYLRSCGFDYD-VFPKENVRLAMRTIYENNVMRFCGGQLGAVNGYVPSGQPNKDGRPDTV 828

Query: 232 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYR 291
           +IQ EEVWTGVTY L++ M++EG  +EA+ TAGGLY+ + ER G+ FETPE +  ++ YR
Sbjct: 829 SIQGEEVWTGVTYALASTMIHEGMFEEAFKTAGGLYQALSERIGMNFETPEAVYAERHYR 888

Query: 292 SGGYMRALAVYAMQDAYLKGKVK 314
           + GYMR L++++MQ A+   K++
Sbjct: 889 AIGYMRPLSIWSMQTAWEMKKLR 911



 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 29/35 (82%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P EEPF LIN Y + DVS+W+DLNLKFI+ +YRDY
Sbjct: 541 PAEEPFDLINAYPIHDVSEWRDLNLKFILQVYRDY 575


>gi|324504438|gb|ADY41918.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 534

 Score =  218 bits (554), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 120/301 (39%), Positives = 165/301 (54%), Gaps = 22/301 (7%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRF----------LTRVYPT 77
           P +EP+  +N Y + D  +WKDLNLKF+++ YRDY     I F          L R Y  
Sbjct: 226 PMDEPWLHLNAYALSDTCEWKDLNLKFVLTCYRDYEKIVKIYFNDDNEMKGCLLRRFYDL 285

Query: 78  CLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLN 127
              +I   +++D  G  +++N G PDQT          AYCGGL + AL C+  M+  L 
Sbjct: 286 SSGIIADAKAWDVDGDDLIENAGQPDQTYDVWSMHGSSAYCGGLWLCALECVRRMALTLG 345

Query: 128 DTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKAS-GVN 186
           +    Q+F+ KL  A + Y   LW G Y+ FD   + H  SIMADQLCG+ F+  + G  
Sbjct: 346 EVVDAQKFANKLNNARKAYERKLWNGKYFDFDEHSTDHK-SIMADQLCGFWFMCITDGKV 404

Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
            + I     I  +L TIF  NV  F  G +G VN M P+G  D + IQSEEVW GV Y L
Sbjct: 405 DDVIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVVDSTGIQSEEVWGGVAYAL 464

Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQD 306
           ++  L     + A+ TA G YR+ +ER GL +++PE +     YR+ GYMR LA++AMQ 
Sbjct: 465 ASFHLLVEENESAFKTAEGWYRSCWERYGLQYQSPEAINESSYYRAIGYMRPLAIWAMQS 524

Query: 307 A 307
           A
Sbjct: 525 A 525


>gi|326516802|dbj|BAJ96393.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 970

 Score =  218 bits (554), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 123/288 (42%), Positives = 164/288 (56%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+A   D+ F   V+P     +   + FD
Sbjct: 630 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVSAAMDYMDQFD 689

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCGGL +AAL     M+  L D    +++  K 
Sbjct: 690 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPYAEKYKLKF 749

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  VY   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    IF    I  A
Sbjct: 750 MKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPIFDEHKIRSA 807

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G MGAVNGM P G  D + +QS E+WTGVTY ++A ML  G   + 
Sbjct: 808 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 867

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           + TA G++   +   G G  F+TPEG T D  YRS  YMR LA++AMQ
Sbjct: 868 FITAEGIFLAGWSEDGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAMQ 915


>gi|414871104|tpg|DAA49661.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
          Length = 649

 Score =  217 bits (553), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 123/290 (42%), Positives = 166/290 (57%), Gaps = 14/290 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ IYRD+A   D++F   V+P     +   + FD
Sbjct: 309 HDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCAAMDYMDQFD 368

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCG L +AAL     M+  L D    +++  K 
Sbjct: 369 RDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHFAEKYKLKF 428

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  VY   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I  A
Sbjct: 429 IKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDEHKIRTA 486

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G MGAVNGM P G  D + +QS E+WTGVTY ++A ML  G   + 
Sbjct: 487 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQG 546

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           +TTA G++   +   G G  F+TPEG T D  YRS  YMR LA++A+Q A
Sbjct: 547 FTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYA 596


>gi|330844075|ref|XP_003293963.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
 gi|325075658|gb|EGC29519.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
          Length = 1205

 Score =  217 bits (553), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 116/296 (39%), Positives = 168/296 (56%), Gaps = 13/296 (4%)

Query: 28   PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELI-RKCE 86
            P E+P+  +N YN+ D+S WKDL  KF++ +YRDY + +D  FL +VY    E+I R  E
Sbjct: 885  PGEDPWKRVNSYNIQDISRWKDLPSKFVLQVYRDYLVVEDKNFLLQVYNVVEEVIQRTLE 944

Query: 87   SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFS 136
            SFD    G+V N GFPDQT          AY GGL +A+L    EM+ +L   + +  ++
Sbjct: 945  SFDTDHDGVVDNEGFPDQTYDVWPAVGCSAYSGGLWLASLKVASEMAKILGFKEDESIYN 1004

Query: 137  AKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANI 196
            A   K S  Y   LW G Y+K+D S S H+DSIM+D L G+ +L + G+ +   F  A  
Sbjct: 1005 AIFEKGSASYTKKLWNGYYFKYDCSNSVHADSIMSDMLAGHWYLLSCGLPSYMTFDQA-- 1062

Query: 197  ARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNV 256
              +L+ I   NV  +     GAVNGM+P G  D + +QS EVW G ++ L+A M+     
Sbjct: 1063 LSSLSIINEYNVNSYGKERCGAVNGMRPEGVVDNTCLQSSEVWIGTSFSLAATMIQHHMD 1122

Query: 257  DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
             +AW    G+  + Y R G  ++TPE    +  YR+G YMR L+++++Q A LK K
Sbjct: 1123 KDAWELVKGIVNSSYNRWGFQYQTPEAWDSNGCYRAGAYMRPLSIWSIQWALLKKK 1178


>gi|16905165|gb|AAL31035.1|AC078948_19 unknown protein [Oryza sativa Japonica Group]
          Length = 967

 Score =  217 bits (553), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 122/288 (42%), Positives = 166/288 (57%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+A   D+ F   V+P    ++     FD
Sbjct: 627 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDYMNQFD 686

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCGGL +AAL     M+  L D    +++  K 
Sbjct: 687 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEKYKLKF 746

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            +A  VY   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I  A
Sbjct: 747 IQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDENKIRSA 804

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G +GAVNGM PNG  D + +QS E+WTGVTY ++A ML  G   + 
Sbjct: 805 LQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 864

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G++   +   G G  F+TPEG T D  YRS  YMR LA++AMQ
Sbjct: 865 FTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 912


>gi|222612993|gb|EEE51125.1| hypothetical protein OsJ_31868 [Oryza sativa Japonica Group]
          Length = 935

 Score =  217 bits (553), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 122/288 (42%), Positives = 166/288 (57%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+A   D+ F   V+P    ++     FD
Sbjct: 595 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDYMNQFD 654

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCGGL +AAL     M+  L D    +++  K 
Sbjct: 655 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEKYKLKF 714

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            +A  VY   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I  A
Sbjct: 715 IQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDENKIRSA 772

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G +GAVNGM PNG  D + +QS E+WTGVTY ++A ML  G   + 
Sbjct: 773 LQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 832

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G++   +   G G  F+TPEG T D  YRS  YMR LA++AMQ
Sbjct: 833 FTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 880


>gi|115482480|ref|NP_001064833.1| Os10g0473400 [Oryza sativa Japonica Group]
 gi|110289241|gb|AAP54244.2| expressed protein [Oryza sativa Japonica Group]
 gi|113639442|dbj|BAF26747.1| Os10g0473400 [Oryza sativa Japonica Group]
 gi|215695411|dbj|BAG90602.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 974

 Score =  217 bits (553), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 122/288 (42%), Positives = 166/288 (57%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+A   D+ F   V+P    ++     FD
Sbjct: 634 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDYMNQFD 693

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCGGL +AAL     M+  L D    +++  K 
Sbjct: 694 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEKYKLKF 753

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            +A  VY   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I  A
Sbjct: 754 IQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDENKIRSA 811

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G +GAVNGM PNG  D + +QS E+WTGVTY ++A ML  G   + 
Sbjct: 812 LQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 871

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G++   +   G G  F+TPEG T D  YRS  YMR LA++AMQ
Sbjct: 872 FTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 919


>gi|226499590|ref|NP_001146319.1| uncharacterized protein LOC100279895 [Zea mays]
 gi|219886613|gb|ACL53681.1| unknown [Zea mays]
          Length = 649

 Score =  217 bits (553), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 123/290 (42%), Positives = 166/290 (57%), Gaps = 14/290 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ IYRD+A   D++F   V+P     +   + FD
Sbjct: 309 HDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRGVWPAVCAAMDYMDQFD 368

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCG L +AAL     M+  L D    +++  K 
Sbjct: 369 RDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHFAEKYKLKF 428

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  VY   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I  A
Sbjct: 429 IKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDEHKIRTA 486

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G MGAVNGM P G  D + +QS E+WTGVTY ++A ML  G   + 
Sbjct: 487 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQG 546

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           +TTA G++   +   G G  F+TPEG T D  YRS  YMR LA++A+Q A
Sbjct: 547 FTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYA 596


>gi|356535470|ref|XP_003536268.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
          Length = 949

 Score =  217 bits (552), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 121/294 (41%), Positives = 170/294 (57%), Gaps = 14/294 (4%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+ D S WKDLN KF++ +YRD+A   D++F   V+P     +   E FD+
Sbjct: 609 DPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFATTGDLQFGVDVWPAVRAAMEYMEQFDR 668

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT           YCG L +AAL     M+  L D +  +    K  
Sbjct: 669 DGDGLIENDGFPDQTYDTWTVHGVSTYCGCLWLAALQAAAVMALELGDREFAETCKRKFL 728

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA   + + LW G+Y+ +DS  S +S SI ADQL G  +  +SG+   ++F  + I  AL
Sbjct: 729 KAKPAFEEKLWNGTYFNYDSGSSGNSKSIQADQLAGQWYTASSGL--PSLFEDSKIKSAL 786

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             ++  NV   + G MGAVNGM PNG  D + +QS EVWTGVTY L+A M++ G  +EA+
Sbjct: 787 RKVYDFNVMKVKGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGLAATMIHAGMEEEAF 846

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
           TTA G++   +   G G  F+TPE  T D  YRS  YMR LA++ MQ A  + K
Sbjct: 847 TTAEGIFLAGWSEDGYGYWFQTPEAWTMDGHYRSLMYMRPLAIWGMQYAINRPK 900


>gi|10440626|gb|AAG16864.1|AC069145_13 unknown protein [Oryza sativa Japonica Group]
          Length = 444

 Score =  217 bits (552), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 122/288 (42%), Positives = 166/288 (57%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+A   D+ F   V+P    ++     FD
Sbjct: 104 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDYMNQFD 163

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCGGL +AAL     M+  L D    +++  K 
Sbjct: 164 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEKYKLKF 223

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            +A  VY   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I  A
Sbjct: 224 IQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDENKIRSA 281

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G +GAVNGM PNG  D + +QS E+WTGVTY ++A ML  G   + 
Sbjct: 282 LQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 341

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G++   +   G G  F+TPEG T D  YRS  YMR LA++AMQ
Sbjct: 342 FTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 389


>gi|327288418|ref|XP_003228923.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Anolis
           carolinensis]
          Length = 667

 Score =  216 bits (551), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 104/240 (43%), Positives = 149/240 (62%), Gaps = 12/240 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P+EEP+  +N Y + D +DWKDLNLKF++ +YRD+ L +D  +L  ++P C  ++     
Sbjct: 397 PDEEPWQRVNAYLIHDTADWKDLNLKFVLQVYRDFFLTEDCTYLRDMWPVCQAVMESELK 456

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FD  G G+++N GF DQT          AYCGGL +AA+  M  M+ +L +    Q++SA
Sbjct: 457 FDTDGDGLIENSGFADQTYDAWVVTGASAYCGGLWLAAVCMMCRMAEILGEGAALQRYSA 516

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEA--IFPSAN 195
            L+  +  +  LLW G YY +DS     S+SIM+DQL G  FL+A G+      +FP  +
Sbjct: 517 ILSSGAAAFERLLWNGRYYNYDSGGGPSSNSIMSDQLAGQWFLRACGLGEGKYQVFPQEH 576

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           +  AL TI+  NV GF  G+MGAVNGM+P+G  D S++QS+EVW GV Y L+A M+ EG 
Sbjct: 577 VQSALRTIYEMNVLGFSEGAMGAVNGMRPDGVPDTSSVQSDEVWVGVVYSLAATMIQEGQ 636


>gi|218184729|gb|EEC67156.1| hypothetical protein OsI_34005 [Oryza sativa Indica Group]
          Length = 974

 Score =  216 bits (551), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 122/288 (42%), Positives = 165/288 (57%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+A   D+ F   V+P     +     FD
Sbjct: 634 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAAMDYMNQFD 693

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCGGL +AAL     M+  L D    +++  K 
Sbjct: 694 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEKYKLKF 753

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            +A  VY   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I  A
Sbjct: 754 IQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDENKIRSA 811

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G +GAVNGM PNG  D + +QS E+WTGVTY ++A ML  G   + 
Sbjct: 812 LQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 871

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G++   +   G G  F+TPEG T D  YRS  YMR LA++AMQ
Sbjct: 872 FTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 919


>gi|333898016|ref|YP_004471890.1| glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
 gi|333113281|gb|AEF18218.1| Glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
          Length = 806

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 119/291 (40%), Positives = 166/291 (57%), Gaps = 17/291 (5%)

Query: 32  PFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKCESFD 89
           P+  IN Y+  + + WKDLN K+++ +YRDY L    D  FL   + +    + K +  D
Sbjct: 516 PWIKINAYDWQNPNIWKDLNSKYVLLVYRDYVLTGKTDKEFLKYTWKSVKTALDKLKEMD 575

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           K   GI  N G PDQT          AYCG L +AAL    E+  +L D +   +++   
Sbjct: 576 KDNDGIPDNEGIPDQTYDTWSMKGTSAYCGSLWLAALKAAQEIGKVLKDNEAYIKYNEWY 635

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
             A Q +   LW G YY FD+ +S H DSIMADQL G  +  A  +    I P  ++ +A
Sbjct: 636 KIAQQNFEKELWNGEYYNFDT-ESDHKDSIMADQLAGQWY--ADILRLGDILPKDHVQKA 692

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  I+  NV  FE G MGAVNGM+P+G  D S IQ++EVWTGVTY L++ M Y G  +EA
Sbjct: 693 LKKIYEFNVMKFENGKMGAVNGMRPDGIVDESDIQAQEVWTGVTYALASFMKYRGMTEEA 752

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
           + TA G+Y+  Y+++G G  F TPE  T D  YR+  YMR L++++M+  Y
Sbjct: 753 YNTAYGVYKMTYDKSGKGYWFRTPEAWTKDGNYRASMYMRPLSIWSMEVNY 803


>gi|195164866|ref|XP_002023267.1| GL21266 [Drosophila persimilis]
 gi|194105352|gb|EDW27395.1| GL21266 [Drosophila persimilis]
          Length = 867

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 115/257 (44%), Positives = 156/257 (60%), Gaps = 19/257 (7%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           I +L  +Y  C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL 
Sbjct: 604 IGYLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 663

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M+++L+      +F   L K  +   + LW GSYY+FD S S H DSIMADQLCG+
Sbjct: 664 AMSAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLSPS-HRDSIMADQLCGH 722

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL TI+  NV GF  G++GA NG   N       G  D 
Sbjct: 723 WYLKSCGFDYE-IYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDN 781

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
           S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+T+ ER G+ FETPE L G+K Y
Sbjct: 782 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSERIGMNFETPEALYGEKRY 841

Query: 291 RSGGYMRALAVYAMQDA 307
           RS GYMR L++++MQ A
Sbjct: 842 RSIGYMRPLSIWSMQVA 858



 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 501 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 535


>gi|198473190|ref|XP_002133205.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
 gi|198139343|gb|EDY70607.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
          Length = 865

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 115/257 (44%), Positives = 156/257 (60%), Gaps = 19/257 (7%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           I +L  +Y  C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL 
Sbjct: 602 IGYLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 661

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M+++L+      +F   L K  +   + LW GSYY+FD S S H DSIMADQLCG+
Sbjct: 662 AMSAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLSPS-HRDSIMADQLCGH 720

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL TI+  NV GF  G++GA NG   N       G  D 
Sbjct: 721 WYLKSCGFDYE-IYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDN 779

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
           S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+T+ ER G+ FETPE L G+K Y
Sbjct: 780 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSERIGMNFETPEALYGEKRY 839

Query: 291 RSGGYMRALAVYAMQDA 307
           RS GYMR L++++MQ A
Sbjct: 840 RSIGYMRPLSIWSMQVA 856



 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 499 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 533


>gi|357146604|ref|XP_003574050.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 962

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 122/288 (42%), Positives = 163/288 (56%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+A   D+ F   V+P     +   + FD
Sbjct: 622 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVCAAMDYMDQFD 681

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCG L +AAL     M+  L D    +++  K 
Sbjct: 682 RDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRPYAEKYKLKF 741

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  VY   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    IF    I  A
Sbjct: 742 IKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPIFDEHKIRSA 799

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G MGAVNGM P G  D + +QS E+WTGVTY ++A ML  G   + 
Sbjct: 800 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 859

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           + TA G++   +   G G  F+TPEG T D  YRS  YMR LA++AMQ
Sbjct: 860 FITAEGIFLAGWSEEGYGYWFQTPEGWTTDGHYRSLIYMRPLAIWAMQ 907


>gi|324500616|gb|ADY40283.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 890

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 125/298 (41%), Positives = 183/298 (61%), Gaps = 17/298 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY--ALHKDIRFLTRVYPTCLELIRK- 84
           P  +P+ L N Y + D   WKDLNLKFI+  +RDY   L KD RFL  V+P    LI + 
Sbjct: 536 PAVDPWLLTNAYVMHDTGKWKDLNLKFILISFRDYTCVLGKDQRFLNLVWPAVKSLINEG 595

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
            +++D+ G G+++N G  DQT          AYCG L +AAL   + M+  + D+    +
Sbjct: 596 LKNWDQDGDGMIENFGAADQTYDAWKMSGVSAYCGCLWLAALRAALVMAREVGDSACAHR 655

Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDS-SQSRHSDSIMADQLCGYLFLKA-SGVNTEAIFP 192
           ++  L++A + Y   LW G YY FD  S+SRH+  IMADQLCGY FL++ S    + + P
Sbjct: 656 YAETLSRAREAYITKLWNGEYYNFDERSRSRHT--IMADQLCGYWFLQSISPELADELLP 713

Query: 193 SANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLY 252
           +A++  AL  I++ NV  F  G +GAVNGM+ +G  DR+ IQ++E+WTGVTY L+A ++ 
Sbjct: 714 AAHVKSALNIIYNYNVLRFAGGRLGAVNGMRRDGTVDRNYIQADEMWTGVTYALAAFLIQ 773

Query: 253 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           +G V   + TA G Y   + R GL ++TPE L   + YR+ GYMR L+++AMQ A ++
Sbjct: 774 QGAVQRGFDTAWGCYDGCFNRFGLQYQTPEALYEQRFYRAIGYMRPLSIWAMQWALIR 831


>gi|357116998|ref|XP_003560263.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 932

 Score =  214 bits (545), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 122/288 (42%), Positives = 162/288 (56%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+A   D+ F   V+P     +   E FD
Sbjct: 593 HDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFAATGDMSFGKDVWPAVCTAMEYMEQFD 652

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCG L +A+L     M+  L      ++   K 
Sbjct: 653 RDGDGMIENDGFPDQTYDAWTVLGVSAYCGCLWLASLQAAAAMARGLGHNDYAERCMVKF 712

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  V+   LW GSY+ +DS  S  S SI ADQL G  +  +SG+    +F    I   
Sbjct: 713 AKAKHVFETKLWNGSYFNYDSGASYSSRSIQADQLAGQWYAASSGL--PPLFDEGRIKCT 770

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G MGAVNGM PNG  D + +QS E+WTGVTY L+A ML  G  D+A
Sbjct: 771 LQKIFDYNVMRVKGGRMGAVNGMYPNGKVDETCMQSREIWTGVTYSLAATMLLHGMEDQA 830

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G++   +   G G  F+TPE  T D  YRS  YMR LA++AMQ
Sbjct: 831 FTTAEGIFLAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 878


>gi|195436933|ref|XP_002066400.1| GK18271 [Drosophila willistoni]
 gi|194162485|gb|EDW77386.1| GK18271 [Drosophila willistoni]
          Length = 955

 Score =  214 bits (544), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 111/263 (42%), Positives = 157/263 (59%), Gaps = 19/263 (7%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           I +L  +Y  C  ++ +   +DK   G+++N   PDQT          AYC GL +A+L 
Sbjct: 692 IGYLKSMYAACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLASLQ 751

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M+++L+      ++   L K      + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 752 AMSAMATILDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 810

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D 
Sbjct: 811 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEPSKAGHVDN 869

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
           S IQ+EEVW GV Y L+A M+ EG  DEA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 870 SNIQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTISQRIGMNFETPEALYGEKRY 929

Query: 291 RSGGYMRALAVYAMQDAYLKGKV 313
           RS GYMR L++++MQ A+ + + 
Sbjct: 930 RSIGYMRPLSIWSMQVAWERRRA 952



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV++WKDLN KF++ +YRDY
Sbjct: 589 PDEEPFTLINCYNIHDVNEWKDLNTKFVLQVYRDY 623


>gi|288558766|gb|ADC53514.1| MIP16949p [Drosophila melanogaster]
          Length = 432

 Score =  213 bits (543), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 111/257 (43%), Positives = 156/257 (60%), Gaps = 19/257 (7%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           I +L  +Y +C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL 
Sbjct: 169 IGYLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 228

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M+++L+      ++   L K  +   + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 229 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 287

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D 
Sbjct: 288 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDN 346

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
           S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 347 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 406

Query: 291 RSGGYMRALAVYAMQDA 307
           RS GYMR L++++MQ A
Sbjct: 407 RSIGYMRPLSIWSMQVA 423



 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 66  PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 100


>gi|195338407|ref|XP_002035816.1| GM15538 [Drosophila sechellia]
 gi|194129696|gb|EDW51739.1| GM15538 [Drosophila sechellia]
          Length = 947

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 111/257 (43%), Positives = 156/257 (60%), Gaps = 19/257 (7%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           I +L  +Y +C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL 
Sbjct: 684 IGYLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 743

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M+++L+      ++   L K  +   + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 744 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 802

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D 
Sbjct: 803 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDN 861

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
           S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 862 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 921

Query: 291 RSGGYMRALAVYAMQDA 307
           RS GYMR L++++MQ A
Sbjct: 922 RSIGYMRPLSIWSMQVA 938



 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 581 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 615


>gi|45552032|ref|NP_788055.2| CG33090 [Drosophila melanogaster]
 gi|74876618|sp|Q7KT91.1|C3390_DROME RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase
 gi|45445134|gb|AAO41192.2| CG33090 [Drosophila melanogaster]
 gi|201065481|gb|ACH92150.1| FI02015p [Drosophila melanogaster]
          Length = 948

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 111/257 (43%), Positives = 156/257 (60%), Gaps = 19/257 (7%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           I +L  +Y +C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL 
Sbjct: 685 IGYLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 744

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M+++L+      ++   L K  +   + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 745 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 803

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D 
Sbjct: 804 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDN 862

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
           S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 863 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 922

Query: 291 RSGGYMRALAVYAMQDA 307
           RS GYMR L++++MQ A
Sbjct: 923 RSIGYMRPLSIWSMQVA 939



 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 582 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 616


>gi|358254610|dbj|GAA55931.1| non-lysosomal glucosylceramidase [Clonorchis sinensis]
          Length = 1200

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 119/311 (38%), Positives = 170/311 (54%), Gaps = 31/311 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIR-KCE 86
           PE EP+   N Y ++   DWKDLN KF++ ++RD+ L +D  +L  + P    +I+   +
Sbjct: 540 PEGEPWRDANAYVMYPTKDWKDLNPKFVLQVWRDWKLTQDNDYLLYMLPIVNRIIQVSLQ 599

Query: 87  SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAAL--AC-MVEMSSLLNDT---- 129
           S+D  G G+++N GFPDQT          AY GGL +A L  AC M+E +   N      
Sbjct: 600 SWDSNGDGMIENSGFPDQTYDTWNAQGISAYTGGLWLACLYSACEMLEYALSANSPLKAF 659

Query: 130 -------------QQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
                        Q Q++    L  A  VY   LWTG+YY + ++   + ++IMADQL G
Sbjct: 660 LIASNSEDEVSWPQVQKELQILLESAKTVYDRALWTGTYYIYQNTPVGNHEAIMADQLSG 719

Query: 177 YLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSE 236
           + FL+  G    AI PS ++  AL TI STN    + GS+G +NG  P   RD S +Q+E
Sbjct: 720 HGFLRVGGAPPGAILPSEHVIMALKTIQSTNWESIQNGSLGVINGGLPGAKRDLSNVQAE 779

Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
           E W GV Y L+A M+ EG   E ++ A   Y T+Y R GL ++TPE    D  +R  GYM
Sbjct: 780 EFWVGVNYSLAATMILEGMSQEGFSLAEACYNTIYNRFGLQYQTPEAYMIDGRFRCPGYM 839

Query: 297 RALAVYAMQDA 307
           R LA++++Q A
Sbjct: 840 RPLAIWSIQQA 850


>gi|356576421|ref|XP_003556330.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
          Length = 950

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 120/294 (40%), Positives = 167/294 (56%), Gaps = 14/294 (4%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+ D S WKDLN KF++ +YRD+A   D+ F   V+P     +   E FD+
Sbjct: 610 DPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFATTGDLEFGVDVWPAVRAAMEYMEQFDR 669

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT           YCG L +AAL     M+  L D +  ++   K  
Sbjct: 670 DGDGLIENDGFPDQTYDTWTVHGVSTYCGCLWLAALQAAAAMALDLGDREFAEKCKRKFL 729

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA   + + LW G+Y+ +DS  S +S SI ADQL G  +  +SG+    +F  + I  AL
Sbjct: 730 KAKPAFEEKLWNGTYFNYDSGSSGNSKSIQADQLAGQWYTASSGL--PPLFEDSKIKSAL 787

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             ++  NV   + G MGAVNGM PNG  D + +QS EVWTGVTY L+A M+  G  +EA+
Sbjct: 788 RKVYDFNVMKVKGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGLAATMILAGMEEEAF 847

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
            TA G++   +   G G  F+TPE  T D  YRS  YMR LA++ MQ A  + K
Sbjct: 848 ATAEGIFLAGWSEDGYGYWFQTPEAWTMDGHYRSLMYMRPLAIWGMQYAINRPK 901


>gi|224072655|ref|XP_002303825.1| predicted protein [Populus trichocarpa]
 gi|222841257|gb|EEE78804.1| predicted protein [Populus trichocarpa]
          Length = 966

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 117/288 (40%), Positives = 164/288 (56%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P++ +N YN+ D S WKDLN KF++ +YRD+A   D+ F   V+P     +   E FD
Sbjct: 625 HDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGVDVWPAVRTAMEYMEQFD 684

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           +   G+V+N GFPDQT          AYCG L +A+L     M+  L D    +   +K 
Sbjct: 685 RDDDGLVENDGFPDQTYDAWTVHGVSAYCGCLWLASLQAAAAMAMQLGDKYFAELCKSKF 744

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA   +   LW GSY+ +DS  S +S SI ADQL G  ++ +SG+   ++F    I  A
Sbjct: 745 AKAKSAFESKLWNGSYFNYDSGSSNNSKSIQADQLAGEWYMASSGL--PSLFDDVKIRSA 802

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  I+  NV     G MGAVNGM PNG  D + +QS E+W+GVTY ++A M+  G  D+A
Sbjct: 803 LNKIYDFNVMKVRGGKMGAVNGMHPNGKVDETCMQSREIWSGVTYAVAATMILSGMEDKA 862

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G++   +   G G  F+TPE  T D  +RS  YMR LA++ MQ
Sbjct: 863 FTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHFRSLIYMRPLAIWGMQ 910


>gi|195474003|ref|XP_002089281.1| GE19029 [Drosophila yakuba]
 gi|194175382|gb|EDW88993.1| GE19029 [Drosophila yakuba]
          Length = 947

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 111/257 (43%), Positives = 155/257 (60%), Gaps = 19/257 (7%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           I +L  +Y  C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL 
Sbjct: 684 IGYLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 743

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M+++L+      ++   L K  +   + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 744 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 802

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D 
Sbjct: 803 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDN 861

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
           S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 862 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 921

Query: 291 RSGGYMRALAVYAMQDA 307
           RS GYMR L++++MQ A
Sbjct: 922 RSIGYMRPLSIWSMQVA 938



 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 581 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 615


>gi|222636955|gb|EEE67087.1| hypothetical protein OsJ_24071 [Oryza sativa Japonica Group]
          Length = 838

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 121/288 (42%), Positives = 161/288 (55%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+A   D+ F   V+P     +   E FD
Sbjct: 498 HDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFAATGDMSFGKDVWPAVCTAMEYMEQFD 557

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
               G+++N GFPDQT          AYCG L +AAL     ++  L      ++   K 
Sbjct: 558 HDDDGMIENDGFPDQTYDAWTVRGVSAYCGCLWLAALQTAAALARTLGHHDYAERCMLKF 617

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  V+   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I   
Sbjct: 618 AKAKPVFEAKLWNGSYFNYDSGTSYNSRSIHADQLAGQWYTASSGL--PPLFDEGRIKCT 675

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G MGAVNGM PNG  D + +QS E+WTGVTY L+A ML  G  D+A
Sbjct: 676 LQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMDDQA 735

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G++   +   G G  F+TPE  T D  YRS  YMR LA++AMQ
Sbjct: 736 FTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 783


>gi|195579186|ref|XP_002079443.1| GD23959 [Drosophila simulans]
 gi|194191452|gb|EDX05028.1| GD23959 [Drosophila simulans]
          Length = 947

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 111/257 (43%), Positives = 155/257 (60%), Gaps = 19/257 (7%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           I +L  +Y +C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL 
Sbjct: 684 IGYLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 743

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M+++L+      ++   L K  +   + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 744 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 802

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D 
Sbjct: 803 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDN 861

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
           S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 862 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 921

Query: 291 RSGGYMRALAVYAMQDA 307
           RS GYMR L++ +MQ A
Sbjct: 922 RSIGYMRPLSILSMQVA 938



 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 581 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 615


>gi|194860149|ref|XP_001969520.1| GG23916 [Drosophila erecta]
 gi|190661387|gb|EDV58579.1| GG23916 [Drosophila erecta]
          Length = 947

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 111/257 (43%), Positives = 155/257 (60%), Gaps = 19/257 (7%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           I +L  +Y  C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL 
Sbjct: 684 IGYLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 743

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M+++L+      ++   L K  +   + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 744 AMSAMATILDQPNDCLRYQDILEKGKRSLDEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 802

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D 
Sbjct: 803 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDN 861

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
           S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 862 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 921

Query: 291 RSGGYMRALAVYAMQDA 307
           RS GYMR L++++MQ A
Sbjct: 922 RSIGYMRPLSIWSMQVA 938



 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 581 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 615


>gi|194760205|ref|XP_001962332.1| GF15415 [Drosophila ananassae]
 gi|190616029|gb|EDV31553.1| GF15415 [Drosophila ananassae]
          Length = 894

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 110/257 (42%), Positives = 155/257 (60%), Gaps = 19/257 (7%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           I +L  +Y +C  ++ +   +DK   G+++N   PDQT          +YC GL +AAL 
Sbjct: 631 IGYLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSSYCSGLWLAALQ 690

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M+++L+      ++   L K  +   + LW GSYY+FD S   H D+IMADQLCG+
Sbjct: 691 AMSAMATILDQPNDCMRYQDILEKGKRSLEEKLWNGSYYRFDLSHG-HRDTIMADQLCGH 749

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D 
Sbjct: 750 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANASDPSKPGHVDN 808

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
           S IQSEEVW GV Y L+A M+ EG  +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 809 SNIQSEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMNFETPEALYGEKRY 868

Query: 291 RSGGYMRALAVYAMQDA 307
           RS GYMR L++++MQ A
Sbjct: 869 RSIGYMRPLSIWSMQVA 885



 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 528 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 562


>gi|195033785|ref|XP_001988762.1| GH11342 [Drosophila grimshawi]
 gi|193904762|gb|EDW03629.1| GH11342 [Drosophila grimshawi]
          Length = 869

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 109/258 (42%), Positives = 155/258 (60%), Gaps = 19/258 (7%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           + +L  +Y  C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL 
Sbjct: 606 MSYLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLWLAALQ 665

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M++LL+      ++   L +      + LW GSYY+FD S + H D+IMADQLCG+
Sbjct: 666 TMSVMATLLDQPNDCLRYQDILERGKNSLEEKLWNGSYYRFDQSHN-HRDTIMADQLCGH 724

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D 
Sbjct: 725 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANANEPSKPGHVDN 783

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
           S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 784 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMNFETPEALYGEKRY 843

Query: 291 RSGGYMRALAVYAMQDAY 308
           RS GYMR L++++MQ A+
Sbjct: 844 RSIGYMRPLSIWSMQVAW 861



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV++WKDLN KF++ +YRDY
Sbjct: 503 PDEEPFTLINCYNIHDVNEWKDLNTKFVLQVYRDY 537


>gi|195115764|ref|XP_002002426.1| GI12858 [Drosophila mojavensis]
 gi|193913001|gb|EDW11868.1| GI12858 [Drosophila mojavensis]
          Length = 877

 Score =  210 bits (535), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 110/261 (42%), Positives = 157/261 (60%), Gaps = 19/261 (7%)

Query: 70  FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
           +L  +Y  C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL  M
Sbjct: 616 YLKSMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDAWVMDGPSAYCSGLWLAALQAM 675

Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
             M++LL+      ++   L K  +   + LW GSYY+FD S + H D+IMADQLCG+ +
Sbjct: 676 SVMATLLDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHN-HRDTIMADQLCGHWY 734

Query: 180 LKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDRSA 232
           LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D S 
Sbjct: 735 LKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEPSKPGHVDNSN 793

Query: 233 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 292
           IQ+EEVW GV Y L+A M+ EG  DEA+ TAGG+Y+T+ +R G+ +ETPE L G+K YRS
Sbjct: 794 IQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTLSQRIGMNYETPEALYGEKRYRS 853

Query: 293 GGYMRALAVYAMQDAYLKGKV 313
            GYMR L++++MQ A+ + + 
Sbjct: 854 IGYMRPLSIWSMQVAWERRRA 874



 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 511 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 545


>gi|218199525|gb|EEC81952.1| hypothetical protein OsI_25831 [Oryza sativa Indica Group]
          Length = 850

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 121/290 (41%), Positives = 161/290 (55%), Gaps = 14/290 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+    D+ F   V+P     +   E FD
Sbjct: 510 HDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFVATGDMSFGKDVWPAVCTAMEYMEQFD 569

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
               G+++N GFPDQT          AYCG L +AAL     ++  L      ++   K 
Sbjct: 570 HDDDGMIENDGFPDQTYDAWTVRGVSAYCGCLWLAALQAAAALARTLGHHDYAERCMLKF 629

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  V+   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I   
Sbjct: 630 AKAKPVFEAKLWNGSYFNYDSGTSYNSRSIHADQLAGQWYTASSGL--PPLFDEGRIKCT 687

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G MGAVNGM PNG  D + +QS E+WTGVTY L+A ML  G  D+A
Sbjct: 688 LQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMDDQA 747

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           +TTA G++   +   G G  F+TPE  T D  YRS  YMR LA++AMQ A
Sbjct: 748 FTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQYA 797


>gi|225460125|ref|XP_002275782.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
          Length = 969

 Score =  209 bits (533), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 123/295 (41%), Positives = 169/295 (57%), Gaps = 15/295 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN++D   WKDLN KF++ +YRD     D  F   V+P     I   + FDK
Sbjct: 620 DPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQFDK 679

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     M+  + D+     F  K  
Sbjct: 680 DGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFWFKFQ 739

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY D LW GSY+ +D+S    S SI ADQL G  + +A G+  + I        AL
Sbjct: 740 KAKAVY-DKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGL--QPIVDDEKARSAL 796

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             +++ NV   + G  GAVNGM P+G  D SA+QS E+W GVTY ++A M++EG V+ A+
Sbjct: 797 EKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAF 856

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
            TA G+Y   + + GLG  F+TPE    D+ YRS  YMR LA++AMQ A  K ++
Sbjct: 857 NTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPEL 911


>gi|195397814|ref|XP_002057523.1| GJ18180 [Drosophila virilis]
 gi|194141177|gb|EDW57596.1| GJ18180 [Drosophila virilis]
          Length = 869

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 109/261 (41%), Positives = 156/261 (59%), Gaps = 19/261 (7%)

Query: 70  FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
           +L  +Y  C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL  M
Sbjct: 608 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLWLAALQTM 667

Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
             M++LL+      ++   L K      + LW GSYY+FD S + H D+IMADQLCG+ +
Sbjct: 668 SVMATLLDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLSHN-HRDTIMADQLCGHWY 726

Query: 180 LKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDRSA 232
           LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D S 
Sbjct: 727 LKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPSKPGHVDNSN 785

Query: 233 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 292
           IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+T+ +R G+ +ETPE L G+K YRS
Sbjct: 786 IQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMNYETPEALYGEKRYRS 845

Query: 293 GGYMRALAVYAMQDAYLKGKV 313
            GYMR L++++MQ A+ + + 
Sbjct: 846 IGYMRPLSIWSMQVAWERRRA 866



 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 503 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 537


>gi|297741007|emb|CBI31319.3| unnamed protein product [Vitis vinifera]
          Length = 953

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 123/295 (41%), Positives = 169/295 (57%), Gaps = 15/295 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN++D   WKDLN KF++ +YRD     D  F   V+P     I   + FDK
Sbjct: 604 DPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQFDK 663

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     M+  + D+     F  K  
Sbjct: 664 DGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFWFKFQ 723

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY D LW GSY+ +D+S    S SI ADQL G  + +A G+  + I        AL
Sbjct: 724 KAKAVY-DKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGL--QPIVDDEKARSAL 780

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             +++ NV   + G  GAVNGM P+G  D SA+QS E+W GVTY ++A M++EG V+ A+
Sbjct: 781 EKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAF 840

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
            TA G+Y   + + GLG  F+TPE    D+ YRS  YMR LA++AMQ A  K ++
Sbjct: 841 NTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPEL 895


>gi|256082537|ref|XP_002577511.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 892

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 112/311 (36%), Positives = 170/311 (54%), Gaps = 31/311 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKC-E 86
           PE+EP+   N Y +F    WKDLN KFI+ ++RD+ + +D ++L  + P  L ++RK   
Sbjct: 560 PEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLV 619

Query: 87  SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS------------ 124
           ++D    G+++N GFPDQT          AY GG+ ++ L    +M S            
Sbjct: 620 AWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQ 679

Query: 125 LLNDTQQQQQFSAKL--------TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
           ++N+T   Q+  +++        TKA   Y+  LWTGSYY F +  +   + IMADQL G
Sbjct: 680 IVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYCTPRREVIMADQLSG 739

Query: 177 YLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSE 236
           Y F + +GV    I P  ++ + L TI + N  G + G++GA+NG  P    D S++Q+E
Sbjct: 740 YWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPDLSSVQAE 799

Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
           E W  V Y LS+ M+ EG +DE        Y T+Y   GL ++TPE    D  +R  GYM
Sbjct: 800 EFWVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQYQTPEAYMSDGRFRCPGYM 859

Query: 297 RALAVYAMQDA 307
           RALA++++Q A
Sbjct: 860 RALAIWSIQQA 870


>gi|30692911|ref|NP_174631.2| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|79319136|ref|NP_001031135.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|332193493|gb|AEE31614.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|332193494|gb|AEE31615.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
          Length = 947

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 119/294 (40%), Positives = 169/294 (57%), Gaps = 15/294 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+F+   WKDLN KF++ +YRD     D+ F   V+P+    I   + FDK
Sbjct: 598 DPWFELNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 657

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     ++  + D      F+AK  
Sbjct: 658 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYE 717

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY + LW GSY+ +D+S+S  S SI+ADQ+ G  + +A G+  + I     I +AL
Sbjct: 718 KARSVY-EKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGL--KPIAKEEWIKKAL 774

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
            T++  NV     G+ GAVNGM P+G  D S + S EVW G TY ++A M+ EG  D+ +
Sbjct: 775 ETVYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGF 834

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
            TA G+Y   +   GLG  F+TPE  T +  YRS  YMR LA++ +Q A+   K
Sbjct: 835 RTASGIYEAAWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPK 888


>gi|353232147|emb|CCD79502.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 900

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 112/311 (36%), Positives = 170/311 (54%), Gaps = 31/311 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKC-E 86
           PE+EP+   N Y +F    WKDLN KFI+ ++RD+ + +D ++L  + P  L ++RK   
Sbjct: 568 PEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLV 627

Query: 87  SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS------------ 124
           ++D    G+++N GFPDQT          AY GG+ ++ L    +M S            
Sbjct: 628 AWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQ 687

Query: 125 LLNDTQQQQQFSAKL--------TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
           ++N+T   Q+  +++        TKA   Y+  LWTGSYY F +  +   + IMADQL G
Sbjct: 688 IVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYCTPRREVIMADQLSG 747

Query: 177 YLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSE 236
           Y F + +GV    I P  ++ + L TI + N  G + G++GA+NG  P    D S++Q+E
Sbjct: 748 YWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPDLSSVQAE 807

Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
           E W  V Y LS+ M+ EG +DE        Y T+Y   GL ++TPE    D  +R  GYM
Sbjct: 808 EFWVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQYQTPEAYMSDGRFRCPGYM 867

Query: 297 RALAVYAMQDA 307
           RALA++++Q A
Sbjct: 868 RALAIWSIQQA 878


>gi|28393605|gb|AAO42222.1| unknown protein [Arabidopsis thaliana]
          Length = 947

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 119/294 (40%), Positives = 169/294 (57%), Gaps = 15/294 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+F+   WKDLN KF++ +YRD     D+ F   V+P+    I   + FDK
Sbjct: 598 DPWFELNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 657

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     ++  + D      F+AK  
Sbjct: 658 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYE 717

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY + LW GSY+ +D+S+S  S SI+ADQ+ G  + +A G+  + I     I +AL
Sbjct: 718 KARSVY-EKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGL--KPIAKEEWIKKAL 774

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
            T++  NV     G+ GAVNGM P+G  D S + S EVW G TY ++A M+ EG  D+ +
Sbjct: 775 ETVYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGF 834

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
            TA G+Y   +   GLG  F+TPE  T +  YRS  YMR LA++ +Q A+   K
Sbjct: 835 RTASGIYEAAWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPK 888


>gi|326489639|dbj|BAK01800.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 797

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 117/295 (39%), Positives = 166/295 (56%), Gaps = 14/295 (4%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y + +   WKDLN KF++ +YRD     D  F   V+P+    +   E FDK
Sbjct: 453 DPWIKVNAYTLHNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAMAYMEQFDK 512

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N  FPDQT          AYCGGL +AAL     ++  + D   ++ F  K  
Sbjct: 513 DKDGMIENEDFPDQTYDVWSMAGVSAYCGGLWVAALQAASALAHEVGDKASEKLFWNKYE 572

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY   LW GSY+ +D + ++ S SI ADQL G  + KA G+++  I        AL
Sbjct: 573 KAKSVYDKKLWNGSYFNYDDAGTKASTSIHADQLAGQWYAKACGLSS--IVDKDKSQSAL 630

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             I++ NV  F+ G+ GA+NGM P+G  D S +QS E+W GVTY L+A+M+ EG VDE +
Sbjct: 631 EKIYAFNVMKFKDGNRGAINGMWPDGTLDMSTMQSREIWPGVTYALAASMIQEGMVDEGF 690

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
            TA G+Y   +   GLG  F+TPE    D  YRS  YMR LA++++Q A    K+
Sbjct: 691 KTAEGIYNAAWSTEGLGYAFQTPESWNNDDEYRSLCYMRPLAIWSIQWALSNPKL 745


>gi|255569126|ref|XP_002525532.1| conserved hypothetical protein [Ricinus communis]
 gi|223535211|gb|EEF36890.1| conserved hypothetical protein [Ricinus communis]
          Length = 948

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 123/295 (41%), Positives = 167/295 (56%), Gaps = 18/295 (6%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+   + WKDLN KF++ IYRD     D  F   V+P+    +   + FDK
Sbjct: 603 DPWFEVNAYNLISTARWKDLNPKFVLQIYRDVVATGDKSFALAVWPSVYVAMAYMDQFDK 662

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     ++  + D++    F  K  
Sbjct: 663 DGDGMIENEGFPDQTYDTWSVTGVSAYCGGLWVAALQAASALACEVGDSESASFFWVKYQ 722

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY   LW GSY+ +DSS +    SI ADQL G  + +A G+++  I     +  AL
Sbjct: 723 KAKAVY-STLWNGSYFNYDSSGN---SSIHADQLAGQWYARACGLSS--IVDEEKVRSAL 776

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             I+  NV   + G  GAVNGM P+G  D S +QS E+W GVTY LSA+M+ EG  + A+
Sbjct: 777 EKIYKFNVLKVKEGKRGAVNGMLPDGKVDMSVMQSREIWPGVTYALSASMIQEGMAEMAF 836

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
            TA G+Y   + + GLG  F+ PEG   D  YRS  YMR LA++AMQ A  K KV
Sbjct: 837 QTASGIYEAAWSQEGLGYSFQIPEGWNTDDQYRSLCYMRPLAIWAMQWALSKPKV 891


>gi|449442371|ref|XP_004138955.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
 gi|449520930|ref|XP_004167485.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 993

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 119/287 (41%), Positives = 165/287 (57%), Gaps = 14/287 (4%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P++ +N YN+ D S WKDLN KF++ +YRD+A  +D+ F   V+P+    I   E FD+
Sbjct: 653 DPWNEMNAYNIHDTSRWKDLNTKFVLQVYRDFAATRDMSFGVDVWPSVRAAIEYMEQFDR 712

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCG L +AAL     M+  L D +  +   +K  
Sbjct: 713 DGDGVIENDGFPDQTYDTWTVHGISAYCGCLWVAALQAAAAMAHELGDQEFAETCKSKFL 772

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  V    LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I  AL
Sbjct: 773 KARPVLEAELWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGL--PPLFDDFKIKSAL 830

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             I+  NV     G MGAVNGM PNG  D + +QS E+WTGVTY ++A M+  G  +EA+
Sbjct: 831 RKIYDFNVMKVRGGRMGAVNGMHPNGKIDETCMQSREIWTGVTYGVAATMILAGMEEEAF 890

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            TA G++   +   G G  F+TPE  + D  YRS  YMR L+++ MQ
Sbjct: 891 KTAEGIFLAGWSEEGFGYWFQTPEAWSTDGHYRSLIYMRPLSIWGMQ 937


>gi|9665098|gb|AAF97289.1|AC010164_11 Hypothetical protein [Arabidopsis thaliana]
          Length = 790

 Score =  208 bits (530), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 119/294 (40%), Positives = 169/294 (57%), Gaps = 15/294 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+F+   WKDLN KF++ +YRD     D+ F   V+P+    I   + FDK
Sbjct: 441 DPWFELNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 500

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     ++  + D      F+AK  
Sbjct: 501 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYE 560

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY + LW GSY+ +D+S+S  S SI+ADQ+ G  + +A G+  + I     I +AL
Sbjct: 561 KARSVY-EKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGL--KPIAKEEWIKKAL 617

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
            T++  NV     G+ GAVNGM P+G  D S + S EVW G TY ++A M+ EG  D+ +
Sbjct: 618 ETVYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGF 677

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
            TA G+Y   +   GLG  F+TPE  T +  YRS  YMR LA++ +Q A+   K
Sbjct: 678 RTASGIYEAAWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPK 731


>gi|414589150|tpg|DAA39721.1| TPA: hypothetical protein ZEAMMB73_266944 [Zea mays]
          Length = 583

 Score =  208 bits (530), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 121/288 (42%), Positives = 159/288 (55%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ IYRD+A   D  F   V+P     +   E FD
Sbjct: 244 HDPWHELNAYNIHDTSRWKDLNPKFVLQIYRDFAATGDKSFGKDVWPAVCTAMEYMEQFD 303

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
             G G+++N GFPDQT          AYCG L +AAL     ++  L      ++   + 
Sbjct: 304 HDGDGMIENDGFPDQTYDAWTVQGVSAYCGCLWLAALQAAAALAGSLGHDDYAERCMTRF 363

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  V+   LW GS++ +DS  S  S SI ADQL G  +  +SG+    +F    I   
Sbjct: 364 VKAKTVFEARLWNGSHFNYDSGTSYSSRSIQADQLAGQWYTASSGL--PPLFDEDRIKCT 421

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G MGAVNGM PNG  D + +QS E+WTGVTY L+A ML  G   +A
Sbjct: 422 LQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMEHQA 481

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G+Y   +   G G  F+TPE  T D  YRS  YMR LA++AMQ
Sbjct: 482 FTTAEGIYTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 529


>gi|440799031|gb|ELR20092.1| glucosylceramidase [Acanthamoeba castellanii str. Neff]
          Length = 860

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 118/309 (38%), Positives = 173/309 (55%), Gaps = 31/309 (10%)

Query: 28  PEEEPFSLINGYNVFDVSDWK---------------------DLNLKFIVSIYRDYALHK 66
           P ++P+  +N YN+ + S WK                     DLN KF++ +YRD+   +
Sbjct: 543 PADDPWVALNAYNIHETSSWKGKLGLLPASHLSPSTELTEAADLNSKFVLGVYRDFVATQ 602

Query: 67  DIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT--AYCGGLHIAALACMVEMSS 124
           D RFL  V+P     +R  + FD+   G+++N G PDQT  ++ GGL +AAL+    M+ 
Sbjct: 603 DKRFLADVWPAVDAALRHLQQFDRNKDGMIENEGLPDQTYHSWTGGLWVAALSAACAMAQ 662

Query: 125 LLN--DTQQQQQFSAKLTKASQVYHDLLWTGS-YYKFDSSQSRHSDSIMADQLCGYLFLK 181
            L   D    Q +      A++VY++ LW  S Y+ +D+S S H DSI ADQLCG  + +
Sbjct: 663 ALQLADDSVAQHYRDLHQSAARVYNESLWNPSGYFNYDTSTSTHHDSIQADQLCGQWYAR 722

Query: 182 ASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTG 241
           A G+ +  I P   I  AL T+F+ N   F  G  GAVNGM+P+G  D + + S E W G
Sbjct: 723 ACGLPS--IAPDKRIVTALKTVFANN---FNEGKRGAVNGMRPDGRVDYACLHSSEAWPG 777

Query: 242 VTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAV 301
            T+ L+A M+ EG VD+A+ TA G+  ++YE  G  F TPE   G   YR+ G+MR LA+
Sbjct: 778 TTFALAACMMGEGLVDQAFITAAGVCGSIYEDLGYWFRTPEAWDGSGNYRALGHMRPLAI 837

Query: 302 YAMQDAYLK 310
           +AMQ A+ +
Sbjct: 838 WAMQWAWTQ 846


>gi|297851790|ref|XP_002893776.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339618|gb|EFH70035.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 947

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 118/290 (40%), Positives = 168/290 (57%), Gaps = 15/290 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+F+   WKDLN KF++ +YRD     D+ F   V+P+    I   + FDK
Sbjct: 598 DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 657

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AYCGGL +AAL     ++  + D      F+AK  
Sbjct: 658 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREVGDNGAAVYFNAKYE 717

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY + LW GSY+ +D+S+S  S SI+ADQL G  + +A G+  + I     I +AL
Sbjct: 718 KARSVY-EKLWNGSYFNYDNSRSGSSSSILADQLAGQWYARACGL--KPIAKEEWIKKAL 774

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
            T++  NV     G+ GAVNGM P+G  D S + S EVW G TY ++A M+ EG  D+ +
Sbjct: 775 ETVYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGF 834

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
            TA G+Y   +   GLG  F+TPE  T +  +RS  YMR LA++ +Q A+
Sbjct: 835 RTASGIYEAAWSDRGLGCAFQTPEAWTTNDEFRSLCYMRPLAIWGIQWAH 884


>gi|242070593|ref|XP_002450573.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
 gi|241936416|gb|EES09561.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
          Length = 951

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 121/298 (40%), Positives = 168/298 (56%), Gaps = 21/298 (7%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y +++   WKDLN KF++ +YRD     D  F   V+P+    +   E FDK
Sbjct: 608 DPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDK 667

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N  FPDQT          AYCGGL +AAL     ++  + D   ++ F  K  
Sbjct: 668 DKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYE 727

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
           KA  VY  L W GSY+ +D   ++ S SI ADQL G  + KA G     +FP  +  +A 
Sbjct: 728 KAKSVYSKL-WNGSYFSYDDGDNKVSTSIQADQLAGQWYAKACG-----LFPIVDKDKAQ 781

Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
             L  IFS NV  F+ G+ GA+NGM P+G  D SA+QS E+W GVTY L+A+M+ EG V+
Sbjct: 782 SALEKIFSFNVMKFKDGTRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVE 841

Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
           + + TA G+Y   +   GLG  F+TPE    D  YRS  YMR LA++AMQ A    K+
Sbjct: 842 QGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSNPKL 899


>gi|224129460|ref|XP_002328722.1| predicted protein [Populus trichocarpa]
 gi|222839020|gb|EEE77371.1| predicted protein [Populus trichocarpa]
          Length = 922

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 117/288 (40%), Positives = 163/288 (56%), Gaps = 15/288 (5%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
           ++P+  +N YN+ +   WKDLN KF++ +YRD     D +F    +P+    +   + FD
Sbjct: 576 DDPWFEVNAYNLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAQAAWPSVYVAMAYMDQFD 635

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           K G G+++N GFPDQT          AYCGGL +AAL     ++  + D    + F  + 
Sbjct: 636 KDGDGMIENDGFPDQTYDTWSVSGVSAYCGGLWVAALQAASALAWEVGDKDSAEYFWFRF 695

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  VY D LW GSY+ +D S  R+S SI ADQL G  + +A G+    I        A
Sbjct: 696 QKAKVVY-DKLWNGSYFNYDDSNGRNSPSIQADQLAGQWYARACGL--LPIVDEDKARSA 752

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  I++ N      G  GAVNGM P+G  D S +QS E+W+GVTY ++A M+ EG +D A
Sbjct: 753 LEKIYNYNFLKVHDGKRGAVNGMLPDGTVDMSDMQSREIWSGVTYAVAATMMQEGLMDMA 812

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           + TA G+Y   +   GLG  F+TPEG   +  YRS GYMR LA++AMQ
Sbjct: 813 FHTASGVYEAAWAEQGLGYSFQTPEGWNTNGQYRSLGYMRPLAIWAMQ 860


>gi|449482327|ref|XP_004156248.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
           sativus]
          Length = 508

 Score =  207 bits (527), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 119/291 (40%), Positives = 169/291 (58%), Gaps = 15/291 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+ +VS WKDL  KF++ +YRD     D  F   V+P+    +   E FDK
Sbjct: 160 DPWLEVNAYNLLNVSRWKDLGSKFVLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDK 219

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N GFPDQT          AYCGGL +AAL     ++S ++D      F  K  
Sbjct: 220 DKDGMIENEGFPDQTYDTWTVKGVSAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQ 279

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY + LW GSY+ +D+S+   S SI ADQL G  + +A G+    I     I  AL
Sbjct: 280 KARSVY-ETLWNGSYFNYDNSKGPWSSSIQADQLAGQWYARACGLCP--IADEEKIRVAL 336

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             I++ NV   + G+ GAVNGM P+G  D+S +Q +E+W GVTY ++A M+ EG V+  +
Sbjct: 337 EKIYNFNVMKVKGGTRGAVNGMFPDGSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGF 396

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYL 309
            TA G+Y+  + + GLG  F+TPE    D  +RS GYMR LA++AMQ A +
Sbjct: 397 QTAMGIYQAAWAQDGLGYSFQTPEAWDVDDRFRSIGYMRPLAIWAMQWAMM 447


>gi|339252284|ref|XP_003371365.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
 gi|316968412|gb|EFV52690.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
          Length = 1615

 Score =  207 bits (527), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 114/277 (41%), Positives = 175/277 (63%), Gaps = 17/277 (6%)

Query: 31   EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY-ALHKDIRFLTRVYPTCLELIRKC-ESF 88
            EP+  +N Y   D S+WKDLNLKF+++ YRDY  L+K   FL  V+P   E+I    +++
Sbjct: 1341 EPWLEVNAYAFHDTSEWKDLNLKFVLTSYRDYLMLNKSESFLRHVWPAVKEIIETALQNW 1400

Query: 89   DKQGLGIVQNGGFPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHD 148
            D  G  +V+N G  DQT     +H   L   VE++  L + Q++   S +          
Sbjct: 1401 DHDGDKLVENDGTCDQTYDAWCMHWTLL---VELAKRL-EMQKKSLISEQC--------- 1447

Query: 149  LLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV 208
             +  G+YY+FD  QS +S ++MADQLCGY +LK  G+ T+ I P  N+ + L TI++ NV
Sbjct: 1448 WMMRGTYYQFDQ-QSCNSSTVMADQLCGYAYLKICGLQTD-ILPLDNVKKVLETIYNLNV 1505

Query: 209  RGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 268
              F  G++GAVNGM  +G++D S++Q++E WTGVTY LSA M+ EG V++ ++TA G+Y+
Sbjct: 1506 CSFGNGTLGAVNGMLYSGEKDTSSLQADEAWTGVTYFLSAHMISEGFVEQGFSTASGIYK 1565

Query: 269  TVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            + +E  G+ ++TPE L   K +R+ GYMR L+++A+Q
Sbjct: 1566 SCFESFGMHYQTPEALYEKKWFRAVGYMRPLSIWAIQ 1602


>gi|253761389|ref|XP_002489100.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
 gi|241947399|gb|EES20544.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
          Length = 603

 Score =  207 bits (526), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 121/288 (42%), Positives = 159/288 (55%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ IYRD+A   D  F   V+P     +   E FD
Sbjct: 264 HDPWHELNAYNIHDTSRWKDLNPKFVLQIYRDFAATGDKSFGKDVWPAVCTAMEYMEQFD 323

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
             G  +++N GFPDQT          AYCG L +AAL     ++  L      ++   + 
Sbjct: 324 HDGDDMIENDGFPDQTYDAWTVQGVSAYCGCLWLAALQAAAALARSLGHDDYAERCMTRF 383

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  V+   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I   
Sbjct: 384 AKAKSVFEARLWNGSYFNYDSGTSYNSRSIQADQLAGQWYTASSGL--PPLFDEDRIKCT 441

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G MGAVNGM PNG  D + +QS E+WTGVTY L+A ML  G   +A
Sbjct: 442 LQKIFDYNVMRVKGGRMGAVNGMHPNGKVDDTCMQSREIWTGVTYSLAATMLLHGMEHQA 501

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G+Y   +   G G  F+TPE  T D  YRS  YMR LA++AMQ
Sbjct: 502 FTTAEGIYIAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 549


>gi|66801399|ref|XP_629625.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
 gi|60463006|gb|EAL61202.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
          Length = 1302

 Score =  207 bits (526), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 110/296 (37%), Positives = 167/296 (56%), Gaps = 13/296 (4%)

Query: 28   PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELI-RKCE 86
            P E+P+  +N Y + DVS WKDL  KF++ +YRDY + +D  FL + Y     +I R  E
Sbjct: 964  PGEDPWKRVNSYCIQDVSRWKDLPSKFVLQVYRDYLVMEDKNFLLQCYNVVEGVIQRTLE 1023

Query: 87   SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFS 136
             FD    G+V N GFPDQT          AY GGL +AAL    E++ +L   + +  ++
Sbjct: 1024 HFDFDHDGVVDNEGFPDQTYDVWPATGCSAYSGGLWLAALKVAAEIAKILGFNEDESIYN 1083

Query: 137  AKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANI 196
            A   K S+ +   LW G Y+ +DSS+S HSDSIM+D L G+ +L + G+ +   F  A  
Sbjct: 1084 AIFEKGSKSFTKKLWNGHYFNYDSSKSVHSDSIMSDMLAGHWYLLSCGLKSYMTFDQA-- 1141

Query: 197  ARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNV 256
              +L+ I   NV+ +  G  GAVNGM+P G  D + +QS EVW G ++ L+A M+     
Sbjct: 1142 LSSLSIINEYNVKSYSKGVCGAVNGMRPEGTVDTTCLQSSEVWIGTSFSLAATMILHHMD 1201

Query: 257  DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
             +AW    G+  + Y++ G  ++TPE    +  +R+  YMR L+++++Q A  K K
Sbjct: 1202 SDAWDLVKGIVNSSYQKWGFQYQTPEAWDQNGCFRAASYMRPLSIWSIQWALQKRK 1257


>gi|302762052|ref|XP_002964448.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
 gi|300168177|gb|EFJ34781.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
          Length = 916

 Score =  207 bits (526), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 113/288 (39%), Positives = 165/288 (57%), Gaps = 18/288 (6%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P++ +N YN+ D S WKDLN KF+V + RD A   D+ F   V+P     +   + FD
Sbjct: 565 HDPWNTLNSYNLHDTSKWKDLNCKFVVQVNRDAAATGDLEFARAVWPAAYAAMAVTDQFD 624

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N G PDQT          +YCGGL +AAL     ++ L+ D    + F  K 
Sbjct: 625 RDGDGMIENDGVPDQTYDLWTVDGVSSYCGGLWVAALQAASALADLVGDKPSSRMFHGKF 684

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            +A  VY   LW GSY+ +D    ++S S+ +DQL G  +  ASG+    I        A
Sbjct: 685 LRAKAVYEKSLWNGSYFHYD----KNSTSVQSDQLAGQWYSLASGL--PGIVREDQATSA 738

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L T+F+TNV  ++ G+ GAVNGM P+G  DR+++QS EVW G TY ++AAM+ +G  +E 
Sbjct: 739 LGTVFATNVMRYKGGNEGAVNGMLPSGGEDRTSLQSREVWAGTTYAVAAAMIQQGMREEG 798

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           + TA G++   +   G G  F+TPE    D  YRS  YMR L+++AMQ
Sbjct: 799 FRTAKGVFLNGWSDQGHGYAFQTPEAWDNDGKYRSLAYMRPLSIWAMQ 846


>gi|301628717|ref|XP_002943495.1| PREDICTED: non-lysosomal glucosylceramidase [Xenopus (Silurana)
           tropicalis]
          Length = 692

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 102/230 (44%), Positives = 143/230 (62%), Gaps = 12/230 (5%)

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           FD+ G G+++N GF DQT          +YCGGL +AA+  M +M+ +L D   Q++F+ 
Sbjct: 444 FDEDGDGLIENSGFADQTYDDWVMTGPSSYCGGLWLAAVCMMCKMAEVLGDGAAQEKFTD 503

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
            L +    +  LLW G YY FD     +S+S+M DQ  G  FL+A G+      +FP  +
Sbjct: 504 ILHRGKAAFEKLLWNGKYYNFDCGDQPYSNSVMVDQCAGNWFLRACGLGAGESEVFPKDH 563

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           +  AL TIF  NV+ F  G MGAVNGM+P+G  D S++QS+EVW GV Y L+A M++EG 
Sbjct: 564 VVSALRTIFELNVKQFADGQMGAVNGMRPDGTTDTSSVQSDEVWIGVVYGLAATMIHEGL 623

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           V E + TA G YRTV+ER G+ F+TPE     + +RS  YMR L+++AMQ
Sbjct: 624 VQEGFATAEGCYRTVWERLGMSFQTPEAYCEKQVFRSLAYMRPLSIWAMQ 673


>gi|449447581|ref|XP_004141546.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 944

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 119/291 (40%), Positives = 169/291 (58%), Gaps = 15/291 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+ +VS WKDL  KF++ +YRD     D  F   V+P+    +   E FDK
Sbjct: 596 DPWLEVNAYNLLNVSRWKDLGSKFVLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDK 655

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N GFPDQT          AYCGGL +AAL     ++S ++D      F  K  
Sbjct: 656 DKDGMIENEGFPDQTYDTWTVKGVSAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQ 715

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY + LW GSY+ +D+S+   S SI ADQL G  + +A G+    I     I  AL
Sbjct: 716 KARSVY-ETLWNGSYFNYDNSKGPWSSSIQADQLAGQWYARACGLC--PIADEEKIRVAL 772

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             I++ NV   + G+ GAVNGM P+G  D+S +Q +E+W GVTY ++A M+ EG V+  +
Sbjct: 773 EKIYNFNVMKVKGGTRGAVNGMFPDGSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGF 832

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYL 309
            TA G+Y+  + + GLG  F+TPE    D  +RS GYMR LA++AMQ A +
Sbjct: 833 QTAMGIYQAAWAQDGLGYSFQTPEAWDVDDRFRSIGYMRPLAIWAMQWAMM 883


>gi|302787158|ref|XP_002975349.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
 gi|300156923|gb|EFJ23550.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
          Length = 916

 Score =  206 bits (525), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 113/288 (39%), Positives = 166/288 (57%), Gaps = 18/288 (6%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P++ +N YN++D S WKDLN KF+V + RD A   D+ F   V+P     +   + FD
Sbjct: 565 HDPWNTLNSYNLYDTSKWKDLNCKFVVQVNRDAAATGDLDFARAVWPAAYAAMAFTDQFD 624

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N G PDQT          +YCGGL +AAL     ++ L+ D    + F  K 
Sbjct: 625 RDGDGMIENDGVPDQTYDLWTVDGVSSYCGGLWVAALQAASALADLVGDKPSSRMFHDKF 684

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            +A  VY   LW GSY+ +D    ++S S+ +DQL G  +  ASG+    I        A
Sbjct: 685 LRAKAVYEKSLWNGSYFHYD----KNSTSVQSDQLAGQWYSLASGL--PGIVREDQATSA 738

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L T+F+TNV  ++ G+ GAVNGM P+G  DR+++QS EVW G TY ++AAM+ +G  +E 
Sbjct: 739 LGTVFATNVMRYKGGNEGAVNGMLPSGGEDRTSLQSREVWVGTTYAVAAAMIQQGMREEG 798

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           + TA G++   +   G G  F+TPE    D  YRS  YMR L+++AMQ
Sbjct: 799 FRTAKGVFLNGWSDQGHGYAFQTPEAWDNDGKYRSLAYMRPLSIWAMQ 846


>gi|302773445|ref|XP_002970140.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
 gi|300162651|gb|EFJ29264.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
          Length = 983

 Score =  206 bits (525), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 118/288 (40%), Positives = 168/288 (58%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ IYRD    +D+ F    +P     +   + FD
Sbjct: 634 HDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPAVYTAMAYMDRFD 693

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           +   G+++N GFPDQT          AYCGGL +AAL     M+  +N+      F  K 
Sbjct: 694 RDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVNEPNAANYFLRKF 753

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
             A +VY   LW GSY+ +DS  S +S+SI ADQ+ G  +  ASG+    +F       A
Sbjct: 754 RDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGL--PPLFDDYKARSA 811

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  I+  NV   + G +GAVNGM PNG  D +A+QS E+W+GVTY  +AAM++EG +++A
Sbjct: 812 LQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEGMLEQA 871

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G+Y   +  +G G  F+TPE  T D  +R+  YMR LA++AMQ
Sbjct: 872 FTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQ 919


>gi|413920603|gb|AFW60535.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
          Length = 951

 Score =  206 bits (525), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 120/298 (40%), Positives = 169/298 (56%), Gaps = 21/298 (7%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y +++   WKDLN KF++ +YRD     D  F   V+P+    +   E FDK
Sbjct: 608 DPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDK 667

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N  FPDQT          AYCGGL +AAL     ++  + D   ++ F  K  
Sbjct: 668 DKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYE 727

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
           KA  VY  L W GSY+ +D   ++ S SI ADQL G  + KA G     +FP  +  +A 
Sbjct: 728 KAKFVYSKL-WNGSYFSYDDGDNKMSTSIQADQLAGQWYAKACG-----LFPIVDKDKAQ 781

Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
             L  ++S NV  F+ G+ GA+NGM P+G  D SA+QS E+W GVTY L+A+M+ EG V+
Sbjct: 782 SALEKVYSFNVMKFKGGTRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVE 841

Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
           + + TA G+Y   +   GLG  F+TPE    D  YRS  YMR LA++AMQ A  K K+
Sbjct: 842 QGFKTAEGVYHAAWSPGGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSKPKL 899


>gi|356520149|ref|XP_003528727.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
           glucosylceramidase-like [Glycine max]
          Length = 944

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 121/287 (42%), Positives = 162/287 (56%), Gaps = 15/287 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +NGYN+++   WKDLN KF++ IYRD     D +F   V+P     I   + FDK
Sbjct: 599 DPWFEVNGYNLYNTDRWKDLNPKFVLQIYRDVVATGDKKFAQAVWPAVYIAIAYMDQFDK 658

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AY GGL +AAL     ++  + D   +  F  K  
Sbjct: 659 DGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAREVGDKGSEDYFWLKFQ 718

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY +L W GSY+ +DSS    S SI ADQL G  + +A G+    I        AL
Sbjct: 719 KAKSVYENL-WNGSYFNYDSSGGSSSLSIQADQLAGQWYARACGL--LPIVEEKKSRSAL 775

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             +++ NV     G  GAVNGM P+G  D S +QS E+W+GVTY L+A M+ E  +D A+
Sbjct: 776 QIVYNHNVMKVHXGRRGAVNGMLPDGKVDMSTMQSREIWSGVTYALAATMIQENMIDMAF 835

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            TAGG+Y T +   GLG  F+TPE  T    YRS  YMR LA++AMQ
Sbjct: 836 QTAGGVYETAWSNNGLGYSFQTPEAWTTKDEYRSLCYMRPLAIWAMQ 882


>gi|302807226|ref|XP_002985326.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
 gi|300147154|gb|EFJ13820.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
          Length = 983

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 118/288 (40%), Positives = 168/288 (58%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ IYRD    +D+ F    +P     +   + FD
Sbjct: 634 HDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPAVYTAMAYMDRFD 693

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           +   G+++N GFPDQT          AYCGGL +AAL     M+  +N+      F  K 
Sbjct: 694 RDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVNEPNAANYFLRKF 753

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
             A +VY   LW GSY+ +DS  S +S+SI ADQ+ G  +  ASG+    +F       A
Sbjct: 754 RDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGL--PPLFDDYKARSA 811

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  I+  NV   + G +GAVNGM PNG  D +A+QS E+W+GVTY  +AAM++EG +++A
Sbjct: 812 LQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEGMLEQA 871

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G+Y   +  +G G  F+TPE  T D  +R+  YMR LA++AMQ
Sbjct: 872 FTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQ 919


>gi|225433442|ref|XP_002285674.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
          Length = 978

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 119/288 (41%), Positives = 167/288 (57%), Gaps = 14/288 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+A  +D  F   V+P     +   E FD
Sbjct: 638 HDPWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAAMEYMEQFD 697

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           +   G+++N GFPDQT          AYCG L +AAL     M+  L D    ++  +K 
Sbjct: 698 RDSDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPFAEKCKSKF 757

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  V+ + LW GSY+ +DS  S +S SI ADQL G  +  +SG+   ++F    I  +
Sbjct: 758 FKAKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGL--PSLFDDYKIKSS 815

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  I+  NV   + G MGAVNGM PNG  D S +QS E+WTGVTY ++A M+  G  ++A
Sbjct: 816 LHKIYDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILSGMEEQA 875

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           +TTA G++   +   G G  F+TPEG T D  +RS  YMR LA++ MQ
Sbjct: 876 FTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQ 923


>gi|224146269|ref|XP_002325943.1| predicted protein [Populus trichocarpa]
 gi|222862818|gb|EEF00325.1| predicted protein [Populus trichocarpa]
          Length = 891

 Score =  205 bits (522), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 119/294 (40%), Positives = 165/294 (56%), Gaps = 18/294 (6%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  IN YN+F  + WKDLN KF++ +YRD     D  F   V+P+    +   + FDK
Sbjct: 549 DPWFEINAYNLFSTARWKDLNCKFVLQVYRDVVATGDKDFARAVWPSVYVAMAYMDQFDK 608

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N G PDQT          AYCGGL +AAL     M+  + D      F  K  
Sbjct: 609 DGDGMIENEGIPDQTYDTWSVTGVSAYCGGLWVAALQATSAMAHEVGDDGSANYFWIKYQ 668

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY  L W GSY+ +DSS ++   SI ADQL G  + +A G++   +       +AL
Sbjct: 669 KAKTVYGKL-WNGSYFNYDSSGNK---SIQADQLAGQWYARACGLS--PVVDGDKARKAL 722

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             I+  NV   + G+ GAVNGM P+G  D +A+Q++E+W GVTY LSA+M+ EG  + A+
Sbjct: 723 EKIYEFNVLKVKGGTRGAVNGMLPDGRVDLTAMQTKEIWPGVTYALSASMIQEGLEEMAF 782

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
            TA G+Y   +   GLG  F+ PE    +  YRS  YMR LA++AMQ A  K K
Sbjct: 783 QTAVGIYNAAWSEEGLGYSFQIPESWDMNDQYRSLCYMRPLAIWAMQWALSKPK 836


>gi|357440847|ref|XP_003590701.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
 gi|355479749|gb|AES60952.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
          Length = 992

 Score =  204 bits (519), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 124/313 (39%), Positives = 173/313 (55%), Gaps = 18/313 (5%)

Query: 9   EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
           +VKF +  N   R      P +    +P+  +N YN+ D S WKDLN KF++ +YRD++ 
Sbjct: 627 KVKFLAEGNWGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFSA 686

Query: 65  HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
             D++F   V+P     +   E FD+   G+++N GFPDQT          AYCGGL +A
Sbjct: 687 TGDLQFGVDVWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWTVHGVSAYCGGLWLA 746

Query: 115 ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
           AL     M+  L D    +    K  KA  VY   LW GSY+ +DS  S +S SI ADQL
Sbjct: 747 ALQAAAAMALQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDSGSSSNSKSIQADQL 806

Query: 175 CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
            G  +  +SG+   ++F    I  +L  +F  NV   + G MGAVNGM PNG  D + +Q
Sbjct: 807 AGQWYTASSGL--PSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDETCMQ 864

Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
           S E+W GVTY ++A M+  G  +EA+TTA G++   +   G G  F+TPE  T D  YRS
Sbjct: 865 SREIWAGVTYGVAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQTPEAFTIDGHYRS 924

Query: 293 GGYMRALAVYAMQ 305
             YMR L+++ MQ
Sbjct: 925 LIYMRPLSIWGMQ 937


>gi|357440839|ref|XP_003590697.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
 gi|355479745|gb|AES60948.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
          Length = 1103

 Score =  204 bits (519), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 125/315 (39%), Positives = 174/315 (55%), Gaps = 18/315 (5%)

Query: 9    EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
            +VKF +  N   R      P +    +P+  +N YN+ D S WKDLN KF++ +YRD++ 
Sbjct: 738  KVKFLAEGNWGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFSA 797

Query: 65   HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
              D++F   V+P     +   E FD+   G+++N GFPDQT          AYCGGL +A
Sbjct: 798  TGDLQFGVDVWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWTVHGVSAYCGGLWLA 857

Query: 115  ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
            AL     M+  L D    +    K  KA  VY   LW GSY+ +DS  S +S SI ADQL
Sbjct: 858  ALQAAAAMALQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDSGSSSNSKSIQADQL 917

Query: 175  CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
             G  +  +SG+   ++F    I  +L  +F  NV   + G MGAVNGM PNG  D + +Q
Sbjct: 918  AGQWYTASSGL--PSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDETCMQ 975

Query: 235  SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
            S E+W GVTY ++A M+  G  +EA+TTA G++   +   G G  F+TPE  T D  YRS
Sbjct: 976  SREIWAGVTYGVAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQTPEAFTIDGHYRS 1035

Query: 293  GGYMRALAVYAMQDA 307
              YMR L+++ MQ A
Sbjct: 1036 LIYMRPLSIWGMQYA 1050


>gi|255575898|ref|XP_002528846.1| conserved hypothetical protein [Ricinus communis]
 gi|223531697|gb|EEF33520.1| conserved hypothetical protein [Ricinus communis]
          Length = 952

 Score =  203 bits (517), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 116/298 (38%), Positives = 171/298 (57%), Gaps = 21/298 (7%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y++++   WKDLN KF++ +YRD     D +F   V+P+    +   + FD+
Sbjct: 607 DPWYEVNAYSLYNTDRWKDLNPKFVLQVYRDVVATGDKKFAEAVWPSVYIAMAYMDQFDR 666

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AY GGL +AAL     ++  + D   +  F A+  
Sbjct: 667 DGDGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAREVGDKGSEDYFWARFQ 726

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
           KA  VY D LW GSY+ +D+S  R+S SI ADQL G  + +A G     +FP  +  +A 
Sbjct: 727 KAKDVY-DKLWNGSYFNYDNSGGRNSSSIQADQLAGQWYARACG-----LFPIVDKDKAR 780

Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
             L  +++ NV   + G  GA+NGM P+G  D S++QS E+W+GVTY L+A M+ E  +D
Sbjct: 781 SALEKVYNYNVLKVKDGKRGAINGMLPDGKVDLSSMQSREIWSGVTYALAATMIQEDMLD 840

Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
            A+ TA G+Y   +   GLG  F+TPE       YRS  YMR LA++AMQ A  + K+
Sbjct: 841 MAFHTASGIYEAAWSERGLGYSFQTPEAWNNVDQYRSLCYMRPLAIWAMQWALSRPKL 898


>gi|255554158|ref|XP_002518119.1| conserved hypothetical protein [Ricinus communis]
 gi|223542715|gb|EEF44252.1| conserved hypothetical protein [Ricinus communis]
          Length = 968

 Score =  203 bits (517), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 121/313 (38%), Positives = 172/313 (54%), Gaps = 18/313 (5%)

Query: 9   EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
           +VKF +  N   R      P +    +P++ +N YN+ D S WKDLN KF++ +YRD+A 
Sbjct: 603 KVKFLAEGNVGIRKVRGAVPHDLGTHDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAA 662

Query: 65  HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
            +D+ F   V+P     +   E FD+ G  +++N GFPDQT          AYCG L +A
Sbjct: 663 TQDMSFGVDVWPAVRSAMEYMEQFDRDGDALIENDGFPDQTYDAWTVHGVSAYCGCLWLA 722

Query: 115 ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
           AL     M+  + D    +   +K  KA   +   LW GSY+ +DS  S +S SI ADQL
Sbjct: 723 ALEAAAAMALQVGDKYFAELCRSKFVKAKSAFEAKLWNGSYFNYDSGSSSNSKSIQADQL 782

Query: 175 CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
            G  ++ +SG+    +F  + I   L  I+  NV     G MGAVNGM PNG  D + +Q
Sbjct: 783 AGQWYVASSGL--PPLFDDSKIKSTLQKIYDFNVMKVRGGRMGAVNGMHPNGKVDETCMQ 840

Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
           S E+WTGVTY ++A M+  G  D+A+  A G++   +   G G  F+TPEG T D  +RS
Sbjct: 841 SREIWTGVTYAVAATMILAGMEDKAFAAAEGIFLAGWSEDGYGYWFQTPEGWTTDGHFRS 900

Query: 293 GGYMRALAVYAMQ 305
             YMR LA++ MQ
Sbjct: 901 LIYMRPLAIWGMQ 913


>gi|281212396|gb|EFA86556.1| hypothetical protein PPL_00357 [Polysphondylium pallidum PN500]
          Length = 4775

 Score =  203 bits (516), Expect = 9e-50,   Method: Composition-based stats.
 Identities = 108/285 (37%), Positives = 162/285 (56%), Gaps = 13/285 (4%)

Query: 28   PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIR-FLTRVYPTCLELIRKCE 86
            P E+P+  +N Y++ DVS WKDL  KF++ IYRDY ++ + R FL +++    E+IR+  
Sbjct: 871  PGEDPWKRVNAYHIQDVSRWKDLPSKFVLQIYRDYLINGNDRTFLLQMWGVVEEVIRRAF 930

Query: 87   SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFS 136
             +D    G++ N G PDQT          AY GGL +AA+    EM+ +L   + +  + 
Sbjct: 931  EYDIDIDGVIDNEGVPDQTYDAWSALGCSAYSGGLWLAAVKVASEMARILGLKEDETVYK 990

Query: 137  AKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANI 196
                K  + Y   LW G Y+ +DSS++ H DSIM+DQL G+ +L A G+ +      A  
Sbjct: 991  KIFEKGKKSYSTKLWNGHYFNYDSSKNPHFDSIMSDQLAGHWYLLACGLPSYITLDQA-- 1048

Query: 197  ARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNV 256
              +L+ I   NV+ +  GS GAVNGM+P G  D +++QS EVW G +Y L++ ML     
Sbjct: 1049 LSSLSIINEYNVKSYSNGSCGAVNGMRPEGGPDTTSLQSCEVWIGTSYGLASTMLLHFMD 1108

Query: 257  DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAV 301
            +EAW    GL  + Y + G  ++TPE    +  YR+G YMR LA+
Sbjct: 1109 NEAWELIKGLVDSTYNKWGFQYQTPEAWDQNGLYRAGTYMRPLAI 1153


>gi|224120058|ref|XP_002331126.1| predicted protein [Populus trichocarpa]
 gi|222872854|gb|EEF09985.1| predicted protein [Populus trichocarpa]
          Length = 948

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 116/288 (40%), Positives = 163/288 (56%), Gaps = 15/288 (5%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ +   WKDLN KF++ +YRD     D +F   V+P+    +   + FD
Sbjct: 602 HDPWFEVNAYNLHNTDRWKDLNPKFVLQVYRDVIATGDKKFARAVWPSVYVAMAYMDQFD 661

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCGGL +AAL     ++  + D    + F  + 
Sbjct: 662 RDGDGMIENDGFPDQTYDTWSLSGVSAYCGGLWVAALQAASALAWEVGDKDSAEYFWCRF 721

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  VY  L W GSY+ +D+S   +S SI ADQL G  + +A G++   I        A
Sbjct: 722 QKAKIVYGKL-WNGSYFNYDNSGGINSSSIQADQLAGQWYARACGLS--PIVDEDKARCA 778

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  I++ NV   + G  GAVNGM P+G  D S +QS E+W+GVTY ++A M+ EG  D A
Sbjct: 779 LEKIYNYNVLKVQDGKRGAVNGMLPDGTVDLSCLQSREIWSGVTYAVAATMIQEGLTDMA 838

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           + TA G+Y  V+   GLG  F+TPEG      YRS  YMR LA++AMQ
Sbjct: 839 FHTASGVYEAVWAEQGLGYSFQTPEGWNTTDQYRSLCYMRPLAIWAMQ 886


>gi|449463809|ref|XP_004149624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
 gi|449494604|ref|XP_004159595.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 951

 Score =  201 bits (511), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 117/290 (40%), Positives = 169/290 (58%), Gaps = 15/290 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +NGYN+++   WKDLN KF++ IYRD     D++F   V+P     I   + FD+
Sbjct: 609 DPWFEVNGYNLYNTDRWKDLNPKFVLQIYRDVVATGDMKFAKAVWPPVYLAIAYMDQFDR 668

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AY GGL +AAL     ++ + ++   +  F  K  
Sbjct: 669 DGDGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVADEKDAEHYFWFKFQ 728

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA + Y + LW GSY+ +DSS    S SI ADQL G  + +ASG+    I        AL
Sbjct: 729 KAKRAY-EKLWNGSYFNYDSSGGSSSSSIQADQLAGQWYARASGL--LPIVDEDKAKSAL 785

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
           A +++ NV   + G  GAVNGM P+G  D S++QS E+W+GVTY ++A+M++E   D A+
Sbjct: 786 AKVYNYNVMKVKDGKRGAVNGMLPDGTIDFSSMQSREIWSGVTYAVAASMIHEDMTDMAF 845

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
            TA G++   +   GLG  F+TPE  T    YRS  YMR LA++AMQ A+
Sbjct: 846 RTAEGIHEAAWSEDGLGYNFQTPEAWTTTDRYRSLCYMRPLAIWAMQWAF 895


>gi|115474457|ref|NP_001060825.1| Os08g0111200 [Oryza sativa Japonica Group]
 gi|42408344|dbj|BAD09496.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
 gi|42408391|dbj|BAD09542.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
 gi|113622794|dbj|BAF22739.1| Os08g0111200 [Oryza sativa Japonica Group]
 gi|215701490|dbj|BAG92914.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 928

 Score =  200 bits (509), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 114/287 (39%), Positives = 163/287 (56%), Gaps = 14/287 (4%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y + D   WKDLN KF++ +YRD     +  F    +P     +   + FD+
Sbjct: 580 DPWFEVNAYMLHDPVRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDR 639

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+V+N G PDQT          AY GGL +AAL     M+ ++ D   +  F  +  
Sbjct: 640 DGDGMVENEGRPDQTYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYH 699

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           +A +VY D LW G Y+ +D+S    S SI ADQL G  + +A G+  E I       RAL
Sbjct: 700 RARRVYTDELWNGGYFNYDNSGGATSSSIQADQLAGQWYARACGL--EPIVDGDKARRAL 757

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
           AT+   NV   + G++GAVNGM+P+G  D S+ QS+EVW GVTY ++AAM++EG  + A+
Sbjct: 758 ATVLDYNVMRVKGGAIGAVNGMRPDGAVDASSTQSKEVWPGVTYAVAAAMIHEGMPEAAF 817

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            TA G++   + + G G  F+TPE  T D  YR+  YMR L V+AMQ
Sbjct: 818 KTAKGIHDAGWGKHGFGYAFQTPESWTADGGYRALHYMRPLGVWAMQ 864


>gi|357157041|ref|XP_003577664.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 951

 Score =  200 bits (508), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 114/287 (39%), Positives = 163/287 (56%), Gaps = 15/287 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y + +   WKDLN KF++ +YRD     +  F   V+P+    +   E FDK
Sbjct: 608 DPWFKVNAYTLHNTDRWKDLNPKFVLQVYRDVVATGNKSFARAVWPSVYMAMAYMEQFDK 667

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N  FPDQT          AYCGGL +AAL     ++  + D   ++ F  K  
Sbjct: 668 DKDGMIENEDFPDQTYDVWSMAGVSAYCGGLWVAALQAASALAREVGDKASEELFWNKYE 727

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY  L W GSY+ +D + ++ S SI ADQL G  + K+ G+++  I        AL
Sbjct: 728 KAKSVYGKL-WNGSYFNYDDAGTKVSTSIHADQLAGQWYAKSCGLSS--IVDKDKSQSAL 784

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             I+S NV  F+ G  GA+NGM P+G  D S +QS E+W GVTY L+A+M+ EG V+E +
Sbjct: 785 EKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSTMQSREIWPGVTYALAASMIQEGMVEEGF 844

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            TA G+Y   +   GLG  F+TPE  T D  YRS  YMR LA++++Q
Sbjct: 845 KTAEGIYHAAWSPEGLGYAFQTPEAWTNDDGYRSLCYMRPLAIWSIQ 891


>gi|115484889|ref|NP_001067588.1| Os11g0242100 [Oryza sativa Japonica Group]
 gi|77549531|gb|ABA92328.1| expressed protein [Oryza sativa Japonica Group]
 gi|113644810|dbj|BAF27951.1| Os11g0242100 [Oryza sativa Japonica Group]
 gi|215704397|dbj|BAG93831.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222615769|gb|EEE51901.1| hypothetical protein OsJ_33494 [Oryza sativa Japonica Group]
          Length = 950

 Score =  200 bits (508), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 118/298 (39%), Positives = 165/298 (55%), Gaps = 21/298 (7%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y +++   WKDLN KF++ +YRD     D  F   V+P+    +   E FD+
Sbjct: 608 DPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAMAYMEQFDR 667

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N  FPDQT          AYCGGL +AAL     ++  + D   ++ F  K  
Sbjct: 668 DKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKASEKLFWDKYE 727

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
           KA  VY  L W GSY+ +D   +  S SI ADQL G  + KA G     +FP  +  +A 
Sbjct: 728 KAKSVYGKL-WNGSYFNYDDGDNIMSASIHADQLAGQWYAKACG-----LFPIVDKDKAE 781

Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
             L  I+S NV  F+ G  GA+NGM P+G  D SA+QS E+W GVTY L+A M+ EG V+
Sbjct: 782 SALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGMVE 841

Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
           + + TA G+Y   +   GLG  F+TPE    D  YRS  YMR LA++A+Q A    K+
Sbjct: 842 KGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPKL 899


>gi|218185509|gb|EEC67936.1| hypothetical protein OsI_35656 [Oryza sativa Indica Group]
          Length = 950

 Score =  200 bits (508), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 118/298 (39%), Positives = 165/298 (55%), Gaps = 21/298 (7%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y +++   WKDLN KF++ +YRD     D  F   V+P+    +   E FD+
Sbjct: 608 DPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAMAYMEQFDR 667

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N  FPDQT          AYCGGL +AAL     ++  + D   ++ F  K  
Sbjct: 668 DKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKASEKLFWDKYE 727

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
           KA  VY  L W GSY+ +D   +  S SI ADQL G  + KA G     +FP  +  +A 
Sbjct: 728 KAKSVYGKL-WNGSYFNYDDGDNIMSASIHADQLAGQWYAKACG-----LFPIVDKDKAE 781

Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
             L  I+S NV  F+ G  GA+NGM P+G  D SA+QS E+W GVTY L+A M+ EG V+
Sbjct: 782 SALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGMVE 841

Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
           + + TA G+Y   +   GLG  F+TPE    D  YRS  YMR LA++A+Q A    K+
Sbjct: 842 KGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPKL 899


>gi|62733283|gb|AAX95400.1| At5g49900 [Oryza sativa Japonica Group]
          Length = 931

 Score =  200 bits (508), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 118/298 (39%), Positives = 165/298 (55%), Gaps = 21/298 (7%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y +++   WKDLN KF++ +YRD     D  F   V+P+    +   E FD+
Sbjct: 589 DPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAMAYMEQFDR 648

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N  FPDQT          AYCGGL +AAL     ++  + D   ++ F  K  
Sbjct: 649 DKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKASEKLFWDKYE 708

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
           KA  VY  L W GSY+ +D   +  S SI ADQL G  + KA G     +FP  +  +A 
Sbjct: 709 KAKSVYGKL-WNGSYFNYDDGDNIMSASIHADQLAGQWYAKACG-----LFPIVDKDKAE 762

Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
             L  I+S NV  F+ G  GA+NGM P+G  D SA+QS E+W GVTY L+A M+ EG V+
Sbjct: 763 SALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGMVE 822

Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
           + + TA G+Y   +   GLG  F+TPE    D  YRS  YMR LA++A+Q A    K+
Sbjct: 823 KGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPKL 880


>gi|297831192|ref|XP_002883478.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329318|gb|EFH59737.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 950

 Score =  200 bits (508), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 119/313 (38%), Positives = 166/313 (53%), Gaps = 18/313 (5%)

Query: 9   EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
           +VKF +  N   R      P +    +P++ +N YN+ D S WKDLN KF++ +YRD+A 
Sbjct: 587 KVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAA 646

Query: 65  HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
             D +F   V+P     +   E FD+    +++N GFPDQT          AYCG L +A
Sbjct: 647 TGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLA 706

Query: 115 ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
           AL     M+  + D    +    K   A       LW GSY+ +DS  S +S SI  DQL
Sbjct: 707 ALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYFNYDSGASSNSKSIQTDQL 766

Query: 175 CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
            G  +  +SG+    IF  + I   +  IF  NV   + G MGAVNGM P+G  D + +Q
Sbjct: 767 AGQWYAASSGL--PPIFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQ 824

Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
           S E+WTGVTY  +A M+  G  ++ +TTA G++   +   G G  F+TPEG T D  YRS
Sbjct: 825 SREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGYWFQTPEGWTMDGHYRS 884

Query: 293 GGYMRALAVYAMQ 305
             YMR LA++ MQ
Sbjct: 885 LIYMRPLAIWGMQ 897


>gi|297739499|emb|CBI29681.3| unnamed protein product [Vitis vinifera]
          Length = 949

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 118/295 (40%), Positives = 166/295 (56%), Gaps = 15/295 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +NGYN+++   WKDLN KF++ +YRD     D  F   V+P+    +     FDK
Sbjct: 605 DPWFEVNGYNLYNTDRWKDLNPKFVLQVYRDVVATGDKNFAKAVWPSVYVALAYMNQFDK 664

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AY GGL +AAL     ++ ++ D   +  F  K  
Sbjct: 665 DGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVVGDKGSEDYFWFKFQ 724

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY  L W GSY+ +D S    S SI ADQL G  + +A G++   I        AL
Sbjct: 725 KAKGVYQKL-WNGSYFNYDDSDGSSSSSIQADQLAGQWYARACGLS--PIVDEDKAKSAL 781

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             ++  NV     G  GAVNGM P+G  D + +QS E+W+GVTY ++A M++EG VD A+
Sbjct: 782 EKVYHYNVLKVGGGKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDMAF 841

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
            TA G+Y   + + GLG  F+TPE    D  YRS  YMR LA++AMQ A+ + K+
Sbjct: 842 QTASGVYEAAWSQEGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKL 896


>gi|359486233|ref|XP_002264575.2| PREDICTED: non-lysosomal glucosylceramidase [Vitis vinifera]
          Length = 960

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 118/295 (40%), Positives = 166/295 (56%), Gaps = 15/295 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +NGYN+++   WKDLN KF++ +YRD     D  F   V+P+    +     FDK
Sbjct: 616 DPWFEVNGYNLYNTDRWKDLNPKFVLQVYRDVVATGDKNFAKAVWPSVYVALAYMNQFDK 675

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+++N GFPDQT          AY GGL +AAL     ++ ++ D   +  F  K  
Sbjct: 676 DGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVVGDKGSEDYFWFKFQ 735

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY  L W GSY+ +D S    S SI ADQL G  + +A G++   I        AL
Sbjct: 736 KAKGVYQKL-WNGSYFNYDDSDGSSSSSIQADQLAGQWYARACGLS--PIVDEDKAKSAL 792

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
             ++  NV     G  GAVNGM P+G  D + +QS E+W+GVTY ++A M++EG VD A+
Sbjct: 793 EKVYHYNVLKVGGGKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDMAF 852

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
            TA G+Y   + + GLG  F+TPE    D  YRS  YMR LA++AMQ A+ + K+
Sbjct: 853 QTASGVYEAAWSQEGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKL 907


>gi|326517703|dbj|BAK03770.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 821

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 118/289 (40%), Positives = 167/289 (57%), Gaps = 16/289 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y + D S WKDLN KF++ +YRD A   ++ F T  +P     +   + FD+
Sbjct: 466 DPWFELNAYMIHDPSRWKDLNPKFVLQVYRDVAATGNVTFATAAWPAVYLAMAYMDQFDR 525

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+V+N G PDQT          AY GGL +AAL     M+ ++ D   +  F  +  
Sbjct: 526 DGDGMVENEGRPDQTYDLWSVSGVSAYTGGLWLAALQAAAAMARIVGDRGAEGYFLERYK 585

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           +A +VY   LW GSY+ +D+S    S SIMADQL G  + +A G+  E I        AL
Sbjct: 586 RAQRVYDGELWNGSYFDYDNSGCATSKSIMADQLAGQWYARACGL--EPIVEEEKARSAL 643

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
            T+   NV   + G++GAVNGM+P+G  D S++QS+EVW GVTY ++AAML+EG  + A+
Sbjct: 644 GTVLDYNVMRVQGGAVGAVNGMRPDGAVDASSLQSKEVWVGVTYGVAAAMLHEGMTEAAF 703

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDK--TYRSGGYMRALAVYAMQ 305
            TA G +   + R G G  F+TPE  T D    YRS  YMR L+++AMQ
Sbjct: 704 RTAKGAHDAGWGRDGFGYAFQTPEAWTSDAGGGYRSLHYMRPLSIWAMQ 752


>gi|413941636|gb|AFW74285.1| hypothetical protein ZEAMMB73_466373 [Zea mays]
          Length = 966

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 116/288 (40%), Positives = 169/288 (58%), Gaps = 15/288 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y + D + WKDLN KF++ +YRD     D  F   V+P     +   + FD+
Sbjct: 613 DPWVELNAYMLHDAARWKDLNPKFVLQVYRDAVATGDAAFAEAVWPAVYVAMAYMDQFDR 672

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+V+N G PDQT          AY GG+ +AAL     M+ ++ D   +  F A+  
Sbjct: 673 DGDGMVENEGVPDQTYDMWSVSGVSAYTGGVWVAALQAAAGMARVVGDRPAEAYFRARRD 732

Query: 141 KASQVYHDLLWTGSYYKFDSSQSR-HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
           +A++VY   LWTG+Y+++D+S     S+SIMADQL G  + +A G+  E +   A    A
Sbjct: 733 RAARVYDAELWTGAYFRYDNSGGGGASESIMADQLAGQWYARACGL--EPVVEDAKARSA 790

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           LAT+   NV   + G++GAVNGM+P+G  D S+ QS+EVW GVTY ++AAM++EG  + A
Sbjct: 791 LATVLDYNVMRVKGGAIGAVNGMRPDGAVDASSTQSKEVWPGVTYAVAAAMVHEGMTEAA 850

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           + TA G +   +   G G  F+TPE  T D  YRS  YMR L ++AMQ
Sbjct: 851 FRTAKGAHDAAWGNDGFGYAFQTPEAWTEDGGYRSLHYMRPLGIWAMQ 898


>gi|341888271|gb|EGT44206.1| hypothetical protein CAEBREN_07747 [Caenorhabditis brenneri]
          Length = 820

 Score =  197 bits (501), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 118/297 (39%), Positives = 164/297 (55%), Gaps = 23/297 (7%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIR-----FLTRVYPTCLELI 82
           P  +P+   N Y + D   WKDLNLKF++S YRDY +    R      L      C +++
Sbjct: 523 PMADPWLHTNAYILHDTGSWKDLNLKFVISCYRDYKMIAKDREDKDQLLEFFLGKCTKIV 582

Query: 83  RKC-ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQ 131
               ES+DK   G+++N GF DQT          AYCG L +AAL C +EM  L      
Sbjct: 583 EDALESWDKDKDGMIENDGFADQTYDVWKMTGTSAYCGSLWLAALTCYIEM--LKQAGSP 640

Query: 132 QQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIF 191
            + +  KL  A   Y   LW G ++KFD     +S  +MADQLCG+  L A  ++     
Sbjct: 641 SKHYEGKLLNAYDAYTTKLWNGRFFKFDELPD-NSKIVMADQLCGFWALTA--MDEPVQV 697

Query: 192 PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 251
               +  AL TIF  NV+ +  G  GAVNG   +   D S+IQSEEVW G+TY LSA M+
Sbjct: 698 SEEKMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVDGSSIQSEEVWAGITYALSAMMI 757

Query: 252 YEGNVDE-AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
            E N+DE A+ T+ GL+ +++ R  L ++TPE +T D  YR+ GYMR L+++A+Q A
Sbjct: 758 -EKNMDEMAFKTSEGLFESIWNRFPLQYQTPEAITSDGMYRALGYMRPLSIWAIQHA 813


>gi|328869576|gb|EGG17953.1| hypothetical protein DFA_06619 [Dictyostelium fasciculatum]
          Length = 1200

 Score =  197 bits (500), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 108/312 (34%), Positives = 164/312 (52%), Gaps = 36/312 (11%)

Query: 28   PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY------------------------A 63
            P E+P+  +N YN+ D+S WKDL  KFI+ +YRDY                         
Sbjct: 872  PGEDPWKRVNAYNIQDISRWKDLPCKFILQVYRDYLLTSVDSSGQKSNATSSSFGQEYIP 931

Query: 64   LHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHI 113
            L  D  FL +++ T  E+IRK   FD    G++ N GFPDQT          AY GGL +
Sbjct: 932  LDGDRGFLLQMWGTVEEVIRKAFEFDTDDDGVIDNEGFPDQTYDTWSASGCSAYTGGLWL 991

Query: 114  AALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQ 173
            AA+     M+ +L     ++ +S    K  + Y+  LW G Y+ +DSS+  H +SIMADQ
Sbjct: 992  AAIKATSAMARILGLRDDEEVYSKLFEKGKKSYNKKLWNGHYFNYDSSKQSHYNSIMADQ 1051

Query: 174  LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAI 233
            L G+ +L + G+++      A     L+ I   N++ +  GS GAVNG+ P    D++ +
Sbjct: 1052 LAGHWYLTSCGLSSYMTLDQA--LSTLSIINEYNIKSYSNGSCGAVNGISPLAIVDQTCL 1109

Query: 234  QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSG 293
            QS EVW G +Y L++  L      EAW+   GL  + YE+ G  ++TPE    + ++R+ 
Sbjct: 1110 QSSEVWIGTSYSLASTFLLHYMDKEAWSLIKGLVNSSYEKWGFQYQTPEAWDMNGSFRAS 1169

Query: 294  GYMRALAVYAMQ 305
             YMR LA++++Q
Sbjct: 1170 TYMRPLAIWSVQ 1181


>gi|341899054|gb|EGT54989.1| hypothetical protein CAEBREN_31176 [Caenorhabditis brenneri]
          Length = 831

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 117/294 (39%), Positives = 163/294 (55%), Gaps = 23/294 (7%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIR-----FLTRVYPTCLELIRKC 85
           +P+   N Y + D   WKDLNLKF++S YRDY +    R      L      C +++   
Sbjct: 537 DPWLHTNAYILHDTGSWKDLNLKFVISCYRDYKMIAKDREDKDQLLEFFLGKCTKIVEDA 596

Query: 86  -ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
            ES+DK   G+++N GF DQT          AYCG L +AAL C +EM  L       + 
Sbjct: 597 LESWDKDKDGMIENDGFADQTYDVWKMTGTSAYCGSLWLAALTCYIEM--LKQAGSPSKN 654

Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSA 194
           +  KL  A   Y   LW G ++KFD     +S  +MADQLCG+  L A  ++        
Sbjct: 655 YEGKLLNAYDAYITKLWNGRFFKFDELPD-NSKIVMADQLCGFWALTA--MDEPVQVSEE 711

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
            +  AL TIF  NV+ +  G  GAVNG   +   D S+IQSEEVW G+TY LSA M+ E 
Sbjct: 712 KMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVDGSSIQSEEVWAGITYALSAMMI-EK 770

Query: 255 NVDE-AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           N+DE A+ T+ GL+ +++ R  L ++TPE +T D  YR+ GYMR L+++A+Q A
Sbjct: 771 NMDEMAFKTSEGLFESIWNRFPLQYQTPEAITSDGMYRALGYMRPLSIWAIQHA 824


>gi|308471410|ref|XP_003097936.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
 gi|308239241|gb|EFO83193.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
          Length = 845

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 115/297 (38%), Positives = 166/297 (55%), Gaps = 18/297 (6%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIR-KCES 87
           +P+   N Y + D   WKDLNLKF++S YRD+ +  + D   L      C +++    ES
Sbjct: 554 DPWLHTNAYILHDTGCWKDLNLKFVISCYRDWKMISNGDQEILEFFIGKCTKIVDGALES 613

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
           +DK   G+++N GF DQT          AYCG L +AAL C ++M  L       + +  
Sbjct: 614 WDKDQDGMIENDGFADQTYDVWKMTGTSAYCGSLWLAALTCYIQM--LKKAGSPTKFYDE 671

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
           KL  A + Y   LW G ++KFD     +S  +MADQLCG+  L A  ++         + 
Sbjct: 672 KLLMAYEAYTTKLWNGKFFKFDELPD-NSKIVMADQLCGFWALTA--MDEPVQVSEEKMK 728

Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
            AL TIF  NV+ ++ G  GAVNG   +   D S+IQSEEVW G+TY LSA M+ +G  +
Sbjct: 729 SALETIFKYNVQMYDGGKCGAVNGYLTSERVDGSSIQSEEVWAGITYALSAMMIEKGMDE 788

Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
            A+ T+ GL+ +++ R  L F+TPE +T D  YR+ GYMR L+++A+Q A  K   K
Sbjct: 789 MAFKTSEGLFDSIWNRFPLQFQTPEAITSDGMYRALGYMRPLSIWAIQHALEKRNRK 845


>gi|20453181|gb|AAM19831.1| AT4g10060/T5L19_190 [Arabidopsis thaliana]
 gi|24111431|gb|AAN46866.1| At4g10060/T5L19_190 [Arabidopsis thaliana]
          Length = 922

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 113/287 (39%), Positives = 163/287 (56%), Gaps = 15/287 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN F+   WKDLN KF++ +YRD     D  F   V+P+    +   + FDK
Sbjct: 571 DPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDK 630

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N GFPDQT          AYCGGL +AAL      +S++ +      F+AK  
Sbjct: 631 DEDGMIENEGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYE 690

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY + LW GSY+ +D S S  S SI+ADQL G  + +A G+  + I     I +AL
Sbjct: 691 KAKIVY-EKLWNGSYFNYDDSGSGSSSSILADQLAGQWYARACGL--KPITKEEWIKKAL 747

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
            TI+  NV   + G+ GAVNGM   G  D +++ S+EVW G TY ++A M+ EG  ++ +
Sbjct: 748 ETIYEFNVMKVKGGTRGAVNGMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGF 807

Query: 261 TTAGGLYRTVYERTGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            TA G+Y  ++   GL   F+TPE    +  YRS  YMR LA++A+Q
Sbjct: 808 QTASGIYEAIWSDRGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWAIQ 854


>gi|240255774|ref|NP_192744.6| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|332657434|gb|AEE82834.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 922

 Score =  193 bits (491), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 113/287 (39%), Positives = 163/287 (56%), Gaps = 15/287 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN F+   WKDLN KF++ +YRD     D  F   V+P+    +   + FDK
Sbjct: 571 DPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDK 630

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N GFPDQT          AYCGGL +AAL      +S++ +      F+AK  
Sbjct: 631 DEDGMIENEGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYE 690

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY + LW GSY+ +D S S  S SI+ADQL G  + +A G+  + I     I +AL
Sbjct: 691 KAKIVY-EKLWNGSYFNYDDSGSGSSSSILADQLAGQWYARACGL--KPITKEEWIKKAL 747

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
            TI+  NV   + G+ GAVNGM   G  D +++ S+EVW G TY ++A M+ EG  ++ +
Sbjct: 748 ETIYEFNVMKVKGGTRGAVNGMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGF 807

Query: 261 TTAGGLYRTVYERTGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            TA G+Y  ++   GL   F+TPE    +  YRS  YMR LA++A+Q
Sbjct: 808 QTASGIYEAIWSDRGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWAIQ 854


>gi|159896834|ref|YP_001543081.1| hypothetical protein Haur_0301 [Herpetosiphon aurantiacus DSM 785]
 gi|159889873|gb|ABX02953.1| protein of unknown function DUF608 [Herpetosiphon aurantiacus DSM
           785]
          Length = 774

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 108/293 (36%), Positives = 161/293 (54%), Gaps = 19/293 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVY---PTCLELIRK 84
           P+E+P    N Y+  D+++WKDLNLK+I+ IYRD +L  D   L   +   PT LE + +
Sbjct: 480 PKEQPLIKTNAYDFQDINNWKDLNLKYILRIYRDVSLWNDQAMLEATWDTIPTALEYVHQ 539

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
              FD  G G++ + G  DQT          +Y   L I AL   + ++  + D  Q   
Sbjct: 540 ---FDSDGDGLLDHSG-ADQTYDTWAMSGAASYSASLLICALEAAIRLAQRMGDHAQADA 595

Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSA 194
           +S  L  A Q +   LW G+Y+++ ++ +   + IMADQL G  +  A G+   A+ P  
Sbjct: 596 WSEWLAAARQSFETKLWNGTYFRYHTADTDLREVIMADQLVGQWYAGAIGL--PAVAPRE 653

Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
            I  AL T++  NV  +  G++GAVNGM P+G  D S+ Q+ EVW+G TY ++A ML EG
Sbjct: 654 MIRSALQTVYRFNVMQYANGALGAVNGMHPDGTVDTSSNQASEVWSGTTYAIAAMMLQEG 713

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
              E W TA G Y   Y   GL F TPE    ++T+R+  YMR  +++A++ A
Sbjct: 714 LDLEGWQTAWGAYNATYNELGLWFRTPEAWGIERTFRASMYMRPQSIWAIEHA 766


>gi|147788372|emb|CAN61187.1| hypothetical protein VITISV_019326 [Vitis vinifera]
          Length = 900

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 119/324 (36%), Positives = 167/324 (51%), Gaps = 50/324 (15%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ +YRD+A  +D  F   V+P     +   E FD
Sbjct: 524 HDPWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAAMEYMEQFD 583

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           +   G+++N GFPDQT          AYCG L +AAL     M+  L D    ++  +K 
Sbjct: 584 RDSDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPFAEKCKSKF 643

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  V+ + LW GSY+ +DS  S +S SI ADQL G  +  +SG+   ++F    I  +
Sbjct: 644 FKAKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGL--PSLFDDCKIKSS 701

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  I+  NV   + G MGAVNGM PNG  D S +QS E+WTGVTY ++A M+  G  ++A
Sbjct: 702 LHKIYDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILSGMEEQA 761

Query: 260 WTTAGGLYRTVYERTGLG--------------------------------------FETP 281
           +TTA G++   +   G G                                      F+TP
Sbjct: 762 FTTAEGIFTAGWSEEGYGTLDFIDDMHCVVQILNCFVHSGLCVACVILLYCTCRYWFQTP 821

Query: 282 EGLTGDKTYRSGGYMRALAVYAMQ 305
           EG T D  +RS  YMR LA++ MQ
Sbjct: 822 EGWTIDGHFRSLIYMRPLAIWGMQ 845


>gi|4539009|emb|CAB39630.1| putative protein [Arabidopsis thaliana]
 gi|7267702|emb|CAB78129.1| putative protein [Arabidopsis thaliana]
          Length = 750

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 113/287 (39%), Positives = 163/287 (56%), Gaps = 15/287 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN F+   WKDLN KF++ +YRD     D  F   V+P+    +   + FDK
Sbjct: 399 DPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDK 458

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+++N GFPDQT          AYCGGL +AAL      +S++ +      F+AK  
Sbjct: 459 DEDGMIENEGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYE 518

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY + LW GSY+ +D S S  S SI+ADQL G  + +A G+  + I     I +AL
Sbjct: 519 KAKIVY-EKLWNGSYFNYDDSGSGSSSSILADQLAGQWYARACGL--KPITKEEWIKKAL 575

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
            TI+  NV   + G+ GAVNGM   G  D +++ S+EVW G TY ++A M+ EG  ++ +
Sbjct: 576 ETIYEFNVMKVKGGTRGAVNGMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGF 635

Query: 261 TTAGGLYRTVYERTGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            TA G+Y  ++   GL   F+TPE    +  YRS  YMR LA++A+Q
Sbjct: 636 QTASGIYEAIWSDRGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWAIQ 682


>gi|297809209|ref|XP_002872488.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
 gi|297318325|gb|EFH48747.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
          Length = 921

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 112/287 (39%), Positives = 162/287 (56%), Gaps = 15/287 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN F+   WKDLN KF++ +YRD     D  F   V+P+    +   + FDK
Sbjct: 571 DPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVSTGDQSFAKAVWPSVYTAVAYLDQFDK 630

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
               +++N GFPDQT          AYCGGL +AAL      +S++ +      F+ K  
Sbjct: 631 DEDEMIENEGFPDQTYDAWSVKGVSAYCGGLWVAALQAASAFASIVGENGVAIYFNTKYE 690

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA  VY + LW GSY+ +D S S  S SI+ADQL G  + +A G+  + I     I +AL
Sbjct: 691 KAKSVY-EKLWNGSYFNYDDSGSGSSSSILADQLAGQWYARACGL--KPITKEEWIKKAL 747

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
            TI+  NV   + G+ GAVNGM P+G  D +++ S+EVW G TY ++A M+ EG  +  +
Sbjct: 748 ETIYEFNVMKVKGGTRGAVNGMSPDGQVDTNSLVSKEVWAGTTYSVAACMIQEGQRERGF 807

Query: 261 TTAGGLYRTVYERTGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            TA G+Y  V+   GL   F+TPE    +  YRS  YMR LA++++Q
Sbjct: 808 QTASGIYEAVWSDRGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWSIQ 854


>gi|268555576|ref|XP_002635777.1| Hypothetical protein CBG10433 [Caenorhabditis briggsae]
          Length = 821

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 114/301 (37%), Positives = 163/301 (54%), Gaps = 21/301 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL-----HKDIRFLTRVYPTCLELI 82
           P  +P+   N Y + D   WKDLNLKF++S YRD+ +           L      C +++
Sbjct: 523 PMADPWIHTNAYILHDTGHWKDLNLKFVISCYRDWKMIAEGEEDSQEILEFFLGKCTKIV 582

Query: 83  R-KCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQ 131
               E++DK   G+++N GF DQT          AYCG L IAAL   ++M  L      
Sbjct: 583 DGALENWDKDKDGMIENDGFADQTYDVWKMTGTSAYCGSLWIAALTSYIQM--LKRAGIP 640

Query: 132 QQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIF 191
            + +  KL  A + Y   LW G ++KFD     +S  +MADQLCG+  L A  ++     
Sbjct: 641 TKDYEEKLLMAYEAYTTKLWNGKFFKFDELPD-NSKIVMADQLCGFWALTA--MDEPIQV 697

Query: 192 PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 251
               I  AL TIF  NV  ++ G  GAVNG   +   D S+IQSEEVW G+TY LSA M+
Sbjct: 698 SEGKIQSALETIFKYNVEMYDGGKCGAVNGFLASERVDGSSIQSEEVWAGITYSLSAMMI 757

Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKG 311
            +G  + A+ T+ GL+ +++ R  L ++TPE +T D  YR+ GYMR L+++A+Q A  K 
Sbjct: 758 EKGMDEMAFKTSEGLFHSIWNRFPLQYQTPEAITADGMYRALGYMRPLSIWAIQHALDKK 817

Query: 312 K 312
           K
Sbjct: 818 K 818


>gi|22331303|ref|NP_189060.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|79313355|ref|NP_001030757.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|17529232|gb|AAL38843.1| unknown protein [Arabidopsis thaliana]
 gi|110742225|dbj|BAE99039.1| hypothetical protein [Arabidopsis thaliana]
 gi|332643347|gb|AEE76868.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|332643348|gb|AEE76869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 950

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 118/313 (37%), Positives = 166/313 (53%), Gaps = 18/313 (5%)

Query: 9   EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
           +VKF +  N   R      P +    +P++ +N YN+ D S WKDLN KF++ +YRD+A 
Sbjct: 587 KVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSRWKDLNPKFVLQVYRDFAA 646

Query: 65  HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
             D +F   V+P     +   E FD+    +++N GFPDQT          AYCG L +A
Sbjct: 647 TGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLA 706

Query: 115 ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
           AL     M+  + D    +    K   A       LW GSY+ +DS  S +S SI  DQL
Sbjct: 707 ALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYFNYDSGSSSNSKSIQTDQL 766

Query: 175 CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
            G  +  +SG+    +F  + I   +  IF  NV   + G MGAVNGM P+G  D + +Q
Sbjct: 767 AGQWYAASSGL--PPLFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQ 824

Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
           S E+WTGVTY  +A M+  G  ++ +TTA G++   +   G G  F+TPEG T D  YRS
Sbjct: 825 SREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGYWFQTPEGWTMDGHYRS 884

Query: 293 GGYMRALAVYAMQ 305
             YMR LA++ MQ
Sbjct: 885 LIYMRPLAIWGMQ 897


>gi|11994237|dbj|BAB01359.1| unnamed protein product [Arabidopsis thaliana]
          Length = 937

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 118/313 (37%), Positives = 166/313 (53%), Gaps = 18/313 (5%)

Query: 9   EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
           +VKF +  N   R      P +    +P++ +N YN+ D S WKDLN KF++ +YRD+A 
Sbjct: 574 KVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSRWKDLNPKFVLQVYRDFAA 633

Query: 65  HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
             D +F   V+P     +   E FD+    +++N GFPDQT          AYCG L +A
Sbjct: 634 TGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLA 693

Query: 115 ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
           AL     M+  + D    +    K   A       LW GSY+ +DS  S +S SI  DQL
Sbjct: 694 ALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYFNYDSGSSSNSKSIQTDQL 753

Query: 175 CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
            G  +  +SG+    +F  + I   +  IF  NV   + G MGAVNGM P+G  D + +Q
Sbjct: 754 AGQWYAASSGL--PPLFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQ 811

Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
           S E+WTGVTY  +A M+  G  ++ +TTA G++   +   G G  F+TPEG T D  YRS
Sbjct: 812 SREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGYWFQTPEGWTMDGHYRS 871

Query: 293 GGYMRALAVYAMQ 305
             YMR LA++ MQ
Sbjct: 872 LIYMRPLAIWGMQ 884


>gi|374311163|ref|YP_005057593.1| glucosylceramidase [Granulicella mallensis MP5ACTX8]
 gi|358753173|gb|AEU36563.1| Glucosylceramidase [Granulicella mallensis MP5ACTX8]
          Length = 811

 Score =  190 bits (483), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 110/290 (37%), Positives = 150/290 (51%), Gaps = 17/290 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           PE +PF  +N     D +DWKDLN KF++ +YRDY L   KD  FL   +P     I   
Sbjct: 520 PEGDPFVAVNEPGWQDTNDWKDLNSKFVLMVYRDYVLTGRKDTAFLRETWPAVKAAIEYL 579

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
             FD  G G+ +N G+PDQT          AY GGL +AAL    E + ++ DT+   ++
Sbjct: 580 RQFDHGG-GVPENSGYPDQTYDDWVVRGVSAYSGGLWLAALRAGEETARVVGDTKTTAEY 638

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
            A   K  + Y   LW G Y+++D+S S   D I ADQL G  +   +G+    I P   
Sbjct: 639 HALFLKGQKTYISQLWNGEYFRYDTS-SESKDDIQADQLAGQWYANLTGLGE--IVPHTM 695

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
              A   IF  NV  F  G MGA NGM  +G    +A +++EVW G T   +  ++ EG 
Sbjct: 696 QVSAAKKIFDVNVMKFGHGEMGAANGMTADGAILTNA-EAKEVWVGTTLGYAGLLMQEGM 754

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            DEAW T  GLY  +YE  G  F TPE       +R+G YMR  A++A++
Sbjct: 755 NDEAWKTTRGLYHVIYEDKGYWFRTPEAWDITGNFRAGMYMRPTAIWALE 804


>gi|414871105|tpg|DAA49662.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
          Length = 589

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 105/259 (40%), Positives = 145/259 (55%), Gaps = 12/259 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ IYRD+A   D++F   V+P     +   + FD
Sbjct: 309 HDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCAAMDYMDQFD 368

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCG L +AAL     M+  L D    +++  K 
Sbjct: 369 RDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHFAEKYKLKF 428

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  VY   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I  A
Sbjct: 429 IKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDEHKIRTA 486

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G MGAVNGM P G  D + +QS E+WTGVTY ++A ML  G   + 
Sbjct: 487 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQG 546

Query: 260 WTTAGGLYRTVYERTGLGF 278
           +TTA G++   +   G G+
Sbjct: 547 FTTAEGIFTAGWSEEGYGY 565


>gi|392918721|ref|NP_504047.2| Protein R08F11.1 [Caenorhabditis elegans]
 gi|387910741|emb|CCD72306.2| Protein R08F11.1 [Caenorhabditis elegans]
          Length = 830

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 111/292 (38%), Positives = 162/292 (55%), Gaps = 20/292 (6%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL----HKDIRFLTRVYPTCLELIR-KC 85
           +P+   N Y + D   WKDLNLKF++S YRD+ L     +  + L      C +++    
Sbjct: 537 DPWIHTNAYILHDTGRWKDLNLKFVISCYRDWKLIELGSEKGQVLEFFLGKCTKIVDGAL 596

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           E +DK   G+++N GF DQT          AYCG L IAAL+  +EM  L       + +
Sbjct: 597 ECWDKDNDGMIENDGFADQTYDVWKMTGTSAYCGSLWIAALSSYIEM--LKQSGLPTKHY 654

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
             KL  A   Y   LW G+++KFD     +S  +MADQLCG  F   + ++         
Sbjct: 655 EEKLEMAYDAYIGKLWNGTFFKFDE-LPENSKIVMADQLCG--FWAMTAMDEPVQISKDK 711

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
           +  AL TIF  NV+ +  G  GAVNG   +   D S+IQSEEVW G+TY LSA M+ +G 
Sbjct: 712 MKSALDTIFKYNVQMYNNGRCGAVNGYLTSERVDGSSIQSEEVWAGITYALSAMMIEKGM 771

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
            ++A+ T+ GL+ +++ R  L ++TPE +T D  YR+ GYMR L+++A+Q A
Sbjct: 772 DEQAFKTSEGLFESIWHRFPLQYQTPEAITSDGMYRALGYMRPLSIWAIQHA 823


>gi|224028315|gb|ACN33233.1| unknown [Zea mays]
          Length = 384

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 105/259 (40%), Positives = 145/259 (55%), Gaps = 12/259 (4%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
            +P+  +N YN+ D S WKDLN KF++ IYRD+A   D++F   V+P     +   + FD
Sbjct: 104 HDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCAAMDYMDQFD 163

Query: 90  KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
           + G G+++N GFPDQT          AYCG L +AAL     M+  L D    +++  K 
Sbjct: 164 RDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHFAEKYKLKF 223

Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
            KA  VY   LW GSY+ +DS  S +S SI ADQL G  +  +SG+    +F    I  A
Sbjct: 224 IKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDEHKIRTA 281

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           L  IF  NV   + G MGAVNGM P G  D + +QS E+WTGVTY ++A ML  G   + 
Sbjct: 282 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQG 341

Query: 260 WTTAGGLYRTVYERTGLGF 278
           +TTA G++   +   G G+
Sbjct: 342 FTTAEGIFTAGWSEEGYGY 360


>gi|357144426|ref|XP_003573288.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 1
           [Brachypodium distachyon]
          Length = 932

 Score =  187 bits (474), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 105/287 (36%), Positives = 162/287 (56%), Gaps = 15/287 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y + D + WKDLN KF++ +YR   +  +  F    +P     +   + +D+
Sbjct: 585 DPWFELNAYMIHDPARWKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDR 644

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+V+N G PDQT          AY GG+ + AL     M+ ++ D   +  F  +  
Sbjct: 645 DGDGMVENEG-PDQTYDLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYL 703

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           +A QVY   LW G+Y+++D+S    S+SIMADQL G  +    G+  E +        AL
Sbjct: 704 RAKQVYDAELWNGTYFRYDNSGGETSESIMADQLAGQWYAHVCGL--EPVVEEDKARSAL 761

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
           AT+   NV   + G++GAVNGM+P+G  D S+ QS+E+W+G TY ++AAM++EG  + A+
Sbjct: 762 ATVLDYNVMRVKGGAVGAVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAF 821

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            TA G +   + + G G  F+TPE  T +  YR   YMR L+V+AMQ
Sbjct: 822 RTAKGAHDASWGKAGFGYAFQTPEAWTAEGGYRGLHYMRPLSVWAMQ 868


>gi|357144429|ref|XP_003573289.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 2
           [Brachypodium distachyon]
          Length = 919

 Score =  186 bits (473), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 105/287 (36%), Positives = 162/287 (56%), Gaps = 15/287 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y + D + WKDLN KF++ +YR   +  +  F    +P     +   + +D+
Sbjct: 572 DPWFELNAYMIHDPARWKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDR 631

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+V+N G PDQT          AY GG+ + AL     M+ ++ D   +  F  +  
Sbjct: 632 DGDGMVENEG-PDQTYDLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYL 690

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           +A QVY   LW G+Y+++D+S    S+SIMADQL G  +    G+  E +        AL
Sbjct: 691 RAKQVYDAELWNGTYFRYDNSGGETSESIMADQLAGQWYAHVCGL--EPVVEEDKARSAL 748

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
           AT+   NV   + G++GAVNGM+P+G  D S+ QS+E+W+G TY ++AAM++EG  + A+
Sbjct: 749 ATVLDYNVMRVKGGAVGAVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAF 808

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            TA G +   + + G G  F+TPE  T +  YR   YMR L+V+AMQ
Sbjct: 809 RTAKGAHDASWGKAGFGYAFQTPEAWTAEGGYRGLHYMRPLSVWAMQ 855


>gi|242080271|ref|XP_002444904.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
 gi|241941254|gb|EES14399.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
          Length = 929

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 112/288 (38%), Positives = 165/288 (57%), Gaps = 15/288 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y + D + WKDLN KF++ +YRD     D  F    +P     +     FD+
Sbjct: 575 DPWVEMNEYMLHDPALWKDLNPKFVLQVYRDAVATGDAAFAEAAWPAVYIAMAYMHQFDR 634

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+V+N G PDQT          AY GGL +AAL     M+ ++ D   +  F  +  
Sbjct: 635 DGDGMVENEGIPDQTYDIWSVSGVSAYTGGLWVAALEAAAGMARVVGDRHAEAYFRGRRD 694

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRH-SDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
           +A++VY   LW G+Y+++D+S     S+SIMADQ+ G  + +A G+  E +        A
Sbjct: 695 RAARVYDAELWNGTYFRYDNSGGGACSESIMADQMAGQWYARACGM--EPVVEEGKARSA 752

Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
           LAT+   NV   + G++GAVNGM+P+G  D S+ QS+EVW GVTY ++AAM++EG  + A
Sbjct: 753 LATVLDYNVMRVKGGAVGAVNGMRPDGAVDASSGQSKEVWPGVTYAVAAAMVHEGMHEAA 812

Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           + TA G +   + + G G  F+TPE  T D  YRS  YMR L ++AMQ
Sbjct: 813 FRTAKGAHDAGWGKDGFGYAFQTPEAWTEDGGYRSLHYMRPLGIWAMQ 860


>gi|326507888|dbj|BAJ86687.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 676

 Score =  185 bits (469), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 107/273 (39%), Positives = 153/273 (56%), Gaps = 14/273 (5%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y + D + WKDLN KF++ +YRD A   D  F T V+P     +   + FD+
Sbjct: 348 DPWFQLNAYMIHDPARWKDLNTKFVLQVYRDGAATGDAAFATAVWPAVYLAMAYMDQFDR 407

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+V+N G PDQT          AY GGL +AAL     M+ ++ D   +  F  +  
Sbjct: 408 DGDGMVENEGRPDQTYDFWSVSGVSAYTGGLWVAALQAAAVMARVVGDRGSEGYFVERYE 467

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           KA +VY   LW G+Y+ +D+S   +S SIMADQL G  + +A G+  E I        AL
Sbjct: 468 KARRVYDGELWNGAYFDYDNSGGTNSKSIMADQLAGQWYARACGL--EPIVEEEKARSAL 525

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
            T+   NV   + G++GAVNGM+P+G  D S+IQS EVW GVTY ++AAM +EG  + A+
Sbjct: 526 GTVLDYNVMRVQGGAIGAVNGMRPDGTVDTSSIQSREVWPGVTYGVAAAMAHEGMPEAAF 585

Query: 261 TTAGGLYRTVYERTGLGFETPE--GLTGDKTYR 291
            TA G +   + R G G   P+  G+ G +  R
Sbjct: 586 RTAKGAHDAGWGRDGFGVRVPDAGGVDGRRRQR 618


>gi|94968715|ref|YP_590763.1| hypothetical protein Acid345_1688 [Candidatus Koribacter versatilis
           Ellin345]
 gi|94550765|gb|ABF40689.1| protein of unknown function DUF608 [Candidatus Koribacter
           versatilis Ellin345]
          Length = 811

 Score =  183 bits (465), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 104/290 (35%), Positives = 159/290 (54%), Gaps = 15/290 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
           P E+   L N YN   VS WKDLN K+++ ++RDY +   KD  FL   Y    + ++  
Sbjct: 519 PLEDALILPNQYNYQSVSFWKDLNSKYVLMVWRDYVMSGSKDKAFLQYNYNAVKQSMQFL 578

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
             FDK   G+++N G+PDQT          AY G L++AAL    EM+ LL D +     
Sbjct: 579 RQFDKNNDGLIENDGYPDQTYDNWSARGESAYSGSLYLAALRATEEMAKLLGDQRTAADT 638

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           +A   +A   +   LW G+Y+ +D   S + D+IMA+QL G  +   +G+    + P   
Sbjct: 639 AALFKRAQDSFVKKLWNGTYFNYDVGSS-YKDAIMAEQLAGQWYASLTGLGD--LVPRDM 695

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
              AL  ++  NV   + G+MGA+NG+  +G+  +   Q+EEVWTGVT+ ++A ML  G 
Sbjct: 696 QHSALKKVYDFNVMKLQNGTMGALNGISASGEVLKDNEQTEEVWTGVTFAVAATMLQNGL 755

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            +E + TA G+Y  VY++ G  F TPE    +  YR+  YMR  A+++M+
Sbjct: 756 REEGFNTAKGVYNVVYDQKGYWFRTPEAYDTNGMYRASMYMRPGAIWSME 805


>gi|324503709|gb|ADY41606.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 261

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 100/249 (40%), Positives = 137/249 (55%), Gaps = 12/249 (4%)

Query: 70  FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
            L R Y     +I   +++D  G  +++N G PDQT          AYCGGL + AL C+
Sbjct: 5   LLRRFYDLSSGIIADAKAWDVDGDDLIENAGQPDQTYDVWSMHGSSAYCGGLWLCALECV 64

Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
             M+  L +    Q+F+ KL  A + Y   LW G Y+ FD   + H  SIMADQLCG+ F
Sbjct: 65  RRMALTLGEVVDAQKFANKLNNARKAYERKLWNGKYFDFDEHSTDHK-SIMADQLCGFWF 123

Query: 180 LKAS-GVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEV 238
           +  + G   + I     I  +L TIF  NV  F  G +G VN M P+G  D + IQSEEV
Sbjct: 124 MCITDGKVDDVIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVVDSTGIQSEEV 183

Query: 239 WTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRA 298
           W GV Y L++  L     + A+ TA G YR+ +ER GL +++PE +     YR+ GYMR 
Sbjct: 184 WGGVAYALASFHLLVEENESAFKTAEGWYRSCWERYGLQYQSPEAINESSYYRAIGYMRP 243

Query: 299 LAVYAMQDA 307
           LA++AMQ A
Sbjct: 244 LAIWAMQSA 252


>gi|344251774|gb|EGW07878.1| Non-lysosomal glucosylceramidase [Cricetulus griseus]
          Length = 226

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 88/184 (47%), Positives = 119/184 (64%), Gaps = 4/184 (2%)

Query: 122 MSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLK 181
           M+ L      Q +FS+ L +  + Y  LLW G YY +DSS    S SIM+DQ  G  FL+
Sbjct: 1   MAVLCGAQDVQDKFSSILCRGREAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLR 60

Query: 182 ASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEV 238
           A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G  DRS++QS+EV
Sbjct: 61  ACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEV 119

Query: 239 WTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRA 298
           W GV Y L+A M+ EG   E + TA G YRTV+ER GL F+TPE     + +RS  YMR 
Sbjct: 120 WVGVVYGLAATMIQEGLTREGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRP 179

Query: 299 LAVY 302
           L+++
Sbjct: 180 LSIW 183


>gi|22297737|ref|NP_680984.1| hypothetical protein tlr0193 [Thermosynechococcus elongatus BP-1]
 gi|22293914|dbj|BAC07746.1| tlr0193 [Thermosynechococcus elongatus BP-1]
          Length = 806

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 107/290 (36%), Positives = 153/290 (52%), Gaps = 19/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
           P E P+   N     D + WKDL   F++ +YRD+      D+ F    +P  +  +   
Sbjct: 513 PNEHPWEKTNYTAYQDCNLWKDLASDFVLLVYRDFLFTGGTDLNFARECWPAVVAALDHL 572

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           + FD+ G G+ +NGG PDQT          AYCGGL +AAL   + + +LL   Q +  +
Sbjct: 573 KQFDQDGDGLPENGGAPDQTYDDWKLQGVSAYCGGLWLAALEAAIALGTLLQQPQVEI-Y 631

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
              L++A   YH LLW G YY+ D+     SD IMADQLCG  + +  G+    I P   
Sbjct: 632 RQWLSQARPRYHQLLWNGEYYRLDTGSG--SDVIMADQLCGQFYAQLLGLVD--IVPPDC 687

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLLSAAMLYEG 254
             RAL  I+ T    F  G  GA NG+ PNG  +   A    EVWTG+ + L+A +   G
Sbjct: 688 CDRALRKIYDTCFLKFHNGQFGAANGLLPNGQPENPHATHPLEVWTGINFGLAAFLWQRG 747

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
            +DEAW  A  + R +YE  GL F TPE +T + T+R+  Y+R +A++A+
Sbjct: 748 MIDEAWRLAEVVVRQIYE-NGLQFRTPEAITANGTFRACMYLRPMAIWAL 796


>gi|257060089|ref|YP_003137977.1| glucosylceramidase [Cyanothece sp. PCC 8802]
 gi|256590255|gb|ACV01142.1| Glucosylceramidase [Cyanothece sp. PCC 8802]
          Length = 805

 Score =  174 bits (440), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 105/307 (34%), Positives = 155/307 (50%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L    D  FL   +P   E +   
Sbjct: 495 PNEHPWQKTNYTSYQDCNLWKDLGSDFVLQVYRDYLLTGSDDTDFLWECWPAITETLDYL 554

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQQ 134
           ++FD    GI +N G PDQT          AYCGGL IAAL   ++++  LL +    ++
Sbjct: 555 KTFDLDNDGIPENSGAPDQTFDDWKLQGISAYCGGLWIAALEAAIKIAEILLKNVPTTEE 614

Query: 135 FSAK----------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
             ++                L ++  +YHD LW G YYK DS     SD +MADQLCG  
Sbjct: 615 LQSRNNPESIKHYVKNHRDWLEQSRSIYHDTLWNGEYYKLDSQSG--SDVVMADQLCGQF 672

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
           +  A  +N   +  +     AL  ++      F+ G  GA NGMKP+G   D ++   +E
Sbjct: 673 Y--ARLLNLPDVVETQYTESALNKVYEACFLKFQDGKYGAANGMKPDGTPEDPNSTHPQE 730

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + L+A +L  G  D A+     + + VYE  GL F TPE +T   T+R+  Y+R
Sbjct: 731 VWTGINFGLAAFLLQMGRKDAAFKLTEAVVKQVYE-NGLQFRTPEAITAVGTFRASHYLR 789

Query: 298 ALAVYAM 304
           A+A++ +
Sbjct: 790 AMAIWGI 796


>gi|357139475|ref|XP_003571307.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 946

 Score =  173 bits (439), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 116/292 (39%), Positives = 165/292 (56%), Gaps = 19/292 (6%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y + D S WKDLN KF++ +YRD A   D+ F    +P+    +   + FD+
Sbjct: 586 DPWHELNAYMIHDPSRWKDLNPKFVLQVYRDVAATGDLDFARSAWPSVYVALAYMDQFDR 645

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
              G+V+N G PDQT          AY GGL +AAL     M++L+ D   +  F  +  
Sbjct: 646 DRDGMVENEGRPDQTYDLWSVSGVSAYTGGLWVAALRAATAMAALVGDLPAEAVFLERYN 705

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           +A++VY   LWTG Y+++D+S   +S+S+MADQL G  + +A G+  E +       RAL
Sbjct: 706 RANKVYDSELWTGDYFRYDNSGGGNSESVMADQLAGQWYARACGL--EPVVGRDKARRAL 763

Query: 201 ATIFSTNVRGFEAGSMGAVNGMK-PN--GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
           A +   NV   + G +GAVNG + P   G  D S+ QS+EVWTG TY ++AAM+ EG  +
Sbjct: 764 AAVLEHNVMQVQGGGVGAVNGARLPEHGGGVDESSTQSKEVWTGTTYAVAAAMIGEGMRE 823

Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKT--YRSGGYMRALAVYAMQ 305
           E +T A G Y   +   G G  F+ PE  T D    YRS  YMR LAV+AMQ
Sbjct: 824 EGFTAAKGAYGAGWGEDGYGYAFQMPESWTADGAGGYRSLHYMRPLAVWAMQ 875


>gi|218247014|ref|YP_002372385.1| hypothetical protein PCC8801_2201 [Cyanothece sp. PCC 8801]
 gi|218167492|gb|ACK66229.1| protein of unknown function DUF608 [Cyanothece sp. PCC 8801]
          Length = 805

 Score =  173 bits (439), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 155/307 (50%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L    D  FL   +P   E +   
Sbjct: 495 PNEHPWQKTNYTSYQDCNLWKDLGSDFVLQVYRDYLLTGSDDTDFLWECWPAITETLDYL 554

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQQ 134
           ++FD    GI +N G PDQT          AYCGGL IAAL   ++++  LL +    ++
Sbjct: 555 KTFDLDNDGIPENSGAPDQTFDDWKLQGISAYCGGLWIAALEAAIKIAEILLKNVPTTEE 614

Query: 135 FSAK----------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
             ++                L ++  +YHD LW G YYK DS     SD +MADQLCG  
Sbjct: 615 LQSRNNPESIKHYVKNHRDWLEQSRSIYHDTLWNGEYYKLDSQSG--SDVVMADQLCGQF 672

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
           + +   +N   +  +     AL  ++      F+ G  GA NGMKP+G   D ++   +E
Sbjct: 673 YARL--LNFPDVVETQYTESALNKVYEACFLKFQDGKYGAANGMKPDGTPEDPNSTHPQE 730

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + L+A +L  G  D A+     + + VYE  GL F TPE +T   T+R+  Y+R
Sbjct: 731 VWTGINFGLAAFLLQMGRKDAAFKLTEAVVKQVYE-NGLQFRTPEAITAVGTFRASHYLR 789

Query: 298 ALAVYAM 304
           A+A++ +
Sbjct: 790 AMAIWGI 796


>gi|37522029|ref|NP_925406.1| hypothetical protein gll2460 [Gloeobacter violaceus PCC 7421]
 gi|35213028|dbj|BAC90401.1| gll2460 [Gloeobacter violaceus PCC 7421]
          Length = 867

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 111/292 (38%), Positives = 152/292 (52%), Gaps = 19/292 (6%)

Query: 28  PEEEPFSLINGYNVF-DVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTC-LELIR 83
           P E+P+ + N Y  + D + WKDL   F++ +YRDY     +D+ F+   +P     L+R
Sbjct: 564 PNEDPW-IKNNYTTYQDCNLWKDLPCDFVLQVYRDYVDTGKRDLGFVRYCWPASKAALVR 622

Query: 84  KCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQ 133
              +FD+ G G  +NGG PD T          AYCGGL +AAL   VEM  L+ D    Q
Sbjct: 623 LKTTFDQDGDGFPENGGPPDCTYDAWPLKGISAYCGGLWLAALLAAVEMGKLVGDAAAVQ 682

Query: 134 QFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPS 193
           QF A   +A  +Y   LW G YY+ D SQS++S++IMADQLCG  +  A   N   I P 
Sbjct: 683 QFGAWYAQAQPLYEKTLWNGRYYRID-SQSKNSEAIMADQLCGEYY--AQVCNLADIVPE 739

Query: 194 ANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
                AL T++ T    F  G  G  NG   +G          EVWTG+ + L+A M+  
Sbjct: 740 PQARSALETVYQTCFVKFYGGRFGCANGTNADGSFIGDTEHPSEVWTGINFGLAAFMIRN 799

Query: 254 GNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
           G   E    A  +   VY   GL F TPE LT  +T+R+  Y+R +A++AMQ
Sbjct: 800 GMRREGMAIAEAVVANVYG-GGLQFRTPEALTPARTFRACMYLRPMAIWAMQ 850


>gi|290990718|ref|XP_002677983.1| predicted protein [Naegleria gruberi]
 gi|284091593|gb|EFC45239.1| predicted protein [Naegleria gruberi]
          Length = 937

 Score =  171 bits (432), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 163/313 (52%), Gaps = 27/313 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPT-CLELIRKCE 86
           P E P+ L+N YN  DV+ WKDLN +FI++I+RD+ + K+  FL   +P+  L L     
Sbjct: 625 PYERPWKLVNAYNFQDVNKWKDLNSQFILTIFRDFMITKNREFLEEAFPSIILALNYSLT 684

Query: 87  SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFS 136
           +FDK G G+++N  FPD T          AY G L IA+L  ++EM+  L+D   +Q+  
Sbjct: 685 NFDKDGDGLIENESFPDTTYDAWKVTGVSAYSGMLWIASLKAILEMAKELDDENTRQKVE 744

Query: 137 AKLTKASQVYHDLLW--TGSYYKFDSSQSRHSDSIMADQLCGYLFL-------KASGVNT 187
             +    + +   LW  T  YY +D+S     DSIM+D L G   L       K SG   
Sbjct: 745 PLIESGLKTFESKLWDETNQYYHYDASTEPQHDSIMSDHLHGTFMLLLIGNYVKNSGRCE 804

Query: 188 EA----IFPSANIARALATIFSTNVRGFEAGSM--GAVNGMKPNGDRDRSAIQSEEVWTG 241
           ++     F    ++++L  I   N   ++  +   G VNGM+P  + D++++QS E+WTG
Sbjct: 805 DSHYLIPFNMERVSKSLNKILENNFVKYQQITKLGGCVNGMRPTQEVDKTSLQSREMWTG 864

Query: 242 VTYLLSAAMLYEGNVDEAWTTAGGLYRTVY-ERTGLGFETPEGLTGDKTYRSGGYMRALA 300
            +Y+++A  +  G  +E       ++   +   +   F+TPE +T    YR+ GY R L+
Sbjct: 865 TSYVVAALTILLGRREEGIDLMKSVFDKCWSSNSAFWFQTPEAVTETGEYRALGYCRPLS 924

Query: 301 VYAMQDAYLKGKV 313
           +++   A +  K+
Sbjct: 925 IWSCLSAMMNKKI 937


>gi|332711224|ref|ZP_08431157.1| putative bile acid beta-glucosidase [Moorea producens 3L]
 gi|332350038|gb|EGJ29645.1| putative bile acid beta-glucosidase [Moorea producens 3L]
          Length = 815

 Score =  170 bits (431), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 155/307 (50%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHK--DIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+  +   DI FL   + + ++ +   
Sbjct: 505 PNEHPWQQTNYTSYQDCNQWKDLASDFVLQVYRDFVFYSATDIEFLWECWSSIVKALAYL 564

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLN-------- 127
           + FD+ G GI +N G PDQT          AYCGGL +AAL   + +  +L         
Sbjct: 565 KRFDQDGDGIPENYGAPDQTFDDWRLQGVSAYCGGLWLAALESAIAIGEILTKNYPPTSP 624

Query: 128 -------DTQQQ--QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
                  D+ Q     +   L +A  +Y + LW G YY+ DS     SD +MADQLCG  
Sbjct: 625 FVEVPELDSIQDTIDCYRQWLAQAMPIYQEKLWNGQYYQLDSESG--SDVVMADQLCGQF 682

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
           + +  G+    I P      AL T++      F  G +GAVNG++P+G   D  A    E
Sbjct: 683 YAQLLGLPD--IVPPECTLSALKTVYDACFLKFHQGQLGAVNGVRPDGTPEDPDATHPME 740

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + L+A ++ +G  DEA      + R VY+  GL F TPE +T   T+R+  Y+R
Sbjct: 741 VWTGINFGLAAFLIQQGMKDEALRITEAVVRQVYDH-GLQFRTPEAITAAGTFRASHYLR 799

Query: 298 ALAVYAM 304
           A+ ++A+
Sbjct: 800 AMGIWAV 806


>gi|425455967|ref|ZP_18835678.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
 gi|389803038|emb|CCI17984.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
          Length = 784

 Score =  170 bits (431), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 100/290 (34%), Positives = 151/290 (52%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++ ++L   +    F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YHD LW G YY  DS     SD +M DQLCG  + +   ++   +  +  
Sbjct: 611 ASWLQQSRAIYHDTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARL--LSLPDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A +L  G
Sbjct: 667 TQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + R +YE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|428308994|ref|YP_007119971.1| bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
 gi|428250606|gb|AFZ16565.1| putative bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
          Length = 819

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 99/307 (32%), Positives = 155/307 (50%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+      D  FL   + + ++ +   
Sbjct: 512 PNEHPWEQTNYTSYQDCNQWKDLPCDFVLQVYRDFVFTGANDTEFLWECWSSVVKTLAYL 571

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------- 126
           ++FDK G GI +N G PDQT          AYCGGL +AAL   + M  +L         
Sbjct: 572 KTFDKDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWLAALEAAIAMGEILINHYPPTSE 631

Query: 127 ----NDTQQQQQ----FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
                D +  Q+    +   L ++  +Y + LW G YY+ DS     SD +M DQLCG  
Sbjct: 632 LIEVPDPESIQETLKLYRQWLEQSQPIYQEKLWNGQYYRLDSESG--SDIVMTDQLCGQF 689

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEE 237
           + +  G+    I P   +  AL +++ +    F+   +GA NG++P+G  +   A    E
Sbjct: 690 YAQLLGLPD--IVPPECVQSALKSVYDSCFLKFQDAQLGAANGVRPDGSPENPDATHPLE 747

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + L+A ++ +G   EA      + R VYE  GL F TPE +T   T+R+  Y+R
Sbjct: 748 VWTGINFGLAAFLVQQGMKQEALQLTEAVVRQVYE-NGLQFRTPEAITAAGTFRASHYLR 806

Query: 298 ALAVYAM 304
           A+A++A+
Sbjct: 807 AMAIWAV 813


>gi|425471589|ref|ZP_18850441.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
 gi|389882505|emb|CCI37035.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
          Length = 784

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 99/290 (34%), Positives = 149/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++ ++L   +  + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATIKIGAILG--EDTEIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YH  LW G YY  DS     SD +M DQLCG  + +   +    +  +  
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL  I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A +L  G
Sbjct: 667 TQSALGKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLLQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + + VYE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|425450263|ref|ZP_18830094.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
 gi|389768962|emb|CCI06046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
          Length = 784

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 99/290 (34%), Positives = 149/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++ ++L   +    F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATIKIGAILG--EDTAIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YH  LW G YY  DS     SD +M DQLCG  + +   +    +  +  
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A +L  G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + R +YE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITQAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|422303239|ref|ZP_16390593.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
 gi|389791807|emb|CCI12404.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
          Length = 784

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 98/290 (33%), Positives = 150/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++  ++   +    F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATIKIGEIIG--EDTAIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YH  LW G YY  DS     SD +M DQLCG  + +   ++   +  +  
Sbjct: 611 TSWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARL--LSLPDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+ +    F  G  GA NG+KP+G   +  A    EVW G+ + L A ++  G
Sbjct: 667 TQSALSKIYQSCFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLMQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + R VYE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|425435596|ref|ZP_18816046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
 gi|389679839|emb|CCH91411.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
          Length = 784

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 99/290 (34%), Positives = 149/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++ ++L   +    F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YH  LW G YY  DS     SD +M DQLCG  + +   +    +  +  
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A +L  G
Sbjct: 667 TQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + R +YE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|427727368|ref|YP_007073605.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
 gi|427363287|gb|AFY46008.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
          Length = 803

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 100/297 (33%), Positives = 153/297 (51%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRDY      D+ FL   +   ++ +   
Sbjct: 504 PNEHVWQKTNYTSYQDCNLWKDLGCDFVLQVYRDYLFTGANDVEFLQDCWDAIVQTLDYV 563

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
           ++FD  G GI +N G PDQT          AYCGGL +AAL   + +S +L+  Q     
Sbjct: 564 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWMAALEAAIAISEILSQHQPDSAL 623

Query: 131 --QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
             Q+  ++  L K+  +Y + LW G YY+ DS+    SD +MADQLCG  +  A+ +N  
Sbjct: 624 LKQKSIYATWLEKSLPIYQEKLWNGQYYRLDSNSG--SDVVMADQLCGQFY--AALLNLP 679

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLLS 247
            I P      AL T++      F  G  GA NG++P+G  +   A    EVWTG+ + L+
Sbjct: 680 DIVPGDRAISALRTVYDACFLKFYNGQFGAANGVRPDGSPENPQATHPLEVWTGINFGLA 739

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           A ++  G  DEA      + + +Y   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 740 AFLVQMGMKDEALRLTEAVVQQIYSN-GLQFRTPEAITPSGTFRASTYLRAMAIWGV 795


>gi|425442611|ref|ZP_18822852.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
 gi|389716273|emb|CCH99457.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
          Length = 784

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 98/290 (33%), Positives = 149/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++  ++   +    F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATIKIGEIIG--EDTAIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YH  LW G YY  DS     SD +M DQLCG  + +   ++   +  +  
Sbjct: 611 TSWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARL--LSLPDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A +L  G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + + VYE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|440753145|ref|ZP_20932348.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
 gi|440177638|gb|ELP56911.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
          Length = 789

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 99/290 (34%), Positives = 149/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++ ++L   +    F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YH  LW G YY  DS     SD +M DQLCG  + +   +    +  +  
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A +L  G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + + VYE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|443669591|ref|ZP_21134796.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
 gi|159029778|emb|CAO87856.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443330115|gb|ELS44858.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
          Length = 784

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 99/290 (34%), Positives = 148/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++  ++   +    F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGEIIG--EDTAIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YH  LW G YY  DS     SD +M DQLCG  + +   +    +  +  
Sbjct: 611 TSWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A +L  G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + R VYE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|425447791|ref|ZP_18827773.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
 gi|389731620|emb|CCI04385.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
          Length = 784

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 99/290 (34%), Positives = 150/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++ ++L   +    F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YH  LW G YY  DS     SD +M DQLCG  +  A  ++   +  +  
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFY--ARLLSLPDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A +L  G
Sbjct: 667 TQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + + +YE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVQQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|425464233|ref|ZP_18843555.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
 gi|389833806|emb|CCI21373.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
          Length = 784

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 98/290 (33%), Positives = 149/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++ ++L   +    F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATIKIGAILG--EDTAIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YH  LW G YY  DS     SD +M DQLCG  + +   +    +  +  
Sbjct: 611 ASWLQQSRTIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYSRLLAL--PDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A ++  G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLMQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + + VYE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|166363582|ref|YP_001655855.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
 gi|166085955|dbj|BAG00663.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
          Length = 784

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 98/290 (33%), Positives = 150/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++ ++L   +  + F
Sbjct: 553 KTFDLDKDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTEIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YH  LW G YY  DS     SD +M DQLCG  + +   +    +  +  
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYSRLLAL--PDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A ++  G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLMQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + + VYE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|390437958|ref|ZP_10226465.1| Glycosidase-like protein [Microcystis sp. T1-4]
 gi|389838618|emb|CCI30589.1| Glycosidase-like protein [Microcystis sp. T1-4]
          Length = 784

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 99/290 (34%), Positives = 148/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   +P+ +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++ ++L   +    F
Sbjct: 553 KTFDLDKDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           +  L ++  +YH  LW G YY  DS     SD +M DQLCG  + +   +    +  +  
Sbjct: 611 AGWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A ++  G
Sbjct: 667 TQSALSKIYQACFLKFHEGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLMQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + R VYE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|119493962|ref|ZP_01624522.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
 gi|119452283|gb|EAW33479.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
          Length = 799

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 104/303 (34%), Positives = 154/303 (50%), Gaps = 30/303 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L   +D  FL   +   +E +   
Sbjct: 499 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFVLTGSEDYEFLQDCWLAIVETLDYL 558

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           + FDK   GI +N G PDQT          AYCGGL +AAL   +E+   L  T+     
Sbjct: 559 KIFDKDEDGIPENSGAPDQTFDDWKLQGVSAYCGGLWLAALEAAIEIGKCLQKTEFSDNV 618

Query: 136 SAK-----------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG 184
           S             L KA  +Y +L W G Y++ DS     SD +MADQLCG  + K  G
Sbjct: 619 SPSISDKISTYETWLEKAKPIYQNL-WNGQYFQLDSQSG--SDVVMADQLCGQFYAKLLG 675

Query: 185 VNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVT 243
           +  E I        AL TI+ +  + F  G  GA NG++ NG+  + +     EVWTG+ 
Sbjct: 676 L--EDIVSPEKTISALQTIYQSCFQNFYQGQFGAANGVRINGEPINPNDTHPLEVWTGIN 733

Query: 244 YLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYA 303
           + L+A ++  G  +E +  A  + + +YE  GL F TPE +T   T+R+  Y+RA+A++A
Sbjct: 734 FGLAAFLIQMGMKEEGFKLAEVVVQQIYE-NGLQFRTPEAITAAGTFRASHYLRAMAIWA 792

Query: 304 MQD 306
           + D
Sbjct: 793 IYD 795


>gi|425460273|ref|ZP_18839755.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
 gi|389827051|emb|CCI21967.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
          Length = 784

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 98/290 (33%), Positives = 148/290 (51%), Gaps = 20/290 (6%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   + + +  ++  
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWSSIVLTLQYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL IAAL   +++ ++L   +    F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
           ++ L ++  +YH  LW G YY  DS     SD +M DQLCG  + +   +    +  +  
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
              AL+ I+      F  G  GA NG+KP+G   +  A    EVW G+ + L A +L  G
Sbjct: 667 TQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
              + W     + R +YE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775


>gi|416392732|ref|ZP_11685894.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
           0003]
 gi|357263592|gb|EHJ12579.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
           0003]
          Length = 804

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 101/307 (32%), Positives = 154/307 (50%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L    DI FL   +    E +   
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDDIDFLWECWAAIPETLNYL 554

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-----NDTQ 130
           + FD    GI +N G PDQT          AYCGGL IAAL   +++  +L     N+ Q
Sbjct: 555 KEFDLDNDGIPENSGAPDQTFDDWKLSGISAYCGGLWIAALEAAIKIGEILLENSPNNPQ 614

Query: 131 QQQQ------------FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
            + +            ++  L ++  +YH  LW G YY  DS     SD +MADQLCG  
Sbjct: 615 LEPENYPESIEKELDKYNNWLQQSRAIYHSTLWNGEYYNLDSKSG--SDIVMADQLCGQF 672

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
           + +  G++   +  S     AL  ++      F+ G  GA NGMKP+G   D ++   +E
Sbjct: 673 YARLLGLSD--VVESEYTLSALNKVYEACFLKFQNGKYGAANGMKPDGTPEDPNSTHPQE 730

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + +++ ++  G  +EA      + + VYE  GL F TPE +T   TYR+  Y+R
Sbjct: 731 VWTGINFGIASFLIQMGMKNEALKLTETVVKQVYE-NGLQFRTPEAITAVGTYRACHYLR 789

Query: 298 ALAVYAM 304
           A+A++ +
Sbjct: 790 AMAIWGV 796


>gi|67922703|ref|ZP_00516206.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
 gi|67855413|gb|EAM50669.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
          Length = 804

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 101/307 (32%), Positives = 154/307 (50%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L    DI FL   +    E +   
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDDIDFLWECWAAIPETLNYL 554

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-----NDTQ 130
           + FD    GI +N G PDQT          AYCGGL IAAL   +++  +L     N+ Q
Sbjct: 555 KEFDLDNDGIPENSGAPDQTFDDWKLSGISAYCGGLWIAALEAAIKIGEILLENSPNNPQ 614

Query: 131 QQQQ------------FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
            + +            ++  L ++  +YH  LW G YY  DS     SD +MADQLCG  
Sbjct: 615 LEPENYPESIEKELDKYNNWLQQSRAIYHSTLWNGEYYNLDSKSG--SDIVMADQLCGQF 672

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
           + +  G++   +  S     AL  ++      F+ G  GA NGMKP+G   D ++   +E
Sbjct: 673 YARLLGLSD--VVESEYTLSALNKVYEACFLKFQNGKYGAANGMKPDGTPEDPNSTHPQE 730

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + +++ ++  G  +EA      + + VYE  GL F TPE +T   TYR+  Y+R
Sbjct: 731 VWTGINFGIASFLIQMGMKNEALKLTETVVKQVYE-NGLQFRTPEAITAVGTYRACHYLR 789

Query: 298 ALAVYAM 304
           A+A++ +
Sbjct: 790 AMAIWGV 796


>gi|162452686|ref|YP_001615053.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
 gi|161163268|emb|CAN94573.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
          Length = 782

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 98/291 (33%), Positives = 150/291 (51%), Gaps = 16/291 (5%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKCES 87
           E  F   N Y   D ++WKDLN K ++ +YRD+AL    D  FL   +P     ++K + 
Sbjct: 492 ESVFRKWNAYTYRDSTNWKDLNSKLVLMVYRDWALTGKTDKAFLDYCWPAVKMAMQKVKG 551

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
            D  G G+ ++ G  DQT          AYCGGL IAA     E+++ + D      +  
Sbjct: 552 QDGDGDGLPESNGI-DQTYDDMDLHGNTAYCGGLFIAASEAAKELATAVGDASLASTYQT 610

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
            + ++   +   LWTGSYYK D+  SR ++ IM+DQL G  + +A G+    I   ++ A
Sbjct: 611 WVEQSKGGFESKLWTGSYYKIDTG-SRDTNRIMSDQLAGQWYARALGL--PPIVDPSHAA 667

Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
            A   I+  N + F+ G+ G VN M  NG  D ++ Q+ E W G ++ + A M+ EG   
Sbjct: 668 SAFTKIYENNFKRFDGGTRGVVNVMTANGSVDGTSNQTRECWVGTSWGVVAGMIQEGLAA 727

Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
           +A      L  T+++  GL F TPE   G+ + R+  YMRA  ++A + AY
Sbjct: 728 QAGEIGASLVDTIWKTDGLWFRTPEAWEGNGSIRAPYYMRATTLWAAKHAY 778


>gi|320105716|ref|YP_004181306.1| glucosylceramidase [Terriglobus saanensis SP1PR4]
 gi|319924237|gb|ADV81312.1| Glucosylceramidase [Terriglobus saanensis SP1PR4]
          Length = 794

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 98/290 (33%), Positives = 145/290 (50%), Gaps = 17/290 (5%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           PE +PF ++N     D +DWKDLN KF++ +YRDY L    D  FL   +P     I+  
Sbjct: 504 PEGDPFYVVNEPGWQDTNDWKDLNSKFVLMVYRDYVLTGRTDKAFLQETWPAVKAAIQYL 563

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
             FD  G G+ +N G+PDQT          AY GGL + AL    E + +L D   Q ++
Sbjct: 564 RQFD-HGRGVPENSGYPDQTYDSWVVRGVSAYSGGLWLGALRAGEETARVLGDWTAQAEY 622

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
            A    A + Y   LW G Y+ +D+  S +   I ADQL G  +   +G+    + P A 
Sbjct: 623 QALFAMAQKTYVATLWNGEYFLYDTDAS-NKGVIQADQLAGQWYAHMTGLGD--LVPRAM 679

Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
               L  I++ NV  F  G++GAVNGM  +G    + +++ EVW G T   +A +   G 
Sbjct: 680 QRSVLKKIYAFNVNQFGDGNLGAVNGMNADGTVVDN-VEAREVWAGTTLGYAALLKSLGI 738

Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            +E++    GL+  +YE  G  F TPE       +R+  YMR   ++ M+
Sbjct: 739 EEESYHVVHGLFHVIYESKGYWFRTPEAWDVTGNFRASMYMRPAGLWGME 788


>gi|209526014|ref|ZP_03274547.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
 gi|376002240|ref|ZP_09780079.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423067988|ref|ZP_17056778.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
 gi|209493540|gb|EDZ93862.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
 gi|375329367|emb|CCE15832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406710526|gb|EKD05737.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
          Length = 799

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 106/299 (35%), Positives = 155/299 (51%), Gaps = 27/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L    DI+FL   +P  +E +   
Sbjct: 499 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDYLLTGADDIQFLVECWPAIVETLDYL 558

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------- 126
           ++FD+   GI +NGG PDQT          AYCGGL +AAL   +E++++L         
Sbjct: 559 KTFDRDQDGIPENGGAPDQTFDDWRLVGISAYCGGLWLAALEAAIEIANILLSHNRDLTP 618

Query: 127 NDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
           + T+    ++  L  A  +Y   LW GSYY  DS     SD +MADQLCG  +    G+ 
Sbjct: 619 DTTKAIATWTNWLETAKPLYDQTLWNGSYYNLDSQSG--SDVVMADQLCGQFYAALLGL- 675

Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYL 245
              I P      AL TI+    + F  G  GA NG++  G+  + +     EVWTG+ + 
Sbjct: 676 -PDIVPPHRAQVALQTIYQACFQNFHNGKFGAANGVRTTGEPMNPNDTHPLEVWTGINFG 734

Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           L A  +  G  +EA+  A  +   VY+  GL F TPE +T   T+R+  Y+RA+A++AM
Sbjct: 735 LGAFFIQMGMKEEAFQLAEAVITQVYQ-NGLQFRTPEAITAAGTFRASHYLRAMAIWAM 792


>gi|334116939|ref|ZP_08491031.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
 gi|333461759|gb|EGK90364.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
          Length = 818

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 102/299 (34%), Positives = 153/299 (51%), Gaps = 27/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L    D  FL   +P  +E +   
Sbjct: 517 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLLTGGTDSEFLQECWPAIVEALAYL 576

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ--- 132
           ++FD+ G GI +N G PDQT          AYCGGL +AAL   + +  +L D   Q   
Sbjct: 577 KTFDEDGDGIPENSGAPDQTFDDWQMRGISAYCGGLWLAALEAAIAIGEILEDAAPQIIP 636

Query: 133 ------QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
                   + + L +A   Y + LW G YY+ DS    +S+ +MADQLCG  + K  G+ 
Sbjct: 637 NPQAKIALYHSWLLQARPSYQEKLWNGQYYRLDSES--NSEVVMADQLCGQFYAKLLGL- 693

Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYL 245
              I P      AL T++ +    F  G  GA NG+  NG   + +A    EVWTG+ + 
Sbjct: 694 -PDIVPPECAVSALETVYESCFLKFNEGEFGAANGVMLNGSPENPNATHPLEVWTGINFG 752

Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           L+A ++  G  D+A+     +   +YE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 753 LAAFLVQMGMEDKAFKLTDAVVNQIYE-NGLQFRTPEAITAGGTFRASHYLRAMAIWAI 810


>gi|172036524|ref|YP_001803025.1| hypothetical protein cce_1609 [Cyanothece sp. ATCC 51142]
 gi|354553307|ref|ZP_08972614.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
 gi|171697978|gb|ACB50959.1| unknown [Cyanothece sp. ATCC 51142]
 gi|353555137|gb|EHC24526.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
          Length = 818

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 102/312 (32%), Positives = 151/312 (48%), Gaps = 35/312 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L    DI FL   +    E +   
Sbjct: 509 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDDIEFLWECWEAIPETLNYL 568

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQQ 134
           ++FD    GI +N G PDQT          AYCG L IAAL   +++   LL +  Q  Q
Sbjct: 569 KAFDLDNDGIPENSGAPDQTFDDWELRGISAYCGALWIAALEAAIKIGEILLENAPQNPQ 628

Query: 135 FSAK----------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
              +                L ++  +YH  LW G YYK DS     SD +MADQL G  
Sbjct: 629 LEPENYPKSIEKELDKYQQWLQQSRAIYHSTLWNGEYYKLDSESG--SDVVMADQLSGQF 686

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
           + +  G+    +        AL  ++      F+ G  GA NGMKP+G   D ++   +E
Sbjct: 687 YARLLGL--PDVVEQQYALSALKKVYEACFLKFQNGKYGAANGMKPDGTPEDPNSTHPQE 744

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + L++ ++  G  DEA      + + VYE  GL F TPE +T   T+R+  Y+R
Sbjct: 745 VWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYE-NGLQFRTPEAITAVGTFRACHYLR 803

Query: 298 ALAVYAMQDAYL 309
           A+A++ +   +L
Sbjct: 804 AMAIWGVYYQFL 815


>gi|427712531|ref|YP_007061155.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
 gi|427376660|gb|AFY60612.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
          Length = 799

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 102/295 (34%), Positives = 157/295 (53%), Gaps = 23/295 (7%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L    D+ FLT  +P  +  +   
Sbjct: 499 PNEHPWQASNYTSYQDCNLWKDLASDFVLLVYRDYLLTGATDLEFLTDCWPAIVSSLDYL 558

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL----NDTQ- 130
           ++FD  G G+ +N G PDQT          AYCGGL IAAL   + ++ +L    N+T+ 
Sbjct: 559 KTFDTDGDGLPENSGAPDQTYDDWQLKGVSAYCGGLWIAALEAALAIAQILEKQGNNTKL 618

Query: 131 QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAI 190
           +Q+++ A L+++   Y   LW G +Y+ DS    +S  +MADQLCG  + +   +N   +
Sbjct: 619 EQEKWQAWLSQSRPTYQAKLWNGQFYRLDSDS--NSQVVMADQLCGQYYARL--LNLPDV 674

Query: 191 FPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAA 249
            P      AL T++      F  G  GA NG+ P+G  ++ +A    EVW G+ Y L+A 
Sbjct: 675 VPIECAESALKTVYDVCFLKFHEGQFGAANGLLPDGSPEKPNATHPLEVWIGINYGLAAF 734

Query: 250 MLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +   G  D AW     + R +Y+  GL F TPE +T   T+R+  Y+R +A++AM
Sbjct: 735 LYQMGLEDGAWRLTQTVIRQIYD-NGLQFRTPEAITAVGTFRACMYLRPMAIWAM 788


>gi|443320525|ref|ZP_21049620.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
 gi|442789771|gb|ELR99409.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
          Length = 806

 Score =  160 bits (406), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 102/307 (33%), Positives = 154/307 (50%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL+  F++ +YRD+ L   +D  FL   +P+  + +   
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLSSDFVLQVYRDFVLTGKEDTDFLWECWPSITQTLNYL 554

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDT---QQQ 132
           +SFD  G GI +N G PDQT          AYCGGL IAAL   + + ++L D      Q
Sbjct: 555 KSFDLDGDGIPENAGAPDQTFDDWKLEGISAYCGGLWIAALEASIAIGAVLLDNPPMNPQ 614

Query: 133 QQFS--------------AKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
            Q S              + LT A  +YH  LW G YY  D+     S  +MADQL G  
Sbjct: 615 LQPSDYPEGIEATLKTHQSWLTLARSIYHQQLWNGQYYSLDTKSG--SVVVMADQLAGQF 672

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEE 237
           + +  G+    +     +   L TI+ +    F  G +GA NG+K +G   +  +    E
Sbjct: 673 YAQLLGL--PDVVEKEYVQSTLKTIYQSCFLNFHQGKIGAANGVKLDGTGENPQSTHPLE 730

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + L A +++ G  DE    A  + + +Y++ GL F TPE LT   T+R+  Y+R
Sbjct: 731 VWTGINFALGAFLIHSGMRDEGLKLAETIVKQIYDQ-GLQFRTPEALTTSGTFRASHYLR 789

Query: 298 ALAVYAM 304
           A+A++A+
Sbjct: 790 AMAIWAI 796


>gi|170079447|ref|YP_001736085.1| hypothetical protein SYNPCC7002_A2862 [Synechococcus sp. PCC 7002]
 gi|169887116|gb|ACB00830.1| Conserved hypothetical protein (DUF608) [Synechococcus sp. PCC
           7002]
          Length = 803

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 105/310 (33%), Positives = 157/310 (50%), Gaps = 35/310 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N     D + WKDL   F++ +YRD+ +   +D  FL   + + L  +   
Sbjct: 495 PNEHPWEATNYTAYQDCNLWKDLGSDFVLQVYRDFVMTGAEDTEFLWSCWGSVLTALDYL 554

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
           ++FD  G GI +NGG PDQT          AYCGGL IAAL   + M+ +L  +      
Sbjct: 555 KTFDLDGDGIPENGGAPDQTFDDWKLKGISAYCGGLWIAALEAAIAMAKILQQSPSHFEA 614

Query: 131 QQQQFSAK------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
            Q Q SA             L +A  +Y D+LW GSYY  D+    +S  +MADQLCG  
Sbjct: 615 HQNQVSAAEFEQAIATYQTWLDQARPLYQDILWNGSYYTLDTGS--NSRVVMADQLCGQY 672

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR-SAIQSEE 237
           + +   ++   + P    A  LATI++     F  G  GA NG+ P+G  ++ +     E
Sbjct: 673 YTQL--LSLPDVNPGDRTATTLATIYTACFEKFHGGQFGAANGLNPDGTPEKENDTHPLE 730

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + ++A M+  G   EA      +   VY+  GL F TPE +T + T+R+  Y+R
Sbjct: 731 VWTGINFGIAALMIRNGMQTEALRMVEAVVEQVYD-NGLQFRTPEAITANGTFRACHYLR 789

Query: 298 ALAVYAMQDA 307
           A+ ++A+ DA
Sbjct: 790 AMGIWAIYDA 799


>gi|440680630|ref|YP_007155425.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
 gi|428677749|gb|AFZ56515.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
          Length = 802

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 98/306 (32%), Positives = 157/306 (51%), Gaps = 31/306 (10%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
           P E  +   N     D + WKDL   F++ +YRD+     +D+ FL   +   ++ +   
Sbjct: 504 PNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFIFTGGQDVEFLADCWDAIVQTLDYL 563

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
           ++FD  G GI +N G PDQT          AYCGGL +AAL   + +  +L + +     
Sbjct: 564 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALESAIAICDILTNRRGAENA 623

Query: 131 -----QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV 185
                Q+  + + L ++  +Y + LW G YY+ DS     S+ +MADQLCG  + +  G+
Sbjct: 624 EKIRSQKSVYQSWLQQSLPIYQEKLWNGQYYRLDSESG--SNVVMADQLCGQFYARLLGL 681

Query: 186 NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTY 244
               I P+ N   AL T++      F  G  GA NG++P+G  +   A    EVWTG+ +
Sbjct: 682 --PDIVPNDNALSALQTVYDACFVKFCNGQFGAANGVRPDGSPENPQATHPLEVWTGINF 739

Query: 245 LLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
            L+A ++  G  +EA      + + +Y+  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 740 GLAAFLIQMGMKNEAMRLTQAVVQQIYDN-GLQFRTPEAITSAGTFRASTYLRAMAIWAI 798

Query: 305 QDAYLK 310
              YLK
Sbjct: 799 ---YLK 801


>gi|126657976|ref|ZP_01729128.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
 gi|126620614|gb|EAZ91331.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
          Length = 804

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 102/307 (33%), Positives = 148/307 (48%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L    DI FL   +    E +   
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGADDIEFLWECWEAVPEALNYL 554

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQQ 134
           + FD    GI +N G PDQT          AYCG L IAAL   +++   LL +     Q
Sbjct: 555 KEFDLDNDGIPENSGAPDQTFDDWELRGMSAYCGALWIAALEAAIKIGEILLENRPNNPQ 614

Query: 135 FSAK----------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
              K                L ++  VYH  LW G YYK DS     SD +MADQL G  
Sbjct: 615 LEPKNYPECIEKELDKYQNWLQQSRSVYHSTLWNGEYYKLDSESG--SDVVMADQLSGQF 672

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
           + +  G+    +  +     AL  ++      F+ G  GA NGMKP+G   D ++   +E
Sbjct: 673 YARLLGL--PDVVENQYALSALKKVYEACFLKFQNGKYGAANGMKPDGTPEDPNSTHPQE 730

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + L++ ++  G  DEA      + + VYE  GL F TPE +T   T+R+  Y+R
Sbjct: 731 VWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYE-NGLQFRTPEAITAVGTFRACHYLR 789

Query: 298 ALAVYAM 304
           A+A++ +
Sbjct: 790 AMAIWGI 796


>gi|220909879|ref|YP_002485190.1| hypothetical protein Cyan7425_4519 [Cyanothece sp. PCC 7425]
 gi|219866490|gb|ACL46829.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7425]
          Length = 814

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 101/298 (33%), Positives = 151/298 (50%), Gaps = 26/298 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L    D  FLT  +P     +   
Sbjct: 503 PNEHPWEKTNYTSYQDCNLWKDLPADFVILVYRDYLLTGGTDYDFLTTCWPAIHATLAYL 562

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA--------CMVEMSSLLN 127
           ++FD  G GI +N G PDQT          AYCGGL IAAL          +V+   L +
Sbjct: 563 KTFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEAAIAIIKLLLVQAPDLPH 622

Query: 128 DTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
             +    +   L ++  +Y   LW G YY+ DS     SD +MADQLCG  + +  G+  
Sbjct: 623 LQESLTTYQTWLAQSRPLYQQTLWNGQYYRLDSGSG--SDVVMADQLCGQFYARLVGL-- 678

Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLL 246
             + P      AL  ++ T    F  G++GA NG++P G  ++  A    EVWTG+ + L
Sbjct: 679 PDVVPEDCARAALDMVYQTCFLQFHNGTLGAANGLRPGGLAEKPDATHPLEVWTGINFGL 738

Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +A ++  G   EA      +   +YE  GL F TPE +T ++T+R+G Y+R +A++A+
Sbjct: 739 AAFLVQMGRKAEAMQLTAAVVYQIYEH-GLQFRTPEAITAEQTFRAGMYLRPMAIWAI 795


>gi|428317101|ref|YP_007114983.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
 gi|428240781|gb|AFZ06567.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
          Length = 809

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 100/299 (33%), Positives = 154/299 (51%), Gaps = 27/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L    D  FL   +P  +E +   
Sbjct: 508 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLLTGGTDCEFLQECWPAIVEALAYL 567

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ--- 132
           ++FD+ G GI +N G PDQT          AYCGGL +AAL   + +  +L D   +   
Sbjct: 568 KTFDEDGDGIPENSGAPDQTFDDWQMRGISAYCGGLWLAALEAAIAIGEILEDAAPEIIP 627

Query: 133 ------QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
                   + + L +A   Y + LW G YY+ DS    +S+ +MADQLCG  + K  G+ 
Sbjct: 628 NPQAKIALYHSWLLQARPSYQEKLWNGQYYRLDSES--NSEVVMADQLCGQFYAKLLGLP 685

Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYL 245
              I P      AL T++ +    F  G  GA NG+  NG   + +A    EVWTG+ + 
Sbjct: 686 D--IVPPECAVSALETVYESCFLKFNDGEFGAANGVMLNGSPENPNATHPLEVWTGINFG 743

Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           L+A ++  G  ++A+     + + +YE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 744 LAAFLVQMGMEEKAFKLTDAVVKQIYE-NGLQFRTPEAITAGGTFRASHYLRAMAIWAI 801


>gi|428200465|ref|YP_007079054.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
 gi|427977897|gb|AFY75497.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
          Length = 808

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 105/307 (34%), Positives = 152/307 (49%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L   KD+ FL   + + +E +   
Sbjct: 496 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFVLTGAKDVAFLWECWQSIVETLAYL 555

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ--- 132
           + FD+ G GI +N G PDQT          AYCGGL IAAL   + +  +L +       
Sbjct: 556 KKFDRDGDGIPENSGAPDQTFDDWRLRGISAYCGGLWIAALEAAIAIGKILIENPPSNPD 615

Query: 133 --------------QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
                         Q +   L  A  +YH+ LW G YY  DS     SD +MADQLCG  
Sbjct: 616 LELENYPQSLQNTIQTYQNWLQGARSLYHEKLWNGEYYHLDSESG--SDIVMADQLCGQF 673

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEE 237
           + +  G+    +        AL  ++      F+ G  GA NG+KP+G   D  A    E
Sbjct: 674 YARLLGLPD--VVERQYAESALKKVYEACFLKFQDGKFGAANGVKPDGSPEDPKATHPLE 731

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + L+A +L  G  DEA+  A  + R VY+  GL F TPE +T   T+R+  Y+R
Sbjct: 732 VWTGINFGLAAFLLQMGMKDEAFKLAETVVRQVYD-NGLQFRTPEAITAVGTFRASHYLR 790

Query: 298 ALAVYAM 304
           A+A++ +
Sbjct: 791 AMAIWGI 797


>gi|119510228|ref|ZP_01629365.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
 gi|119465077|gb|EAW45977.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
          Length = 814

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 102/298 (34%), Positives = 157/298 (52%), Gaps = 26/298 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N     D + WKDL   F++ +YRD+ L    D++FL   +   ++ +   
Sbjct: 504 PNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFLLTGADDVQFLADCWAGIVQTLDYL 563

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-------ND 128
           +SFD  G GI +N G PDQT          AYCGGL +AAL   + +S +L        D
Sbjct: 564 KSFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAISDILLTNHRGAED 623

Query: 129 TQQ-QQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
           T++  Q++   L ++  +Y + LW G Y++ DS     SD +MADQLCG  + +  G+  
Sbjct: 624 TEELGQRYRVWLAQSRTIYQEKLWNGQYFRLDSESG--SDVVMADQLCGQFYARLLGL-- 679

Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLL 246
             I PS     AL T++      F  G  GA NG++P+G   + +A    EVWTG+ + L
Sbjct: 680 PDIVPSDRALSALQTVYHACFVKFCNGEFGAANGVRPDGSAENPNATHPLEVWTGINFGL 739

Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +A ++  G  DEA    G + + +Y   GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 740 AAFLVQMGMQDEALKLTGAVVQQIY-HNGLQFRTPEAITASGTFRASTYLRAMAIWAI 796


>gi|428770169|ref|YP_007161959.1| glucosylceramidase [Cyanobacterium aponinum PCC 10605]
 gi|428684448|gb|AFZ53915.1| Glucosylceramidase [Cyanobacterium aponinum PCC 10605]
          Length = 802

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 147/307 (47%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHK--DIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY      D  FL   + + +E +   
Sbjct: 494 PNEHPWEKSNYTSYQDCNLWKDLGCDFVLLVYRDYLFTGGLDQDFLWECWDSVVETLHYV 553

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL------NDT 129
           ++FD  G GI +N G PDQT          AYCGGL IA L   + + + L      N +
Sbjct: 554 KTFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAGLEAAIAIGNTLIANPPMNPS 613

Query: 130 QQQQQ-----------FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
            Q              F   LT+A  +YH+ LW G YY+ DS     SD +MADQLCG  
Sbjct: 614 LQPDDYPQGIKDAIAVFQQWLTQARSLYHETLWNGEYYRLDSESG--SDVVMADQLCGQF 671

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEE 237
           + +  G+    +  S  +   L  ++      F  G  G  NG+KP+G     +     E
Sbjct: 672 YAQLLGLPD--VVESKYVKSTLRKVYDACFLKFHDGKFGIANGVKPDGKPVKENDTHPLE 729

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ Y + A M+  G   E    A  + + VYE  GL F TPE +T   T+R+  Y+R
Sbjct: 730 VWTGINYGIVAFMILNGMKQEGLRVAETVVKQVYE-NGLQFRTPEAITAVGTFRASHYLR 788

Query: 298 ALAVYAM 304
           A+AV+A+
Sbjct: 789 AMAVWAI 795


>gi|254431466|ref|ZP_05045169.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
 gi|197625919|gb|EDY38478.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
          Length = 837

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 98/297 (32%), Positives = 144/297 (48%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E P+   N     D + WKDL   F++ ++R + L    +D+RFL   +P  +E +R 
Sbjct: 527 PNERPWDATNYTAYQDCNLWKDLASDFVLQVWRTFRLAPTGEDLRFLADCWPAAVEALRY 586

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSL------LND 128
            + FD    G+  NGG PDQT          AYCG L IAAL   + M         L+ 
Sbjct: 587 LKGFDANSDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMGQRLQLELGLDT 646

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
           + QQ+ FS+ L ++   +  LLW G YY  D+     +  +MADQLCG  + +   +   
Sbjct: 647 SSQQRDFSSWLEQSRANFDRLLWNGEYYNIDADSG--TPVVMADQLCGDFYARL--LELP 702

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
            +        AL  I      GF+ GS+G  NG++ +G   D       EVWTG+ + L+
Sbjct: 703 PVVAEERARSALRAIREACFEGFQGGSLGVANGLRRDGTPLDPDGTHPLEVWTGINFGLA 762

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           A     G  D A    G +   VY   GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 763 AYYRLMGETDTALAITGAVVHQVYG-GGLQFRTPEAITAVNTFRACHYLRAMAIWAL 818


>gi|75907319|ref|YP_321615.1| hypothetical protein Ava_1096 [Anabaena variabilis ATCC 29413]
 gi|75701044|gb|ABA20720.1| Protein of unknown function DUF608 [Anabaena variabilis ATCC 29413]
          Length = 804

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 103/301 (34%), Positives = 153/301 (50%), Gaps = 29/301 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD+ L    D++FL   +   +E +   
Sbjct: 504 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYV 563

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD  G GI +N G PDQT          AYCGGL +AALA  + +S +L    Q  + 
Sbjct: 564 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWMAALAAAIAISDILLQNHQDSET 623

Query: 136 SAKL-----------TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG 184
             KL           TK+  +Y + LW G YY+ DS     SD +MADQLCG  +  A+ 
Sbjct: 624 KEKLLLQKSTYETWLTKSLPIYQEKLWNGKYYRLDSESG--SDVVMADQLCGQFY--ANL 679

Query: 185 VNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVT 243
           +    I PS     AL T++      F  G  GA NG++P+G  +   A    EVWTG+ 
Sbjct: 680 LELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPENPKATHPLEVWTGIN 739

Query: 244 YLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYA 303
           + L+A ++  G  DE +     +   +Y   GL F TPE +T   T+R+  Y+RA+A++A
Sbjct: 740 FGLAAFLVQMGMKDEGFRLTQAVVAQIY-NNGLQFRTPEAITAAGTFRASTYLRAMAIWA 798

Query: 304 M 304
           +
Sbjct: 799 I 799


>gi|254410084|ref|ZP_05023864.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183120|gb|EDX78104.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 814

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 99/304 (32%), Positives = 143/304 (47%), Gaps = 34/304 (11%)

Query: 30  EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKCES 87
           E P+   N  +  D + WKDL   F++ +YR +     KDI FL   +   +  +   +S
Sbjct: 509 EHPWEKTNYTSYQDCNQWKDLPCDFVLQVYRTFVFTGSKDIEFLAECWSAIVRALAYLKS 568

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGG----------------LHIAALACMVE 121
           FD  G GI +N G PDQT          AYCGG                L    L   + 
Sbjct: 569 FDHDGDGIPENSGAPDQTFDDWQLQGVSAYCGGLWLAALEAASAIAQILLDNYTLVSAIN 628

Query: 122 MSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLK 181
           +    +       +   L +A  +Y + LW G YY+ DS     SD +MADQLCG  + +
Sbjct: 629 LPDPESIQHTLNTYHKWLQQAQPIYQEKLWNGQYYRLDSDSD--SDVVMADQLCGQFYAQ 686

Query: 182 ASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWT 240
             G+    I P      AL T++ +    F     GA NG+KP+G  +   A    EVWT
Sbjct: 687 LLGL--PDIVPPECTQSALKTVYESCFLKFHNAQFGAANGVKPDGSPEHPDATHPLEVWT 744

Query: 241 GVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALA 300
           G+ + L+A ++  G  DEA      + R VYE  GL F TPE +T   T+R+G Y+RA+A
Sbjct: 745 GINFGLAAFLIQLGMKDEALKLTESVVRQVYE-NGLQFRTPEAITPVGTFRAGHYLRAMA 803

Query: 301 VYAM 304
           ++A+
Sbjct: 804 IWAV 807


>gi|443325334|ref|ZP_21054034.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
 gi|442795063|gb|ELS04450.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
          Length = 808

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 101/311 (32%), Positives = 157/311 (50%), Gaps = 43/311 (13%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHK--DIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY   +  +  FL   + + +  +   
Sbjct: 499 PNEHPWEATNYTSYQDCNQWKDLPSDFVLQVYRDYLFTEGDNTEFLWECWDSVVCTLAYL 558

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--NDTQQQQ 133
           + FD+   GI +N G PDQT          +YCGGL IAAL   +++  +L  N T+  +
Sbjct: 559 KEFDEDNDGIPENSGAPDQTFDDWRLQGISSYCGGLWIAALEAAIKIGEILIANPTKNPE 618

Query: 134 QFSAK---------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
             +A                L ++ ++YHD LW G YY+ DS    +S+ +MADQLCG  
Sbjct: 619 LQTADFPQGIEKTISDYREWLQRSRELYHDTLWNGEYYRLDSKS--NSEVVMADQLCGQF 676

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSE-- 236
           + +  G+    +  +     AL+ I+      F  G  GA NG+ P+G    SA+  +  
Sbjct: 677 YARLLGLPD--VVENQYTKSALSKIYDACFLKFHDGKFGAANGVLPDG----SAVNPDDT 730

Query: 237 ---EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSG 293
              EVWTG+ + L+A M+  G  DEA+     + R VYE  GL F TPE +T   T+R+ 
Sbjct: 731 HPLEVWTGINFGLAAFMVQMGMKDEAFRLTETVVRQVYE-NGLQFRTPEAITAVGTFRAS 789

Query: 294 GYMRALAVYAM 304
            Y+RA+A++ +
Sbjct: 790 HYLRAMAIWGV 800


>gi|307151416|ref|YP_003886800.1| glucosylceramidase [Cyanothece sp. PCC 7822]
 gi|306981644|gb|ADN13525.1| Glucosylceramidase [Cyanothece sp. PCC 7822]
          Length = 805

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 155/307 (50%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YR +      +  +L   + +  E +   
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLLLYRSFLWTGGNNQDYLWEHWHSVTETLAYL 554

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-----NDTQ 130
           ++FD  G GI +N G PDQT          AYCG L IAAL   ++M  +L      + Q
Sbjct: 555 KAFDLDGDGIPENSGAPDQTFDDWQLRGISAYCGALWIAALEAAIKMGEILLQRPPMNPQ 614

Query: 131 QQQQFSAK------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
            Q Q +A+            L +A  +YHD LW G YY+ DS     SD +M+DQLCG  
Sbjct: 615 LQPQNAAESIEEAITTYRHWLQQARSLYHDTLWNGEYYRLDSESG--SDVVMSDQLCGQF 672

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
           + +  G+  + + P    + AL  I+      F  G  GA NG+KP+G   +  A    E
Sbjct: 673 YARLLGL-PDVVEPQYTTS-ALRKIYEACFLKFHGGKYGAANGVKPDGTPENPDATHPLE 730

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + L+A M+  G  +EA+     + + VYE  GL F TPE +T   T+R+  Y+R
Sbjct: 731 VWTGINFGLAAFMIQMGMKEEAFKITEAVVKQVYE-NGLQFRTPEAITAVGTFRASHYLR 789

Query: 298 ALAVYAM 304
           A+A++A+
Sbjct: 790 AMAIWAV 796


>gi|427716943|ref|YP_007064937.1| glucosylceramidase [Calothrix sp. PCC 7507]
 gi|427349379|gb|AFY32103.1| Glucosylceramidase [Calothrix sp. PCC 7507]
          Length = 800

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 101/299 (33%), Positives = 154/299 (51%), Gaps = 27/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N     D + WKDL   F++ +YRD+ L    D+ FLT  +P  +E +   
Sbjct: 499 PNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFLLTGADDVEFLTDCWPAIVETLNYL 558

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------- 126
           ++FD+   GI +N G PDQT          AYCGGL +AAL   + +S++L         
Sbjct: 559 KTFDQDDDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAISNILLNQNLADLG 618

Query: 127 NDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
           N   Q+  +   L ++  +Y + LW G YY+ DS     SD +MADQLCG  + +  G+ 
Sbjct: 619 NLALQKSIYETWLAQSRPIYEEKLWNGQYYRLDSKSG--SDVVMADQLCGQFYARLLGL- 675

Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYL 245
              I  S     AL T++      F  G  GA NG++P+G  +   A    EVWTG+ + 
Sbjct: 676 -PDIVASDRALSALQTVYEACFLKFYDGQFGAANGVRPDGSPENPQATHPLEVWTGINFG 734

Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           L+A ++  G  DEA      + + +Y+  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 735 LAAFLVQMGMQDEALKLTQAVVKQIYD-NGLQFRTPEAITTVGTFRASTYLRAMAIWAI 792


>gi|222639789|gb|EEE67921.1| hypothetical protein OsJ_25789 [Oryza sativa Japonica Group]
          Length = 919

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 102/287 (35%), Positives = 146/287 (50%), Gaps = 36/287 (12%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N Y + D   WKDLN KF++ +YRD     +  F    +P     +   + FD+
Sbjct: 593 DPWFEVNAYMLHDPVRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDR 652

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G G+V+N G PDQT          AY GGL +AAL     M+ ++ D   +  F  +  
Sbjct: 653 DGDGMVENEGRPDQTYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYH 712

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           +A +VY D LW G Y+ +D+S    S SI ADQL G  + +A G+  E I       RAL
Sbjct: 713 RARRVYTDELWNGGYFNYDNSGGATSSSIQADQLAGQWYARACGL--EPIVDGDKARRAL 770

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
           AT+   NV             M+           S+EV  GVTY ++AAM++EG  + A+
Sbjct: 771 ATVLDYNV-------------MR---------THSKEVCPGVTYAVAAAMIHEGMPEAAF 808

Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
            TA G++   + + G G  F+TPE  T D  YR+  YMR L V+AMQ
Sbjct: 809 KTAKGIHDAGWGKHGFGYAFQTPESWTADGGYRALHYMRPLGVWAMQ 855


>gi|16330256|ref|NP_440984.1| hypothetical protein sll1775 [Synechocystis sp. PCC 6803]
 gi|383321997|ref|YP_005382850.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325166|ref|YP_005386019.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491050|ref|YP_005408726.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384436317|ref|YP_005651041.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
 gi|451814414|ref|YP_007450866.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
 gi|1652745|dbj|BAA17664.1| sll1775 [Synechocystis sp. PCC 6803]
 gi|339273349|dbj|BAK49836.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
 gi|359271316|dbj|BAL28835.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359274486|dbj|BAL32004.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359277656|dbj|BAL35173.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407958168|dbj|BAM51408.1| hypothetical protein BEST7613_2477 [Bacillus subtilis BEST7613]
 gi|451780383|gb|AGF51352.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
          Length = 817

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 104/303 (34%), Positives = 148/303 (48%), Gaps = 31/303 (10%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDI--RFLTRVYPTCLELIRKC 85
           P E P+   N     D + WKDL   F++ +YRD+ L  D    FL   +P+ +  +   
Sbjct: 495 PNEHPWERSNYTAYQDCNLWKDLGSDFVLLVYRDFLLLPDADGEFLGECWPSIVAALDYL 554

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAAL-ACMVEMSSLLNDTQQQQQ 134
           + FD    GI +N G PDQT          AYCGGL IAAL A +V    LL +  QQ  
Sbjct: 555 KGFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEAALVMGQYLLENPPQQTD 614

Query: 135 FSAK------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA 182
             A             L +A  +Y D LW G YY  DS     S  +MADQLCG  +  A
Sbjct: 615 LDAMEIQTAIARYETWLAQARSLYQDTLWNGEYYNLDSQSG--SKVVMADQLCGQFY--A 670

Query: 183 SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTG 241
             +N   +       +ALA ++ T    F  G +GA NG+ P+G  ++ +     EVWTG
Sbjct: 671 RLLNLPDVVEHHYAEKALAKVYDTCFLKFAHGELGAANGLLPDGSPQNPNDTHPLEVWTG 730

Query: 242 VTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAV 301
           + + L+A ++  G  +EA      +   VY   GL F TPE +T   T+R+  Y+RA+A+
Sbjct: 731 INFGLAAFLIQMGMKEEALKMTETVVNQVY-NNGLQFRTPEAITAVGTFRASHYLRAMAI 789

Query: 302 YAM 304
           +A+
Sbjct: 790 WAV 792


>gi|434407175|ref|YP_007150060.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
           7417]
 gi|428261430|gb|AFZ27380.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
           7417]
          Length = 816

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 149/301 (49%), Gaps = 29/301 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N     D + WKDL   F++ +YRD+      D+ FL   +   ++ +   
Sbjct: 512 PNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFLFTGADDVEFLADCWDAIVQTLDYL 571

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------- 126
           + FDK G GI +N G PDQT          AYCGGL +AAL C + +  +L         
Sbjct: 572 KGFDKDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALECAIAICDILLTNRQDAKN 631

Query: 127 --NDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG 184
             N   Q+  + A L ++  VY + LW G YY+ DS     ++ +MADQLCG  + +  G
Sbjct: 632 AKNLETQRSIYGAWLAQSLTVYEEKLWNGQYYRLDSESG--TEVVMADQLCGQFYARLLG 689

Query: 185 VNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVT 243
           +    I        AL T++      F  G  GA NG++P+G   + +A    EVWTG+ 
Sbjct: 690 L--PDIVKCDRALSALQTVYDACFLKFFDGQFGAANGVRPDGVPENPNATHPLEVWTGIN 747

Query: 244 YLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYA 303
           + L+A ++  G  DEA      +   +Y+  GL F TPE +T   T+R+  Y+R +A++A
Sbjct: 748 FGLAAFLVQMGMQDEALRLTQAVVSQIYD-NGLQFRTPEAITTTGTFRASTYLRPMAIWA 806

Query: 304 M 304
           +
Sbjct: 807 I 807


>gi|298490121|ref|YP_003720298.1| glucosylceramidase ['Nostoc azollae' 0708]
 gi|298232039|gb|ADI63175.1| Glucosylceramidase ['Nostoc azollae' 0708]
          Length = 806

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 92/288 (31%), Positives = 148/288 (51%), Gaps = 31/288 (10%)

Query: 43  DVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGG 100
           D + WKDL+  F++ +YRD+      D++FL   +   ++ +   ++FD  G GI +N G
Sbjct: 519 DCNLWKDLSSDFVLQVYRDFLFTGSHDVQFLVDCWDAIVQTLDYLKTFDLDGDGIPENSG 578

Query: 101 FPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ-------------QQQQFSA 137
            PDQT          AYCGGL +AAL   + ++ +L + +             Q+  + +
Sbjct: 579 APDQTFDDWLLNGVSAYCGGLWLAALESAIAIADILTNHRGTENTEKTDKLRSQKSIYQS 638

Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
            L ++  +Y   LW G YY+ DS     SD +MADQLCG  + +  G+    I PS +  
Sbjct: 639 WLQQSRPIYEQKLWNGQYYRLDSESG--SDVVMADQLCGQFYTRLLGL--PDIVPSNHAH 694

Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLLSAAMLYEGNV 256
            AL T++      F  G  GA NG+ P+G  +   A    EVWTG+ + L+A ++  G  
Sbjct: 695 SALQTVYEACFVKFYDGKFGAANGVLPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMK 754

Query: 257 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +EA      +   +Y+  GL F TPE +T   T+R+  Y+R +A++A+
Sbjct: 755 NEAMQLTQAVVEQIYDN-GLQFRTPEAITPVGTFRASTYLRPMAIWAI 801


>gi|434389621|ref|YP_007100232.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
 gi|428020611|gb|AFY96705.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
          Length = 818

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 98/305 (32%), Positives = 150/305 (49%), Gaps = 38/305 (12%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N     D + WKDL   F++ +YR Y     KD+ FL   +   ++ +   
Sbjct: 518 PNEHPWEKTNYTCYQDCNLWKDLGSDFVLQVYRAYIFTGKKDLEFLAECWTAIVDTLAYL 577

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQ-- 133
           + FD    GI++N G PDQT          AYCGGL +AAL   + + ++L  TQQ    
Sbjct: 578 KQFDLDNDGIIENSGAPDQTFDDWRLQGISAYCGGLWLAALEAAISIGNIL--TQQYHGE 635

Query: 134 ----------QFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKAS 183
                     Q+ A L     +Y   LW GSYY+ D+     S+ +MADQLCG  + +  
Sbjct: 636 LTPKPAEVIPQYQAWLDSGKPIYQQQLWNGSYYRIDTGSG--SEVVMADQLCGQFYTQL- 692

Query: 184 GVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMK----PNGDRDRSAIQSEEVW 239
            +  + I P      AL T++ +    F+ G  GA NG+K    P+  +D   +   EVW
Sbjct: 693 -LELDDIVPPDCAKTALQTVYQSCFLNFQGGKFGAANGVKIDGSPSNPKDTHPL---EVW 748

Query: 240 TGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRAL 299
           TG+ + ++A ++  G  +EA   A  +   VY   GL F TPE +T   T+R+  Y+RA+
Sbjct: 749 TGINFGIAAFLVQMGMQEEALKLAETIVEQVYS-NGLQFRTPEAITSAGTFRASHYLRAM 807

Query: 300 AVYAM 304
           A++ +
Sbjct: 808 AIWGI 812


>gi|81299663|ref|YP_399871.1| hypothetical protein Synpcc7942_0854 [Synechococcus elongatus PCC
           7942]
 gi|81168544|gb|ABB56884.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 798

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 96/293 (32%), Positives = 148/293 (50%), Gaps = 21/293 (7%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N     D + WKDL   F++ ++RDY L    D  FL+  +P  +  +R  
Sbjct: 496 PNESPWLQTNYTGYQDCNLWKDLGCDFVLQVWRDYQLTGSSDRGFLSDCWPAAVAALRYL 555

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLN---DTQQQ 132
           + FD  G GI +N G PDQT          AYCGGL IAAL   + ++ +L    + + +
Sbjct: 556 KDFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEAALAIADVLELSAEDRDR 615

Query: 133 QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFP 192
           Q F + L +A  +YHD LW G YY+ DS     S  +MADQLCG  + +   +    + P
Sbjct: 616 QDFQSWLAQARSLYHDTLWNGRYYQLDSGSG--SQVVMADQLCGDFYSRL--LQLPPVAP 671

Query: 193 SANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAML 251
                     ++    + F +G  G  NG+ P+G   D       EVWTG+ + ++A  L
Sbjct: 672 LEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGSPVDPKGTHPLEVWTGINFGIAAYWL 731

Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
             G+ D  +     + R +Y+  GL F TPE +T + T+R+  Y+R +A++A+
Sbjct: 732 LLGHRDRCFEVTETVIRQIYD-NGLQFRTPEAITANATFRASHYLRPMAIWAV 783


>gi|428209901|ref|YP_007094254.1| glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
 gi|428011822|gb|AFY90385.1| Glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
          Length = 812

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 103/311 (33%), Positives = 155/311 (49%), Gaps = 33/311 (10%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N     D + WKDL   F++ +YRDY L    D+ FL   +   ++ +   
Sbjct: 506 PNEHVWEKTNYTAYQDCNLWKDLPSDFVLQVYRDYLLTGATDVEFLASCWEAIVQALNYL 565

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQ- 133
           ++FDK G GI +N G PDQT          AYCGGL +AAL   + +   LLN+ ++   
Sbjct: 566 KTFDKDGDGIPENSGAPDQTFDDWKLQGVSAYCGGLWLAALEAAIAIGKILLNNVERLHV 625

Query: 134 -------------QFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFL 180
                         + + L+++  +Y + LW G YY+ DS+    SD +MADQLCG  + 
Sbjct: 626 TSLHVTSLQETIDTYQSWLSRSRPIYQEKLWNGQYYRLDSNSG--SDVVMADQLCGQFYA 683

Query: 181 KASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVW 239
           +   +    I P      AL TI+      F  G  GA NG+KPNG   + +A    EVW
Sbjct: 684 RL--LKLPDIVPQECALSALKTIYDACFLKFHQGKFGAANGLKPNGSPENPNATHPLEVW 741

Query: 240 TGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRAL 299
           TG+ + L+A +L  G   EA      +   +YE  GL F TPE +T   T+R+  Y+RA+
Sbjct: 742 TGINFGLAAFLLQMGMKAEALQLTETVVNQIYE-NGLQFRTPEAITSVGTFRASYYLRAM 800

Query: 300 AVYAMQDAYLK 310
           A++A+   + +
Sbjct: 801 AIWAIYGVWFE 811


>gi|7470943|pir||T31071 conserved hypothetical protein - Anabaena sp. (fragment)
 gi|2828799|gb|AAC38131.1|AAC38131 unknown [Nostoc sp. PCC 7120]
          Length = 439

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 101/301 (33%), Positives = 152/301 (50%), Gaps = 29/301 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD+ L    D++FL   +   +E +   
Sbjct: 139 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYV 198

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
           ++FD  G GI +N G PDQT          AYCGGL +AALA  + +S +L         
Sbjct: 199 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALAAAIAISDILLQNHQDSAT 258

Query: 131 ------QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG 184
                 Q+  +   LTK+  +Y + LW G YY+ DS     SD +MADQLCG  +  A+ 
Sbjct: 259 KGHLLYQKSTYETWLTKSLPIYQEKLWNGKYYRLDSESG--SDVVMADQLCGQFY--ANL 314

Query: 185 VNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVT 243
           +    I PS     AL T++      F  G  GA NG++P+G  +   A    EVWTG+ 
Sbjct: 315 LELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPENPKATHPLEVWTGIN 374

Query: 244 YLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYA 303
           + L+A ++  G  DE +     +   +Y   GL F TPE +T   T+R+  Y+R +A++A
Sbjct: 375 FGLAAFLVQMGMKDEGFRLTQAVVEQIY-NNGLQFRTPEAITAAGTFRASTYLRPMAIWA 433

Query: 304 M 304
           +
Sbjct: 434 I 434


>gi|17230308|ref|NP_486856.1| hypothetical protein alr2816 [Nostoc sp. PCC 7120]
 gi|17131910|dbj|BAB74515.1| alr2816 [Nostoc sp. PCC 7120]
          Length = 804

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 101/301 (33%), Positives = 152/301 (50%), Gaps = 29/301 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD+ L    D++FL   +   +E +   
Sbjct: 504 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYV 563

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
           ++FD  G GI +N G PDQT          AYCGGL +AALA  + +S +L         
Sbjct: 564 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALAAAIAISDILLQNHQDSAT 623

Query: 131 ------QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG 184
                 Q+  +   LTK+  +Y + LW G YY+ DS     SD +MADQLCG  +  A+ 
Sbjct: 624 KEHLLYQKSTYETWLTKSLPIYQEKLWNGKYYRLDSESG--SDVVMADQLCGQFY--ANL 679

Query: 185 VNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVT 243
           +    I PS     AL T++      F  G  GA NG++P+G  +   A    EVWTG+ 
Sbjct: 680 LELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPENPKATHPLEVWTGIN 739

Query: 244 YLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYA 303
           + L+A ++  G  DE +     +   +Y   GL F TPE +T   T+R+  Y+R +A++A
Sbjct: 740 FGLAAFLVQMGMKDEGFRLTQAVVEQIY-NNGLQFRTPEAITAAGTFRASTYLRPMAIWA 798

Query: 304 M 304
           +
Sbjct: 799 I 799


>gi|147812668|emb|CAN61857.1| hypothetical protein VITISV_016690 [Vitis vinifera]
          Length = 521

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 100/294 (34%), Positives = 146/294 (49%), Gaps = 25/294 (8%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN++D   WKDLN KF++ +YRD     D  F   V+P     I   + FDK
Sbjct: 184 DPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQFDK 243

Query: 91  QGLGIVQNGGFPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHD-- 148
            G G+++N G P Q A               M+  + D+     F  K  KA  VY    
Sbjct: 244 DGDGMIENDGLPLQAASA-------------MAREVGDSMTADYFWFKFQKAKAVYDKDQ 290

Query: 149 ---LLWTG---SYYKFDSSQSRHS-DSIMADQLCGYLFLKASGVNTEAIFPSANIARALA 201
              ++  G     ++F  + S    D +  D    +    A     + I        AL 
Sbjct: 291 LAAVIGEGRLLXLFRFLGNPSHTGIDGLFQDHR-DWTTRYARACGLQPIVDDEKARSALE 349

Query: 202 TIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWT 261
            +++ NV   + G  GAVNGM P+G  D SA+QS E+W GVTY ++A M++EG V+ A+ 
Sbjct: 350 KVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAFN 409

Query: 262 TAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
           TA G+Y   + + GLG  F+TPE    D+ YRS  YMR LA++AMQ A  K ++
Sbjct: 410 TASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPEL 463


>gi|428299644|ref|YP_007137950.1| glucosylceramidase [Calothrix sp. PCC 6303]
 gi|428236188|gb|AFZ01978.1| Glucosylceramidase [Calothrix sp. PCC 6303]
          Length = 804

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 99/298 (33%), Positives = 150/298 (50%), Gaps = 26/298 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD+     KDI FL   +      +   
Sbjct: 502 PNEHVWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFVFTGSKDIEFLAECWDAIALTLEYL 561

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQQ 134
           ++FDK   GI +N G PDQT          AYCGGL +AAL   + +   LL +   Q++
Sbjct: 562 KAFDKDHDGIPENSGAPDQTFDDWKLQGVSAYCGGLWLAALEAAIAICDILLENHGDQEK 621

Query: 135 FSAK-------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
            +A+       L K+  +Y   LW G YY+ DS     SD +MADQLCG  + +   +  
Sbjct: 622 LTAQKIIYTDWLAKSRPIYQQKLWNGQYYRLDSESG--SDVVMADQLCGQFYARL--LKL 677

Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLL 246
             I P      A+ T++      F  G  GA NG++P+G  +   A    EVWTG+ + L
Sbjct: 678 ADIVPHDCAISAVRTVYEACFINFNDGKFGAANGLRPDGSPENPKATHPLEVWTGINFGL 737

Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +A ++  G  DEAW     + + +Y+  GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 738 AAFLMQMGMQDEAWKLTEVVVQQIYD-NGLQFRTPEAITATGTFRACTYLRAMAIWGI 794


>gi|427709805|ref|YP_007052182.1| glucosylceramidase [Nostoc sp. PCC 7107]
 gi|427362310|gb|AFY45032.1| Glucosylceramidase [Nostoc sp. PCC 7107]
          Length = 802

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 99/298 (33%), Positives = 150/298 (50%), Gaps = 26/298 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N     D + WKDL+  F++ +YRDY L    D+ FL   +   ++ +   
Sbjct: 504 PNEHVWEKTNYTCYQDCNLWKDLSCDFVLQVYRDYLLTGADDVEFLADCWDAVVQTLDYL 563

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------N 127
           ++FDK G GI +N G PDQT          AYCGGL +AAL   + +S +L        +
Sbjct: 564 KTFDKDGDGIPENSGAPDQTFDDWRLLGVSAYCGGLWLAALEAAIAISDILITNHKVVES 623

Query: 128 DTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
             +Q+  +   L ++  +Y + LW G YY+ DS     S  +MADQLCG  + +   +  
Sbjct: 624 AEKQKSIYEVWLAQSRPIYQEKLWNGQYYRLDSESG--SAVVMADQLCGQFYARL--LEL 679

Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLL 246
             I PS     AL T++      F+ G  GA NG+ P+G  +   A    EVWTG+ + L
Sbjct: 680 PDIVPSDRALSALKTVYDACFLKFQNGEFGAANGVLPDGSPENPKATHPLEVWTGINFGL 739

Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +A ++  G  DEA      +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 740 AAFLVQMGMQDEALRLTQAVVEQVY-NNGLQFRTPEAITATGTFRASTYLRAMAIWGI 796


>gi|427725201|ref|YP_007072478.1| glucosylceramidase [Leptolyngbya sp. PCC 7376]
 gi|427356921|gb|AFY39644.1| Glucosylceramidase [Leptolyngbya sp. PCC 7376]
          Length = 807

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 100/310 (32%), Positives = 151/310 (48%), Gaps = 35/310 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N     D + WKDL   F++ +YRD+ +   KD  FL   +   ++ +   
Sbjct: 501 PNEHPWEATNYTAYQDCNLWKDLGADFVLQVYRDFVMTGSKDTEFLWSCWEGVVQALDYL 560

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGL-------HIAALACMVEMSSLLND 128
           ++FD    GI +NGG PDQT          AYCGGL        IA    +   +S+  +
Sbjct: 561 KTFDLDCDGIPENGGSPDQTFDDWKLKGISAYCGGLWIAALEAAIAIANILKNKASIYEE 620

Query: 129 TQQQ---QQFSAKLTK-------ASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
            Q Q   ++F A + K       +  +Y D LW GSYY  D+     S  +M+DQLCG  
Sbjct: 621 HQNQVSTEEFEAAIAKYQDWLKQSRPLYQDTLWNGSYYTLDTGSD--SKVVMSDQLCGEF 678

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR-SAIQSEE 237
           + +   +N   + P    A  L TI+      F  G  GA NG+ P+G  ++ +     E
Sbjct: 679 YTQL--LNLPNVNPENRTAITLQTIYEACFLKFHDGQFGAANGLNPDGSPEKENDTHPLE 736

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + ++A M+  G  DEA      +   VY   GL F TPE +T + T+R+  Y+R
Sbjct: 737 VWTGINFGIAAFMIRNGMKDEALKMTEAVIHQVYS-NGLQFRTPEAITANNTFRACHYLR 795

Query: 298 ALAVYAMQDA 307
           A+ ++A+ DA
Sbjct: 796 AMGIWAIYDA 805


>gi|427733995|ref|YP_007053539.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
 gi|427369036|gb|AFY52992.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
          Length = 799

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 96/298 (32%), Positives = 151/298 (50%), Gaps = 26/298 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD+      DI FL   +   +E +   
Sbjct: 503 PNEHVWEKTNYTSYQDCNQWKDLGCDFVLQVYRDFIFTGSNDIEFLADCWNAIVETLNYL 562

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
           ++FDK G GI +N G PDQT          AYCGGL +AAL   + ++ +L+  Q     
Sbjct: 563 KAFDKDGDGIPENSGAPDQTFDDWRLLGVSAYCGGLWLAALEAAIAIAEVLSSKQVQVEN 622

Query: 131 ---QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
              Q+  +   L K+  VY + LW G +Y+ DS     SD +MADQLCG  + +   +  
Sbjct: 623 LAAQKSIYEGWLEKSRPVYQEKLWNGKFYRLDSESG--SDVVMADQLCGQFYARL--LKL 678

Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLL 246
             I P      AL +++      F  G +GA NG++ +G  +   A    EVWTG+ + +
Sbjct: 679 PDIVPPECTMSALNSVYDACFLKFNQGKLGAANGLRLDGSPENPDATHPLEVWTGINFGI 738

Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +A ++  G   EA+     +   +YE  GL F TPE +T  + +R+  Y+RA+A++A+
Sbjct: 739 AAFLVQMGMKSEAFKLTEAVVNQIYEH-GLQFRTPEAITATRNFRACVYLRAMAIWAI 795


>gi|366162568|ref|ZP_09462323.1| hypothetical protein AcelC_02756 [Acetivibrio cellulolyticus CD2]
          Length = 842

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 98/289 (33%), Positives = 150/289 (51%), Gaps = 17/289 (5%)

Query: 33  FSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKCESFDK 90
           F   N Y   D ++WKDLN K ++ +YRD+AL    D+ FL   +    + + K +S D 
Sbjct: 469 FETWNAYVYRDSTNWKDLNSKLVLMVYRDWALTGKTDMDFLKYCWIPVQKAMDKVKSQDS 528

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
            G  +  + G  DQT          AYCGGL +AA     E+++ + D+ +  Q+ +   
Sbjct: 529 DGDYLPNSSGI-DQTYDDMQLQGNTAYCGGLFVAACNAAKEIATAMGDSTKATQYQSWYE 587

Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
           K+ + Y   LW GSY+K D+  S  +  IM+DQLCG+ + KA G+  E I P     ++ 
Sbjct: 588 KSKENYIAKLWNGSYFKIDTG-SYDTSRIMSDQLCGHWYAKACGL--EGIVPDEYAKKSY 644

Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
           + ++  N + F+ G  G VN MK +G+ D S  Q+ E W G +Y + A M+ EG   EA 
Sbjct: 645 SKVYEFNNKKFDNGLHGFVNIMKADGNVDMSHAQTAEAWVGTSYGVIAGMIQEGLETEAS 704

Query: 261 TTAGGLYRTVYERTGLGFETPEGL-TGDKTYRSGGYMRALAVYAMQDAY 308
                L  TV++   + F TPE    G    R+  YMRA  ++A++ AY
Sbjct: 705 QVGSNLADTVWKTNDMWFRTPEAWRQGVSEVRAPYYMRANCIWAVKHAY 753


>gi|409991122|ref|ZP_11274412.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
           Paraca]
 gi|291568810|dbj|BAI91082.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409938009|gb|EKN79383.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
           Paraca]
          Length = 799

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 104/299 (34%), Positives = 150/299 (50%), Gaps = 27/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L    DI+FL   +P  ++ +   
Sbjct: 499 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDYLLTGADDIQFLVECWPAIVQTLDYL 558

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMV--------EMSSLLN 127
           ++FD+    I +NGG PDQT          AYCGGL +AAL   +          S +  
Sbjct: 559 KTFDRDRDCIPENGGAPDQTFDDWRMMGISAYCGGLWLAALEAAIAIANILLSHHSDITP 618

Query: 128 DTQQQ-QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
           DT +    ++  L  A  +Y   LW GSYY  DS     SD +MADQLCG  +    G+ 
Sbjct: 619 DTTKAIATWTNWLETAKPLYDQTLWNGSYYNLDSQSG--SDVVMADQLCGQFYAALLGL- 675

Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYL 245
              I P      AL TI+    + F  G  GA NG++  G+  +       EVWTG+ + 
Sbjct: 676 -PDIVPHHRARVALQTIYQACFQNFHNGKFGAANGVRTTGEPINPQDTHPLEVWTGINFG 734

Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           L A ++  G  +EA+  A  +   VY+  GL F TPE +T   T+R+  Y+RA+A++AM
Sbjct: 735 LGAFLIQMGMKEEAFQLAEAVITQVYQ-NGLQFRTPEAITAAGTFRASHYLRAMAIWAM 792


>gi|428773649|ref|YP_007165437.1| glucosylceramidase [Cyanobacterium stanieri PCC 7202]
 gi|428687928|gb|AFZ47788.1| Glucosylceramidase [Cyanobacterium stanieri PCC 7202]
          Length = 800

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 100/310 (32%), Positives = 147/310 (47%), Gaps = 35/310 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY      D  FL   + + +E +   
Sbjct: 494 PNEHPWIKSNYTSYQDCNLWKDLGSDFVLLVYRDYLYTGKNDQDFLWECWDSVVETLHYV 553

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDT------ 129
           + FD    GI +N G PDQT          AYCGGL I AL   + + ++L D       
Sbjct: 554 KGFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWICALEAAIAIGTILLDNPPVNPA 613

Query: 130 ------QQQQQ-----FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
                 Q+  Q     F   L ++  +YHD LW G YY  DS     SD +MADQLCG  
Sbjct: 614 LEKENAQENIQGAIALFRQWLHQSRAIYHDTLWNGEYYNLDSKSG--SDVVMADQLCGQY 671

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR-SAIQSEE 237
           +  A  +    +     +   L  I+      F  G++G  NG+KP+G  ++ +     E
Sbjct: 672 Y--AQLLKLPDVVEKEYVTATLNKIYDACYLKFHNGTLGIANGVKPDGSPEKENDTHPLE 729

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           +WTG+ Y + A M+  G  +E    A  +   VY   GL F TPE +T   T+R+  Y+R
Sbjct: 730 IWTGINYGIVAFMILNGMKEEGLKVAETVVNQVYS-NGLQFRTPEAITAVNTFRASHYLR 788

Query: 298 ALAVYAMQDA 307
           A+A++A+  A
Sbjct: 789 AMAIWAIYQA 798


>gi|56750695|ref|YP_171396.1| hypothetical protein syc0686_c [Synechococcus elongatus PCC 6301]
 gi|56685654|dbj|BAD78876.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 798

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 95/293 (32%), Positives = 147/293 (50%), Gaps = 21/293 (7%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N     D + WKDL   F++ ++RDY L    D  FL+  +P  +  +R  
Sbjct: 496 PNESPWLQTNYTGYQDCNLWKDLGCDFVLQVWRDYQLTGSSDRGFLSDCWPAAVAALRYL 555

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLN---DTQQQ 132
           + FD  G GI +N G PDQT          AYCGGL IAAL   + ++ +L    + + +
Sbjct: 556 KDFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEAALAIADVLELSAEDRDR 615

Query: 133 QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFP 192
           Q F + L +A  +YHD LW G YY+ DS     S  +MADQLCG  + +   +    + P
Sbjct: 616 QDFQSWLAQARSLYHDTLWNGRYYQLDSGSG--SQVVMADQLCGDFYSRL--LQLPPVAP 671

Query: 193 SANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAML 251
                     ++    + F +G  G  NG+ P+G           EVWTG+ + ++A  L
Sbjct: 672 LEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGSPVGPKGTHPLEVWTGINFGIAAYWL 731

Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
             G+ D  +     + R +Y+  GL F TPE +T + T+R+  Y+R +A++A+
Sbjct: 732 LLGHRDRCFEVTETVIRQIYD-NGLQFRTPEAITANATFRASHYLRPMAIWAV 783


>gi|218437284|ref|YP_002375613.1| hypothetical protein PCC7424_0277 [Cyanothece sp. PCC 7424]
 gi|218170012|gb|ACK68745.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7424]
          Length = 801

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 100/309 (32%), Positives = 154/309 (49%), Gaps = 39/309 (12%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY----ALHKDIRFLTRVYPTCLELIR 83
           P E P+   N  +  D + WKDL   F++ +YR +    A ++D  FL   + +  + I 
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLGCDFVLLLYRSFLWTGANNQD--FLWEHWDSVTQTIA 552

Query: 84  KCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEM-SSLLNDTQQQ 132
             + FD  G GI +N G PDQT          AYCG L IA L   +++  +LL      
Sbjct: 553 YVKKFDLDGDGIPENSGAPDQTFDDWQLRGISAYCGALWIAGLEAAIKIGETLLQRPPMN 612

Query: 133 QQFSAK----------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
            Q  ++                L +A  +YHD LWTG YY+ DS     SD +M+DQLCG
Sbjct: 613 PQLQSENAPEKIEIAIATYRQWLHQARALYHDTLWTGEYYRLDSESG--SDVVMSDQLCG 670

Query: 177 YLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQS 235
             + +  G+  + + P   ++ AL  I+ +    F  G  GA NG++P+G  +   A   
Sbjct: 671 QFYARLLGL-PDVVEPQYTLS-ALRKIYESCFLQFYGGKYGAANGVRPDGSPENPDATHP 728

Query: 236 EEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGY 295
            EVWTG+ + L+A M+  G   E  +    + + +YE  GL F TPE +T   T+R+  Y
Sbjct: 729 LEVWTGINFGLAAFMIQMGMKQEGLSITEAVVKQIYE-NGLQFRTPEAITAAGTFRASHY 787

Query: 296 MRALAVYAM 304
           +RA+A++A+
Sbjct: 788 LRAMAIWAI 796


>gi|16183333|gb|AAL13692.1| GH26380p [Drosophila melanogaster]
          Length = 391

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 82/204 (40%), Positives = 115/204 (56%), Gaps = 19/204 (9%)

Query: 68  IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
           I +L  +Y +C  ++ +   +DK   G+++N   PDQT          AYC GL +AAL 
Sbjct: 169 IGYLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 228

Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            M  M+++L+      ++   L K  +   + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 229 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 287

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
            +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N       G  D 
Sbjct: 288 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDN 346

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEG 254
           S IQ+EEVW GV Y L+A M+ EG
Sbjct: 347 SNIQAEEVWPGVVYALAATMIQEG 370



 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 31/35 (88%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
           P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 66  PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 100


>gi|434391596|ref|YP_007126543.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
 gi|428263437|gb|AFZ29383.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
          Length = 809

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 149/313 (47%), Gaps = 44/313 (14%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL+  F++ +YRD+ L    D   L   +   ++ +   
Sbjct: 499 PNEHPWEATNYTSYQDCNLWKDLSCDFVLQVYRDFVLTGGTDWELLWDCWSAVVQTLTYL 558

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGL---------------------HIA 114
           ++FD    GI +N G PDQT          AYCGGL                     H A
Sbjct: 559 KTFDLDEDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAIGKTLLSYPDDHPA 618

Query: 115 A--LACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMAD 172
           +  LA   +   +    +    F + L ++  +Y + LW G YY+ DS     SD +MAD
Sbjct: 619 SKILASAPDYPPI---PETLDVFQSWLMRSRPIYQEKLWNGQYYRLDSKSG--SDVVMAD 673

Query: 173 QLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRS 231
           QLCG  + +  G+    I P     +AL T++      F  G  GA NG+KP+G   + +
Sbjct: 674 QLCGQFYARLLGL--PDIVPPECATQALKTVYDACFVKFHNGQFGAANGLKPDGLPENPN 731

Query: 232 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYR 291
           A    EVWTG+ + L+A ++  G   EAW     + + VY+  GL F TPE +T   T+R
Sbjct: 732 ATHPLEVWTGINFGLAAFLMQMGMKSEAWRITQAVVQQVYD-NGLQFRTPEAITAKGTFR 790

Query: 292 SGGYMRALAVYAM 304
           +  Y+R +A++A+
Sbjct: 791 ACHYLRPMAIWAI 803


>gi|186681980|ref|YP_001865176.1| hypothetical protein Npun_R1544 [Nostoc punctiforme PCC 73102]
 gi|186464432|gb|ACC80233.1| protein of unknown function DUF608 [Nostoc punctiforme PCC 73102]
          Length = 801

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 98/299 (32%), Positives = 151/299 (50%), Gaps = 27/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD+ L    D+ FL   +   ++ +   
Sbjct: 503 PNEHVWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGADDVEFLADCWNAIVQTLDYL 562

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
           ++FD  G GI +N G PDQT          AYCGGL +AAL   + +S LL   +     
Sbjct: 563 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAISDLLLTHKLGDLG 622

Query: 131 ----QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
               Q+  +   L ++  +Y + LW G YY+ DS     SD +MADQLCG  + +   ++
Sbjct: 623 GLAVQKSIYETWLQQSLPIYQEKLWNGQYYQLDSQSG--SDVVMADQLCGQFYARL--LD 678

Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYL 245
              I PS     AL T++      F  G  G  NG++P+G  +   A    EVWTG+ + 
Sbjct: 679 LPDIVPSDRALSALKTVYDACFLKFCNGEFGTANGVRPDGSPENPKATHPLEVWTGINFG 738

Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           L+A ++  G  DEA      + + +YE  GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 739 LAAFLVQMGMKDEALRLTQAVVQQIYEN-GLQFRTPEAITAAGTFRASTYLRAMAIWGI 796


>gi|449691194|ref|XP_004212588.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
           magnipapillata]
          Length = 169

 Score =  150 bits (379), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 74/160 (46%), Positives = 106/160 (66%), Gaps = 3/160 (1%)

Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT---EAIFPSANIARALATIFSTNVRG 210
           +YY +D S++ + + IMA Q  G  +L A  +     + +F    +   L  IF  NV  
Sbjct: 1   TYYNYDGSENDYHNCIMAGQCAGQWYLHACDLGQTEDDLVFRPEQVHSTLKVIFQNNVMR 60

Query: 211 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 270
           FE G MGAVNGM+ +G  D++++Q+EEVWTGVTY L+A M+  G V E + TA G+YRTV
Sbjct: 61  FEDGFMGAVNGMRADGTVDKNSLQAEEVWTGVTYALAATMIQHGMVREGFRTAEGIYRTV 120

Query: 271 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           YE+ GLGF+TPE +T +KT+RS GYMR L+++++  AY K
Sbjct: 121 YEKWGLGFQTPEAITANKTFRSRGYMRPLSIWSIYHAYSK 160


>gi|434400779|ref|YP_007134783.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
 gi|428271876|gb|AFZ37817.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
          Length = 804

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 146/307 (47%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L   KD  FL   + + +  +   
Sbjct: 495 PNEHPWQATNYTSYQDCNLWKDLPSDFVLQVYRDYVLTGSKDSEFLWECWDSIVVTLAYL 554

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAA----------LACMVEMSSL 125
           + FD  G GI +N G PDQT          AYCGGL IAA          L     M+  
Sbjct: 555 KEFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEAAIAIGQILMTHPPMNPN 614

Query: 126 LNDTQQQQ-------QFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
           L      Q        +   L +A  +YHD LW G YY+ DS    +SD +MADQLCG  
Sbjct: 615 LQPPDYPQGIETVINHYRVWLQQARSLYHDTLWNGEYYRLDSQS--NSDVVMADQLCGQF 672

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEE 237
           + +  G+    +        AL  I+      F  G  GA NG+ P+G   + +     E
Sbjct: 673 YARLLGL--PDVVEEQYTKSALKKIYQACFLKFHNGKYGAANGVLPDGSPVNPNDTHPLE 730

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VWTG+ + L+A ML  G   EA+     + + VYE  GL F TPE +T   T+R+  Y+R
Sbjct: 731 VWTGINFGLAAFMLQMGMKAEAFQLTETVVKQVYE-NGLQFRTPEAITAVGTFRASHYLR 789

Query: 298 ALAVYAM 304
           A+A++ +
Sbjct: 790 AMAIWGI 796


>gi|284929778|ref|YP_003422300.1| putative bile acid beta-glucosidase [cyanobacterium UCYN-A]
 gi|284810222|gb|ADB95919.1| predicted bile acid beta-glucosidase [cyanobacterium UCYN-A]
          Length = 803

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 95/307 (30%), Positives = 150/307 (48%), Gaps = 35/307 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L   +DI FL   +P  +E +   
Sbjct: 495 PNEHPWEETNYTSYQDCNLWKDLGSDFVLQVYRDFLLTDSQDINFLQECWPAIVETLDYL 554

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-----NDTQ 130
           + FD    GI +N G PDQT          +YCG L I AL   + +  +L     N++ 
Sbjct: 555 KKFDLDDDGIPENSGAPDQTFDDWQLHGISSYCGALWITALEAAIGIGKILLKNSENNSH 614

Query: 131 QQQQFSAK------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
              Q S              L K+  +YH+ LW G YY+ DS     SD+IM DQLCG  
Sbjct: 615 PDLQNSLAFIQENLTKYDDWLKKSRLIYHETLWNGEYYQLDSKSG--SDTIMTDQLCGQF 672

Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
           +  A  +N   +  ++    AL+ I+      F  G  GA NG++ +G   + ++   +E
Sbjct: 673 Y--ARLLNLPDVVDTSCTISALSKIYEACFLNFYDGQYGAANGVRSDGTPENPNSTHPQE 730

Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
           VW G+ + L+A ++  G  ++A      +   +Y   GL F TPE +T   T+R+  Y+R
Sbjct: 731 VWVGINFGLAAFLIQMGMKNQALELTKVVVEQIYAN-GLQFRTPEAITPVGTFRASHYLR 789

Query: 298 ALAVYAM 304
           A+A++ +
Sbjct: 790 AMAIWGV 796


>gi|317969095|ref|ZP_07970485.1| hypothetical protein SCB02_06140 [Synechococcus sp. CB0205]
          Length = 837

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 96/297 (32%), Positives = 152/297 (51%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E+P+   N     D + WKDL   +++ ++R + L    +D+RFL   +P  +E +R 
Sbjct: 530 PNEKPWDATNYTAYQDCNLWKDLASDYVLQVWRTFKLAPNGEDLRFLADCWPAAVEALRY 589

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC------MVEMSSLLND 128
            ++FD    G+  NGG PDQT          AYCG L IAAL         +++S+ L+ 
Sbjct: 590 LKTFDINNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALDAALAIAQQLQLSTGLDT 649

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
           + +QQ+FSA L +A   +  LLW G YY  D+     +  +MADQLCG  + +  G+  E
Sbjct: 650 SAEQQEFSAWLAQARGNFDRLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--E 705

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
            +   AN    L  +  +    FE GS+G  NG++ +G   D +     EVWTG+ + ++
Sbjct: 706 PVVSEANSRSTLNAVKDSCFEKFEGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 765

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +     G  + A      +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 766 SYFRLMGETETALAITSAVVNQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGV 821


>gi|225593155|gb|ACN96073.1| pedicted bile acid beta-glucosidase [Fischerella sp. MV11]
          Length = 812

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/308 (32%), Positives = 150/308 (48%), Gaps = 36/308 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD+      DI FL   +   ++ +   
Sbjct: 505 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLFTGADDITFLAECWNAIVQTLDYL 564

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL----NDTQQ 131
           ++FD    GI +N G PDQT          AYCGGL +AAL   + +  +L     DT+ 
Sbjct: 565 KTFDIDNDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAICDVLITNRRDTEN 624

Query: 132 QQQF--------------SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            ++F                 L ++  +Y   LW G YY+ DS     SD +MADQLCG 
Sbjct: 625 AEKFLGDLGDLAVQKSIYETWLQQSRPIYEAKLWNGQYYRLDSESG--SDVVMADQLCGQ 682

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSE 236
            + +  G+    I        AL T+++     F  G  GA NG++P+G  +   A    
Sbjct: 683 FYARLLGLPD--IVKRDRALSALKTVYNACFLKFHHGQFGAANGLRPDGSPENPQATHPL 740

Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
           EVWTG+ + L+A M+  G  DEA      + R VYE  GL F TPE +T   T+R+  Y+
Sbjct: 741 EVWTGINFGLAAFMVQMGMKDEAMKLTEAVVRQVYE-NGLQFRTPEAITAVGTFRASTYL 799

Query: 297 RALAVYAM 304
           RA+A++A+
Sbjct: 800 RAMAIWAI 807


>gi|414077215|ref|YP_006996533.1| beta-glucosidase [Anabaena sp. 90]
 gi|413970631|gb|AFW94720.1| putative beta-glucosidase [Anabaena sp. 90]
          Length = 795

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 93/299 (31%), Positives = 153/299 (51%), Gaps = 27/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD+      D++FL   +   ++ +   
Sbjct: 497 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLFTGANDLQFLADCWDAIVQTLDYV 556

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
           ++FD  G  I +N G PDQT          AYCGGL +AAL   + +S +L + +     
Sbjct: 557 KTFDLDGDSIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAISKILTNRRGAEDA 616

Query: 131 ----QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
               Q   +   L+++  +Y + LW G YY+ DS     SD +MADQLCG  + +   ++
Sbjct: 617 EVEVQGSIYQGWLSQSKPIYQEKLWNGEYYRLDSDSG--SDVVMADQLCGQFYARL--LS 672

Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYL 245
              I PS     AL  ++ +    F+ G  GA NG+ PNG   + ++    EVWTG+ + 
Sbjct: 673 LPDIVPSDRALSALTKVYDSCFLKFQNGEFGAANGVLPNGLPENPNSTHPLEVWTGINFG 732

Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           L+A ++     +EA      + R +Y+  GL F TPE +T + T+R+  Y+RA+A++ +
Sbjct: 733 LAAFLVQMNMKNEAMRLTEAVVRQIYD-NGLQFRTPEAITANGTFRASTYLRAMAIWGI 790


>gi|113477064|ref|YP_723125.1| hypothetical protein Tery_3568 [Trichodesmium erythraeum IMS101]
 gi|110168112|gb|ABG52652.1| protein of unknown function DUF608 [Trichodesmium erythraeum
           IMS101]
          Length = 816

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 151/315 (47%), Gaps = 43/315 (13%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+ + N  +  D + WKDL+  F++ +YRD+ L    D  FL + +    E +   
Sbjct: 499 PNEHPWEMTNYTSYQDCNQWKDLSSDFVLQVYRDFLLTGADDYEFLWQSWSAITETLAYL 558

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGL-----------------HIAALA- 117
           + FDK   GI +N G PDQT          AYCGGL                 H   +  
Sbjct: 559 KGFDKDNDGIPENEGAPDQTFDDWQLRGVSAYCGGLWLAALEAAIAIGKVLIEHPREIPY 618

Query: 118 -------CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIM 170
                    V+ +S+     Q   +   L K   +Y + LW G YY+ DS    +S+ +M
Sbjct: 619 YPPKGFYSEVDKNSVDAINNQVYLYQGWLKKGLPIYQEKLWNGEYYRLDSES--NSEVVM 676

Query: 171 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RD 229
           ADQL G  + K   +N E I P+     AL T++++  + F  G  GA NG+ P+G   +
Sbjct: 677 ADQLSGQFYAKL--LNLEDIVPAECALSALKTVYNSCFKNFHNGKFGAANGVLPDGSPEN 734

Query: 230 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKT 289
            +A    EVWTG+ + L+A M+  G   EA      +   +YE  GL F TPE +T   T
Sbjct: 735 PNATHPLEVWTGINFGLAAFMVQIGMKKEALEITEVVVGQIYE-NGLQFRTPEAITVMGT 793

Query: 290 YRSGGYMRALAVYAM 304
           +R+  Y+RA+A++A+
Sbjct: 794 FRASHYLRAMAIWAI 808


>gi|354566897|ref|ZP_08986068.1| Glucosylceramidase [Fischerella sp. JSC-11]
 gi|353544556|gb|EHC14010.1| Glucosylceramidase [Fischerella sp. JSC-11]
          Length = 812

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 101/308 (32%), Positives = 148/308 (48%), Gaps = 36/308 (11%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD+      DI FL   +   +E +   
Sbjct: 505 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLFTGADDITFLAECWNAIVETLDYL 564

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL----NDTQQ 131
           ++FD    GI +N G PDQT          AYCGGL +AAL   + +  +L     DT  
Sbjct: 565 KTFDIDNDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAICDVLITNRRDTDN 624

Query: 132 QQQFSAKL--------------TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
            + F   L               ++  +Y   LW G YY+ DS     SD +MADQLCG 
Sbjct: 625 AENFLGDLGDLAVQKSIYETWLQQSRPIYEAKLWNGQYYRLDSESG--SDVVMADQLCGQ 682

Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSE 236
            + +  G+    I        AL T+++     F  G  GA NG++P+G  +   A    
Sbjct: 683 FYARLLGL--PDIVKRDRALSALKTVYNACFLKFHHGQFGAANGLRPDGSPENPQATHPL 740

Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
           EVWTG+ + L+A M+  G  DEA      + R +YE  GL F TPE +T   T+R+  Y+
Sbjct: 741 EVWTGINFGLAAFMVQMGMKDEAMKLTETVVRQIYE-NGLQFRTPEAITAVGTFRASTYL 799

Query: 297 RALAVYAM 304
           RA+A++A+
Sbjct: 800 RAMAIWAI 807


>gi|300866990|ref|ZP_07111661.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300335025|emb|CBN56827.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 863

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 101/321 (31%), Positives = 150/321 (46%), Gaps = 49/321 (15%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L    D  FL   +P  +E +   
Sbjct: 539 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLLTGSTDYEFLQECWPAIVEALAYL 598

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ- 134
           ++FD  G GI +N G PDQT          AYCGGL +AAL   + + SLL   Q   + 
Sbjct: 599 KTFDLDGDGIPENSGAPDQTFDDWQMRGVSAYCGGLWLAALEAAIAIGSLLIKHQSSVEE 658

Query: 135 ------------------------------FSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
                                         +   L +A  +Y   LW G YY+ DS    
Sbjct: 659 LNQSPITNFNKDFKDVSLFPITNYQLPITNYQNWLEQAKPIYQQRLWNGQYYRLDSESG- 717

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
            SD +MADQLCG  + +  G+    I P      AL T++ +    F  G  GA NG+  
Sbjct: 718 -SDVVMADQLCGQFYARLLGL--PDIVPEECAISALETVYESCFIKFNQGEFGAANGVML 774

Query: 225 NGDRDRS-AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
           +G  ++  A    EVWTG+ + L+A M+  G  ++A      + + +Y   GL F TPE 
Sbjct: 775 DGSPEKPGATHPLEVWTGINFGLAAFMMQMGMKEKALELTEVVVKQIYA-NGLQFRTPEA 833

Query: 284 LTGDKTYRSGGYMRALAVYAM 304
           +T   T+R+  Y+RA+A++A+
Sbjct: 834 ITATGTFRASHYLRAMAIWAI 854


>gi|427419169|ref|ZP_18909352.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
 gi|425761882|gb|EKV02735.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
          Length = 835

 Score =  146 bits (369), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 98/325 (30%), Positives = 156/325 (48%), Gaps = 53/325 (16%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YR + L    D+ FL   +P   + ++  
Sbjct: 512 PNENPWIATNYTSYQDCNQWKDLPSDFVLQVYRAFVLTGGDDLEFLAECWPAVTQTLQYL 571

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQ---- 131
           + FD  G G+ +NGG PDQT          AYCGGL +AAL   ++M  LL   ++    
Sbjct: 572 KLFDLDGDGVPENGGAPDQTFDDWQLKGLSAYCGGLWLAALEAAIKMGDLLQQHRKISGS 631

Query: 132 ----QQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
               + Q+   + +   VY + LWTGS+Y+ DS    +SD +MADQLCG    +   ++ 
Sbjct: 632 FAIVKSQYQRWIKQGKIVYQNKLWTGSFYRLDS--ESNSDVVMADQLCGQFCARL--MDL 687

Query: 188 EAIFPSANIARALATIF------------------STNVRGFEAG---------SMGAVN 220
             I     I  AL TI+                  +    G +AG         ++GA N
Sbjct: 688 PDIVDQEFIDIALDTIYDACFVKFNEYAAKHSTPQNQKFEGSQAGFFSAAQLGMAIGAAN 747

Query: 221 GMKPNGDRDRSAIQSE-EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFE 279
           G+KP+G  +      + EVWTG+ + ++A +   G  D+A      + + +Y   GL F 
Sbjct: 748 GVKPDGSPENPDDTHQLEVWTGINFGVAAFLAQMGKRDQAMAITEAVVQQIYNY-GLQFR 806

Query: 280 TPEGLTGDKTYRSGGYMRALAVYAM 304
           TPE +T   T+R+  Y+R +A++ +
Sbjct: 807 TPEAITAMGTFRACHYLRPMAIWGL 831


>gi|428223619|ref|YP_007107716.1| glucosylceramidase [Geitlerinema sp. PCC 7407]
 gi|427983520|gb|AFY64664.1| Glucosylceramidase [Geitlerinema sp. PCC 7407]
          Length = 821

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 98/304 (32%), Positives = 150/304 (49%), Gaps = 32/304 (10%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L    D  F+   +P     +   
Sbjct: 504 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFRLTGSTDDAFIAECWPAVTRALAYL 563

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-----NDTQ 130
           ++FD+ G GI +N G PDQT          AYCGGL IAAL   + +   L     +D  
Sbjct: 564 KAFDQDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWIAALEAAIALGERLQQHPVSDHP 623

Query: 131 QQQQ---------FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLK 181
           +  Q         +   L ++  VY D LW G YY+ DS     S+ +MADQLCG  + +
Sbjct: 624 ELSQGAIAHHLSTYRTWLAQSRAVYDDALWNGRYYRLDSGSG--SEVVMADQLCGQFYAR 681

Query: 182 ASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWT 240
              ++   +  S     AL T++ +    F  G  GA NG+KP+G  +   A    EVWT
Sbjct: 682 L--LDLPDVVASDRALSALKTVYDSCFLRFHDGKFGAANGVKPDGSPENPKATHPLEVWT 739

Query: 241 GVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALA 300
           G+ + ++A ++  G  DE +     +   +YE  GL F TPE +T   T+R+  Y+R +A
Sbjct: 740 GINFGIAAFLVQMGLRDEGFRLTQAVVEQIYEH-GLQFRTPEAITAVGTFRACHYLRPMA 798

Query: 301 VYAM 304
           ++A+
Sbjct: 799 IWAV 802


>gi|443311911|ref|ZP_21041533.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
 gi|442777986|gb|ELR88257.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
          Length = 804

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 96/310 (30%), Positives = 143/310 (46%), Gaps = 38/310 (12%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRD+ L    D   L   +   ++ +   
Sbjct: 492 PNEHPWEKTNYTSYQDCNLWKDLPSDFVLQVYRDFVLTGSTDWELLWDCWGAIVQTLNYL 551

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           ++FD    GI +N G PDQT          AYCGGL +A L   + +   L    Q+   
Sbjct: 552 KTFDLDNDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAGLESAIAIGKTLISYPQEHPL 611

Query: 136 SAKLT--------------------KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLC 175
              LT                    ++  VY + LW G YYK DS     SD +MADQLC
Sbjct: 612 GEMLTSPLNQPPIEETIGIYQTWLEQSRPVYQEKLWNGQYYKLDSESG--SDVVMADQLC 669

Query: 176 GYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQ 234
           G  + +  GV    I  S     +L T++      F  G+ GA NG+K +G  +   A  
Sbjct: 670 GQFYARLLGV--PDIVESDRTVSSLKTVYEACFLKFHNGAFGAANGLKIDGSPENPDATH 727

Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGG 294
             EVWTG+ + L+A +L     +EA+     +   +Y   GL F TPE +T   T+R+  
Sbjct: 728 PLEVWTGINFGLAAFLLQMDMKEEAFKLTQTVVEQIY-NNGLQFRTPEAITAVGTFRACH 786

Query: 295 YMRALAVYAM 304
           Y+R +A++A+
Sbjct: 787 YLRPMAIWAV 796


>gi|254423918|ref|ZP_05037636.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
 gi|196191407|gb|EDX86371.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
          Length = 828

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 97/322 (30%), Positives = 154/322 (47%), Gaps = 48/322 (14%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL+  F++ +YR +      D+ FL   +P  +E +   
Sbjct: 503 PNENPWMATNYTSYQDCNQWKDLSSDFVLQVYRAFESTGSVDMDFLLDCWPAIVETLGYT 562

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQ-- 133
           + FD  G G+++N G PDQT          AYCGGL IAAL+  +++  +L ++++ +  
Sbjct: 563 KLFDTDGDGLIENSGAPDQTFDDWKLQGISAYCGGLWIAALSAAIKIGEILQESKRVEGT 622

Query: 134 ------QFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG--- 184
                 Q+          Y   LW G YY+ D+     SD +MADQLCG       G   
Sbjct: 623 IVILLAQYRRWWQHGRTAYQKQLWNGEYYRLDTGSG--SDVVMADQLCGQFCASTMGLPD 680

Query: 185 -VNTEAIFPSAN-IARALATIF----------STNVRGFEAG---------SMGAVNGMK 223
            V++E + P+   I  A    F          +    G + G          +G  NG++
Sbjct: 681 VVDSEFVEPTLRAIYEACFVRFNQYTAQLGPQNQKFEGAQTGYFSASELGVKVGCANGVR 740

Query: 224 PNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPE 282
           P+G  ++       EVWTG+ + L+A + +EG + EA      + R VYE  GL F TPE
Sbjct: 741 PDGSPQNPDDTHQLEVWTGINFGLAAFLAHEGKLHEAMEITEAVVRQVYEH-GLQFRTPE 799

Query: 283 GLTGDKTYRSGGYMRALAVYAM 304
            +T   T+R+  Y+R +A++A+
Sbjct: 800 AITAVGTFRACHYLRPMAIWAV 821


>gi|78212458|ref|YP_381237.1| hypothetical protein Syncc9605_0920 [Synechococcus sp. CC9605]
 gi|78196917|gb|ABB34682.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 832

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 150/297 (50%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E P+   N     D + WKDL   F++ ++R + L    +DIRFL   +P  +E +R 
Sbjct: 525 PNEIPWDATNYTAYQDCNLWKDLGSDFVLQVWRTFKLAPSGEDIRFLADCWPAAVEALRY 584

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC------MVEMSSLLND 128
            ++FD    G+  NGG PDQT          AYCG L IAAL         +++S+ L+ 
Sbjct: 585 LKTFDVNNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLSTGLDT 644

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
           + +Q++FS  L ++   +  LLW G YY  D+     +  +MADQLCG  + +   +   
Sbjct: 645 SAEQREFSGWLEQSRGNFDKLLWNGEYYDIDAESG--TPVVMADQLCGDFYARL--LALP 700

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
            +   AN    L  +       F+ GS+G  NG++ +G   D +A    EVWTG+ + ++
Sbjct: 701 PVVSDANSLSTLKAVKEACFEAFDGGSLGVANGLRRDGTPLDPNATHPLEVWTGINFGIA 760

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +     G+   A +    +   VY  +GL F TPE +T   TYR+  Y+RA+A++ +
Sbjct: 761 SYYRLMGDKQTAQSICSAVVEQVYS-SGLQFRTPEAITAVNTYRACHYLRAMAIWGL 816


>gi|428210868|ref|YP_007084012.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
           6304]
 gi|427999249|gb|AFY80092.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
           6304]
          Length = 820

 Score =  144 bits (363), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 103/316 (32%), Positives = 151/316 (47%), Gaps = 44/316 (13%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F + +YRD+ L    D  FL   +P+ ++ I   
Sbjct: 499 PNEHPWVKTNYTSYQDCNLWKDLPCDFALLVYRDFLLTGGDDYEFLWDCWPSMVQAIAYL 558

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA----------ALACMVEMSSL 125
           ++FD  G GI +NGG PDQT          AYCGGL IA           +      S +
Sbjct: 559 KTFDTDGDGIPENGGAPDQTFDDWPMRGVSAYCGGLWIAALQAAIAIGEVILNNPPHSGV 618

Query: 126 LNDTQQQQQFSAKLTKAS----------------QVYHDLLWTGSYYKFDSSQSRHSDSI 169
             DT  +Q+ S  L  A+                 +Y   LW G+YY+ DS     S+ +
Sbjct: 619 FTDTIYEQKRSQLLEPATIQATLSTYQDWLEQSRPLYQHKLWNGTYYRLDSESG--SEVV 676

Query: 170 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 228
           MADQLCG  + +  G+    I P      AL T++ +    F  G  GA NG+  +G   
Sbjct: 677 MADQLCGQFYARLLGLAD--IVPIDCAESALKTVYESCFLKFHNGEFGAANGVLIDGSPV 734

Query: 229 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDK 288
           +  A    EVWTG+ + L+A +L  G  +E +     + + VYE  GL F TPE +T   
Sbjct: 735 NPDATHPLEVWTGINFGLAAFLLQMGMKEEGFKLTETVIQQVYE-NGLQFRTPEAITAVG 793

Query: 289 TYRSGGYMRALAVYAM 304
           T+R+  Y+RA+A++A+
Sbjct: 794 TFRASHYLRAMAIWAV 809


>gi|87125468|ref|ZP_01081313.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
 gi|86166768|gb|EAQ68030.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
          Length = 832

 Score =  143 bits (361), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 89/297 (29%), Positives = 140/297 (47%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E P+   N     D + WKDL   +++ ++R + L    +DI FL   +P  +E +  
Sbjct: 525 PNENPWDATNYTAYQDCNLWKDLASDYVLQVWRTFKLSPSGEDINFLAECWPAAVEALTY 584

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEM------SSLLND 128
            + FD    G+  NGG PDQT          AYCG L IAAL   + M      ++ L+ 
Sbjct: 585 LKGFDTNNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMGQQLQLATGLDT 644

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
             +Q  FS  L ++   +  LLW G YY  D+     +  +MADQLCG  + +   +   
Sbjct: 645 AAEQHTFSTWLEQSRANFDTLLWNGEYYDIDAESG--TPVVMADQLCGDFYARL--LKLP 700

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
           A+   AN    L  +  +    F+ GS+G  NG++ +G   D       EVWTG+ + ++
Sbjct: 701 AVVSDANCRSTLKAVKESCFEAFQGGSLGVANGLRRDGTPLDPQGTHPLEVWTGINFGIA 760

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +      +   A      +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 761 SYYRLMDDTSTALAITSAVVNQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 816


>gi|428307493|ref|YP_007144318.1| glucosylceramidase [Crinalium epipsammum PCC 9333]
 gi|428249028|gb|AFZ14808.1| Glucosylceramidase [Crinalium epipsammum PCC 9333]
          Length = 811

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 97/311 (31%), Positives = 154/311 (49%), Gaps = 39/311 (12%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E P+   N  +  D + WKDL   F++ +YRDY L    DI FL+  +   ++ +   
Sbjct: 501 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDYLLTGATDIEFLSECWSAIVKTLAYL 560

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ- 134
           + FD  G GI +N G PDQT          AYCGGL +AAL   + +  +L +   + + 
Sbjct: 561 KEFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAIGEILINHGVELED 620

Query: 135 ---------FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV 185
                    + + L ++  +Y + LW G YY+ DS     S+ +MADQLCG  + +   +
Sbjct: 621 GDFESAIATYKSWLEQSKPLYQEKLWNGQYYQLDSESG--SNVVMADQLCGQFYARL--L 676

Query: 186 NTEAIFPSANIARALATIF-----------STNVRGFEAGSMGAVNGMKPNGDRDRS-AI 233
           N   I P+  +  AL T++           S +        +GA NG+  +G  +   A 
Sbjct: 677 NLPDIVPNDCVESALKTVYDSCFLKFNQKLSQHSESLTPTFIGAANGVLADGSPENPQAT 736

Query: 234 QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSG 293
              EVWTG+ + L+A M+  G  DEA+     + + +Y   GL F TPE +T   T+R+ 
Sbjct: 737 HPLEVWTGINFGLAAFMIQMGMKDEAFQLTEAVVQQIY-NNGLQFRTPEAITAAGTFRAS 795

Query: 294 GYMRALAVYAM 304
            Y+RA+A++A+
Sbjct: 796 HYLRAMAIWAI 806


>gi|427702545|ref|YP_007045767.1| bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
 gi|427345713|gb|AFY28426.1| putative bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
          Length = 835

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 98/300 (32%), Positives = 146/300 (48%), Gaps = 31/300 (10%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHK---DIRFLTRVYPTCLELIRK 84
           P E PF   N     D + WKDL   +++ ++R Y+L     DIRFL   +P  +  +  
Sbjct: 527 PNERPFDATNYTAYQDCNLWKDLASDYVLQVWRTYSLAPSGPDIRFLADCWPAAVTALDY 586

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
            + FD    G+  NGG PDQT          AYCG L IAAL         +++   L+ 
Sbjct: 587 LKQFDVNDDGLPDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAALAIGQRLQLELGLDT 646

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
            ++Q++F   L ++   +  LLW G YY  D+     +  +MADQLCG  + +       
Sbjct: 647 AEEQRRFGGWLEQSRSHFDTLLWNGEYYAIDAGSG--TPVVMADQLCGDFYAR-----LL 699

Query: 189 AIFPSANIARALAT---IFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTY 244
           A+ P     RAL++   I       FE G +G  NG++ +G   D       EVWTG+ +
Sbjct: 700 ALPPVVADERALSSLKAIRQACFESFEGGRLGVANGLRRDGTPLDPEGTHPLEVWTGINF 759

Query: 245 LLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
            L+A     G    A+   G + R VYE  GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 760 GLAAYYRLMGQSSTAFAITGAVVRQVYE-GGLQFRTPEAITAAGTFRACHYLRAMAIWAL 818


>gi|318042148|ref|ZP_07974104.1| hypothetical protein SCB01_10580 [Synechococcus sp. CB0101]
          Length = 836

 Score =  140 bits (353), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 92/297 (30%), Positives = 144/297 (48%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E P+   N     D + WKDL   +++ ++R + L    +DI FL   +P  +E +  
Sbjct: 529 PNERPWDATNYTAYQDCNLWKDLASDYVLQVWRTFKLAPNGEDINFLAECWPAAVEALHY 588

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC------MVEMSSLLND 128
            + FD    G+  NGG PDQT          AYCG L IAAL         +++S+ L+ 
Sbjct: 589 LKGFDANNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQQLQLSTGLDT 648

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
             +Q  FS+ L ++   +  LLW G YY  D+     +  +MADQLCG  + +  G+   
Sbjct: 649 ASEQHTFSSWLEQSRANFDHLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--P 704

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
           A+   AN    L  +       FE G +G  NG++ +G   D +     EVWTG+ + ++
Sbjct: 705 AVVSEANSRSTLKAVKEACFEQFEGGKLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 764

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +     G+ D A      +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 765 SYYRLMGDTDTALAITAAVVNQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGV 820


>gi|260434763|ref|ZP_05788733.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
 gi|260412637|gb|EEX05933.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
          Length = 832

 Score =  140 bits (352), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 92/297 (30%), Positives = 147/297 (49%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E P+   N     D + WKDL   F++ ++R + L    +DIRFL   +P  +E +R 
Sbjct: 525 PNEIPWDATNYTAYQDCNLWKDLGSDFVLQVWRSFKLAPSGEDIRFLADCWPAAVEALRF 584

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC------MVEMSSLLND 128
            ++FD    G+  NGG PDQT          AYCG L IAAL         +++S+ L+ 
Sbjct: 585 LKTFDVNNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLSTGLDT 644

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
           + +Q++FS  L ++   +  LLW G YY  D+     +  +MADQLCG  + +  G+   
Sbjct: 645 SAEQREFSGWLEQSRGNFDKLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--P 700

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
            +   +N    L  +       F+ GS+G  NG++ +G   D +     EVWTG+ + ++
Sbjct: 701 PVVSDSNSRSTLKVVKEACFEAFDGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 760

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +     G+   A      +   VY   GL F TPE +T   T+R+  Y+RA+A++  
Sbjct: 761 SYYRLMGDKQTAQAICSAVVEQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGF 816


>gi|124023625|ref|YP_001017932.1| bile acid beta-glucosidase [Prochlorococcus marinus str. MIT 9303]
 gi|123963911|gb|ABM78667.1| Predicted bile acid beta-glucosidase [Prochlorococcus marinus str.
           MIT 9303]
          Length = 837

 Score =  140 bits (352), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 99/312 (31%), Positives = 149/312 (47%), Gaps = 29/312 (9%)

Query: 13  RSAANTHERYYTRWFPEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIR 69
           R+AA  H+       P E PF   N     D + WKDL   +++ ++R + L    +D+ 
Sbjct: 515 RAAATPHDLGA----PNESPFDATNYTAYQDCNLWKDLASDYVLQVWRTFLLSPNGEDLS 570

Query: 70  FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
           FL   +P  ++ +   + FD    G+  NGG PDQT          AYCG L IAAL   
Sbjct: 571 FLAECWPAAVQALSYLKRFDVNHDGLPDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAA 630

Query: 120 ------VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQ 173
                 +++   LN  ++Q QFS  L ++   +  LLW G YYK D+     +  +MADQ
Sbjct: 631 LAMAQRLQLDLGLNTAEEQHQFSGWLEQSRANFDRLLWNGEYYKIDAESG--TPVVMADQ 688

Query: 174 LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSA 232
           LCG  + +  G+ +  +        +L  +      GFE G +G  NG+  +G   D   
Sbjct: 689 LCGDFYARLLGLPS--VVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDGMPLDPKG 746

Query: 233 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 292
               EVWTG+ + L+A     G+   A      +   VY   GL F TPE +T  KTYR+
Sbjct: 747 THPLEVWTGINFGLAAYYRLMGDATTATAICSAVVNQVYG-GGLQFRTPEAITAVKTYRA 805

Query: 293 GGYMRALAVYAM 304
             Y+RA+A++A+
Sbjct: 806 CHYLRAMAIWAL 817


>gi|33862646|ref|NP_894206.1| hypothetical protein PMT0373 [Prochlorococcus marinus str. MIT
           9313]
 gi|33634562|emb|CAE20548.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 837

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 99/312 (31%), Positives = 149/312 (47%), Gaps = 29/312 (9%)

Query: 13  RSAANTHERYYTRWFPEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIR 69
           R+AA  H+       P E PF   N     D + WKDL   +++ ++R + L    +D+ 
Sbjct: 515 RAAATPHDLGA----PNESPFDATNYTAYQDCNLWKDLASDYVLQVWRTFLLSPNGEDLS 570

Query: 70  FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
           FL   +P  ++ +   + FD    G+  NGG PDQT          AYCG L IAAL   
Sbjct: 571 FLAECWPAVVQALSYLKRFDVNHDGLPDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAA 630

Query: 120 ------VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQ 173
                 +++   LN  ++Q QFS  L ++   +  LLW G YYK D+     +  +MADQ
Sbjct: 631 LAMAQRLQLDLGLNTAEEQHQFSGWLEQSRVNFDRLLWNGEYYKIDAESG--TPVVMADQ 688

Query: 174 LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSA 232
           LCG  + +  G+ +  +        +L  +      GFE G +G  NG+  +G   D   
Sbjct: 689 LCGDFYARLLGLPS--VVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDGMPLDPKG 746

Query: 233 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 292
               EVWTG+ + L+A     G+   A      +   VY   GL F TPE +T  KTYR+
Sbjct: 747 THPLEVWTGINFGLAAYYRLMGDATTATAICSAVVNQVYG-GGLQFRTPEAITAVKTYRA 805

Query: 293 GGYMRALAVYAM 304
             Y+RA+A++A+
Sbjct: 806 CHYLRAMAIWAL 817


>gi|443318654|ref|ZP_21047901.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
 gi|442781756|gb|ELR91849.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
          Length = 857

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 98/325 (30%), Positives = 147/325 (45%), Gaps = 53/325 (16%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E PF   N  +  D + WKDL   F++ +YR Y +    D  FL   +P   E ++  
Sbjct: 523 PNEHPFVKTNYTSYQDCNQWKDLPSDFVIQVYRAYKMTGATDGDFLADCWPAVTETLKYL 582

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------N 127
           + FD    GI +NGG PDQT          AYCGGL +AAL   + ++  L        N
Sbjct: 583 KRFDTDSDGIPENGGAPDQTFDDWQLKGLSAYCGGLWMAALESAIAIADTLTAAGRAPGN 642

Query: 128 DTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
                 Q+   L ++ +VYH  LW G YY+ ++     S  +MADQLCG    +   +  
Sbjct: 643 TPILVTQYRRWLEQSRKVYHKRLWNGRYYRLETGSD--SSVVMADQLCGQFCARL--MEL 698

Query: 188 EAIFPSANIARALATIF-----------STNVR-------GFEAGS---------MGAVN 220
             +     +  ALA I+           +T  R       G + G+         +GA N
Sbjct: 699 PDLVEEKYVQSALAAIYDACFLKFNNYAATQARPQEQKFIGSQTGTFRSAQPGIPIGAAN 758

Query: 221 GMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFE 279
           G+ P+G   D       EVWTG+ + L+      G VD+A+     + + +Y   GL F 
Sbjct: 759 GVLPDGSPEDPDGTHQLEVWTGINFGLATFFAQMGQVDKAFAITEAVVQQIYTH-GLQFR 817

Query: 280 TPEGLTGDKTYRSGGYMRALAVYAM 304
           TPE +T   T+R+  Y+R LA++ +
Sbjct: 818 TPEAITALGTFRACHYLRPLAIWGL 842


>gi|116072378|ref|ZP_01469645.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
 gi|116064900|gb|EAU70659.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
          Length = 820

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 92/297 (30%), Positives = 147/297 (49%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E PF   N     D + WKDL   F++ ++R + L    +D+ FL   +P+ ++ +  
Sbjct: 513 PNEVPFDATNYTAYQDCNLWKDLASDFVLQVWRTFQLAPTGEDLNFLAECWPSAVQALDY 572

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
            + FD    G+  NGG PDQT          AYCG L IAAL         +++   L+ 
Sbjct: 573 LKGFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLDLGLDT 632

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
           T +Q++F   L ++   +  LLW G YY+ D+     +  +MADQLCG  + +  G+   
Sbjct: 633 TVEQRRFGQWLEQSRSNFDRLLWNGEYYEIDAESG--TPVVMADQLCGDFYARLLGL--P 688

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
           ++   AN    L  +       FEAG++G  NG++ +G   D +     EVWTG+ + ++
Sbjct: 689 SVVSDANALSTLNVVREACFERFEAGTLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 748

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +     G  D A    G +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 749 SYFRLMGQSDTALAITGAVVNQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 804


>gi|116073260|ref|ZP_01470522.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
 gi|116068565|gb|EAU74317.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
          Length = 832

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 93/299 (31%), Positives = 153/299 (51%), Gaps = 29/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E P+   N     D + WKDL   +++ ++R + L    +D+RFL   +P+ +E +R 
Sbjct: 525 PNELPWDATNYTAYQDCNLWKDLGSDYVLQVWRTFKLSPSGEDVRFLADCWPSAVEALRY 584

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC------MVEMSSLLND 128
            ++FD    G+  NGG PDQT          AYCG L IAAL         +++ + L+ 
Sbjct: 585 LKTFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLKTGLDT 644

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
           + +Q++FS  L ++   +  LLW G YY  D+     +  +MADQLCG  + +  G+  E
Sbjct: 645 SAEQREFSGWLEQSRGNFDRLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--E 700

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
            +   AN    L  +       F+ G +G  NG++ +G   D +     EVWTG+ + ++
Sbjct: 701 PVVSEANSRSTLKAVREACFEKFDGGKLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 760

Query: 248 A--AMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +   ++ EG   EA  +A  +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 761 SYYRLMGEGQTAEAICSA--VVNQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 816


>gi|188570649|gb|ACD64360.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 4   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 64  NSKSIQADQLAGQWYTASAGLPN--LFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           G T D +YRS  YMR LA++ MQ A  K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209


>gi|188570647|gb|ACD64359.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 4   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 64  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIPSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           G T D +YRS  YMR LA++ MQ A  K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209


>gi|188570531|gb|ACD64301.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 92/208 (44%), Positives = 122/208 (58%), Gaps = 4/208 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 4   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF+ NV     G MGAVNGM P
Sbjct: 64  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFNFNVMKVRGGRMGAVNGMHP 121

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           G T D +YRS  YMR LA++ MQ A  K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209


>gi|188570631|gb|ACD64351.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 4   SAYCGCLWLAALQAAAAMAVELNDKLAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 64  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           G T D +YRS  YMR LA++ MQ A  K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209


>gi|87301122|ref|ZP_01083963.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
 gi|87284090|gb|EAQ76043.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
          Length = 830

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 143/297 (48%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E PF   N     D + WKDL   F++ ++R ++L    +D+RFL   +P  +E +R 
Sbjct: 522 PNERPFDATNYTAYQDCNLWKDLASDFVLQVWRTFSLAPTGEDLRFLADCWPAVVEALRY 581

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
            + FD    G+  NGG PDQT          AYCG L IAAL         +++   L+ 
Sbjct: 582 LKQFDVNDDGLPDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLDT 641

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
             +Q++F + L  +   +  LLW G YY+ D+     +  +MADQLCG  + +  G+   
Sbjct: 642 GAEQREFGSWLEPSRANFDRLLWNGEYYRIDAGSG--TPVVMADQLCGDFYARLLGLPQ- 698

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
            +        +L  I       FE G +G  NG++ +G   D       EVWTG+ + L+
Sbjct: 699 -VVSDERALSSLKAIREACFERFEGGRLGVANGLRRDGTPLDPKGTHPLEVWTGINFGLA 757

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           A     G  D A      +   VY   GL F TPE +T   T+R+  Y+RA+A++A+
Sbjct: 758 AYYRLMGQTDTALAITSAVVGQVYA-GGLQFRTPEAITAVGTFRACHYLRAMAIWAL 813


>gi|292384326|gb|ADE21416.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 4   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 64  NSKSIQADQLAGQWYTASAGLPN--LFGEDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           G T D +YRS  YMR LA++ MQ A  K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209


>gi|292384308|gb|ADE21407.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 1   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFMKAKSSFESKLWNGSYFNYDSGSSS 60

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 61  NSKSIQADQLAGQWYTASAGLPN--LFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 118

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 119 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 178

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
           G T D +YRS  YMR LA++ MQ A
Sbjct: 179 GWTMDGSYRSLVYMRPLAIWGMQQA 203


>gi|292384322|gb|ADE21414.1| unknown [Helianthus neglectus]
          Length = 207

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 1   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 61  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 118

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 119 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 178

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           G T D +YRS  YMR LA++ MQ A  K
Sbjct: 179 GWTMDGSYRSLVYMRPLAIWGMQQALEK 206


>gi|188570553|gb|ACD64312.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 4   SAYCGCLWLAALQAATAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 64  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           G T D +YRS  YMR LA++ MQ A  K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209


>gi|188570503|gb|ACD64287.1| hypothetical protein [Helianthus annuus]
 gi|188570505|gb|ACD64288.1| hypothetical protein [Helianthus annuus]
 gi|188570507|gb|ACD64289.1| hypothetical protein [Helianthus annuus]
 gi|188570509|gb|ACD64290.1| hypothetical protein [Helianthus annuus]
 gi|188570513|gb|ACD64292.1| hypothetical protein [Helianthus annuus]
 gi|188570515|gb|ACD64293.1| hypothetical protein [Helianthus annuus]
 gi|188570517|gb|ACD64294.1| hypothetical protein [Helianthus annuus]
 gi|188570519|gb|ACD64295.1| hypothetical protein [Helianthus annuus]
 gi|188570521|gb|ACD64296.1| hypothetical protein [Helianthus annuus]
 gi|188570523|gb|ACD64297.1| hypothetical protein [Helianthus annuus]
 gi|188570525|gb|ACD64298.1| hypothetical protein [Helianthus annuus]
 gi|188570527|gb|ACD64299.1| hypothetical protein [Helianthus annuus]
 gi|188570529|gb|ACD64300.1| hypothetical protein [Helianthus annuus]
 gi|188570533|gb|ACD64302.1| hypothetical protein [Helianthus annuus]
 gi|188570535|gb|ACD64303.1| hypothetical protein [Helianthus annuus]
 gi|188570537|gb|ACD64304.1| hypothetical protein [Helianthus annuus]
 gi|188570539|gb|ACD64305.1| hypothetical protein [Helianthus annuus]
 gi|188570541|gb|ACD64306.1| hypothetical protein [Helianthus annuus]
 gi|188570543|gb|ACD64307.1| hypothetical protein [Helianthus annuus]
 gi|188570545|gb|ACD64308.1| hypothetical protein [Helianthus annuus]
 gi|188570547|gb|ACD64309.1| hypothetical protein [Helianthus annuus]
 gi|188570549|gb|ACD64310.1| hypothetical protein [Helianthus annuus]
 gi|188570551|gb|ACD64311.1| hypothetical protein [Helianthus annuus]
 gi|188570555|gb|ACD64313.1| hypothetical protein [Helianthus annuus]
 gi|188570557|gb|ACD64314.1| hypothetical protein [Helianthus annuus]
 gi|188570559|gb|ACD64315.1| hypothetical protein [Helianthus annuus]
 gi|188570561|gb|ACD64316.1| hypothetical protein [Helianthus annuus]
 gi|188570563|gb|ACD64317.1| hypothetical protein [Helianthus annuus]
 gi|188570565|gb|ACD64318.1| hypothetical protein [Helianthus annuus]
 gi|188570567|gb|ACD64319.1| hypothetical protein [Helianthus annuus]
 gi|188570569|gb|ACD64320.1| hypothetical protein [Helianthus annuus]
 gi|188570571|gb|ACD64321.1| hypothetical protein [Helianthus annuus]
 gi|188570573|gb|ACD64322.1| hypothetical protein [Helianthus annuus]
 gi|188570575|gb|ACD64323.1| hypothetical protein [Helianthus annuus]
 gi|188570577|gb|ACD64324.1| hypothetical protein [Helianthus annuus]
 gi|188570579|gb|ACD64325.1| hypothetical protein [Helianthus annuus]
 gi|188570581|gb|ACD64326.1| hypothetical protein [Helianthus annuus]
 gi|188570583|gb|ACD64327.1| hypothetical protein [Helianthus annuus]
 gi|188570585|gb|ACD64328.1| hypothetical protein [Helianthus annuus]
 gi|188570587|gb|ACD64329.1| hypothetical protein [Helianthus annuus]
 gi|188570589|gb|ACD64330.1| hypothetical protein [Helianthus annuus]
 gi|188570591|gb|ACD64331.1| hypothetical protein [Helianthus annuus]
 gi|188570593|gb|ACD64332.1| hypothetical protein [Helianthus annuus]
 gi|188570595|gb|ACD64333.1| hypothetical protein [Helianthus annuus]
 gi|188570597|gb|ACD64334.1| hypothetical protein [Helianthus annuus]
 gi|188570599|gb|ACD64335.1| hypothetical protein [Helianthus annuus]
 gi|188570601|gb|ACD64336.1| hypothetical protein [Helianthus annuus]
 gi|188570603|gb|ACD64337.1| hypothetical protein [Helianthus petiolaris]
 gi|188570605|gb|ACD64338.1| hypothetical protein [Helianthus petiolaris]
 gi|188570607|gb|ACD64339.1| hypothetical protein [Helianthus petiolaris]
 gi|188570609|gb|ACD64340.1| hypothetical protein [Helianthus petiolaris]
 gi|188570611|gb|ACD64341.1| hypothetical protein [Helianthus petiolaris]
 gi|188570613|gb|ACD64342.1| hypothetical protein [Helianthus petiolaris]
 gi|188570615|gb|ACD64343.1| hypothetical protein [Helianthus petiolaris]
 gi|188570617|gb|ACD64344.1| hypothetical protein [Helianthus petiolaris]
 gi|188570619|gb|ACD64345.1| hypothetical protein [Helianthus petiolaris]
 gi|188570625|gb|ACD64348.1| hypothetical protein [Helianthus petiolaris]
 gi|188570627|gb|ACD64349.1| hypothetical protein [Helianthus petiolaris]
 gi|188570629|gb|ACD64350.1| hypothetical protein [Helianthus petiolaris]
 gi|188570633|gb|ACD64352.1| hypothetical protein [Helianthus petiolaris]
 gi|188570637|gb|ACD64354.1| hypothetical protein [Helianthus petiolaris]
 gi|188570641|gb|ACD64356.1| hypothetical protein [Helianthus petiolaris]
 gi|188570643|gb|ACD64357.1| hypothetical protein [Helianthus petiolaris]
 gi|188570645|gb|ACD64358.1| hypothetical protein [Helianthus petiolaris]
 gi|188570651|gb|ACD64361.1| hypothetical protein [Helianthus petiolaris]
 gi|188570653|gb|ACD64362.1| hypothetical protein [Helianthus petiolaris]
 gi|188570655|gb|ACD64363.1| hypothetical protein [Helianthus petiolaris]
 gi|188570657|gb|ACD64364.1| hypothetical protein [Helianthus petiolaris]
 gi|188570659|gb|ACD64365.1| hypothetical protein [Helianthus petiolaris]
 gi|188570661|gb|ACD64366.1| hypothetical protein [Helianthus petiolaris]
 gi|188570663|gb|ACD64367.1| hypothetical protein [Helianthus petiolaris]
 gi|188570665|gb|ACD64368.1| hypothetical protein [Helianthus petiolaris]
 gi|292384298|gb|ADE21402.1| unknown [Helianthus neglectus]
 gi|292384300|gb|ADE21403.1| unknown [Helianthus neglectus]
 gi|292384302|gb|ADE21404.1| unknown [Helianthus neglectus]
 gi|292384304|gb|ADE21405.1| unknown [Helianthus neglectus]
 gi|292384312|gb|ADE21409.1| unknown [Helianthus neglectus]
 gi|292384318|gb|ADE21412.1| unknown [Helianthus neglectus]
 gi|292384320|gb|ADE21413.1| unknown [Helianthus neglectus]
 gi|292384328|gb|ADE21417.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 4   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 64  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           G T D +YRS  YMR LA++ MQ A  K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209


>gi|188570671|gb|ACD64371.1| hypothetical protein [Bahiopsis reticulata]
 gi|188570673|gb|ACD64372.1| hypothetical protein [Bahiopsis reticulata]
          Length = 210

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 4   SAYCGCLWLAALQAAAAMAVELNDKFSAQKYKKKFMKAKTSFESKLWNGSYFNYDSGSSS 63

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 64  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDSAFTTAEGIFTAGWSEEGFGYAFQTPE 181

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
           G T D +YRS  YMR LA++ MQ A
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQA 206


>gi|292384306|gb|ADE21406.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 1   SAYCGCLWLAALQAAAAMAVELNDKFTAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 61  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 118

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 119 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 178

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
           G T D +YRS  YMR LA++ MQ A
Sbjct: 179 GWTMDGSYRSLVYMRPLAIWGMQQA 203


>gi|292384314|gb|ADE21410.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 1   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 61  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 118

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 119 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 178

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
           G T D +YRS  YMR LA++ MQ A
Sbjct: 179 GWTMDGSYRSLVYMRPLAIWGMQQA 203


>gi|282897193|ref|ZP_06305195.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
 gi|281197845|gb|EFA72739.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
          Length = 798

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 97/297 (32%), Positives = 144/297 (48%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD+      DI FL   +   +  +   
Sbjct: 502 PNEWVWEQTNYTSYQDCNLWKDLGCDFVLQVYRDFLFTGGNDIEFLVDCWYGIVLTLDYL 561

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA-----CMVEMS--SLLND 128
           + FD    GI +N G PDQT          AYCG L +AAL      C V M+   L N 
Sbjct: 562 KQFDIDADGIPENSGAPDQTFDDWRLSGVSAYCGALWLAALEAAIAICDVLMNHPELSNV 621

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
            +Q+  +   L ++  VY   LW G +Y+ DS     S+ +MADQL G  + +   +N  
Sbjct: 622 GEQRSIYENWLNQSLPVYQQKLWNGKFYRLDSESG--SNVVMADQLSGQFYARL--LNLP 677

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLLS 247
            I P      AL TI+      F+ G  GA NG+ P+G  +   A    EVWTG+ + L+
Sbjct: 678 DIVPQDRALSALTTIYDACFLKFQDGKFGAANGVLPDGSPENPQATHPLEVWTGINFGLA 737

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           A +L  G  D+       + R +Y+  GL F TPE +T   T+R+  Y+R +A++A+
Sbjct: 738 AFLLQMGMKDQGLRLTQAVVRQIYDH-GLQFRTPEAITAAGTFRASTYLRPMAIWAI 793


>gi|188570667|gb|ACD64369.1| hypothetical protein [Bahiopsis lanata]
 gi|188570669|gb|ACD64370.1| hypothetical protein [Bahiopsis lanata]
          Length = 208

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 2   SAYCGCLWLAALQAAAAMAVALNDKFSAQKYKKKFMKAKTSFESKLWNGSYFNYDSGSSS 61

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 62  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLHKIFDFNVMKVRGGRMGAVNGMHP 119

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D+A+TTA G++   +   G G  F+TPE
Sbjct: 120 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFTAGWSEEGFGYAFQTPE 179

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
           G T D  YRS  YMR LA++ MQ A
Sbjct: 180 GWTMDGCYRSLVYMRPLAIWGMQQA 204


>gi|78185056|ref|YP_377491.1| hypothetical protein Syncc9902_1489 [Synechococcus sp. CC9902]
 gi|78169350|gb|ABB26447.1| conserved hypothetical protein [Synechococcus sp. CC9902]
          Length = 823

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 91/297 (30%), Positives = 145/297 (48%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E PF   N     D + WKDL   F++ ++R + L    +D+ FL   +P+ +  +  
Sbjct: 516 PNELPFDATNYTAYQDCNLWKDLASDFVLQVWRTFQLAPTGEDLNFLADCWPSAVRALDY 575

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
            + FD    G+  NGG PDQT          AYCG L IAAL         +++   L+ 
Sbjct: 576 LKGFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLDLGLDT 635

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
           T +Q++F   L ++   +  LLW G YY+ D+     +  +MADQLCG  + +  G+   
Sbjct: 636 TVEQRRFGQWLEQSRSNFDRLLWNGEYYEIDAESG--TPVVMADQLCGDFYARLLGL--P 691

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
           ++   AN    L  +       FE G++G  NG++ +G   D +     EVWTG+ + ++
Sbjct: 692 SVVSDANALSTLNVVRDACFERFEGGTLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 751

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +     G  D A    G +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 752 SYFRLMGQSDTALAITGAVVNQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 807


>gi|292384310|gb|ADE21408.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 4   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFFKAKSSFESKLWNGSYFNYDSGSSS 63

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 64  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           G T D +YRS  YMR LA++ MQ A  K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209


>gi|188570511|gb|ACD64291.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 91/208 (43%), Positives = 121/208 (58%), Gaps = 4/208 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 4   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 64  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           G T D +YRS  YMR LA++ +Q A  K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGIQQALEK 209


>gi|282900195|ref|ZP_06308151.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
           CS-505]
 gi|281194944|gb|EFA69885.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
           CS-505]
          Length = 789

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 95/297 (31%), Positives = 142/297 (47%), Gaps = 25/297 (8%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD+      DI FL   +   +  +   
Sbjct: 493 PNEWVWEQTNYTSYQDCNLWKDLGCDFVLQVYRDFLFTGANDIEFLVDCWHGMVLTLDYL 552

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA-------CMVEMSSLLND 128
           + FD  G GI +N G PDQT          AYCG L +AAL         ++    L N 
Sbjct: 553 KQFDMDGDGIPENSGAPDQTFDDWRLSGVSAYCGALWLAALEAAIAICDVLINRPELPNV 612

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
            +Q+  +   L ++  VY   LW G +Y+ DS     S+ +MADQL G  + +   +N  
Sbjct: 613 GEQKSIYEHWLNQSLPVYQQKLWNGKFYRLDSESG--SNVVMADQLSGQFYARL--LNLP 668

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLLS 247
            I P      AL TI+      F+ G  GA NG+  +G  +   A    EVWTG+ + L+
Sbjct: 669 DIVPKDRALSALTTIYDACFLKFQDGKFGAANGVLLDGSPENPQATHPLEVWTGINFGLA 728

Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           A +L  G  ++       + R VYE  GL F TPE +T   T+R+  Y+R +A++A+
Sbjct: 729 AFLLQMGMKEQGLRLTEAVVRQVYEH-GLQFRTPEAITAAGTFRASTYLRPMAIWAV 784


>gi|292384316|gb|ADE21411.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 1   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFFKAKSSFESKLWNGSYFNYDSGSSS 60

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 61  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 118

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TPE
Sbjct: 119 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 178

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
           G T D +YRS  YMR LA++ MQ A
Sbjct: 179 GWTMDGSYRSLVYMRPLAIWGMQQA 203


>gi|33866127|ref|NP_897686.1| hypothetical protein SYNW1593 [Synechococcus sp. WH 8102]
 gi|33639102|emb|CAE08108.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 833

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 93/300 (31%), Positives = 149/300 (49%), Gaps = 31/300 (10%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E PF   N     D + WKDL   +++ ++R + L    +D+RFL   +P+ ++ +  
Sbjct: 526 PNERPFDATNYTAYQDCNLWKDLASDYMLQVWRTFRLAPTGEDLRFLAECWPSAVQALDY 585

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
            + FD    G+  NGG PDQT          AYCG L IAAL         +++   L+ 
Sbjct: 586 LKGFDVNNDGLPDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLDT 645

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
             +Q++F A L ++   + DLLW G YY+ D+     +  +MADQLCG  + +  G+   
Sbjct: 646 AAEQRRFGAWLEQSRANFDDLLWNGEYYRIDAESG--TPVVMADQLCGDFYARLLGLP-- 701

Query: 189 AIFPSANIARALATIFSTN---VRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTY 244
              P  + ARA +T+ +        FE+G +G  NG++ +G   D +     EVWTG+ +
Sbjct: 702 ---PVVSDARARSTLQAVKEACFERFESGRLGVANGLRRDGSPLDPNGTHPLEVWTGINF 758

Query: 245 LLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
            L++     G    A      +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 759 GLASYYRLMGESKTALAICSAVVGQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817


>gi|188570621|gb|ACD64346.1| hypothetical protein [Helianthus petiolaris]
 gi|188570623|gb|ACD64347.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 91/208 (43%), Positives = 120/208 (57%), Gaps = 4/208 (1%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
           +AYCG L +AAL     M+  LND    Q++  K  KA   +   LW GSY+ +DS  S 
Sbjct: 4   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63

Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
           +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM P
Sbjct: 64  NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
           NG  D + +QS EVWTGVTY ++A M++    D A+TTA G++   +   G G  F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAEMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           G T D +YRS  YMR LA++ MQ A  K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209


>gi|292384324|gb|ADE21415.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 83/187 (44%), Positives = 109/187 (58%), Gaps = 4/187 (2%)

Query: 126 LNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV 185
           LND    Q++  K  KA   +   LW GSY+ +DS  S +S SI ADQL G  +  ++G+
Sbjct: 25  LNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGL 84

Query: 186 NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYL 245
               +F    I  +L  IF  NV     G MGAVNGM PNG  D + +QS EVWTGVTY 
Sbjct: 85  PN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYG 142

Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYA 303
           ++A M++ G  D A+TTA G++   +   G G  F+TPEG T D +YRS  YMR LA++ 
Sbjct: 143 VAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWG 202

Query: 304 MQDAYLK 310
           MQ A  K
Sbjct: 203 MQQALEK 209


>gi|188570635|gb|ACD64353.1| hypothetical protein [Helianthus petiolaris]
 gi|188570639|gb|ACD64355.1| hypothetical protein [Helianthus petiolaris]
          Length = 211

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 92/209 (44%), Positives = 121/209 (57%), Gaps = 5/209 (2%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDT-QQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQS 163
           +AYCG L +AAL     M+  LND     Q++  K  KA   +   LW GSY+ +DS  S
Sbjct: 4   SAYCGCLWLAALQAAAAMAVELNDKFAAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSS 63

Query: 164 RHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMK 223
            +S SI ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM 
Sbjct: 64  SNSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 121

Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETP 281
           PNG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G++   +   G G  F+TP
Sbjct: 122 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 181

Query: 282 EGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           EG T D +YRS  YMR LA++ MQ A  K
Sbjct: 182 EGWTMDGSYRSLVYMRPLAIWGMQQALEK 210


>gi|148240031|ref|YP_001225418.1| hypothetical protein SynWH7803_1695 [Synechococcus sp. WH 7803]
 gi|147848570|emb|CAK24121.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
          Length = 833

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 94/299 (31%), Positives = 147/299 (49%), Gaps = 29/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E PF   N     D + WKDL   F++ ++R + L    +D++FL   +P  ++ +  
Sbjct: 526 PNEVPFDATNYTAYQDCNLWKDLASDFVLQVWRTFKLAPTGEDLKFLADCWPAAVQALHY 585

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
            + FD    G+  NGG PDQT          AYCG L IAAL         ++    L+ 
Sbjct: 586 LKRFDVNNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQQELGLDT 645

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
              Q  FSA L ++   +  LLW G YY  D+     +  +MADQLCG  + +  G+  E
Sbjct: 646 GSDQHTFSAWLEQSRSNFDRLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--E 701

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
            +   AN    L  +  +    F+ GS+G  NG++ +G   D +     EVWTG+ + ++
Sbjct: 702 PVVSEANSRSTLKAVKESCFEKFQGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 761

Query: 248 A--AMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +   ++ EG   EA  +A  +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 762 SYYRLMGEGQTAEAICSA--VVDQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817


>gi|288901049|gb|ADC68237.1| hypothetical protein [Dendrobium nobile]
          Length = 207

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 79/203 (38%), Positives = 106/203 (52%), Gaps = 12/203 (5%)

Query: 67  DIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
           D+ F   V+P     +     FD+ G G+++N GFPDQT          AYCG L +A+L
Sbjct: 6   DMAFGREVWPAVCAAMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLASL 65

Query: 117 ACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
                M+  L D    ++ + K   A   +   LW GSY+ +DS  S +S SI ADQL G
Sbjct: 66  QAAAAMAHRLGDHAYAEKCTIKFLTAKHAFESKLWNGSYFNYDSGNSSNSKSIQADQLAG 125

Query: 177 YLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSE 236
             +  +SG+    +F    I   L  IF  NV     G +GAVNGM PNG  D S +QS 
Sbjct: 126 QWYAASSGL--PPLFDENKIRSTLQKIFDFNVMKVRGGRIGAVNGMHPNGKVDESCMQSR 183

Query: 237 EVWTGVTYLLSAAMLYEGNVDEA 259
           E+WTGVTY LSA M+  G  ++A
Sbjct: 184 EIWTGVTYALSATMILAGMEEQA 206


>gi|47212590|emb|CAG12815.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 777

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 67/140 (47%), Positives = 85/140 (60%), Gaps = 2/140 (1%)

Query: 168 SIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 227
           S+ A   CG+   +     T   FP   I  AL +IF  NV  F  G MGAVNGM+P G 
Sbjct: 638 SVPATGFCGHPVWETE--TTRLAFPKEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGV 695

Query: 228 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGD 287
            DRS++QS+EVW GV Y L+A M++EG ++E   TA G YRTV+ER G+ F+TPE     
Sbjct: 696 PDRSSVQSDEVWIGVVYGLAATMIHEGMLEEGMRTAEGCYRTVWERLGMAFQTPEAYCEK 755

Query: 288 KTYRSGGYMRALAVYAMQDA 307
             YRS  YMR L ++AMQ A
Sbjct: 756 GIYRSLAYMRPLCIWAMQLA 775



 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 55/79 (69%), Gaps = 2/79 (2%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCE- 86
           PE+EP+  +N Y + D + WKDLNLKF++ +YRD+ L +D ++L  ++P C E++ + E 
Sbjct: 535 PEDEPWQRVNAYLIHDTAGWKDLNLKFVLQVYRDFHLTQDAQYLRDMWPIC-EMVTESEL 593

Query: 87  SFDKQGLGIVQNGGFPDQT 105
            FD  G G+++N G+ DQT
Sbjct: 594 QFDLDGDGLIENSGYADQT 612


>gi|88809081|ref|ZP_01124590.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
 gi|88787023|gb|EAR18181.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
          Length = 849

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 93/299 (31%), Positives = 146/299 (48%), Gaps = 29/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E PF   N     D + WKDL   F++ ++R + L    +D+ FL   +P  ++ +  
Sbjct: 541 PNEVPFDATNYTAYQDCNLWKDLASDFVLQVWRTFKLAPTGEDLTFLADCWPAAVQALHY 600

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
            + FD    G+  NGG PDQT          AYCG L IAAL         ++    L+ 
Sbjct: 601 LKRFDVNNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQQELGLDT 660

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
             +Q  FS  L ++   +  LLW G YY  D+     +  +MADQLCG  + +  G+  E
Sbjct: 661 GSEQHTFSNWLEQSRANFDALLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--E 716

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
            +   AN    L  +  +    F+ GS+G  NG++ +G   D +     EVWTG+ + ++
Sbjct: 717 PVVSDANSRSTLKAVKESCFEKFQGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 776

Query: 248 A--AMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +   ++ EG   EA  +A  +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 777 SYYQLMGEGQTAEAICSA--VVHQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 832


>gi|113953483|ref|YP_730032.1| hypothetical protein sync_0818 [Synechococcus sp. CC9311]
 gi|113880834|gb|ABI45792.1| conserved hypothetical protein [Synechococcus sp. CC9311]
          Length = 833

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 91/299 (30%), Positives = 144/299 (48%), Gaps = 29/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E PF   N     D + WKDL   +++ ++R + L    +D+ FL   +P  ++ +  
Sbjct: 526 PNEVPFDATNYTAYQDCNLWKDLASDYVLQVWRTFKLAPTGEDLSFLAECWPAAVQALHY 585

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
            + FD    G+  NGG PDQT          AYCG L IAAL         +++   L+ 
Sbjct: 586 LKQFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLELGLDT 645

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
             +Q  FS  L ++   +  LLW G YY  D+     +  +MADQLCG  + +  G+   
Sbjct: 646 GDEQHTFSGWLEQSRANFDKLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--P 701

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
            +   AN    L  +       F  GS+G  NG++ +G   D +     EVWTG+ + ++
Sbjct: 702 PVVSEANSRSTLKAVKEACFDNFAGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 761

Query: 248 A--AMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +   ++ EG   EA  +A  +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 762 SYYRLMGEGKTAEAICSA--VVTQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817


>gi|352096901|ref|ZP_08957615.1| Glucosylceramidase [Synechococcus sp. WH 8016]
 gi|351675485|gb|EHA58646.1| Glucosylceramidase [Synechococcus sp. WH 8016]
          Length = 833

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 89/299 (29%), Positives = 142/299 (47%), Gaps = 29/299 (9%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
           P E PF   N     D + WKDL   +++ ++R + L    +D+ FL   +P  ++ +  
Sbjct: 526 PNEVPFDATNYTAYQDCNLWKDLASDYVLQVWRTFKLAPTGEDLSFLAECWPAAVQALHY 585

Query: 85  CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
            + FD    G+  NGG PDQT          AYCG L IAAL         +++   L+ 
Sbjct: 586 LKQFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLELGLDT 645

Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
             +Q  FS  L ++   +  LLW G YY  D+     +  +MADQLCG  + +   +   
Sbjct: 646 GDEQHTFSGWLEQSRANFDTLLWNGEYYDIDAESG--TPVVMADQLCGDFYARL--LELP 701

Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
            +   AN    L  +       F  G +G  NG++ +G   D +     EVWTG+ + ++
Sbjct: 702 PVVSEANSRSTLKAVKEACFDNFAGGLLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 761

Query: 248 A--AMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
           +   ++ EG   EA  +A  +   VY   GL F TPE +T   T+R+  Y+RA+A++ +
Sbjct: 762 SYYRLMGEGPTAEAICSA--VVTQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817


>gi|156333782|ref|XP_001619412.1| hypothetical protein NEMVEDRAFT_v1g224207 [Nematostella vectensis]
 gi|156202556|gb|EDO27312.1| predicted protein [Nematostella vectensis]
          Length = 173

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 54/101 (53%), Positives = 75/101 (74%)

Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 273
           G+MGAVNG++P+G  D S++Q+EEVWTGVTY ++A+M+ EG VDE + TA G+Y T +ER
Sbjct: 52  GTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNTCFER 111

Query: 274 TGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
            G+ F+TPE +  +  YRS  YMR L+++AMQ A  K K K
Sbjct: 112 LGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNK 152



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 33/51 (64%)

Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLE 80
          E P+  +N Y++ D S WKDLNLKF++ +YRDY    D+ +L  ++P   E
Sbjct: 1  EAPWDHVNAYHIHDTSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWPITKE 51


>gi|295829670|gb|ADG38504.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829672|gb|ADG38505.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829674|gb|ADG38506.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829676|gb|ADG38507.1| AT3G24180-like protein [Capsella grandiflora]
          Length = 190

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 71/192 (36%), Positives = 96/192 (50%), Gaps = 12/192 (6%)

Query: 70  FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
           F   V+P     +   E FD+    +++N GFPDQT          AYCG L +AAL   
Sbjct: 1   FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60

Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
             M+  + D    +    K   A       LW GSY+ +DS  S +S SI  DQL G  +
Sbjct: 61  AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120

Query: 180 LKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVW 239
             +SG+    IF  + I   +  IF  NV   + G MGAVNGM P+G  D + +QS E+W
Sbjct: 121 AASSGL--PPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIW 178

Query: 240 TGVTYLLSAAML 251
           TGVTY  +A M+
Sbjct: 179 TGVTYAAAATMI 190


>gi|345291087|gb|AEN82035.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291089|gb|AEN82036.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291091|gb|AEN82037.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291093|gb|AEN82038.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291095|gb|AEN82039.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291097|gb|AEN82040.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291099|gb|AEN82041.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291101|gb|AEN82042.1| AT3G24180-like protein, partial [Capsella rubella]
          Length = 191

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 71/192 (36%), Positives = 96/192 (50%), Gaps = 12/192 (6%)

Query: 70  FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
           F   V+P     +   E FD+    +++N GFPDQT          AYCG L +AAL   
Sbjct: 1   FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60

Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
             M+  + D    +    K   A       LW GSY+ +DS  S +S SI  DQL G  +
Sbjct: 61  AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120

Query: 180 LKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVW 239
             +SG+    IF  + I   +  IF  NV   + G MGAVNGM P+G  D + +QS E+W
Sbjct: 121 AASSGL--PPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIW 178

Query: 240 TGVTYLLSAAML 251
           TGVTY  +A M+
Sbjct: 179 TGVTYAAAATMI 190


>gi|295829678|gb|ADG38508.1| AT3G24180-like protein [Neslia paniculata]
          Length = 190

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 71/192 (36%), Positives = 96/192 (50%), Gaps = 12/192 (6%)

Query: 70  FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
           F   V+P     +   E FD+    +++N GFPDQT          AYCG L +AAL   
Sbjct: 1   FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60

Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
             M+  + D    +    K   A       LW GSY+ +DS  S +S SI  DQL G  +
Sbjct: 61  AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120

Query: 180 LKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVW 239
             +SG+    IF  + I   +  IF  NV   + G MGAVNGM P+G  D + +QS E+W
Sbjct: 121 AASSGL--PPIFEESKIRSTMQKIFDFNVMKTKGGRMGAVNGMHPDGKVDETCMQSREIW 178

Query: 240 TGVTYLLSAAML 251
           TGVTY  +A M+
Sbjct: 179 TGVTYAAAATMI 190


>gi|148242848|ref|YP_001228005.1| hypothetical protein SynRCC307_1749 [Synechococcus sp. RCC307]
 gi|147851158|emb|CAK28652.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
          Length = 220

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 69/207 (33%), Positives = 103/207 (49%), Gaps = 12/207 (5%)

Query: 105 TAYCGGLHIAALACMVEMSSLL------NDTQQQQQFSAKLTKASQVYHDLLWTGSYYKF 158
           +AYCG L IAAL   + M   L      +   +Q+ FS  L ++   +  LLW G +YK 
Sbjct: 2   SAYCGALWIAALEAALAMGQTLALELGVDTAAEQRDFSQWLQQSRGNFDQLLWNGEFYKI 61

Query: 159 DSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGA 218
           D+     +  +MADQLCG  + +  G+    +   AN   +L  I      GF  G +G 
Sbjct: 62  DAESG--TPVVMADQLCGDFYARLLGL--PPVVSEANARSSLQAIKEACFEGFAGGRLGV 117

Query: 219 VNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG 277
            NG++ +G   D +     EVWTG+ + L+A     G  + A      +   VY   GL 
Sbjct: 118 ANGLRRDGTPLDPNGTHPLEVWTGINFGLAAYYRLMGETNTAVAITQAVVNQVYA-GGLQ 176

Query: 278 FETPEGLTGDKTYRSGGYMRALAVYAM 304
           F TPE LTG  T+R+  Y+RA+A++A+
Sbjct: 177 FRTPEALTGQNTFRACHYLRAMAIWAL 203


>gi|255073149|ref|XP_002500249.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
 gi|226515511|gb|ACO61507.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
          Length = 954

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 91/314 (28%), Positives = 139/314 (44%), Gaps = 38/314 (12%)

Query: 28  PEEEPF-SLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIR-----------FLTRVY 75
           P + P  S  N +++ DV+ WKDL  K ++++ R +AL    R            L R++
Sbjct: 621 PFDAPLTSSPNAFDLTDVNRWKDLAPKLMLTLARAHALRGGFRGPNDPGGIPRDVLRRMF 680

Query: 76  PTCLE-LIRKCESFDKQGLGIVQNG----GFPDQT-----------AYCGGLHIAALACM 119
             C E L  +   FD  G G++++       PD +           AY GGL +AAL   
Sbjct: 681 RPCYESLAAQLRRFDGNGDGLIEHDDDDEAGPDHSFVAWRARGDTSAYAGGLWLAALRAG 740

Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG------SYYKFDSSQSRHSDSIMADQ 173
             ++  LN+T  +      +  A+  +   LW G       YY+FD+S +   D   A Q
Sbjct: 741 AGLARDLNETHARDSLEQTMRAAAVAHDAALWLGDDDGGRGYYRFDASGTEGGDVSSAAQ 800

Query: 174 LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAI 233
           + G   L    +    + P+  +  ALA++   NV                +G  D  + 
Sbjct: 801 VMGEWALAV--IGARGVLPARKVRAALASVARLNVAAAGGAGGAVNGAAA-SGSVDAVSA 857

Query: 234 QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSG 293
            S EVW GV+Y +++ ML  G  DEAW  A G+    YER GL F TPE       +RS 
Sbjct: 858 HSNEVWPGVSYAVASHMLLAGMDDEAWELARGVAAGTYER-GLAFRTPEAWDARGGFRSA 916

Query: 294 GYMRALAVYAMQDA 307
              RA +V+A++ A
Sbjct: 917 MSQRAGSVWAIEHA 930


>gi|226444526|gb|ACO57837.1| unknown [Helianthus annuus]
 gi|226444528|gb|ACO57838.1| unknown [Helianthus annuus]
 gi|226444560|gb|ACO57854.1| unknown [Helianthus petiolaris]
 gi|226444568|gb|ACO57858.1| unknown [Helianthus petiolaris]
          Length = 103

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/106 (51%), Positives = 70/106 (66%), Gaps = 5/106 (4%)

Query: 207 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 266
            VRG   G MGAVNGM PNG  D + +QS EVWTGVTY ++A M++ G  D+A+TTA G+
Sbjct: 1   KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGI 57

Query: 267 YRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           +   +   G G  F+TPEG T D +YRS  YMR LA++ MQ A  K
Sbjct: 58  FIAGWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103


>gi|226444514|gb|ACO57831.1| unknown [Helianthus annuus]
 gi|226444516|gb|ACO57832.1| unknown [Helianthus annuus]
 gi|226444518|gb|ACO57833.1| unknown [Helianthus annuus]
 gi|226444520|gb|ACO57834.1| unknown [Helianthus annuus]
 gi|226444522|gb|ACO57835.1| unknown [Helianthus annuus]
 gi|226444530|gb|ACO57839.1| unknown [Helianthus annuus]
 gi|226444532|gb|ACO57840.1| unknown [Helianthus annuus]
 gi|226444534|gb|ACO57841.1| unknown [Helianthus annuus]
 gi|226444536|gb|ACO57842.1| unknown [Helianthus annuus]
 gi|226444538|gb|ACO57843.1| unknown [Helianthus annuus]
 gi|226444540|gb|ACO57844.1| unknown [Helianthus annuus]
 gi|226444542|gb|ACO57845.1| unknown [Helianthus annuus]
 gi|226444544|gb|ACO57846.1| unknown [Helianthus annuus]
 gi|226444546|gb|ACO57847.1| unknown [Helianthus petiolaris]
 gi|226444548|gb|ACO57848.1| unknown [Helianthus petiolaris]
 gi|226444550|gb|ACO57849.1| unknown [Helianthus petiolaris]
 gi|226444552|gb|ACO57850.1| unknown [Helianthus petiolaris]
 gi|226444554|gb|ACO57851.1| unknown [Helianthus petiolaris]
 gi|226444556|gb|ACO57852.1| unknown [Helianthus petiolaris]
 gi|226444558|gb|ACO57853.1| unknown [Helianthus petiolaris]
 gi|226444562|gb|ACO57855.1| unknown [Helianthus petiolaris]
 gi|226444564|gb|ACO57856.1| unknown [Helianthus petiolaris]
 gi|226444566|gb|ACO57857.1| unknown [Helianthus petiolaris]
 gi|226444570|gb|ACO57859.1| unknown [Helianthus argophyllus]
 gi|226444572|gb|ACO57860.1| unknown [Helianthus argophyllus]
 gi|226444576|gb|ACO57862.1| unknown [Helianthus argophyllus]
 gi|226444578|gb|ACO57863.1| unknown [Helianthus argophyllus]
 gi|226444580|gb|ACO57864.1| unknown [Helianthus argophyllus]
 gi|226444582|gb|ACO57865.1| unknown [Helianthus argophyllus]
 gi|226444584|gb|ACO57866.1| unknown [Helianthus argophyllus]
 gi|226444586|gb|ACO57867.1| unknown [Helianthus argophyllus]
 gi|226444588|gb|ACO57868.1| unknown [Helianthus argophyllus]
 gi|226444590|gb|ACO57869.1| unknown [Helianthus argophyllus]
 gi|226444592|gb|ACO57870.1| unknown [Helianthus argophyllus]
          Length = 103

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/106 (51%), Positives = 69/106 (65%), Gaps = 5/106 (4%)

Query: 207 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 266
            VRG   G MGAVNGM PNG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G+
Sbjct: 1   KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57

Query: 267 YRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           +   +   G G  F+TPEG T D +YRS  YMR LA++ MQ A  K
Sbjct: 58  FIAGWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103


>gi|226444524|gb|ACO57836.1| unknown [Helianthus annuus]
          Length = 103

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/103 (52%), Positives = 68/103 (66%), Gaps = 5/103 (4%)

Query: 207 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 266
            VRG   G MGAVNGM PNG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G+
Sbjct: 1   KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57

Query: 267 YRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           +   +   G G  F+TPEG T D +YRS  YMR LA++ MQ A
Sbjct: 58  FIAGWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQA 100


>gi|226444574|gb|ACO57861.1| unknown [Helianthus argophyllus]
          Length = 103

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 68/106 (64%), Gaps = 5/106 (4%)

Query: 207 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 266
            VRG   G MGAVNGM PNG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G+
Sbjct: 1   KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57

Query: 267 YRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
           +   +   G G  F+TPEG T D +YRS  YMR LA++  Q A  K
Sbjct: 58  FIAGWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGKQQALEK 103


>gi|449674846|ref|XP_002159451.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
           magnipapillata]
          Length = 751

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 46/136 (33%), Positives = 76/136 (55%), Gaps = 10/136 (7%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
           P++EPF  +N Y + D S WKDLNL F++ +YRD+ + K+ +FL  ++P     +    +
Sbjct: 616 PDDEPFVRVNAYCIHDTSQWKDLNLHFVLQVYRDFYITKNKQFLMDMWPAMKAAMAHSLA 675

Query: 88  FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
            D  G GI++N G  D T          AY GGL ++AL C++E++ +L  +Q   ++ +
Sbjct: 676 QDIDGDGIIENSGIADSTFDTWVVTGPSAYTGGLWLSALRCIIEIAHILGLSQSISKYKS 735

Query: 138 KLTKASQVYHDLLWTG 153
            L +  + Y   LW G
Sbjct: 736 ILERGKKSYEKKLWNG 751


>gi|411117353|ref|ZP_11389840.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410713456|gb|EKQ70957.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 907

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 87/180 (48%), Gaps = 31/180 (17%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRD--YALHKDIRFLTRVYPTCLELIRKC 85
           P E  +   N  +  D + WKDL   F++ +YRD  Y   KDI FL   +    + +   
Sbjct: 520 PNEHVWQKTNYTSYQDCNLWKDLPCDFVLQVYRDFLYTGSKDIAFLQECWSAIAQTLHYL 579

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC--------------MVE 121
           ++FD    GI +N G PDQT          AYCGGL IAAL                M +
Sbjct: 580 KTFDYDFDGIPENSGAPDQTFDDWKLDGISAYCGGLWIAALEAALAIGRILAEYTTNMRQ 639

Query: 122 MSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLK 181
           +++L ND      F   L +A  +YHD LW G YY+ DS+    S+ +MADQLCG  + +
Sbjct: 640 LATLSNDIGN---FQVWLEQARSIYHDTLWNGQYYRLDSNSC--SEVVMADQLCGQFYAR 694



 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 55/91 (60%), Gaps = 2/91 (2%)

Query: 215 SMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 273
            +GA NG+KP+G   + ++    EVWTG+ + L+A ++  G  DEA      + + +Y  
Sbjct: 809 PIGAANGVKPDGSPENPNSTHPLEVWTGINFGLAAFLVQMGMRDEAMELTQAVVQQIY-T 867

Query: 274 TGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
            GL F TPE +T   T+R+  Y+RALA++A+
Sbjct: 868 NGLQFRTPEAITPSGTFRACHYLRALAIWAI 898


>gi|295828610|gb|ADG37974.1| AT1G33700-like protein [Capsella grandiflora]
          Length = 105

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 58/96 (60%), Gaps = 10/96 (10%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+F+ + WKDLN KF++ +YRD     D+ F   V+P+    I   + FDK
Sbjct: 1   DPWFEVNAYNLFNTNRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
            G G+++N GFPDQT          AYCGGL +AAL
Sbjct: 61  DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAAL 96


>gi|295828604|gb|ADG37971.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828608|gb|ADG37973.1| AT1G33700-like protein [Capsella grandiflora]
          Length = 105

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 10/96 (10%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+F+   WKDLN KF++ +YRD     D+ F   V+P+    I   + FDK
Sbjct: 1   DPWFEVNAYNLFNTBRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
            G G+++N GFPDQT          AYCGGL +AAL
Sbjct: 61  DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAAL 96


>gi|295828602|gb|ADG37970.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828606|gb|ADG37972.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828612|gb|ADG37975.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828614|gb|ADG37976.1| AT1G33700-like protein [Neslia paniculata]
          Length = 105

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 10/96 (10%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+F+   WKDLN KF++ +YRD     D+ F   V+P+    I   + FDK
Sbjct: 1   DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
            G G+++N GFPDQT          AYCGGL +AAL
Sbjct: 61  DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAAL 96


>gi|345289099|gb|AEN81041.1| AT1G33700-like protein, partial [Capsella rubella]
          Length = 191

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 10/96 (10%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+F+   WKDLN KF++ +YRD     D+ F   V+P+    I   + FDK
Sbjct: 83  DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 142

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
            G G+++N GFPDQT          AYCGGL +AAL
Sbjct: 143 DGDGMIENDGFPDQTYDAWSCSGVSAYCGGLWVAAL 178


>gi|345289089|gb|AEN81036.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289091|gb|AEN81037.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289093|gb|AEN81038.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289095|gb|AEN81039.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289097|gb|AEN81040.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289101|gb|AEN81042.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289103|gb|AEN81043.1| AT1G33700-like protein, partial [Capsella rubella]
          Length = 191

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 10/96 (10%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +N YN+F+   WKDLN KF++ +YRD     D+ F   V+P+    I   + FDK
Sbjct: 83  DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 142

Query: 91  QGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
            G G+++N GFPDQT          AYCGGL +AAL
Sbjct: 143 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAAL 178


>gi|256082539|ref|XP_002577512.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 717

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 81/157 (51%), Gaps = 31/157 (19%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRK-CE 86
           PE+EP+   N Y +F    WKDLN KFI+ ++RD+ + +D ++L  + P  L ++RK   
Sbjct: 560 PEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLV 619

Query: 87  SFDKQGLGIVQNGGFPDQ----------TAYCGGLHIAALACMVEMSS------------ 124
           ++D    G+++N GFPDQ          TAY GG+ ++ L    +M S            
Sbjct: 620 AWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQ 679

Query: 125 LLNDTQQQQQFSAKL--------TKASQVYHDLLWTG 153
           ++N+T   Q+  +++        TKA   Y+  LWTG
Sbjct: 680 IVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTG 716


>gi|353232146|emb|CCD79501.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 725

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 81/157 (51%), Gaps = 31/157 (19%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRK-CE 86
           PE+EP+   N Y +F    WKDLN KFI+ ++RD+ + +D ++L  + P  L ++RK   
Sbjct: 568 PEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLV 627

Query: 87  SFDKQGLGIVQNGGFPDQ----------TAYCGGLHIAALACMVEMSS------------ 124
           ++D    G+++N GFPDQ          TAY GG+ ++ L    +M S            
Sbjct: 628 AWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQ 687

Query: 125 LLNDTQQQQQFSAKL--------TKASQVYHDLLWTG 153
           ++N+T   Q+  +++        TKA   Y+  LWTG
Sbjct: 688 IVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTG 724


>gi|194359640|gb|ACF57667.1| bile acid beta-glucosidase [Ctenopharyngodon idella]
          Length = 89

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 36/89 (40%), Positives = 56/89 (62%), Gaps = 10/89 (11%)

Query: 32  PFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQ 91
           P+  +N Y ++D +DWKDLNLKF++ ++RDY L +D ++L  ++P C  ++     FDK 
Sbjct: 1   PWVRVNAYLIYDTADWKDLNLKFVLQVFRDYHLTQDEQYLKDMWPICQTVMETELKFDKD 60

Query: 92  GLGIVQNGGFPDQT----------AYCGG 110
           G G+++N G+ DQT          AYCGG
Sbjct: 61  GDGLIENSGYADQTYDGWKVTGPSAYCGG 89


>gi|15922862|ref|NP_378531.1| hypothetical protein ST2526 [Sulfolobus tokodaii str. 7]
 gi|15623653|dbj|BAB67640.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 646

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 63/234 (26%), Positives = 101/234 (43%), Gaps = 18/234 (7%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQN-GGFPDQ- 104
           W DL   +++ IYRDY    D+ FL R Y    E+I    S DK    I  + GGF +  
Sbjct: 378 WTDLGSTWVLMIYRDYKFTNDLAFLKRNYRKMKEVIDWLISLDKDKDCIPDSKGGFDNSY 437

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   ++Y G + + AL   +  S +L    +   +   L +    ++  LW G Y+
Sbjct: 438 DGTYMYGASSYVGSMFLCALRAFISASKILG--MEYSIYEDCLRRGIMTFNS-LWNGKYF 494

Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
               S +R  +S M+ Q+ G  +    G+  E I     I +AL +I+  N +   A   
Sbjct: 495 IAWKSDNRKKESCMSSQILGQFWCDILGL--EPIIDEDKIVQALRSIYELNGK---ASKF 549

Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 270
             VN + P+G  D    Q    W+ V + +SA M+  G  +E    A   + T+
Sbjct: 550 CLVNSVNPDGSIDTETDQMRSCWSRVAFAVSAHMIIRGLKNEGIEIANREWETI 603


>gi|47196687|emb|CAF89164.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 239

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 40/86 (46%), Positives = 51/86 (59%), Gaps = 2/86 (2%)

Query: 168 SIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 227
           S+ A   CG+   +     T   FP   I  AL +IF  NV  F  G MGAVNGM+P G 
Sbjct: 155 SVPATGFCGHPVWETE--TTRQAFPKEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGV 212

Query: 228 RDRSAIQSEEVWTGVTYLLSAAMLYE 253
            DRS++QS+EVW GV Y L+A M++E
Sbjct: 213 PDRSSVQSDEVWIGVVYGLAATMIHE 238



 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 33/49 (67%), Gaps = 2/49 (4%)

Query: 58  IYRDYALHKDIRFLTRVYPTCLELIRKCE-SFDKQGLGIVQNGGFPDQT 105
           +YRD+ L +D ++L  ++P C E++ + E  FD  G G+++N G+ DQT
Sbjct: 82  VYRDFHLTQDAQYLRDMWPIC-EMVTESELQFDLDGDGLIENSGYADQT 129


>gi|332796021|ref|YP_004457521.1| hypothetical protein Ahos_0332 [Acidianus hospitalis W1]
 gi|332693756|gb|AEE93223.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 652

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 62/237 (26%), Positives = 103/237 (43%), Gaps = 18/237 (7%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQN-GGFPDQ- 104
           W DL   +I+ IYRDY    D+ FL R Y    E+I      D+   GI  + GGF +  
Sbjct: 383 WNDLGPTWILMIYRDYKFTNDLSFLKRNYNKMKEVIDWLIKKDEDNDGIPDSKGGFDNSY 442

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   ++Y G L + AL   +E S +L  +    ++   L KA       LW G Y+
Sbjct: 443 DGTYMYGTSSYIGSLFLCALKAFIESSKIL--SYDYSKYEEILNKAKSSLES-LWNGRYF 499

Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
                + + ++S +  QL G  +    G+    +     I  AL  I+  N +   A   
Sbjct: 500 INWKYKDQKNESCLNSQLLGEFWCNLLGLGN--VIDEDKIKTALKYIYEHNGK---ASKY 554

Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 273
             VN + P+G  D S  Q +  W  +++ +++ M+ +G ++E    A   + T+  R
Sbjct: 555 CLVNSVNPDGSIDESTDQMKSCWPRISFAIASHMIMKGMINEGIEIARKEWETISSR 611


>gi|327310049|ref|YP_004336946.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
 gi|326946528|gb|AEA11634.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
          Length = 660

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 61/239 (25%), Positives = 99/239 (41%), Gaps = 17/239 (7%)

Query: 44  VSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPD 103
           ++ WKDL   +++ IYRDY    D+  L R      + +    S D+ G  +  + G  D
Sbjct: 381 LTPWKDLASTWVLMIYRDYYYTGDVEILRRNIDAMKDAVDWLMSLDRDGDCVPDSRGRND 440

Query: 104 ----------QTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
                     + AY   L + AL   ++ +  L   + + ++   L +  +   D LW G
Sbjct: 441 NSYDGSNMYGRVAYVASLFLCALTAYIKAAERLG-LKPEPKYLECLERGKRSL-DELWNG 498

Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
            YY   +   R  D+ M+ QL G  +     ++   I P   +A AL +I+     G  A
Sbjct: 499 RYYVAWADGPRRKDACMSSQLLGQFWCDM--LDLPPIAPDDKVAAALRSIYEL---GMRA 553

Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
                 N + P+G  D    Q    WT V + ++A ML  G   E    A   +RT+ E
Sbjct: 554 SKYCIPNSVTPDGKPDEETPQLRSCWTRVNFAVAAHMLLRGMEREGMEVASREWRTIAE 612


>gi|345293667|gb|AEN83325.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293669|gb|AEN83326.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293671|gb|AEN83327.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293673|gb|AEN83328.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293675|gb|AEN83329.1| AT5G49900-like protein, partial [Capsella rubella]
          Length = 156

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 46/75 (61%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +NGYN+ +   WKDLN KF++ +YRD     D +F   V+P+    +     FDK
Sbjct: 69  DPWFEVNGYNLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAMAYMAQFDK 128

Query: 91  QGLGIVQNGGFPDQT 105
            G G+++N GFPDQT
Sbjct: 129 DGDGMIENEGFPDQT 143


>gi|269925332|ref|YP_003321955.1| hypothetical protein Tter_0211 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269788992|gb|ACZ41133.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 811

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 63/247 (25%), Positives = 102/247 (41%), Gaps = 29/247 (11%)

Query: 41  VFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNG- 99
           ++D     D  L  ++ +YR++    D  FL  ++P     +     +  +    +  G 
Sbjct: 457 IWDHEAAADGQLGTVIRVYREWKFSGDDAFLKELWPKVKSAVNYSSLYWDKDQDFILEGR 516

Query: 100 -------GFPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWT 152
                   F       G L + AL    EM+  L  ++    ++     +S+    LLW 
Sbjct: 517 QHNTYDIEFYGPNPLTGILFLGALRAAEEMAKYLG-SESASSYAQAFEASSKKLDQLLWN 575

Query: 153 GSYY--KFDSSQS---RHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN 207
           G YY  K D       +H+   ++DQL G      +G+    + P  +I+RAL +IF+ N
Sbjct: 576 GEYYIQKLDDPNEHRYQHASGCLSDQLFGQTLASLTGLGY--LLPKEHISRALESIFAYN 633

Query: 208 V-----------RGFEAGS-MGAVNGMKPNGDRDRSAIQ-SEEVWTGVTYLLSAAMLYEG 254
                       R +  G   G V    P GDR       S+EVW+G  Y ++  M+YEG
Sbjct: 634 FKPNFWNHTNTQRVYALGDDAGLVMCTWPFGDRPSFPFPYSDEVWSGTEYQVATLMIYEG 693

Query: 255 NVDEAWT 261
            +DEA T
Sbjct: 694 LLDEALT 700


>gi|345293661|gb|AEN83322.1| AT5G49900-like protein, partial [Capsella grandiflora]
 gi|345293663|gb|AEN83323.1| AT5G49900-like protein, partial [Capsella grandiflora]
 gi|345293665|gb|AEN83324.1| AT5G49900-like protein, partial [Capsella grandiflora]
          Length = 156

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 46/75 (61%)

Query: 31  EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           +P+  +NGY++ +   WKDLN KF++ +YRD     D +F   V+P+    +     FDK
Sbjct: 69  DPWFEVNGYSLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAMAYMAQFDK 128

Query: 91  QGLGIVQNGGFPDQT 105
            G G+++N GFPDQT
Sbjct: 129 DGDGMIENEGFPDQT 143


>gi|159042199|ref|YP_001541451.1| hypothetical protein Cmaq_1638 [Caldivirga maquilingensis IC-167]
 gi|157921034|gb|ABW02461.1| protein of unknown function DUF608 [Caldivirga maquilingensis
           IC-167]
          Length = 661

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 63/237 (26%), Positives = 105/237 (44%), Gaps = 23/237 (9%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQN-GGFPDQ- 104
           W DL   +I+ IYRDY L  ++  L R      E+I    S D  G  I  + GGF +  
Sbjct: 388 WNDLGSTWILMIYRDYLLTGNVEVLRRNIDKMREVIDWLISRDYDGDCIPDSRGGFDNSY 447

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   ++Y   L + +L   ++ + +L   +   ++ + L+K  +  +  LW G Y+
Sbjct: 448 DGTNMYGASSYIASLFLCSLQAFIKSAEVLG-VRLSDRYESCLSKGRETLNS-LWNGRYF 505

Query: 157 KFDSSQSRHSDSIMADQLCGYL---FLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
               S    ++S M  QL G     FLK   V  E       I  AL +I+  N    ++
Sbjct: 506 MAWKSSGNSNESCMNSQLLGQFWCDFLKLPPVVDE-----DKIKVALRSIYELN---HKS 557

Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 270
                 N +KP+G+ D S+ Q    W  V+++++A M+  G V+E    A   + T+
Sbjct: 558 SPHCLPNSVKPSGEIDTSSGQMRSCWPRVSFVVTAHMVLRGMVNEGLEIAKKEWDTI 614


>gi|303285310|ref|XP_003061945.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
 gi|226456356|gb|EEH53657.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
          Length = 1073

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 62/253 (24%), Positives = 102/253 (40%), Gaps = 53/253 (20%)

Query: 103  DQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGS-------- 154
            + ++Y GGL +AAL     ++  L++T  +      L  A++ + + LW G         
Sbjct: 782  NASSYGGGLWLAALRAGAGIARDLDETDARTSLEETLRIAAKSFDEALWRGGGGGGGGGV 841

Query: 155  ----------------YYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIAR 198
                            YY  D+S +   D ++A Q+ G   L    +    +     +  
Sbjct: 842  RKSQSRSRRGTHALEGYYAADASGTDAGDVVLAGQVMGEWALGM--IRAPGVLDPRKVRA 899

Query: 199  ALATIFSTNVRGFEAGSM----------------GAVNGMKPNGDRDRSAI--------- 233
            AL+T +  NV+ F                     GAVNG + +   D +A+         
Sbjct: 900  ALSTTYERNVKAFGRARGGSGSGSDDEPSTYVPSGAVNGARVDDATDAAALGVGAGDGIP 959

Query: 234  -QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 292
             Q+ E   G +Y L++ ++  G  DE W  A G YR  YE  G  F TPE    ++ +R 
Sbjct: 960  AQARESRVGQSYALASHLILAGFSDEGWDVARGAYRVTYE-DGFAFRTPEIFDAERRFRG 1018

Query: 293  GGYMRALAVYAMQ 305
                RA AV+A++
Sbjct: 1019 AISGRAGAVWAIE 1031


>gi|227828867|ref|YP_002830647.1| hypothetical protein M1425_2640 [Sulfolobus islandicus M.14.25]
 gi|227460663|gb|ACP39349.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
          Length = 661

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 58/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           W DL    ++ +YRDY    D   L + Y    E+I      D     I  + G  D   
Sbjct: 393 WTDLGPTLVLMLYRDYVFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   ++Y   + ++ALA  +++S +L D +   ++   L    + ++  LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DVKIDTKYYRFLECGKKTFNS-LWNGKYF 510

Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
                    + S +  QL G  +    G+    I  +  I  AL +I+  N   F A   
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565

Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
              N +K + D D S  Q    W  V++ ++A M+  G V E    A   + TV E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETVKE 621


>gi|229586074|ref|YP_002844576.1| hypothetical protein M1627_2694 [Sulfolobus islandicus M.16.27]
 gi|228021124|gb|ACP56531.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
          Length = 661

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           W DL    ++ +YRDY    D   L + Y    E+I      D     I  + G  D   
Sbjct: 393 WTDLGPTLVLMLYRDYVFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   ++Y   + ++ALA  +++S +L D +   ++   L    + ++  LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DVKIDTKYYRFLECGKKTFNS-LWNGKYF 510

Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
                    + S +  QL G  +    G+    I  +  I  AL +I+  N   F A   
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565

Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
              N +K + D D S  Q    W  V++ ++A M+  G V E    A   + T+ E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETIKE 621


>gi|227831601|ref|YP_002833381.1| hypothetical protein LS215_2801 [Sulfolobus islandicus L.S.2.15]
 gi|229580553|ref|YP_002838953.1| hypothetical protein YG5714_2812 [Sulfolobus islandicus Y.G.57.14]
 gi|238621059|ref|YP_002915885.1| hypothetical protein M164_2625 [Sulfolobus islandicus M.16.4]
 gi|284999153|ref|YP_003420921.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|227458049|gb|ACP36736.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
 gi|228011269|gb|ACP47031.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.G.57.14]
 gi|238382129|gb|ACR43217.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
 gi|284447049|gb|ADB88551.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
          Length = 661

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           W DL    ++ +YRDY    D   L + Y    E+I      D     I  + G  D   
Sbjct: 393 WTDLGPTLVLMLYRDYVFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   ++Y   + ++ALA  +++S +L D +   ++   L    + ++  LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DVKIDTKYYRFLECGKKTFNS-LWNGKYF 510

Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
                    + S +  QL G  +    G+    I  +  I  AL +I+  N   F A   
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565

Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
              N +K + D D S  Q    W  V++ ++A M+  G V E    A   + T+ E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETIKE 621


>gi|385777223|ref|YP_005649791.1| hypothetical protein [Sulfolobus islandicus REY15A]
 gi|323475970|gb|ADX86576.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
          Length = 661

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           W DL    ++ +YRDY    D   L + Y    E+I      D     I  + G  D   
Sbjct: 393 WTDLGPTLVLMLYRDYVFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   ++Y   + ++ALA  +++S +L D +   ++   L    + ++  LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DVKIDTKYYRFLECGKKTFNS-LWNGKYF 510

Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
                    + S +  QL G  +    G+    I  +  I  AL +I+  N   F A   
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565

Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
              N +K + D D S  Q    W  V++ ++A M+  G V E    A   + T+ E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETIKE 621


>gi|385774575|ref|YP_005647144.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
 gi|323478691|gb|ADX83929.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
          Length = 661

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           W DL    ++ +YRDY    D   L + Y    E+I      D     I  + G  D   
Sbjct: 393 WTDLGPTLVLMLYRDYVFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   ++Y   + ++ALA  +++S +L D +   ++   L    + ++  LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DVKIDTKYYRFLECGKKTFNS-LWNGKYF 510

Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
                    + S +  QL G  +    G+    I  +  I  AL +I+  N   F A   
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565

Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
              N +K + D D S  Q    W  V++ ++A M+  G V E    A   + T+ E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETIKE 621


>gi|284173198|ref|ZP_06387167.1| hypothetical protein Ssol98_00875 [Sulfolobus solfataricus 98/2]
          Length = 647

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 55/236 (23%), Positives = 94/236 (39%), Gaps = 17/236 (7%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           W DL    ++ +YRDY    +   L + Y    E+I      D     I  + G  D   
Sbjct: 379 WTDLGPTLVLMLYRDYVFTSNREILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 438

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   ++Y   + ++AL   ++MS +L D +   ++   L    + ++  LW G Y+
Sbjct: 439 DGTHMYGASSYIASMFLSALTAFIKMSEIL-DVKIDDKYYRFLECGKKTFNS-LWNGKYF 496

Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
                    + S +  QL G  +    G+    I     I  AL +I+  N   F+A   
Sbjct: 497 ILWKKNDEENTSCLNSQLLGQFWCDILGL--PPITDHDKINTALRSIYELN---FKASKY 551

Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
              N ++ +G  D S  Q    W  V++ ++A M+  G V E    A   + T+ E
Sbjct: 552 CLTNAVREDGSVDSSTAQLRSCWPRVSFAVAAHMILRGMVKEGIEVAKREWETIKE 607


>gi|15899397|ref|NP_344002.1| hypothetical protein SSO2674 [Sulfolobus solfataricus P2]
 gi|384433011|ref|YP_005642369.1| hypothetical protein [Sulfolobus solfataricus 98/2]
 gi|13815992|gb|AAK42792.1| Conserved hypothetical protein [Sulfolobus solfataricus P2]
 gi|261601165|gb|ACX90768.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
          Length = 648

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 55/236 (23%), Positives = 94/236 (39%), Gaps = 17/236 (7%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           W DL    ++ +YRDY    +   L + Y    E+I      D     I  + G  D   
Sbjct: 380 WTDLGPTLVLMLYRDYVFTSNREILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 439

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   ++Y   + ++AL   ++MS +L D +   ++   L    + ++  LW G Y+
Sbjct: 440 DGTHMYGASSYIASMFLSALTAFIKMSEIL-DVKIDDKYYRFLECGKKTFNS-LWNGKYF 497

Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
                    + S +  QL G  +    G+    I     I  AL +I+  N   F+A   
Sbjct: 498 ILWKKNDEENTSCLNSQLLGQFWCDILGL--PPITDHDKINTALRSIYELN---FKASKY 552

Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
              N ++ +G  D S  Q    W  V++ ++A M+  G V E    A   + T+ E
Sbjct: 553 CLTNAVREDGSVDSSTAQLRSCWPRVSFAVAAHMILRGMVKEGIEVAKREWETIKE 608


>gi|229583411|ref|YP_002841810.1| hypothetical protein YN1551_3001 [Sulfolobus islandicus Y.N.15.51]
 gi|228014127|gb|ACP49888.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.N.15.51]
          Length = 661

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           W DL    ++ +YRDY    D   L + Y    E+I      D     I  + G  D   
Sbjct: 393 WTDLGPTLVLMLYRDYIFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   ++Y   + ++ALA  +++S +L D +   ++   L    + ++  LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DIKIDTKYYRFLECGKKTFNS-LWNGKYF 510

Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
                    + S +  QL G  +    G+    I  +  I  AL +I+  N   F A   
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565

Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
              N +K + D D S  Q    W  V++ ++A M+  G V E    A   + T+ E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETIKE 621


>gi|12005896|gb|AAG44660.1|AF258662_1 AD035 [Homo sapiens]
          Length = 367

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 36/51 (70%)

Query: 28  PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTC 78
           P++EP+  +N Y + D +DWKDLNLKF++ +YRDY L  D  FL  ++P C
Sbjct: 311 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVC 361


>gi|160934323|ref|ZP_02081710.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
 gi|156866996|gb|EDO60368.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
          Length = 806

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/167 (25%), Positives = 81/167 (48%), Gaps = 20/167 (11%)

Query: 111 LHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY-----KFDSSQSRH 165
           L +AAL    EM++ L DT   ++++    K S+    LLW G YY       ++ + ++
Sbjct: 535 LFLAALKAGKEMAAYLGDTAHAEKYAEAFRKGSEKMDSLLWGGDYYIQVIDDINAYKYQY 594

Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMG-----AVN 220
               +ADQ+ G +    + +    + P  +I +A+ ++F  N +      +      A+N
Sbjct: 595 GTGCLADQVFGQMLAHLNHLGY--VLPEEHIKKAIHSVFQYNFKETMEDHVNVQRTYALN 652

Query: 221 GMK-------PNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
             K       P G R +   + S+EVW+G+ Y ++A +++EG V+E 
Sbjct: 653 QEKGLVLCSWPQGGRPKLPFVYSDEVWSGIEYQVAAHLIFEGFVEEG 699


>gi|269839491|ref|YP_003324183.1| hypothetical protein Tter_2468 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269791221|gb|ACZ43361.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 807

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 58/255 (22%), Positives = 105/255 (41%), Gaps = 51/255 (20%)

Query: 49  DLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELI-----------------RKCESFDKQ 91
           D  +  I+ + R++ L  D+ +L RV+P     +                 R+  ++D +
Sbjct: 456 DGQMGSILRVLREWRLSGDMEWLRRVWPGVKSALAYASAQWDGDGDGVLDGRQHNTYDVE 515

Query: 92  GLGIVQNGGFPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLW 151
                    F      CG  ++AAL  + E++ +L +    ++      + S    +LLW
Sbjct: 516 ---------FYGPNPLCGLYYLAALRAVEELAEVLGEANLARRCREVFERGSHRLDELLW 566

Query: 152 TGSYYK----FDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN 207
            G YY+     D+   +H    +ADQL G   L+A  +    + P  ++ +AL +I+  N
Sbjct: 567 NGEYYEQLGDEDAWPYQHGKGCLADQLLGQ--LQAHVLGMGYLVPEEHVRQALRSIWRYN 624

Query: 208 VRGFEAGSMGAVNGMKPNGDRDRSAI----------------QSEEVWTGVTYLLSAAML 251
              F A     VN  +     D + +                 S+EV+TG  Y ++A M+
Sbjct: 625 ---FRADLSEHVNCQRTFALNDEAGLLMCTWPRGGQPTFPFPYSDEVFTGSEYQVAAHMI 681

Query: 252 YEGNVDEAWTTAGGL 266
           YEG ++E     G +
Sbjct: 682 YEGLLEEGLQIVGAV 696


>gi|160931643|ref|ZP_02079037.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
 gi|156869288|gb|EDO62660.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
          Length = 804

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 51/242 (21%), Positives = 104/242 (42%), Gaps = 37/242 (15%)

Query: 49  DLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGG-------- 100
           D  L  I+ +YR++ +  D  FL +++   +  +    S+       V +G         
Sbjct: 471 DGQLGSILRVYREWKISGDDDFLKKIWNQVVSALEFSASYWDSNQDCVLDGQQHNTYDIE 530

Query: 101 FPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--- 157
           F    +    ++ AAL    EM+  L + ++ Q++ +      +   ++L+ G YY+   
Sbjct: 531 FYGVNSLGNSIYYAALKAGAEMAEYLGEHERSQKWRSMEQAGCKRMDEMLFNGEYYRQVT 590

Query: 158 ---FDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAG 214
               D  + ++ +  ++DQL G       G+    +FP  ++  A+  I+  N +     
Sbjct: 591 DGDIDEYKYQYGEGCLSDQLLGQTLAHLYGLGH--LFPEDHVKSAVFAIYKYNFKE---- 644

Query: 215 SMGAVNGMK----------------PNGDRDRSA-IQSEEVWTGVTYLLSAAMLYEGNVD 257
            MG+   ++                P+G R +   + S+EVWTG+ Y ++A ++YEG + 
Sbjct: 645 RMGSHKSLQRGYAYQDEPGLLLCSWPSGGRPKQPFVYSDEVWTGIEYQVAAGLIYEGFLQ 704

Query: 258 EA 259
           E 
Sbjct: 705 EG 706


>gi|402589209|gb|EJW83141.1| hypothetical protein WUBG_05945 [Wuchereria bancrofti]
          Length = 90

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 39/59 (66%)

Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
            A+ P ++I  AL  I+S+N+  F  G +GAVNGM+ +G   R  +Q++E+W GVTY L
Sbjct: 22  RALLPLSHIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYAL 80


>gi|86144926|ref|ZP_01063258.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
 gi|85837825|gb|EAQ55937.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
          Length = 1024

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 60/121 (49%), Gaps = 8/121 (6%)

Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMK 223
           SD+   D L    +LK  G+  E IFP  NI RAL  ++  N   FE  S  +G  N  +
Sbjct: 815 SDNSFGDALLADSYLKLIGL--EGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTQ 869

Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
            +G     A Q+++VW GV + ++ A+   G   +A T    +Y  +Y+ + + F  PEG
Sbjct: 870 ADG-SPHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALYDYSKIPFAAPEG 928

Query: 284 L 284
            
Sbjct: 929 F 929



 Score = 45.8 bits (107), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 36/148 (24%), Positives = 62/148 (41%), Gaps = 14/148 (9%)

Query: 32  PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           PF +  + Y+  +V +WKDL  K+I+ +YR Y   +DI  +   +    E I    +   
Sbjct: 596 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDISVVKECWQAVTESIDFLSNLIA 655

Query: 91  QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
           +G  +    G  D          + YC  L +A L    E++ L+ +++    +  +  K
Sbjct: 656 EGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAASELAQLMGESELASGYLTRSKK 715

Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
           A       LW     YY F     Q++H
Sbjct: 716 ALATVEQSLWDDKEGYYHFFVTPVQAKH 743


>gi|402583425|gb|EJW77369.1| hypothetical protein WUBG_11725, partial [Wuchereria bancrofti]
          Length = 109

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 38/57 (66%)

Query: 190 IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
           + P ++I  AL  I+S+N+  F  G +GAVNGM+ +G   R  +Q++E+W GVTY L
Sbjct: 1   LLPLSHIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYAL 57


>gi|374921957|gb|AFA26156.1| non-lysosomal glucosylceramidase-like protein, partial [Lolium
           perenne]
          Length = 109

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 51/101 (50%), Gaps = 10/101 (9%)

Query: 86  ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
           E FD+    +++N GFPDQT          AYCG L +A+L     M+  L      ++ 
Sbjct: 7   EQFDRDDDSMIENDGFPDQTYDAWTVLGVSAYCGCLWLASLQAAAAMARSLGHADYAERC 66

Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
             K  KA  V+   LW GSY+ +DS  S  S SI ADQL G
Sbjct: 67  MVKFAKAKHVFEAKLWNGSYFNYDSGTSYSSRSIQADQLAG 107


>gi|148976285|ref|ZP_01813009.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
 gi|145964379|gb|EDK29634.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
          Length = 982

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 50/176 (28%), Positives = 80/176 (45%), Gaps = 29/176 (16%)

Query: 111 LHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIM 170
           ++I  +A  V    LL++T  Q          +Q Y DL+          S +  SD+++
Sbjct: 739 INIGKVAQRVSKKRLLSETAPQ--------AFTQEYLDLV--------PDSDNSFSDALL 782

Query: 171 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 228
           AD    YL L    +  E IFP   I RAL  ++  N   FE  S  +G  N  + +G  
Sbjct: 783 AD---SYLRL----IGLEGIFPQQRIQRALDYVYKHN---FEINSPKLGVANMTRADG-S 831

Query: 229 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGL 284
              A Q+++VW GV + ++ A+   G   +A T    +Y  +Y+ + + F  PEG 
Sbjct: 832 PHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALYDYSKIPFAAPEGF 887



 Score = 40.4 bits (93), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/148 (23%), Positives = 59/148 (39%), Gaps = 14/148 (9%)

Query: 32  PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           PF +  + Y+  +V +WKDL  K+I+ +YR Y   +D+  +   +    E I    +   
Sbjct: 554 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDMSVVKECWQAVTESIDFLSNLIA 613

Query: 91  QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
            G  +    G  D          + YC  L +A L    E++ L+ +      +  +  K
Sbjct: 614 DGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAASELAKLMGEHDLGAGYLTRSKK 673

Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
           A       LW     YY F     Q++H
Sbjct: 674 ALATVEQSLWDEKEGYYHFFVTPVQAKH 701


>gi|84386772|ref|ZP_00989797.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
 gi|84378300|gb|EAP95158.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
          Length = 908

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 59/121 (48%), Gaps = 8/121 (6%)

Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMK 223
           SD+   D L    +LK  G+  E IFP  NI RAL  ++  N   FE  S  +G  N   
Sbjct: 699 SDNSFGDALLADSYLKLIGL--EGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTL 753

Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
            +G     A Q+++VW GV + ++ A+   G   +A T    +Y  +Y+ + + F  PEG
Sbjct: 754 ADG-SPHEAFQAQDVWIGVQFSVATALSLAGKNQQAETLMDTVYTALYDYSKIPFAAPEG 812

Query: 284 L 284
            
Sbjct: 813 F 813



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 36/148 (24%), Positives = 63/148 (42%), Gaps = 14/148 (9%)

Query: 32  PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           PF +  + Y+  +V +WKDL  K+I+ +YR Y   +DI  +   +    E I    +   
Sbjct: 480 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDISVVKECWQAVTESIDFLSNLIA 539

Query: 91  QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
           +G  +    G  D          + YC  L +A L    E++ L+ ++++   +  +  K
Sbjct: 540 EGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAASELAQLMGESERANGYLTRSKK 599

Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
           A       LW     YY F     Q++H
Sbjct: 600 ALATVEQSLWDDKEGYYHFFVTPVQAKH 627


>gi|15899746|ref|NP_344351.1| hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
 gi|13816440|gb|AAK43141.1| Hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
          Length = 803

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 24/180 (13%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
           + Y   L IAAL  + E S LLN+    +    +LT+    Y   LW G YY   +D   
Sbjct: 547 SPYVSFLWIAALEALNEASKLLNNLHNYE--VKELTEKVNKY---LWNGEYYINWYDPIS 601

Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
           +   DS+ A Q+ G  +++  G+      P         +IFS+ ++       G  NG 
Sbjct: 602 NLRDDSLNASQMTGDWYVQLLGL------PEFLDHEKRKSIFSSIIKYNYTEEEGVKNGS 655

Query: 222 ----MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG 277
               + P G +   ++QS+  W+G+ Y L++ M Y G  DE    A  + R VYER  L 
Sbjct: 656 SNKEITPLGVK--LSVQSKTPWSGIEYYLASHMFYNG-FDEY---AKKILRNVYERYELA 709


>gi|284174574|ref|ZP_06388543.1| hypothetical protein Ssol98_07941 [Sulfolobus solfataricus 98/2]
 gi|356934758|gb|AET42959.1| hypothetical protein [Sulfolobus solfataricus 98/2]
          Length = 803

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 24/180 (13%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
           + Y   L IAAL  + E S LLN+    +    +LT+    Y   LW G YY   +D   
Sbjct: 547 SPYVSFLWIAALEALNEASKLLNNLHNYE--VKELTEKVNKY---LWNGEYYINWYDPIS 601

Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
           +   DS+ A Q+ G  +++  G+      P         +IFS+ ++       G  NG 
Sbjct: 602 NLRDDSLNASQMTGDWYVQLLGL------PEFLDHEKRKSIFSSIIKYNYTEEEGVKNGS 655

Query: 222 ----MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG 277
               + P G +   ++QS+  W+G+ Y L++ M Y G  DE    A  + R VYER  L 
Sbjct: 656 SNKEITPLGVK--LSVQSKTPWSGIEYYLASHMFYNG-FDEY---AKKILRNVYERYELA 709


>gi|218677125|ref|YP_002395944.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
 gi|218325393|emb|CAV27494.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
          Length = 1069

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 63/138 (45%), Gaps = 8/138 (5%)

Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMK 223
           SD+   D L    +LK  G+  + IFP  NI RAL  ++  N   FE  S  +G  N   
Sbjct: 860 SDNSFGDALLADSYLKLIGL--DGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTL 914

Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
            +G     A Q+++VW GV + ++ A+   G   +A T    +Y  +Y+ + + F  PEG
Sbjct: 915 ADG-SPHEAFQAQDVWIGVQFSVATALSLAGKNQQAETLMDTVYTALYDYSKIPFAAPEG 973

Query: 284 LTGDKTYRSGGYMRALAV 301
                +      M A  +
Sbjct: 974 FNCSVSVNEKDLMEAFKL 991



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 37/148 (25%), Positives = 63/148 (42%), Gaps = 14/148 (9%)

Query: 32  PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           PF +  + Y+  +V +WKDL  K+I+ +YR Y   +DI  +   +    E I    +   
Sbjct: 641 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDISVVKECWQAVTESIDFLSNLIA 700

Query: 91  QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
           +G  +    G  D          + YC  L +A L    E++ L+N+++    +  +  K
Sbjct: 701 EGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAASELAKLMNESELAAGYLTRSKK 760

Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
           A       LW     YY F     Q++H
Sbjct: 761 ALATVEQSLWDDKEGYYHFFVTPVQAKH 788


>gi|374815490|ref|ZP_09719227.1| hypothetical protein TpriZ_16630 [Treponema primitia ZAS-1]
          Length = 814

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 57/237 (24%), Positives = 95/237 (40%), Gaps = 30/237 (12%)

Query: 55  IVSIYRDYALHKDIRFLTRVYPTC-LELIRKCESFDKQGLGIVQNGGFPDQTAYCGGL-- 111
           IV +YRD+ +  D  FL  ++P   L L     ++D  G  ++      D      G+  
Sbjct: 478 IVRLYRDWLISGDDAFLKELWPKAKLALDYGIRTWDTDGDFVLDGMKHVDYDVEFYGIDP 537

Query: 112 -----HIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKF-------D 159
                ++AAL   V+++  L D     ++      AS      +W      F       D
Sbjct: 538 LGNLCYLAALKAAVKIAEYLKDEGAHNRYQTIYDTASVQADKFMWDSKAGYFIQVLKDVD 597

Query: 160 SSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV----------- 208
             + +H    +ADQL G  +   +G+    +  + +I +A  +IF  N            
Sbjct: 598 EFKYQHGTGCLADQLIGQFYAHLTGLG--PLVDAGHIKQAAQSIFKYNFISDFTDHPNMQ 655

Query: 209 RGFEAGS-MGAVNGMKPNGDRDRSAI-QSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 263
           RG+      G +    P G R R     SEE W+   Y +++ ++YEG +DE  T A
Sbjct: 656 RGYAVNDDKGLLMTTWPLGGRPRYPFFYSEEAWSRTEYHVASTLIYEGFLDEGLTIA 712


>gi|15898194|ref|NP_342799.1| hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
 gi|13814563|gb|AAK41589.1| Hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
          Length = 663

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 48/169 (28%), Positives = 76/169 (44%), Gaps = 14/169 (8%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQNGG-- 100
           WKD+N   I+ +YR +    DI FL  VYP  +++    +R+C+       G + N    
Sbjct: 404 WKDMNPSLILLVYRYFKFTNDIEFLKEVYPILVKVMDWELRQCKGNLPFMEGEMDNAFDA 463

Query: 101 --FPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKF 158
                  +Y   L I +L  M E++ L+ D+      S KL+ A + +  +   G Y+K 
Sbjct: 464 TIIKGHDSYTSSLFIGSLIAMREIAKLVGDSNYVDFISEKLSSAREAFRRMF-NGRYFK- 521

Query: 159 DSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN 207
            +  S  + S +A QL G  F    G+  E I     I +AL +I   N
Sbjct: 522 -AWDSVDNASFLA-QLYGEWFTTLVGL--EDIVEEDIIKKALESIIRLN 566


>gi|417950561|ref|ZP_12593680.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
 gi|342806239|gb|EGU41472.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
          Length = 1046

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 58/121 (47%), Gaps = 8/121 (6%)

Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMK 223
           SD+   D L    +LK  G+  E IFP   I RAL  ++  N   FE  S  +G  N   
Sbjct: 837 SDNSFGDALLADSYLKLIGL--EGIFPKQRIQRALDYVYKHN---FEINSPKLGVANMTL 891

Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
            +G     A Q+++VW GV + ++ A+   G   +A T    +Y  +Y+ + + F  PEG
Sbjct: 892 ADG-SPHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALYDYSKIPFAAPEG 950

Query: 284 L 284
            
Sbjct: 951 F 951



 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 36/148 (24%), Positives = 60/148 (40%), Gaps = 14/148 (9%)

Query: 32  PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           PF +  + Y+  +V +WKDL  K+I+ +YR Y   +DI  +   +    E I    +   
Sbjct: 618 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQKTQDISVVKECWQAVTESIDFLSNLIA 677

Query: 91  QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
           +G  +    G  D          + YC  L +A L    E++ L+ +      +  +  K
Sbjct: 678 EGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAASELAKLMGEHDLGAGYLTRSKK 737

Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
           A       LW     YY F     Q++H
Sbjct: 738 ALATVEQSLWDEKEGYYHFFVTPVQAKH 765


>gi|407069309|ref|ZP_11100147.1| hypothetical protein VcycZ_07137 [Vibrio cyclitrophicus ZF14]
          Length = 1024

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 58/121 (47%), Gaps = 8/121 (6%)

Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMK 223
           SD+   D L    +LK  G+  E IFP   I RAL  ++  N   FE  S  +G  N   
Sbjct: 815 SDNSFGDALLADSYLKLIGL--EGIFPEDRIQRALDYVYKHN---FEINSPKLGVANMTL 869

Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
            +G     A Q+++VW GV + ++ A+   G   +A T    +Y  +Y+ + + F  PEG
Sbjct: 870 ADG-SPHEAFQAQDVWIGVQFSVATALNLAGKSQQAETLMDTVYTALYDYSKIPFAAPEG 928

Query: 284 L 284
            
Sbjct: 929 F 929



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/148 (25%), Positives = 63/148 (42%), Gaps = 14/148 (9%)

Query: 32  PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
           PF +  + Y+  +V +WKDL  K+I+ +YR Y   +D+  +   +    E I    +   
Sbjct: 596 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDLSVVKECWQAVTESIEFLSNLVA 655

Query: 91  QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
           +G  +    G  D          + YC  L +A L    E++ L+N+ +    + A+  K
Sbjct: 656 EGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAANELALLMNENELAADYLARSKK 715

Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
           A       LW     YY F     Q++H
Sbjct: 716 ALATVEQSLWDDKEGYYHFFVTPVQAKH 743


>gi|302348409|ref|YP_003816047.1| trehalase [Acidilobus saccharovorans 345-15]
 gi|302328821|gb|ADL19016.1| Putative trehalase [Acidilobus saccharovorans 345-15]
          Length = 928

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 80/173 (46%), Gaps = 25/173 (14%)

Query: 149 LLWTGSYYK--FDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFST 206
           LLW GSY+   +D        +I++ QL G  +L+  G+       S  +  AL  ++  
Sbjct: 656 LLWNGSYFDDWYDPESGLRDRAILSAQLTGEWYLQVLGLGDGV--DSEKVRSALREVYRA 713

Query: 207 NVRGFEAGSMGAVNGM----------KPNGDR--DRSAIQSEEVWTGVTYLLSAAMLYEG 254
           N R +E    G   G           +PNG +  +R + Q++  WTGV + +++ MLYEG
Sbjct: 714 NFRRWEGLLNGTYPGTPRPSMVGDVEEPNGTKILNRVSSQADTPWTGVEFGVASQMLYEG 773

Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
            V+EA      + R++++R         GL  +     G Y R LA  A+ ++
Sbjct: 774 MVEEAME----VLRSIHDRY-----RSWGLYFNHLECDGHYSRPLAALAIPNS 817


>gi|385774246|ref|YP_005646813.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
 gi|323478361|gb|ADX83599.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
          Length = 818

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 86/203 (42%), Gaps = 21/203 (10%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
           + Y   L I+AL  + E S LLN+ Q  +     + +  +  +  LW G YY   +D   
Sbjct: 562 SPYVSFLWISALEALNEASKLLNNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 616

Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
           +   DS  A Q+ G  +++   ++              ++I   N  G E    G+ N  
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 674

Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
           + P G +   +IQS+  W+GV Y L++ M Y G  DE    A  + R VYER  +     
Sbjct: 675 ITPLGVK--LSIQSKAPWSGVEYYLASHMFYSG-FDEY---AKKILRNVYERYEIA---- 724

Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
            G   +       YMR L   +M
Sbjct: 725 -GSFWNHIEWGARYMRPLVALSM 746


>gi|261417086|ref|YP_003250769.1| glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
           S85]
 gi|385791890|ref|YP_005823013.1| hypothetical protein FSU_3272 [Fibrobacter succinogenes subsp.
           succinogenes S85]
 gi|261373542|gb|ACX76287.1| Glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
           S85]
 gi|302325406|gb|ADL24607.1| conserved domain protein [Fibrobacter succinogenes subsp.
           succinogenes S85]
          Length = 1047

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 58/121 (47%), Gaps = 4/121 (3%)

Query: 167 DSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 226
           D + AD +    +L+   ++ E I  SA     L  IF+TN +   +  +GA N ++ +G
Sbjct: 834 DDVFADTMLADTYLRL--LDLEPITDSAKAKSNLLRIFNTNYKA-NSPLIGAANLVRKDG 890

Query: 227 DR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLT 285
              D    Q+ +VW G+ Y +  AM++ G   EA   A  +   +Y+   + F  PEG  
Sbjct: 891 SPLDEFNFQAHDVWIGIQYSIMTAMMFHGLEKEASVLADSMIGNLYDEARVPFAAPEGFN 950

Query: 286 G 286
           G
Sbjct: 951 G 951



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/133 (25%), Positives = 60/133 (45%), Gaps = 11/133 (8%)

Query: 37  NGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIV 96
           + Y+  +V +WKDL  K+++ +YR Y   KD++ L          +   E     G    
Sbjct: 614 DAYDWHNVKEWKDLAPKYVLMVYRHYHKTKDMQCLADCKEAVYAAMEYLEKMVNPGENFP 673

Query: 97  QNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYH 147
              G  D          + YCG L IA L    +++ LL D +Q  +++ K   A++ + 
Sbjct: 674 LTHGTDDTFDNLCSYGISVYCGSLWIAGLRAAAKIAELLGDNEQAAKWNEKSEAANKEFT 733

Query: 148 DLLWTGS--YYKF 158
           + LW  +  Y++F
Sbjct: 734 ESLWDENEGYFRF 746


>gi|284997264|ref|YP_003419031.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284445159|gb|ADB86661.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
          Length = 740

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 51/213 (23%), Positives = 90/213 (42%), Gaps = 19/213 (8%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           WKD N  FI+ +YR Y    D+ FL  +YP  L+ ++   + DK   G+ +  G  D   
Sbjct: 474 WKDTNPTFILLVYRYYKFTNDLDFLKEMYPKMLKAMKWELTQDKDKDGVPELEGQGDTGF 533

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                    +Y   ++IA++  + E + +L D     + +  L K+ +++   L+ G Y 
Sbjct: 534 DAMSVKGIDSYTTSIYIASIIALKETAKILKDNDTLNEMTILLEKSRKIFSR-LFNGKY- 591

Query: 157 KFDS-SQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS 215
            FD          I   QL G  + +  G+  E I     I+ AL  + S N     + +
Sbjct: 592 -FDPWIGEPEIKGIFLGQLVGEWWSEILGL--EPIIEEEKISSALEAMLSINAN---SSN 645

Query: 216 MGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSA 248
               N +  +G     + QS   W  + + +S+
Sbjct: 646 FCTPNIVSEDGKIIDISPQSYSSWPRLVFAISS 678


>gi|385776902|ref|YP_005649470.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
 gi|323475650|gb|ADX86256.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
          Length = 818

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 86/203 (42%), Gaps = 21/203 (10%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
           + Y   L I+AL  + E S LLN+ Q  +     + +  +  +  LW G YY   +D   
Sbjct: 562 SPYVSFLWISALEALNEASKLLNNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 616

Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
           +   DS  A Q+ G  +++   ++              ++I   N  G E    G+ N  
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 674

Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
           + P G +   +IQS+  W+GV Y L++ M Y G  DE    A  + R VYER  +     
Sbjct: 675 ITPLGVK--LSIQSKAPWSGVEYYLASHMFYSG-FDEY---AKKILRNVYERYEIA---- 724

Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
            G   +       YMR L   +M
Sbjct: 725 -GSFWNHIEWGARYMRPLVALSM 746


>gi|15898746|ref|NP_343351.1| hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
 gi|229578524|ref|YP_002836922.1| hypothetical protein YG5714_0710 [Sulfolobus islandicus Y.G.57.14]
 gi|229582969|ref|YP_002841368.1| hypothetical protein YN1551_2495 [Sulfolobus islandicus Y.N.15.51]
 gi|284173504|ref|ZP_06387473.1| hypothetical protein Ssol98_02460 [Sulfolobus solfataricus 98/2]
 gi|384435081|ref|YP_005644439.1| hypothetical protein [Sulfolobus solfataricus 98/2]
 gi|13815223|gb|AAK42141.1| Hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
 gi|228009238|gb|ACP45000.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.G.57.14]
 gi|228013685|gb|ACP49446.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.N.15.51]
 gi|261603235|gb|ACX92838.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
          Length = 661

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 92/213 (43%), Gaps = 27/213 (12%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
           WKD+N   I+ +YR +    DI FL  VYPT +++    +R+C    + GL  ++   + 
Sbjct: 402 WKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457

Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
            F D T      +Y   L IA+L  M E++ L+ D+      + KL  A + +  +   G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLNVAREAFRKMF-NG 515

Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
            Y+K  +       S +A QL G  F   + +  E I     I  AL +I   N     A
Sbjct: 516 KYFK--AWDGVDKASFLA-QLYGEWF--TTLLELENIVDENMIKSALESIIRLNGN---A 567

Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
                 N +  NG     ++Q+   W  + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600


>gi|227827237|ref|YP_002829016.1| hypothetical protein M1425_0924 [Sulfolobus islandicus M.14.25]
 gi|227459032|gb|ACP37718.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
          Length = 661

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 55/213 (25%), Positives = 93/213 (43%), Gaps = 27/213 (12%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
           WKD+N   ++ +YR +    DI FL  VYPT +++    +R+C    + GL  ++   + 
Sbjct: 402 WKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457

Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
            F D T      +Y   L IA+L  M E++ L+ D+      + KL+ A + +  +   G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAREAFRKMF-NG 515

Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
            Y+K  +       S +A QL G  F   + +  E I     I  AL +I   N     A
Sbjct: 516 KYFK--AWDGVDKASFLA-QLYGEWF--TTLLELENIVDENMIKSALESIIRLNGN---A 567

Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
                 N +  NG     ++Q+   W  + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600


>gi|227829616|ref|YP_002831395.1| hypothetical protein LS215_0672 [Sulfolobus islandicus L.S.2.15]
 gi|227456063|gb|ACP34750.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
          Length = 661

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 92/213 (43%), Gaps = 27/213 (12%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
           WKD+N   I+ +YR +    DI FL  VYPT +++    +R+C    + GL  ++   + 
Sbjct: 402 WKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457

Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
            F D T      +Y   L IA+L  M E++ L+ D+      + KL  A + +  +   G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLNVAREAFRKMF-NG 515

Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
            Y+K  +       S +A QL G  F   + +  E I     I  AL +I   N     A
Sbjct: 516 KYFK--AWDGVDKASFLA-QLYGEWF--TTLLELENIVDENMIKSALESIIRLNGN---A 567

Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
                 N +  NG     ++Q+   W  + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600


>gi|159041841|ref|YP_001541093.1| hypothetical protein Cmaq_1276 [Caldivirga maquilingensis IC-167]
 gi|157920676|gb|ABW02103.1| protein of unknown function DUF608 [Caldivirga maquilingensis
           IC-167]
          Length = 688

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 54/121 (44%), Gaps = 11/121 (9%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           WKDLN  +I+ ++R +    D+ F+  +YP  ++        DK G G+ +  G  D   
Sbjct: 412 WKDLNSTYILLVHRYFKRSNDVEFIKEIYPKLIKAFEWVLVQDKDGDGVPELSGDGDTGF 471

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                    +Y   L IAAL  M E++ L+ D     +  + L KA   Y +  W G  +
Sbjct: 472 DAMSVKGFDSYTTSLWIAALMVMGELAKLMGDQATLSKVESTLLKARDSY-NRRWLGDRF 530

Query: 157 K 157
           K
Sbjct: 531 K 531


>gi|238619389|ref|YP_002914214.1| hypothetical protein M164_0933 [Sulfolobus islandicus M.16.4]
 gi|238380458|gb|ACR41546.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
          Length = 661

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 91/213 (42%), Gaps = 27/213 (12%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
           WKD+N   ++ +YR +    DI FL  VYPT +++    +R+C    + GL  ++   + 
Sbjct: 402 WKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMNWELRQC----RDGLPFMEGEMDN 457

Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
            F D T      +Y   L IA+L  M E++ L+ D+      + KL+ A + +  +   G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAREAFRKMF-NG 515

Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
            Y+K      + S      QL G  F   + +  E I     I  AL +I   N     A
Sbjct: 516 KYFKAWDGVDKAS---FLAQLYGEWF--TTLLELENIVDENMIKSALESIIRLNGN---A 567

Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
                 N +  NG     ++Q+   W  + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600


>gi|229584458|ref|YP_002842959.1| hypothetical protein M1627_0991 [Sulfolobus islandicus M.16.27]
 gi|228019507|gb|ACP54914.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
          Length = 661

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 55/213 (25%), Positives = 93/213 (43%), Gaps = 27/213 (12%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
           WKD+N   ++ +YR +    DI FL  VYPT +++    +R+C    + GL  ++   + 
Sbjct: 402 WKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457

Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
            F D T      +Y   L IA+L  M E++ L+ D+      + KL+ A + +  +   G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAREAFRKMF-NG 515

Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
            Y+K  +       S +A QL G  F   + +  E I     I  AL +I   N     A
Sbjct: 516 KYFK--AWDGVDKASFLA-QLYGEWF--TTLLELENIVDENMIKSALESIIRLNGN---A 567

Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
                 N +  NG     ++Q+   W  + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600


>gi|385775519|ref|YP_005648087.1| hypothetical protein [Sulfolobus islandicus REY15A]
 gi|323474267|gb|ADX84873.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
          Length = 661

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/213 (25%), Positives = 93/213 (43%), Gaps = 27/213 (12%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
           WKD+N   ++ +YR +    DI FL  VYPT +++    +R+C    + GL  ++   + 
Sbjct: 402 WKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457

Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
            F D T      +Y   L IA+L  M E++ L+ D+      + KL+ A + +  +   G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAREAFRKMF-NG 515

Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
            Y+K  +       S +A QL G  F   + +  E I     I  AL +I   N     A
Sbjct: 516 KYFK--AWDGVDKASFLA-QLYGEWF--TTLLELENIADENMIKSALESIIRLNGN---A 567

Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
                 N +  NG     ++Q+   W  + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600


>gi|385772401|ref|YP_005644967.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
 gi|323476515|gb|ADX81753.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
          Length = 661

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 62/124 (50%), Gaps = 19/124 (15%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
           WKD+N   ++ +YR +    DI FL  VYPT +++    +R+C    + GL  ++   + 
Sbjct: 402 WKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457

Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
            F D T      +Y   L IA+L  M E++ L+ D+      + KL+ A + +  +   G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAREAFRKMF-NG 515

Query: 154 SYYK 157
            Y+K
Sbjct: 516 KYFK 519


>gi|327309977|ref|YP_004336874.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
 gi|326946456|gb|AEA11562.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
          Length = 694

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/229 (24%), Positives = 93/229 (40%), Gaps = 19/229 (8%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           WKD N  FI+ +YR Y    DI F+  VY   +   R     D+ G G+ +  G  D   
Sbjct: 414 WKDTNPTFILLVYRYYKRTGDIDFVKSVYGRLVAAYRWMRGQDRDGDGLPELEGSGDSGF 473

Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
                   +AY   L IAAL  + +++  + D++  ++  + L +A   +       SY 
Sbjct: 474 DCAKIRGPSAYTSSLWIAALVALRDLAEAVGDSKTAEEAGSDLERARNSFERSFDGVSYK 533

Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
            +D +      ++ A    G  +L  +G + E       +A AL+ +   N       + 
Sbjct: 534 AWDGAP---EGAVFAALAVGDWWLYLAG-SREGAARRDRLASALSAVLRVNGSASANCAP 589

Query: 217 GAVNGMKPNGD---RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTT 262
            A  G +P  D    +R   Q    W  + + + AA+ Y    D AW +
Sbjct: 590 NASTG-RPEADGEISERVGRQLTASWPRLIFSV-AALGYALTGDAAWLS 636


>gi|15921102|ref|NP_376771.1| hypothetical protein ST0868 [Sulfolobus tokodaii str. 7]
 gi|15621887|dbj|BAB65880.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 683

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 10/92 (10%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
           WKD N  F++ +YR Y   KD  FL  VYP   + +    S DK G G+ +  G  DQ  
Sbjct: 410 WKDTNTTFVLMVYRYYLRTKDKEFLKSVYPYVKKAMSWIISKDKDGDGLPEVDGSTDQGF 469

Query: 105 --------TAYCGGLHIAALACMVEMSSLLND 128
                    +Y   ++IAAL  M++++ ++ D
Sbjct: 470 DCVPIEGVCSYISTVYIAALEAMIKIAEIVGD 501


>gi|229581136|ref|YP_002839535.1| hypothetical protein YN1551_0454 [Sulfolobus islandicus Y.N.15.51]
 gi|228011852|gb|ACP47613.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.N.15.51]
          Length = 816

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 85/203 (41%), Gaps = 21/203 (10%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
           + Y   L I+AL  + E S LL++ Q  +     + +  +  +  LW G YY   +D   
Sbjct: 560 SPYVSFLWISALEALNEASKLLSNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 614

Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
           +   DS  A Q+ G  +++   ++              ++I   N  G E    G+ N  
Sbjct: 615 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 672

Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
           + P G +   +IQS+  W+GV Y L+  M Y G  DE    A  + R VYER  +     
Sbjct: 673 ITPLGVK--LSIQSKAPWSGVEYYLALHMFYSG-FDEY---AKKILRNVYERYEIA---- 722

Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
            G   +       YMR L   +M
Sbjct: 723 -GNFWNHIEWGARYMRPLVALSM 744


>gi|410638454|ref|ZP_11349015.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
 gi|410141863|dbj|GAC16220.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
          Length = 1124

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 49/234 (20%), Positives = 97/234 (41%), Gaps = 26/234 (11%)

Query: 49  DLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIR-KCESFDKQGLGIVQN-------GG 100
           D +   I++ YR++    +  +L   +P   + +      +D   +G + N       G 
Sbjct: 720 DGHFGTILNTYREHQCSANDSWLKSQWPRVKKAVEWGINEWDPNRIGFMSNTQHNTLDGN 779

Query: 101 FPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDS 160
               +++ G L++ +L     M+ ++ + Q  +++ A      ++ +  LW G YY  + 
Sbjct: 780 ITGCSSWIGSLYLTSLEAAARMADIVGEPQTAKEYRAIRESGKKLQNKRLWNGEYYIQEE 839

Query: 161 SQSRHSDSI---MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRG-FEAGSM 216
            + R  D +     DQL G  +  A  +N +  +P     +A++ +   N R  F   S+
Sbjct: 840 GKERFQDYLDGCHIDQLLGEWW--ADQLNIDRNYPRERAQQAMSALLKYNFRADFYGQSL 897

Query: 217 GAVN-------GMK----PNGDRDRSAIQ-SEEVWTGVTYLLSAAMLYEGNVDE 258
                      GMK    P G +    ++  +EV TG  Y  +  M+  G +DE
Sbjct: 898 KPRQYCEIEDGGMKMITWPRGQQPIPGMKYGDEVMTGFEYGAAVTMMQNGMIDE 951


>gi|284998829|ref|YP_003420597.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284446725|gb|ADB88227.1| hypothetical protein LD85_2606 [Sulfolobus islandicus L.D.8.5]
          Length = 803

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 85/203 (41%), Gaps = 21/203 (10%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
           + Y   L I+AL  + E S LL++ Q  +     + +  +  +  LW G YY   +D   
Sbjct: 547 SPYVSFLWISALEALNEASKLLSNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 601

Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
           +   DS  A Q+ G  +++   ++              ++I   N  G E    G+ N  
Sbjct: 602 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 659

Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
           + P G +   +IQS+  W+GV Y L+  M Y G  DE    A  + R VYER  +     
Sbjct: 660 ITPLGVK--LSIQSKAPWSGVEYYLALHMFYSG-FDEY---AKKILRNVYERYEIA---- 709

Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
            G   +       YMR L   +M
Sbjct: 710 -GNFWNHIEWGARYMRPLVALSM 731


>gi|227831316|ref|YP_002833096.1| hypothetical protein LS215_2489 [Sulfolobus islandicus L.S.2.15]
 gi|229580221|ref|YP_002838621.1| hypothetical protein YG5714_2457 [Sulfolobus islandicus Y.G.57.14]
 gi|227457764|gb|ACP36451.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
 gi|228010937|gb|ACP46699.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.G.57.14]
          Length = 818

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 85/203 (41%), Gaps = 21/203 (10%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
           + Y   L I+AL  + E S LL++ Q  +     + +  +  +  LW G YY   +D   
Sbjct: 562 SPYVSFLWISALEALNEASKLLSNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 616

Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
           +   DS  A Q+ G  +++   ++              ++I   N  G E    G+ N  
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 674

Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
           + P G +   +IQS+  W+GV Y L+  M Y G  DE    A  + R VYER  +     
Sbjct: 675 ITPLGVK--LSIQSKAPWSGVEYYLALHMFYSG-FDEY---AKKILRNVYERYEIA---- 724

Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
            G   +       YMR L   +M
Sbjct: 725 -GNFWNHIEWGARYMRPLVALSM 746


>gi|229585796|ref|YP_002844298.1| hypothetical protein M1627_2396 [Sulfolobus islandicus M.16.27]
 gi|228020846|gb|ACP56253.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
          Length = 818

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 85/203 (41%), Gaps = 21/203 (10%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
           + Y   L I+AL  + E S LLN+ Q  +     + +  +  +  LW G YY   +D   
Sbjct: 562 SPYVSFLWISALEALNEASKLLNNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 616

Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
           +   DS  A Q+ G  +++   ++              ++I   N  G E    G+ N  
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 674

Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
           + P G +   +IQS+  W+GV Y L++ M   G  DE    A  + R VYER  +     
Sbjct: 675 ITPLGIK--LSIQSKAPWSGVEYYLASHMFSSG-FDEY---AKKILRNVYERYEIA---- 724

Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
            G   +       YMR L   +M
Sbjct: 725 -GNFWNHIEWGARYMRPLVALSM 746


>gi|410867532|ref|YP_006982143.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
           ATCC 4875]
 gi|410824173|gb|AFV90788.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
           ATCC 4875]
          Length = 845

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 83/219 (37%), Gaps = 26/219 (11%)

Query: 107 YCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY-----KFDSS 161
           + G L +AAL    EM+ +L  T +   +  +   +S  Y   L+ G YY     +    
Sbjct: 558 FMGSLWLAALRASEEMALILGATDRAMGWRERFVLSSAAYDTALFRGGYYVQAADETHGD 617

Query: 162 QSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA-------- 213
                D  +ADQL G  +  A  ++   + P+ ++  AL  + + N+   +         
Sbjct: 618 DDEFGDGCLADQLIGQWW--AHLLDLGHLLPAEHVRSALQAVVAHNLNPADETRHSQRAY 675

Query: 214 ---GSMGAVNGMKPNGDRDRSAIQ-SEEVWTGVTYLLSAAMLYEG-------NVDEAWTT 262
              G  G V    P G R   A +  +EVWTG  Y ++A  L EG        +D  W  
Sbjct: 676 AVRGEQGLVMCSWPRGGRPARATRYCDEVWTGCEYEVAALCLAEGLEKDGHRILDAIWRR 735

Query: 263 AGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAV 301
             G  R  Y     G      + G      GG  R  A+
Sbjct: 736 HDGRIRNPYNEIECGDHYVRSMAGWSVLELGGAPRWDAI 774


>gi|227828566|ref|YP_002830346.1| hypothetical protein M1425_2317 [Sulfolobus islandicus M.14.25]
 gi|227460362|gb|ACP39048.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
          Length = 818

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 56/203 (27%), Positives = 85/203 (41%), Gaps = 21/203 (10%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
           + Y   L I+AL  + E S LLN+ Q  +    +L +    Y   LW G YY   +D   
Sbjct: 562 SPYVSFLWISALEALNEASKLLNNPQNYE--VKELIEKVDKY---LWNGEYYMDWYDPIS 616

Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
           +   DS  A Q+ G  +++   ++              ++I   N  G E    G+ N  
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 674

Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
           + P G +   +IQS+  W+GV Y L++ M   G  DE    A  + R VYER  +     
Sbjct: 675 ITPLGIK--LSIQSKAPWSGVEYYLASHMFSSG-FDEY---AKKILRNVYERYEIA---- 724

Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
            G   +       YMR L   +M
Sbjct: 725 -GNFWNHIEWGARYMRPLVALSM 746


>gi|422908697|ref|ZP_16943374.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
 gi|341639159|gb|EGS63785.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
          Length = 981

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 55/120 (45%), Gaps = 4/120 (3%)

Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN 225
           SD+   D L    +LK   V+   +F S+ I R L  I+ TN     +  +G  N    +
Sbjct: 771 SDNSFGDALLADSYLKL--VDLPGLFDSSRIERTLDFIYQTNFIT-NSPKLGVANMTLAD 827

Query: 226 GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLT 285
           G R   A Q+++VW GV + ++ A+   G   +A      +Y+ +Y    + F  PEG  
Sbjct: 828 G-RLHDAFQAQDVWIGVQFSVATALRLAGKQAQAERLMDAVYQALYSMAKIPFAAPEGFN 886



 Score = 41.6 bits (96), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 64/148 (43%), Gaps = 14/148 (9%)

Query: 32  PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLE----LIRKCE 86
           PF +  + Y+  +V +WKDL  KFI+ +YR +    D   +   + +  E    LI   E
Sbjct: 552 PFDIDPDAYSWHNVKEWKDLAPKFILMVYRHWQKTGDREVVESCWSSVKESIDYLINLIE 611

Query: 87  SFDKQGLGIVQNGGFPDQTA-----YCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
             D   L    +  F +  +     YCG L +A L    E++ L+ + +  Q ++ +   
Sbjct: 612 DGDTLPLTRGTDDTFDNLASHGVSIYCGSLWVAGLRAASELALLVGEEELAQGYTQRSKD 671

Query: 142 ASQVYHDLLW--TGSYYKF--DSSQSRH 165
           A +     LW     YY F     Q++H
Sbjct: 672 ALETLERGLWDEQKGYYHFFITPVQAKH 699


>gi|238620760|ref|YP_002915586.1| hypothetical protein M164_2319 [Sulfolobus islandicus M.16.4]
 gi|238381830|gb|ACR42918.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
          Length = 818

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 53/203 (26%), Positives = 84/203 (41%), Gaps = 21/203 (10%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
           + Y   L I+AL  + E S LLN+ Q  +     + +  +  +  LW G YY   +D   
Sbjct: 562 SPYVSFLWISALEALNEASKLLNNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 616

Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
           +   DS  A Q+ G  +++   ++              ++I   N    E    G+ N  
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSAEEGVRNGSSNDD 674

Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
           + P G +   +IQS+  W+GV Y L++ M   G  DE    A  + R VYER  +     
Sbjct: 675 ITPLGVK--LSIQSKAPWSGVEYYLASHMFSSG-FDEY---AKKILRNVYERYEIA---- 724

Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
            G   +       YMR L   +M
Sbjct: 725 -GNFWNHIEWGARYMRPLVALSM 746


>gi|326436882|gb|EGD82452.1| hypothetical protein PTSG_11966 [Salpingoeca sp. ATCC 50818]
          Length = 1720

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 63/269 (23%), Positives = 105/269 (39%), Gaps = 41/269 (15%)

Query: 21   RYYTRWFPEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLE 80
            R ++ W P++E  SL       D +   D  +  ++  YR+  +     +    Y   L+
Sbjct: 1302 RQWSFW-PDDEDRSL-------DQTICSDGEICTVIKTYREVLMGAPSSWFKDKYHQVLK 1353

Query: 81   LI-RKCESFDKQGLGIVQNGGFPDQ--------TAYCGGLHIAALACMVEMSSLLNDTQQ 131
            ++ R     D QG G+V+ G  P           ++ G L++ AL    EM+ L ND   
Sbjct: 1354 IMERWLIIMDAQGDGVVR-GAQPSTYDVALHGANSFIGSLYMCALRAAEEMAVLQNDPDS 1412

Query: 132  QQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIM-----ADQLCGYLFLKASGVN 186
              +F A+   +   Y  L +T   +        H  +I+      D L G  + +  G+ 
Sbjct: 1413 AAKFRARFKLSVANYDKLCFTNGKWYTQVVDPAHDTNIIGESTFVDSLLGEWWAQFLGIG 1472

Query: 187  TEAIFPSANIARALATIFSTN-VRGFEAGSMGAVN-------GM------KPNGDRDRSA 232
              A+ P A++   L   F  N V  F+               GM       P     ++A
Sbjct: 1473 --AMLPPAHVLSTLQNCFKYNHVDAFDPARQAPRKFCDSRDAGMYIATWGGPENPPPKNA 1530

Query: 233  I--QSEEVWTGVTYLLSAAMLYEGNVDEA 259
            +   SE +WTG+ Y  S   +  G+VD A
Sbjct: 1531 LLYTSEAIWTGLVYPFSGLCMMAGDVDTA 1559


>gi|15922864|ref|NP_378533.1| hypothetical protein ST2529 [Sulfolobus tokodaii str. 7]
 gi|15623655|dbj|BAB67642.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 836

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/153 (23%), Positives = 65/153 (42%), Gaps = 25/153 (16%)

Query: 139 LTKASQVYHDLLWTGSYYK--FDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANI 196
           +  A  +Y  +LW G Y+   +D        +  A QL G  +L   G    ++      
Sbjct: 582 VENALSLYESMLWNGEYFDLWYDPISGYRDKTCQASQLLGEFYLNLLGY---SLLDREKT 638

Query: 197 ARALATIFSTNVRGFEAGSMGAVNGMKPNGDR------------DRSAIQSEEVWTGVTY 244
            + L +I   N++  E    G +NG  P+G R              ++I  +  W+GV +
Sbjct: 639 RKTLLSIVKYNLKEEE----GVINGAYPDGYRPLMREYENPLKIKEASIHQDTPWSGVEF 694

Query: 245 LLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG 277
            L++ ++YE  +DE    A  + + VY+R  + 
Sbjct: 695 YLASHLIYEKMIDE----AKKVLKEVYDRYSIA 723


>gi|229820204|ref|YP_002881730.1| hypothetical protein Bcav_1712 [Beutenbergia cavernae DSM 12333]
 gi|229566117|gb|ACQ79968.1| protein of unknown function DUF608 [Beutenbergia cavernae DSM
           12333]
          Length = 865

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 70/152 (46%), Gaps = 20/152 (13%)

Query: 134 QFSAKLTKASQVYHDLLWTGSYYKF-----DSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
           +  A   + S  Y ++ +TG YY+      + +Q +  D  ++DQL G  +  A  ++  
Sbjct: 611 ELHALFERGSAAYDEICFTGEYYRQVLEPDEPTQFQWGDGCLSDQLIGQWW--AHLLDLG 668

Query: 189 AIFPSANIARALATIFSTNVR-GFEA-----------GSMGAVNGMKPNGDRDRSAIQ-S 235
            + P+A++  AL  +   N+R GF+              +G +    P+G R   A +  
Sbjct: 669 HVLPAAHVRSALRAVVGHNLRRGFDGWEHTQRVYADGDDVGLLMCTWPHGGRPEVATRYC 728

Query: 236 EEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 267
           +EVW+GV Y ++A  L EG   E      GL+
Sbjct: 729 DEVWSGVEYEVAAHCLLEGLTAEGRAILDGLW 760


>gi|160932020|ref|ZP_02079412.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
 gi|156869062|gb|EDO62434.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
          Length = 822

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 46/200 (23%), Positives = 85/200 (42%), Gaps = 28/200 (14%)

Query: 126 LNDTQQQQQFSAKLTKASQVYHDLLWTGSYY-----KFDSSQSRHSDSIMADQLCGYLFL 180
           + D ++ +++     K +++  +  W G YY       D  + ++    ++DQL G    
Sbjct: 570 MGDQERARRYRDGAEKGAKLMDEKCWNGEYYIQLIDDVDKYRYQYGKGCLSDQLLGQFLA 629

Query: 181 KASGVNTEAIFPSANIARALATIFSTNV--RGFEAG----------SMGAVNGMKPNGDR 228
           + +G+    + P  ++ +A  +IF  N   R  + G           MG      P G+R
Sbjct: 630 QEAGLGY--VLPKNHVKKAAESIFRYNFIERASDFGHVQRAYIFNDEMGLTPCTWPLGER 687

Query: 229 DR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGD 287
            R   I   EVWTG+ Y ++A ++ E  ++E  T    + + V +R       P      
Sbjct: 688 PRFPFIYFGEVWTGIEYEVAALLIREDLLEEGLT----IVKAVRDRQDGYRRNPWSENES 743

Query: 288 KTYRSGGYMRALAVYAMQDA 307
             Y    Y RA+A Y++ +A
Sbjct: 744 GYY----YTRAMASYSILNA 759


>gi|327311335|ref|YP_004338232.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
 gi|326947814|gb|AEA12920.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
          Length = 892

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 88/220 (40%), Gaps = 40/220 (18%)

Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
           ++Y   L  AAL  MVE + L   T +  +  A+L + +    + LW G Y+   +D   
Sbjct: 587 SSYVLFLWAAALKAMVEGAKL---TGRSPEKYAELLRRAFEGLERLWNGEYFDLWWDPVT 643

Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGM 222
                + MA QL G L    + +    +     +  AL  +   N+    A   G +NGM
Sbjct: 644 GERDRASMAAQLFGQLLAHVADLGY--LSDKQRVISALRAVAKYNL----APDEGLINGM 697

Query: 223 KPNGDRDRSAI------------------QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 264
            P  DR R +                   Q +  WTGV Y ++  M YEG V+E      
Sbjct: 698 YP--DRRRPSFVGPTLYENFTRGPYLPTWQMDTPWTGVEYAVAGHMFYEGLVEEGTAVLK 755

Query: 265 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
            L+   YER G  +   E  T         YMR L+ +A+
Sbjct: 756 ALHER-YERGGHYWNHIEWGT--------HYMRPLSAWAV 786


>gi|149196161|ref|ZP_01873217.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
 gi|149141008|gb|EDM29405.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
          Length = 1052

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 33/133 (24%), Positives = 56/133 (42%), Gaps = 11/133 (8%)

Query: 37  NGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIV 96
           + YN  +V++WKDL  K+I+ + R+Y   KD+  L   + +    ++  +    +G  I 
Sbjct: 612 DAYNWHNVAEWKDLAPKYILMLLRNYHFTKDVYLLEECWDSVQASLQYLKDMILEGHSIP 671

Query: 97  QNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYH 147
              G  D          T YCG L +A L    E++ +L               AS  ++
Sbjct: 672 LTNGTDDTFDNLSSFGITLYCGSLWVAGLKAAGEIAKILKIEDVIDDLKELEEAASASFN 731

Query: 148 DLLW--TGSYYKF 158
             LW    +Y+ F
Sbjct: 732 QALWDKENNYFHF 744


>gi|348173841|ref|ZP_08880735.1| hypothetical protein SspiN1_25464 [Saccharopolyspora spinosa NRRL
           18395]
          Length = 995

 Score = 45.8 bits (107), Expect = 0.026,   Method: Composition-based stats.
 Identities = 61/256 (23%), Positives = 108/256 (42%), Gaps = 42/256 (16%)

Query: 55  IVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIV-------QNGGFPDQTAY 107
           I+ IYR++ +  D ++L RV+P   + +    + D +  GI+       Q+  +  + ++
Sbjct: 608 ILRIYREHQMSPDSQWLQRVWPRTRQAVEYLIASDAEPDGILDGAQPNTQDATWFGRNSW 667

Query: 108 CGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSD 167
             GL++AAL     M+  + D    ++ +   T  ++V    L+ G Y+  +   + H  
Sbjct: 668 LSGLYLAALRAGAAMAGEVGDDAFARRCTELATSGTEVIVRDLFNGEYFVHELDPA-HPG 726

Query: 168 SI------MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV--RGFEAGSMGAV 219
           S+       ADQL G  +  A+ +    + P      AL +I+  N   R  E      +
Sbjct: 727 SVNTNRGCFADQLLGPSW--ATQLGLPRVLPPEPTRSALRSIWRHNFVPRPMEYRQNSPI 784

Query: 220 NGMK---------------PNGDRDR-----SAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
            G +               PNG  D      S     E W G+ Y ++A ++ EG VDE 
Sbjct: 785 EGGRIFYDADVPALVMCTWPNGGGDEAGDNWSVSYFNEAWHGIEYQVAAHLIAEGMVDEG 844

Query: 260 WTTAGGLYRTVYERTG 275
              A    R+V++R G
Sbjct: 845 LAVA----RSVHDRYG 856


>gi|332796016|ref|YP_004457516.1| hypothetical protein Ahos_0327 [Acidianus hospitalis W1]
 gi|332693751|gb|AEE93218.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 662

 Score = 45.8 bits (107), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 51/207 (24%), Positives = 86/207 (41%), Gaps = 24/207 (11%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLE-----LIRKCESFDKQGLGIVQNGGF 101
           WKDLN  FI+ +YR +    D+ FL   YP  L+     L R  E+   +G G       
Sbjct: 407 WKDLNPTFILLVYRYFKFTGDVDFLRETYPKMLKAYEWMLTRSIEA---EGSGDTAFDVL 463

Query: 102 P--DQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFD 159
           P   +      L IA+   + E   +LN+  +    S  L+K  ++ +  L+ G Y+   
Sbjct: 464 PIKGKNPMLLTLFIASALALRETKKVLNEKDE----STDLSKLREMLNS-LYNGKYFIAW 518

Query: 160 SSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAV 219
             Q    + I   QL G  + +  G+  E +     I+ AL  +   N +   A      
Sbjct: 519 EGQ----EGIFMAQLLGEWWTELLGL--ENVTDEEKISSALRYMLEVNGK---ASEYCTP 569

Query: 220 NGMKPNGDRDRSAIQSEEVWTGVTYLL 246
           N +K NG+  + + Q+   W  + + +
Sbjct: 570 NLVKENGEVVKISPQAYSSWPRLVFAM 596


>gi|284997210|ref|YP_003418977.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284445105|gb|ADB86607.1| hypothetical protein LD85_0902 [Sulfolobus islandicus L.D.8.5]
          Length = 498

 Score = 45.8 bits (107), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 49/97 (50%), Gaps = 18/97 (18%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
           WKD+N   I+ +YR +    DI FL  VYPT +++    +R+C    + GL  ++   + 
Sbjct: 402 WKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457

Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQ 130
            F D T      +Y   L IA+L  M E++ L+   Q
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGRQQ 493


>gi|15922946|ref|NP_378615.1| hypothetical protein ST2609 [Sulfolobus tokodaii str. 7]
 gi|15623737|dbj|BAB67724.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 661

 Score = 45.4 bits (106), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 46/167 (27%), Positives = 69/167 (41%), Gaps = 16/167 (9%)

Query: 47  WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT- 105
           WKDLN  +I+ IYR Y L  DI FL  VY    +       F + GL    +  F D T 
Sbjct: 388 WKDLNPTYILLIYRYYKLTGDIEFLKSVYDKVKKAFEWELKFSRYGLEGKMDSAF-DVTP 446

Query: 106 -----AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDS 160
                +Y   L+IA+L  M E+S    D     +   ++ +A + +  +   G Y+    
Sbjct: 447 IKGINSYTLSLYIASLFAMREISKTAGDNLNLDE---QIKEAKEAFEKMF-NGKYF---I 499

Query: 161 SQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN 207
           +     D++   Q+ G  +    G+  E I     I  AL  I   N
Sbjct: 500 AWEGMEDAVFLAQVFGEWWTTLLGL--EPIADEEKIKSALRWIIKVN 544


>gi|297741933|emb|CBI33368.3| unnamed protein product [Vitis vinifera]
          Length = 108

 Score = 45.1 bits (105), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 34/53 (64%), Gaps = 2/53 (3%)

Query: 257 DEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
           ++A+TTA G++   +   G G  F+TPEG T D  +RS  YMR LA++ MQ A
Sbjct: 3   EQAFTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWA 55


>gi|374854722|dbj|BAL57596.1| hypothetical conserved protein [uncultured prokaryote]
          Length = 751

 Score = 44.7 bits (104), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 57/255 (22%), Positives = 95/255 (37%), Gaps = 46/255 (18%)

Query: 49  DLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKC---ESFDKQGLGIVQNGG----- 100
           D  +  IV  Y+ Y    ++ +L +VYP   +L+       S+D    G+++        
Sbjct: 456 DGQMGLIVRAYQQYRRDNNLEWLRQVYPKLKQLLSFAWLPGSWDADRDGVMEGAQHNTYD 515

Query: 101 --FPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY-- 156
             F      CG  ++AAL  M EM+  + +T   Q+      + S+   + L+ G YY  
Sbjct: 516 IEFFGPNPMCGVWYLAALLAMEEMAKRVGETDFAQECRQLFERGSRWIDENLFDGEYYVQ 575

Query: 157 -----------------KFDSSQSRHS--DSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
                              D +  R+      + DQL G    KA+      +    +I 
Sbjct: 576 RVQPLQGQPHPMTTAIDPGDPAYQRYQVGTGCLIDQLTGQ--YKANRAGLGDLLKREHIV 633

Query: 198 RALATIFSTNV-RGFE-----------AGSMGAVNGMKPNGDRDRSAIQS-EEVWTGVTY 244
           +AL ++   N  RGF                G +    P G+R  +      E WTG+ Y
Sbjct: 634 KALRSLMRHNFRRGFHQHYNNMRTYALGDEAGVLICSYPRGERPETPFPYWAECWTGLEY 693

Query: 245 LLSAAMLYEGNVDEA 259
           + +  +L  G   EA
Sbjct: 694 MFARLLLDYGLEQEA 708


>gi|332796035|ref|YP_004457535.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
 gi|332693770|gb|AEE93237.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
          Length = 840

 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 61/279 (21%), Positives = 105/279 (37%), Gaps = 61/279 (21%)

Query: 49  DLNLKFIVSIYRDYALHKDIRFLTRVYPT---CLELIRKCESFDK----------QGLGI 95
           DLN +FI+  Y       D+ FL  ++P     LE   K +++D           + L  
Sbjct: 462 DLNPEFILMTYLIAKTTGDLNFLKEIFPKMKEALESTMKTQTYDGLIYHTLPAGLEWLRY 521

Query: 96  VQNG-GFP---DQTAYCGGLHIAALA-------CMVEMSSL-----------LNDTQQQQ 133
           V N    P   + +A   G ++  L+        M+ ++S            +ND     
Sbjct: 522 VNNKLNLPRGDNNSASILGHNLIPLSMQTFDDWSMIGITSFTSILWISSIQAVNDACSNL 581

Query: 134 QFSAKLTKASQVYH--DLLWTGSYYK--FDSSQSRHSDSIMADQLCGYLFLKASGVNTEA 189
           + +      S V    D LW G Y+   +D        +  A Q+ G+ +    G+    
Sbjct: 582 KINCSYDYESLVKKLIDYLWNGEYFDLWYDPKSKMRDKACNASQILGHWYSTLLGLR--- 638

Query: 190 IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD-----------RSAIQSEEV 238
               + +   L +I   N++  E    G +NG  PNG R             +  Q +  
Sbjct: 639 FLDDSLVKTTLKSIVKYNLKEEE----GLLNGAYPNGYRPLKRNYQNQLNLPATTQIDTP 694

Query: 239 WTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG 277
           W+GV + +++ ++YE   DE       + R +YER  L 
Sbjct: 695 WSGVEFYVASHLIYEKLRDE----GEKILRNIYERYKLA 729


>gi|170060875|ref|XP_001865995.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879232|gb|EDS42615.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 183

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 15/27 (55%), Positives = 20/27 (74%)

Query: 36 INGYNVFDVSDWKDLNLKFIVSIYRDY 62
          +  Y + DV +WKDLN KFI+ +YRDY
Sbjct: 54 VRAYPIHDVYEWKDLNSKFILQVYRDY 80


>gi|156383620|ref|XP_001632931.1| predicted protein [Nematostella vectensis]
 gi|156219994|gb|EDO40868.1| predicted protein [Nematostella vectensis]
          Length = 1090

 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 49/232 (21%), Positives = 89/232 (38%), Gaps = 30/232 (12%)

Query: 49  DLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ---- 104
           D ++  ++  YR+        +  +++P    ++++  +    G G++  G  P+     
Sbjct: 634 DGDVGTVLKTYREVRQGAPKTWFNKMWPQVKRIMKRWMTDLDNGTGVIP-GPQPNTYDCT 692

Query: 105 ----TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY--KF 158
                 Y  G ++A L    +M+ + N+ +    +  +    S      L+ GS+Y  + 
Sbjct: 693 LYGVNVYISGYYLAGLRAAEKMAKIQNEPELAATYHQRFELGSAKIDKTLFNGSWYIQEV 752

Query: 159 DSSQSRH--SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN--------- 207
           DS    +   D+   D L G  +  A G+    I    NI   L  IF  N         
Sbjct: 753 DSEHPVNVVGDATWVDCLVGQWWAHALGLGY--ILSQKNIQSTLQNIFLKNHVERFVPAT 810

Query: 208 ---VRGFEAGSMGAVNGMKPNGDRDRSAI--QSEEVWTGVTYLLSAAMLYEG 254
               + F+    G    + PN D  +S +   SE  WTG+ Y  +    YEG
Sbjct: 811 QAPRQFFDERDAGLRICVYPN-DLPKSPLLYHSEGAWTGLEYEYAELCFYEG 861


>gi|312868970|ref|ZP_07729150.1| glutaminase A [Lactobacillus oris PB013-T2-3]
 gi|311095534|gb|EFQ53798.1| glutaminase A [Lactobacillus oris PB013-T2-3]
          Length = 306

 Score = 37.7 bits (86), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 46/178 (25%), Positives = 78/178 (43%), Gaps = 26/178 (14%)

Query: 117 ACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
           A  +E++SL+     +Q F   +  A ++ HD   T +   F+ S+ R  D+   D+   
Sbjct: 113 AGAIEVTSLIKPLADRQPFEQIIMFAREICHDPQITLNNEIFE-SEDRTGDT---DRSLA 168

Query: 177 YLFLKASGVNTEAIFPSANIARALATIF---STNVRGFEAGSMGAV---NGMKPNGDRDR 230
           Y +LKA+G+         ++  +L T F   S  V      ++GAV   NG+KP  D+DR
Sbjct: 169 Y-YLKANGIM------QGDVTVSLTTYFKQCSMMVTAVSLANLGAVLANNGVKP-WDQDR 220

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEG-------NVDEAWTTAGGLYRTVYERTGLGFETP 281
             I S+      + +++  +  E         +       GGL   V  R G+G  +P
Sbjct: 221 -LISSQAATYAKSLMMTTGLYNESGTYSSIIGIPTKSGVGGGLMSAVPNRVGVGIFSP 277


>gi|259502028|ref|ZP_05744930.1| glutaminase 1 [Lactobacillus antri DSM 16041]
 gi|259170029|gb|EEW54524.1| glutaminase 1 [Lactobacillus antri DSM 16041]
          Length = 309

 Score = 37.7 bits (86), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 48/179 (26%), Positives = 77/179 (43%), Gaps = 28/179 (15%)

Query: 117 ACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
           A  +E++SL+     +Q F   +  A ++ HD   T +   F+S + R  D+   D+   
Sbjct: 116 AGAIEVASLIKPLADRQPFEQIIMFAREICHDPQITLNNEIFES-EDRTGDT---DRSLA 171

Query: 177 YLFLKASGVNTEAIFPSANIARALATIF---STNVRGFEAGSMGAV---NGMKPNGDRDR 230
           Y +LKA+G+        A++  +L T F   S  V      ++GAV   NG+KP    D+
Sbjct: 172 Y-YLKANGI------LQADVTASLTTYFKQCSMMVTAVSLANLGAVLANNGVKP---WDQ 221

Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGN--------VDEAWTTAGGLYRTVYERTGLGFETP 281
             + S    T    L+    LY+ +        V       GGL   V  R G+G  +P
Sbjct: 222 ERLISGAAATYAKSLMMTTGLYDESGTYSSIIGVPTKSGVGGGLMSAVPNRVGIGIFSP 280


>gi|320109368|ref|YP_004184958.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
 gi|319927889|gb|ADV84964.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
          Length = 898

 Score = 37.4 bits (85), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 34/167 (20%), Positives = 65/167 (38%), Gaps = 13/167 (7%)

Query: 54  FIVSIYRDYA-------LHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQTA 106
           +++ + +DY        + K+   + R +      IR  +SF   G     +   P+ + 
Sbjct: 531 WVIQLAKDYQQTGDDTFIKKNADHIRRGFAYMKAQIRD-DSFIPVGGQTYDDFPHPEISV 589

Query: 107 YCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSS---QS 163
           Y G +++AAL   V +   L DTQ   +   +  +        LW G ++ + +      
Sbjct: 590 YTGTVYLAALRAGVVLGDALGDTQMSHEAETQFKQTQAGLIHALWNGRFFAYGTDIGGAH 649

Query: 164 RHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRG 210
           R  D + + QL G    + +G     + P      A+     T+V G
Sbjct: 650 RRDDRLFSGQLAGQFLSRYAGWGD--VLPHDQAKSAIYEQLHTSVLG 694


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.134    0.406 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,943,065,818
Number of Sequences: 23463169
Number of extensions: 199339992
Number of successful extensions: 418074
Number of sequences better than 100.0: 418
Number of HSP's better than 100.0 without gapping: 281
Number of HSP's successfully gapped in prelim test: 137
Number of HSP's that attempted gapping in prelim test: 416248
Number of HSP's gapped (non-prelim): 612
length of query: 314
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 172
effective length of database: 9,027,425,369
effective search space: 1552717163468
effective search space used: 1552717163468
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)