BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy10296
         (116 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|443709232|gb|ELU03988.1| hypothetical protein CAPTEDRAFT_186839 [Capitella teleta]
          Length = 802

 Score =  119 bits (297), Expect = 3e-25,   Method: Composition-based stats.
 Identities = 53/98 (54%), Positives = 68/98 (69%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQ+CG  FLKA GV   A+FPSAN+ RAL  +F  NV  F+ G MGA+NGM+PN  +D
Sbjct: 651 MADQMCGQWFLKACGVADNAVFPSANVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKD 710

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            ++ QS+E WTGVTY L A M+  G +D+ + TA G Y
Sbjct: 711 VTSCQSDEFWTGVTYALGATMIQVGMIDKGFQTAYGAY 748


>gi|443710865|gb|ELU04898.1| hypothetical protein CAPTEDRAFT_216398 [Capitella teleta]
          Length = 449

 Score =  118 bits (295), Expect = 5e-25,   Method: Composition-based stats.
 Identities = 53/98 (54%), Positives = 68/98 (69%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQ+CG  FLKA GV   A+FPSAN+ RAL  +F  NV  F+ G MGA+NGM+PN  +D
Sbjct: 298 MADQMCGQWFLKACGVADNAVFPSANVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKD 357

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            ++ QS+E WTGVTY L A M+  G +D+ + TA G Y
Sbjct: 358 VTSCQSDEFWTGVTYALGATMIQVGMIDKGFQTAYGAY 395


>gi|357630375|gb|EHJ78536.1| hypothetical protein KGM_16417 [Danaus plexippus]
          Length = 912

 Score =  113 bits (283), Expect = 1e-23,   Method: Composition-based stats.
 Identities = 60/103 (58%), Positives = 73/103 (70%), Gaps = 3/103 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGM--KPNGD 58
           MADQL G  FL+ASG  TE +FP AN+ +AL TI+  NV+ F  G MGAVNG    P   
Sbjct: 751 MADQLAGQWFLRASGW-TEPVFPEANVKKALHTIYENNVQRFLNGRMGAVNGFVRGPRPG 809

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
            D +AIQSEEVWTGVTY L+A M+YEG  ++A++TAGGLY  L
Sbjct: 810 IDTTAIQSEEVWTGVTYGLAALMIYEGMHEQAFSTAGGLYNTL 852


>gi|328709489|ref|XP_001943760.2| PREDICTED: non-lysosomal glucosylceramidase-like [Acyrthosiphon
           pisum]
          Length = 776

 Score =  112 bits (279), Expect = 4e-23,   Method: Composition-based stats.
 Identities = 54/98 (55%), Positives = 67/98 (68%), Gaps = 1/98 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG  +LK  GVN E + P   + +AL TI+  NV GF  G MGAVNGM P+G+ D
Sbjct: 633 MADQLCGEWYLKCCGVNEE-VLPVDRVKKALHTIYKMNVLGFNDGKMGAVNGMMPDGNPD 691

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
             ++QSEEVWTGVTY LS+ M+  G  DE + TA G+Y
Sbjct: 692 TFSLQSEEVWTGVTYALSSLMIAHGLRDEGFNTAKGIY 729


>gi|339252284|ref|XP_003371365.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
 gi|316968412|gb|EFV52690.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
          Length = 1615

 Score =  110 bits (275), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 51/103 (49%), Positives = 74/103 (71%), Gaps = 1/103 (0%)

Query: 1    MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
            MADQLCGY +LK  G+ T+ I P  N+ + L TI++ NV  F  G++GAVNGM  +G++D
Sbjct: 1468 MADQLCGYAYLKICGLQTD-ILPLDNVKKVLETIYNLNVCSFGNGTLGAVNGMLYSGEKD 1526

Query: 61   RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             S++Q++E WTGVTY LSA M+ EG V++ ++TA G+Y+   E
Sbjct: 1527 TSSLQADEAWTGVTYFLSAHMISEGFVEQGFSTASGIYKSCFE 1569


>gi|312382731|gb|EFR28085.1| hypothetical protein AND_04399 [Anopheles darlingi]
          Length = 777

 Score =  109 bits (273), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 61/122 (50%), Positives = 78/122 (63%), Gaps = 8/122 (6%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           M+DQLCG+ +L+A G + + +FP  N+  A+ TI+  NV  F  G++GAVNG  PN    
Sbjct: 626 MSDQLCGHWYLRACGFDYD-VFPKENVRLAMRTIYENNVMRFCGGNLGAVNGYVPNAQPN 684

Query: 57  --GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
             G  D S IQ EEVWTGVTY L+A M++EG  +EA+ TAGGLYR L E      ET PE
Sbjct: 685 KEGRPDVSNIQGEEVWTGVTYALAATMIHEGMFEEAFQTAGGLYRSLSEKIGMNFET-PE 743

Query: 115 GV 116
            V
Sbjct: 744 AV 745


>gi|291241331|ref|XP_002740565.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 792

 Score =  109 bits (272), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 58/117 (49%), Positives = 77/117 (65%), Gaps = 3/117 (2%)

Query: 1   MADQLCGYLFLKASGVNTE--AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M DQ  G  +L+A G+  E   +FP  +I  AL  I+  NV GFE G+MGAVNGM+P+G 
Sbjct: 645 MCDQAAGQWYLRACGLVQENDEVFPKDHILCALRAIYKMNVMGFEKGTMGAVNGMRPDGS 704

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
            D ++ QSEEVWTGVTY L+A M+ EG ++EA+TTA G+Y+   E    G +T PE 
Sbjct: 705 PDTTSFQSEEVWTGVTYALAANMIQEGLLEEAFTTASGVYKTCYERCGLGFQT-PEA 760


>gi|405977906|gb|EKC42333.1| Non-lysosomal glucosylceramidase [Crassostrea gigas]
          Length = 1464

 Score =  109 bits (272), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 52/98 (53%), Positives = 68/98 (69%)

Query: 1    MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
            MADQL G+ FLKASG+  + IFP   +  +L  IF  NV  FE G+MGA+NG +P+G +D
Sbjct: 1316 MADQLAGHWFLKASGLQDDDIFPPDRVKSSLKKIFENNVMMFEGGNMGAINGTRPDGSKD 1375

Query: 61   RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
             S+ QSEE W GVTY L+A M+ EG +D+A+ TA G Y
Sbjct: 1376 ISSCQSEEFWVGVTYGLAANMIQEGLLDQAFQTAWGAY 1413


>gi|196002317|ref|XP_002111026.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
 gi|190586977|gb|EDV27030.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
          Length = 843

 Score =  109 bits (272), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 50/101 (49%), Positives = 71/101 (70%), Gaps = 3/101 (2%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           MADQ+CG+ +L+A G+   N + IFP + + +AL  +F  NV  F  G+ GAVNGMKPNG
Sbjct: 689 MADQMCGHWYLQACGLVNNNEDDIFPKSQVDKALRKVFDYNVMKFMNGTWGAVNGMKPNG 748

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + D S++QSEEVWTGVTY L+A ++ +G + E +  A G+Y
Sbjct: 749 EIDTSSVQSEEVWTGVTYALAANLIQQGMIKEGFKVAEGMY 789


>gi|395515264|ref|XP_003761826.1| PREDICTED: non-lysosomal glucosylceramidase [Sarcophilus harrisii]
          Length = 954

 Score =  108 bits (270), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 57/106 (53%), Positives = 74/106 (69%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FPSA++ RAL TIF  NV+GF  G+MGAVNGM P+G
Sbjct: 759 MSDQCAGQWFLRACGLGKGDTE-VFPSAHVIRALQTIFELNVQGFAGGAMGAVNGMHPDG 817

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D S++QS+EVW GV Y L+A M+ EG V E + TA G YR + E
Sbjct: 818 VPDTSSVQSDEVWVGVVYGLAATMIQEGLVQEGFLTAEGCYRTVWE 863


>gi|156356077|ref|XP_001623757.1| predicted protein [Nematostella vectensis]
 gi|156210486|gb|EDO31657.1| predicted protein [Nematostella vectensis]
          Length = 783

 Score =  108 bits (269), Expect = 5e-22,   Method: Composition-based stats.
 Identities = 51/101 (50%), Positives = 73/101 (72%), Gaps = 3/101 (2%)

Query: 1   MADQLCGYLFLKA---SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           MADQL G  +L A   +  + + +FPS N+  AL T+F+ NV  F+ G+MGAVNG++P+G
Sbjct: 632 MADQLSGQWYLHACDLAQTSNDRVFPSENVISALRTVFNFNVMKFQEGTMGAVNGIRPDG 691

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
             D S++Q+EEVWTGVTY ++A+M+ EG VDE + TA G+Y
Sbjct: 692 QLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIY 732


>gi|334333160|ref|XP_003341682.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
           glucosylceramidase-like [Monodelphis domestica]
          Length = 950

 Score =  108 bits (269), Expect = 5e-22,   Method: Composition-based stats.
 Identities = 57/118 (48%), Positives = 78/118 (66%), Gaps = 5/118 (4%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+GF  G+MGAVNGM+P+G
Sbjct: 755 MSDQCAGQWFLRACGLGEGDTE-VFPTPHVIRALQTIFEVNVQGFAGGAMGAVNGMQPDG 813

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
             D S++QS+EVW GV Y L+A M+ EG + E + TA G YR + E R+      PE 
Sbjct: 814 IPDTSSVQSDEVWVGVVYGLAATMIQEGLIQEGFRTAEGCYRTVWE-RLGMAFQTPEA 870


>gi|156554493|ref|XP_001604835.1| PREDICTED: non-lysosomal glucosylceramidase-like [Nasonia
           vitripennis]
          Length = 826

 Score =  107 bits (267), Expect = 8e-22,   Method: Composition-based stats.
 Identities = 54/116 (46%), Positives = 75/116 (64%), Gaps = 2/116 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           M+DQLCG+  L+A G + E +FP   +  +L T+F  NV  ++ G  GAVNG  P G  D
Sbjct: 664 MSDQLCGHWLLRACGFSYE-VFPQDRVRSSLETVFQNNVMKYKNGQQGAVNGFSPTGSID 722

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEGV 116
            S IQSEE+WTGV Y L+A +++EG ++EA+ TA G+YR + E    G ET PE +
Sbjct: 723 YSCIQSEEMWTGVAYGLAALLIHEGMIEEAFRTAEGVYRTVYEKIGMGFET-PEAL 777


>gi|351707038|gb|EHB09957.1| Non-lysosomal glucosylceramidase [Heterocephalus glaber]
          Length = 922

 Score =  107 bits (266), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 745 MSDQCAGQWFLKACGLGEGDTE-VFPTTHVVRALQTIFELNVQSFAGGAMGAVNGMQPHG 803

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 804 IPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 849


>gi|354485725|ref|XP_003505033.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Cricetulus griseus]
          Length = 898

 Score =  107 bits (266), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 721 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 779

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 780 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGCYRTVWE 825


>gi|157119417|ref|XP_001653371.1| bile acid beta-glucosidase, putative [Aedes aegypti]
 gi|108883154|gb|EAT47379.1| AAEL001478-PA, partial [Aedes aegypti]
          Length = 898

 Score =  107 bits (266), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 57/122 (46%), Positives = 76/122 (62%), Gaps = 8/122 (6%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP----- 55
           M+DQLCG+ +L+  G + + IFP  N+  AL TI+  NV  F  G+MGAVNG  P     
Sbjct: 747 MSDQLCGHWYLRCCGFDYD-IFPKENVRTALKTIYDNNVMRFCGGNMGAVNGYVPSTQPN 805

Query: 56  -NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
            +G  D S +Q EEVWTGVTY L++ M++EG   EA+ TAGG+Y+ L E      ET PE
Sbjct: 806 KDGRADASTVQGEEVWTGVTYALASTMIHEGMFTEAFQTAGGMYQTLSEKIGMNFET-PE 864

Query: 115 GV 116
            +
Sbjct: 865 AL 866


>gi|410925034|ref|XP_003975986.1| PREDICTED: non-lysosomal glucosylceramidase-like [Takifugu
           rubripes]
          Length = 827

 Score =  106 bits (265), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 56/117 (47%), Positives = 72/117 (61%), Gaps = 3/117 (2%)

Query: 1   MADQLCGYLFLKASGVN--TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M+DQ  G+ FL+ASG+       FP   I RAL T+F  NV  F  G MGAVNGM+P G 
Sbjct: 678 MSDQCAGHWFLRASGLGDGEYQAFPKEKIQRALKTVFDLNVMSFAGGQMGAVNGMRPEGV 737

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
            DRS++QS+EVW GV Y L+A M++EG ++E   TA G YR + E R+      PE 
Sbjct: 738 PDRSSVQSDEVWIGVVYGLAATMIHEGMLEEGMRTAEGCYRTVWE-RLGMAFQTPEA 793


>gi|198473190|ref|XP_002133205.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
 gi|198139343|gb|EDY70607.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
          Length = 865

 Score =  106 bits (264), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 57/123 (46%), Positives = 75/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL TI+  NV GF  G++GA NG   N    
Sbjct: 713 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADP 771

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+ L E R+      P
Sbjct: 772 SKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSE-RIGMNFETP 830

Query: 114 EGV 116
           E +
Sbjct: 831 EAL 833


>gi|195164866|ref|XP_002023267.1| GL21266 [Drosophila persimilis]
 gi|194105352|gb|EDW27395.1| GL21266 [Drosophila persimilis]
          Length = 867

 Score =  106 bits (264), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 57/123 (46%), Positives = 75/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL TI+  NV GF  G++GA NG   N    
Sbjct: 715 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADP 773

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+ L E R+      P
Sbjct: 774 SKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSE-RIGMNFETP 832

Query: 114 EGV 116
           E +
Sbjct: 833 EAL 835


>gi|221041110|dbj|BAH12232.1| unnamed protein product [Homo sapiens]
          Length = 933

 Score =  105 bits (263), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 756 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 814

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 815 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 860


>gi|125821356|ref|XP_687652.2| PREDICTED: non-lysosomal glucosylceramidase [Danio rerio]
          Length = 851

 Score =  105 bits (263), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 55/117 (47%), Positives = 72/117 (61%), Gaps = 3/117 (2%)

Query: 1   MADQLCGYLFLKASGVNTE--AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M+DQ  G+ FL+ASG+  +    FP   I  AL ++F  NV  F  G MGAVNGM+P G 
Sbjct: 698 MSDQCAGHWFLRASGLGDDEYQAFPKEKICSALKSVFDLNVMSFAGGQMGAVNGMRPEGV 757

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
            DRS++QS+EVW GV Y L+A M++EG V+E   TA G YR + E R+      PE 
Sbjct: 758 PDRSSVQSDEVWVGVVYGLAATMIHEGMVEEGLRTAEGCYRAVWE-RMGMAFQTPEA 813


>gi|332831849|ref|XP_001167923.2| PREDICTED: non-lysosomal glucosylceramidase isoform 4 [Pan
           troglodytes]
          Length = 933

 Score =  105 bits (263), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 756 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 814

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 815 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 860


>gi|444729905|gb|ELW70308.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
          Length = 279

 Score =  105 bits (263), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 102 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHG 160

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 161 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 206


>gi|344251774|gb|EGW07878.1| Non-lysosomal glucosylceramidase [Cricetulus griseus]
          Length = 226

 Score =  105 bits (262), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 49  MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 107

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 108 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGCYRTVWE 153


>gi|24308251|ref|NP_065995.1| non-lysosomal glucosylceramidase [Homo sapiens]
 gi|143018392|sp|Q9HCG7.2|GBA2_HUMAN RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
 gi|15030210|gb|AAH11363.1| Glucosidase, beta (bile acid) 2 [Homo sapiens]
 gi|16215453|emb|CAC83792.1| bile acid beta-glucosidase [Homo sapiens]
 gi|21739906|emb|CAD38976.1| hypothetical protein [Homo sapiens]
 gi|119578752|gb|EAW58348.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
 gi|119578753|gb|EAW58349.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
          Length = 927

 Score =  105 bits (262), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854


>gi|332228461|ref|XP_003263408.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
           [Nomascus leucogenys]
          Length = 919

 Score =  105 bits (262), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 742 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 800

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 801 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 846


>gi|55633173|ref|XP_520565.1| PREDICTED: non-lysosomal glucosylceramidase isoform 6 [Pan
           troglodytes]
 gi|410227830|gb|JAA11134.1| talin 1 [Pan troglodytes]
          Length = 927

 Score =  105 bits (262), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854


>gi|14042883|dbj|BAB55430.1| unnamed protein product [Homo sapiens]
          Length = 927

 Score =  105 bits (262), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854


>gi|426361727|ref|XP_004048051.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Gorilla
           gorilla gorilla]
          Length = 933

 Score =  105 bits (262), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 756 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 814

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 815 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 860


>gi|355689821|gb|AER98956.1| glucosidase, beta 2 [Mustela putorius furo]
          Length = 550

 Score =  105 bits (262), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 56/106 (52%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKASG+   +TE +FP+ ++  AL TIF  NVR F  G+MGAVNGM+P+G
Sbjct: 372 MSDQCAGQWFLKASGLGEGDTE-VFPTQHVVCALQTIFEFNVRAFAGGAMGAVNGMQPHG 430

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 431 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 476


>gi|403306664|ref|XP_003943843.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 929

 Score =  105 bits (262), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 756 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 814

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 815 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 860


>gi|170036781|ref|XP_001846240.1| bile acid beta-glucosidase [Culex quinquefasciatus]
 gi|167879683|gb|EDS43066.1| bile acid beta-glucosidase [Culex quinquefasciatus]
          Length = 913

 Score =  105 bits (262), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 57/122 (46%), Positives = 76/122 (62%), Gaps = 8/122 (6%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP----- 55
           M+DQLCG+ +L+  G + + I P  N+  AL TI+  NV  F  G+MGAVNG  P     
Sbjct: 762 MSDQLCGHWYLRCCGFDYD-IVPKENVRTALKTIYDNNVMRFCGGNMGAVNGYIPSSQPN 820

Query: 56  -NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
            +G  D S +Q+EEVWTGVTY L++ M++EG   EA+ TAGGLY+ L E      ET PE
Sbjct: 821 KDGRADSSTLQAEEVWTGVTYALASTMIHEGMFTEAFQTAGGLYQTLSEKIGMSFET-PE 879

Query: 115 GV 116
            +
Sbjct: 880 AL 881


>gi|444729906|gb|ELW70309.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
          Length = 1001

 Score =  105 bits (262), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 824 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHG 882

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 883 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 928


>gi|403306662|ref|XP_003943842.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 923

 Score =  105 bits (262), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854


>gi|385648282|ref|NP_001245314.1| glucosidase, beta (bile acid) 2 [Sus scrofa]
          Length = 915

 Score =  105 bits (261), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 74/106 (69%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+ASG+   +TE +FP+ ++ +AL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 742 MSDQCAGQWFLRASGLGEGDTE-VFPTPHVVQALQTIFEFNVQAFAGGAMGAVNGMQPHG 800

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 801 VPDRSSVQSDEVWVGVVYALAATMIQEGLTWEGFRTAEGCYRTVWE 846


>gi|432110769|gb|ELK34246.1| Non-lysosomal glucosylceramidase [Myotis davidii]
          Length = 868

 Score =  105 bits (261), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 59/117 (50%), Positives = 77/117 (65%), Gaps = 5/117 (4%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+ASG+    TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P G
Sbjct: 712 MSDQCAGQWFLRASGLGEGETE-VFPTQHVVRALQTIFEFNVQAFAGGTMGAVNGMQPQG 770

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E    G +T PE
Sbjct: 771 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGLGFQT-PE 826


>gi|195115764|ref|XP_002002426.1| GI12858 [Drosophila mojavensis]
 gi|193913001|gb|EDW11868.1| GI12858 [Drosophila mojavensis]
          Length = 877

 Score =  105 bits (261), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 56/123 (45%), Positives = 74/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 725 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEP 783

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  DEA+ TAGG+Y+ L + R+      P
Sbjct: 784 SKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTLSQ-RIGMNYETP 842

Query: 114 EGV 116
           E +
Sbjct: 843 EAL 845


>gi|395855828|ref|XP_003800351.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
            [Otolemur garnettii]
          Length = 1147

 Score =  105 bits (261), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1    MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
            M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TI+  NV+ F  G+MGAVNGM+P+G
Sbjct: 970  MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIYELNVQAFAGGAMGAVNGMQPHG 1028

Query: 58   DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
              DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 1029 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 1074


>gi|397519481|ref|XP_003829887.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pan
           paniscus]
          Length = 927

 Score =  105 bits (261), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 53/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL T+F  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTVFELNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854


>gi|402897018|ref|XP_003911574.1| PREDICTED: non-lysosomal glucosylceramidase [Papio anubis]
          Length = 927

 Score =  105 bits (261), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+    TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGETE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854


>gi|355567647|gb|EHH23988.1| Non-lysosomal glucosylceramidase [Macaca mulatta]
 gi|355753217|gb|EHH57263.1| Non-lysosomal glucosylceramidase [Macaca fascicularis]
 gi|380789499|gb|AFE66625.1| non-lysosomal glucosylceramidase [Macaca mulatta]
 gi|384943046|gb|AFI35128.1| non-lysosomal glucosylceramidase [Macaca mulatta]
          Length = 927

 Score =  105 bits (261), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+    TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGETE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854


>gi|109111050|ref|XP_001084705.1| PREDICTED: non-lysosomal glucosylceramidase [Macaca mulatta]
          Length = 927

 Score =  105 bits (261), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+    TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGETE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854


>gi|390458158|ref|XP_002743170.2| PREDICTED: non-lysosomal glucosylceramidase [Callithrix jacchus]
          Length = 1065

 Score =  105 bits (261), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 892 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 950

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 951 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 996


>gi|395740303|ref|XP_003777396.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pongo
           abelii]
          Length = 933

 Score =  104 bits (260), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 756 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 814

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 815 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 860


>gi|297684086|ref|XP_002819683.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pongo
           abelii]
          Length = 927

 Score =  104 bits (260), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854


>gi|426361725|ref|XP_004048050.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Gorilla
           gorilla gorilla]
          Length = 927

 Score =  104 bits (260), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854


>gi|410978559|ref|XP_003995657.1| PREDICTED: non-lysosomal glucosylceramidase [Felis catus]
          Length = 917

 Score =  104 bits (260), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+ASG+   +TE +FP+ ++  AL TIF  NVR F  G+MGAVNGM+P+G
Sbjct: 742 MSDQCAGQWFLRASGLGEGDTE-VFPTPHVVCALQTIFEFNVRAFAGGAMGAVNGMQPHG 800

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 801 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 846


>gi|194386992|dbj|BAG59862.1| unnamed protein product [Homo sapiens]
          Length = 755

 Score =  104 bits (260), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 578 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 636

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 637 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 682


>gi|158299437|ref|XP_319575.4| AGAP008830-PA [Anopheles gambiae str. PEST]
 gi|157013519|gb|EAA14848.4| AGAP008830-PA [Anopheles gambiae str. PEST]
          Length = 913

 Score =  104 bits (259), Expect = 8e-21,   Method: Composition-based stats.
 Identities = 54/122 (44%), Positives = 78/122 (63%), Gaps = 8/122 (6%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           M+DQLCG+ +L++ G + + +FP  N+  A+ TI+  NV  F  G +GAVNG  P+G  +
Sbjct: 762 MSDQLCGHWYLRSCGFDYD-VFPKENVRLAMRTIYENNVMRFCGGQLGAVNGYVPSGQPN 820

Query: 61  RS------AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
           +       +IQ EEVWTGVTY L++ M++EG  +EA+ TAGGLY+ L E R+      PE
Sbjct: 821 KDGRPDTVSIQGEEVWTGVTYALASTMIHEGMFEEAFKTAGGLYQALSE-RIGMNFETPE 879

Query: 115 GV 116
            V
Sbjct: 880 AV 881


>gi|343959578|dbj|BAK63646.1| bile acid beta-glucosidase [Pan troglodytes]
          Length = 514

 Score =  104 bits (259), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 337 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 395

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 396 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 441


>gi|189217893|ref|NP_001013109.2| non-lysosomal glucosylceramidase [Rattus norvegicus]
 gi|149045757|gb|EDL98757.1| glucosidase beta 2 [Rattus norvegicus]
          Length = 904

 Score =  103 bits (258), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 741 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALKTIFELNVQAFAGGAMGAVNGMQPHG 799

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 800 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 845


>gi|301767028|ref|XP_002918945.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Ailuropoda melanoleuca]
          Length = 928

 Score =  103 bits (258), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKASG+   +TE +FP+ ++  AL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 753 MSDQCAGQWFLKASGLGEGDTE-VFPTPHVVCALQTIFEFNVQAFAGGAMGAVNGMQPHG 811

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 812 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 857


>gi|281339788|gb|EFB15372.1| hypothetical protein PANDA_007478 [Ailuropoda melanoleuca]
          Length = 923

 Score =  103 bits (258), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKASG+   +TE +FP+ ++  AL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKASGLGEGDTE-VFPTPHVVCALQTIFEFNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854


>gi|143018484|sp|Q5M868.2|GBA2_RAT RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
          Length = 912

 Score =  103 bits (258), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 741 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALKTIFELNVQAFAGGAMGAVNGMQPHG 799

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 800 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 845


>gi|195436933|ref|XP_002066400.1| GK18271 [Drosophila willistoni]
 gi|194162485|gb|EDW77386.1| GK18271 [Drosophila willistoni]
          Length = 955

 Score =  103 bits (258), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 803 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEP 861

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  DEA+ TAGG+Y+ + + R+      P
Sbjct: 862 SKAGHVDNSNIQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTISQ-RIGMNFETP 920

Query: 114 EGV 116
           E +
Sbjct: 921 EAL 923


>gi|73971791|ref|XP_531991.2| PREDICTED: non-lysosomal glucosylceramidase [Canis lupus familiaris]
          Length = 1136

 Score =  103 bits (258), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1    MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
            M+DQ  G  FL+ASG+   +TE +FP+ ++  AL TIF  NVR F  G+MGAVNGM+P+G
Sbjct: 961  MSDQCAGQWFLRASGLGEGDTE-VFPTRHVVCALQTIFEFNVRAFAGGAMGAVNGMQPHG 1019

Query: 58   DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
              DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 1020 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 1065


>gi|397519483|ref|XP_003829888.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pan
           paniscus]
          Length = 755

 Score =  103 bits (258), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 53/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL T+F  NV+ F  G+MGAVNGM+P+G
Sbjct: 578 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTVFELNVQAFAGGAMGAVNGMQPHG 636

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D+S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 637 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 682


>gi|338720173|ref|XP_001497796.3| PREDICTED: non-lysosomal glucosylceramidase [Equus caballus]
          Length = 983

 Score =  103 bits (257), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ +I RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 806 MSDQCAGQWFLRACGLGEGDTE-VFPTRHIVRALQTIFELNVQAFAGGAMGAVNGMQPHG 864

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DR+++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 865 VPDRTSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 910


>gi|240120073|ref|NP_766280.2| non-lysosomal glucosylceramidase [Mus musculus]
 gi|143018443|sp|Q69ZF3.2|GBA2_MOUSE RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
 gi|34784135|gb|AAH56935.1| Glucosidase beta 2 [Mus musculus]
          Length = 918

 Score =  103 bits (257), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM P+G
Sbjct: 741 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 799

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 800 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 845


>gi|26354314|dbj|BAC40785.1| unnamed protein product [Mus musculus]
          Length = 918

 Score =  103 bits (257), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM P+G
Sbjct: 741 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 799

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 800 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 845


>gi|50511011|dbj|BAD32491.1| mKIAA1605 protein [Mus musculus]
          Length = 940

 Score =  103 bits (257), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM P+G
Sbjct: 763 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 821

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 822 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 867


>gi|148670514|gb|EDL02461.1| glucosidase beta 2, isoform CRA_a [Mus musculus]
          Length = 918

 Score =  103 bits (257), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM P+G
Sbjct: 741 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 799

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 800 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 845


>gi|148670516|gb|EDL02463.1| glucosidase beta 2, isoform CRA_c [Mus musculus]
          Length = 467

 Score =  103 bits (257), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM P+G
Sbjct: 290 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 348

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             DRS++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 349 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 394


>gi|189239183|ref|XP_966847.2| PREDICTED: similar to bile acid beta-glucosidase, putative
           [Tribolium castaneum]
 gi|270010940|gb|EFA07388.1| hypothetical protein TcasGA2_TC016367 [Tribolium castaneum]
          Length = 818

 Score =  103 bits (256), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 52/103 (50%), Positives = 69/103 (66%), Gaps = 2/103 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG  +L++ G N E +FP   +  +L TI+  NV+ F  G MGAVNG   +G  D
Sbjct: 674 MADQLCGQWYLRSCGFNYE-VFPQDRVKTSLKTIYENNVQSFCDGRMGAVNGFI-DGVID 731

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
              IQS+EVWTGVTY L+A+ML EG   EA+ TAGG+++ + E
Sbjct: 732 EFTIQSQEVWTGVTYALAASMLQEGMRTEAFNTAGGMFKSMSE 774


>gi|45552032|ref|NP_788055.2| CG33090 [Drosophila melanogaster]
 gi|74876618|sp|Q7KT91.1|C3390_DROME RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase
 gi|45445134|gb|AAO41192.2| CG33090 [Drosophila melanogaster]
 gi|201065481|gb|ACH92150.1| FI02015p [Drosophila melanogaster]
          Length = 948

 Score =  102 bits (255), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 796 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEP 854

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+ L + R+      P
Sbjct: 855 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 913

Query: 114 EGV 116
           E +
Sbjct: 914 EAL 916


>gi|194860149|ref|XP_001969520.1| GG23916 [Drosophila erecta]
 gi|190661387|gb|EDV58579.1| GG23916 [Drosophila erecta]
          Length = 947

 Score =  102 bits (255), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 795 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEP 853

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+ L + R+      P
Sbjct: 854 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 912

Query: 114 EGV 116
           E +
Sbjct: 913 EAL 915


>gi|194760205|ref|XP_001962332.1| GF15415 [Drosophila ananassae]
 gi|190616029|gb|EDV31553.1| GF15415 [Drosophila ananassae]
          Length = 894

 Score =  102 bits (255), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 742 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANASDP 800

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQSEEVW GV Y L+A M+ EG  +EA+ TAGG+Y+ + + R+      P
Sbjct: 801 SKPGHVDNSNIQSEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQ-RIGMNFETP 859

Query: 114 EGV 116
           E +
Sbjct: 860 EAL 862


>gi|195474003|ref|XP_002089281.1| GE19029 [Drosophila yakuba]
 gi|194175382|gb|EDW88993.1| GE19029 [Drosophila yakuba]
          Length = 947

 Score =  102 bits (255), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 795 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEP 853

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+ L + R+      P
Sbjct: 854 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 912

Query: 114 EGV 116
           E +
Sbjct: 913 EAL 915


>gi|195338407|ref|XP_002035816.1| GM15538 [Drosophila sechellia]
 gi|194129696|gb|EDW51739.1| GM15538 [Drosophila sechellia]
          Length = 947

 Score =  102 bits (255), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 795 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEP 853

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+ L + R+      P
Sbjct: 854 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 912

Query: 114 EGV 116
           E +
Sbjct: 913 EAL 915


>gi|195579186|ref|XP_002079443.1| GD23959 [Drosophila simulans]
 gi|194191452|gb|EDX05028.1| GD23959 [Drosophila simulans]
          Length = 947

 Score =  102 bits (255), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 795 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEP 853

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+ L + R+      P
Sbjct: 854 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 912

Query: 114 EGV 116
           E +
Sbjct: 913 EAL 915


>gi|198424051|ref|XP_002127036.1| PREDICTED: similar to Non-lysosomal glucosylceramidase (NLGase)
           (Glucosylceramidase 2) (Beta-glucocerebrosidase 2)
           (Beta-glucosidase 2) [Ciona intestinalis]
          Length = 861

 Score =  102 bits (255), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 49/99 (49%), Positives = 67/99 (67%), Gaps = 2/99 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQ  G  FL ASG+    + P   +  +L TIFS NV  ++ G  GAVNGM+P+G  D
Sbjct: 720 MADQCAGQWFLLASGICD--VLPDDQVMSSLKTIFSLNVEQYDGGRHGAVNGMRPHGAVD 777

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 99
            S++QS+EVWTGVTY L+A M+++G + E + TA G+YR
Sbjct: 778 TSSVQSDEVWTGVTYALAATMIHKGMLKEGFRTASGIYR 816


>gi|195397814|ref|XP_002057523.1| GJ18180 [Drosophila virilis]
 gi|194141177|gb|EDW57596.1| GJ18180 [Drosophila virilis]
          Length = 869

 Score =  102 bits (255), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 54/123 (43%), Positives = 74/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 717 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEP 775

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+ + + R+      P
Sbjct: 776 SKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQ-RIGMNYETP 834

Query: 114 EGV 116
           E +
Sbjct: 835 EAL 837


>gi|288558766|gb|ADC53514.1| MIP16949p [Drosophila melanogaster]
          Length = 432

 Score =  102 bits (255), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 280 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEP 338

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+ L + R+      P
Sbjct: 339 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 397

Query: 114 EGV 116
           E +
Sbjct: 398 EAL 400


>gi|326934726|ref|XP_003213436.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Meleagris gallopavo]
          Length = 639

 Score =  102 bits (254), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 51/104 (49%), Positives = 67/104 (64%), Gaps = 2/104 (1%)

Query: 3   DQLCGYLFLKASGVNTEA--IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           DQ  G  FL A G++ +   +FP ++I  AL TIF  NV  F  G+MGAVNGM+P+G  D
Sbjct: 481 DQCAGQWFLGACGLDQKELEVFPKSHIVSALKTIFEKNVMSFAGGTMGAVNGMRPDGVPD 540

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
            S++QS EVW GV Y L+A M+ EG V E + TA G YR + E+
Sbjct: 541 TSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTAEGCYRTVWEN 584


>gi|195033785|ref|XP_001988762.1| GH11342 [Drosophila grimshawi]
 gi|193904762|gb|EDW03629.1| GH11342 [Drosophila grimshawi]
          Length = 869

 Score =  102 bits (253), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 54/123 (43%), Positives = 74/123 (60%), Gaps = 9/123 (7%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 717 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANANEP 775

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
              G  D S IQ+EEVW GV Y L+A M+ EG  +EA+ TAGG+Y+ + + R+      P
Sbjct: 776 SKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQ-RIGMNFETP 834

Query: 114 EGV 116
           E +
Sbjct: 835 EAL 837


>gi|169642399|gb|AAI60640.1| LOC100145361 protein [Xenopus (Silurana) tropicalis]
          Length = 571

 Score =  101 bits (252), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 51/105 (48%), Positives = 67/105 (63%), Gaps = 2/105 (1%)

Query: 1   MADQLCGYLFLKASGVNT--EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M DQ  G  FL+A G+      +FP  ++  AL TIF  NV+ F  G MGAVNGM+P+G 
Sbjct: 415 MVDQCAGNWFLRACGLGAGESEVFPKDHVVSALRTIFELNVKQFADGQMGAVNGMRPDGT 474

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
            D S++QS+EVW GV Y L+A M++EG V E + TA G YR + E
Sbjct: 475 TDTSSVQSDEVWIGVVYGLAATMIHEGLVQEGFATAEGCYRTVWE 519


>gi|432920219|ref|XP_004079895.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oryzias latipes]
          Length = 858

 Score =  101 bits (252), Expect = 5e-20,   Method: Composition-based stats.
 Identities = 55/117 (47%), Positives = 68/117 (58%), Gaps = 3/117 (2%)

Query: 1   MADQLCGYLFLKASGVNTE--AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M+DQ  G+ FLKAS    E    FP   I   L +IF  NV  F  G MGAVNGM+P G 
Sbjct: 699 MSDQCAGHWFLKASACGEEDYQAFPKEKIQSTLKSIFDLNVMAFAGGQMGAVNGMRPEGV 758

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
            DRS++QS+EVW GV Y L+A M++EG  +E   TA G YR + E R+      PE 
Sbjct: 759 PDRSSVQSDEVWIGVVYGLAATMIHEGMREEGMHTAEGCYRTVWE-RLGMAFQTPEA 814


>gi|242015137|ref|XP_002428230.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212512791|gb|EEB15492.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 812

 Score =  101 bits (252), Expect = 6e-20,   Method: Composition-based stats.
 Identities = 49/116 (42%), Positives = 70/116 (60%), Gaps = 2/116 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           M+DQLCG  +L   G+  E + P A +  +L  I+  NV     G++G VNGM PNG+ D
Sbjct: 669 MSDQLCGLWYLYCCGIKNE-VLPPAKVNSSLRCIYENNVLKLGKGNLGCVNGMLPNGEID 727

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEGV 116
              +QSEE+W GV Y L++ M++EG V+E W TA G+Y  + +    G ET PE +
Sbjct: 728 IHTLQSEEIWVGVVYALASLMIFEGMVEEGWKTARGIYETVYDRIGMGFET-PEAL 782


>gi|301628717|ref|XP_002943495.1| PREDICTED: non-lysosomal glucosylceramidase [Xenopus (Silurana)
           tropicalis]
          Length = 692

 Score =  101 bits (251), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 2/101 (1%)

Query: 1   MADQLCGYLFLKASGVNT--EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M DQ  G  FL+A G+      +FP  ++  AL TIF  NV+ F  G MGAVNGM+P+G 
Sbjct: 536 MVDQCAGNWFLRACGLGAGESEVFPKDHVVSALRTIFELNVKQFADGQMGAVNGMRPDGT 595

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 99
            D S++QS+EVW GV Y L+A M++EG V E + TA G YR
Sbjct: 596 TDTSSVQSDEVWIGVVYGLAATMIHEGLVQEGFATAEGCYR 636


>gi|194669510|ref|XP_001790122.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
 gi|297477943|ref|XP_002689726.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
 gi|296484715|tpg|DAA26830.1| TPA: hypothetical protein BOS_8918 [Bos taurus]
          Length = 918

 Score =  101 bits (251), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 52/105 (49%), Positives = 68/105 (64%), Gaps = 2/105 (1%)

Query: 1   MADQLCGYLFLKASGVN--TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M+DQ  G  FL+ASG+      +FP+  + RAL TIF  NV+ F  G+MGAVNGM+P G 
Sbjct: 740 MSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQAFAGGAMGAVNGMQPCGV 799

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
            D S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 800 PDTSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGCYRTVWE 844


>gi|440901933|gb|ELR52793.1| Non-lysosomal glucosylceramidase, partial [Bos grunniens mutus]
          Length = 927

 Score =  101 bits (251), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 52/105 (49%), Positives = 68/105 (64%), Gaps = 2/105 (1%)

Query: 1   MADQLCGYLFLKASGVN--TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M+DQ  G  FL+ASG+      +FP+  + RAL TIF  NV+ F  G+MGAVNGM+P G 
Sbjct: 749 MSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQAFAGGAMGAVNGMQPRGV 808

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
            D S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 809 PDTSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 853


>gi|344271640|ref|XP_003407645.1| PREDICTED: non-lysosomal glucosylceramidase [Loxodonta africana]
          Length = 967

 Score =  100 bits (250), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 52/106 (49%), Positives = 72/106 (67%), Gaps = 4/106 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 794 MSDQCAGQWFLRACGLGEGDTE-VFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHG 852

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
             D S++QS+EVW GV Y L+A M+ EG   + + TA G YR + E
Sbjct: 853 VPDTSSVQSDEVWVGVVYGLAATMIQEGLTSKGFWTAEGCYRTVWE 898


>gi|426220232|ref|XP_004004320.1| PREDICTED: non-lysosomal glucosylceramidase [Ovis aries]
          Length = 918

 Score =  100 bits (250), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 52/105 (49%), Positives = 68/105 (64%), Gaps = 2/105 (1%)

Query: 1   MADQLCGYLFLKASGVN--TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M+DQ  G  FL+ASG+      +FP+  + RAL TIF  NV+ F  G+MGAVNGM+P G 
Sbjct: 740 MSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQAFAGGAMGAVNGMQPRGV 799

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
            D S++QS+EVW GV Y L+A M+ EG   E + TA G YR + E
Sbjct: 800 PDTSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 844


>gi|363744064|ref|XP_003642968.1| PREDICTED: non-lysosomal glucosylceramidase-like [Gallus gallus]
          Length = 853

 Score =  100 bits (249), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 51/103 (49%), Positives = 65/103 (63%), Gaps = 2/103 (1%)

Query: 3   DQLCGYLFLKASGVNTEAI--FPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           DQ  G  FL A G++ + +  FP ++I  AL TIF  NV  F  G MGAVNGM+P+G  D
Sbjct: 695 DQCAGQWFLGACGLDQKEVEVFPKSHIVSALKTIFEKNVMSFAGGKMGAVNGMRPDGVPD 754

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
            S++QS EVW GV Y L+A M+ EG V E + TA G YR + E
Sbjct: 755 TSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTAEGCYRTVWE 797


>gi|224090236|ref|XP_002190896.1| PREDICTED: non-lysosomal glucosylceramidase [Taeniopygia guttata]
          Length = 826

 Score =  100 bits (248), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 52/105 (49%), Positives = 68/105 (64%), Gaps = 2/105 (1%)

Query: 1   MADQLCGYLFLKASGVNT--EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M+DQ  G  FL A G++     +FP ++I  AL TIF  NV  F  G+MGAVNGM+P+G 
Sbjct: 668 MSDQCAGQWFLGACGLDQGEFEVFPKSHILSALRTIFEKNVLSFAGGTMGAVNGMRPDGV 727

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
            D S++QS EVW GV Y L+A M+ EG V+E + TA G YR + E
Sbjct: 728 PDTSSVQSNEVWVGVVYSLAATMIQEGMVEEGFRTAEGCYRTVWE 772


>gi|348520215|ref|XP_003447624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oreochromis
           niloticus]
          Length = 856

 Score = 99.8 bits (247), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 50/105 (47%), Positives = 65/105 (61%), Gaps = 2/105 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAI--FPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M+DQ  G+ FL+ASG+       FP   I  AL ++F  NV  F  G MGAVNGM+P G 
Sbjct: 699 MSDQCAGHWFLRASGLGDGEFQAFPKEKIQSALKSVFDLNVMSFAGGQMGAVNGMRPEGV 758

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
            D S++QS+EVW GV Y L+A M++EG  +E   TA G YR + E
Sbjct: 759 PDHSSVQSDEVWIGVVYGLAATMIHEGMQEEGMRTAEGCYRTVWE 803


>gi|308485740|ref|XP_003105068.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
 gi|308257013|gb|EFP00966.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
          Length = 933

 Score = 98.6 bits (244), Expect = 4e-19,   Method: Composition-based stats.
 Identities = 49/99 (49%), Positives = 68/99 (68%), Gaps = 1/99 (1%)

Query: 1   MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
           MADQLCGY FL++ S    + + P+  +  AL TI+  NV  F  G MGAVNGMKPNG  
Sbjct: 729 MADQLCGYWFLQSVSPEMVDDLLPNHMVRSALDTIYRLNVCRFGNGQMGAVNGMKPNGVV 788

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           DR  IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 789 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 827


>gi|449275451|gb|EMC84314.1| Non-lysosomal glucosylceramidase [Columba livia]
          Length = 692

 Score = 97.8 bits (242), Expect = 6e-19,   Method: Composition-based stats.
 Identities = 44/83 (53%), Positives = 58/83 (69%)

Query: 21  IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAA 80
           +FP +++  AL TIF  NV GF  G+MGAVNGM+P+G  D S++QS EVW GV Y L+A 
Sbjct: 554 VFPKSHVVSALKTIFEKNVMGFAGGTMGAVNGMRPSGVPDTSSVQSNEVWVGVVYALAAT 613

Query: 81  MLYEGNVDEAWTTAGGLYRRLRE 103
           M+ EG V+E + TA G YR + E
Sbjct: 614 MIQEGLVEEGFRTAEGCYRTVWE 636


>gi|268569852|ref|XP_002640631.1| Hypothetical protein CBG08749 [Caenorhabditis briggsae]
          Length = 899

 Score = 97.4 bits (241), Expect = 8e-19,   Method: Composition-based stats.
 Identities = 49/99 (49%), Positives = 68/99 (68%), Gaps = 1/99 (1%)

Query: 1   MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
           MADQLCGY FL++ S    + + PS  +  AL TI+  NV  F  G MGAVNGMKP+G  
Sbjct: 701 MADQLCGYWFLQSVSPEMADDLLPSHMVRSALDTIYRLNVCQFGNGQMGAVNGMKPSGVV 760

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           DR  IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 761 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 799


>gi|341901617|gb|EGT57552.1| hypothetical protein CAEBREN_15139 [Caenorhabditis brenneri]
          Length = 925

 Score = 97.1 bits (240), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 48/99 (48%), Positives = 68/99 (68%), Gaps = 1/99 (1%)

Query: 1   MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
           MADQLCGY FL++ S    + + P+  +  AL TI+  NV  F  G MGAVNGMKP+G  
Sbjct: 705 MADQLCGYWFLQSVSPEMADDLLPNHMVRSALDTIYRLNVCKFGNGQMGAVNGMKPSGVV 764

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           DR  IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 765 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 803


>gi|71992715|ref|NP_001021680.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
 gi|18376555|emb|CAD21661.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
          Length = 959

 Score = 96.7 bits (239), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 48/99 (48%), Positives = 68/99 (68%), Gaps = 1/99 (1%)

Query: 1   MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
           MADQLCGY FL++ S    + + P+  +  AL TI+  NV  F  G MGAVNGMKP+G  
Sbjct: 753 MADQLCGYWFLQSVSPELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVV 812

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           DR  IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 813 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 851


>gi|71992723|ref|NP_001021681.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
 gi|50511739|emb|CAH04764.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
          Length = 930

 Score = 95.9 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 48/99 (48%), Positives = 68/99 (68%), Gaps = 1/99 (1%)

Query: 1   MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
           MADQLCGY FL++ S    + + P+  +  AL TI+  NV  F  G MGAVNGMKP+G  
Sbjct: 724 MADQLCGYWFLQSVSPELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVV 783

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           DR  IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 784 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 822


>gi|71992731|ref|NP_001021682.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
 gi|15130776|emb|CAC48128.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
          Length = 922

 Score = 95.9 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 48/99 (48%), Positives = 68/99 (68%), Gaps = 1/99 (1%)

Query: 1   MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
           MADQLCGY FL++ S    + + P+  +  AL TI+  NV  F  G MGAVNGMKP+G  
Sbjct: 716 MADQLCGYWFLQSVSPELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVV 775

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           DR  IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 776 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 814


>gi|431902810|gb|ELK09025.1| Non-lysosomal glucosylceramidase [Pteropus alecto]
          Length = 970

 Score = 94.7 bits (234), Expect = 6e-18,   Method: Composition-based stats.
 Identities = 47/94 (50%), Positives = 67/94 (71%), Gaps = 4/94 (4%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+ASG+   +TE +FP+ ++  AL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 718 MSDQCAGQWFLRASGLGEGDTE-VFPTPHVVCALQTIFKFNVQAFAGGAMGAVNGMQPHG 776

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
             DRS++QS+EVW GV Y L+A M+ E N+  ++
Sbjct: 777 VPDRSSVQSDEVWVGVVYGLAATMIQETNLMSSY 810


>gi|148670515|gb|EDL02462.1| glucosidase beta 2, isoform CRA_b [Mus musculus]
          Length = 377

 Score = 94.7 bits (234), Expect = 6e-18,   Method: Composition-based stats.
 Identities = 46/87 (52%), Positives = 62/87 (71%), Gaps = 4/87 (4%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FL+A G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM P+G
Sbjct: 241 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 299

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYE 84
             DRS++QS+EVW GV Y L+A M+ E
Sbjct: 300 VPDRSSVQSDEVWVGVVYGLAATMIQE 326


>gi|119578751|gb|EAW58347.1| glucosidase, beta (bile acid) 2, isoform CRA_c [Homo sapiens]
 gi|168270622|dbj|BAG10104.1| non-lysosomal glucosylceramidase [synthetic construct]
          Length = 877

 Score = 94.7 bits (234), Expect = 7e-18,   Method: Composition-based stats.
 Identities = 46/87 (52%), Positives = 63/87 (72%), Gaps = 4/87 (4%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYE 84
             D+S++QS+EVW GV Y L+A M+ E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQE 835


>gi|333898016|ref|YP_004471890.1| glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
 gi|333113281|gb|AEF18218.1| Glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
          Length = 806

 Score = 94.4 bits (233), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQL G  +  A  +    I P  ++ +AL  I+  NV  FE G MGAVNGM+P+G  D
Sbjct: 665 MADQLAGQWY--ADILRLGDILPKDHVQKALKKIYEFNVMKFENGKMGAVNGMRPDGIVD 722

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 99
            S IQ++EVWTGVTY L++ M Y G  +EA+ TA G+Y+
Sbjct: 723 ESDIQAQEVWTGVTYALASFMKYRGMTEEAYNTAYGVYK 761


>gi|10047285|dbj|BAB13431.1| KIAA1605 protein [Homo sapiens]
          Length = 922

 Score = 94.4 bits (233), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 46/87 (52%), Positives = 63/87 (72%), Gaps = 4/87 (4%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           M+DQ  G  FLKA G+   +TE +FP+ ++ RAL TIF  NV+ F  G+MGAVNGM+P+G
Sbjct: 795 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 853

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYE 84
             D+S++QS+EVW GV Y L+A M+ E
Sbjct: 854 VPDKSSVQSDEVWVGVVYGLAATMIQE 880


>gi|260813884|ref|XP_002601646.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
 gi|229286945|gb|EEN57658.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
          Length = 871

 Score = 94.0 bits (232), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 46/107 (42%), Positives = 69/107 (64%), Gaps = 9/107 (8%)

Query: 1   MADQLCGYLFLKASGVNTE---------AIFPSANIARALATIFSTNVRGFEAGSMGAVN 51
           MA    G+ +L+A G+  E          + P  ++A +L  ++  NV  F  G+MGAVN
Sbjct: 713 MAGAGSGHWYLRACGLVPEDNEVMDSDREVLPLDHVASSLRKVYDFNVMQFHQGTMGAVN 772

Query: 52  GMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           GM+PNG +D +++QSEEVWTG+TY L+A+M+ EG V+E + TA G+Y
Sbjct: 773 GMRPNGKKDLTSMQSEEVWTGITYALAASMIQEGMVEEGFQTAYGVY 819


>gi|324500616|gb|ADY40283.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 890

 Score = 92.4 bits (228), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 46/99 (46%), Positives = 67/99 (67%), Gaps = 1/99 (1%)

Query: 1   MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
           MADQLCGY FL++ S    + + P+A++  AL  I++ NV  F  G +GAVNGM+ +G  
Sbjct: 690 MADQLCGYWFLQSISPELADELLPAAHVKSALNIIYNYNVLRFAGGRLGAVNGMRRDGTV 749

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           DR+ IQ++E+WTGVTY L+A ++ +G V   + TA G Y
Sbjct: 750 DRNYIQADEMWTGVTYALAAFLIQQGAVQRGFDTAWGCY 788


>gi|47212590|emb|CAG12815.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 777

 Score = 92.0 bits (227), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 50/114 (43%), Positives = 64/114 (56%), Gaps = 3/114 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           A   CG+   +     T   FP   I  AL +IF  NV  F  G MGAVNGM+P G  DR
Sbjct: 641 ATGFCGHPVWETE--TTRLAFPKEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDR 698

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
           S++QS+EVW GV Y L+A M++EG ++E   TA G YR + E R+      PE 
Sbjct: 699 SSVQSDEVWIGVVYGLAATMIHEGMLEEGMRTAEGCYRTVWE-RLGMAFQTPEA 751


>gi|327288418|ref|XP_003228923.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Anolis
           carolinensis]
          Length = 667

 Score = 92.0 bits (227), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 44/88 (50%), Positives = 59/88 (67%), Gaps = 2/88 (2%)

Query: 1   MADQLCGYLFLKASGVN--TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           M+DQL G  FL+A G+      +FP  ++  AL TI+  NV GF  G+MGAVNGM+P+G 
Sbjct: 549 MSDQLAGQWFLRACGLGEGKYQVFPQEHVQSALRTIYEMNVLGFSEGAMGAVNGMRPDGV 608

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGN 86
            D S++QS+EVW GV Y L+A M+ EG 
Sbjct: 609 PDTSSVQSDEVWVGVVYSLAATMIQEGQ 636


>gi|356535470|ref|XP_003536268.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
          Length = 949

 Score = 89.7 bits (221), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 45/97 (46%), Positives = 63/97 (64%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+   ++F  + I  AL  ++  NV   + G MGAVNGM PNG  D 
Sbjct: 759 ADQLAGQWYTASSGL--PSLFEDSKIKSALRKVYDFNVMKVKGGRMGAVNGMHPNGKVDE 816

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY L+A M++ G  +EA+TTA G++
Sbjct: 817 TCMQSREVWTGVTYGLAATMIHAGMEEEAFTTAEGIF 853


>gi|449691194|ref|XP_004212588.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
           magnipapillata]
          Length = 169

 Score = 89.4 bits (220), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 50/119 (42%), Positives = 69/119 (57%), Gaps = 4/119 (3%)

Query: 1   MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
           MA Q  G  +L A  +     + +F    +   L  IF  NV  FE G MGAVNGM+ +G
Sbjct: 17  MAGQCAGQWYLHACDLGQTEDDLVFRPEQVHSTLKVIFQNNVMRFEDGFMGAVNGMRADG 76

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEGV 116
             D++++Q+EEVWTGVTY L+A M+  G V E + TA G+YR + E    G +T PE +
Sbjct: 77  TVDKNSLQAEEVWTGVTYALAATMIQHGMVREGFRTAEGIYRTVYEKWGLGFQT-PEAI 134


>gi|168014916|ref|XP_001759997.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162688747|gb|EDQ75122.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 970

 Score = 89.0 bits (219), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 46/97 (47%), Positives = 62/97 (63%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQ+ G  +  ASG+    +F  A    AL  IF  NV   + G +GAVNGM PNG  D 
Sbjct: 772 ADQMAGQWYAWASGL--PPLFDDAKARSALQKIFDFNVMKVKNGHLGAVNGMHPNGKVDE 829

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY L+AAM++EG +D+A+  A G++
Sbjct: 830 TCMQSREVWTGVTYALAAAMIHEGMLDQAFRAAEGVF 866


>gi|218199525|gb|EEC81952.1| hypothetical protein OsI_25831 [Oryza sativa Indica Group]
          Length = 850

 Score = 88.2 bits (217), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 46/97 (47%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I   L  IF  NV   + G MGAVNGM PNG  D 
Sbjct: 661 ADQLAGQWYTASSGL--PPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDE 718

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY L+A ML  G  D+A+TTA G++
Sbjct: 719 TCMQSREIWTGVTYSLAATMLLHGMDDQAFTTAEGIF 755


>gi|358254610|dbj|GAA55931.1| non-lysosomal glucosylceramidase [Clonorchis sinensis]
          Length = 1200

 Score = 87.8 bits (216), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 44/98 (44%), Positives = 57/98 (58%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQL G+ FL+  G    AI PS ++  AL TI STN    + GS+G +NG  P   RD
Sbjct: 713 MADQLSGHGFLRVGGAPPGAILPSEHVIMALKTIQSTNWESIQNGSLGVINGGLPGAKRD 772

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            S +Q+EE W GV Y L+A M+ EG   E ++ A   Y
Sbjct: 773 LSNVQAEEFWVGVNYSLAATMILEGMSQEGFSLAEACY 810


>gi|357116998|ref|XP_003560263.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 932

 Score = 87.8 bits (216), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 46/97 (47%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I   L  IF  NV   + G MGAVNGM PNG  D 
Sbjct: 744 ADQLAGQWYAASSGL--PPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMYPNGKVDE 801

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY L+A ML  G  D+A+TTA G++
Sbjct: 802 TCMQSREIWTGVTYSLAATMLLHGMEDQAFTTAEGIF 838


>gi|390344669|ref|XP_001200431.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Strongylocentrotus purpuratus]
          Length = 347

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 43/100 (43%), Positives = 63/100 (63%), Gaps = 2/100 (2%)

Query: 1   MADQLCGYLFLKASGVNTE--AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           MADQ CG+ +L+A  +  E   +F   ++  AL TIF  NV G + G+ GA+NG++P+G 
Sbjct: 193 MADQTCGHWYLRACDLVPERDQVFQVDHVRSALRTIFDMNVMGVKEGNFGAMNGIRPSGK 252

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            D +++Q EEVW G TY L+  M+ EG  +E + TA G Y
Sbjct: 253 PDHTSLQGEEVWVGTTYGLAGNMIQEGMWEEGFRTAKGCY 292


>gi|222636955|gb|EEE67087.1| hypothetical protein OsJ_24071 [Oryza sativa Japonica Group]
          Length = 838

 Score = 87.4 bits (215), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 46/97 (47%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I   L  IF  NV   + G MGAVNGM PNG  D 
Sbjct: 649 ADQLAGQWYTASSGL--PPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDE 706

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY L+A ML  G  D+A+TTA G++
Sbjct: 707 TCMQSREIWTGVTYSLAATMLLHGMDDQAFTTAEGIF 743


>gi|224072655|ref|XP_002303825.1| predicted protein [Populus trichocarpa]
 gi|222841257|gb|EEE78804.1| predicted protein [Populus trichocarpa]
          Length = 966

 Score = 87.0 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  ++ +SG+   ++F    I  AL  I+  NV     G MGAVNGM PNG  D 
Sbjct: 776 ADQLAGEWYMASSGL--PSLFDDVKIRSALNKIYDFNVMKVRGGKMGAVNGMHPNGKVDE 833

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+W+GVTY ++A M+  G  D+A+TTA G++
Sbjct: 834 TCMQSREIWSGVTYAVAATMILSGMEDKAFTTAEGIF 870


>gi|302773445|ref|XP_002970140.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
 gi|300162651|gb|EFJ29264.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
          Length = 983

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 63/97 (64%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQ+ G  +  ASG+    +F       AL  I+  NV   + G +GAVNGM PNG  D 
Sbjct: 785 ADQMAGQWYAWASGL--PPLFDDYKARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDE 842

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           +A+QS E+W+GVTY  +AAM++EG +++A+TTA G+Y
Sbjct: 843 TAMQSREIWSGVTYAAAAAMIHEGMLEQAFTTAEGIY 879


>gi|302807226|ref|XP_002985326.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
 gi|300147154|gb|EFJ13820.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
          Length = 983

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 63/97 (64%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQ+ G  +  ASG+    +F       AL  I+  NV   + G +GAVNGM PNG  D 
Sbjct: 785 ADQMAGQWYAWASGL--PPLFDDYKARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDE 842

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           +A+QS E+W+GVTY  +AAM++EG +++A+TTA G+Y
Sbjct: 843 TAMQSREIWSGVTYAAAAAMIHEGMLEQAFTTAEGIY 879


>gi|326489639|dbj|BAK01800.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 797

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 45/97 (46%), Positives = 62/97 (63%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + KA G+++  I        AL  I++ NV  F+ G+ GA+NGM P+G  D 
Sbjct: 603 ADQLAGQWYAKACGLSS--IVDKDKSQSALEKIYAFNVMKFKDGNRGAINGMWPDGTLDM 660

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           S +QS E+W GVTY L+A+M+ EG VDE + TA G+Y
Sbjct: 661 STMQSREIWPGVTYALAASMIQEGMVDEGFKTAEGIY 697


>gi|147788372|emb|CAN61187.1| hypothetical protein VITISV_019326 [Vitis vinifera]
          Length = 900

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+   ++F    I  +L  I+  NV   + G MGAVNGM PNG  D 
Sbjct: 675 ADQLAGQWYTASSGL--PSLFDDCKIKSSLHKIYDFNVMKVKGGKMGAVNGMHPNGKVDE 732

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           S +QS E+WTGVTY ++A M+  G  ++A+TTA G++
Sbjct: 733 SCMQSREIWTGVTYGVAATMILSGMEEQAFTTAEGIF 769


>gi|414589150|tpg|DAA39721.1| TPA: hypothetical protein ZEAMMB73_266944 [Zea mays]
          Length = 583

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 46/97 (47%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I   L  IF  NV   + G MGAVNGM PNG  D 
Sbjct: 395 ADQLAGQWYTASSGL--PPLFDEDRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDE 452

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY L+A ML  G   +A+TTA G+Y
Sbjct: 453 TCMQSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIY 489


>gi|242070593|ref|XP_002450573.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
 gi|241936416|gb|EES09561.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
          Length = 951

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 47/100 (47%), Positives = 64/100 (64%), Gaps = 8/100 (8%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARA---LATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           ADQL G  + KA G     +FP  +  +A   L  IFS NV  F+ G+ GA+NGM P+G 
Sbjct: 757 ADQLAGQWYAKACG-----LFPIVDKDKAQSALEKIFSFNVMKFKDGTRGAMNGMWPDGT 811

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            D SA+QS E+W GVTY L+A+M+ EG V++ + TA G+Y
Sbjct: 812 VDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGIY 851


>gi|16183333|gb|AAL13692.1| GH26380p [Drosophila melanogaster]
          Length = 391

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 44/92 (47%), Positives = 56/92 (60%), Gaps = 8/92 (8%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
           MADQLCG+ +LK+ G + E I+P  N+  AL  I+  NV GF  G++GA NG   N    
Sbjct: 280 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEP 338

Query: 57  ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEG 85
              G  D S IQ+EEVW GV Y L+A M+ EG
Sbjct: 339 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEG 370


>gi|253761389|ref|XP_002489100.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
 gi|241947399|gb|EES20544.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
          Length = 603

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 46/97 (47%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I   L  IF  NV   + G MGAVNGM PNG  D 
Sbjct: 415 ADQLAGQWYTASSGL--PPLFDEDRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDD 472

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY L+A ML  G   +A+TTA G+Y
Sbjct: 473 TCMQSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIY 509


>gi|225433442|ref|XP_002285674.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
          Length = 978

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+   ++F    I  +L  I+  NV   + G MGAVNGM PNG  D 
Sbjct: 789 ADQLAGQWYTASSGL--PSLFDDYKIKSSLHKIYDFNVMKVKGGKMGAVNGMHPNGKVDE 846

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           S +QS E+WTGVTY ++A M+  G  ++A+TTA G++
Sbjct: 847 SCMQSREIWTGVTYGVAATMILSGMEEQAFTTAEGIF 883


>gi|413920603|gb|AFW60535.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
          Length = 951

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 45/100 (45%), Positives = 64/100 (64%), Gaps = 8/100 (8%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARA---LATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           ADQL G  + KA G     +FP  +  +A   L  ++S NV  F+ G+ GA+NGM P+G 
Sbjct: 757 ADQLAGQWYAKACG-----LFPIVDKDKAQSALEKVYSFNVMKFKGGTRGAMNGMWPDGT 811

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            D SA+QS E+W GVTY L+A+M+ EG V++ + TA G+Y
Sbjct: 812 VDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGVY 851


>gi|413941636|gb|AFW74285.1| hypothetical protein ZEAMMB73_466373 [Zea mays]
          Length = 966

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 45/98 (45%), Positives = 64/98 (65%), Gaps = 2/98 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQL G  + +A G+  E +   A    ALAT+   NV   + G++GAVNGM+P+G  D
Sbjct: 763 MADQLAGQWYARACGL--EPVVEDAKARSALATVLDYNVMRVKGGAIGAVNGMRPDGAVD 820

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            S+ QS+EVW GVTY ++AAM++EG  + A+ TA G +
Sbjct: 821 ASSTQSKEVWPGVTYAVAAAMVHEGMTEAAFRTAKGAH 858


>gi|115474457|ref|NP_001060825.1| Os08g0111200 [Oryza sativa Japonica Group]
 gi|42408344|dbj|BAD09496.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
 gi|42408391|dbj|BAD09542.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
 gi|113622794|dbj|BAF22739.1| Os08g0111200 [Oryza sativa Japonica Group]
 gi|215701490|dbj|BAG92914.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 928

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 45/97 (46%), Positives = 64/97 (65%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G+  E I       RALAT+   NV   + G++GAVNGM+P+G  D 
Sbjct: 730 ADQLAGQWYARACGL--EPIVDGDKARRALATVLDYNVMRVKGGAIGAVNGMRPDGAVDA 787

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           S+ QS+EVW GVTY ++AAM++EG  + A+ TA G++
Sbjct: 788 SSTQSKEVWPGVTYAVAAAMIHEGMPEAAFKTAKGIH 824


>gi|168037390|ref|XP_001771187.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677567|gb|EDQ64036.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 981

 Score = 84.7 bits (208), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ASG+    +F       AL  +F  NV   + G  GA NGM PNG  D 
Sbjct: 767 ADQLAGQWYAWASGL--PPLFDDYKARSALQKVFDFNVMKVKGGKWGAANGMHPNGKVDE 824

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY  SAAM++EG V++A+T A G++
Sbjct: 825 TCMQSREIWTGVTYAASAAMIHEGMVEQAFTAAQGVF 861


>gi|357440839|ref|XP_003590697.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
 gi|355479745|gb|AES60948.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
          Length = 1103

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 42/97 (43%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2    ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
            ADQL G  +  +SG+   ++F    I  +L  +F  NV   + G MGAVNGM PNG  D 
Sbjct: 914  ADQLAGQWYTASSGL--PSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDE 971

Query: 62   SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            + +QS E+W GVTY ++A M+  G  +EA+TTA G++
Sbjct: 972  TCMQSREIWAGVTYGVAATMILAGMEEEAFTTAEGIF 1008


>gi|297809209|ref|XP_002872488.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
 gi|297318325|gb|EFH48747.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
          Length = 921

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 45/116 (38%), Positives = 66/116 (56%), Gaps = 3/116 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           +ADQL G  + +A G+  + I     I +AL TI+  NV   + G+ GAVNGM P+G  D
Sbjct: 719 LADQLAGQWYARACGL--KPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSPDGQVD 776

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCE-TAPEG 115
            +++ S+EVW G TY ++A M+ EG  +  + TA G+Y  +  DR   C    PE 
Sbjct: 777 TNSLVSKEVWAGTTYSVAACMIQEGQRERGFQTASGIYEAVWSDRGLSCSFQTPEA 832


>gi|356576421|ref|XP_003556330.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
          Length = 950

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F  + I  AL  ++  NV   + G MGAVNGM PNG  D 
Sbjct: 760 ADQLAGQWYTASSGL--PPLFEDSKIKSALRKVYDFNVMKVKGGRMGAVNGMHPNGKVDE 817

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY L+A M+  G  +EA+ TA G++
Sbjct: 818 TCMQSREVWTGVTYGLAATMILAGMEEEAFATAEGIF 854


>gi|62733283|gb|AAX95400.1| At5g49900 [Oryza sativa Japonica Group]
          Length = 931

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 46/100 (46%), Positives = 62/100 (62%), Gaps = 8/100 (8%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARA---LATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           ADQL G  + KA G     +FP  +  +A   L  I+S NV  F+ G  GA+NGM P+G 
Sbjct: 738 ADQLAGQWYAKACG-----LFPIVDKDKAESALEKIYSFNVMKFKDGKRGAMNGMWPDGT 792

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            D SA+QS E+W GVTY L+A M+ EG V++ + TA G+Y
Sbjct: 793 VDMSAMQSREIWPGVTYALAATMIQEGMVEKGFKTAEGIY 832


>gi|357157041|ref|XP_003577664.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 951

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 44/97 (45%), Positives = 61/97 (62%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + K+ G+++  I        AL  I+S NV  F+ G  GA+NGM P+G  D 
Sbjct: 757 ADQLAGQWYAKSCGLSS--IVDKDKSQSALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDM 814

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           S +QS E+W GVTY L+A+M+ EG V+E + TA G+Y
Sbjct: 815 STMQSREIWPGVTYALAASMIQEGMVEEGFKTAEGIY 851


>gi|357440847|ref|XP_003590701.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
 gi|355479749|gb|AES60952.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
          Length = 992

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 42/97 (43%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+   ++F    I  +L  +F  NV   + G MGAVNGM PNG  D 
Sbjct: 803 ADQLAGQWYTASSGL--PSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDE 860

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+W GVTY ++A M+  G  +EA+TTA G++
Sbjct: 861 TCMQSREIWAGVTYGVAATMILAGMEEEAFTTAEGIF 897


>gi|326517703|dbj|BAK03770.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 821

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/98 (45%), Positives = 63/98 (64%), Gaps = 2/98 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQL G  + +A G+  E I        AL T+   NV   + G++GAVNGM+P+G  D
Sbjct: 615 MADQLAGQWYARACGL--EPIVEEEKARSALGTVLDYNVMRVQGGAVGAVNGMRPDGAVD 672

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            S++QS+EVW GVTY ++AAML+EG  + A+ TA G +
Sbjct: 673 ASSLQSKEVWVGVTYGVAAAMLHEGMTEAAFRTAKGAH 710


>gi|115484889|ref|NP_001067588.1| Os11g0242100 [Oryza sativa Japonica Group]
 gi|77549531|gb|ABA92328.1| expressed protein [Oryza sativa Japonica Group]
 gi|113644810|dbj|BAF27951.1| Os11g0242100 [Oryza sativa Japonica Group]
 gi|215704397|dbj|BAG93831.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222615769|gb|EEE51901.1| hypothetical protein OsJ_33494 [Oryza sativa Japonica Group]
          Length = 950

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 46/100 (46%), Positives = 62/100 (62%), Gaps = 8/100 (8%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARA---LATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           ADQL G  + KA G     +FP  +  +A   L  I+S NV  F+ G  GA+NGM P+G 
Sbjct: 757 ADQLAGQWYAKACG-----LFPIVDKDKAESALEKIYSFNVMKFKDGKRGAMNGMWPDGT 811

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            D SA+QS E+W GVTY L+A M+ EG V++ + TA G+Y
Sbjct: 812 VDMSAMQSREIWPGVTYALAATMIQEGMVEKGFKTAEGIY 851


>gi|218185509|gb|EEC67936.1| hypothetical protein OsI_35656 [Oryza sativa Indica Group]
          Length = 950

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 46/100 (46%), Positives = 62/100 (62%), Gaps = 8/100 (8%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARA---LATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           ADQL G  + KA G     +FP  +  +A   L  I+S NV  F+ G  GA+NGM P+G 
Sbjct: 757 ADQLAGQWYAKACG-----LFPIVDKDKAESALEKIYSFNVMKFKDGKRGAMNGMWPDGT 811

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            D SA+QS E+W GVTY L+A M+ EG V++ + TA G+Y
Sbjct: 812 VDMSAMQSREIWPGVTYALAATMIQEGMVEKGFKTAEGIY 851


>gi|188570649|gb|ACD64360.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164


>gi|188570667|gb|ACD64369.1| hypothetical protein [Bahiopsis lanata]
 gi|188570669|gb|ACD64370.1| hypothetical protein [Bahiopsis lanata]
          Length = 208

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 68  ADQLAGQWYTASAGLPN--LFGDDKIQSSLHKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 125

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D+A+TTA G++
Sbjct: 126 TCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIF 162


>gi|292384308|gb|ADE21407.1| unknown [Helianthus neglectus]
          Length = 205

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 67  ADQLAGQWYTASAGLPN--LFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 124

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 125 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 161


>gi|440799031|gb|ELR20092.1| glucosylceramidase [Acanthamoeba castellanii str. Neff]
          Length = 860

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 62/104 (59%), Gaps = 5/104 (4%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
            ADQLCG  + +A G+ +  I P   I  AL T+F+ N   F  G  GAVNGM+P+G  D
Sbjct: 711 QADQLCGQWYARACGLPS--IAPDKRIVTALKTVFANN---FNEGKRGAVNGMRPDGRVD 765

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
            + + S E W G T+ L+A M+ EG VD+A+ TA G+   + ED
Sbjct: 766 YACLHSSEAWPGTTFALAACMMGEGLVDQAFITAAGVCGSIYED 809


>gi|188570531|gb|ACD64301.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF+ NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFNFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164


>gi|292384324|gb|ADE21415.1| unknown [Helianthus neglectus]
          Length = 210

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164


>gi|8777421|dbj|BAA97011.1| unnamed protein product [Arabidopsis thaliana]
          Length = 928

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +ASG+    I        AL  +++ NV   + G  GAVNGM PNG  D 
Sbjct: 726 ADQLAGQWYARASGL--LPIVDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDT 783

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           +++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y
Sbjct: 784 ASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIY 820


>gi|188570631|gb|ACD64351.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164


>gi|292384310|gb|ADE21408.1| unknown [Helianthus neglectus]
          Length = 210

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164


>gi|255554158|ref|XP_002518119.1| conserved hypothetical protein [Ricinus communis]
 gi|223542715|gb|EEF44252.1| conserved hypothetical protein [Ricinus communis]
          Length = 968

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 41/97 (42%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  ++ +SG+    +F  + I   L  I+  NV     G MGAVNGM PNG  D 
Sbjct: 779 ADQLAGQWYVASSGL--PPLFDDSKIKSTLQKIYDFNVMKVRGGRMGAVNGMHPNGKVDE 836

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A M+  G  D+A+  A G++
Sbjct: 837 TCMQSREIWTGVTYAVAATMILAGMEDKAFAAAEGIF 873


>gi|188570647|gb|ACD64359.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFGDDKIPSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164


>gi|188570671|gb|ACD64371.1| hypothetical protein [Bahiopsis reticulata]
 gi|188570673|gb|ACD64372.1| hypothetical protein [Bahiopsis reticulata]
          Length = 210

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDSAFTTAEGIF 164


>gi|188570635|gb|ACD64353.1| hypothetical protein [Helianthus petiolaris]
 gi|188570639|gb|ACD64355.1| hypothetical protein [Helianthus petiolaris]
          Length = 211

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 71  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 128

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 129 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 165


>gi|188570553|gb|ACD64312.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164


>gi|188570503|gb|ACD64287.1| hypothetical protein [Helianthus annuus]
 gi|188570505|gb|ACD64288.1| hypothetical protein [Helianthus annuus]
 gi|188570507|gb|ACD64289.1| hypothetical protein [Helianthus annuus]
 gi|188570509|gb|ACD64290.1| hypothetical protein [Helianthus annuus]
 gi|188570513|gb|ACD64292.1| hypothetical protein [Helianthus annuus]
 gi|188570515|gb|ACD64293.1| hypothetical protein [Helianthus annuus]
 gi|188570517|gb|ACD64294.1| hypothetical protein [Helianthus annuus]
 gi|188570519|gb|ACD64295.1| hypothetical protein [Helianthus annuus]
 gi|188570521|gb|ACD64296.1| hypothetical protein [Helianthus annuus]
 gi|188570523|gb|ACD64297.1| hypothetical protein [Helianthus annuus]
 gi|188570525|gb|ACD64298.1| hypothetical protein [Helianthus annuus]
 gi|188570527|gb|ACD64299.1| hypothetical protein [Helianthus annuus]
 gi|188570529|gb|ACD64300.1| hypothetical protein [Helianthus annuus]
 gi|188570533|gb|ACD64302.1| hypothetical protein [Helianthus annuus]
 gi|188570535|gb|ACD64303.1| hypothetical protein [Helianthus annuus]
 gi|188570537|gb|ACD64304.1| hypothetical protein [Helianthus annuus]
 gi|188570539|gb|ACD64305.1| hypothetical protein [Helianthus annuus]
 gi|188570541|gb|ACD64306.1| hypothetical protein [Helianthus annuus]
 gi|188570543|gb|ACD64307.1| hypothetical protein [Helianthus annuus]
 gi|188570545|gb|ACD64308.1| hypothetical protein [Helianthus annuus]
 gi|188570547|gb|ACD64309.1| hypothetical protein [Helianthus annuus]
 gi|188570549|gb|ACD64310.1| hypothetical protein [Helianthus annuus]
 gi|188570551|gb|ACD64311.1| hypothetical protein [Helianthus annuus]
 gi|188570555|gb|ACD64313.1| hypothetical protein [Helianthus annuus]
 gi|188570557|gb|ACD64314.1| hypothetical protein [Helianthus annuus]
 gi|188570559|gb|ACD64315.1| hypothetical protein [Helianthus annuus]
 gi|188570561|gb|ACD64316.1| hypothetical protein [Helianthus annuus]
 gi|188570563|gb|ACD64317.1| hypothetical protein [Helianthus annuus]
 gi|188570565|gb|ACD64318.1| hypothetical protein [Helianthus annuus]
 gi|188570567|gb|ACD64319.1| hypothetical protein [Helianthus annuus]
 gi|188570569|gb|ACD64320.1| hypothetical protein [Helianthus annuus]
 gi|188570571|gb|ACD64321.1| hypothetical protein [Helianthus annuus]
 gi|188570573|gb|ACD64322.1| hypothetical protein [Helianthus annuus]
 gi|188570575|gb|ACD64323.1| hypothetical protein [Helianthus annuus]
 gi|188570577|gb|ACD64324.1| hypothetical protein [Helianthus annuus]
 gi|188570579|gb|ACD64325.1| hypothetical protein [Helianthus annuus]
 gi|188570581|gb|ACD64326.1| hypothetical protein [Helianthus annuus]
 gi|188570583|gb|ACD64327.1| hypothetical protein [Helianthus annuus]
 gi|188570585|gb|ACD64328.1| hypothetical protein [Helianthus annuus]
 gi|188570587|gb|ACD64329.1| hypothetical protein [Helianthus annuus]
 gi|188570589|gb|ACD64330.1| hypothetical protein [Helianthus annuus]
 gi|188570591|gb|ACD64331.1| hypothetical protein [Helianthus annuus]
 gi|188570593|gb|ACD64332.1| hypothetical protein [Helianthus annuus]
 gi|188570595|gb|ACD64333.1| hypothetical protein [Helianthus annuus]
 gi|188570597|gb|ACD64334.1| hypothetical protein [Helianthus annuus]
 gi|188570599|gb|ACD64335.1| hypothetical protein [Helianthus annuus]
 gi|188570601|gb|ACD64336.1| hypothetical protein [Helianthus annuus]
 gi|188570603|gb|ACD64337.1| hypothetical protein [Helianthus petiolaris]
 gi|188570605|gb|ACD64338.1| hypothetical protein [Helianthus petiolaris]
 gi|188570607|gb|ACD64339.1| hypothetical protein [Helianthus petiolaris]
 gi|188570609|gb|ACD64340.1| hypothetical protein [Helianthus petiolaris]
 gi|188570611|gb|ACD64341.1| hypothetical protein [Helianthus petiolaris]
 gi|188570613|gb|ACD64342.1| hypothetical protein [Helianthus petiolaris]
 gi|188570615|gb|ACD64343.1| hypothetical protein [Helianthus petiolaris]
 gi|188570617|gb|ACD64344.1| hypothetical protein [Helianthus petiolaris]
 gi|188570619|gb|ACD64345.1| hypothetical protein [Helianthus petiolaris]
 gi|188570625|gb|ACD64348.1| hypothetical protein [Helianthus petiolaris]
 gi|188570627|gb|ACD64349.1| hypothetical protein [Helianthus petiolaris]
 gi|188570629|gb|ACD64350.1| hypothetical protein [Helianthus petiolaris]
 gi|188570633|gb|ACD64352.1| hypothetical protein [Helianthus petiolaris]
 gi|188570637|gb|ACD64354.1| hypothetical protein [Helianthus petiolaris]
 gi|188570641|gb|ACD64356.1| hypothetical protein [Helianthus petiolaris]
 gi|188570643|gb|ACD64357.1| hypothetical protein [Helianthus petiolaris]
 gi|188570645|gb|ACD64358.1| hypothetical protein [Helianthus petiolaris]
 gi|188570651|gb|ACD64361.1| hypothetical protein [Helianthus petiolaris]
 gi|188570653|gb|ACD64362.1| hypothetical protein [Helianthus petiolaris]
 gi|188570655|gb|ACD64363.1| hypothetical protein [Helianthus petiolaris]
 gi|188570657|gb|ACD64364.1| hypothetical protein [Helianthus petiolaris]
 gi|188570659|gb|ACD64365.1| hypothetical protein [Helianthus petiolaris]
 gi|188570661|gb|ACD64366.1| hypothetical protein [Helianthus petiolaris]
 gi|188570663|gb|ACD64367.1| hypothetical protein [Helianthus petiolaris]
 gi|188570665|gb|ACD64368.1| hypothetical protein [Helianthus petiolaris]
 gi|292384298|gb|ADE21402.1| unknown [Helianthus neglectus]
 gi|292384300|gb|ADE21403.1| unknown [Helianthus neglectus]
 gi|292384302|gb|ADE21404.1| unknown [Helianthus neglectus]
 gi|292384304|gb|ADE21405.1| unknown [Helianthus neglectus]
 gi|292384312|gb|ADE21409.1| unknown [Helianthus neglectus]
 gi|292384318|gb|ADE21412.1| unknown [Helianthus neglectus]
 gi|292384320|gb|ADE21413.1| unknown [Helianthus neglectus]
 gi|292384328|gb|ADE21417.1| unknown [Helianthus neglectus]
          Length = 210

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164


>gi|292384322|gb|ADE21414.1| unknown [Helianthus neglectus]
          Length = 207

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 67  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 124

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 125 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 161


>gi|188570511|gb|ACD64291.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164


>gi|292384326|gb|ADE21416.1| unknown [Helianthus neglectus]
          Length = 210

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFGEDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164


>gi|414871104|tpg|DAA49661.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
          Length = 649

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 57/97 (58%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I  AL  IF  NV   + G MGAVNGM P G  D 
Sbjct: 460 ADQLAGQWYAASSGL--PPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 517

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A ML  G   + +TTA G++
Sbjct: 518 TCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIF 554


>gi|224028315|gb|ACN33233.1| unknown [Zea mays]
          Length = 384

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 57/97 (58%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I  AL  IF  NV   + G MGAVNGM P G  D 
Sbjct: 255 ADQLAGQWYAASSGL--PPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 312

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A ML  G   + +TTA G++
Sbjct: 313 TCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIF 349


>gi|292384316|gb|ADE21411.1| unknown [Helianthus neglectus]
          Length = 205

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 67  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 124

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 125 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 161


>gi|226499590|ref|NP_001146319.1| uncharacterized protein LOC100279895 [Zea mays]
 gi|219886613|gb|ACL53681.1| unknown [Zea mays]
          Length = 649

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 57/97 (58%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I  AL  IF  NV   + G MGAVNGM P G  D 
Sbjct: 460 ADQLAGQWYAASSGL--PPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 517

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A ML  G   + +TTA G++
Sbjct: 518 TCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIF 554


>gi|292384306|gb|ADE21406.1| unknown [Helianthus neglectus]
          Length = 205

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 67  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 124

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 125 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 161


>gi|292384314|gb|ADE21410.1| unknown [Helianthus neglectus]
          Length = 205

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 67  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 124

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++ G  D A+TTA G++
Sbjct: 125 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 161


>gi|110742221|dbj|BAE99037.1| hypothetical protein [Arabidopsis thaliana]
          Length = 957

 Score = 82.8 bits (203), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +ASG+    I        AL  +++ NV   + G  GAVNGM PNG  D 
Sbjct: 755 ADQLAGQWYARASGL--LPIVDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDT 812

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           +++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y
Sbjct: 813 ASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIY 849


>gi|222423179|dbj|BAH19567.1| AT5G49900 [Arabidopsis thaliana]
          Length = 957

 Score = 82.8 bits (203), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +ASG+    I        AL  +++ NV   + G  GAVNGM PNG  D 
Sbjct: 755 ADQLAGQWYARASGL--LPIVDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDT 812

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           +++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y
Sbjct: 813 ASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIY 849


>gi|30695777|ref|NP_199801.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|27311753|gb|AAO00842.1| putative protein [Arabidopsis thaliana]
 gi|34365725|gb|AAQ65174.1| At5g49900 [Arabidopsis thaliana]
 gi|332008486|gb|AED95869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 957

 Score = 82.8 bits (203), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +ASG+    I        AL  +++ NV   + G  GAVNGM PNG  D 
Sbjct: 755 ADQLAGQWYARASGL--LPIVDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDT 812

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           +++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y
Sbjct: 813 ASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIY 849


>gi|414871105|tpg|DAA49662.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
          Length = 589

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 57/97 (58%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I  AL  IF  NV   + G MGAVNGM P G  D 
Sbjct: 460 ADQLAGQWYAASSGL--PPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 517

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A ML  G   + +TTA G++
Sbjct: 518 TCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIF 554


>gi|297851790|ref|XP_002893776.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339618|gb|EFH70035.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 947

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 45/109 (41%), Positives = 62/109 (56%), Gaps = 2/109 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           +ADQL G  + +A G+  + I     I +AL T++  NV     G+ GAVNGM P+G  D
Sbjct: 746 LADQLAGQWYARACGL--KPIAKEEWIKKALETVYDFNVMRVRDGTRGAVNGMLPDGRVD 803

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGC 109
            S + S EVW G TY ++A M+ EG  D+ + TA G+Y     DR  GC
Sbjct: 804 TSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLGC 852


>gi|449442371|ref|XP_004138955.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
 gi|449520930|ref|XP_004167485.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 993

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I  AL  I+  NV     G MGAVNGM PNG  D 
Sbjct: 803 ADQLAGQWYTASSGL--PPLFDDFKIKSALRKIYDFNVMKVRGGRMGAVNGMHPNGKIDE 860

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A M+  G  +EA+ TA G++
Sbjct: 861 TCMQSREIWTGVTYGVAATMILAGMEEEAFKTAEGIF 897


>gi|255569126|ref|XP_002525532.1| conserved hypothetical protein [Ricinus communis]
 gi|223535211|gb|EEF36890.1| conserved hypothetical protein [Ricinus communis]
          Length = 948

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 59/97 (60%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G+++  I     +  AL  I+  NV   + G  GAVNGM P+G  D 
Sbjct: 749 ADQLAGQWYARACGLSS--IVDEEKVRSALEKIYKFNVLKVKEGKRGAVNGMLPDGKVDM 806

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           S +QS E+W GVTY LSA+M+ EG  + A+ TA G+Y
Sbjct: 807 SVMQSREIWPGVTYALSASMIQEGMAEMAFQTASGIY 843


>gi|10440626|gb|AAG16864.1|AC069145_13 unknown protein [Oryza sativa Japonica Group]
          Length = 444

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I  AL  IF  NV   + G +GAVNGM PNG  D 
Sbjct: 255 ADQLAGQWYAASSGL--PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDE 312

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A ML  G   + +TTA G++
Sbjct: 313 TCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIF 349


>gi|222612993|gb|EEE51125.1| hypothetical protein OsJ_31868 [Oryza sativa Japonica Group]
          Length = 935

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I  AL  IF  NV   + G +GAVNGM PNG  D 
Sbjct: 746 ADQLAGQWYAASSGL--PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDE 803

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A ML  G   + +TTA G++
Sbjct: 804 TCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIF 840


>gi|16905165|gb|AAL31035.1|AC078948_19 unknown protein [Oryza sativa Japonica Group]
          Length = 967

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I  AL  IF  NV   + G +GAVNGM PNG  D 
Sbjct: 778 ADQLAGQWYAASSGL--PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDE 835

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A ML  G   + +TTA G++
Sbjct: 836 TCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIF 872


>gi|115482480|ref|NP_001064833.1| Os10g0473400 [Oryza sativa Japonica Group]
 gi|110289241|gb|AAP54244.2| expressed protein [Oryza sativa Japonica Group]
 gi|113639442|dbj|BAF26747.1| Os10g0473400 [Oryza sativa Japonica Group]
 gi|215695411|dbj|BAG90602.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 974

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I  AL  IF  NV   + G +GAVNGM PNG  D 
Sbjct: 785 ADQLAGQWYAASSGL--PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDE 842

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A ML  G   + +TTA G++
Sbjct: 843 TCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIF 879


>gi|218184729|gb|EEC67156.1| hypothetical protein OsI_34005 [Oryza sativa Indica Group]
          Length = 974

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I  AL  IF  NV   + G +GAVNGM PNG  D 
Sbjct: 785 ADQLAGQWYAASSGL--PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDE 842

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A ML  G   + +TTA G++
Sbjct: 843 TCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIF 879


>gi|297795757|ref|XP_002865763.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311598|gb|EFH42022.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 956

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +ASG+    I        AL  +++ NV   + G  GAVNGM PNG  D 
Sbjct: 755 ADQLAGQWYARASGL--LPIVDEDKARTALEKVYNFNVMKIKDGKRGAVNGMHPNGKVDT 812

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           +++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y
Sbjct: 813 ASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGVY 849


>gi|326507888|dbj|BAJ86687.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 676

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 45/98 (45%), Positives = 61/98 (62%), Gaps = 2/98 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQL G  + +A G+  E I        AL T+   NV   + G++GAVNGM+P+G  D
Sbjct: 497 MADQLAGQWYARACGL--EPIVEEEKARSALGTVLDYNVMRVQGGAIGAVNGMRPDGTVD 554

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            S+IQS EVW GVTY ++AAM +EG  + A+ TA G +
Sbjct: 555 TSSIQSREVWPGVTYGVAAAMAHEGMPEAAFRTAKGAH 592


>gi|30692911|ref|NP_174631.2| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|79319136|ref|NP_001031135.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|332193493|gb|AEE31614.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|332193494|gb|AEE31615.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
          Length = 947

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 44/109 (40%), Positives = 62/109 (56%), Gaps = 2/109 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           +ADQ+ G  + +A G+  + I     I +AL T++  NV     G+ GAVNGM P+G  D
Sbjct: 746 LADQMAGQWYARACGL--KPIAKEEWIKKALETVYDFNVMRVRDGTRGAVNGMLPDGRVD 803

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGC 109
            S + S EVW G TY ++A M+ EG  D+ + TA G+Y     DR  GC
Sbjct: 804 TSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLGC 852


>gi|297741007|emb|CBI31319.3| unnamed protein product [Vitis vinifera]
          Length = 953

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 42/97 (43%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G+  + I        AL  +++ NV   + G  GAVNGM P+G  D 
Sbjct: 753 ADQLAGQWYARACGL--QPIVDDEKARSALEKVYNFNVLKVKEGKCGAVNGMLPDGRVDM 810

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           SA+QS E+W GVTY ++A M++EG V+ A+ TA G+Y
Sbjct: 811 SAMQSREIWAGVTYSVAANMIHEGMVETAFNTASGIY 847


>gi|28393605|gb|AAO42222.1| unknown protein [Arabidopsis thaliana]
          Length = 947

 Score = 82.0 bits (201), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 44/109 (40%), Positives = 62/109 (56%), Gaps = 2/109 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           +ADQ+ G  + +A G+  + I     I +AL T++  NV     G+ GAVNGM P+G  D
Sbjct: 746 LADQMAGQWYARACGL--KPIAKEEWIKKALETVYDFNVMRVRDGTRGAVNGMLPDGRVD 803

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGC 109
            S + S EVW G TY ++A M+ EG  D+ + TA G+Y     DR  GC
Sbjct: 804 TSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLGC 852


>gi|225460125|ref|XP_002275782.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
          Length = 969

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 42/97 (43%), Positives = 60/97 (61%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G+  + I        AL  +++ NV   + G  GAVNGM P+G  D 
Sbjct: 769 ADQLAGQWYARACGL--QPIVDDEKARSALEKVYNFNVLKVKEGKCGAVNGMLPDGRVDM 826

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           SA+QS E+W GVTY ++A M++EG V+ A+ TA G+Y
Sbjct: 827 SAMQSREIWAGVTYSVAANMIHEGMVETAFNTASGIY 863


>gi|159896834|ref|YP_001543081.1| hypothetical protein Haur_0301 [Herpetosiphon aurantiacus DSM 785]
 gi|159889873|gb|ABX02953.1| protein of unknown function DUF608 [Herpetosiphon aurantiacus DSM
           785]
          Length = 774

 Score = 81.3 bits (199), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 45/98 (45%), Positives = 59/98 (60%), Gaps = 2/98 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQL G  +  A G+   A+ P   I  AL T++  NV  +  G++GAVNGM P+G  D
Sbjct: 631 MADQLVGQWYAGAIGL--PAVAPREMIRSALQTVYRFNVMQYANGALGAVNGMHPDGTVD 688

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            S+ Q+ EVW+G TY ++A ML EG   E W TA G Y
Sbjct: 689 TSSNQASEVWSGTTYAIAAMMLQEGLDLEGWQTAWGAY 726


>gi|353232147|emb|CCD79502.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 900

 Score = 80.9 bits (198), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 38/90 (42%), Positives = 54/90 (60%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQL GY F + +GV    I P  ++ + L TI + N  G + G++GA+NG  P    D
Sbjct: 741 MADQLSGYWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPD 800

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
            S++Q+EE W  V Y LS+ M+ EG +DE 
Sbjct: 801 LSSVQAEEFWVAVNYSLSSLMIAEGMIDEG 830


>gi|256082537|ref|XP_002577511.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 892

 Score = 80.9 bits (198), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 38/90 (42%), Positives = 54/90 (60%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQL GY F + +GV    I P  ++ + L TI + N  G + G++GA+NG  P    D
Sbjct: 733 MADQLSGYWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPD 792

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
            S++Q+EE W  V Y LS+ M+ EG +DE 
Sbjct: 793 LSSVQAEEFWVAVNYSLSSLMIAEGMIDEG 822


>gi|357146604|ref|XP_003574050.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 962

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 56/97 (57%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    IF    I  AL  IF  NV   + G MGAVNGM P G  D 
Sbjct: 773 ADQLAGQWYAASSGL--PPIFDEHKIRSALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 830

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A ML  G   + + TA G++
Sbjct: 831 TCMQSREIWTGVTYGVAANMLLHGMEHQGFITAEGIF 867


>gi|20453181|gb|AAM19831.1| AT4g10060/T5L19_190 [Arabidopsis thaliana]
 gi|24111431|gb|AAN46866.1| At4g10060/T5L19_190 [Arabidopsis thaliana]
          Length = 922

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 3/116 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           +ADQL G  + +A G+  + I     I +AL TI+  NV   + G+ GAVNGM   G  D
Sbjct: 719 LADQLAGQWYARACGL--KPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSTEGKVD 776

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCE-TAPEG 115
            +++ S+EVW G TY ++A M+ EG  ++ + TA G+Y  +  DR   C    PE 
Sbjct: 777 TNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSFQTPEA 832


>gi|240255774|ref|NP_192744.6| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|332657434|gb|AEE82834.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 922

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 3/116 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           +ADQL G  + +A G+  + I     I +AL TI+  NV   + G+ GAVNGM   G  D
Sbjct: 719 LADQLAGQWYARACGL--KPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSTEGKVD 776

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCE-TAPEG 115
            +++ S+EVW G TY ++A M+ EG  ++ + TA G+Y  +  DR   C    PE 
Sbjct: 777 TNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSFQTPEA 832


>gi|326516802|dbj|BAJ96393.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 970

 Score = 80.1 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 56/97 (57%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    IF    I  AL  IF  NV   + G MGAVNGM P G  D 
Sbjct: 781 ADQLAGQWYAASSGL--PPIFDEHKIRSALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 838

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+WTGVTY ++A ML  G   + + TA G++
Sbjct: 839 TCMQSREIWTGVTYGVAANMLLHGMEHQGFITAEGIF 875


>gi|356520149|ref|XP_003528727.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
           glucosylceramidase-like [Glycine max]
          Length = 944

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 42/97 (43%), Positives = 57/97 (58%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G+    I        AL  +++ NV     G  GAVNGM P+G  D 
Sbjct: 748 ADQLAGQWYARACGL--LPIVEEKKSRSALQIVYNHNVMKVHXGRRGAVNGMLPDGKVDM 805

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           S +QS E+W+GVTY L+A M+ E  +D A+ TAGG+Y
Sbjct: 806 STMQSREIWSGVTYALAATMIQENMIDMAFQTAGGVY 842


>gi|188570621|gb|ACD64346.1| hypothetical protein [Helianthus petiolaris]
 gi|188570623|gb|ACD64347.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  ++G+    +F    I  +L  IF  NV     G MGAVNGM PNG  D 
Sbjct: 70  ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS EVWTGVTY ++A M++    D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAEMEDNAFTTAEGIF 164


>gi|4539009|emb|CAB39630.1| putative protein [Arabidopsis thaliana]
 gi|7267702|emb|CAB78129.1| putative protein [Arabidopsis thaliana]
          Length = 750

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 3/116 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           +ADQL G  + +A G+  + I     I +AL TI+  NV   + G+ GAVNGM   G  D
Sbjct: 547 LADQLAGQWYARACGL--KPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSTEGKVD 604

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCE-TAPEG 115
            +++ S+EVW G TY ++A M+ EG  ++ + TA G+Y  +  DR   C    PE 
Sbjct: 605 TNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSFQTPEA 660


>gi|324503709|gb|ADY41606.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 261

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 48/117 (41%), Positives = 62/117 (52%), Gaps = 2/117 (1%)

Query: 1   MADQLCGYLFLKAS-GVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
           MADQLCG+ F+  + G   + I     I  +L TIF  NV  F  G +G VN M P+G  
Sbjct: 114 MADQLCGFWFMCITDGKVDDVIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVV 173

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEGV 116
           D + IQSEEVW GV Y L++  L     + A+ TA G YR   E R      +PE +
Sbjct: 174 DSTGIQSEEVWGGVAYALASFHLLVEENESAFKTAEGWYRSCWE-RYGLQYQSPEAI 229


>gi|242080271|ref|XP_002444904.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
 gi|241941254|gb|EES14399.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
          Length = 929

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/98 (43%), Positives = 63/98 (64%), Gaps = 2/98 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQ+ G  + +A G+  E +        ALAT+   NV   + G++GAVNGM+P+G  D
Sbjct: 725 MADQMAGQWYARACGM--EPVVEEGKARSALATVLDYNVMRVKGGAVGAVNGMRPDGAVD 782

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            S+ QS+EVW GVTY ++AAM++EG  + A+ TA G +
Sbjct: 783 ASSGQSKEVWPGVTYAVAAAMVHEGMHEAAFRTAKGAH 820


>gi|324504438|gb|ADY41918.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 534

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 45/104 (43%), Positives = 57/104 (54%), Gaps = 1/104 (0%)

Query: 1   MADQLCGYLFLKAS-GVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
           MADQLCG+ F+  + G   + I     I  +L TIF  NV  F  G +G VN M P+G  
Sbjct: 387 MADQLCGFWFMCITDGKVDDVIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVV 446

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
           D + IQSEEVW GV Y L++  L     + A+ TA G YR   E
Sbjct: 447 DSTGIQSEEVWGGVAYALASFHLLVEENESAFKTAEGWYRSCWE 490


>gi|268555576|ref|XP_002635777.1| Hypothetical protein CBG10433 [Caenorhabditis briggsae]
          Length = 821

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 44/101 (43%), Positives = 59/101 (58%), Gaps = 2/101 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG+  L A  ++         I  AL TIF  NV  ++ G  GAVNG   +   D
Sbjct: 678 MADQLCGFWALTA--MDEPIQVSEGKIQSALETIFKYNVEMYDGGKCGAVNGFLASERVD 735

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
            S+IQSEEVW G+TY LSA M+ +G  + A+ T+ GL+  +
Sbjct: 736 GSSIQSEEVWAGITYSLSAMMIEKGMDEMAFKTSEGLFHSI 776


>gi|9665098|gb|AAF97289.1|AC010164_11 Hypothetical protein [Arabidopsis thaliana]
          Length = 790

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/109 (40%), Positives = 62/109 (56%), Gaps = 2/109 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           +ADQ+ G  + +A G+  + I     I +AL T++  NV     G+ GAVNGM P+G  D
Sbjct: 589 LADQMAGQWYARACGL--KPIAKEEWIKKALETVYDFNVMRVRDGTRGAVNGMLPDGRVD 646

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGC 109
            S + S EVW G TY ++A M+ EG  D+ + TA G+Y     DR  GC
Sbjct: 647 TSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLGC 695


>gi|297831192|ref|XP_002883478.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329318|gb|EFH59737.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 950

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 40/96 (41%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 3   DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
           DQL G  +  +SG+    IF  + I   +  IF  NV   + G MGAVNGM P+G  D +
Sbjct: 764 DQLAGQWYAASSGL--PPIFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDT 821

Query: 63  AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            +QS E+WTGVTY  +A M+  G  ++ +TTA G++
Sbjct: 822 CMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIF 857


>gi|302787158|ref|XP_002975349.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
 gi|300156923|gb|EFJ23550.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
          Length = 916

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 42/97 (43%), Positives = 62/97 (63%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           +DQL G  +  ASG+    I        AL T+F+TNV  ++ G+ GAVNGM P+G  DR
Sbjct: 712 SDQLAGQWYSLASGL--PGIVREDQATSALGTVFATNVMRYKGGNEGAVNGMLPSGGEDR 769

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           +++QS EVW G TY ++AAM+ +G  +E + TA G++
Sbjct: 770 TSLQSREVWVGTTYAVAAAMIQQGMREEGFRTAKGVF 806


>gi|302762052|ref|XP_002964448.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
 gi|300168177|gb|EFJ34781.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
          Length = 916

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 42/97 (43%), Positives = 62/97 (63%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           +DQL G  +  ASG+    I        AL T+F+TNV  ++ G+ GAVNGM P+G  DR
Sbjct: 712 SDQLAGQWYSLASGL--PGIVREDQATSALGTVFATNVMRYKGGNEGAVNGMLPSGGEDR 769

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           +++QS EVW G TY ++AAM+ +G  +E + TA G++
Sbjct: 770 TSLQSREVWAGTTYAVAAAMIQQGMREEGFRTAKGVF 806


>gi|47196687|emb|CAF89164.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 239

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 48/80 (60%), Gaps = 2/80 (2%)

Query: 5   LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAI 64
            CG+   +     T   FP   I  AL +IF  NV  F  G MGAVNGM+P G  DRS++
Sbjct: 161 FCGHPVWETE--TTRQAFPKEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSV 218

Query: 65  QSEEVWTGVTYLLSAAMLYE 84
           QS+EVW GV Y L+A M++E
Sbjct: 219 QSDEVWIGVVYGLAATMIHE 238


>gi|308471410|ref|XP_003097936.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
 gi|308239241|gb|EFO83193.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
          Length = 845

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 43/98 (43%), Positives = 59/98 (60%), Gaps = 2/98 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG+  L A  ++         +  AL TIF  NV+ ++ G  GAVNG   +   D
Sbjct: 703 MADQLCGFWALTA--MDEPVQVSEEKMKSALETIFKYNVQMYDGGKCGAVNGYLTSERVD 760

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            S+IQSEEVW G+TY LSA M+ +G  + A+ T+ GL+
Sbjct: 761 GSSIQSEEVWAGITYALSAMMIEKGMDEMAFKTSEGLF 798


>gi|22331303|ref|NP_189060.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|79313355|ref|NP_001030757.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|17529232|gb|AAL38843.1| unknown protein [Arabidopsis thaliana]
 gi|110742225|dbj|BAE99039.1| hypothetical protein [Arabidopsis thaliana]
 gi|332643347|gb|AEE76868.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|332643348|gb|AEE76869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 950

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 3   DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
           DQL G  +  +SG+    +F  + I   +  IF  NV   + G MGAVNGM P+G  D +
Sbjct: 764 DQLAGQWYAASSGL--PPLFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDT 821

Query: 63  AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            +QS E+WTGVTY  +A M+  G  ++ +TTA G++
Sbjct: 822 CMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIF 857


>gi|11994237|dbj|BAB01359.1| unnamed protein product [Arabidopsis thaliana]
          Length = 937

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 3   DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
           DQL G  +  +SG+    +F  + I   +  IF  NV   + G MGAVNGM P+G  D +
Sbjct: 751 DQLAGQWYAASSGL--PPLFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDT 808

Query: 63  AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            +QS E+WTGVTY  +A M+  G  ++ +TTA G++
Sbjct: 809 CMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIF 844


>gi|359486233|ref|XP_002264575.2| PREDICTED: non-lysosomal glucosylceramidase [Vitis vinifera]
          Length = 960

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 41/97 (42%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G++   I        AL  ++  NV     G  GAVNGM P+G  D 
Sbjct: 765 ADQLAGQWYARACGLS--PIVDEDKAKSALEKVYHYNVLKVGGGKRGAVNGMLPDGKVDT 822

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+W+GVTY ++A M++EG VD A+ TA G+Y
Sbjct: 823 TTMQSREIWSGVTYGVAATMIHEGLVDMAFQTASGVY 859


>gi|297739499|emb|CBI29681.3| unnamed protein product [Vitis vinifera]
          Length = 949

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 41/97 (42%), Positives = 58/97 (59%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G++   I        AL  ++  NV     G  GAVNGM P+G  D 
Sbjct: 754 ADQLAGQWYARACGLS--PIVDEDKAKSALEKVYHYNVLKVGGGKRGAVNGMLPDGKVDT 811

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           + +QS E+W+GVTY ++A M++EG VD A+ TA G+Y
Sbjct: 812 TTMQSREIWSGVTYGVAATMIHEGLVDMAFQTASGVY 848


>gi|449463809|ref|XP_004149624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
 gi|449494604|ref|XP_004159595.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 951

 Score = 77.8 bits (190), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 41/97 (42%), Positives = 61/97 (62%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +ASG+    I        ALA +++ NV   + G  GAVNGM P+G  D 
Sbjct: 758 ADQLAGQWYARASGL--LPIVDEDKAKSALAKVYNYNVMKVKDGKRGAVNGMLPDGTIDF 815

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           S++QS E+W+GVTY ++A+M++E   D A+ TA G++
Sbjct: 816 SSMQSREIWSGVTYAVAASMIHEDMTDMAFRTAEGIH 852


>gi|357144426|ref|XP_003573288.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 1
           [Brachypodium distachyon]
          Length = 932

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/98 (41%), Positives = 61/98 (62%), Gaps = 2/98 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQL G  +    G+  E +        ALAT+   NV   + G++GAVNGM+P+G  D
Sbjct: 733 MADQLAGQWYAHVCGL--EPVVEEDKARSALATVLDYNVMRVKGGAVGAVNGMRPDGGID 790

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            S+ QS+E+W+G TY ++AAM++EG  + A+ TA G +
Sbjct: 791 MSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAH 828


>gi|357144429|ref|XP_003573289.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 2
           [Brachypodium distachyon]
          Length = 919

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/98 (41%), Positives = 61/98 (62%), Gaps = 2/98 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQL G  +    G+  E +        ALAT+   NV   + G++GAVNGM+P+G  D
Sbjct: 720 MADQLAGQWYAHVCGL--EPVVEEDKARSALATVLDYNVMRVKGGAVGAVNGMRPDGGID 777

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
            S+ QS+E+W+G TY ++AAM++EG  + A+ TA G +
Sbjct: 778 MSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAH 815


>gi|392918721|ref|NP_504047.2| Protein R08F11.1 [Caenorhabditis elegans]
 gi|387910741|emb|CCD72306.2| Protein R08F11.1 [Caenorhabditis elegans]
          Length = 830

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 42/101 (41%), Positives = 60/101 (59%), Gaps = 2/101 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG+  + A  ++         +  AL TIF  NV+ +  G  GAVNG   +   D
Sbjct: 688 MADQLCGFWAMTA--MDEPVQISKDKMKSALDTIFKYNVQMYNNGRCGAVNGYLTSERVD 745

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
            S+IQSEEVW G+TY LSA M+ +G  ++A+ T+ GL+  +
Sbjct: 746 GSSIQSEEVWAGITYALSAMMIEKGMDEQAFKTSEGLFESI 786


>gi|341888271|gb|EGT44206.1| hypothetical protein CAEBREN_07747 [Caenorhabditis brenneri]
          Length = 820

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 4/102 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG+  L A  ++         +  AL TIF  NV+ +  G  GAVNG   +   D
Sbjct: 678 MADQLCGFWALTA--MDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVD 735

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDE-AWTTAGGLYRRL 101
            S+IQSEEVW G+TY LS AM+ E N+DE A+ T+ GL+  +
Sbjct: 736 GSSIQSEEVWAGITYALS-AMMIEKNMDEMAFKTSEGLFESI 776


>gi|341899054|gb|EGT54989.1| hypothetical protein CAEBREN_31176 [Caenorhabditis brenneri]
          Length = 831

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 4/102 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG+  L A  ++         +  AL TIF  NV+ +  G  GAVNG   +   D
Sbjct: 689 MADQLCGFWALTA--MDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVD 746

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDE-AWTTAGGLYRRL 101
            S+IQSEEVW G+TY LS AM+ E N+DE A+ T+ GL+  +
Sbjct: 747 GSSIQSEEVWAGITYALS-AMMIEKNMDEMAFKTSEGLFESI 787


>gi|255575898|ref|XP_002528846.1| conserved hypothetical protein [Ricinus communis]
 gi|223531697|gb|EEF33520.1| conserved hypothetical protein [Ricinus communis]
          Length = 952

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 44/107 (41%), Positives = 64/107 (59%), Gaps = 8/107 (7%)

Query: 2   ADQLCGYLFLKASGVNTEAIFP--SANIAR-ALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
           ADQL G  + +A G     +FP    + AR AL  +++ NV   + G  GA+NGM P+G 
Sbjct: 756 ADQLAGQWYARACG-----LFPIVDKDKARSALEKVYNYNVLKVKDGKRGAINGMLPDGK 810

Query: 59  RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDR 105
            D S++QS E+W+GVTY L+A M+ E  +D A+ TA G+Y     +R
Sbjct: 811 VDLSSMQSREIWSGVTYALAATMIQEDMLDMAFHTASGIYEAAWSER 857


>gi|449482327|ref|XP_004156248.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
           sativus]
          Length = 508

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 41/98 (41%), Positives = 60/98 (61%), Gaps = 2/98 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G+    I     I  AL  I++ NV   + G+ GAVNGM P+G  D+
Sbjct: 309 ADQLAGQWYARACGLC--PIADEEKIRVALEKIYNFNVMKVKGGTRGAVNGMFPDGSVDK 366

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 99
           S +Q +E+W GVTY ++A M+ EG V+  + TA G+Y+
Sbjct: 367 SILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQ 404


>gi|449447581|ref|XP_004141546.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 944

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 41/98 (41%), Positives = 60/98 (61%), Gaps = 2/98 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G+    I     I  AL  I++ NV   + G+ GAVNGM P+G  D+
Sbjct: 745 ADQLAGQWYARACGLC--PIADEEKIRVALEKIYNFNVMKVKGGTRGAVNGMFPDGSVDK 802

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 99
           S +Q +E+W GVTY ++A M+ EG V+  + TA G+Y+
Sbjct: 803 SILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQ 840


>gi|288901049|gb|ADC68237.1| hypothetical protein [Dendrobium nobile]
          Length = 207

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/89 (46%), Positives = 51/89 (57%), Gaps = 2/89 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  +  +SG+    +F    I   L  IF  NV     G +GAVNGM PNG  D 
Sbjct: 120 ADQLAGQWYAASSGL--PPLFDENKIRSTLQKIFDFNVMKVRGGRIGAVNGMHPNGKVDE 177

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           S +QS E+WTGVTY LSA M+  G  ++A
Sbjct: 178 SCMQSREIWTGVTYALSATMILAGMEEQA 206


>gi|224120058|ref|XP_002331126.1| predicted protein [Populus trichocarpa]
 gi|222872854|gb|EEF09985.1| predicted protein [Populus trichocarpa]
          Length = 948

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/115 (40%), Positives = 65/115 (56%), Gaps = 3/115 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G++   I        AL  I++ NV   + G  GAVNGM P+G  D 
Sbjct: 752 ADQLAGQWYARACGLS--PIVDEDKARCALEKIYNYNVLKVQDGKRGAVNGMLPDGTVDL 809

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCE-TAPEG 115
           S +QS E+W+GVTY ++A M+ EG  D A+ TA G+Y  +  ++  G     PEG
Sbjct: 810 SCLQSREIWSGVTYAVAATMIQEGLTDMAFHTASGVYEAVWAEQGLGYSFQTPEG 864


>gi|224146269|ref|XP_002325943.1| predicted protein [Populus trichocarpa]
 gi|222862818|gb|EEF00325.1| predicted protein [Populus trichocarpa]
          Length = 891

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 61/97 (62%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G++   +       +AL  I+  NV   + G+ GAVNGM P+G  D 
Sbjct: 695 ADQLAGQWYARACGLS--PVVDGDKARKALEKIYEFNVLKVKGGTRGAVNGMLPDGRVDL 752

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           +A+Q++E+W GVTY LSA+M+ EG  + A+ TA G+Y
Sbjct: 753 TAMQTKEIWPGVTYALSASMIQEGLEEMAFQTAVGIY 789


>gi|224129460|ref|XP_002328722.1| predicted protein [Populus trichocarpa]
 gi|222839020|gb|EEE77371.1| predicted protein [Populus trichocarpa]
          Length = 922

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Composition-based stats.
 Identities = 41/97 (42%), Positives = 56/97 (57%), Gaps = 2/97 (2%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G+    I        AL  I++ N      G  GAVNGM P+G  D 
Sbjct: 726 ADQLAGQWYARACGL--LPIVDEDKARSALEKIYNYNFLKVHDGKRGAVNGMLPDGTVDM 783

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
           S +QS E+W+GVTY ++A M+ EG +D A+ TA G+Y
Sbjct: 784 SDMQSREIWSGVTYAVAATMMQEGLMDMAFHTASGVY 820


>gi|147812668|emb|CAN61857.1| hypothetical protein VITISV_016690 [Vitis vinifera]
          Length = 521

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 34/69 (49%), Positives = 48/69 (69%)

Query: 30  ALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 89
           AL  +++ NV   + G  GAVNGM P+G  D SA+QS E+W GVTY ++A M++EG V+ 
Sbjct: 347 ALEKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVET 406

Query: 90  AWTTAGGLY 98
           A+ TA G+Y
Sbjct: 407 AFNTASGIY 415


>gi|281212396|gb|EFA86556.1| hypothetical protein PPL_00357 [Polysphondylium pallidum PN500]
          Length = 4775

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 39/97 (40%), Positives = 56/97 (57%), Gaps = 2/97 (2%)

Query: 1    MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
            M+DQL G+ +L A G+ +      A    +L+ I   NV+ +  GS GAVNGM+P G  D
Sbjct: 1024 MSDQLAGHWYLLACGLPSYITLDQA--LSSLSIINEYNVKSYSNGSCGAVNGMRPEGGPD 1081

Query: 61   RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
             +++QS EVW G +Y L++ ML     +EAW    GL
Sbjct: 1082 TTSLQSCEVWIGTSYGLASTMLLHFMDNEAWELIKGL 1118


>gi|156333782|ref|XP_001619412.1| hypothetical protein NEMVEDRAFT_v1g224207 [Nematostella vectensis]
 gi|156202556|gb|EDO27312.1| predicted protein [Nematostella vectensis]
          Length = 173

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/74 (48%), Positives = 52/74 (70%), Gaps = 1/74 (1%)

Query: 43  EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLR 102
           + G+MGAVNG++P+G  D S++Q+EEVWTGVTY ++A+M+ EG VDE + TA G+Y    
Sbjct: 50  KEGTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNTCF 109

Query: 103 EDRVSGCETAPEGV 116
           E R+      PE +
Sbjct: 110 E-RLGMNFQTPEAI 122


>gi|22297737|ref|NP_680984.1| hypothetical protein tlr0193 [Thermosynechococcus elongatus BP-1]
 gi|22293914|dbj|BAC07746.1| tlr0193 [Thermosynechococcus elongatus BP-1]
          Length = 806

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 55/105 (52%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  G+    I P     RAL  I+ T    F  G  GA NG+ PNG   
Sbjct: 664 MADQLCGQFYAQLLGLVD--IVPPDCCDRALRKIYDTCFLKFHNGQFGAANGLLPNGQPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVWTG+ + L+A +   G +DEAW  A  + R++ E+
Sbjct: 722 NPHATHPLEVWTGINFGLAAFLWQRGMIDEAWRLAEVVVRQIYEN 766


>gi|295829670|gb|ADG38504.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829672|gb|ADG38505.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829674|gb|ADG38506.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829676|gb|ADG38507.1| AT3G24180-like protein [Capsella grandiflora]
          Length = 190

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 47/80 (58%), Gaps = 2/80 (2%)

Query: 3   DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
           DQL G  +  +SG+    IF  + I   +  IF  NV   + G MGAVNGM P+G  D +
Sbjct: 113 DQLAGQWYAASSGL--PPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDT 170

Query: 63  AIQSEEVWTGVTYLLSAAML 82
            +QS E+WTGVTY  +A M+
Sbjct: 171 CMQSREIWTGVTYAAAATMI 190


>gi|345291087|gb|AEN82035.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291089|gb|AEN82036.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291091|gb|AEN82037.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291093|gb|AEN82038.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291095|gb|AEN82039.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291097|gb|AEN82040.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291099|gb|AEN82041.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291101|gb|AEN82042.1| AT3G24180-like protein, partial [Capsella rubella]
          Length = 191

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 47/80 (58%), Gaps = 2/80 (2%)

Query: 3   DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
           DQL G  +  +SG+    IF  + I   +  IF  NV   + G MGAVNGM P+G  D +
Sbjct: 113 DQLAGQWYAASSGL--PPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDT 170

Query: 63  AIQSEEVWTGVTYLLSAAML 82
            +QS E+WTGVTY  +A M+
Sbjct: 171 CMQSREIWTGVTYAAAATMI 190


>gi|295829678|gb|ADG38508.1| AT3G24180-like protein [Neslia paniculata]
          Length = 190

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 47/80 (58%), Gaps = 2/80 (2%)

Query: 3   DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
           DQL G  +  +SG+    IF  + I   +  IF  NV   + G MGAVNGM P+G  D +
Sbjct: 113 DQLAGQWYAASSGL--PPIFEESKIRSTMQKIFDFNVMKTKGGRMGAVNGMHPDGKVDET 170

Query: 63  AIQSEEVWTGVTYLLSAAML 82
            +QS E+WTGVTY  +A M+
Sbjct: 171 CMQSREIWTGVTYAAAATMI 190


>gi|66801399|ref|XP_629625.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
 gi|60463006|gb|EAL61202.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
          Length = 1302

 Score = 68.9 bits (167), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 34/97 (35%), Positives = 53/97 (54%), Gaps = 2/97 (2%)

Query: 1    MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
            M+D L G+ +L + G+ +   F  A    +L+ I   NV+ +  G  GAVNGM+P G  D
Sbjct: 1117 MSDMLAGHWYLLSCGLKSYMTFDQA--LSSLSIINEYNVKSYSKGVCGAVNGMRPEGTVD 1174

Query: 61   RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
             + +QS EVW G ++ L+A M+      +AW    G+
Sbjct: 1175 TTCLQSSEVWIGTSFSLAATMILHHMDSDAWDLVKGI 1211


>gi|226444526|gb|ACO57837.1| unknown [Helianthus annuus]
 gi|226444528|gb|ACO57838.1| unknown [Helianthus annuus]
 gi|226444560|gb|ACO57854.1| unknown [Helianthus petiolaris]
 gi|226444568|gb|ACO57858.1| unknown [Helianthus petiolaris]
          Length = 103

 Score = 67.0 bits (162), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 34/61 (55%), Positives = 44/61 (72%), Gaps = 3/61 (4%)

Query: 38 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
           VRG   G MGAVNGM PNG  D + +QS EVWTGVTY ++A M++ G  D+A+TTA G+
Sbjct: 1  KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGI 57

Query: 98 Y 98
          +
Sbjct: 58 F 58


>gi|226444574|gb|ACO57861.1| unknown [Helianthus argophyllus]
          Length = 103

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/61 (55%), Positives = 43/61 (70%), Gaps = 3/61 (4%)

Query: 38 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
           VRG   G MGAVNGM PNG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G+
Sbjct: 1  KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57

Query: 98 Y 98
          +
Sbjct: 58 F 58


>gi|226444524|gb|ACO57836.1| unknown [Helianthus annuus]
          Length = 103

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/61 (55%), Positives = 43/61 (70%), Gaps = 3/61 (4%)

Query: 38 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
           VRG   G MGAVNGM PNG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G+
Sbjct: 1  KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57

Query: 98 Y 98
          +
Sbjct: 58 F 58


>gi|226444514|gb|ACO57831.1| unknown [Helianthus annuus]
 gi|226444516|gb|ACO57832.1| unknown [Helianthus annuus]
 gi|226444518|gb|ACO57833.1| unknown [Helianthus annuus]
 gi|226444520|gb|ACO57834.1| unknown [Helianthus annuus]
 gi|226444522|gb|ACO57835.1| unknown [Helianthus annuus]
 gi|226444530|gb|ACO57839.1| unknown [Helianthus annuus]
 gi|226444532|gb|ACO57840.1| unknown [Helianthus annuus]
 gi|226444534|gb|ACO57841.1| unknown [Helianthus annuus]
 gi|226444536|gb|ACO57842.1| unknown [Helianthus annuus]
 gi|226444538|gb|ACO57843.1| unknown [Helianthus annuus]
 gi|226444540|gb|ACO57844.1| unknown [Helianthus annuus]
 gi|226444542|gb|ACO57845.1| unknown [Helianthus annuus]
 gi|226444544|gb|ACO57846.1| unknown [Helianthus annuus]
 gi|226444546|gb|ACO57847.1| unknown [Helianthus petiolaris]
 gi|226444548|gb|ACO57848.1| unknown [Helianthus petiolaris]
 gi|226444550|gb|ACO57849.1| unknown [Helianthus petiolaris]
 gi|226444552|gb|ACO57850.1| unknown [Helianthus petiolaris]
 gi|226444554|gb|ACO57851.1| unknown [Helianthus petiolaris]
 gi|226444556|gb|ACO57852.1| unknown [Helianthus petiolaris]
 gi|226444558|gb|ACO57853.1| unknown [Helianthus petiolaris]
 gi|226444562|gb|ACO57855.1| unknown [Helianthus petiolaris]
 gi|226444564|gb|ACO57856.1| unknown [Helianthus petiolaris]
 gi|226444566|gb|ACO57857.1| unknown [Helianthus petiolaris]
 gi|226444570|gb|ACO57859.1| unknown [Helianthus argophyllus]
 gi|226444572|gb|ACO57860.1| unknown [Helianthus argophyllus]
 gi|226444576|gb|ACO57862.1| unknown [Helianthus argophyllus]
 gi|226444578|gb|ACO57863.1| unknown [Helianthus argophyllus]
 gi|226444580|gb|ACO57864.1| unknown [Helianthus argophyllus]
 gi|226444582|gb|ACO57865.1| unknown [Helianthus argophyllus]
 gi|226444584|gb|ACO57866.1| unknown [Helianthus argophyllus]
 gi|226444586|gb|ACO57867.1| unknown [Helianthus argophyllus]
 gi|226444588|gb|ACO57868.1| unknown [Helianthus argophyllus]
 gi|226444590|gb|ACO57869.1| unknown [Helianthus argophyllus]
 gi|226444592|gb|ACO57870.1| unknown [Helianthus argophyllus]
          Length = 103

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/61 (55%), Positives = 43/61 (70%), Gaps = 3/61 (4%)

Query: 38 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
           VRG   G MGAVNGM PNG  D + +QS EVWTGVTY ++A M++ G  D A+TTA G+
Sbjct: 1  KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57

Query: 98 Y 98
          +
Sbjct: 58 F 58


>gi|328869576|gb|EGG17953.1| hypothetical protein DFA_06619 [Dictyostelium fasciculatum]
          Length = 1200

 Score = 65.5 bits (158), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 2/97 (2%)

Query: 1    MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
            MADQL G+ +L + G+++      A     L+ I   N++ +  GS GAVNG+ P    D
Sbjct: 1048 MADQLAGHWYLTSCGLSSYMTLDQA--LSTLSIINEYNIKSYSNGSCGAVNGISPLAIVD 1105

Query: 61   RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
            ++ +QS EVW G +Y L++  L      EAW+   GL
Sbjct: 1106 QTCLQSSEVWIGTSYSLASTFLLHYMDKEAWSLIKGL 1142


>gi|94968715|ref|YP_590763.1| hypothetical protein Acid345_1688 [Candidatus Koribacter versatilis
           Ellin345]
 gi|94550765|gb|ABF40689.1| protein of unknown function DUF608 [Candidatus Koribacter
           versatilis Ellin345]
          Length = 811

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 60/105 (57%), Gaps = 2/105 (1%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MA+QL G  +   +G+    + P      AL  ++  NV   + G+MGA+NG+  +G+  
Sbjct: 672 MAEQLAGQWYASLTGLGD--LVPRDMQHSALKKVYDFNVMKLQNGTMGALNGISASGEVL 729

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDR 105
           +   Q+EEVWTGVT+ ++A ML  G  +E + TA G+Y  + + +
Sbjct: 730 KDNEQTEEVWTGVTFAVAATMLQNGLREEGFNTAKGVYNVVYDQK 774


>gi|332711224|ref|ZP_08431157.1| putative bile acid beta-glucosidase [Moorea producens 3L]
 gi|332350038|gb|EGJ29645.1| putative bile acid beta-glucosidase [Moorea producens 3L]
          Length = 815

 Score = 64.7 bits (156), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 36/102 (35%), Positives = 54/102 (52%), Gaps = 3/102 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  G+    I P      AL T++      F  G +GAVNG++P+G   
Sbjct: 674 MADQLCGQFYAQLLGL--PDIVPPECTLSALKTVYDACFLKFHQGQLGAVNGVRPDGTPE 731

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
           D  A    EVWTG+ + L+A ++ +G  DEA      + R++
Sbjct: 732 DPDATHPMEVWTGINFGLAAFLIQQGMKDEALRITEAVVRQV 773


>gi|374311163|ref|YP_005057593.1| glucosylceramidase [Granulicella mallensis MP5ACTX8]
 gi|358753173|gb|AEU36563.1| Glucosylceramidase [Granulicella mallensis MP5ACTX8]
          Length = 811

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 53/105 (50%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
            ADQL G  +   +G+    I P      A   IF  NV  F  G MGA NGM  +G   
Sbjct: 672 QADQLAGQWYANLTGLGE--IVPHTMQVSAAKKIFDVNVMKFGHGEMGAANGMTADGAIL 729

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDR 105
            +A +++EVW G T   +  ++ EG  DEAW T  GLY  + ED+
Sbjct: 730 TNA-EAKEVWVGTTLGYAGLLMQEGMNDEAWKTTRGLYHVIYEDK 773


>gi|366162568|ref|ZP_09462323.1| hypothetical protein AcelC_02756 [Acetivibrio cellulolyticus CD2]
          Length = 842

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 34/90 (37%), Positives = 51/90 (56%), Gaps = 2/90 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           M+DQLCG+ + KA G+  E I P     ++ + ++  N + F+ G  G VN MK +G+ D
Sbjct: 616 MSDQLCGHWYAKACGL--EGIVPDEYAKKSYSKVYEFNNKKFDNGLHGFVNIMKADGNVD 673

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
            S  Q+ E W G +Y + A M+ EG   EA
Sbjct: 674 MSHAQTAEAWVGTSYGVIAGMIQEGLETEA 703


>gi|330844075|ref|XP_003293963.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
 gi|325075658|gb|EGC29519.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
          Length = 1205

 Score = 62.4 bits (150), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 33/97 (34%), Positives = 51/97 (52%), Gaps = 2/97 (2%)

Query: 1    MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
            M+D L G+ +L + G+ +   F  A    +L+ I   NV  +     GAVNGM+P G  D
Sbjct: 1038 MSDMLAGHWYLLSCGLPSYMTFDQA--LSSLSIINEYNVNSYGKERCGAVNGMRPEGVVD 1095

Query: 61   RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
             + +QS EVW G ++ L+A M+      +AW    G+
Sbjct: 1096 NTCLQSSEVWIGTSFSLAATMIQHHMDKDAWELVKGI 1132


>gi|434391596|ref|YP_007126543.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
 gi|428263437|gb|AFZ29383.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
          Length = 809

 Score = 62.4 bits (150), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 34/92 (36%), Positives = 49/92 (53%), Gaps = 3/92 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           MADQLCG  + +  G+    I P     +AL T++      F  G  GA NG+KP+G   
Sbjct: 671 MADQLCGQFYARLLGL--PDIVPPECATQALKTVYDACFVKFHNGQFGAANGLKPDGLPE 728

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
           + +A    EVWTG+ + L+A ++  G   EAW
Sbjct: 729 NPNATHPLEVWTGINFGLAAFLMQMGMKSEAW 760


>gi|402589209|gb|EJW83141.1| hypothetical protein WUBG_05945 [Wuchereria bancrofti]
          Length = 90

 Score = 61.2 bits (147), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 26/63 (41%), Positives = 40/63 (63%)

Query: 19 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLS 78
           A+ P ++I  AL  I+S+N+  F  G +GAVNGM+ +G   R  +Q++E+W GVTY L 
Sbjct: 22 RALLPLSHIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYALQ 81

Query: 79 AAM 81
            +
Sbjct: 82 HHL 84


>gi|257060089|ref|YP_003137977.1| glucosylceramidase [Cyanothece sp. PCC 8802]
 gi|256590255|gb|ACV01142.1| Glucosylceramidase [Cyanothece sp. PCC 8802]
          Length = 805

 Score = 60.8 bits (146), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 34/105 (32%), Positives = 55/105 (52%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A  +N   +  +     AL  ++      F+ G  GA NGMKP+G   
Sbjct: 664 MADQLCGQFY--ARLLNLPDVVETQYTESALNKVYEACFLKFQDGKYGAANGMKPDGTPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           D ++   +EVWTG+ + L+A +L  G  D A+     + +++ E+
Sbjct: 722 DPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEAVVKQVYEN 766


>gi|218247014|ref|YP_002372385.1| hypothetical protein PCC8801_2201 [Cyanothece sp. PCC 8801]
 gi|218167492|gb|ACK66229.1| protein of unknown function DUF608 [Cyanothece sp. PCC 8801]
          Length = 805

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 34/105 (32%), Positives = 55/105 (52%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A  +N   +  +     AL  ++      F+ G  GA NGMKP+G   
Sbjct: 664 MADQLCGQFY--ARLLNFPDVVETQYTESALNKVYEACFLKFQDGKYGAANGMKPDGTPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           D ++   +EVWTG+ + L+A +L  G  D A+     + +++ E+
Sbjct: 722 DPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEAVVKQVYEN 766


>gi|119510228|ref|ZP_01629365.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
 gi|119465077|gb|EAW45977.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
          Length = 814

 Score = 60.1 bits (144), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  G+    I PS     AL T++      F  G  GA NG++P+G   
Sbjct: 664 MADQLCGQFYARLLGL--PDIVPSDRALSALQTVYHACFVKFCNGEFGAANGVRPDGSAE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           + +A    EVWTG+ + L+A ++  G  DEA    G + +++  +
Sbjct: 722 NPNATHPLEVWTGINFGLAAFLVQMGMQDEALKLTGAVVQQIYHN 766


>gi|428209901|ref|YP_007094254.1| glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
 gi|428011822|gb|AFY90385.1| Glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
          Length = 812

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/91 (39%), Positives = 46/91 (50%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A  +    I P      AL TI+      F  G  GA NG+KPNG   
Sbjct: 673 MADQLCGQFY--ARLLKLPDIVPQECALSALKTIYDACFLKFHQGKFGAANGLKPNGSPE 730

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           + +A    EVWTG+ + L+A +L  G   EA
Sbjct: 731 NPNATHPLEVWTGINFGLAAFLLQMGMKAEA 761


>gi|254410084|ref|ZP_05023864.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183120|gb|EDX78104.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 814

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG  + +  G+    I P      AL T++ +    F     GA NG+KP+G  +
Sbjct: 675 MADQLCGQFYAQLLGL--PDIVPPECTQSALKTVYESCFLKFHNAQFGAANGVKPDGSPE 732

Query: 61  R-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
              A    EVWTG+ + L+A ++  G  DEA      + R++ E+
Sbjct: 733 HPDATHPLEVWTGINFGLAAFLIQLGMKDEALKLTESVVRQVYEN 777


>gi|402583425|gb|EJW77369.1| hypothetical protein WUBG_11725, partial [Wuchereria bancrofti]
          Length = 109

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 39/61 (63%)

Query: 21 IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAA 80
          + P ++I  AL  I+S+N+  F  G +GAVNGM+ +G   R  +Q++E+W GVTY L   
Sbjct: 1  LLPLSHIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYALQHH 60

Query: 81 M 81
          +
Sbjct: 61 L 61


>gi|225593155|gb|ACN96073.1| pedicted bile acid beta-glucosidase [Fischerella sp. MV11]
          Length = 812

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  G+    I        AL T+++     F  G  GA NG++P+G   
Sbjct: 675 MADQLCGQFYARLLGLPD--IVKRDRALSALKTVYNACFLKFHHGQFGAANGLRPDGSPE 732

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVWTG+ + L+A M+  G  DEA      + R++ E+
Sbjct: 733 NPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKLTEAVVRQVYEN 777


>gi|428200465|ref|YP_007079054.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
 gi|427977897|gb|AFY75497.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
          Length = 808

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 35/105 (33%), Positives = 54/105 (51%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  G+    +        AL  ++      F+ G  GA NG+KP+G   
Sbjct: 665 MADQLCGQFYARLLGL--PDVVERQYAESALKKVYEACFLKFQDGKFGAANGVKPDGSPE 722

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           D  A    EVWTG+ + L+A +L  G  DEA+  A  + R++ ++
Sbjct: 723 DPKATHPLEVWTGINFGLAAFLLQMGMKDEAFKLAETVVRQVYDN 767


>gi|354566897|ref|ZP_08986068.1| Glucosylceramidase [Fischerella sp. JSC-11]
 gi|353544556|gb|EHC14010.1| Glucosylceramidase [Fischerella sp. JSC-11]
          Length = 812

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  G+    I        AL T+++     F  G  GA NG++P+G   
Sbjct: 675 MADQLCGQFYARLLGL--PDIVKRDRALSALKTVYNACFLKFHHGQFGAANGLRPDGSPE 732

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVWTG+ + L+A M+  G  DEA      + R++ E+
Sbjct: 733 NPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKLTETVVRQIYEN 777


>gi|428299644|ref|YP_007137950.1| glucosylceramidase [Calothrix sp. PCC 6303]
 gi|428236188|gb|AFZ01978.1| Glucosylceramidase [Calothrix sp. PCC 6303]
          Length = 804

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 47/92 (51%), Gaps = 3/92 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A  +    I P      A+ T++      F  G  GA NG++P+G   
Sbjct: 662 MADQLCGQFY--ARLLKLADIVPHDCAISAVRTVYEACFINFNDGKFGAANGLRPDGSPE 719

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
           +  A    EVWTG+ + L+A ++  G  DEAW
Sbjct: 720 NPKATHPLEVWTGINFGLAAFLMQMGMQDEAW 751


>gi|427727368|ref|YP_007073605.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
 gi|427363287|gb|AFY46008.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
          Length = 803

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 48/91 (52%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A+ +N   I P      AL T++      F  G  GA NG++P+G   
Sbjct: 663 MADQLCGQFY--AALLNLPDIVPGDRAISALRTVYDACFLKFYNGQFGAANGVRPDGSPE 720

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           +  A    EVWTG+ + L+A ++  G  DEA
Sbjct: 721 NPQATHPLEVWTGINFGLAAFLVQMGMKDEA 751


>gi|113477064|ref|YP_723125.1| hypothetical protein Tery_3568 [Trichodesmium erythraeum IMS101]
 gi|110168112|gb|ABG52652.1| protein of unknown function DUF608 [Trichodesmium erythraeum
           IMS101]
          Length = 816

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 50/91 (54%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQL G  + K   +N E I P+     AL T++++  + F  G  GA NG+ P+G   
Sbjct: 676 MADQLSGQFYAKL--LNLEDIVPAECALSALKTVYNSCFKNFHNGKFGAANGVLPDGSPE 733

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           + +A    EVWTG+ + L+A M+  G   EA
Sbjct: 734 NPNATHPLEVWTGINFGLAAFMVQIGMKKEA 764


>gi|440680630|ref|YP_007155425.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
 gi|428677749|gb|AFZ56515.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
          Length = 802

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 49/91 (53%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  G+    I P+ N   AL T++      F  G  GA NG++P+G   
Sbjct: 666 MADQLCGQFYARLLGL--PDIVPNDNALSALQTVYDACFVKFCNGQFGAANGVRPDGSPE 723

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           +  A    EVWTG+ + L+A ++  G  +EA
Sbjct: 724 NPQATHPLEVWTGINFGLAAFLIQMGMKNEA 754


>gi|427725201|ref|YP_007072478.1| glucosylceramidase [Leptolyngbya sp. PCC 7376]
 gi|427356921|gb|AFY39644.1| Glucosylceramidase [Leptolyngbya sp. PCC 7376]
          Length = 807

 Score = 57.4 bits (137), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           M+DQLCG  + +   +N   + P    A  L TI+      F  G  GA NG+ P+G  +
Sbjct: 670 MSDQLCGEFYTQL--LNLPNVNPENRTAITLQTIYEACFLKFHDGQFGAANGLNPDGSPE 727

Query: 61  R-SAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           + +     EVWTG+ + ++A M+  G  DEA
Sbjct: 728 KENDTHPLEVWTGINFGIAAFMIRNGMKDEA 758


>gi|427709805|ref|YP_007052182.1| glucosylceramidase [Nostoc sp. PCC 7107]
 gi|427362310|gb|AFY45032.1| Glucosylceramidase [Nostoc sp. PCC 7107]
          Length = 802

 Score = 57.4 bits (137), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 47/91 (51%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +   +    I PS     AL T++      F+ G  GA NG+ P+G   
Sbjct: 664 MADQLCGQFYARL--LELPDIVPSDRALSALKTVYDACFLKFQNGEFGAANGVLPDGSPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           +  A    EVWTG+ + L+A ++  G  DEA
Sbjct: 722 NPKATHPLEVWTGINFGLAAFLVQMGMQDEA 752


>gi|427712531|ref|YP_007061155.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
 gi|427376660|gb|AFY60612.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
          Length = 799

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG  +  A  +N   + P      AL T++      F  G  GA NG+ P+G  +
Sbjct: 656 MADQLCGQYY--ARLLNLPDVVPIECAESALKTVYDVCFLKFHEGQFGAANGLLPDGSPE 713

Query: 61  R-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           + +A    EVW G+ Y L+A +   G  D AW     + R++ ++
Sbjct: 714 KPNATHPLEVWIGINYGLAAFLYQMGLEDGAWRLTQTVIRQIYDN 758


>gi|428308994|ref|YP_007119971.1| bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
 gi|428250606|gb|AFZ16565.1| putative bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
          Length = 819

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           M DQLCG  + +  G+    I P   +  AL +++ +    F+   +GA NG++P+G   
Sbjct: 681 MTDQLCGQFYAQLLGL--PDIVPPECVQSALKSVYDSCFLKFQDAQLGAANGVRPDGSPE 738

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVWTG+ + L+A ++ +G   EA      + R++ E+
Sbjct: 739 NPDATHPLEVWTGINFGLAAFLVQQGMKQEALQLTEAVVRQVYEN 783


>gi|75907319|ref|YP_321615.1| hypothetical protein Ava_1096 [Anabaena variabilis ATCC 29413]
 gi|75701044|gb|ABA20720.1| Protein of unknown function DUF608 [Anabaena variabilis ATCC 29413]
          Length = 804

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 3/90 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A+ +    I PS     AL T++      F  G  GA NG++P+G   
Sbjct: 667 MADQLCGQFY--ANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPE 724

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 89
           +  A    EVWTG+ + L+A ++  G  DE
Sbjct: 725 NPKATHPLEVWTGINFGLAAFLVQMGMKDE 754


>gi|7470943|pir||T31071 conserved hypothetical protein - Anabaena sp. (fragment)
 gi|2828799|gb|AAC38131.1|AAC38131 unknown [Nostoc sp. PCC 7120]
          Length = 439

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 3/92 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A+ +    I PS     AL T++      F  G  GA NG++P+G   
Sbjct: 302 MADQLCGQFY--ANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPE 359

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
           +  A    EVWTG+ + L+A ++  G  DE +
Sbjct: 360 NPKATHPLEVWTGINFGLAAFLVQMGMKDEGF 391


>gi|17230308|ref|NP_486856.1| hypothetical protein alr2816 [Nostoc sp. PCC 7120]
 gi|17131910|dbj|BAB74515.1| alr2816 [Nostoc sp. PCC 7120]
          Length = 804

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 3/90 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A+ +    I PS     AL T++      F  G  GA NG++P+G   
Sbjct: 667 MADQLCGQFY--ANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPE 724

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 89
           +  A    EVWTG+ + L+A ++  G  DE
Sbjct: 725 NPKATHPLEVWTGINFGLAAFLVQMGMKDE 754


>gi|162452686|ref|YP_001615053.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
 gi|161163268|emb|CAN94573.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
          Length = 782

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 57/124 (45%), Gaps = 13/124 (10%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           M+DQL G  + +A G+    I   ++ A A   I+  N + F+ G+ G VN M  NG  D
Sbjct: 642 MSDQLAGQWYARALGL--PPIVDPSHAASAFTKIYENNFKRFDGGTRGVVNVMTANGSVD 699

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEG-----------NVDEAWTTAGGLYRRLREDRVSGC 109
            ++ Q+ E W G ++ + A M+ EG            VD  W T G  +R       +G 
Sbjct: 700 GTSNQTRECWVGTSWGVVAGMIQEGLAAQAGEIGASLVDTIWKTDGLWFRTPEAWEGNGS 759

Query: 110 ETAP 113
             AP
Sbjct: 760 IRAP 763


>gi|416392732|ref|ZP_11685894.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
           0003]
 gi|357263592|gb|EHJ12579.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
           0003]
          Length = 804

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 31/105 (29%), Positives = 56/105 (53%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  G++   +  S     AL  ++      F+ G  GA NGMKP+G   
Sbjct: 664 MADQLCGQFYARLLGLSD--VVESEYTLSALNKVYEACFLKFQNGKYGAANGMKPDGTPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           D ++   +EVWTG+ + +++ ++  G  +EA      + +++ E+
Sbjct: 722 DPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVKQVYEN 766


>gi|170079447|ref|YP_001736085.1| hypothetical protein SYNPCC7002_A2862 [Synechococcus sp. PCC 7002]
 gi|169887116|gb|ACB00830.1| Conserved hypothetical protein (DUF608) [Synechococcus sp. PCC
           7002]
          Length = 803

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG  + +   ++   + P    A  LATI++     F  G  GA NG+ P+G  +
Sbjct: 664 MADQLCGQYYTQL--LSLPDVNPGDRTATTLATIYTACFEKFHGGQFGAANGLNPDGTPE 721

Query: 61  R-SAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           + +     EVWTG+ + ++A M+  G   EA
Sbjct: 722 KENDTHPLEVWTGINFGIAALMIRNGMQTEA 752


>gi|67922703|ref|ZP_00516206.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
 gi|67855413|gb|EAM50669.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
          Length = 804

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 31/105 (29%), Positives = 56/105 (53%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  G++   +  S     AL  ++      F+ G  GA NGMKP+G   
Sbjct: 664 MADQLCGQFYARLLGLSD--VVESEYTLSALNKVYEACFLKFQNGKYGAANGMKPDGTPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           D ++   +EVWTG+ + +++ ++  G  +EA      + +++ E+
Sbjct: 722 DPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVKQVYEN 766


>gi|186681980|ref|YP_001865176.1| hypothetical protein Npun_R1544 [Nostoc punctiforme PCC 73102]
 gi|186464432|gb|ACC80233.1| protein of unknown function DUF608 [Nostoc punctiforme PCC 73102]
          Length = 801

 Score = 56.2 bits (134), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A  ++   I PS     AL T++      F  G  G  NG++P+G   
Sbjct: 664 MADQLCGQFY--ARLLDLPDIVPSDRALSALKTVYDACFLKFCNGEFGTANGVRPDGSPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVWTG+ + L+A ++  G  DEA      + +++ E+
Sbjct: 722 NPKATHPLEVWTGINFGLAAFLVQMGMKDEALRLTQAVVQQIYEN 766


>gi|119493962|ref|ZP_01624522.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
 gi|119452283|gb|EAW33479.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
          Length = 799

 Score = 56.2 bits (134), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/105 (32%), Positives = 59/105 (56%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + K  G+  + + P   I+ AL TI+ +  + F  G  GA NG++ NG+  
Sbjct: 661 MADQLCGQFYAKLLGLE-DIVSPEKTIS-ALQTIYQSCFQNFYQGQFGAANGVRINGEPI 718

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           + +     EVWTG+ + L+A ++  G  +E +  A  + +++ E+
Sbjct: 719 NPNDTHPLEVWTGINFGLAAFLIQMGMKEEGFKLAEVVVQQIYEN 763


>gi|298490121|ref|YP_003720298.1| glucosylceramidase ['Nostoc azollae' 0708]
 gi|298232039|gb|ADI63175.1| Glucosylceramidase ['Nostoc azollae' 0708]
          Length = 806

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 33/91 (36%), Positives = 48/91 (52%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  G+    I PS +   AL T++      F  G  GA NG+ P+G   
Sbjct: 669 MADQLCGQFYTRLLGL--PDIVPSNHAHSALQTVYEACFVKFYDGKFGAANGVLPDGSPE 726

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           +  A    EVWTG+ + L+A ++  G  +EA
Sbjct: 727 NPKATHPLEVWTGINFGLAAFLVQMGMKNEA 757


>gi|334116939|ref|ZP_08491031.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
 gi|333461759|gb|EGK90364.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
          Length = 818

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + K  G+    I P      AL T++ +    F  G  GA NG+  NG   
Sbjct: 678 MADQLCGQFYAKLLGL--PDIVPPECAVSALETVYESCFLKFNEGEFGAANGVMLNGSPE 735

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           + +A    EVWTG+ + L+A ++  G  D+A+     +  ++ E+
Sbjct: 736 NPNATHPLEVWTGINFGLAAFLVQMGMEDKAFKLTDAVVNQIYEN 780


>gi|427716943|ref|YP_007064937.1| glucosylceramidase [Calothrix sp. PCC 7507]
 gi|427349379|gb|AFY32103.1| Glucosylceramidase [Calothrix sp. PCC 7507]
          Length = 800

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 47/91 (51%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  G+    I  S     AL T++      F  G  GA NG++P+G   
Sbjct: 660 MADQLCGQFYARLLGL--PDIVASDRALSALQTVYEACFLKFYDGQFGAANGVRPDGSPE 717

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           +  A    EVWTG+ + L+A ++  G  DEA
Sbjct: 718 NPQATHPLEVWTGINFGLAAFLVQMGMQDEA 748


>gi|307151416|ref|YP_003886800.1| glucosylceramidase [Cyanothece sp. PCC 7822]
 gi|306981644|gb|ADN13525.1| Glucosylceramidase [Cyanothece sp. PCC 7822]
          Length = 805

 Score = 55.1 bits (131), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 33/105 (31%), Positives = 55/105 (52%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           M+DQLCG  + +  G+  + + P    + AL  I+      F  G  GA NG+KP+G   
Sbjct: 664 MSDQLCGQFYARLLGL-PDVVEPQYTTS-ALRKIYEACFLKFHGGKYGAANGVKPDGTPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVWTG+ + L+A M+  G  +EA+     + +++ E+
Sbjct: 722 NPDATHPLEVWTGINFGLAAFMIQMGMKEEAFKITEAVVKQVYEN 766


>gi|16330256|ref|NP_440984.1| hypothetical protein sll1775 [Synechocystis sp. PCC 6803]
 gi|383321997|ref|YP_005382850.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325166|ref|YP_005386019.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491050|ref|YP_005408726.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384436317|ref|YP_005651041.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
 gi|451814414|ref|YP_007450866.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
 gi|1652745|dbj|BAA17664.1| sll1775 [Synechocystis sp. PCC 6803]
 gi|339273349|dbj|BAK49836.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
 gi|359271316|dbj|BAL28835.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359274486|dbj|BAL32004.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359277656|dbj|BAL35173.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407958168|dbj|BAM51408.1| hypothetical protein BEST7613_2477 [Bacillus subtilis BEST7613]
 gi|451780383|gb|AGF51352.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
          Length = 817

 Score = 54.7 bits (130), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A  +N   +       +ALA ++ T    F  G +GA NG+ P+G  +
Sbjct: 660 MADQLCGQFY--ARLLNLPDVVEHHYAEKALAKVYDTCFLKFAHGELGAANGLLPDGSPQ 717

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           + +     EVWTG+ + L+A ++  G  +EA
Sbjct: 718 NPNDTHPLEVWTGINFGLAAFLIQMGMKEEA 748


>gi|428317101|ref|YP_007114983.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
 gi|428240781|gb|AFZ06567.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
          Length = 809

 Score = 54.3 bits (129), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 54/105 (51%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + K  G+    I P      AL T++ +    F  G  GA NG+  NG   
Sbjct: 669 MADQLCGQFYAKLLGLPD--IVPPECAVSALETVYESCFLKFNDGEFGAANGVMLNGSPE 726

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           + +A    EVWTG+ + L+A ++  G  ++A+     + +++ E+
Sbjct: 727 NPNATHPLEVWTGINFGLAAFLVQMGMEEKAFKLTDAVVKQIYEN 771


>gi|357139475|ref|XP_003571307.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 946

 Score = 54.3 bits (129), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 43/101 (42%), Positives = 58/101 (57%), Gaps = 5/101 (4%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMK---PNG 57
           MADQL G  + +A G+  E +       RALA +   NV   + G +GAVNG +     G
Sbjct: 735 MADQLAGQWYARACGL--EPVVGRDKARRALAAVLEHNVMQVQGGGVGAVNGARLPEHGG 792

Query: 58  DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
             D S+ QS+EVWTG TY ++AAM+ EG  +E +T A G Y
Sbjct: 793 GVDESSTQSKEVWTGTTYAVAAAMIGEGMREEGFTAAKGAY 833


>gi|220909879|ref|YP_002485190.1| hypothetical protein Cyan7425_4519 [Cyanothece sp. PCC 7425]
 gi|219866490|gb|ACL46829.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7425]
          Length = 814

 Score = 54.3 bits (129), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG  + +  G+    + P      AL  ++ T    F  G++GA NG++P G  +
Sbjct: 663 MADQLCGQFYARLVGL--PDVVPEDCARAALDMVYQTCFLQFHNGTLGAANGLRPGGLAE 720

Query: 61  RS-AIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           +  A    EVWTG+ + L+A ++  G   EA
Sbjct: 721 KPDATHPLEVWTGINFGLAAFLVQMGRKAEA 751


>gi|172036524|ref|YP_001803025.1| hypothetical protein cce_1609 [Cyanothece sp. ATCC 51142]
 gi|354553307|ref|ZP_08972614.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
 gi|171697978|gb|ACB50959.1| unknown [Cyanothece sp. ATCC 51142]
 gi|353555137|gb|EHC24526.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
          Length = 818

 Score = 54.3 bits (129), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 31/105 (29%), Positives = 53/105 (50%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQL G  + +  G+    +        AL  ++      F+ G  GA NGMKP+G   
Sbjct: 678 MADQLSGQFYARLLGL--PDVVEQQYALSALKKVYEACFLKFQNGKYGAANGMKPDGTPE 735

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           D ++   +EVWTG+ + L++ ++  G  DEA      + +++ E+
Sbjct: 736 DPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYEN 780


>gi|126657976|ref|ZP_01729128.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
 gi|126620614|gb|EAZ91331.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
          Length = 804

 Score = 54.3 bits (129), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 31/105 (29%), Positives = 54/105 (51%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQL G  + +  G+    +  +     AL  ++      F+ G  GA NGMKP+G   
Sbjct: 664 MADQLSGQFYARLLGL--PDVVENQYALSALKKVYEACFLKFQNGKYGAANGMKPDGTPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           D ++   +EVWTG+ + L++ ++  G  DEA      + +++ E+
Sbjct: 722 DPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYEN 766


>gi|425455967|ref|ZP_18835678.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
 gi|389803038|emb|CCI17984.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
          Length = 784

 Score = 53.9 bits (128), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 49/105 (46%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  +  A  ++   +  +     AL+ I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFY--ARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A +L  G   + W     + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN 745


>gi|282897193|ref|ZP_06305195.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
 gi|281197845|gb|EFA72739.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
          Length = 798

 Score = 53.9 bits (128), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 34/102 (33%), Positives = 49/102 (48%), Gaps = 3/102 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQL G  +  A  +N   I P      AL TI+      F+ G  GA NG+ P+G   
Sbjct: 661 MADQLSGQFY--ARLLNLPDIVPQDRALSALTTIYDACFLKFQDGKFGAANGVLPDGSPE 718

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
           +  A    EVWTG+ + L+A +L  G  D+       + R++
Sbjct: 719 NPQATHPLEVWTGINFGLAAFLLQMGMKDQGLRLTQAVVRQI 760


>gi|434407175|ref|YP_007150060.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
           7417]
 gi|428261430|gb|AFZ27380.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
           7417]
          Length = 816

 Score = 53.5 bits (127), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           MADQLCG  + +  G+    I        AL T++      F  G  GA NG++P+G   
Sbjct: 675 MADQLCGQFYARLLGL--PDIVKCDRALSALQTVYDACFLKFFDGQFGAANGVRPDGVPE 732

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           + +A    EVWTG+ + L+A ++  G  DEA
Sbjct: 733 NPNATHPLEVWTGINFGLAAFLVQMGMQDEA 763


>gi|428223619|ref|YP_007107716.1| glucosylceramidase [Geitlerinema sp. PCC 7407]
 gi|427983520|gb|AFY64664.1| Glucosylceramidase [Geitlerinema sp. PCC 7407]
          Length = 821

 Score = 53.5 bits (127), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 3/90 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +   ++   +  S     AL T++ +    F  G  GA NG+KP+G   
Sbjct: 670 MADQLCGQFYARL--LDLPDVVASDRALSALKTVYDSCFLRFHDGKFGAANGVKPDGSPE 727

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 89
           +  A    EVWTG+ + ++A ++  G  DE
Sbjct: 728 NPKATHPLEVWTGINFGIAAFLVQMGLRDE 757


>gi|425450263|ref|ZP_18830094.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
 gi|389768962|emb|CCI06046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
          Length = 784

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  + +   +    +  +     AL+ I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A +L  G   + W     + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITQAVVRQIYEN 745


>gi|422303239|ref|ZP_16390593.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
 gi|389791807|emb|CCI12404.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
          Length = 784

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 50/105 (47%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  +  A  ++   +  +     AL+ I+ +    F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFY--ARLLSLPDVVENQYTQSALSKIYQSCFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A ++  G   + W     + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVYEN 745


>gi|414077215|ref|YP_006996533.1| beta-glucosidase [Anabaena sp. 90]
 gi|413970631|gb|AFW94720.1| putative beta-glucosidase [Anabaena sp. 90]
          Length = 795

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 54/105 (51%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           MADQLCG  +  A  ++   I PS     AL  ++ +    F+ G  GA NG+ PNG   
Sbjct: 658 MADQLCGQFY--ARLLSLPDIVPSDRALSALTKVYDSCFLKFQNGEFGAANGVLPNGLPE 715

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           + ++    EVWTG+ + L+A ++     +EA      + R++ ++
Sbjct: 716 NPNSTHPLEVWTGINFGLAAFLVQMNMKNEAMRLTEAVVRQIYDN 760


>gi|218437284|ref|YP_002375613.1| hypothetical protein PCC7424_0277 [Cyanothece sp. PCC 7424]
 gi|218170012|gb|ACK68745.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7424]
          Length = 801

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           M+DQLCG  + +  G+  + + P   ++ AL  I+ +    F  G  GA NG++P+G   
Sbjct: 664 MSDQLCGQFYARLLGL-PDVVEPQYTLS-ALRKIYESCFLQFYGGKYGAANGVRPDGSPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVWTG+ + L+A M+  G   E  +    + +++ E+
Sbjct: 722 NPDATHPLEVWTGINFGLAAFMIQMGMKQEGLSITEAVVKQIYEN 766


>gi|425435596|ref|ZP_18816046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
 gi|389679839|emb|CCH91411.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
          Length = 784

 Score = 52.8 bits (125), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  + +   +    +  +     AL+ I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A +L  G   + W     + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN 745


>gi|425460273|ref|ZP_18839755.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
 gi|389827051|emb|CCI21967.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
          Length = 784

 Score = 52.8 bits (125), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  + +   +    +  +     AL+ I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A +L  G   + W     + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN 745


>gi|443325334|ref|ZP_21054034.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
 gi|442795063|gb|ELS04450.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
          Length = 808

 Score = 52.8 bits (125), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 11/109 (10%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG  + +  G+    +  +     AL+ I+      F  G  GA NG+ P+G   
Sbjct: 668 MADQLCGQFYARLLGL--PDVVENQYTKSALSKIYDACFLKFHDGKFGAANGVLPDG--- 722

Query: 61  RSAIQSE-----EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
            SA+  +     EVWTG+ + L+A M+  G  DEA+     + R++ E+
Sbjct: 723 -SAVNPDDTHPLEVWTGINFGLAAFMVQMGMKDEAFRLTETVVRQVYEN 770


>gi|443669591|ref|ZP_21134796.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
 gi|159029778|emb|CAO87856.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443330115|gb|ELS44858.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
          Length = 784

 Score = 52.8 bits (125), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  + +   +    +  +     AL+ I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A +L  G   + W     + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQVYEN 745


>gi|425442611|ref|ZP_18822852.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
 gi|389716273|emb|CCH99457.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
          Length = 784

 Score = 52.4 bits (124), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  +  A  ++   +  +     AL+ I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFY--ARLLSLPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A +L  G   + W     + +++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN 745


>gi|425447791|ref|ZP_18827773.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
 gi|389731620|emb|CCI04385.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
          Length = 784

 Score = 52.0 bits (123), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  +  A  ++   +  +     AL+ I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFY--ARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A +L  G   + W     + +++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVQQIYEN 745


>gi|443311911|ref|ZP_21041533.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
 gi|442777986|gb|ELR88257.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
          Length = 804

 Score = 52.0 bits (123), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 3/92 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  + +  GV    I  S     +L T++      F  G+ GA NG+K +G   
Sbjct: 664 MADQLCGQFYARLLGV--PDIVESDRTVSSLKTVYEACFLKFHNGAFGAANGLKIDGSPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
           +  A    EVWTG+ + L+A +L     +EA+
Sbjct: 722 NPDATHPLEVWTGINFGLAAFLLQMDMKEEAF 753


>gi|390437958|ref|ZP_10226465.1| Glycosidase-like protein [Microcystis sp. T1-4]
 gi|389838618|emb|CCI30589.1| Glycosidase-like protein [Microcystis sp. T1-4]
          Length = 784

 Score = 51.6 bits (122), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  + +   +    +  +     AL+ I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYQACFLKFHEGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A ++  G   + W     + R++ E+
Sbjct: 701 NPQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVYEN 745


>gi|300866990|ref|ZP_07111661.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300335025|emb|CBN56827.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 863

 Score = 51.6 bits (122), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG  + +  G+    I P      AL T++ +    F  G  GA NG+  +G  +
Sbjct: 722 MADQLCGQFYARLLGL--PDIVPEECAISALETVYESCFIKFNQGEFGAANGVMLDGSPE 779

Query: 61  RS-AIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           +  A    EVWTG+ + L+A M+  G  ++A
Sbjct: 780 KPGATHPLEVWTGINFGLAAFMMQMGMKEKA 810


>gi|209526014|ref|ZP_03274547.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
 gi|376002240|ref|ZP_09780079.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423067988|ref|ZP_17056778.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
 gi|209493540|gb|EDZ93862.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
 gi|375329367|emb|CCE15832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406710526|gb|EKD05737.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
          Length = 799

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 3/95 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A+ +    I P      AL TI+    + F  G  GA NG++  G+  
Sbjct: 660 MADQLCGQFY--AALLGLPDIVPPHRAQVALQTIYQACFQNFHNGKFGAANGVRTTGEPM 717

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 94
           + +     EVWTG+ + L A  +  G  +EA+  A
Sbjct: 718 NPNDTHPLEVWTGINFGLGAFFIQMGMKEEAFQLA 752


>gi|409991122|ref|ZP_11274412.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
           Paraca]
 gi|291568810|dbj|BAI91082.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409938009|gb|EKN79383.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
           Paraca]
          Length = 799

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 3/95 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  +  A+ +    I P      AL TI+    + F  G  GA NG++  G+  
Sbjct: 660 MADQLCGQFY--AALLGLPDIVPHHRARVALQTIYQACFQNFHNGKFGAANGVRTTGEPI 717

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 94
           +       EVWTG+ + L A ++  G  +EA+  A
Sbjct: 718 NPQDTHPLEVWTGINFGLGAFLIQMGMKEEAFQLA 752


>gi|440753145|ref|ZP_20932348.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
 gi|440177638|gb|ELP56911.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
          Length = 789

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  + +   +    +  +     AL+ I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A +L  G   + W     + +++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN 745


>gi|425471589|ref|ZP_18850441.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
 gi|389882505|emb|CCI37035.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
          Length = 784

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 47/105 (44%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  + +   +    +  +     AL  I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALGKIYQACFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A +L  G   + W     + +++ E+
Sbjct: 701 NPQATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN 745


>gi|425464233|ref|ZP_18843555.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
 gi|389833806|emb|CCI21373.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
          Length = 784

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 48/105 (45%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  + +   +    +  +     AL+ I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFYSRLLAL--PDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A ++  G   + W     + +++ E+
Sbjct: 701 NPQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVKQVYEN 745


>gi|166363582|ref|YP_001655855.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
 gi|166085955|dbj|BAG00663.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
          Length = 784

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 48/105 (45%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           M DQLCG  + +   +    +  +     AL+ I+      F  G  GA NG+KP+G   
Sbjct: 643 MTDQLCGQFYSRLLAL--PDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVW G+ + L A ++  G   + W     + +++ E+
Sbjct: 701 NPQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVKQVYEN 745


>gi|37522029|ref|NP_925406.1| hypothetical protein gll2460 [Gloeobacter violaceus PCC 7421]
 gi|35213028|dbj|BAC90401.1| gll2460 [Gloeobacter violaceus PCC 7421]
          Length = 867

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 38/85 (44%), Gaps = 2/85 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG  +  A   N   I P      AL T++ T    F  G  G  NG   +G   
Sbjct: 718 MADQLCGEYY--AQVCNLADIVPEPQARSALETVYQTCFVKFYGGRFGCANGTNADGSFI 775

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEG 85
                  EVWTG+ + L+A M+  G
Sbjct: 776 GDTEHPSEVWTGINFGLAAFMIRNG 800


>gi|434389621|ref|YP_007100232.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
 gi|428020611|gb|AFY96705.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
          Length = 818

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 9/98 (9%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMK----PN 56
           MADQLCG  + +   +  + I P      AL T++ +    F+ G  GA NG+K    P+
Sbjct: 680 MADQLCGQFYTQL--LELDDIVPPDCAKTALQTVYQSCFLNFQGGKFGAANGVKIDGSPS 737

Query: 57  GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 94
             +D   +   EVWTG+ + ++A ++  G  +EA   A
Sbjct: 738 NPKDTHPL---EVWTGINFGIAAFLVQMGMQEEALKLA 772


>gi|269925332|ref|YP_003321955.1| hypothetical protein Tter_0211 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269788992|gb|ACZ41133.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 811

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 57/126 (45%), Gaps = 20/126 (15%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV-----------RGFEAGS-MG 48
           ++DQL G      +G+    + P  +I+RAL +IF+ N            R +  G   G
Sbjct: 598 LSDQLFGQTLASLTGLGY--LLPKEHISRALESIFAYNFKPNFWNHTNTQRVYALGDDAG 655

Query: 49  AVNGMKPNGDRDRSAIQ-SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVS 107
            V    P GDR       S+EVW+G  Y ++  M+YEG +DEA T       R   DR  
Sbjct: 656 LVMCTWPFGDRPSFPFPYSDEVWSGTEYQVATLMIYEGLLDEALTII-----RATRDRYD 710

Query: 108 GCETAP 113
           G +  P
Sbjct: 711 GFKRNP 716


>gi|443320525|ref|ZP_21049620.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
 gi|442789771|gb|ELR99409.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
          Length = 806

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 29/102 (28%), Positives = 50/102 (49%), Gaps = 3/102 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQL G  + +  G+    +     +   L TI+ +    F  G +GA NG+K +G   
Sbjct: 664 MADQLAGQFYAQLLGL--PDVVEKEYVQSTLKTIYQSCFLNFHQGKIGAANGVKLDGTGE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
           +  +    EVWTG+ + L A +++ G  DE    A  + +++
Sbjct: 722 NPQSTHPLEVWTGINFALGAFLIHSGMRDEGLKLAETIVKQI 763


>gi|428770169|ref|YP_007161959.1| glucosylceramidase [Cyanobacterium aponinum PCC 10605]
 gi|428684448|gb|AFZ53915.1| Glucosylceramidase [Cyanobacterium aponinum PCC 10605]
          Length = 802

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+    +  S  +   L  ++      F  G  G  NG+KP+G   
Sbjct: 663 MADQLCGQFYAQLLGL--PDVVESKYVKSTLRKVYDACFLKFHDGKFGIANGVKPDGKPV 720

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
             +     EVWTG+ Y + A M+  G   E    A  + +++ E+
Sbjct: 721 KENDTHPLEVWTGINYGIVAFMILNGMKQEGLRVAETVVKQVYEN 765


>gi|427733995|ref|YP_007053539.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
 gi|427369036|gb|AFY52992.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
          Length = 799

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 3/104 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQLCG  +  A  +    I P      AL +++      F  G +GA NG++ +G   
Sbjct: 663 MADQLCGQFY--ARLLKLPDIVPPECTMSALNSVYDACFLKFNQGKLGAANGLRLDGSPE 720

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
           +  A    EVWTG+ + ++A ++  G   EA+     +  ++ E
Sbjct: 721 NPDATHPLEVWTGINFGIAAFLVQMGMKSEAFKLTEAVVNQIYE 764


>gi|428210868|ref|YP_007084012.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
           6304]
 gi|427999249|gb|AFY80092.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
           6304]
          Length = 820

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+    I P      AL T++ +    F  G  GA NG+  +G   
Sbjct: 677 MADQLCGQFYARLLGLAD--IVPIDCAESALKTVYESCFLKFHNGEFGAANGVLIDGSPV 734

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           +  A    EVWTG+ + L+A +L  G  +E +     + +++ E+
Sbjct: 735 NPDATHPLEVWTGINFGLAAFLLQMGMKEEGFKLTETVIQQVYEN 779


>gi|282900195|ref|ZP_06308151.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
           CS-505]
 gi|281194944|gb|EFA69885.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
           CS-505]
          Length = 789

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 33/104 (31%), Positives = 49/104 (47%), Gaps = 3/104 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           MADQL G  +  A  +N   I P      AL TI+      F+ G  GA NG+  +G   
Sbjct: 652 MADQLSGQFY--ARLLNLPDIVPKDRALSALTTIYDACFLKFQDGKFGAANGVLLDGSPE 709

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
           +  A    EVWTG+ + L+A +L  G  ++       + R++ E
Sbjct: 710 NPQATHPLEVWTGINFGLAAFLLQMGMKEQGLRLTEAVVRQVYE 753


>gi|434400779|ref|YP_007134783.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
 gi|428271876|gb|AFZ37817.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
          Length = 804

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 32/105 (30%), Positives = 50/105 (47%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+    +        AL  I+      F  G  GA NG+ P+G   
Sbjct: 664 MADQLCGQFYARLLGL--PDVVEEQYTKSALKKIYQACFLKFHNGKYGAANGVLPDGSPV 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           + +     EVWTG+ + L+A ML  G   EA+     + +++ E+
Sbjct: 722 NPNDTHPLEVWTGINFGLAAFMLQMGMKAEAFQLTETVVKQVYEN 766


>gi|88809081|ref|ZP_01124590.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
 gi|88787023|gb|EAR18181.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
          Length = 849

 Score = 48.5 bits (114), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 48/93 (51%), Gaps = 5/93 (5%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+  E +   AN    L  +  +    F+ GS+G  NG++ +G   
Sbjct: 700 MADQLCGDFYARLLGL--EPVVSDANSRSTLKAVKESCFEKFQGGSLGVANGLRRDGTPL 757

Query: 60  DRSAIQSEEVWTGVTYLLSA--AMLYEGNVDEA 90
           D +     EVWTG+ + +++   ++ EG   EA
Sbjct: 758 DPNGTHPLEVWTGINFGIASYYQLMGEGQTAEA 790


>gi|320105716|ref|YP_004181306.1| glucosylceramidase [Terriglobus saanensis SP1PR4]
 gi|319924237|gb|ADV81312.1| Glucosylceramidase [Terriglobus saanensis SP1PR4]
          Length = 794

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
            ADQL G  +   +G+    + P A     L  I++ NV  F  G++GAVNGM  +G   
Sbjct: 656 QADQLAGQWYAHMTGLGD--LVPRAMQRSVLKKIYAFNVNQFGDGNLGAVNGMNADGTV- 712

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDR 105
              +++ EVW G T   +A +   G  +E++    GL+  + E +
Sbjct: 713 VDNVEAREVWAGTTLGYAALLKSLGIEEESYHVVHGLFHVIYESK 757


>gi|317969095|ref|ZP_07970485.1| hypothetical protein SCB02_06140 [Synechococcus sp. CB0205]
          Length = 837

 Score = 48.1 bits (113), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 45/91 (49%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+  E +   AN    L  +  +    FE GS+G  NG++ +G   
Sbjct: 689 MADQLCGDFYARLLGL--EPVVSEANSRSTLNAVKDSCFEKFEGGSLGVANGLRRDGTPL 746

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           D +     EVWTG+ + +++     G  + A
Sbjct: 747 DPNGTHPLEVWTGINFGIASYFRLMGETETA 777


>gi|116072378|ref|ZP_01469645.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
 gi|116064900|gb|EAU70659.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
          Length = 820

 Score = 48.1 bits (113), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 3/102 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+   ++   AN    L  +       FEAG++G  NG++ +G   
Sbjct: 672 MADQLCGDFYARLLGL--PSVVSDANALSTLNVVREACFERFEAGTLGVANGLRRDGTPL 729

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
           D +     EVWTG+ + +++     G  D A    G +  ++
Sbjct: 730 DPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAVVNQV 771


>gi|148240031|ref|YP_001225418.1| hypothetical protein SynWH7803_1695 [Synechococcus sp. WH 7803]
 gi|147848570|emb|CAK24121.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
          Length = 833

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 48/93 (51%), Gaps = 5/93 (5%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+  E +   AN    L  +  +    F+ GS+G  NG++ +G   
Sbjct: 685 MADQLCGDFYARLLGL--EPVVSEANSRSTLKAVKESCFEKFQGGSLGVANGLRRDGTPL 742

Query: 60  DRSAIQSEEVWTGVTYLLSA--AMLYEGNVDEA 90
           D +     EVWTG+ + +++   ++ EG   EA
Sbjct: 743 DPNGTHPLEVWTGINFGIASYYRLMGEGQTAEA 775


>gi|318042148|ref|ZP_07974104.1| hypothetical protein SCB01_10580 [Synechococcus sp. CB0101]
          Length = 836

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 44/91 (48%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+   A+   AN    L  +       FE G +G  NG++ +G   
Sbjct: 688 MADQLCGDFYARLLGL--PAVVSEANSRSTLKAVKEACFEQFEGGKLGVANGLRRDGTPL 745

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           D +     EVWTG+ + +++     G+ D A
Sbjct: 746 DPNGTHPLEVWTGINFGIASYYRLMGDTDTA 776


>gi|428773649|ref|YP_007165437.1| glucosylceramidase [Cyanobacterium stanieri PCC 7202]
 gi|428687928|gb|AFZ47788.1| Glucosylceramidase [Cyanobacterium stanieri PCC 7202]
          Length = 800

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 3/95 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           MADQLCG  +  A  +    +     +   L  I+      F  G++G  NG+KP+G  +
Sbjct: 663 MADQLCGQYY--AQLLKLPDVVEKEYVTATLNKIYDACYLKFHNGTLGIANGVKPDGSPE 720

Query: 61  R-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 94
           + +     E+WTG+ Y + A M+  G  +E    A
Sbjct: 721 KENDTHPLEIWTGINYGIVAFMILNGMKEEGLKVA 755


>gi|290990718|ref|XP_002677983.1| predicted protein [Naegleria gruberi]
 gi|284091593|gb|EFC45239.1| predicted protein [Naegleria gruberi]
          Length = 937

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/71 (29%), Positives = 41/71 (57%), Gaps = 2/71 (2%)

Query: 22  FPSANIARALATIFSTNVRGFEAGSM--GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSA 79
           F    ++++L  I   N   ++  +   G VNGM+P  + D++++QS E+WTG +Y+++A
Sbjct: 812 FNMERVSKSLNKILENNFVKYQQITKLGGCVNGMRPTQEVDKTSLQSREMWTGTSYVVAA 871

Query: 80  AMLYEGNVDEA 90
             +  G  +E 
Sbjct: 872 LTILLGRREEG 882


>gi|254431466|ref|ZP_05045169.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
 gi|197625919|gb|EDY38478.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
          Length = 837

 Score = 47.0 bits (110), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 45/102 (44%), Gaps = 3/102 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +   +    +        AL  I      GF+ GS+G  NG++ +G   
Sbjct: 686 MADQLCGDFYARL--LELPPVVAEERARSALRAIREACFEGFQGGSLGVANGLRRDGTPL 743

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
           D       EVWTG+ + L+A     G  D A    G +  ++
Sbjct: 744 DPDGTHPLEVWTGINFGLAAYYRLMGETDTALAITGAVVHQV 785


>gi|86144926|ref|ZP_01063258.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
 gi|85837825|gb|EAQ55937.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
          Length = 1024

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 38/116 (32%), Positives = 54/116 (46%), Gaps = 9/116 (7%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
            D L    +LK  G+  E IFP  NI RAL  ++  N   FE  S  +G  N  + +G  
Sbjct: 820 GDALLADSYLKLIGL--EGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTQADGS- 873

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
              A Q+++VW GV + ++ A+   G   +A T    +Y  L  D       APEG
Sbjct: 874 PHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALY-DYSKIPFAAPEG 928


>gi|78185056|ref|YP_377491.1| hypothetical protein Syncc9902_1489 [Synechococcus sp. CC9902]
 gi|78169350|gb|ABB26447.1| conserved hypothetical protein [Synechococcus sp. CC9902]
          Length = 823

 Score = 46.6 bits (109), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/102 (27%), Positives = 48/102 (47%), Gaps = 3/102 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+   ++   AN    L  +       FE G++G  NG++ +G   
Sbjct: 675 MADQLCGDFYARLLGL--PSVVSDANALSTLNVVRDACFERFEGGTLGVANGLRRDGTPL 732

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
           D +     EVWTG+ + +++     G  D A    G +  ++
Sbjct: 733 DPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAVVNQV 774


>gi|148242848|ref|YP_001228005.1| hypothetical protein SynRCC307_1749 [Synechococcus sp. RCC307]
 gi|147851158|emb|CAK28652.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
          Length = 220

 Score = 46.2 bits (108), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 43/91 (47%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           MADQLCG  + +  G+    +   AN   +L  I      GF  G +G  NG++ +G   
Sbjct: 71  MADQLCGDFYARLLGL--PPVVSEANARSSLQAIKEACFEGFAGGRLGVANGLRRDGTPL 128

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           D +     EVWTG+ + L+A     G  + A
Sbjct: 129 DPNGTHPLEVWTGINFGLAAYYRLMGETNTA 159


>gi|428307493|ref|YP_007144318.1| glucosylceramidase [Crinalium epipsammum PCC 9333]
 gi|428249028|gb|AFZ14808.1| Glucosylceramidase [Crinalium epipsammum PCC 9333]
          Length = 811

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 14/103 (13%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIF-----------STNVRGFEAGSMGA 49
           MADQLCG  +  A  +N   I P+  +  AL T++           S +        +GA
Sbjct: 663 MADQLCGQFY--ARLLNLPDIVPNDCVESALKTVYDSCFLKFNQKLSQHSESLTPTFIGA 720

Query: 50  VNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
            NG+  +G   +  A    EVWTG+ + L+A M+  G  DEA+
Sbjct: 721 ANGVLADGSPENPQATHPLEVWTGINFGLAAFMIQMGMKDEAF 763


>gi|284929778|ref|YP_003422300.1| putative bile acid beta-glucosidase [cyanobacterium UCYN-A]
 gi|284810222|gb|ADB95919.1| predicted bile acid beta-glucosidase [cyanobacterium UCYN-A]
          Length = 803

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 26/91 (28%), Positives = 47/91 (51%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
           M DQLCG  +  A  +N   +  ++    AL+ I+      F  G  GA NG++ +G   
Sbjct: 664 MTDQLCGQFY--ARLLNLPDVVDTSCTISALSKIYEACFLNFYDGQYGAANGVRSDGTPE 721

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           + ++   +EVW G+ + L+A ++  G  ++A
Sbjct: 722 NPNSTHPQEVWVGINFGLAAFLIQMGMKNQA 752


>gi|87125468|ref|ZP_01081313.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
 gi|86166768|gb|EAQ68030.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
          Length = 832

 Score = 45.8 bits (107), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 37/76 (48%), Gaps = 3/76 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +   +   A+   AN    L  +  +    F+ GS+G  NG++ +G   
Sbjct: 684 MADQLCGDFYARL--LKLPAVVSDANCRSTLKAVKESCFEAFQGGSLGVANGLRRDGTPL 741

Query: 60  DRSAIQSEEVWTGVTY 75
           D       EVWTG+ +
Sbjct: 742 DPQGTHPLEVWTGINF 757


>gi|116073260|ref|ZP_01470522.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
 gi|116068565|gb|EAU74317.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
          Length = 832

 Score = 45.8 bits (107), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 5/93 (5%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+  E +   AN    L  +       F+ G +G  NG++ +G   
Sbjct: 684 MADQLCGDFYARLLGL--EPVVSEANSRSTLKAVREACFEKFDGGKLGVANGLRRDGTPL 741

Query: 60  DRSAIQSEEVWTGVTYLLSA--AMLYEGNVDEA 90
           D +     EVWTG+ + +++   ++ EG   EA
Sbjct: 742 DPNGTHPLEVWTGINFGIASYYRLMGEGQTAEA 774


>gi|427702545|ref|YP_007045767.1| bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
 gi|427345713|gb|AFY28426.1| putative bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
          Length = 835

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 9/107 (8%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN---VRGFEAGSMGAVNGMKPNG 57
           MADQLCG  + +       A+ P     RAL+++ +        FE G +G  NG++ +G
Sbjct: 686 MADQLCGDFYAR-----LLALPPVVADERALSSLKAIRQACFESFEGGRLGVANGLRRDG 740

Query: 58  DR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
              D       EVWTG+ + L+A     G    A+   G + R++ E
Sbjct: 741 TPLDPEGTHPLEVWTGINFGLAAYYRLMGQSSTAFAITGAVVRQVYE 787


>gi|81299663|ref|YP_399871.1| hypothetical protein Synpcc7942_0854 [Synechococcus elongatus PCC
           7942]
 gi|81168544|gb|ABB56884.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 798

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/105 (24%), Positives = 47/105 (44%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +   +    + P          ++    + F +G  G  NG+ P+G   
Sbjct: 651 MADQLCGDFYSRL--LQLPPVAPLEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGSPV 708

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
           D       EVWTG+ + ++A  L  G+ D  +     + R++ ++
Sbjct: 709 DPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIRQIYDN 753


>gi|84386772|ref|ZP_00989797.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
 gi|84378300|gb|EAP95158.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
          Length = 908

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 38/116 (32%), Positives = 53/116 (45%), Gaps = 9/116 (7%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
            D L    +LK  G+  E IFP  NI RAL  ++  N   FE  S  +G  N    +G  
Sbjct: 704 GDALLADSYLKLIGL--EGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTLADGS- 757

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
              A Q+++VW GV + ++ A+   G   +A T    +Y  L  D       APEG
Sbjct: 758 PHEAFQAQDVWIGVQFSVATALSLAGKNQQAETLMDTVYTALY-DYSKIPFAAPEG 812


>gi|87301122|ref|ZP_01083963.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
 gi|87284090|gb|EAQ76043.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
          Length = 830

 Score = 44.3 bits (103), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 40/91 (43%), Gaps = 3/91 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+    +        +L  I       FE G +G  NG++ +G   
Sbjct: 681 MADQLCGDFYARLLGLPQ--VVSDERALSSLKAIREACFERFEGGRLGVANGLRRDGTPL 738

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           D       EVWTG+ + L+A     G  D A
Sbjct: 739 DPKGTHPLEVWTGINFGLAAYYRLMGQTDTA 769


>gi|148976285|ref|ZP_01813009.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
 gi|145964379|gb|EDK29634.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
          Length = 982

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 9/116 (7%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
           +D L    +L+  G+  E IFP   I RAL  ++  N   FE  S  +G  N  + +G  
Sbjct: 778 SDALLADSYLRLIGL--EGIFPQQRIQRALDYVYKHN---FEINSPKLGVANMTRADGS- 831

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
              A Q+++VW GV + ++ A+   G   +A T    +Y  L  D       APEG
Sbjct: 832 PHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALY-DYSKIPFAAPEG 886


>gi|78212458|ref|YP_381237.1| hypothetical protein Syncc9605_0920 [Synechococcus sp. CC9605]
 gi|78196917|gb|ABB34682.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 832

 Score = 43.9 bits (102), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 37/76 (48%), Gaps = 3/76 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +   +    +   AN    L  +       F+ GS+G  NG++ +G   
Sbjct: 684 MADQLCGDFYARLLAL--PPVVSDANSLSTLKAVKEACFEAFDGGSLGVANGLRRDGTPL 741

Query: 60  DRSAIQSEEVWTGVTY 75
           D +A    EVWTG+ +
Sbjct: 742 DPNATHPLEVWTGINF 757


>gi|218677125|ref|YP_002395944.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
 gi|218325393|emb|CAV27494.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
          Length = 1069

 Score = 43.5 bits (101), Expect = 0.015,   Method: Composition-based stats.
 Identities = 37/116 (31%), Positives = 53/116 (45%), Gaps = 9/116 (7%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
            D L    +LK  G+  + IFP  NI RAL  ++  N   FE  S  +G  N    +G  
Sbjct: 865 GDALLADSYLKLIGL--DGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTLADGS- 918

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
              A Q+++VW GV + ++ A+   G   +A T    +Y  L  D       APEG
Sbjct: 919 PHEAFQAQDVWIGVQFSVATALSLAGKNQQAETLMDTVYTALY-DYSKIPFAAPEG 973


>gi|33866127|ref|NP_897686.1| hypothetical protein SYNW1593 [Synechococcus sp. WH 8102]
 gi|33639102|emb|CAE08108.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 833

 Score = 43.5 bits (101), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 9/83 (10%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN---VRGFEAGSMGAVNGMKPNG 57
           MADQLCG  + +  G+      P  + ARA +T+ +        FE+G +G  NG++ +G
Sbjct: 685 MADQLCGDFYARLLGLP-----PVVSDARARSTLQAVKEACFERFESGRLGVANGLRRDG 739

Query: 58  DR-DRSAIQSEEVWTGVTYLLSA 79
              D +     EVWTG+ + L++
Sbjct: 740 SPLDPNGTHPLEVWTGINFGLAS 762


>gi|113953483|ref|YP_730032.1| hypothetical protein sync_0818 [Synechococcus sp. CC9311]
 gi|113880834|gb|ABI45792.1| conserved hypothetical protein [Synechococcus sp. CC9311]
          Length = 833

 Score = 43.5 bits (101), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 45/93 (48%), Gaps = 5/93 (5%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+    +   AN    L  +       F  GS+G  NG++ +G   
Sbjct: 685 MADQLCGDFYARLLGL--PPVVSEANSRSTLKAVKEACFDNFAGGSLGVANGLRRDGTPL 742

Query: 60  DRSAIQSEEVWTGVTYLLSA--AMLYEGNVDEA 90
           D +     EVWTG+ + +++   ++ EG   EA
Sbjct: 743 DPNGTHPLEVWTGINFGIASYYRLMGEGKTAEA 775


>gi|260434763|ref|ZP_05788733.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
 gi|260412637|gb|EEX05933.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
          Length = 832

 Score = 43.5 bits (101), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 3/76 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +  G+    +   +N    L  +       F+ GS+G  NG++ +G   
Sbjct: 684 MADQLCGDFYARLLGL--PPVVSDSNSRSTLKVVKEACFEAFDGGSLGVANGLRRDGTPL 741

Query: 60  DRSAIQSEEVWTGVTY 75
           D +     EVWTG+ +
Sbjct: 742 DPNGTHPLEVWTGINF 757


>gi|302348409|ref|YP_003816047.1| trehalase [Acidilobus saccharovorans 345-15]
 gi|302328821|gb|ADL19016.1| Putative trehalase [Acidilobus saccharovorans 345-15]
          Length = 928

 Score = 43.1 bits (100), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 54/114 (47%), Gaps = 14/114 (12%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGM------- 53
           ++ QL G  +L+  G+       S  +  AL  ++  N R +E    G   G        
Sbjct: 679 LSAQLTGEWYLQVLGLGDGV--DSEKVRSALREVYRANFRRWEGLLNGTYPGTPRPSMVG 736

Query: 54  ---KPNGDR--DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLR 102
              +PNG +  +R + Q++  WTGV + +++ MLYEG V+EA      ++ R R
Sbjct: 737 DVEEPNGTKILNRVSSQADTPWTGVEFGVASQMLYEGMVEEAMEVLRSIHDRYR 790


>gi|417950561|ref|ZP_12593680.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
 gi|342806239|gb|EGU41472.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
          Length = 1046

 Score = 42.7 bits (99), Expect = 0.026,   Method: Composition-based stats.
 Identities = 37/116 (31%), Positives = 52/116 (44%), Gaps = 9/116 (7%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
            D L    +LK  G+  E IFP   I RAL  ++  N   FE  S  +G  N    +G  
Sbjct: 842 GDALLADSYLKLIGL--EGIFPKQRIQRALDYVYKHN---FEINSPKLGVANMTLADGS- 895

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
              A Q+++VW GV + ++ A+   G   +A T    +Y  L  D       APEG
Sbjct: 896 PHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALY-DYSKIPFAAPEG 950


>gi|222639789|gb|EEE67921.1| hypothetical protein OsJ_25789 [Oryza sativa Japonica Group]
          Length = 919

 Score = 42.7 bits (99), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 47/97 (48%), Gaps = 24/97 (24%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
           ADQL G  + +A G+  E I       RALAT+   NV             M+       
Sbjct: 743 ADQLAGQWYARACGL--EPIVDGDKARRALATVLDYNV-------------MR------- 780

Query: 62  SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
               S+EV  GVTY ++AAM++EG  + A+ TA G++
Sbjct: 781 --THSKEVCPGVTYAVAAAMIHEGMPEAAFKTAKGIH 815


>gi|56750695|ref|YP_171396.1| hypothetical protein syc0686_c [Synechococcus elongatus PCC 6301]
 gi|56685654|dbj|BAD78876.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 798

 Score = 42.4 bits (98), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 25/105 (23%), Positives = 46/105 (43%), Gaps = 3/105 (2%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
           MADQLCG  + +   +    + P          ++    + F +G  G  NG+ P+G   
Sbjct: 651 MADQLCGDFYSRL--LQLPPVAPLEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGSPV 708

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
                   EVWTG+ + ++A  L  G+ D  +     + R++ ++
Sbjct: 709 GPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIRQIYDN 753


>gi|407069309|ref|ZP_11100147.1| hypothetical protein VcycZ_07137 [Vibrio cyclitrophicus ZF14]
          Length = 1024

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 37/116 (31%), Positives = 52/116 (44%), Gaps = 9/116 (7%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
            D L    +LK  G+  E IFP   I RAL  ++  N   FE  S  +G  N    +G  
Sbjct: 820 GDALLADSYLKLIGL--EGIFPEDRIQRALDYVYKHN---FEINSPKLGVANMTLADGS- 873

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
              A Q+++VW GV + ++ A+   G   +A T    +Y  L  D       APEG
Sbjct: 874 PHEAFQAQDVWIGVQFSVATALNLAGKSQQAETLMDTVYTALY-DYSKIPFAAPEG 928


>gi|261417086|ref|YP_003250769.1| glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
           S85]
 gi|385791890|ref|YP_005823013.1| hypothetical protein FSU_3272 [Fibrobacter succinogenes subsp.
           succinogenes S85]
 gi|261373542|gb|ACX76287.1| Glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
           S85]
 gi|302325406|gb|ADL24607.1| conserved domain protein [Fibrobacter succinogenes subsp.
           succinogenes S85]
          Length = 1047

 Score = 42.0 bits (97), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 56/117 (47%), Gaps = 7/117 (5%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
            AD +    +L+   ++ E I  SA     L  IF+TN +   +  +GA N ++ +G   
Sbjct: 837 FADTMLADTYLRL--LDLEPITDSAKAKSNLLRIFNTNYKA-NSPLIGAANLVRKDGSPL 893

Query: 60  DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED-RVSGCETAPEG 115
           D    Q+ +VW G+ Y +  AM++ G   EA   A  +   L ++ RV     APEG
Sbjct: 894 DEFNFQAHDVWIGIQYSIMTAMMFHGLEKEASVLADSMIGNLYDEARVPFA--APEG 948


>gi|33862646|ref|NP_894206.1| hypothetical protein PMT0373 [Prochlorococcus marinus str. MIT
           9313]
 gi|33634562|emb|CAE20548.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 837

 Score = 41.6 bits (96), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 3/80 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           MADQLCG  + +  G+   ++        +L  +      GFE G +G  NG+  +G   
Sbjct: 685 MADQLCGDFYARLLGL--PSVVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDGMPL 742

Query: 60  DRSAIQSEEVWTGVTYLLSA 79
           D       EVWTG+ + L+A
Sbjct: 743 DPKGTHPLEVWTGINFGLAA 762


>gi|124023625|ref|YP_001017932.1| bile acid beta-glucosidase [Prochlorococcus marinus str. MIT 9303]
 gi|123963911|gb|ABM78667.1| Predicted bile acid beta-glucosidase [Prochlorococcus marinus str.
           MIT 9303]
          Length = 837

 Score = 41.6 bits (96), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 3/80 (3%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
           MADQLCG  + +  G+   ++        +L  +      GFE G +G  NG+  +G   
Sbjct: 685 MADQLCGDFYARLLGL--PSVVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDGMPL 742

Query: 60  DRSAIQSEEVWTGVTYLLSA 79
           D       EVWTG+ + L+A
Sbjct: 743 DPKGTHPLEVWTGINFGLAA 762


>gi|374815490|ref|ZP_09719227.1| hypothetical protein TpriZ_16630 [Treponema primitia ZAS-1]
          Length = 814

 Score = 39.7 bits (91), Expect = 0.22,   Method: Composition-based stats.
 Identities = 34/126 (26%), Positives = 54/126 (42%), Gaps = 20/126 (15%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV-----------RGFEAGS-MG 48
           +ADQL G  +   +G+    +  + +I +A  +IF  N            RG+      G
Sbjct: 608 LADQLIGQFYAHLTGLG--PLVDAGHIKQAAQSIFKYNFISDFTDHPNMQRGYAVNDDKG 665

Query: 49  AVNGMKPNGDRDRSAI-QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVS 107
            +    P G R R     SEE W+   Y +++ ++YEG +DE  T A  +      DR  
Sbjct: 666 LLMTTWPLGGRPRYPFFYSEEAWSRTEYHVASTLIYEGFLDEGLTIAKAV-----RDRYD 720

Query: 108 GCETAP 113
           G +  P
Sbjct: 721 GIKRNP 726


>gi|15922864|ref|NP_378533.1| hypothetical protein ST2529 [Sulfolobus tokodaii str. 7]
 gi|15623655|dbj|BAB67642.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 836

 Score = 39.3 bits (90), Expect = 0.32,   Method: Composition-based stats.
 Identities = 28/111 (25%), Positives = 48/111 (43%), Gaps = 19/111 (17%)

Query: 2   ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-- 59
           A QL G  +L   G    ++       + L +I   N++  E    G +NG  P+G R  
Sbjct: 616 ASQLLGEFYLNLLGY---SLLDREKTRKTLLSIVKYNLKEEE----GVINGAYPDGYRPL 668

Query: 60  ----------DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRR 100
                       ++I  +  W+GV + L++ ++YE  +DEA      +Y R
Sbjct: 669 MREYENPLKIKEASIHQDTPWSGVEFYLASHLIYEKMIDEAKKVLKEVYDR 719


>gi|15922862|ref|NP_378531.1| hypothetical protein ST2526 [Sulfolobus tokodaii str. 7]
 gi|15623653|dbj|BAB67640.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 646

 Score = 38.9 bits (89), Expect = 0.35,   Method: Composition-based stats.
 Identities = 30/115 (26%), Positives = 48/115 (41%), Gaps = 12/115 (10%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           M+ Q+ G  +    G+  E I     I +AL +I+  N +   A     VN + P+G  D
Sbjct: 508 MSSQILGQFWCDILGL--EPIIDEDKIVQALRSIYELNGK---ASKFCLVNSVNPDGSID 562

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEA-------WTTAGGLYRRLREDRVSG 108
               Q    W+ V + +SA M+  G  +E        W T   L +  +  R+ G
Sbjct: 563 TETDQMRSCWSRVAFAVSAHMIIRGLKNEGIEIANREWETIKSLGKWNQSSRIDG 617


>gi|229820204|ref|YP_002881730.1| hypothetical protein Bcav_1712 [Beutenbergia cavernae DSM 12333]
 gi|229566117|gb|ACQ79968.1| protein of unknown function DUF608 [Beutenbergia cavernae DSM
           12333]
          Length = 865

 Score = 38.9 bits (89), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 53/113 (46%), Gaps = 15/113 (13%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVR-GFEA-----------GSMG 48
           ++DQL G  +  A  ++   + P+A++  AL  +   N+R GF+              +G
Sbjct: 652 LSDQLIGQWW--AHLLDLGHVLPAAHVRSALRAVVGHNLRRGFDGWEHTQRVYADGDDVG 709

Query: 49  AVNGMKPNGDRDRSAIQ-SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRR 100
            +    P+G R   A +  +EVW+GV Y ++A  L EG   E      GL+ R
Sbjct: 710 LLMCTWPHGGRPEVATRYCDEVWSGVEYEVAAHCLLEGLTAEGRAILDGLWAR 762


>gi|254423918|ref|ZP_05037636.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
 gi|196191407|gb|EDX86371.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
          Length = 828

 Score = 38.9 bits (89), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 55/128 (42%), Gaps = 25/128 (19%)

Query: 1   MADQLCGYLFLKASG----VNTEAIFPSAN-IARALATIFSTNV-------RGFEAGS-- 46
           MADQLCG       G    V++E + P+   I  A    F+          + FE     
Sbjct: 663 MADQLCGQFCASTMGLPDVVDSEFVEPTLRAIYEACFVRFNQYTAQLGPQNQKFEGAQTG 722

Query: 47  ----------MGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 95
                     +G  NG++P+G  ++       EVWTG+ + L+A + +EG + EA     
Sbjct: 723 YFSASELGVKVGCANGVRPDGSPQNPDDTHQLEVWTGINFGLAAFLAHEGKLHEAMEITE 782

Query: 96  GLYRRLRE 103
            + R++ E
Sbjct: 783 AVVRQVYE 790


>gi|327310049|ref|YP_004336946.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
 gi|326946528|gb|AEA11634.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
          Length = 660

 Score = 38.5 bits (88), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 42/103 (40%), Gaps = 5/103 (4%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           M+ QL G  +     ++   I P   +A AL +I+     G  A      N + P+G  D
Sbjct: 515 MSSQLLGQFWCDM--LDLPPIAPDDKVAAALRSIYEL---GMRASKYCIPNSVTPDGKPD 569

Query: 61  RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
               Q    WT V + ++A ML  G   E    A   +R + E
Sbjct: 570 EETPQLRSCWTRVNFAVAAHMLLRGMEREGMEVASREWRTIAE 612


>gi|160931643|ref|ZP_02079037.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
 gi|156869288|gb|EDO62660.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
          Length = 804

 Score = 38.1 bits (87), Expect = 0.60,   Method: Composition-based stats.
 Identities = 27/107 (25%), Positives = 49/107 (45%), Gaps = 23/107 (21%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMK------ 54
           ++DQL G       G+    +FP  ++  A+  I+  N +      MG+   ++      
Sbjct: 606 LSDQLLGQTLAHLYGLG--HLFPEDHVKSAVFAIYKYNFKE----RMGSHKSLQRGYAYQ 659

Query: 55  ----------PNGDRDRSA-IQSEEVWTGVTYLLSAAMLYEGNVDEA 90
                     P+G R +   + S+EVWTG+ Y ++A ++YEG + E 
Sbjct: 660 DEPGLLLCSWPSGGRPKQPFVYSDEVWTGIEYQVAAGLIYEGFLQEG 706


>gi|160932020|ref|ZP_02079412.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
 gi|156869062|gb|EDO62434.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
          Length = 822

 Score = 37.4 bits (85), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 32/117 (27%), Positives = 53/117 (45%), Gaps = 17/117 (14%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV--RGFEAGS----------MG 48
           ++DQL G    + +G+    + P  ++ +A  +IF  N   R  + G           MG
Sbjct: 619 LSDQLLGQFLAQEAGLGY--VLPKNHVKKAAESIFRYNFIERASDFGHVQRAYIFNDEMG 676

Query: 49  AVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
                 P G+R R   I   EVWTG+ Y ++A ++ E  ++E  T    +  R R+D
Sbjct: 677 LTPCTWPLGERPRFPFIYFGEVWTGIEYEVAALLIREDLLEEGLTIVKAV--RDRQD 731


>gi|160934323|ref|ZP_02081710.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
 gi|156866996|gb|EDO60368.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
          Length = 806

 Score = 37.4 bits (85), Expect = 1.1,   Method: Composition-based stats.
 Identities = 27/113 (23%), Positives = 53/113 (46%), Gaps = 15/113 (13%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMG-----AVNGMK- 54
           +ADQ+ G +    + +    + P  +I +A+ ++F  N +      +      A+N  K 
Sbjct: 599 LADQVFGQMLAHLNHLGY--VLPEEHIKKAIHSVFQYNFKETMEDHVNVQRTYALNQEKG 656

Query: 55  ------PNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRR 100
                 P G R +   + S+EVW+G+ Y ++A +++EG V+E       +  R
Sbjct: 657 LVLCSWPQGGRPKLPFVYSDEVWSGIEYQVAAHLIFEGFVEEGLAVVKAVRER 709


>gi|388492496|gb|AFK34314.1| unknown [Medicago truncatula]
 gi|388521539|gb|AFK48831.1| unknown [Medicago truncatula]
          Length = 183

 Score = 37.4 bits (85), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 5/64 (7%)

Query: 53  MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETA 112
           M+P+G R  +       W  +T+L++A+ L  G+V  A+ T    YR L  +R   C+T 
Sbjct: 81  MRPSGSRSWAICLFITTW--LTFLIAASCLLAGSVKNAYHTK---YRDLLGERAPSCQTL 135

Query: 113 PEGV 116
            +GV
Sbjct: 136 RKGV 139


>gi|217071402|gb|ACJ84061.1| unknown [Medicago truncatula]
          Length = 183

 Score = 37.4 bits (85), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 5/64 (7%)

Query: 53  MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETA 112
           M+P+G R  +       W  +T+L++A+ L  G+V  A+ T    YR L  +R   C+T 
Sbjct: 81  MRPSGSRSWAICLFITTW--LTFLIAASCLLAGSVKNAYHTK---YRDLLGERAPSCQTL 135

Query: 113 PEGV 116
            +GV
Sbjct: 136 RKGV 139


>gi|411117353|ref|ZP_11389840.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410713456|gb|EKQ70957.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 907

 Score = 37.4 bits (85), Expect = 1.3,   Method: Composition-based stats.
 Identities = 18/45 (40%), Positives = 29/45 (64%), Gaps = 1/45 (2%)

Query: 47  MGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
           +GA NG+KP+G   + ++    EVWTG+ + L+A ++  G  DEA
Sbjct: 810 IGAANGVKPDGSPENPNSTHPLEVWTGINFGLAAFLVQMGMRDEA 854


>gi|358347158|ref|XP_003637628.1| hypothetical protein MTR_093s0004 [Medicago truncatula]
 gi|355503563|gb|AES84766.1| hypothetical protein MTR_093s0004 [Medicago truncatula]
          Length = 115

 Score = 37.4 bits (85), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 5/64 (7%)

Query: 53  MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETA 112
           M+P+G R  +       W  +T+L++A+ L  G+V  A+ T    YR L  +R   C+T 
Sbjct: 13  MRPSGSRSWAICLFITTW--LTFLIAASCLLAGSVKNAYHTK---YRDLLGERAPSCQTL 67

Query: 113 PEGV 116
            +GV
Sbjct: 68  RKGV 71


>gi|363807072|ref|NP_001242586.1| uncharacterized protein LOC100817583 precursor [Glycine max]
 gi|255640558|gb|ACU20564.1| unknown [Glycine max]
          Length = 183

 Score = 37.4 bits (85), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 34/64 (53%), Gaps = 5/64 (7%)

Query: 53  MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETA 112
           M+P+G R  S       W  VT++++A+ L  G+V  A+ T    YR L  +R   C+T 
Sbjct: 81  MRPSGSRSWSICLFITSW--VTFIIAASCLLAGSVRNAYHTK---YRDLMGERAPSCQTL 135

Query: 113 PEGV 116
            +GV
Sbjct: 136 RKGV 139


>gi|255073149|ref|XP_002500249.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
 gi|226515511|gb|ACO61507.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
          Length = 954

 Score = 36.6 bits (83), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/42 (42%), Positives = 25/42 (59%)

Query: 56  NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
           +G  D  +  S EVW GV+Y +++ ML  G  DEAW  A G+
Sbjct: 849 SGSVDAVSAHSNEVWPGVSYAVASHMLLAGMDDEAWELARGV 890


>gi|317152500|ref|YP_004120548.1| UspA domain-containing protein [Desulfovibrio aespoeensis Aspo-2]
 gi|316942751|gb|ADU61802.1| UspA domain-containing protein [Desulfovibrio aespoeensis Aspo-2]
          Length = 300

 Score = 35.0 bits (79), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 14 SGVNTEAIFPSANIARAL-ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTG 72
          SGV+  A+  +A  AR   A ++ T+V G   G  G++  ++P+G+ +R   Q EE+++ 
Sbjct: 12 SGVDKCAVMAAAEFARKFEAKLYLTHVAGMAQG-WGSMEHLEPSGETERIKSQIEEMYSD 70

Query: 73 VTYLLSAAMLYEGNV 87
             LLS    Y+ NV
Sbjct: 71 ---LLSGIAEYQINV 82


>gi|427419169|ref|ZP_18909352.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
 gi|425761882|gb|EKV02735.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
          Length = 835

 Score = 34.7 bits (78), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 50/118 (42%), Gaps = 30/118 (25%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIF------------------STNVRGF 42
           MADQLCG    +   ++   I     I  AL TI+                  +    G 
Sbjct: 672 MADQLCGQFCARL--MDLPDIVDQEFIDIALDTIYDACFVKFNEYAAKHSTPQNQKFEGS 729

Query: 43  EAG---------SMGAVNGMKPNGDRDRSAIQSE-EVWTGVTYLLSAAMLYEGNVDEA 90
           +AG         ++GA NG+KP+G  +      + EVWTG+ + ++A +   G  D+A
Sbjct: 730 QAGFFSAAQLGMAIGAANGVKPDGSPENPDDTHQLEVWTGINFGVAAFLAQMGKRDQA 787


>gi|269839491|ref|YP_003324183.1| hypothetical protein Tter_2468 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269791221|gb|ACZ43361.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 807

 Score = 34.7 bits (78), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 53/120 (44%), Gaps = 23/120 (19%)

Query: 1   MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
           +ADQL G L  +A  +    + P  ++ +AL +I+  N   F A     VN  +     D
Sbjct: 589 LADQLLGQL--QAHVLGMGYLVPEEHVRQALRSIWRYN---FRADLSEHVNCQRTFALND 643

Query: 61  RSAI----------------QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
            + +                 S+EV+TG  Y ++A M+YEG ++E     G +  R R D
Sbjct: 644 EAGLLMCTWPRGGQPTFPFPYSDEVFTGSEYQVAAHMIYEGLLEEGLQIVGAV--RGRHD 701


>gi|148553136|ref|YP_001260718.1| RNAse III [Sphingomonas wittichii RW1]
 gi|148498326|gb|ABQ66580.1| RNAse III [Sphingomonas wittichii RW1]
          Length = 234

 Score = 34.3 bits (77), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 5/93 (5%)

Query: 23  PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAM 81
           P   ++R L  I S         ++G    MK     RD  A+QS+ V   V   L  A+
Sbjct: 77  PEGKLSRRLNAIVSRETCADVGRAIGVATHMKLGKQARDDGAVQSDNVLGDVVEALIGAL 136

Query: 82  LYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
             EG +D     A    RR  +DR +G   AP+
Sbjct: 137 YLEGGLD----AAQAFVRRAWDDRATGQAQAPK 165


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.316    0.132    0.389 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,776,012,469
Number of Sequences: 23463169
Number of extensions: 62581433
Number of successful extensions: 136644
Number of sequences better than 100.0: 361
Number of HSP's better than 100.0 without gapping: 242
Number of HSP's successfully gapped in prelim test: 119
Number of HSP's that attempted gapping in prelim test: 136126
Number of HSP's gapped (non-prelim): 367
length of query: 116
length of database: 8,064,228,071
effective HSP length: 83
effective length of query: 33
effective length of database: 6,116,785,044
effective search space: 201853906452
effective search space used: 201853906452
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 69 (31.2 bits)