BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10296
(116 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|443709232|gb|ELU03988.1| hypothetical protein CAPTEDRAFT_186839 [Capitella teleta]
Length = 802
Score = 119 bits (297), Expect = 3e-25, Method: Composition-based stats.
Identities = 53/98 (54%), Positives = 68/98 (69%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQ+CG FLKA GV A+FPSAN+ RAL +F NV F+ G MGA+NGM+PN +D
Sbjct: 651 MADQMCGQWFLKACGVADNAVFPSANVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKD 710
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
++ QS+E WTGVTY L A M+ G +D+ + TA G Y
Sbjct: 711 VTSCQSDEFWTGVTYALGATMIQVGMIDKGFQTAYGAY 748
>gi|443710865|gb|ELU04898.1| hypothetical protein CAPTEDRAFT_216398 [Capitella teleta]
Length = 449
Score = 118 bits (295), Expect = 5e-25, Method: Composition-based stats.
Identities = 53/98 (54%), Positives = 68/98 (69%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQ+CG FLKA GV A+FPSAN+ RAL +F NV F+ G MGA+NGM+PN +D
Sbjct: 298 MADQMCGQWFLKACGVADNAVFPSANVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKD 357
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
++ QS+E WTGVTY L A M+ G +D+ + TA G Y
Sbjct: 358 VTSCQSDEFWTGVTYALGATMIQVGMIDKGFQTAYGAY 395
>gi|357630375|gb|EHJ78536.1| hypothetical protein KGM_16417 [Danaus plexippus]
Length = 912
Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats.
Identities = 60/103 (58%), Positives = 73/103 (70%), Gaps = 3/103 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGM--KPNGD 58
MADQL G FL+ASG TE +FP AN+ +AL TI+ NV+ F G MGAVNG P
Sbjct: 751 MADQLAGQWFLRASGW-TEPVFPEANVKKALHTIYENNVQRFLNGRMGAVNGFVRGPRPG 809
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
D +AIQSEEVWTGVTY L+A M+YEG ++A++TAGGLY L
Sbjct: 810 IDTTAIQSEEVWTGVTYGLAALMIYEGMHEQAFSTAGGLYNTL 852
>gi|328709489|ref|XP_001943760.2| PREDICTED: non-lysosomal glucosylceramidase-like [Acyrthosiphon
pisum]
Length = 776
Score = 112 bits (279), Expect = 4e-23, Method: Composition-based stats.
Identities = 54/98 (55%), Positives = 67/98 (68%), Gaps = 1/98 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG +LK GVN E + P + +AL TI+ NV GF G MGAVNGM P+G+ D
Sbjct: 633 MADQLCGEWYLKCCGVNEE-VLPVDRVKKALHTIYKMNVLGFNDGKMGAVNGMMPDGNPD 691
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
++QSEEVWTGVTY LS+ M+ G DE + TA G+Y
Sbjct: 692 TFSLQSEEVWTGVTYALSSLMIAHGLRDEGFNTAKGIY 729
>gi|339252284|ref|XP_003371365.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
gi|316968412|gb|EFV52690.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
Length = 1615
Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats.
Identities = 51/103 (49%), Positives = 74/103 (71%), Gaps = 1/103 (0%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCGY +LK G+ T+ I P N+ + L TI++ NV F G++GAVNGM +G++D
Sbjct: 1468 MADQLCGYAYLKICGLQTD-ILPLDNVKKVLETIYNLNVCSFGNGTLGAVNGMLYSGEKD 1526
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
S++Q++E WTGVTY LSA M+ EG V++ ++TA G+Y+ E
Sbjct: 1527 TSSLQADEAWTGVTYFLSAHMISEGFVEQGFSTASGIYKSCFE 1569
>gi|312382731|gb|EFR28085.1| hypothetical protein AND_04399 [Anopheles darlingi]
Length = 777
Score = 109 bits (273), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/122 (50%), Positives = 78/122 (63%), Gaps = 8/122 (6%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
M+DQLCG+ +L+A G + + +FP N+ A+ TI+ NV F G++GAVNG PN
Sbjct: 626 MSDQLCGHWYLRACGFDYD-VFPKENVRLAMRTIYENNVMRFCGGNLGAVNGYVPNAQPN 684
Query: 57 --GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
G D S IQ EEVWTGVTY L+A M++EG +EA+ TAGGLYR L E ET PE
Sbjct: 685 KEGRPDVSNIQGEEVWTGVTYALAATMIHEGMFEEAFQTAGGLYRSLSEKIGMNFET-PE 743
Query: 115 GV 116
V
Sbjct: 744 AV 745
>gi|291241331|ref|XP_002740565.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 792
Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats.
Identities = 58/117 (49%), Positives = 77/117 (65%), Gaps = 3/117 (2%)
Query: 1 MADQLCGYLFLKASGVNTE--AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M DQ G +L+A G+ E +FP +I AL I+ NV GFE G+MGAVNGM+P+G
Sbjct: 645 MCDQAAGQWYLRACGLVQENDEVFPKDHILCALRAIYKMNVMGFEKGTMGAVNGMRPDGS 704
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
D ++ QSEEVWTGVTY L+A M+ EG ++EA+TTA G+Y+ E G +T PE
Sbjct: 705 PDTTSFQSEEVWTGVTYALAANMIQEGLLEEAFTTASGVYKTCYERCGLGFQT-PEA 760
>gi|405977906|gb|EKC42333.1| Non-lysosomal glucosylceramidase [Crassostrea gigas]
Length = 1464
Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats.
Identities = 52/98 (53%), Positives = 68/98 (69%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL G+ FLKASG+ + IFP + +L IF NV FE G+MGA+NG +P+G +D
Sbjct: 1316 MADQLAGHWFLKASGLQDDDIFPPDRVKSSLKKIFENNVMMFEGGNMGAINGTRPDGSKD 1375
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S+ QSEE W GVTY L+A M+ EG +D+A+ TA G Y
Sbjct: 1376 ISSCQSEEFWVGVTYGLAANMIQEGLLDQAFQTAWGAY 1413
>gi|196002317|ref|XP_002111026.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
gi|190586977|gb|EDV27030.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
Length = 843
Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats.
Identities = 50/101 (49%), Positives = 71/101 (70%), Gaps = 3/101 (2%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
MADQ+CG+ +L+A G+ N + IFP + + +AL +F NV F G+ GAVNGMKPNG
Sbjct: 689 MADQMCGHWYLQACGLVNNNEDDIFPKSQVDKALRKVFDYNVMKFMNGTWGAVNGMKPNG 748
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ D S++QSEEVWTGVTY L+A ++ +G + E + A G+Y
Sbjct: 749 EIDTSSVQSEEVWTGVTYALAANLIQQGMIKEGFKVAEGMY 789
>gi|395515264|ref|XP_003761826.1| PREDICTED: non-lysosomal glucosylceramidase [Sarcophilus harrisii]
Length = 954
Score = 108 bits (270), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 57/106 (53%), Positives = 74/106 (69%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FPSA++ RAL TIF NV+GF G+MGAVNGM P+G
Sbjct: 759 MSDQCAGQWFLRACGLGKGDTE-VFPSAHVIRALQTIFELNVQGFAGGAMGAVNGMHPDG 817
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D S++QS+EVW GV Y L+A M+ EG V E + TA G YR + E
Sbjct: 818 VPDTSSVQSDEVWVGVVYGLAATMIQEGLVQEGFLTAEGCYRTVWE 863
>gi|156356077|ref|XP_001623757.1| predicted protein [Nematostella vectensis]
gi|156210486|gb|EDO31657.1| predicted protein [Nematostella vectensis]
Length = 783
Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats.
Identities = 51/101 (50%), Positives = 73/101 (72%), Gaps = 3/101 (2%)
Query: 1 MADQLCGYLFLKA---SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
MADQL G +L A + + + +FPS N+ AL T+F+ NV F+ G+MGAVNG++P+G
Sbjct: 632 MADQLSGQWYLHACDLAQTSNDRVFPSENVISALRTVFNFNVMKFQEGTMGAVNGIRPDG 691
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
D S++Q+EEVWTGVTY ++A+M+ EG VDE + TA G+Y
Sbjct: 692 QLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIY 732
>gi|334333160|ref|XP_003341682.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
glucosylceramidase-like [Monodelphis domestica]
Length = 950
Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats.
Identities = 57/118 (48%), Positives = 78/118 (66%), Gaps = 5/118 (4%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+GF G+MGAVNGM+P+G
Sbjct: 755 MSDQCAGQWFLRACGLGEGDTE-VFPTPHVIRALQTIFEVNVQGFAGGAMGAVNGMQPDG 813
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
D S++QS+EVW GV Y L+A M+ EG + E + TA G YR + E R+ PE
Sbjct: 814 IPDTSSVQSDEVWVGVVYGLAATMIQEGLIQEGFRTAEGCYRTVWE-RLGMAFQTPEA 870
>gi|156554493|ref|XP_001604835.1| PREDICTED: non-lysosomal glucosylceramidase-like [Nasonia
vitripennis]
Length = 826
Score = 107 bits (267), Expect = 8e-22, Method: Composition-based stats.
Identities = 54/116 (46%), Positives = 75/116 (64%), Gaps = 2/116 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
M+DQLCG+ L+A G + E +FP + +L T+F NV ++ G GAVNG P G D
Sbjct: 664 MSDQLCGHWLLRACGFSYE-VFPQDRVRSSLETVFQNNVMKYKNGQQGAVNGFSPTGSID 722
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEGV 116
S IQSEE+WTGV Y L+A +++EG ++EA+ TA G+YR + E G ET PE +
Sbjct: 723 YSCIQSEEMWTGVAYGLAALLIHEGMIEEAFRTAEGVYRTVYEKIGMGFET-PEAL 777
>gi|351707038|gb|EHB09957.1| Non-lysosomal glucosylceramidase [Heterocephalus glaber]
Length = 922
Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats.
Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 745 MSDQCAGQWFLKACGLGEGDTE-VFPTTHVVRALQTIFELNVQSFAGGAMGAVNGMQPHG 803
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 804 IPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 849
>gi|354485725|ref|XP_003505033.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Cricetulus griseus]
Length = 898
Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 721 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 779
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 780 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGCYRTVWE 825
>gi|157119417|ref|XP_001653371.1| bile acid beta-glucosidase, putative [Aedes aegypti]
gi|108883154|gb|EAT47379.1| AAEL001478-PA, partial [Aedes aegypti]
Length = 898
Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats.
Identities = 57/122 (46%), Positives = 76/122 (62%), Gaps = 8/122 (6%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP----- 55
M+DQLCG+ +L+ G + + IFP N+ AL TI+ NV F G+MGAVNG P
Sbjct: 747 MSDQLCGHWYLRCCGFDYD-IFPKENVRTALKTIYDNNVMRFCGGNMGAVNGYVPSTQPN 805
Query: 56 -NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
+G D S +Q EEVWTGVTY L++ M++EG EA+ TAGG+Y+ L E ET PE
Sbjct: 806 KDGRADASTVQGEEVWTGVTYALASTMIHEGMFTEAFQTAGGMYQTLSEKIGMNFET-PE 864
Query: 115 GV 116
+
Sbjct: 865 AL 866
>gi|410925034|ref|XP_003975986.1| PREDICTED: non-lysosomal glucosylceramidase-like [Takifugu
rubripes]
Length = 827
Score = 106 bits (265), Expect = 2e-21, Method: Composition-based stats.
Identities = 56/117 (47%), Positives = 72/117 (61%), Gaps = 3/117 (2%)
Query: 1 MADQLCGYLFLKASGVN--TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M+DQ G+ FL+ASG+ FP I RAL T+F NV F G MGAVNGM+P G
Sbjct: 678 MSDQCAGHWFLRASGLGDGEYQAFPKEKIQRALKTVFDLNVMSFAGGQMGAVNGMRPEGV 737
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
DRS++QS+EVW GV Y L+A M++EG ++E TA G YR + E R+ PE
Sbjct: 738 PDRSSVQSDEVWIGVVYGLAATMIHEGMLEEGMRTAEGCYRTVWE-RLGMAFQTPEA 793
>gi|198473190|ref|XP_002133205.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
gi|198139343|gb|EDY70607.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
Length = 865
Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats.
Identities = 57/123 (46%), Positives = 75/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL TI+ NV GF G++GA NG N
Sbjct: 713 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADP 771
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+ L E R+ P
Sbjct: 772 SKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSE-RIGMNFETP 830
Query: 114 EGV 116
E +
Sbjct: 831 EAL 833
>gi|195164866|ref|XP_002023267.1| GL21266 [Drosophila persimilis]
gi|194105352|gb|EDW27395.1| GL21266 [Drosophila persimilis]
Length = 867
Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats.
Identities = 57/123 (46%), Positives = 75/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL TI+ NV GF G++GA NG N
Sbjct: 715 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADP 773
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+ L E R+ P
Sbjct: 774 SKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSE-RIGMNFETP 832
Query: 114 EGV 116
E +
Sbjct: 833 EAL 835
>gi|221041110|dbj|BAH12232.1| unnamed protein product [Homo sapiens]
Length = 933
Score = 105 bits (263), Expect = 3e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 756 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 814
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 815 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 860
>gi|125821356|ref|XP_687652.2| PREDICTED: non-lysosomal glucosylceramidase [Danio rerio]
Length = 851
Score = 105 bits (263), Expect = 3e-21, Method: Composition-based stats.
Identities = 55/117 (47%), Positives = 72/117 (61%), Gaps = 3/117 (2%)
Query: 1 MADQLCGYLFLKASGVNTE--AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M+DQ G+ FL+ASG+ + FP I AL ++F NV F G MGAVNGM+P G
Sbjct: 698 MSDQCAGHWFLRASGLGDDEYQAFPKEKICSALKSVFDLNVMSFAGGQMGAVNGMRPEGV 757
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
DRS++QS+EVW GV Y L+A M++EG V+E TA G YR + E R+ PE
Sbjct: 758 PDRSSVQSDEVWVGVVYGLAATMIHEGMVEEGLRTAEGCYRAVWE-RMGMAFQTPEA 813
>gi|332831849|ref|XP_001167923.2| PREDICTED: non-lysosomal glucosylceramidase isoform 4 [Pan
troglodytes]
Length = 933
Score = 105 bits (263), Expect = 3e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 756 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 814
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 815 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 860
>gi|444729905|gb|ELW70308.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
Length = 279
Score = 105 bits (263), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 102 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHG 160
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 161 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 206
>gi|344251774|gb|EGW07878.1| Non-lysosomal glucosylceramidase [Cricetulus griseus]
Length = 226
Score = 105 bits (262), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 49 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 107
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 108 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGCYRTVWE 153
>gi|24308251|ref|NP_065995.1| non-lysosomal glucosylceramidase [Homo sapiens]
gi|143018392|sp|Q9HCG7.2|GBA2_HUMAN RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
gi|15030210|gb|AAH11363.1| Glucosidase, beta (bile acid) 2 [Homo sapiens]
gi|16215453|emb|CAC83792.1| bile acid beta-glucosidase [Homo sapiens]
gi|21739906|emb|CAD38976.1| hypothetical protein [Homo sapiens]
gi|119578752|gb|EAW58348.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
gi|119578753|gb|EAW58349.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
Length = 927
Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854
>gi|332228461|ref|XP_003263408.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
[Nomascus leucogenys]
Length = 919
Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 742 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 800
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 801 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 846
>gi|55633173|ref|XP_520565.1| PREDICTED: non-lysosomal glucosylceramidase isoform 6 [Pan
troglodytes]
gi|410227830|gb|JAA11134.1| talin 1 [Pan troglodytes]
Length = 927
Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854
>gi|14042883|dbj|BAB55430.1| unnamed protein product [Homo sapiens]
Length = 927
Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854
>gi|426361727|ref|XP_004048051.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Gorilla
gorilla gorilla]
Length = 933
Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 756 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 814
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 815 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 860
>gi|355689821|gb|AER98956.1| glucosidase, beta 2 [Mustela putorius furo]
Length = 550
Score = 105 bits (262), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/106 (52%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKASG+ +TE +FP+ ++ AL TIF NVR F G+MGAVNGM+P+G
Sbjct: 372 MSDQCAGQWFLKASGLGEGDTE-VFPTQHVVCALQTIFEFNVRAFAGGAMGAVNGMQPHG 430
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 431 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 476
>gi|403306664|ref|XP_003943843.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Saimiri
boliviensis boliviensis]
Length = 929
Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 756 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 814
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 815 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 860
>gi|170036781|ref|XP_001846240.1| bile acid beta-glucosidase [Culex quinquefasciatus]
gi|167879683|gb|EDS43066.1| bile acid beta-glucosidase [Culex quinquefasciatus]
Length = 913
Score = 105 bits (262), Expect = 4e-21, Method: Composition-based stats.
Identities = 57/122 (46%), Positives = 76/122 (62%), Gaps = 8/122 (6%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP----- 55
M+DQLCG+ +L+ G + + I P N+ AL TI+ NV F G+MGAVNG P
Sbjct: 762 MSDQLCGHWYLRCCGFDYD-IVPKENVRTALKTIYDNNVMRFCGGNMGAVNGYIPSSQPN 820
Query: 56 -NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
+G D S +Q+EEVWTGVTY L++ M++EG EA+ TAGGLY+ L E ET PE
Sbjct: 821 KDGRADSSTLQAEEVWTGVTYALASTMIHEGMFTEAFQTAGGLYQTLSEKIGMSFET-PE 879
Query: 115 GV 116
+
Sbjct: 880 AL 881
>gi|444729906|gb|ELW70309.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
Length = 1001
Score = 105 bits (262), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 824 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHG 882
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 883 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 928
>gi|403306662|ref|XP_003943842.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Saimiri
boliviensis boliviensis]
Length = 923
Score = 105 bits (262), Expect = 4e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854
>gi|385648282|ref|NP_001245314.1| glucosidase, beta (bile acid) 2 [Sus scrofa]
Length = 915
Score = 105 bits (261), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 74/106 (69%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+ASG+ +TE +FP+ ++ +AL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 742 MSDQCAGQWFLRASGLGEGDTE-VFPTPHVVQALQTIFEFNVQAFAGGAMGAVNGMQPHG 800
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 801 VPDRSSVQSDEVWVGVVYALAATMIQEGLTWEGFRTAEGCYRTVWE 846
>gi|432110769|gb|ELK34246.1| Non-lysosomal glucosylceramidase [Myotis davidii]
Length = 868
Score = 105 bits (261), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 59/117 (50%), Positives = 77/117 (65%), Gaps = 5/117 (4%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+ASG+ TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P G
Sbjct: 712 MSDQCAGQWFLRASGLGEGETE-VFPTQHVVRALQTIFEFNVQAFAGGTMGAVNGMQPQG 770
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E G +T PE
Sbjct: 771 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGLGFQT-PE 826
>gi|195115764|ref|XP_002002426.1| GI12858 [Drosophila mojavensis]
gi|193913001|gb|EDW11868.1| GI12858 [Drosophila mojavensis]
Length = 877
Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats.
Identities = 56/123 (45%), Positives = 74/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 725 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEP 783
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG DEA+ TAGG+Y+ L + R+ P
Sbjct: 784 SKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTLSQ-RIGMNYETP 842
Query: 114 EGV 116
E +
Sbjct: 843 EAL 845
>gi|395855828|ref|XP_003800351.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
[Otolemur garnettii]
Length = 1147
Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TI+ NV+ F G+MGAVNGM+P+G
Sbjct: 970 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIYELNVQAFAGGAMGAVNGMQPHG 1028
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 1029 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 1074
>gi|397519481|ref|XP_003829887.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pan
paniscus]
Length = 927
Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats.
Identities = 53/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL T+F NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTVFELNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854
>gi|402897018|ref|XP_003911574.1| PREDICTED: non-lysosomal glucosylceramidase [Papio anubis]
Length = 927
Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGETE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854
>gi|355567647|gb|EHH23988.1| Non-lysosomal glucosylceramidase [Macaca mulatta]
gi|355753217|gb|EHH57263.1| Non-lysosomal glucosylceramidase [Macaca fascicularis]
gi|380789499|gb|AFE66625.1| non-lysosomal glucosylceramidase [Macaca mulatta]
gi|384943046|gb|AFI35128.1| non-lysosomal glucosylceramidase [Macaca mulatta]
Length = 927
Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGETE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854
>gi|109111050|ref|XP_001084705.1| PREDICTED: non-lysosomal glucosylceramidase [Macaca mulatta]
Length = 927
Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGETE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854
>gi|390458158|ref|XP_002743170.2| PREDICTED: non-lysosomal glucosylceramidase [Callithrix jacchus]
Length = 1065
Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 892 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 950
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 951 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 996
>gi|395740303|ref|XP_003777396.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pongo
abelii]
Length = 933
Score = 104 bits (260), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 756 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 814
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 815 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 860
>gi|297684086|ref|XP_002819683.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pongo
abelii]
Length = 927
Score = 104 bits (260), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854
>gi|426361725|ref|XP_004048050.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Gorilla
gorilla gorilla]
Length = 927
Score = 104 bits (260), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854
>gi|410978559|ref|XP_003995657.1| PREDICTED: non-lysosomal glucosylceramidase [Felis catus]
Length = 917
Score = 104 bits (260), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+ASG+ +TE +FP+ ++ AL TIF NVR F G+MGAVNGM+P+G
Sbjct: 742 MSDQCAGQWFLRASGLGEGDTE-VFPTPHVVCALQTIFEFNVRAFAGGAMGAVNGMQPHG 800
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 801 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 846
>gi|194386992|dbj|BAG59862.1| unnamed protein product [Homo sapiens]
Length = 755
Score = 104 bits (260), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 578 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 636
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 637 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 682
>gi|158299437|ref|XP_319575.4| AGAP008830-PA [Anopheles gambiae str. PEST]
gi|157013519|gb|EAA14848.4| AGAP008830-PA [Anopheles gambiae str. PEST]
Length = 913
Score = 104 bits (259), Expect = 8e-21, Method: Composition-based stats.
Identities = 54/122 (44%), Positives = 78/122 (63%), Gaps = 8/122 (6%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
M+DQLCG+ +L++ G + + +FP N+ A+ TI+ NV F G +GAVNG P+G +
Sbjct: 762 MSDQLCGHWYLRSCGFDYD-VFPKENVRLAMRTIYENNVMRFCGGQLGAVNGYVPSGQPN 820
Query: 61 RS------AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
+ +IQ EEVWTGVTY L++ M++EG +EA+ TAGGLY+ L E R+ PE
Sbjct: 821 KDGRPDTVSIQGEEVWTGVTYALASTMIHEGMFEEAFKTAGGLYQALSE-RIGMNFETPE 879
Query: 115 GV 116
V
Sbjct: 880 AV 881
>gi|343959578|dbj|BAK63646.1| bile acid beta-glucosidase [Pan troglodytes]
Length = 514
Score = 104 bits (259), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 337 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 395
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 396 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 441
>gi|189217893|ref|NP_001013109.2| non-lysosomal glucosylceramidase [Rattus norvegicus]
gi|149045757|gb|EDL98757.1| glucosidase beta 2 [Rattus norvegicus]
Length = 904
Score = 103 bits (258), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 741 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALKTIFELNVQAFAGGAMGAVNGMQPHG 799
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 800 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 845
>gi|301767028|ref|XP_002918945.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Ailuropoda melanoleuca]
Length = 928
Score = 103 bits (258), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKASG+ +TE +FP+ ++ AL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 753 MSDQCAGQWFLKASGLGEGDTE-VFPTPHVVCALQTIFEFNVQAFAGGAMGAVNGMQPHG 811
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 812 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 857
>gi|281339788|gb|EFB15372.1| hypothetical protein PANDA_007478 [Ailuropoda melanoleuca]
Length = 923
Score = 103 bits (258), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKASG+ +TE +FP+ ++ AL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKASGLGEGDTE-VFPTPHVVCALQTIFEFNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 854
>gi|143018484|sp|Q5M868.2|GBA2_RAT RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
Length = 912
Score = 103 bits (258), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 741 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALKTIFELNVQAFAGGAMGAVNGMQPHG 799
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 800 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 845
>gi|195436933|ref|XP_002066400.1| GK18271 [Drosophila willistoni]
gi|194162485|gb|EDW77386.1| GK18271 [Drosophila willistoni]
Length = 955
Score = 103 bits (258), Expect = 1e-20, Method: Composition-based stats.
Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 803 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEP 861
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG DEA+ TAGG+Y+ + + R+ P
Sbjct: 862 SKAGHVDNSNIQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTISQ-RIGMNFETP 920
Query: 114 EGV 116
E +
Sbjct: 921 EAL 923
>gi|73971791|ref|XP_531991.2| PREDICTED: non-lysosomal glucosylceramidase [Canis lupus familiaris]
Length = 1136
Score = 103 bits (258), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+ASG+ +TE +FP+ ++ AL TIF NVR F G+MGAVNGM+P+G
Sbjct: 961 MSDQCAGQWFLRASGLGEGDTE-VFPTRHVVCALQTIFEFNVRAFAGGAMGAVNGMQPHG 1019
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 1020 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 1065
>gi|397519483|ref|XP_003829888.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pan
paniscus]
Length = 755
Score = 103 bits (258), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL T+F NV+ F G+MGAVNGM+P+G
Sbjct: 578 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTVFELNVQAFAGGAMGAVNGMQPHG 636
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D+S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 637 VPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 682
>gi|338720173|ref|XP_001497796.3| PREDICTED: non-lysosomal glucosylceramidase [Equus caballus]
Length = 983
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 73/106 (68%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ +I RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 806 MSDQCAGQWFLRACGLGEGDTE-VFPTRHIVRALQTIFELNVQAFAGGAMGAVNGMQPHG 864
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DR+++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 865 VPDRTSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 910
>gi|240120073|ref|NP_766280.2| non-lysosomal glucosylceramidase [Mus musculus]
gi|143018443|sp|Q69ZF3.2|GBA2_MOUSE RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
gi|34784135|gb|AAH56935.1| Glucosidase beta 2 [Mus musculus]
Length = 918
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM P+G
Sbjct: 741 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 799
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 800 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 845
>gi|26354314|dbj|BAC40785.1| unnamed protein product [Mus musculus]
Length = 918
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM P+G
Sbjct: 741 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 799
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 800 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 845
>gi|50511011|dbj|BAD32491.1| mKIAA1605 protein [Mus musculus]
Length = 940
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM P+G
Sbjct: 763 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 821
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 822 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 867
>gi|148670514|gb|EDL02461.1| glucosidase beta 2, isoform CRA_a [Mus musculus]
Length = 918
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM P+G
Sbjct: 741 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 799
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 800 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 845
>gi|148670516|gb|EDL02463.1| glucosidase beta 2, isoform CRA_c [Mus musculus]
Length = 467
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 72/106 (67%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM P+G
Sbjct: 290 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 348
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
DRS++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 349 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 394
>gi|189239183|ref|XP_966847.2| PREDICTED: similar to bile acid beta-glucosidase, putative
[Tribolium castaneum]
gi|270010940|gb|EFA07388.1| hypothetical protein TcasGA2_TC016367 [Tribolium castaneum]
Length = 818
Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/103 (50%), Positives = 69/103 (66%), Gaps = 2/103 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG +L++ G N E +FP + +L TI+ NV+ F G MGAVNG +G D
Sbjct: 674 MADQLCGQWYLRSCGFNYE-VFPQDRVKTSLKTIYENNVQSFCDGRMGAVNGFI-DGVID 731
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
IQS+EVWTGVTY L+A+ML EG EA+ TAGG+++ + E
Sbjct: 732 EFTIQSQEVWTGVTYALAASMLQEGMRTEAFNTAGGMFKSMSE 774
>gi|45552032|ref|NP_788055.2| CG33090 [Drosophila melanogaster]
gi|74876618|sp|Q7KT91.1|C3390_DROME RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase
gi|45445134|gb|AAO41192.2| CG33090 [Drosophila melanogaster]
gi|201065481|gb|ACH92150.1| FI02015p [Drosophila melanogaster]
Length = 948
Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats.
Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 796 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEP 854
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+ L + R+ P
Sbjct: 855 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 913
Query: 114 EGV 116
E +
Sbjct: 914 EAL 916
>gi|194860149|ref|XP_001969520.1| GG23916 [Drosophila erecta]
gi|190661387|gb|EDV58579.1| GG23916 [Drosophila erecta]
Length = 947
Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats.
Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 795 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEP 853
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+ L + R+ P
Sbjct: 854 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 912
Query: 114 EGV 116
E +
Sbjct: 913 EAL 915
>gi|194760205|ref|XP_001962332.1| GF15415 [Drosophila ananassae]
gi|190616029|gb|EDV31553.1| GF15415 [Drosophila ananassae]
Length = 894
Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats.
Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 742 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANASDP 800
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQSEEVW GV Y L+A M+ EG +EA+ TAGG+Y+ + + R+ P
Sbjct: 801 SKPGHVDNSNIQSEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQ-RIGMNFETP 859
Query: 114 EGV 116
E +
Sbjct: 860 EAL 862
>gi|195474003|ref|XP_002089281.1| GE19029 [Drosophila yakuba]
gi|194175382|gb|EDW88993.1| GE19029 [Drosophila yakuba]
Length = 947
Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats.
Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 795 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEP 853
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+ L + R+ P
Sbjct: 854 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 912
Query: 114 EGV 116
E +
Sbjct: 913 EAL 915
>gi|195338407|ref|XP_002035816.1| GM15538 [Drosophila sechellia]
gi|194129696|gb|EDW51739.1| GM15538 [Drosophila sechellia]
Length = 947
Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats.
Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 795 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEP 853
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+ L + R+ P
Sbjct: 854 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 912
Query: 114 EGV 116
E +
Sbjct: 913 EAL 915
>gi|195579186|ref|XP_002079443.1| GD23959 [Drosophila simulans]
gi|194191452|gb|EDX05028.1| GD23959 [Drosophila simulans]
Length = 947
Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats.
Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 795 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEP 853
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+ L + R+ P
Sbjct: 854 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 912
Query: 114 EGV 116
E +
Sbjct: 913 EAL 915
>gi|198424051|ref|XP_002127036.1| PREDICTED: similar to Non-lysosomal glucosylceramidase (NLGase)
(Glucosylceramidase 2) (Beta-glucocerebrosidase 2)
(Beta-glucosidase 2) [Ciona intestinalis]
Length = 861
Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats.
Identities = 49/99 (49%), Positives = 67/99 (67%), Gaps = 2/99 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQ G FL ASG+ + P + +L TIFS NV ++ G GAVNGM+P+G D
Sbjct: 720 MADQCAGQWFLLASGICD--VLPDDQVMSSLKTIFSLNVEQYDGGRHGAVNGMRPHGAVD 777
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 99
S++QS+EVWTGVTY L+A M+++G + E + TA G+YR
Sbjct: 778 TSSVQSDEVWTGVTYALAATMIHKGMLKEGFRTASGIYR 816
>gi|195397814|ref|XP_002057523.1| GJ18180 [Drosophila virilis]
gi|194141177|gb|EDW57596.1| GJ18180 [Drosophila virilis]
Length = 869
Score = 102 bits (255), Expect = 3e-20, Method: Composition-based stats.
Identities = 54/123 (43%), Positives = 74/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 717 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEP 775
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+ + + R+ P
Sbjct: 776 SKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQ-RIGMNYETP 834
Query: 114 EGV 116
E +
Sbjct: 835 EAL 837
>gi|288558766|gb|ADC53514.1| MIP16949p [Drosophila melanogaster]
Length = 432
Score = 102 bits (255), Expect = 3e-20, Method: Composition-based stats.
Identities = 55/123 (44%), Positives = 74/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 280 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEP 338
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+ L + R+ P
Sbjct: 339 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQ-RIGMNFETP 397
Query: 114 EGV 116
E +
Sbjct: 398 EAL 400
>gi|326934726|ref|XP_003213436.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Meleagris gallopavo]
Length = 639
Score = 102 bits (254), Expect = 3e-20, Method: Composition-based stats.
Identities = 51/104 (49%), Positives = 67/104 (64%), Gaps = 2/104 (1%)
Query: 3 DQLCGYLFLKASGVNTEA--IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
DQ G FL A G++ + +FP ++I AL TIF NV F G+MGAVNGM+P+G D
Sbjct: 481 DQCAGQWFLGACGLDQKELEVFPKSHIVSALKTIFEKNVMSFAGGTMGAVNGMRPDGVPD 540
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
S++QS EVW GV Y L+A M+ EG V E + TA G YR + E+
Sbjct: 541 TSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTAEGCYRTVWEN 584
>gi|195033785|ref|XP_001988762.1| GH11342 [Drosophila grimshawi]
gi|193904762|gb|EDW03629.1| GH11342 [Drosophila grimshawi]
Length = 869
Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats.
Identities = 54/123 (43%), Positives = 74/123 (60%), Gaps = 9/123 (7%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 717 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANANEP 775
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAP 113
G D S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+ + + R+ P
Sbjct: 776 SKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQ-RIGMNFETP 834
Query: 114 EGV 116
E +
Sbjct: 835 EAL 837
>gi|169642399|gb|AAI60640.1| LOC100145361 protein [Xenopus (Silurana) tropicalis]
Length = 571
Score = 101 bits (252), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 51/105 (48%), Positives = 67/105 (63%), Gaps = 2/105 (1%)
Query: 1 MADQLCGYLFLKASGVNT--EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M DQ G FL+A G+ +FP ++ AL TIF NV+ F G MGAVNGM+P+G
Sbjct: 415 MVDQCAGNWFLRACGLGAGESEVFPKDHVVSALRTIFELNVKQFADGQMGAVNGMRPDGT 474
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D S++QS+EVW GV Y L+A M++EG V E + TA G YR + E
Sbjct: 475 TDTSSVQSDEVWIGVVYGLAATMIHEGLVQEGFATAEGCYRTVWE 519
>gi|432920219|ref|XP_004079895.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oryzias latipes]
Length = 858
Score = 101 bits (252), Expect = 5e-20, Method: Composition-based stats.
Identities = 55/117 (47%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 1 MADQLCGYLFLKASGVNTE--AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M+DQ G+ FLKAS E FP I L +IF NV F G MGAVNGM+P G
Sbjct: 699 MSDQCAGHWFLKASACGEEDYQAFPKEKIQSTLKSIFDLNVMAFAGGQMGAVNGMRPEGV 758
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
DRS++QS+EVW GV Y L+A M++EG +E TA G YR + E R+ PE
Sbjct: 759 PDRSSVQSDEVWIGVVYGLAATMIHEGMREEGMHTAEGCYRTVWE-RLGMAFQTPEA 814
>gi|242015137|ref|XP_002428230.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212512791|gb|EEB15492.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 812
Score = 101 bits (252), Expect = 6e-20, Method: Composition-based stats.
Identities = 49/116 (42%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
M+DQLCG +L G+ E + P A + +L I+ NV G++G VNGM PNG+ D
Sbjct: 669 MSDQLCGLWYLYCCGIKNE-VLPPAKVNSSLRCIYENNVLKLGKGNLGCVNGMLPNGEID 727
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEGV 116
+QSEE+W GV Y L++ M++EG V+E W TA G+Y + + G ET PE +
Sbjct: 728 IHTLQSEEIWVGVVYALASLMIFEGMVEEGWKTARGIYETVYDRIGMGFET-PEAL 782
>gi|301628717|ref|XP_002943495.1| PREDICTED: non-lysosomal glucosylceramidase [Xenopus (Silurana)
tropicalis]
Length = 692
Score = 101 bits (251), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 2/101 (1%)
Query: 1 MADQLCGYLFLKASGVNT--EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M DQ G FL+A G+ +FP ++ AL TIF NV+ F G MGAVNGM+P+G
Sbjct: 536 MVDQCAGNWFLRACGLGAGESEVFPKDHVVSALRTIFELNVKQFADGQMGAVNGMRPDGT 595
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 99
D S++QS+EVW GV Y L+A M++EG V E + TA G YR
Sbjct: 596 TDTSSVQSDEVWIGVVYGLAATMIHEGLVQEGFATAEGCYR 636
>gi|194669510|ref|XP_001790122.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
gi|297477943|ref|XP_002689726.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
gi|296484715|tpg|DAA26830.1| TPA: hypothetical protein BOS_8918 [Bos taurus]
Length = 918
Score = 101 bits (251), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 68/105 (64%), Gaps = 2/105 (1%)
Query: 1 MADQLCGYLFLKASGVN--TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M+DQ G FL+ASG+ +FP+ + RAL TIF NV+ F G+MGAVNGM+P G
Sbjct: 740 MSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQAFAGGAMGAVNGMQPCGV 799
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 800 PDTSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGCYRTVWE 844
>gi|440901933|gb|ELR52793.1| Non-lysosomal glucosylceramidase, partial [Bos grunniens mutus]
Length = 927
Score = 101 bits (251), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 68/105 (64%), Gaps = 2/105 (1%)
Query: 1 MADQLCGYLFLKASGVN--TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M+DQ G FL+ASG+ +FP+ + RAL TIF NV+ F G+MGAVNGM+P G
Sbjct: 749 MSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQAFAGGAMGAVNGMQPRGV 808
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 809 PDTSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 853
>gi|344271640|ref|XP_003407645.1| PREDICTED: non-lysosomal glucosylceramidase [Loxodonta africana]
Length = 967
Score = 100 bits (250), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 52/106 (49%), Positives = 72/106 (67%), Gaps = 4/106 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 794 MSDQCAGQWFLRACGLGEGDTE-VFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHG 852
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D S++QS+EVW GV Y L+A M+ EG + + TA G YR + E
Sbjct: 853 VPDTSSVQSDEVWVGVVYGLAATMIQEGLTSKGFWTAEGCYRTVWE 898
>gi|426220232|ref|XP_004004320.1| PREDICTED: non-lysosomal glucosylceramidase [Ovis aries]
Length = 918
Score = 100 bits (250), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 68/105 (64%), Gaps = 2/105 (1%)
Query: 1 MADQLCGYLFLKASGVN--TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M+DQ G FL+ASG+ +FP+ + RAL TIF NV+ F G+MGAVNGM+P G
Sbjct: 740 MSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQAFAGGAMGAVNGMQPRGV 799
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D S++QS+EVW GV Y L+A M+ EG E + TA G YR + E
Sbjct: 800 PDTSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE 844
>gi|363744064|ref|XP_003642968.1| PREDICTED: non-lysosomal glucosylceramidase-like [Gallus gallus]
Length = 853
Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats.
Identities = 51/103 (49%), Positives = 65/103 (63%), Gaps = 2/103 (1%)
Query: 3 DQLCGYLFLKASGVNTEAI--FPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
DQ G FL A G++ + + FP ++I AL TIF NV F G MGAVNGM+P+G D
Sbjct: 695 DQCAGQWFLGACGLDQKEVEVFPKSHIVSALKTIFEKNVMSFAGGKMGAVNGMRPDGVPD 754
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
S++QS EVW GV Y L+A M+ EG V E + TA G YR + E
Sbjct: 755 TSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTAEGCYRTVWE 797
>gi|224090236|ref|XP_002190896.1| PREDICTED: non-lysosomal glucosylceramidase [Taeniopygia guttata]
Length = 826
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 68/105 (64%), Gaps = 2/105 (1%)
Query: 1 MADQLCGYLFLKASGVNT--EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M+DQ G FL A G++ +FP ++I AL TIF NV F G+MGAVNGM+P+G
Sbjct: 668 MSDQCAGQWFLGACGLDQGEFEVFPKSHILSALRTIFEKNVLSFAGGTMGAVNGMRPDGV 727
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D S++QS EVW GV Y L+A M+ EG V+E + TA G YR + E
Sbjct: 728 PDTSSVQSNEVWVGVVYSLAATMIQEGMVEEGFRTAEGCYRTVWE 772
>gi|348520215|ref|XP_003447624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oreochromis
niloticus]
Length = 856
Score = 99.8 bits (247), Expect = 2e-19, Method: Composition-based stats.
Identities = 50/105 (47%), Positives = 65/105 (61%), Gaps = 2/105 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAI--FPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M+DQ G+ FL+ASG+ FP I AL ++F NV F G MGAVNGM+P G
Sbjct: 699 MSDQCAGHWFLRASGLGDGEFQAFPKEKIQSALKSVFDLNVMSFAGGQMGAVNGMRPEGV 758
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D S++QS+EVW GV Y L+A M++EG +E TA G YR + E
Sbjct: 759 PDHSSVQSDEVWIGVVYGLAATMIHEGMQEEGMRTAEGCYRTVWE 803
>gi|308485740|ref|XP_003105068.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
gi|308257013|gb|EFP00966.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
Length = 933
Score = 98.6 bits (244), Expect = 4e-19, Method: Composition-based stats.
Identities = 49/99 (49%), Positives = 68/99 (68%), Gaps = 1/99 (1%)
Query: 1 MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
MADQLCGY FL++ S + + P+ + AL TI+ NV F G MGAVNGMKPNG
Sbjct: 729 MADQLCGYWFLQSVSPEMVDDLLPNHMVRSALDTIYRLNVCRFGNGQMGAVNGMKPNGVV 788
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
DR IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 789 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 827
>gi|449275451|gb|EMC84314.1| Non-lysosomal glucosylceramidase [Columba livia]
Length = 692
Score = 97.8 bits (242), Expect = 6e-19, Method: Composition-based stats.
Identities = 44/83 (53%), Positives = 58/83 (69%)
Query: 21 IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAA 80
+FP +++ AL TIF NV GF G+MGAVNGM+P+G D S++QS EVW GV Y L+A
Sbjct: 554 VFPKSHVVSALKTIFEKNVMGFAGGTMGAVNGMRPSGVPDTSSVQSNEVWVGVVYALAAT 613
Query: 81 MLYEGNVDEAWTTAGGLYRRLRE 103
M+ EG V+E + TA G YR + E
Sbjct: 614 MIQEGLVEEGFRTAEGCYRTVWE 636
>gi|268569852|ref|XP_002640631.1| Hypothetical protein CBG08749 [Caenorhabditis briggsae]
Length = 899
Score = 97.4 bits (241), Expect = 8e-19, Method: Composition-based stats.
Identities = 49/99 (49%), Positives = 68/99 (68%), Gaps = 1/99 (1%)
Query: 1 MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
MADQLCGY FL++ S + + PS + AL TI+ NV F G MGAVNGMKP+G
Sbjct: 701 MADQLCGYWFLQSVSPEMADDLLPSHMVRSALDTIYRLNVCQFGNGQMGAVNGMKPSGVV 760
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
DR IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 761 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 799
>gi|341901617|gb|EGT57552.1| hypothetical protein CAEBREN_15139 [Caenorhabditis brenneri]
Length = 925
Score = 97.1 bits (240), Expect = 1e-18, Method: Composition-based stats.
Identities = 48/99 (48%), Positives = 68/99 (68%), Gaps = 1/99 (1%)
Query: 1 MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
MADQLCGY FL++ S + + P+ + AL TI+ NV F G MGAVNGMKP+G
Sbjct: 705 MADQLCGYWFLQSVSPEMADDLLPNHMVRSALDTIYRLNVCKFGNGQMGAVNGMKPSGVV 764
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
DR IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 765 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 803
>gi|71992715|ref|NP_001021680.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
gi|18376555|emb|CAD21661.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
Length = 959
Score = 96.7 bits (239), Expect = 2e-18, Method: Composition-based stats.
Identities = 48/99 (48%), Positives = 68/99 (68%), Gaps = 1/99 (1%)
Query: 1 MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
MADQLCGY FL++ S + + P+ + AL TI+ NV F G MGAVNGMKP+G
Sbjct: 753 MADQLCGYWFLQSVSPELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVV 812
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
DR IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 813 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 851
>gi|71992723|ref|NP_001021681.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
gi|50511739|emb|CAH04764.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
Length = 930
Score = 95.9 bits (237), Expect = 2e-18, Method: Composition-based stats.
Identities = 48/99 (48%), Positives = 68/99 (68%), Gaps = 1/99 (1%)
Query: 1 MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
MADQLCGY FL++ S + + P+ + AL TI+ NV F G MGAVNGMKP+G
Sbjct: 724 MADQLCGYWFLQSVSPELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVV 783
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
DR IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 784 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 822
>gi|71992731|ref|NP_001021682.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
gi|15130776|emb|CAC48128.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
Length = 922
Score = 95.9 bits (237), Expect = 2e-18, Method: Composition-based stats.
Identities = 48/99 (48%), Positives = 68/99 (68%), Gaps = 1/99 (1%)
Query: 1 MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
MADQLCGY FL++ S + + P+ + AL TI+ NV F G MGAVNGMKP+G
Sbjct: 716 MADQLCGYWFLQSVSPELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVV 775
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
DR IQ++E+WTGVTY +++ ++ +G V++A+ TA G Y
Sbjct: 776 DREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSY 814
>gi|431902810|gb|ELK09025.1| Non-lysosomal glucosylceramidase [Pteropus alecto]
Length = 970
Score = 94.7 bits (234), Expect = 6e-18, Method: Composition-based stats.
Identities = 47/94 (50%), Positives = 67/94 (71%), Gaps = 4/94 (4%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+ASG+ +TE +FP+ ++ AL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 718 MSDQCAGQWFLRASGLGEGDTE-VFPTPHVVCALQTIFKFNVQAFAGGAMGAVNGMQPHG 776
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
DRS++QS+EVW GV Y L+A M+ E N+ ++
Sbjct: 777 VPDRSSVQSDEVWVGVVYGLAATMIQETNLMSSY 810
>gi|148670515|gb|EDL02462.1| glucosidase beta 2, isoform CRA_b [Mus musculus]
Length = 377
Score = 94.7 bits (234), Expect = 6e-18, Method: Composition-based stats.
Identities = 46/87 (52%), Positives = 62/87 (71%), Gaps = 4/87 (4%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FL+A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM P+G
Sbjct: 241 MSDQCAGQWFLRACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHG 299
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYE 84
DRS++QS+EVW GV Y L+A M+ E
Sbjct: 300 VPDRSSVQSDEVWVGVVYGLAATMIQE 326
>gi|119578751|gb|EAW58347.1| glucosidase, beta (bile acid) 2, isoform CRA_c [Homo sapiens]
gi|168270622|dbj|BAG10104.1| non-lysosomal glucosylceramidase [synthetic construct]
Length = 877
Score = 94.7 bits (234), Expect = 7e-18, Method: Composition-based stats.
Identities = 46/87 (52%), Positives = 63/87 (72%), Gaps = 4/87 (4%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 750 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 808
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYE 84
D+S++QS+EVW GV Y L+A M+ E
Sbjct: 809 VPDKSSVQSDEVWVGVVYGLAATMIQE 835
>gi|333898016|ref|YP_004471890.1| glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
gi|333113281|gb|AEF18218.1| Glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
Length = 806
Score = 94.4 bits (233), Expect = 8e-18, Method: Composition-based stats.
Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL G + A + I P ++ +AL I+ NV FE G MGAVNGM+P+G D
Sbjct: 665 MADQLAGQWY--ADILRLGDILPKDHVQKALKKIYEFNVMKFENGKMGAVNGMRPDGIVD 722
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 99
S IQ++EVWTGVTY L++ M Y G +EA+ TA G+Y+
Sbjct: 723 ESDIQAQEVWTGVTYALASFMKYRGMTEEAYNTAYGVYK 761
>gi|10047285|dbj|BAB13431.1| KIAA1605 protein [Homo sapiens]
Length = 922
Score = 94.4 bits (233), Expect = 9e-18, Method: Composition-based stats.
Identities = 46/87 (52%), Positives = 63/87 (72%), Gaps = 4/87 (4%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
M+DQ G FLKA G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G
Sbjct: 795 MSDQCAGQWFLKACGLGEGDTE-VFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHG 853
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYE 84
D+S++QS+EVW GV Y L+A M+ E
Sbjct: 854 VPDKSSVQSDEVWVGVVYGLAATMIQE 880
>gi|260813884|ref|XP_002601646.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
gi|229286945|gb|EEN57658.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
Length = 871
Score = 94.0 bits (232), Expect = 1e-17, Method: Composition-based stats.
Identities = 46/107 (42%), Positives = 69/107 (64%), Gaps = 9/107 (8%)
Query: 1 MADQLCGYLFLKASGVNTE---------AIFPSANIARALATIFSTNVRGFEAGSMGAVN 51
MA G+ +L+A G+ E + P ++A +L ++ NV F G+MGAVN
Sbjct: 713 MAGAGSGHWYLRACGLVPEDNEVMDSDREVLPLDHVASSLRKVYDFNVMQFHQGTMGAVN 772
Query: 52 GMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
GM+PNG +D +++QSEEVWTG+TY L+A+M+ EG V+E + TA G+Y
Sbjct: 773 GMRPNGKKDLTSMQSEEVWTGITYALAASMIQEGMVEEGFQTAYGVY 819
>gi|324500616|gb|ADY40283.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 890
Score = 92.4 bits (228), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 67/99 (67%), Gaps = 1/99 (1%)
Query: 1 MADQLCGYLFLKA-SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
MADQLCGY FL++ S + + P+A++ AL I++ NV F G +GAVNGM+ +G
Sbjct: 690 MADQLCGYWFLQSISPELADELLPAAHVKSALNIIYNYNVLRFAGGRLGAVNGMRRDGTV 749
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
DR+ IQ++E+WTGVTY L+A ++ +G V + TA G Y
Sbjct: 750 DRNYIQADEMWTGVTYALAAFLIQQGAVQRGFDTAWGCY 788
>gi|47212590|emb|CAG12815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 777
Score = 92.0 bits (227), Expect = 4e-17, Method: Composition-based stats.
Identities = 50/114 (43%), Positives = 64/114 (56%), Gaps = 3/114 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
A CG+ + T FP I AL +IF NV F G MGAVNGM+P G DR
Sbjct: 641 ATGFCGHPVWETE--TTRLAFPKEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDR 698
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
S++QS+EVW GV Y L+A M++EG ++E TA G YR + E R+ PE
Sbjct: 699 SSVQSDEVWIGVVYGLAATMIHEGMLEEGMRTAEGCYRTVWE-RLGMAFQTPEA 751
>gi|327288418|ref|XP_003228923.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Anolis
carolinensis]
Length = 667
Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 44/88 (50%), Positives = 59/88 (67%), Gaps = 2/88 (2%)
Query: 1 MADQLCGYLFLKASGVN--TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
M+DQL G FL+A G+ +FP ++ AL TI+ NV GF G+MGAVNGM+P+G
Sbjct: 549 MSDQLAGQWFLRACGLGEGKYQVFPQEHVQSALRTIYEMNVLGFSEGAMGAVNGMRPDGV 608
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGN 86
D S++QS+EVW GV Y L+A M+ EG
Sbjct: 609 PDTSSVQSDEVWVGVVYSLAATMIQEGQ 636
>gi|356535470|ref|XP_003536268.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
Length = 949
Score = 89.7 bits (221), Expect = 2e-16, Method: Composition-based stats.
Identities = 45/97 (46%), Positives = 63/97 (64%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ ++F + I AL ++ NV + G MGAVNGM PNG D
Sbjct: 759 ADQLAGQWYTASSGL--PSLFEDSKIKSALRKVYDFNVMKVKGGRMGAVNGMHPNGKVDE 816
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY L+A M++ G +EA+TTA G++
Sbjct: 817 TCMQSREVWTGVTYGLAATMIHAGMEEEAFTTAEGIF 853
>gi|449691194|ref|XP_004212588.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
magnipapillata]
Length = 169
Score = 89.4 bits (220), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 69/119 (57%), Gaps = 4/119 (3%)
Query: 1 MADQLCGYLFLKASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 57
MA Q G +L A + + +F + L IF NV FE G MGAVNGM+ +G
Sbjct: 17 MAGQCAGQWYLHACDLGQTEDDLVFRPEQVHSTLKVIFQNNVMRFEDGFMGAVNGMRADG 76
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEGV 116
D++++Q+EEVWTGVTY L+A M+ G V E + TA G+YR + E G +T PE +
Sbjct: 77 TVDKNSLQAEEVWTGVTYALAATMIQHGMVREGFRTAEGIYRTVYEKWGLGFQT-PEAI 134
>gi|168014916|ref|XP_001759997.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688747|gb|EDQ75122.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 970
Score = 89.0 bits (219), Expect = 4e-16, Method: Composition-based stats.
Identities = 46/97 (47%), Positives = 62/97 (63%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQ+ G + ASG+ +F A AL IF NV + G +GAVNGM PNG D
Sbjct: 772 ADQMAGQWYAWASGL--PPLFDDAKARSALQKIFDFNVMKVKNGHLGAVNGMHPNGKVDE 829
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY L+AAM++EG +D+A+ A G++
Sbjct: 830 TCMQSREVWTGVTYALAAAMIHEGMLDQAFRAAEGVF 866
>gi|218199525|gb|EEC81952.1| hypothetical protein OsI_25831 [Oryza sativa Indica Group]
Length = 850
Score = 88.2 bits (217), Expect = 6e-16, Method: Composition-based stats.
Identities = 46/97 (47%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I L IF NV + G MGAVNGM PNG D
Sbjct: 661 ADQLAGQWYTASSGL--PPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDE 718
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY L+A ML G D+A+TTA G++
Sbjct: 719 TCMQSREIWTGVTYSLAATMLLHGMDDQAFTTAEGIF 755
>gi|358254610|dbj|GAA55931.1| non-lysosomal glucosylceramidase [Clonorchis sinensis]
Length = 1200
Score = 87.8 bits (216), Expect = 7e-16, Method: Composition-based stats.
Identities = 44/98 (44%), Positives = 57/98 (58%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL G+ FL+ G AI PS ++ AL TI STN + GS+G +NG P RD
Sbjct: 713 MADQLSGHGFLRVGGAPPGAILPSEHVIMALKTIQSTNWESIQNGSLGVINGGLPGAKRD 772
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S +Q+EE W GV Y L+A M+ EG E ++ A Y
Sbjct: 773 LSNVQAEEFWVGVNYSLAATMILEGMSQEGFSLAEACY 810
>gi|357116998|ref|XP_003560263.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 932
Score = 87.8 bits (216), Expect = 7e-16, Method: Composition-based stats.
Identities = 46/97 (47%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I L IF NV + G MGAVNGM PNG D
Sbjct: 744 ADQLAGQWYAASSGL--PPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMYPNGKVDE 801
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY L+A ML G D+A+TTA G++
Sbjct: 802 TCMQSREIWTGVTYSLAATMLLHGMEDQAFTTAEGIF 838
>gi|390344669|ref|XP_001200431.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Strongylocentrotus purpuratus]
Length = 347
Score = 87.8 bits (216), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 63/100 (63%), Gaps = 2/100 (2%)
Query: 1 MADQLCGYLFLKASGVNTE--AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
MADQ CG+ +L+A + E +F ++ AL TIF NV G + G+ GA+NG++P+G
Sbjct: 193 MADQTCGHWYLRACDLVPERDQVFQVDHVRSALRTIFDMNVMGVKEGNFGAMNGIRPSGK 252
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
D +++Q EEVW G TY L+ M+ EG +E + TA G Y
Sbjct: 253 PDHTSLQGEEVWVGTTYGLAGNMIQEGMWEEGFRTAKGCY 292
>gi|222636955|gb|EEE67087.1| hypothetical protein OsJ_24071 [Oryza sativa Japonica Group]
Length = 838
Score = 87.4 bits (215), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I L IF NV + G MGAVNGM PNG D
Sbjct: 649 ADQLAGQWYTASSGL--PPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDE 706
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY L+A ML G D+A+TTA G++
Sbjct: 707 TCMQSREIWTGVTYSLAATMLLHGMDDQAFTTAEGIF 743
>gi|224072655|ref|XP_002303825.1| predicted protein [Populus trichocarpa]
gi|222841257|gb|EEE78804.1| predicted protein [Populus trichocarpa]
Length = 966
Score = 87.0 bits (214), Expect = 1e-15, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G ++ +SG+ ++F I AL I+ NV G MGAVNGM PNG D
Sbjct: 776 ADQLAGEWYMASSGL--PSLFDDVKIRSALNKIYDFNVMKVRGGKMGAVNGMHPNGKVDE 833
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+W+GVTY ++A M+ G D+A+TTA G++
Sbjct: 834 TCMQSREIWSGVTYAVAATMILSGMEDKAFTTAEGIF 870
>gi|302773445|ref|XP_002970140.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
gi|300162651|gb|EFJ29264.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
Length = 983
Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 63/97 (64%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQ+ G + ASG+ +F AL I+ NV + G +GAVNGM PNG D
Sbjct: 785 ADQMAGQWYAWASGL--PPLFDDYKARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDE 842
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+A+QS E+W+GVTY +AAM++EG +++A+TTA G+Y
Sbjct: 843 TAMQSREIWSGVTYAAAAAMIHEGMLEQAFTTAEGIY 879
>gi|302807226|ref|XP_002985326.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
gi|300147154|gb|EFJ13820.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
Length = 983
Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 63/97 (64%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQ+ G + ASG+ +F AL I+ NV + G +GAVNGM PNG D
Sbjct: 785 ADQMAGQWYAWASGL--PPLFDDYKARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDE 842
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+A+QS E+W+GVTY +AAM++EG +++A+TTA G+Y
Sbjct: 843 TAMQSREIWSGVTYAAAAAMIHEGMLEQAFTTAEGIY 879
>gi|326489639|dbj|BAK01800.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 797
Score = 86.7 bits (213), Expect = 2e-15, Method: Composition-based stats.
Identities = 45/97 (46%), Positives = 62/97 (63%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + KA G+++ I AL I++ NV F+ G+ GA+NGM P+G D
Sbjct: 603 ADQLAGQWYAKACGLSS--IVDKDKSQSALEKIYAFNVMKFKDGNRGAINGMWPDGTLDM 660
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S +QS E+W GVTY L+A+M+ EG VDE + TA G+Y
Sbjct: 661 STMQSREIWPGVTYALAASMIQEGMVDEGFKTAEGIY 697
>gi|147788372|emb|CAN61187.1| hypothetical protein VITISV_019326 [Vitis vinifera]
Length = 900
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ ++F I +L I+ NV + G MGAVNGM PNG D
Sbjct: 675 ADQLAGQWYTASSGL--PSLFDDCKIKSSLHKIYDFNVMKVKGGKMGAVNGMHPNGKVDE 732
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S +QS E+WTGVTY ++A M+ G ++A+TTA G++
Sbjct: 733 SCMQSREIWTGVTYGVAATMILSGMEEQAFTTAEGIF 769
>gi|414589150|tpg|DAA39721.1| TPA: hypothetical protein ZEAMMB73_266944 [Zea mays]
Length = 583
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 46/97 (47%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I L IF NV + G MGAVNGM PNG D
Sbjct: 395 ADQLAGQWYTASSGL--PPLFDEDRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDE 452
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY L+A ML G +A+TTA G+Y
Sbjct: 453 TCMQSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIY 489
>gi|242070593|ref|XP_002450573.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
gi|241936416|gb|EES09561.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
Length = 951
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 47/100 (47%), Positives = 64/100 (64%), Gaps = 8/100 (8%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARA---LATIFSTNVRGFEAGSMGAVNGMKPNGD 58
ADQL G + KA G +FP + +A L IFS NV F+ G+ GA+NGM P+G
Sbjct: 757 ADQLAGQWYAKACG-----LFPIVDKDKAQSALEKIFSFNVMKFKDGTRGAMNGMWPDGT 811
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
D SA+QS E+W GVTY L+A+M+ EG V++ + TA G+Y
Sbjct: 812 VDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGIY 851
>gi|16183333|gb|AAL13692.1| GH26380p [Drosophila melanogaster]
Length = 391
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 44/92 (47%), Positives = 56/92 (60%), Gaps = 8/92 (8%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN---- 56
MADQLCG+ +LK+ G + E I+P N+ AL I+ NV GF G++GA NG N
Sbjct: 280 MADQLCGHWYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEP 338
Query: 57 ---GDRDRSAIQSEEVWTGVTYLLSAAMLYEG 85
G D S IQ+EEVW GV Y L+A M+ EG
Sbjct: 339 TKPGHVDNSNIQAEEVWPGVVYALAATMIQEG 370
>gi|253761389|ref|XP_002489100.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
gi|241947399|gb|EES20544.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
Length = 603
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 46/97 (47%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I L IF NV + G MGAVNGM PNG D
Sbjct: 415 ADQLAGQWYTASSGL--PPLFDEDRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDD 472
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY L+A ML G +A+TTA G+Y
Sbjct: 473 TCMQSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIY 509
>gi|225433442|ref|XP_002285674.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
Length = 978
Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ ++F I +L I+ NV + G MGAVNGM PNG D
Sbjct: 789 ADQLAGQWYTASSGL--PSLFDDYKIKSSLHKIYDFNVMKVKGGKMGAVNGMHPNGKVDE 846
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S +QS E+WTGVTY ++A M+ G ++A+TTA G++
Sbjct: 847 SCMQSREIWTGVTYGVAATMILSGMEEQAFTTAEGIF 883
>gi|413920603|gb|AFW60535.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
Length = 951
Score = 85.1 bits (209), Expect = 5e-15, Method: Composition-based stats.
Identities = 45/100 (45%), Positives = 64/100 (64%), Gaps = 8/100 (8%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARA---LATIFSTNVRGFEAGSMGAVNGMKPNGD 58
ADQL G + KA G +FP + +A L ++S NV F+ G+ GA+NGM P+G
Sbjct: 757 ADQLAGQWYAKACG-----LFPIVDKDKAQSALEKVYSFNVMKFKGGTRGAMNGMWPDGT 811
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
D SA+QS E+W GVTY L+A+M+ EG V++ + TA G+Y
Sbjct: 812 VDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGVY 851
>gi|413941636|gb|AFW74285.1| hypothetical protein ZEAMMB73_466373 [Zea mays]
Length = 966
Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 64/98 (65%), Gaps = 2/98 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL G + +A G+ E + A ALAT+ NV + G++GAVNGM+P+G D
Sbjct: 763 MADQLAGQWYARACGL--EPVVEDAKARSALATVLDYNVMRVKGGAIGAVNGMRPDGAVD 820
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S+ QS+EVW GVTY ++AAM++EG + A+ TA G +
Sbjct: 821 ASSTQSKEVWPGVTYAVAAAMVHEGMTEAAFRTAKGAH 858
>gi|115474457|ref|NP_001060825.1| Os08g0111200 [Oryza sativa Japonica Group]
gi|42408344|dbj|BAD09496.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
gi|42408391|dbj|BAD09542.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
gi|113622794|dbj|BAF22739.1| Os08g0111200 [Oryza sativa Japonica Group]
gi|215701490|dbj|BAG92914.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 928
Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 64/97 (65%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G+ E I RALAT+ NV + G++GAVNGM+P+G D
Sbjct: 730 ADQLAGQWYARACGL--EPIVDGDKARRALATVLDYNVMRVKGGAIGAVNGMRPDGAVDA 787
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S+ QS+EVW GVTY ++AAM++EG + A+ TA G++
Sbjct: 788 SSTQSKEVWPGVTYAVAAAMIHEGMPEAAFKTAKGIH 824
>gi|168037390|ref|XP_001771187.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677567|gb|EDQ64036.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 981
Score = 84.7 bits (208), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ASG+ +F AL +F NV + G GA NGM PNG D
Sbjct: 767 ADQLAGQWYAWASGL--PPLFDDYKARSALQKVFDFNVMKVKGGKWGAANGMHPNGKVDE 824
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY SAAM++EG V++A+T A G++
Sbjct: 825 TCMQSREIWTGVTYAASAAMIHEGMVEQAFTAAQGVF 861
>gi|357440839|ref|XP_003590697.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
gi|355479745|gb|AES60948.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
Length = 1103
Score = 84.3 bits (207), Expect = 8e-15, Method: Composition-based stats.
Identities = 42/97 (43%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ ++F I +L +F NV + G MGAVNGM PNG D
Sbjct: 914 ADQLAGQWYTASSGL--PSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDE 971
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+W GVTY ++A M+ G +EA+TTA G++
Sbjct: 972 TCMQSREIWAGVTYGVAATMILAGMEEEAFTTAEGIF 1008
>gi|297809209|ref|XP_002872488.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
gi|297318325|gb|EFH48747.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
Length = 921
Score = 84.3 bits (207), Expect = 8e-15, Method: Composition-based stats.
Identities = 45/116 (38%), Positives = 66/116 (56%), Gaps = 3/116 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
+ADQL G + +A G+ + I I +AL TI+ NV + G+ GAVNGM P+G D
Sbjct: 719 LADQLAGQWYARACGL--KPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSPDGQVD 776
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCE-TAPEG 115
+++ S+EVW G TY ++A M+ EG + + TA G+Y + DR C PE
Sbjct: 777 TNSLVSKEVWAGTTYSVAACMIQEGQRERGFQTASGIYEAVWSDRGLSCSFQTPEA 832
>gi|356576421|ref|XP_003556330.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
Length = 950
Score = 84.3 bits (207), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F + I AL ++ NV + G MGAVNGM PNG D
Sbjct: 760 ADQLAGQWYTASSGL--PPLFEDSKIKSALRKVYDFNVMKVKGGRMGAVNGMHPNGKVDE 817
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY L+A M+ G +EA+ TA G++
Sbjct: 818 TCMQSREVWTGVTYGLAATMILAGMEEEAFATAEGIF 854
>gi|62733283|gb|AAX95400.1| At5g49900 [Oryza sativa Japonica Group]
Length = 931
Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats.
Identities = 46/100 (46%), Positives = 62/100 (62%), Gaps = 8/100 (8%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARA---LATIFSTNVRGFEAGSMGAVNGMKPNGD 58
ADQL G + KA G +FP + +A L I+S NV F+ G GA+NGM P+G
Sbjct: 738 ADQLAGQWYAKACG-----LFPIVDKDKAESALEKIYSFNVMKFKDGKRGAMNGMWPDGT 792
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
D SA+QS E+W GVTY L+A M+ EG V++ + TA G+Y
Sbjct: 793 VDMSAMQSREIWPGVTYALAATMIQEGMVEKGFKTAEGIY 832
>gi|357157041|ref|XP_003577664.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 951
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 44/97 (45%), Positives = 61/97 (62%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + K+ G+++ I AL I+S NV F+ G GA+NGM P+G D
Sbjct: 757 ADQLAGQWYAKSCGLSS--IVDKDKSQSALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDM 814
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S +QS E+W GVTY L+A+M+ EG V+E + TA G+Y
Sbjct: 815 STMQSREIWPGVTYALAASMIQEGMVEEGFKTAEGIY 851
>gi|357440847|ref|XP_003590701.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
gi|355479749|gb|AES60952.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
Length = 992
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 42/97 (43%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ ++F I +L +F NV + G MGAVNGM PNG D
Sbjct: 803 ADQLAGQWYTASSGL--PSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDE 860
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+W GVTY ++A M+ G +EA+TTA G++
Sbjct: 861 TCMQSREIWAGVTYGVAATMILAGMEEEAFTTAEGIF 897
>gi|326517703|dbj|BAK03770.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 821
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL G + +A G+ E I AL T+ NV + G++GAVNGM+P+G D
Sbjct: 615 MADQLAGQWYARACGL--EPIVEEEKARSALGTVLDYNVMRVQGGAVGAVNGMRPDGAVD 672
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S++QS+EVW GVTY ++AAML+EG + A+ TA G +
Sbjct: 673 ASSLQSKEVWVGVTYGVAAAMLHEGMTEAAFRTAKGAH 710
>gi|115484889|ref|NP_001067588.1| Os11g0242100 [Oryza sativa Japonica Group]
gi|77549531|gb|ABA92328.1| expressed protein [Oryza sativa Japonica Group]
gi|113644810|dbj|BAF27951.1| Os11g0242100 [Oryza sativa Japonica Group]
gi|215704397|dbj|BAG93831.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222615769|gb|EEE51901.1| hypothetical protein OsJ_33494 [Oryza sativa Japonica Group]
Length = 950
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 46/100 (46%), Positives = 62/100 (62%), Gaps = 8/100 (8%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARA---LATIFSTNVRGFEAGSMGAVNGMKPNGD 58
ADQL G + KA G +FP + +A L I+S NV F+ G GA+NGM P+G
Sbjct: 757 ADQLAGQWYAKACG-----LFPIVDKDKAESALEKIYSFNVMKFKDGKRGAMNGMWPDGT 811
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
D SA+QS E+W GVTY L+A M+ EG V++ + TA G+Y
Sbjct: 812 VDMSAMQSREIWPGVTYALAATMIQEGMVEKGFKTAEGIY 851
>gi|218185509|gb|EEC67936.1| hypothetical protein OsI_35656 [Oryza sativa Indica Group]
Length = 950
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 46/100 (46%), Positives = 62/100 (62%), Gaps = 8/100 (8%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARA---LATIFSTNVRGFEAGSMGAVNGMKPNGD 58
ADQL G + KA G +FP + +A L I+S NV F+ G GA+NGM P+G
Sbjct: 757 ADQLAGQWYAKACG-----LFPIVDKDKAESALEKIYSFNVMKFKDGKRGAMNGMWPDGT 811
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
D SA+QS E+W GVTY L+A M+ EG V++ + TA G+Y
Sbjct: 812 VDMSAMQSREIWPGVTYALAATMIQEGMVEKGFKTAEGIY 851
>gi|188570649|gb|ACD64360.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164
>gi|188570667|gb|ACD64369.1| hypothetical protein [Bahiopsis lanata]
gi|188570669|gb|ACD64370.1| hypothetical protein [Bahiopsis lanata]
Length = 208
Score = 83.6 bits (205), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 68 ADQLAGQWYTASAGLPN--LFGDDKIQSSLHKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 125
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D+A+TTA G++
Sbjct: 126 TCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIF 162
>gi|292384308|gb|ADE21407.1| unknown [Helianthus neglectus]
Length = 205
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 67 ADQLAGQWYTASAGLPN--LFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 124
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 125 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 161
>gi|440799031|gb|ELR20092.1| glucosylceramidase [Acanthamoeba castellanii str. Neff]
Length = 860
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 62/104 (59%), Gaps = 5/104 (4%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
ADQLCG + +A G+ + I P I AL T+F+ N F G GAVNGM+P+G D
Sbjct: 711 QADQLCGQWYARACGLPS--IAPDKRIVTALKTVFANN---FNEGKRGAVNGMRPDGRVD 765
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ + S E W G T+ L+A M+ EG VD+A+ TA G+ + ED
Sbjct: 766 YACLHSSEAWPGTTFALAACMMGEGLVDQAFITAAGVCGSIYED 809
>gi|188570531|gb|ACD64301.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF+ NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFNFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164
>gi|292384324|gb|ADE21415.1| unknown [Helianthus neglectus]
Length = 210
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164
>gi|8777421|dbj|BAA97011.1| unnamed protein product [Arabidopsis thaliana]
Length = 928
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +ASG+ I AL +++ NV + G GAVNGM PNG D
Sbjct: 726 ADQLAGQWYARASGL--LPIVDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDT 783
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y
Sbjct: 784 ASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIY 820
>gi|188570631|gb|ACD64351.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164
>gi|292384310|gb|ADE21408.1| unknown [Helianthus neglectus]
Length = 210
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164
>gi|255554158|ref|XP_002518119.1| conserved hypothetical protein [Ricinus communis]
gi|223542715|gb|EEF44252.1| conserved hypothetical protein [Ricinus communis]
Length = 968
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 41/97 (42%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G ++ +SG+ +F + I L I+ NV G MGAVNGM PNG D
Sbjct: 779 ADQLAGQWYVASSGL--PPLFDDSKIKSTLQKIYDFNVMKVRGGRMGAVNGMHPNGKVDE 836
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A M+ G D+A+ A G++
Sbjct: 837 TCMQSREIWTGVTYAVAATMILAGMEDKAFAAAEGIF 873
>gi|188570647|gb|ACD64359.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFGDDKIPSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164
>gi|188570671|gb|ACD64371.1| hypothetical protein [Bahiopsis reticulata]
gi|188570673|gb|ACD64372.1| hypothetical protein [Bahiopsis reticulata]
Length = 210
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDSAFTTAEGIF 164
>gi|188570635|gb|ACD64353.1| hypothetical protein [Helianthus petiolaris]
gi|188570639|gb|ACD64355.1| hypothetical protein [Helianthus petiolaris]
Length = 211
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 71 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 128
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 129 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 165
>gi|188570553|gb|ACD64312.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164
>gi|188570503|gb|ACD64287.1| hypothetical protein [Helianthus annuus]
gi|188570505|gb|ACD64288.1| hypothetical protein [Helianthus annuus]
gi|188570507|gb|ACD64289.1| hypothetical protein [Helianthus annuus]
gi|188570509|gb|ACD64290.1| hypothetical protein [Helianthus annuus]
gi|188570513|gb|ACD64292.1| hypothetical protein [Helianthus annuus]
gi|188570515|gb|ACD64293.1| hypothetical protein [Helianthus annuus]
gi|188570517|gb|ACD64294.1| hypothetical protein [Helianthus annuus]
gi|188570519|gb|ACD64295.1| hypothetical protein [Helianthus annuus]
gi|188570521|gb|ACD64296.1| hypothetical protein [Helianthus annuus]
gi|188570523|gb|ACD64297.1| hypothetical protein [Helianthus annuus]
gi|188570525|gb|ACD64298.1| hypothetical protein [Helianthus annuus]
gi|188570527|gb|ACD64299.1| hypothetical protein [Helianthus annuus]
gi|188570529|gb|ACD64300.1| hypothetical protein [Helianthus annuus]
gi|188570533|gb|ACD64302.1| hypothetical protein [Helianthus annuus]
gi|188570535|gb|ACD64303.1| hypothetical protein [Helianthus annuus]
gi|188570537|gb|ACD64304.1| hypothetical protein [Helianthus annuus]
gi|188570539|gb|ACD64305.1| hypothetical protein [Helianthus annuus]
gi|188570541|gb|ACD64306.1| hypothetical protein [Helianthus annuus]
gi|188570543|gb|ACD64307.1| hypothetical protein [Helianthus annuus]
gi|188570545|gb|ACD64308.1| hypothetical protein [Helianthus annuus]
gi|188570547|gb|ACD64309.1| hypothetical protein [Helianthus annuus]
gi|188570549|gb|ACD64310.1| hypothetical protein [Helianthus annuus]
gi|188570551|gb|ACD64311.1| hypothetical protein [Helianthus annuus]
gi|188570555|gb|ACD64313.1| hypothetical protein [Helianthus annuus]
gi|188570557|gb|ACD64314.1| hypothetical protein [Helianthus annuus]
gi|188570559|gb|ACD64315.1| hypothetical protein [Helianthus annuus]
gi|188570561|gb|ACD64316.1| hypothetical protein [Helianthus annuus]
gi|188570563|gb|ACD64317.1| hypothetical protein [Helianthus annuus]
gi|188570565|gb|ACD64318.1| hypothetical protein [Helianthus annuus]
gi|188570567|gb|ACD64319.1| hypothetical protein [Helianthus annuus]
gi|188570569|gb|ACD64320.1| hypothetical protein [Helianthus annuus]
gi|188570571|gb|ACD64321.1| hypothetical protein [Helianthus annuus]
gi|188570573|gb|ACD64322.1| hypothetical protein [Helianthus annuus]
gi|188570575|gb|ACD64323.1| hypothetical protein [Helianthus annuus]
gi|188570577|gb|ACD64324.1| hypothetical protein [Helianthus annuus]
gi|188570579|gb|ACD64325.1| hypothetical protein [Helianthus annuus]
gi|188570581|gb|ACD64326.1| hypothetical protein [Helianthus annuus]
gi|188570583|gb|ACD64327.1| hypothetical protein [Helianthus annuus]
gi|188570585|gb|ACD64328.1| hypothetical protein [Helianthus annuus]
gi|188570587|gb|ACD64329.1| hypothetical protein [Helianthus annuus]
gi|188570589|gb|ACD64330.1| hypothetical protein [Helianthus annuus]
gi|188570591|gb|ACD64331.1| hypothetical protein [Helianthus annuus]
gi|188570593|gb|ACD64332.1| hypothetical protein [Helianthus annuus]
gi|188570595|gb|ACD64333.1| hypothetical protein [Helianthus annuus]
gi|188570597|gb|ACD64334.1| hypothetical protein [Helianthus annuus]
gi|188570599|gb|ACD64335.1| hypothetical protein [Helianthus annuus]
gi|188570601|gb|ACD64336.1| hypothetical protein [Helianthus annuus]
gi|188570603|gb|ACD64337.1| hypothetical protein [Helianthus petiolaris]
gi|188570605|gb|ACD64338.1| hypothetical protein [Helianthus petiolaris]
gi|188570607|gb|ACD64339.1| hypothetical protein [Helianthus petiolaris]
gi|188570609|gb|ACD64340.1| hypothetical protein [Helianthus petiolaris]
gi|188570611|gb|ACD64341.1| hypothetical protein [Helianthus petiolaris]
gi|188570613|gb|ACD64342.1| hypothetical protein [Helianthus petiolaris]
gi|188570615|gb|ACD64343.1| hypothetical protein [Helianthus petiolaris]
gi|188570617|gb|ACD64344.1| hypothetical protein [Helianthus petiolaris]
gi|188570619|gb|ACD64345.1| hypothetical protein [Helianthus petiolaris]
gi|188570625|gb|ACD64348.1| hypothetical protein [Helianthus petiolaris]
gi|188570627|gb|ACD64349.1| hypothetical protein [Helianthus petiolaris]
gi|188570629|gb|ACD64350.1| hypothetical protein [Helianthus petiolaris]
gi|188570633|gb|ACD64352.1| hypothetical protein [Helianthus petiolaris]
gi|188570637|gb|ACD64354.1| hypothetical protein [Helianthus petiolaris]
gi|188570641|gb|ACD64356.1| hypothetical protein [Helianthus petiolaris]
gi|188570643|gb|ACD64357.1| hypothetical protein [Helianthus petiolaris]
gi|188570645|gb|ACD64358.1| hypothetical protein [Helianthus petiolaris]
gi|188570651|gb|ACD64361.1| hypothetical protein [Helianthus petiolaris]
gi|188570653|gb|ACD64362.1| hypothetical protein [Helianthus petiolaris]
gi|188570655|gb|ACD64363.1| hypothetical protein [Helianthus petiolaris]
gi|188570657|gb|ACD64364.1| hypothetical protein [Helianthus petiolaris]
gi|188570659|gb|ACD64365.1| hypothetical protein [Helianthus petiolaris]
gi|188570661|gb|ACD64366.1| hypothetical protein [Helianthus petiolaris]
gi|188570663|gb|ACD64367.1| hypothetical protein [Helianthus petiolaris]
gi|188570665|gb|ACD64368.1| hypothetical protein [Helianthus petiolaris]
gi|292384298|gb|ADE21402.1| unknown [Helianthus neglectus]
gi|292384300|gb|ADE21403.1| unknown [Helianthus neglectus]
gi|292384302|gb|ADE21404.1| unknown [Helianthus neglectus]
gi|292384304|gb|ADE21405.1| unknown [Helianthus neglectus]
gi|292384312|gb|ADE21409.1| unknown [Helianthus neglectus]
gi|292384318|gb|ADE21412.1| unknown [Helianthus neglectus]
gi|292384320|gb|ADE21413.1| unknown [Helianthus neglectus]
gi|292384328|gb|ADE21417.1| unknown [Helianthus neglectus]
Length = 210
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164
>gi|292384322|gb|ADE21414.1| unknown [Helianthus neglectus]
Length = 207
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 67 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 124
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 125 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 161
>gi|188570511|gb|ACD64291.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164
>gi|292384326|gb|ADE21416.1| unknown [Helianthus neglectus]
Length = 210
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFGEDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 164
>gi|414871104|tpg|DAA49661.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
Length = 649
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 57/97 (58%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I AL IF NV + G MGAVNGM P G D
Sbjct: 460 ADQLAGQWYAASSGL--PPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 517
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A ML G + +TTA G++
Sbjct: 518 TCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIF 554
>gi|224028315|gb|ACN33233.1| unknown [Zea mays]
Length = 384
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 57/97 (58%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I AL IF NV + G MGAVNGM P G D
Sbjct: 255 ADQLAGQWYAASSGL--PPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 312
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A ML G + +TTA G++
Sbjct: 313 TCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIF 349
>gi|292384316|gb|ADE21411.1| unknown [Helianthus neglectus]
Length = 205
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 67 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 124
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 125 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 161
>gi|226499590|ref|NP_001146319.1| uncharacterized protein LOC100279895 [Zea mays]
gi|219886613|gb|ACL53681.1| unknown [Zea mays]
Length = 649
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 57/97 (58%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I AL IF NV + G MGAVNGM P G D
Sbjct: 460 ADQLAGQWYAASSGL--PPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 517
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A ML G + +TTA G++
Sbjct: 518 TCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIF 554
>gi|292384306|gb|ADE21406.1| unknown [Helianthus neglectus]
Length = 205
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 67 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 124
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 125 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 161
>gi|292384314|gb|ADE21410.1| unknown [Helianthus neglectus]
Length = 205
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 67 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 124
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ G D A+TTA G++
Sbjct: 125 TCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIF 161
>gi|110742221|dbj|BAE99037.1| hypothetical protein [Arabidopsis thaliana]
Length = 957
Score = 82.8 bits (203), Expect = 3e-14, Method: Composition-based stats.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +ASG+ I AL +++ NV + G GAVNGM PNG D
Sbjct: 755 ADQLAGQWYARASGL--LPIVDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDT 812
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y
Sbjct: 813 ASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIY 849
>gi|222423179|dbj|BAH19567.1| AT5G49900 [Arabidopsis thaliana]
Length = 957
Score = 82.8 bits (203), Expect = 3e-14, Method: Composition-based stats.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +ASG+ I AL +++ NV + G GAVNGM PNG D
Sbjct: 755 ADQLAGQWYARASGL--LPIVDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDT 812
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y
Sbjct: 813 ASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIY 849
>gi|30695777|ref|NP_199801.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|27311753|gb|AAO00842.1| putative protein [Arabidopsis thaliana]
gi|34365725|gb|AAQ65174.1| At5g49900 [Arabidopsis thaliana]
gi|332008486|gb|AED95869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 957
Score = 82.8 bits (203), Expect = 3e-14, Method: Composition-based stats.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +ASG+ I AL +++ NV + G GAVNGM PNG D
Sbjct: 755 ADQLAGQWYARASGL--LPIVDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDT 812
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y
Sbjct: 813 ASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIY 849
>gi|414871105|tpg|DAA49662.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
Length = 589
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 57/97 (58%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I AL IF NV + G MGAVNGM P G D
Sbjct: 460 ADQLAGQWYAASSGL--PPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 517
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A ML G + +TTA G++
Sbjct: 518 TCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIF 554
>gi|297851790|ref|XP_002893776.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
lyrata]
gi|297339618|gb|EFH70035.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
lyrata]
Length = 947
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 45/109 (41%), Positives = 62/109 (56%), Gaps = 2/109 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
+ADQL G + +A G+ + I I +AL T++ NV G+ GAVNGM P+G D
Sbjct: 746 LADQLAGQWYARACGL--KPIAKEEWIKKALETVYDFNVMRVRDGTRGAVNGMLPDGRVD 803
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGC 109
S + S EVW G TY ++A M+ EG D+ + TA G+Y DR GC
Sbjct: 804 TSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLGC 852
>gi|449442371|ref|XP_004138955.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
gi|449520930|ref|XP_004167485.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 993
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I AL I+ NV G MGAVNGM PNG D
Sbjct: 803 ADQLAGQWYTASSGL--PPLFDDFKIKSALRKIYDFNVMKVRGGRMGAVNGMHPNGKIDE 860
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A M+ G +EA+ TA G++
Sbjct: 861 TCMQSREIWTGVTYGVAATMILAGMEEEAFKTAEGIF 897
>gi|255569126|ref|XP_002525532.1| conserved hypothetical protein [Ricinus communis]
gi|223535211|gb|EEF36890.1| conserved hypothetical protein [Ricinus communis]
Length = 948
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G+++ I + AL I+ NV + G GAVNGM P+G D
Sbjct: 749 ADQLAGQWYARACGLSS--IVDEEKVRSALEKIYKFNVLKVKEGKRGAVNGMLPDGKVDM 806
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S +QS E+W GVTY LSA+M+ EG + A+ TA G+Y
Sbjct: 807 SVMQSREIWPGVTYALSASMIQEGMAEMAFQTASGIY 843
>gi|10440626|gb|AAG16864.1|AC069145_13 unknown protein [Oryza sativa Japonica Group]
Length = 444
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I AL IF NV + G +GAVNGM PNG D
Sbjct: 255 ADQLAGQWYAASSGL--PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDE 312
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A ML G + +TTA G++
Sbjct: 313 TCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIF 349
>gi|222612993|gb|EEE51125.1| hypothetical protein OsJ_31868 [Oryza sativa Japonica Group]
Length = 935
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I AL IF NV + G +GAVNGM PNG D
Sbjct: 746 ADQLAGQWYAASSGL--PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDE 803
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A ML G + +TTA G++
Sbjct: 804 TCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIF 840
>gi|16905165|gb|AAL31035.1|AC078948_19 unknown protein [Oryza sativa Japonica Group]
Length = 967
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I AL IF NV + G +GAVNGM PNG D
Sbjct: 778 ADQLAGQWYAASSGL--PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDE 835
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A ML G + +TTA G++
Sbjct: 836 TCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIF 872
>gi|115482480|ref|NP_001064833.1| Os10g0473400 [Oryza sativa Japonica Group]
gi|110289241|gb|AAP54244.2| expressed protein [Oryza sativa Japonica Group]
gi|113639442|dbj|BAF26747.1| Os10g0473400 [Oryza sativa Japonica Group]
gi|215695411|dbj|BAG90602.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 974
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I AL IF NV + G +GAVNGM PNG D
Sbjct: 785 ADQLAGQWYAASSGL--PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDE 842
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A ML G + +TTA G++
Sbjct: 843 TCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIF 879
>gi|218184729|gb|EEC67156.1| hypothetical protein OsI_34005 [Oryza sativa Indica Group]
Length = 974
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I AL IF NV + G +GAVNGM PNG D
Sbjct: 785 ADQLAGQWYAASSGL--PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDE 842
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A ML G + +TTA G++
Sbjct: 843 TCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIF 879
>gi|297795757|ref|XP_002865763.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
lyrata]
gi|297311598|gb|EFH42022.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
lyrata]
Length = 956
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +ASG+ I AL +++ NV + G GAVNGM PNG D
Sbjct: 755 ADQLAGQWYARASGL--LPIVDEDKARTALEKVYNFNVMKIKDGKRGAVNGMHPNGKVDT 812
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y
Sbjct: 813 ASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGVY 849
>gi|326507888|dbj|BAJ86687.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 676
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 61/98 (62%), Gaps = 2/98 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL G + +A G+ E I AL T+ NV + G++GAVNGM+P+G D
Sbjct: 497 MADQLAGQWYARACGL--EPIVEEEKARSALGTVLDYNVMRVQGGAIGAVNGMRPDGTVD 554
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S+IQS EVW GVTY ++AAM +EG + A+ TA G +
Sbjct: 555 TSSIQSREVWPGVTYGVAAAMAHEGMPEAAFRTAKGAH 592
>gi|30692911|ref|NP_174631.2| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|79319136|ref|NP_001031135.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|332193493|gb|AEE31614.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|332193494|gb|AEE31615.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
Length = 947
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 44/109 (40%), Positives = 62/109 (56%), Gaps = 2/109 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
+ADQ+ G + +A G+ + I I +AL T++ NV G+ GAVNGM P+G D
Sbjct: 746 LADQMAGQWYARACGL--KPIAKEEWIKKALETVYDFNVMRVRDGTRGAVNGMLPDGRVD 803
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGC 109
S + S EVW G TY ++A M+ EG D+ + TA G+Y DR GC
Sbjct: 804 TSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLGC 852
>gi|297741007|emb|CBI31319.3| unnamed protein product [Vitis vinifera]
Length = 953
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 42/97 (43%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G+ + I AL +++ NV + G GAVNGM P+G D
Sbjct: 753 ADQLAGQWYARACGL--QPIVDDEKARSALEKVYNFNVLKVKEGKCGAVNGMLPDGRVDM 810
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
SA+QS E+W GVTY ++A M++EG V+ A+ TA G+Y
Sbjct: 811 SAMQSREIWAGVTYSVAANMIHEGMVETAFNTASGIY 847
>gi|28393605|gb|AAO42222.1| unknown protein [Arabidopsis thaliana]
Length = 947
Score = 82.0 bits (201), Expect = 5e-14, Method: Composition-based stats.
Identities = 44/109 (40%), Positives = 62/109 (56%), Gaps = 2/109 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
+ADQ+ G + +A G+ + I I +AL T++ NV G+ GAVNGM P+G D
Sbjct: 746 LADQMAGQWYARACGL--KPIAKEEWIKKALETVYDFNVMRVRDGTRGAVNGMLPDGRVD 803
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGC 109
S + S EVW G TY ++A M+ EG D+ + TA G+Y DR GC
Sbjct: 804 TSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLGC 852
>gi|225460125|ref|XP_002275782.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
Length = 969
Score = 81.6 bits (200), Expect = 5e-14, Method: Composition-based stats.
Identities = 42/97 (43%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G+ + I AL +++ NV + G GAVNGM P+G D
Sbjct: 769 ADQLAGQWYARACGL--QPIVDDEKARSALEKVYNFNVLKVKEGKCGAVNGMLPDGRVDM 826
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
SA+QS E+W GVTY ++A M++EG V+ A+ TA G+Y
Sbjct: 827 SAMQSREIWAGVTYSVAANMIHEGMVETAFNTASGIY 863
>gi|159896834|ref|YP_001543081.1| hypothetical protein Haur_0301 [Herpetosiphon aurantiacus DSM 785]
gi|159889873|gb|ABX02953.1| protein of unknown function DUF608 [Herpetosiphon aurantiacus DSM
785]
Length = 774
Score = 81.3 bits (199), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 59/98 (60%), Gaps = 2/98 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL G + A G+ A+ P I AL T++ NV + G++GAVNGM P+G D
Sbjct: 631 MADQLVGQWYAGAIGL--PAVAPREMIRSALQTVYRFNVMQYANGALGAVNGMHPDGTVD 688
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S+ Q+ EVW+G TY ++A ML EG E W TA G Y
Sbjct: 689 TSSNQASEVWSGTTYAIAAMMLQEGLDLEGWQTAWGAY 726
>gi|353232147|emb|CCD79502.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 900
Score = 80.9 bits (198), Expect = 1e-13, Method: Composition-based stats.
Identities = 38/90 (42%), Positives = 54/90 (60%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL GY F + +GV I P ++ + L TI + N G + G++GA+NG P D
Sbjct: 741 MADQLSGYWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPD 800
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
S++Q+EE W V Y LS+ M+ EG +DE
Sbjct: 801 LSSVQAEEFWVAVNYSLSSLMIAEGMIDEG 830
>gi|256082537|ref|XP_002577511.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 892
Score = 80.9 bits (198), Expect = 1e-13, Method: Composition-based stats.
Identities = 38/90 (42%), Positives = 54/90 (60%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL GY F + +GV I P ++ + L TI + N G + G++GA+NG P D
Sbjct: 733 MADQLSGYWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPD 792
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
S++Q+EE W V Y LS+ M+ EG +DE
Sbjct: 793 LSSVQAEEFWVAVNYSLSSLMIAEGMIDEG 822
>gi|357146604|ref|XP_003574050.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 962
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 56/97 (57%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ IF I AL IF NV + G MGAVNGM P G D
Sbjct: 773 ADQLAGQWYAASSGL--PPIFDEHKIRSALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 830
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A ML G + + TA G++
Sbjct: 831 TCMQSREIWTGVTYGVAANMLLHGMEHQGFITAEGIF 867
>gi|20453181|gb|AAM19831.1| AT4g10060/T5L19_190 [Arabidopsis thaliana]
gi|24111431|gb|AAN46866.1| At4g10060/T5L19_190 [Arabidopsis thaliana]
Length = 922
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 3/116 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
+ADQL G + +A G+ + I I +AL TI+ NV + G+ GAVNGM G D
Sbjct: 719 LADQLAGQWYARACGL--KPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSTEGKVD 776
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCE-TAPEG 115
+++ S+EVW G TY ++A M+ EG ++ + TA G+Y + DR C PE
Sbjct: 777 TNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSFQTPEA 832
>gi|240255774|ref|NP_192744.6| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|332657434|gb|AEE82834.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 922
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 3/116 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
+ADQL G + +A G+ + I I +AL TI+ NV + G+ GAVNGM G D
Sbjct: 719 LADQLAGQWYARACGL--KPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSTEGKVD 776
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCE-TAPEG 115
+++ S+EVW G TY ++A M+ EG ++ + TA G+Y + DR C PE
Sbjct: 777 TNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSFQTPEA 832
>gi|326516802|dbj|BAJ96393.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 970
Score = 80.1 bits (196), Expect = 1e-13, Method: Composition-based stats.
Identities = 43/97 (44%), Positives = 56/97 (57%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ IF I AL IF NV + G MGAVNGM P G D
Sbjct: 781 ADQLAGQWYAASSGL--PPIFDEHKIRSALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDE 838
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+WTGVTY ++A ML G + + TA G++
Sbjct: 839 TCMQSREIWTGVTYGVAANMLLHGMEHQGFITAEGIF 875
>gi|356520149|ref|XP_003528727.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
glucosylceramidase-like [Glycine max]
Length = 944
Score = 80.1 bits (196), Expect = 2e-13, Method: Composition-based stats.
Identities = 42/97 (43%), Positives = 57/97 (58%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G+ I AL +++ NV G GAVNGM P+G D
Sbjct: 748 ADQLAGQWYARACGL--LPIVEEKKSRSALQIVYNHNVMKVHXGRRGAVNGMLPDGKVDM 805
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S +QS E+W+GVTY L+A M+ E +D A+ TAGG+Y
Sbjct: 806 STMQSREIWSGVTYALAATMIQENMIDMAFQTAGGVY 842
>gi|188570621|gb|ACD64346.1| hypothetical protein [Helianthus petiolaris]
gi|188570623|gb|ACD64347.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + ++G+ +F I +L IF NV G MGAVNGM PNG D
Sbjct: 70 ADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDE 127
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS EVWTGVTY ++A M++ D A+TTA G++
Sbjct: 128 TCMQSREVWTGVTYGVAATMIHAEMEDNAFTTAEGIF 164
>gi|4539009|emb|CAB39630.1| putative protein [Arabidopsis thaliana]
gi|7267702|emb|CAB78129.1| putative protein [Arabidopsis thaliana]
Length = 750
Score = 80.1 bits (196), Expect = 2e-13, Method: Composition-based stats.
Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 3/116 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
+ADQL G + +A G+ + I I +AL TI+ NV + G+ GAVNGM G D
Sbjct: 547 LADQLAGQWYARACGL--KPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSTEGKVD 604
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCE-TAPEG 115
+++ S+EVW G TY ++A M+ EG ++ + TA G+Y + DR C PE
Sbjct: 605 TNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSFQTPEA 660
>gi|324503709|gb|ADY41606.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 261
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 62/117 (52%), Gaps = 2/117 (1%)
Query: 1 MADQLCGYLFLKAS-GVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
MADQLCG+ F+ + G + I I +L TIF NV F G +G VN M P+G
Sbjct: 114 MADQLCGFWFMCITDGKVDDVIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVV 173
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEGV 116
D + IQSEEVW GV Y L++ L + A+ TA G YR E R +PE +
Sbjct: 174 DSTGIQSEEVWGGVAYALASFHLLVEENESAFKTAEGWYRSCWE-RYGLQYQSPEAI 229
>gi|242080271|ref|XP_002444904.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
gi|241941254|gb|EES14399.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
Length = 929
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQ+ G + +A G+ E + ALAT+ NV + G++GAVNGM+P+G D
Sbjct: 725 MADQMAGQWYARACGM--EPVVEEGKARSALATVLDYNVMRVKGGAVGAVNGMRPDGAVD 782
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S+ QS+EVW GVTY ++AAM++EG + A+ TA G +
Sbjct: 783 ASSGQSKEVWPGVTYAVAAAMVHEGMHEAAFRTAKGAH 820
>gi|324504438|gb|ADY41918.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 534
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 45/104 (43%), Positives = 57/104 (54%), Gaps = 1/104 (0%)
Query: 1 MADQLCGYLFLKAS-GVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR 59
MADQLCG+ F+ + G + I I +L TIF NV F G +G VN M P+G
Sbjct: 387 MADQLCGFWFMCITDGKVDDVIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVV 446
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D + IQSEEVW GV Y L++ L + A+ TA G YR E
Sbjct: 447 DSTGIQSEEVWGGVAYALASFHLLVEENESAFKTAEGWYRSCWE 490
>gi|268555576|ref|XP_002635777.1| Hypothetical protein CBG10433 [Caenorhabditis briggsae]
Length = 821
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 44/101 (43%), Positives = 59/101 (58%), Gaps = 2/101 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG+ L A ++ I AL TIF NV ++ G GAVNG + D
Sbjct: 678 MADQLCGFWALTA--MDEPIQVSEGKIQSALETIFKYNVEMYDGGKCGAVNGFLASERVD 735
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
S+IQSEEVW G+TY LSA M+ +G + A+ T+ GL+ +
Sbjct: 736 GSSIQSEEVWAGITYSLSAMMIEKGMDEMAFKTSEGLFHSI 776
>gi|9665098|gb|AAF97289.1|AC010164_11 Hypothetical protein [Arabidopsis thaliana]
Length = 790
Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 62/109 (56%), Gaps = 2/109 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
+ADQ+ G + +A G+ + I I +AL T++ NV G+ GAVNGM P+G D
Sbjct: 589 LADQMAGQWYARACGL--KPIAKEEWIKKALETVYDFNVMRVRDGTRGAVNGMLPDGRVD 646
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGC 109
S + S EVW G TY ++A M+ EG D+ + TA G+Y DR GC
Sbjct: 647 TSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLGC 695
>gi|297831192|ref|XP_002883478.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
lyrata]
gi|297329318|gb|EFH59737.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
lyrata]
Length = 950
Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats.
Identities = 40/96 (41%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 3 DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
DQL G + +SG+ IF + I + IF NV + G MGAVNGM P+G D +
Sbjct: 764 DQLAGQWYAASSGL--PPIFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDT 821
Query: 63 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+QS E+WTGVTY +A M+ G ++ +TTA G++
Sbjct: 822 CMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIF 857
>gi|302787158|ref|XP_002975349.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
gi|300156923|gb|EFJ23550.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
Length = 916
Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 62/97 (63%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
+DQL G + ASG+ I AL T+F+TNV ++ G+ GAVNGM P+G DR
Sbjct: 712 SDQLAGQWYSLASGL--PGIVREDQATSALGTVFATNVMRYKGGNEGAVNGMLPSGGEDR 769
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+++QS EVW G TY ++AAM+ +G +E + TA G++
Sbjct: 770 TSLQSREVWVGTTYAVAAAMIQQGMREEGFRTAKGVF 806
>gi|302762052|ref|XP_002964448.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
gi|300168177|gb|EFJ34781.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
Length = 916
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 62/97 (63%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
+DQL G + ASG+ I AL T+F+TNV ++ G+ GAVNGM P+G DR
Sbjct: 712 SDQLAGQWYSLASGL--PGIVREDQATSALGTVFATNVMRYKGGNEGAVNGMLPSGGEDR 769
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+++QS EVW G TY ++AAM+ +G +E + TA G++
Sbjct: 770 TSLQSREVWAGTTYAVAAAMIQQGMREEGFRTAKGVF 806
>gi|47196687|emb|CAF89164.1| unnamed protein product [Tetraodon nigroviridis]
Length = 239
Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 48/80 (60%), Gaps = 2/80 (2%)
Query: 5 LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAI 64
CG+ + T FP I AL +IF NV F G MGAVNGM+P G DRS++
Sbjct: 161 FCGHPVWETE--TTRQAFPKEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSV 218
Query: 65 QSEEVWTGVTYLLSAAMLYE 84
QS+EVW GV Y L+A M++E
Sbjct: 219 QSDEVWIGVVYGLAATMIHE 238
>gi|308471410|ref|XP_003097936.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
gi|308239241|gb|EFO83193.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
Length = 845
Score = 78.6 bits (192), Expect = 4e-13, Method: Composition-based stats.
Identities = 43/98 (43%), Positives = 59/98 (60%), Gaps = 2/98 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG+ L A ++ + AL TIF NV+ ++ G GAVNG + D
Sbjct: 703 MADQLCGFWALTA--MDEPVQVSEEKMKSALETIFKYNVQMYDGGKCGAVNGYLTSERVD 760
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S+IQSEEVW G+TY LSA M+ +G + A+ T+ GL+
Sbjct: 761 GSSIQSEEVWAGITYALSAMMIEKGMDEMAFKTSEGLF 798
>gi|22331303|ref|NP_189060.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|79313355|ref|NP_001030757.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|17529232|gb|AAL38843.1| unknown protein [Arabidopsis thaliana]
gi|110742225|dbj|BAE99039.1| hypothetical protein [Arabidopsis thaliana]
gi|332643347|gb|AEE76868.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|332643348|gb|AEE76869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 950
Score = 78.2 bits (191), Expect = 5e-13, Method: Composition-based stats.
Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 3 DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
DQL G + +SG+ +F + I + IF NV + G MGAVNGM P+G D +
Sbjct: 764 DQLAGQWYAASSGL--PPLFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDT 821
Query: 63 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+QS E+WTGVTY +A M+ G ++ +TTA G++
Sbjct: 822 CMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIF 857
>gi|11994237|dbj|BAB01359.1| unnamed protein product [Arabidopsis thaliana]
Length = 937
Score = 78.2 bits (191), Expect = 6e-13, Method: Composition-based stats.
Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 2/96 (2%)
Query: 3 DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
DQL G + +SG+ +F + I + IF NV + G MGAVNGM P+G D +
Sbjct: 751 DQLAGQWYAASSGL--PPLFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDT 808
Query: 63 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+QS E+WTGVTY +A M+ G ++ +TTA G++
Sbjct: 809 CMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIF 844
>gi|359486233|ref|XP_002264575.2| PREDICTED: non-lysosomal glucosylceramidase [Vitis vinifera]
Length = 960
Score = 78.2 bits (191), Expect = 6e-13, Method: Composition-based stats.
Identities = 41/97 (42%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G++ I AL ++ NV G GAVNGM P+G D
Sbjct: 765 ADQLAGQWYARACGLS--PIVDEDKAKSALEKVYHYNVLKVGGGKRGAVNGMLPDGKVDT 822
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+W+GVTY ++A M++EG VD A+ TA G+Y
Sbjct: 823 TTMQSREIWSGVTYGVAATMIHEGLVDMAFQTASGVY 859
>gi|297739499|emb|CBI29681.3| unnamed protein product [Vitis vinifera]
Length = 949
Score = 78.2 bits (191), Expect = 6e-13, Method: Composition-based stats.
Identities = 41/97 (42%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G++ I AL ++ NV G GAVNGM P+G D
Sbjct: 754 ADQLAGQWYARACGLS--PIVDEDKAKSALEKVYHYNVLKVGGGKRGAVNGMLPDGKVDT 811
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+ +QS E+W+GVTY ++A M++EG VD A+ TA G+Y
Sbjct: 812 TTMQSREIWSGVTYGVAATMIHEGLVDMAFQTASGVY 848
>gi|449463809|ref|XP_004149624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
gi|449494604|ref|XP_004159595.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 951
Score = 77.8 bits (190), Expect = 9e-13, Method: Composition-based stats.
Identities = 41/97 (42%), Positives = 61/97 (62%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +ASG+ I ALA +++ NV + G GAVNGM P+G D
Sbjct: 758 ADQLAGQWYARASGL--LPIVDEDKAKSALAKVYNYNVMKVKDGKRGAVNGMLPDGTIDF 815
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S++QS E+W+GVTY ++A+M++E D A+ TA G++
Sbjct: 816 SSMQSREIWSGVTYAVAASMIHEDMTDMAFRTAEGIH 852
>gi|357144426|ref|XP_003573288.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 1
[Brachypodium distachyon]
Length = 932
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 61/98 (62%), Gaps = 2/98 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL G + G+ E + ALAT+ NV + G++GAVNGM+P+G D
Sbjct: 733 MADQLAGQWYAHVCGL--EPVVEEDKARSALATVLDYNVMRVKGGAVGAVNGMRPDGGID 790
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S+ QS+E+W+G TY ++AAM++EG + A+ TA G +
Sbjct: 791 MSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAH 828
>gi|357144429|ref|XP_003573289.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 2
[Brachypodium distachyon]
Length = 919
Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 61/98 (62%), Gaps = 2/98 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL G + G+ E + ALAT+ NV + G++GAVNGM+P+G D
Sbjct: 720 MADQLAGQWYAHVCGL--EPVVEEDKARSALATVLDYNVMRVKGGAVGAVNGMRPDGGID 777
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S+ QS+E+W+G TY ++AAM++EG + A+ TA G +
Sbjct: 778 MSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAH 815
>gi|392918721|ref|NP_504047.2| Protein R08F11.1 [Caenorhabditis elegans]
gi|387910741|emb|CCD72306.2| Protein R08F11.1 [Caenorhabditis elegans]
Length = 830
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 42/101 (41%), Positives = 60/101 (59%), Gaps = 2/101 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG+ + A ++ + AL TIF NV+ + G GAVNG + D
Sbjct: 688 MADQLCGFWAMTA--MDEPVQISKDKMKSALDTIFKYNVQMYNNGRCGAVNGYLTSERVD 745
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
S+IQSEEVW G+TY LSA M+ +G ++A+ T+ GL+ +
Sbjct: 746 GSSIQSEEVWAGITYALSAMMIEKGMDEQAFKTSEGLFESI 786
>gi|341888271|gb|EGT44206.1| hypothetical protein CAEBREN_07747 [Caenorhabditis brenneri]
Length = 820
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 4/102 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG+ L A ++ + AL TIF NV+ + G GAVNG + D
Sbjct: 678 MADQLCGFWALTA--MDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVD 735
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDE-AWTTAGGLYRRL 101
S+IQSEEVW G+TY LS AM+ E N+DE A+ T+ GL+ +
Sbjct: 736 GSSIQSEEVWAGITYALS-AMMIEKNMDEMAFKTSEGLFESI 776
>gi|341899054|gb|EGT54989.1| hypothetical protein CAEBREN_31176 [Caenorhabditis brenneri]
Length = 831
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 4/102 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG+ L A ++ + AL TIF NV+ + G GAVNG + D
Sbjct: 689 MADQLCGFWALTA--MDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVD 746
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDE-AWTTAGGLYRRL 101
S+IQSEEVW G+TY LS AM+ E N+DE A+ T+ GL+ +
Sbjct: 747 GSSIQSEEVWAGITYALS-AMMIEKNMDEMAFKTSEGLFESI 787
>gi|255575898|ref|XP_002528846.1| conserved hypothetical protein [Ricinus communis]
gi|223531697|gb|EEF33520.1| conserved hypothetical protein [Ricinus communis]
Length = 952
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 44/107 (41%), Positives = 64/107 (59%), Gaps = 8/107 (7%)
Query: 2 ADQLCGYLFLKASGVNTEAIFP--SANIAR-ALATIFSTNVRGFEAGSMGAVNGMKPNGD 58
ADQL G + +A G +FP + AR AL +++ NV + G GA+NGM P+G
Sbjct: 756 ADQLAGQWYARACG-----LFPIVDKDKARSALEKVYNYNVLKVKDGKRGAINGMLPDGK 810
Query: 59 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDR 105
D S++QS E+W+GVTY L+A M+ E +D A+ TA G+Y +R
Sbjct: 811 VDLSSMQSREIWSGVTYALAATMIQEDMLDMAFHTASGIYEAAWSER 857
>gi|449482327|ref|XP_004156248.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
sativus]
Length = 508
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 41/98 (41%), Positives = 60/98 (61%), Gaps = 2/98 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G+ I I AL I++ NV + G+ GAVNGM P+G D+
Sbjct: 309 ADQLAGQWYARACGLC--PIADEEKIRVALEKIYNFNVMKVKGGTRGAVNGMFPDGSVDK 366
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 99
S +Q +E+W GVTY ++A M+ EG V+ + TA G+Y+
Sbjct: 367 SILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQ 404
>gi|449447581|ref|XP_004141546.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 944
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 41/98 (41%), Positives = 60/98 (61%), Gaps = 2/98 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G+ I I AL I++ NV + G+ GAVNGM P+G D+
Sbjct: 745 ADQLAGQWYARACGLC--PIADEEKIRVALEKIYNFNVMKVKGGTRGAVNGMFPDGSVDK 802
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 99
S +Q +E+W GVTY ++A M+ EG V+ + TA G+Y+
Sbjct: 803 SILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQ 840
>gi|288901049|gb|ADC68237.1| hypothetical protein [Dendrobium nobile]
Length = 207
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/89 (46%), Positives = 51/89 (57%), Gaps = 2/89 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +SG+ +F I L IF NV G +GAVNGM PNG D
Sbjct: 120 ADQLAGQWYAASSGL--PPLFDENKIRSTLQKIFDFNVMKVRGGRIGAVNGMHPNGKVDE 177
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
S +QS E+WTGVTY LSA M+ G ++A
Sbjct: 178 SCMQSREIWTGVTYALSATMILAGMEEQA 206
>gi|224120058|ref|XP_002331126.1| predicted protein [Populus trichocarpa]
gi|222872854|gb|EEF09985.1| predicted protein [Populus trichocarpa]
Length = 948
Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 65/115 (56%), Gaps = 3/115 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G++ I AL I++ NV + G GAVNGM P+G D
Sbjct: 752 ADQLAGQWYARACGLS--PIVDEDKARCALEKIYNYNVLKVQDGKRGAVNGMLPDGTVDL 809
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCE-TAPEG 115
S +QS E+W+GVTY ++A M+ EG D A+ TA G+Y + ++ G PEG
Sbjct: 810 SCLQSREIWSGVTYAVAATMIQEGLTDMAFHTASGVYEAVWAEQGLGYSFQTPEG 864
>gi|224146269|ref|XP_002325943.1| predicted protein [Populus trichocarpa]
gi|222862818|gb|EEF00325.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 61/97 (62%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G++ + +AL I+ NV + G+ GAVNGM P+G D
Sbjct: 695 ADQLAGQWYARACGLS--PVVDGDKARKALEKIYEFNVLKVKGGTRGAVNGMLPDGRVDL 752
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
+A+Q++E+W GVTY LSA+M+ EG + A+ TA G+Y
Sbjct: 753 TAMQTKEIWPGVTYALSASMIQEGLEEMAFQTAVGIY 789
>gi|224129460|ref|XP_002328722.1| predicted protein [Populus trichocarpa]
gi|222839020|gb|EEE77371.1| predicted protein [Populus trichocarpa]
Length = 922
Score = 74.3 bits (181), Expect = 9e-12, Method: Composition-based stats.
Identities = 41/97 (42%), Positives = 56/97 (57%), Gaps = 2/97 (2%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G+ I AL I++ N G GAVNGM P+G D
Sbjct: 726 ADQLAGQWYARACGL--LPIVDEDKARSALEKIYNYNFLKVHDGKRGAVNGMLPDGTVDM 783
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S +QS E+W+GVTY ++A M+ EG +D A+ TA G+Y
Sbjct: 784 SDMQSREIWSGVTYAVAATMMQEGLMDMAFHTASGVY 820
>gi|147812668|emb|CAN61857.1| hypothetical protein VITISV_016690 [Vitis vinifera]
Length = 521
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 34/69 (49%), Positives = 48/69 (69%)
Query: 30 ALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 89
AL +++ NV + G GAVNGM P+G D SA+QS E+W GVTY ++A M++EG V+
Sbjct: 347 ALEKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVET 406
Query: 90 AWTTAGGLY 98
A+ TA G+Y
Sbjct: 407 AFNTASGIY 415
>gi|281212396|gb|EFA86556.1| hypothetical protein PPL_00357 [Polysphondylium pallidum PN500]
Length = 4775
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 39/97 (40%), Positives = 56/97 (57%), Gaps = 2/97 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
M+DQL G+ +L A G+ + A +L+ I NV+ + GS GAVNGM+P G D
Sbjct: 1024 MSDQLAGHWYLLACGLPSYITLDQA--LSSLSIINEYNVKSYSNGSCGAVNGMRPEGGPD 1081
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
+++QS EVW G +Y L++ ML +EAW GL
Sbjct: 1082 TTSLQSCEVWIGTSYGLASTMLLHFMDNEAWELIKGL 1118
>gi|156333782|ref|XP_001619412.1| hypothetical protein NEMVEDRAFT_v1g224207 [Nematostella vectensis]
gi|156202556|gb|EDO27312.1| predicted protein [Nematostella vectensis]
Length = 173
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/74 (48%), Positives = 52/74 (70%), Gaps = 1/74 (1%)
Query: 43 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLR 102
+ G+MGAVNG++P+G D S++Q+EEVWTGVTY ++A+M+ EG VDE + TA G+Y
Sbjct: 50 KEGTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNTCF 109
Query: 103 EDRVSGCETAPEGV 116
E R+ PE +
Sbjct: 110 E-RLGMNFQTPEAI 122
>gi|22297737|ref|NP_680984.1| hypothetical protein tlr0193 [Thermosynechococcus elongatus BP-1]
gi|22293914|dbj|BAC07746.1| tlr0193 [Thermosynechococcus elongatus BP-1]
Length = 806
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 55/105 (52%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + G+ I P RAL I+ T F G GA NG+ PNG
Sbjct: 664 MADQLCGQFYAQLLGLVD--IVPPDCCDRALRKIYDTCFLKFHNGQFGAANGLLPNGQPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVWTG+ + L+A + G +DEAW A + R++ E+
Sbjct: 722 NPHATHPLEVWTGINFGLAAFLWQRGMIDEAWRLAEVVVRQIYEN 766
>gi|295829670|gb|ADG38504.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829672|gb|ADG38505.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829674|gb|ADG38506.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829676|gb|ADG38507.1| AT3G24180-like protein [Capsella grandiflora]
Length = 190
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 47/80 (58%), Gaps = 2/80 (2%)
Query: 3 DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
DQL G + +SG+ IF + I + IF NV + G MGAVNGM P+G D +
Sbjct: 113 DQLAGQWYAASSGL--PPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDT 170
Query: 63 AIQSEEVWTGVTYLLSAAML 82
+QS E+WTGVTY +A M+
Sbjct: 171 CMQSREIWTGVTYAAAATMI 190
>gi|345291087|gb|AEN82035.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291089|gb|AEN82036.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291091|gb|AEN82037.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291093|gb|AEN82038.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291095|gb|AEN82039.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291097|gb|AEN82040.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291099|gb|AEN82041.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291101|gb|AEN82042.1| AT3G24180-like protein, partial [Capsella rubella]
Length = 191
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 47/80 (58%), Gaps = 2/80 (2%)
Query: 3 DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
DQL G + +SG+ IF + I + IF NV + G MGAVNGM P+G D +
Sbjct: 113 DQLAGQWYAASSGL--PPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDT 170
Query: 63 AIQSEEVWTGVTYLLSAAML 82
+QS E+WTGVTY +A M+
Sbjct: 171 CMQSREIWTGVTYAAAATMI 190
>gi|295829678|gb|ADG38508.1| AT3G24180-like protein [Neslia paniculata]
Length = 190
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 47/80 (58%), Gaps = 2/80 (2%)
Query: 3 DQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS 62
DQL G + +SG+ IF + I + IF NV + G MGAVNGM P+G D +
Sbjct: 113 DQLAGQWYAASSGL--PPIFEESKIRSTMQKIFDFNVMKTKGGRMGAVNGMHPDGKVDET 170
Query: 63 AIQSEEVWTGVTYLLSAAML 82
+QS E+WTGVTY +A M+
Sbjct: 171 CMQSREIWTGVTYAAAATMI 190
>gi|66801399|ref|XP_629625.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
gi|60463006|gb|EAL61202.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
Length = 1302
Score = 68.9 bits (167), Expect = 3e-10, Method: Composition-based stats.
Identities = 34/97 (35%), Positives = 53/97 (54%), Gaps = 2/97 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
M+D L G+ +L + G+ + F A +L+ I NV+ + G GAVNGM+P G D
Sbjct: 1117 MSDMLAGHWYLLSCGLKSYMTFDQA--LSSLSIINEYNVKSYSKGVCGAVNGMRPEGTVD 1174
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
+ +QS EVW G ++ L+A M+ +AW G+
Sbjct: 1175 TTCLQSSEVWIGTSFSLAATMILHHMDSDAWDLVKGI 1211
>gi|226444526|gb|ACO57837.1| unknown [Helianthus annuus]
gi|226444528|gb|ACO57838.1| unknown [Helianthus annuus]
gi|226444560|gb|ACO57854.1| unknown [Helianthus petiolaris]
gi|226444568|gb|ACO57858.1| unknown [Helianthus petiolaris]
Length = 103
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/61 (55%), Positives = 44/61 (72%), Gaps = 3/61 (4%)
Query: 38 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
VRG G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D+A+TTA G+
Sbjct: 1 KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGI 57
Query: 98 Y 98
+
Sbjct: 58 F 58
>gi|226444574|gb|ACO57861.1| unknown [Helianthus argophyllus]
Length = 103
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/61 (55%), Positives = 43/61 (70%), Gaps = 3/61 (4%)
Query: 38 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
VRG G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G+
Sbjct: 1 KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57
Query: 98 Y 98
+
Sbjct: 58 F 58
>gi|226444524|gb|ACO57836.1| unknown [Helianthus annuus]
Length = 103
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/61 (55%), Positives = 43/61 (70%), Gaps = 3/61 (4%)
Query: 38 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
VRG G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G+
Sbjct: 1 KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57
Query: 98 Y 98
+
Sbjct: 58 F 58
>gi|226444514|gb|ACO57831.1| unknown [Helianthus annuus]
gi|226444516|gb|ACO57832.1| unknown [Helianthus annuus]
gi|226444518|gb|ACO57833.1| unknown [Helianthus annuus]
gi|226444520|gb|ACO57834.1| unknown [Helianthus annuus]
gi|226444522|gb|ACO57835.1| unknown [Helianthus annuus]
gi|226444530|gb|ACO57839.1| unknown [Helianthus annuus]
gi|226444532|gb|ACO57840.1| unknown [Helianthus annuus]
gi|226444534|gb|ACO57841.1| unknown [Helianthus annuus]
gi|226444536|gb|ACO57842.1| unknown [Helianthus annuus]
gi|226444538|gb|ACO57843.1| unknown [Helianthus annuus]
gi|226444540|gb|ACO57844.1| unknown [Helianthus annuus]
gi|226444542|gb|ACO57845.1| unknown [Helianthus annuus]
gi|226444544|gb|ACO57846.1| unknown [Helianthus annuus]
gi|226444546|gb|ACO57847.1| unknown [Helianthus petiolaris]
gi|226444548|gb|ACO57848.1| unknown [Helianthus petiolaris]
gi|226444550|gb|ACO57849.1| unknown [Helianthus petiolaris]
gi|226444552|gb|ACO57850.1| unknown [Helianthus petiolaris]
gi|226444554|gb|ACO57851.1| unknown [Helianthus petiolaris]
gi|226444556|gb|ACO57852.1| unknown [Helianthus petiolaris]
gi|226444558|gb|ACO57853.1| unknown [Helianthus petiolaris]
gi|226444562|gb|ACO57855.1| unknown [Helianthus petiolaris]
gi|226444564|gb|ACO57856.1| unknown [Helianthus petiolaris]
gi|226444566|gb|ACO57857.1| unknown [Helianthus petiolaris]
gi|226444570|gb|ACO57859.1| unknown [Helianthus argophyllus]
gi|226444572|gb|ACO57860.1| unknown [Helianthus argophyllus]
gi|226444576|gb|ACO57862.1| unknown [Helianthus argophyllus]
gi|226444578|gb|ACO57863.1| unknown [Helianthus argophyllus]
gi|226444580|gb|ACO57864.1| unknown [Helianthus argophyllus]
gi|226444582|gb|ACO57865.1| unknown [Helianthus argophyllus]
gi|226444584|gb|ACO57866.1| unknown [Helianthus argophyllus]
gi|226444586|gb|ACO57867.1| unknown [Helianthus argophyllus]
gi|226444588|gb|ACO57868.1| unknown [Helianthus argophyllus]
gi|226444590|gb|ACO57869.1| unknown [Helianthus argophyllus]
gi|226444592|gb|ACO57870.1| unknown [Helianthus argophyllus]
Length = 103
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/61 (55%), Positives = 43/61 (70%), Gaps = 3/61 (4%)
Query: 38 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
VRG G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G+
Sbjct: 1 KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57
Query: 98 Y 98
+
Sbjct: 58 F 58
>gi|328869576|gb|EGG17953.1| hypothetical protein DFA_06619 [Dictyostelium fasciculatum]
Length = 1200
Score = 65.5 bits (158), Expect = 3e-09, Method: Composition-based stats.
Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 2/97 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQL G+ +L + G+++ A L+ I N++ + GS GAVNG+ P D
Sbjct: 1048 MADQLAGHWYLTSCGLSSYMTLDQA--LSTLSIINEYNIKSYSNGSCGAVNGISPLAIVD 1105
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
++ +QS EVW G +Y L++ L EAW+ GL
Sbjct: 1106 QTCLQSSEVWIGTSYSLASTFLLHYMDKEAWSLIKGL 1142
>gi|94968715|ref|YP_590763.1| hypothetical protein Acid345_1688 [Candidatus Koribacter versatilis
Ellin345]
gi|94550765|gb|ABF40689.1| protein of unknown function DUF608 [Candidatus Koribacter
versatilis Ellin345]
Length = 811
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 60/105 (57%), Gaps = 2/105 (1%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MA+QL G + +G+ + P AL ++ NV + G+MGA+NG+ +G+
Sbjct: 672 MAEQLAGQWYASLTGLGD--LVPRDMQHSALKKVYDFNVMKLQNGTMGALNGISASGEVL 729
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDR 105
+ Q+EEVWTGVT+ ++A ML G +E + TA G+Y + + +
Sbjct: 730 KDNEQTEEVWTGVTFAVAATMLQNGLREEGFNTAKGVYNVVYDQK 774
>gi|332711224|ref|ZP_08431157.1| putative bile acid beta-glucosidase [Moorea producens 3L]
gi|332350038|gb|EGJ29645.1| putative bile acid beta-glucosidase [Moorea producens 3L]
Length = 815
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 36/102 (35%), Positives = 54/102 (52%), Gaps = 3/102 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + G+ I P AL T++ F G +GAVNG++P+G
Sbjct: 674 MADQLCGQFYAQLLGL--PDIVPPECTLSALKTVYDACFLKFHQGQLGAVNGVRPDGTPE 731
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
D A EVWTG+ + L+A ++ +G DEA + R++
Sbjct: 732 DPDATHPMEVWTGINFGLAAFLIQQGMKDEALRITEAVVRQV 773
>gi|374311163|ref|YP_005057593.1| glucosylceramidase [Granulicella mallensis MP5ACTX8]
gi|358753173|gb|AEU36563.1| Glucosylceramidase [Granulicella mallensis MP5ACTX8]
Length = 811
Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 53/105 (50%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
ADQL G + +G+ I P A IF NV F G MGA NGM +G
Sbjct: 672 QADQLAGQWYANLTGLGE--IVPHTMQVSAAKKIFDVNVMKFGHGEMGAANGMTADGAIL 729
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDR 105
+A +++EVW G T + ++ EG DEAW T GLY + ED+
Sbjct: 730 TNA-EAKEVWVGTTLGYAGLLMQEGMNDEAWKTTRGLYHVIYEDK 773
>gi|366162568|ref|ZP_09462323.1| hypothetical protein AcelC_02756 [Acetivibrio cellulolyticus CD2]
Length = 842
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 34/90 (37%), Positives = 51/90 (56%), Gaps = 2/90 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
M+DQLCG+ + KA G+ E I P ++ + ++ N + F+ G G VN MK +G+ D
Sbjct: 616 MSDQLCGHWYAKACGL--EGIVPDEYAKKSYSKVYEFNNKKFDNGLHGFVNIMKADGNVD 673
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
S Q+ E W G +Y + A M+ EG EA
Sbjct: 674 MSHAQTAEAWVGTSYGVIAGMIQEGLETEA 703
>gi|330844075|ref|XP_003293963.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
gi|325075658|gb|EGC29519.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
Length = 1205
Score = 62.4 bits (150), Expect = 4e-08, Method: Composition-based stats.
Identities = 33/97 (34%), Positives = 51/97 (52%), Gaps = 2/97 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
M+D L G+ +L + G+ + F A +L+ I NV + GAVNGM+P G D
Sbjct: 1038 MSDMLAGHWYLLSCGLPSYMTFDQA--LSSLSIINEYNVNSYGKERCGAVNGMRPEGVVD 1095
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
+ +QS EVW G ++ L+A M+ +AW G+
Sbjct: 1096 NTCLQSSEVWIGTSFSLAATMIQHHMDKDAWELVKGI 1132
>gi|434391596|ref|YP_007126543.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
gi|428263437|gb|AFZ29383.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
Length = 809
Score = 62.4 bits (150), Expect = 4e-08, Method: Composition-based stats.
Identities = 34/92 (36%), Positives = 49/92 (53%), Gaps = 3/92 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
MADQLCG + + G+ I P +AL T++ F G GA NG+KP+G
Sbjct: 671 MADQLCGQFYARLLGL--PDIVPPECATQALKTVYDACFVKFHNGQFGAANGLKPDGLPE 728
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
+ +A EVWTG+ + L+A ++ G EAW
Sbjct: 729 NPNATHPLEVWTGINFGLAAFLMQMGMKSEAW 760
>gi|402589209|gb|EJW83141.1| hypothetical protein WUBG_05945 [Wuchereria bancrofti]
Length = 90
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 40/63 (63%)
Query: 19 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLS 78
A+ P ++I AL I+S+N+ F G +GAVNGM+ +G R +Q++E+W GVTY L
Sbjct: 22 RALLPLSHIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYALQ 81
Query: 79 AAM 81
+
Sbjct: 82 HHL 84
>gi|257060089|ref|YP_003137977.1| glucosylceramidase [Cyanothece sp. PCC 8802]
gi|256590255|gb|ACV01142.1| Glucosylceramidase [Cyanothece sp. PCC 8802]
Length = 805
Score = 60.8 bits (146), Expect = 9e-08, Method: Composition-based stats.
Identities = 34/105 (32%), Positives = 55/105 (52%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A +N + + AL ++ F+ G GA NGMKP+G
Sbjct: 664 MADQLCGQFY--ARLLNLPDVVETQYTESALNKVYEACFLKFQDGKYGAANGMKPDGTPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
D ++ +EVWTG+ + L+A +L G D A+ + +++ E+
Sbjct: 722 DPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEAVVKQVYEN 766
>gi|218247014|ref|YP_002372385.1| hypothetical protein PCC8801_2201 [Cyanothece sp. PCC 8801]
gi|218167492|gb|ACK66229.1| protein of unknown function DUF608 [Cyanothece sp. PCC 8801]
Length = 805
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 34/105 (32%), Positives = 55/105 (52%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A +N + + AL ++ F+ G GA NGMKP+G
Sbjct: 664 MADQLCGQFY--ARLLNFPDVVETQYTESALNKVYEACFLKFQDGKYGAANGMKPDGTPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
D ++ +EVWTG+ + L+A +L G D A+ + +++ E+
Sbjct: 722 DPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEAVVKQVYEN 766
>gi|119510228|ref|ZP_01629365.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
gi|119465077|gb|EAW45977.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
Length = 814
Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + G+ I PS AL T++ F G GA NG++P+G
Sbjct: 664 MADQLCGQFYARLLGL--PDIVPSDRALSALQTVYHACFVKFCNGEFGAANGVRPDGSAE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ +A EVWTG+ + L+A ++ G DEA G + +++ +
Sbjct: 722 NPNATHPLEVWTGINFGLAAFLVQMGMQDEALKLTGAVVQQIYHN 766
>gi|428209901|ref|YP_007094254.1| glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
gi|428011822|gb|AFY90385.1| Glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
Length = 812
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 46/91 (50%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A + I P AL TI+ F G GA NG+KPNG
Sbjct: 673 MADQLCGQFY--ARLLKLPDIVPQECALSALKTIYDACFLKFHQGKFGAANGLKPNGSPE 730
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ +A EVWTG+ + L+A +L G EA
Sbjct: 731 NPNATHPLEVWTGINFGLAAFLLQMGMKAEA 761
>gi|254410084|ref|ZP_05023864.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183120|gb|EDX78104.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 814
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG + + G+ I P AL T++ + F GA NG+KP+G +
Sbjct: 675 MADQLCGQFYAQLLGL--PDIVPPECTQSALKTVYESCFLKFHNAQFGAANGVKPDGSPE 732
Query: 61 R-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
A EVWTG+ + L+A ++ G DEA + R++ E+
Sbjct: 733 HPDATHPLEVWTGINFGLAAFLIQLGMKDEALKLTESVVRQVYEN 777
>gi|402583425|gb|EJW77369.1| hypothetical protein WUBG_11725, partial [Wuchereria bancrofti]
Length = 109
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%)
Query: 21 IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAA 80
+ P ++I AL I+S+N+ F G +GAVNGM+ +G R +Q++E+W GVTY L
Sbjct: 1 LLPLSHIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYALQHH 60
Query: 81 M 81
+
Sbjct: 61 L 61
>gi|225593155|gb|ACN96073.1| pedicted bile acid beta-glucosidase [Fischerella sp. MV11]
Length = 812
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + G+ I AL T+++ F G GA NG++P+G
Sbjct: 675 MADQLCGQFYARLLGLPD--IVKRDRALSALKTVYNACFLKFHHGQFGAANGLRPDGSPE 732
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVWTG+ + L+A M+ G DEA + R++ E+
Sbjct: 733 NPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKLTEAVVRQVYEN 777
>gi|428200465|ref|YP_007079054.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
gi|427977897|gb|AFY75497.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
Length = 808
Score = 58.9 bits (141), Expect = 4e-07, Method: Composition-based stats.
Identities = 35/105 (33%), Positives = 54/105 (51%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + G+ + AL ++ F+ G GA NG+KP+G
Sbjct: 665 MADQLCGQFYARLLGL--PDVVERQYAESALKKVYEACFLKFQDGKFGAANGVKPDGSPE 722
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
D A EVWTG+ + L+A +L G DEA+ A + R++ ++
Sbjct: 723 DPKATHPLEVWTGINFGLAAFLLQMGMKDEAFKLAETVVRQVYDN 767
>gi|354566897|ref|ZP_08986068.1| Glucosylceramidase [Fischerella sp. JSC-11]
gi|353544556|gb|EHC14010.1| Glucosylceramidase [Fischerella sp. JSC-11]
Length = 812
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + G+ I AL T+++ F G GA NG++P+G
Sbjct: 675 MADQLCGQFYARLLGL--PDIVKRDRALSALKTVYNACFLKFHHGQFGAANGLRPDGSPE 732
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVWTG+ + L+A M+ G DEA + R++ E+
Sbjct: 733 NPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKLTETVVRQIYEN 777
>gi|428299644|ref|YP_007137950.1| glucosylceramidase [Calothrix sp. PCC 6303]
gi|428236188|gb|AFZ01978.1| Glucosylceramidase [Calothrix sp. PCC 6303]
Length = 804
Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 47/92 (51%), Gaps = 3/92 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A + I P A+ T++ F G GA NG++P+G
Sbjct: 662 MADQLCGQFY--ARLLKLADIVPHDCAISAVRTVYEACFINFNDGKFGAANGLRPDGSPE 719
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
+ A EVWTG+ + L+A ++ G DEAW
Sbjct: 720 NPKATHPLEVWTGINFGLAAFLMQMGMQDEAW 751
>gi|427727368|ref|YP_007073605.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
gi|427363287|gb|AFY46008.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
Length = 803
Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 48/91 (52%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A+ +N I P AL T++ F G GA NG++P+G
Sbjct: 663 MADQLCGQFY--AALLNLPDIVPGDRAISALRTVYDACFLKFYNGQFGAANGVRPDGSPE 720
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ A EVWTG+ + L+A ++ G DEA
Sbjct: 721 NPQATHPLEVWTGINFGLAAFLVQMGMKDEA 751
>gi|113477064|ref|YP_723125.1| hypothetical protein Tery_3568 [Trichodesmium erythraeum IMS101]
gi|110168112|gb|ABG52652.1| protein of unknown function DUF608 [Trichodesmium erythraeum
IMS101]
Length = 816
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 50/91 (54%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQL G + K +N E I P+ AL T++++ + F G GA NG+ P+G
Sbjct: 676 MADQLSGQFYAKL--LNLEDIVPAECALSALKTVYNSCFKNFHNGKFGAANGVLPDGSPE 733
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ +A EVWTG+ + L+A M+ G EA
Sbjct: 734 NPNATHPLEVWTGINFGLAAFMVQIGMKKEA 764
>gi|440680630|ref|YP_007155425.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
gi|428677749|gb|AFZ56515.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
Length = 802
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 49/91 (53%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + G+ I P+ N AL T++ F G GA NG++P+G
Sbjct: 666 MADQLCGQFYARLLGL--PDIVPNDNALSALQTVYDACFVKFCNGQFGAANGVRPDGSPE 723
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ A EVWTG+ + L+A ++ G +EA
Sbjct: 724 NPQATHPLEVWTGINFGLAAFLIQMGMKNEA 754
>gi|427725201|ref|YP_007072478.1| glucosylceramidase [Leptolyngbya sp. PCC 7376]
gi|427356921|gb|AFY39644.1| Glucosylceramidase [Leptolyngbya sp. PCC 7376]
Length = 807
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
M+DQLCG + + +N + P A L TI+ F G GA NG+ P+G +
Sbjct: 670 MSDQLCGEFYTQL--LNLPNVNPENRTAITLQTIYEACFLKFHDGQFGAANGLNPDGSPE 727
Query: 61 R-SAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ + EVWTG+ + ++A M+ G DEA
Sbjct: 728 KENDTHPLEVWTGINFGIAAFMIRNGMKDEA 758
>gi|427709805|ref|YP_007052182.1| glucosylceramidase [Nostoc sp. PCC 7107]
gi|427362310|gb|AFY45032.1| Glucosylceramidase [Nostoc sp. PCC 7107]
Length = 802
Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 47/91 (51%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + + I PS AL T++ F+ G GA NG+ P+G
Sbjct: 664 MADQLCGQFYARL--LELPDIVPSDRALSALKTVYDACFLKFQNGEFGAANGVLPDGSPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ A EVWTG+ + L+A ++ G DEA
Sbjct: 722 NPKATHPLEVWTGINFGLAAFLVQMGMQDEA 752
>gi|427712531|ref|YP_007061155.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
gi|427376660|gb|AFY60612.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
Length = 799
Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats.
Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG + A +N + P AL T++ F G GA NG+ P+G +
Sbjct: 656 MADQLCGQYY--ARLLNLPDVVPIECAESALKTVYDVCFLKFHEGQFGAANGLLPDGSPE 713
Query: 61 R-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ +A EVW G+ Y L+A + G D AW + R++ ++
Sbjct: 714 KPNATHPLEVWIGINYGLAAFLYQMGLEDGAWRLTQTVIRQIYDN 758
>gi|428308994|ref|YP_007119971.1| bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
gi|428250606|gb|AFZ16565.1| putative bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
Length = 819
Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats.
Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
M DQLCG + + G+ I P + AL +++ + F+ +GA NG++P+G
Sbjct: 681 MTDQLCGQFYAQLLGL--PDIVPPECVQSALKSVYDSCFLKFQDAQLGAANGVRPDGSPE 738
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVWTG+ + L+A ++ +G EA + R++ E+
Sbjct: 739 NPDATHPLEVWTGINFGLAAFLVQQGMKQEALQLTEAVVRQVYEN 783
>gi|75907319|ref|YP_321615.1| hypothetical protein Ava_1096 [Anabaena variabilis ATCC 29413]
gi|75701044|gb|ABA20720.1| Protein of unknown function DUF608 [Anabaena variabilis ATCC 29413]
Length = 804
Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 3/90 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A+ + I PS AL T++ F G GA NG++P+G
Sbjct: 667 MADQLCGQFY--ANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPE 724
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 89
+ A EVWTG+ + L+A ++ G DE
Sbjct: 725 NPKATHPLEVWTGINFGLAAFLVQMGMKDE 754
>gi|7470943|pir||T31071 conserved hypothetical protein - Anabaena sp. (fragment)
gi|2828799|gb|AAC38131.1|AAC38131 unknown [Nostoc sp. PCC 7120]
Length = 439
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 3/92 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A+ + I PS AL T++ F G GA NG++P+G
Sbjct: 302 MADQLCGQFY--ANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPE 359
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
+ A EVWTG+ + L+A ++ G DE +
Sbjct: 360 NPKATHPLEVWTGINFGLAAFLVQMGMKDEGF 391
>gi|17230308|ref|NP_486856.1| hypothetical protein alr2816 [Nostoc sp. PCC 7120]
gi|17131910|dbj|BAB74515.1| alr2816 [Nostoc sp. PCC 7120]
Length = 804
Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 3/90 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A+ + I PS AL T++ F G GA NG++P+G
Sbjct: 667 MADQLCGQFY--ANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPE 724
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 89
+ A EVWTG+ + L+A ++ G DE
Sbjct: 725 NPKATHPLEVWTGINFGLAAFLVQMGMKDE 754
>gi|162452686|ref|YP_001615053.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
gi|161163268|emb|CAN94573.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
Length = 782
Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 57/124 (45%), Gaps = 13/124 (10%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
M+DQL G + +A G+ I ++ A A I+ N + F+ G+ G VN M NG D
Sbjct: 642 MSDQLAGQWYARALGL--PPIVDPSHAASAFTKIYENNFKRFDGGTRGVVNVMTANGSVD 699
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEG-----------NVDEAWTTAGGLYRRLREDRVSGC 109
++ Q+ E W G ++ + A M+ EG VD W T G +R +G
Sbjct: 700 GTSNQTRECWVGTSWGVVAGMIQEGLAAQAGEIGASLVDTIWKTDGLWFRTPEAWEGNGS 759
Query: 110 ETAP 113
AP
Sbjct: 760 IRAP 763
>gi|416392732|ref|ZP_11685894.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
0003]
gi|357263592|gb|EHJ12579.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
0003]
Length = 804
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 31/105 (29%), Positives = 56/105 (53%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + G++ + S AL ++ F+ G GA NGMKP+G
Sbjct: 664 MADQLCGQFYARLLGLSD--VVESEYTLSALNKVYEACFLKFQNGKYGAANGMKPDGTPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
D ++ +EVWTG+ + +++ ++ G +EA + +++ E+
Sbjct: 722 DPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVKQVYEN 766
>gi|170079447|ref|YP_001736085.1| hypothetical protein SYNPCC7002_A2862 [Synechococcus sp. PCC 7002]
gi|169887116|gb|ACB00830.1| Conserved hypothetical protein (DUF608) [Synechococcus sp. PCC
7002]
Length = 803
Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG + + ++ + P A LATI++ F G GA NG+ P+G +
Sbjct: 664 MADQLCGQYYTQL--LSLPDVNPGDRTATTLATIYTACFEKFHGGQFGAANGLNPDGTPE 721
Query: 61 R-SAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ + EVWTG+ + ++A M+ G EA
Sbjct: 722 KENDTHPLEVWTGINFGIAALMIRNGMQTEA 752
>gi|67922703|ref|ZP_00516206.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
gi|67855413|gb|EAM50669.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
Length = 804
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 31/105 (29%), Positives = 56/105 (53%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + G++ + S AL ++ F+ G GA NGMKP+G
Sbjct: 664 MADQLCGQFYARLLGLSD--VVESEYTLSALNKVYEACFLKFQNGKYGAANGMKPDGTPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
D ++ +EVWTG+ + +++ ++ G +EA + +++ E+
Sbjct: 722 DPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVKQVYEN 766
>gi|186681980|ref|YP_001865176.1| hypothetical protein Npun_R1544 [Nostoc punctiforme PCC 73102]
gi|186464432|gb|ACC80233.1| protein of unknown function DUF608 [Nostoc punctiforme PCC 73102]
Length = 801
Score = 56.2 bits (134), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A ++ I PS AL T++ F G G NG++P+G
Sbjct: 664 MADQLCGQFY--ARLLDLPDIVPSDRALSALKTVYDACFLKFCNGEFGTANGVRPDGSPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVWTG+ + L+A ++ G DEA + +++ E+
Sbjct: 722 NPKATHPLEVWTGINFGLAAFLVQMGMKDEALRLTQAVVQQIYEN 766
>gi|119493962|ref|ZP_01624522.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
gi|119452283|gb|EAW33479.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
Length = 799
Score = 56.2 bits (134), Expect = 3e-06, Method: Composition-based stats.
Identities = 34/105 (32%), Positives = 59/105 (56%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + K G+ + + P I+ AL TI+ + + F G GA NG++ NG+
Sbjct: 661 MADQLCGQFYAKLLGLE-DIVSPEKTIS-ALQTIYQSCFQNFYQGQFGAANGVRINGEPI 718
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ + EVWTG+ + L+A ++ G +E + A + +++ E+
Sbjct: 719 NPNDTHPLEVWTGINFGLAAFLIQMGMKEEGFKLAEVVVQQIYEN 763
>gi|298490121|ref|YP_003720298.1| glucosylceramidase ['Nostoc azollae' 0708]
gi|298232039|gb|ADI63175.1| Glucosylceramidase ['Nostoc azollae' 0708]
Length = 806
Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats.
Identities = 33/91 (36%), Positives = 48/91 (52%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + G+ I PS + AL T++ F G GA NG+ P+G
Sbjct: 669 MADQLCGQFYTRLLGL--PDIVPSNHAHSALQTVYEACFVKFYDGKFGAANGVLPDGSPE 726
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ A EVWTG+ + L+A ++ G +EA
Sbjct: 727 NPKATHPLEVWTGINFGLAAFLVQMGMKNEA 757
>gi|334116939|ref|ZP_08491031.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
gi|333461759|gb|EGK90364.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
Length = 818
Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + K G+ I P AL T++ + F G GA NG+ NG
Sbjct: 678 MADQLCGQFYAKLLGL--PDIVPPECAVSALETVYESCFLKFNEGEFGAANGVMLNGSPE 735
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ +A EVWTG+ + L+A ++ G D+A+ + ++ E+
Sbjct: 736 NPNATHPLEVWTGINFGLAAFLVQMGMEDKAFKLTDAVVNQIYEN 780
>gi|427716943|ref|YP_007064937.1| glucosylceramidase [Calothrix sp. PCC 7507]
gi|427349379|gb|AFY32103.1| Glucosylceramidase [Calothrix sp. PCC 7507]
Length = 800
Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 47/91 (51%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + G+ I S AL T++ F G GA NG++P+G
Sbjct: 660 MADQLCGQFYARLLGL--PDIVASDRALSALQTVYEACFLKFYDGQFGAANGVRPDGSPE 717
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ A EVWTG+ + L+A ++ G DEA
Sbjct: 718 NPQATHPLEVWTGINFGLAAFLVQMGMQDEA 748
>gi|307151416|ref|YP_003886800.1| glucosylceramidase [Cyanothece sp. PCC 7822]
gi|306981644|gb|ADN13525.1| Glucosylceramidase [Cyanothece sp. PCC 7822]
Length = 805
Score = 55.1 bits (131), Expect = 5e-06, Method: Composition-based stats.
Identities = 33/105 (31%), Positives = 55/105 (52%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
M+DQLCG + + G+ + + P + AL I+ F G GA NG+KP+G
Sbjct: 664 MSDQLCGQFYARLLGL-PDVVEPQYTTS-ALRKIYEACFLKFHGGKYGAANGVKPDGTPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVWTG+ + L+A M+ G +EA+ + +++ E+
Sbjct: 722 NPDATHPLEVWTGINFGLAAFMIQMGMKEEAFKITEAVVKQVYEN 766
>gi|16330256|ref|NP_440984.1| hypothetical protein sll1775 [Synechocystis sp. PCC 6803]
gi|383321997|ref|YP_005382850.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325166|ref|YP_005386019.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491050|ref|YP_005408726.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436317|ref|YP_005651041.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
gi|451814414|ref|YP_007450866.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
gi|1652745|dbj|BAA17664.1| sll1775 [Synechocystis sp. PCC 6803]
gi|339273349|dbj|BAK49836.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
gi|359271316|dbj|BAL28835.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274486|dbj|BAL32004.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277656|dbj|BAL35173.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407958168|dbj|BAM51408.1| hypothetical protein BEST7613_2477 [Bacillus subtilis BEST7613]
gi|451780383|gb|AGF51352.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
Length = 817
Score = 54.7 bits (130), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A +N + +ALA ++ T F G +GA NG+ P+G +
Sbjct: 660 MADQLCGQFY--ARLLNLPDVVEHHYAEKALAKVYDTCFLKFAHGELGAANGLLPDGSPQ 717
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ + EVWTG+ + L+A ++ G +EA
Sbjct: 718 NPNDTHPLEVWTGINFGLAAFLIQMGMKEEA 748
>gi|428317101|ref|YP_007114983.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
gi|428240781|gb|AFZ06567.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
Length = 809
Score = 54.3 bits (129), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 54/105 (51%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + K G+ I P AL T++ + F G GA NG+ NG
Sbjct: 669 MADQLCGQFYAKLLGLPD--IVPPECAVSALETVYESCFLKFNDGEFGAANGVMLNGSPE 726
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ +A EVWTG+ + L+A ++ G ++A+ + +++ E+
Sbjct: 727 NPNATHPLEVWTGINFGLAAFLVQMGMEEKAFKLTDAVVKQIYEN 771
>gi|357139475|ref|XP_003571307.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 946
Score = 54.3 bits (129), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 58/101 (57%), Gaps = 5/101 (4%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMK---PNG 57
MADQL G + +A G+ E + RALA + NV + G +GAVNG + G
Sbjct: 735 MADQLAGQWYARACGL--EPVVGRDKARRALAAVLEHNVMQVQGGGVGAVNGARLPEHGG 792
Query: 58 DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
D S+ QS+EVWTG TY ++AAM+ EG +E +T A G Y
Sbjct: 793 GVDESSTQSKEVWTGTTYAVAAAMIGEGMREEGFTAAKGAY 833
>gi|220909879|ref|YP_002485190.1| hypothetical protein Cyan7425_4519 [Cyanothece sp. PCC 7425]
gi|219866490|gb|ACL46829.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7425]
Length = 814
Score = 54.3 bits (129), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG + + G+ + P AL ++ T F G++GA NG++P G +
Sbjct: 663 MADQLCGQFYARLVGL--PDVVPEDCARAALDMVYQTCFLQFHNGTLGAANGLRPGGLAE 720
Query: 61 RS-AIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ A EVWTG+ + L+A ++ G EA
Sbjct: 721 KPDATHPLEVWTGINFGLAAFLVQMGRKAEA 751
>gi|172036524|ref|YP_001803025.1| hypothetical protein cce_1609 [Cyanothece sp. ATCC 51142]
gi|354553307|ref|ZP_08972614.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
gi|171697978|gb|ACB50959.1| unknown [Cyanothece sp. ATCC 51142]
gi|353555137|gb|EHC24526.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
Length = 818
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 31/105 (29%), Positives = 53/105 (50%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQL G + + G+ + AL ++ F+ G GA NGMKP+G
Sbjct: 678 MADQLSGQFYARLLGL--PDVVEQQYALSALKKVYEACFLKFQNGKYGAANGMKPDGTPE 735
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
D ++ +EVWTG+ + L++ ++ G DEA + +++ E+
Sbjct: 736 DPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYEN 780
>gi|126657976|ref|ZP_01729128.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
gi|126620614|gb|EAZ91331.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
Length = 804
Score = 54.3 bits (129), Expect = 1e-05, Method: Composition-based stats.
Identities = 31/105 (29%), Positives = 54/105 (51%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQL G + + G+ + + AL ++ F+ G GA NGMKP+G
Sbjct: 664 MADQLSGQFYARLLGL--PDVVENQYALSALKKVYEACFLKFQNGKYGAANGMKPDGTPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
D ++ +EVWTG+ + L++ ++ G DEA + +++ E+
Sbjct: 722 DPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYEN 766
>gi|425455967|ref|ZP_18835678.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
gi|389803038|emb|CCI17984.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
Length = 784
Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 49/105 (46%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + A ++ + + AL+ I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFY--ARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A +L G + W + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN 745
>gi|282897193|ref|ZP_06305195.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
gi|281197845|gb|EFA72739.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
Length = 798
Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats.
Identities = 34/102 (33%), Positives = 49/102 (48%), Gaps = 3/102 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQL G + A +N I P AL TI+ F+ G GA NG+ P+G
Sbjct: 661 MADQLSGQFY--ARLLNLPDIVPQDRALSALTTIYDACFLKFQDGKFGAANGVLPDGSPE 718
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
+ A EVWTG+ + L+A +L G D+ + R++
Sbjct: 719 NPQATHPLEVWTGINFGLAAFLLQMGMKDQGLRLTQAVVRQI 760
>gi|434407175|ref|YP_007150060.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
7417]
gi|428261430|gb|AFZ27380.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
7417]
Length = 816
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
MADQLCG + + G+ I AL T++ F G GA NG++P+G
Sbjct: 675 MADQLCGQFYARLLGL--PDIVKCDRALSALQTVYDACFLKFFDGQFGAANGVRPDGVPE 732
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ +A EVWTG+ + L+A ++ G DEA
Sbjct: 733 NPNATHPLEVWTGINFGLAAFLVQMGMQDEA 763
>gi|428223619|ref|YP_007107716.1| glucosylceramidase [Geitlerinema sp. PCC 7407]
gi|427983520|gb|AFY64664.1| Glucosylceramidase [Geitlerinema sp. PCC 7407]
Length = 821
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 3/90 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + ++ + S AL T++ + F G GA NG+KP+G
Sbjct: 670 MADQLCGQFYARL--LDLPDVVASDRALSALKTVYDSCFLRFHDGKFGAANGVKPDGSPE 727
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 89
+ A EVWTG+ + ++A ++ G DE
Sbjct: 728 NPKATHPLEVWTGINFGIAAFLVQMGLRDE 757
>gi|425450263|ref|ZP_18830094.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
gi|389768962|emb|CCI06046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
Length = 784
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + + + + + AL+ I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A +L G + W + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITQAVVRQIYEN 745
>gi|422303239|ref|ZP_16390593.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
gi|389791807|emb|CCI12404.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
Length = 784
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 50/105 (47%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + A ++ + + AL+ I+ + F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFY--ARLLSLPDVVENQYTQSALSKIYQSCFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A ++ G + W + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVYEN 745
>gi|414077215|ref|YP_006996533.1| beta-glucosidase [Anabaena sp. 90]
gi|413970631|gb|AFW94720.1| putative beta-glucosidase [Anabaena sp. 90]
Length = 795
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 54/105 (51%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
MADQLCG + A ++ I PS AL ++ + F+ G GA NG+ PNG
Sbjct: 658 MADQLCGQFY--ARLLSLPDIVPSDRALSALTKVYDSCFLKFQNGEFGAANGVLPNGLPE 715
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ ++ EVWTG+ + L+A ++ +EA + R++ ++
Sbjct: 716 NPNSTHPLEVWTGINFGLAAFLVQMNMKNEAMRLTEAVVRQIYDN 760
>gi|218437284|ref|YP_002375613.1| hypothetical protein PCC7424_0277 [Cyanothece sp. PCC 7424]
gi|218170012|gb|ACK68745.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7424]
Length = 801
Score = 53.1 bits (126), Expect = 2e-05, Method: Composition-based stats.
Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
M+DQLCG + + G+ + + P ++ AL I+ + F G GA NG++P+G
Sbjct: 664 MSDQLCGQFYARLLGL-PDVVEPQYTLS-ALRKIYESCFLQFYGGKYGAANGVRPDGSPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVWTG+ + L+A M+ G E + + +++ E+
Sbjct: 722 NPDATHPLEVWTGINFGLAAFMIQMGMKQEGLSITEAVVKQIYEN 766
>gi|425435596|ref|ZP_18816046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
gi|389679839|emb|CCH91411.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
Length = 784
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + + + + + AL+ I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A +L G + W + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN 745
>gi|425460273|ref|ZP_18839755.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
gi|389827051|emb|CCI21967.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
Length = 784
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + + + + + AL+ I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A +L G + W + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN 745
>gi|443325334|ref|ZP_21054034.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
gi|442795063|gb|ELS04450.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
Length = 808
Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats.
Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 11/109 (10%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG + + G+ + + AL+ I+ F G GA NG+ P+G
Sbjct: 668 MADQLCGQFYARLLGL--PDVVENQYTKSALSKIYDACFLKFHDGKFGAANGVLPDG--- 722
Query: 61 RSAIQSE-----EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
SA+ + EVWTG+ + L+A M+ G DEA+ + R++ E+
Sbjct: 723 -SAVNPDDTHPLEVWTGINFGLAAFMVQMGMKDEAFRLTETVVRQVYEN 770
>gi|443669591|ref|ZP_21134796.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
gi|159029778|emb|CAO87856.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443330115|gb|ELS44858.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
Length = 784
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + + + + + AL+ I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A +L G + W + R++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQVYEN 745
>gi|425442611|ref|ZP_18822852.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
gi|389716273|emb|CCH99457.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
Length = 784
Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + A ++ + + AL+ I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFY--ARLLSLPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A +L G + W + +++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN 745
>gi|425447791|ref|ZP_18827773.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
gi|389731620|emb|CCI04385.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
Length = 784
Score = 52.0 bits (123), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + A ++ + + AL+ I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFY--ARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A +L G + W + +++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVQQIYEN 745
>gi|443311911|ref|ZP_21041533.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
gi|442777986|gb|ELR88257.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
Length = 804
Score = 52.0 bits (123), Expect = 4e-05, Method: Composition-based stats.
Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 3/92 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + + GV I S +L T++ F G+ GA NG+K +G
Sbjct: 664 MADQLCGQFYARLLGV--PDIVESDRTVSSLKTVYEACFLKFHNGAFGAANGLKIDGSPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
+ A EVWTG+ + L+A +L +EA+
Sbjct: 722 NPDATHPLEVWTGINFGLAAFLLQMDMKEEAF 753
>gi|390437958|ref|ZP_10226465.1| Glycosidase-like protein [Microcystis sp. T1-4]
gi|389838618|emb|CCI30589.1| Glycosidase-like protein [Microcystis sp. T1-4]
Length = 784
Score = 51.6 bits (122), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + + + + + AL+ I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYQACFLKFHEGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A ++ G + W + R++ E+
Sbjct: 701 NPQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVYEN 745
>gi|300866990|ref|ZP_07111661.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335025|emb|CBN56827.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 863
Score = 51.6 bits (122), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG + + G+ I P AL T++ + F G GA NG+ +G +
Sbjct: 722 MADQLCGQFYARLLGL--PDIVPEECAISALETVYESCFIKFNQGEFGAANGVMLDGSPE 779
Query: 61 RS-AIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ A EVWTG+ + L+A M+ G ++A
Sbjct: 780 KPGATHPLEVWTGINFGLAAFMMQMGMKEKA 810
>gi|209526014|ref|ZP_03274547.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
gi|376002240|ref|ZP_09780079.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423067988|ref|ZP_17056778.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
gi|209493540|gb|EDZ93862.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
gi|375329367|emb|CCE15832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406710526|gb|EKD05737.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
Length = 799
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A+ + I P AL TI+ + F G GA NG++ G+
Sbjct: 660 MADQLCGQFY--AALLGLPDIVPPHRAQVALQTIYQACFQNFHNGKFGAANGVRTTGEPM 717
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 94
+ + EVWTG+ + L A + G +EA+ A
Sbjct: 718 NPNDTHPLEVWTGINFGLGAFFIQMGMKEEAFQLA 752
>gi|409991122|ref|ZP_11274412.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
Paraca]
gi|291568810|dbj|BAI91082.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409938009|gb|EKN79383.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
Paraca]
Length = 799
Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats.
Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + A+ + I P AL TI+ + F G GA NG++ G+
Sbjct: 660 MADQLCGQFY--AALLGLPDIVPHHRARVALQTIYQACFQNFHNGKFGAANGVRTTGEPI 717
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 94
+ EVWTG+ + L A ++ G +EA+ A
Sbjct: 718 NPQDTHPLEVWTGINFGLGAFLIQMGMKEEAFQLA 752
>gi|440753145|ref|ZP_20932348.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
gi|440177638|gb|ELP56911.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
Length = 789
Score = 51.6 bits (122), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + + + + + AL+ I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A +L G + W + +++ E+
Sbjct: 701 NPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN 745
>gi|425471589|ref|ZP_18850441.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
gi|389882505|emb|CCI37035.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
Length = 784
Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 47/105 (44%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + + + + + AL I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFYARLLAL--PDVVENQYTQSALGKIYQACFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A +L G + W + +++ E+
Sbjct: 701 NPQATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN 745
>gi|425464233|ref|ZP_18843555.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
gi|389833806|emb|CCI21373.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
Length = 784
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + + + + + AL+ I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFYSRLLAL--PDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A ++ G + W + +++ E+
Sbjct: 701 NPQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVKQVYEN 745
>gi|166363582|ref|YP_001655855.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
gi|166085955|dbj|BAG00663.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
Length = 784
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
M DQLCG + + + + + AL+ I+ F G GA NG+KP+G
Sbjct: 643 MTDQLCGQFYSRLLAL--PDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPE 700
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVW G+ + L A ++ G + W + +++ E+
Sbjct: 701 NPQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVKQVYEN 745
>gi|37522029|ref|NP_925406.1| hypothetical protein gll2460 [Gloeobacter violaceus PCC 7421]
gi|35213028|dbj|BAC90401.1| gll2460 [Gloeobacter violaceus PCC 7421]
Length = 867
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 38/85 (44%), Gaps = 2/85 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG + A N I P AL T++ T F G G NG +G
Sbjct: 718 MADQLCGEYY--AQVCNLADIVPEPQARSALETVYQTCFVKFYGGRFGCANGTNADGSFI 775
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEG 85
EVWTG+ + L+A M+ G
Sbjct: 776 GDTEHPSEVWTGINFGLAAFMIRNG 800
>gi|434389621|ref|YP_007100232.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
gi|428020611|gb|AFY96705.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
Length = 818
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 9/98 (9%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMK----PN 56
MADQLCG + + + + I P AL T++ + F+ G GA NG+K P+
Sbjct: 680 MADQLCGQFYTQL--LELDDIVPPDCAKTALQTVYQSCFLNFQGGKFGAANGVKIDGSPS 737
Query: 57 GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 94
+D + EVWTG+ + ++A ++ G +EA A
Sbjct: 738 NPKDTHPL---EVWTGINFGIAAFLVQMGMQEEALKLA 772
>gi|269925332|ref|YP_003321955.1| hypothetical protein Tter_0211 [Thermobaculum terrenum ATCC
BAA-798]
gi|269788992|gb|ACZ41133.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
BAA-798]
Length = 811
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 57/126 (45%), Gaps = 20/126 (15%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV-----------RGFEAGS-MG 48
++DQL G +G+ + P +I+RAL +IF+ N R + G G
Sbjct: 598 LSDQLFGQTLASLTGLGY--LLPKEHISRALESIFAYNFKPNFWNHTNTQRVYALGDDAG 655
Query: 49 AVNGMKPNGDRDRSAIQ-SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVS 107
V P GDR S+EVW+G Y ++ M+YEG +DEA T R DR
Sbjct: 656 LVMCTWPFGDRPSFPFPYSDEVWSGTEYQVATLMIYEGLLDEALTII-----RATRDRYD 710
Query: 108 GCETAP 113
G + P
Sbjct: 711 GFKRNP 716
>gi|443320525|ref|ZP_21049620.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
gi|442789771|gb|ELR99409.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
Length = 806
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 29/102 (28%), Positives = 50/102 (49%), Gaps = 3/102 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQL G + + G+ + + L TI+ + F G +GA NG+K +G
Sbjct: 664 MADQLAGQFYAQLLGL--PDVVEKEYVQSTLKTIYQSCFLNFHQGKIGAANGVKLDGTGE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
+ + EVWTG+ + L A +++ G DE A + +++
Sbjct: 722 NPQSTHPLEVWTGINFALGAFLIHSGMRDEGLKLAETIVKQI 763
>gi|428770169|ref|YP_007161959.1| glucosylceramidase [Cyanobacterium aponinum PCC 10605]
gi|428684448|gb|AFZ53915.1| Glucosylceramidase [Cyanobacterium aponinum PCC 10605]
Length = 802
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ + S + L ++ F G G NG+KP+G
Sbjct: 663 MADQLCGQFYAQLLGL--PDVVESKYVKSTLRKVYDACFLKFHDGKFGIANGVKPDGKPV 720
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ EVWTG+ Y + A M+ G E A + +++ E+
Sbjct: 721 KENDTHPLEVWTGINYGIVAFMILNGMKQEGLRVAETVVKQVYEN 765
>gi|427733995|ref|YP_007053539.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
gi|427369036|gb|AFY52992.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
Length = 799
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 3/104 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQLCG + A + I P AL +++ F G +GA NG++ +G
Sbjct: 663 MADQLCGQFY--ARLLKLPDIVPPECTMSALNSVYDACFLKFNQGKLGAANGLRLDGSPE 720
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
+ A EVWTG+ + ++A ++ G EA+ + ++ E
Sbjct: 721 NPDATHPLEVWTGINFGIAAFLVQMGMKSEAFKLTEAVVNQIYE 764
>gi|428210868|ref|YP_007084012.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
6304]
gi|427999249|gb|AFY80092.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
6304]
Length = 820
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ I P AL T++ + F G GA NG+ +G
Sbjct: 677 MADQLCGQFYARLLGLAD--IVPIDCAESALKTVYESCFLKFHNGEFGAANGVLIDGSPV 734
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ A EVWTG+ + L+A +L G +E + + +++ E+
Sbjct: 735 NPDATHPLEVWTGINFGLAAFLLQMGMKEEGFKLTETVIQQVYEN 779
>gi|282900195|ref|ZP_06308151.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
CS-505]
gi|281194944|gb|EFA69885.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
CS-505]
Length = 789
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 33/104 (31%), Positives = 49/104 (47%), Gaps = 3/104 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
MADQL G + A +N I P AL TI+ F+ G GA NG+ +G
Sbjct: 652 MADQLSGQFY--ARLLNLPDIVPKDRALSALTTIYDACFLKFQDGKFGAANGVLLDGSPE 709
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
+ A EVWTG+ + L+A +L G ++ + R++ E
Sbjct: 710 NPQATHPLEVWTGINFGLAAFLLQMGMKEQGLRLTEAVVRQVYE 753
>gi|434400779|ref|YP_007134783.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
gi|428271876|gb|AFZ37817.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
Length = 804
Score = 48.5 bits (114), Expect = 5e-04, Method: Composition-based stats.
Identities = 32/105 (30%), Positives = 50/105 (47%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ + AL I+ F G GA NG+ P+G
Sbjct: 664 MADQLCGQFYARLLGL--PDVVEEQYTKSALKKIYQACFLKFHNGKYGAANGVLPDGSPV 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ + EVWTG+ + L+A ML G EA+ + +++ E+
Sbjct: 722 NPNDTHPLEVWTGINFGLAAFMLQMGMKAEAFQLTETVVKQVYEN 766
>gi|88809081|ref|ZP_01124590.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
gi|88787023|gb|EAR18181.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
Length = 849
Score = 48.5 bits (114), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 48/93 (51%), Gaps = 5/93 (5%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ E + AN L + + F+ GS+G NG++ +G
Sbjct: 700 MADQLCGDFYARLLGL--EPVVSDANSRSTLKAVKESCFEKFQGGSLGVANGLRRDGTPL 757
Query: 60 DRSAIQSEEVWTGVTYLLSA--AMLYEGNVDEA 90
D + EVWTG+ + +++ ++ EG EA
Sbjct: 758 DPNGTHPLEVWTGINFGIASYYQLMGEGQTAEA 790
>gi|320105716|ref|YP_004181306.1| glucosylceramidase [Terriglobus saanensis SP1PR4]
gi|319924237|gb|ADV81312.1| Glucosylceramidase [Terriglobus saanensis SP1PR4]
Length = 794
Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
ADQL G + +G+ + P A L I++ NV F G++GAVNGM +G
Sbjct: 656 QADQLAGQWYAHMTGLGD--LVPRAMQRSVLKKIYAFNVNQFGDGNLGAVNGMNADGTV- 712
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDR 105
+++ EVW G T +A + G +E++ GL+ + E +
Sbjct: 713 VDNVEAREVWAGTTLGYAALLKSLGIEEESYHVVHGLFHVIYESK 757
>gi|317969095|ref|ZP_07970485.1| hypothetical protein SCB02_06140 [Synechococcus sp. CB0205]
Length = 837
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 45/91 (49%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ E + AN L + + FE GS+G NG++ +G
Sbjct: 689 MADQLCGDFYARLLGL--EPVVSEANSRSTLNAVKDSCFEKFEGGSLGVANGLRRDGTPL 746
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
D + EVWTG+ + +++ G + A
Sbjct: 747 DPNGTHPLEVWTGINFGIASYFRLMGETETA 777
>gi|116072378|ref|ZP_01469645.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
gi|116064900|gb|EAU70659.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
Length = 820
Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 3/102 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ ++ AN L + FEAG++G NG++ +G
Sbjct: 672 MADQLCGDFYARLLGL--PSVVSDANALSTLNVVREACFERFEAGTLGVANGLRRDGTPL 729
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
D + EVWTG+ + +++ G D A G + ++
Sbjct: 730 DPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAVVNQV 771
>gi|148240031|ref|YP_001225418.1| hypothetical protein SynWH7803_1695 [Synechococcus sp. WH 7803]
gi|147848570|emb|CAK24121.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 833
Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 48/93 (51%), Gaps = 5/93 (5%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ E + AN L + + F+ GS+G NG++ +G
Sbjct: 685 MADQLCGDFYARLLGL--EPVVSEANSRSTLKAVKESCFEKFQGGSLGVANGLRRDGTPL 742
Query: 60 DRSAIQSEEVWTGVTYLLSA--AMLYEGNVDEA 90
D + EVWTG+ + +++ ++ EG EA
Sbjct: 743 DPNGTHPLEVWTGINFGIASYYRLMGEGQTAEA 775
>gi|318042148|ref|ZP_07974104.1| hypothetical protein SCB01_10580 [Synechococcus sp. CB0101]
Length = 836
Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 44/91 (48%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ A+ AN L + FE G +G NG++ +G
Sbjct: 688 MADQLCGDFYARLLGL--PAVVSEANSRSTLKAVKEACFEQFEGGKLGVANGLRRDGTPL 745
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
D + EVWTG+ + +++ G+ D A
Sbjct: 746 DPNGTHPLEVWTGINFGIASYYRLMGDTDTA 776
>gi|428773649|ref|YP_007165437.1| glucosylceramidase [Cyanobacterium stanieri PCC 7202]
gi|428687928|gb|AFZ47788.1| Glucosylceramidase [Cyanobacterium stanieri PCC 7202]
Length = 800
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 3/95 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
MADQLCG + A + + + L I+ F G++G NG+KP+G +
Sbjct: 663 MADQLCGQYY--AQLLKLPDVVEKEYVTATLNKIYDACYLKFHNGTLGIANGVKPDGSPE 720
Query: 61 R-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 94
+ + E+WTG+ Y + A M+ G +E A
Sbjct: 721 KENDTHPLEIWTGINYGIVAFMILNGMKEEGLKVA 755
>gi|290990718|ref|XP_002677983.1| predicted protein [Naegleria gruberi]
gi|284091593|gb|EFC45239.1| predicted protein [Naegleria gruberi]
Length = 937
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 21/71 (29%), Positives = 41/71 (57%), Gaps = 2/71 (2%)
Query: 22 FPSANIARALATIFSTNVRGFEAGSM--GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSA 79
F ++++L I N ++ + G VNGM+P + D++++QS E+WTG +Y+++A
Sbjct: 812 FNMERVSKSLNKILENNFVKYQQITKLGGCVNGMRPTQEVDKTSLQSREMWTGTSYVVAA 871
Query: 80 AMLYEGNVDEA 90
+ G +E
Sbjct: 872 LTILLGRREEG 882
>gi|254431466|ref|ZP_05045169.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
gi|197625919|gb|EDY38478.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
Length = 837
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 45/102 (44%), Gaps = 3/102 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + + + AL I GF+ GS+G NG++ +G
Sbjct: 686 MADQLCGDFYARL--LELPPVVAEERARSALRAIREACFEGFQGGSLGVANGLRRDGTPL 743
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
D EVWTG+ + L+A G D A G + ++
Sbjct: 744 DPDGTHPLEVWTGINFGLAAYYRLMGETDTALAITGAVVHQV 785
>gi|86144926|ref|ZP_01063258.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
gi|85837825|gb|EAQ55937.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
Length = 1024
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 38/116 (32%), Positives = 54/116 (46%), Gaps = 9/116 (7%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
D L +LK G+ E IFP NI RAL ++ N FE S +G N + +G
Sbjct: 820 GDALLADSYLKLIGL--EGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTQADGS- 873
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
A Q+++VW GV + ++ A+ G +A T +Y L D APEG
Sbjct: 874 PHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALY-DYSKIPFAAPEG 928
>gi|78185056|ref|YP_377491.1| hypothetical protein Syncc9902_1489 [Synechococcus sp. CC9902]
gi|78169350|gb|ABB26447.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 823
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 48/102 (47%), Gaps = 3/102 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ ++ AN L + FE G++G NG++ +G
Sbjct: 675 MADQLCGDFYARLLGL--PSVVSDANALSTLNVVRDACFERFEGGTLGVANGLRRDGTPL 732
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRL 101
D + EVWTG+ + +++ G D A G + ++
Sbjct: 733 DPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAVVNQV 774
>gi|148242848|ref|YP_001228005.1| hypothetical protein SynRCC307_1749 [Synechococcus sp. RCC307]
gi|147851158|emb|CAK28652.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 220
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 43/91 (47%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
MADQLCG + + G+ + AN +L I GF G +G NG++ +G
Sbjct: 71 MADQLCGDFYARLLGL--PPVVSEANARSSLQAIKEACFEGFAGGRLGVANGLRRDGTPL 128
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
D + EVWTG+ + L+A G + A
Sbjct: 129 DPNGTHPLEVWTGINFGLAAYYRLMGETNTA 159
>gi|428307493|ref|YP_007144318.1| glucosylceramidase [Crinalium epipsammum PCC 9333]
gi|428249028|gb|AFZ14808.1| Glucosylceramidase [Crinalium epipsammum PCC 9333]
Length = 811
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 14/103 (13%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIF-----------STNVRGFEAGSMGA 49
MADQLCG + A +N I P+ + AL T++ S + +GA
Sbjct: 663 MADQLCGQFY--ARLLNLPDIVPNDCVESALKTVYDSCFLKFNQKLSQHSESLTPTFIGA 720
Query: 50 VNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 91
NG+ +G + A EVWTG+ + L+A M+ G DEA+
Sbjct: 721 ANGVLADGSPENPQATHPLEVWTGINFGLAAFMIQMGMKDEAF 763
>gi|284929778|ref|YP_003422300.1| putative bile acid beta-glucosidase [cyanobacterium UCYN-A]
gi|284810222|gb|ADB95919.1| predicted bile acid beta-glucosidase [cyanobacterium UCYN-A]
Length = 803
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 26/91 (28%), Positives = 47/91 (51%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-R 59
M DQLCG + A +N + ++ AL+ I+ F G GA NG++ +G
Sbjct: 664 MTDQLCGQFY--ARLLNLPDVVDTSCTISALSKIYEACFLNFYDGQYGAANGVRSDGTPE 721
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+ ++ +EVW G+ + L+A ++ G ++A
Sbjct: 722 NPNSTHPQEVWVGINFGLAAFLIQMGMKNQA 752
>gi|87125468|ref|ZP_01081313.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
gi|86166768|gb|EAQ68030.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
Length = 832
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + + A+ AN L + + F+ GS+G NG++ +G
Sbjct: 684 MADQLCGDFYARL--LKLPAVVSDANCRSTLKAVKESCFEAFQGGSLGVANGLRRDGTPL 741
Query: 60 DRSAIQSEEVWTGVTY 75
D EVWTG+ +
Sbjct: 742 DPQGTHPLEVWTGINF 757
>gi|116073260|ref|ZP_01470522.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
gi|116068565|gb|EAU74317.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
Length = 832
Score = 45.8 bits (107), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 5/93 (5%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ E + AN L + F+ G +G NG++ +G
Sbjct: 684 MADQLCGDFYARLLGL--EPVVSEANSRSTLKAVREACFEKFDGGKLGVANGLRRDGTPL 741
Query: 60 DRSAIQSEEVWTGVTYLLSA--AMLYEGNVDEA 90
D + EVWTG+ + +++ ++ EG EA
Sbjct: 742 DPNGTHPLEVWTGINFGIASYYRLMGEGQTAEA 774
>gi|427702545|ref|YP_007045767.1| bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
gi|427345713|gb|AFY28426.1| putative bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
Length = 835
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 9/107 (8%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN---VRGFEAGSMGAVNGMKPNG 57
MADQLCG + + A+ P RAL+++ + FE G +G NG++ +G
Sbjct: 686 MADQLCGDFYAR-----LLALPPVVADERALSSLKAIRQACFESFEGGRLGVANGLRRDG 740
Query: 58 DR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
D EVWTG+ + L+A G A+ G + R++ E
Sbjct: 741 TPLDPEGTHPLEVWTGINFGLAAYYRLMGQSSTAFAITGAVVRQVYE 787
>gi|81299663|ref|YP_399871.1| hypothetical protein Synpcc7942_0854 [Synechococcus elongatus PCC
7942]
gi|81168544|gb|ABB56884.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 798
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/105 (24%), Positives = 47/105 (44%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + + + P ++ + F +G G NG+ P+G
Sbjct: 651 MADQLCGDFYSRL--LQLPPVAPLEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGSPV 708
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
D EVWTG+ + ++A L G+ D + + R++ ++
Sbjct: 709 DPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIRQIYDN 753
>gi|84386772|ref|ZP_00989797.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
gi|84378300|gb|EAP95158.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
Length = 908
Score = 44.7 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 38/116 (32%), Positives = 53/116 (45%), Gaps = 9/116 (7%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
D L +LK G+ E IFP NI RAL ++ N FE S +G N +G
Sbjct: 704 GDALLADSYLKLIGL--EGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTLADGS- 757
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
A Q+++VW GV + ++ A+ G +A T +Y L D APEG
Sbjct: 758 PHEAFQAQDVWIGVQFSVATALSLAGKNQQAETLMDTVYTALY-DYSKIPFAAPEG 812
>gi|87301122|ref|ZP_01083963.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
gi|87284090|gb|EAQ76043.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
Length = 830
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 40/91 (43%), Gaps = 3/91 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ + +L I FE G +G NG++ +G
Sbjct: 681 MADQLCGDFYARLLGLPQ--VVSDERALSSLKAIREACFERFEGGRLGVANGLRRDGTPL 738
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
D EVWTG+ + L+A G D A
Sbjct: 739 DPKGTHPLEVWTGINFGLAAYYRLMGQTDTA 769
>gi|148976285|ref|ZP_01813009.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
gi|145964379|gb|EDK29634.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
Length = 982
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 9/116 (7%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
+D L +L+ G+ E IFP I RAL ++ N FE S +G N + +G
Sbjct: 778 SDALLADSYLRLIGL--EGIFPQQRIQRALDYVYKHN---FEINSPKLGVANMTRADGS- 831
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
A Q+++VW GV + ++ A+ G +A T +Y L D APEG
Sbjct: 832 PHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALY-DYSKIPFAAPEG 886
>gi|78212458|ref|YP_381237.1| hypothetical protein Syncc9605_0920 [Synechococcus sp. CC9605]
gi|78196917|gb|ABB34682.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 832
Score = 43.9 bits (102), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + + + AN L + F+ GS+G NG++ +G
Sbjct: 684 MADQLCGDFYARLLAL--PPVVSDANSLSTLKAVKEACFEAFDGGSLGVANGLRRDGTPL 741
Query: 60 DRSAIQSEEVWTGVTY 75
D +A EVWTG+ +
Sbjct: 742 DPNATHPLEVWTGINF 757
>gi|218677125|ref|YP_002395944.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
gi|218325393|emb|CAV27494.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
Length = 1069
Score = 43.5 bits (101), Expect = 0.015, Method: Composition-based stats.
Identities = 37/116 (31%), Positives = 53/116 (45%), Gaps = 9/116 (7%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
D L +LK G+ + IFP NI RAL ++ N FE S +G N +G
Sbjct: 865 GDALLADSYLKLIGL--DGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTLADGS- 918
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
A Q+++VW GV + ++ A+ G +A T +Y L D APEG
Sbjct: 919 PHEAFQAQDVWIGVQFSVATALSLAGKNQQAETLMDTVYTALY-DYSKIPFAAPEG 973
>gi|33866127|ref|NP_897686.1| hypothetical protein SYNW1593 [Synechococcus sp. WH 8102]
gi|33639102|emb|CAE08108.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 833
Score = 43.5 bits (101), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 9/83 (10%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN---VRGFEAGSMGAVNGMKPNG 57
MADQLCG + + G+ P + ARA +T+ + FE+G +G NG++ +G
Sbjct: 685 MADQLCGDFYARLLGLP-----PVVSDARARSTLQAVKEACFERFESGRLGVANGLRRDG 739
Query: 58 DR-DRSAIQSEEVWTGVTYLLSA 79
D + EVWTG+ + L++
Sbjct: 740 SPLDPNGTHPLEVWTGINFGLAS 762
>gi|113953483|ref|YP_730032.1| hypothetical protein sync_0818 [Synechococcus sp. CC9311]
gi|113880834|gb|ABI45792.1| conserved hypothetical protein [Synechococcus sp. CC9311]
Length = 833
Score = 43.5 bits (101), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 45/93 (48%), Gaps = 5/93 (5%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ + AN L + F GS+G NG++ +G
Sbjct: 685 MADQLCGDFYARLLGL--PPVVSEANSRSTLKAVKEACFDNFAGGSLGVANGLRRDGTPL 742
Query: 60 DRSAIQSEEVWTGVTYLLSA--AMLYEGNVDEA 90
D + EVWTG+ + +++ ++ EG EA
Sbjct: 743 DPNGTHPLEVWTGINFGIASYYRLMGEGKTAEA 775
>gi|260434763|ref|ZP_05788733.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
gi|260412637|gb|EEX05933.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
Length = 832
Score = 43.5 bits (101), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + G+ + +N L + F+ GS+G NG++ +G
Sbjct: 684 MADQLCGDFYARLLGL--PPVVSDSNSRSTLKVVKEACFEAFDGGSLGVANGLRRDGTPL 741
Query: 60 DRSAIQSEEVWTGVTY 75
D + EVWTG+ +
Sbjct: 742 DPNGTHPLEVWTGINF 757
>gi|302348409|ref|YP_003816047.1| trehalase [Acidilobus saccharovorans 345-15]
gi|302328821|gb|ADL19016.1| Putative trehalase [Acidilobus saccharovorans 345-15]
Length = 928
Score = 43.1 bits (100), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 54/114 (47%), Gaps = 14/114 (12%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGM------- 53
++ QL G +L+ G+ S + AL ++ N R +E G G
Sbjct: 679 LSAQLTGEWYLQVLGLGDGV--DSEKVRSALREVYRANFRRWEGLLNGTYPGTPRPSMVG 736
Query: 54 ---KPNGDR--DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLR 102
+PNG + +R + Q++ WTGV + +++ MLYEG V+EA ++ R R
Sbjct: 737 DVEEPNGTKILNRVSSQADTPWTGVEFGVASQMLYEGMVEEAMEVLRSIHDRYR 790
>gi|417950561|ref|ZP_12593680.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
gi|342806239|gb|EGU41472.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
Length = 1046
Score = 42.7 bits (99), Expect = 0.026, Method: Composition-based stats.
Identities = 37/116 (31%), Positives = 52/116 (44%), Gaps = 9/116 (7%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
D L +LK G+ E IFP I RAL ++ N FE S +G N +G
Sbjct: 842 GDALLADSYLKLIGL--EGIFPKQRIQRALDYVYKHN---FEINSPKLGVANMTLADGS- 895
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
A Q+++VW GV + ++ A+ G +A T +Y L D APEG
Sbjct: 896 PHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALY-DYSKIPFAAPEG 950
>gi|222639789|gb|EEE67921.1| hypothetical protein OsJ_25789 [Oryza sativa Japonica Group]
Length = 919
Score = 42.7 bits (99), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 47/97 (48%), Gaps = 24/97 (24%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR 61
ADQL G + +A G+ E I RALAT+ NV M+
Sbjct: 743 ADQLAGQWYARACGL--EPIVDGDKARRALATVLDYNV-------------MR------- 780
Query: 62 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 98
S+EV GVTY ++AAM++EG + A+ TA G++
Sbjct: 781 --THSKEVCPGVTYAVAAAMIHEGMPEAAFKTAKGIH 815
>gi|56750695|ref|YP_171396.1| hypothetical protein syc0686_c [Synechococcus elongatus PCC 6301]
gi|56685654|dbj|BAD78876.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 798
Score = 42.4 bits (98), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 25/105 (23%), Positives = 46/105 (43%), Gaps = 3/105 (2%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
MADQLCG + + + + P ++ + F +G G NG+ P+G
Sbjct: 651 MADQLCGDFYSRL--LQLPPVAPLEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGSPV 708
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
EVWTG+ + ++A L G+ D + + R++ ++
Sbjct: 709 GPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIRQIYDN 753
>gi|407069309|ref|ZP_11100147.1| hypothetical protein VcycZ_07137 [Vibrio cyclitrophicus ZF14]
Length = 1024
Score = 42.0 bits (97), Expect = 0.045, Method: Composition-based stats.
Identities = 37/116 (31%), Positives = 52/116 (44%), Gaps = 9/116 (7%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 59
D L +LK G+ E IFP I RAL ++ N FE S +G N +G
Sbjct: 820 GDALLADSYLKLIGL--EGIFPEDRIQRALDYVYKHN---FEINSPKLGVANMTLADGS- 873
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETAPEG 115
A Q+++VW GV + ++ A+ G +A T +Y L D APEG
Sbjct: 874 PHEAFQAQDVWIGVQFSVATALNLAGKSQQAETLMDTVYTALY-DYSKIPFAAPEG 928
>gi|261417086|ref|YP_003250769.1| glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|385791890|ref|YP_005823013.1| hypothetical protein FSU_3272 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|261373542|gb|ACX76287.1| Glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|302325406|gb|ADL24607.1| conserved domain protein [Fibrobacter succinogenes subsp.
succinogenes S85]
Length = 1047
Score = 42.0 bits (97), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 56/117 (47%), Gaps = 7/117 (5%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 59
AD + +L+ ++ E I SA L IF+TN + + +GA N ++ +G
Sbjct: 837 FADTMLADTYLRL--LDLEPITDSAKAKSNLLRIFNTNYKA-NSPLIGAANLVRKDGSPL 893
Query: 60 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED-RVSGCETAPEG 115
D Q+ +VW G+ Y + AM++ G EA A + L ++ RV APEG
Sbjct: 894 DEFNFQAHDVWIGIQYSIMTAMMFHGLEKEASVLADSMIGNLYDEARVPFA--APEG 948
>gi|33862646|ref|NP_894206.1| hypothetical protein PMT0373 [Prochlorococcus marinus str. MIT
9313]
gi|33634562|emb|CAE20548.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 837
Score = 41.6 bits (96), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 3/80 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
MADQLCG + + G+ ++ +L + GFE G +G NG+ +G
Sbjct: 685 MADQLCGDFYARLLGL--PSVVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDGMPL 742
Query: 60 DRSAIQSEEVWTGVTYLLSA 79
D EVWTG+ + L+A
Sbjct: 743 DPKGTHPLEVWTGINFGLAA 762
>gi|124023625|ref|YP_001017932.1| bile acid beta-glucosidase [Prochlorococcus marinus str. MIT 9303]
gi|123963911|gb|ABM78667.1| Predicted bile acid beta-glucosidase [Prochlorococcus marinus str.
MIT 9303]
Length = 837
Score = 41.6 bits (96), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 3/80 (3%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DR 59
MADQLCG + + G+ ++ +L + GFE G +G NG+ +G
Sbjct: 685 MADQLCGDFYARLLGL--PSVVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDGMPL 742
Query: 60 DRSAIQSEEVWTGVTYLLSA 79
D EVWTG+ + L+A
Sbjct: 743 DPKGTHPLEVWTGINFGLAA 762
>gi|374815490|ref|ZP_09719227.1| hypothetical protein TpriZ_16630 [Treponema primitia ZAS-1]
Length = 814
Score = 39.7 bits (91), Expect = 0.22, Method: Composition-based stats.
Identities = 34/126 (26%), Positives = 54/126 (42%), Gaps = 20/126 (15%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV-----------RGFEAGS-MG 48
+ADQL G + +G+ + + +I +A +IF N RG+ G
Sbjct: 608 LADQLIGQFYAHLTGLG--PLVDAGHIKQAAQSIFKYNFISDFTDHPNMQRGYAVNDDKG 665
Query: 49 AVNGMKPNGDRDRSAI-QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVS 107
+ P G R R SEE W+ Y +++ ++YEG +DE T A + DR
Sbjct: 666 LLMTTWPLGGRPRYPFFYSEEAWSRTEYHVASTLIYEGFLDEGLTIAKAV-----RDRYD 720
Query: 108 GCETAP 113
G + P
Sbjct: 721 GIKRNP 726
>gi|15922864|ref|NP_378533.1| hypothetical protein ST2529 [Sulfolobus tokodaii str. 7]
gi|15623655|dbj|BAB67642.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 836
Score = 39.3 bits (90), Expect = 0.32, Method: Composition-based stats.
Identities = 28/111 (25%), Positives = 48/111 (43%), Gaps = 19/111 (17%)
Query: 2 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-- 59
A QL G +L G ++ + L +I N++ E G +NG P+G R
Sbjct: 616 ASQLLGEFYLNLLGY---SLLDREKTRKTLLSIVKYNLKEEE----GVINGAYPDGYRPL 668
Query: 60 ----------DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRR 100
++I + W+GV + L++ ++YE +DEA +Y R
Sbjct: 669 MREYENPLKIKEASIHQDTPWSGVEFYLASHLIYEKMIDEAKKVLKEVYDR 719
>gi|15922862|ref|NP_378531.1| hypothetical protein ST2526 [Sulfolobus tokodaii str. 7]
gi|15623653|dbj|BAB67640.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 646
Score = 38.9 bits (89), Expect = 0.35, Method: Composition-based stats.
Identities = 30/115 (26%), Positives = 48/115 (41%), Gaps = 12/115 (10%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
M+ Q+ G + G+ E I I +AL +I+ N + A VN + P+G D
Sbjct: 508 MSSQILGQFWCDILGL--EPIIDEDKIVQALRSIYELNGK---ASKFCLVNSVNPDGSID 562
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEA-------WTTAGGLYRRLREDRVSG 108
Q W+ V + +SA M+ G +E W T L + + R+ G
Sbjct: 563 TETDQMRSCWSRVAFAVSAHMIIRGLKNEGIEIANREWETIKSLGKWNQSSRIDG 617
>gi|229820204|ref|YP_002881730.1| hypothetical protein Bcav_1712 [Beutenbergia cavernae DSM 12333]
gi|229566117|gb|ACQ79968.1| protein of unknown function DUF608 [Beutenbergia cavernae DSM
12333]
Length = 865
Score = 38.9 bits (89), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 53/113 (46%), Gaps = 15/113 (13%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVR-GFEA-----------GSMG 48
++DQL G + A ++ + P+A++ AL + N+R GF+ +G
Sbjct: 652 LSDQLIGQWW--AHLLDLGHVLPAAHVRSALRAVVGHNLRRGFDGWEHTQRVYADGDDVG 709
Query: 49 AVNGMKPNGDRDRSAIQ-SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRR 100
+ P+G R A + +EVW+GV Y ++A L EG E GL+ R
Sbjct: 710 LLMCTWPHGGRPEVATRYCDEVWSGVEYEVAAHCLLEGLTAEGRAILDGLWAR 762
>gi|254423918|ref|ZP_05037636.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
gi|196191407|gb|EDX86371.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
Length = 828
Score = 38.9 bits (89), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 55/128 (42%), Gaps = 25/128 (19%)
Query: 1 MADQLCGYLFLKASG----VNTEAIFPSAN-IARALATIFSTNV-------RGFEAGS-- 46
MADQLCG G V++E + P+ I A F+ + FE
Sbjct: 663 MADQLCGQFCASTMGLPDVVDSEFVEPTLRAIYEACFVRFNQYTAQLGPQNQKFEGAQTG 722
Query: 47 ----------MGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 95
+G NG++P+G ++ EVWTG+ + L+A + +EG + EA
Sbjct: 723 YFSASELGVKVGCANGVRPDGSPQNPDDTHQLEVWTGINFGLAAFLAHEGKLHEAMEITE 782
Query: 96 GLYRRLRE 103
+ R++ E
Sbjct: 783 AVVRQVYE 790
>gi|327310049|ref|YP_004336946.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
gi|326946528|gb|AEA11634.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
Length = 660
Score = 38.5 bits (88), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 42/103 (40%), Gaps = 5/103 (4%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
M+ QL G + ++ I P +A AL +I+ G A N + P+G D
Sbjct: 515 MSSQLLGQFWCDM--LDLPPIAPDDKVAAALRSIYEL---GMRASKYCIPNSVTPDGKPD 569
Query: 61 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRE 103
Q WT V + ++A ML G E A +R + E
Sbjct: 570 EETPQLRSCWTRVNFAVAAHMLLRGMEREGMEVASREWRTIAE 612
>gi|160931643|ref|ZP_02079037.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
gi|156869288|gb|EDO62660.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
Length = 804
Score = 38.1 bits (87), Expect = 0.60, Method: Composition-based stats.
Identities = 27/107 (25%), Positives = 49/107 (45%), Gaps = 23/107 (21%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMK------ 54
++DQL G G+ +FP ++ A+ I+ N + MG+ ++
Sbjct: 606 LSDQLLGQTLAHLYGLG--HLFPEDHVKSAVFAIYKYNFKE----RMGSHKSLQRGYAYQ 659
Query: 55 ----------PNGDRDRSA-IQSEEVWTGVTYLLSAAMLYEGNVDEA 90
P+G R + + S+EVWTG+ Y ++A ++YEG + E
Sbjct: 660 DEPGLLLCSWPSGGRPKQPFVYSDEVWTGIEYQVAAGLIYEGFLQEG 706
>gi|160932020|ref|ZP_02079412.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
gi|156869062|gb|EDO62434.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
Length = 822
Score = 37.4 bits (85), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 53/117 (45%), Gaps = 17/117 (14%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV--RGFEAGS----------MG 48
++DQL G + +G+ + P ++ +A +IF N R + G MG
Sbjct: 619 LSDQLLGQFLAQEAGLGY--VLPKNHVKKAAESIFRYNFIERASDFGHVQRAYIFNDEMG 676
Query: 49 AVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
P G+R R I EVWTG+ Y ++A ++ E ++E T + R R+D
Sbjct: 677 LTPCTWPLGERPRFPFIYFGEVWTGIEYEVAALLIREDLLEEGLTIVKAV--RDRQD 731
>gi|160934323|ref|ZP_02081710.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
gi|156866996|gb|EDO60368.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
Length = 806
Score = 37.4 bits (85), Expect = 1.1, Method: Composition-based stats.
Identities = 27/113 (23%), Positives = 53/113 (46%), Gaps = 15/113 (13%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMG-----AVNGMK- 54
+ADQ+ G + + + + P +I +A+ ++F N + + A+N K
Sbjct: 599 LADQVFGQMLAHLNHLGY--VLPEEHIKKAIHSVFQYNFKETMEDHVNVQRTYALNQEKG 656
Query: 55 ------PNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRR 100
P G R + + S+EVW+G+ Y ++A +++EG V+E + R
Sbjct: 657 LVLCSWPQGGRPKLPFVYSDEVWSGIEYQVAAHLIFEGFVEEGLAVVKAVRER 709
>gi|388492496|gb|AFK34314.1| unknown [Medicago truncatula]
gi|388521539|gb|AFK48831.1| unknown [Medicago truncatula]
Length = 183
Score = 37.4 bits (85), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 5/64 (7%)
Query: 53 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETA 112
M+P+G R + W +T+L++A+ L G+V A+ T YR L +R C+T
Sbjct: 81 MRPSGSRSWAICLFITTW--LTFLIAASCLLAGSVKNAYHTK---YRDLLGERAPSCQTL 135
Query: 113 PEGV 116
+GV
Sbjct: 136 RKGV 139
>gi|217071402|gb|ACJ84061.1| unknown [Medicago truncatula]
Length = 183
Score = 37.4 bits (85), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 5/64 (7%)
Query: 53 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETA 112
M+P+G R + W +T+L++A+ L G+V A+ T YR L +R C+T
Sbjct: 81 MRPSGSRSWAICLFITTW--LTFLIAASCLLAGSVKNAYHTK---YRDLLGERAPSCQTL 135
Query: 113 PEGV 116
+GV
Sbjct: 136 RKGV 139
>gi|411117353|ref|ZP_11389840.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
JSC-12]
gi|410713456|gb|EKQ70957.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
JSC-12]
Length = 907
Score = 37.4 bits (85), Expect = 1.3, Method: Composition-based stats.
Identities = 18/45 (40%), Positives = 29/45 (64%), Gaps = 1/45 (2%)
Query: 47 MGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 90
+GA NG+KP+G + ++ EVWTG+ + L+A ++ G DEA
Sbjct: 810 IGAANGVKPDGSPENPNSTHPLEVWTGINFGLAAFLVQMGMRDEA 854
>gi|358347158|ref|XP_003637628.1| hypothetical protein MTR_093s0004 [Medicago truncatula]
gi|355503563|gb|AES84766.1| hypothetical protein MTR_093s0004 [Medicago truncatula]
Length = 115
Score = 37.4 bits (85), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 5/64 (7%)
Query: 53 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETA 112
M+P+G R + W +T+L++A+ L G+V A+ T YR L +R C+T
Sbjct: 13 MRPSGSRSWAICLFITTW--LTFLIAASCLLAGSVKNAYHTK---YRDLLGERAPSCQTL 67
Query: 113 PEGV 116
+GV
Sbjct: 68 RKGV 71
>gi|363807072|ref|NP_001242586.1| uncharacterized protein LOC100817583 precursor [Glycine max]
gi|255640558|gb|ACU20564.1| unknown [Glycine max]
Length = 183
Score = 37.4 bits (85), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 34/64 (53%), Gaps = 5/64 (7%)
Query: 53 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLREDRVSGCETA 112
M+P+G R S W VT++++A+ L G+V A+ T YR L +R C+T
Sbjct: 81 MRPSGSRSWSICLFITSW--VTFIIAASCLLAGSVRNAYHTK---YRDLMGERAPSCQTL 135
Query: 113 PEGV 116
+GV
Sbjct: 136 RKGV 139
>gi|255073149|ref|XP_002500249.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
gi|226515511|gb|ACO61507.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
Length = 954
Score = 36.6 bits (83), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 25/42 (59%)
Query: 56 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 97
+G D + S EVW GV+Y +++ ML G DEAW A G+
Sbjct: 849 SGSVDAVSAHSNEVWPGVSYAVASHMLLAGMDDEAWELARGV 890
>gi|317152500|ref|YP_004120548.1| UspA domain-containing protein [Desulfovibrio aespoeensis Aspo-2]
gi|316942751|gb|ADU61802.1| UspA domain-containing protein [Desulfovibrio aespoeensis Aspo-2]
Length = 300
Score = 35.0 bits (79), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 14 SGVNTEAIFPSANIARAL-ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTG 72
SGV+ A+ +A AR A ++ T+V G G G++ ++P+G+ +R Q EE+++
Sbjct: 12 SGVDKCAVMAAAEFARKFEAKLYLTHVAGMAQG-WGSMEHLEPSGETERIKSQIEEMYSD 70
Query: 73 VTYLLSAAMLYEGNV 87
LLS Y+ NV
Sbjct: 71 ---LLSGIAEYQINV 82
>gi|427419169|ref|ZP_18909352.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
gi|425761882|gb|EKV02735.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
Length = 835
Score = 34.7 bits (78), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 50/118 (42%), Gaps = 30/118 (25%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIF------------------STNVRGF 42
MADQLCG + ++ I I AL TI+ + G
Sbjct: 672 MADQLCGQFCARL--MDLPDIVDQEFIDIALDTIYDACFVKFNEYAAKHSTPQNQKFEGS 729
Query: 43 EAG---------SMGAVNGMKPNGDRDRSAIQSE-EVWTGVTYLLSAAMLYEGNVDEA 90
+AG ++GA NG+KP+G + + EVWTG+ + ++A + G D+A
Sbjct: 730 QAGFFSAAQLGMAIGAANGVKPDGSPENPDDTHQLEVWTGINFGVAAFLAQMGKRDQA 787
>gi|269839491|ref|YP_003324183.1| hypothetical protein Tter_2468 [Thermobaculum terrenum ATCC
BAA-798]
gi|269791221|gb|ACZ43361.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
BAA-798]
Length = 807
Score = 34.7 bits (78), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 53/120 (44%), Gaps = 23/120 (19%)
Query: 1 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD 60
+ADQL G L +A + + P ++ +AL +I+ N F A VN + D
Sbjct: 589 LADQLLGQL--QAHVLGMGYLVPEEHVRQALRSIWRYN---FRADLSEHVNCQRTFALND 643
Query: 61 RSAI----------------QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRRLRED 104
+ + S+EV+TG Y ++A M+YEG ++E G + R R D
Sbjct: 644 EAGLLMCTWPRGGQPTFPFPYSDEVFTGSEYQVAAHMIYEGLLEEGLQIVGAV--RGRHD 701
>gi|148553136|ref|YP_001260718.1| RNAse III [Sphingomonas wittichii RW1]
gi|148498326|gb|ABQ66580.1| RNAse III [Sphingomonas wittichii RW1]
Length = 234
Score = 34.3 bits (77), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 5/93 (5%)
Query: 23 PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAM 81
P ++R L I S ++G MK RD A+QS+ V V L A+
Sbjct: 77 PEGKLSRRLNAIVSRETCADVGRAIGVATHMKLGKQARDDGAVQSDNVLGDVVEALIGAL 136
Query: 82 LYEGNVDEAWTTAGGLYRRLREDRVSGCETAPE 114
EG +D A RR +DR +G AP+
Sbjct: 137 YLEGGLD----AAQAFVRRAWDDRATGQAQAPK 165
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.132 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,776,012,469
Number of Sequences: 23463169
Number of extensions: 62581433
Number of successful extensions: 136644
Number of sequences better than 100.0: 361
Number of HSP's better than 100.0 without gapping: 242
Number of HSP's successfully gapped in prelim test: 119
Number of HSP's that attempted gapping in prelim test: 136126
Number of HSP's gapped (non-prelim): 367
length of query: 116
length of database: 8,064,228,071
effective HSP length: 83
effective length of query: 33
effective length of database: 6,116,785,044
effective search space: 201853906452
effective search space used: 201853906452
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 69 (31.2 bits)