BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10295
(115 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|291241331|ref|XP_002740565.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 792
Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 61/103 (59%), Positives = 79/103 (76%)
Query: 6 RASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
+ + GFE G+MGAVNGM+P+G D ++ QSEEVWTGVTY L+A M+ EG ++EA+TTA
Sbjct: 682 KMNVMGFEKGTMGAVNGMRPDGSPDTTSFQSEEVWTGVTYALAANMIQEGLLEEAFTTAS 741
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
G+Y+T YER GLGF+TPE + YRS GYMR LA++AMQ A
Sbjct: 742 GVYKTCYERCGLGFQTPEAYFANNCYRSLGYMRPLAIWAMQYA 784
>gi|449691194|ref|XP_004212588.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
magnipapillata]
Length = 169
Score = 123 bits (309), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 57/100 (57%), Positives = 78/100 (78%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
FE G MGAVNGM+ +G D++++Q+EEVWTGVTY L+A M+ G V E + TA G+YRTV
Sbjct: 61 FEDGFMGAVNGMRADGTVDKNSLQAEEVWTGVTYALAATMIQHGMVREGFRTAEGIYRTV 120
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
YE+ GLGF+TPE +T +KT+RS GYMR L+++++ AY K
Sbjct: 121 YEKWGLGFQTPEAITANKTFRSRGYMRPLSIWSIYHAYSK 160
>gi|242015137|ref|XP_002428230.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212512791|gb|EEB15492.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 812
Score = 122 bits (305), Expect = 4e-26, Method: Composition-based stats.
Identities = 52/95 (54%), Positives = 71/95 (74%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G++G VNGM PNG+ D +QSEE+W GV Y L++ M++EG V+E W TA G+Y TVY+R
Sbjct: 712 GNLGCVNGMLPNGEIDIHTLQSEEIWVGVVYALASLMIFEGMVEEGWKTARGIYETVYDR 771
Query: 75 TGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
G+GFETPE L K YR+ GYMR L++++MQ A+
Sbjct: 772 IGMGFETPEALRESKKYRAIGYMRPLSIWSMQLAW 806
>gi|156356077|ref|XP_001623757.1| predicted protein [Nematostella vectensis]
gi|156210486|gb|EDO31657.1| predicted protein [Nematostella vectensis]
Length = 783
Score = 120 bits (302), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 55/104 (52%), Positives = 77/104 (74%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F+ G+MGAVNG++P+G D S++Q+EEVWTGVTY ++A+M+ EG VDE + TA G+Y T
Sbjct: 676 FQEGTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNTC 735
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
+ER G+ F+TPE + + YRS YMR L+++AMQ A K K K
Sbjct: 736 FERLGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNK 779
>gi|156554493|ref|XP_001604835.1| PREDICTED: non-lysosomal glucosylceramidase-like [Nasonia
vitripennis]
Length = 826
Score = 119 bits (299), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 55/98 (56%), Positives = 73/98 (74%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
++ G GAVNG P G D S IQSEE+WTGV Y L+A +++EG ++EA+ TA G+YRTV
Sbjct: 704 YKNGQQGAVNGFSPTGSIDYSCIQSEEMWTGVAYGLAALLIHEGMIEEAFRTAEGVYRTV 763
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
YE+ G+GFETPE L K YR+ GYMR L+++AMQ A+
Sbjct: 764 YEKIGMGFETPEALYEHKVYRAIGYMRPLSIWAMQHAW 801
>gi|449275451|gb|EMC84314.1| Non-lysosomal glucosylceramidase [Columba livia]
Length = 692
Score = 118 bits (295), Expect = 6e-25, Method: Composition-based stats.
Identities = 53/98 (54%), Positives = 70/98 (71%)
Query: 11 GFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
GF G+MGAVNGM+P+G D S++QS EVW GV Y L+A M+ EG V+E + TA G YRT
Sbjct: 574 GFAGGTMGAVNGMRPSGVPDTSSVQSNEVWVGVVYALAATMIQEGLVEEGFRTAEGCYRT 633
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
V+E+ G+ F+TPE K YRS YMR L++++MQ A
Sbjct: 634 VWEQLGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQLA 671
>gi|156333782|ref|XP_001619412.1| hypothetical protein NEMVEDRAFT_v1g224207 [Nematostella vectensis]
gi|156202556|gb|EDO27312.1| predicted protein [Nematostella vectensis]
Length = 173
Score = 117 bits (293), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 54/101 (53%), Positives = 75/101 (74%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G+MGAVNG++P+G D S++Q+EEVWTGVTY ++A+M+ EG VDE + TA G+Y T +ER
Sbjct: 52 GTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNTCFER 111
Query: 75 TGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
G+ F+TPE + + YRS YMR L+++AMQ A K K K
Sbjct: 112 LGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNK 152
>gi|328709489|ref|XP_001943760.2| PREDICTED: non-lysosomal glucosylceramidase-like [Acyrthosiphon
pisum]
Length = 776
Score = 114 bits (286), Expect = 6e-24, Method: Composition-based stats.
Identities = 51/98 (52%), Positives = 68/98 (69%)
Query: 11 GFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
GF G MGAVNGM P+G+ D ++QSEEVWTGVTY LS+ M+ G DE + TA G+Y T
Sbjct: 672 GFNDGKMGAVNGMMPDGNPDTFSLQSEEVWTGVTYALSSLMIAHGLRDEGFNTAKGIYNT 731
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
VY G+ ++TPE + YRS GYMR L+++++Q A
Sbjct: 732 VYNNIGMAYQTPEAIYSKNAYRSVGYMRPLSIWSIQTA 769
>gi|432110769|gb|ELK34246.1| Non-lysosomal glucosylceramidase [Myotis davidii]
Length = 868
Score = 114 bits (285), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 54/101 (53%), Positives = 71/101 (70%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 751 NVQAFAGGTMGAVNGMQPQGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 810
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
YRTV+ER GLGF+TPE + +RS YMR L+++AMQ A
Sbjct: 811 YRTVWERLGLGFQTPEAYCQKRVFRSLAYMRPLSIWAMQLA 851
>gi|169642399|gb|AAI60640.1| LOC100145361 protein [Xenopus (Silurana) tropicalis]
Length = 571
Score = 113 bits (283), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 70/99 (70%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G MGAVNGM+P+G D S++QS+EVW GV Y L+A M++EG V E + TA G
Sbjct: 454 NVKQFADGQMGAVNGMRPDGTTDTSSVQSDEVWIGVVYGLAATMIHEGLVQEGFATAEGC 513
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
YRTV+ER G+ F+TPE + +RS YMR L+++AMQ
Sbjct: 514 YRTVWERLGMSFQTPEAYCEKQVFRSLAYMRPLSIWAMQ 552
>gi|395515264|ref|XP_003761826.1| PREDICTED: non-lysosomal glucosylceramidase [Sarcophilus harrisii]
Length = 954
Score = 113 bits (283), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 53/101 (52%), Positives = 71/101 (70%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ +GF G+MGAVNGM P+G D S++QS+EVW GV Y L+A M+ EG V E + TA G
Sbjct: 798 NVQGFAGGAMGAVNGMHPDGVPDTSSVQSDEVWVGVVYGLAATMIQEGLVQEGFLTAEGC 857
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
YRTV+ER G+ F+TPE + +RS YMR L+++AMQ A
Sbjct: 858 YRTVWERLGMAFQTPEAYCQRRVFRSLAYMRPLSIWAMQLA 898
>gi|195436933|ref|XP_002066400.1| GK18271 [Drosophila willistoni]
gi|194162485|gb|EDW77386.1| GK18271 [Drosophila willistoni]
Length = 955
Score = 113 bits (282), Expect = 1e-23, Method: Composition-based stats.
Identities = 56/116 (48%), Positives = 76/116 (65%), Gaps = 7/116 (6%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQ+EEVW GV Y L+A M+
Sbjct: 832 LKRIYDNNVMGFHDGNIGAANGFIANVSEPSKAGHVDNSNIQAEEVWPGVVYALAATMIQ 891
Query: 54 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
EG DEA+ TAGG+Y+T+ +R G+ FETPE L G+K YRS GYMR L++++MQ A+
Sbjct: 892 EGMFDEAFQTAGGMYKTISQRIGMNFETPEALYGEKRYRSIGYMRPLSIWSMQVAW 947
>gi|260813884|ref|XP_002601646.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
gi|229286945|gb|EEN57658.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
Length = 871
Score = 113 bits (282), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 52/106 (49%), Positives = 73/106 (68%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
++K + F G+MGAVNGM+PNG +D +++QSEEVWTG+TY L+A+M+ EG V+E
Sbjct: 752 LRKVYDFNVMQFHQGTMGAVNGMRPNGKKDLTSMQSEEVWTGITYALAASMIQEGMVEEG 811
Query: 61 WTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ TA G+Y Y + G F+TPE +RS GYMR LA++AMQ
Sbjct: 812 FQTAYGVYNMCYLQCGFAFQTPEAYLERDFFRSLGYMRPLAIWAMQ 857
>gi|224090236|ref|XP_002190896.1| PREDICTED: non-lysosomal glucosylceramidase [Taeniopygia guttata]
Length = 826
Score = 112 bits (281), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 53/97 (54%), Positives = 69/97 (71%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F G+MGAVNGM+P+G D S++QS EVW GV Y L+A M+ EG V+E + TA G YRTV
Sbjct: 711 FAGGTMGAVNGMRPDGVPDTSSVQSNEVWVGVVYSLAATMIQEGMVEEGFRTAEGCYRTV 770
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
+ER G+ F+TPE K YRS YMR L++++MQ A
Sbjct: 771 WERLGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQLA 807
>gi|334333160|ref|XP_003341682.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
glucosylceramidase-like [Monodelphis domestica]
Length = 950
Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats.
Identities = 52/101 (51%), Positives = 72/101 (71%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ +GF G+MGAVNGM+P+G D S++QS+EVW GV Y L+A M+ EG + E + TA G
Sbjct: 794 NVQGFAGGAMGAVNGMQPDGIPDTSSVQSDEVWVGVVYGLAATMIQEGLIQEGFRTAEGC 853
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
YRTV+ER G+ F+TPE + +RS YMR L+++AMQ A
Sbjct: 854 YRTVWERLGMAFQTPEAYCQHRVFRSLAYMRPLSIWAMQLA 894
>gi|198424051|ref|XP_002127036.1| PREDICTED: similar to Non-lysosomal glucosylceramidase (NLGase)
(Glucosylceramidase 2) (Beta-glucocerebrosidase 2)
(Beta-glucosidase 2) [Ciona intestinalis]
Length = 861
Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats.
Identities = 48/97 (49%), Positives = 72/97 (74%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
++ G GAVNGM+P+G D S++QS+EVWTGVTY L+A M+++G + E + TA G+YR+
Sbjct: 759 YDGGRHGAVNGMRPHGAVDTSSVQSDEVWTGVTYALAATMIHKGMLKEGFRTASGIYRSC 818
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
++R G+ F+TPE + + TYRS YMR L+++ MQ A
Sbjct: 819 WQRYGMAFQTPEAMRKNHTYRSLAYMRPLSIWGMQYA 855
>gi|405977906|gb|EKC42333.1| Non-lysosomal glucosylceramidase [Crassostrea gigas]
Length = 1464
Score = 112 bits (280), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 53/106 (50%), Positives = 71/106 (66%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
+KK + FE G+MGA+NG +P+G +D S+ QSEE W GVTY L+A M+ EG +D+A
Sbjct: 1346 LKKIFENNVMMFEGGNMGAINGTRPDGSKDISSCQSEEFWVGVTYGLAANMIQEGLLDQA 1405
Query: 61 WTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ TA G Y +E GL F+TPE D YRS GYMR LA+++MQ
Sbjct: 1406 FQTAWGAYHVCWEWYGLAFQTPEAYMTDNIYRSLGYMRPLAIWSMQ 1451
>gi|73971791|ref|XP_531991.2| PREDICTED: non-lysosomal glucosylceramidase [Canis lupus familiaris]
Length = 1136
Score = 112 bits (280), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 52/98 (53%), Positives = 69/98 (70%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ R F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 1000 NVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 1059
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 1060 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 1097
>gi|351707038|gb|EHB09957.1| Non-lysosomal glucosylceramidase [Heterocephalus glaber]
Length = 922
Score = 112 bits (280), Expect = 3e-23, Method: Composition-based stats.
Identities = 55/108 (50%), Positives = 74/108 (68%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 784 NVQSFAGGAMGAVNGMQPHGIPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 843
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
YRTV+ER GL F+TPE + +RS YMR L+++AMQ A + K K
Sbjct: 844 YRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIWAMQLALQQQKHK 891
>gi|170036781|ref|XP_001846240.1| bile acid beta-glucosidase [Culex quinquefasciatus]
gi|167879683|gb|EDS43066.1| bile acid beta-glucosidase [Culex quinquefasciatus]
Length = 913
Score = 112 bits (280), Expect = 3e-23, Method: Composition-based stats.
Identities = 54/108 (50%), Positives = 76/108 (70%), Gaps = 6/108 (5%)
Query: 12 FEAGSMGAVNGMKP------NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
F G+MGAVNG P +G D S +Q+EEVWTGVTY L++ M++EG EA+ TAG
Sbjct: 802 FCGGNMGAVNGYIPSSQPNKDGRADSSTLQAEEVWTGVTYALASTMIHEGMFTEAFQTAG 861
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
GLY+T+ E+ G+ FETPE L ++ YR+ GYMR L++++MQ A+ + K
Sbjct: 862 GLYQTLSEKIGMSFETPEALYAERHYRAIGYMRPLSIWSMQTAWEQKK 909
>gi|301628717|ref|XP_002943495.1| PREDICTED: non-lysosomal glucosylceramidase [Xenopus (Silurana)
tropicalis]
Length = 692
Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 70/99 (70%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G MGAVNGM+P+G D S++QS+EVW GV Y L+A M++EG V E + TA G
Sbjct: 575 NVKQFADGQMGAVNGMRPDGTTDTSSVQSDEVWIGVVYGLAATMIHEGLVQEGFATAEGC 634
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
YRTV+ER G+ F+TPE + +RS YMR L+++AMQ
Sbjct: 635 YRTVWERLGMSFQTPEAYCEKQVFRSLAYMRPLSIWAMQ 673
>gi|47212590|emb|CAG12815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 777
Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 54/108 (50%), Positives = 70/108 (64%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
+K + F G MGAVNGM+P G DRS++QS+EVW GV Y L+A M++EG ++E
Sbjct: 668 LKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGMLEEG 727
Query: 61 WTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
TA G YRTV+ER G+ F+TPE YRS YMR L ++AMQ A
Sbjct: 728 MRTAEGCYRTVWERLGMAFQTPEAYCEKGIYRSLAYMRPLCIWAMQLA 775
>gi|326934726|ref|XP_003213436.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Meleagris gallopavo]
Length = 639
Score = 112 bits (279), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 70/108 (64%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
+K + F G+MGAVNGM+P+G D S++QS EVW GV Y L+A M+ EG V E
Sbjct: 511 LKTIFEKNVMSFAGGTMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGLVQEG 570
Query: 61 WTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
+ TA G YRTV+E G+ F+TPE K YRS YMR L++++MQ A
Sbjct: 571 FHTAEGCYRTVWENLGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLA 618
>gi|432920219|ref|XP_004079895.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oryzias latipes]
Length = 858
Score = 112 bits (279), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 53/98 (54%), Positives = 67/98 (68%)
Query: 11 GFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G MGAVNGM+P G DRS++QS+EVW GV Y L+A M++EG +E TA G YRT
Sbjct: 741 AFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGMREEGMHTAEGCYRT 800
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
V+ER G+ F+TPE YRS YMR L+++AMQ A
Sbjct: 801 VWERLGMAFQTPEAYCEKNIYRSLAYMRPLSIWAMQLA 838
>gi|410925034|ref|XP_003975986.1| PREDICTED: non-lysosomal glucosylceramidase-like [Takifugu
rubripes]
Length = 827
Score = 112 bits (279), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 53/97 (54%), Positives = 68/97 (70%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F G MGAVNGM+P G DRS++QS+EVW GV Y L+A M++EG ++E TA G YRTV
Sbjct: 721 FAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGMLEEGMRTAEGCYRTV 780
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
+ER G+ F+TPE YRS YMR L+++AMQ A
Sbjct: 781 WERLGMAFQTPEAYCEKGIYRSLAYMRPLSIWAMQLA 817
>gi|308485740|ref|XP_003105068.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
gi|308257013|gb|EFP00966.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
Length = 933
Score = 111 bits (278), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 73/101 (72%)
Query: 6 RASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
R + F G MGAVNGMKPNG DR IQ++E+WTGVTY +++ ++ +G V++A+ TA
Sbjct: 765 RLNVCRFGNGQMGAVNGMKPNGVVDREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTAS 824
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G Y T +E+TGL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 825 GSYLTCFEQTGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 865
>gi|125821356|ref|XP_687652.2| PREDICTED: non-lysosomal glucosylceramidase [Danio rerio]
Length = 851
Score = 111 bits (278), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 53/97 (54%), Positives = 67/97 (69%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F G MGAVNGM+P G DRS++QS+EVW GV Y L+A M++EG V+E TA G YR V
Sbjct: 741 FAGGQMGAVNGMRPEGVPDRSSVQSDEVWVGVVYGLAATMIHEGMVEEGLRTAEGCYRAV 800
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
+ER G+ F+TPE YRS YMR L+++AMQ A
Sbjct: 801 WERMGMAFQTPEAYCEKGIYRSLAYMRPLSIWAMQLA 837
>gi|312382731|gb|EFR28085.1| hypothetical protein AND_04399 [Anopheles darlingi]
Length = 777
Score = 111 bits (278), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 55/110 (50%), Positives = 77/110 (70%), Gaps = 6/110 (5%)
Query: 12 FEAGSMGAVNGMKPN------GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
F G++GAVNG PN G D S IQ EEVWTGVTY L+A M++EG +EA+ TAG
Sbjct: 666 FCGGNLGAVNGYVPNAQPNKEGRPDVSNIQGEEVWTGVTYALAATMIHEGMFEEAFQTAG 725
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
GLYR++ E+ G+ FETPE + ++ YR+ GYMR L++++MQ A+ K++
Sbjct: 726 GLYRSLSEKIGMNFETPEAVYAERHYRAIGYMRPLSIWSMQTAWELRKLR 775
>gi|363744064|ref|XP_003642968.1| PREDICTED: non-lysosomal glucosylceramidase-like [Gallus gallus]
Length = 853
Score = 111 bits (277), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 69/108 (63%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
+K + F G MGAVNGM+P+G D S++QS EVW GV Y L+A M+ EG V E
Sbjct: 725 LKTIFEKNVMSFAGGKMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGLVQEG 784
Query: 61 WTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
+ TA G YRTV+E G+ F+TPE K YRS YMR L++++MQ A
Sbjct: 785 FHTAEGCYRTVWEHLGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLA 832
>gi|443709232|gb|ELU03988.1| hypothetical protein CAPTEDRAFT_186839 [Capitella teleta]
Length = 802
Score = 111 bits (277), Expect = 7e-23, Method: Composition-based stats.
Identities = 50/100 (50%), Positives = 69/100 (69%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F+ G MGA+NGM+PN +D ++ QS+E WTGVTY L A M+ G +D+ + TA G Y T
Sbjct: 692 FDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYALGATMIQVGMIDKGFQTAYGAYHTC 751
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
+ER GL F+TPE ++ +RS GYMR LA++A+Q A K
Sbjct: 752 WERYGLAFQTPEAYFDNRRFRSLGYMRPLAIWAIQHAVEK 791
>gi|195033785|ref|XP_001988762.1| GH11342 [Drosophila grimshawi]
gi|193904762|gb|EDW03629.1| GH11342 [Drosophila grimshawi]
Length = 869
Score = 111 bits (277), Expect = 7e-23, Method: Composition-based stats.
Identities = 55/116 (47%), Positives = 76/116 (65%), Gaps = 7/116 (6%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQ+EEVW GV Y L+A M+
Sbjct: 746 LKRIYDNNVMGFHEGNIGAANGFIANANEPSKPGHVDNSNIQAEEVWPGVVYALAATMIQ 805
Query: 54 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K YRS GYMR L++++MQ A+
Sbjct: 806 EGMFEEAFQTAGGMYKTISQRIGMNFETPEALYGEKRYRSIGYMRPLSIWSMQVAW 861
>gi|194760205|ref|XP_001962332.1| GF15415 [Drosophila ananassae]
gi|190616029|gb|EDV31553.1| GF15415 [Drosophila ananassae]
Length = 894
Score = 110 bits (276), Expect = 9e-23, Method: Composition-based stats.
Identities = 56/115 (48%), Positives = 75/115 (65%), Gaps = 7/115 (6%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQSEEVW GV Y L+A M+
Sbjct: 771 LKRIYDNNVMGFHDGNIGAANGFIANASDPSKPGHVDNSNIQSEEVWPGVVYALAATMIQ 830
Query: 54 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K YRS GYMR L++++MQ A
Sbjct: 831 EGMFEEAFQTAGGMYKTISQRIGMNFETPEALYGEKRYRSIGYMRPLSIWSMQVA 885
>gi|195115764|ref|XP_002002426.1| GI12858 [Drosophila mojavensis]
gi|193913001|gb|EDW11868.1| GI12858 [Drosophila mojavensis]
Length = 877
Score = 110 bits (276), Expect = 9e-23, Method: Composition-based stats.
Identities = 55/116 (47%), Positives = 76/116 (65%), Gaps = 7/116 (6%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQ+EEVW GV Y L+A M+
Sbjct: 754 LKRIYDNNVMGFHDGNIGAANGFIANVSEPSKPGHVDNSNIQAEEVWPGVVYALAATMIQ 813
Query: 54 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
EG DEA+ TAGG+Y+T+ +R G+ +ETPE L G+K YRS GYMR L++++MQ A+
Sbjct: 814 EGMFDEAFQTAGGMYKTLSQRIGMNYETPEALYGEKRYRSIGYMRPLSIWSMQVAW 869
>gi|198473190|ref|XP_002133205.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
gi|198139343|gb|EDY70607.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
Length = 865
Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats.
Identities = 55/105 (52%), Positives = 71/105 (67%), Gaps = 7/105 (6%)
Query: 11 GFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTT 63
GF G++GA NG N G D S IQ+EEVW GV Y L+A M+ EG +EA+ T
Sbjct: 752 GFHEGNIGAANGFIANAADPSKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQT 811
Query: 64 AGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
AGG+Y+T+ ER G+ FETPE L G+K YRS GYMR L++++MQ A
Sbjct: 812 AGGMYKTLSERIGMNFETPEALYGEKRYRSIGYMRPLSIWSMQVA 856
>gi|444729905|gb|ELW70308.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
Length = 279
Score = 110 bits (274), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 51/98 (52%), Positives = 69/98 (70%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 141 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 200
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 201 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 238
>gi|195164866|ref|XP_002023267.1| GL21266 [Drosophila persimilis]
gi|194105352|gb|EDW27395.1| GL21266 [Drosophila persimilis]
Length = 867
Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats.
Identities = 55/105 (52%), Positives = 71/105 (67%), Gaps = 7/105 (6%)
Query: 11 GFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTT 63
GF G++GA NG N G D S IQ+EEVW GV Y L+A M+ EG +EA+ T
Sbjct: 754 GFHEGNIGAANGFIANAADPSKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQT 813
Query: 64 AGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
AGG+Y+T+ ER G+ FETPE L G+K YRS GYMR L++++MQ A
Sbjct: 814 AGGMYKTLSERIGMNFETPEALYGEKRYRSIGYMRPLSIWSMQVA 858
>gi|288558766|gb|ADC53514.1| MIP16949p [Drosophila melanogaster]
Length = 432
Score = 110 bits (274), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 54/113 (47%), Positives = 74/113 (65%), Gaps = 7/113 (6%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQ+EEVW GV Y L+A M+
Sbjct: 309 LKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNIQAEEVWPGVVYALAATMIQ 368
Query: 54 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K YRS GYMR L++++MQ
Sbjct: 369 EGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRYRSIGYMRPLSIWSMQ 421
>gi|195397814|ref|XP_002057523.1| GJ18180 [Drosophila virilis]
gi|194141177|gb|EDW57596.1| GJ18180 [Drosophila virilis]
Length = 869
Score = 110 bits (274), Expect = 2e-22, Method: Composition-based stats.
Identities = 54/116 (46%), Positives = 76/116 (65%), Gaps = 7/116 (6%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQ+EEVW GV Y L+A M+
Sbjct: 746 LKRIYDNNVMGFHEGNIGAANGFIANASEPSKPGHVDNSNIQAEEVWPGVVYALAATMIQ 805
Query: 54 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
EG +EA+ TAGG+Y+T+ +R G+ +ETPE L G+K YRS GYMR L++++MQ A+
Sbjct: 806 EGMFEEAFQTAGGMYKTISQRIGMNYETPEALYGEKRYRSIGYMRPLSIWSMQVAW 861
>gi|443710865|gb|ELU04898.1| hypothetical protein CAPTEDRAFT_216398 [Capitella teleta]
Length = 449
Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats.
Identities = 50/100 (50%), Positives = 69/100 (69%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F+ G MGA+NGM+PN +D ++ QS+E WTGVTY L A M+ G +D+ + TA G Y T
Sbjct: 339 FDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYALGATMIQVGMIDKGFQTAYGAYHTC 398
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
+ER GL F+TPE ++ +RS GYMR LA++A+Q A K
Sbjct: 399 WERYGLAFQTPEAYFDNRRFRSLGYMRPLAIWAIQHAVEK 438
>gi|195474003|ref|XP_002089281.1| GE19029 [Drosophila yakuba]
gi|194175382|gb|EDW88993.1| GE19029 [Drosophila yakuba]
Length = 947
Score = 109 bits (273), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/115 (47%), Positives = 75/115 (65%), Gaps = 7/115 (6%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQ+EEVW GV Y L+A M+
Sbjct: 824 LKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNIQAEEVWPGVVYALAATMIQ 883
Query: 54 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K YRS GYMR L++++MQ A
Sbjct: 884 EGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRYRSIGYMRPLSIWSMQVA 938
>gi|194860149|ref|XP_001969520.1| GG23916 [Drosophila erecta]
gi|190661387|gb|EDV58579.1| GG23916 [Drosophila erecta]
Length = 947
Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats.
Identities = 55/115 (47%), Positives = 75/115 (65%), Gaps = 7/115 (6%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQ+EEVW GV Y L+A M+
Sbjct: 824 LKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNIQAEEVWPGVVYALAATMIQ 883
Query: 54 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K YRS GYMR L++++MQ A
Sbjct: 884 EGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRYRSIGYMRPLSIWSMQVA 938
>gi|444729906|gb|ELW70309.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
Length = 1001
Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats.
Identities = 51/98 (52%), Positives = 69/98 (70%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 863 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 922
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 923 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 960
>gi|268569852|ref|XP_002640631.1| Hypothetical protein CBG08749 [Caenorhabditis briggsae]
Length = 899
Score = 109 bits (273), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 73/101 (72%)
Query: 6 RASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
R + F G MGAVNGMKP+G DR IQ++E+WTGVTY +++ ++ +G V++A+ TA
Sbjct: 737 RLNVCQFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTAS 796
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G Y T +E+TGL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 797 GSYLTCFEQTGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 837
>gi|157119417|ref|XP_001653371.1| bile acid beta-glucosidase, putative [Aedes aegypti]
gi|108883154|gb|EAT47379.1| AAEL001478-PA, partial [Aedes aegypti]
Length = 898
Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats.
Identities = 52/108 (48%), Positives = 75/108 (69%), Gaps = 6/108 (5%)
Query: 12 FEAGSMGAVNGMKP------NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
F G+MGAVNG P +G D S +Q EEVWTGVTY L++ M++EG EA+ TAG
Sbjct: 787 FCGGNMGAVNGYVPSTQPNKDGRADASTVQGEEVWTGVTYALASTMIHEGMFTEAFQTAG 846
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
G+Y+T+ E+ G+ FETPE L ++ YR+ GYMR L+++++Q A+ + K
Sbjct: 847 GMYQTLSEKIGMNFETPEALYAERHYRAIGYMRPLSIWSLQTAWEQKK 894
>gi|195338407|ref|XP_002035816.1| GM15538 [Drosophila sechellia]
gi|194129696|gb|EDW51739.1| GM15538 [Drosophila sechellia]
Length = 947
Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats.
Identities = 55/115 (47%), Positives = 75/115 (65%), Gaps = 7/115 (6%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQ+EEVW GV Y L+A M+
Sbjct: 824 LKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNIQAEEVWPGVVYALAATMIQ 883
Query: 54 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K YRS GYMR L++++MQ A
Sbjct: 884 EGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRYRSIGYMRPLSIWSMQVA 938
>gi|45552032|ref|NP_788055.2| CG33090 [Drosophila melanogaster]
gi|74876618|sp|Q7KT91.1|C3390_DROME RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase
gi|45445134|gb|AAO41192.2| CG33090 [Drosophila melanogaster]
gi|201065481|gb|ACH92150.1| FI02015p [Drosophila melanogaster]
Length = 948
Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats.
Identities = 55/115 (47%), Positives = 75/115 (65%), Gaps = 7/115 (6%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQ+EEVW GV Y L+A M+
Sbjct: 825 LKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNIQAEEVWPGVVYALAATMIQ 884
Query: 54 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K YRS GYMR L++++MQ A
Sbjct: 885 EGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRYRSIGYMRPLSIWSMQVA 939
>gi|71992715|ref|NP_001021680.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
gi|18376555|emb|CAD21661.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
Length = 959
Score = 108 bits (271), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 72/101 (71%)
Query: 6 RASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
R + F G MGAVNGMKP+G DR IQ++E+WTGVTY +++ ++ +G V++A+ TA
Sbjct: 789 RLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTAS 848
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G Y T +E TGL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 849 GSYLTCFEETGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 889
>gi|410978559|ref|XP_003995657.1| PREDICTED: non-lysosomal glucosylceramidase [Felis catus]
Length = 917
Score = 108 bits (271), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 50/96 (52%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ R F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 781 NVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 840
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 841 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876
>gi|355689821|gb|AER98956.1| glucosidase, beta 2 [Mustela putorius furo]
Length = 550
Score = 108 bits (271), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 50/96 (52%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ R F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 411 NVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 470
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 471 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 506
>gi|348520215|ref|XP_003447624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oreochromis
niloticus]
Length = 856
Score = 108 bits (270), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 69/108 (63%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
+K + F G MGAVNGM+P G D S++QS+EVW GV Y L+A M++EG +E
Sbjct: 731 LKSVFDLNVMSFAGGQMGAVNGMRPEGVPDHSSVQSDEVWIGVVYGLAATMIHEGMQEEG 790
Query: 61 WTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
TA G YRTV+E+ G+ F+TPE YRS YMR L+++AMQ A
Sbjct: 791 MRTAEGCYRTVWEKLGMAFQTPEAYCEKNIYRSLAYMRPLSIWAMQLA 838
>gi|71992723|ref|NP_001021681.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
gi|50511739|emb|CAH04764.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
Length = 930
Score = 108 bits (270), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 72/101 (71%)
Query: 6 RASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
R + F G MGAVNGMKP+G DR IQ++E+WTGVTY +++ ++ +G V++A+ TA
Sbjct: 760 RLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTAS 819
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G Y T +E TGL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 820 GSYLTCFEETGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 860
>gi|71992731|ref|NP_001021682.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
gi|15130776|emb|CAC48128.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
Length = 922
Score = 108 bits (270), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 72/101 (71%)
Query: 6 RASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
R + F G MGAVNGMKP+G DR IQ++E+WTGVTY +++ ++ +G V++A+ TA
Sbjct: 752 RLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTAS 811
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G Y T +E TGL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 812 GSYLTCFEETGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 852
>gi|344251774|gb|EGW07878.1| Non-lysosomal glucosylceramidase [Cricetulus griseus]
Length = 226
Score = 108 bits (270), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 88 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGC 147
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 148 YRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 183
>gi|195579186|ref|XP_002079443.1| GD23959 [Drosophila simulans]
gi|194191452|gb|EDX05028.1| GD23959 [Drosophila simulans]
Length = 947
Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats.
Identities = 55/115 (47%), Positives = 74/115 (64%), Gaps = 7/115 (6%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQ+EEVW GV Y L+A M+
Sbjct: 824 LKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNIQAEEVWPGVVYALAATMIQ 883
Query: 54 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K YRS GYMR L++ +MQ A
Sbjct: 884 EGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRYRSIGYMRPLSILSMQVA 938
>gi|385648282|ref|NP_001245314.1| glucosidase, beta (bile acid) 2 [Sus scrofa]
Length = 915
Score = 107 bits (268), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 781 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYALAATMIQEGLTWEGFRTAEGC 840
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 841 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876
>gi|341901617|gb|EGT57552.1| hypothetical protein CAEBREN_15139 [Caenorhabditis brenneri]
Length = 925
Score = 107 bits (268), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 72/101 (71%)
Query: 6 RASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
R + F G MGAVNGMKP+G DR IQ++E+WTGVTY +++ ++ +G V++A+ TA
Sbjct: 741 RLNVCKFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTAS 800
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G Y T +E+ GL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 801 GSYLTCFEQAGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 841
>gi|143018484|sp|Q5M868.2|GBA2_RAT RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
Length = 912
Score = 107 bits (268), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 50/103 (48%), Positives = 69/103 (66%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
+K + + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E
Sbjct: 773 LKTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEG 832
Query: 61 WTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
+ TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 833 FRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 875
>gi|189217893|ref|NP_001013109.2| non-lysosomal glucosylceramidase [Rattus norvegicus]
gi|149045757|gb|EDL98757.1| glucosidase beta 2 [Rattus norvegicus]
Length = 904
Score = 107 bits (267), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 50/103 (48%), Positives = 69/103 (66%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
+K + + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E
Sbjct: 773 LKTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEG 832
Query: 61 WTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
+ TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 833 FRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 875
>gi|281339788|gb|EFB15372.1| hypothetical protein PANDA_007478 [Ailuropoda melanoleuca]
Length = 923
Score = 107 bits (267), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 849 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|301767028|ref|XP_002918945.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Ailuropoda melanoleuca]
Length = 928
Score = 107 bits (267), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 792 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 851
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 852 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 887
>gi|339252284|ref|XP_003371365.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
gi|316968412|gb|EFV52690.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
Length = 1615
Score = 107 bits (267), Expect = 9e-22, Method: Composition-based stats.
Identities = 45/95 (47%), Positives = 72/95 (75%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F G++GAVNGM +G++D S++Q++E WTGVTY LSA M+ EG V++ ++TA G+Y++
Sbjct: 1508 FGNGTLGAVNGMLYSGEKDTSSLQADEAWTGVTYFLSAHMISEGFVEQGFSTASGIYKSC 1567
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+E G+ ++TPE L K +R+ GYMR L+++A+Q
Sbjct: 1568 FESFGMHYQTPEALYEKKWFRAVGYMRPLSIWAIQ 1602
>gi|158299437|ref|XP_319575.4| AGAP008830-PA [Anopheles gambiae str. PEST]
gi|157013519|gb|EAA14848.4| AGAP008830-PA [Anopheles gambiae str. PEST]
Length = 913
Score = 107 bits (266), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 76/110 (69%), Gaps = 6/110 (5%)
Query: 12 FEAGSMGAVNGMKPNGDRDRS------AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
F G +GAVNG P+G ++ +IQ EEVWTGVTY L++ M++EG +EA+ TAG
Sbjct: 802 FCGGQLGAVNGYVPSGQPNKDGRPDTVSIQGEEVWTGVTYALASTMIHEGMFEEAFKTAG 861
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
GLY+ + ER G+ FETPE + ++ YR+ GYMR L++++MQ A+ K++
Sbjct: 862 GLYQALSERIGMNFETPEAVYAERHYRAIGYMRPLSIWSMQTAWEMKKLR 911
>gi|354485725|ref|XP_003505033.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Cricetulus griseus]
Length = 898
Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats.
Identities = 49/96 (51%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 760 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGC 819
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 820 YRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 855
>gi|194669510|ref|XP_001790122.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
gi|297477943|ref|XP_002689726.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
gi|296484715|tpg|DAA26830.1| TPA: hypothetical protein BOS_8918 [Bos taurus]
Length = 918
Score = 107 bits (266), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/98 (51%), Positives = 67/98 (68%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P G D S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 779 NVQAFAGGAMGAVNGMQPCGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGC 838
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 839 YRTVWERLGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 876
>gi|333898016|ref|YP_004471890.1| glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
gi|333113281|gb|AEF18218.1| Glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
Length = 806
Score = 107 bits (266), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 53/111 (47%), Positives = 74/111 (66%), Gaps = 2/111 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
+KK + FE G MGAVNGM+P+G D S IQ++EVWTGVTY L++ M Y G +EA
Sbjct: 693 LKKIYEFNVMKFENGKMGAVNGMRPDGIVDESDIQAQEVWTGVTYALASFMKYRGMTEEA 752
Query: 61 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
+ TA G+Y+ Y+++G G F TPE T D YR+ YMR L++++M+ Y
Sbjct: 753 YNTAYGVYKMTYDKSGKGYWFRTPEAWTKDGNYRASMYMRPLSIWSMEVNY 803
>gi|426220232|ref|XP_004004320.1| PREDICTED: non-lysosomal glucosylceramidase [Ovis aries]
Length = 918
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 50/98 (51%), Positives = 67/98 (68%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P G D S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 779 NVQAFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 838
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 839 YRTVWERLGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 876
>gi|440901933|gb|ELR52793.1| Non-lysosomal glucosylceramidase, partial [Bos grunniens mutus]
Length = 927
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 50/98 (51%), Positives = 67/98 (68%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P G D S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 788 NVQAFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 847
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 848 YRTVWERLGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 885
>gi|240120073|ref|NP_766280.2| non-lysosomal glucosylceramidase [Mus musculus]
gi|143018443|sp|Q69ZF3.2|GBA2_MOUSE RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
gi|34784135|gb|AAH56935.1| Glucosidase beta 2 [Mus musculus]
Length = 918
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 66/96 (68%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 780 NVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 839
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 840 YRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875
>gi|148670514|gb|EDL02461.1| glucosidase beta 2, isoform CRA_a [Mus musculus]
Length = 918
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 66/96 (68%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 780 NVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 839
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 840 YRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875
>gi|26354314|dbj|BAC40785.1| unnamed protein product [Mus musculus]
Length = 918
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 66/96 (68%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 780 NVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 839
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 840 YRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875
>gi|109111050|ref|XP_001084705.1| PREDICTED: non-lysosomal glucosylceramidase [Macaca mulatta]
Length = 927
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 849 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|403306664|ref|XP_003943843.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Saimiri
boliviensis boliviensis]
Length = 929
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 795 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 854
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 855 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|221041110|dbj|BAH12232.1| unnamed protein product [Homo sapiens]
Length = 933
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 795 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 854
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 855 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|426361727|ref|XP_004048051.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Gorilla
gorilla gorilla]
Length = 933
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 795 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 854
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 855 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|50511011|dbj|BAD32491.1| mKIAA1605 protein [Mus musculus]
Length = 940
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 66/96 (68%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 802 NVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 861
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 862 YRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 897
>gi|355567647|gb|EHH23988.1| Non-lysosomal glucosylceramidase [Macaca mulatta]
gi|355753217|gb|EHH57263.1| Non-lysosomal glucosylceramidase [Macaca fascicularis]
gi|380789499|gb|AFE66625.1| non-lysosomal glucosylceramidase [Macaca mulatta]
gi|384943046|gb|AFI35128.1| non-lysosomal glucosylceramidase [Macaca mulatta]
Length = 927
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 849 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|332831849|ref|XP_001167923.2| PREDICTED: non-lysosomal glucosylceramidase isoform 4 [Pan
troglodytes]
Length = 933
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 795 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 854
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 855 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|24308251|ref|NP_065995.1| non-lysosomal glucosylceramidase [Homo sapiens]
gi|143018392|sp|Q9HCG7.2|GBA2_HUMAN RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
gi|15030210|gb|AAH11363.1| Glucosidase, beta (bile acid) 2 [Homo sapiens]
gi|16215453|emb|CAC83792.1| bile acid beta-glucosidase [Homo sapiens]
gi|21739906|emb|CAD38976.1| hypothetical protein [Homo sapiens]
gi|119578752|gb|EAW58348.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
gi|119578753|gb|EAW58349.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
Length = 927
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 849 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|402897018|ref|XP_003911574.1| PREDICTED: non-lysosomal glucosylceramidase [Papio anubis]
Length = 927
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 849 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|397519481|ref|XP_003829887.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pan
paniscus]
Length = 927
Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 849 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|332228461|ref|XP_003263408.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
[Nomascus leucogenys]
Length = 919
Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 781 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 840
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 841 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876
>gi|403306662|ref|XP_003943842.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Saimiri
boliviensis boliviensis]
Length = 923
Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 849 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|426361725|ref|XP_004048050.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Gorilla
gorilla gorilla]
Length = 927
Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 849 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|55633173|ref|XP_520565.1| PREDICTED: non-lysosomal glucosylceramidase isoform 6 [Pan
troglodytes]
gi|410227830|gb|JAA11134.1| talin 1 [Pan troglodytes]
Length = 927
Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 849 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|395740303|ref|XP_003777396.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pongo
abelii]
Length = 933
Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 795 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 854
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 855 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|148670516|gb|EDL02463.1| glucosidase beta 2, isoform CRA_c [Mus musculus]
Length = 467
Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 66/96 (68%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 329 NVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 388
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 389 YRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 424
>gi|14042883|dbj|BAB55430.1| unnamed protein product [Homo sapiens]
Length = 927
Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 849 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|297684086|ref|XP_002819683.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pongo
abelii]
Length = 927
Score = 105 bits (263), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 849 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|194386992|dbj|BAG59862.1| unnamed protein product [Homo sapiens]
Length = 755
Score = 105 bits (263), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 617 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 676
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 677 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 712
>gi|397519483|ref|XP_003829888.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pan
paniscus]
Length = 755
Score = 105 bits (263), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 617 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 676
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 677 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 712
>gi|390458158|ref|XP_002743170.2| PREDICTED: non-lysosomal glucosylceramidase [Callithrix jacchus]
Length = 1065
Score = 105 bits (263), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 931 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 990
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 991 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 1026
>gi|343959578|dbj|BAK63646.1| bile acid beta-glucosidase [Pan troglodytes]
Length = 514
Score = 105 bits (262), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 376 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 435
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 436 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 471
>gi|196002317|ref|XP_002111026.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
gi|190586977|gb|EDV27030.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
Length = 843
Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats.
Identities = 49/98 (50%), Positives = 68/98 (69%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F G+ GAVNGMKPNG+ D S++QSEEVWTGVTY L+A ++ +G + E + A G+Y T
Sbjct: 733 FMNGTWGAVNGMKPNGEIDTSSVQSEEVWTGVTYALAANLIQQGMIKEGFKVAEGMYDTT 792
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
+ GLGF+TPE YR GYMR L+++A++ A+
Sbjct: 793 FNVMGLGFQTPEAYYKINRYRCTGYMRPLSIWAIEWAF 830
>gi|395855828|ref|XP_003800351.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
[Otolemur garnettii]
Length = 1147
Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats.
Identities = 49/96 (51%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 1009 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 1068
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 1069 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 1104
>gi|338720173|ref|XP_001497796.3| PREDICTED: non-lysosomal glucosylceramidase [Equus caballus]
Length = 983
Score = 105 bits (262), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 67/96 (69%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G DR+++QS+EVW GV Y L+A M+ EG E + TA G
Sbjct: 845 NVQAFAGGAMGAVNGMQPHGVPDRTSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 904
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 905 YRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 940
>gi|189239183|ref|XP_966847.2| PREDICTED: similar to bile acid beta-glucosidase, putative
[Tribolium castaneum]
gi|270010940|gb|EFA07388.1| hypothetical protein TcasGA2_TC016367 [Tribolium castaneum]
Length = 818
Score = 104 bits (260), Expect = 6e-21, Method: Composition-based stats.
Identities = 53/101 (52%), Positives = 71/101 (70%), Gaps = 1/101 (0%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G MGAVNG +G D IQS+EVWTGVTY L+A+ML EG EA+ TAGG+
Sbjct: 710 NVQSFCDGRMGAVNGFI-DGVIDEFTIQSQEVWTGVTYALAASMLQEGMRTEAFNTAGGM 768
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
++++ ER GL F+TPE L K YR+ GYMR L++++MQ A
Sbjct: 769 FKSMSERFGLSFDTPEALYAAKYYRAIGYMRPLSIWSMQLA 809
>gi|344271640|ref|XP_003407645.1| PREDICTED: non-lysosomal glucosylceramidase [Loxodonta africana]
Length = 967
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/96 (48%), Positives = 66/96 (68%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + F G+MGAVNGM+P+G D S++QS+EVW GV Y L+A M+ EG + + TA G
Sbjct: 833 NVQAFAGGAMGAVNGMQPHGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTSKGFWTAEGC 892
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 103
YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 893 YRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 928
>gi|168014916|ref|XP_001759997.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688747|gb|EDQ75122.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 970
Score = 102 bits (254), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/92 (52%), Positives = 64/92 (69%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G +GAVNGM PNG D + +QS EVWTGVTY L+AAM++EG +D+A+ A G++ +
Sbjct: 813 GHLGAVNGMHPNGKVDETCMQSREVWTGVTYALAAAMIHEGMLDQAFRAAEGVFIAGWSD 872
Query: 75 TGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G F+TPE T D +RS YMR LA++AMQ
Sbjct: 873 LGYWFQTPEAWTMDGYFRSLAYMRPLAIWAMQ 904
>gi|413920603|gb|AFW60535.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
Length = 951
Score = 102 bits (253), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 52/106 (49%), Positives = 70/106 (66%), Gaps = 2/106 (1%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F+ G+ GA+NGM P+G D SA+QS E+W GVTY L+A+M+ EG V++ + TA G+Y
Sbjct: 795 FKGGTRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGVYHAA 854
Query: 72 YERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
+ GLG F+TPE D YRS YMR LA++AMQ A K K+
Sbjct: 855 WSPGGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSKPKLH 900
>gi|110742221|dbj|BAE99037.1| hypothetical protein [Arabidopsis thaliana]
Length = 957
Score = 101 bits (252), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 53/103 (51%), Positives = 67/103 (65%), Gaps = 2/103 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM PNG D +++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y + +
Sbjct: 796 GKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIYEAAWSK 855
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
TGLG F+TPE YRS YMR LA++AMQ A K K
Sbjct: 856 TGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 898
>gi|356520149|ref|XP_003528727.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
glucosylceramidase-like [Glycine max]
Length = 944
Score = 101 bits (252), Expect = 5e-20, Method: Composition-based stats.
Identities = 50/94 (53%), Positives = 63/94 (67%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM P+G D S +QS E+W+GVTY L+A M+ E +D A+ TAGG+Y T +
Sbjct: 789 GRRGAVNGMLPDGKVDMSTMQSREIWSGVTYALAATMIQENMIDMAFQTAGGVYETAWSN 848
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
GLG F+TPE T YRS YMR LA++AMQ
Sbjct: 849 NGLGYSFQTPEAWTTKDEYRSLCYMRPLAIWAMQ 882
>gi|255569126|ref|XP_002525532.1| conserved hypothetical protein [Ricinus communis]
gi|223535211|gb|EEF36890.1| conserved hypothetical protein [Ricinus communis]
Length = 948
Score = 101 bits (252), Expect = 6e-20, Method: Composition-based stats.
Identities = 53/102 (51%), Positives = 66/102 (64%), Gaps = 2/102 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM P+G D S +QS E+W GVTY LSA+M+ EG + A+ TA G+Y + +
Sbjct: 790 GKRGAVNGMLPDGKVDMSVMQSREIWPGVTYALSASMIQEGMAEMAFQTASGIYEAAWSQ 849
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 114
GLG F+ PEG D YRS YMR LA++AMQ A K KV
Sbjct: 850 EGLGYSFQIPEGWNTDDQYRSLCYMRPLAIWAMQWALSKPKV 891
>gi|242070593|ref|XP_002450573.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
gi|241936416|gb|EES09561.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
Length = 951
Score = 101 bits (251), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 51/106 (48%), Positives = 69/106 (65%), Gaps = 2/106 (1%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F+ G+ GA+NGM P+G D SA+QS E+W GVTY L+A+M+ EG V++ + TA G+Y
Sbjct: 795 FKDGTRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGIYHAA 854
Query: 72 YERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
+ GLG F+TPE D YRS YMR LA++AMQ A K+
Sbjct: 855 WSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSNPKLH 900
>gi|8777421|dbj|BAA97011.1| unnamed protein product [Arabidopsis thaliana]
Length = 928
Score = 101 bits (251), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/103 (51%), Positives = 66/103 (64%), Gaps = 2/103 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM PNG D +++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y +
Sbjct: 767 GKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIYEAAWSE 826
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
TGLG F+TPE YRS YMR LA++AMQ A K K
Sbjct: 827 TGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 869
>gi|297739499|emb|CBI29681.3| unnamed protein product [Vitis vinifera]
Length = 949
Score = 100 bits (250), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 50/103 (48%), Positives = 69/103 (66%), Gaps = 2/103 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM P+G D + +QS E+W+GVTY ++A M++EG VD A+ TA G+Y + +
Sbjct: 795 GKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDMAFQTASGVYEAAWSQ 854
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
GLG F+TPE D YRS YMR LA++AMQ A+ + K+
Sbjct: 855 EGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKLH 897
>gi|302807226|ref|XP_002985326.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
gi|300147154|gb|EFJ13820.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
Length = 983
Score = 100 bits (250), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 68/94 (72%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G +GAVNGM PNG D +A+QS E+W+GVTY +AAM++EG +++A+TTA G+Y +
Sbjct: 826 GKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEGMLEQAFTTAEGIYNAGWSD 885
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+G G F+TPE T D +R+ YMR LA++AMQ
Sbjct: 886 SGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQ 919
>gi|147812668|emb|CAN61857.1| hypothetical protein VITISV_016690 [Vitis vinifera]
Length = 521
Score = 100 bits (250), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 69/103 (66%), Gaps = 2/103 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM P+G D SA+QS E+W GVTY ++A M++EG V+ A+ TA G+Y + +
Sbjct: 362 GKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAFNTASGIYDAAWSQ 421
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
GLG F+TPE D+ YRS YMR LA++AMQ A K ++
Sbjct: 422 EGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPELH 464
>gi|302773445|ref|XP_002970140.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
gi|300162651|gb|EFJ29264.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
Length = 983
Score = 100 bits (250), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 68/94 (72%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G +GAVNGM PNG D +A+QS E+W+GVTY +AAM++EG +++A+TTA G+Y +
Sbjct: 826 GKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEGMLEQAFTTAEGIYNAGWSD 885
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+G G F+TPE T D +R+ YMR LA++AMQ
Sbjct: 886 SGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQ 919
>gi|30695777|ref|NP_199801.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|27311753|gb|AAO00842.1| putative protein [Arabidopsis thaliana]
gi|34365725|gb|AAQ65174.1| At5g49900 [Arabidopsis thaliana]
gi|332008486|gb|AED95869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 957
Score = 100 bits (250), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 53/103 (51%), Positives = 66/103 (64%), Gaps = 2/103 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM PNG D +++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y +
Sbjct: 796 GKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIYEAAWSE 855
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
TGLG F+TPE YRS YMR LA++AMQ A K K
Sbjct: 856 TGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 898
>gi|359486233|ref|XP_002264575.2| PREDICTED: non-lysosomal glucosylceramidase [Vitis vinifera]
Length = 960
Score = 100 bits (250), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 50/103 (48%), Positives = 69/103 (66%), Gaps = 2/103 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM P+G D + +QS E+W+GVTY ++A M++EG VD A+ TA G+Y + +
Sbjct: 806 GKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDMAFQTASGVYEAAWSQ 865
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
GLG F+TPE D YRS YMR LA++AMQ A+ + K+
Sbjct: 866 EGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKLH 908
>gi|222423179|dbj|BAH19567.1| AT5G49900 [Arabidopsis thaliana]
Length = 957
Score = 100 bits (250), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 53/103 (51%), Positives = 66/103 (64%), Gaps = 2/103 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM PNG D +++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y +
Sbjct: 796 GKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGIYEAAWSE 855
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
TGLG F+TPE YRS YMR LA++AMQ A K K
Sbjct: 856 TGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 898
>gi|225460125|ref|XP_002275782.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
Length = 969
Score = 100 bits (250), Expect = 9e-20, Method: Composition-based stats.
Identities = 51/99 (51%), Positives = 67/99 (67%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM P+G D SA+QS E+W GVTY ++A M++EG V+ A+ TA G+Y + +
Sbjct: 810 GKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAFNTASGIYDAAWSQ 869
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
GLG F+TPE D+ YRS YMR LA++AMQ A K
Sbjct: 870 EGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSK 908
>gi|297741007|emb|CBI31319.3| unnamed protein product [Vitis vinifera]
Length = 953
Score = 100 bits (250), Expect = 9e-20, Method: Composition-based stats.
Identities = 51/99 (51%), Positives = 67/99 (67%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM P+G D SA+QS E+W GVTY ++A M++EG V+ A+ TA G+Y + +
Sbjct: 794 GKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAFNTASGIYDAAWSQ 853
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
GLG F+TPE D+ YRS YMR LA++AMQ A K
Sbjct: 854 EGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSK 892
>gi|292384324|gb|ADE21415.1| unknown [Helianthus neglectus]
Length = 210
Score = 100 bits (250), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|188570667|gb|ACD64369.1| hypothetical protein [Bahiopsis lanata]
gi|188570669|gb|ACD64370.1| hypothetical protein [Bahiopsis lanata]
Length = 208
Score = 100 bits (250), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 51/96 (53%), Positives = 65/96 (67%), Gaps = 2/96 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D+A+TTA G++ +
Sbjct: 109 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFTAGWSE 168
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
G G F+TPEG T D YRS YMR LA++ MQ A
Sbjct: 169 EGFGYAFQTPEGWTMDGCYRSLVYMRPLAIWGMQQA 204
>gi|188570531|gb|ACD64301.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 100 bits (250), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|188570635|gb|ACD64353.1| hypothetical protein [Helianthus petiolaris]
gi|188570639|gb|ACD64355.1| hypothetical protein [Helianthus petiolaris]
Length = 211
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 112 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 171
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 172 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 210
>gi|188570649|gb|ACD64360.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|226444526|gb|ACO57837.1| unknown [Helianthus annuus]
gi|226444528|gb|ACO57838.1| unknown [Helianthus annuus]
gi|226444560|gb|ACO57854.1| unknown [Helianthus petiolaris]
gi|226444568|gb|ACO57858.1| unknown [Helianthus petiolaris]
Length = 103
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 67/99 (67%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D+A+TTA G++ +
Sbjct: 5 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFIAGWSE 64
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 65 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103
>gi|188570553|gb|ACD64312.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|188570503|gb|ACD64287.1| hypothetical protein [Helianthus annuus]
gi|188570505|gb|ACD64288.1| hypothetical protein [Helianthus annuus]
gi|188570507|gb|ACD64289.1| hypothetical protein [Helianthus annuus]
gi|188570509|gb|ACD64290.1| hypothetical protein [Helianthus annuus]
gi|188570513|gb|ACD64292.1| hypothetical protein [Helianthus annuus]
gi|188570515|gb|ACD64293.1| hypothetical protein [Helianthus annuus]
gi|188570517|gb|ACD64294.1| hypothetical protein [Helianthus annuus]
gi|188570519|gb|ACD64295.1| hypothetical protein [Helianthus annuus]
gi|188570521|gb|ACD64296.1| hypothetical protein [Helianthus annuus]
gi|188570523|gb|ACD64297.1| hypothetical protein [Helianthus annuus]
gi|188570525|gb|ACD64298.1| hypothetical protein [Helianthus annuus]
gi|188570527|gb|ACD64299.1| hypothetical protein [Helianthus annuus]
gi|188570529|gb|ACD64300.1| hypothetical protein [Helianthus annuus]
gi|188570533|gb|ACD64302.1| hypothetical protein [Helianthus annuus]
gi|188570535|gb|ACD64303.1| hypothetical protein [Helianthus annuus]
gi|188570537|gb|ACD64304.1| hypothetical protein [Helianthus annuus]
gi|188570539|gb|ACD64305.1| hypothetical protein [Helianthus annuus]
gi|188570541|gb|ACD64306.1| hypothetical protein [Helianthus annuus]
gi|188570543|gb|ACD64307.1| hypothetical protein [Helianthus annuus]
gi|188570545|gb|ACD64308.1| hypothetical protein [Helianthus annuus]
gi|188570547|gb|ACD64309.1| hypothetical protein [Helianthus annuus]
gi|188570549|gb|ACD64310.1| hypothetical protein [Helianthus annuus]
gi|188570551|gb|ACD64311.1| hypothetical protein [Helianthus annuus]
gi|188570555|gb|ACD64313.1| hypothetical protein [Helianthus annuus]
gi|188570557|gb|ACD64314.1| hypothetical protein [Helianthus annuus]
gi|188570559|gb|ACD64315.1| hypothetical protein [Helianthus annuus]
gi|188570561|gb|ACD64316.1| hypothetical protein [Helianthus annuus]
gi|188570563|gb|ACD64317.1| hypothetical protein [Helianthus annuus]
gi|188570565|gb|ACD64318.1| hypothetical protein [Helianthus annuus]
gi|188570567|gb|ACD64319.1| hypothetical protein [Helianthus annuus]
gi|188570569|gb|ACD64320.1| hypothetical protein [Helianthus annuus]
gi|188570571|gb|ACD64321.1| hypothetical protein [Helianthus annuus]
gi|188570573|gb|ACD64322.1| hypothetical protein [Helianthus annuus]
gi|188570575|gb|ACD64323.1| hypothetical protein [Helianthus annuus]
gi|188570577|gb|ACD64324.1| hypothetical protein [Helianthus annuus]
gi|188570579|gb|ACD64325.1| hypothetical protein [Helianthus annuus]
gi|188570581|gb|ACD64326.1| hypothetical protein [Helianthus annuus]
gi|188570583|gb|ACD64327.1| hypothetical protein [Helianthus annuus]
gi|188570585|gb|ACD64328.1| hypothetical protein [Helianthus annuus]
gi|188570587|gb|ACD64329.1| hypothetical protein [Helianthus annuus]
gi|188570589|gb|ACD64330.1| hypothetical protein [Helianthus annuus]
gi|188570591|gb|ACD64331.1| hypothetical protein [Helianthus annuus]
gi|188570593|gb|ACD64332.1| hypothetical protein [Helianthus annuus]
gi|188570595|gb|ACD64333.1| hypothetical protein [Helianthus annuus]
gi|188570597|gb|ACD64334.1| hypothetical protein [Helianthus annuus]
gi|188570599|gb|ACD64335.1| hypothetical protein [Helianthus annuus]
gi|188570601|gb|ACD64336.1| hypothetical protein [Helianthus annuus]
gi|188570603|gb|ACD64337.1| hypothetical protein [Helianthus petiolaris]
gi|188570605|gb|ACD64338.1| hypothetical protein [Helianthus petiolaris]
gi|188570607|gb|ACD64339.1| hypothetical protein [Helianthus petiolaris]
gi|188570609|gb|ACD64340.1| hypothetical protein [Helianthus petiolaris]
gi|188570611|gb|ACD64341.1| hypothetical protein [Helianthus petiolaris]
gi|188570613|gb|ACD64342.1| hypothetical protein [Helianthus petiolaris]
gi|188570615|gb|ACD64343.1| hypothetical protein [Helianthus petiolaris]
gi|188570617|gb|ACD64344.1| hypothetical protein [Helianthus petiolaris]
gi|188570619|gb|ACD64345.1| hypothetical protein [Helianthus petiolaris]
gi|188570625|gb|ACD64348.1| hypothetical protein [Helianthus petiolaris]
gi|188570627|gb|ACD64349.1| hypothetical protein [Helianthus petiolaris]
gi|188570629|gb|ACD64350.1| hypothetical protein [Helianthus petiolaris]
gi|188570633|gb|ACD64352.1| hypothetical protein [Helianthus petiolaris]
gi|188570637|gb|ACD64354.1| hypothetical protein [Helianthus petiolaris]
gi|188570641|gb|ACD64356.1| hypothetical protein [Helianthus petiolaris]
gi|188570643|gb|ACD64357.1| hypothetical protein [Helianthus petiolaris]
gi|188570645|gb|ACD64358.1| hypothetical protein [Helianthus petiolaris]
gi|188570651|gb|ACD64361.1| hypothetical protein [Helianthus petiolaris]
gi|188570653|gb|ACD64362.1| hypothetical protein [Helianthus petiolaris]
gi|188570655|gb|ACD64363.1| hypothetical protein [Helianthus petiolaris]
gi|188570657|gb|ACD64364.1| hypothetical protein [Helianthus petiolaris]
gi|188570659|gb|ACD64365.1| hypothetical protein [Helianthus petiolaris]
gi|188570661|gb|ACD64366.1| hypothetical protein [Helianthus petiolaris]
gi|188570663|gb|ACD64367.1| hypothetical protein [Helianthus petiolaris]
gi|188570665|gb|ACD64368.1| hypothetical protein [Helianthus petiolaris]
gi|292384298|gb|ADE21402.1| unknown [Helianthus neglectus]
gi|292384300|gb|ADE21403.1| unknown [Helianthus neglectus]
gi|292384302|gb|ADE21404.1| unknown [Helianthus neglectus]
gi|292384304|gb|ADE21405.1| unknown [Helianthus neglectus]
gi|292384312|gb|ADE21409.1| unknown [Helianthus neglectus]
gi|292384318|gb|ADE21412.1| unknown [Helianthus neglectus]
gi|292384320|gb|ADE21413.1| unknown [Helianthus neglectus]
gi|292384328|gb|ADE21417.1| unknown [Helianthus neglectus]
Length = 210
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|292384326|gb|ADE21416.1| unknown [Helianthus neglectus]
Length = 210
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|292384322|gb|ADE21414.1| unknown [Helianthus neglectus]
Length = 207
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 108 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 167
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 168 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 206
>gi|188570647|gb|ACD64359.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|292384310|gb|ADE21408.1| unknown [Helianthus neglectus]
Length = 210
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|188570631|gb|ACD64351.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|297795757|ref|XP_002865763.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
lyrata]
gi|297311598|gb|EFH42022.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
lyrata]
Length = 956
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/103 (50%), Positives = 66/103 (64%), Gaps = 2/103 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM PNG D +++QS E+W+GVTY LSA M+ EG V+ A+ TA G+Y +
Sbjct: 796 GKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGVYEAAWSE 855
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
GLG F+TPE + YRS YMR LA++AMQ A K K
Sbjct: 856 AGLGYSFQTPEAWNTNDEYRSLTYMRPLAIWAMQWALTKTSQK 898
>gi|188570671|gb|ACD64371.1| hypothetical protein [Bahiopsis reticulata]
gi|188570673|gb|ACD64372.1| hypothetical protein [Bahiopsis reticulata]
Length = 210
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/96 (53%), Positives = 65/96 (67%), Gaps = 2/96 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDSAFTTAEGIFTAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
G G F+TPEG T D +YRS YMR LA++ MQ A
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQA 206
>gi|292384314|gb|ADE21410.1| unknown [Helianthus neglectus]
Length = 205
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/96 (53%), Positives = 65/96 (67%), Gaps = 2/96 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 108 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 167
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
G G F+TPEG T D +YRS YMR LA++ MQ A
Sbjct: 168 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQA 203
>gi|218199525|gb|EEC81952.1| hypothetical protein OsI_25831 [Oryza sativa Indica Group]
Length = 850
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/94 (54%), Positives = 63/94 (67%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS E+WTGVTY L+A ML G D+A+TTA G++ +
Sbjct: 702 GRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMDDQAFTTAEGIFTAGWSE 761
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T D YRS YMR LA++AMQ
Sbjct: 762 EGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 795
>gi|222636955|gb|EEE67087.1| hypothetical protein OsJ_24071 [Oryza sativa Japonica Group]
Length = 838
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/94 (54%), Positives = 63/94 (67%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS E+WTGVTY L+A ML G D+A+TTA G++ +
Sbjct: 690 GRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMDDQAFTTAEGIFTAGWSE 749
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T D YRS YMR LA++AMQ
Sbjct: 750 EGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 783
>gi|292384308|gb|ADE21407.1| unknown [Helianthus neglectus]
Length = 205
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/96 (53%), Positives = 65/96 (67%), Gaps = 2/96 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 108 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 167
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
G G F+TPEG T D +YRS YMR LA++ MQ A
Sbjct: 168 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQA 203
>gi|292384306|gb|ADE21406.1| unknown [Helianthus neglectus]
Length = 205
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/96 (53%), Positives = 65/96 (67%), Gaps = 2/96 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 108 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 167
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
G G F+TPEG T D +YRS YMR LA++ MQ A
Sbjct: 168 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQA 203
>gi|226444514|gb|ACO57831.1| unknown [Helianthus annuus]
gi|226444516|gb|ACO57832.1| unknown [Helianthus annuus]
gi|226444518|gb|ACO57833.1| unknown [Helianthus annuus]
gi|226444520|gb|ACO57834.1| unknown [Helianthus annuus]
gi|226444522|gb|ACO57835.1| unknown [Helianthus annuus]
gi|226444530|gb|ACO57839.1| unknown [Helianthus annuus]
gi|226444532|gb|ACO57840.1| unknown [Helianthus annuus]
gi|226444534|gb|ACO57841.1| unknown [Helianthus annuus]
gi|226444536|gb|ACO57842.1| unknown [Helianthus annuus]
gi|226444538|gb|ACO57843.1| unknown [Helianthus annuus]
gi|226444540|gb|ACO57844.1| unknown [Helianthus annuus]
gi|226444542|gb|ACO57845.1| unknown [Helianthus annuus]
gi|226444544|gb|ACO57846.1| unknown [Helianthus annuus]
gi|226444546|gb|ACO57847.1| unknown [Helianthus petiolaris]
gi|226444548|gb|ACO57848.1| unknown [Helianthus petiolaris]
gi|226444550|gb|ACO57849.1| unknown [Helianthus petiolaris]
gi|226444552|gb|ACO57850.1| unknown [Helianthus petiolaris]
gi|226444554|gb|ACO57851.1| unknown [Helianthus petiolaris]
gi|226444556|gb|ACO57852.1| unknown [Helianthus petiolaris]
gi|226444558|gb|ACO57853.1| unknown [Helianthus petiolaris]
gi|226444562|gb|ACO57855.1| unknown [Helianthus petiolaris]
gi|226444564|gb|ACO57856.1| unknown [Helianthus petiolaris]
gi|226444566|gb|ACO57857.1| unknown [Helianthus petiolaris]
gi|226444570|gb|ACO57859.1| unknown [Helianthus argophyllus]
gi|226444572|gb|ACO57860.1| unknown [Helianthus argophyllus]
gi|226444576|gb|ACO57862.1| unknown [Helianthus argophyllus]
gi|226444578|gb|ACO57863.1| unknown [Helianthus argophyllus]
gi|226444580|gb|ACO57864.1| unknown [Helianthus argophyllus]
gi|226444582|gb|ACO57865.1| unknown [Helianthus argophyllus]
gi|226444584|gb|ACO57866.1| unknown [Helianthus argophyllus]
gi|226444586|gb|ACO57867.1| unknown [Helianthus argophyllus]
gi|226444588|gb|ACO57868.1| unknown [Helianthus argophyllus]
gi|226444590|gb|ACO57869.1| unknown [Helianthus argophyllus]
gi|226444592|gb|ACO57870.1| unknown [Helianthus argophyllus]
Length = 103
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 5 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 64
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 65 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103
>gi|292384316|gb|ADE21411.1| unknown [Helianthus neglectus]
Length = 205
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/96 (53%), Positives = 65/96 (67%), Gaps = 2/96 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 108 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 167
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
G G F+TPEG T D +YRS YMR LA++ MQ A
Sbjct: 168 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQA 203
>gi|390344669|ref|XP_001200431.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Strongylocentrotus purpuratus]
Length = 347
Score = 99.8 bits (247), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/96 (47%), Positives = 64/96 (66%)
Query: 11 GFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
G + G+ GA+NG++P+G D +++Q EEVW G TY L+ M+ EG +E + TA G Y T
Sbjct: 235 GVKEGNFGAMNGIRPSGKPDHTSLQGEEVWVGTTYGLAGNMIQEGMWEEGFRTAKGCYTT 294
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
YE+ GL ++ PE K YRS GYMR LA++AMQ
Sbjct: 295 CYEQAGLAYQVPEAYMSKKIYRSLGYMRPLAIWAMQ 330
>gi|226444524|gb|ACO57836.1| unknown [Helianthus annuus]
Length = 103
Score = 99.8 bits (247), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/96 (53%), Positives = 65/96 (67%), Gaps = 2/96 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 5 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 64
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
G G F+TPEG T D +YRS YMR LA++ MQ A
Sbjct: 65 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQA 100
>gi|357116998|ref|XP_003560263.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 932
Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/94 (54%), Positives = 63/94 (67%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS E+WTGVTY L+A ML G D+A+TTA G++ +
Sbjct: 785 GRMGAVNGMYPNGKVDETCMQSREIWTGVTYSLAATMLLHGMEDQAFTTAEGIFLAGWSE 844
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T D YRS YMR LA++AMQ
Sbjct: 845 EGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 878
>gi|168037390|ref|XP_001771187.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677567|gb|EDQ64036.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 981
Score = 99.4 bits (246), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 64/99 (64%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GA NGM PNG D + +QS E+WTGVTY SAAM++EG V++A+T A G++ +
Sbjct: 808 GKWGAANGMHPNGKVDETCMQSREIWTGVTYAASAAMIHEGMVEQAFTAAQGVFLAGWSD 867
Query: 75 TGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
G F+TPE T D +RS YMR LA++AMQ A K
Sbjct: 868 LGYWFQTPEAWTIDGYFRSLAYMRPLAIWAMQWALYPPK 906
>gi|188570511|gb|ACD64291.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 99.0 bits (245), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ +Q A K
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGIQQALEK 209
>gi|324500616|gb|ADY40283.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 890
Score = 98.6 bits (244), Expect = 4e-19, Method: Composition-based stats.
Identities = 46/100 (46%), Positives = 68/100 (68%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F G +GAVNGM+ +G DR+ IQ++E+WTGVTY L+A ++ +G V + TA G Y
Sbjct: 732 FAGGRLGAVNGMRRDGTVDRNYIQADEMWTGVTYALAAFLIQQGAVQRGFDTAWGCYDGC 791
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
+ R GL ++TPE L + YR+ GYMR L+++AMQ A ++
Sbjct: 792 FNRFGLQYQTPEALYEQRFYRAIGYMRPLSIWAMQWALIR 831
>gi|414589150|tpg|DAA39721.1| TPA: hypothetical protein ZEAMMB73_266944 [Zea mays]
Length = 583
Score = 98.6 bits (244), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/94 (54%), Positives = 62/94 (65%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS E+WTGVTY L+A ML G +A+TTA G+Y +
Sbjct: 436 GRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIYTAGWSE 495
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T D YRS YMR LA++AMQ
Sbjct: 496 EGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 529
>gi|326489639|dbj|BAK01800.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 797
Score = 98.6 bits (244), Expect = 4e-19, Method: Composition-based stats.
Identities = 48/97 (49%), Positives = 65/97 (67%), Gaps = 2/97 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F+ G+ GA+NGM P+G D S +QS E+W GVTY L+A+M+ EG VDE + TA G+Y
Sbjct: 641 FKDGNRGAINGMWPDGTLDMSTMQSREIWPGVTYALAASMIQEGMVDEGFKTAEGIYNAA 700
Query: 72 YERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ GLG F+TPE D YRS YMR LA++++Q
Sbjct: 701 WSTEGLGYAFQTPESWNNDDEYRSLCYMRPLAIWSIQ 737
>gi|357630375|gb|EHJ78536.1| hypothetical protein KGM_16417 [Danaus plexippus]
Length = 912
Score = 98.6 bits (244), Expect = 4e-19, Method: Composition-based stats.
Identities = 53/101 (52%), Positives = 69/101 (68%), Gaps = 3/101 (2%)
Query: 12 FEAGSMGAVNGM--KPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 69
F G MGAVNG P D +AIQSEEVWTGVTY L+A M+YEG ++A++TAGGLY
Sbjct: 791 FLNGRMGAVNGFVRGPRPGIDTTAIQSEEVWTGVTYGLAALMIYEGMHEQAFSTAGGLYN 850
Query: 70 TVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYL 110
T+ + GL FETPE L + +RS YMR L++++M A +
Sbjct: 851 TLM-KMGLAFETPEALYENGNHRSVAYMRPLSIWSMYHAII 890
>gi|357157041|ref|XP_003577664.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 951
Score = 98.6 bits (244), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/106 (47%), Positives = 68/106 (64%), Gaps = 2/106 (1%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F+ G GA+NGM P+G D S +QS E+W GVTY L+A+M+ EG V+E + TA G+Y
Sbjct: 795 FKDGKRGAMNGMWPDGTVDMSTMQSREIWPGVTYALAASMIQEGMVEEGFKTAEGIYHAA 854
Query: 72 YERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 115
+ GLG F+TPE T D YRS YMR LA++++Q A K+
Sbjct: 855 WSPEGLGYAFQTPEAWTNDDGYRSLCYMRPLAIWSIQWALSSPKLH 900
>gi|253761389|ref|XP_002489100.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
gi|241947399|gb|EES20544.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
Length = 603
Score = 98.2 bits (243), Expect = 6e-19, Method: Composition-based stats.
Identities = 51/96 (53%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G MGAVNGM PNG D + +QS E+WTGVTY L+A ML G +A+TTA G+Y +
Sbjct: 454 KGGRMGAVNGMHPNGKVDDTCMQSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIYIAGW 513
Query: 73 ERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T D YRS YMR LA++AMQ
Sbjct: 514 SEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 549
>gi|224120058|ref|XP_002331126.1| predicted protein [Populus trichocarpa]
gi|222872854|gb|EEF09985.1| predicted protein [Populus trichocarpa]
Length = 948
Score = 97.8 bits (242), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G GAVNGM P+G D S +QS E+W+GVTY ++A M+ EG D A+ TA G+Y V+
Sbjct: 791 QDGKRGAVNGMLPDGTVDLSCLQSREIWSGVTYAVAATMIQEGLTDMAFHTASGVYEAVW 850
Query: 73 ERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
GLG F+TPEG YRS YMR LA++AMQ
Sbjct: 851 AEQGLGYSFQTPEGWNTTDQYRSLCYMRPLAIWAMQ 886
>gi|188570621|gb|ACD64346.1| hypothetical protein [Helianthus petiolaris]
gi|188570623|gb|ACD64347.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 97.8 bits (242), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 65/99 (65%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ D A+TTA G++ +
Sbjct: 111 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAEMEDNAFTTAEGIFIAGWSE 170
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 171 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|356535470|ref|XP_003536268.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
Length = 949
Score = 97.8 bits (242), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 52/101 (51%), Positives = 66/101 (65%), Gaps = 2/101 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY L+A M++ G +EA+TTA G++ +
Sbjct: 800 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGLAATMIHAGMEEEAFTTAEGIFLAGWSE 859
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
G G F+TPE T D YRS YMR LA++ MQ A + K
Sbjct: 860 DGYGYWFQTPEAWTMDGHYRSLMYMRPLAIWGMQYAINRPK 900
>gi|62733283|gb|AAX95400.1| At5g49900 [Oryza sativa Japonica Group]
Length = 931
Score = 97.4 bits (241), Expect = 9e-19, Method: Composition-based stats.
Identities = 48/97 (49%), Positives = 64/97 (65%), Gaps = 2/97 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F+ G GA+NGM P+G D SA+QS E+W GVTY L+A M+ EG V++ + TA G+Y
Sbjct: 776 FKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGMVEKGFKTAEGIYHAA 835
Query: 72 YERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ GLG F+TPE D YRS YMR LA++A+Q
Sbjct: 836 WSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQ 872
>gi|224129460|ref|XP_002328722.1| predicted protein [Populus trichocarpa]
gi|222839020|gb|EEE77371.1| predicted protein [Populus trichocarpa]
Length = 922
Score = 97.4 bits (241), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 49/94 (52%), Positives = 64/94 (68%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM P+G D S +QS E+W+GVTY ++A M+ EG +D A+ TA G+Y +
Sbjct: 767 GKRGAVNGMLPDGTVDMSDMQSREIWSGVTYAVAATMMQEGLMDMAFHTASGVYEAAWAE 826
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
GLG F+TPEG + YRS GYMR LA++AMQ
Sbjct: 827 QGLGYSFQTPEGWNTNGQYRSLGYMRPLAIWAMQ 860
>gi|226444574|gb|ACO57861.1| unknown [Helianthus argophyllus]
Length = 103
Score = 97.4 bits (241), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 65/99 (65%), Gaps = 2/99 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ +
Sbjct: 5 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSE 64
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
G G F+TPEG T D +YRS YMR LA++ Q A K
Sbjct: 65 EGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGKQQALEK 103
>gi|115484889|ref|NP_001067588.1| Os11g0242100 [Oryza sativa Japonica Group]
gi|77549531|gb|ABA92328.1| expressed protein [Oryza sativa Japonica Group]
gi|113644810|dbj|BAF27951.1| Os11g0242100 [Oryza sativa Japonica Group]
gi|215704397|dbj|BAG93831.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222615769|gb|EEE51901.1| hypothetical protein OsJ_33494 [Oryza sativa Japonica Group]
Length = 950
Score = 96.7 bits (239), Expect = 2e-18, Method: Composition-based stats.
Identities = 48/97 (49%), Positives = 64/97 (65%), Gaps = 2/97 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F+ G GA+NGM P+G D SA+QS E+W GVTY L+A M+ EG V++ + TA G+Y
Sbjct: 795 FKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGMVEKGFKTAEGIYHAA 854
Query: 72 YERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ GLG F+TPE D YRS YMR LA++A+Q
Sbjct: 855 WSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQ 891
>gi|218185509|gb|EEC67936.1| hypothetical protein OsI_35656 [Oryza sativa Indica Group]
Length = 950
Score = 96.7 bits (239), Expect = 2e-18, Method: Composition-based stats.
Identities = 48/97 (49%), Positives = 64/97 (65%), Gaps = 2/97 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F+ G GA+NGM P+G D SA+QS E+W GVTY L+A M+ EG V++ + TA G+Y
Sbjct: 795 FKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGMVEKGFKTAEGIYHAA 854
Query: 72 YERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ GLG F+TPE D YRS YMR LA++A+Q
Sbjct: 855 WSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQ 891
>gi|225433442|ref|XP_002285674.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
Length = 978
Score = 96.3 bits (238), Expect = 2e-18, Method: Composition-based stats.
Identities = 48/96 (50%), Positives = 64/96 (66%), Gaps = 2/96 (2%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G MGAVNGM PNG D S +QS E+WTGVTY ++A M+ G ++A+TTA G++ +
Sbjct: 828 KGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILSGMEEQAFTTAEGIFTAGW 887
Query: 73 ERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D +RS YMR LA++ MQ
Sbjct: 888 SEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQ 923
>gi|449482327|ref|XP_004156248.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
sativus]
Length = 508
Score = 96.3 bits (238), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 65/94 (69%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G+ GAVNGM P+G D+S +Q +E+W GVTY ++A M+ EG V+ + TA G+Y+ + +
Sbjct: 350 GTRGAVNGMFPDGSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQAAWAQ 409
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
GLG F+TPE D +RS GYMR LA++AMQ
Sbjct: 410 DGLGYSFQTPEAWDVDDRFRSIGYMRPLAIWAMQ 443
>gi|449447581|ref|XP_004141546.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 944
Score = 95.9 bits (237), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 67/98 (68%), Gaps = 2/98 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G+ GAVNGM P+G D+S +Q +E+W GVTY ++A M+ EG V+ + TA G+Y+ + +
Sbjct: 786 GTRGAVNGMFPDGSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQAAWAQ 845
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYL 110
GLG F+TPE D +RS GYMR LA++AMQ A +
Sbjct: 846 DGLGYSFQTPEAWDVDDRFRSIGYMRPLAIWAMQWAMM 883
>gi|440799031|gb|ELR20092.1| glucosylceramidase [Acanthamoeba castellanii str. Neff]
Length = 860
Score = 95.9 bits (237), Expect = 3e-18, Method: Composition-based stats.
Identities = 47/103 (45%), Positives = 65/103 (63%)
Query: 9 ARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 68
A F G GAVNGM+P+G D + + S E W G T+ L+A M+ EG VD+A+ TA G+
Sbjct: 744 ANNFNEGKRGAVNGMRPDGRVDYACLHSSEAWPGTTFALAACMMGEGLVDQAFITAAGVC 803
Query: 69 RTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
++YE G F TPE G YR+ G+MR LA++AMQ A+ +
Sbjct: 804 GSIYEDLGYWFRTPEAWDGSGNYRALGHMRPLAIWAMQWAWTQ 846
>gi|308471410|ref|XP_003097936.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
gi|308239241|gb|EFO83193.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
Length = 845
Score = 95.1 bits (235), Expect = 4e-18, Method: Composition-based stats.
Identities = 46/100 (46%), Positives = 67/100 (67%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
++ G GAVNG + D S+IQSEEVW G+TY LSA M+ +G + A+ T+ GL+ ++
Sbjct: 742 YDGGKCGAVNGYLTSERVDGSSIQSEEVWAGITYALSAMMIEKGMDEMAFKTSEGLFDSI 801
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
+ R L F+TPE +T D YR+ GYMR L+++A+Q A K
Sbjct: 802 WNRFPLQFQTPEAITSDGMYRALGYMRPLSIWAIQHALEK 841
>gi|268555576|ref|XP_002635777.1| Hypothetical protein CBG10433 [Caenorhabditis briggsae]
Length = 821
Score = 94.7 bits (234), Expect = 5e-18, Method: Composition-based stats.
Identities = 46/102 (45%), Positives = 68/102 (66%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
++ G GAVNG + D S+IQSEEVW G+TY LSA M+ +G + A+ T+ GL+ ++
Sbjct: 717 YDGGKCGAVNGFLASERVDGSSIQSEEVWAGITYSLSAMMIEKGMDEMAFKTSEGLFHSI 776
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
+ R L ++TPE +T D YR+ GYMR L+++A+Q A K K
Sbjct: 777 WNRFPLQYQTPEAITADGMYRALGYMRPLSIWAIQHALDKKK 818
>gi|356576421|ref|XP_003556330.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
Length = 950
Score = 94.4 bits (233), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 64/101 (63%), Gaps = 2/101 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS EVWTGVTY L+A M+ G +EA+ TA G++ +
Sbjct: 801 GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGLAATMILAGMEEEAFATAEGIFLAGWSE 860
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
G G F+TPE T D YRS YMR LA++ MQ A + K
Sbjct: 861 DGYGYWFQTPEAWTMDGHYRSLMYMRPLAIWGMQYAINRPK 901
>gi|449463809|ref|XP_004149624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
gi|449494604|ref|XP_004159595.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 951
Score = 94.0 bits (232), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/97 (48%), Positives = 65/97 (67%), Gaps = 2/97 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GAVNGM P+G D S++QS E+W+GVTY ++A+M++E D A+ TA G++ +
Sbjct: 799 GKRGAVNGMLPDGTIDFSSMQSREIWSGVTYAVAASMIHEDMTDMAFRTAEGIHEAAWSE 858
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
GLG F+TPE T YRS YMR LA++AMQ A+
Sbjct: 859 DGLGYNFQTPEAWTTTDRYRSLCYMRPLAIWAMQWAF 895
>gi|357440839|ref|XP_003590697.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
gi|355479745|gb|AES60948.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
Length = 1103
Score = 94.0 bits (232), Expect = 1e-17, Method: Composition-based stats.
Identities = 47/98 (47%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G MGAVNGM PNG D + +QS E+W GVTY ++A M+ G +EA+TTA G++ +
Sbjct: 953 KGGRMGAVNGMHPNGKVDETCMQSREIWAGVTYGVAATMILAGMEEEAFTTAEGIFLAGW 1012
Query: 73 ERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
G G F+TPE T D YRS YMR L+++ MQ A
Sbjct: 1013 SEEGSGYWFQTPEAFTIDGHYRSLIYMRPLSIWGMQYA 1050
>gi|255554158|ref|XP_002518119.1| conserved hypothetical protein [Ricinus communis]
gi|223542715|gb|EEF44252.1| conserved hypothetical protein [Ricinus communis]
Length = 968
Score = 93.6 bits (231), Expect = 1e-17, Method: Composition-based stats.
Identities = 46/95 (48%), Positives = 61/95 (64%), Gaps = 2/95 (2%)
Query: 14 AGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 73
G MGAVNGM PNG D + +QS E+WTGVTY ++A M+ G D+A+ A G++ +
Sbjct: 819 GGRMGAVNGMHPNGKVDETCMQSREIWTGVTYAVAATMILAGMEDKAFAAAEGIFLAGWS 878
Query: 74 RTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D +RS YMR LA++ MQ
Sbjct: 879 EDGYGYWFQTPEGWTTDGHFRSLIYMRPLAIWGMQ 913
>gi|222612993|gb|EEE51125.1| hypothetical protein OsJ_31868 [Oryza sativa Japonica Group]
Length = 935
Score = 93.6 bits (231), Expect = 1e-17, Method: Composition-based stats.
Identities = 48/96 (50%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G +GAVNGM PNG D + +QS E+WTGVTY ++A ML G + +TTA G++ +
Sbjct: 785 KGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIFIAGW 844
Query: 73 ERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 845 SEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 880
>gi|224072655|ref|XP_002303825.1| predicted protein [Populus trichocarpa]
gi|222841257|gb|EEE78804.1| predicted protein [Populus trichocarpa]
Length = 966
Score = 93.6 bits (231), Expect = 1e-17, Method: Composition-based stats.
Identities = 46/95 (48%), Positives = 62/95 (65%), Gaps = 2/95 (2%)
Query: 14 AGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 73
G MGAVNGM PNG D + +QS E+W+GVTY ++A M+ G D+A+TTA G++ +
Sbjct: 816 GGKMGAVNGMHPNGKVDETCMQSREIWSGVTYAVAATMILSGMEDKAFTTAEGIFTAGWS 875
Query: 74 RTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T D +RS YMR LA++ MQ
Sbjct: 876 EEGYGYWFQTPEAWTIDGHFRSLIYMRPLAIWGMQ 910
>gi|22331303|ref|NP_189060.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|79313355|ref|NP_001030757.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|17529232|gb|AAL38843.1| unknown protein [Arabidopsis thaliana]
gi|110742225|dbj|BAE99039.1| hypothetical protein [Arabidopsis thaliana]
gi|332643347|gb|AEE76868.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|332643348|gb|AEE76869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 950
Score = 93.6 bits (231), Expect = 2e-17, Method: Composition-based stats.
Identities = 46/96 (47%), Positives = 62/96 (64%), Gaps = 2/96 (2%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G MGAVNGM P+G D + +QS E+WTGVTY +A M+ G ++ +TTA G++ +
Sbjct: 802 KGGKMGAVNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGW 861
Query: 73 ERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D YRS YMR LA++ MQ
Sbjct: 862 SEEGFGYWFQTPEGWTMDGHYRSLIYMRPLAIWGMQ 897
>gi|358254610|dbj|GAA55931.1| non-lysosomal glucosylceramidase [Clonorchis sinensis]
Length = 1200
Score = 93.6 bits (231), Expect = 2e-17, Method: Composition-based stats.
Identities = 42/96 (43%), Positives = 59/96 (61%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ GS+G +NG P RD S +Q+EE W GV Y L+A M+ EG E ++ A Y T+Y
Sbjct: 755 QNGSLGVINGGLPGAKRDLSNVQAEEFWVGVNYSLAATMILEGMSQEGFSLAEACYNTIY 814
Query: 73 ERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
R GL ++TPE D +R GYMR LA++++Q A
Sbjct: 815 NRFGLQYQTPEAYMIDGRFRCPGYMRPLAIWSIQQA 850
>gi|115482480|ref|NP_001064833.1| Os10g0473400 [Oryza sativa Japonica Group]
gi|110289241|gb|AAP54244.2| expressed protein [Oryza sativa Japonica Group]
gi|113639442|dbj|BAF26747.1| Os10g0473400 [Oryza sativa Japonica Group]
gi|215695411|dbj|BAG90602.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 974
Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 62/94 (65%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G +GAVNGM PNG D + +QS E+WTGVTY ++A ML G + +TTA G++ +
Sbjct: 826 GRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIFIAGWSE 885
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 886 EGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 919
>gi|255575898|ref|XP_002528846.1| conserved hypothetical protein [Ricinus communis]
gi|223531697|gb|EEF33520.1| conserved hypothetical protein [Ricinus communis]
Length = 952
Score = 93.2 bits (230), Expect = 2e-17, Method: Composition-based stats.
Identities = 48/102 (47%), Positives = 65/102 (63%), Gaps = 2/102 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G GA+NGM P+G D S++QS E+W+GVTY L+A M+ E +D A+ TA G+Y +
Sbjct: 797 GKRGAINGMLPDGKVDLSSMQSREIWSGVTYALAATMIQEDMLDMAFHTASGIYEAAWSE 856
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 114
GLG F+TPE YRS YMR LA++AMQ A + K+
Sbjct: 857 RGLGYSFQTPEAWNNVDQYRSLCYMRPLAIWAMQWALSRPKL 898
>gi|218184729|gb|EEC67156.1| hypothetical protein OsI_34005 [Oryza sativa Indica Group]
Length = 974
Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 62/94 (65%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G +GAVNGM PNG D + +QS E+WTGVTY ++A ML G + +TTA G++ +
Sbjct: 826 GRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIFIAGWSE 885
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 886 EGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 919
>gi|392918721|ref|NP_504047.2| Protein R08F11.1 [Caenorhabditis elegans]
gi|387910741|emb|CCD72306.2| Protein R08F11.1 [Caenorhabditis elegans]
Length = 830
Score = 93.2 bits (230), Expect = 2e-17, Method: Composition-based stats.
Identities = 44/97 (45%), Positives = 66/97 (68%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
+ G GAVNG + D S+IQSEEVW G+TY LSA M+ +G ++A+ T+ GL+ ++
Sbjct: 727 YNNGRCGAVNGYLTSERVDGSSIQSEEVWAGITYALSAMMIEKGMDEQAFKTSEGLFESI 786
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
+ R L ++TPE +T D YR+ GYMR L+++A+Q A
Sbjct: 787 WHRFPLQYQTPEAITSDGMYRALGYMRPLSIWAIQHA 823
>gi|297831192|ref|XP_002883478.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
lyrata]
gi|297329318|gb|EFH59737.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
lyrata]
Length = 950
Score = 93.2 bits (230), Expect = 2e-17, Method: Composition-based stats.
Identities = 46/96 (47%), Positives = 62/96 (64%), Gaps = 2/96 (2%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G MGAVNGM P+G D + +QS E+WTGVTY +A M+ G ++ +TTA G++ +
Sbjct: 802 KGGKMGAVNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGW 861
Query: 73 ERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D YRS YMR LA++ MQ
Sbjct: 862 SEEGFGYWFQTPEGWTMDGHYRSLIYMRPLAIWGMQ 897
>gi|16905165|gb|AAL31035.1|AC078948_19 unknown protein [Oryza sativa Japonica Group]
Length = 967
Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 62/94 (65%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G +GAVNGM PNG D + +QS E+WTGVTY ++A ML G + +TTA G++ +
Sbjct: 819 GRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIFIAGWSE 878
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 879 EGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 912
>gi|341899054|gb|EGT54989.1| hypothetical protein CAEBREN_31176 [Caenorhabditis brenneri]
Length = 831
Score = 93.2 bits (230), Expect = 2e-17, Method: Composition-based stats.
Identities = 47/98 (47%), Positives = 67/98 (68%), Gaps = 2/98 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE-AWTTAGGLYRT 70
+ G GAVNG + D S+IQSEEVW G+TY LS AM+ E N+DE A+ T+ GL+ +
Sbjct: 728 YNDGKCGAVNGFLTSERVDGSSIQSEEVWAGITYALS-AMMIEKNMDEMAFKTSEGLFES 786
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
++ R L ++TPE +T D YR+ GYMR L+++A+Q A
Sbjct: 787 IWNRFPLQYQTPEAITSDGMYRALGYMRPLSIWAIQHA 824
>gi|341888271|gb|EGT44206.1| hypothetical protein CAEBREN_07747 [Caenorhabditis brenneri]
Length = 820
Score = 93.2 bits (230), Expect = 2e-17, Method: Composition-based stats.
Identities = 47/98 (47%), Positives = 67/98 (68%), Gaps = 2/98 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE-AWTTAGGLYRT 70
+ G GAVNG + D S+IQSEEVW G+TY LS AM+ E N+DE A+ T+ GL+ +
Sbjct: 717 YNDGKCGAVNGFLTSERVDGSSIQSEEVWAGITYALS-AMMIEKNMDEMAFKTSEGLFES 775
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
++ R L ++TPE +T D YR+ GYMR L+++A+Q A
Sbjct: 776 IWNRFPLQYQTPEAITSDGMYRALGYMRPLSIWAIQHA 813
>gi|115474457|ref|NP_001060825.1| Os08g0111200 [Oryza sativa Japonica Group]
gi|42408344|dbj|BAD09496.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
gi|42408391|dbj|BAD09542.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
gi|113622794|dbj|BAF22739.1| Os08g0111200 [Oryza sativa Japonica Group]
gi|215701490|dbj|BAG92914.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 928
Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 66/94 (70%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G++GAVNGM+P+G D S+ QS+EVW GVTY ++AAM++EG + A+ TA G++ + +
Sbjct: 771 GAIGAVNGMRPDGAVDASSTQSKEVWPGVTYAVAAAMIHEGMPEAAFKTAKGIHDAGWGK 830
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T D YR+ YMR L V+AMQ
Sbjct: 831 HGFGYAFQTPESWTADGGYRALHYMRPLGVWAMQ 864
>gi|10440626|gb|AAG16864.1|AC069145_13 unknown protein [Oryza sativa Japonica Group]
Length = 444
Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 62/94 (65%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G +GAVNGM PNG D + +QS E+WTGVTY ++A ML G + +TTA G++ +
Sbjct: 296 GRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQGFTTAEGIFIAGWSE 355
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 356 EGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 389
>gi|226499590|ref|NP_001146319.1| uncharacterized protein LOC100279895 [Zea mays]
gi|219886613|gb|ACL53681.1| unknown [Zea mays]
Length = 649
Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 62/96 (64%), Gaps = 2/96 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM P G D + +QS E+WTGVTY ++A ML G + +TTA G++ +
Sbjct: 501 GRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIFTAGWSE 560
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
G G F+TPEG T D YRS YMR LA++A+Q A
Sbjct: 561 EGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYA 596
>gi|414871104|tpg|DAA49661.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
Length = 649
Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 61/94 (64%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM P G D + +QS E+WTGVTY ++A ML G + +TTA G++ +
Sbjct: 501 GRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIFTAGWSE 560
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D YRS YMR LA++A+Q
Sbjct: 561 EGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQ 594
>gi|224146269|ref|XP_002325943.1| predicted protein [Populus trichocarpa]
gi|222862818|gb|EEF00325.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 2/101 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G+ GAVNGM P+G D +A+Q++E+W GVTY LSA+M+ EG + A+ TA G+Y +
Sbjct: 736 GTRGAVNGMLPDGRVDLTAMQTKEIWPGVTYALSASMIQEGLEEMAFQTAVGIYNAAWSE 795
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
GLG F+ PE + YRS YMR LA++AMQ A K K
Sbjct: 796 EGLGYSFQIPESWDMNDQYRSLCYMRPLAIWAMQWALSKPK 836
>gi|357146604|ref|XP_003574050.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 962
Score = 92.8 bits (229), Expect = 2e-17, Method: Composition-based stats.
Identities = 47/96 (48%), Positives = 61/96 (63%), Gaps = 2/96 (2%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G MGAVNGM P G D + +QS E+WTGVTY ++A ML G + + TA G++ +
Sbjct: 812 KGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQGFITAEGIFLAGW 871
Query: 73 ERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 872 SEEGYGYWFQTPEGWTTDGHYRSLIYMRPLAIWAMQ 907
>gi|324504438|gb|ADY41918.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 534
Score = 92.4 bits (228), Expect = 3e-17, Method: Composition-based stats.
Identities = 45/97 (46%), Positives = 61/97 (62%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F G +G VN M P+G D + IQSEEVW GV Y L++ L + A+ TA G YR+
Sbjct: 429 FANGQLGPVNAMMPSGVVDSTGIQSEEVWGGVAYALASFHLLVEENESAFKTAEGWYRSC 488
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
+ER GL +++PE + YR+ GYMR LA++AMQ A
Sbjct: 489 WERYGLQYQSPEAINESSYYRAIGYMRPLAIWAMQSA 525
>gi|11994237|dbj|BAB01359.1| unnamed protein product [Arabidopsis thaliana]
Length = 937
Score = 92.4 bits (228), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 61/94 (64%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM P+G D + +QS E+WTGVTY +A M+ G ++ +TTA G++ +
Sbjct: 791 GKMGAVNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSE 850
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D YRS YMR LA++ MQ
Sbjct: 851 EGFGYWFQTPEGWTMDGHYRSLIYMRPLAIWGMQ 884
>gi|413941636|gb|AFW74285.1| hypothetical protein ZEAMMB73_466373 [Zea mays]
Length = 966
Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 64/94 (68%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G++GAVNGM+P+G D S+ QS+EVW GVTY ++AAM++EG + A+ TA G + +
Sbjct: 805 GAIGAVNGMRPDGAVDASSTQSKEVWPGVTYAVAAAMVHEGMTEAAFRTAKGAHDAAWGN 864
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T D YRS YMR L ++AMQ
Sbjct: 865 DGFGYAFQTPEAWTEDGGYRSLHYMRPLGIWAMQ 898
>gi|324503709|gb|ADY41606.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 261
Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 61/97 (62%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F G +G VN M P+G D + IQSEEVW GV Y L++ L + A+ TA G YR+
Sbjct: 156 FANGQLGPVNAMMPSGVVDSTGIQSEEVWGGVAYALASFHLLVEENESAFKTAEGWYRSC 215
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
+ER GL +++PE + YR+ GYMR LA++AMQ A
Sbjct: 216 WERYGLQYQSPEAINESSYYRAIGYMRPLAIWAMQSA 252
>gi|357440847|ref|XP_003590701.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
gi|355479749|gb|AES60952.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
Length = 992
Score = 91.3 bits (225), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 61/94 (64%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS E+W GVTY ++A M+ G +EA+TTA G++ +
Sbjct: 844 GRMGAVNGMHPNGKVDETCMQSREIWAGVTYGVAATMILAGMEEEAFTTAEGIFLAGWSE 903
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T D YRS YMR L+++ MQ
Sbjct: 904 EGSGYWFQTPEAFTIDGHYRSLIYMRPLSIWGMQ 937
>gi|326517703|dbj|BAK03770.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 821
Score = 91.3 bits (225), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 68/98 (69%), Gaps = 4/98 (4%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G++GAVNGM+P+G D S++QS+EVW GVTY ++AAML+EG + A+ TA G + +
Sbjct: 655 QGGAVGAVNGMRPDGAVDASSLQSKEVWVGVTYGVAAAMLHEGMTEAAFRTAKGAHDAGW 714
Query: 73 ERTGLG--FETPEGLTGDK--TYRSGGYMRALAVYAMQ 106
R G G F+TPE T D YRS YMR L+++AMQ
Sbjct: 715 GRDGFGYAFQTPEAWTSDAGGGYRSLHYMRPLSIWAMQ 752
>gi|326516802|dbj|BAJ96393.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 970
Score = 90.9 bits (224), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 60/94 (63%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM P G D + +QS E+WTGVTY ++A ML G + + TA G++ +
Sbjct: 822 GRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQGFITAEGIFLAGWSE 881
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 882 DGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAMQ 915
>gi|449442371|ref|XP_004138955.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
gi|449520930|ref|XP_004167485.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 993
Score = 90.9 bits (224), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 61/94 (64%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM PNG D + +QS E+WTGVTY ++A M+ G +EA+ TA G++ +
Sbjct: 844 GRMGAVNGMHPNGKIDETCMQSREIWTGVTYGVAATMILAGMEEEAFKTAEGIFLAGWSE 903
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE + D YRS YMR L+++ MQ
Sbjct: 904 EGFGYWFQTPEAWSTDGHYRSLIYMRPLSIWGMQ 937
>gi|353232147|emb|CCD79502.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 900
Score = 90.1 bits (222), Expect = 1e-16, Method: Composition-based stats.
Identities = 42/104 (40%), Positives = 61/104 (58%)
Query: 5 SRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 64
S + G + G++GA+NG P D S++Q+EE W V Y LS+ M+ EG +DE
Sbjct: 775 SNCNWHGIKNGTIGAINGCLPVCKPDLSSVQAEEFWVAVNYSLSSLMIAEGMIDEGLALG 834
Query: 65 GGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
Y T+Y GL ++TPE D +R GYMRALA++++Q A
Sbjct: 835 EKCYDTIYNLYGLQYQTPEAYMSDGRFRCPGYMRALAIWSIQQA 878
>gi|256082537|ref|XP_002577511.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 892
Score = 90.1 bits (222), Expect = 2e-16, Method: Composition-based stats.
Identities = 42/104 (40%), Positives = 61/104 (58%)
Query: 5 SRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 64
S + G + G++GA+NG P D S++Q+EE W V Y LS+ M+ EG +DE
Sbjct: 767 SNCNWHGIKNGTIGAINGCLPVCKPDLSSVQAEEFWVAVNYSLSSLMIAEGMIDEGLALG 826
Query: 65 GGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
Y T+Y GL ++TPE D +R GYMRALA++++Q A
Sbjct: 827 EKCYDTIYNLYGLQYQTPEAYMSDGRFRCPGYMRALAIWSIQQA 870
>gi|159896834|ref|YP_001543081.1| hypothetical protein Haur_0301 [Herpetosiphon aurantiacus DSM 785]
gi|159889873|gb|ABX02953.1| protein of unknown function DUF608 [Herpetosiphon aurantiacus DSM
785]
Length = 774
Score = 89.0 bits (219), Expect = 3e-16, Method: Composition-based stats.
Identities = 44/103 (42%), Positives = 63/103 (61%)
Query: 6 RASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
R + + G++GAVNGM P+G D S+ Q+ EVW+G TY ++A ML EG E W TA
Sbjct: 664 RFNVMQYANGALGAVNGMHPDGTVDTSSNQASEVWSGTTYAIAAMMLQEGLDLEGWQTAW 723
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
G Y Y GL F TPE ++T+R+ YMR +++A++ A
Sbjct: 724 GAYNATYNELGLWFRTPEAWGIERTFRASMYMRPQSIWAIEHA 766
>gi|242080271|ref|XP_002444904.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
gi|241941254|gb|EES14399.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
Length = 929
Score = 88.6 bits (218), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 65/94 (69%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G++GAVNGM+P+G D S+ QS+EVW GVTY ++AAM++EG + A+ TA G + + +
Sbjct: 767 GAVGAVNGMRPDGAVDASSGQSKEVWPGVTYAVAAAMVHEGMHEAAFRTAKGAHDAGWGK 826
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T D YRS YMR L ++AMQ
Sbjct: 827 DGFGYAFQTPEAWTEDGGYRSLHYMRPLGIWAMQ 860
>gi|302787158|ref|XP_002975349.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
gi|300156923|gb|EFJ23550.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
Length = 916
Score = 87.8 bits (216), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 64/97 (65%), Gaps = 2/97 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
++ G+ GAVNGM P+G DR+++QS EVW G TY ++AAM+ +G +E + TA G++
Sbjct: 750 YKGGNEGAVNGMLPSGGEDRTSLQSREVWVGTTYAVAAAMIQQGMREEGFRTAKGVFLNG 809
Query: 72 YERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ G G F+TPE D YRS YMR L+++AMQ
Sbjct: 810 WSDQGHGYAFQTPEAWDNDGKYRSLAYMRPLSIWAMQ 846
>gi|302762052|ref|XP_002964448.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
gi|300168177|gb|EFJ34781.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
Length = 916
Score = 87.8 bits (216), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 64/97 (65%), Gaps = 2/97 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
++ G+ GAVNGM P+G DR+++QS EVW G TY ++AAM+ +G +E + TA G++
Sbjct: 750 YKGGNEGAVNGMLPSGGEDRTSLQSREVWAGTTYAVAAAMIQQGMREEGFRTAKGVFLNG 809
Query: 72 YERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ G G F+TPE D YRS YMR L+++AMQ
Sbjct: 810 WSDQGHGYAFQTPEAWDNDGKYRSLAYMRPLSIWAMQ 846
>gi|28393605|gb|AAO42222.1| unknown protein [Arabidopsis thaliana]
Length = 947
Score = 87.8 bits (216), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 62/101 (61%), Gaps = 2/101 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G+ GAVNGM P+G D S + S EVW G TY ++A M+ EG D+ + TA G+Y +
Sbjct: 788 GTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSD 847
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
GLG F+TPE T + YRS YMR LA++ +Q A+ K
Sbjct: 848 RGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPK 888
>gi|30692911|ref|NP_174631.2| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|79319136|ref|NP_001031135.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|332193493|gb|AEE31614.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|332193494|gb|AEE31615.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
Length = 947
Score = 87.8 bits (216), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 62/101 (61%), Gaps = 2/101 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G+ GAVNGM P+G D S + S EVW G TY ++A M+ EG D+ + TA G+Y +
Sbjct: 788 GTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSD 847
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
GLG F+TPE T + YRS YMR LA++ +Q A+ K
Sbjct: 848 RGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPK 888
>gi|9665098|gb|AAF97289.1|AC010164_11 Hypothetical protein [Arabidopsis thaliana]
Length = 790
Score = 87.8 bits (216), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 62/101 (61%), Gaps = 2/101 (1%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G+ GAVNGM P+G D S + S EVW G TY ++A M+ EG D+ + TA G+Y +
Sbjct: 631 GTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSD 690
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
GLG F+TPE T + YRS YMR LA++ +Q A+ K
Sbjct: 691 RGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPK 731
>gi|357144426|ref|XP_003573288.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 1
[Brachypodium distachyon]
Length = 932
Score = 87.0 bits (214), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 65/94 (69%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G++GAVNGM+P+G D S+ QS+E+W+G TY ++AAM++EG + A+ TA G + + +
Sbjct: 775 GAVGAVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAHDASWGK 834
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T + YR YMR L+V+AMQ
Sbjct: 835 AGFGYAFQTPEAWTAEGGYRGLHYMRPLSVWAMQ 868
>gi|357144429|ref|XP_003573289.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 2
[Brachypodium distachyon]
Length = 919
Score = 87.0 bits (214), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 65/94 (69%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G++GAVNGM+P+G D S+ QS+E+W+G TY ++AAM++EG + A+ TA G + + +
Sbjct: 762 GAVGAVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAHDASWGK 821
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
G G F+TPE T + YR YMR L+V+AMQ
Sbjct: 822 AGFGYAFQTPEAWTAEGGYRGLHYMRPLSVWAMQ 855
>gi|330844075|ref|XP_003293963.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
gi|325075658|gb|EGC29519.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
Length = 1205
Score = 86.7 bits (213), Expect = 2e-15, Method: Composition-based stats.
Identities = 39/96 (40%), Positives = 58/96 (60%)
Query: 18 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGL 77
GAVNGM+P G D + +QS EVW G ++ L+A M+ +AW G+ + Y R G
Sbjct: 1083 GAVNGMRPEGVVDNTCLQSSEVWIGTSFSLAATMIQHHMDKDAWELVKGIVNSSYNRWGF 1142
Query: 78 GFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
++TPE + YR+G YMR L+++++Q A LK K
Sbjct: 1143 QYQTPEAWDSNGCYRAGAYMRPLSIWSIQWALLKKK 1178
>gi|297851790|ref|XP_002893776.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
lyrata]
gi|297339618|gb|EFH70035.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
lyrata]
Length = 947
Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 61/97 (62%), Gaps = 2/97 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G+ GAVNGM P+G D S + S EVW G TY ++A M+ EG D+ + TA G+Y +
Sbjct: 788 GTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSD 847
Query: 75 TGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
GLG F+TPE T + +RS YMR LA++ +Q A+
Sbjct: 848 RGLGCAFQTPEAWTTNDEFRSLCYMRPLAIWGIQWAH 884
>gi|281212396|gb|EFA86556.1| hypothetical protein PPL_00357 [Polysphondylium pallidum PN500]
Length = 4775
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 39/95 (41%), Positives = 57/95 (60%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + + GS GAVNGM+P G D +++QS EVW G +Y L++ ML +EAW GL
Sbjct: 1059 NVKSYSNGSCGAVNGMRPEGGPDTTSLQSCEVWIGTSYGLASTMLLHFMDNEAWELIKGL 1118
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAV 102
+ Y + G ++TPE + YR+G YMR LA+
Sbjct: 1119 VDSTYNKWGFQYQTPEAWDQNGLYRAGTYMRPLAI 1153
>gi|297809209|ref|XP_002872488.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
gi|297318325|gb|EFH48747.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
Length = 921
Score = 85.9 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 41/96 (42%), Positives = 61/96 (63%), Gaps = 2/96 (2%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G+ GAVNGM P+G D +++ S+EVW G TY ++A M+ EG + + TA G+Y V+
Sbjct: 759 KGGTRGAVNGMSPDGQVDTNSLVSKEVWAGTTYSVAACMIQEGQRERGFQTASGIYEAVW 818
Query: 73 ERTGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
GL F+TPE + YRS YMR LA++++Q
Sbjct: 819 SDRGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWSIQ 854
>gi|147788372|emb|CAN61187.1| hypothetical protein VITISV_019326 [Vitis vinifera]
Length = 900
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 48/132 (36%), Positives = 64/132 (48%), Gaps = 38/132 (28%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G MGAVNGM PNG D S +QS E+WTGVTY ++A M+ G ++A+TTA G++ +
Sbjct: 714 KGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILSGMEEQAFTTAEGIFTAGW 773
Query: 73 ERTGLG--------------------------------------FETPEGLTGDKTYRSG 94
G G F+TPEG T D +RS
Sbjct: 774 SEEGYGTLDFIDDMHCVVQILNCFVHSGLCVACVILLYCTCRYWFQTPEGWTIDGHFRSL 833
Query: 95 GYMRALAVYAMQ 106
YMR LA++ MQ
Sbjct: 834 IYMRPLAIWGMQ 845
>gi|66801399|ref|XP_629625.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
gi|60463006|gb|EAL61202.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
Length = 1302
Score = 82.8 bits (203), Expect = 3e-14, Method: Composition-based stats.
Identities = 36/106 (33%), Positives = 61/106 (57%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
+ + + G GAVNGM+P G D + +QS EVW G ++ L+A M+ +AW G+
Sbjct: 1152 NVKSYSKGVCGAVNGMRPEGTVDTTCLQSSEVWIGTSFSLAATMILHHMDSDAWDLVKGI 1211
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 113
+ Y++ G ++TPE + +R+ YMR L+++++Q A K K
Sbjct: 1212 VNSSYQKWGFQYQTPEAWDQNGCFRAASYMRPLSIWSIQWALQKRK 1257
>gi|4539009|emb|CAB39630.1| putative protein [Arabidopsis thaliana]
gi|7267702|emb|CAB78129.1| putative protein [Arabidopsis thaliana]
Length = 750
Score = 82.8 bits (203), Expect = 3e-14, Method: Composition-based stats.
Identities = 40/96 (41%), Positives = 60/96 (62%), Gaps = 2/96 (2%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G+ GAVNGM G D +++ S+EVW G TY ++A M+ EG ++ + TA G+Y ++
Sbjct: 587 KGGTRGAVNGMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIW 646
Query: 73 ERTGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
GL F+TPE + YRS YMR LA++A+Q
Sbjct: 647 SDRGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWAIQ 682
>gi|20453181|gb|AAM19831.1| AT4g10060/T5L19_190 [Arabidopsis thaliana]
gi|24111431|gb|AAN46866.1| At4g10060/T5L19_190 [Arabidopsis thaliana]
Length = 922
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 40/94 (42%), Positives = 59/94 (62%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G+ GAVNGM G D +++ S+EVW G TY ++A M+ EG ++ + TA G+Y ++
Sbjct: 761 GTRGAVNGMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSD 820
Query: 75 TGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
GL F+TPE + YRS YMR LA++A+Q
Sbjct: 821 RGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWAIQ 854
>gi|240255774|ref|NP_192744.6| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|332657434|gb|AEE82834.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 922
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 40/94 (42%), Positives = 59/94 (62%), Gaps = 2/94 (2%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G+ GAVNGM G D +++ S+EVW G TY ++A M+ EG ++ + TA G+Y ++
Sbjct: 761 GTRGAVNGMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSD 820
Query: 75 TGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
GL F+TPE + YRS YMR LA++A+Q
Sbjct: 821 RGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWAIQ 854
>gi|94968715|ref|YP_590763.1| hypothetical protein Acid345_1688 [Candidatus Koribacter versatilis
Ellin345]
gi|94550765|gb|ABF40689.1| protein of unknown function DUF608 [Candidatus Koribacter
versatilis Ellin345]
Length = 811
Score = 81.3 bits (199), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 65/106 (61%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
+KK + + G+MGA+NG+ +G+ + Q+EEVWTGVT+ ++A ML G +E
Sbjct: 700 LKKVYDFNVMKLQNGTMGALNGISASGEVLKDNEQTEEVWTGVTFAVAATMLQNGLREEG 759
Query: 61 WTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ TA G+Y VY++ G F TPE + YR+ YMR A+++M+
Sbjct: 760 FNTAKGVYNVVYDQKGYWFRTPEAYDTNGMYRASMYMRPGAIWSME 805
>gi|328869576|gb|EGG17953.1| hypothetical protein DFA_06619 [Dictyostelium fasciculatum]
Length = 1200
Score = 79.3 bits (194), Expect = 3e-13, Method: Composition-based stats.
Identities = 36/97 (37%), Positives = 58/97 (59%)
Query: 10 RGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 69
+ + GS GAVNG+ P D++ +QS EVW G +Y L++ L EAW+ GL
Sbjct: 1085 KSYSNGSCGAVNGISPLAIVDQTCLQSSEVWIGTSYSLASTFLLHYMDKEAWSLIKGLVN 1144
Query: 70 TVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ YE+ G ++TPE + ++R+ YMR LA++++Q
Sbjct: 1145 SSYEKWGFQYQTPEAWDMNGSFRASTYMRPLAIWSVQ 1181
>gi|332711224|ref|ZP_08431157.1| putative bile acid beta-glucosidase [Moorea producens 3L]
gi|332350038|gb|EGJ29645.1| putative bile acid beta-glucosidase [Moorea producens 3L]
Length = 815
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 58/95 (61%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G +GAVNG++P+G D A EVWTG+ + L+A ++ +G DEA + R
Sbjct: 713 FHQGQLGAVNGVRPDGTPEDPDATHPMEVWTGINFGLAAFLIQQGMKDEALRITEAVVRQ 772
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VY+ GL F TPE +T T+R+ Y+RA+ ++A+
Sbjct: 773 VYDH-GLQFRTPEAITAAGTFRASHYLRAMGIWAV 806
>gi|22297737|ref|NP_680984.1| hypothetical protein tlr0193 [Thermosynechococcus elongatus BP-1]
gi|22293914|dbj|BAC07746.1| tlr0193 [Thermosynechococcus elongatus BP-1]
Length = 806
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 39/95 (41%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+ PNG + A EVWTG+ + L+A + G +DEAW A + R
Sbjct: 703 FHNGQFGAANGLLPNGQPENPHATHPLEVWTGINFGLAAFLWQRGMIDEAWRLAEVVVRQ 762
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+YE GL F TPE +T + T+R+ Y+R +A++A+
Sbjct: 763 IYE-NGLQFRTPEAITANGTFRACMYLRPMAIWAL 796
>gi|428200465|ref|YP_007079054.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
gi|427977897|gb|AFY75497.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
Length = 808
Score = 77.0 bits (188), Expect = 1e-12, Method: Composition-based stats.
Identities = 43/106 (40%), Positives = 62/106 (58%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+KK A F+ G GA NG+KP+G D A EVWTG+ + L+A +L G DE
Sbjct: 693 LKKVYEACFLKFQDGKFGAANGVKPDGSPEDPKATHPLEVWTGINFGLAAFLLQMGMKDE 752
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A+ A + R VY+ GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 753 AFKLAETVVRQVYD-NGLQFRTPEAITAVGTFRASHYLRAMAIWGI 797
>gi|172036524|ref|YP_001803025.1| hypothetical protein cce_1609 [Cyanothece sp. ATCC 51142]
gi|354553307|ref|ZP_08972614.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
gi|171697978|gb|ACB50959.1| unknown [Cyanothece sp. ATCC 51142]
gi|353555137|gb|EHC24526.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
Length = 818
Score = 76.3 bits (186), Expect = 3e-12, Method: Composition-based stats.
Identities = 41/111 (36%), Positives = 64/111 (57%), Gaps = 2/111 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+KK A F+ G GA NGMKP+G D ++ +EVWTG+ + L++ ++ G DE
Sbjct: 706 LKKVYEACFLKFQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLASFLIQMGMKDE 765
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYL 110
A + + VYE GL F TPE +T T+R+ Y+RA+A++ + +L
Sbjct: 766 ALKLTEAVVKQVYE-NGLQFRTPEAITAVGTFRACHYLRAMAIWGVYYQFL 815
>gi|218247014|ref|YP_002372385.1| hypothetical protein PCC8801_2201 [Cyanothece sp. PCC 8801]
gi|218167492|gb|ACK66229.1| protein of unknown function DUF608 [Cyanothece sp. PCC 8801]
Length = 805
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 58/95 (61%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F+ G GA NGMKP+G D ++ +EVWTG+ + L+A +L G D A+ + +
Sbjct: 703 FQDGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEAVVKQ 762
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VYE GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 763 VYE-NGLQFRTPEAITAVGTFRASHYLRAMAIWGI 796
>gi|257060089|ref|YP_003137977.1| glucosylceramidase [Cyanothece sp. PCC 8802]
gi|256590255|gb|ACV01142.1| Glucosylceramidase [Cyanothece sp. PCC 8802]
Length = 805
Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 58/95 (61%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F+ G GA NGMKP+G D ++ +EVWTG+ + L+A +L G D A+ + +
Sbjct: 703 FQDGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEAVVKQ 762
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VYE GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 763 VYE-NGLQFRTPEAITAVGTFRASHYLRAMAIWGI 796
>gi|126657976|ref|ZP_01729128.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
gi|126620614|gb|EAZ91331.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
Length = 804
Score = 75.5 bits (184), Expect = 4e-12, Method: Composition-based stats.
Identities = 40/106 (37%), Positives = 62/106 (58%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+KK A F+ G GA NGMKP+G D ++ +EVWTG+ + L++ ++ G DE
Sbjct: 692 LKKVYEACFLKFQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLASFLIQMGMKDE 751
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A + + VYE GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 752 ALKLTEAVVKQVYE-NGLQFRTPEAITAVGTFRACHYLRAMAIWGI 796
>gi|225593155|gb|ACN96073.1| pedicted bile acid beta-glucosidase [Fischerella sp. MV11]
Length = 812
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 39/95 (41%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + A EVWTG+ + L+A M+ G DEA + R
Sbjct: 714 FHHGQFGAANGLRPDGSPENPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKLTEAVVRQ 773
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 774 VYE-NGLQFRTPEAITAVGTFRASTYLRAMAIWAI 807
>gi|443669591|ref|ZP_21134796.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
gi|159029778|emb|CAO87856.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443330115|gb|ELS44858.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
Length = 784
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K +A F G GA NG+KP+G + A EVW G+ + L A +L G +
Sbjct: 671 LSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + R VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITEAVVRQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|254410084|ref|ZP_05023864.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183120|gb|EDX78104.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 814
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 39/95 (41%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F GA NG+KP+G + A EVWTG+ + L+A ++ G DEA + R
Sbjct: 714 FHNAQFGAANGVKPDGSPEHPDATHPLEVWTGINFGLAAFLIQLGMKDEALKLTESVVRQ 773
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VYE GL F TPE +T T+R+G Y+RA+A++A+
Sbjct: 774 VYE-NGLQFRTPEAITPVGTFRAGHYLRAMAIWAV 807
>gi|354566897|ref|ZP_08986068.1| Glucosylceramidase [Fischerella sp. JSC-11]
gi|353544556|gb|EHC14010.1| Glucosylceramidase [Fischerella sp. JSC-11]
Length = 812
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + A EVWTG+ + L+A M+ G DEA + R
Sbjct: 714 FHHGQFGAANGLRPDGSPENPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKLTETVVRQ 773
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 774 IYE-NGLQFRTPEAITAVGTFRASTYLRAMAIWAI 807
>gi|434391596|ref|YP_007126543.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
gi|428263437|gb|AFZ29383.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
Length = 809
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+KP+G + +A EVWTG+ + L+A ++ G EAW + +
Sbjct: 710 FHNGQFGAANGLKPDGLPENPNATHPLEVWTGINFGLAAFLMQMGMKSEAWRITQAVVQQ 769
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VY+ GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 770 VYD-NGLQFRTPEAITAKGTFRACHYLRPMAIWAI 803
>gi|425455967|ref|ZP_18835678.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
gi|389803038|emb|CCI17984.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
Length = 784
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K A F G GA NG+KP+G + A EVW G+ + L A +L G +
Sbjct: 671 LSKIYEACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + R +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITEAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|390437958|ref|ZP_10226465.1| Glycosidase-like protein [Microcystis sp. T1-4]
gi|389838618|emb|CCI30589.1| Glycosidase-like protein [Microcystis sp. T1-4]
Length = 784
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K +A F G GA NG+KP+G + A EVW G+ + L A ++ G +
Sbjct: 671 LSKIYQACFLKFHEGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLMQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + R VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITEAVVRQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|425435596|ref|ZP_18816046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
gi|389679839|emb|CCH91411.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
Length = 784
Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K A F G GA NG+KP+G + A EVW G+ + L A +L G +
Sbjct: 671 LSKIYEACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + R +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITEAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|425460273|ref|ZP_18839755.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
gi|389827051|emb|CCI21967.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
Length = 784
Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K A F G GA NG+KP+G + A EVW G+ + L A +L G +
Sbjct: 671 LSKIYEACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + R +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITEAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|374311163|ref|YP_005057593.1| glucosylceramidase [Granulicella mallensis MP5ACTX8]
gi|358753173|gb|AEU36563.1| Glucosylceramidase [Granulicella mallensis MP5ACTX8]
Length = 811
Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 57/105 (54%), Gaps = 1/105 (0%)
Query: 2 KKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 61
KK + F G MGA NGM +G +A +++EVW G T + ++ EG DEAW
Sbjct: 701 KKIFDVNVMKFGHGEMGAANGMTADGAILTNA-EAKEVWVGTTLGYAGLLMQEGMNDEAW 759
Query: 62 TTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
T GLY +YE G F TPE +R+G YMR A++A++
Sbjct: 760 KTTRGLYHVIYEDKGYWFRTPEAWDITGNFRAGMYMRPTAIWALE 804
>gi|307151416|ref|YP_003886800.1| glucosylceramidase [Cyanothece sp. PCC 7822]
gi|306981644|gb|ADN13525.1| Glucosylceramidase [Cyanothece sp. PCC 7822]
Length = 805
Score = 72.8 bits (177), Expect = 3e-11, Method: Composition-based stats.
Identities = 40/106 (37%), Positives = 61/106 (57%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
++K A F G GA NG+KP+G + A EVWTG+ + L+A M+ G +E
Sbjct: 692 LRKIYEACFLKFHGGKYGAANGVKPDGTPENPDATHPLEVWTGINFGLAAFMIQMGMKEE 751
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A+ + + VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 752 AFKITEAVVKQVYE-NGLQFRTPEAITAVGTFRASHYLRAMAIWAV 796
>gi|427725201|ref|YP_007072478.1| glucosylceramidase [Leptolyngbya sp. PCC 7376]
gi|427356921|gb|AFY39644.1| Glucosylceramidase [Leptolyngbya sp. PCC 7376]
Length = 807
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 56/98 (57%), Gaps = 2/98 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+ P+G ++ + EVWTG+ + ++A M+ G DEA +
Sbjct: 709 FHDGQFGAANGLNPDGSPEKENDTHPLEVWTGINFGIAAFMIRNGMKDEALKMTEAVIHQ 768
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
VY GL F TPE +T + T+R+ Y+RA+ ++A+ DA
Sbjct: 769 VYS-NGLQFRTPEAITANNTFRACHYLRAMGIWAIYDA 805
>gi|440753145|ref|ZP_20932348.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
gi|440177638|gb|ELP56911.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
Length = 789
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K +A F G GA NG+KP+G + A EVW G+ + L A +L G +
Sbjct: 671 LSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + + VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|220909879|ref|YP_002485190.1| hypothetical protein Cyan7425_4519 [Cyanothece sp. PCC 7425]
gi|219866490|gb|ACL46829.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7425]
Length = 814
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 35/95 (36%), Positives = 58/95 (61%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G++GA NG++P G ++ A EVWTG+ + L+A ++ G EA +
Sbjct: 702 FHNGTLGAANGLRPGGLAEKPDATHPLEVWTGINFGLAAFLVQMGRKAEAMQLTAAVVYQ 761
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+YE GL F TPE +T ++T+R+G Y+R +A++A+
Sbjct: 762 IYEH-GLQFRTPEAITAEQTFRAGMYLRPMAIWAI 795
>gi|119510228|ref|ZP_01629365.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
gi|119465077|gb|EAW45977.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
Length = 814
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + +A EVWTG+ + L+A ++ G DEA G + +
Sbjct: 703 FCNGEFGAANGVRPDGSAENPNATHPLEVWTGINFGLAAFLVQMGMQDEALKLTGAVVQQ 762
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 763 IYH-NGLQFRTPEAITASGTFRASTYLRAMAIWAI 796
>gi|428308994|ref|YP_007119971.1| bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
gi|428250606|gb|AFZ16565.1| putative bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
Length = 819
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F+ +GA NG++P+G + A EVWTG+ + L+A ++ +G EA + R
Sbjct: 720 FQDAQLGAANGVRPDGSPENPDATHPLEVWTGINFGLAAFLVQQGMKQEALQLTEAVVRQ 779
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 780 VYE-NGLQFRTPEAITAAGTFRASHYLRAMAIWAV 813
>gi|67922703|ref|ZP_00516206.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
gi|67855413|gb|EAM50669.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
Length = 804
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 58/95 (61%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F+ G GA NGMKP+G D ++ +EVWTG+ + +++ ++ G +EA + +
Sbjct: 703 FQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVKQ 762
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VYE GL F TPE +T TYR+ Y+RA+A++ +
Sbjct: 763 VYE-NGLQFRTPEAITAVGTYRACHYLRAMAIWGV 796
>gi|425471589|ref|ZP_18850441.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
gi|389882505|emb|CCI37035.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
Length = 784
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K +A F G GA NG+KP+G + A EVW G+ + L A +L G +
Sbjct: 671 LGKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLLQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + + VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|425464233|ref|ZP_18843555.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
gi|389833806|emb|CCI21373.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
Length = 784
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K +A F G GA NG+KP+G + A EVW G+ + L A ++ G +
Sbjct: 671 LSKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLMQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + + VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITEAVVKQVYEN-GLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|416392732|ref|ZP_11685894.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
0003]
gi|357263592|gb|EHJ12579.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
0003]
Length = 804
Score = 71.6 bits (174), Expect = 6e-11, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 58/95 (61%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F+ G GA NGMKP+G D ++ +EVWTG+ + +++ ++ G +EA + +
Sbjct: 703 FQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVKQ 762
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VYE GL F TPE +T TYR+ Y+RA+A++ +
Sbjct: 763 VYE-NGLQFRTPEAITAVGTYRACHYLRAMAIWGV 796
>gi|422303239|ref|ZP_16390593.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
gi|389791807|emb|CCI12404.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
Length = 784
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K ++ F G GA NG+KP+G + A EVW G+ + L A ++ G +
Sbjct: 671 LSKIYQSCFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLMQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + R VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITEAVVRQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|166363582|ref|YP_001655855.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
gi|166085955|dbj|BAG00663.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
Length = 784
Score = 71.2 bits (173), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K +A F G GA NG+KP+G + A EVW G+ + L A ++ G +
Sbjct: 671 LSKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLMQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + + VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|428299644|ref|YP_007137950.1| glucosylceramidase [Calothrix sp. PCC 6303]
gi|428236188|gb|AFZ01978.1| Glucosylceramidase [Calothrix sp. PCC 6303]
Length = 804
Score = 71.2 bits (173), Expect = 8e-11, Method: Composition-based stats.
Identities = 35/95 (36%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + A EVWTG+ + L+A ++ G DEAW + +
Sbjct: 701 FNDGKFGAANGLRPDGSPENPKATHPLEVWTGINFGLAAFLMQMGMQDEAWKLTEVVVQQ 760
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y+ GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 761 IYD-NGLQFRTPEAITATGTFRACTYLRAMAIWGI 794
>gi|428209901|ref|YP_007094254.1| glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
gi|428011822|gb|AFY90385.1| Glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
Length = 812
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 2/101 (1%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+KPNG + +A EVWTG+ + L+A +L G EA +
Sbjct: 712 FHQGKFGAANGLKPNGSPENPNATHPLEVWTGINFGLAAFLLQMGMKAEALQLTETVVNQ 771
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
+YE GL F TPE +T T+R+ Y+RA+A++A+ + +
Sbjct: 772 IYE-NGLQFRTPEAITSVGTFRASYYLRAMAIWAIYGVWFE 811
>gi|326507888|dbj|BAJ86687.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 676
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 2/82 (2%)
Query: 13 EAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVY 72
+ G++GAVNGM+P+G D S+IQS EVW GVTY ++AAM +EG + A+ TA G + +
Sbjct: 537 QGGAIGAVNGMRPDGTVDTSSIQSREVWPGVTYGVAAAMAHEGMPEAAFRTAKGAHDAGW 596
Query: 73 ERTGLGFETPE--GLTGDKTYR 92
R G G P+ G+ G + R
Sbjct: 597 GRDGFGVRVPDAGGVDGRRRQR 618
>gi|425450263|ref|ZP_18830094.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
gi|389768962|emb|CCI06046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
Length = 784
Score = 69.3 bits (168), Expect = 2e-10, Method: Composition-based stats.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K +A F G GA NG+KP+G + A EVW G+ + L A +L G +
Sbjct: 671 LSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + R +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITQAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|170079447|ref|YP_001736085.1| hypothetical protein SYNPCC7002_A2862 [Synechococcus sp. PCC 7002]
gi|169887116|gb|ACB00830.1| Conserved hypothetical protein (DUF608) [Synechococcus sp. PCC
7002]
Length = 803
Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 7 ASARGFEAGSMGAVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
A F G GA NG+ P+G ++ + EVWTG+ + ++A M+ G EA
Sbjct: 698 ACFEKFHGGQFGAANGLNPDGTPEKENDTHPLEVWTGINFGIAALMIRNGMQTEALRMVE 757
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
+ VY+ GL F TPE +T + T+R+ Y+RA+ ++A+ DA
Sbjct: 758 AVVEQVYDN-GLQFRTPEAITANGTFRACHYLRAMGIWAIYDA 799
>gi|443320525|ref|ZP_21049620.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
gi|442789771|gb|ELR99409.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
Length = 806
Score = 68.9 bits (167), Expect = 3e-10, Method: Composition-based stats.
Identities = 35/95 (36%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G +GA NG+K +G + + EVWTG+ + L A +++ G DE A + +
Sbjct: 703 FHQGKIGAANGVKLDGTGENPQSTHPLEVWTGINFALGAFLIHSGMRDEGLKLAETIVKQ 762
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y++ GL F TPE LT T+R+ Y+RA+A++A+
Sbjct: 763 IYDQ-GLQFRTPEALTTSGTFRASHYLRAMAIWAI 796
>gi|218437284|ref|YP_002375613.1| hypothetical protein PCC7424_0277 [Cyanothece sp. PCC 7424]
gi|218170012|gb|ACK68745.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7424]
Length = 801
Score = 68.9 bits (167), Expect = 4e-10, Method: Composition-based stats.
Identities = 35/95 (36%), Positives = 55/95 (57%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + A EVWTG+ + L+A M+ G E + + +
Sbjct: 703 FYGGKYGAANGVRPDGSPENPDATHPLEVWTGINFGLAAFMIQMGMKQEGLSITEAVVKQ 762
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 763 IYE-NGLQFRTPEAITAAGTFRASHYLRAMAIWAI 796
>gi|425442611|ref|ZP_18822852.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
gi|389716273|emb|CCH99457.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
Length = 784
Score = 68.2 bits (165), Expect = 5e-10, Method: Composition-based stats.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K +A F G GA NG+KP+G + A EVW G+ + L A +L G +
Sbjct: 671 LSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMGMEKQ 730
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
W + + VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 731 GWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|427712531|ref|YP_007061155.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
gi|427376660|gb|AFY60612.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
Length = 799
Score = 68.2 bits (165), Expect = 5e-10, Method: Composition-based stats.
Identities = 36/95 (37%), Positives = 54/95 (56%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+ P+G ++ +A EVW G+ Y L+A + G D AW + R
Sbjct: 695 FHEGQFGAANGLLPDGSPEKPNATHPLEVWIGINYGLAAFLYQMGLEDGAWRLTQTVIRQ 754
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y+ GL F TPE +T T+R+ Y+R +A++AM
Sbjct: 755 IYD-NGLQFRTPEAITAVGTFRACMYLRPMAIWAM 788
>gi|428773649|ref|YP_007165437.1| glucosylceramidase [Cyanobacterium stanieri PCC 7202]
gi|428687928|gb|AFZ47788.1| Glucosylceramidase [Cyanobacterium stanieri PCC 7202]
Length = 800
Score = 67.8 bits (164), Expect = 7e-10, Method: Composition-based stats.
Identities = 35/98 (35%), Positives = 56/98 (57%), Gaps = 2/98 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G++G NG+KP+G ++ + E+WTG+ Y + A M+ G +E A +
Sbjct: 702 FHNGTLGIANGVKPDGSPEKENDTHPLEIWTGINYGIVAFMILNGMKEEGLKVAETVVNQ 761
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
VY GL F TPE +T T+R+ Y+RA+A++A+ A
Sbjct: 762 VYS-NGLQFRTPEAITAVNTFRASHYLRAMAIWAIYQA 798
>gi|425447791|ref|ZP_18827773.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
gi|389731620|emb|CCI04385.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
Length = 784
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 35/95 (36%), Positives = 53/95 (55%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+KP+G + A EVW G+ + L A +L G + W + +
Sbjct: 682 FHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVQQ 741
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 742 IYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|427702545|ref|YP_007045767.1| bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
gi|427345713|gb|AFY28426.1| putative bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
Length = 835
Score = 67.8 bits (164), Expect = 9e-10, Method: Composition-based stats.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+K +A FE G +G NG++ +G D EVWTG+ + L+A G
Sbjct: 714 LKAIRQACFESFEGGRLGVANGLRRDGTPLDPEGTHPLEVWTGINFGLAAYYRLMGQSST 773
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A+ G + R VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 774 AFAITGAVVRQVYE-GGLQFRTPEAITAAGTFRACHYLRAMAIWAL 818
>gi|427716943|ref|YP_007064937.1| glucosylceramidase [Calothrix sp. PCC 7507]
gi|427349379|gb|AFY32103.1| Glucosylceramidase [Calothrix sp. PCC 7507]
Length = 800
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 35/95 (36%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + A EVWTG+ + L+A ++ G DEA + +
Sbjct: 699 FYDGQFGAANGVRPDGSPENPQATHPLEVWTGINFGLAAFLVQMGMQDEALKLTQAVVKQ 758
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y+ GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 759 IYD-NGLQFRTPEAITTVGTFRASTYLRAMAIWAI 792
>gi|290990718|ref|XP_002677983.1| predicted protein [Naegleria gruberi]
gi|284091593|gb|EFC45239.1| predicted protein [Naegleria gruberi]
Length = 937
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 32/102 (31%), Positives = 60/102 (58%), Gaps = 9/102 (8%)
Query: 18 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER--- 74
G VNGM+P + D++++QS E+WTG +Y+++A + G +E L ++V+++
Sbjct: 840 GCVNGMRPTQEVDKTSLQSREMWTGTSYVVAALTILLGRREEGID----LMKSVFDKCWS 895
Query: 75 --TGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 114
+ F+TPE +T YR+ GY R L++++ A + K+
Sbjct: 896 SNSAFWFQTPEAVTETGEYRALGYCRPLSIWSCLSAMMNKKI 937
>gi|443325334|ref|ZP_21054034.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
gi|442795063|gb|ELS04450.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
Length = 808
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 39/99 (39%), Positives = 57/99 (57%), Gaps = 10/99 (10%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSE-----EVWTGVTYLLSAAMLYEGNVDEAWTTAGG 66
F G GA NG+ P+G SA+ + EVWTG+ + L+A M+ G DEA+
Sbjct: 707 FHDGKFGAANGVLPDG----SAVNPDDTHPLEVWTGINFGLAAFMVQMGMKDEAFRLTET 762
Query: 67 LYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+ R VYE GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 763 VVRQVYE-NGLQFRTPEAITAVGTFRASHYLRAMAIWGV 800
>gi|282897193|ref|ZP_06305195.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
gi|281197845|gb|EFA72739.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
Length = 798
Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats.
Identities = 34/99 (34%), Positives = 55/99 (55%), Gaps = 2/99 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F+ G GA NG+ P+G + A EVWTG+ + L+A +L G D+ + R
Sbjct: 700 FQDGKFGAANGVLPDGSPENPQATHPLEVWTGINFGLAAFLLQMGMKDQGLRLTQAVVRQ 759
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
+Y+ GL F TPE +T T+R+ Y+R +A++A+ +
Sbjct: 760 IYDH-GLQFRTPEAITAAGTFRASTYLRPMAIWAIYTCF 797
>gi|409991122|ref|ZP_11274412.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
Paraca]
gi|291568810|dbj|BAI91082.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409938009|gb|EKN79383.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
Paraca]
Length = 799
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 2/101 (1%)
Query: 6 RASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 64
+A + F G GA NG++ G+ + EVWTG+ + L A ++ G +EA+ A
Sbjct: 693 QACFQNFHNGKFGAANGVRTTGEPINPQDTHPLEVWTGINFGLGAFLIQMGMKEEAFQLA 752
Query: 65 GGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+ VY+ GL F TPE +T T+R+ Y+RA+A++AM
Sbjct: 753 EAVITQVYQ-NGLQFRTPEAITAAGTFRASHYLRAMAIWAM 792
>gi|440680630|ref|YP_007155425.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
gi|428677749|gb|AFZ56515.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
Length = 802
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 59/101 (58%), Gaps = 5/101 (4%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + A EVWTG+ + L+A ++ G +EA + +
Sbjct: 705 FCNGQFGAANGVRPDGSPENPQATHPLEVWTGINFGLAAFLIQMGMKNEAMRLTQAVVQQ 764
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 111
+Y+ GL F TPE +T T+R+ Y+RA+A++A+ YLK
Sbjct: 765 IYD-NGLQFRTPEAITSAGTFRASTYLRAMAIWAI---YLK 801
>gi|334116939|ref|ZP_08491031.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
gi|333461759|gb|EGK90364.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
Length = 818
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 35/95 (36%), Positives = 55/95 (57%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+ NG + +A EVWTG+ + L+A ++ G D+A+ +
Sbjct: 717 FNEGEFGAANGVMLNGSPENPNATHPLEVWTGINFGLAAFLVQMGMEDKAFKLTDAVVNQ 776
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 777 IYE-NGLQFRTPEAITAGGTFRASHYLRAMAIWAI 810
>gi|428223619|ref|YP_007107716.1| glucosylceramidase [Geitlerinema sp. PCC 7407]
gi|427983520|gb|AFY64664.1| Glucosylceramidase [Geitlerinema sp. PCC 7407]
Length = 821
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 54/95 (56%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+KP+G + A EVWTG+ + ++A ++ G DE + +
Sbjct: 709 FHDGKFGAANGVKPDGSPENPKATHPLEVWTGINFGIAAFLVQMGLRDEGFRLTQAVVEQ 768
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+YE GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 769 IYEH-GLQFRTPEAITAVGTFRACHYLRPMAIWAV 802
>gi|119493962|ref|ZP_01624522.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
gi|119452283|gb|EAW33479.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
Length = 799
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 35/99 (35%), Positives = 59/99 (59%), Gaps = 2/99 (2%)
Query: 10 RGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 68
+ F G GA NG++ NG+ + + EVWTG+ + L+A ++ G +E + A +
Sbjct: 698 QNFYQGQFGAANGVRINGEPINPNDTHPLEVWTGINFGLAAFLIQMGMKEEGFKLAEVVV 757
Query: 69 RTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQD 107
+ +YE GL F TPE +T T+R+ Y+RA+A++A+ D
Sbjct: 758 QQIYE-NGLQFRTPEAITAAGTFRASHYLRAMAIWAIYD 795
>gi|414077215|ref|YP_006996533.1| beta-glucosidase [Anabaena sp. 90]
gi|413970631|gb|AFW94720.1| putative beta-glucosidase [Anabaena sp. 90]
Length = 795
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/95 (34%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F+ G GA NG+ PNG + ++ EVWTG+ + L+A ++ +EA + R
Sbjct: 697 FQNGEFGAANGVLPNGLPENPNSTHPLEVWTGINFGLAAFLVQMNMKNEAMRLTEAVVRQ 756
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y+ GL F TPE +T + T+R+ Y+RA+A++ +
Sbjct: 757 IYD-NGLQFRTPEAITANGTFRASTYLRAMAIWGI 790
>gi|209526014|ref|ZP_03274547.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
gi|376002240|ref|ZP_09780079.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423067988|ref|ZP_17056778.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
gi|209493540|gb|EDZ93862.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
gi|375329367|emb|CCE15832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406710526|gb|EKD05737.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
Length = 799
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 2/101 (1%)
Query: 6 RASARGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 64
+A + F G GA NG++ G+ + + EVWTG+ + L A + G +EA+ A
Sbjct: 693 QACFQNFHNGKFGAANGVRTTGEPMNPNDTHPLEVWTGINFGLGAFFIQMGMKEEAFQLA 752
Query: 65 GGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+ VY+ GL F TPE +T T+R+ Y+RA+A++AM
Sbjct: 753 EAVITQVYQ-NGLQFRTPEAITAAGTFRASHYLRAMAIWAM 792
>gi|434400779|ref|YP_007134783.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
gi|428271876|gb|AFZ37817.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
Length = 804
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 39/106 (36%), Positives = 59/106 (55%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+KK +A F G GA NG+ P+G + + EVWTG+ + L+A ML G E
Sbjct: 692 LKKIYQACFLKFHNGKYGAANGVLPDGSPVNPNDTHPLEVWTGINFGLAAFMLQMGMKAE 751
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A+ + + VYE GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 752 AFQLTETVVKQVYE-NGLQFRTPEAITAVGTFRASHYLRAMAIWGI 796
>gi|428770169|ref|YP_007161959.1| glucosylceramidase [Cyanobacterium aponinum PCC 10605]
gi|428684448|gb|AFZ53915.1| Glucosylceramidase [Cyanobacterium aponinum PCC 10605]
Length = 802
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 39/106 (36%), Positives = 56/106 (52%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
++K A F G G NG+KP+G + EVWTG+ Y + A M+ G E
Sbjct: 691 LRKVYDACFLKFHDGKFGIANGVKPDGKPVKENDTHPLEVWTGINYGIVAFMILNGMKQE 750
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A + + VYE GL F TPE +T T+R+ Y+RA+AV+A+
Sbjct: 751 GLRVAETVVKQVYE-NGLQFRTPEAITAVGTFRASHYLRAMAVWAI 795
>gi|75907319|ref|YP_321615.1| hypothetical protein Ava_1096 [Anabaena variabilis ATCC 29413]
gi|75701044|gb|ABA20720.1| Protein of unknown function DUF608 [Anabaena variabilis ATCC 29413]
Length = 804
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 34/95 (35%), Positives = 54/95 (56%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + A EVWTG+ + L+A ++ G DE + +
Sbjct: 706 FYDGQFGAANGVRPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAVVAQ 765
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 766 IYN-NGLQFRTPEAITAAGTFRASTYLRAMAIWAI 799
>gi|427709805|ref|YP_007052182.1| glucosylceramidase [Nostoc sp. PCC 7107]
gi|427362310|gb|AFY45032.1| Glucosylceramidase [Nostoc sp. PCC 7107]
Length = 802
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 35/95 (36%), Positives = 53/95 (55%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F+ G GA NG+ P+G + A EVWTG+ + L+A ++ G DEA +
Sbjct: 703 FQNGEFGAANGVLPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMQDEALRLTQAVVEQ 762
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 763 VYN-NGLQFRTPEAITATGTFRASTYLRAMAIWGI 796
>gi|162452686|ref|YP_001615053.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
gi|161163268|emb|CAN94573.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
Length = 782
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 54/98 (55%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F+ G+ G VN M NG D ++ Q+ E W G ++ + A M+ EG +A L T+
Sbjct: 681 FDGGTRGVVNVMTANGSVDGTSNQTRECWVGTSWGVVAGMIQEGLAAQAGEIGASLVDTI 740
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 109
++ GL F TPE G+ + R+ YMRA ++A + AY
Sbjct: 741 WKTDGLWFRTPEAWEGNGSIRAPYYMRATTLWAAKHAY 778
>gi|254431466|ref|ZP_05045169.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
gi|197625919|gb|EDY38478.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
Length = 837
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 37/100 (37%), Positives = 54/100 (54%), Gaps = 2/100 (2%)
Query: 7 ASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
A GF+ GS+G NG++ +G D EVWTG+ + L+A G D A G
Sbjct: 720 ACFEGFQGGSLGVANGLRRDGTPLDPDGTHPLEVWTGINFGLAAYYRLMGETDTALAITG 779
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+ VY GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 780 AVVHQVYG-GGLQFRTPEAITAVNTFRACHYLRAMAIWAL 818
>gi|357139475|ref|XP_003571307.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 946
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 51/84 (60%), Gaps = 4/84 (4%)
Query: 27 GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEG 84
G D S+ QS+EVWTG TY ++AAM+ EG +E +T A G Y + G G F+ PE
Sbjct: 792 GGVDESSTQSKEVWTGTTYAVAAAMIGEGMREEGFTAAKGAYGAGWGEDGYGYAFQMPES 851
Query: 85 LTGDKT--YRSGGYMRALAVYAMQ 106
T D YRS YMR LAV+AMQ
Sbjct: 852 WTADGAGGYRSLHYMRPLAVWAMQ 875
>gi|186681980|ref|YP_001865176.1| hypothetical protein Npun_R1544 [Nostoc punctiforme PCC 73102]
gi|186464432|gb|ACC80233.1| protein of unknown function DUF608 [Nostoc punctiforme PCC 73102]
Length = 801
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 34/95 (35%), Positives = 54/95 (56%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G G NG++P+G + A EVWTG+ + L+A ++ G DEA + +
Sbjct: 703 FCNGEFGTANGVRPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEALRLTQAVVQQ 762
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+YE GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 763 IYE-NGLQFRTPEAITAAGTFRASTYLRAMAIWGI 796
>gi|411117353|ref|ZP_11389840.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
JSC-12]
gi|410713456|gb|EKQ70957.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
JSC-12]
Length = 907
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 34/90 (37%), Positives = 55/90 (61%), Gaps = 2/90 (2%)
Query: 17 MGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERT 75
+GA NG+KP+G + ++ EVWTG+ + L+A ++ G DEA + + +Y
Sbjct: 810 IGAANGVKPDGSPENPNSTHPLEVWTGINFGLAAFLVQMGMRDEAMELTQAVVQQIYT-N 868
Query: 76 GLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
GL F TPE +T T+R+ Y+RALA++A+
Sbjct: 869 GLQFRTPEAITPSGTFRACHYLRALAIWAI 898
>gi|427727368|ref|YP_007073605.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
gi|427363287|gb|AFY46008.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
Length = 803
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 34/95 (35%), Positives = 54/95 (56%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + A EVWTG+ + L+A ++ G DEA + +
Sbjct: 702 FYNGQFGAANGVRPDGSPENPQATHPLEVWTGINFGLAAFLVQMGMKDEALRLTEAVVQQ 761
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 762 IYS-NGLQFRTPEAITPSGTFRASTYLRAMAIWGV 795
>gi|222639789|gb|EEE67921.1| hypothetical protein OsJ_25789 [Oryza sativa Japonica Group]
Length = 919
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 59/100 (59%), Gaps = 15/100 (15%)
Query: 22 GMKP--NGDRDRSAI-----------QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 68
G++P +GD+ R A+ S+EV GVTY ++AAM++EG + A+ TA G++
Sbjct: 756 GLEPIVDGDKARRALATVLDYNVMRTHSKEVCPGVTYAVAAAMIHEGMPEAAFKTAKGIH 815
Query: 69 RTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ + G G F+TPE T D YR+ YMR L V+AMQ
Sbjct: 816 DAGWGKHGFGYAFQTPESWTADGGYRALHYMRPLGVWAMQ 855
>gi|7470943|pir||T31071 conserved hypothetical protein - Anabaena sp. (fragment)
gi|2828799|gb|AAC38131.1|AAC38131 unknown [Nostoc sp. PCC 7120]
Length = 439
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 33/95 (34%), Positives = 53/95 (55%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + A EVWTG+ + L+A ++ G DE + +
Sbjct: 341 FYDGQFGAANGVRPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAVVEQ 400
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 401 IYN-NGLQFRTPEAITAAGTFRASTYLRPMAIWAI 434
>gi|113477064|ref|YP_723125.1| hypothetical protein Tery_3568 [Trichodesmium erythraeum IMS101]
gi|110168112|gb|ABG52652.1| protein of unknown function DUF608 [Trichodesmium erythraeum
IMS101]
Length = 816
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 58/106 (54%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+K + + F G GA NG+ P+G + +A EVWTG+ + L+A M+ G E
Sbjct: 704 LKTVYNSCFKNFHNGKFGAANGVLPDGSPENPNATHPLEVWTGINFGLAAFMVQIGMKKE 763
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A + +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 764 ALEITEVVVGQIYE-NGLQFRTPEAITVMGTFRASHYLRAMAIWAI 808
>gi|428317101|ref|YP_007114983.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
gi|428240781|gb|AFZ06567.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
Length = 809
Score = 65.5 bits (158), Expect = 5e-09, Method: Composition-based stats.
Identities = 34/95 (35%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+ NG + +A EVWTG+ + L+A ++ G ++A+ + +
Sbjct: 708 FNDGEFGAANGVMLNGSPENPNATHPLEVWTGINFGLAAFLVQMGMEEKAFKLTDAVVKQ 767
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 768 IYE-NGLQFRTPEAITAGGTFRASHYLRAMAIWAI 801
>gi|434407175|ref|YP_007150060.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
7417]
gi|428261430|gb|AFZ27380.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
7417]
Length = 816
Score = 65.1 bits (157), Expect = 5e-09, Method: Composition-based stats.
Identities = 34/95 (35%), Positives = 55/95 (57%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + +A EVWTG+ + L+A ++ G DEA +
Sbjct: 714 FFDGQFGAANGVRPDGVPENPNATHPLEVWTGINFGLAAFLVQMGMQDEALRLTQAVVSQ 773
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y+ GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 774 IYD-NGLQFRTPEAITTTGTFRASTYLRPMAIWAI 807
>gi|17230308|ref|NP_486856.1| hypothetical protein alr2816 [Nostoc sp. PCC 7120]
gi|17131910|dbj|BAB74515.1| alr2816 [Nostoc sp. PCC 7120]
Length = 804
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 33/95 (34%), Positives = 53/95 (55%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG++P+G + A EVWTG+ + L+A ++ G DE + +
Sbjct: 706 FYDGQFGAANGVRPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAVVEQ 765
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 766 IYN-NGLQFRTPEAITAAGTFRASTYLRPMAIWAI 799
>gi|148242848|ref|YP_001228005.1| hypothetical protein SynRCC307_1749 [Synechococcus sp. RCC307]
gi|147851158|emb|CAK28652.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 220
Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
++ A GF G +G NG++ +G D + EVWTG+ + L+A G +
Sbjct: 99 LQAIKEACFEGFAGGRLGVANGLRRDGTPLDPNGTHPLEVWTGINFGLAAYYRLMGETNT 158
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A + VY GL F TPE LTG T+R+ Y+RA+A++A+
Sbjct: 159 AVAITQAVVNQVYA-GGLQFRTPEALTGQNTFRACHYLRAMAIWAL 203
>gi|224028315|gb|ACN33233.1| unknown [Zea mays]
Length = 384
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 42/71 (59%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM P G D + +QS E+WTGVTY ++A ML G + +TTA G++ +
Sbjct: 296 GRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIFTAGWSE 355
Query: 75 TGLGFETPEGL 85
G G+ L
Sbjct: 356 EGYGYALSHTL 366
>gi|414871105|tpg|DAA49662.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
Length = 589
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/65 (46%), Positives = 41/65 (63%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
G MGAVNGM P G D + +QS E+WTGVTY ++A ML G + +TTA G++ +
Sbjct: 501 GRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQGFTTAEGIFTAGWSE 560
Query: 75 TGLGF 79
G G+
Sbjct: 561 EGYGY 565
>gi|254423918|ref|ZP_05037636.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
gi|196191407|gb|EDX86371.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
Length = 828
Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats.
Identities = 33/90 (36%), Positives = 54/90 (60%), Gaps = 2/90 (2%)
Query: 17 MGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERT 75
+G NG++P+G ++ EVWTG+ + L+A + +EG + EA + R VYE
Sbjct: 733 VGCANGVRPDGSPQNPDDTHQLEVWTGINFGLAAFLAHEGKLHEAMEITEAVVRQVYEH- 791
Query: 76 GLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 792 GLQFRTPEAITAVGTFRACHYLRPMAIWAV 821
>gi|427733995|ref|YP_007053539.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
gi|427369036|gb|AFY52992.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
Length = 799
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 32/95 (33%), Positives = 55/95 (57%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G +GA NG++ +G + A EVWTG+ + ++A ++ G EA+ +
Sbjct: 702 FNQGKLGAANGLRLDGSPENPDATHPLEVWTGINFGIAAFLVQMGMKSEAFKLTEAVVNQ 761
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+YE GL F TPE +T + +R+ Y+RA+A++A+
Sbjct: 762 IYEH-GLQFRTPEAITATRNFRACVYLRAMAIWAI 795
>gi|431902810|gb|ELK09025.1| Non-lysosomal glucosylceramidase [Pteropus alecto]
Length = 970
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 27/54 (50%), Positives = 40/54 (74%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 61
+ + F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ E N+ ++
Sbjct: 757 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQETNLMSSY 810
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 20/38 (52%), Positives = 27/38 (71%)
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 891 YRTVWERLGLAFQTPEAYCQHQVFRSLAYMRPLSIWAM 928
>gi|16330256|ref|NP_440984.1| hypothetical protein sll1775 [Synechocystis sp. PCC 6803]
gi|383321997|ref|YP_005382850.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325166|ref|YP_005386019.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491050|ref|YP_005408726.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436317|ref|YP_005651041.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
gi|451814414|ref|YP_007450866.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
gi|1652745|dbj|BAA17664.1| sll1775 [Synechocystis sp. PCC 6803]
gi|339273349|dbj|BAK49836.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
gi|359271316|dbj|BAL28835.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274486|dbj|BAL32004.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277656|dbj|BAL35173.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407958168|dbj|BAM51408.1| hypothetical protein BEST7613_2477 [Bacillus subtilis BEST7613]
gi|451780383|gb|AGF51352.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
Length = 817
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 34/95 (35%), Positives = 55/95 (57%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G +GA NG+ P+G ++ + EVWTG+ + L+A ++ G +EA +
Sbjct: 699 FAHGELGAANGLLPDGSPQNPNDTHPLEVWTGINFGLAAFLIQMGMKEEALKMTETVVNQ 758
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VY GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 759 VYN-NGLQFRTPEAITAVGTFRASHYLRAMAIWAV 792
>gi|81299663|ref|YP_399871.1| hypothetical protein Synpcc7942_0854 [Synechococcus elongatus PCC
7942]
gi|81168544|gb|ABB56884.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 798
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 33/100 (33%), Positives = 55/100 (55%), Gaps = 2/100 (2%)
Query: 7 ASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
A + F +G G NG+ P+G D EVWTG+ + ++A L G+ D +
Sbjct: 685 ACFQKFHSGQFGLANGLLPDGSPVDPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTE 744
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+ R +Y+ GL F TPE +T + T+R+ Y+R +A++A+
Sbjct: 745 TVIRQIYD-NGLQFRTPEAITANATFRASHYLRPMAIWAV 783
>gi|116072378|ref|ZP_01469645.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
gi|116064900|gb|EAU70659.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
Length = 820
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 33/95 (34%), Positives = 53/95 (55%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
FEAG++G NG++ +G D + EVWTG+ + +++ G D A G +
Sbjct: 711 FEAGTLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAVVNQ 770
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 771 VYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 804
>gi|282900195|ref|ZP_06308151.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
CS-505]
gi|281194944|gb|EFA69885.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
CS-505]
Length = 789
Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats.
Identities = 34/95 (35%), Positives = 53/95 (55%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F+ G GA NG+ +G + A EVWTG+ + L+A +L G ++ + R
Sbjct: 691 FQDGKFGAANGVLLDGSPENPQATHPLEVWTGINFGLAAFLLQMGMKEQGLRLTEAVVRQ 750
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VYE GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 751 VYEH-GLQFRTPEAITAAGTFRASTYLRPMAIWAV 784
>gi|78212458|ref|YP_381237.1| hypothetical protein Syncc9605_0920 [Synechococcus sp. CC9605]
gi|78196917|gb|ABB34682.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 832
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 34/106 (32%), Positives = 57/106 (53%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+K A F+ GS+G NG++ +G D +A EVWTG+ + +++ G+
Sbjct: 712 LKAVKEACFEAFDGGSLGVANGLRRDGTPLDPNATHPLEVWTGINFGIASYYRLMGDKQT 771
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A + + VY +GL F TPE +T TYR+ Y+RA+A++ +
Sbjct: 772 AQSICSAVVEQVYS-SGLQFRTPEAITAVNTYRACHYLRAMAIWGL 816
>gi|428307493|ref|YP_007144318.1| glucosylceramidase [Crinalium epipsammum PCC 9333]
gi|428249028|gb|AFZ14808.1| Glucosylceramidase [Crinalium epipsammum PCC 9333]
Length = 811
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 33/90 (36%), Positives = 53/90 (58%), Gaps = 2/90 (2%)
Query: 17 MGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERT 75
+GA NG+ +G + A EVWTG+ + L+A M+ G DEA+ + + +Y
Sbjct: 718 IGAANGVLADGSPENPQATHPLEVWTGINFGLAAFMIQMGMKDEAFQLTEAVVQQIYN-N 776
Query: 76 GLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 777 GLQFRTPEAITAAGTFRASHYLRAMAIWAI 806
>gi|300866990|ref|ZP_07111661.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335025|emb|CBN56827.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 863
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 33/95 (34%), Positives = 54/95 (56%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+ +G ++ A EVWTG+ + L+A M+ G ++A + +
Sbjct: 761 FNQGEFGAANGVMLDGSPEKPGATHPLEVWTGINFGLAAFMMQMGMKEKALELTEVVVKQ 820
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 821 IYA-NGLQFRTPEAITATGTFRASHYLRAMAIWAI 854
>gi|33862646|ref|NP_894206.1| hypothetical protein PMT0373 [Prochlorococcus marinus str. MIT
9313]
gi|33634562|emb|CAE20548.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 837
Score = 62.0 bits (149), Expect = 5e-08, Method: Composition-based stats.
Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 2/100 (2%)
Query: 7 ASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
A GFE G +G NG+ +G D EVWTG+ + L+A G+ A
Sbjct: 719 ACFEGFEGGRLGVANGLCRDGMPLDPKGTHPLEVWTGINFGLAAYYRLMGDATTATAICS 778
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+ VY GL F TPE +T KTYR+ Y+RA+A++A+
Sbjct: 779 AVVNQVYG-GGLQFRTPEAITAVKTYRACHYLRAMAIWAL 817
>gi|327288418|ref|XP_003228923.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Anolis
carolinensis]
Length = 667
Score = 62.0 bits (149), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 35/46 (76%)
Query: 11 GFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 56
GF G+MGAVNGM+P+G D S++QS+EVW GV Y L+A M+ EG
Sbjct: 591 GFSEGAMGAVNGMRPDGVPDTSSVQSDEVWVGVVYSLAATMIQEGQ 636
>gi|298490121|ref|YP_003720298.1| glucosylceramidase ['Nostoc azollae' 0708]
gi|298232039|gb|ADI63175.1| Glucosylceramidase ['Nostoc azollae' 0708]
Length = 806
Score = 62.0 bits (149), Expect = 5e-08, Method: Composition-based stats.
Identities = 33/95 (34%), Positives = 53/95 (55%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+ P+G + A EVWTG+ + L+A ++ G +EA +
Sbjct: 708 FYDGKFGAANGVLPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKNEAMQLTQAVVEQ 767
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y+ GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 768 IYD-NGLQFRTPEAITPVGTFRASTYLRPMAIWAI 801
>gi|428210868|ref|YP_007084012.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
6304]
gi|427999249|gb|AFY80092.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
6304]
Length = 820
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 39/106 (36%), Positives = 57/106 (53%), Gaps = 5/106 (4%)
Query: 12 FEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G GA NG+ +G + A EVWTG+ + L+A +L G +E + + +
Sbjct: 716 FHNGEFGAANGVLIDGSPVNPDATHPLEVWTGINFGLAAFLLQMGMKEEGFKLTETVIQQ 775
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRAL---AVYAMQDAYLKGK 113
VYE GL F TPE +T T+R+ Y+RA+ AVY M + +GK
Sbjct: 776 VYE-NGLQFRTPEAITAVGTFRASHYLRAMAIWAVYGMFTGFKEGK 820
>gi|124023625|ref|YP_001017932.1| bile acid beta-glucosidase [Prochlorococcus marinus str. MIT 9303]
gi|123963911|gb|ABM78667.1| Predicted bile acid beta-glucosidase [Prochlorococcus marinus str.
MIT 9303]
Length = 837
Score = 61.6 bits (148), Expect = 6e-08, Method: Composition-based stats.
Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 2/100 (2%)
Query: 7 ASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
A GFE G +G NG+ +G D EVWTG+ + L+A G+ A
Sbjct: 719 ACFEGFEGGRLGVANGLCRDGMPLDPKGTHPLEVWTGINFGLAAYYRLMGDATTATAICS 778
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+ VY GL F TPE +T KTYR+ Y+RA+A++A+
Sbjct: 779 AVVNQVYG-GGLQFRTPEAITAVKTYRACHYLRAMAIWAL 817
>gi|78185056|ref|YP_377491.1| hypothetical protein Syncc9902_1489 [Synechococcus sp. CC9902]
gi|78169350|gb|ABB26447.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 823
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 32/95 (33%), Positives = 52/95 (54%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
FE G++G NG++ +G D + EVWTG+ + +++ G D A G +
Sbjct: 714 FEGGTLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAVVNQ 773
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 774 VYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 807
>gi|434389621|ref|YP_007100232.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
gi|428020611|gb|AFY96705.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
Length = 818
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 34/98 (34%), Positives = 55/98 (56%), Gaps = 8/98 (8%)
Query: 12 FEAGSMGAVNGMK----PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
F+ G GA NG+K P+ +D + EVWTG+ + ++A ++ G +EA A +
Sbjct: 719 FQGGKFGAANGVKIDGSPSNPKDTHPL---EVWTGINFGIAAFLVQMGMQEEALKLAETI 775
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 776 VEQVYS-NGLQFRTPEAITSAGTFRASHYLRAMAIWGI 812
>gi|148670515|gb|EDL02462.1| glucosidase beta 2, isoform CRA_b [Mus musculus]
Length = 377
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 26/47 (55%), Positives = 35/47 (74%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 54
+ + F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ E
Sbjct: 280 NVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 326
>gi|366162568|ref|ZP_09462323.1| hypothetical protein AcelC_02756 [Acetivibrio cellulolyticus CD2]
Length = 842
Score = 60.8 bits (146), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 1/101 (0%)
Query: 10 RGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 69
+ F+ G G VN MK +G+ D S Q+ E W G +Y + A M+ EG EA L
Sbjct: 653 KKFDNGLHGFVNIMKADGNVDMSHAQTAEAWVGTSYGVIAGMIQEGLETEASQVGSNLAD 712
Query: 70 TVYERTGLGFETPEGL-TGDKTYRSGGYMRALAVYAMQDAY 109
TV++ + F TPE G R+ YMRA ++A++ AY
Sbjct: 713 TVWKTNDMWFRTPEAWRQGVSEVRAPYYMRANCIWAVKHAY 753
>gi|47196687|emb|CAF89164.1| unnamed protein product [Tetraodon nigroviridis]
Length = 239
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/54 (50%), Positives = 36/54 (66%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 54
+K + F G MGAVNGM+P G DRS++QS+EVW GV Y L+A M++E
Sbjct: 185 LKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHE 238
>gi|443311911|ref|ZP_21041533.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
gi|442777986|gb|ELR88257.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
Length = 804
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 33/95 (34%), Positives = 53/95 (55%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
F G+ GA NG+K +G + A EVWTG+ + L+A +L +EA+ +
Sbjct: 703 FHNGAFGAANGLKIDGSPENPDATHPLEVWTGINFGLAAFLLQMDMKEEAFKLTQTVVEQ 762
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+Y GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 763 IYN-NGLQFRTPEAITAVGTFRACHYLRPMAIWAV 796
>gi|56750695|ref|YP_171396.1| hypothetical protein syc0686_c [Synechococcus elongatus PCC 6301]
gi|56685654|dbj|BAD78876.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 798
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 32/100 (32%), Positives = 54/100 (54%), Gaps = 2/100 (2%)
Query: 7 ASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 65
A + F +G G NG+ P+G EVWTG+ + ++A L G+ D +
Sbjct: 685 ACFQKFHSGQFGLANGLLPDGSPVGPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTE 744
Query: 66 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+ R +Y+ GL F TPE +T + T+R+ Y+R +A++A+
Sbjct: 745 TVIRQIYD-NGLQFRTPEAITANATFRASHYLRPMAIWAV 783
>gi|37522029|ref|NP_925406.1| hypothetical protein gll2460 [Gloeobacter violaceus PCC 7421]
gi|35213028|dbj|BAC90401.1| gll2460 [Gloeobacter violaceus PCC 7421]
Length = 867
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 48/95 (50%), Gaps = 1/95 (1%)
Query: 12 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 71
F G G NG +G EVWTG+ + L+A M+ G E A + V
Sbjct: 757 FYGGRFGCANGTNADGSFIGDTEHPSEVWTGINFGLAAFMIRNGMRREGMAIAEAVVANV 816
Query: 72 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
Y GL F TPE LT +T+R+ Y+R +A++AMQ
Sbjct: 817 YG-GGLQFRTPEALTPARTFRACMYLRPMAIWAMQ 850
>gi|87301122|ref|ZP_01083963.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
gi|87284090|gb|EAQ76043.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
Length = 830
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+K A FE G +G NG++ +G D EVWTG+ + L+A G D
Sbjct: 709 LKAIREACFERFEGGRLGVANGLRRDGTPLDPKGTHPLEVWTGINFGLAAYYRLMGQTDT 768
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A + VY GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 769 ALAITSAVVGQVYA-GGLQFRTPEAITAVGTFRACHYLRAMAIWAL 813
>gi|119578751|gb|EAW58347.1| glucosidase, beta (bile acid) 2, isoform CRA_c [Homo sapiens]
gi|168270622|dbj|BAG10104.1| non-lysosomal glucosylceramidase [synthetic construct]
Length = 877
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 25/47 (53%), Positives = 36/47 (76%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 54
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ E
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835
>gi|318042148|ref|ZP_07974104.1| hypothetical protein SCB01_10580 [Synechococcus sp. CB0101]
Length = 836
Score = 59.3 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 33/106 (31%), Positives = 54/106 (50%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+K A FE G +G NG++ +G D + EVWTG+ + +++ G+ D
Sbjct: 716 LKAVKEACFEQFEGGKLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGDTDT 775
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 776 ALAITAAVVNQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGV 820
>gi|288901049|gb|ADC68237.1| hypothetical protein [Dendrobium nobile]
Length = 207
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 33/46 (71%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
G +GAVNGM PNG D S +QS E+WTGVTY LSA M+ G ++A
Sbjct: 161 GRIGAVNGMHPNGKVDESCMQSREIWTGVTYALSATMILAGMEEQA 206
>gi|10047285|dbj|BAB13431.1| KIAA1605 protein [Homo sapiens]
Length = 922
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 36/47 (76%)
Query: 8 SARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 54
+ + F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ E
Sbjct: 834 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 880
>gi|443318654|ref|ZP_21047901.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
gi|442781756|gb|ELR91849.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
Length = 857
Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 2/90 (2%)
Query: 17 MGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERT 75
+GA NG+ P+G D EVWTG+ + L+ G VD+A+ + + +Y
Sbjct: 754 IGAANGVLPDGSPEDPDGTHQLEVWTGINFGLATFFAQMGQVDKAFAITEAVVQQIYTH- 812
Query: 76 GLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
GL F TPE +T T+R+ Y+R LA++ +
Sbjct: 813 GLQFRTPEAITALGTFRACHYLRPLAIWGL 842
>gi|255073149|ref|XP_002500249.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
gi|226515511|gb|ACO61507.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
Length = 954
Score = 58.2 bits (139), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 26 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGL 85
+G D + S EVW GV+Y +++ ML G DEAW A G+ YER GL F TPE
Sbjct: 849 SGSVDAVSAHSNEVWPGVSYAVASHMLLAGMDDEAWELARGVAAGTYER-GLAFRTPEAW 907
Query: 86 TGDKTYRSGGYMRALAVYAMQDA 108
+RS RA +V+A++ A
Sbjct: 908 DARGGFRSAMSQRAGSVWAIEHA 930
>gi|317969095|ref|ZP_07970485.1| hypothetical protein SCB02_06140 [Synechococcus sp. CB0205]
Length = 837
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
FE GS+G NG++ +G D + EVWTG+ + +++ G + A +
Sbjct: 728 FEGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYFRLMGETETALAITSAVVNQ 787
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 788 VYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGV 821
>gi|320105716|ref|YP_004181306.1| glucosylceramidase [Terriglobus saanensis SP1PR4]
gi|319924237|gb|ADV81312.1| Glucosylceramidase [Terriglobus saanensis SP1PR4]
Length = 794
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 54/106 (50%), Gaps = 1/106 (0%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 60
+KK + F G++GAVNGM +G + +++ EVW G T +A + G +E+
Sbjct: 684 LKKIYAFNVNQFGDGNLGAVNGMNADGTVVDN-VEAREVWAGTTLGYAALLKSLGIEEES 742
Query: 61 WTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
+ GL+ +YE G F TPE +R+ YMR ++ M+
Sbjct: 743 YHVVHGLFHVIYESKGYWFRTPEAWDVTGNFRASMYMRPAGLWGME 788
>gi|260434763|ref|ZP_05788733.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
gi|260412637|gb|EEX05933.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
Length = 832
Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats.
Identities = 32/105 (30%), Positives = 53/105 (50%), Gaps = 2/105 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+K A F+ GS+G NG++ +G D + EVWTG+ + +++ G+
Sbjct: 712 LKVVKEACFEAFDGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGDKQT 771
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYA 104
A + VY GL F TPE +T T+R+ Y+RA+A++
Sbjct: 772 AQAICSAVVEQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWG 815
>gi|427419169|ref|ZP_18909352.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
gi|425761882|gb|EKV02735.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
Length = 835
Score = 56.2 bits (134), Expect = 3e-06, Method: Composition-based stats.
Identities = 29/91 (31%), Positives = 51/91 (56%), Gaps = 2/91 (2%)
Query: 16 SMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 74
++GA NG+KP+G + EVWTG+ + ++A + G D+A + + +Y
Sbjct: 742 AIGAANGVKPDGSPENPDDTHQLEVWTGINFGVAAFLAQMGKRDQAMAITEAVVQQIYN- 800
Query: 75 TGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
GL F TPE +T T+R+ Y+R +A++ +
Sbjct: 801 YGLQFRTPEAITAMGTFRACHYLRPMAIWGL 831
>gi|33866127|ref|NP_897686.1| hypothetical protein SYNW1593 [Synechococcus sp. WH 8102]
gi|33639102|emb|CAE08108.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 833
Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats.
Identities = 31/95 (32%), Positives = 50/95 (52%), Gaps = 2/95 (2%)
Query: 12 FEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRT 70
FE+G +G NG++ +G D + EVWTG+ + L++ G A +
Sbjct: 724 FESGRLGVANGLRRDGSPLDPNGTHPLEVWTGINFGLASYYRLMGESKTALAICSAVVGQ 783
Query: 71 VYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 784 VYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817
>gi|284929778|ref|YP_003422300.1| putative bile acid beta-glucosidase [cyanobacterium UCYN-A]
gi|284810222|gb|ADB95919.1| predicted bile acid beta-glucosidase [cyanobacterium UCYN-A]
Length = 803
Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats.
Identities = 31/106 (29%), Positives = 56/106 (52%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+ K A F G GA NG++ +G + ++ +EVW G+ + L+A ++ G ++
Sbjct: 692 LSKIYEACFLNFYDGQYGAANGVRSDGTPENPNSTHPQEVWVGINFGLAAFLIQMGMKNQ 751
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A + +Y GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 752 ALELTKVVVEQIYA-NGLQFRTPEAITPVGTFRASHYLRAMAIWGV 796
>gi|113953483|ref|YP_730032.1| hypothetical protein sync_0818 [Synechococcus sp. CC9311]
gi|113880834|gb|ABI45792.1| conserved hypothetical protein [Synechococcus sp. CC9311]
Length = 833
Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 58/108 (53%), Gaps = 6/108 (5%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSA--AMLYEGNV 57
+K A F GS+G NG++ +G D + EVWTG+ + +++ ++ EG
Sbjct: 713 LKAVKEACFDNFAGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGKT 772
Query: 58 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
EA +A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 773 AEAICSA--VVTQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817
>gi|116073260|ref|ZP_01470522.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
gi|116068565|gb|EAU74317.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
Length = 832
Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 58/108 (53%), Gaps = 6/108 (5%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSA--AMLYEGNV 57
+K A F+ G +G NG++ +G D + EVWTG+ + +++ ++ EG
Sbjct: 712 LKAVREACFEKFDGGKLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGQT 771
Query: 58 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
EA +A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 772 AEAICSA--VVNQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 816
>gi|88809081|ref|ZP_01124590.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
gi|88787023|gb|EAR18181.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
Length = 849
Score = 55.1 bits (131), Expect = 6e-06, Method: Composition-based stats.
Identities = 33/97 (34%), Positives = 56/97 (57%), Gaps = 6/97 (6%)
Query: 12 FEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSA--AMLYEGNVDEAWTTAGGLY 68
F+ GS+G NG++ +G D + EVWTG+ + +++ ++ EG EA +A +
Sbjct: 739 FQGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYQLMGEGQTAEAICSA--VV 796
Query: 69 RTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 797 HQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 832
>gi|87125468|ref|ZP_01081313.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
gi|86166768|gb|EAQ68030.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
Length = 832
Score = 54.7 bits (130), Expect = 7e-06, Method: Composition-based stats.
Identities = 30/106 (28%), Positives = 52/106 (49%), Gaps = 2/106 (1%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 59
+K + F+ GS+G NG++ +G D EVWTG+ + +++ +
Sbjct: 712 LKAVKESCFEAFQGGSLGVANGLRRDGTPLDPQGTHPLEVWTGINFGIASYYRLMDDTST 771
Query: 60 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 772 ALAITSAVVNQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 816
>gi|148240031|ref|YP_001225418.1| hypothetical protein SynWH7803_1695 [Synechococcus sp. WH 7803]
gi|147848570|emb|CAK24121.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 833
Score = 54.3 bits (129), Expect = 9e-06, Method: Composition-based stats.
Identities = 33/97 (34%), Positives = 56/97 (57%), Gaps = 6/97 (6%)
Query: 12 FEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSA--AMLYEGNVDEAWTTAGGLY 68
F+ GS+G NG++ +G D + EVWTG+ + +++ ++ EG EA +A +
Sbjct: 724 FQGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGQTAEAICSA--VV 781
Query: 69 RTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 782 DQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817
>gi|352096901|ref|ZP_08957615.1| Glucosylceramidase [Synechococcus sp. WH 8016]
gi|351675485|gb|EHA58646.1| Glucosylceramidase [Synechococcus sp. WH 8016]
Length = 833
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 57/108 (52%), Gaps = 6/108 (5%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSA--AMLYEGNV 57
+K A F G +G NG++ +G D + EVWTG+ + +++ ++ EG
Sbjct: 713 LKAVKEACFDNFAGGLLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGPT 772
Query: 58 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
EA +A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 773 AEAICSA--VVTQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817
>gi|295829678|gb|ADG38508.1| AT3G24180-like protein [Neslia paniculata]
Length = 190
Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 52
G MGAVNGM P+G D + +QS E+WTGVTY +A M+
Sbjct: 153 GRMGAVNGMHPDGKVDETCMQSREIWTGVTYAAAATMI 190
>gi|295829670|gb|ADG38504.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829672|gb|ADG38505.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829674|gb|ADG38506.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829676|gb|ADG38507.1| AT3G24180-like protein [Capsella grandiflora]
Length = 190
Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 52
G MGAVNGM P+G D + +QS E+WTGVTY +A M+
Sbjct: 153 GKMGAVNGMHPDGKVDDTCMQSREIWTGVTYAAAATMI 190
>gi|345291087|gb|AEN82035.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291089|gb|AEN82036.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291091|gb|AEN82037.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291093|gb|AEN82038.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291095|gb|AEN82039.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291097|gb|AEN82040.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291099|gb|AEN82041.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291101|gb|AEN82042.1| AT3G24180-like protein, partial [Capsella rubella]
Length = 191
Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 15 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 52
G MGAVNGM P+G D + +QS E+WTGVTY +A M+
Sbjct: 153 GKMGAVNGMHPDGKVDDTCMQSREIWTGVTYAAAATMI 190
>gi|303285310|ref|XP_003061945.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
gi|226456356|gb|EEH53657.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
Length = 1073
Score = 50.8 bits (120), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 11/99 (11%)
Query: 18 GAVNGMKPNGDRDRSAI----------QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 67
GAVNG + + D +A+ Q+ E G +Y L++ ++ G DE W A G
Sbjct: 934 GAVNGARVDDATDAAALGVGAGDGIPAQARESRVGQSYALASHLILAGFSDEGWDVARGA 993
Query: 68 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 106
YR YE G F TPE ++ +R RA AV+A++
Sbjct: 994 YRVTYE-DGFAFRTPEIFDAERRFRGAISGRAGAVWAIE 1031
>gi|16183333|gb|AAL13692.1| GH26380p [Drosophila melanogaster]
Length = 391
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 26/62 (41%), Positives = 34/62 (54%), Gaps = 7/62 (11%)
Query: 1 MKKKSRASARGFEAGSMGAVNGMKPN-------GDRDRSAIQSEEVWTGVTYLLSAAMLY 53
+K+ + GF G++GA NG N G D S IQ+EEVW GV Y L+A M+
Sbjct: 309 LKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNIQAEEVWPGVVYALAATMIQ 368
Query: 54 EG 55
EG
Sbjct: 369 EG 370
>gi|297741933|emb|CBI33368.3| unnamed protein product [Vitis vinifera]
Length = 108
Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 34/53 (64%), Gaps = 2/53 (3%)
Query: 58 DEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
++A+TTA G++ + G G F+TPEG T D +RS YMR LA++ MQ A
Sbjct: 3 EQAFTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWA 55
>gi|261417086|ref|YP_003250769.1| glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|385791890|ref|YP_005823013.1| hypothetical protein FSU_3272 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|261373542|gb|ACX76287.1| Glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|302325406|gb|ADL24607.1| conserved domain protein [Fibrobacter succinogenes subsp.
succinogenes S85]
Length = 1047
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 17 MGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERT 75
+GA N ++ +G D Q+ +VW G+ Y + AM++ G EA A + +Y+
Sbjct: 880 IGAANLVRKDGSPLDEFNFQAHDVWIGIQYSIMTAMMFHGLEKEASVLADSMIGNLYDEA 939
Query: 76 GLGFETPEGLTG 87
+ F PEG G
Sbjct: 940 RVPFAAPEGFNG 951
>gi|402589209|gb|EJW83141.1| hypothetical protein WUBG_05945 [Wuchereria bancrofti]
Length = 90
Score = 42.7 bits (99), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 26/41 (63%)
Query: 11 GFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAM 51
F G +GAVNGM+ +G R +Q++E+W GVTY L +
Sbjct: 44 NFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYALQHHL 84
>gi|402583425|gb|EJW77369.1| hypothetical protein WUBG_11725, partial [Wuchereria bancrofti]
Length = 109
Score = 42.7 bits (99), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 26/41 (63%)
Query: 11 GFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAM 51
F G +GAVNGM+ +G R +Q++E+W GVTY L +
Sbjct: 21 NFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYALQHHL 61
>gi|302348409|ref|YP_003816047.1| trehalase [Acidilobus saccharovorans 345-15]
gi|302328821|gb|ADL19016.1| Putative trehalase [Acidilobus saccharovorans 345-15]
Length = 928
Score = 38.5 bits (88), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 48/90 (53%), Gaps = 17/90 (18%)
Query: 24 KPNGDR--DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERT---GLG 78
+PNG + +R + Q++ WTGV + +++ MLYEG V+EA + R++++R GL
Sbjct: 740 EPNGTKILNRVSSQADTPWTGVEFGVASQMLYEGMVEEAME----VLRSIHDRYRSWGLY 795
Query: 79 FETPEGLTGDKTYRSGGYMRALAVYAMQDA 108
F E G Y R LA A+ ++
Sbjct: 796 FNHLEC--------DGHYSRPLAALAIPNS 817
>gi|317480931|ref|ZP_07940011.1| hypothetical protein HMPREF1007_03130 [Bacteroides sp. 4_1_36]
gi|316902824|gb|EFV24698.1| hypothetical protein HMPREF1007_03130 [Bacteroides sp. 4_1_36]
Length = 851
Score = 37.0 bits (84), Expect = 1.5, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 10/73 (13%)
Query: 34 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 93
+ S EVWTG+ Y ++A +++EG V++ + RT +R P Y
Sbjct: 719 VYSNEVWTGIEYQVAAHLMFEGEVEKGLE----IVRTCRDRYNGRVRNPF-----NEYEC 769
Query: 94 GG-YMRALAVYAM 105
G Y RA+A YAM
Sbjct: 770 GAWYARAMASYAM 782
>gi|270294306|ref|ZP_06200508.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270275773|gb|EFA21633.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 851
Score = 37.0 bits (84), Expect = 1.5, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 10/73 (13%)
Query: 34 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 93
+ S EVWTG+ Y ++A +++EG V++ + RT +R P Y
Sbjct: 719 VYSNEVWTGIEYQVAAHLMFEGEVEKGLE----IVRTCRDRYNGRVRNPF-----NEYEC 769
Query: 94 GG-YMRALAVYAM 105
G Y RA+A YAM
Sbjct: 770 GAWYARAMASYAM 782
>gi|160890984|ref|ZP_02071987.1| hypothetical protein BACUNI_03431 [Bacteroides uniformis ATCC 8492]
gi|156859205|gb|EDO52636.1| hypothetical protein BACUNI_03431 [Bacteroides uniformis ATCC 8492]
Length = 858
Score = 37.0 bits (84), Expect = 1.5, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 10/73 (13%)
Query: 34 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 93
+ S EVWTG+ Y ++A +++EG V++ + RT +R P Y
Sbjct: 726 VYSNEVWTGIEYQVAAHLMFEGEVEKGLE----IVRTCRDRYNGRVRNPF-----NEYEC 776
Query: 94 GG-YMRALAVYAM 105
G Y RA+A YAM
Sbjct: 777 GAWYARAMASYAM 789
>gi|333382634|ref|ZP_08474302.1| hypothetical protein HMPREF9455_02468 [Dysgonomonas gadei ATCC
BAA-286]
gi|332828576|gb|EGK01276.1| hypothetical protein HMPREF9455_02468 [Dysgonomonas gadei ATCC
BAA-286]
Length = 859
Score = 37.0 bits (84), Expect = 1.6, Method: Composition-based stats.
Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 10/73 (13%)
Query: 34 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 93
+ S+EVWTG+ Y ++A +++EG V++ + RT +R P Y
Sbjct: 723 VYSDEVWTGIEYQVAAHLMFEGEVEKGLE----IVRTCRDRYNGKIRNPF-----NEYEC 773
Query: 94 GG-YMRALAVYAM 105
G Y RA+A YA+
Sbjct: 774 GAWYARAMASYAL 786
>gi|269925332|ref|YP_003321955.1| hypothetical protein Tter_0211 [Thermobaculum terrenum ATCC
BAA-798]
gi|269788992|gb|ACZ41133.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
BAA-798]
Length = 811
Score = 35.8 bits (81), Expect = 3.3, Method: Composition-based stats.
Identities = 18/39 (46%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 25 PNGDRDRSAIQ-SEEVWTGVTYLLSAAMLYEGNVDEAWT 62
P GDR S+EVW+G Y ++ M+YEG +DEA T
Sbjct: 662 PFGDRPSFPFPYSDEVWSGTEYQVATLMIYEGLLDEALT 700
>gi|327311335|ref|YP_004338232.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
gi|326947814|gb|AEA12920.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
Length = 892
Score = 35.4 bits (80), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 48/118 (40%), Gaps = 29/118 (24%)
Query: 6 RASARGFEAGSMGAVNGMKPNGDRDRSAI------------------QSEEVWTGVTYLL 47
RA A+ A G +NGM P DR R + Q + WTGV Y +
Sbjct: 680 RAVAKYNLAPDEGLINGMYP--DRRRPSFVGPTLYENFTRGPYLPTWQMDTPWTGVEYAV 737
Query: 48 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 105
+ M YEG V+E L+ YER G + E T YMR L+ +A+
Sbjct: 738 AGHMFYEGLVEEGTAVLKALHER-YERGGHYWNHIEWGT--------HYMRPLSAWAV 786
>gi|160931643|ref|ZP_02079037.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
gi|156869288|gb|EDO62660.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
Length = 804
Score = 35.4 bits (80), Expect = 4.5, Method: Composition-based stats.
Identities = 15/37 (40%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 25 PNGDRDRSA-IQSEEVWTGVTYLLSAAMLYEGNVDEA 60
P+G R + + S+EVWTG+ Y ++A ++YEG + E
Sbjct: 670 PSGGRPKQPFVYSDEVWTGIEYQVAAGLIYEGFLQEG 706
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.313 0.130 0.379
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,885,216,453
Number of Sequences: 23463169
Number of extensions: 74454235
Number of successful extensions: 116034
Number of sequences better than 100.0: 356
Number of HSP's better than 100.0 without gapping: 288
Number of HSP's successfully gapped in prelim test: 68
Number of HSP's that attempted gapping in prelim test: 115452
Number of HSP's gapped (non-prelim): 362
length of query: 115
length of database: 8,064,228,071
effective HSP length: 82
effective length of query: 33
effective length of database: 6,140,248,213
effective search space: 202628191029
effective search space used: 202628191029
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 69 (31.2 bits)