BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy5277
(314 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|432110769|gb|ELK34246.1| Non-lysosomal glucosylceramidase [Myotis davidii]
Length = 868
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 139/292 (47%), Positives = 190/292 (65%), Gaps = 12/292 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 560 PDDEPWLRVNAYVIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMK 619
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q++FS+
Sbjct: 620 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDIQEKFSS 679
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
L++ Q Y LLW G YY +D S S SIM+DQ G FL+ASG+ +FP+ +
Sbjct: 680 ILSRGQQAYERLLWNGRYYNYDCSSRPQSCSIMSDQCAGQWFLRASGLGEGETEVFPTQH 739
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ RAL TIF NV+ F G+MGAVNGM+P G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 740 VVRALQTIFEFNVQAFAGGTMGAVNGMQPQGVPDRSSVQSDEVWVGVVYGLAATMIQEGL 799
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
E + TA G YRTV+ER GLGF+TPE + +RS YMR L+++AMQ A
Sbjct: 800 TWEGFQTAEGCYRTVWERLGLGFQTPEAYCQKRVFRSLAYMRPLSIWAMQLA 851
>gi|328709489|ref|XP_001943760.2| PREDICTED: non-lysosomal glucosylceramidase-like [Acyrthosiphon
pisum]
Length = 776
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 136/290 (46%), Positives = 188/290 (64%), Gaps = 11/290 (3%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
PEE+PF+L+N Y V +VS+W+DLNLKF++ YRDY + D ++L +YP ++ K S
Sbjct: 481 PEEDPFNLVNAYLVHNVSEWRDLNLKFVLQCYRDYTMFGDNKYLADMYPQIKNVMEKSLS 540
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
+D G G+++N G DQT AYCGGL +AAL C V++S++L +++S
Sbjct: 541 WDTDGDGMIENSGTADQTFDTWIMTGISAYCGGLWLAALYCTVQISNILKQDDISEKYST 600
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
L KA + ++ LW G Y+ FDSS HS SIMADQLCG +LK GVN E + P +
Sbjct: 601 ILNKAKESFNAALWNGGYFNFDSSGQHHSKSIMADQLCGEWYLKCCGVN-EEVLPVDRVK 659
Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
+AL TI+ NV GF G MGAVNGM P+G+ D ++QSEEVWTGVTY LS+ M+ G D
Sbjct: 660 KALHTIYKMNVLGFNDGKMGAVNGMMPDGNPDTFSLQSEEVWTGVTYALSSLMIAHGLRD 719
Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
E + TA G+Y TVY G+ ++TPE + YRS GYMR L+++++Q A
Sbjct: 720 EGFNTAKGIYNTVYNNIGMAYQTPEAIYSKNAYRSVGYMRPLSIWSIQTA 769
>gi|444729906|gb|ELW70309.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
Length = 1001
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 138/290 (47%), Positives = 191/290 (65%), Gaps = 14/290 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 672 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 731
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q++F++
Sbjct: 732 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQEKFTS 791
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
LT+ + Y LLW G YY +DSS S SIM+DQ G FLKA G+ +TE +FP+
Sbjct: 792 ILTRGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTE-VFPTP 850
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 851 HVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 910
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 911 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 960
>gi|291241331|ref|XP_002740565.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 792
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 135/291 (46%), Positives = 194/291 (66%), Gaps = 12/291 (4%)
Query: 29 EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
++EP+ +N Y ++D +DWKDLNLKF++ +YRDY + KD +L ++P +++ ++
Sbjct: 494 DDEPWVRVNAYLMYDTADWKDLNLKFVLQVYRDYYVTKDFIYLQDMWPKVKVVMQTSLAW 553
Query: 89 DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
D G GI+ N G+ DQT AYCGGL +AA+ M+E++ +L++ Q +++S
Sbjct: 554 DTDGDGIIDNSGYADQTYDAWSVTGASAYCGGLWLAAVRAMMEIAKILDEEDQFKKYSDV 613
Query: 139 LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE--AIFPSANI 196
L + Y LW G YY +DSS+ +HS+SIM DQ G +L+A G+ E +FP +I
Sbjct: 614 LIQGKSSYEKKLWNGKYYNYDSSKQKHSNSIMCDQAAGQWYLRACGLVQENDEVFPKDHI 673
Query: 197 ARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNV 256
AL I+ NV GFE G+MGAVNGM+P+G D ++ QSEEVWTGVTY L+A M+ EG +
Sbjct: 674 LCALRAIYKMNVMGFEKGTMGAVNGMRPDGSPDTTSFQSEEVWTGVTYALAANMIQEGLL 733
Query: 257 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
+EA+TTA G+Y+T YER GLGF+TPE + YRS GYMR LA++AMQ A
Sbjct: 734 EEAFTTASGVYKTCYERCGLGFQTPEAYFANNCYRSLGYMRPLAIWAMQYA 784
>gi|351707038|gb|EHB09957.1| Non-lysosomal glucosylceramidase [Heterocephalus glaber]
Length = 922
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 140/300 (46%), Positives = 193/300 (64%), Gaps = 14/300 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 593 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 652
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A V+M+ L Q +FS+
Sbjct: 653 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVTVQMAVLCAAQDVQDKFSS 712
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 713 ILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTT 771
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 772 HVVRALQTIFELNVQSFAGGAMGAVNGMQPHGIPDRSSVQSDEVWVGVVYGLAATMIQEG 831
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++AMQ A + K K
Sbjct: 832 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIWAMQLALQQQKHK 891
>gi|334333160|ref|XP_003341682.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
glucosylceramidase-like [Monodelphis domestica]
Length = 950
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 135/293 (46%), Positives = 192/293 (65%), Gaps = 14/293 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ IN Y + D +DWKDLNLKF++ IYRD+ L + FL ++P C ++
Sbjct: 603 PDDEPWLRINAYLIHDTADWKDLNLKFVLQIYRDFYLTGNESFLRDMWPVCQAVMESEMK 662
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+PDQT AYCGGL +AA+A MV+M+++ Q++F +
Sbjct: 663 FDKDQDGLIENGGYPDQTYDGWITTGPSAYCGGLWLAAVAVMVQMATVCGTQDIQEKFLS 722
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S IM+DQ G FL+A G+ +TE +FP+
Sbjct: 723 ILSRGREAYERLLWNGRYYNYDSSTQPQSRIIMSDQCAGQWFLRACGLGEGDTE-VFPTP 781
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+GF G+MGAVNGM+P+G D S++QS+EVW GV Y L+A M+ EG
Sbjct: 782 HVIRALQTIFEVNVQGFAGGAMGAVNGMQPDGIPDTSSVQSDEVWVGVVYGLAATMIQEG 841
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
+ E + TA G YRTV+ER G+ F+TPE + +RS YMR L+++AMQ A
Sbjct: 842 LIQEGFRTAEGCYRTVWERLGMAFQTPEAYCQHRVFRSLAYMRPLSIWAMQLA 894
>gi|156554493|ref|XP_001604835.1| PREDICTED: non-lysosomal glucosylceramidase-like [Nasonia
vitripennis]
Length = 826
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 133/291 (45%), Positives = 186/291 (63%), Gaps = 11/291 (3%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P EEPF LIN Y + DVS W+DLN KF++S YR Y ++++ L + T ++ +
Sbjct: 512 PGEEPFDLINAYPIHDVSQWRDLNSKFVLSCYRIYFFNRNLEQLRDFWSTIKLVLEHSLT 571
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FD+ G+++NGGFPDQT AYCGGL IAAL C V+M+ LL +++ + ++
Sbjct: 572 FDEDNDGLIENGGFPDQTYDCWVMSGPSAYCGGLWIAALHCAVKMAELLGESEDEARYKG 631
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
L + + D LW G YY FD + SIM+DQLCG+ L+A G + E +FP +
Sbjct: 632 ILDRGKVAFQDKLWNGKYYNFDCGKDESKLSIMSDQLCGHWLLRACGFSYE-VFPQDRVR 690
Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
+L T+F NV ++ G GAVNG P G D S IQSEE+WTGV Y L+A +++EG ++
Sbjct: 691 SSLETVFQNNVMKYKNGQQGAVNGFSPTGSIDYSCIQSEEMWTGVAYGLAALLIHEGMIE 750
Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
EA+ TA G+YRTVYE+ G+GFETPE L K YR+ GYMR L+++AMQ A+
Sbjct: 751 EAFRTAEGVYRTVYEKIGMGFETPEALYEHKVYRAIGYMRPLSIWAMQHAW 801
>gi|156356077|ref|XP_001623757.1| predicted protein [Nematostella vectensis]
gi|156210486|gb|EDO31657.1| predicted protein [Nematostella vectensis]
Length = 783
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 130/300 (43%), Positives = 194/300 (64%), Gaps = 13/300 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P E P+ +N Y++ D S WKDLNLKF++ +YRDY D+ +L ++P ++ K +
Sbjct: 480 PSEAPWDHVNAYHIHDTSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWPITKTVMTKSMT 539
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
+D G G+++N G DQT AYCGGL +AAL M E++++L+ ++ ++
Sbjct: 540 YDSDGDGLIENSGLADQTFDAWPVTGPSAYCGGLWLAALRVMAEIATILDFPDERGKYEK 599
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +DSS S++ +SIMADQL G +L A + + + +FPS
Sbjct: 600 ILARGKKAYERLLWNGKYYNYDSSTSKYHNSIMADQLSGQWYLHACDLAQTSNDRVFPSE 659
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
N+ AL T+F+ NV F+ G+MGAVNG++P+G D S++Q+EEVWTGVTY ++A+M+ EG
Sbjct: 660 NVISALRTVFNFNVMKFQEGTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEG 719
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
VDE + TA G+Y T +ER G+ F+TPE + + YRS YMR L+++AMQ A K K K
Sbjct: 720 LVDEGFKTASGIYNTCFERLGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNK 779
>gi|343959578|dbj|BAK63646.1| bile acid beta-glucosidase [Pan troglodytes]
Length = 514
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 185 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 244
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 245 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 304
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 305 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 363
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 364 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 423
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 424 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 471
>gi|194669510|ref|XP_001790122.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
gi|297477943|ref|XP_002689726.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
gi|296484715|tpg|DAA26830.1| TPA: hypothetical protein BOS_8918 [Bos taurus]
Length = 918
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 134/289 (46%), Positives = 184/289 (63%), Gaps = 12/289 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 588 PDDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMK 647
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L + Q +FS+
Sbjct: 648 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSS 707
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
L++ + Y LLW G YY +D S S SIM+DQ G FL+ASG+ +FP+
Sbjct: 708 ILSRGQEAYDRLLWNGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQ 767
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ RAL TIF NV+ F G+MGAVNGM+P G D S++QS+EVW GV Y L+A M+ EG
Sbjct: 768 VVRALQTIFEFNVQAFAGGAMGAVNGMQPCGVPDTSSVQSDEVWVGVVYGLAATMIQEGL 827
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 828 TREGFRTAEGCYRTVWERLGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 876
>gi|395740303|ref|XP_003777396.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pongo
abelii]
Length = 933
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 604 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 663
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 664 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 723
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 724 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 782
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|385648282|ref|NP_001245314.1| glucosidase, beta (bile acid) 2 [Sus scrofa]
Length = 915
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 135/288 (46%), Positives = 189/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y V D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 590 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQVFLRDMWPVCLAVMESEMK 649
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L + Q QFS+
Sbjct: 650 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGTREVQDQFSS 709
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +D S + SIM+DQ G FL+ASG+ +TE +FP+
Sbjct: 710 ILSRGREAYERLLWNGRYYNYDCSSQPQAYSIMSDQCAGQWFLRASGLGEGDTE-VFPTP 768
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ +AL TIF NV+ F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 769 HVVQALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYALAATMIQEG 828
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 829 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876
>gi|194386992|dbj|BAG59862.1| unnamed protein product [Homo sapiens]
Length = 755
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 426 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 485
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 486 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 545
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 546 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 604
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 605 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 664
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 665 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 712
>gi|297684086|ref|XP_002819683.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pongo
abelii]
Length = 927
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 776
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|332228461|ref|XP_003263408.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
[Nomascus leucogenys]
Length = 919
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 590 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 649
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 650 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 709
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 710 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 768
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 769 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 828
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 829 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876
>gi|426361725|ref|XP_004048050.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Gorilla
gorilla gorilla]
Length = 927
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|14042883|dbj|BAB55430.1| unnamed protein product [Homo sapiens]
Length = 927
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|332831849|ref|XP_001167923.2| PREDICTED: non-lysosomal glucosylceramidase isoform 4 [Pan
troglodytes]
Length = 933
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 604 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 663
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 664 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 723
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 724 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 782
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|440901933|gb|ELR52793.1| Non-lysosomal glucosylceramidase, partial [Bos grunniens mutus]
Length = 927
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 134/289 (46%), Positives = 184/289 (63%), Gaps = 12/289 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 597 PDDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMK 656
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L + Q +FS+
Sbjct: 657 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSS 716
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
L++ + Y LLW G YY +D S S SIM+DQ G FL+ASG+ +FP+
Sbjct: 717 ILSRGQEAYDRLLWNGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQ 776
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ RAL TIF NV+ F G+MGAVNGM+P G D S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 VVRALQTIFEFNVQAFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGL 836
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 837 TWEGFRTAEGCYRTVWERLGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 885
>gi|24308251|ref|NP_065995.1| non-lysosomal glucosylceramidase [Homo sapiens]
gi|143018392|sp|Q9HCG7.2|GBA2_HUMAN RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
gi|15030210|gb|AAH11363.1| Glucosidase, beta (bile acid) 2 [Homo sapiens]
gi|16215453|emb|CAC83792.1| bile acid beta-glucosidase [Homo sapiens]
gi|21739906|emb|CAD38976.1| hypothetical protein [Homo sapiens]
gi|119578752|gb|EAW58348.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
gi|119578753|gb|EAW58349.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
Length = 927
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|426220232|ref|XP_004004320.1| PREDICTED: non-lysosomal glucosylceramidase [Ovis aries]
Length = 918
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 134/289 (46%), Positives = 184/289 (63%), Gaps = 12/289 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 588 PDDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMK 647
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L + Q +FS+
Sbjct: 648 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSS 707
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
L++ + Y LLW G YY +D S S SIM+DQ G FL+ASG+ +FP+
Sbjct: 708 ILSRGQEAYDRLLWNGRYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDSEVFPTQQ 767
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ RAL TIF NV+ F G+MGAVNGM+P G D S++QS+EVW GV Y L+A M+ EG
Sbjct: 768 VVRALQTIFEFNVQAFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGL 827
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 828 TWEGFRTAEGCYRTVWERLGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 876
>gi|221041110|dbj|BAH12232.1| unnamed protein product [Homo sapiens]
Length = 933
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 604 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 663
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 664 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 723
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 724 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 782
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|426361727|ref|XP_004048051.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Gorilla
gorilla gorilla]
Length = 933
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 604 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 663
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 664 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 723
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 724 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 782
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|55633173|ref|XP_520565.1| PREDICTED: non-lysosomal glucosylceramidase isoform 6 [Pan
troglodytes]
gi|410227830|gb|JAA11134.1| talin 1 [Pan troglodytes]
Length = 927
Score = 273 bits (698), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|395515264|ref|XP_003761826.1| PREDICTED: non-lysosomal glucosylceramidase [Sarcophilus harrisii]
Length = 954
Score = 273 bits (698), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 137/293 (46%), Positives = 191/293 (65%), Gaps = 14/293 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ IN Y + D +DWKDLNLKF++ IYRD+ L + FL ++P C ++
Sbjct: 607 PDDEPWLRINAYLIHDTADWKDLNLKFVLQIYRDFYLTGNESFLRDMWPVCQAVMESEMK 666
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+++ Q++F +
Sbjct: 667 FDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQMAAVCGAQDVQEKFLS 726
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S IM+DQ G FL+A G+ +TE +FPSA
Sbjct: 727 ILSRGREAYERLLWNGRYYNYDSSSQPQSRIIMSDQCAGQWFLRACGLGKGDTE-VFPSA 785
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+GF G+MGAVNGM P+G D S++QS+EVW GV Y L+A M+ EG
Sbjct: 786 HVIRALQTIFELNVQGFAGGAMGAVNGMHPDGVPDTSSVQSDEVWVGVVYGLAATMIQEG 845
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
V E + TA G YRTV+ER G+ F+TPE + +RS YMR L+++AMQ A
Sbjct: 846 LVQEGFLTAEGCYRTVWERLGMAFQTPEAYCQRRVFRSLAYMRPLSIWAMQLA 898
>gi|403306662|ref|XP_003943842.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Saimiri
boliviensis boliviensis]
Length = 923
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+++ Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 776
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|403306664|ref|XP_003943843.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Saimiri
boliviensis boliviensis]
Length = 929
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 604 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 663
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+++ Q +FS+
Sbjct: 664 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSS 723
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 724 ILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 782
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|397519483|ref|XP_003829888.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pan
paniscus]
Length = 755
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 426 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 485
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 486 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 545
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 546 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 604
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL T+F NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 605 HVVRALQTVFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 664
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 665 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 712
>gi|73971791|ref|XP_531991.2| PREDICTED: non-lysosomal glucosylceramidase [Canis lupus familiaris]
Length = 1136
Score = 273 bits (697), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 138/290 (47%), Positives = 187/290 (64%), Gaps = 14/290 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y V D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 809 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMK 868
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q +FS+
Sbjct: 869 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAILCGAHDVQDKFSS 928
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +D S S SIM+DQ G FL+ASG+ +TE +FP+
Sbjct: 929 ILRRGQEAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLRASGLGEGDTE-VFPTR 987
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ AL TIF NVR F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 988 HVVCALQTIFEFNVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 1047
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 1048 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 1097
>gi|397519481|ref|XP_003829887.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pan
paniscus]
Length = 927
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL T+F NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVRALQTVFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 836
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|410925034|ref|XP_003975986.1| PREDICTED: non-lysosomal glucosylceramidase-like [Takifugu
rubripes]
Length = 827
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 132/292 (45%), Positives = 184/292 (63%), Gaps = 12/292 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
PE+EP+ +N Y + + +DWKDLNLKF++ +YRD+ L +D ++L ++P C ++
Sbjct: 526 PEDEPWQRVNAYLIHNTADWKDLNLKFVLQVYRDFHLTQDSQYLQDMWPICQTVMESALK 585
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FD G G+++N GF DQT AYCGGL +A+L M +M+ LL + + Q +
Sbjct: 586 FDLDGDGLIENSGFADQTYDGWTVTGPSAYCGGLWLASLVVMCKMARLLENEMKYQYYRD 645
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEA--IFPSAN 195
L + S Y +LW G YY +DSS HS+S+M+DQ G+ FL+ASG+ FP
Sbjct: 646 ILDRGSVAYDKVLWNGKYYNYDSSGKDHSNSVMSDQCAGHWFLRASGLGDGEYQAFPKEK 705
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
I RAL T+F NV F G MGAVNGM+P G DRS++QS+EVW GV Y L+A M++EG
Sbjct: 706 IQRALKTVFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGM 765
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
++E TA G YRTV+ER G+ F+TPE YRS YMR L+++AMQ A
Sbjct: 766 LEEGMRTAEGCYRTVWERLGMAFQTPEAYCEKGIYRSLAYMRPLSIWAMQLA 817
>gi|354485725|ref|XP_003505033.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Cricetulus griseus]
Length = 898
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 569 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 628
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q +FS+
Sbjct: 629 FDKDQDGLIENGGYADQTYDGWVATGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSS 688
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +DSS S SIM+DQ G FL+A G+ +TE +FP+
Sbjct: 689 ILCRGREAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 747
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 748 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 807
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 808 LTREGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 855
>gi|390458158|ref|XP_002743170.2| PREDICTED: non-lysosomal glucosylceramidase [Callithrix jacchus]
Length = 1065
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 740 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 799
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+++ Q +FS+
Sbjct: 800 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSS 859
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 860 ILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTP 918
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 919 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 978
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 979 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 1026
>gi|355689821|gb|AER98956.1| glucosidase, beta 2 [Mustela putorius furo]
Length = 550
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 137/288 (47%), Positives = 185/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y V D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 220 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYHLTGDQSFLRDMWPVCLAVMESEMK 279
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q +FS+
Sbjct: 280 FDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQMAVLCGAKDIQDKFSS 339
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +D S S SIM+DQ G FLKASG+ +TE +FP+
Sbjct: 340 ILRRGQEAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTE-VFPTQ 398
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ AL TIF NVR F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 399 HVVCALQTIFEFNVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 458
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 459 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 506
>gi|109111050|ref|XP_001084705.1| PREDICTED: non-lysosomal glucosylceramidase [Macaca mulatta]
Length = 927
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 132/287 (45%), Positives = 184/287 (64%), Gaps = 12/287 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
L + + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +FP+ +
Sbjct: 718 ILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPH 777
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 778 VVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGL 837
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 838 TWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|355567647|gb|EHH23988.1| Non-lysosomal glucosylceramidase [Macaca mulatta]
gi|355753217|gb|EHH57263.1| Non-lysosomal glucosylceramidase [Macaca fascicularis]
gi|380789499|gb|AFE66625.1| non-lysosomal glucosylceramidase [Macaca mulatta]
gi|384943046|gb|AFI35128.1| non-lysosomal glucosylceramidase [Macaca mulatta]
Length = 927
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 132/287 (45%), Positives = 184/287 (64%), Gaps = 12/287 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
L + + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +FP+ +
Sbjct: 718 ILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPH 777
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 778 VVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGL 837
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 838 TWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|148670516|gb|EDL02463.1| glucosidase beta 2, isoform CRA_c [Mus musculus]
Length = 467
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ IYRDY L D FL ++P CL ++
Sbjct: 138 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 197
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q++F++
Sbjct: 198 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 257
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +DSS S SIM+DQ G FL+A G+ +TE +FP+
Sbjct: 258 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 316
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 317 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 376
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 377 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 424
>gi|395855828|ref|XP_003800351.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
[Otolemur garnettii]
Length = 1147
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 135/288 (46%), Positives = 187/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 818 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 877
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 878 FDKDHDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDLQDKFSS 937
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +DSS S SIM+DQ G FLKA G+ +TE +FP+
Sbjct: 938 ILRQGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTE-VFPTQ 996
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TI+ NV+ F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 997 HVVRALQTIYELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 1056
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 1057 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 1104
>gi|189217893|ref|NP_001013109.2| non-lysosomal glucosylceramidase [Rattus norvegicus]
gi|149045757|gb|EDL98757.1| glucosidase beta 2 [Rattus norvegicus]
Length = 904
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 589 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 648
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q +FS+
Sbjct: 649 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSS 708
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +DSS S S+M+DQ G FL+A G+ +TE +FP+
Sbjct: 709 ILCRGREAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTE-VFPTL 767
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 768 HVVRALKTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 827
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 828 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 875
>gi|338720173|ref|XP_001497796.3| PREDICTED: non-lysosomal glucosylceramidase [Equus caballus]
Length = 983
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 135/288 (46%), Positives = 187/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ IYRDY L D FL ++P CL ++
Sbjct: 654 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLRDMWPVCLAVMESEMK 713
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 714 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQDRFSS 773
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +D S S SIM+DQ G FL+A G+ +TE +FP+
Sbjct: 774 ILSRGQEAYERLLWNGRYYNYDCSSQPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTR 832
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
+I RAL TIF NV+ F G+MGAVNGM+P+G DR+++QS+EVW GV Y L+A M+ EG
Sbjct: 833 HIVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRTSVQSDEVWVGVVYGLAATMIQEG 892
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 893 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 940
>gi|344271640|ref|XP_003407645.1| PREDICTED: non-lysosomal glucosylceramidase [Loxodonta africana]
Length = 967
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 133/288 (46%), Positives = 188/288 (65%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 642 PDDEPWLRVNAYVIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEME 701
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q++FS+
Sbjct: 702 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMATLCEAQDIQEKFSS 761
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S SIM+DQ G FL+A G+ +TE +FP+
Sbjct: 762 ILSRGKEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTP 820
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G D S++QS+EVW GV Y L+A M+ EG
Sbjct: 821 HVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDTSSVQSDEVWVGVVYGLAATMIQEG 880
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
+ + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 881 LTSKGFWTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 928
>gi|402897018|ref|XP_003911574.1| PREDICTED: non-lysosomal glucosylceramidase [Papio anubis]
Length = 927
Score = 271 bits (692), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 131/287 (45%), Positives = 183/287 (63%), Gaps = 12/287 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT YCGGL +AA+A MV+M++L Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSVYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
L + + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +FP+ +
Sbjct: 718 ILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPH 777
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ EG
Sbjct: 778 VVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGL 837
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 838 TWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|143018484|sp|Q5M868.2|GBA2_RAT RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
Length = 912
Score = 271 bits (692), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 589 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMK 648
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q +FS+
Sbjct: 649 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSS 708
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +DSS S S+M+DQ G FL+A G+ +TE +FP+
Sbjct: 709 ILCRGREAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTE-VFPTL 767
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 768 HVVRALKTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 827
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 828 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 875
>gi|50511011|dbj|BAD32491.1| mKIAA1605 protein [Mus musculus]
Length = 940
Score = 271 bits (692), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ IYRDY L D FL ++P CL ++
Sbjct: 611 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 670
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q++F++
Sbjct: 671 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 730
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +DSS S SIM+DQ G FL+A G+ +TE +FP+
Sbjct: 731 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 789
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 790 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 849
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 850 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 897
>gi|26354314|dbj|BAC40785.1| unnamed protein product [Mus musculus]
Length = 918
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ IYRDY L D FL ++P CL ++
Sbjct: 589 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 648
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q++F++
Sbjct: 649 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 708
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +DSS S SIM+DQ G FL+A G+ +TE +FP+
Sbjct: 709 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 767
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 768 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 827
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 828 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875
>gi|148670514|gb|EDL02461.1| glucosidase beta 2, isoform CRA_a [Mus musculus]
Length = 918
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ IYRDY L D FL ++P CL ++
Sbjct: 589 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 648
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q++F++
Sbjct: 649 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 708
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +DSS S SIM+DQ G FL+A G+ +TE +FP+
Sbjct: 709 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 767
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 768 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 827
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 828 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875
>gi|240120073|ref|NP_766280.2| non-lysosomal glucosylceramidase [Mus musculus]
gi|143018443|sp|Q69ZF3.2|GBA2_MOUSE RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
gi|34784135|gb|AAH56935.1| Glucosidase beta 2 [Mus musculus]
Length = 918
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 135/288 (46%), Positives = 186/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ IYRDY L D FL ++P CL ++
Sbjct: 589 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 648
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q++F++
Sbjct: 649 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 708
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +DSS S SIM+DQ G FL+A G+ +TE +FP+
Sbjct: 709 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 767
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 768 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 827
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 828 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875
>gi|410978559|ref|XP_003995657.1| PREDICTED: non-lysosomal glucosylceramidase [Felis catus]
Length = 917
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 136/288 (47%), Positives = 185/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y V D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 590 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMK 649
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q +FS+
Sbjct: 650 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSS 709
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +D S S SIM+DQ G FL+ASG+ +TE +FP+
Sbjct: 710 ILRRGQEAYERLLWNGRYYNYDCSPPPQSCSIMSDQCAGQWFLRASGLGEGDTE-VFPTP 768
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ AL TIF NVR F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 769 HVVCALQTIFEFNVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 828
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 829 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876
>gi|281339788|gb|EFB15372.1| hypothetical protein PANDA_007478 [Ailuropoda melanoleuca]
Length = 923
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 135/288 (46%), Positives = 185/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y V D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+++ L Q +FS+
Sbjct: 658 FDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +D S S SIM+DQ G FLKASG+ +TE +FP+
Sbjct: 718 ILRRGREAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTE-VFPTP 776
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ AL TIF NV+ F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 777 HVVCALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 836
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 837 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|301767028|ref|XP_002918945.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Ailuropoda melanoleuca]
Length = 928
Score = 268 bits (685), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 135/288 (46%), Positives = 185/288 (64%), Gaps = 14/288 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y V D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 601 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESEMK 660
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+++ L Q +FS+
Sbjct: 661 FDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKFSS 720
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +D S S SIM+DQ G FLKASG+ +TE +FP+
Sbjct: 721 ILRRGREAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTE-VFPTP 779
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ AL TIF NV+ F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 780 HVVCALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 839
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVY 302
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++
Sbjct: 840 LTWEGFQTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIW 887
>gi|405977906|gb|EKC42333.1| Non-lysosomal glucosylceramidase [Crassostrea gigas]
Length = 1464
Score = 267 bits (682), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 186/315 (59%), Gaps = 12/315 (3%)
Query: 6 TMGEVKFRSAANTHERYYTRWFPEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH 65
T GEV R H + PE+EP++ +N Y + +WKDLN+KF++ YRDY+
Sbjct: 1144 TSGEVGIRK--RLHAVPHDMGDPEDEPWNRVNCYVIHPTCNWKDLNMKFVLQTYRDYSAT 1201
Query: 66 KDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAA 115
KD +L +YP ++ + +D GI+ NGGF DQT AYCGG+ +AA
Sbjct: 1202 KDDAYLRHMYPVAKYVMETAKKWDVDDDGIIDNGGFADQTFDAWTMTGASAYCGGMWLAA 1261
Query: 116 LACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLC 175
L M+EM+++L + L K Y + LW GSYY +D S H DSIMADQL
Sbjct: 1262 LRMMIEMATILKKEDDVLHYKEILDKGKVSYQNKLWNGSYYNYDCSSGGHHDSIMADQLA 1321
Query: 176 GYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQS 235
G+ FLKASG+ + IFP + +L IF NV FE G+MGA+NG +P+G +D S+ QS
Sbjct: 1322 GHWFLKASGLQDDDIFPPDRVKSSLKKIFENNVMMFEGGNMGAINGTRPDGSKDISSCQS 1381
Query: 236 EEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGY 295
EE W GVTY L+A M+ EG +D+A+ TA G Y +E GL F+TPE D YRS GY
Sbjct: 1382 EEFWVGVTYGLAANMIQEGLLDQAFQTAWGAYHVCWEWYGLAFQTPEAYMTDNIYRSLGY 1441
Query: 296 MRALAVYAMQDAYLK 310
MR LA+++MQ A K
Sbjct: 1442 MRPLAIWSMQWALEK 1456
>gi|125821356|ref|XP_687652.2| PREDICTED: non-lysosomal glucosylceramidase [Danio rerio]
Length = 851
Score = 266 bits (680), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 129/292 (44%), Positives = 183/292 (62%), Gaps = 12/292 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L +D ++L ++P C ++
Sbjct: 546 PDDEPWVRVNAYLIHDTADWKDLNLKFVLQVYRDYHLTQDQQYLKDMWPVCQTVMENELK 605
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G G+++N G+ DQT AYCGGL +A++ M +M+ +LN Q++
Sbjct: 606 FDKDGDGLIENSGYADQTYDGWKVTGPSAYCGGLWLASVCMMCKMARVLNCESVYQRYRD 665
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEA--IFPSAN 195
L + S + LLW G YY +DSS S+S+M+DQ G+ FL+ASG+ + FP
Sbjct: 666 ILERGSAAFDKLLWNGKYYNYDSSGRSLSNSVMSDQCAGHWFLRASGLGDDEYQAFPKEK 725
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
I AL ++F NV F G MGAVNGM+P G DRS++QS+EVW GV Y L+A M++EG
Sbjct: 726 ICSALKSVFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWVGVVYGLAATMIHEGM 785
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
V+E TA G YR V+ER G+ F+TPE YRS YMR L+++AMQ A
Sbjct: 786 VEEGLRTAEGCYRAVWERMGMAFQTPEAYCEKGIYRSLAYMRPLSIWAMQLA 837
>gi|443710865|gb|ELU04898.1| hypothetical protein CAPTEDRAFT_216398 [Capitella teleta]
Length = 449
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 129/296 (43%), Positives = 180/296 (60%), Gaps = 14/296 (4%)
Query: 29 EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
+ EP+S +N Y + ++WKDLN KF++ +YRDY KD ++L +YP C ++ +
Sbjct: 143 DNEPWSRLNAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMYPQCKAVMDHSLRW 202
Query: 89 DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
D GI+ N GF DQT AYCGGL +AA+ EM+ L + ++
Sbjct: 203 DVDRDGIIDNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHRLGVVEDLMKYEEI 262
Query: 139 LTKASQVYHDLLWTG----SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSA 194
L + + + + LW G YY +D S+S H DSIMADQ+CG FLKA GV A+FPSA
Sbjct: 263 LERGKKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLKACGVADNAVFPSA 322
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
N+ RAL +F NV F+ G MGA+NGM+PN +D ++ QS+E WTGVTY L A M+ G
Sbjct: 323 NVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYALGATMIQVG 382
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
+D+ + TA G Y T +ER GL F+TPE ++ +RS GYMR LA++A+Q A K
Sbjct: 383 MIDKGFQTAYGAYHTCWERYGLAFQTPEAYFDNRRFRSLGYMRPLAIWAIQHAVEK 438
>gi|169642399|gb|AAI60640.1| LOC100145361 protein [Xenopus (Silurana) tropicalis]
Length = 571
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 124/290 (42%), Positives = 181/290 (62%), Gaps = 12/290 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D ++WKDLNLKF++ +YRDY + KD +L ++ C ++
Sbjct: 263 PDDEPWQKLNAYLIHDTANWKDLNLKFVLQVYRDYHITKDTTYLRDMWSVCQTVMETSLK 322
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FD+ G G+++N GF DQT +YCGGL +AA+ M +M+ +L D Q++F+
Sbjct: 323 FDEDGDGLIENSGFADQTYDDWVMTGPSSYCGGLWLAAVCMMCKMAEVLGDGAAQEKFTD 382
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
L + + LLW G YY FD +S+S+M DQ G FL+A G+ +FP +
Sbjct: 383 ILHRGKAAFEKLLWNGKYYNFDCGDQPYSNSVMVDQCAGNWFLRACGLGAGESEVFPKDH 442
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ AL TIF NV+ F G MGAVNGM+P+G D S++QS+EVW GV Y L+A M++EG
Sbjct: 443 VVSALRTIFELNVKQFADGQMGAVNGMRPDGTTDTSSVQSDEVWIGVVYGLAATMIHEGL 502
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
V E + TA G YRTV+ER G+ F+TPE + +RS YMR L+++AMQ
Sbjct: 503 VQEGFATAEGCYRTVWERLGMSFQTPEAYCEKQVFRSLAYMRPLSIWAMQ 552
>gi|443709232|gb|ELU03988.1| hypothetical protein CAPTEDRAFT_186839 [Capitella teleta]
Length = 802
Score = 263 bits (672), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 129/296 (43%), Positives = 180/296 (60%), Gaps = 14/296 (4%)
Query: 29 EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
+ EP+S +N Y + ++WKDLN KF++ +YRDY KD ++L +YP C ++ +
Sbjct: 496 DNEPWSRLNAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMYPQCKAVMDHSLRW 555
Query: 89 DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
D GI+ N GF DQT AYCGGL +AA+ EM+ L + ++
Sbjct: 556 DVDRDGIIDNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHRLGVVEDLMKYEEI 615
Query: 139 LTKASQVYHDLLWTG----SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSA 194
L + + + + LW G YY +D S+S H DSIMADQ+CG FLKA GV A+FPSA
Sbjct: 616 LERGKKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLKACGVADNAVFPSA 675
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
N+ RAL +F NV F+ G MGA+NGM+PN +D ++ QS+E WTGVTY L A M+ G
Sbjct: 676 NVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYALGATMIQVG 735
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
+D+ + TA G Y T +ER GL F+TPE ++ +RS GYMR LA++A+Q A K
Sbjct: 736 MIDKGFQTAYGAYHTCWERYGLAFQTPEAYFDNRRFRSLGYMRPLAIWAIQHAVEK 791
>gi|198424051|ref|XP_002127036.1| PREDICTED: similar to Non-lysosomal glucosylceramidase (NLGase)
(Glucosylceramidase 2) (Beta-glucocerebrosidase 2)
(Beta-glucosidase 2) [Ciona intestinalis]
Length = 861
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 127/291 (43%), Positives = 185/291 (63%), Gaps = 13/291 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
PE+EP+ +N Y V D +DW+DLN KF++ YRDY + KDI FL ++P C ++ +
Sbjct: 567 PEDEPWLRVNTYFVHDTADWRDLNPKFVLQAYRDYYITKDIDFLKAMWPICKIVMEQSMR 626
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
DK G+++N G DQT AYCGGL +AAL CM E + +L++ Q +F
Sbjct: 627 HDKDNDGLIENSGAADQTFDGWCVTGPSAYCGGLWLAALRCMEEAADILHEKDDQVKFHD 686
Query: 138 KLTKASQVYHDLLWTGSYYKFD-SSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANI 196
L +A++ Y D LW G Y+ +D S+ +H S+MADQ G FL ASG+ + P +
Sbjct: 687 ILVRATKAYQDRLWNGKYFNYDCSNNPQHRTSVMADQCAGQWFLLASGICD--VLPDDQV 744
Query: 197 ARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNV 256
+L TIFS NV ++ G GAVNGM+P+G D S++QS+EVWTGVTY L+A M+++G +
Sbjct: 745 MSSLKTIFSLNVEQYDGGRHGAVNGMRPHGAVDTSSVQSDEVWTGVTYALAATMIHKGML 804
Query: 257 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
E + TA G+YR+ ++R G+ F+TPE + + TYRS YMR L+++ MQ A
Sbjct: 805 KEGFRTASGIYRSCWQRYGMAFQTPEAMRKNHTYRSLAYMRPLSIWGMQYA 855
>gi|432920219|ref|XP_004079895.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oryzias latipes]
Length = 858
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 127/292 (43%), Positives = 179/292 (61%), Gaps = 12/292 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D + WKDLNLKF++ +YRD+ + +D ++L ++P C ++
Sbjct: 547 PDDEPWHRLNAYLIHDTAGWKDLNLKFVLQVYRDFHITQDNQYLKDMWPICQAVMESEIK 606
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FD G G+++N G+ DQT AYCGGL +A+L M +M+ L++ + Q +
Sbjct: 607 FDLDGDGLIENSGYADQTYDGWTVTGPSAYCGGLWLASLCVMCKMARLVDTERTYQHYKD 666
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE--AIFPSAN 195
L + S + LLW G YY +DSS HS+S+M+DQ G+ FLKAS E FP
Sbjct: 667 ILDRGSAAFDKLLWNGKYYNYDSSGRYHSNSVMSDQCAGHWFLKASACGEEDYQAFPKEK 726
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
I L +IF NV F G MGAVNGM+P G DRS++QS+EVW GV Y L+A M++EG
Sbjct: 727 IQSTLKSIFDLNVMAFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGM 786
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
+E TA G YRTV+ER G+ F+TPE YRS YMR L+++AMQ A
Sbjct: 787 REEGMHTAEGCYRTVWERLGMAFQTPEAYCEKNIYRSLAYMRPLSIWAMQLA 838
>gi|242015137|ref|XP_002428230.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212512791|gb|EEB15492.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 812
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 128/291 (43%), Positives = 180/291 (61%), Gaps = 12/291 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P EEPF IN Y + DVS+W+DL+LKF++ YRDY L+ D+ L ++ + L+
Sbjct: 518 PFEEPFLKINSYPIHDVSEWRDLDLKFVLQSYRDYFLNNDLNQLKYLWKSVCVLMNNALR 577
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
+DK G G+++N G PDQT AYCG L +AAL C +M+ +L+D +FS
Sbjct: 578 WDKDGDGLIENVGEPDQTYDAWTMEGPSAYCGSLWLAALQCTCKMALILSDVDTYNKFSE 637
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
L A Q + LW G YY FD S+ + +IM+DQLCG +L G+ E + P A +
Sbjct: 638 TLKMAQQSFDKKLWNGKYYNFDCSE-KWGTTIMSDQLCGLWYLYCCGIKNE-VLPPAKVN 695
Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
+L I+ NV G++G VNGM PNG+ D +QSEE+W GV Y L++ M++EG V+
Sbjct: 696 SSLRCIYENNVLKLGKGNLGCVNGMLPNGEIDIHTLQSEEIWVGVVYALASLMIFEGMVE 755
Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
E W TA G+Y TVY+R G+GFETPE L K YR+ GYMR L++++MQ A+
Sbjct: 756 EGWKTARGIYETVYDRIGMGFETPEALRESKKYRAIGYMRPLSIWSMQLAW 806
>gi|224090236|ref|XP_002190896.1| PREDICTED: non-lysosomal glucosylceramidase [Taeniopygia guttata]
Length = 826
Score = 258 bits (659), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 133/292 (45%), Positives = 184/292 (63%), Gaps = 12/292 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P +EP+ +N Y + D +DWKDLNLKF++ +YRDY L D +L ++P C ++
Sbjct: 516 PGDEPWQRVNAYLMHDTADWKDLNLKFVLQVYRDYYLTHDSLYLQDMWPVCQAVMESELK 575
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FD G+++NGGF DQT AYCGGL +AA+ M +M+ +L DT+ +Q++
Sbjct: 576 FDTDNDGLIENGGFADQTYDAWVVIRRCAYCGGLWLAAVCMMCKMAEVLGDTEIRQKYLG 635
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEA--IFPSAN 195
L K + + +LW G YY +DSS S S SIM+DQ G FL A G++ +FP ++
Sbjct: 636 ILNKGKEAFERMLWNGKYYNYDSSGSDTSSSIMSDQCAGQWFLGACGLDQGEFEVFPKSH 695
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
I AL TIF NV F G+MGAVNGM+P+G D S++QS EVW GV Y L+A M+ EG
Sbjct: 696 ILSALRTIFEKNVLSFAGGTMGAVNGMRPDGVPDTSSVQSNEVWVGVVYSLAATMIQEGM 755
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
V+E + TA G YRTV+ER G+ F+TPE K YRS YMR L++++MQ A
Sbjct: 756 VEEGFRTAEGCYRTVWERLGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQLA 807
>gi|390344669|ref|XP_001200431.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Strongylocentrotus purpuratus]
Length = 347
Score = 258 bits (659), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 179/290 (61%), Gaps = 12/290 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
PEE+P+ +N Y D ++WKDLNLK ++ +YRDY KD FL ++P C+ ++ K S
Sbjct: 41 PEEDPWLKLNCYWFHDTAEWKDLNLKMVLMVYRDYFATKDREFLDFMWPKCMTVMAKANS 100
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
DK G G++ + G DQT AYCGGL++AAL CM EM+ LL D + ++Q+S
Sbjct: 101 QDKDGDGLIDHFGTADQTYDVWTAKGCSAYCGGLYLAALKCMCEMADLLGDDKAKRQYSE 160
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE--AIFPSAN 195
L + Y LW G Y+ +DSS + DSIMADQ CG+ +L+A + E +F +
Sbjct: 161 ILQRGKATYEKKLWNGRYFNYDSSDQSYHDSIMADQTCGHWYLRACDLVPERDQVFQVDH 220
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ AL TIF NV G + G+ GA+NG++P+G D +++Q EEVW G TY L+ M+ EG
Sbjct: 221 VRSALRTIFDMNVMGVKEGNFGAMNGIRPSGKPDHTSLQGEEVWVGTTYGLAGNMIQEGM 280
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+E + TA G Y T YE+ GL ++ PE K YRS GYMR LA++AMQ
Sbjct: 281 WEEGFRTAKGCYTTCYEQAGLAYQVPEAYMSKKIYRSLGYMRPLAIWAMQ 330
>gi|363744064|ref|XP_003642968.1| PREDICTED: non-lysosomal glucosylceramidase-like [Gallus gallus]
Length = 853
Score = 256 bits (655), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 133/292 (45%), Positives = 183/292 (62%), Gaps = 12/292 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D ++WKDLNLKF++ +YRDY L D +L ++P C ++
Sbjct: 541 PDDEPWQRVNAYLMHDTANWKDLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELK 600
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FD G+++NGG DQT AYCGGL +AA+ M EM+ +L DT+ +Q++ A
Sbjct: 601 FDTDNDGLIENGGIADQTYDAWVVDGASAYCGGLWLAAVRMMCEMAEVLGDTETRQKYDA 660
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAI--FPSAN 195
L K + + LLW G YY +DSS S S SIM+DQ G FL A G++ + + FP ++
Sbjct: 661 ILQKGKESFERLLWNGKYYNYDSSGSSTSSSIMSDQCAGQWFLGACGLDQKEVEVFPKSH 720
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
I AL TIF NV F G MGAVNGM+P+G D S++QS EVW GV Y L+A M+ EG
Sbjct: 721 IVSALKTIFEKNVMSFAGGKMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGL 780
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
V E + TA G YRTV+E G+ F+TPE K YRS YMR L++++MQ A
Sbjct: 781 VQEGFHTAEGCYRTVWEHLGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLA 832
>gi|348520215|ref|XP_003447624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oreochromis
niloticus]
Length = 856
Score = 256 bits (654), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 123/292 (42%), Positives = 179/292 (61%), Gaps = 12/292 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D + WKDLNLKF++ +YRD+ L +D ++L ++P C ++
Sbjct: 547 PDDEPWMRVNAYLIHDTAGWKDLNLKFVLQVYRDFHLTQDRQYLQDMWPICQAVMESELK 606
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FD G G+++N G+ DQT AYCGG+ +A+L M +M+ LL+ + + +
Sbjct: 607 FDLDGDGLIENSGYADQTYDGWIVTGPSAYCGGMWVASLCVMCKMARLLDSEESYRHYKD 666
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAI--FPSAN 195
L + S + LLW G YY +DSS S+S+M+DQ G+ FL+ASG+ FP
Sbjct: 667 LLDRGSAAFEKLLWNGKYYNYDSSGRELSNSVMSDQCAGHWFLRASGLGDGEFQAFPKEK 726
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
I AL ++F NV F G MGAVNGM+P G D S++QS+EVW GV Y L+A M++EG
Sbjct: 727 IQSALKSVFDLNVMSFAGGQMGAVNGMRPEGVPDHSSVQSDEVWIGVVYGLAATMIHEGM 786
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
+E TA G YRTV+E+ G+ F+TPE YRS YMR L+++AMQ A
Sbjct: 787 QEEGMRTAEGCYRTVWEKLGMAFQTPEAYCEKNIYRSLAYMRPLSIWAMQLA 838
>gi|326934726|ref|XP_003213436.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Meleagris gallopavo]
Length = 639
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 133/292 (45%), Positives = 183/292 (62%), Gaps = 12/292 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D ++WKDLNLKF++ +YRDY L D +L ++P C ++
Sbjct: 327 PDDEPWQRVNAYLMHDTANWKDLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELK 386
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FD G+++NGG DQT AYCGGL +AA+ M EM+ +L D + QQ++ A
Sbjct: 387 FDTDNDGLIENGGTADQTYDAWVVNGASAYCGGLWLAAVRMMCEMAEVLGDAETQQKYGA 446
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEA--IFPSAN 195
L K + + LLW G YY +DSS S S SIM+DQ G FL A G++ + +FP ++
Sbjct: 447 ILQKGKESFERLLWNGKYYNYDSSGSSTSSSIMSDQCAGQWFLGACGLDQKELEVFPKSH 506
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
I AL TIF NV F G+MGAVNGM+P+G D S++QS EVW GV Y L+A M+ EG
Sbjct: 507 IVSALKTIFEKNVMSFAGGTMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGL 566
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
V E + TA G YRTV+E G+ F+TPE K YRS YMR L++++MQ A
Sbjct: 567 VQEGFHTAEGCYRTVWENLGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLA 618
>gi|449275451|gb|EMC84314.1| Non-lysosomal glucosylceramidase [Columba livia]
Length = 692
Score = 253 bits (645), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 129/290 (44%), Positives = 180/290 (62%), Gaps = 14/290 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P +EP+ +N Y + D ++WKDLNLKF++ +YRDY L D +L ++P C ++
Sbjct: 386 PADEPWQRVNAYLIHDTANWKDLNLKFVLQVYRDYYLTHDSLYLRDMWPVCQAVMESELK 445
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FD G+++NGGF DQT AYCGGL +AA+ M +M+ +L D + QQ++
Sbjct: 446 FDMDNDGLIENGGFADQTYDAWVVNGASAYCGGLWLAAVCMMCKMAEVLGDAEIQQKYMD 505
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
L K + + LLW G YY +DSS S S SIM+DQ G+ S +FP +++
Sbjct: 506 ILRKGKETFERLLWNGKYYNYDSSGSNTSSSIMSDQCAGHPKPPLS----MQVFPKSHVV 561
Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
AL TIF NV GF G+MGAVNGM+P+G D S++QS EVW GV Y L+A M+ EG V+
Sbjct: 562 SALKTIFEKNVMGFAGGTMGAVNGMRPSGVPDTSSVQSNEVWVGVVYALAATMIQEGLVE 621
Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
E + TA G YRTV+E+ G+ F+TPE K YRS YMR L++++MQ A
Sbjct: 622 EGFRTAEGCYRTVWEQLGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQLA 671
>gi|196002317|ref|XP_002111026.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
gi|190586977|gb|EDV27030.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
Length = 843
Score = 250 bits (638), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 125/294 (42%), Positives = 180/294 (61%), Gaps = 14/294 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P +EP+ IN Y + DV DW DLN KF++ IYRDY ++ FL ++P ++ S
Sbjct: 538 PCDEPWKRINAYCISDVQDWTDLNTKFVLQIYRDYCATGNLEFLKDMWPCAKHVMETAIS 597
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
D G GI+ N G DQT AYCGG+ +AAL M +++ L+ + + ++S
Sbjct: 598 QDTDGDGIIDNNG-ADQTYDVWRMYGASAYCGGIFLAALYIMYKIADLIGCKEDKAKYSK 656
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + LW G YY +DSS+ HSDSIMADQ+CG+ +L+A G+ N + IFP +
Sbjct: 657 ILLRGRDSFQQKLWNGDYYLYDSSKGNHSDSIMADQMCGHWYLQACGLVNNNEDDIFPKS 716
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
+ +AL +F NV F G+ GAVNGMKPNG+ D S++QSEEVWTGVTY L+A ++ +G
Sbjct: 717 QVDKALRKVFDYNVMKFMNGTWGAVNGMKPNGEIDTSSVQSEEVWTGVTYALAANLIQQG 776
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
+ E + A G+Y T + GLGF+TPE YR GYMR L+++A++ A+
Sbjct: 777 MIKEGFKVAEGMYDTTFNVMGLGFQTPEAYYKINRYRCTGYMRPLSIWAIEWAF 830
>gi|260813884|ref|XP_002601646.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
gi|229286945|gb|EEN57658.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
Length = 871
Score = 246 bits (628), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 122/296 (41%), Positives = 183/296 (61%), Gaps = 19/296 (6%)
Query: 29 EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
++EPF +N Y + D SDW+DLNLKF++ I+RDY + +D FL ++P ++ K ++
Sbjct: 562 DDEPFIHVNAYVMHDTSDWRDLNLKFVLQIFRDYFVGRDKGFLAFMWPRAQAVMTKALTW 621
Query: 89 DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
D G++ N G DQT AYCGGL +AAL CMVE++++L +++Q + A
Sbjct: 622 DTDQDGMIDNSGAADQTYDAWIVTGASAYCGGLWLAALRCMVEIANILGKEEEEQHYLAV 681
Query: 139 LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE---------A 189
+ +A Y + LW G YY +D S +S+SIMA G+ +L+A G+ E
Sbjct: 682 MQRAKTAYEEKLWNGEYYNYDCSGRSYSNSIMAGAGSGHWYLRACGLVPEDNEVMDSDRE 741
Query: 190 IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAA 249
+ P ++A +L ++ NV F G+MGAVNGM+PNG +D +++QSEEVWTG+TY L+A+
Sbjct: 742 VLPLDHVASSLRKVYDFNVMQFHQGTMGAVNGMRPNGKKDLTSMQSEEVWTGITYALAAS 801
Query: 250 MLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
M+ EG V+E + TA G+Y Y + G F+TPE +RS GYMR LA++AMQ
Sbjct: 802 MIQEGMVEEGFQTAYGVYNMCYLQCGFAFQTPEAYLERDFFRSLGYMRPLAIWAMQ 857
>gi|189239183|ref|XP_966847.2| PREDICTED: similar to bile acid beta-glucosidase, putative
[Tribolium castaneum]
gi|270010940|gb|EFA07388.1| hypothetical protein TcasGA2_TC016367 [Tribolium castaneum]
Length = 818
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 186/314 (59%), Gaps = 19/314 (6%)
Query: 8 GEVKFRSAANTHERYYTRWFPEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--- 64
GEV R NT P EEPF L+N Y + DVS W+DLN KF++ ++RD +
Sbjct: 501 GEVCERKVPNTVPHDIGD--PGEEPFILLNSYPIHDVSQWRDLNSKFVLQVFRDAFITGL 558
Query: 65 -HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHI 113
+ I +L +Y C ++ K FD G G+++N G PDQT AYCGGL +
Sbjct: 559 DDRSIAYLNDMYNACYTVMHKSLDFDVDGDGLIENSGSPDQTFDTWVMTGASAYCGGLWL 618
Query: 114 AALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQ 173
AAL M +++ L T+ +++F L K + + LW G Y FD S + SIMADQ
Sbjct: 619 AALFAMTKIADALQKTEDKEKFQELLDKGTAAFERKLWNGKCYNFDCS-DKECRSIMADQ 677
Query: 174 LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAI 233
LCG +L++ G N E +FP + +L TI+ NV+ F G MGAVNG +G D I
Sbjct: 678 LCGQWYLRSCGFNYE-VFPQDRVKTSLKTIYENNVQSFCDGRMGAVNGFI-DGVIDEFTI 735
Query: 234 QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSG 293
QS+EVWTGVTY L+A+ML EG EA+ TAGG+++++ ER GL F+TPE L K YR+
Sbjct: 736 QSQEVWTGVTYALAASMLQEGMRTEAFNTAGGMFKSMSERFGLSFDTPEALYAAKYYRAI 795
Query: 294 GYMRALAVYAMQDA 307
GYMR L++++MQ A
Sbjct: 796 GYMRPLSIWSMQLA 809
>gi|71992731|ref|NP_001021682.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
gi|15130776|emb|CAC48128.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
Length = 922
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 124/294 (42%), Positives = 187/294 (63%), Gaps = 19/294 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL----HKDIRFLTRVYPTC-LELI 82
P +EP+ N Y + D WKDLN+K++++ +RDY + H+D FL +P + ++
Sbjct: 562 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHED--FLFHTWPAVRMIML 619
Query: 83 RKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ 132
E++D+ G G+++N G DQT AYCG L +A+L +EM+ L+ D + Q
Sbjct: 620 EALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMKDDETQ 679
Query: 133 QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIF 191
Q F L KA +V+ D LWTG+Y++F +SR +++MADQLCGY FL++ S + +
Sbjct: 680 QLFRNTLEKAKKVFIDTLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPELADELL 738
Query: 192 PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 251
P+ + AL TI+ NV F G MGAVNGMKP+G DR IQ++E+WTGVTY +++ ++
Sbjct: 739 PNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLI 798
Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+G V++A+ TA G Y T +E TGL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 799 QQGEVEKAFHTASGSYLTCFEETGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 852
>gi|71992723|ref|NP_001021681.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
gi|50511739|emb|CAH04764.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
Length = 930
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 124/294 (42%), Positives = 187/294 (63%), Gaps = 19/294 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL----HKDIRFLTRVYPTC-LELI 82
P +EP+ N Y + D WKDLN+K++++ +RDY + H+D FL +P + ++
Sbjct: 570 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHED--FLFHTWPAVRMIML 627
Query: 83 RKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ 132
E++D+ G G+++N G DQT AYCG L +A+L +EM+ L+ D + Q
Sbjct: 628 EALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMKDDETQ 687
Query: 133 QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIF 191
Q F L KA +V+ D LWTG+Y++F +SR +++MADQLCGY FL++ S + +
Sbjct: 688 QLFRNTLEKAKKVFIDTLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPELADELL 746
Query: 192 PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 251
P+ + AL TI+ NV F G MGAVNGMKP+G DR IQ++E+WTGVTY +++ ++
Sbjct: 747 PNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLI 806
Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+G V++A+ TA G Y T +E TGL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 807 QQGEVEKAFHTASGSYLTCFEETGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 860
>gi|71992715|ref|NP_001021680.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
gi|18376555|emb|CAD21661.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
Length = 959
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 124/294 (42%), Positives = 187/294 (63%), Gaps = 19/294 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL----HKDIRFLTRVYPTC-LELI 82
P +EP+ N Y + D WKDLN+K++++ +RDY + H+D FL +P + ++
Sbjct: 599 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHED--FLFHTWPAVRMIML 656
Query: 83 RKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ 132
E++D+ G G+++N G DQT AYCG L +A+L +EM+ L+ D + Q
Sbjct: 657 EALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMKDDETQ 716
Query: 133 QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIF 191
Q F L KA +V+ D LWTG+Y++F +SR +++MADQLCGY FL++ S + +
Sbjct: 717 QLFRNTLEKAKKVFIDTLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPELADELL 775
Query: 192 PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 251
P+ + AL TI+ NV F G MGAVNGMKP+G DR IQ++E+WTGVTY +++ ++
Sbjct: 776 PNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLI 835
Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+G V++A+ TA G Y T +E TGL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 836 QQGEVEKAFHTASGSYLTCFEETGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 889
>gi|268569852|ref|XP_002640631.1| Hypothetical protein CBG08749 [Caenorhabditis briggsae]
Length = 899
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 121/292 (41%), Positives = 186/292 (63%), Gaps = 15/292 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTC-LELIRK 84
P +EP+ N Y + D WKDLN+K++++ +RDY + + FL +P + ++
Sbjct: 547 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEEHQEFLHHTWPAVKMIMLEA 606
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
++D+ G G+++N G DQT AYCG L +A+L +EM+ LL D + ++
Sbjct: 607 LANWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSIEMAGLLEDGETKKL 666
Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIFPS 193
F L KA +V+ D+LWTG+Y++F +SR +++MADQLCGY FL++ S + + PS
Sbjct: 667 FLETLNKAKKVFVDVLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPEMADDLLPS 725
Query: 194 ANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
+ AL TI+ NV F G MGAVNGMKP+G DR IQ++E+WTGVTY +++ ++ +
Sbjct: 726 HMVRSALDTIYRLNVCQFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLIQQ 785
Query: 254 GNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
G V++A+ TA G Y T +E+TGL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 786 GEVEKAFHTASGSYLTCFEQTGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 837
>gi|308485740|ref|XP_003105068.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
gi|308257013|gb|EFP00966.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
Length = 933
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 120/292 (41%), Positives = 184/292 (63%), Gaps = 15/292 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTC-LELIRK 84
P +EP+ N Y + D WKDLN+K++++ +RDY + + FL +P + ++
Sbjct: 575 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEEHQEFLFHTWPAVKMIMLEA 634
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
E++D+ G G+++N G DQT AYCG L +A+L +EM+ L+ + +
Sbjct: 635 LENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSIEMARLMGEEATENL 694
Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIFPS 193
F L KA +V+ D LWTG+Y++F +SR +++MADQLCGY FL++ S + + P+
Sbjct: 695 FRVTLNKAKKVFIDTLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPEMVDDLLPN 753
Query: 194 ANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
+ AL TI+ NV F G MGAVNGMKPNG DR IQ++E+WTGVTY +++ ++ +
Sbjct: 754 HMVRSALDTIYRLNVCRFGNGQMGAVNGMKPNGVVDREYIQADEMWTGVTYAVASLLIQQ 813
Query: 254 GNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
G V++A+ TA G Y T +E+TGL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 814 GEVEKAFHTASGSYLTCFEQTGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 865
>gi|341901617|gb|EGT57552.1| hypothetical protein CAEBREN_15139 [Caenorhabditis brenneri]
Length = 925
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 120/292 (41%), Positives = 185/292 (63%), Gaps = 15/292 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIR--FLTRVYPTC-LELIRK 84
P +EP+ N Y + D WKDLN+K++++ +RDY + + FL +P + ++
Sbjct: 551 PADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEAHQEFLHHTWPAIKMIMLEA 610
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
E++D+ G G+++N G DQT AYCG L +A+L VEM++++ D +
Sbjct: 611 LENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSVEMAAIVKDEITENL 670
Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA-SGVNTEAIFPS 193
F L KA +V+ DLLWTG+Y++F +SR +++MADQLCGY FL++ S + + P+
Sbjct: 671 FKGTLEKAKKVFIDLLWTGTYFRF-CERSRSRETVMADQLCGYWFLQSVSPEMADDLLPN 729
Query: 194 ANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
+ AL TI+ NV F G MGAVNGMKP+G DR IQ++E+WTGVTY +++ ++ +
Sbjct: 730 HMVRSALDTIYRLNVCKFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLIQQ 789
Query: 254 GNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
G V++A+ TA G Y T +E+ GL ++TPE L K YR+ GYMR L+++AMQ
Sbjct: 790 GEVEKAFHTASGSYLTCFEQAGLQYQTPEALYESKFYRAIGYMRPLSIWAMQ 841
>gi|297795757|ref|XP_002865763.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
lyrata]
gi|297311598|gb|EFH42022.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
lyrata]
Length = 956
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 171/296 (57%), Gaps = 14/296 (4%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +NGY + + WKDLN KF++ +YRD D +F + V+P+ + FDK
Sbjct: 605 DPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDK 664
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL ++ ++ D Q F +K
Sbjct: 665 DGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASVLARVVGDKNSQDYFWSKFQ 724
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY LW GSY+ +D+S S++S SI ADQL G + +ASG+ I AL
Sbjct: 725 KAKVVYEKKLWNGSYFNYDNSGSQYSSSIQADQLAGQWYARASGL--LPIVDEDKARTAL 782
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
+++ NV + G GAVNGM PNG D +++QS E+W+GVTY LSA M+ EG V+ A+
Sbjct: 783 EKVYNFNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAF 842
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
TA G+Y + GLG F+TPE + YRS YMR LA++AMQ A K K
Sbjct: 843 QTASGVYEAAWSEAGLGYSFQTPEAWNTNDEYRSLTYMRPLAIWAMQWALTKTSQK 898
>gi|10047285|dbj|BAB13431.1| KIAA1605 protein [Homo sapiens]
Length = 922
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 111/239 (46%), Positives = 157/239 (65%), Gaps = 14/239 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 643 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 702
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 703 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 762
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 763 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 821
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ E
Sbjct: 822 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 880
>gi|119578751|gb|EAW58347.1| glucosidase, beta (bile acid) 2, isoform CRA_c [Homo sapiens]
gi|168270622|dbj|BAG10104.1| non-lysosomal glucosylceramidase [synthetic construct]
Length = 877
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 111/239 (46%), Positives = 157/239 (65%), Gaps = 14/239 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 598 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 657
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q +FS+
Sbjct: 658 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSS 717
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +DSS S S+M+DQ G FLKA G+ +TE +FP+
Sbjct: 718 ILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTE-VFPTQ 776
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
++ RAL TIF NV+ F G+MGAVNGM+P+G D+S++QS+EVW GV Y L+A M+ E
Sbjct: 777 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835
>gi|148670515|gb|EDL02462.1| glucosidase beta 2, isoform CRA_b [Mus musculus]
Length = 377
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 112/239 (46%), Positives = 155/239 (64%), Gaps = 14/239 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ IYRDY L D FL ++P CL ++
Sbjct: 89 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMK 148
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M+ L Q++F++
Sbjct: 149 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFAS 208
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L + + Y LLW G YY +DSS S SIM+DQ G FL+A G+ +TE +FP+
Sbjct: 209 ILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTE-VFPTL 267
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
++ RAL TIF NV+ F G+MGAVNGM P+G DRS++QS+EVW GV Y L+A M+ E
Sbjct: 268 HVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 326
>gi|312382731|gb|EFR28085.1| hypothetical protein AND_04399 [Anopheles darlingi]
Length = 777
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 114/263 (43%), Positives = 168/263 (63%), Gaps = 18/263 (6%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
+ +L +YP C ++ + +DK G+++N PDQT AYCGGL +A+L
Sbjct: 515 MTYLKAMYPACKLVLERSREWDKDDDGLIENSRSPDQTYDSWVMDGPSAYCGGLWLASLH 574
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
CM M+SLL+ T+ + A L K + + + LW G+YYKFD+ QS +SIM+DQLCG+
Sbjct: 575 CMATMASLLDQTEDAANYQAILDKGRRSFEEKLWNGTYYKFDA-QSASKNSIMSDQLCGH 633
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN------GDRDRS 231
+L+A G + + +FP N+ A+ TI+ NV F G++GAVNG PN G D S
Sbjct: 634 WYLRACGFDYD-VFPKENVRLAMRTIYENNVMRFCGGNLGAVNGYVPNAQPNKEGRPDVS 692
Query: 232 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYR 291
IQ EEVWTGVTY L+A M++EG +EA+ TAGGLYR++ E+ G+ FETPE + ++ YR
Sbjct: 693 NIQGEEVWTGVTYALAATMIHEGMFEEAFQTAGGLYRSLSEKIGMNFETPEAVYAERHYR 752
Query: 292 SGGYMRALAVYAMQDAYLKGKVK 314
+ GYMR L++++MQ A+ K++
Sbjct: 753 AIGYMRPLSIWSMQTAWELRKLR 775
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 29/35 (82%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P EEPF LIN Y + DVS+W+DLN+KFI+ +YRDY
Sbjct: 405 PAEEPFDLINAYPIHDVSEWRDLNIKFILQVYRDY 439
>gi|357630375|gb|EHJ78536.1| hypothetical protein KGM_16417 [Danaus plexippus]
Length = 912
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/253 (47%), Positives = 168/253 (66%), Gaps = 15/253 (5%)
Query: 69 RFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC 118
++L+ +YP+C+ L+R+ +D+ G G+++NGGFPDQT AYCGGL +A+++
Sbjct: 641 QYLSDMYPSCVTLLRRGLDWDRDGDGLIENGGFPDQTYDAWVMTGPSAYCGGLWVASVSA 700
Query: 119 MVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
+ M+ +L T +++FS L KA Y LW GSYYKFD+ +S+ +MADQL G
Sbjct: 701 VHAMAKILGFTDDEKEFSTLLEKARDSYERKLWNGSYYKFDTKPC-NSEVVMADQLAGQW 759
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGM--KPNGDRDRSAIQSE 236
FL+ASG TE +FP AN+ +AL TI+ NV+ F G MGAVNG P D +AIQSE
Sbjct: 760 FLRASGW-TEPVFPEANVKKALHTIYENNVQRFLNGRMGAVNGFVRGPRPGIDTTAIQSE 818
Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
EVWTGVTY L+A M+YEG ++A++TAGGLY T+ + GL FETPE L + +RS YM
Sbjct: 819 EVWTGVTYGLAALMIYEGMHEQAFSTAGGLYNTLM-KMGLAFETPEALYENGNHRSVAYM 877
Query: 297 RALAVYAMQDAYL 309
R L++++M A +
Sbjct: 878 RPLSIWSMYHAII 890
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/37 (62%), Positives = 29/37 (78%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
P+ PFS IN YN+ DVS+W+DLNLKFI+ + RDY L
Sbjct: 492 PDGVPFSHINAYNIHDVSEWRDLNLKFILQVMRDYRL 528
>gi|168014916|ref|XP_001759997.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688747|gb|EDQ75122.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 970
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 121/287 (42%), Positives = 167/287 (58%), Gaps = 12/287 (4%)
Query: 29 EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
+ +P++ +N YN+ D S WKDLN KF++ +YRD D F V+P + + F
Sbjct: 620 QHDPWAEVNAYNIHDTSRWKDLNAKFVLQVYRDVVATGDKVFAKAVWPAVYAAMAYLDQF 679
Query: 89 DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
D+ G+++N GFPDQT AYCGGL IAAL M+ LL + + F +K
Sbjct: 680 DRDRDGLIENDGFPDQTYDTWTVHGVSAYCGGLWIAALQAAEAMAVLLEEKNAARYFQSK 739
Query: 139 LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIAR 198
+A + Y LW GSY+ +DS S +S SI ADQ+ G + ASG+ +F A
Sbjct: 740 FIQAREAYLKKLWNGSYFSYDSGNSSNSKSIQADQMAGQWYAWASGL--PPLFDDAKARS 797
Query: 199 ALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 258
AL IF NV + G +GAVNGM PNG D + +QS EVWTGVTY L+AAM++EG +D+
Sbjct: 798 ALQKIFDFNVMKVKNGHLGAVNGMHPNGKVDETCMQSREVWTGVTYALAAAMIHEGMLDQ 857
Query: 259 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
A+ A G++ + G F+TPE T D +RS YMR LA++AMQ
Sbjct: 858 AFRAAEGVFIAGWSDLGYWFQTPEAWTMDGYFRSLAYMRPLAIWAMQ 904
>gi|431902810|gb|ELK09025.1| Non-lysosomal glucosylceramidase [Pteropus alecto]
Length = 970
Score = 223 bits (568), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 111/242 (45%), Positives = 157/242 (64%), Gaps = 14/242 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P CL ++
Sbjct: 566 PDDEPWLRVNAYMIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMK 625
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L +FS+
Sbjct: 626 FDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDILDKFSS 685
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
L++ + Y LLW G YY +D S S SIM+DQ G FL+ASG+ +TE +FP+
Sbjct: 686 ILSRGREAYERLLWNGHYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDTE-VFPTP 744
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ AL TIF NV+ F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ E
Sbjct: 745 HVVCALQTIFKFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQET 804
Query: 255 NV 256
N+
Sbjct: 805 NL 806
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 27/38 (71%)
Query: 267 YRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 891 YRTVWERLGLAFQTPEAYCQHQVFRSLAYMRPLSIWAM 928
>gi|110742221|dbj|BAE99037.1| hypothetical protein [Arabidopsis thaliana]
Length = 957
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 172/296 (58%), Gaps = 14/296 (4%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +NGY + + WKDLN KF++ +YRD D +F + V+P+ + FDK
Sbjct: 605 DPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDK 664
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL ++ ++ D Q F +K
Sbjct: 665 DGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWSKFQ 724
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY LW GSY+ +D+S S++S +I ADQL G + +ASG+ I AL
Sbjct: 725 KAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGL--LPIVDEDKARTAL 782
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
+++ NV + G GAVNGM PNG D +++QS E+W+GVTY LSA M+ EG V+ A+
Sbjct: 783 EKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAF 842
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
TA G+Y + +TGLG F+TPE YRS YMR LA++AMQ A K K
Sbjct: 843 QTASGIYEAAWSKTGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 898
>gi|8777421|dbj|BAA97011.1| unnamed protein product [Arabidopsis thaliana]
Length = 928
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 171/296 (57%), Gaps = 14/296 (4%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +NGY + + WKDLN KF++ +YRD D +F + V+P+ + FDK
Sbjct: 576 DPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDK 635
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL ++ ++ D Q F +K
Sbjct: 636 DGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWSKFQ 695
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY LW GSY+ +D+S S++S +I ADQL G + +ASG+ I AL
Sbjct: 696 KAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGL--LPIVDEDKARTAL 753
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
+++ NV + G GAVNGM PNG D +++QS E+W+GVTY LSA M+ EG V+ A+
Sbjct: 754 EKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAF 813
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
TA G+Y + TGLG F+TPE YRS YMR LA++AMQ A K K
Sbjct: 814 QTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 869
>gi|222423179|dbj|BAH19567.1| AT5G49900 [Arabidopsis thaliana]
Length = 957
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 171/296 (57%), Gaps = 14/296 (4%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +NGY + + WKDLN KF++ +YRD D +F + V+P+ + FDK
Sbjct: 605 DPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDK 664
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL ++ ++ D Q F +K
Sbjct: 665 DGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWSKFQ 724
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY LW GSY+ +D+S S++S +I ADQL G + +ASG+ I AL
Sbjct: 725 KAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGL--LPIVDEDKARTAL 782
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
+++ NV + G GAVNGM PNG D +++QS E+W+GVTY LSA M+ EG V+ A+
Sbjct: 783 EKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAF 842
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
TA G+Y + TGLG F+TPE YRS YMR LA++AMQ A K K
Sbjct: 843 QTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 898
>gi|30695777|ref|NP_199801.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|27311753|gb|AAO00842.1| putative protein [Arabidopsis thaliana]
gi|34365725|gb|AAQ65174.1| At5g49900 [Arabidopsis thaliana]
gi|332008486|gb|AED95869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 957
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 171/296 (57%), Gaps = 14/296 (4%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +NGY + + WKDLN KF++ +YRD D +F + V+P+ + FDK
Sbjct: 605 DPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDK 664
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL ++ ++ D Q F +K
Sbjct: 665 DGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWSKFQ 724
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY LW GSY+ +D+S S++S +I ADQL G + +ASG+ I AL
Sbjct: 725 KAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGL--LPIVDEDKARTAL 782
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
+++ NV + G GAVNGM PNG D +++QS E+W+GVTY LSA M+ EG V+ A+
Sbjct: 783 EKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAF 842
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
TA G+Y + TGLG F+TPE YRS YMR LA++AMQ A K K
Sbjct: 843 QTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQK 898
>gi|157119417|ref|XP_001653371.1| bile acid beta-glucosidase, putative [Aedes aegypti]
gi|108883154|gb|EAT47379.1| AAEL001478-PA, partial [Aedes aegypti]
Length = 898
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 111/261 (42%), Positives = 162/261 (62%), Gaps = 18/261 (6%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
+ +L +YP C +++ K +DK GI++N PDQT AYCGGL +A+L
Sbjct: 636 MTYLKSMYPACKQVLEKTLEWDKDNDGIIENSKCPDQTYDTWVMDGPSAYCGGLWLASLH 695
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
CM M++LL+ +F L K + + LW G+YYKFD QS ++IM+DQLCG+
Sbjct: 696 CMTAMANLLDQNDDCNKFREILEKGKTSFEEKLWNGTYYKFDG-QSASKNTIMSDQLCGH 754
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP------NGDRDRS 231
+L+ G + + IFP N+ AL TI+ NV F G+MGAVNG P +G D S
Sbjct: 755 WYLRCCGFDYD-IFPKENVRTALKTIYDNNVMRFCGGNMGAVNGYVPSTQPNKDGRADAS 813
Query: 232 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYR 291
+Q EEVWTGVTY L++ M++EG EA+ TAGG+Y+T+ E+ G+ FETPE L ++ YR
Sbjct: 814 TVQGEEVWTGVTYALASTMIHEGMFTEAFQTAGGMYQTLSEKIGMNFETPEALYAERHYR 873
Query: 292 SGGYMRALAVYAMQDAYLKGK 312
+ GYMR L+++++Q A+ + K
Sbjct: 874 AIGYMRPLSIWSLQTAWEQKK 894
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 29/35 (82%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF LIN Y + DVS+W+DLN KFI+ +YRDY
Sbjct: 533 PDEEPFDLINAYPIHDVSEWRDLNTKFILQVYRDY 567
>gi|168037390|ref|XP_001771187.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677567|gb|EDQ64036.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 981
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 165/294 (56%), Gaps = 12/294 (4%)
Query: 29 EEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESF 88
+ +P+ +N YN+ D S WKDLN KF++ +YRD D +F V+P + + F
Sbjct: 615 QHDPWVEVNAYNIHDTSHWKDLNPKFVLQVYRDVVATGDRQFAKAVWPAVYAAMAYMDQF 674
Query: 89 DKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAK 138
D+ G+++N GFPDQT AYCGGL IAAL M+ L++D F K
Sbjct: 675 DRDRDGLIENDGFPDQTYDTWSVHGVSAYCGGLWIAALQAAAAMADLVDDKDAATYFKGK 734
Query: 139 LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIAR 198
+A VY LW G Y+ +DS S +S+S+ ADQL G + ASG+ +F
Sbjct: 735 FNQARDVYERKLWNGEYFNYDSGTSSNSNSVQADQLAGQWYAWASGL--PPLFDDYKARS 792
Query: 199 ALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDE 258
AL +F NV + G GA NGM PNG D + +QS E+WTGVTY SAAM++EG V++
Sbjct: 793 ALQKVFDFNVMKVKGGKWGAANGMHPNGKVDETCMQSREIWTGVTYAASAAMIHEGMVEQ 852
Query: 259 AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
A+T A G++ + G F+TPE T D +RS YMR LA++AMQ A K
Sbjct: 853 AFTAAQGVFLAGWSDLGYWFQTPEAWTIDGYFRSLAYMRPLAIWAMQWALYPPK 906
>gi|170036781|ref|XP_001846240.1| bile acid beta-glucosidase [Culex quinquefasciatus]
gi|167879683|gb|EDS43066.1| bile acid beta-glucosidase [Culex quinquefasciatus]
Length = 913
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 112/261 (42%), Positives = 165/261 (63%), Gaps = 18/261 (6%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
+ +L +YP C +++ K +DK G+++N PDQT +YCGGL +A+L
Sbjct: 651 MTYLKSMYPACKQVLEKTLEWDKDNDGLIENSKSPDQTYDTWVMDGPSSYCGGLWLASLH 710
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
CM M++LL+ ++ ++ L K + + LW GSYYKFDS QS +SIM+DQLCG+
Sbjct: 711 CMSVMANLLDQSEDCIKYREILEKGKASFEEKLWNGSYYKFDS-QSSSKNSIMSDQLCGH 769
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP------NGDRDRS 231
+L+ G + + I P N+ AL TI+ NV F G+MGAVNG P +G D S
Sbjct: 770 WYLRCCGFDYD-IVPKENVRTALKTIYDNNVMRFCGGNMGAVNGYIPSSQPNKDGRADSS 828
Query: 232 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYR 291
+Q+EEVWTGVTY L++ M++EG EA+ TAGGLY+T+ E+ G+ FETPE L ++ YR
Sbjct: 829 TLQAEEVWTGVTYALASTMIHEGMFTEAFQTAGGLYQTLSEKIGMSFETPEALYAERHYR 888
Query: 292 SGGYMRALAVYAMQDAYLKGK 312
+ GYMR L++++MQ A+ + K
Sbjct: 889 AIGYMRPLSIWSMQTAWEQKK 909
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 29/35 (82%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF LIN Y + DVS+WKDLN KFI+ +YRDY
Sbjct: 548 PDEEPFELINAYPIHDVSEWKDLNTKFILQVYRDY 582
>gi|444729905|gb|ELW70308.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
Length = 279
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 112/230 (48%), Positives = 152/230 (66%), Gaps = 14/230 (6%)
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FDK G+++NGG+ DQT AYCGGL +AA+A MV+M++L Q++F++
Sbjct: 10 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQEKFTS 69
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV---NTEAIFPSA 194
LT+ + Y LLW G YY +DSS S SIM+DQ G FLKA G+ +TE +FP+
Sbjct: 70 ILTRGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTE-VFPTP 128
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
++ RAL TIF NV+ F G+MGAVNGM+P+G DRS++QS+EVW GV Y L+A M+ EG
Sbjct: 129 HVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEG 188
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
E + TA G YRTV+ER GL F+TPE + +RS YMR L+++AM
Sbjct: 189 LTWEGFRTAEGCYRTVWERLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 238
>gi|158299437|ref|XP_319575.4| AGAP008830-PA [Anopheles gambiae str. PEST]
gi|157013519|gb|EAA14848.4| AGAP008830-PA [Anopheles gambiae str. PEST]
Length = 913
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 108/263 (41%), Positives = 166/263 (63%), Gaps = 18/263 (6%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
+ +L +YP C ++ +DK G G+++N PDQT AYCGGL +A+L
Sbjct: 651 LTYLKAMYPACRVVLEHSLEWDKDGDGLIENSKAPDQTYDTWVMDGPSAYCGGLWLASLH 710
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
CM M+SLL+ + ++ A L K + + LW G+YY+FD+ QS +SIM+DQLCG+
Sbjct: 711 CMSAMASLLDQNEDSDRYKAVLDKGRASFEEKLWNGTYYRFDA-QSASKNSIMSDQLCGH 769
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS------ 231
+L++ G + + +FP N+ A+ TI+ NV F G +GAVNG P+G ++
Sbjct: 770 WYLRSCGFDYD-VFPKENVRLAMRTIYENNVMRFCGGQLGAVNGYVPSGQPNKDGRPDTV 828
Query: 232 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYR 291
+IQ EEVWTGVTY L++ M++EG +EA+ TAGGLY+ + ER G+ FETPE + ++ YR
Sbjct: 829 SIQGEEVWTGVTYALASTMIHEGMFEEAFKTAGGLYQALSERIGMNFETPEAVYAERHYR 888
Query: 292 SGGYMRALAVYAMQDAYLKGKVK 314
+ GYMR L++++MQ A+ K++
Sbjct: 889 AIGYMRPLSIWSMQTAWEMKKLR 911
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 29/35 (82%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P EEPF LIN Y + DVS+W+DLNLKFI+ +YRDY
Sbjct: 541 PAEEPFDLINAYPIHDVSEWRDLNLKFILQVYRDY 575
>gi|324504438|gb|ADY41918.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 534
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 120/301 (39%), Positives = 165/301 (54%), Gaps = 22/301 (7%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRF----------LTRVYPT 77
P +EP+ +N Y + D +WKDLNLKF+++ YRDY I F L R Y
Sbjct: 226 PMDEPWLHLNAYALSDTCEWKDLNLKFVLTCYRDYEKIVKIYFNDDNEMKGCLLRRFYDL 285
Query: 78 CLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLN 127
+I +++D G +++N G PDQT AYCGGL + AL C+ M+ L
Sbjct: 286 SSGIIADAKAWDVDGDDLIENAGQPDQTYDVWSMHGSSAYCGGLWLCALECVRRMALTLG 345
Query: 128 DTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKAS-GVN 186
+ Q+F+ KL A + Y LW G Y+ FD + H SIMADQLCG+ F+ + G
Sbjct: 346 EVVDAQKFANKLNNARKAYERKLWNGKYFDFDEHSTDHK-SIMADQLCGFWFMCITDGKV 404
Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
+ I I +L TIF NV F G +G VN M P+G D + IQSEEVW GV Y L
Sbjct: 405 DDVIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVVDSTGIQSEEVWGGVAYAL 464
Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQD 306
++ L + A+ TA G YR+ +ER GL +++PE + YR+ GYMR LA++AMQ
Sbjct: 465 ASFHLLVEENESAFKTAEGWYRSCWERYGLQYQSPEAINESSYYRAIGYMRPLAIWAMQS 524
Query: 307 A 307
A
Sbjct: 525 A 525
>gi|326516802|dbj|BAJ96393.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 970
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 123/288 (42%), Positives = 164/288 (56%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+A D+ F V+P + + FD
Sbjct: 630 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVSAAMDYMDQFD 689
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCGGL +AAL M+ L D +++ K
Sbjct: 690 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPYAEKYKLKF 749
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA VY LW GSY+ +DS S +S SI ADQL G + +SG+ IF I A
Sbjct: 750 MKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPIFDEHKIRSA 807
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G MGAVNGM P G D + +QS E+WTGVTY ++A ML G +
Sbjct: 808 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 867
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+ TA G++ + G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 868 FITAEGIFLAGWSEDGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAMQ 915
>gi|414871104|tpg|DAA49661.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
Length = 649
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 123/290 (42%), Positives = 166/290 (57%), Gaps = 14/290 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ IYRD+A D++F V+P + + FD
Sbjct: 309 HDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCAAMDYMDQFD 368
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCG L +AAL M+ L D +++ K
Sbjct: 369 RDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHFAEKYKLKF 428
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA VY LW GSY+ +DS S +S SI ADQL G + +SG+ +F I A
Sbjct: 429 IKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDEHKIRTA 486
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G MGAVNGM P G D + +QS E+WTGVTY ++A ML G +
Sbjct: 487 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQG 546
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
+TTA G++ + G G F+TPEG T D YRS YMR LA++A+Q A
Sbjct: 547 FTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYA 596
>gi|330844075|ref|XP_003293963.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
gi|325075658|gb|EGC29519.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
Length = 1205
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 116/296 (39%), Positives = 168/296 (56%), Gaps = 13/296 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELI-RKCE 86
P E+P+ +N YN+ D+S WKDL KF++ +YRDY + +D FL +VY E+I R E
Sbjct: 885 PGEDPWKRVNSYNIQDISRWKDLPSKFVLQVYRDYLVVEDKNFLLQVYNVVEEVIQRTLE 944
Query: 87 SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFS 136
SFD G+V N GFPDQT AY GGL +A+L EM+ +L + + ++
Sbjct: 945 SFDTDHDGVVDNEGFPDQTYDVWPAVGCSAYSGGLWLASLKVASEMAKILGFKEDESIYN 1004
Query: 137 AKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANI 196
A K S Y LW G Y+K+D S S H+DSIM+D L G+ +L + G+ + F A
Sbjct: 1005 AIFEKGSASYTKKLWNGYYFKYDCSNSVHADSIMSDMLAGHWYLLSCGLPSYMTFDQA-- 1062
Query: 197 ARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNV 256
+L+ I NV + GAVNGM+P G D + +QS EVW G ++ L+A M+
Sbjct: 1063 LSSLSIINEYNVNSYGKERCGAVNGMRPEGVVDNTCLQSSEVWIGTSFSLAATMIQHHMD 1122
Query: 257 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
+AW G+ + Y R G ++TPE + YR+G YMR L+++++Q A LK K
Sbjct: 1123 KDAWELVKGIVNSSYNRWGFQYQTPEAWDSNGCYRAGAYMRPLSIWSIQWALLKKK 1178
>gi|16905165|gb|AAL31035.1|AC078948_19 unknown protein [Oryza sativa Japonica Group]
Length = 967
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 122/288 (42%), Positives = 166/288 (57%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+A D+ F V+P ++ FD
Sbjct: 627 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDYMNQFD 686
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCGGL +AAL M+ L D +++ K
Sbjct: 687 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEKYKLKF 746
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
+A VY LW GSY+ +DS S +S SI ADQL G + +SG+ +F I A
Sbjct: 747 IQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDENKIRSA 804
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G +GAVNGM PNG D + +QS E+WTGVTY ++A ML G +
Sbjct: 805 LQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 864
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G++ + G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 865 FTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 912
>gi|222612993|gb|EEE51125.1| hypothetical protein OsJ_31868 [Oryza sativa Japonica Group]
Length = 935
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 122/288 (42%), Positives = 166/288 (57%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+A D+ F V+P ++ FD
Sbjct: 595 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDYMNQFD 654
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCGGL +AAL M+ L D +++ K
Sbjct: 655 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEKYKLKF 714
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
+A VY LW GSY+ +DS S +S SI ADQL G + +SG+ +F I A
Sbjct: 715 IQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDENKIRSA 772
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G +GAVNGM PNG D + +QS E+WTGVTY ++A ML G +
Sbjct: 773 LQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 832
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G++ + G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 833 FTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 880
>gi|115482480|ref|NP_001064833.1| Os10g0473400 [Oryza sativa Japonica Group]
gi|110289241|gb|AAP54244.2| expressed protein [Oryza sativa Japonica Group]
gi|113639442|dbj|BAF26747.1| Os10g0473400 [Oryza sativa Japonica Group]
gi|215695411|dbj|BAG90602.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 974
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 122/288 (42%), Positives = 166/288 (57%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+A D+ F V+P ++ FD
Sbjct: 634 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDYMNQFD 693
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCGGL +AAL M+ L D +++ K
Sbjct: 694 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEKYKLKF 753
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
+A VY LW GSY+ +DS S +S SI ADQL G + +SG+ +F I A
Sbjct: 754 IQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDENKIRSA 811
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G +GAVNGM PNG D + +QS E+WTGVTY ++A ML G +
Sbjct: 812 LQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 871
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G++ + G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 872 FTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 919
>gi|226499590|ref|NP_001146319.1| uncharacterized protein LOC100279895 [Zea mays]
gi|219886613|gb|ACL53681.1| unknown [Zea mays]
Length = 649
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 123/290 (42%), Positives = 166/290 (57%), Gaps = 14/290 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ IYRD+A D++F V+P + + FD
Sbjct: 309 HDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRGVWPAVCAAMDYMDQFD 368
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCG L +AAL M+ L D +++ K
Sbjct: 369 RDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHFAEKYKLKF 428
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA VY LW GSY+ +DS S +S SI ADQL G + +SG+ +F I A
Sbjct: 429 IKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDEHKIRTA 486
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G MGAVNGM P G D + +QS E+WTGVTY ++A ML G +
Sbjct: 487 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQG 546
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
+TTA G++ + G G F+TPEG T D YRS YMR LA++A+Q A
Sbjct: 547 FTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYA 596
>gi|356535470|ref|XP_003536268.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
Length = 949
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 121/294 (41%), Positives = 170/294 (57%), Gaps = 14/294 (4%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+ D S WKDLN KF++ +YRD+A D++F V+P + E FD+
Sbjct: 609 DPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFATTGDLQFGVDVWPAVRAAMEYMEQFDR 668
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT YCG L +AAL M+ L D + + K
Sbjct: 669 DGDGLIENDGFPDQTYDTWTVHGVSTYCGCLWLAALQAAAVMALELGDREFAETCKRKFL 728
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA + + LW G+Y+ +DS S +S SI ADQL G + +SG+ ++F + I AL
Sbjct: 729 KAKPAFEEKLWNGTYFNYDSGSSGNSKSIQADQLAGQWYTASSGL--PSLFEDSKIKSAL 786
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
++ NV + G MGAVNGM PNG D + +QS EVWTGVTY L+A M++ G +EA+
Sbjct: 787 RKVYDFNVMKVKGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGLAATMIHAGMEEEAF 846
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
TTA G++ + G G F+TPE T D YRS YMR LA++ MQ A + K
Sbjct: 847 TTAEGIFLAGWSEDGYGYWFQTPEAWTMDGHYRSLMYMRPLAIWGMQYAINRPK 900
>gi|10440626|gb|AAG16864.1|AC069145_13 unknown protein [Oryza sativa Japonica Group]
Length = 444
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 122/288 (42%), Positives = 166/288 (57%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+A D+ F V+P ++ FD
Sbjct: 104 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDYMNQFD 163
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCGGL +AAL M+ L D +++ K
Sbjct: 164 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEKYKLKF 223
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
+A VY LW GSY+ +DS S +S SI ADQL G + +SG+ +F I A
Sbjct: 224 IQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDENKIRSA 281
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G +GAVNGM PNG D + +QS E+WTGVTY ++A ML G +
Sbjct: 282 LQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 341
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G++ + G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 342 FTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 389
>gi|327288418|ref|XP_003228923.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Anolis
carolinensis]
Length = 667
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 104/240 (43%), Positives = 149/240 (62%), Gaps = 12/240 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P+EEP+ +N Y + D +DWKDLNLKF++ +YRD+ L +D +L ++P C ++
Sbjct: 397 PDEEPWQRVNAYLIHDTADWKDLNLKFVLQVYRDFFLTEDCTYLRDMWPVCQAVMESELK 456
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FD G G+++N GF DQT AYCGGL +AA+ M M+ +L + Q++SA
Sbjct: 457 FDTDGDGLIENSGFADQTYDAWVVTGASAYCGGLWLAAVCMMCRMAEILGEGAALQRYSA 516
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEA--IFPSAN 195
L+ + + LLW G YY +DS S+SIM+DQL G FL+A G+ +FP +
Sbjct: 517 ILSSGAAAFERLLWNGRYYNYDSGGGPSSNSIMSDQLAGQWFLRACGLGEGKYQVFPQEH 576
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ AL TI+ NV GF G+MGAVNGM+P+G D S++QS+EVW GV Y L+A M+ EG
Sbjct: 577 VQSALRTIYEMNVLGFSEGAMGAVNGMRPDGVPDTSSVQSDEVWVGVVYSLAATMIQEGQ 636
>gi|218184729|gb|EEC67156.1| hypothetical protein OsI_34005 [Oryza sativa Indica Group]
Length = 974
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 122/288 (42%), Positives = 165/288 (57%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+A D+ F V+P + FD
Sbjct: 634 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAAMDYMNQFD 693
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCGGL +AAL M+ L D +++ K
Sbjct: 694 RDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEKYKLKF 753
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
+A VY LW GSY+ +DS S +S SI ADQL G + +SG+ +F I A
Sbjct: 754 IQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDENKIRSA 811
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G +GAVNGM PNG D + +QS E+WTGVTY ++A ML G +
Sbjct: 812 LQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 871
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G++ + G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 872 FTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 919
>gi|333898016|ref|YP_004471890.1| glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
gi|333113281|gb|AEF18218.1| Glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
Length = 806
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 119/291 (40%), Positives = 166/291 (57%), Gaps = 17/291 (5%)
Query: 32 PFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKCESFD 89
P+ IN Y+ + + WKDLN K+++ +YRDY L D FL + + + K + D
Sbjct: 516 PWIKINAYDWQNPNIWKDLNSKYVLLVYRDYVLTGKTDKEFLKYTWKSVKTALDKLKEMD 575
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
K GI N G PDQT AYCG L +AAL E+ +L D + +++
Sbjct: 576 KDNDGIPDNEGIPDQTYDTWSMKGTSAYCGSLWLAALKAAQEIGKVLKDNEAYIKYNEWY 635
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
A Q + LW G YY FD+ +S H DSIMADQL G + A + I P ++ +A
Sbjct: 636 KIAQQNFEKELWNGEYYNFDT-ESDHKDSIMADQLAGQWY--ADILRLGDILPKDHVQKA 692
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L I+ NV FE G MGAVNGM+P+G D S IQ++EVWTGVTY L++ M Y G +EA
Sbjct: 693 LKKIYEFNVMKFENGKMGAVNGMRPDGIVDESDIQAQEVWTGVTYALASFMKYRGMTEEA 752
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
+ TA G+Y+ Y+++G G F TPE T D YR+ YMR L++++M+ Y
Sbjct: 753 YNTAYGVYKMTYDKSGKGYWFRTPEAWTKDGNYRASMYMRPLSIWSMEVNY 803
>gi|195164866|ref|XP_002023267.1| GL21266 [Drosophila persimilis]
gi|194105352|gb|EDW27395.1| GL21266 [Drosophila persimilis]
Length = 867
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 115/257 (44%), Positives = 156/257 (60%), Gaps = 19/257 (7%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
I +L +Y C ++ + +DK G+++N PDQT AYC GL +AAL
Sbjct: 604 IGYLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 663
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M+++L+ +F L K + + LW GSYY+FD S S H DSIMADQLCG+
Sbjct: 664 AMSAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLSPS-HRDSIMADQLCGH 722
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL TI+ NV GF G++GA NG N G D
Sbjct: 723 WYLKSCGFDYE-IYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDN 781
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+T+ ER G+ FETPE L G+K Y
Sbjct: 782 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSERIGMNFETPEALYGEKRY 841
Query: 291 RSGGYMRALAVYAMQDA 307
RS GYMR L++++MQ A
Sbjct: 842 RSIGYMRPLSIWSMQVA 858
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 501 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 535
>gi|198473190|ref|XP_002133205.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
gi|198139343|gb|EDY70607.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
Length = 865
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 115/257 (44%), Positives = 156/257 (60%), Gaps = 19/257 (7%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
I +L +Y C ++ + +DK G+++N PDQT AYC GL +AAL
Sbjct: 602 IGYLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 661
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M+++L+ +F L K + + LW GSYY+FD S S H DSIMADQLCG+
Sbjct: 662 AMSAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLSPS-HRDSIMADQLCGH 720
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL TI+ NV GF G++GA NG N G D
Sbjct: 721 WYLKSCGFDYE-IYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDN 779
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+T+ ER G+ FETPE L G+K Y
Sbjct: 780 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSERIGMNFETPEALYGEKRY 839
Query: 291 RSGGYMRALAVYAMQDA 307
RS GYMR L++++MQ A
Sbjct: 840 RSIGYMRPLSIWSMQVA 856
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 499 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 533
>gi|357146604|ref|XP_003574050.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 962
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 122/288 (42%), Positives = 163/288 (56%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+A D+ F V+P + + FD
Sbjct: 622 HDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVCAAMDYMDQFD 681
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCG L +AAL M+ L D +++ K
Sbjct: 682 RDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRPYAEKYKLKF 741
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA VY LW GSY+ +DS S +S SI ADQL G + +SG+ IF I A
Sbjct: 742 IKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPIFDEHKIRSA 799
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G MGAVNGM P G D + +QS E+WTGVTY ++A ML G +
Sbjct: 800 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 859
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+ TA G++ + G G F+TPEG T D YRS YMR LA++AMQ
Sbjct: 860 FITAEGIFLAGWSEEGYGYWFQTPEGWTTDGHYRSLIYMRPLAIWAMQ 907
>gi|324500616|gb|ADY40283.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 890
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 125/298 (41%), Positives = 183/298 (61%), Gaps = 17/298 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY--ALHKDIRFLTRVYPTCLELIRK- 84
P +P+ L N Y + D WKDLNLKFI+ +RDY L KD RFL V+P LI +
Sbjct: 536 PAVDPWLLTNAYVMHDTGKWKDLNLKFILISFRDYTCVLGKDQRFLNLVWPAVKSLINEG 595
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
+++D+ G G+++N G DQT AYCG L +AAL + M+ + D+ +
Sbjct: 596 LKNWDQDGDGMIENFGAADQTYDAWKMSGVSAYCGCLWLAALRAALVMAREVGDSACAHR 655
Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDS-SQSRHSDSIMADQLCGYLFLKA-SGVNTEAIFP 192
++ L++A + Y LW G YY FD S+SRH+ IMADQLCGY FL++ S + + P
Sbjct: 656 YAETLSRAREAYITKLWNGEYYNFDERSRSRHT--IMADQLCGYWFLQSISPELADELLP 713
Query: 193 SANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLY 252
+A++ AL I++ NV F G +GAVNGM+ +G DR+ IQ++E+WTGVTY L+A ++
Sbjct: 714 AAHVKSALNIIYNYNVLRFAGGRLGAVNGMRRDGTVDRNYIQADEMWTGVTYALAAFLIQ 773
Query: 253 EGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
+G V + TA G Y + R GL ++TPE L + YR+ GYMR L+++AMQ A ++
Sbjct: 774 QGAVQRGFDTAWGCYDGCFNRFGLQYQTPEALYEQRFYRAIGYMRPLSIWAMQWALIR 831
>gi|357116998|ref|XP_003560263.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 932
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 122/288 (42%), Positives = 162/288 (56%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+A D+ F V+P + E FD
Sbjct: 593 HDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFAATGDMSFGKDVWPAVCTAMEYMEQFD 652
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCG L +A+L M+ L ++ K
Sbjct: 653 RDGDGMIENDGFPDQTYDAWTVLGVSAYCGCLWLASLQAAAAMARGLGHNDYAERCMVKF 712
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA V+ LW GSY+ +DS S S SI ADQL G + +SG+ +F I
Sbjct: 713 AKAKHVFETKLWNGSYFNYDSGASYSSRSIQADQLAGQWYAASSGL--PPLFDEGRIKCT 770
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G MGAVNGM PNG D + +QS E+WTGVTY L+A ML G D+A
Sbjct: 771 LQKIFDYNVMRVKGGRMGAVNGMYPNGKVDETCMQSREIWTGVTYSLAATMLLHGMEDQA 830
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G++ + G G F+TPE T D YRS YMR LA++AMQ
Sbjct: 831 FTTAEGIFLAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 878
>gi|195436933|ref|XP_002066400.1| GK18271 [Drosophila willistoni]
gi|194162485|gb|EDW77386.1| GK18271 [Drosophila willistoni]
Length = 955
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 111/263 (42%), Positives = 157/263 (59%), Gaps = 19/263 (7%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
I +L +Y C ++ + +DK G+++N PDQT AYC GL +A+L
Sbjct: 692 IGYLKSMYAACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLASLQ 751
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M+++L+ ++ L K + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 752 AMSAMATILDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 810
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D
Sbjct: 811 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEPSKAGHVDN 869
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
S IQ+EEVW GV Y L+A M+ EG DEA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 870 SNIQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTISQRIGMNFETPEALYGEKRY 929
Query: 291 RSGGYMRALAVYAMQDAYLKGKV 313
RS GYMR L++++MQ A+ + +
Sbjct: 930 RSIGYMRPLSIWSMQVAWERRRA 952
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV++WKDLN KF++ +YRDY
Sbjct: 589 PDEEPFTLINCYNIHDVNEWKDLNTKFVLQVYRDY 623
>gi|288558766|gb|ADC53514.1| MIP16949p [Drosophila melanogaster]
Length = 432
Score = 213 bits (543), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 111/257 (43%), Positives = 156/257 (60%), Gaps = 19/257 (7%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
I +L +Y +C ++ + +DK G+++N PDQT AYC GL +AAL
Sbjct: 169 IGYLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 228
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M+++L+ ++ L K + + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 229 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 287
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D
Sbjct: 288 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDN 346
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 347 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 406
Query: 291 RSGGYMRALAVYAMQDA 307
RS GYMR L++++MQ A
Sbjct: 407 RSIGYMRPLSIWSMQVA 423
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 66 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 100
>gi|195338407|ref|XP_002035816.1| GM15538 [Drosophila sechellia]
gi|194129696|gb|EDW51739.1| GM15538 [Drosophila sechellia]
Length = 947
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 111/257 (43%), Positives = 156/257 (60%), Gaps = 19/257 (7%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
I +L +Y +C ++ + +DK G+++N PDQT AYC GL +AAL
Sbjct: 684 IGYLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 743
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M+++L+ ++ L K + + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 744 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 802
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D
Sbjct: 803 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDN 861
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 862 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 921
Query: 291 RSGGYMRALAVYAMQDA 307
RS GYMR L++++MQ A
Sbjct: 922 RSIGYMRPLSIWSMQVA 938
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 581 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 615
>gi|45552032|ref|NP_788055.2| CG33090 [Drosophila melanogaster]
gi|74876618|sp|Q7KT91.1|C3390_DROME RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase
gi|45445134|gb|AAO41192.2| CG33090 [Drosophila melanogaster]
gi|201065481|gb|ACH92150.1| FI02015p [Drosophila melanogaster]
Length = 948
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 111/257 (43%), Positives = 156/257 (60%), Gaps = 19/257 (7%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
I +L +Y +C ++ + +DK G+++N PDQT AYC GL +AAL
Sbjct: 685 IGYLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 744
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M+++L+ ++ L K + + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 745 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 803
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D
Sbjct: 804 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDN 862
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 863 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 922
Query: 291 RSGGYMRALAVYAMQDA 307
RS GYMR L++++MQ A
Sbjct: 923 RSIGYMRPLSIWSMQVA 939
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 582 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 616
>gi|358254610|dbj|GAA55931.1| non-lysosomal glucosylceramidase [Clonorchis sinensis]
Length = 1200
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 119/311 (38%), Positives = 170/311 (54%), Gaps = 31/311 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIR-KCE 86
PE EP+ N Y ++ DWKDLN KF++ ++RD+ L +D +L + P +I+ +
Sbjct: 540 PEGEPWRDANAYVMYPTKDWKDLNPKFVLQVWRDWKLTQDNDYLLYMLPIVNRIIQVSLQ 599
Query: 87 SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAAL--AC-MVEMSSLLNDT---- 129
S+D G G+++N GFPDQT AY GGL +A L AC M+E + N
Sbjct: 600 SWDSNGDGMIENSGFPDQTYDTWNAQGISAYTGGLWLACLYSACEMLEYALSANSPLKAF 659
Query: 130 -------------QQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
Q Q++ L A VY LWTG+YY + ++ + ++IMADQL G
Sbjct: 660 LIASNSEDEVSWPQVQKELQILLESAKTVYDRALWTGTYYIYQNTPVGNHEAIMADQLSG 719
Query: 177 YLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSE 236
+ FL+ G AI PS ++ AL TI STN + GS+G +NG P RD S +Q+E
Sbjct: 720 HGFLRVGGAPPGAILPSEHVIMALKTIQSTNWESIQNGSLGVINGGLPGAKRDLSNVQAE 779
Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
E W GV Y L+A M+ EG E ++ A Y T+Y R GL ++TPE D +R GYM
Sbjct: 780 EFWVGVNYSLAATMILEGMSQEGFSLAEACYNTIYNRFGLQYQTPEAYMIDGRFRCPGYM 839
Query: 297 RALAVYAMQDA 307
R LA++++Q A
Sbjct: 840 RPLAIWSIQQA 850
>gi|356576421|ref|XP_003556330.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
Length = 950
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 167/294 (56%), Gaps = 14/294 (4%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+ D S WKDLN KF++ +YRD+A D+ F V+P + E FD+
Sbjct: 610 DPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFATTGDLEFGVDVWPAVRAAMEYMEQFDR 669
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT YCG L +AAL M+ L D + ++ K
Sbjct: 670 DGDGLIENDGFPDQTYDTWTVHGVSTYCGCLWLAALQAAAAMALDLGDREFAEKCKRKFL 729
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA + + LW G+Y+ +DS S +S SI ADQL G + +SG+ +F + I AL
Sbjct: 730 KAKPAFEEKLWNGTYFNYDSGSSGNSKSIQADQLAGQWYTASSGL--PPLFEDSKIKSAL 787
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
++ NV + G MGAVNGM PNG D + +QS EVWTGVTY L+A M+ G +EA+
Sbjct: 788 RKVYDFNVMKVKGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGLAATMILAGMEEEAF 847
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
TA G++ + G G F+TPE T D YRS YMR LA++ MQ A + K
Sbjct: 848 ATAEGIFLAGWSEDGYGYWFQTPEAWTMDGHYRSLMYMRPLAIWGMQYAINRPK 901
>gi|224072655|ref|XP_002303825.1| predicted protein [Populus trichocarpa]
gi|222841257|gb|EEE78804.1| predicted protein [Populus trichocarpa]
Length = 966
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 117/288 (40%), Positives = 164/288 (56%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P++ +N YN+ D S WKDLN KF++ +YRD+A D+ F V+P + E FD
Sbjct: 625 HDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGVDVWPAVRTAMEYMEQFD 684
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G+V+N GFPDQT AYCG L +A+L M+ L D + +K
Sbjct: 685 RDDDGLVENDGFPDQTYDAWTVHGVSAYCGCLWLASLQAAAAMAMQLGDKYFAELCKSKF 744
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA + LW GSY+ +DS S +S SI ADQL G ++ +SG+ ++F I A
Sbjct: 745 AKAKSAFESKLWNGSYFNYDSGSSNNSKSIQADQLAGEWYMASSGL--PSLFDDVKIRSA 802
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L I+ NV G MGAVNGM PNG D + +QS E+W+GVTY ++A M+ G D+A
Sbjct: 803 LNKIYDFNVMKVRGGKMGAVNGMHPNGKVDETCMQSREIWSGVTYAVAATMILSGMEDKA 862
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G++ + G G F+TPE T D +RS YMR LA++ MQ
Sbjct: 863 FTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHFRSLIYMRPLAIWGMQ 910
>gi|195474003|ref|XP_002089281.1| GE19029 [Drosophila yakuba]
gi|194175382|gb|EDW88993.1| GE19029 [Drosophila yakuba]
Length = 947
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 111/257 (43%), Positives = 155/257 (60%), Gaps = 19/257 (7%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
I +L +Y C ++ + +DK G+++N PDQT AYC GL +AAL
Sbjct: 684 IGYLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 743
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M+++L+ ++ L K + + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 744 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 802
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D
Sbjct: 803 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDN 861
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 862 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 921
Query: 291 RSGGYMRALAVYAMQDA 307
RS GYMR L++++MQ A
Sbjct: 922 RSIGYMRPLSIWSMQVA 938
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 581 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 615
>gi|222636955|gb|EEE67087.1| hypothetical protein OsJ_24071 [Oryza sativa Japonica Group]
Length = 838
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/288 (42%), Positives = 161/288 (55%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+A D+ F V+P + E FD
Sbjct: 498 HDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFAATGDMSFGKDVWPAVCTAMEYMEQFD 557
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
G+++N GFPDQT AYCG L +AAL ++ L ++ K
Sbjct: 558 HDDDGMIENDGFPDQTYDAWTVRGVSAYCGCLWLAALQTAAALARTLGHHDYAERCMLKF 617
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA V+ LW GSY+ +DS S +S SI ADQL G + +SG+ +F I
Sbjct: 618 AKAKPVFEAKLWNGSYFNYDSGTSYNSRSIHADQLAGQWYTASSGL--PPLFDEGRIKCT 675
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G MGAVNGM PNG D + +QS E+WTGVTY L+A ML G D+A
Sbjct: 676 LQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMDDQA 735
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G++ + G G F+TPE T D YRS YMR LA++AMQ
Sbjct: 736 FTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 783
>gi|195579186|ref|XP_002079443.1| GD23959 [Drosophila simulans]
gi|194191452|gb|EDX05028.1| GD23959 [Drosophila simulans]
Length = 947
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 111/257 (43%), Positives = 155/257 (60%), Gaps = 19/257 (7%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
I +L +Y +C ++ + +DK G+++N PDQT AYC GL +AAL
Sbjct: 684 IGYLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 743
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M+++L+ ++ L K + + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 744 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 802
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D
Sbjct: 803 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDN 861
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 862 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 921
Query: 291 RSGGYMRALAVYAMQDA 307
RS GYMR L++ +MQ A
Sbjct: 922 RSIGYMRPLSILSMQVA 938
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 581 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 615
>gi|194860149|ref|XP_001969520.1| GG23916 [Drosophila erecta]
gi|190661387|gb|EDV58579.1| GG23916 [Drosophila erecta]
Length = 947
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 111/257 (43%), Positives = 155/257 (60%), Gaps = 19/257 (7%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
I +L +Y C ++ + +DK G+++N PDQT AYC GL +AAL
Sbjct: 684 IGYLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 743
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M+++L+ ++ L K + + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 744 AMSAMATILDQPNDCLRYQDILEKGKRSLDEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 802
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D
Sbjct: 803 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDN 861
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 862 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMNFETPEALYGEKRY 921
Query: 291 RSGGYMRALAVYAMQDA 307
RS GYMR L++++MQ A
Sbjct: 922 RSIGYMRPLSIWSMQVA 938
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 581 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 615
>gi|194760205|ref|XP_001962332.1| GF15415 [Drosophila ananassae]
gi|190616029|gb|EDV31553.1| GF15415 [Drosophila ananassae]
Length = 894
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 110/257 (42%), Positives = 155/257 (60%), Gaps = 19/257 (7%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
I +L +Y +C ++ + +DK G+++N PDQT +YC GL +AAL
Sbjct: 631 IGYLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSSYCSGLWLAALQ 690
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M+++L+ ++ L K + + LW GSYY+FD S H D+IMADQLCG+
Sbjct: 691 AMSAMATILDQPNDCMRYQDILEKGKRSLEEKLWNGSYYRFDLSHG-HRDTIMADQLCGH 749
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D
Sbjct: 750 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANASDPSKPGHVDN 808
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
S IQSEEVW GV Y L+A M+ EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 809 SNIQSEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMNFETPEALYGEKRY 868
Query: 291 RSGGYMRALAVYAMQDA 307
RS GYMR L++++MQ A
Sbjct: 869 RSIGYMRPLSIWSMQVA 885
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 528 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 562
>gi|195033785|ref|XP_001988762.1| GH11342 [Drosophila grimshawi]
gi|193904762|gb|EDW03629.1| GH11342 [Drosophila grimshawi]
Length = 869
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 109/258 (42%), Positives = 155/258 (60%), Gaps = 19/258 (7%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
+ +L +Y C ++ + +DK G+++N PDQT AYC GL +AAL
Sbjct: 606 MSYLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLWLAALQ 665
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M++LL+ ++ L + + LW GSYY+FD S + H D+IMADQLCG+
Sbjct: 666 TMSVMATLLDQPNDCLRYQDILERGKNSLEEKLWNGSYYRFDQSHN-HRDTIMADQLCGH 724
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D
Sbjct: 725 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANANEPSKPGHVDN 783
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTY 290
S IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+T+ +R G+ FETPE L G+K Y
Sbjct: 784 SNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMNFETPEALYGEKRY 843
Query: 291 RSGGYMRALAVYAMQDAY 308
RS GYMR L++++MQ A+
Sbjct: 844 RSIGYMRPLSIWSMQVAW 861
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV++WKDLN KF++ +YRDY
Sbjct: 503 PDEEPFTLINCYNIHDVNEWKDLNTKFVLQVYRDY 537
>gi|195115764|ref|XP_002002426.1| GI12858 [Drosophila mojavensis]
gi|193913001|gb|EDW11868.1| GI12858 [Drosophila mojavensis]
Length = 877
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 110/261 (42%), Positives = 157/261 (60%), Gaps = 19/261 (7%)
Query: 70 FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
+L +Y C ++ + +DK G+++N PDQT AYC GL +AAL M
Sbjct: 616 YLKSMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDAWVMDGPSAYCSGLWLAALQAM 675
Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
M++LL+ ++ L K + + LW GSYY+FD S + H D+IMADQLCG+ +
Sbjct: 676 SVMATLLDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHN-HRDTIMADQLCGHWY 734
Query: 180 LKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDRSA 232
LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D S
Sbjct: 735 LKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEPSKPGHVDNSN 793
Query: 233 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 292
IQ+EEVW GV Y L+A M+ EG DEA+ TAGG+Y+T+ +R G+ +ETPE L G+K YRS
Sbjct: 794 IQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTLSQRIGMNYETPEALYGEKRYRS 853
Query: 293 GGYMRALAVYAMQDAYLKGKV 313
GYMR L++++MQ A+ + +
Sbjct: 854 IGYMRPLSIWSMQVAWERRRA 874
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 511 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 545
>gi|218199525|gb|EEC81952.1| hypothetical protein OsI_25831 [Oryza sativa Indica Group]
Length = 850
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 121/290 (41%), Positives = 161/290 (55%), Gaps = 14/290 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+ D+ F V+P + E FD
Sbjct: 510 HDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFVATGDMSFGKDVWPAVCTAMEYMEQFD 569
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
G+++N GFPDQT AYCG L +AAL ++ L ++ K
Sbjct: 570 HDDDGMIENDGFPDQTYDAWTVRGVSAYCGCLWLAALQAAAALARTLGHHDYAERCMLKF 629
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA V+ LW GSY+ +DS S +S SI ADQL G + +SG+ +F I
Sbjct: 630 AKAKPVFEAKLWNGSYFNYDSGTSYNSRSIHADQLAGQWYTASSGL--PPLFDEGRIKCT 687
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G MGAVNGM PNG D + +QS E+WTGVTY L+A ML G D+A
Sbjct: 688 LQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMDDQA 747
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
+TTA G++ + G G F+TPE T D YRS YMR LA++AMQ A
Sbjct: 748 FTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQYA 797
>gi|225460125|ref|XP_002275782.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
Length = 969
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 123/295 (41%), Positives = 169/295 (57%), Gaps = 15/295 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN++D WKDLN KF++ +YRD D F V+P I + FDK
Sbjct: 620 DPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQFDK 679
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL M+ + D+ F K
Sbjct: 680 DGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFWFKFQ 739
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY D LW GSY+ +D+S S SI ADQL G + +A G+ + I AL
Sbjct: 740 KAKAVY-DKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGL--QPIVDDEKARSAL 796
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
+++ NV + G GAVNGM P+G D SA+QS E+W GVTY ++A M++EG V+ A+
Sbjct: 797 EKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAF 856
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
TA G+Y + + GLG F+TPE D+ YRS YMR LA++AMQ A K ++
Sbjct: 857 NTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPEL 911
>gi|195397814|ref|XP_002057523.1| GJ18180 [Drosophila virilis]
gi|194141177|gb|EDW57596.1| GJ18180 [Drosophila virilis]
Length = 869
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 109/261 (41%), Positives = 156/261 (59%), Gaps = 19/261 (7%)
Query: 70 FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
+L +Y C ++ + +DK G+++N PDQT AYC GL +AAL M
Sbjct: 608 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLWLAALQTM 667
Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
M++LL+ ++ L K + LW GSYY+FD S + H D+IMADQLCG+ +
Sbjct: 668 SVMATLLDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLSHN-HRDTIMADQLCGHWY 726
Query: 180 LKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDRSA 232
LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D S
Sbjct: 727 LKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPSKPGHVDNSN 785
Query: 233 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 292
IQ+EEVW GV Y L+A M+ EG +EA+ TAGG+Y+T+ +R G+ +ETPE L G+K YRS
Sbjct: 786 IQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMNYETPEALYGEKRYRS 845
Query: 293 GGYMRALAVYAMQDAYLKGKV 313
GYMR L++++MQ A+ + +
Sbjct: 846 IGYMRPLSIWSMQVAWERRRA 866
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 503 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 537
>gi|297741007|emb|CBI31319.3| unnamed protein product [Vitis vinifera]
Length = 953
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/295 (41%), Positives = 169/295 (57%), Gaps = 15/295 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN++D WKDLN KF++ +YRD D F V+P I + FDK
Sbjct: 604 DPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQFDK 663
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL M+ + D+ F K
Sbjct: 664 DGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFWFKFQ 723
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY D LW GSY+ +D+S S SI ADQL G + +A G+ + I AL
Sbjct: 724 KAKAVY-DKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGL--QPIVDDEKARSAL 780
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
+++ NV + G GAVNGM P+G D SA+QS E+W GVTY ++A M++EG V+ A+
Sbjct: 781 EKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAF 840
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
TA G+Y + + GLG F+TPE D+ YRS YMR LA++AMQ A K ++
Sbjct: 841 NTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPEL 895
>gi|256082537|ref|XP_002577511.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 892
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 112/311 (36%), Positives = 170/311 (54%), Gaps = 31/311 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKC-E 86
PE+EP+ N Y +F WKDLN KFI+ ++RD+ + +D ++L + P L ++RK
Sbjct: 560 PEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLV 619
Query: 87 SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS------------ 124
++D G+++N GFPDQT AY GG+ ++ L +M S
Sbjct: 620 AWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQ 679
Query: 125 LLNDTQQQQQFSAKL--------TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
++N+T Q+ +++ TKA Y+ LWTGSYY F + + + IMADQL G
Sbjct: 680 IVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYCTPRREVIMADQLSG 739
Query: 177 YLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSE 236
Y F + +GV I P ++ + L TI + N G + G++GA+NG P D S++Q+E
Sbjct: 740 YWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPDLSSVQAE 799
Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
E W V Y LS+ M+ EG +DE Y T+Y GL ++TPE D +R GYM
Sbjct: 800 EFWVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQYQTPEAYMSDGRFRCPGYM 859
Query: 297 RALAVYAMQDA 307
RALA++++Q A
Sbjct: 860 RALAIWSIQQA 870
>gi|30692911|ref|NP_174631.2| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|79319136|ref|NP_001031135.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|332193493|gb|AEE31614.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|332193494|gb|AEE31615.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
Length = 947
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 169/294 (57%), Gaps = 15/294 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+F+ WKDLN KF++ +YRD D+ F V+P+ I + FDK
Sbjct: 598 DPWFELNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 657
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL ++ + D F+AK
Sbjct: 658 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYE 717
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY + LW GSY+ +D+S+S S SI+ADQ+ G + +A G+ + I I +AL
Sbjct: 718 KARSVY-EKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGL--KPIAKEEWIKKAL 774
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
T++ NV G+ GAVNGM P+G D S + S EVW G TY ++A M+ EG D+ +
Sbjct: 775 ETVYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGF 834
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
TA G+Y + GLG F+TPE T + YRS YMR LA++ +Q A+ K
Sbjct: 835 RTASGIYEAAWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPK 888
>gi|353232147|emb|CCD79502.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 900
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 112/311 (36%), Positives = 170/311 (54%), Gaps = 31/311 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKC-E 86
PE+EP+ N Y +F WKDLN KFI+ ++RD+ + +D ++L + P L ++RK
Sbjct: 568 PEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLV 627
Query: 87 SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS------------ 124
++D G+++N GFPDQT AY GG+ ++ L +M S
Sbjct: 628 AWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQ 687
Query: 125 LLNDTQQQQQFSAKL--------TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
++N+T Q+ +++ TKA Y+ LWTGSYY F + + + IMADQL G
Sbjct: 688 IVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYCTPRREVIMADQLSG 747
Query: 177 YLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSE 236
Y F + +GV I P ++ + L TI + N G + G++GA+NG P D S++Q+E
Sbjct: 748 YWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPDLSSVQAE 807
Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
E W V Y LS+ M+ EG +DE Y T+Y GL ++TPE D +R GYM
Sbjct: 808 EFWVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQYQTPEAYMSDGRFRCPGYM 867
Query: 297 RALAVYAMQDA 307
RALA++++Q A
Sbjct: 868 RALAIWSIQQA 878
>gi|28393605|gb|AAO42222.1| unknown protein [Arabidopsis thaliana]
Length = 947
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 169/294 (57%), Gaps = 15/294 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+F+ WKDLN KF++ +YRD D+ F V+P+ I + FDK
Sbjct: 598 DPWFELNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 657
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL ++ + D F+AK
Sbjct: 658 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYE 717
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY + LW GSY+ +D+S+S S SI+ADQ+ G + +A G+ + I I +AL
Sbjct: 718 KARSVY-EKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGL--KPIAKEEWIKKAL 774
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
T++ NV G+ GAVNGM P+G D S + S EVW G TY ++A M+ EG D+ +
Sbjct: 775 ETVYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGF 834
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
TA G+Y + GLG F+TPE T + YRS YMR LA++ +Q A+ K
Sbjct: 835 RTASGIYEAAWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPK 888
>gi|326489639|dbj|BAK01800.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 797
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/295 (39%), Positives = 166/295 (56%), Gaps = 14/295 (4%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y + + WKDLN KF++ +YRD D F V+P+ + E FDK
Sbjct: 453 DPWIKVNAYTLHNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAMAYMEQFDK 512
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N FPDQT AYCGGL +AAL ++ + D ++ F K
Sbjct: 513 DKDGMIENEDFPDQTYDVWSMAGVSAYCGGLWVAALQAASALAHEVGDKASEKLFWNKYE 572
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY LW GSY+ +D + ++ S SI ADQL G + KA G+++ I AL
Sbjct: 573 KAKSVYDKKLWNGSYFNYDDAGTKASTSIHADQLAGQWYAKACGLSS--IVDKDKSQSAL 630
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
I++ NV F+ G+ GA+NGM P+G D S +QS E+W GVTY L+A+M+ EG VDE +
Sbjct: 631 EKIYAFNVMKFKDGNRGAINGMWPDGTLDMSTMQSREIWPGVTYALAASMIQEGMVDEGF 690
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
TA G+Y + GLG F+TPE D YRS YMR LA++++Q A K+
Sbjct: 691 KTAEGIYNAAWSTEGLGYAFQTPESWNNDDEYRSLCYMRPLAIWSIQWALSNPKL 745
>gi|255569126|ref|XP_002525532.1| conserved hypothetical protein [Ricinus communis]
gi|223535211|gb|EEF36890.1| conserved hypothetical protein [Ricinus communis]
Length = 948
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/295 (41%), Positives = 167/295 (56%), Gaps = 18/295 (6%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+ + WKDLN KF++ IYRD D F V+P+ + + FDK
Sbjct: 603 DPWFEVNAYNLISTARWKDLNPKFVLQIYRDVVATGDKSFALAVWPSVYVAMAYMDQFDK 662
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL ++ + D++ F K
Sbjct: 663 DGDGMIENEGFPDQTYDTWSVTGVSAYCGGLWVAALQAASALACEVGDSESASFFWVKYQ 722
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY LW GSY+ +DSS + SI ADQL G + +A G+++ I + AL
Sbjct: 723 KAKAVY-STLWNGSYFNYDSSGN---SSIHADQLAGQWYARACGLSS--IVDEEKVRSAL 776
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
I+ NV + G GAVNGM P+G D S +QS E+W GVTY LSA+M+ EG + A+
Sbjct: 777 EKIYKFNVLKVKEGKRGAVNGMLPDGKVDMSVMQSREIWPGVTYALSASMIQEGMAEMAF 836
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
TA G+Y + + GLG F+ PEG D YRS YMR LA++AMQ A K KV
Sbjct: 837 QTASGIYEAAWSQEGLGYSFQIPEGWNTDDQYRSLCYMRPLAIWAMQWALSKPKV 891
>gi|449442371|ref|XP_004138955.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
gi|449520930|ref|XP_004167485.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 993
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/287 (41%), Positives = 165/287 (57%), Gaps = 14/287 (4%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P++ +N YN+ D S WKDLN KF++ +YRD+A +D+ F V+P+ I E FD+
Sbjct: 653 DPWNEMNAYNIHDTSRWKDLNTKFVLQVYRDFAATRDMSFGVDVWPSVRAAIEYMEQFDR 712
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCG L +AAL M+ L D + + +K
Sbjct: 713 DGDGVIENDGFPDQTYDTWTVHGISAYCGCLWVAALQAAAAMAHELGDQEFAETCKSKFL 772
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA V LW GSY+ +DS S +S SI ADQL G + +SG+ +F I AL
Sbjct: 773 KARPVLEAELWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGL--PPLFDDFKIKSAL 830
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
I+ NV G MGAVNGM PNG D + +QS E+WTGVTY ++A M+ G +EA+
Sbjct: 831 RKIYDFNVMKVRGGRMGAVNGMHPNGKIDETCMQSREIWTGVTYGVAATMILAGMEEEAF 890
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
TA G++ + G G F+TPE + D YRS YMR L+++ MQ
Sbjct: 891 KTAEGIFLAGWSEEGFGYWFQTPEAWSTDGHYRSLIYMRPLSIWGMQ 937
>gi|9665098|gb|AAF97289.1|AC010164_11 Hypothetical protein [Arabidopsis thaliana]
Length = 790
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 169/294 (57%), Gaps = 15/294 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+F+ WKDLN KF++ +YRD D+ F V+P+ I + FDK
Sbjct: 441 DPWFELNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 500
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL ++ + D F+AK
Sbjct: 501 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYE 560
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY + LW GSY+ +D+S+S S SI+ADQ+ G + +A G+ + I I +AL
Sbjct: 561 KARSVY-EKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGL--KPIAKEEWIKKAL 617
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
T++ NV G+ GAVNGM P+G D S + S EVW G TY ++A M+ EG D+ +
Sbjct: 618 ETVYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGF 677
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
TA G+Y + GLG F+TPE T + YRS YMR LA++ +Q A+ K
Sbjct: 678 RTASGIYEAAWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPK 731
>gi|414589150|tpg|DAA39721.1| TPA: hypothetical protein ZEAMMB73_266944 [Zea mays]
Length = 583
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 121/288 (42%), Positives = 159/288 (55%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ IYRD+A D F V+P + E FD
Sbjct: 244 HDPWHELNAYNIHDTSRWKDLNPKFVLQIYRDFAATGDKSFGKDVWPAVCTAMEYMEQFD 303
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
G G+++N GFPDQT AYCG L +AAL ++ L ++ +
Sbjct: 304 HDGDGMIENDGFPDQTYDAWTVQGVSAYCGCLWLAALQAAAALAGSLGHDDYAERCMTRF 363
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA V+ LW GS++ +DS S S SI ADQL G + +SG+ +F I
Sbjct: 364 VKAKTVFEARLWNGSHFNYDSGTSYSSRSIQADQLAGQWYTASSGL--PPLFDEDRIKCT 421
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G MGAVNGM PNG D + +QS E+WTGVTY L+A ML G +A
Sbjct: 422 LQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMEHQA 481
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G+Y + G G F+TPE T D YRS YMR LA++AMQ
Sbjct: 482 FTTAEGIYTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 529
>gi|440799031|gb|ELR20092.1| glucosylceramidase [Acanthamoeba castellanii str. Neff]
Length = 860
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 118/309 (38%), Positives = 173/309 (55%), Gaps = 31/309 (10%)
Query: 28 PEEEPFSLINGYNVFDVSDWK---------------------DLNLKFIVSIYRDYALHK 66
P ++P+ +N YN+ + S WK DLN KF++ +YRD+ +
Sbjct: 543 PADDPWVALNAYNIHETSSWKGKLGLLPASHLSPSTELTEAADLNSKFVLGVYRDFVATQ 602
Query: 67 DIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT--AYCGGLHIAALACMVEMSS 124
D RFL V+P +R + FD+ G+++N G PDQT ++ GGL +AAL+ M+
Sbjct: 603 DKRFLADVWPAVDAALRHLQQFDRNKDGMIENEGLPDQTYHSWTGGLWVAALSAACAMAQ 662
Query: 125 LLN--DTQQQQQFSAKLTKASQVYHDLLWTGS-YYKFDSSQSRHSDSIMADQLCGYLFLK 181
L D Q + A++VY++ LW S Y+ +D+S S H DSI ADQLCG + +
Sbjct: 663 ALQLADDSVAQHYRDLHQSAARVYNESLWNPSGYFNYDTSTSTHHDSIQADQLCGQWYAR 722
Query: 182 ASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTG 241
A G+ + I P I AL T+F+ N F G GAVNGM+P+G D + + S E W G
Sbjct: 723 ACGLPS--IAPDKRIVTALKTVFANN---FNEGKRGAVNGMRPDGRVDYACLHSSEAWPG 777
Query: 242 VTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAV 301
T+ L+A M+ EG VD+A+ TA G+ ++YE G F TPE G YR+ G+MR LA+
Sbjct: 778 TTFALAACMMGEGLVDQAFITAAGVCGSIYEDLGYWFRTPEAWDGSGNYRALGHMRPLAI 837
Query: 302 YAMQDAYLK 310
+AMQ A+ +
Sbjct: 838 WAMQWAWTQ 846
>gi|297851790|ref|XP_002893776.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
lyrata]
gi|297339618|gb|EFH70035.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
lyrata]
Length = 947
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 168/290 (57%), Gaps = 15/290 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+F+ WKDLN KF++ +YRD D+ F V+P+ I + FDK
Sbjct: 598 DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 657
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AYCGGL +AAL ++ + D F+AK
Sbjct: 658 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREVGDNGAAVYFNAKYE 717
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY + LW GSY+ +D+S+S S SI+ADQL G + +A G+ + I I +AL
Sbjct: 718 KARSVY-EKLWNGSYFNYDNSRSGSSSSILADQLAGQWYARACGL--KPIAKEEWIKKAL 774
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
T++ NV G+ GAVNGM P+G D S + S EVW G TY ++A M+ EG D+ +
Sbjct: 775 ETVYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGF 834
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
TA G+Y + GLG F+TPE T + +RS YMR LA++ +Q A+
Sbjct: 835 RTASGIYEAAWSDRGLGCAFQTPEAWTTNDEFRSLCYMRPLAIWGIQWAH 884
>gi|242070593|ref|XP_002450573.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
gi|241936416|gb|EES09561.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
Length = 951
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 121/298 (40%), Positives = 168/298 (56%), Gaps = 21/298 (7%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y +++ WKDLN KF++ +YRD D F V+P+ + E FDK
Sbjct: 608 DPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDK 667
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N FPDQT AYCGGL +AAL ++ + D ++ F K
Sbjct: 668 DKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYE 727
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
KA VY L W GSY+ +D ++ S SI ADQL G + KA G +FP + +A
Sbjct: 728 KAKSVYSKL-WNGSYFSYDDGDNKVSTSIQADQLAGQWYAKACG-----LFPIVDKDKAQ 781
Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
L IFS NV F+ G+ GA+NGM P+G D SA+QS E+W GVTY L+A+M+ EG V+
Sbjct: 782 SALEKIFSFNVMKFKDGTRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVE 841
Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
+ + TA G+Y + GLG F+TPE D YRS YMR LA++AMQ A K+
Sbjct: 842 QGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSNPKL 899
>gi|224129460|ref|XP_002328722.1| predicted protein [Populus trichocarpa]
gi|222839020|gb|EEE77371.1| predicted protein [Populus trichocarpa]
Length = 922
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 117/288 (40%), Positives = 163/288 (56%), Gaps = 15/288 (5%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
++P+ +N YN+ + WKDLN KF++ +YRD D +F +P+ + + FD
Sbjct: 576 DDPWFEVNAYNLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAQAAWPSVYVAMAYMDQFD 635
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
K G G+++N GFPDQT AYCGGL +AAL ++ + D + F +
Sbjct: 636 KDGDGMIENDGFPDQTYDTWSVSGVSAYCGGLWVAALQAASALAWEVGDKDSAEYFWFRF 695
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA VY D LW GSY+ +D S R+S SI ADQL G + +A G+ I A
Sbjct: 696 QKAKVVY-DKLWNGSYFNYDDSNGRNSPSIQADQLAGQWYARACGL--LPIVDEDKARSA 752
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L I++ N G GAVNGM P+G D S +QS E+W+GVTY ++A M+ EG +D A
Sbjct: 753 LEKIYNYNFLKVHDGKRGAVNGMLPDGTVDMSDMQSREIWSGVTYAVAATMMQEGLMDMA 812
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+ TA G+Y + GLG F+TPEG + YRS GYMR LA++AMQ
Sbjct: 813 FHTASGVYEAAWAEQGLGYSFQTPEGWNTNGQYRSLGYMRPLAIWAMQ 860
>gi|449482327|ref|XP_004156248.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
sativus]
Length = 508
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 119/291 (40%), Positives = 169/291 (58%), Gaps = 15/291 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+ +VS WKDL KF++ +YRD D F V+P+ + E FDK
Sbjct: 160 DPWLEVNAYNLLNVSRWKDLGSKFVLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDK 219
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N GFPDQT AYCGGL +AAL ++S ++D F K
Sbjct: 220 DKDGMIENEGFPDQTYDTWTVKGVSAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQ 279
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY + LW GSY+ +D+S+ S SI ADQL G + +A G+ I I AL
Sbjct: 280 KARSVY-ETLWNGSYFNYDNSKGPWSSSIQADQLAGQWYARACGLCP--IADEEKIRVAL 336
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
I++ NV + G+ GAVNGM P+G D+S +Q +E+W GVTY ++A M+ EG V+ +
Sbjct: 337 EKIYNFNVMKVKGGTRGAVNGMFPDGSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGF 396
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYL 309
TA G+Y+ + + GLG F+TPE D +RS GYMR LA++AMQ A +
Sbjct: 397 QTAMGIYQAAWAQDGLGYSFQTPEAWDVDDRFRSIGYMRPLAIWAMQWAMM 447
>gi|339252284|ref|XP_003371365.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
gi|316968412|gb|EFV52690.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
Length = 1615
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 114/277 (41%), Positives = 175/277 (63%), Gaps = 17/277 (6%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY-ALHKDIRFLTRVYPTCLELIRKC-ESF 88
EP+ +N Y D S+WKDLNLKF+++ YRDY L+K FL V+P E+I +++
Sbjct: 1341 EPWLEVNAYAFHDTSEWKDLNLKFVLTSYRDYLMLNKSESFLRHVWPAVKEIIETALQNW 1400
Query: 89 DKQGLGIVQNGGFPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHD 148
D G +V+N G DQT +H L VE++ L + Q++ S +
Sbjct: 1401 DHDGDKLVENDGTCDQTYDAWCMHWTLL---VELAKRL-EMQKKSLISEQC--------- 1447
Query: 149 LLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV 208
+ G+YY+FD QS +S ++MADQLCGY +LK G+ T+ I P N+ + L TI++ NV
Sbjct: 1448 WMMRGTYYQFDQ-QSCNSSTVMADQLCGYAYLKICGLQTD-ILPLDNVKKVLETIYNLNV 1505
Query: 209 RGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYR 268
F G++GAVNGM +G++D S++Q++E WTGVTY LSA M+ EG V++ ++TA G+Y+
Sbjct: 1506 CSFGNGTLGAVNGMLYSGEKDTSSLQADEAWTGVTYFLSAHMISEGFVEQGFSTASGIYK 1565
Query: 269 TVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+ +E G+ ++TPE L K +R+ GYMR L+++A+Q
Sbjct: 1566 SCFESFGMHYQTPEALYEKKWFRAVGYMRPLSIWAIQ 1602
>gi|253761389|ref|XP_002489100.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
gi|241947399|gb|EES20544.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
Length = 603
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 121/288 (42%), Positives = 159/288 (55%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ IYRD+A D F V+P + E FD
Sbjct: 264 HDPWHELNAYNIHDTSRWKDLNPKFVLQIYRDFAATGDKSFGKDVWPAVCTAMEYMEQFD 323
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
G +++N GFPDQT AYCG L +AAL ++ L ++ +
Sbjct: 324 HDGDDMIENDGFPDQTYDAWTVQGVSAYCGCLWLAALQAAAALARSLGHDDYAERCMTRF 383
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA V+ LW GSY+ +DS S +S SI ADQL G + +SG+ +F I
Sbjct: 384 AKAKSVFEARLWNGSYFNYDSGTSYNSRSIQADQLAGQWYTASSGL--PPLFDEDRIKCT 441
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G MGAVNGM PNG D + +QS E+WTGVTY L+A ML G +A
Sbjct: 442 LQKIFDYNVMRVKGGRMGAVNGMHPNGKVDDTCMQSREIWTGVTYSLAATMLLHGMEHQA 501
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G+Y + G G F+TPE T D YRS YMR LA++AMQ
Sbjct: 502 FTTAEGIYIAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 549
>gi|66801399|ref|XP_629625.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
gi|60463006|gb|EAL61202.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
Length = 1302
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 110/296 (37%), Positives = 167/296 (56%), Gaps = 13/296 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELI-RKCE 86
P E+P+ +N Y + DVS WKDL KF++ +YRDY + +D FL + Y +I R E
Sbjct: 964 PGEDPWKRVNSYCIQDVSRWKDLPSKFVLQVYRDYLVMEDKNFLLQCYNVVEGVIQRTLE 1023
Query: 87 SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFS 136
FD G+V N GFPDQT AY GGL +AAL E++ +L + + ++
Sbjct: 1024 HFDFDHDGVVDNEGFPDQTYDVWPATGCSAYSGGLWLAALKVAAEIAKILGFNEDESIYN 1083
Query: 137 AKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANI 196
A K S+ + LW G Y+ +DSS+S HSDSIM+D L G+ +L + G+ + F A
Sbjct: 1084 AIFEKGSKSFTKKLWNGHYFNYDSSKSVHSDSIMSDMLAGHWYLLSCGLKSYMTFDQA-- 1141
Query: 197 ARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNV 256
+L+ I NV+ + G GAVNGM+P G D + +QS EVW G ++ L+A M+
Sbjct: 1142 LSSLSIINEYNVKSYSKGVCGAVNGMRPEGTVDTTCLQSSEVWIGTSFSLAATMILHHMD 1201
Query: 257 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
+AW G+ + Y++ G ++TPE + +R+ YMR L+++++Q A K K
Sbjct: 1202 SDAWDLVKGIVNSSYQKWGFQYQTPEAWDQNGCFRAASYMRPLSIWSIQWALQKRK 1257
>gi|302762052|ref|XP_002964448.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
gi|300168177|gb|EFJ34781.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
Length = 916
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 113/288 (39%), Positives = 165/288 (57%), Gaps = 18/288 (6%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P++ +N YN+ D S WKDLN KF+V + RD A D+ F V+P + + FD
Sbjct: 565 HDPWNTLNSYNLHDTSKWKDLNCKFVVQVNRDAAATGDLEFARAVWPAAYAAMAVTDQFD 624
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N G PDQT +YCGGL +AAL ++ L+ D + F K
Sbjct: 625 RDGDGMIENDGVPDQTYDLWTVDGVSSYCGGLWVAALQAASALADLVGDKPSSRMFHGKF 684
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
+A VY LW GSY+ +D ++S S+ +DQL G + ASG+ I A
Sbjct: 685 LRAKAVYEKSLWNGSYFHYD----KNSTSVQSDQLAGQWYSLASGL--PGIVREDQATSA 738
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L T+F+TNV ++ G+ GAVNGM P+G DR+++QS EVW G TY ++AAM+ +G +E
Sbjct: 739 LGTVFATNVMRYKGGNEGAVNGMLPSGGEDRTSLQSREVWAGTTYAVAAAMIQQGMREEG 798
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+ TA G++ + G G F+TPE D YRS YMR L+++AMQ
Sbjct: 799 FRTAKGVFLNGWSDQGHGYAFQTPEAWDNDGKYRSLAYMRPLSIWAMQ 846
>gi|301628717|ref|XP_002943495.1| PREDICTED: non-lysosomal glucosylceramidase [Xenopus (Silurana)
tropicalis]
Length = 692
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 102/230 (44%), Positives = 143/230 (62%), Gaps = 12/230 (5%)
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
FD+ G G+++N GF DQT +YCGGL +AA+ M +M+ +L D Q++F+
Sbjct: 444 FDEDGDGLIENSGFADQTYDDWVMTGPSSYCGGLWLAAVCMMCKMAEVLGDGAAQEKFTD 503
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT--EAIFPSAN 195
L + + LLW G YY FD +S+S+M DQ G FL+A G+ +FP +
Sbjct: 504 ILHRGKAAFEKLLWNGKYYNFDCGDQPYSNSVMVDQCAGNWFLRACGLGAGESEVFPKDH 563
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ AL TIF NV+ F G MGAVNGM+P+G D S++QS+EVW GV Y L+A M++EG
Sbjct: 564 VVSALRTIFELNVKQFADGQMGAVNGMRPDGTTDTSSVQSDEVWIGVVYGLAATMIHEGL 623
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
V E + TA G YRTV+ER G+ F+TPE + +RS YMR L+++AMQ
Sbjct: 624 VQEGFATAEGCYRTVWERLGMSFQTPEAYCEKQVFRSLAYMRPLSIWAMQ 673
>gi|449447581|ref|XP_004141546.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 944
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 119/291 (40%), Positives = 169/291 (58%), Gaps = 15/291 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+ +VS WKDL KF++ +YRD D F V+P+ + E FDK
Sbjct: 596 DPWLEVNAYNLLNVSRWKDLGSKFVLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDK 655
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N GFPDQT AYCGGL +AAL ++S ++D F K
Sbjct: 656 DKDGMIENEGFPDQTYDTWTVKGVSAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQ 715
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY + LW GSY+ +D+S+ S SI ADQL G + +A G+ I I AL
Sbjct: 716 KARSVY-ETLWNGSYFNYDNSKGPWSSSIQADQLAGQWYARACGLC--PIADEEKIRVAL 772
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
I++ NV + G+ GAVNGM P+G D+S +Q +E+W GVTY ++A M+ EG V+ +
Sbjct: 773 EKIYNFNVMKVKGGTRGAVNGMFPDGSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGF 832
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYL 309
TA G+Y+ + + GLG F+TPE D +RS GYMR LA++AMQ A +
Sbjct: 833 QTAMGIYQAAWAQDGLGYSFQTPEAWDVDDRFRSIGYMRPLAIWAMQWAMM 883
>gi|302787158|ref|XP_002975349.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
gi|300156923|gb|EFJ23550.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
Length = 916
Score = 206 bits (525), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 113/288 (39%), Positives = 166/288 (57%), Gaps = 18/288 (6%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P++ +N YN++D S WKDLN KF+V + RD A D+ F V+P + + FD
Sbjct: 565 HDPWNTLNSYNLYDTSKWKDLNCKFVVQVNRDAAATGDLDFARAVWPAAYAAMAFTDQFD 624
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N G PDQT +YCGGL +AAL ++ L+ D + F K
Sbjct: 625 RDGDGMIENDGVPDQTYDLWTVDGVSSYCGGLWVAALQAASALADLVGDKPSSRMFHDKF 684
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
+A VY LW GSY+ +D ++S S+ +DQL G + ASG+ I A
Sbjct: 685 LRAKAVYEKSLWNGSYFHYD----KNSTSVQSDQLAGQWYSLASGL--PGIVREDQATSA 738
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L T+F+TNV ++ G+ GAVNGM P+G DR+++QS EVW G TY ++AAM+ +G +E
Sbjct: 739 LGTVFATNVMRYKGGNEGAVNGMLPSGGEDRTSLQSREVWVGTTYAVAAAMIQQGMREEG 798
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+ TA G++ + G G F+TPE D YRS YMR L+++AMQ
Sbjct: 799 FRTAKGVFLNGWSDQGHGYAFQTPEAWDNDGKYRSLAYMRPLSIWAMQ 846
>gi|302773445|ref|XP_002970140.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
gi|300162651|gb|EFJ29264.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
Length = 983
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 118/288 (40%), Positives = 168/288 (58%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ IYRD +D+ F +P + + FD
Sbjct: 634 HDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPAVYTAMAYMDRFD 693
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G+++N GFPDQT AYCGGL +AAL M+ +N+ F K
Sbjct: 694 RDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVNEPNAANYFLRKF 753
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
A +VY LW GSY+ +DS S +S+SI ADQ+ G + ASG+ +F A
Sbjct: 754 RDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGL--PPLFDDYKARSA 811
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L I+ NV + G +GAVNGM PNG D +A+QS E+W+GVTY +AAM++EG +++A
Sbjct: 812 LQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEGMLEQA 871
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G+Y + +G G F+TPE T D +R+ YMR LA++AMQ
Sbjct: 872 FTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQ 919
>gi|413920603|gb|AFW60535.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
Length = 951
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 120/298 (40%), Positives = 169/298 (56%), Gaps = 21/298 (7%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y +++ WKDLN KF++ +YRD D F V+P+ + E FDK
Sbjct: 608 DPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDK 667
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N FPDQT AYCGGL +AAL ++ + D ++ F K
Sbjct: 668 DKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYE 727
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
KA VY L W GSY+ +D ++ S SI ADQL G + KA G +FP + +A
Sbjct: 728 KAKFVYSKL-WNGSYFSYDDGDNKMSTSIQADQLAGQWYAKACG-----LFPIVDKDKAQ 781
Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
L ++S NV F+ G+ GA+NGM P+G D SA+QS E+W GVTY L+A+M+ EG V+
Sbjct: 782 SALEKVYSFNVMKFKGGTRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVE 841
Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
+ + TA G+Y + GLG F+TPE D YRS YMR LA++AMQ A K K+
Sbjct: 842 QGFKTAEGVYHAAWSPGGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSKPKL 899
>gi|356520149|ref|XP_003528727.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
glucosylceramidase-like [Glycine max]
Length = 944
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 121/287 (42%), Positives = 162/287 (56%), Gaps = 15/287 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +NGYN+++ WKDLN KF++ IYRD D +F V+P I + FDK
Sbjct: 599 DPWFEVNGYNLYNTDRWKDLNPKFVLQIYRDVVATGDKKFAQAVWPAVYIAIAYMDQFDK 658
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AY GGL +AAL ++ + D + F K
Sbjct: 659 DGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAREVGDKGSEDYFWLKFQ 718
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY +L W GSY+ +DSS S SI ADQL G + +A G+ I AL
Sbjct: 719 KAKSVYENL-WNGSYFNYDSSGGSSSLSIQADQLAGQWYARACGL--LPIVEEKKSRSAL 775
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
+++ NV G GAVNGM P+G D S +QS E+W+GVTY L+A M+ E +D A+
Sbjct: 776 QIVYNHNVMKVHXGRRGAVNGMLPDGKVDMSTMQSREIWSGVTYALAATMIQENMIDMAF 835
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
TAGG+Y T + GLG F+TPE T YRS YMR LA++AMQ
Sbjct: 836 QTAGGVYETAWSNNGLGYSFQTPEAWTTKDEYRSLCYMRPLAIWAMQ 882
>gi|302807226|ref|XP_002985326.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
gi|300147154|gb|EFJ13820.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
Length = 983
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 118/288 (40%), Positives = 168/288 (58%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ IYRD +D+ F +P + + FD
Sbjct: 634 HDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPAVYTAMAYMDRFD 693
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G+++N GFPDQT AYCGGL +AAL M+ +N+ F K
Sbjct: 694 RDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVNEPNAANYFLRKF 753
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
A +VY LW GSY+ +DS S +S+SI ADQ+ G + ASG+ +F A
Sbjct: 754 RDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGL--PPLFDDYKARSA 811
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L I+ NV + G +GAVNGM PNG D +A+QS E+W+GVTY +AAM++EG +++A
Sbjct: 812 LQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEGMLEQA 871
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G+Y + +G G F+TPE T D +R+ YMR LA++AMQ
Sbjct: 872 FTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQ 919
>gi|225433442|ref|XP_002285674.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
Length = 978
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/288 (41%), Positives = 167/288 (57%), Gaps = 14/288 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+A +D F V+P + E FD
Sbjct: 638 HDPWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAAMEYMEQFD 697
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G+++N GFPDQT AYCG L +AAL M+ L D ++ +K
Sbjct: 698 RDSDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPFAEKCKSKF 757
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA V+ + LW GSY+ +DS S +S SI ADQL G + +SG+ ++F I +
Sbjct: 758 FKAKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGL--PSLFDDYKIKSS 815
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L I+ NV + G MGAVNGM PNG D S +QS E+WTGVTY ++A M+ G ++A
Sbjct: 816 LHKIYDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILSGMEEQA 875
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+TTA G++ + G G F+TPEG T D +RS YMR LA++ MQ
Sbjct: 876 FTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQ 923
>gi|224146269|ref|XP_002325943.1| predicted protein [Populus trichocarpa]
gi|222862818|gb|EEF00325.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 165/294 (56%), Gaps = 18/294 (6%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ IN YN+F + WKDLN KF++ +YRD D F V+P+ + + FDK
Sbjct: 549 DPWFEINAYNLFSTARWKDLNCKFVLQVYRDVVATGDKDFARAVWPSVYVAMAYMDQFDK 608
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N G PDQT AYCGGL +AAL M+ + D F K
Sbjct: 609 DGDGMIENEGIPDQTYDTWSVTGVSAYCGGLWVAALQATSAMAHEVGDDGSANYFWIKYQ 668
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY L W GSY+ +DSS ++ SI ADQL G + +A G++ + +AL
Sbjct: 669 KAKTVYGKL-WNGSYFNYDSSGNK---SIQADQLAGQWYARACGLS--PVVDGDKARKAL 722
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
I+ NV + G+ GAVNGM P+G D +A+Q++E+W GVTY LSA+M+ EG + A+
Sbjct: 723 EKIYEFNVLKVKGGTRGAVNGMLPDGRVDLTAMQTKEIWPGVTYALSASMIQEGLEEMAF 782
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGK 312
TA G+Y + GLG F+ PE + YRS YMR LA++AMQ A K K
Sbjct: 783 QTAVGIYNAAWSEEGLGYSFQIPESWDMNDQYRSLCYMRPLAIWAMQWALSKPK 836
>gi|357440847|ref|XP_003590701.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
gi|355479749|gb|AES60952.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
Length = 992
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 124/313 (39%), Positives = 173/313 (55%), Gaps = 18/313 (5%)
Query: 9 EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
+VKF + N R P + +P+ +N YN+ D S WKDLN KF++ +YRD++
Sbjct: 627 KVKFLAEGNWGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFSA 686
Query: 65 HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
D++F V+P + E FD+ G+++N GFPDQT AYCGGL +A
Sbjct: 687 TGDLQFGVDVWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWTVHGVSAYCGGLWLA 746
Query: 115 ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
AL M+ L D + K KA VY LW GSY+ +DS S +S SI ADQL
Sbjct: 747 ALQAAAAMALQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDSGSSSNSKSIQADQL 806
Query: 175 CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
G + +SG+ ++F I +L +F NV + G MGAVNGM PNG D + +Q
Sbjct: 807 AGQWYTASSGL--PSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDETCMQ 864
Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
S E+W GVTY ++A M+ G +EA+TTA G++ + G G F+TPE T D YRS
Sbjct: 865 SREIWAGVTYGVAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQTPEAFTIDGHYRS 924
Query: 293 GGYMRALAVYAMQ 305
YMR L+++ MQ
Sbjct: 925 LIYMRPLSIWGMQ 937
>gi|357440839|ref|XP_003590697.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
gi|355479745|gb|AES60948.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
Length = 1103
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 125/315 (39%), Positives = 174/315 (55%), Gaps = 18/315 (5%)
Query: 9 EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
+VKF + N R P + +P+ +N YN+ D S WKDLN KF++ +YRD++
Sbjct: 738 KVKFLAEGNWGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFSA 797
Query: 65 HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
D++F V+P + E FD+ G+++N GFPDQT AYCGGL +A
Sbjct: 798 TGDLQFGVDVWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWTVHGVSAYCGGLWLA 857
Query: 115 ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
AL M+ L D + K KA VY LW GSY+ +DS S +S SI ADQL
Sbjct: 858 ALQAAAAMALQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDSGSSSNSKSIQADQL 917
Query: 175 CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
G + +SG+ ++F I +L +F NV + G MGAVNGM PNG D + +Q
Sbjct: 918 AGQWYTASSGL--PSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDETCMQ 975
Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
S E+W GVTY ++A M+ G +EA+TTA G++ + G G F+TPE T D YRS
Sbjct: 976 SREIWAGVTYGVAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQTPEAFTIDGHYRS 1035
Query: 293 GGYMRALAVYAMQDA 307
YMR L+++ MQ A
Sbjct: 1036 LIYMRPLSIWGMQYA 1050
>gi|255575898|ref|XP_002528846.1| conserved hypothetical protein [Ricinus communis]
gi|223531697|gb|EEF33520.1| conserved hypothetical protein [Ricinus communis]
Length = 952
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 116/298 (38%), Positives = 171/298 (57%), Gaps = 21/298 (7%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y++++ WKDLN KF++ +YRD D +F V+P+ + + FD+
Sbjct: 607 DPWYEVNAYSLYNTDRWKDLNPKFVLQVYRDVVATGDKKFAEAVWPSVYIAMAYMDQFDR 666
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AY GGL +AAL ++ + D + F A+
Sbjct: 667 DGDGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAREVGDKGSEDYFWARFQ 726
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
KA VY D LW GSY+ +D+S R+S SI ADQL G + +A G +FP + +A
Sbjct: 727 KAKDVY-DKLWNGSYFNYDNSGGRNSSSIQADQLAGQWYARACG-----LFPIVDKDKAR 780
Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
L +++ NV + G GA+NGM P+G D S++QS E+W+GVTY L+A M+ E +D
Sbjct: 781 SALEKVYNYNVLKVKDGKRGAINGMLPDGKVDLSSMQSREIWSGVTYALAATMIQEDMLD 840
Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
A+ TA G+Y + GLG F+TPE YRS YMR LA++AMQ A + K+
Sbjct: 841 MAFHTASGIYEAAWSERGLGYSFQTPEAWNNVDQYRSLCYMRPLAIWAMQWALSRPKL 898
>gi|255554158|ref|XP_002518119.1| conserved hypothetical protein [Ricinus communis]
gi|223542715|gb|EEF44252.1| conserved hypothetical protein [Ricinus communis]
Length = 968
Score = 203 bits (517), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 121/313 (38%), Positives = 172/313 (54%), Gaps = 18/313 (5%)
Query: 9 EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
+VKF + N R P + +P++ +N YN+ D S WKDLN KF++ +YRD+A
Sbjct: 603 KVKFLAEGNVGIRKVRGAVPHDLGTHDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAA 662
Query: 65 HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
+D+ F V+P + E FD+ G +++N GFPDQT AYCG L +A
Sbjct: 663 TQDMSFGVDVWPAVRSAMEYMEQFDRDGDALIENDGFPDQTYDAWTVHGVSAYCGCLWLA 722
Query: 115 ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
AL M+ + D + +K KA + LW GSY+ +DS S +S SI ADQL
Sbjct: 723 ALEAAAAMALQVGDKYFAELCRSKFVKAKSAFEAKLWNGSYFNYDSGSSSNSKSIQADQL 782
Query: 175 CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
G ++ +SG+ +F + I L I+ NV G MGAVNGM PNG D + +Q
Sbjct: 783 AGQWYVASSGL--PPLFDDSKIKSTLQKIYDFNVMKVRGGRMGAVNGMHPNGKVDETCMQ 840
Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
S E+WTGVTY ++A M+ G D+A+ A G++ + G G F+TPEG T D +RS
Sbjct: 841 SREIWTGVTYAVAATMILAGMEDKAFAAAEGIFLAGWSEDGYGYWFQTPEGWTTDGHFRS 900
Query: 293 GGYMRALAVYAMQ 305
YMR LA++ MQ
Sbjct: 901 LIYMRPLAIWGMQ 913
>gi|281212396|gb|EFA86556.1| hypothetical protein PPL_00357 [Polysphondylium pallidum PN500]
Length = 4775
Score = 203 bits (516), Expect = 9e-50, Method: Composition-based stats.
Identities = 108/285 (37%), Positives = 162/285 (56%), Gaps = 13/285 (4%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIR-FLTRVYPTCLELIRKCE 86
P E+P+ +N Y++ DVS WKDL KF++ IYRDY ++ + R FL +++ E+IR+
Sbjct: 871 PGEDPWKRVNAYHIQDVSRWKDLPSKFVLQIYRDYLINGNDRTFLLQMWGVVEEVIRRAF 930
Query: 87 SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFS 136
+D G++ N G PDQT AY GGL +AA+ EM+ +L + + +
Sbjct: 931 EYDIDIDGVIDNEGVPDQTYDAWSALGCSAYSGGLWLAAVKVASEMARILGLKEDETVYK 990
Query: 137 AKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANI 196
K + Y LW G Y+ +DSS++ H DSIM+DQL G+ +L A G+ + A
Sbjct: 991 KIFEKGKKSYSTKLWNGHYFNYDSSKNPHFDSIMSDQLAGHWYLLACGLPSYITLDQA-- 1048
Query: 197 ARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNV 256
+L+ I NV+ + GS GAVNGM+P G D +++QS EVW G +Y L++ ML
Sbjct: 1049 LSSLSIINEYNVKSYSNGSCGAVNGMRPEGGPDTTSLQSCEVWIGTSYGLASTMLLHFMD 1108
Query: 257 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAV 301
+EAW GL + Y + G ++TPE + YR+G YMR LA+
Sbjct: 1109 NEAWELIKGLVDSTYNKWGFQYQTPEAWDQNGLYRAGTYMRPLAI 1153
>gi|224120058|ref|XP_002331126.1| predicted protein [Populus trichocarpa]
gi|222872854|gb|EEF09985.1| predicted protein [Populus trichocarpa]
Length = 948
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 116/288 (40%), Positives = 163/288 (56%), Gaps = 15/288 (5%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ + WKDLN KF++ +YRD D +F V+P+ + + FD
Sbjct: 602 HDPWFEVNAYNLHNTDRWKDLNPKFVLQVYRDVIATGDKKFARAVWPSVYVAMAYMDQFD 661
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCGGL +AAL ++ + D + F +
Sbjct: 662 RDGDGMIENDGFPDQTYDTWSLSGVSAYCGGLWVAALQAASALAWEVGDKDSAEYFWCRF 721
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA VY L W GSY+ +D+S +S SI ADQL G + +A G++ I A
Sbjct: 722 QKAKIVYGKL-WNGSYFNYDNSGGINSSSIQADQLAGQWYARACGLS--PIVDEDKARCA 778
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L I++ NV + G GAVNGM P+G D S +QS E+W+GVTY ++A M+ EG D A
Sbjct: 779 LEKIYNYNVLKVQDGKRGAVNGMLPDGTVDLSCLQSREIWSGVTYAVAATMIQEGLTDMA 838
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+ TA G+Y V+ GLG F+TPEG YRS YMR LA++AMQ
Sbjct: 839 FHTASGVYEAVWAEQGLGYSFQTPEGWNTTDQYRSLCYMRPLAIWAMQ 886
>gi|449463809|ref|XP_004149624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
gi|449494604|ref|XP_004159595.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 951
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 117/290 (40%), Positives = 169/290 (58%), Gaps = 15/290 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +NGYN+++ WKDLN KF++ IYRD D++F V+P I + FD+
Sbjct: 609 DPWFEVNGYNLYNTDRWKDLNPKFVLQIYRDVVATGDMKFAKAVWPPVYLAIAYMDQFDR 668
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AY GGL +AAL ++ + ++ + F K
Sbjct: 669 DGDGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVADEKDAEHYFWFKFQ 728
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA + Y + LW GSY+ +DSS S SI ADQL G + +ASG+ I AL
Sbjct: 729 KAKRAY-EKLWNGSYFNYDSSGGSSSSSIQADQLAGQWYARASGL--LPIVDEDKAKSAL 785
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
A +++ NV + G GAVNGM P+G D S++QS E+W+GVTY ++A+M++E D A+
Sbjct: 786 AKVYNYNVMKVKDGKRGAVNGMLPDGTIDFSSMQSREIWSGVTYAVAASMIHEDMTDMAF 845
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
TA G++ + GLG F+TPE T YRS YMR LA++AMQ A+
Sbjct: 846 RTAEGIHEAAWSEDGLGYNFQTPEAWTTTDRYRSLCYMRPLAIWAMQWAF 895
>gi|115474457|ref|NP_001060825.1| Os08g0111200 [Oryza sativa Japonica Group]
gi|42408344|dbj|BAD09496.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
gi|42408391|dbj|BAD09542.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
gi|113622794|dbj|BAF22739.1| Os08g0111200 [Oryza sativa Japonica Group]
gi|215701490|dbj|BAG92914.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 928
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 114/287 (39%), Positives = 163/287 (56%), Gaps = 14/287 (4%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y + D WKDLN KF++ +YRD + F +P + + FD+
Sbjct: 580 DPWFEVNAYMLHDPVRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDR 639
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+V+N G PDQT AY GGL +AAL M+ ++ D + F +
Sbjct: 640 DGDGMVENEGRPDQTYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYH 699
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
+A +VY D LW G Y+ +D+S S SI ADQL G + +A G+ E I RAL
Sbjct: 700 RARRVYTDELWNGGYFNYDNSGGATSSSIQADQLAGQWYARACGL--EPIVDGDKARRAL 757
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
AT+ NV + G++GAVNGM+P+G D S+ QS+EVW GVTY ++AAM++EG + A+
Sbjct: 758 ATVLDYNVMRVKGGAIGAVNGMRPDGAVDASSTQSKEVWPGVTYAVAAAMIHEGMPEAAF 817
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
TA G++ + + G G F+TPE T D YR+ YMR L V+AMQ
Sbjct: 818 KTAKGIHDAGWGKHGFGYAFQTPESWTADGGYRALHYMRPLGVWAMQ 864
>gi|357157041|ref|XP_003577664.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 951
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 114/287 (39%), Positives = 163/287 (56%), Gaps = 15/287 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y + + WKDLN KF++ +YRD + F V+P+ + E FDK
Sbjct: 608 DPWFKVNAYTLHNTDRWKDLNPKFVLQVYRDVVATGNKSFARAVWPSVYMAMAYMEQFDK 667
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N FPDQT AYCGGL +AAL ++ + D ++ F K
Sbjct: 668 DKDGMIENEDFPDQTYDVWSMAGVSAYCGGLWVAALQAASALAREVGDKASEELFWNKYE 727
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY L W GSY+ +D + ++ S SI ADQL G + K+ G+++ I AL
Sbjct: 728 KAKSVYGKL-WNGSYFNYDDAGTKVSTSIHADQLAGQWYAKSCGLSS--IVDKDKSQSAL 784
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
I+S NV F+ G GA+NGM P+G D S +QS E+W GVTY L+A+M+ EG V+E +
Sbjct: 785 EKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSTMQSREIWPGVTYALAASMIQEGMVEEGF 844
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
TA G+Y + GLG F+TPE T D YRS YMR LA++++Q
Sbjct: 845 KTAEGIYHAAWSPEGLGYAFQTPEAWTNDDGYRSLCYMRPLAIWSIQ 891
>gi|115484889|ref|NP_001067588.1| Os11g0242100 [Oryza sativa Japonica Group]
gi|77549531|gb|ABA92328.1| expressed protein [Oryza sativa Japonica Group]
gi|113644810|dbj|BAF27951.1| Os11g0242100 [Oryza sativa Japonica Group]
gi|215704397|dbj|BAG93831.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222615769|gb|EEE51901.1| hypothetical protein OsJ_33494 [Oryza sativa Japonica Group]
Length = 950
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 118/298 (39%), Positives = 165/298 (55%), Gaps = 21/298 (7%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y +++ WKDLN KF++ +YRD D F V+P+ + E FD+
Sbjct: 608 DPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAMAYMEQFDR 667
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N FPDQT AYCGGL +AAL ++ + D ++ F K
Sbjct: 668 DKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKASEKLFWDKYE 727
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
KA VY L W GSY+ +D + S SI ADQL G + KA G +FP + +A
Sbjct: 728 KAKSVYGKL-WNGSYFNYDDGDNIMSASIHADQLAGQWYAKACG-----LFPIVDKDKAE 781
Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
L I+S NV F+ G GA+NGM P+G D SA+QS E+W GVTY L+A M+ EG V+
Sbjct: 782 SALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGMVE 841
Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
+ + TA G+Y + GLG F+TPE D YRS YMR LA++A+Q A K+
Sbjct: 842 KGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPKL 899
>gi|218185509|gb|EEC67936.1| hypothetical protein OsI_35656 [Oryza sativa Indica Group]
Length = 950
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 118/298 (39%), Positives = 165/298 (55%), Gaps = 21/298 (7%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y +++ WKDLN KF++ +YRD D F V+P+ + E FD+
Sbjct: 608 DPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAMAYMEQFDR 667
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N FPDQT AYCGGL +AAL ++ + D ++ F K
Sbjct: 668 DKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKASEKLFWDKYE 727
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
KA VY L W GSY+ +D + S SI ADQL G + KA G +FP + +A
Sbjct: 728 KAKSVYGKL-WNGSYFNYDDGDNIMSASIHADQLAGQWYAKACG-----LFPIVDKDKAE 781
Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
L I+S NV F+ G GA+NGM P+G D SA+QS E+W GVTY L+A M+ EG V+
Sbjct: 782 SALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGMVE 841
Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
+ + TA G+Y + GLG F+TPE D YRS YMR LA++A+Q A K+
Sbjct: 842 KGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPKL 899
>gi|62733283|gb|AAX95400.1| At5g49900 [Oryza sativa Japonica Group]
Length = 931
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 118/298 (39%), Positives = 165/298 (55%), Gaps = 21/298 (7%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y +++ WKDLN KF++ +YRD D F V+P+ + E FD+
Sbjct: 589 DPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAMAYMEQFDR 648
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N FPDQT AYCGGL +AAL ++ + D ++ F K
Sbjct: 649 DKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKASEKLFWDKYE 708
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA- 199
KA VY L W GSY+ +D + S SI ADQL G + KA G +FP + +A
Sbjct: 709 KAKSVYGKL-WNGSYFNYDDGDNIMSASIHADQLAGQWYAKACG-----LFPIVDKDKAE 762
Query: 200 --LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
L I+S NV F+ G GA+NGM P+G D SA+QS E+W GVTY L+A M+ EG V+
Sbjct: 763 SALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGMVE 822
Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
+ + TA G+Y + GLG F+TPE D YRS YMR LA++A+Q A K+
Sbjct: 823 KGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPKL 880
>gi|297831192|ref|XP_002883478.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
lyrata]
gi|297329318|gb|EFH59737.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
lyrata]
Length = 950
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 119/313 (38%), Positives = 166/313 (53%), Gaps = 18/313 (5%)
Query: 9 EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
+VKF + N R P + +P++ +N YN+ D S WKDLN KF++ +YRD+A
Sbjct: 587 KVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAA 646
Query: 65 HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
D +F V+P + E FD+ +++N GFPDQT AYCG L +A
Sbjct: 647 TGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLA 706
Query: 115 ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
AL M+ + D + K A LW GSY+ +DS S +S SI DQL
Sbjct: 707 ALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYFNYDSGASSNSKSIQTDQL 766
Query: 175 CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
G + +SG+ IF + I + IF NV + G MGAVNGM P+G D + +Q
Sbjct: 767 AGQWYAASSGL--PPIFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQ 824
Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
S E+WTGVTY +A M+ G ++ +TTA G++ + G G F+TPEG T D YRS
Sbjct: 825 SREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGYWFQTPEGWTMDGHYRS 884
Query: 293 GGYMRALAVYAMQ 305
YMR LA++ MQ
Sbjct: 885 LIYMRPLAIWGMQ 897
>gi|297739499|emb|CBI29681.3| unnamed protein product [Vitis vinifera]
Length = 949
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 118/295 (40%), Positives = 166/295 (56%), Gaps = 15/295 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +NGYN+++ WKDLN KF++ +YRD D F V+P+ + FDK
Sbjct: 605 DPWFEVNGYNLYNTDRWKDLNPKFVLQVYRDVVATGDKNFAKAVWPSVYVALAYMNQFDK 664
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AY GGL +AAL ++ ++ D + F K
Sbjct: 665 DGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVVGDKGSEDYFWFKFQ 724
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY L W GSY+ +D S S SI ADQL G + +A G++ I AL
Sbjct: 725 KAKGVYQKL-WNGSYFNYDDSDGSSSSSIQADQLAGQWYARACGLS--PIVDEDKAKSAL 781
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
++ NV G GAVNGM P+G D + +QS E+W+GVTY ++A M++EG VD A+
Sbjct: 782 EKVYHYNVLKVGGGKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDMAF 841
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
TA G+Y + + GLG F+TPE D YRS YMR LA++AMQ A+ + K+
Sbjct: 842 QTASGVYEAAWSQEGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKL 896
>gi|359486233|ref|XP_002264575.2| PREDICTED: non-lysosomal glucosylceramidase [Vitis vinifera]
Length = 960
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 118/295 (40%), Positives = 166/295 (56%), Gaps = 15/295 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +NGYN+++ WKDLN KF++ +YRD D F V+P+ + FDK
Sbjct: 616 DPWFEVNGYNLYNTDRWKDLNPKFVLQVYRDVVATGDKNFAKAVWPSVYVALAYMNQFDK 675
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+++N GFPDQT AY GGL +AAL ++ ++ D + F K
Sbjct: 676 DGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVVGDKGSEDYFWFKFQ 735
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY L W GSY+ +D S S SI ADQL G + +A G++ I AL
Sbjct: 736 KAKGVYQKL-WNGSYFNYDDSDGSSSSSIQADQLAGQWYARACGLS--PIVDEDKAKSAL 792
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
++ NV G GAVNGM P+G D + +QS E+W+GVTY ++A M++EG VD A+
Sbjct: 793 EKVYHYNVLKVGGGKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDMAF 852
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
TA G+Y + + GLG F+TPE D YRS YMR LA++AMQ A+ + K+
Sbjct: 853 QTASGVYEAAWSQEGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKL 907
>gi|326517703|dbj|BAK03770.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 821
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 118/289 (40%), Positives = 167/289 (57%), Gaps = 16/289 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y + D S WKDLN KF++ +YRD A ++ F T +P + + FD+
Sbjct: 466 DPWFELNAYMIHDPSRWKDLNPKFVLQVYRDVAATGNVTFATAAWPAVYLAMAYMDQFDR 525
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+V+N G PDQT AY GGL +AAL M+ ++ D + F +
Sbjct: 526 DGDGMVENEGRPDQTYDLWSVSGVSAYTGGLWLAALQAAAAMARIVGDRGAEGYFLERYK 585
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
+A +VY LW GSY+ +D+S S SIMADQL G + +A G+ E I AL
Sbjct: 586 RAQRVYDGELWNGSYFDYDNSGCATSKSIMADQLAGQWYARACGL--EPIVEEEKARSAL 643
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
T+ NV + G++GAVNGM+P+G D S++QS+EVW GVTY ++AAML+EG + A+
Sbjct: 644 GTVLDYNVMRVQGGAVGAVNGMRPDGAVDASSLQSKEVWVGVTYGVAAAMLHEGMTEAAF 703
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDK--TYRSGGYMRALAVYAMQ 305
TA G + + R G G F+TPE T D YRS YMR L+++AMQ
Sbjct: 704 RTAKGAHDAGWGRDGFGYAFQTPEAWTSDAGGGYRSLHYMRPLSIWAMQ 752
>gi|413941636|gb|AFW74285.1| hypothetical protein ZEAMMB73_466373 [Zea mays]
Length = 966
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 116/288 (40%), Positives = 169/288 (58%), Gaps = 15/288 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y + D + WKDLN KF++ +YRD D F V+P + + FD+
Sbjct: 613 DPWVELNAYMLHDAARWKDLNPKFVLQVYRDAVATGDAAFAEAVWPAVYVAMAYMDQFDR 672
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+V+N G PDQT AY GG+ +AAL M+ ++ D + F A+
Sbjct: 673 DGDGMVENEGVPDQTYDMWSVSGVSAYTGGVWVAALQAAAGMARVVGDRPAEAYFRARRD 732
Query: 141 KASQVYHDLLWTGSYYKFDSSQSR-HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
+A++VY LWTG+Y+++D+S S+SIMADQL G + +A G+ E + A A
Sbjct: 733 RAARVYDAELWTGAYFRYDNSGGGGASESIMADQLAGQWYARACGL--EPVVEDAKARSA 790
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
LAT+ NV + G++GAVNGM+P+G D S+ QS+EVW GVTY ++AAM++EG + A
Sbjct: 791 LATVLDYNVMRVKGGAIGAVNGMRPDGAVDASSTQSKEVWPGVTYAVAAAMVHEGMTEAA 850
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+ TA G + + G G F+TPE T D YRS YMR L ++AMQ
Sbjct: 851 FRTAKGAHDAAWGNDGFGYAFQTPEAWTEDGGYRSLHYMRPLGIWAMQ 898
>gi|341888271|gb|EGT44206.1| hypothetical protein CAEBREN_07747 [Caenorhabditis brenneri]
Length = 820
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 118/297 (39%), Positives = 164/297 (55%), Gaps = 23/297 (7%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIR-----FLTRVYPTCLELI 82
P +P+ N Y + D WKDLNLKF++S YRDY + R L C +++
Sbjct: 523 PMADPWLHTNAYILHDTGSWKDLNLKFVISCYRDYKMIAKDREDKDQLLEFFLGKCTKIV 582
Query: 83 RKC-ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQ 131
ES+DK G+++N GF DQT AYCG L +AAL C +EM L
Sbjct: 583 EDALESWDKDKDGMIENDGFADQTYDVWKMTGTSAYCGSLWLAALTCYIEM--LKQAGSP 640
Query: 132 QQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIF 191
+ + KL A Y LW G ++KFD +S +MADQLCG+ L A ++
Sbjct: 641 SKHYEGKLLNAYDAYTTKLWNGRFFKFDELPD-NSKIVMADQLCGFWALTA--MDEPVQV 697
Query: 192 PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 251
+ AL TIF NV+ + G GAVNG + D S+IQSEEVW G+TY LSA M+
Sbjct: 698 SEEKMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVDGSSIQSEEVWAGITYALSAMMI 757
Query: 252 YEGNVDE-AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
E N+DE A+ T+ GL+ +++ R L ++TPE +T D YR+ GYMR L+++A+Q A
Sbjct: 758 -EKNMDEMAFKTSEGLFESIWNRFPLQYQTPEAITSDGMYRALGYMRPLSIWAIQHA 813
>gi|328869576|gb|EGG17953.1| hypothetical protein DFA_06619 [Dictyostelium fasciculatum]
Length = 1200
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 164/312 (52%), Gaps = 36/312 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY------------------------A 63
P E+P+ +N YN+ D+S WKDL KFI+ +YRDY
Sbjct: 872 PGEDPWKRVNAYNIQDISRWKDLPCKFILQVYRDYLLTSVDSSGQKSNATSSSFGQEYIP 931
Query: 64 LHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHI 113
L D FL +++ T E+IRK FD G++ N GFPDQT AY GGL +
Sbjct: 932 LDGDRGFLLQMWGTVEEVIRKAFEFDTDDDGVIDNEGFPDQTYDTWSASGCSAYTGGLWL 991
Query: 114 AALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQ 173
AA+ M+ +L ++ +S K + Y+ LW G Y+ +DSS+ H +SIMADQ
Sbjct: 992 AAIKATSAMARILGLRDDEEVYSKLFEKGKKSYNKKLWNGHYFNYDSSKQSHYNSIMADQ 1051
Query: 174 LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAI 233
L G+ +L + G+++ A L+ I N++ + GS GAVNG+ P D++ +
Sbjct: 1052 LAGHWYLTSCGLSSYMTLDQA--LSTLSIINEYNIKSYSNGSCGAVNGISPLAIVDQTCL 1109
Query: 234 QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSG 293
QS EVW G +Y L++ L EAW+ GL + YE+ G ++TPE + ++R+
Sbjct: 1110 QSSEVWIGTSYSLASTFLLHYMDKEAWSLIKGLVNSSYEKWGFQYQTPEAWDMNGSFRAS 1169
Query: 294 GYMRALAVYAMQ 305
YMR LA++++Q
Sbjct: 1170 TYMRPLAIWSVQ 1181
>gi|341899054|gb|EGT54989.1| hypothetical protein CAEBREN_31176 [Caenorhabditis brenneri]
Length = 831
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 117/294 (39%), Positives = 163/294 (55%), Gaps = 23/294 (7%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIR-----FLTRVYPTCLELIRKC 85
+P+ N Y + D WKDLNLKF++S YRDY + R L C +++
Sbjct: 537 DPWLHTNAYILHDTGSWKDLNLKFVISCYRDYKMIAKDREDKDQLLEFFLGKCTKIVEDA 596
Query: 86 -ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
ES+DK G+++N GF DQT AYCG L +AAL C +EM L +
Sbjct: 597 LESWDKDKDGMIENDGFADQTYDVWKMTGTSAYCGSLWLAALTCYIEM--LKQAGSPSKN 654
Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSA 194
+ KL A Y LW G ++KFD +S +MADQLCG+ L A ++
Sbjct: 655 YEGKLLNAYDAYITKLWNGRFFKFDELPD-NSKIVMADQLCGFWALTA--MDEPVQVSEE 711
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
+ AL TIF NV+ + G GAVNG + D S+IQSEEVW G+TY LSA M+ E
Sbjct: 712 KMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVDGSSIQSEEVWAGITYALSAMMI-EK 770
Query: 255 NVDE-AWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
N+DE A+ T+ GL+ +++ R L ++TPE +T D YR+ GYMR L+++A+Q A
Sbjct: 771 NMDEMAFKTSEGLFESIWNRFPLQYQTPEAITSDGMYRALGYMRPLSIWAIQHA 824
>gi|308471410|ref|XP_003097936.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
gi|308239241|gb|EFO83193.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
Length = 845
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 115/297 (38%), Positives = 166/297 (55%), Gaps = 18/297 (6%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIR-KCES 87
+P+ N Y + D WKDLNLKF++S YRD+ + + D L C +++ ES
Sbjct: 554 DPWLHTNAYILHDTGCWKDLNLKFVISCYRDWKMISNGDQEILEFFIGKCTKIVDGALES 613
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
+DK G+++N GF DQT AYCG L +AAL C ++M L + +
Sbjct: 614 WDKDQDGMIENDGFADQTYDVWKMTGTSAYCGSLWLAALTCYIQM--LKKAGSPTKFYDE 671
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
KL A + Y LW G ++KFD +S +MADQLCG+ L A ++ +
Sbjct: 672 KLLMAYEAYTTKLWNGKFFKFDELPD-NSKIVMADQLCGFWALTA--MDEPVQVSEEKMK 728
Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
AL TIF NV+ ++ G GAVNG + D S+IQSEEVW G+TY LSA M+ +G +
Sbjct: 729 SALETIFKYNVQMYDGGKCGAVNGYLTSERVDGSSIQSEEVWAGITYALSAMMIEKGMDE 788
Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
A+ T+ GL+ +++ R L F+TPE +T D YR+ GYMR L+++A+Q A K K
Sbjct: 789 MAFKTSEGLFDSIWNRFPLQFQTPEAITSDGMYRALGYMRPLSIWAIQHALEKRNRK 845
>gi|20453181|gb|AAM19831.1| AT4g10060/T5L19_190 [Arabidopsis thaliana]
gi|24111431|gb|AAN46866.1| At4g10060/T5L19_190 [Arabidopsis thaliana]
Length = 922
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 113/287 (39%), Positives = 163/287 (56%), Gaps = 15/287 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN F+ WKDLN KF++ +YRD D F V+P+ + + FDK
Sbjct: 571 DPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDK 630
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N GFPDQT AYCGGL +AAL +S++ + F+AK
Sbjct: 631 DEDGMIENEGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYE 690
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY + LW GSY+ +D S S S SI+ADQL G + +A G+ + I I +AL
Sbjct: 691 KAKIVY-EKLWNGSYFNYDDSGSGSSSSILADQLAGQWYARACGL--KPITKEEWIKKAL 747
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
TI+ NV + G+ GAVNGM G D +++ S+EVW G TY ++A M+ EG ++ +
Sbjct: 748 ETIYEFNVMKVKGGTRGAVNGMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGF 807
Query: 261 TTAGGLYRTVYERTGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
TA G+Y ++ GL F+TPE + YRS YMR LA++A+Q
Sbjct: 808 QTASGIYEAIWSDRGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWAIQ 854
>gi|240255774|ref|NP_192744.6| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|332657434|gb|AEE82834.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 922
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 113/287 (39%), Positives = 163/287 (56%), Gaps = 15/287 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN F+ WKDLN KF++ +YRD D F V+P+ + + FDK
Sbjct: 571 DPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDK 630
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N GFPDQT AYCGGL +AAL +S++ + F+AK
Sbjct: 631 DEDGMIENEGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYE 690
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY + LW GSY+ +D S S S SI+ADQL G + +A G+ + I I +AL
Sbjct: 691 KAKIVY-EKLWNGSYFNYDDSGSGSSSSILADQLAGQWYARACGL--KPITKEEWIKKAL 747
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
TI+ NV + G+ GAVNGM G D +++ S+EVW G TY ++A M+ EG ++ +
Sbjct: 748 ETIYEFNVMKVKGGTRGAVNGMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGF 807
Query: 261 TTAGGLYRTVYERTGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
TA G+Y ++ GL F+TPE + YRS YMR LA++A+Q
Sbjct: 808 QTASGIYEAIWSDRGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWAIQ 854
>gi|159896834|ref|YP_001543081.1| hypothetical protein Haur_0301 [Herpetosiphon aurantiacus DSM 785]
gi|159889873|gb|ABX02953.1| protein of unknown function DUF608 [Herpetosiphon aurantiacus DSM
785]
Length = 774
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 108/293 (36%), Positives = 161/293 (54%), Gaps = 19/293 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVY---PTCLELIRK 84
P+E+P N Y+ D+++WKDLNLK+I+ IYRD +L D L + PT LE + +
Sbjct: 480 PKEQPLIKTNAYDFQDINNWKDLNLKYILRIYRDVSLWNDQAMLEATWDTIPTALEYVHQ 539
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ 134
FD G G++ + G DQT +Y L I AL + ++ + D Q
Sbjct: 540 ---FDSDGDGLLDHSG-ADQTYDTWAMSGAASYSASLLICALEAAIRLAQRMGDHAQADA 595
Query: 135 FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSA 194
+S L A Q + LW G+Y+++ ++ + + IMADQL G + A G+ A+ P
Sbjct: 596 WSEWLAAARQSFETKLWNGTYFRYHTADTDLREVIMADQLVGQWYAGAIGL--PAVAPRE 653
Query: 195 NIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
I AL T++ NV + G++GAVNGM P+G D S+ Q+ EVW+G TY ++A ML EG
Sbjct: 654 MIRSALQTVYRFNVMQYANGALGAVNGMHPDGTVDTSSNQASEVWSGTTYAIAAMMLQEG 713
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
E W TA G Y Y GL F TPE ++T+R+ YMR +++A++ A
Sbjct: 714 LDLEGWQTAWGAYNATYNELGLWFRTPEAWGIERTFRASMYMRPQSIWAIEHA 766
>gi|147788372|emb|CAN61187.1| hypothetical protein VITISV_019326 [Vitis vinifera]
Length = 900
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 119/324 (36%), Positives = 167/324 (51%), Gaps = 50/324 (15%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ +YRD+A +D F V+P + E FD
Sbjct: 524 HDPWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAAMEYMEQFD 583
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G+++N GFPDQT AYCG L +AAL M+ L D ++ +K
Sbjct: 584 RDSDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPFAEKCKSKF 643
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA V+ + LW GSY+ +DS S +S SI ADQL G + +SG+ ++F I +
Sbjct: 644 FKAKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGL--PSLFDDCKIKSS 701
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L I+ NV + G MGAVNGM PNG D S +QS E+WTGVTY ++A M+ G ++A
Sbjct: 702 LHKIYDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILSGMEEQA 761
Query: 260 WTTAGGLYRTVYERTGLG--------------------------------------FETP 281
+TTA G++ + G G F+TP
Sbjct: 762 FTTAEGIFTAGWSEEGYGTLDFIDDMHCVVQILNCFVHSGLCVACVILLYCTCRYWFQTP 821
Query: 282 EGLTGDKTYRSGGYMRALAVYAMQ 305
EG T D +RS YMR LA++ MQ
Sbjct: 822 EGWTIDGHFRSLIYMRPLAIWGMQ 845
>gi|4539009|emb|CAB39630.1| putative protein [Arabidopsis thaliana]
gi|7267702|emb|CAB78129.1| putative protein [Arabidopsis thaliana]
Length = 750
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 113/287 (39%), Positives = 163/287 (56%), Gaps = 15/287 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN F+ WKDLN KF++ +YRD D F V+P+ + + FDK
Sbjct: 399 DPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDK 458
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+++N GFPDQT AYCGGL +AAL +S++ + F+AK
Sbjct: 459 DEDGMIENEGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYE 518
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY + LW GSY+ +D S S S SI+ADQL G + +A G+ + I I +AL
Sbjct: 519 KAKIVY-EKLWNGSYFNYDDSGSGSSSSILADQLAGQWYARACGL--KPITKEEWIKKAL 575
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
TI+ NV + G+ GAVNGM G D +++ S+EVW G TY ++A M+ EG ++ +
Sbjct: 576 ETIYEFNVMKVKGGTRGAVNGMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGF 635
Query: 261 TTAGGLYRTVYERTGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
TA G+Y ++ GL F+TPE + YRS YMR LA++A+Q
Sbjct: 636 QTASGIYEAIWSDRGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWAIQ 682
>gi|297809209|ref|XP_002872488.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
gi|297318325|gb|EFH48747.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
Length = 921
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/287 (39%), Positives = 162/287 (56%), Gaps = 15/287 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN F+ WKDLN KF++ +YRD D F V+P+ + + FDK
Sbjct: 571 DPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVSTGDQSFAKAVWPSVYTAVAYLDQFDK 630
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
+++N GFPDQT AYCGGL +AAL +S++ + F+ K
Sbjct: 631 DEDEMIENEGFPDQTYDAWSVKGVSAYCGGLWVAALQAASAFASIVGENGVAIYFNTKYE 690
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA VY + LW GSY+ +D S S S SI+ADQL G + +A G+ + I I +AL
Sbjct: 691 KAKSVY-EKLWNGSYFNYDDSGSGSSSSILADQLAGQWYARACGL--KPITKEEWIKKAL 747
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
TI+ NV + G+ GAVNGM P+G D +++ S+EVW G TY ++A M+ EG + +
Sbjct: 748 ETIYEFNVMKVKGGTRGAVNGMSPDGQVDTNSLVSKEVWAGTTYSVAACMIQEGQRERGF 807
Query: 261 TTAGGLYRTVYERTGL--GFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
TA G+Y V+ GL F+TPE + YRS YMR LA++++Q
Sbjct: 808 QTASGIYEAVWSDRGLSCSFQTPEAWNMNDEYRSLCYMRPLAIWSIQ 854
>gi|268555576|ref|XP_002635777.1| Hypothetical protein CBG10433 [Caenorhabditis briggsae]
Length = 821
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 114/301 (37%), Positives = 163/301 (54%), Gaps = 21/301 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL-----HKDIRFLTRVYPTCLELI 82
P +P+ N Y + D WKDLNLKF++S YRD+ + L C +++
Sbjct: 523 PMADPWIHTNAYILHDTGHWKDLNLKFVISCYRDWKMIAEGEEDSQEILEFFLGKCTKIV 582
Query: 83 R-KCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQ 131
E++DK G+++N GF DQT AYCG L IAAL ++M L
Sbjct: 583 DGALENWDKDKDGMIENDGFADQTYDVWKMTGTSAYCGSLWIAALTSYIQM--LKRAGIP 640
Query: 132 QQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIF 191
+ + KL A + Y LW G ++KFD +S +MADQLCG+ L A ++
Sbjct: 641 TKDYEEKLLMAYEAYTTKLWNGKFFKFDELPD-NSKIVMADQLCGFWALTA--MDEPIQV 697
Query: 192 PSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAML 251
I AL TIF NV ++ G GAVNG + D S+IQSEEVW G+TY LSA M+
Sbjct: 698 SEGKIQSALETIFKYNVEMYDGGKCGAVNGFLASERVDGSSIQSEEVWAGITYSLSAMMI 757
Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKG 311
+G + A+ T+ GL+ +++ R L ++TPE +T D YR+ GYMR L+++A+Q A K
Sbjct: 758 EKGMDEMAFKTSEGLFHSIWNRFPLQYQTPEAITADGMYRALGYMRPLSIWAIQHALDKK 817
Query: 312 K 312
K
Sbjct: 818 K 818
>gi|22331303|ref|NP_189060.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|79313355|ref|NP_001030757.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|17529232|gb|AAL38843.1| unknown protein [Arabidopsis thaliana]
gi|110742225|dbj|BAE99039.1| hypothetical protein [Arabidopsis thaliana]
gi|332643347|gb|AEE76868.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|332643348|gb|AEE76869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 950
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 118/313 (37%), Positives = 166/313 (53%), Gaps = 18/313 (5%)
Query: 9 EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
+VKF + N R P + +P++ +N YN+ D S WKDLN KF++ +YRD+A
Sbjct: 587 KVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSRWKDLNPKFVLQVYRDFAA 646
Query: 65 HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
D +F V+P + E FD+ +++N GFPDQT AYCG L +A
Sbjct: 647 TGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLA 706
Query: 115 ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
AL M+ + D + K A LW GSY+ +DS S +S SI DQL
Sbjct: 707 ALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYFNYDSGSSSNSKSIQTDQL 766
Query: 175 CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
G + +SG+ +F + I + IF NV + G MGAVNGM P+G D + +Q
Sbjct: 767 AGQWYAASSGL--PPLFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQ 824
Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
S E+WTGVTY +A M+ G ++ +TTA G++ + G G F+TPEG T D YRS
Sbjct: 825 SREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGYWFQTPEGWTMDGHYRS 884
Query: 293 GGYMRALAVYAMQ 305
YMR LA++ MQ
Sbjct: 885 LIYMRPLAIWGMQ 897
>gi|11994237|dbj|BAB01359.1| unnamed protein product [Arabidopsis thaliana]
Length = 937
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 118/313 (37%), Positives = 166/313 (53%), Gaps = 18/313 (5%)
Query: 9 EVKFRSAANTHERYYTRWFPEE----EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL 64
+VKF + N R P + +P++ +N YN+ D S WKDLN KF++ +YRD+A
Sbjct: 574 KVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSRWKDLNPKFVLQVYRDFAA 633
Query: 65 HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA 114
D +F V+P + E FD+ +++N GFPDQT AYCG L +A
Sbjct: 634 TGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLA 693
Query: 115 ALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQL 174
AL M+ + D + K A LW GSY+ +DS S +S SI DQL
Sbjct: 694 ALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYFNYDSGSSSNSKSIQTDQL 753
Query: 175 CGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQ 234
G + +SG+ +F + I + IF NV + G MGAVNGM P+G D + +Q
Sbjct: 754 AGQWYAASSGL--PPLFEESKIRSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQ 811
Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRS 292
S E+WTGVTY +A M+ G ++ +TTA G++ + G G F+TPEG T D YRS
Sbjct: 812 SREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGYWFQTPEGWTMDGHYRS 871
Query: 293 GGYMRALAVYAMQ 305
YMR LA++ MQ
Sbjct: 872 LIYMRPLAIWGMQ 884
>gi|374311163|ref|YP_005057593.1| glucosylceramidase [Granulicella mallensis MP5ACTX8]
gi|358753173|gb|AEU36563.1| Glucosylceramidase [Granulicella mallensis MP5ACTX8]
Length = 811
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 110/290 (37%), Positives = 150/290 (51%), Gaps = 17/290 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
PE +PF +N D +DWKDLN KF++ +YRDY L KD FL +P I
Sbjct: 520 PEGDPFVAVNEPGWQDTNDWKDLNSKFVLMVYRDYVLTGRKDTAFLRETWPAVKAAIEYL 579
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
FD G G+ +N G+PDQT AY GGL +AAL E + ++ DT+ ++
Sbjct: 580 RQFDHGG-GVPENSGYPDQTYDDWVVRGVSAYSGGLWLAALRAGEETARVVGDTKTTAEY 638
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
A K + Y LW G Y+++D+S S D I ADQL G + +G+ I P
Sbjct: 639 HALFLKGQKTYISQLWNGEYFRYDTS-SESKDDIQADQLAGQWYANLTGLGE--IVPHTM 695
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
A IF NV F G MGA NGM +G +A +++EVW G T + ++ EG
Sbjct: 696 QVSAAKKIFDVNVMKFGHGEMGAANGMTADGAILTNA-EAKEVWVGTTLGYAGLLMQEGM 754
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
DEAW T GLY +YE G F TPE +R+G YMR A++A++
Sbjct: 755 NDEAWKTTRGLYHVIYEDKGYWFRTPEAWDITGNFRAGMYMRPTAIWALE 804
>gi|414871105|tpg|DAA49662.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
Length = 589
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 105/259 (40%), Positives = 145/259 (55%), Gaps = 12/259 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ IYRD+A D++F V+P + + FD
Sbjct: 309 HDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCAAMDYMDQFD 368
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCG L +AAL M+ L D +++ K
Sbjct: 369 RDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHFAEKYKLKF 428
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA VY LW GSY+ +DS S +S SI ADQL G + +SG+ +F I A
Sbjct: 429 IKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDEHKIRTA 486
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G MGAVNGM P G D + +QS E+WTGVTY ++A ML G +
Sbjct: 487 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQG 546
Query: 260 WTTAGGLYRTVYERTGLGF 278
+TTA G++ + G G+
Sbjct: 547 FTTAEGIFTAGWSEEGYGY 565
>gi|392918721|ref|NP_504047.2| Protein R08F11.1 [Caenorhabditis elegans]
gi|387910741|emb|CCD72306.2| Protein R08F11.1 [Caenorhabditis elegans]
Length = 830
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 111/292 (38%), Positives = 162/292 (55%), Gaps = 20/292 (6%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL----HKDIRFLTRVYPTCLELIR-KC 85
+P+ N Y + D WKDLNLKF++S YRD+ L + + L C +++
Sbjct: 537 DPWIHTNAYILHDTGRWKDLNLKFVISCYRDWKLIELGSEKGQVLEFFLGKCTKIVDGAL 596
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
E +DK G+++N GF DQT AYCG L IAAL+ +EM L + +
Sbjct: 597 ECWDKDNDGMIENDGFADQTYDVWKMTGTSAYCGSLWIAALSSYIEM--LKQSGLPTKHY 654
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
KL A Y LW G+++KFD +S +MADQLCG F + ++
Sbjct: 655 EEKLEMAYDAYIGKLWNGTFFKFDE-LPENSKIVMADQLCG--FWAMTAMDEPVQISKDK 711
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
+ AL TIF NV+ + G GAVNG + D S+IQSEEVW G+TY LSA M+ +G
Sbjct: 712 MKSALDTIFKYNVQMYNNGRCGAVNGYLTSERVDGSSIQSEEVWAGITYALSAMMIEKGM 771
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
++A+ T+ GL+ +++ R L ++TPE +T D YR+ GYMR L+++A+Q A
Sbjct: 772 DEQAFKTSEGLFESIWHRFPLQYQTPEAITSDGMYRALGYMRPLSIWAIQHA 823
>gi|224028315|gb|ACN33233.1| unknown [Zea mays]
Length = 384
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 105/259 (40%), Positives = 145/259 (55%), Gaps = 12/259 (4%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFD 89
+P+ +N YN+ D S WKDLN KF++ IYRD+A D++F V+P + + FD
Sbjct: 104 HDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCAAMDYMDQFD 163
Query: 90 KQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKL 139
+ G G+++N GFPDQT AYCG L +AAL M+ L D +++ K
Sbjct: 164 RDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHFAEKYKLKF 223
Query: 140 TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
KA VY LW GSY+ +DS S +S SI ADQL G + +SG+ +F I A
Sbjct: 224 IKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL--PPLFDEHKIRTA 281
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
L IF NV + G MGAVNGM P G D + +QS E+WTGVTY ++A ML G +
Sbjct: 282 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLHGMEHQG 341
Query: 260 WTTAGGLYRTVYERTGLGF 278
+TTA G++ + G G+
Sbjct: 342 FTTAEGIFTAGWSEEGYGY 360
>gi|357144426|ref|XP_003573288.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 1
[Brachypodium distachyon]
Length = 932
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 105/287 (36%), Positives = 162/287 (56%), Gaps = 15/287 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y + D + WKDLN KF++ +YR + + F +P + + +D+
Sbjct: 585 DPWFELNAYMIHDPARWKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDR 644
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+V+N G PDQT AY GG+ + AL M+ ++ D + F +
Sbjct: 645 DGDGMVENEG-PDQTYDLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYL 703
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
+A QVY LW G+Y+++D+S S+SIMADQL G + G+ E + AL
Sbjct: 704 RAKQVYDAELWNGTYFRYDNSGGETSESIMADQLAGQWYAHVCGL--EPVVEEDKARSAL 761
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
AT+ NV + G++GAVNGM+P+G D S+ QS+E+W+G TY ++AAM++EG + A+
Sbjct: 762 ATVLDYNVMRVKGGAVGAVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAF 821
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
TA G + + + G G F+TPE T + YR YMR L+V+AMQ
Sbjct: 822 RTAKGAHDASWGKAGFGYAFQTPEAWTAEGGYRGLHYMRPLSVWAMQ 868
>gi|357144429|ref|XP_003573289.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 2
[Brachypodium distachyon]
Length = 919
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 105/287 (36%), Positives = 162/287 (56%), Gaps = 15/287 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y + D + WKDLN KF++ +YR + + F +P + + +D+
Sbjct: 572 DPWFELNAYMIHDPARWKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDR 631
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+V+N G PDQT AY GG+ + AL M+ ++ D + F +
Sbjct: 632 DGDGMVENEG-PDQTYDLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYL 690
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
+A QVY LW G+Y+++D+S S+SIMADQL G + G+ E + AL
Sbjct: 691 RAKQVYDAELWNGTYFRYDNSGGETSESIMADQLAGQWYAHVCGL--EPVVEEDKARSAL 748
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
AT+ NV + G++GAVNGM+P+G D S+ QS+E+W+G TY ++AAM++EG + A+
Sbjct: 749 ATVLDYNVMRVKGGAVGAVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAF 808
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
TA G + + + G G F+TPE T + YR YMR L+V+AMQ
Sbjct: 809 RTAKGAHDASWGKAGFGYAFQTPEAWTAEGGYRGLHYMRPLSVWAMQ 855
>gi|242080271|ref|XP_002444904.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
gi|241941254|gb|EES14399.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
Length = 929
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 112/288 (38%), Positives = 165/288 (57%), Gaps = 15/288 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y + D + WKDLN KF++ +YRD D F +P + FD+
Sbjct: 575 DPWVEMNEYMLHDPALWKDLNPKFVLQVYRDAVATGDAAFAEAAWPAVYIAMAYMHQFDR 634
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+V+N G PDQT AY GGL +AAL M+ ++ D + F +
Sbjct: 635 DGDGMVENEGIPDQTYDIWSVSGVSAYTGGLWVAALEAAAGMARVVGDRHAEAYFRGRRD 694
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRH-SDSIMADQLCGYLFLKASGVNTEAIFPSANIARA 199
+A++VY LW G+Y+++D+S S+SIMADQ+ G + +A G+ E + A
Sbjct: 695 RAARVYDAELWNGTYFRYDNSGGGACSESIMADQMAGQWYARACGM--EPVVEEGKARSA 752
Query: 200 LATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
LAT+ NV + G++GAVNGM+P+G D S+ QS+EVW GVTY ++AAM++EG + A
Sbjct: 753 LATVLDYNVMRVKGGAVGAVNGMRPDGAVDASSGQSKEVWPGVTYAVAAAMVHEGMHEAA 812
Query: 260 WTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+ TA G + + + G G F+TPE T D YRS YMR L ++AMQ
Sbjct: 813 FRTAKGAHDAGWGKDGFGYAFQTPEAWTEDGGYRSLHYMRPLGIWAMQ 860
>gi|326507888|dbj|BAJ86687.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 676
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 107/273 (39%), Positives = 153/273 (56%), Gaps = 14/273 (5%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y + D + WKDLN KF++ +YRD A D F T V+P + + FD+
Sbjct: 348 DPWFQLNAYMIHDPARWKDLNTKFVLQVYRDGAATGDAAFATAVWPAVYLAMAYMDQFDR 407
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+V+N G PDQT AY GGL +AAL M+ ++ D + F +
Sbjct: 408 DGDGMVENEGRPDQTYDFWSVSGVSAYTGGLWVAALQAAAVMARVVGDRGSEGYFVERYE 467
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
KA +VY LW G+Y+ +D+S +S SIMADQL G + +A G+ E I AL
Sbjct: 468 KARRVYDGELWNGAYFDYDNSGGTNSKSIMADQLAGQWYARACGL--EPIVEEEKARSAL 525
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
T+ NV + G++GAVNGM+P+G D S+IQS EVW GVTY ++AAM +EG + A+
Sbjct: 526 GTVLDYNVMRVQGGAIGAVNGMRPDGTVDTSSIQSREVWPGVTYGVAAAMAHEGMPEAAF 585
Query: 261 TTAGGLYRTVYERTGLGFETPE--GLTGDKTYR 291
TA G + + R G G P+ G+ G + R
Sbjct: 586 RTAKGAHDAGWGRDGFGVRVPDAGGVDGRRRQR 618
>gi|94968715|ref|YP_590763.1| hypothetical protein Acid345_1688 [Candidatus Koribacter versatilis
Ellin345]
gi|94550765|gb|ABF40689.1| protein of unknown function DUF608 [Candidatus Koribacter
versatilis Ellin345]
Length = 811
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 104/290 (35%), Positives = 159/290 (54%), Gaps = 15/290 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
P E+ L N YN VS WKDLN K+++ ++RDY + KD FL Y + ++
Sbjct: 519 PLEDALILPNQYNYQSVSFWKDLNSKYVLMVWRDYVMSGSKDKAFLQYNYNAVKQSMQFL 578
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
FDK G+++N G+PDQT AY G L++AAL EM+ LL D +
Sbjct: 579 RQFDKNNDGLIENDGYPDQTYDNWSARGESAYSGSLYLAALRATEEMAKLLGDQRTAADT 638
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
+A +A + LW G+Y+ +D S + D+IMA+QL G + +G+ + P
Sbjct: 639 AALFKRAQDSFVKKLWNGTYFNYDVGSS-YKDAIMAEQLAGQWYASLTGLGD--LVPRDM 695
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
AL ++ NV + G+MGA+NG+ +G+ + Q+EEVWTGVT+ ++A ML G
Sbjct: 696 QHSALKKVYDFNVMKLQNGTMGALNGISASGEVLKDNEQTEEVWTGVTFAVAATMLQNGL 755
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+E + TA G+Y VY++ G F TPE + YR+ YMR A+++M+
Sbjct: 756 REEGFNTAKGVYNVVYDQKGYWFRTPEAYDTNGMYRASMYMRPGAIWSME 805
>gi|324503709|gb|ADY41606.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 261
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 100/249 (40%), Positives = 137/249 (55%), Gaps = 12/249 (4%)
Query: 70 FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
L R Y +I +++D G +++N G PDQT AYCGGL + AL C+
Sbjct: 5 LLRRFYDLSSGIIADAKAWDVDGDDLIENAGQPDQTYDVWSMHGSSAYCGGLWLCALECV 64
Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
M+ L + Q+F+ KL A + Y LW G Y+ FD + H SIMADQLCG+ F
Sbjct: 65 RRMALTLGEVVDAQKFANKLNNARKAYERKLWNGKYFDFDEHSTDHK-SIMADQLCGFWF 123
Query: 180 LKAS-GVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEV 238
+ + G + I I +L TIF NV F G +G VN M P+G D + IQSEEV
Sbjct: 124 MCITDGKVDDVIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVVDSTGIQSEEV 183
Query: 239 WTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRA 298
W GV Y L++ L + A+ TA G YR+ +ER GL +++PE + YR+ GYMR
Sbjct: 184 WGGVAYALASFHLLVEENESAFKTAEGWYRSCWERYGLQYQSPEAINESSYYRAIGYMRP 243
Query: 299 LAVYAMQDA 307
LA++AMQ A
Sbjct: 244 LAIWAMQSA 252
>gi|344251774|gb|EGW07878.1| Non-lysosomal glucosylceramidase [Cricetulus griseus]
Length = 226
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 88/184 (47%), Positives = 119/184 (64%), Gaps = 4/184 (2%)
Query: 122 MSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLK 181
M+ L Q +FS+ L + + Y LLW G YY +DSS S SIM+DQ G FL+
Sbjct: 1 MAVLCGAQDVQDKFSSILCRGREAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLR 60
Query: 182 ASGV---NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEV 238
A G+ +TE +FP+ ++ RAL TIF NV+ F G+MGAVNGM+P+G DRS++QS+EV
Sbjct: 61 ACGLGEGDTE-VFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEV 119
Query: 239 WTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRA 298
W GV Y L+A M+ EG E + TA G YRTV+ER GL F+TPE + +RS YMR
Sbjct: 120 WVGVVYGLAATMIQEGLTREGFRTAEGCYRTVWERLGLAFQTPEAYCQQQVFRSLAYMRP 179
Query: 299 LAVY 302
L+++
Sbjct: 180 LSIW 183
>gi|22297737|ref|NP_680984.1| hypothetical protein tlr0193 [Thermosynechococcus elongatus BP-1]
gi|22293914|dbj|BAC07746.1| tlr0193 [Thermosynechococcus elongatus BP-1]
Length = 806
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 107/290 (36%), Positives = 153/290 (52%), Gaps = 19/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
P E P+ N D + WKDL F++ +YRD+ D+ F +P + +
Sbjct: 513 PNEHPWEKTNYTAYQDCNLWKDLASDFVLLVYRDFLFTGGTDLNFARECWPAVVAALDHL 572
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
+ FD+ G G+ +NGG PDQT AYCGGL +AAL + + +LL Q + +
Sbjct: 573 KQFDQDGDGLPENGGAPDQTYDDWKLQGVSAYCGGLWLAALEAAIALGTLLQQPQVEI-Y 631
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
L++A YH LLW G YY+ D+ SD IMADQLCG + + G+ I P
Sbjct: 632 RQWLSQARPRYHQLLWNGEYYRLDTGSG--SDVIMADQLCGQFYAQLLGLVD--IVPPDC 687
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLLSAAMLYEG 254
RAL I+ T F G GA NG+ PNG + A EVWTG+ + L+A + G
Sbjct: 688 CDRALRKIYDTCFLKFHNGQFGAANGLLPNGQPENPHATHPLEVWTGINFGLAAFLWQRG 747
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+DEAW A + R +YE GL F TPE +T + T+R+ Y+R +A++A+
Sbjct: 748 MIDEAWRLAEVVVRQIYE-NGLQFRTPEAITANGTFRACMYLRPMAIWAL 796
>gi|257060089|ref|YP_003137977.1| glucosylceramidase [Cyanothece sp. PCC 8802]
gi|256590255|gb|ACV01142.1| Glucosylceramidase [Cyanothece sp. PCC 8802]
Length = 805
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 105/307 (34%), Positives = 155/307 (50%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L D FL +P E +
Sbjct: 495 PNEHPWQKTNYTSYQDCNLWKDLGSDFVLQVYRDYLLTGSDDTDFLWECWPAITETLDYL 554
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQQ 134
++FD GI +N G PDQT AYCGGL IAAL ++++ LL + ++
Sbjct: 555 KTFDLDNDGIPENSGAPDQTFDDWKLQGISAYCGGLWIAALEAAIKIAEILLKNVPTTEE 614
Query: 135 FSAK----------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
++ L ++ +YHD LW G YYK DS SD +MADQLCG
Sbjct: 615 LQSRNNPESIKHYVKNHRDWLEQSRSIYHDTLWNGEYYKLDSQSG--SDVVMADQLCGQF 672
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
+ A +N + + AL ++ F+ G GA NGMKP+G D ++ +E
Sbjct: 673 Y--ARLLNLPDVVETQYTESALNKVYEACFLKFQDGKYGAANGMKPDGTPEDPNSTHPQE 730
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + L+A +L G D A+ + + VYE GL F TPE +T T+R+ Y+R
Sbjct: 731 VWTGINFGLAAFLLQMGRKDAAFKLTEAVVKQVYE-NGLQFRTPEAITAVGTFRASHYLR 789
Query: 298 ALAVYAM 304
A+A++ +
Sbjct: 790 AMAIWGI 796
>gi|357139475|ref|XP_003571307.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 946
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 116/292 (39%), Positives = 165/292 (56%), Gaps = 19/292 (6%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y + D S WKDLN KF++ +YRD A D+ F +P+ + + FD+
Sbjct: 586 DPWHELNAYMIHDPSRWKDLNPKFVLQVYRDVAATGDLDFARSAWPSVYVALAYMDQFDR 645
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G+V+N G PDQT AY GGL +AAL M++L+ D + F +
Sbjct: 646 DRDGMVENEGRPDQTYDLWSVSGVSAYTGGLWVAALRAATAMAALVGDLPAEAVFLERYN 705
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
+A++VY LWTG Y+++D+S +S+S+MADQL G + +A G+ E + RAL
Sbjct: 706 RANKVYDSELWTGDYFRYDNSGGGNSESVMADQLAGQWYARACGL--EPVVGRDKARRAL 763
Query: 201 ATIFSTNVRGFEAGSMGAVNGMK-PN--GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
A + NV + G +GAVNG + P G D S+ QS+EVWTG TY ++AAM+ EG +
Sbjct: 764 AAVLEHNVMQVQGGGVGAVNGARLPEHGGGVDESSTQSKEVWTGTTYAVAAAMIGEGMRE 823
Query: 258 EAWTTAGGLYRTVYERTGLG--FETPEGLTGDKT--YRSGGYMRALAVYAMQ 305
E +T A G Y + G G F+ PE T D YRS YMR LAV+AMQ
Sbjct: 824 EGFTAAKGAYGAGWGEDGYGYAFQMPESWTADGAGGYRSLHYMRPLAVWAMQ 875
>gi|218247014|ref|YP_002372385.1| hypothetical protein PCC8801_2201 [Cyanothece sp. PCC 8801]
gi|218167492|gb|ACK66229.1| protein of unknown function DUF608 [Cyanothece sp. PCC 8801]
Length = 805
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 155/307 (50%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L D FL +P E +
Sbjct: 495 PNEHPWQKTNYTSYQDCNLWKDLGSDFVLQVYRDYLLTGSDDTDFLWECWPAITETLDYL 554
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQQ 134
++FD GI +N G PDQT AYCGGL IAAL ++++ LL + ++
Sbjct: 555 KTFDLDNDGIPENSGAPDQTFDDWKLQGISAYCGGLWIAALEAAIKIAEILLKNVPTTEE 614
Query: 135 FSAK----------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
++ L ++ +YHD LW G YYK DS SD +MADQLCG
Sbjct: 615 LQSRNNPESIKHYVKNHRDWLEQSRSIYHDTLWNGEYYKLDSQSG--SDVVMADQLCGQF 672
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
+ + +N + + AL ++ F+ G GA NGMKP+G D ++ +E
Sbjct: 673 YARL--LNFPDVVETQYTESALNKVYEACFLKFQDGKYGAANGMKPDGTPEDPNSTHPQE 730
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + L+A +L G D A+ + + VYE GL F TPE +T T+R+ Y+R
Sbjct: 731 VWTGINFGLAAFLLQMGRKDAAFKLTEAVVKQVYE-NGLQFRTPEAITAVGTFRASHYLR 789
Query: 298 ALAVYAM 304
A+A++ +
Sbjct: 790 AMAIWGI 796
>gi|37522029|ref|NP_925406.1| hypothetical protein gll2460 [Gloeobacter violaceus PCC 7421]
gi|35213028|dbj|BAC90401.1| gll2460 [Gloeobacter violaceus PCC 7421]
Length = 867
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/292 (38%), Positives = 152/292 (52%), Gaps = 19/292 (6%)
Query: 28 PEEEPFSLINGYNVF-DVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTC-LELIR 83
P E+P+ + N Y + D + WKDL F++ +YRDY +D+ F+ +P L+R
Sbjct: 564 PNEDPW-IKNNYTTYQDCNLWKDLPCDFVLQVYRDYVDTGKRDLGFVRYCWPASKAALVR 622
Query: 84 KCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQ 133
+FD+ G G +NGG PD T AYCGGL +AAL VEM L+ D Q
Sbjct: 623 LKTTFDQDGDGFPENGGPPDCTYDAWPLKGISAYCGGLWLAALLAAVEMGKLVGDAAAVQ 682
Query: 134 QFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPS 193
QF A +A +Y LW G YY+ D SQS++S++IMADQLCG + A N I P
Sbjct: 683 QFGAWYAQAQPLYEKTLWNGRYYRID-SQSKNSEAIMADQLCGEYY--AQVCNLADIVPE 739
Query: 194 ANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYE 253
AL T++ T F G G NG +G EVWTG+ + L+A M+
Sbjct: 740 PQARSALETVYQTCFVKFYGGRFGCANGTNADGSFIGDTEHPSEVWTGINFGLAAFMIRN 799
Query: 254 GNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
G E A + VY GL F TPE LT +T+R+ Y+R +A++AMQ
Sbjct: 800 GMRREGMAIAEAVVANVYG-GGLQFRTPEALTPARTFRACMYLRPMAIWAMQ 850
>gi|290990718|ref|XP_002677983.1| predicted protein [Naegleria gruberi]
gi|284091593|gb|EFC45239.1| predicted protein [Naegleria gruberi]
Length = 937
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 163/313 (52%), Gaps = 27/313 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPT-CLELIRKCE 86
P E P+ L+N YN DV+ WKDLN +FI++I+RD+ + K+ FL +P+ L L
Sbjct: 625 PYERPWKLVNAYNFQDVNKWKDLNSQFILTIFRDFMITKNREFLEEAFPSIILALNYSLT 684
Query: 87 SFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFS 136
+FDK G G+++N FPD T AY G L IA+L ++EM+ L+D +Q+
Sbjct: 685 NFDKDGDGLIENESFPDTTYDAWKVTGVSAYSGMLWIASLKAILEMAKELDDENTRQKVE 744
Query: 137 AKLTKASQVYHDLLW--TGSYYKFDSSQSRHSDSIMADQLCGYLFL-------KASGVNT 187
+ + + LW T YY +D+S DSIM+D L G L K SG
Sbjct: 745 PLIESGLKTFESKLWDETNQYYHYDASTEPQHDSIMSDHLHGTFMLLLIGNYVKNSGRCE 804
Query: 188 EA----IFPSANIARALATIFSTNVRGFEAGSM--GAVNGMKPNGDRDRSAIQSEEVWTG 241
++ F ++++L I N ++ + G VNGM+P + D++++QS E+WTG
Sbjct: 805 DSHYLIPFNMERVSKSLNKILENNFVKYQQITKLGGCVNGMRPTQEVDKTSLQSREMWTG 864
Query: 242 VTYLLSAAMLYEGNVDEAWTTAGGLYRTVY-ERTGLGFETPEGLTGDKTYRSGGYMRALA 300
+Y+++A + G +E ++ + + F+TPE +T YR+ GY R L+
Sbjct: 865 TSYVVAALTILLGRREEGIDLMKSVFDKCWSSNSAFWFQTPEAVTETGEYRALGYCRPLS 924
Query: 301 VYAMQDAYLKGKV 313
+++ A + K+
Sbjct: 925 IWSCLSAMMNKKI 937
>gi|332711224|ref|ZP_08431157.1| putative bile acid beta-glucosidase [Moorea producens 3L]
gi|332350038|gb|EGJ29645.1| putative bile acid beta-glucosidase [Moorea producens 3L]
Length = 815
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 155/307 (50%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHK--DIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ + DI FL + + ++ +
Sbjct: 505 PNEHPWQQTNYTSYQDCNQWKDLASDFVLQVYRDFVFYSATDIEFLWECWSSIVKALAYL 564
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLN-------- 127
+ FD+ G GI +N G PDQT AYCGGL +AAL + + +L
Sbjct: 565 KRFDQDGDGIPENYGAPDQTFDDWRLQGVSAYCGGLWLAALESAIAIGEILTKNYPPTSP 624
Query: 128 -------DTQQQ--QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
D+ Q + L +A +Y + LW G YY+ DS SD +MADQLCG
Sbjct: 625 FVEVPELDSIQDTIDCYRQWLAQAMPIYQEKLWNGQYYQLDSESG--SDVVMADQLCGQF 682
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
+ + G+ I P AL T++ F G +GAVNG++P+G D A E
Sbjct: 683 YAQLLGLPD--IVPPECTLSALKTVYDACFLKFHQGQLGAVNGVRPDGTPEDPDATHPME 740
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + L+A ++ +G DEA + R VY+ GL F TPE +T T+R+ Y+R
Sbjct: 741 VWTGINFGLAAFLIQQGMKDEALRITEAVVRQVYDH-GLQFRTPEAITAAGTFRASHYLR 799
Query: 298 ALAVYAM 304
A+ ++A+
Sbjct: 800 AMGIWAV 806
>gi|425455967|ref|ZP_18835678.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
gi|389803038|emb|CCI17984.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
Length = 784
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 100/290 (34%), Positives = 151/290 (52%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++L + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YHD LW G YY DS SD +M DQLCG + + ++ + +
Sbjct: 611 ASWLQQSRAIYHDTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARL--LSLPDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ F G GA NG+KP+G + A EVW G+ + L A +L G
Sbjct: 667 TQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + R +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|428308994|ref|YP_007119971.1| bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
gi|428250606|gb|AFZ16565.1| putative bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
Length = 819
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 155/307 (50%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ D FL + + ++ +
Sbjct: 512 PNEHPWEQTNYTSYQDCNQWKDLPCDFVLQVYRDFVFTGANDTEFLWECWSSVVKTLAYL 571
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------- 126
++FDK G GI +N G PDQT AYCGGL +AAL + M +L
Sbjct: 572 KTFDKDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWLAALEAAIAMGEILINHYPPTSE 631
Query: 127 ----NDTQQQQQ----FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
D + Q+ + L ++ +Y + LW G YY+ DS SD +M DQLCG
Sbjct: 632 LIEVPDPESIQETLKLYRQWLEQSQPIYQEKLWNGQYYRLDSESG--SDIVMTDQLCGQF 689
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEE 237
+ + G+ I P + AL +++ + F+ +GA NG++P+G + A E
Sbjct: 690 YAQLLGLPD--IVPPECVQSALKSVYDSCFLKFQDAQLGAANGVRPDGSPENPDATHPLE 747
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + L+A ++ +G EA + R VYE GL F TPE +T T+R+ Y+R
Sbjct: 748 VWTGINFGLAAFLVQQGMKQEALQLTEAVVRQVYE-NGLQFRTPEAITAAGTFRASHYLR 806
Query: 298 ALAVYAM 304
A+A++A+
Sbjct: 807 AMAIWAV 813
>gi|425471589|ref|ZP_18850441.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
gi|389882505|emb|CCI37035.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
Length = 784
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 99/290 (34%), Positives = 149/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++L + + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATIKIGAILG--EDTEIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YH LW G YY DS SD +M DQLCG + + + + +
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL I+ F G GA NG+KP+G + A EVW G+ + L A +L G
Sbjct: 667 TQSALGKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLLQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + + VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|425450263|ref|ZP_18830094.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
gi|389768962|emb|CCI06046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
Length = 784
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 99/290 (34%), Positives = 149/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++L + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATIKIGAILG--EDTAIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YH LW G YY DS SD +M DQLCG + + + + +
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ F G GA NG+KP+G + A EVW G+ + L A +L G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + R +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITQAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|422303239|ref|ZP_16390593.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
gi|389791807|emb|CCI12404.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
Length = 784
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 150/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++ + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATIKIGEIIG--EDTAIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YH LW G YY DS SD +M DQLCG + + ++ + +
Sbjct: 611 TSWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARL--LSLPDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ + F G GA NG+KP+G + A EVW G+ + L A ++ G
Sbjct: 667 TQSALSKIYQSCFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLMQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + R VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|425435596|ref|ZP_18816046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
gi|389679839|emb|CCH91411.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
Length = 784
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 99/290 (34%), Positives = 149/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++L + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YH LW G YY DS SD +M DQLCG + + + + +
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ F G GA NG+KP+G + A EVW G+ + L A +L G
Sbjct: 667 TQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + R +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|427727368|ref|YP_007073605.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
gi|427363287|gb|AFY46008.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
Length = 803
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 153/297 (51%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRDY D+ FL + ++ +
Sbjct: 504 PNEHVWQKTNYTSYQDCNLWKDLGCDFVLQVYRDYLFTGANDVEFLQDCWDAIVQTLDYV 563
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
++FD G GI +N G PDQT AYCGGL +AAL + +S +L+ Q
Sbjct: 564 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWMAALEAAIAISEILSQHQPDSAL 623
Query: 131 --QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
Q+ ++ L K+ +Y + LW G YY+ DS+ SD +MADQLCG + A+ +N
Sbjct: 624 LKQKSIYATWLEKSLPIYQEKLWNGQYYRLDSNSG--SDVVMADQLCGQFY--AALLNLP 679
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLLS 247
I P AL T++ F G GA NG++P+G + A EVWTG+ + L+
Sbjct: 680 DIVPGDRAISALRTVYDACFLKFYNGQFGAANGVRPDGSPENPQATHPLEVWTGINFGLA 739
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
A ++ G DEA + + +Y GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 740 AFLVQMGMKDEALRLTEAVVQQIYSN-GLQFRTPEAITPSGTFRASTYLRAMAIWGV 795
>gi|425442611|ref|ZP_18822852.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
gi|389716273|emb|CCH99457.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
Length = 784
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 149/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++ + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATIKIGEIIG--EDTAIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YH LW G YY DS SD +M DQLCG + + ++ + +
Sbjct: 611 TSWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARL--LSLPDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ F G GA NG+KP+G + A EVW G+ + L A +L G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + + VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|440753145|ref|ZP_20932348.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
gi|440177638|gb|ELP56911.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
Length = 789
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 99/290 (34%), Positives = 149/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++L + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YH LW G YY DS SD +M DQLCG + + + + +
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ F G GA NG+KP+G + A EVW G+ + L A +L G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + + VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|443669591|ref|ZP_21134796.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
gi|159029778|emb|CAO87856.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443330115|gb|ELS44858.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
Length = 784
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 99/290 (34%), Positives = 148/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++ + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGEIIG--EDTAIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YH LW G YY DS SD +M DQLCG + + + + +
Sbjct: 611 TSWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ F G GA NG+KP+G + A EVW G+ + L A +L G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + R VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|425447791|ref|ZP_18827773.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
gi|389731620|emb|CCI04385.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
Length = 784
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 99/290 (34%), Positives = 150/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++L + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YH LW G YY DS SD +M DQLCG + A ++ + +
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFY--ARLLSLPDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ F G GA NG+KP+G + A EVW G+ + L A +L G
Sbjct: 667 TQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + + +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVQQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|425464233|ref|ZP_18843555.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
gi|389833806|emb|CCI21373.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
Length = 784
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 149/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++L + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATIKIGAILG--EDTAIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YH LW G YY DS SD +M DQLCG + + + + +
Sbjct: 611 ASWLQQSRTIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYSRLLAL--PDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ F G GA NG+KP+G + A EVW G+ + L A ++ G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLMQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + + VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|166363582|ref|YP_001655855.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
gi|166085955|dbj|BAG00663.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
Length = 784
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 150/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++L + + F
Sbjct: 553 KTFDLDKDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTEIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YH LW G YY DS SD +M DQLCG + + + + +
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYSRLLAL--PDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ F G GA NG+KP+G + A EVW G+ + L A ++ G
Sbjct: 667 TQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLMQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + + VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVKQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|390437958|ref|ZP_10226465.1| Glycosidase-like protein [Microcystis sp. T1-4]
gi|389838618|emb|CCI30589.1| Glycosidase-like protein [Microcystis sp. T1-4]
Length = 784
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 99/290 (34%), Positives = 148/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL +P+ + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++L + F
Sbjct: 553 KTFDLDKDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
+ L ++ +YH LW G YY DS SD +M DQLCG + + + + +
Sbjct: 611 AGWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ F G GA NG+KP+G + A EVW G+ + L A ++ G
Sbjct: 667 TQSALSKIYQACFLKFHEGKFGAANGLKPDGTPENPQATHPLEVWVGINFGLVAFLMQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + R VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQVYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|119493962|ref|ZP_01624522.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
gi|119452283|gb|EAW33479.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
Length = 799
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 154/303 (50%), Gaps = 30/303 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L +D FL + +E +
Sbjct: 499 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFVLTGSEDYEFLQDCWLAIVETLDYL 558
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
+ FDK GI +N G PDQT AYCGGL +AAL +E+ L T+
Sbjct: 559 KIFDKDEDGIPENSGAPDQTFDDWKLQGVSAYCGGLWLAALEAAIEIGKCLQKTEFSDNV 618
Query: 136 SAK-----------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG 184
S L KA +Y +L W G Y++ DS SD +MADQLCG + K G
Sbjct: 619 SPSISDKISTYETWLEKAKPIYQNL-WNGQYFQLDSQSG--SDVVMADQLCGQFYAKLLG 675
Query: 185 VNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVT 243
+ E I AL TI+ + + F G GA NG++ NG+ + + EVWTG+
Sbjct: 676 L--EDIVSPEKTISALQTIYQSCFQNFYQGQFGAANGVRINGEPINPNDTHPLEVWTGIN 733
Query: 244 YLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYA 303
+ L+A ++ G +E + A + + +YE GL F TPE +T T+R+ Y+RA+A++A
Sbjct: 734 FGLAAFLIQMGMKEEGFKLAEVVVQQIYE-NGLQFRTPEAITAAGTFRASHYLRAMAIWA 792
Query: 304 MQD 306
+ D
Sbjct: 793 IYD 795
>gi|425460273|ref|ZP_18839755.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
gi|389827051|emb|CCI21967.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
Length = 784
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 148/290 (51%), Gaps = 20/290 (6%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL + + + ++
Sbjct: 493 PNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWSSIVLTLQYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL IAAL +++ ++L + F
Sbjct: 553 KTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGAILG--EDTAIF 610
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
++ L ++ +YH LW G YY DS SD +M DQLCG + + + + +
Sbjct: 611 ASWLQQSRAIYHQTLWNGEYYNLDSGSG--SDVVMTDQLCGQFYARLLAL--PDVVENQY 666
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYLLSAAMLYEG 254
AL+ I+ F G GA NG+KP+G + A EVW G+ + L A +L G
Sbjct: 667 TQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAFLLQMG 726
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ W + R +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 727 MEKQGWQITEAVVRQIYE-NGLQFRTPEAITSVGTFRACHYLRAMAIWAI 775
>gi|416392732|ref|ZP_11685894.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
0003]
gi|357263592|gb|EHJ12579.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
0003]
Length = 804
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 101/307 (32%), Positives = 154/307 (50%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L DI FL + E +
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDDIDFLWECWAAIPETLNYL 554
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-----NDTQ 130
+ FD GI +N G PDQT AYCGGL IAAL +++ +L N+ Q
Sbjct: 555 KEFDLDNDGIPENSGAPDQTFDDWKLSGISAYCGGLWIAALEAAIKIGEILLENSPNNPQ 614
Query: 131 QQQQ------------FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
+ + ++ L ++ +YH LW G YY DS SD +MADQLCG
Sbjct: 615 LEPENYPESIEKELDKYNNWLQQSRAIYHSTLWNGEYYNLDSKSG--SDIVMADQLCGQF 672
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
+ + G++ + S AL ++ F+ G GA NGMKP+G D ++ +E
Sbjct: 673 YARLLGLSD--VVESEYTLSALNKVYEACFLKFQNGKYGAANGMKPDGTPEDPNSTHPQE 730
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + +++ ++ G +EA + + VYE GL F TPE +T TYR+ Y+R
Sbjct: 731 VWTGINFGIASFLIQMGMKNEALKLTETVVKQVYE-NGLQFRTPEAITAVGTYRACHYLR 789
Query: 298 ALAVYAM 304
A+A++ +
Sbjct: 790 AMAIWGV 796
>gi|67922703|ref|ZP_00516206.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
gi|67855413|gb|EAM50669.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
Length = 804
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 101/307 (32%), Positives = 154/307 (50%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L DI FL + E +
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDDIDFLWECWAAIPETLNYL 554
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-----NDTQ 130
+ FD GI +N G PDQT AYCGGL IAAL +++ +L N+ Q
Sbjct: 555 KEFDLDNDGIPENSGAPDQTFDDWKLSGISAYCGGLWIAALEAAIKIGEILLENSPNNPQ 614
Query: 131 QQQQ------------FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
+ + ++ L ++ +YH LW G YY DS SD +MADQLCG
Sbjct: 615 LEPENYPESIEKELDKYNNWLQQSRAIYHSTLWNGEYYNLDSKSG--SDIVMADQLCGQF 672
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
+ + G++ + S AL ++ F+ G GA NGMKP+G D ++ +E
Sbjct: 673 YARLLGLSD--VVESEYTLSALNKVYEACFLKFQNGKYGAANGMKPDGTPEDPNSTHPQE 730
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + +++ ++ G +EA + + VYE GL F TPE +T TYR+ Y+R
Sbjct: 731 VWTGINFGIASFLIQMGMKNEALKLTETVVKQVYE-NGLQFRTPEAITAVGTYRACHYLR 789
Query: 298 ALAVYAM 304
A+A++ +
Sbjct: 790 AMAIWGV 796
>gi|162452686|ref|YP_001615053.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
gi|161163268|emb|CAN94573.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
Length = 782
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 98/291 (33%), Positives = 150/291 (51%), Gaps = 16/291 (5%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKCES 87
E F N Y D ++WKDLN K ++ +YRD+AL D FL +P ++K +
Sbjct: 492 ESVFRKWNAYTYRDSTNWKDLNSKLVLMVYRDWALTGKTDKAFLDYCWPAVKMAMQKVKG 551
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
D G G+ ++ G DQT AYCGGL IAA E+++ + D +
Sbjct: 552 QDGDGDGLPESNGI-DQTYDDMDLHGNTAYCGGLFIAASEAAKELATAVGDASLASTYQT 610
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
+ ++ + LWTGSYYK D+ SR ++ IM+DQL G + +A G+ I ++ A
Sbjct: 611 WVEQSKGGFESKLWTGSYYKIDTG-SRDTNRIMSDQLAGQWYARALGL--PPIVDPSHAA 667
Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVD 257
A I+ N + F+ G+ G VN M NG D ++ Q+ E W G ++ + A M+ EG
Sbjct: 668 SAFTKIYENNFKRFDGGTRGVVNVMTANGSVDGTSNQTRECWVGTSWGVVAGMIQEGLAA 727
Query: 258 EAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAY 308
+A L T+++ GL F TPE G+ + R+ YMRA ++A + AY
Sbjct: 728 QAGEIGASLVDTIWKTDGLWFRTPEAWEGNGSIRAPYYMRATTLWAAKHAY 778
>gi|320105716|ref|YP_004181306.1| glucosylceramidase [Terriglobus saanensis SP1PR4]
gi|319924237|gb|ADV81312.1| Glucosylceramidase [Terriglobus saanensis SP1PR4]
Length = 794
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 145/290 (50%), Gaps = 17/290 (5%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
PE +PF ++N D +DWKDLN KF++ +YRDY L D FL +P I+
Sbjct: 504 PEGDPFYVVNEPGWQDTNDWKDLNSKFVLMVYRDYVLTGRTDKAFLQETWPAVKAAIQYL 563
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
FD G G+ +N G+PDQT AY GGL + AL E + +L D Q ++
Sbjct: 564 RQFD-HGRGVPENSGYPDQTYDSWVVRGVSAYSGGLWLGALRAGEETARVLGDWTAQAEY 622
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSAN 195
A A + Y LW G Y+ +D+ S + I ADQL G + +G+ + P A
Sbjct: 623 QALFAMAQKTYVATLWNGEYFLYDTDAS-NKGVIQADQLAGQWYAHMTGLGD--LVPRAM 679
Query: 196 IARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGN 255
L I++ NV F G++GAVNGM +G + +++ EVW G T +A + G
Sbjct: 680 QRSVLKKIYAFNVNQFGDGNLGAVNGMNADGTVVDN-VEAREVWAGTTLGYAALLKSLGI 738
Query: 256 VDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQ 305
+E++ GL+ +YE G F TPE +R+ YMR ++ M+
Sbjct: 739 EEESYHVVHGLFHVIYESKGYWFRTPEAWDVTGNFRASMYMRPAGLWGME 788
>gi|209526014|ref|ZP_03274547.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
gi|376002240|ref|ZP_09780079.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423067988|ref|ZP_17056778.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
gi|209493540|gb|EDZ93862.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
gi|375329367|emb|CCE15832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406710526|gb|EKD05737.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
Length = 799
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 106/299 (35%), Positives = 155/299 (51%), Gaps = 27/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L DI+FL +P +E +
Sbjct: 499 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDYLLTGADDIQFLVECWPAIVETLDYL 558
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------- 126
++FD+ GI +NGG PDQT AYCGGL +AAL +E++++L
Sbjct: 559 KTFDRDQDGIPENGGAPDQTFDDWRLVGISAYCGGLWLAALEAAIEIANILLSHNRDLTP 618
Query: 127 NDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
+ T+ ++ L A +Y LW GSYY DS SD +MADQLCG + G+
Sbjct: 619 DTTKAIATWTNWLETAKPLYDQTLWNGSYYNLDSQSG--SDVVMADQLCGQFYAALLGL- 675
Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYL 245
I P AL TI+ + F G GA NG++ G+ + + EVWTG+ +
Sbjct: 676 -PDIVPPHRAQVALQTIYQACFQNFHNGKFGAANGVRTTGEPMNPNDTHPLEVWTGINFG 734
Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
L A + G +EA+ A + VY+ GL F TPE +T T+R+ Y+RA+A++AM
Sbjct: 735 LGAFFIQMGMKEEAFQLAEAVITQVYQ-NGLQFRTPEAITAAGTFRASHYLRAMAIWAM 792
>gi|334116939|ref|ZP_08491031.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
gi|333461759|gb|EGK90364.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
Length = 818
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 102/299 (34%), Positives = 153/299 (51%), Gaps = 27/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L D FL +P +E +
Sbjct: 517 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLLTGGTDSEFLQECWPAIVEALAYL 576
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ--- 132
++FD+ G GI +N G PDQT AYCGGL +AAL + + +L D Q
Sbjct: 577 KTFDEDGDGIPENSGAPDQTFDDWQMRGISAYCGGLWLAALEAAIAIGEILEDAAPQIIP 636
Query: 133 ------QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
+ + L +A Y + LW G YY+ DS +S+ +MADQLCG + K G+
Sbjct: 637 NPQAKIALYHSWLLQARPSYQEKLWNGQYYRLDSES--NSEVVMADQLCGQFYAKLLGL- 693
Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYL 245
I P AL T++ + F G GA NG+ NG + +A EVWTG+ +
Sbjct: 694 -PDIVPPECAVSALETVYESCFLKFNEGEFGAANGVMLNGSPENPNATHPLEVWTGINFG 752
Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
L+A ++ G D+A+ + +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 753 LAAFLVQMGMEDKAFKLTDAVVNQIYE-NGLQFRTPEAITAGGTFRASHYLRAMAIWAI 810
>gi|172036524|ref|YP_001803025.1| hypothetical protein cce_1609 [Cyanothece sp. ATCC 51142]
gi|354553307|ref|ZP_08972614.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
gi|171697978|gb|ACB50959.1| unknown [Cyanothece sp. ATCC 51142]
gi|353555137|gb|EHC24526.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
Length = 818
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 102/312 (32%), Positives = 151/312 (48%), Gaps = 35/312 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L DI FL + E +
Sbjct: 509 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDDIEFLWECWEAIPETLNYL 568
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQQ 134
++FD GI +N G PDQT AYCG L IAAL +++ LL + Q Q
Sbjct: 569 KAFDLDNDGIPENSGAPDQTFDDWELRGISAYCGALWIAALEAAIKIGEILLENAPQNPQ 628
Query: 135 FSAK----------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
+ L ++ +YH LW G YYK DS SD +MADQL G
Sbjct: 629 LEPENYPKSIEKELDKYQQWLQQSRAIYHSTLWNGEYYKLDSESG--SDVVMADQLSGQF 686
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
+ + G+ + AL ++ F+ G GA NGMKP+G D ++ +E
Sbjct: 687 YARLLGL--PDVVEQQYALSALKKVYEACFLKFQNGKYGAANGMKPDGTPEDPNSTHPQE 744
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + L++ ++ G DEA + + VYE GL F TPE +T T+R+ Y+R
Sbjct: 745 VWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYE-NGLQFRTPEAITAVGTFRACHYLR 803
Query: 298 ALAVYAMQDAYL 309
A+A++ + +L
Sbjct: 804 AMAIWGVYYQFL 815
>gi|427712531|ref|YP_007061155.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
gi|427376660|gb|AFY60612.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
Length = 799
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 102/295 (34%), Positives = 157/295 (53%), Gaps = 23/295 (7%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L D+ FLT +P + +
Sbjct: 499 PNEHPWQASNYTSYQDCNLWKDLASDFVLLVYRDYLLTGATDLEFLTDCWPAIVSSLDYL 558
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL----NDTQ- 130
++FD G G+ +N G PDQT AYCGGL IAAL + ++ +L N+T+
Sbjct: 559 KTFDTDGDGLPENSGAPDQTYDDWQLKGVSAYCGGLWIAALEAALAIAQILEKQGNNTKL 618
Query: 131 QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAI 190
+Q+++ A L+++ Y LW G +Y+ DS +S +MADQLCG + + +N +
Sbjct: 619 EQEKWQAWLSQSRPTYQAKLWNGQFYRLDSDS--NSQVVMADQLCGQYYARL--LNLPDV 674
Query: 191 FPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR-SAIQSEEVWTGVTYLLSAA 249
P AL T++ F G GA NG+ P+G ++ +A EVW G+ Y L+A
Sbjct: 675 VPIECAESALKTVYDVCFLKFHEGQFGAANGLLPDGSPEKPNATHPLEVWIGINYGLAAF 734
Query: 250 MLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ G D AW + R +Y+ GL F TPE +T T+R+ Y+R +A++AM
Sbjct: 735 LYQMGLEDGAWRLTQTVIRQIYD-NGLQFRTPEAITAVGTFRACMYLRPMAIWAM 788
>gi|443320525|ref|ZP_21049620.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
gi|442789771|gb|ELR99409.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
Length = 806
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 102/307 (33%), Positives = 154/307 (50%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL+ F++ +YRD+ L +D FL +P+ + +
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLSSDFVLQVYRDFVLTGKEDTDFLWECWPSITQTLNYL 554
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDT---QQQ 132
+SFD G GI +N G PDQT AYCGGL IAAL + + ++L D Q
Sbjct: 555 KSFDLDGDGIPENAGAPDQTFDDWKLEGISAYCGGLWIAALEASIAIGAVLLDNPPMNPQ 614
Query: 133 QQFS--------------AKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
Q S + LT A +YH LW G YY D+ S +MADQL G
Sbjct: 615 LQPSDYPEGIEATLKTHQSWLTLARSIYHQQLWNGQYYSLDTKSG--SVVVMADQLAGQF 672
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEE 237
+ + G+ + + L TI+ + F G +GA NG+K +G + + E
Sbjct: 673 YAQLLGL--PDVVEKEYVQSTLKTIYQSCFLNFHQGKIGAANGVKLDGTGENPQSTHPLE 730
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + L A +++ G DE A + + +Y++ GL F TPE LT T+R+ Y+R
Sbjct: 731 VWTGINFALGAFLIHSGMRDEGLKLAETIVKQIYDQ-GLQFRTPEALTTSGTFRASHYLR 789
Query: 298 ALAVYAM 304
A+A++A+
Sbjct: 790 AMAIWAI 796
>gi|170079447|ref|YP_001736085.1| hypothetical protein SYNPCC7002_A2862 [Synechococcus sp. PCC 7002]
gi|169887116|gb|ACB00830.1| Conserved hypothetical protein (DUF608) [Synechococcus sp. PCC
7002]
Length = 803
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 157/310 (50%), Gaps = 35/310 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N D + WKDL F++ +YRD+ + +D FL + + L +
Sbjct: 495 PNEHPWEATNYTAYQDCNLWKDLGSDFVLQVYRDFVMTGAEDTEFLWSCWGSVLTALDYL 554
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
++FD G GI +NGG PDQT AYCGGL IAAL + M+ +L +
Sbjct: 555 KTFDLDGDGIPENGGAPDQTFDDWKLKGISAYCGGLWIAALEAAIAMAKILQQSPSHFEA 614
Query: 131 QQQQFSAK------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
Q Q SA L +A +Y D+LW GSYY D+ +S +MADQLCG
Sbjct: 615 HQNQVSAAEFEQAIATYQTWLDQARPLYQDILWNGSYYTLDTGS--NSRVVMADQLCGQY 672
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR-SAIQSEE 237
+ + ++ + P A LATI++ F G GA NG+ P+G ++ + E
Sbjct: 673 YTQL--LSLPDVNPGDRTATTLATIYTACFEKFHGGQFGAANGLNPDGTPEKENDTHPLE 730
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + ++A M+ G EA + VY+ GL F TPE +T + T+R+ Y+R
Sbjct: 731 VWTGINFGIAALMIRNGMQTEALRMVEAVVEQVYD-NGLQFRTPEAITANGTFRACHYLR 789
Query: 298 ALAVYAMQDA 307
A+ ++A+ DA
Sbjct: 790 AMGIWAIYDA 799
>gi|440680630|ref|YP_007155425.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
gi|428677749|gb|AFZ56515.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
Length = 802
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 98/306 (32%), Positives = 157/306 (51%), Gaps = 31/306 (10%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
P E + N D + WKDL F++ +YRD+ +D+ FL + ++ +
Sbjct: 504 PNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFIFTGGQDVEFLADCWDAIVQTLDYL 563
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
++FD G GI +N G PDQT AYCGGL +AAL + + +L + +
Sbjct: 564 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALESAIAICDILTNRRGAENA 623
Query: 131 -----QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV 185
Q+ + + L ++ +Y + LW G YY+ DS S+ +MADQLCG + + G+
Sbjct: 624 EKIRSQKSVYQSWLQQSLPIYQEKLWNGQYYRLDSESG--SNVVMADQLCGQFYARLLGL 681
Query: 186 NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTY 244
I P+ N AL T++ F G GA NG++P+G + A EVWTG+ +
Sbjct: 682 --PDIVPNDNALSALQTVYDACFVKFCNGQFGAANGVRPDGSPENPQATHPLEVWTGINF 739
Query: 245 LLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
L+A ++ G +EA + + +Y+ GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 740 GLAAFLIQMGMKNEAMRLTQAVVQQIYDN-GLQFRTPEAITSAGTFRASTYLRAMAIWAI 798
Query: 305 QDAYLK 310
YLK
Sbjct: 799 ---YLK 801
>gi|126657976|ref|ZP_01729128.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
gi|126620614|gb|EAZ91331.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
Length = 804
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 102/307 (33%), Positives = 148/307 (48%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L DI FL + E +
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGADDIEFLWECWEAVPEALNYL 554
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQQ 134
+ FD GI +N G PDQT AYCG L IAAL +++ LL + Q
Sbjct: 555 KEFDLDNDGIPENSGAPDQTFDDWELRGMSAYCGALWIAALEAAIKIGEILLENRPNNPQ 614
Query: 135 FSAK----------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
K L ++ VYH LW G YYK DS SD +MADQL G
Sbjct: 615 LEPKNYPECIEKELDKYQNWLQQSRSVYHSTLWNGEYYKLDSESG--SDVVMADQLSGQF 672
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
+ + G+ + + AL ++ F+ G GA NGMKP+G D ++ +E
Sbjct: 673 YARLLGL--PDVVENQYALSALKKVYEACFLKFQNGKYGAANGMKPDGTPEDPNSTHPQE 730
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + L++ ++ G DEA + + VYE GL F TPE +T T+R+ Y+R
Sbjct: 731 VWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYE-NGLQFRTPEAITAVGTFRACHYLR 789
Query: 298 ALAVYAM 304
A+A++ +
Sbjct: 790 AMAIWGI 796
>gi|220909879|ref|YP_002485190.1| hypothetical protein Cyan7425_4519 [Cyanothece sp. PCC 7425]
gi|219866490|gb|ACL46829.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7425]
Length = 814
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 101/298 (33%), Positives = 151/298 (50%), Gaps = 26/298 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L D FLT +P +
Sbjct: 503 PNEHPWEKTNYTSYQDCNLWKDLPADFVILVYRDYLLTGGTDYDFLTTCWPAIHATLAYL 562
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA--------CMVEMSSLLN 127
++FD G GI +N G PDQT AYCGGL IAAL +V+ L +
Sbjct: 563 KTFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEAAIAIIKLLLVQAPDLPH 622
Query: 128 DTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
+ + L ++ +Y LW G YY+ DS SD +MADQLCG + + G+
Sbjct: 623 LQESLTTYQTWLAQSRPLYQQTLWNGQYYRLDSGSG--SDVVMADQLCGQFYARLVGL-- 678
Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLL 246
+ P AL ++ T F G++GA NG++P G ++ A EVWTG+ + L
Sbjct: 679 PDVVPEDCARAALDMVYQTCFLQFHNGTLGAANGLRPGGLAEKPDATHPLEVWTGINFGL 738
Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+A ++ G EA + +YE GL F TPE +T ++T+R+G Y+R +A++A+
Sbjct: 739 AAFLVQMGRKAEAMQLTAAVVYQIYEH-GLQFRTPEAITAEQTFRAGMYLRPMAIWAI 795
>gi|428317101|ref|YP_007114983.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
gi|428240781|gb|AFZ06567.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
Length = 809
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/299 (33%), Positives = 154/299 (51%), Gaps = 27/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L D FL +P +E +
Sbjct: 508 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLLTGGTDCEFLQECWPAIVEALAYL 567
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ--- 132
++FD+ G GI +N G PDQT AYCGGL +AAL + + +L D +
Sbjct: 568 KTFDEDGDGIPENSGAPDQTFDDWQMRGISAYCGGLWLAALEAAIAIGEILEDAAPEIIP 627
Query: 133 ------QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
+ + L +A Y + LW G YY+ DS +S+ +MADQLCG + K G+
Sbjct: 628 NPQAKIALYHSWLLQARPSYQEKLWNGQYYRLDSES--NSEVVMADQLCGQFYAKLLGLP 685
Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYL 245
I P AL T++ + F G GA NG+ NG + +A EVWTG+ +
Sbjct: 686 D--IVPPECAVSALETVYESCFLKFNDGEFGAANGVMLNGSPENPNATHPLEVWTGINFG 743
Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
L+A ++ G ++A+ + + +YE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 744 LAAFLVQMGMEEKAFKLTDAVVKQIYE-NGLQFRTPEAITAGGTFRASHYLRAMAIWAI 801
>gi|428200465|ref|YP_007079054.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
gi|427977897|gb|AFY75497.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
Length = 808
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 105/307 (34%), Positives = 152/307 (49%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L KD+ FL + + +E +
Sbjct: 496 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFVLTGAKDVAFLWECWQSIVETLAYL 555
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQ--- 132
+ FD+ G GI +N G PDQT AYCGGL IAAL + + +L +
Sbjct: 556 KKFDRDGDGIPENSGAPDQTFDDWRLRGISAYCGGLWIAALEAAIAIGKILIENPPSNPD 615
Query: 133 --------------QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
Q + L A +YH+ LW G YY DS SD +MADQLCG
Sbjct: 616 LELENYPQSLQNTIQTYQNWLQGARSLYHEKLWNGEYYHLDSESG--SDIVMADQLCGQF 673
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEE 237
+ + G+ + AL ++ F+ G GA NG+KP+G D A E
Sbjct: 674 YARLLGLPD--VVERQYAESALKKVYEACFLKFQDGKFGAANGVKPDGSPEDPKATHPLE 731
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + L+A +L G DEA+ A + R VY+ GL F TPE +T T+R+ Y+R
Sbjct: 732 VWTGINFGLAAFLLQMGMKDEAFKLAETVVRQVYD-NGLQFRTPEAITAVGTFRASHYLR 790
Query: 298 ALAVYAM 304
A+A++ +
Sbjct: 791 AMAIWGI 797
>gi|119510228|ref|ZP_01629365.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
gi|119465077|gb|EAW45977.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
Length = 814
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 102/298 (34%), Positives = 157/298 (52%), Gaps = 26/298 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N D + WKDL F++ +YRD+ L D++FL + ++ +
Sbjct: 504 PNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFLLTGADDVQFLADCWAGIVQTLDYL 563
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-------ND 128
+SFD G GI +N G PDQT AYCGGL +AAL + +S +L D
Sbjct: 564 KSFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAISDILLTNHRGAED 623
Query: 129 TQQ-QQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
T++ Q++ L ++ +Y + LW G Y++ DS SD +MADQLCG + + G+
Sbjct: 624 TEELGQRYRVWLAQSRTIYQEKLWNGQYFRLDSESG--SDVVMADQLCGQFYARLLGL-- 679
Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLL 246
I PS AL T++ F G GA NG++P+G + +A EVWTG+ + L
Sbjct: 680 PDIVPSDRALSALQTVYHACFVKFCNGEFGAANGVRPDGSAENPNATHPLEVWTGINFGL 739
Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+A ++ G DEA G + + +Y GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 740 AAFLVQMGMQDEALKLTGAVVQQIY-HNGLQFRTPEAITASGTFRASTYLRAMAIWAI 796
>gi|428770169|ref|YP_007161959.1| glucosylceramidase [Cyanobacterium aponinum PCC 10605]
gi|428684448|gb|AFZ53915.1| Glucosylceramidase [Cyanobacterium aponinum PCC 10605]
Length = 802
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 147/307 (47%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHK--DIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY D FL + + +E +
Sbjct: 494 PNEHPWEKSNYTSYQDCNLWKDLGCDFVLLVYRDYLFTGGLDQDFLWECWDSVVETLHYV 553
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL------NDT 129
++FD G GI +N G PDQT AYCGGL IA L + + + L N +
Sbjct: 554 KTFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAGLEAAIAIGNTLIANPPMNPS 613
Query: 130 QQQQQ-----------FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
Q F LT+A +YH+ LW G YY+ DS SD +MADQLCG
Sbjct: 614 LQPDDYPQGIKDAIAVFQQWLTQARSLYHETLWNGEYYRLDSESG--SDVVMADQLCGQF 671
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEE 237
+ + G+ + S + L ++ F G G NG+KP+G + E
Sbjct: 672 YAQLLGLPD--VVESKYVKSTLRKVYDACFLKFHDGKFGIANGVKPDGKPVKENDTHPLE 729
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ Y + A M+ G E A + + VYE GL F TPE +T T+R+ Y+R
Sbjct: 730 VWTGINYGIVAFMILNGMKQEGLRVAETVVKQVYE-NGLQFRTPEAITAVGTFRASHYLR 788
Query: 298 ALAVYAM 304
A+AV+A+
Sbjct: 789 AMAVWAI 795
>gi|254431466|ref|ZP_05045169.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
gi|197625919|gb|EDY38478.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
Length = 837
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 98/297 (32%), Positives = 144/297 (48%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E P+ N D + WKDL F++ ++R + L +D+RFL +P +E +R
Sbjct: 527 PNERPWDATNYTAYQDCNLWKDLASDFVLQVWRTFRLAPTGEDLRFLADCWPAAVEALRY 586
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSL------LND 128
+ FD G+ NGG PDQT AYCG L IAAL + M L+
Sbjct: 587 LKGFDANSDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMGQRLQLELGLDT 646
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+ QQ+ FS+ L ++ + LLW G YY D+ + +MADQLCG + + +
Sbjct: 647 SSQQRDFSSWLEQSRANFDRLLWNGEYYNIDADSG--TPVVMADQLCGDFYARL--LELP 702
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
+ AL I GF+ GS+G NG++ +G D EVWTG+ + L+
Sbjct: 703 PVVAEERARSALRAIREACFEGFQGGSLGVANGLRRDGTPLDPDGTHPLEVWTGINFGLA 762
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
A G D A G + VY GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 763 AYYRLMGETDTALAITGAVVHQVYG-GGLQFRTPEAITAVNTFRACHYLRAMAIWAL 818
>gi|75907319|ref|YP_321615.1| hypothetical protein Ava_1096 [Anabaena variabilis ATCC 29413]
gi|75701044|gb|ABA20720.1| Protein of unknown function DUF608 [Anabaena variabilis ATCC 29413]
Length = 804
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 103/301 (34%), Positives = 153/301 (50%), Gaps = 29/301 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD+ L D++FL + +E +
Sbjct: 504 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYV 563
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD G GI +N G PDQT AYCGGL +AALA + +S +L Q +
Sbjct: 564 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWMAALAAAIAISDILLQNHQDSET 623
Query: 136 SAKL-----------TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG 184
KL TK+ +Y + LW G YY+ DS SD +MADQLCG + A+
Sbjct: 624 KEKLLLQKSTYETWLTKSLPIYQEKLWNGKYYRLDSESG--SDVVMADQLCGQFY--ANL 679
Query: 185 VNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVT 243
+ I PS AL T++ F G GA NG++P+G + A EVWTG+
Sbjct: 680 LELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPENPKATHPLEVWTGIN 739
Query: 244 YLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYA 303
+ L+A ++ G DE + + +Y GL F TPE +T T+R+ Y+RA+A++A
Sbjct: 740 FGLAAFLVQMGMKDEGFRLTQAVVAQIY-NNGLQFRTPEAITAAGTFRASTYLRAMAIWA 798
Query: 304 M 304
+
Sbjct: 799 I 799
>gi|254410084|ref|ZP_05023864.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183120|gb|EDX78104.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 814
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 99/304 (32%), Positives = 143/304 (47%), Gaps = 34/304 (11%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKCES 87
E P+ N + D + WKDL F++ +YR + KDI FL + + + +S
Sbjct: 509 EHPWEKTNYTSYQDCNQWKDLPCDFVLQVYRTFVFTGSKDIEFLAECWSAIVRALAYLKS 568
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGG----------------LHIAALACMVE 121
FD G GI +N G PDQT AYCGG L L +
Sbjct: 569 FDHDGDGIPENSGAPDQTFDDWQLQGVSAYCGGLWLAALEAASAIAQILLDNYTLVSAIN 628
Query: 122 MSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLK 181
+ + + L +A +Y + LW G YY+ DS SD +MADQLCG + +
Sbjct: 629 LPDPESIQHTLNTYHKWLQQAQPIYQEKLWNGQYYRLDSDSD--SDVVMADQLCGQFYAQ 686
Query: 182 ASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWT 240
G+ I P AL T++ + F GA NG+KP+G + A EVWT
Sbjct: 687 LLGL--PDIVPPECTQSALKTVYESCFLKFHNAQFGAANGVKPDGSPEHPDATHPLEVWT 744
Query: 241 GVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALA 300
G+ + L+A ++ G DEA + R VYE GL F TPE +T T+R+G Y+RA+A
Sbjct: 745 GINFGLAAFLIQLGMKDEALKLTESVVRQVYE-NGLQFRTPEAITPVGTFRAGHYLRAMA 803
Query: 301 VYAM 304
++A+
Sbjct: 804 IWAV 807
>gi|443325334|ref|ZP_21054034.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
gi|442795063|gb|ELS04450.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
Length = 808
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 101/311 (32%), Positives = 157/311 (50%), Gaps = 43/311 (13%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHK--DIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY + + FL + + + +
Sbjct: 499 PNEHPWEATNYTSYQDCNQWKDLPSDFVLQVYRDYLFTEGDNTEFLWECWDSVVCTLAYL 558
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--NDTQQQQ 133
+ FD+ GI +N G PDQT +YCGGL IAAL +++ +L N T+ +
Sbjct: 559 KEFDEDNDGIPENSGAPDQTFDDWRLQGISSYCGGLWIAALEAAIKIGEILIANPTKNPE 618
Query: 134 QFSAK---------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
+A L ++ ++YHD LW G YY+ DS +S+ +MADQLCG
Sbjct: 619 LQTADFPQGIEKTISDYREWLQRSRELYHDTLWNGEYYRLDSKS--NSEVVMADQLCGQF 676
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSE-- 236
+ + G+ + + AL+ I+ F G GA NG+ P+G SA+ +
Sbjct: 677 YARLLGLPD--VVENQYTKSALSKIYDACFLKFHDGKFGAANGVLPDG----SAVNPDDT 730
Query: 237 ---EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSG 293
EVWTG+ + L+A M+ G DEA+ + R VYE GL F TPE +T T+R+
Sbjct: 731 HPLEVWTGINFGLAAFMVQMGMKDEAFRLTETVVRQVYE-NGLQFRTPEAITAVGTFRAS 789
Query: 294 GYMRALAVYAM 304
Y+RA+A++ +
Sbjct: 790 HYLRAMAIWGV 800
>gi|307151416|ref|YP_003886800.1| glucosylceramidase [Cyanothece sp. PCC 7822]
gi|306981644|gb|ADN13525.1| Glucosylceramidase [Cyanothece sp. PCC 7822]
Length = 805
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 155/307 (50%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YR + + +L + + E +
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLLLYRSFLWTGGNNQDYLWEHWHSVTETLAYL 554
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-----NDTQ 130
++FD G GI +N G PDQT AYCG L IAAL ++M +L + Q
Sbjct: 555 KAFDLDGDGIPENSGAPDQTFDDWQLRGISAYCGALWIAALEAAIKMGEILLQRPPMNPQ 614
Query: 131 QQQQFSAK------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
Q Q +A+ L +A +YHD LW G YY+ DS SD +M+DQLCG
Sbjct: 615 LQPQNAAESIEEAITTYRHWLQQARSLYHDTLWNGEYYRLDSESG--SDVVMSDQLCGQF 672
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
+ + G+ + + P + AL I+ F G GA NG+KP+G + A E
Sbjct: 673 YARLLGL-PDVVEPQYTTS-ALRKIYEACFLKFHGGKYGAANGVKPDGTPENPDATHPLE 730
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + L+A M+ G +EA+ + + VYE GL F TPE +T T+R+ Y+R
Sbjct: 731 VWTGINFGLAAFMIQMGMKEEAFKITEAVVKQVYE-NGLQFRTPEAITAVGTFRASHYLR 789
Query: 298 ALAVYAM 304
A+A++A+
Sbjct: 790 AMAIWAV 796
>gi|427716943|ref|YP_007064937.1| glucosylceramidase [Calothrix sp. PCC 7507]
gi|427349379|gb|AFY32103.1| Glucosylceramidase [Calothrix sp. PCC 7507]
Length = 800
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 101/299 (33%), Positives = 154/299 (51%), Gaps = 27/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N D + WKDL F++ +YRD+ L D+ FLT +P +E +
Sbjct: 499 PNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFLLTGADDVEFLTDCWPAIVETLNYL 558
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------- 126
++FD+ GI +N G PDQT AYCGGL +AAL + +S++L
Sbjct: 559 KTFDQDDDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAISNILLNQNLADLG 618
Query: 127 NDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
N Q+ + L ++ +Y + LW G YY+ DS SD +MADQLCG + + G+
Sbjct: 619 NLALQKSIYETWLAQSRPIYEEKLWNGQYYRLDSKSG--SDVVMADQLCGQFYARLLGL- 675
Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYL 245
I S AL T++ F G GA NG++P+G + A EVWTG+ +
Sbjct: 676 -PDIVASDRALSALQTVYEACFLKFYDGQFGAANGVRPDGSPENPQATHPLEVWTGINFG 734
Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
L+A ++ G DEA + + +Y+ GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 735 LAAFLVQMGMQDEALKLTQAVVKQIYD-NGLQFRTPEAITTVGTFRASTYLRAMAIWAI 792
>gi|222639789|gb|EEE67921.1| hypothetical protein OsJ_25789 [Oryza sativa Japonica Group]
Length = 919
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 102/287 (35%), Positives = 146/287 (50%), Gaps = 36/287 (12%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N Y + D WKDLN KF++ +YRD + F +P + + FD+
Sbjct: 593 DPWFEVNAYMLHDPVRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDR 652
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G G+V+N G PDQT AY GGL +AAL M+ ++ D + F +
Sbjct: 653 DGDGMVENEGRPDQTYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYH 712
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
+A +VY D LW G Y+ +D+S S SI ADQL G + +A G+ E I RAL
Sbjct: 713 RARRVYTDELWNGGYFNYDNSGGATSSSIQADQLAGQWYARACGL--EPIVDGDKARRAL 770
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
AT+ NV M+ S+EV GVTY ++AAM++EG + A+
Sbjct: 771 ATVLDYNV-------------MR---------THSKEVCPGVTYAVAAAMIHEGMPEAAF 808
Query: 261 TTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQ 305
TA G++ + + G G F+TPE T D YR+ YMR L V+AMQ
Sbjct: 809 KTAKGIHDAGWGKHGFGYAFQTPESWTADGGYRALHYMRPLGVWAMQ 855
>gi|16330256|ref|NP_440984.1| hypothetical protein sll1775 [Synechocystis sp. PCC 6803]
gi|383321997|ref|YP_005382850.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325166|ref|YP_005386019.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491050|ref|YP_005408726.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436317|ref|YP_005651041.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
gi|451814414|ref|YP_007450866.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
gi|1652745|dbj|BAA17664.1| sll1775 [Synechocystis sp. PCC 6803]
gi|339273349|dbj|BAK49836.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
gi|359271316|dbj|BAL28835.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274486|dbj|BAL32004.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277656|dbj|BAL35173.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407958168|dbj|BAM51408.1| hypothetical protein BEST7613_2477 [Bacillus subtilis BEST7613]
gi|451780383|gb|AGF51352.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
Length = 817
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 148/303 (48%), Gaps = 31/303 (10%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDI--RFLTRVYPTCLELIRKC 85
P E P+ N D + WKDL F++ +YRD+ L D FL +P+ + +
Sbjct: 495 PNEHPWERSNYTAYQDCNLWKDLGSDFVLLVYRDFLLLPDADGEFLGECWPSIVAALDYL 554
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAAL-ACMVEMSSLLNDTQQQQQ 134
+ FD GI +N G PDQT AYCGGL IAAL A +V LL + QQ
Sbjct: 555 KGFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEAALVMGQYLLENPPQQTD 614
Query: 135 FSAK------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKA 182
A L +A +Y D LW G YY DS S +MADQLCG + A
Sbjct: 615 LDAMEIQTAIARYETWLAQARSLYQDTLWNGEYYNLDSQSG--SKVVMADQLCGQFY--A 670
Query: 183 SGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVWTG 241
+N + +ALA ++ T F G +GA NG+ P+G ++ + EVWTG
Sbjct: 671 RLLNLPDVVEHHYAEKALAKVYDTCFLKFAHGELGAANGLLPDGSPQNPNDTHPLEVWTG 730
Query: 242 VTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAV 301
+ + L+A ++ G +EA + VY GL F TPE +T T+R+ Y+RA+A+
Sbjct: 731 INFGLAAFLIQMGMKEEALKMTETVVNQVY-NNGLQFRTPEAITAVGTFRASHYLRAMAI 789
Query: 302 YAM 304
+A+
Sbjct: 790 WAV 792
>gi|434407175|ref|YP_007150060.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
7417]
gi|428261430|gb|AFZ27380.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
7417]
Length = 816
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 149/301 (49%), Gaps = 29/301 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N D + WKDL F++ +YRD+ D+ FL + ++ +
Sbjct: 512 PNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFLFTGADDVEFLADCWDAIVQTLDYL 571
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------- 126
+ FDK G GI +N G PDQT AYCGGL +AAL C + + +L
Sbjct: 572 KGFDKDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALECAIAICDILLTNRQDAKN 631
Query: 127 --NDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG 184
N Q+ + A L ++ VY + LW G YY+ DS ++ +MADQLCG + + G
Sbjct: 632 AKNLETQRSIYGAWLAQSLTVYEEKLWNGQYYRLDSESG--TEVVMADQLCGQFYARLLG 689
Query: 185 VNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVT 243
+ I AL T++ F G GA NG++P+G + +A EVWTG+
Sbjct: 690 L--PDIVKCDRALSALQTVYDACFLKFFDGQFGAANGVRPDGVPENPNATHPLEVWTGIN 747
Query: 244 YLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYA 303
+ L+A ++ G DEA + +Y+ GL F TPE +T T+R+ Y+R +A++A
Sbjct: 748 FGLAAFLVQMGMQDEALRLTQAVVSQIYD-NGLQFRTPEAITTTGTFRASTYLRPMAIWA 806
Query: 304 M 304
+
Sbjct: 807 I 807
>gi|298490121|ref|YP_003720298.1| glucosylceramidase ['Nostoc azollae' 0708]
gi|298232039|gb|ADI63175.1| Glucosylceramidase ['Nostoc azollae' 0708]
Length = 806
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 92/288 (31%), Positives = 148/288 (51%), Gaps = 31/288 (10%)
Query: 43 DVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGG 100
D + WKDL+ F++ +YRD+ D++FL + ++ + ++FD G GI +N G
Sbjct: 519 DCNLWKDLSSDFVLQVYRDFLFTGSHDVQFLVDCWDAIVQTLDYLKTFDLDGDGIPENSG 578
Query: 101 FPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ-------------QQQQFSA 137
PDQT AYCGGL +AAL + ++ +L + + Q+ + +
Sbjct: 579 APDQTFDDWLLNGVSAYCGGLWLAALESAIAIADILTNHRGTENTEKTDKLRSQKSIYQS 638
Query: 138 KLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
L ++ +Y LW G YY+ DS SD +MADQLCG + + G+ I PS +
Sbjct: 639 WLQQSRPIYEQKLWNGQYYRLDSESG--SDVVMADQLCGQFYTRLLGL--PDIVPSNHAH 694
Query: 198 RALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLLSAAMLYEGNV 256
AL T++ F G GA NG+ P+G + A EVWTG+ + L+A ++ G
Sbjct: 695 SALQTVYEACFVKFYDGKFGAANGVLPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMK 754
Query: 257 DEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+EA + +Y+ GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 755 NEAMQLTQAVVEQIYDN-GLQFRTPEAITPVGTFRASTYLRPMAIWAI 801
>gi|434389621|ref|YP_007100232.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
gi|428020611|gb|AFY96705.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
Length = 818
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 150/305 (49%), Gaps = 38/305 (12%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N D + WKDL F++ +YR Y KD+ FL + ++ +
Sbjct: 518 PNEHPWEKTNYTCYQDCNLWKDLGSDFVLQVYRAYIFTGKKDLEFLAECWTAIVDTLAYL 577
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQ-- 133
+ FD GI++N G PDQT AYCGGL +AAL + + ++L TQQ
Sbjct: 578 KQFDLDNDGIIENSGAPDQTFDDWRLQGISAYCGGLWLAALEAAISIGNIL--TQQYHGE 635
Query: 134 ----------QFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKAS 183
Q+ A L +Y LW GSYY+ D+ S+ +MADQLCG + +
Sbjct: 636 LTPKPAEVIPQYQAWLDSGKPIYQQQLWNGSYYRIDTGSG--SEVVMADQLCGQFYTQL- 692
Query: 184 GVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMK----PNGDRDRSAIQSEEVW 239
+ + I P AL T++ + F+ G GA NG+K P+ +D + EVW
Sbjct: 693 -LELDDIVPPDCAKTALQTVYQSCFLNFQGGKFGAANGVKIDGSPSNPKDTHPL---EVW 748
Query: 240 TGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRAL 299
TG+ + ++A ++ G +EA A + VY GL F TPE +T T+R+ Y+RA+
Sbjct: 749 TGINFGIAAFLVQMGMQEEALKLAETIVEQVYS-NGLQFRTPEAITSAGTFRASHYLRAM 807
Query: 300 AVYAM 304
A++ +
Sbjct: 808 AIWGI 812
>gi|81299663|ref|YP_399871.1| hypothetical protein Synpcc7942_0854 [Synechococcus elongatus PCC
7942]
gi|81168544|gb|ABB56884.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 798
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 96/293 (32%), Positives = 148/293 (50%), Gaps = 21/293 (7%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N D + WKDL F++ ++RDY L D FL+ +P + +R
Sbjct: 496 PNESPWLQTNYTGYQDCNLWKDLGCDFVLQVWRDYQLTGSSDRGFLSDCWPAAVAALRYL 555
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLN---DTQQQ 132
+ FD G GI +N G PDQT AYCGGL IAAL + ++ +L + + +
Sbjct: 556 KDFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEAALAIADVLELSAEDRDR 615
Query: 133 QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFP 192
Q F + L +A +YHD LW G YY+ DS S +MADQLCG + + + + P
Sbjct: 616 QDFQSWLAQARSLYHDTLWNGRYYQLDSGSG--SQVVMADQLCGDFYSRL--LQLPPVAP 671
Query: 193 SANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAML 251
++ + F +G G NG+ P+G D EVWTG+ + ++A L
Sbjct: 672 LEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGSPVDPKGTHPLEVWTGINFGIAAYWL 731
Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
G+ D + + R +Y+ GL F TPE +T + T+R+ Y+R +A++A+
Sbjct: 732 LLGHRDRCFEVTETVIRQIYD-NGLQFRTPEAITANATFRASHYLRPMAIWAV 783
>gi|428209901|ref|YP_007094254.1| glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
gi|428011822|gb|AFY90385.1| Glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
Length = 812
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 103/311 (33%), Positives = 155/311 (49%), Gaps = 33/311 (10%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N D + WKDL F++ +YRDY L D+ FL + ++ +
Sbjct: 506 PNEHVWEKTNYTAYQDCNLWKDLPSDFVLQVYRDYLLTGATDVEFLASCWEAIVQALNYL 565
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQ- 133
++FDK G GI +N G PDQT AYCGGL +AAL + + LLN+ ++
Sbjct: 566 KTFDKDGDGIPENSGAPDQTFDDWKLQGVSAYCGGLWLAALEAAIAIGKILLNNVERLHV 625
Query: 134 -------------QFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFL 180
+ + L+++ +Y + LW G YY+ DS+ SD +MADQLCG +
Sbjct: 626 TSLHVTSLQETIDTYQSWLSRSRPIYQEKLWNGQYYRLDSNSG--SDVVMADQLCGQFYA 683
Query: 181 KASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RDRSAIQSEEVW 239
+ + I P AL TI+ F G GA NG+KPNG + +A EVW
Sbjct: 684 RL--LKLPDIVPQECALSALKTIYDACFLKFHQGKFGAANGLKPNGSPENPNATHPLEVW 741
Query: 240 TGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRAL 299
TG+ + L+A +L G EA + +YE GL F TPE +T T+R+ Y+RA+
Sbjct: 742 TGINFGLAAFLLQMGMKAEALQLTETVVNQIYE-NGLQFRTPEAITSVGTFRASYYLRAM 800
Query: 300 AVYAMQDAYLK 310
A++A+ + +
Sbjct: 801 AIWAIYGVWFE 811
>gi|7470943|pir||T31071 conserved hypothetical protein - Anabaena sp. (fragment)
gi|2828799|gb|AAC38131.1|AAC38131 unknown [Nostoc sp. PCC 7120]
Length = 439
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/301 (33%), Positives = 152/301 (50%), Gaps = 29/301 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD+ L D++FL + +E +
Sbjct: 139 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYV 198
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
++FD G GI +N G PDQT AYCGGL +AALA + +S +L
Sbjct: 199 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALAAAIAISDILLQNHQDSAT 258
Query: 131 ------QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG 184
Q+ + LTK+ +Y + LW G YY+ DS SD +MADQLCG + A+
Sbjct: 259 KGHLLYQKSTYETWLTKSLPIYQEKLWNGKYYRLDSESG--SDVVMADQLCGQFY--ANL 314
Query: 185 VNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVT 243
+ I PS AL T++ F G GA NG++P+G + A EVWTG+
Sbjct: 315 LELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPENPKATHPLEVWTGIN 374
Query: 244 YLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYA 303
+ L+A ++ G DE + + +Y GL F TPE +T T+R+ Y+R +A++A
Sbjct: 375 FGLAAFLVQMGMKDEGFRLTQAVVEQIY-NNGLQFRTPEAITAAGTFRASTYLRPMAIWA 433
Query: 304 M 304
+
Sbjct: 434 I 434
>gi|17230308|ref|NP_486856.1| hypothetical protein alr2816 [Nostoc sp. PCC 7120]
gi|17131910|dbj|BAB74515.1| alr2816 [Nostoc sp. PCC 7120]
Length = 804
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/301 (33%), Positives = 152/301 (50%), Gaps = 29/301 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD+ L D++FL + +E +
Sbjct: 504 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYV 563
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
++FD G GI +N G PDQT AYCGGL +AALA + +S +L
Sbjct: 564 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALAAAIAISDILLQNHQDSAT 623
Query: 131 ------QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG 184
Q+ + LTK+ +Y + LW G YY+ DS SD +MADQLCG + A+
Sbjct: 624 KEHLLYQKSTYETWLTKSLPIYQEKLWNGKYYRLDSESG--SDVVMADQLCGQFY--ANL 679
Query: 185 VNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVT 243
+ I PS AL T++ F G GA NG++P+G + A EVWTG+
Sbjct: 680 LELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPENPKATHPLEVWTGIN 739
Query: 244 YLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYA 303
+ L+A ++ G DE + + +Y GL F TPE +T T+R+ Y+R +A++A
Sbjct: 740 FGLAAFLVQMGMKDEGFRLTQAVVEQIY-NNGLQFRTPEAITAAGTFRASTYLRPMAIWA 798
Query: 304 M 304
+
Sbjct: 799 I 799
>gi|147812668|emb|CAN61857.1| hypothetical protein VITISV_016690 [Vitis vinifera]
Length = 521
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 100/294 (34%), Positives = 146/294 (49%), Gaps = 25/294 (8%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN++D WKDLN KF++ +YRD D F V+P I + FDK
Sbjct: 184 DPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQFDK 243
Query: 91 QGLGIVQNGGFPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHD-- 148
G G+++N G P Q A M+ + D+ F K KA VY
Sbjct: 244 DGDGMIENDGLPLQAASA-------------MAREVGDSMTADYFWFKFQKAKAVYDKDQ 290
Query: 149 ---LLWTG---SYYKFDSSQSRHS-DSIMADQLCGYLFLKASGVNTEAIFPSANIARALA 201
++ G ++F + S D + D + A + I AL
Sbjct: 291 LAAVIGEGRLLXLFRFLGNPSHTGIDGLFQDHR-DWTTRYARACGLQPIVDDEKARSALE 349
Query: 202 TIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWT 261
+++ NV + G GAVNGM P+G D SA+QS E+W GVTY ++A M++EG V+ A+
Sbjct: 350 KVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAFN 409
Query: 262 TAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKV 313
TA G+Y + + GLG F+TPE D+ YRS YMR LA++AMQ A K ++
Sbjct: 410 TASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPEL 463
>gi|428299644|ref|YP_007137950.1| glucosylceramidase [Calothrix sp. PCC 6303]
gi|428236188|gb|AFZ01978.1| Glucosylceramidase [Calothrix sp. PCC 6303]
Length = 804
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 150/298 (50%), Gaps = 26/298 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD+ KDI FL + +
Sbjct: 502 PNEHVWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFVFTGSKDIEFLAECWDAIALTLEYL 561
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSS-LLNDTQQQQQ 134
++FDK GI +N G PDQT AYCGGL +AAL + + LL + Q++
Sbjct: 562 KAFDKDHDGIPENSGAPDQTFDDWKLQGVSAYCGGLWLAALEAAIAICDILLENHGDQEK 621
Query: 135 FSAK-------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
+A+ L K+ +Y LW G YY+ DS SD +MADQLCG + + +
Sbjct: 622 LTAQKIIYTDWLAKSRPIYQQKLWNGQYYRLDSESG--SDVVMADQLCGQFYARL--LKL 677
Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLL 246
I P A+ T++ F G GA NG++P+G + A EVWTG+ + L
Sbjct: 678 ADIVPHDCAISAVRTVYEACFINFNDGKFGAANGLRPDGSPENPKATHPLEVWTGINFGL 737
Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+A ++ G DEAW + + +Y+ GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 738 AAFLMQMGMQDEAWKLTEVVVQQIYD-NGLQFRTPEAITATGTFRACTYLRAMAIWGI 794
>gi|427709805|ref|YP_007052182.1| glucosylceramidase [Nostoc sp. PCC 7107]
gi|427362310|gb|AFY45032.1| Glucosylceramidase [Nostoc sp. PCC 7107]
Length = 802
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 150/298 (50%), Gaps = 26/298 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N D + WKDL+ F++ +YRDY L D+ FL + ++ +
Sbjct: 504 PNEHVWEKTNYTCYQDCNLWKDLSCDFVLQVYRDYLLTGADDVEFLADCWDAVVQTLDYL 563
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------N 127
++FDK G GI +N G PDQT AYCGGL +AAL + +S +L +
Sbjct: 564 KTFDKDGDGIPENSGAPDQTFDDWRLLGVSAYCGGLWLAALEAAIAISDILITNHKVVES 623
Query: 128 DTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
+Q+ + L ++ +Y + LW G YY+ DS S +MADQLCG + + +
Sbjct: 624 AEKQKSIYEVWLAQSRPIYQEKLWNGQYYRLDSESG--SAVVMADQLCGQFYARL--LEL 679
Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLL 246
I PS AL T++ F+ G GA NG+ P+G + A EVWTG+ + L
Sbjct: 680 PDIVPSDRALSALKTVYDACFLKFQNGEFGAANGVLPDGSPENPKATHPLEVWTGINFGL 739
Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+A ++ G DEA + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 740 AAFLVQMGMQDEALRLTQAVVEQVY-NNGLQFRTPEAITATGTFRASTYLRAMAIWGI 796
>gi|427725201|ref|YP_007072478.1| glucosylceramidase [Leptolyngbya sp. PCC 7376]
gi|427356921|gb|AFY39644.1| Glucosylceramidase [Leptolyngbya sp. PCC 7376]
Length = 807
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 151/310 (48%), Gaps = 35/310 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N D + WKDL F++ +YRD+ + KD FL + ++ +
Sbjct: 501 PNEHPWEATNYTAYQDCNLWKDLGADFVLQVYRDFVMTGSKDTEFLWSCWEGVVQALDYL 560
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGL-------HIAALACMVEMSSLLND 128
++FD GI +NGG PDQT AYCGGL IA + +S+ +
Sbjct: 561 KTFDLDCDGIPENGGSPDQTFDDWKLKGISAYCGGLWIAALEAAIAIANILKNKASIYEE 620
Query: 129 TQQQ---QQFSAKLTK-------ASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
Q Q ++F A + K + +Y D LW GSYY D+ S +M+DQLCG
Sbjct: 621 HQNQVSTEEFEAAIAKYQDWLKQSRPLYQDTLWNGSYYTLDTGSD--SKVVMSDQLCGEF 678
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR-SAIQSEE 237
+ + +N + P A L TI+ F G GA NG+ P+G ++ + E
Sbjct: 679 YTQL--LNLPNVNPENRTAITLQTIYEACFLKFHDGQFGAANGLNPDGSPEKENDTHPLE 736
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + ++A M+ G DEA + VY GL F TPE +T + T+R+ Y+R
Sbjct: 737 VWTGINFGIAAFMIRNGMKDEALKMTEAVIHQVYS-NGLQFRTPEAITANNTFRACHYLR 795
Query: 298 ALAVYAMQDA 307
A+ ++A+ DA
Sbjct: 796 AMGIWAIYDA 805
>gi|427733995|ref|YP_007053539.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
gi|427369036|gb|AFY52992.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
Length = 799
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 96/298 (32%), Positives = 151/298 (50%), Gaps = 26/298 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD+ DI FL + +E +
Sbjct: 503 PNEHVWEKTNYTSYQDCNQWKDLGCDFVLQVYRDFIFTGSNDIEFLADCWNAIVETLNYL 562
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
++FDK G GI +N G PDQT AYCGGL +AAL + ++ +L+ Q
Sbjct: 563 KAFDKDGDGIPENSGAPDQTFDDWRLLGVSAYCGGLWLAALEAAIAIAEVLSSKQVQVEN 622
Query: 131 ---QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
Q+ + L K+ VY + LW G +Y+ DS SD +MADQLCG + + +
Sbjct: 623 LAAQKSIYEGWLEKSRPVYQEKLWNGKFYRLDSESG--SDVVMADQLCGQFYARL--LKL 678
Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLL 246
I P AL +++ F G +GA NG++ +G + A EVWTG+ + +
Sbjct: 679 PDIVPPECTMSALNSVYDACFLKFNQGKLGAANGLRLDGSPENPDATHPLEVWTGINFGI 738
Query: 247 SAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+A ++ G EA+ + +YE GL F TPE +T + +R+ Y+RA+A++A+
Sbjct: 739 AAFLVQMGMKSEAFKLTEAVVNQIYEH-GLQFRTPEAITATRNFRACVYLRAMAIWAI 795
>gi|366162568|ref|ZP_09462323.1| hypothetical protein AcelC_02756 [Acetivibrio cellulolyticus CD2]
Length = 842
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 98/289 (33%), Positives = 150/289 (51%), Gaps = 17/289 (5%)
Query: 33 FSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKCESFDK 90
F N Y D ++WKDLN K ++ +YRD+AL D+ FL + + + K +S D
Sbjct: 469 FETWNAYVYRDSTNWKDLNSKLVLMVYRDWALTGKTDMDFLKYCWIPVQKAMDKVKSQDS 528
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLT 140
G + + G DQT AYCGGL +AA E+++ + D+ + Q+ +
Sbjct: 529 DGDYLPNSSGI-DQTYDDMQLQGNTAYCGGLFVAACNAAKEIATAMGDSTKATQYQSWYE 587
Query: 141 KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARAL 200
K+ + Y LW GSY+K D+ S + IM+DQLCG+ + KA G+ E I P ++
Sbjct: 588 KSKENYIAKLWNGSYFKIDTG-SYDTSRIMSDQLCGHWYAKACGL--EGIVPDEYAKKSY 644
Query: 201 ATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAW 260
+ ++ N + F+ G G VN MK +G+ D S Q+ E W G +Y + A M+ EG EA
Sbjct: 645 SKVYEFNNKKFDNGLHGFVNIMKADGNVDMSHAQTAEAWVGTSYGVIAGMIQEGLETEAS 704
Query: 261 TTAGGLYRTVYERTGLGFETPEGL-TGDKTYRSGGYMRALAVYAMQDAY 308
L TV++ + F TPE G R+ YMRA ++A++ AY
Sbjct: 705 QVGSNLADTVWKTNDMWFRTPEAWRQGVSEVRAPYYMRANCIWAVKHAY 753
>gi|409991122|ref|ZP_11274412.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
Paraca]
gi|291568810|dbj|BAI91082.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409938009|gb|EKN79383.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
Paraca]
Length = 799
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 104/299 (34%), Positives = 150/299 (50%), Gaps = 27/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L DI+FL +P ++ +
Sbjct: 499 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDYLLTGADDIQFLVECWPAIVQTLDYL 558
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMV--------EMSSLLN 127
++FD+ I +NGG PDQT AYCGGL +AAL + S +
Sbjct: 559 KTFDRDRDCIPENGGAPDQTFDDWRMMGISAYCGGLWLAALEAAIAIANILLSHHSDITP 618
Query: 128 DTQQQ-QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
DT + ++ L A +Y LW GSYY DS SD +MADQLCG + G+
Sbjct: 619 DTTKAIATWTNWLETAKPLYDQTLWNGSYYNLDSQSG--SDVVMADQLCGQFYAALLGL- 675
Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYL 245
I P AL TI+ + F G GA NG++ G+ + EVWTG+ +
Sbjct: 676 -PDIVPHHRARVALQTIYQACFQNFHNGKFGAANGVRTTGEPINPQDTHPLEVWTGINFG 734
Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
L A ++ G +EA+ A + VY+ GL F TPE +T T+R+ Y+RA+A++AM
Sbjct: 735 LGAFLIQMGMKEEAFQLAEAVITQVYQ-NGLQFRTPEAITAAGTFRASHYLRAMAIWAM 792
>gi|428773649|ref|YP_007165437.1| glucosylceramidase [Cyanobacterium stanieri PCC 7202]
gi|428687928|gb|AFZ47788.1| Glucosylceramidase [Cyanobacterium stanieri PCC 7202]
Length = 800
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 147/310 (47%), Gaps = 35/310 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY D FL + + +E +
Sbjct: 494 PNEHPWIKSNYTSYQDCNLWKDLGSDFVLLVYRDYLYTGKNDQDFLWECWDSVVETLHYV 553
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDT------ 129
+ FD GI +N G PDQT AYCGGL I AL + + ++L D
Sbjct: 554 KGFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWICALEAAIAIGTILLDNPPVNPA 613
Query: 130 ------QQQQQ-----FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
Q+ Q F L ++ +YHD LW G YY DS SD +MADQLCG
Sbjct: 614 LEKENAQENIQGAIALFRQWLHQSRAIYHDTLWNGEYYNLDSKSG--SDVVMADQLCGQY 671
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDR-SAIQSEE 237
+ A + + + L I+ F G++G NG+KP+G ++ + E
Sbjct: 672 Y--AQLLKLPDVVEKEYVTATLNKIYDACYLKFHNGTLGIANGVKPDGSPEKENDTHPLE 729
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
+WTG+ Y + A M+ G +E A + VY GL F TPE +T T+R+ Y+R
Sbjct: 730 IWTGINYGIVAFMILNGMKEEGLKVAETVVNQVYS-NGLQFRTPEAITAVNTFRASHYLR 788
Query: 298 ALAVYAMQDA 307
A+A++A+ A
Sbjct: 789 AMAIWAIYQA 798
>gi|56750695|ref|YP_171396.1| hypothetical protein syc0686_c [Synechococcus elongatus PCC 6301]
gi|56685654|dbj|BAD78876.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 798
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 95/293 (32%), Positives = 147/293 (50%), Gaps = 21/293 (7%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N D + WKDL F++ ++RDY L D FL+ +P + +R
Sbjct: 496 PNESPWLQTNYTGYQDCNLWKDLGCDFVLQVWRDYQLTGSSDRGFLSDCWPAAVAALRYL 555
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLN---DTQQQ 132
+ FD G GI +N G PDQT AYCGGL IAAL + ++ +L + + +
Sbjct: 556 KDFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEAALAIADVLELSAEDRDR 615
Query: 133 QQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFP 192
Q F + L +A +YHD LW G YY+ DS S +MADQLCG + + + + P
Sbjct: 616 QDFQSWLAQARSLYHDTLWNGRYYQLDSGSG--SQVVMADQLCGDFYSRL--LQLPPVAP 671
Query: 193 SANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAML 251
++ + F +G G NG+ P+G EVWTG+ + ++A L
Sbjct: 672 LEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGSPVGPKGTHPLEVWTGINFGIAAYWL 731
Query: 252 YEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
G+ D + + R +Y+ GL F TPE +T + T+R+ Y+R +A++A+
Sbjct: 732 LLGHRDRCFEVTETVIRQIYD-NGLQFRTPEAITANATFRASHYLRPMAIWAV 783
>gi|218437284|ref|YP_002375613.1| hypothetical protein PCC7424_0277 [Cyanothece sp. PCC 7424]
gi|218170012|gb|ACK68745.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7424]
Length = 801
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 100/309 (32%), Positives = 154/309 (49%), Gaps = 39/309 (12%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY----ALHKDIRFLTRVYPTCLELIR 83
P E P+ N + D + WKDL F++ +YR + A ++D FL + + + I
Sbjct: 495 PNEHPWEKTNYTSYQDCNLWKDLGCDFVLLLYRSFLWTGANNQD--FLWEHWDSVTQTIA 552
Query: 84 KCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEM-SSLLNDTQQQ 132
+ FD G GI +N G PDQT AYCG L IA L +++ +LL
Sbjct: 553 YVKKFDLDGDGIPENSGAPDQTFDDWQLRGISAYCGALWIAGLEAAIKIGETLLQRPPMN 612
Query: 133 QQFSAK----------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
Q ++ L +A +YHD LWTG YY+ DS SD +M+DQLCG
Sbjct: 613 PQLQSENAPEKIEIAIATYRQWLHQARALYHDTLWTGEYYRLDSESG--SDVVMSDQLCG 670
Query: 177 YLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQS 235
+ + G+ + + P ++ AL I+ + F G GA NG++P+G + A
Sbjct: 671 QFYARLLGL-PDVVEPQYTLS-ALRKIYESCFLQFYGGKYGAANGVRPDGSPENPDATHP 728
Query: 236 EEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGY 295
EVWTG+ + L+A M+ G E + + + +YE GL F TPE +T T+R+ Y
Sbjct: 729 LEVWTGINFGLAAFMIQMGMKQEGLSITEAVVKQIYE-NGLQFRTPEAITAAGTFRASHY 787
Query: 296 MRALAVYAM 304
+RA+A++A+
Sbjct: 788 LRAMAIWAI 796
>gi|16183333|gb|AAL13692.1| GH26380p [Drosophila melanogaster]
Length = 391
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 82/204 (40%), Positives = 115/204 (56%), Gaps = 19/204 (9%)
Query: 68 IRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA 117
I +L +Y +C ++ + +DK G+++N PDQT AYC GL +AAL
Sbjct: 169 IGYLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQ 228
Query: 118 CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
M M+++L+ ++ L K + + LW GSYY+FD S S H D+IMADQLCG+
Sbjct: 229 AMSAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLSHS-HRDTIMADQLCGH 287
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN-------GDRDR 230
+LK+ G + E I+P N+ AL I+ NV GF G++GA NG N G D
Sbjct: 288 WYLKSCGFDYE-IYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDN 346
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEG 254
S IQ+EEVW GV Y L+A M+ EG
Sbjct: 347 SNIQAEEVWPGVVYALAATMIQEG 370
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDY 62
P+EEPF+LIN YN+ DV+DWKDLN KF++ +YRDY
Sbjct: 66 PDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDY 100
>gi|434391596|ref|YP_007126543.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
gi|428263437|gb|AFZ29383.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
Length = 809
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 149/313 (47%), Gaps = 44/313 (14%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL+ F++ +YRD+ L D L + ++ +
Sbjct: 499 PNEHPWEATNYTSYQDCNLWKDLSCDFVLQVYRDFVLTGGTDWELLWDCWSAVVQTLTYL 558
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGL---------------------HIA 114
++FD GI +N G PDQT AYCGGL H A
Sbjct: 559 KTFDLDEDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAIGKTLLSYPDDHPA 618
Query: 115 A--LACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMAD 172
+ LA + + + F + L ++ +Y + LW G YY+ DS SD +MAD
Sbjct: 619 SKILASAPDYPPI---PETLDVFQSWLMRSRPIYQEKLWNGQYYRLDSKSG--SDVVMAD 673
Query: 173 QLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRS 231
QLCG + + G+ I P +AL T++ F G GA NG+KP+G + +
Sbjct: 674 QLCGQFYARLLGL--PDIVPPECATQALKTVYDACFVKFHNGQFGAANGLKPDGLPENPN 731
Query: 232 AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYR 291
A EVWTG+ + L+A ++ G EAW + + VY+ GL F TPE +T T+R
Sbjct: 732 ATHPLEVWTGINFGLAAFLMQMGMKSEAWRITQAVVQQVYD-NGLQFRTPEAITAKGTFR 790
Query: 292 SGGYMRALAVYAM 304
+ Y+R +A++A+
Sbjct: 791 ACHYLRPMAIWAI 803
>gi|186681980|ref|YP_001865176.1| hypothetical protein Npun_R1544 [Nostoc punctiforme PCC 73102]
gi|186464432|gb|ACC80233.1| protein of unknown function DUF608 [Nostoc punctiforme PCC 73102]
Length = 801
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 98/299 (32%), Positives = 151/299 (50%), Gaps = 27/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD+ L D+ FL + ++ +
Sbjct: 503 PNEHVWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGADDVEFLADCWNAIVQTLDYL 562
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
++FD G GI +N G PDQT AYCGGL +AAL + +S LL +
Sbjct: 563 KTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAISDLLLTHKLGDLG 622
Query: 131 ----QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
Q+ + L ++ +Y + LW G YY+ DS SD +MADQLCG + + ++
Sbjct: 623 GLAVQKSIYETWLQQSLPIYQEKLWNGQYYQLDSQSG--SDVVMADQLCGQFYARL--LD 678
Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYL 245
I PS AL T++ F G G NG++P+G + A EVWTG+ +
Sbjct: 679 LPDIVPSDRALSALKTVYDACFLKFCNGEFGTANGVRPDGSPENPKATHPLEVWTGINFG 738
Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
L+A ++ G DEA + + +YE GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 739 LAAFLVQMGMKDEALRLTQAVVQQIYEN-GLQFRTPEAITAAGTFRASTYLRAMAIWGI 796
>gi|449691194|ref|XP_004212588.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
magnipapillata]
Length = 169
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 74/160 (46%), Positives = 106/160 (66%), Gaps = 3/160 (1%)
Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT---EAIFPSANIARALATIFSTNVRG 210
+YY +D S++ + + IMA Q G +L A + + +F + L IF NV
Sbjct: 1 TYYNYDGSENDYHNCIMAGQCAGQWYLHACDLGQTEDDLVFRPEQVHSTLKVIFQNNVMR 60
Query: 211 FEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 270
FE G MGAVNGM+ +G D++++Q+EEVWTGVTY L+A M+ G V E + TA G+YRTV
Sbjct: 61 FEDGFMGAVNGMRADGTVDKNSLQAEEVWTGVTYALAATMIQHGMVREGFRTAEGIYRTV 120
Query: 271 YERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
YE+ GLGF+TPE +T +KT+RS GYMR L+++++ AY K
Sbjct: 121 YEKWGLGFQTPEAITANKTFRSRGYMRPLSIWSIYHAYSK 160
>gi|434400779|ref|YP_007134783.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
gi|428271876|gb|AFZ37817.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
Length = 804
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 146/307 (47%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L KD FL + + + +
Sbjct: 495 PNEHPWQATNYTSYQDCNLWKDLPSDFVLQVYRDYVLTGSKDSEFLWECWDSIVVTLAYL 554
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAA----------LACMVEMSSL 125
+ FD G GI +N G PDQT AYCGGL IAA L M+
Sbjct: 555 KEFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEAAIAIGQILMTHPPMNPN 614
Query: 126 LNDTQQQQ-------QFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
L Q + L +A +YHD LW G YY+ DS +SD +MADQLCG
Sbjct: 615 LQPPDYPQGIETVINHYRVWLQQARSLYHDTLWNGEYYRLDSQS--NSDVVMADQLCGQF 672
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEE 237
+ + G+ + AL I+ F G GA NG+ P+G + + E
Sbjct: 673 YARLLGL--PDVVEEQYTKSALKKIYQACFLKFHNGKYGAANGVLPDGSPVNPNDTHPLE 730
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VWTG+ + L+A ML G EA+ + + VYE GL F TPE +T T+R+ Y+R
Sbjct: 731 VWTGINFGLAAFMLQMGMKAEAFQLTETVVKQVYE-NGLQFRTPEAITAVGTFRASHYLR 789
Query: 298 ALAVYAM 304
A+A++ +
Sbjct: 790 AMAIWGI 796
>gi|284929778|ref|YP_003422300.1| putative bile acid beta-glucosidase [cyanobacterium UCYN-A]
gi|284810222|gb|ADB95919.1| predicted bile acid beta-glucosidase [cyanobacterium UCYN-A]
Length = 803
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 95/307 (30%), Positives = 150/307 (48%), Gaps = 35/307 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L +DI FL +P +E +
Sbjct: 495 PNEHPWEETNYTSYQDCNLWKDLGSDFVLQVYRDFLLTDSQDINFLQECWPAIVETLDYL 554
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-----NDTQ 130
+ FD GI +N G PDQT +YCG L I AL + + +L N++
Sbjct: 555 KKFDLDDDGIPENSGAPDQTFDDWQLHGISSYCGALWITALEAAIGIGKILLKNSENNSH 614
Query: 131 QQQQFSAK------------LTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYL 178
Q S L K+ +YH+ LW G YY+ DS SD+IM DQLCG
Sbjct: 615 PDLQNSLAFIQENLTKYDDWLKKSRLIYHETLWNGEYYQLDSKSG--SDTIMTDQLCGQF 672
Query: 179 FLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEE 237
+ A +N + ++ AL+ I+ F G GA NG++ +G + ++ +E
Sbjct: 673 Y--ARLLNLPDVVDTSCTISALSKIYEACFLNFYDGQYGAANGVRSDGTPENPNSTHPQE 730
Query: 238 VWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMR 297
VW G+ + L+A ++ G ++A + +Y GL F TPE +T T+R+ Y+R
Sbjct: 731 VWVGINFGLAAFLIQMGMKNQALELTKVVVEQIYAN-GLQFRTPEAITPVGTFRASHYLR 789
Query: 298 ALAVYAM 304
A+A++ +
Sbjct: 790 AMAIWGV 796
>gi|317969095|ref|ZP_07970485.1| hypothetical protein SCB02_06140 [Synechococcus sp. CB0205]
Length = 837
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 96/297 (32%), Positives = 152/297 (51%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E+P+ N D + WKDL +++ ++R + L +D+RFL +P +E +R
Sbjct: 530 PNEKPWDATNYTAYQDCNLWKDLASDYVLQVWRTFKLAPNGEDLRFLADCWPAAVEALRY 589
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC------MVEMSSLLND 128
++FD G+ NGG PDQT AYCG L IAAL +++S+ L+
Sbjct: 590 LKTFDINNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALDAALAIAQQLQLSTGLDT 649
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+ +QQ+FSA L +A + LLW G YY D+ + +MADQLCG + + G+ E
Sbjct: 650 SAEQQEFSAWLAQARGNFDRLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--E 705
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
+ AN L + + FE GS+G NG++ +G D + EVWTG+ + ++
Sbjct: 706 PVVSEANSRSTLNAVKDSCFEKFEGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 765
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ G + A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 766 SYFRLMGETETALAITSAVVNQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGV 821
>gi|225593155|gb|ACN96073.1| pedicted bile acid beta-glucosidase [Fischerella sp. MV11]
Length = 812
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/308 (32%), Positives = 150/308 (48%), Gaps = 36/308 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD+ DI FL + ++ +
Sbjct: 505 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLFTGADDITFLAECWNAIVQTLDYL 564
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL----NDTQQ 131
++FD GI +N G PDQT AYCGGL +AAL + + +L DT+
Sbjct: 565 KTFDIDNDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAICDVLITNRRDTEN 624
Query: 132 QQQF--------------SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
++F L ++ +Y LW G YY+ DS SD +MADQLCG
Sbjct: 625 AEKFLGDLGDLAVQKSIYETWLQQSRPIYEAKLWNGQYYRLDSESG--SDVVMADQLCGQ 682
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSE 236
+ + G+ I AL T+++ F G GA NG++P+G + A
Sbjct: 683 FYARLLGLPD--IVKRDRALSALKTVYNACFLKFHHGQFGAANGLRPDGSPENPQATHPL 740
Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
EVWTG+ + L+A M+ G DEA + R VYE GL F TPE +T T+R+ Y+
Sbjct: 741 EVWTGINFGLAAFMVQMGMKDEAMKLTEAVVRQVYE-NGLQFRTPEAITAVGTFRASTYL 799
Query: 297 RALAVYAM 304
RA+A++A+
Sbjct: 800 RAMAIWAI 807
>gi|414077215|ref|YP_006996533.1| beta-glucosidase [Anabaena sp. 90]
gi|413970631|gb|AFW94720.1| putative beta-glucosidase [Anabaena sp. 90]
Length = 795
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 93/299 (31%), Positives = 153/299 (51%), Gaps = 27/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD+ D++FL + ++ +
Sbjct: 497 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLFTGANDLQFLADCWDAIVQTLDYV 556
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQ----- 130
++FD G I +N G PDQT AYCGGL +AAL + +S +L + +
Sbjct: 557 KTFDLDGDSIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAISKILTNRRGAEDA 616
Query: 131 ----QQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVN 186
Q + L+++ +Y + LW G YY+ DS SD +MADQLCG + + ++
Sbjct: 617 EVEVQGSIYQGWLSQSKPIYQEKLWNGEYYRLDSDSG--SDVVMADQLCGQFYARL--LS 672
Query: 187 TEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSAIQSEEVWTGVTYL 245
I PS AL ++ + F+ G GA NG+ PNG + ++ EVWTG+ +
Sbjct: 673 LPDIVPSDRALSALTKVYDSCFLKFQNGEFGAANGVLPNGLPENPNSTHPLEVWTGINFG 732
Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
L+A ++ +EA + R +Y+ GL F TPE +T + T+R+ Y+RA+A++ +
Sbjct: 733 LAAFLVQMNMKNEAMRLTEAVVRQIYD-NGLQFRTPEAITANGTFRASTYLRAMAIWGI 790
>gi|113477064|ref|YP_723125.1| hypothetical protein Tery_3568 [Trichodesmium erythraeum IMS101]
gi|110168112|gb|ABG52652.1| protein of unknown function DUF608 [Trichodesmium erythraeum
IMS101]
Length = 816
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 151/315 (47%), Gaps = 43/315 (13%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ + N + D + WKDL+ F++ +YRD+ L D FL + + E +
Sbjct: 499 PNEHPWEMTNYTSYQDCNQWKDLSSDFVLQVYRDFLLTGADDYEFLWQSWSAITETLAYL 558
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGL-----------------HIAALA- 117
+ FDK GI +N G PDQT AYCGGL H +
Sbjct: 559 KGFDKDNDGIPENEGAPDQTFDDWQLRGVSAYCGGLWLAALEAAIAIGKVLIEHPREIPY 618
Query: 118 -------CMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIM 170
V+ +S+ Q + L K +Y + LW G YY+ DS +S+ +M
Sbjct: 619 YPPKGFYSEVDKNSVDAINNQVYLYQGWLKKGLPIYQEKLWNGEYYRLDSES--NSEVVM 676
Query: 171 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD-RD 229
ADQL G + K +N E I P+ AL T++++ + F G GA NG+ P+G +
Sbjct: 677 ADQLSGQFYAKL--LNLEDIVPAECALSALKTVYNSCFKNFHNGKFGAANGVLPDGSPEN 734
Query: 230 RSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKT 289
+A EVWTG+ + L+A M+ G EA + +YE GL F TPE +T T
Sbjct: 735 PNATHPLEVWTGINFGLAAFMVQIGMKKEALEITEVVVGQIYE-NGLQFRTPEAITVMGT 793
Query: 290 YRSGGYMRALAVYAM 304
+R+ Y+RA+A++A+
Sbjct: 794 FRASHYLRAMAIWAI 808
>gi|354566897|ref|ZP_08986068.1| Glucosylceramidase [Fischerella sp. JSC-11]
gi|353544556|gb|EHC14010.1| Glucosylceramidase [Fischerella sp. JSC-11]
Length = 812
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 101/308 (32%), Positives = 148/308 (48%), Gaps = 36/308 (11%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD+ DI FL + +E +
Sbjct: 505 PNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLFTGADDITFLAECWNAIVETLDYL 564
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL----NDTQQ 131
++FD GI +N G PDQT AYCGGL +AAL + + +L DT
Sbjct: 565 KTFDIDNDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAICDVLITNRRDTDN 624
Query: 132 QQQFSAKL--------------TKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGY 177
+ F L ++ +Y LW G YY+ DS SD +MADQLCG
Sbjct: 625 AENFLGDLGDLAVQKSIYETWLQQSRPIYEAKLWNGQYYRLDSESG--SDVVMADQLCGQ 682
Query: 178 LFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSE 236
+ + G+ I AL T+++ F G GA NG++P+G + A
Sbjct: 683 FYARLLGL--PDIVKRDRALSALKTVYNACFLKFHHGQFGAANGLRPDGSPENPQATHPL 740
Query: 237 EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYM 296
EVWTG+ + L+A M+ G DEA + R +YE GL F TPE +T T+R+ Y+
Sbjct: 741 EVWTGINFGLAAFMVQMGMKDEAMKLTETVVRQIYE-NGLQFRTPEAITAVGTFRASTYL 799
Query: 297 RALAVYAM 304
RA+A++A+
Sbjct: 800 RAMAIWAI 807
>gi|300866990|ref|ZP_07111661.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335025|emb|CBN56827.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 863
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 101/321 (31%), Positives = 150/321 (46%), Gaps = 49/321 (15%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L D FL +P +E +
Sbjct: 539 PNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLLTGSTDYEFLQECWPAIVEALAYL 598
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ- 134
++FD G GI +N G PDQT AYCGGL +AAL + + SLL Q +
Sbjct: 599 KTFDLDGDGIPENSGAPDQTFDDWQMRGVSAYCGGLWLAALEAAIAIGSLLIKHQSSVEE 658
Query: 135 ------------------------------FSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+ L +A +Y LW G YY+ DS
Sbjct: 659 LNQSPITNFNKDFKDVSLFPITNYQLPITNYQNWLEQAKPIYQQRLWNGQYYRLDSESG- 717
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
SD +MADQLCG + + G+ I P AL T++ + F G GA NG+
Sbjct: 718 -SDVVMADQLCGQFYARLLGL--PDIVPEECAISALETVYESCFIKFNQGEFGAANGVML 774
Query: 225 NGDRDRS-AIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
+G ++ A EVWTG+ + L+A M+ G ++A + + +Y GL F TPE
Sbjct: 775 DGSPEKPGATHPLEVWTGINFGLAAFMMQMGMKEKALELTEVVVKQIYA-NGLQFRTPEA 833
Query: 284 LTGDKTYRSGGYMRALAVYAM 304
+T T+R+ Y+RA+A++A+
Sbjct: 834 ITATGTFRASHYLRAMAIWAI 854
>gi|427419169|ref|ZP_18909352.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
gi|425761882|gb|EKV02735.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
Length = 835
Score = 146 bits (369), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 98/325 (30%), Positives = 156/325 (48%), Gaps = 53/325 (16%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YR + L D+ FL +P + ++
Sbjct: 512 PNENPWIATNYTSYQDCNQWKDLPSDFVLQVYRAFVLTGGDDLEFLAECWPAVTQTLQYL 571
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQ---- 131
+ FD G G+ +NGG PDQT AYCGGL +AAL ++M LL ++
Sbjct: 572 KLFDLDGDGVPENGGAPDQTFDDWQLKGLSAYCGGLWLAALEAAIKMGDLLQQHRKISGS 631
Query: 132 ----QQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
+ Q+ + + VY + LWTGS+Y+ DS +SD +MADQLCG + ++
Sbjct: 632 FAIVKSQYQRWIKQGKIVYQNKLWTGSFYRLDS--ESNSDVVMADQLCGQFCARL--MDL 687
Query: 188 EAIFPSANIARALATIF------------------STNVRGFEAG---------SMGAVN 220
I I AL TI+ + G +AG ++GA N
Sbjct: 688 PDIVDQEFIDIALDTIYDACFVKFNEYAAKHSTPQNQKFEGSQAGFFSAAQLGMAIGAAN 747
Query: 221 GMKPNGDRDRSAIQSE-EVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFE 279
G+KP+G + + EVWTG+ + ++A + G D+A + + +Y GL F
Sbjct: 748 GVKPDGSPENPDDTHQLEVWTGINFGVAAFLAQMGKRDQAMAITEAVVQQIYNY-GLQFR 806
Query: 280 TPEGLTGDKTYRSGGYMRALAVYAM 304
TPE +T T+R+ Y+R +A++ +
Sbjct: 807 TPEAITAMGTFRACHYLRPMAIWGL 831
>gi|428223619|ref|YP_007107716.1| glucosylceramidase [Geitlerinema sp. PCC 7407]
gi|427983520|gb|AFY64664.1| Glucosylceramidase [Geitlerinema sp. PCC 7407]
Length = 821
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 98/304 (32%), Positives = 150/304 (49%), Gaps = 32/304 (10%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L D F+ +P +
Sbjct: 504 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFRLTGSTDDAFIAECWPAVTRALAYL 563
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL-----NDTQ 130
++FD+ G GI +N G PDQT AYCGGL IAAL + + L +D
Sbjct: 564 KAFDQDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWIAALEAAIALGERLQQHPVSDHP 623
Query: 131 QQQQ---------FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLK 181
+ Q + L ++ VY D LW G YY+ DS S+ +MADQLCG + +
Sbjct: 624 ELSQGAIAHHLSTYRTWLAQSRAVYDDALWNGRYYRLDSGSG--SEVVMADQLCGQFYAR 681
Query: 182 ASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWT 240
++ + S AL T++ + F G GA NG+KP+G + A EVWT
Sbjct: 682 L--LDLPDVVASDRALSALKTVYDSCFLRFHDGKFGAANGVKPDGSPENPKATHPLEVWT 739
Query: 241 GVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALA 300
G+ + ++A ++ G DE + + +YE GL F TPE +T T+R+ Y+R +A
Sbjct: 740 GINFGIAAFLVQMGLRDEGFRLTQAVVEQIYEH-GLQFRTPEAITAVGTFRACHYLRPMA 798
Query: 301 VYAM 304
++A+
Sbjct: 799 IWAV 802
>gi|443311911|ref|ZP_21041533.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
gi|442777986|gb|ELR88257.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
Length = 804
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 143/310 (46%), Gaps = 38/310 (12%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRD+ L D L + ++ +
Sbjct: 492 PNEHPWEKTNYTSYQDCNLWKDLPSDFVLQVYRDFVLTGSTDWELLWDCWGAIVQTLNYL 551
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
++FD GI +N G PDQT AYCGGL +A L + + L Q+
Sbjct: 552 KTFDLDNDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAGLESAIAIGKTLISYPQEHPL 611
Query: 136 SAKLT--------------------KASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLC 175
LT ++ VY + LW G YYK DS SD +MADQLC
Sbjct: 612 GEMLTSPLNQPPIEETIGIYQTWLEQSRPVYQEKLWNGQYYKLDSESG--SDVVMADQLC 669
Query: 176 GYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQ 234
G + + GV I S +L T++ F G+ GA NG+K +G + A
Sbjct: 670 GQFYARLLGV--PDIVESDRTVSSLKTVYEACFLKFHNGAFGAANGLKIDGSPENPDATH 727
Query: 235 SEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGG 294
EVWTG+ + L+A +L +EA+ + +Y GL F TPE +T T+R+
Sbjct: 728 PLEVWTGINFGLAAFLLQMDMKEEAFKLTQTVVEQIY-NNGLQFRTPEAITAVGTFRACH 786
Query: 295 YMRALAVYAM 304
Y+R +A++A+
Sbjct: 787 YLRPMAIWAV 796
>gi|254423918|ref|ZP_05037636.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
gi|196191407|gb|EDX86371.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
Length = 828
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 97/322 (30%), Positives = 154/322 (47%), Gaps = 48/322 (14%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL+ F++ +YR + D+ FL +P +E +
Sbjct: 503 PNENPWMATNYTSYQDCNQWKDLSSDFVLQVYRAFESTGSVDMDFLLDCWPAIVETLGYT 562
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQ-- 133
+ FD G G+++N G PDQT AYCGGL IAAL+ +++ +L ++++ +
Sbjct: 563 KLFDTDGDGLIENSGAPDQTFDDWKLQGISAYCGGLWIAALSAAIKIGEILQESKRVEGT 622
Query: 134 ------QFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASG--- 184
Q+ Y LW G YY+ D+ SD +MADQLCG G
Sbjct: 623 IVILLAQYRRWWQHGRTAYQKQLWNGEYYRLDTGSG--SDVVMADQLCGQFCASTMGLPD 680
Query: 185 -VNTEAIFPSAN-IARALATIF----------STNVRGFEAG---------SMGAVNGMK 223
V++E + P+ I A F + G + G +G NG++
Sbjct: 681 VVDSEFVEPTLRAIYEACFVRFNQYTAQLGPQNQKFEGAQTGYFSASELGVKVGCANGVR 740
Query: 224 PNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPE 282
P+G ++ EVWTG+ + L+A + +EG + EA + R VYE GL F TPE
Sbjct: 741 PDGSPQNPDDTHQLEVWTGINFGLAAFLAHEGKLHEAMEITEAVVRQVYEH-GLQFRTPE 799
Query: 283 GLTGDKTYRSGGYMRALAVYAM 304
+T T+R+ Y+R +A++A+
Sbjct: 800 AITAVGTFRACHYLRPMAIWAV 821
>gi|78212458|ref|YP_381237.1| hypothetical protein Syncc9605_0920 [Synechococcus sp. CC9605]
gi|78196917|gb|ABB34682.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 832
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 150/297 (50%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E P+ N D + WKDL F++ ++R + L +DIRFL +P +E +R
Sbjct: 525 PNEIPWDATNYTAYQDCNLWKDLGSDFVLQVWRTFKLAPSGEDIRFLADCWPAAVEALRY 584
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC------MVEMSSLLND 128
++FD G+ NGG PDQT AYCG L IAAL +++S+ L+
Sbjct: 585 LKTFDVNNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLSTGLDT 644
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+ +Q++FS L ++ + LLW G YY D+ + +MADQLCG + + +
Sbjct: 645 SAEQREFSGWLEQSRGNFDKLLWNGEYYDIDAESG--TPVVMADQLCGDFYARL--LALP 700
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
+ AN L + F+ GS+G NG++ +G D +A EVWTG+ + ++
Sbjct: 701 PVVSDANSLSTLKAVKEACFEAFDGGSLGVANGLRRDGTPLDPNATHPLEVWTGINFGIA 760
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ G+ A + + VY +GL F TPE +T TYR+ Y+RA+A++ +
Sbjct: 761 SYYRLMGDKQTAQSICSAVVEQVYS-SGLQFRTPEAITAVNTYRACHYLRAMAIWGL 816
>gi|428210868|ref|YP_007084012.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
6304]
gi|427999249|gb|AFY80092.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
6304]
Length = 820
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 103/316 (32%), Positives = 151/316 (47%), Gaps = 44/316 (13%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F + +YRD+ L D FL +P+ ++ I
Sbjct: 499 PNEHPWVKTNYTSYQDCNLWKDLPCDFALLVYRDFLLTGGDDYEFLWDCWPSMVQAIAYL 558
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIA----------ALACMVEMSSL 125
++FD G GI +NGG PDQT AYCGGL IA + S +
Sbjct: 559 KTFDTDGDGIPENGGAPDQTFDDWPMRGVSAYCGGLWIAALQAAIAIGEVILNNPPHSGV 618
Query: 126 LNDTQQQQQFSAKLTKAS----------------QVYHDLLWTGSYYKFDSSQSRHSDSI 169
DT +Q+ S L A+ +Y LW G+YY+ DS S+ +
Sbjct: 619 FTDTIYEQKRSQLLEPATIQATLSTYQDWLEQSRPLYQHKLWNGTYYRLDSESG--SEVV 676
Query: 170 MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR- 228
MADQLCG + + G+ I P AL T++ + F G GA NG+ +G
Sbjct: 677 MADQLCGQFYARLLGLAD--IVPIDCAESALKTVYESCFLKFHNGEFGAANGVLIDGSPV 734
Query: 229 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDK 288
+ A EVWTG+ + L+A +L G +E + + + VYE GL F TPE +T
Sbjct: 735 NPDATHPLEVWTGINFGLAAFLLQMGMKEEGFKLTETVIQQVYE-NGLQFRTPEAITAVG 793
Query: 289 TYRSGGYMRALAVYAM 304
T+R+ Y+RA+A++A+
Sbjct: 794 TFRASHYLRAMAIWAV 809
>gi|87125468|ref|ZP_01081313.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
gi|86166768|gb|EAQ68030.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
Length = 832
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 140/297 (47%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E P+ N D + WKDL +++ ++R + L +DI FL +P +E +
Sbjct: 525 PNENPWDATNYTAYQDCNLWKDLASDYVLQVWRTFKLSPSGEDINFLAECWPAAVEALTY 584
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEM------SSLLND 128
+ FD G+ NGG PDQT AYCG L IAAL + M ++ L+
Sbjct: 585 LKGFDTNNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMGQQLQLATGLDT 644
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+Q FS L ++ + LLW G YY D+ + +MADQLCG + + +
Sbjct: 645 AAEQHTFSTWLEQSRANFDTLLWNGEYYDIDAESG--TPVVMADQLCGDFYARL--LKLP 700
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
A+ AN L + + F+ GS+G NG++ +G D EVWTG+ + ++
Sbjct: 701 AVVSDANCRSTLKAVKESCFEAFQGGSLGVANGLRRDGTPLDPQGTHPLEVWTGINFGIA 760
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ + A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 761 SYYRLMDDTSTALAITSAVVNQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 816
>gi|428307493|ref|YP_007144318.1| glucosylceramidase [Crinalium epipsammum PCC 9333]
gi|428249028|gb|AFZ14808.1| Glucosylceramidase [Crinalium epipsammum PCC 9333]
Length = 811
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 97/311 (31%), Positives = 154/311 (49%), Gaps = 39/311 (12%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E P+ N + D + WKDL F++ +YRDY L DI FL+ + ++ +
Sbjct: 501 PNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDYLLTGATDIEFLSECWSAIVKTLAYL 560
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQ- 134
+ FD G GI +N G PDQT AYCGGL +AAL + + +L + + +
Sbjct: 561 KEFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALEAAIAIGEILINHGVELED 620
Query: 135 ---------FSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV 185
+ + L ++ +Y + LW G YY+ DS S+ +MADQLCG + + +
Sbjct: 621 GDFESAIATYKSWLEQSKPLYQEKLWNGQYYQLDSESG--SNVVMADQLCGQFYARL--L 676
Query: 186 NTEAIFPSANIARALATIF-----------STNVRGFEAGSMGAVNGMKPNGDRDRS-AI 233
N I P+ + AL T++ S + +GA NG+ +G + A
Sbjct: 677 NLPDIVPNDCVESALKTVYDSCFLKFNQKLSQHSESLTPTFIGAANGVLADGSPENPQAT 736
Query: 234 QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSG 293
EVWTG+ + L+A M+ G DEA+ + + +Y GL F TPE +T T+R+
Sbjct: 737 HPLEVWTGINFGLAAFMIQMGMKDEAFQLTEAVVQQIY-NNGLQFRTPEAITAAGTFRAS 795
Query: 294 GYMRALAVYAM 304
Y+RA+A++A+
Sbjct: 796 HYLRAMAIWAI 806
>gi|427702545|ref|YP_007045767.1| bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
gi|427345713|gb|AFY28426.1| putative bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
Length = 835
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 146/300 (48%), Gaps = 31/300 (10%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHK---DIRFLTRVYPTCLELIRK 84
P E PF N D + WKDL +++ ++R Y+L DIRFL +P + +
Sbjct: 527 PNERPFDATNYTAYQDCNLWKDLASDYVLQVWRTYSLAPSGPDIRFLADCWPAAVTALDY 586
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
+ FD G+ NGG PDQT AYCG L IAAL +++ L+
Sbjct: 587 LKQFDVNDDGLPDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAALAIGQRLQLELGLDT 646
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
++Q++F L ++ + LLW G YY D+ + +MADQLCG + +
Sbjct: 647 AEEQRRFGGWLEQSRSHFDTLLWNGEYYAIDAGSG--TPVVMADQLCGDFYAR-----LL 699
Query: 189 AIFPSANIARALAT---IFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTY 244
A+ P RAL++ I FE G +G NG++ +G D EVWTG+ +
Sbjct: 700 ALPPVVADERALSSLKAIRQACFESFEGGRLGVANGLRRDGTPLDPEGTHPLEVWTGINF 759
Query: 245 LLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
L+A G A+ G + R VYE GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 760 GLAAYYRLMGQSSTAFAITGAVVRQVYE-GGLQFRTPEAITAAGTFRACHYLRAMAIWAL 818
>gi|318042148|ref|ZP_07974104.1| hypothetical protein SCB01_10580 [Synechococcus sp. CB0101]
Length = 836
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 144/297 (48%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E P+ N D + WKDL +++ ++R + L +DI FL +P +E +
Sbjct: 529 PNERPWDATNYTAYQDCNLWKDLASDYVLQVWRTFKLAPNGEDINFLAECWPAAVEALHY 588
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC------MVEMSSLLND 128
+ FD G+ NGG PDQT AYCG L IAAL +++S+ L+
Sbjct: 589 LKGFDANNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQQLQLSTGLDT 648
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+Q FS+ L ++ + LLW G YY D+ + +MADQLCG + + G+
Sbjct: 649 ASEQHTFSSWLEQSRANFDHLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--P 704
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
A+ AN L + FE G +G NG++ +G D + EVWTG+ + ++
Sbjct: 705 AVVSEANSRSTLKAVKEACFEQFEGGKLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 764
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ G+ D A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 765 SYYRLMGDTDTALAITAAVVNQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGV 820
>gi|260434763|ref|ZP_05788733.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
gi|260412637|gb|EEX05933.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
Length = 832
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 147/297 (49%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E P+ N D + WKDL F++ ++R + L +DIRFL +P +E +R
Sbjct: 525 PNEIPWDATNYTAYQDCNLWKDLGSDFVLQVWRSFKLAPSGEDIRFLADCWPAAVEALRF 584
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC------MVEMSSLLND 128
++FD G+ NGG PDQT AYCG L IAAL +++S+ L+
Sbjct: 585 LKTFDVNNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLSTGLDT 644
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+ +Q++FS L ++ + LLW G YY D+ + +MADQLCG + + G+
Sbjct: 645 SAEQREFSGWLEQSRGNFDKLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--P 700
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
+ +N L + F+ GS+G NG++ +G D + EVWTG+ + ++
Sbjct: 701 PVVSDSNSRSTLKVVKEACFEAFDGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 760
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ G+ A + VY GL F TPE +T T+R+ Y+RA+A++
Sbjct: 761 SYYRLMGDKQTAQAICSAVVEQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGF 816
>gi|124023625|ref|YP_001017932.1| bile acid beta-glucosidase [Prochlorococcus marinus str. MIT 9303]
gi|123963911|gb|ABM78667.1| Predicted bile acid beta-glucosidase [Prochlorococcus marinus str.
MIT 9303]
Length = 837
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 99/312 (31%), Positives = 149/312 (47%), Gaps = 29/312 (9%)
Query: 13 RSAANTHERYYTRWFPEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIR 69
R+AA H+ P E PF N D + WKDL +++ ++R + L +D+
Sbjct: 515 RAAATPHDLGA----PNESPFDATNYTAYQDCNLWKDLASDYVLQVWRTFLLSPNGEDLS 570
Query: 70 FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
FL +P ++ + + FD G+ NGG PDQT AYCG L IAAL
Sbjct: 571 FLAECWPAAVQALSYLKRFDVNHDGLPDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAA 630
Query: 120 ------VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQ 173
+++ LN ++Q QFS L ++ + LLW G YYK D+ + +MADQ
Sbjct: 631 LAMAQRLQLDLGLNTAEEQHQFSGWLEQSRANFDRLLWNGEYYKIDAESG--TPVVMADQ 688
Query: 174 LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSA 232
LCG + + G+ + + +L + GFE G +G NG+ +G D
Sbjct: 689 LCGDFYARLLGLPS--VVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDGMPLDPKG 746
Query: 233 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 292
EVWTG+ + L+A G+ A + VY GL F TPE +T KTYR+
Sbjct: 747 THPLEVWTGINFGLAAYYRLMGDATTATAICSAVVNQVYG-GGLQFRTPEAITAVKTYRA 805
Query: 293 GGYMRALAVYAM 304
Y+RA+A++A+
Sbjct: 806 CHYLRAMAIWAL 817
>gi|33862646|ref|NP_894206.1| hypothetical protein PMT0373 [Prochlorococcus marinus str. MIT
9313]
gi|33634562|emb|CAE20548.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 837
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/312 (31%), Positives = 149/312 (47%), Gaps = 29/312 (9%)
Query: 13 RSAANTHERYYTRWFPEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIR 69
R+AA H+ P E PF N D + WKDL +++ ++R + L +D+
Sbjct: 515 RAAATPHDLGA----PNESPFDATNYTAYQDCNLWKDLASDYVLQVWRTFLLSPNGEDLS 570
Query: 70 FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
FL +P ++ + + FD G+ NGG PDQT AYCG L IAAL
Sbjct: 571 FLAECWPAVVQALSYLKRFDVNHDGLPDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAA 630
Query: 120 ------VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQ 173
+++ LN ++Q QFS L ++ + LLW G YYK D+ + +MADQ
Sbjct: 631 LAMAQRLQLDLGLNTAEEQHQFSGWLEQSRVNFDRLLWNGEYYKIDAESG--TPVVMADQ 688
Query: 174 LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG-DRDRSA 232
LCG + + G+ + + +L + GFE G +G NG+ +G D
Sbjct: 689 LCGDFYARLLGLPS--VVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDGMPLDPKG 746
Query: 233 IQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 292
EVWTG+ + L+A G+ A + VY GL F TPE +T KTYR+
Sbjct: 747 THPLEVWTGINFGLAAYYRLMGDATTATAICSAVVNQVYG-GGLQFRTPEAITAVKTYRA 805
Query: 293 GGYMRALAVYAM 304
Y+RA+A++A+
Sbjct: 806 CHYLRAMAIWAL 817
>gi|443318654|ref|ZP_21047901.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
gi|442781756|gb|ELR91849.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
Length = 857
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/325 (30%), Positives = 147/325 (45%), Gaps = 53/325 (16%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E PF N + D + WKDL F++ +YR Y + D FL +P E ++
Sbjct: 523 PNEHPFVKTNYTSYQDCNQWKDLPSDFVIQVYRAYKMTGATDGDFLADCWPAVTETLKYL 582
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLL--------N 127
+ FD GI +NGG PDQT AYCGGL +AAL + ++ L N
Sbjct: 583 KRFDTDSDGIPENGGAPDQTFDDWQLKGLSAYCGGLWMAALESAIAIADTLTAAGRAPGN 642
Query: 128 DTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNT 187
Q+ L ++ +VYH LW G YY+ ++ S +MADQLCG + +
Sbjct: 643 TPILVTQYRRWLEQSRKVYHKRLWNGRYYRLETGSD--SSVVMADQLCGQFCARL--MEL 698
Query: 188 EAIFPSANIARALATIF-----------STNVR-------GFEAGS---------MGAVN 220
+ + ALA I+ +T R G + G+ +GA N
Sbjct: 699 PDLVEEKYVQSALAAIYDACFLKFNNYAATQARPQEQKFIGSQTGTFRSAQPGIPIGAAN 758
Query: 221 GMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFE 279
G+ P+G D EVWTG+ + L+ G VD+A+ + + +Y GL F
Sbjct: 759 GVLPDGSPEDPDGTHQLEVWTGINFGLATFFAQMGQVDKAFAITEAVVQQIYTH-GLQFR 817
Query: 280 TPEGLTGDKTYRSGGYMRALAVYAM 304
TPE +T T+R+ Y+R LA++ +
Sbjct: 818 TPEAITALGTFRACHYLRPLAIWGL 842
>gi|116072378|ref|ZP_01469645.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
gi|116064900|gb|EAU70659.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
Length = 820
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 147/297 (49%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E PF N D + WKDL F++ ++R + L +D+ FL +P+ ++ +
Sbjct: 513 PNEVPFDATNYTAYQDCNLWKDLASDFVLQVWRTFQLAPTGEDLNFLAECWPSAVQALDY 572
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
+ FD G+ NGG PDQT AYCG L IAAL +++ L+
Sbjct: 573 LKGFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLDLGLDT 632
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
T +Q++F L ++ + LLW G YY+ D+ + +MADQLCG + + G+
Sbjct: 633 TVEQRRFGQWLEQSRSNFDRLLWNGEYYEIDAESG--TPVVMADQLCGDFYARLLGL--P 688
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
++ AN L + FEAG++G NG++ +G D + EVWTG+ + ++
Sbjct: 689 SVVSDANALSTLNVVREACFERFEAGTLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 748
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ G D A G + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 749 SYFRLMGQSDTALAITGAVVNQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 804
>gi|116073260|ref|ZP_01470522.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
gi|116068565|gb|EAU74317.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
Length = 832
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 93/299 (31%), Positives = 153/299 (51%), Gaps = 29/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E P+ N D + WKDL +++ ++R + L +D+RFL +P+ +E +R
Sbjct: 525 PNELPWDATNYTAYQDCNLWKDLGSDYVLQVWRTFKLSPSGEDVRFLADCWPSAVEALRY 584
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC------MVEMSSLLND 128
++FD G+ NGG PDQT AYCG L IAAL +++ + L+
Sbjct: 585 LKTFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLKTGLDT 644
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+ +Q++FS L ++ + LLW G YY D+ + +MADQLCG + + G+ E
Sbjct: 645 SAEQREFSGWLEQSRGNFDRLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--E 700
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
+ AN L + F+ G +G NG++ +G D + EVWTG+ + ++
Sbjct: 701 PVVSEANSRSTLKAVREACFEKFDGGKLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 760
Query: 248 A--AMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ ++ EG EA +A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 761 SYYRLMGEGQTAEAICSA--VVNQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 816
>gi|188570649|gb|ACD64360.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 64 NSKSIQADQLAGQWYTASAGLPN--LFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
G T D +YRS YMR LA++ MQ A K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|188570647|gb|ACD64359.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 64 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIPSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
G T D +YRS YMR LA++ MQ A K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|188570531|gb|ACD64301.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 122/208 (58%), Gaps = 4/208 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF+ NV G MGAVNGM P
Sbjct: 64 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFNFNVMKVRGGRMGAVNGMHP 121
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
G T D +YRS YMR LA++ MQ A K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|188570631|gb|ACD64351.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAAAAMAVELNDKLAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 64 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
G T D +YRS YMR LA++ MQ A K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|87301122|ref|ZP_01083963.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
gi|87284090|gb|EAQ76043.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
Length = 830
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 143/297 (48%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E PF N D + WKDL F++ ++R ++L +D+RFL +P +E +R
Sbjct: 522 PNERPFDATNYTAYQDCNLWKDLASDFVLQVWRTFSLAPTGEDLRFLADCWPAVVEALRY 581
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
+ FD G+ NGG PDQT AYCG L IAAL +++ L+
Sbjct: 582 LKQFDVNDDGLPDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLDT 641
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+Q++F + L + + LLW G YY+ D+ + +MADQLCG + + G+
Sbjct: 642 GAEQREFGSWLEPSRANFDRLLWNGEYYRIDAGSG--TPVVMADQLCGDFYARLLGLPQ- 698
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
+ +L I FE G +G NG++ +G D EVWTG+ + L+
Sbjct: 699 -VVSDERALSSLKAIREACFERFEGGRLGVANGLRRDGTPLDPKGTHPLEVWTGINFGLA 757
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
A G D A + VY GL F TPE +T T+R+ Y+RA+A++A+
Sbjct: 758 AYYRLMGQTDTALAITSAVVGQVYA-GGLQFRTPEAITAVGTFRACHYLRAMAIWAL 813
>gi|292384326|gb|ADE21416.1| unknown [Helianthus neglectus]
Length = 210
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 64 NSKSIQADQLAGQWYTASAGLPN--LFGEDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
G T D +YRS YMR LA++ MQ A K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|292384308|gb|ADE21407.1| unknown [Helianthus neglectus]
Length = 205
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 1 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFMKAKSSFESKLWNGSYFNYDSGSSS 60
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 61 NSKSIQADQLAGQWYTASAGLPN--LFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 118
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 119 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 178
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
G T D +YRS YMR LA++ MQ A
Sbjct: 179 GWTMDGSYRSLVYMRPLAIWGMQQA 203
>gi|292384322|gb|ADE21414.1| unknown [Helianthus neglectus]
Length = 207
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 1 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 61 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 118
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 119 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 178
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
G T D +YRS YMR LA++ MQ A K
Sbjct: 179 GWTMDGSYRSLVYMRPLAIWGMQQALEK 206
>gi|188570553|gb|ACD64312.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAATAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 64 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
G T D +YRS YMR LA++ MQ A K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|188570503|gb|ACD64287.1| hypothetical protein [Helianthus annuus]
gi|188570505|gb|ACD64288.1| hypothetical protein [Helianthus annuus]
gi|188570507|gb|ACD64289.1| hypothetical protein [Helianthus annuus]
gi|188570509|gb|ACD64290.1| hypothetical protein [Helianthus annuus]
gi|188570513|gb|ACD64292.1| hypothetical protein [Helianthus annuus]
gi|188570515|gb|ACD64293.1| hypothetical protein [Helianthus annuus]
gi|188570517|gb|ACD64294.1| hypothetical protein [Helianthus annuus]
gi|188570519|gb|ACD64295.1| hypothetical protein [Helianthus annuus]
gi|188570521|gb|ACD64296.1| hypothetical protein [Helianthus annuus]
gi|188570523|gb|ACD64297.1| hypothetical protein [Helianthus annuus]
gi|188570525|gb|ACD64298.1| hypothetical protein [Helianthus annuus]
gi|188570527|gb|ACD64299.1| hypothetical protein [Helianthus annuus]
gi|188570529|gb|ACD64300.1| hypothetical protein [Helianthus annuus]
gi|188570533|gb|ACD64302.1| hypothetical protein [Helianthus annuus]
gi|188570535|gb|ACD64303.1| hypothetical protein [Helianthus annuus]
gi|188570537|gb|ACD64304.1| hypothetical protein [Helianthus annuus]
gi|188570539|gb|ACD64305.1| hypothetical protein [Helianthus annuus]
gi|188570541|gb|ACD64306.1| hypothetical protein [Helianthus annuus]
gi|188570543|gb|ACD64307.1| hypothetical protein [Helianthus annuus]
gi|188570545|gb|ACD64308.1| hypothetical protein [Helianthus annuus]
gi|188570547|gb|ACD64309.1| hypothetical protein [Helianthus annuus]
gi|188570549|gb|ACD64310.1| hypothetical protein [Helianthus annuus]
gi|188570551|gb|ACD64311.1| hypothetical protein [Helianthus annuus]
gi|188570555|gb|ACD64313.1| hypothetical protein [Helianthus annuus]
gi|188570557|gb|ACD64314.1| hypothetical protein [Helianthus annuus]
gi|188570559|gb|ACD64315.1| hypothetical protein [Helianthus annuus]
gi|188570561|gb|ACD64316.1| hypothetical protein [Helianthus annuus]
gi|188570563|gb|ACD64317.1| hypothetical protein [Helianthus annuus]
gi|188570565|gb|ACD64318.1| hypothetical protein [Helianthus annuus]
gi|188570567|gb|ACD64319.1| hypothetical protein [Helianthus annuus]
gi|188570569|gb|ACD64320.1| hypothetical protein [Helianthus annuus]
gi|188570571|gb|ACD64321.1| hypothetical protein [Helianthus annuus]
gi|188570573|gb|ACD64322.1| hypothetical protein [Helianthus annuus]
gi|188570575|gb|ACD64323.1| hypothetical protein [Helianthus annuus]
gi|188570577|gb|ACD64324.1| hypothetical protein [Helianthus annuus]
gi|188570579|gb|ACD64325.1| hypothetical protein [Helianthus annuus]
gi|188570581|gb|ACD64326.1| hypothetical protein [Helianthus annuus]
gi|188570583|gb|ACD64327.1| hypothetical protein [Helianthus annuus]
gi|188570585|gb|ACD64328.1| hypothetical protein [Helianthus annuus]
gi|188570587|gb|ACD64329.1| hypothetical protein [Helianthus annuus]
gi|188570589|gb|ACD64330.1| hypothetical protein [Helianthus annuus]
gi|188570591|gb|ACD64331.1| hypothetical protein [Helianthus annuus]
gi|188570593|gb|ACD64332.1| hypothetical protein [Helianthus annuus]
gi|188570595|gb|ACD64333.1| hypothetical protein [Helianthus annuus]
gi|188570597|gb|ACD64334.1| hypothetical protein [Helianthus annuus]
gi|188570599|gb|ACD64335.1| hypothetical protein [Helianthus annuus]
gi|188570601|gb|ACD64336.1| hypothetical protein [Helianthus annuus]
gi|188570603|gb|ACD64337.1| hypothetical protein [Helianthus petiolaris]
gi|188570605|gb|ACD64338.1| hypothetical protein [Helianthus petiolaris]
gi|188570607|gb|ACD64339.1| hypothetical protein [Helianthus petiolaris]
gi|188570609|gb|ACD64340.1| hypothetical protein [Helianthus petiolaris]
gi|188570611|gb|ACD64341.1| hypothetical protein [Helianthus petiolaris]
gi|188570613|gb|ACD64342.1| hypothetical protein [Helianthus petiolaris]
gi|188570615|gb|ACD64343.1| hypothetical protein [Helianthus petiolaris]
gi|188570617|gb|ACD64344.1| hypothetical protein [Helianthus petiolaris]
gi|188570619|gb|ACD64345.1| hypothetical protein [Helianthus petiolaris]
gi|188570625|gb|ACD64348.1| hypothetical protein [Helianthus petiolaris]
gi|188570627|gb|ACD64349.1| hypothetical protein [Helianthus petiolaris]
gi|188570629|gb|ACD64350.1| hypothetical protein [Helianthus petiolaris]
gi|188570633|gb|ACD64352.1| hypothetical protein [Helianthus petiolaris]
gi|188570637|gb|ACD64354.1| hypothetical protein [Helianthus petiolaris]
gi|188570641|gb|ACD64356.1| hypothetical protein [Helianthus petiolaris]
gi|188570643|gb|ACD64357.1| hypothetical protein [Helianthus petiolaris]
gi|188570645|gb|ACD64358.1| hypothetical protein [Helianthus petiolaris]
gi|188570651|gb|ACD64361.1| hypothetical protein [Helianthus petiolaris]
gi|188570653|gb|ACD64362.1| hypothetical protein [Helianthus petiolaris]
gi|188570655|gb|ACD64363.1| hypothetical protein [Helianthus petiolaris]
gi|188570657|gb|ACD64364.1| hypothetical protein [Helianthus petiolaris]
gi|188570659|gb|ACD64365.1| hypothetical protein [Helianthus petiolaris]
gi|188570661|gb|ACD64366.1| hypothetical protein [Helianthus petiolaris]
gi|188570663|gb|ACD64367.1| hypothetical protein [Helianthus petiolaris]
gi|188570665|gb|ACD64368.1| hypothetical protein [Helianthus petiolaris]
gi|292384298|gb|ADE21402.1| unknown [Helianthus neglectus]
gi|292384300|gb|ADE21403.1| unknown [Helianthus neglectus]
gi|292384302|gb|ADE21404.1| unknown [Helianthus neglectus]
gi|292384304|gb|ADE21405.1| unknown [Helianthus neglectus]
gi|292384312|gb|ADE21409.1| unknown [Helianthus neglectus]
gi|292384318|gb|ADE21412.1| unknown [Helianthus neglectus]
gi|292384320|gb|ADE21413.1| unknown [Helianthus neglectus]
gi|292384328|gb|ADE21417.1| unknown [Helianthus neglectus]
Length = 210
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 64 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
G T D +YRS YMR LA++ MQ A K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|188570671|gb|ACD64371.1| hypothetical protein [Bahiopsis reticulata]
gi|188570673|gb|ACD64372.1| hypothetical protein [Bahiopsis reticulata]
Length = 210
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAAAAMAVELNDKFSAQKYKKKFMKAKTSFESKLWNGSYFNYDSGSSS 63
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 64 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDSAFTTAEGIFTAGWSEEGFGYAFQTPE 181
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
G T D +YRS YMR LA++ MQ A
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQA 206
>gi|292384306|gb|ADE21406.1| unknown [Helianthus neglectus]
Length = 205
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 1 SAYCGCLWLAALQAAAAMAVELNDKFTAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 61 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 118
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 119 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 178
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
G T D +YRS YMR LA++ MQ A
Sbjct: 179 GWTMDGSYRSLVYMRPLAIWGMQQA 203
>gi|292384314|gb|ADE21410.1| unknown [Helianthus neglectus]
Length = 205
Score = 137 bits (345), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 1 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 61 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 118
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 119 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 178
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
G T D +YRS YMR LA++ MQ A
Sbjct: 179 GWTMDGSYRSLVYMRPLAIWGMQQA 203
>gi|282897193|ref|ZP_06305195.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
gi|281197845|gb|EFA72739.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
Length = 798
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 97/297 (32%), Positives = 144/297 (48%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH--KDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD+ DI FL + + +
Sbjct: 502 PNEWVWEQTNYTSYQDCNLWKDLGCDFVLQVYRDFLFTGGNDIEFLVDCWYGIVLTLDYL 561
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA-----CMVEMS--SLLND 128
+ FD GI +N G PDQT AYCG L +AAL C V M+ L N
Sbjct: 562 KQFDIDADGIPENSGAPDQTFDDWRLSGVSAYCGALWLAALEAAIAICDVLMNHPELSNV 621
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+Q+ + L ++ VY LW G +Y+ DS S+ +MADQL G + + +N
Sbjct: 622 GEQRSIYENWLNQSLPVYQQKLWNGKFYRLDSESG--SNVVMADQLSGQFYARL--LNLP 677
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLLS 247
I P AL TI+ F+ G GA NG+ P+G + A EVWTG+ + L+
Sbjct: 678 DIVPQDRALSALTTIYDACFLKFQDGKFGAANGVLPDGSPENPQATHPLEVWTGINFGLA 737
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
A +L G D+ + R +Y+ GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 738 AFLLQMGMKDQGLRLTQAVVRQIYDH-GLQFRTPEAITAAGTFRASTYLRPMAIWAI 793
>gi|188570667|gb|ACD64369.1| hypothetical protein [Bahiopsis lanata]
gi|188570669|gb|ACD64370.1| hypothetical protein [Bahiopsis lanata]
Length = 208
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 2 SAYCGCLWLAALQAAAAMAVALNDKFSAQKYKKKFMKAKTSFESKLWNGSYFNYDSGSSS 61
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 62 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLHKIFDFNVMKVRGGRMGAVNGMHP 119
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D+A+TTA G++ + G G F+TPE
Sbjct: 120 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFTAGWSEEGFGYAFQTPE 179
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
G T D YRS YMR LA++ MQ A
Sbjct: 180 GWTMDGCYRSLVYMRPLAIWGMQQA 204
>gi|78185056|ref|YP_377491.1| hypothetical protein Syncc9902_1489 [Synechococcus sp. CC9902]
gi|78169350|gb|ABB26447.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 823
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/297 (30%), Positives = 145/297 (48%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E PF N D + WKDL F++ ++R + L +D+ FL +P+ + +
Sbjct: 516 PNELPFDATNYTAYQDCNLWKDLASDFVLQVWRTFQLAPTGEDLNFLADCWPSAVRALDY 575
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
+ FD G+ NGG PDQT AYCG L IAAL +++ L+
Sbjct: 576 LKGFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLDLGLDT 635
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
T +Q++F L ++ + LLW G YY+ D+ + +MADQLCG + + G+
Sbjct: 636 TVEQRRFGQWLEQSRSNFDRLLWNGEYYEIDAESG--TPVVMADQLCGDFYARLLGL--P 691
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
++ AN L + FE G++G NG++ +G D + EVWTG+ + ++
Sbjct: 692 SVVSDANALSTLNVVRDACFERFEGGTLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 751
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ G D A G + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 752 SYFRLMGQSDTALAITGAVVNQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 807
>gi|292384310|gb|ADE21408.1| unknown [Helianthus neglectus]
Length = 210
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 121/208 (58%), Gaps = 4/208 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFFKAKSSFESKLWNGSYFNYDSGSSS 63
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 64 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
G T D +YRS YMR LA++ MQ A K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|188570511|gb|ACD64291.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/208 (43%), Positives = 121/208 (58%), Gaps = 4/208 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 64 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
G T D +YRS YMR LA++ +Q A K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGIQQALEK 209
>gi|282900195|ref|ZP_06308151.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
CS-505]
gi|281194944|gb|EFA69885.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
CS-505]
Length = 789
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 95/297 (31%), Positives = 142/297 (47%), Gaps = 25/297 (8%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYAL--HKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD+ DI FL + + +
Sbjct: 493 PNEWVWEQTNYTSYQDCNLWKDLGCDFVLQVYRDFLFTGANDIEFLVDCWHGMVLTLDYL 552
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALA-------CMVEMSSLLND 128
+ FD G GI +N G PDQT AYCG L +AAL ++ L N
Sbjct: 553 KQFDMDGDGIPENSGAPDQTFDDWRLSGVSAYCGALWLAALEAAIAICDVLINRPELPNV 612
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+Q+ + L ++ VY LW G +Y+ DS S+ +MADQL G + + +N
Sbjct: 613 GEQKSIYEHWLNQSLPVYQQKLWNGKFYRLDSESG--SNVVMADQLSGQFYARL--LNLP 668
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRS-AIQSEEVWTGVTYLLS 247
I P AL TI+ F+ G GA NG+ +G + A EVWTG+ + L+
Sbjct: 669 DIVPKDRALSALTTIYDACFLKFQDGKFGAANGVLLDGSPENPQATHPLEVWTGINFGLA 728
Query: 248 AAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
A +L G ++ + R VYE GL F TPE +T T+R+ Y+R +A++A+
Sbjct: 729 AFLLQMGMKEQGLRLTEAVVRQVYEH-GLQFRTPEAITAAGTFRASTYLRPMAIWAV 784
>gi|292384316|gb|ADE21411.1| unknown [Helianthus neglectus]
Length = 205
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 120/205 (58%), Gaps = 4/205 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 1 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFFKAKSSFESKLWNGSYFNYDSGSSS 60
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 61 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 118
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TPE
Sbjct: 119 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 178
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDA 307
G T D +YRS YMR LA++ MQ A
Sbjct: 179 GWTMDGSYRSLVYMRPLAIWGMQQA 203
>gi|33866127|ref|NP_897686.1| hypothetical protein SYNW1593 [Synechococcus sp. WH 8102]
gi|33639102|emb|CAE08108.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 833
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 149/300 (49%), Gaps = 31/300 (10%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E PF N D + WKDL +++ ++R + L +D+RFL +P+ ++ +
Sbjct: 526 PNERPFDATNYTAYQDCNLWKDLASDYMLQVWRTFRLAPTGEDLRFLAECWPSAVQALDY 585
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
+ FD G+ NGG PDQT AYCG L IAAL +++ L+
Sbjct: 586 LKGFDVNNDGLPDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLDT 645
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+Q++F A L ++ + DLLW G YY+ D+ + +MADQLCG + + G+
Sbjct: 646 AAEQRRFGAWLEQSRANFDDLLWNGEYYRIDAESG--TPVVMADQLCGDFYARLLGLP-- 701
Query: 189 AIFPSANIARALATIFSTN---VRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTY 244
P + ARA +T+ + FE+G +G NG++ +G D + EVWTG+ +
Sbjct: 702 ---PVVSDARARSTLQAVKEACFERFESGRLGVANGLRRDGSPLDPNGTHPLEVWTGINF 758
Query: 245 LLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
L++ G A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 759 GLASYYRLMGESKTALAICSAVVGQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817
>gi|188570621|gb|ACD64346.1| hypothetical protein [Helianthus petiolaris]
gi|188570623|gb|ACD64347.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/208 (43%), Positives = 120/208 (57%), Gaps = 4/208 (1%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSR 164
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 63
Query: 165 HSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKP 224
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM P
Sbjct: 64 NSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 225 NGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPE 282
NG D + +QS EVWTGVTY ++A M++ D A+TTA G++ + G G F+TPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAEMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 283 GLTGDKTYRSGGYMRALAVYAMQDAYLK 310
G T D +YRS YMR LA++ MQ A K
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEK 209
>gi|292384324|gb|ADE21415.1| unknown [Helianthus neglectus]
Length = 210
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 83/187 (44%), Positives = 109/187 (58%), Gaps = 4/187 (2%)
Query: 126 LNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGV 185
LND Q++ K KA + LW GSY+ +DS S +S SI ADQL G + ++G+
Sbjct: 25 LNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGL 84
Query: 186 NTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYL 245
+F I +L IF NV G MGAVNGM PNG D + +QS EVWTGVTY
Sbjct: 85 PN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYG 142
Query: 246 LSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYA 303
++A M++ G D A+TTA G++ + G G F+TPEG T D +YRS YMR LA++
Sbjct: 143 VAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWG 202
Query: 304 MQDAYLK 310
MQ A K
Sbjct: 203 MQQALEK 209
>gi|188570635|gb|ACD64353.1| hypothetical protein [Helianthus petiolaris]
gi|188570639|gb|ACD64355.1| hypothetical protein [Helianthus petiolaris]
Length = 211
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 92/209 (44%), Positives = 121/209 (57%), Gaps = 5/209 (2%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDT-QQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQS 163
+AYCG L +AAL M+ LND Q++ K KA + LW GSY+ +DS S
Sbjct: 4 SAYCGCLWLAALQAAAAMAVELNDKFAAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSS 63
Query: 164 RHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMK 223
+S SI ADQL G + ++G+ +F I +L IF NV G MGAVNGM
Sbjct: 64 SNSKSIQADQLAGQWYTASAGLPN--LFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 121
Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG--FETP 281
PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G++ + G G F+TP
Sbjct: 122 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 181
Query: 282 EGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
EG T D +YRS YMR LA++ MQ A K
Sbjct: 182 EGWTMDGSYRSLVYMRPLAIWGMQQALEK 210
>gi|148240031|ref|YP_001225418.1| hypothetical protein SynWH7803_1695 [Synechococcus sp. WH 7803]
gi|147848570|emb|CAK24121.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 833
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/299 (31%), Positives = 147/299 (49%), Gaps = 29/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E PF N D + WKDL F++ ++R + L +D++FL +P ++ +
Sbjct: 526 PNEVPFDATNYTAYQDCNLWKDLASDFVLQVWRTFKLAPTGEDLKFLADCWPAAVQALHY 585
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
+ FD G+ NGG PDQT AYCG L IAAL ++ L+
Sbjct: 586 LKRFDVNNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQQELGLDT 645
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
Q FSA L ++ + LLW G YY D+ + +MADQLCG + + G+ E
Sbjct: 646 GSDQHTFSAWLEQSRSNFDRLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--E 701
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
+ AN L + + F+ GS+G NG++ +G D + EVWTG+ + ++
Sbjct: 702 PVVSEANSRSTLKAVKESCFEKFQGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 761
Query: 248 A--AMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ ++ EG EA +A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 762 SYYRLMGEGQTAEAICSA--VVDQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817
>gi|288901049|gb|ADC68237.1| hypothetical protein [Dendrobium nobile]
Length = 207
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 79/203 (38%), Positives = 106/203 (52%), Gaps = 12/203 (5%)
Query: 67 DIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
D+ F V+P + FD+ G G+++N GFPDQT AYCG L +A+L
Sbjct: 6 DMAFGREVWPAVCAAMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLASL 65
Query: 117 ACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
M+ L D ++ + K A + LW GSY+ +DS S +S SI ADQL G
Sbjct: 66 QAAAAMAHRLGDHAYAEKCTIKFLTAKHAFESKLWNGSYFNYDSGNSSNSKSIQADQLAG 125
Query: 177 YLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSE 236
+ +SG+ +F I L IF NV G +GAVNGM PNG D S +QS
Sbjct: 126 QWYAASSGL--PPLFDENKIRSTLQKIFDFNVMKVRGGRIGAVNGMHPNGKVDESCMQSR 183
Query: 237 EVWTGVTYLLSAAMLYEGNVDEA 259
E+WTGVTY LSA M+ G ++A
Sbjct: 184 EIWTGVTYALSATMILAGMEEQA 206
>gi|47212590|emb|CAG12815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 777
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 67/140 (47%), Positives = 85/140 (60%), Gaps = 2/140 (1%)
Query: 168 SIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 227
S+ A CG+ + T FP I AL +IF NV F G MGAVNGM+P G
Sbjct: 638 SVPATGFCGHPVWETE--TTRLAFPKEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGV 695
Query: 228 RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGD 287
DRS++QS+EVW GV Y L+A M++EG ++E TA G YRTV+ER G+ F+TPE
Sbjct: 696 PDRSSVQSDEVWIGVVYGLAATMIHEGMLEEGMRTAEGCYRTVWERLGMAFQTPEAYCEK 755
Query: 288 KTYRSGGYMRALAVYAMQDA 307
YRS YMR L ++AMQ A
Sbjct: 756 GIYRSLAYMRPLCIWAMQLA 775
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 55/79 (69%), Gaps = 2/79 (2%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCE- 86
PE+EP+ +N Y + D + WKDLNLKF++ +YRD+ L +D ++L ++P C E++ + E
Sbjct: 535 PEDEPWQRVNAYLIHDTAGWKDLNLKFVLQVYRDFHLTQDAQYLRDMWPIC-EMVTESEL 593
Query: 87 SFDKQGLGIVQNGGFPDQT 105
FD G G+++N G+ DQT
Sbjct: 594 QFDLDGDGLIENSGYADQT 612
>gi|88809081|ref|ZP_01124590.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
gi|88787023|gb|EAR18181.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
Length = 849
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 93/299 (31%), Positives = 146/299 (48%), Gaps = 29/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E PF N D + WKDL F++ ++R + L +D+ FL +P ++ +
Sbjct: 541 PNEVPFDATNYTAYQDCNLWKDLASDFVLQVWRTFKLAPTGEDLTFLADCWPAAVQALHY 600
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
+ FD G+ NGG PDQT AYCG L IAAL ++ L+
Sbjct: 601 LKRFDVNNDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQQELGLDT 660
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+Q FS L ++ + LLW G YY D+ + +MADQLCG + + G+ E
Sbjct: 661 GSEQHTFSNWLEQSRANFDALLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--E 716
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
+ AN L + + F+ GS+G NG++ +G D + EVWTG+ + ++
Sbjct: 717 PVVSDANSRSTLKAVKESCFEKFQGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 776
Query: 248 A--AMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ ++ EG EA +A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 777 SYYQLMGEGQTAEAICSA--VVHQVYS-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 832
>gi|113953483|ref|YP_730032.1| hypothetical protein sync_0818 [Synechococcus sp. CC9311]
gi|113880834|gb|ABI45792.1| conserved hypothetical protein [Synechococcus sp. CC9311]
Length = 833
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 144/299 (48%), Gaps = 29/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E PF N D + WKDL +++ ++R + L +D+ FL +P ++ +
Sbjct: 526 PNEVPFDATNYTAYQDCNLWKDLASDYVLQVWRTFKLAPTGEDLSFLAECWPAAVQALHY 585
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
+ FD G+ NGG PDQT AYCG L IAAL +++ L+
Sbjct: 586 LKQFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLELGLDT 645
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+Q FS L ++ + LLW G YY D+ + +MADQLCG + + G+
Sbjct: 646 GDEQHTFSGWLEQSRANFDKLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGL--P 701
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
+ AN L + F GS+G NG++ +G D + EVWTG+ + ++
Sbjct: 702 PVVSEANSRSTLKAVKEACFDNFAGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 761
Query: 248 A--AMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ ++ EG EA +A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 762 SYYRLMGEGKTAEAICSA--VVTQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817
>gi|352096901|ref|ZP_08957615.1| Glucosylceramidase [Synechococcus sp. WH 8016]
gi|351675485|gb|EHA58646.1| Glucosylceramidase [Synechococcus sp. WH 8016]
Length = 833
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 142/299 (47%), Gaps = 29/299 (9%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALH---KDIRFLTRVYPTCLELIRK 84
P E PF N D + WKDL +++ ++R + L +D+ FL +P ++ +
Sbjct: 526 PNEVPFDATNYTAYQDCNLWKDLASDYVLQVWRTFKLAPTGEDLSFLAECWPAAVQALHY 585
Query: 85 CESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM------VEMSSLLND 128
+ FD G+ NGG PDQT AYCG L IAAL +++ L+
Sbjct: 586 LKQFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLELGLDT 645
Query: 129 TQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+Q FS L ++ + LLW G YY D+ + +MADQLCG + + +
Sbjct: 646 GDEQHTFSGWLEQSRANFDTLLWNGEYYDIDAESG--TPVVMADQLCGDFYARL--LELP 701
Query: 189 AIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDR-DRSAIQSEEVWTGVTYLLS 247
+ AN L + F G +G NG++ +G D + EVWTG+ + ++
Sbjct: 702 PVVSEANSRSTLKAVKEACFDNFAGGLLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIA 761
Query: 248 A--AMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
+ ++ EG EA +A + VY GL F TPE +T T+R+ Y+RA+A++ +
Sbjct: 762 SYYRLMGEGPTAEAICSA--VVTQVYA-GGLQFRTPEAITAVNTFRACHYLRAMAIWGL 817
>gi|156333782|ref|XP_001619412.1| hypothetical protein NEMVEDRAFT_v1g224207 [Nematostella vectensis]
gi|156202556|gb|EDO27312.1| predicted protein [Nematostella vectensis]
Length = 173
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 54/101 (53%), Positives = 75/101 (74%)
Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 273
G+MGAVNG++P+G D S++Q+EEVWTGVTY ++A+M+ EG VDE + TA G+Y T +ER
Sbjct: 52 GTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNTCFER 111
Query: 274 TGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDAYLKGKVK 314
G+ F+TPE + + YRS YMR L+++AMQ A K K K
Sbjct: 112 LGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNK 152
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 33/51 (64%)
Query: 30 EEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLE 80
E P+ +N Y++ D S WKDLNLKF++ +YRDY D+ +L ++P E
Sbjct: 1 EAPWDHVNAYHIHDTSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWPITKE 51
>gi|295829670|gb|ADG38504.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829672|gb|ADG38505.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829674|gb|ADG38506.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829676|gb|ADG38507.1| AT3G24180-like protein [Capsella grandiflora]
Length = 190
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 71/192 (36%), Positives = 96/192 (50%), Gaps = 12/192 (6%)
Query: 70 FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
F V+P + E FD+ +++N GFPDQT AYCG L +AAL
Sbjct: 1 FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60
Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
M+ + D + K A LW GSY+ +DS S +S SI DQL G +
Sbjct: 61 AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120
Query: 180 LKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVW 239
+SG+ IF + I + IF NV + G MGAVNGM P+G D + +QS E+W
Sbjct: 121 AASSGL--PPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIW 178
Query: 240 TGVTYLLSAAML 251
TGVTY +A M+
Sbjct: 179 TGVTYAAAATMI 190
>gi|345291087|gb|AEN82035.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291089|gb|AEN82036.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291091|gb|AEN82037.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291093|gb|AEN82038.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291095|gb|AEN82039.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291097|gb|AEN82040.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291099|gb|AEN82041.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291101|gb|AEN82042.1| AT3G24180-like protein, partial [Capsella rubella]
Length = 191
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 71/192 (36%), Positives = 96/192 (50%), Gaps = 12/192 (6%)
Query: 70 FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
F V+P + E FD+ +++N GFPDQT AYCG L +AAL
Sbjct: 1 FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60
Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
M+ + D + K A LW GSY+ +DS S +S SI DQL G +
Sbjct: 61 AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120
Query: 180 LKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVW 239
+SG+ IF + I + IF NV + G MGAVNGM P+G D + +QS E+W
Sbjct: 121 AASSGL--PPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIW 178
Query: 240 TGVTYLLSAAML 251
TGVTY +A M+
Sbjct: 179 TGVTYAAAATMI 190
>gi|295829678|gb|ADG38508.1| AT3G24180-like protein [Neslia paniculata]
Length = 190
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 71/192 (36%), Positives = 96/192 (50%), Gaps = 12/192 (6%)
Query: 70 FLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACM 119
F V+P + E FD+ +++N GFPDQT AYCG L +AAL
Sbjct: 1 FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60
Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLF 179
M+ + D + K A LW GSY+ +DS S +S SI DQL G +
Sbjct: 61 AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120
Query: 180 LKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVW 239
+SG+ IF + I + IF NV + G MGAVNGM P+G D + +QS E+W
Sbjct: 121 AASSGL--PPIFEESKIRSTMQKIFDFNVMKTKGGRMGAVNGMHPDGKVDETCMQSREIW 178
Query: 240 TGVTYLLSAAML 251
TGVTY +A M+
Sbjct: 179 TGVTYAAAATMI 190
>gi|148242848|ref|YP_001228005.1| hypothetical protein SynRCC307_1749 [Synechococcus sp. RCC307]
gi|147851158|emb|CAK28652.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 220
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 103/207 (49%), Gaps = 12/207 (5%)
Query: 105 TAYCGGLHIAALACMVEMSSLL------NDTQQQQQFSAKLTKASQVYHDLLWTGSYYKF 158
+AYCG L IAAL + M L + +Q+ FS L ++ + LLW G +YK
Sbjct: 2 SAYCGALWIAALEAALAMGQTLALELGVDTAAEQRDFSQWLQQSRGNFDQLLWNGEFYKI 61
Query: 159 DSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGA 218
D+ + +MADQLCG + + G+ + AN +L I GF G +G
Sbjct: 62 DAESG--TPVVMADQLCGDFYARLLGL--PPVVSEANARSSLQAIKEACFEGFAGGRLGV 117
Query: 219 VNGMKPNGDR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG 277
NG++ +G D + EVWTG+ + L+A G + A + VY GL
Sbjct: 118 ANGLRRDGTPLDPNGTHPLEVWTGINFGLAAYYRLMGETNTAVAITQAVVNQVYA-GGLQ 176
Query: 278 FETPEGLTGDKTYRSGGYMRALAVYAM 304
F TPE LTG T+R+ Y+RA+A++A+
Sbjct: 177 FRTPEALTGQNTFRACHYLRAMAIWAL 203
>gi|255073149|ref|XP_002500249.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
gi|226515511|gb|ACO61507.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
Length = 954
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 91/314 (28%), Positives = 139/314 (44%), Gaps = 38/314 (12%)
Query: 28 PEEEPF-SLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIR-----------FLTRVY 75
P + P S N +++ DV+ WKDL K ++++ R +AL R L R++
Sbjct: 621 PFDAPLTSSPNAFDLTDVNRWKDLAPKLMLTLARAHALRGGFRGPNDPGGIPRDVLRRMF 680
Query: 76 PTCLE-LIRKCESFDKQGLGIVQNG----GFPDQT-----------AYCGGLHIAALACM 119
C E L + FD G G++++ PD + AY GGL +AAL
Sbjct: 681 RPCYESLAAQLRRFDGNGDGLIEHDDDDEAGPDHSFVAWRARGDTSAYAGGLWLAALRAG 740
Query: 120 VEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG------SYYKFDSSQSRHSDSIMADQ 173
++ LN+T + + A+ + LW G YY+FD+S + D A Q
Sbjct: 741 AGLARDLNETHARDSLEQTMRAAAVAHDAALWLGDDDGGRGYYRFDASGTEGGDVSSAAQ 800
Query: 174 LCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAI 233
+ G L + + P+ + ALA++ NV +G D +
Sbjct: 801 VMGEWALAV--IGARGVLPARKVRAALASVARLNVAAAGGAGGAVNGAAA-SGSVDAVSA 857
Query: 234 QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSG 293
S EVW GV+Y +++ ML G DEAW A G+ YER GL F TPE +RS
Sbjct: 858 HSNEVWPGVSYAVASHMLLAGMDDEAWELARGVAAGTYER-GLAFRTPEAWDARGGFRSA 916
Query: 294 GYMRALAVYAMQDA 307
RA +V+A++ A
Sbjct: 917 MSQRAGSVWAIEHA 930
>gi|226444526|gb|ACO57837.1| unknown [Helianthus annuus]
gi|226444528|gb|ACO57838.1| unknown [Helianthus annuus]
gi|226444560|gb|ACO57854.1| unknown [Helianthus petiolaris]
gi|226444568|gb|ACO57858.1| unknown [Helianthus petiolaris]
Length = 103
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 70/106 (66%), Gaps = 5/106 (4%)
Query: 207 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 266
VRG G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D+A+TTA G+
Sbjct: 1 KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGI 57
Query: 267 YRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
+ + G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 58 FIAGWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103
>gi|226444514|gb|ACO57831.1| unknown [Helianthus annuus]
gi|226444516|gb|ACO57832.1| unknown [Helianthus annuus]
gi|226444518|gb|ACO57833.1| unknown [Helianthus annuus]
gi|226444520|gb|ACO57834.1| unknown [Helianthus annuus]
gi|226444522|gb|ACO57835.1| unknown [Helianthus annuus]
gi|226444530|gb|ACO57839.1| unknown [Helianthus annuus]
gi|226444532|gb|ACO57840.1| unknown [Helianthus annuus]
gi|226444534|gb|ACO57841.1| unknown [Helianthus annuus]
gi|226444536|gb|ACO57842.1| unknown [Helianthus annuus]
gi|226444538|gb|ACO57843.1| unknown [Helianthus annuus]
gi|226444540|gb|ACO57844.1| unknown [Helianthus annuus]
gi|226444542|gb|ACO57845.1| unknown [Helianthus annuus]
gi|226444544|gb|ACO57846.1| unknown [Helianthus annuus]
gi|226444546|gb|ACO57847.1| unknown [Helianthus petiolaris]
gi|226444548|gb|ACO57848.1| unknown [Helianthus petiolaris]
gi|226444550|gb|ACO57849.1| unknown [Helianthus petiolaris]
gi|226444552|gb|ACO57850.1| unknown [Helianthus petiolaris]
gi|226444554|gb|ACO57851.1| unknown [Helianthus petiolaris]
gi|226444556|gb|ACO57852.1| unknown [Helianthus petiolaris]
gi|226444558|gb|ACO57853.1| unknown [Helianthus petiolaris]
gi|226444562|gb|ACO57855.1| unknown [Helianthus petiolaris]
gi|226444564|gb|ACO57856.1| unknown [Helianthus petiolaris]
gi|226444566|gb|ACO57857.1| unknown [Helianthus petiolaris]
gi|226444570|gb|ACO57859.1| unknown [Helianthus argophyllus]
gi|226444572|gb|ACO57860.1| unknown [Helianthus argophyllus]
gi|226444576|gb|ACO57862.1| unknown [Helianthus argophyllus]
gi|226444578|gb|ACO57863.1| unknown [Helianthus argophyllus]
gi|226444580|gb|ACO57864.1| unknown [Helianthus argophyllus]
gi|226444582|gb|ACO57865.1| unknown [Helianthus argophyllus]
gi|226444584|gb|ACO57866.1| unknown [Helianthus argophyllus]
gi|226444586|gb|ACO57867.1| unknown [Helianthus argophyllus]
gi|226444588|gb|ACO57868.1| unknown [Helianthus argophyllus]
gi|226444590|gb|ACO57869.1| unknown [Helianthus argophyllus]
gi|226444592|gb|ACO57870.1| unknown [Helianthus argophyllus]
Length = 103
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 69/106 (65%), Gaps = 5/106 (4%)
Query: 207 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 266
VRG G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G+
Sbjct: 1 KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57
Query: 267 YRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
+ + G G F+TPEG T D +YRS YMR LA++ MQ A K
Sbjct: 58 FIAGWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103
>gi|226444524|gb|ACO57836.1| unknown [Helianthus annuus]
Length = 103
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 68/103 (66%), Gaps = 5/103 (4%)
Query: 207 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 266
VRG G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G+
Sbjct: 1 KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57
Query: 267 YRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
+ + G G F+TPEG T D +YRS YMR LA++ MQ A
Sbjct: 58 FIAGWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQA 100
>gi|226444574|gb|ACO57861.1| unknown [Helianthus argophyllus]
Length = 103
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 68/106 (64%), Gaps = 5/106 (4%)
Query: 207 NVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGL 266
VRG G MGAVNGM PNG D + +QS EVWTGVTY ++A M++ G D A+TTA G+
Sbjct: 1 KVRG---GRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGI 57
Query: 267 YRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDAYLK 310
+ + G G F+TPEG T D +YRS YMR LA++ Q A K
Sbjct: 58 FIAGWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGKQQALEK 103
>gi|449674846|ref|XP_002159451.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
magnipapillata]
Length = 751
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 76/136 (55%), Gaps = 10/136 (7%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCES 87
P++EPF +N Y + D S WKDLNL F++ +YRD+ + K+ +FL ++P + +
Sbjct: 616 PDDEPFVRVNAYCIHDTSQWKDLNLHFVLQVYRDFYITKNKQFLMDMWPAMKAAMAHSLA 675
Query: 88 FDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSA 137
D G GI++N G D T AY GGL ++AL C++E++ +L +Q ++ +
Sbjct: 676 QDIDGDGIIENSGIADSTFDTWVVTGPSAYTGGLWLSALRCIIEIAHILGLSQSISKYKS 735
Query: 138 KLTKASQVYHDLLWTG 153
L + + Y LW G
Sbjct: 736 ILERGKKSYEKKLWNG 751
>gi|411117353|ref|ZP_11389840.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
JSC-12]
gi|410713456|gb|EKQ70957.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
JSC-12]
Length = 907
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 87/180 (48%), Gaps = 31/180 (17%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRD--YALHKDIRFLTRVYPTCLELIRKC 85
P E + N + D + WKDL F++ +YRD Y KDI FL + + +
Sbjct: 520 PNEHVWQKTNYTSYQDCNLWKDLPCDFVLQVYRDFLYTGSKDIAFLQECWSAIAQTLHYL 579
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALAC--------------MVE 121
++FD GI +N G PDQT AYCGGL IAAL M +
Sbjct: 580 KTFDYDFDGIPENSGAPDQTFDDWKLDGISAYCGGLWIAALEAALAIGRILAEYTTNMRQ 639
Query: 122 MSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCGYLFLK 181
+++L ND F L +A +YHD LW G YY+ DS+ S+ +MADQLCG + +
Sbjct: 640 LATLSNDIGN---FQVWLEQARSIYHDTLWNGQYYRLDSNSC--SEVVMADQLCGQFYAR 694
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 55/91 (60%), Gaps = 2/91 (2%)
Query: 215 SMGAVNGMKPNGD-RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 273
+GA NG+KP+G + ++ EVWTG+ + L+A ++ G DEA + + +Y
Sbjct: 809 PIGAANGVKPDGSPENPNSTHPLEVWTGINFGLAAFLVQMGMRDEAMELTQAVVQQIY-T 867
Query: 274 TGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
GL F TPE +T T+R+ Y+RALA++A+
Sbjct: 868 NGLQFRTPEAITPSGTFRACHYLRALAIWAI 898
>gi|295828610|gb|ADG37974.1| AT1G33700-like protein [Capsella grandiflora]
Length = 105
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 58/96 (60%), Gaps = 10/96 (10%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+F+ + WKDLN KF++ +YRD D+ F V+P+ I + FDK
Sbjct: 1 DPWFEVNAYNLFNTNRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
G G+++N GFPDQT AYCGGL +AAL
Sbjct: 61 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAAL 96
>gi|295828604|gb|ADG37971.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828608|gb|ADG37973.1| AT1G33700-like protein [Capsella grandiflora]
Length = 105
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 10/96 (10%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+F+ WKDLN KF++ +YRD D+ F V+P+ I + FDK
Sbjct: 1 DPWFEVNAYNLFNTBRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
G G+++N GFPDQT AYCGGL +AAL
Sbjct: 61 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAAL 96
>gi|295828602|gb|ADG37970.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828606|gb|ADG37972.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828612|gb|ADG37975.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828614|gb|ADG37976.1| AT1G33700-like protein [Neslia paniculata]
Length = 105
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 10/96 (10%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+F+ WKDLN KF++ +YRD D+ F V+P+ I + FDK
Sbjct: 1 DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
G G+++N GFPDQT AYCGGL +AAL
Sbjct: 61 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAAL 96
>gi|345289099|gb|AEN81041.1| AT1G33700-like protein, partial [Capsella rubella]
Length = 191
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 10/96 (10%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+F+ WKDLN KF++ +YRD D+ F V+P+ I + FDK
Sbjct: 83 DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 142
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
G G+++N GFPDQT AYCGGL +AAL
Sbjct: 143 DGDGMIENDGFPDQTYDAWSCSGVSAYCGGLWVAAL 178
>gi|345289089|gb|AEN81036.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289091|gb|AEN81037.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289093|gb|AEN81038.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289095|gb|AEN81039.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289097|gb|AEN81040.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289101|gb|AEN81042.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289103|gb|AEN81043.1| AT1G33700-like protein, partial [Capsella rubella]
Length = 191
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 10/96 (10%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +N YN+F+ WKDLN KF++ +YRD D+ F V+P+ I + FDK
Sbjct: 83 DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 142
Query: 91 QGLGIVQNGGFPDQT----------AYCGGLHIAAL 116
G G+++N GFPDQT AYCGGL +AAL
Sbjct: 143 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAAL 178
>gi|256082539|ref|XP_002577512.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 717
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 81/157 (51%), Gaps = 31/157 (19%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRK-CE 86
PE+EP+ N Y +F WKDLN KFI+ ++RD+ + +D ++L + P L ++RK
Sbjct: 560 PEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLV 619
Query: 87 SFDKQGLGIVQNGGFPDQ----------TAYCGGLHIAALACMVEMSS------------ 124
++D G+++N GFPDQ TAY GG+ ++ L +M S
Sbjct: 620 AWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQ 679
Query: 125 LLNDTQQQQQFSAKL--------TKASQVYHDLLWTG 153
++N+T Q+ +++ TKA Y+ LWTG
Sbjct: 680 IVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTG 716
>gi|353232146|emb|CCD79501.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 725
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 81/157 (51%), Gaps = 31/157 (19%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRK-CE 86
PE+EP+ N Y +F WKDLN KFI+ ++RD+ + +D ++L + P L ++RK
Sbjct: 568 PEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLV 627
Query: 87 SFDKQGLGIVQNGGFPDQ----------TAYCGGLHIAALACMVEMSS------------ 124
++D G+++N GFPDQ TAY GG+ ++ L +M S
Sbjct: 628 AWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQ 687
Query: 125 LLNDTQQQQQFSAKL--------TKASQVYHDLLWTG 153
++N+T Q+ +++ TKA Y+ LWTG
Sbjct: 688 IVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTG 724
>gi|194359640|gb|ACF57667.1| bile acid beta-glucosidase [Ctenopharyngodon idella]
Length = 89
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 56/89 (62%), Gaps = 10/89 (11%)
Query: 32 PFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQ 91
P+ +N Y ++D +DWKDLNLKF++ ++RDY L +D ++L ++P C ++ FDK
Sbjct: 1 PWVRVNAYLIYDTADWKDLNLKFVLQVFRDYHLTQDEQYLKDMWPICQTVMETELKFDKD 60
Query: 92 GLGIVQNGGFPDQT----------AYCGG 110
G G+++N G+ DQT AYCGG
Sbjct: 61 GDGLIENSGYADQTYDGWKVTGPSAYCGG 89
>gi|15922862|ref|NP_378531.1| hypothetical protein ST2526 [Sulfolobus tokodaii str. 7]
gi|15623653|dbj|BAB67640.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 646
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 63/234 (26%), Positives = 101/234 (43%), Gaps = 18/234 (7%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQN-GGFPDQ- 104
W DL +++ IYRDY D+ FL R Y E+I S DK I + GGF +
Sbjct: 378 WTDLGSTWVLMIYRDYKFTNDLAFLKRNYRKMKEVIDWLISLDKDKDCIPDSKGGFDNSY 437
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
++Y G + + AL + S +L + + L + ++ LW G Y+
Sbjct: 438 DGTYMYGASSYVGSMFLCALRAFISASKILG--MEYSIYEDCLRRGIMTFNS-LWNGKYF 494
Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
S +R +S M+ Q+ G + G+ E I I +AL +I+ N + A
Sbjct: 495 IAWKSDNRKKESCMSSQILGQFWCDILGL--EPIIDEDKIVQALRSIYELNGK---ASKF 549
Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 270
VN + P+G D Q W+ V + +SA M+ G +E A + T+
Sbjct: 550 CLVNSVNPDGSIDTETDQMRSCWSRVAFAVSAHMIIRGLKNEGIEIANREWETI 603
>gi|47196687|emb|CAF89164.1| unnamed protein product [Tetraodon nigroviridis]
Length = 239
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 40/86 (46%), Positives = 51/86 (59%), Gaps = 2/86 (2%)
Query: 168 SIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGD 227
S+ A CG+ + T FP I AL +IF NV F G MGAVNGM+P G
Sbjct: 155 SVPATGFCGHPVWETE--TTRQAFPKEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGV 212
Query: 228 RDRSAIQSEEVWTGVTYLLSAAMLYE 253
DRS++QS+EVW GV Y L+A M++E
Sbjct: 213 PDRSSVQSDEVWIGVVYGLAATMIHE 238
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 58 IYRDYALHKDIRFLTRVYPTCLELIRKCE-SFDKQGLGIVQNGGFPDQT 105
+YRD+ L +D ++L ++P C E++ + E FD G G+++N G+ DQT
Sbjct: 82 VYRDFHLTQDAQYLRDMWPIC-EMVTESELQFDLDGDGLIENSGYADQT 129
>gi|332796021|ref|YP_004457521.1| hypothetical protein Ahos_0332 [Acidianus hospitalis W1]
gi|332693756|gb|AEE93223.1| conserved hypothetical protein [Acidianus hospitalis W1]
Length = 652
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 103/237 (43%), Gaps = 18/237 (7%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQN-GGFPDQ- 104
W DL +I+ IYRDY D+ FL R Y E+I D+ GI + GGF +
Sbjct: 383 WNDLGPTWILMIYRDYKFTNDLSFLKRNYNKMKEVIDWLIKKDEDNDGIPDSKGGFDNSY 442
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
++Y G L + AL +E S +L + ++ L KA LW G Y+
Sbjct: 443 DGTYMYGTSSYIGSLFLCALKAFIESSKIL--SYDYSKYEEILNKAKSSLES-LWNGRYF 499
Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
+ + ++S + QL G + G+ + I AL I+ N + A
Sbjct: 500 INWKYKDQKNESCLNSQLLGEFWCNLLGLGN--VIDEDKIKTALKYIYEHNGK---ASKY 554
Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYER 273
VN + P+G D S Q + W +++ +++ M+ +G ++E A + T+ R
Sbjct: 555 CLVNSVNPDGSIDESTDQMKSCWPRISFAIASHMIMKGMINEGIEIARKEWETISSR 611
>gi|327310049|ref|YP_004336946.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
gi|326946528|gb|AEA11634.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
Length = 660
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 61/239 (25%), Positives = 99/239 (41%), Gaps = 17/239 (7%)
Query: 44 VSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPD 103
++ WKDL +++ IYRDY D+ L R + + S D+ G + + G D
Sbjct: 381 LTPWKDLASTWVLMIYRDYYYTGDVEILRRNIDAMKDAVDWLMSLDRDGDCVPDSRGRND 440
Query: 104 ----------QTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
+ AY L + AL ++ + L + + ++ L + + D LW G
Sbjct: 441 NSYDGSNMYGRVAYVASLFLCALTAYIKAAERLG-LKPEPKYLECLERGKRSL-DELWNG 498
Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
YY + R D+ M+ QL G + ++ I P +A AL +I+ G A
Sbjct: 499 RYYVAWADGPRRKDACMSSQLLGQFWCDM--LDLPPIAPDDKVAAALRSIYEL---GMRA 553
Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
N + P+G D Q WT V + ++A ML G E A +RT+ E
Sbjct: 554 SKYCIPNSVTPDGKPDEETPQLRSCWTRVNFAVAAHMLLRGMEREGMEVASREWRTIAE 612
>gi|345293667|gb|AEN83325.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293669|gb|AEN83326.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293671|gb|AEN83327.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293673|gb|AEN83328.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293675|gb|AEN83329.1| AT5G49900-like protein, partial [Capsella rubella]
Length = 156
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 46/75 (61%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +NGYN+ + WKDLN KF++ +YRD D +F V+P+ + FDK
Sbjct: 69 DPWFEVNGYNLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAMAYMAQFDK 128
Query: 91 QGLGIVQNGGFPDQT 105
G G+++N GFPDQT
Sbjct: 129 DGDGMIENEGFPDQT 143
>gi|269925332|ref|YP_003321955.1| hypothetical protein Tter_0211 [Thermobaculum terrenum ATCC
BAA-798]
gi|269788992|gb|ACZ41133.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
BAA-798]
Length = 811
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 63/247 (25%), Positives = 102/247 (41%), Gaps = 29/247 (11%)
Query: 41 VFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNG- 99
++D D L ++ +YR++ D FL ++P + + + + G
Sbjct: 457 IWDHEAAADGQLGTVIRVYREWKFSGDDAFLKELWPKVKSAVNYSSLYWDKDQDFILEGR 516
Query: 100 -------GFPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWT 152
F G L + AL EM+ L ++ ++ +S+ LLW
Sbjct: 517 QHNTYDIEFYGPNPLTGILFLGALRAAEEMAKYLG-SESASSYAQAFEASSKKLDQLLWN 575
Query: 153 GSYY--KFDSSQS---RHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN 207
G YY K D +H+ ++DQL G +G+ + P +I+RAL +IF+ N
Sbjct: 576 GEYYIQKLDDPNEHRYQHASGCLSDQLFGQTLASLTGLGY--LLPKEHISRALESIFAYN 633
Query: 208 V-----------RGFEAGS-MGAVNGMKPNGDRDRSAIQ-SEEVWTGVTYLLSAAMLYEG 254
R + G G V P GDR S+EVW+G Y ++ M+YEG
Sbjct: 634 FKPNFWNHTNTQRVYALGDDAGLVMCTWPFGDRPSFPFPYSDEVWSGTEYQVATLMIYEG 693
Query: 255 NVDEAWT 261
+DEA T
Sbjct: 694 LLDEALT 700
>gi|345293661|gb|AEN83322.1| AT5G49900-like protein, partial [Capsella grandiflora]
gi|345293663|gb|AEN83323.1| AT5G49900-like protein, partial [Capsella grandiflora]
gi|345293665|gb|AEN83324.1| AT5G49900-like protein, partial [Capsella grandiflora]
Length = 156
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 46/75 (61%)
Query: 31 EPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
+P+ +NGY++ + WKDLN KF++ +YRD D +F V+P+ + FDK
Sbjct: 69 DPWFEVNGYSLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAMAYMAQFDK 128
Query: 91 QGLGIVQNGGFPDQT 105
G G+++N GFPDQT
Sbjct: 129 DGDGMIENEGFPDQT 143
>gi|159042199|ref|YP_001541451.1| hypothetical protein Cmaq_1638 [Caldivirga maquilingensis IC-167]
gi|157921034|gb|ABW02461.1| protein of unknown function DUF608 [Caldivirga maquilingensis
IC-167]
Length = 661
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 105/237 (44%), Gaps = 23/237 (9%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQN-GGFPDQ- 104
W DL +I+ IYRDY L ++ L R E+I S D G I + GGF +
Sbjct: 388 WNDLGSTWILMIYRDYLLTGNVEVLRRNIDKMREVIDWLISRDYDGDCIPDSRGGFDNSY 447
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
++Y L + +L ++ + +L + ++ + L+K + + LW G Y+
Sbjct: 448 DGTNMYGASSYIASLFLCSLQAFIKSAEVLG-VRLSDRYESCLSKGRETLNS-LWNGRYF 505
Query: 157 KFDSSQSRHSDSIMADQLCGYL---FLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
S ++S M QL G FLK V E I AL +I+ N ++
Sbjct: 506 MAWKSSGNSNESCMNSQLLGQFWCDFLKLPPVVDE-----DKIKVALRSIYELN---HKS 557
Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTV 270
N +KP+G+ D S+ Q W V+++++A M+ G V+E A + T+
Sbjct: 558 SPHCLPNSVKPSGEIDTSSGQMRSCWPRVSFVVTAHMVLRGMVNEGLEIAKKEWDTI 614
>gi|303285310|ref|XP_003061945.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
gi|226456356|gb|EEH53657.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
Length = 1073
Score = 70.9 bits (172), Expect = 7e-10, Method: Composition-based stats.
Identities = 62/253 (24%), Positives = 102/253 (40%), Gaps = 53/253 (20%)
Query: 103 DQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGS-------- 154
+ ++Y GGL +AAL ++ L++T + L A++ + + LW G
Sbjct: 782 NASSYGGGLWLAALRAGAGIARDLDETDARTSLEETLRIAAKSFDEALWRGGGGGGGGGV 841
Query: 155 ----------------YYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIAR 198
YY D+S + D ++A Q+ G L + + +
Sbjct: 842 RKSQSRSRRGTHALEGYYAADASGTDAGDVVLAGQVMGEWALGM--IRAPGVLDPRKVRA 899
Query: 199 ALATIFSTNVRGFEAGSM----------------GAVNGMKPNGDRDRSAI--------- 233
AL+T + NV+ F GAVNG + + D +A+
Sbjct: 900 ALSTTYERNVKAFGRARGGSGSGSDDEPSTYVPSGAVNGARVDDATDAAALGVGAGDGIP 959
Query: 234 -QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRS 292
Q+ E G +Y L++ ++ G DE W A G YR YE G F TPE ++ +R
Sbjct: 960 AQARESRVGQSYALASHLILAGFSDEGWDVARGAYRVTYE-DGFAFRTPEIFDAERRFRG 1018
Query: 293 GGYMRALAVYAMQ 305
RA AV+A++
Sbjct: 1019 AISGRAGAVWAIE 1031
>gi|227828867|ref|YP_002830647.1| hypothetical protein M1425_2640 [Sulfolobus islandicus M.14.25]
gi|227460663|gb|ACP39349.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
Length = 661
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 58/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
W DL ++ +YRDY D L + Y E+I D I + G D
Sbjct: 393 WTDLGPTLVLMLYRDYVFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
++Y + ++ALA +++S +L D + ++ L + ++ LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DVKIDTKYYRFLECGKKTFNS-LWNGKYF 510
Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
+ S + QL G + G+ I + I AL +I+ N F A
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565
Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
N +K + D D S Q W V++ ++A M+ G V E A + TV E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETVKE 621
>gi|229586074|ref|YP_002844576.1| hypothetical protein M1627_2694 [Sulfolobus islandicus M.16.27]
gi|228021124|gb|ACP56531.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
Length = 661
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
W DL ++ +YRDY D L + Y E+I D I + G D
Sbjct: 393 WTDLGPTLVLMLYRDYVFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
++Y + ++ALA +++S +L D + ++ L + ++ LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DVKIDTKYYRFLECGKKTFNS-LWNGKYF 510
Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
+ S + QL G + G+ I + I AL +I+ N F A
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565
Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
N +K + D D S Q W V++ ++A M+ G V E A + T+ E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETIKE 621
>gi|227831601|ref|YP_002833381.1| hypothetical protein LS215_2801 [Sulfolobus islandicus L.S.2.15]
gi|229580553|ref|YP_002838953.1| hypothetical protein YG5714_2812 [Sulfolobus islandicus Y.G.57.14]
gi|238621059|ref|YP_002915885.1| hypothetical protein M164_2625 [Sulfolobus islandicus M.16.4]
gi|284999153|ref|YP_003420921.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|227458049|gb|ACP36736.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
gi|228011269|gb|ACP47031.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.G.57.14]
gi|238382129|gb|ACR43217.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
gi|284447049|gb|ADB88551.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
Length = 661
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
W DL ++ +YRDY D L + Y E+I D I + G D
Sbjct: 393 WTDLGPTLVLMLYRDYVFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
++Y + ++ALA +++S +L D + ++ L + ++ LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DVKIDTKYYRFLECGKKTFNS-LWNGKYF 510
Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
+ S + QL G + G+ I + I AL +I+ N F A
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565
Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
N +K + D D S Q W V++ ++A M+ G V E A + T+ E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETIKE 621
>gi|385777223|ref|YP_005649791.1| hypothetical protein [Sulfolobus islandicus REY15A]
gi|323475970|gb|ADX86576.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
Length = 661
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
W DL ++ +YRDY D L + Y E+I D I + G D
Sbjct: 393 WTDLGPTLVLMLYRDYVFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
++Y + ++ALA +++S +L D + ++ L + ++ LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DVKIDTKYYRFLECGKKTFNS-LWNGKYF 510
Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
+ S + QL G + G+ I + I AL +I+ N F A
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565
Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
N +K + D D S Q W V++ ++A M+ G V E A + T+ E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETIKE 621
>gi|385774575|ref|YP_005647144.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
gi|323478691|gb|ADX83929.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
Length = 661
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
W DL ++ +YRDY D L + Y E+I D I + G D
Sbjct: 393 WTDLGPTLVLMLYRDYVFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
++Y + ++ALA +++S +L D + ++ L + ++ LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DVKIDTKYYRFLECGKKTFNS-LWNGKYF 510
Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
+ S + QL G + G+ I + I AL +I+ N F A
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565
Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
N +K + D D S Q W V++ ++A M+ G V E A + T+ E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETIKE 621
>gi|284173198|ref|ZP_06387167.1| hypothetical protein Ssol98_00875 [Sulfolobus solfataricus 98/2]
Length = 647
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/236 (23%), Positives = 94/236 (39%), Gaps = 17/236 (7%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
W DL ++ +YRDY + L + Y E+I D I + G D
Sbjct: 379 WTDLGPTLVLMLYRDYVFTSNREILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 438
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
++Y + ++AL ++MS +L D + ++ L + ++ LW G Y+
Sbjct: 439 DGTHMYGASSYIASMFLSALTAFIKMSEIL-DVKIDDKYYRFLECGKKTFNS-LWNGKYF 496
Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
+ S + QL G + G+ I I AL +I+ N F+A
Sbjct: 497 ILWKKNDEENTSCLNSQLLGQFWCDILGL--PPITDHDKINTALRSIYELN---FKASKY 551
Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
N ++ +G D S Q W V++ ++A M+ G V E A + T+ E
Sbjct: 552 CLTNAVREDGSVDSSTAQLRSCWPRVSFAVAAHMILRGMVKEGIEVAKREWETIKE 607
>gi|15899397|ref|NP_344002.1| hypothetical protein SSO2674 [Sulfolobus solfataricus P2]
gi|384433011|ref|YP_005642369.1| hypothetical protein [Sulfolobus solfataricus 98/2]
gi|13815992|gb|AAK42792.1| Conserved hypothetical protein [Sulfolobus solfataricus P2]
gi|261601165|gb|ACX90768.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
Length = 648
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/236 (23%), Positives = 94/236 (39%), Gaps = 17/236 (7%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
W DL ++ +YRDY + L + Y E+I D I + G D
Sbjct: 380 WTDLGPTLVLMLYRDYVFTSNREILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 439
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
++Y + ++AL ++MS +L D + ++ L + ++ LW G Y+
Sbjct: 440 DGTHMYGASSYIASMFLSALTAFIKMSEIL-DVKIDDKYYRFLECGKKTFNS-LWNGKYF 497
Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
+ S + QL G + G+ I I AL +I+ N F+A
Sbjct: 498 ILWKKNDEENTSCLNSQLLGQFWCDILGL--PPITDHDKINTALRSIYELN---FKASKY 552
Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
N ++ +G D S Q W V++ ++A M+ G V E A + T+ E
Sbjct: 553 CLTNAVREDGSVDSSTAQLRSCWPRVSFAVAAHMILRGMVKEGIEVAKREWETIKE 608
>gi|229583411|ref|YP_002841810.1| hypothetical protein YN1551_3001 [Sulfolobus islandicus Y.N.15.51]
gi|228014127|gb|ACP49888.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.N.15.51]
Length = 661
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 95/236 (40%), Gaps = 17/236 (7%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
W DL ++ +YRDY D L + Y E+I D I + G D
Sbjct: 393 WTDLGPTLVLMLYRDYIFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSY 452
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
++Y + ++ALA +++S +L D + ++ L + ++ LW G Y+
Sbjct: 453 DGTHMYGASSYVASMFLSALAAFIKISEIL-DIKIDTKYYRFLECGKKTFNS-LWNGKYF 510
Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
+ S + QL G + G+ I + I AL +I+ N F A
Sbjct: 511 VLWKKNDEENRSCLNSQLLGQFWCDILGL--PPITDNDKINTALRSIYELN---FRASKY 565
Query: 217 GAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYE 272
N +K + D D S Q W V++ ++A M+ G V E A + T+ E
Sbjct: 566 CLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETIKE 621
>gi|12005896|gb|AAG44660.1|AF258662_1 AD035 [Homo sapiens]
Length = 367
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 36/51 (70%)
Query: 28 PEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTC 78
P++EP+ +N Y + D +DWKDLNLKF++ +YRDY L D FL ++P C
Sbjct: 311 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVC 361
>gi|160934323|ref|ZP_02081710.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
gi|156866996|gb|EDO60368.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
Length = 806
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/167 (25%), Positives = 81/167 (48%), Gaps = 20/167 (11%)
Query: 111 LHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY-----KFDSSQSRH 165
L +AAL EM++ L DT ++++ K S+ LLW G YY ++ + ++
Sbjct: 535 LFLAALKAGKEMAAYLGDTAHAEKYAEAFRKGSEKMDSLLWGGDYYIQVIDDINAYKYQY 594
Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMG-----AVN 220
+ADQ+ G + + + + P +I +A+ ++F N + + A+N
Sbjct: 595 GTGCLADQVFGQMLAHLNHLGY--VLPEEHIKKAIHSVFQYNFKETMEDHVNVQRTYALN 652
Query: 221 GMK-------PNGDRDR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
K P G R + + S+EVW+G+ Y ++A +++EG V+E
Sbjct: 653 QEKGLVLCSWPQGGRPKLPFVYSDEVWSGIEYQVAAHLIFEGFVEEG 699
>gi|269839491|ref|YP_003324183.1| hypothetical protein Tter_2468 [Thermobaculum terrenum ATCC
BAA-798]
gi|269791221|gb|ACZ43361.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
BAA-798]
Length = 807
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/255 (22%), Positives = 105/255 (41%), Gaps = 51/255 (20%)
Query: 49 DLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELI-----------------RKCESFDKQ 91
D + I+ + R++ L D+ +L RV+P + R+ ++D +
Sbjct: 456 DGQMGSILRVLREWRLSGDMEWLRRVWPGVKSALAYASAQWDGDGDGVLDGRQHNTYDVE 515
Query: 92 GLGIVQNGGFPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLW 151
F CG ++AAL + E++ +L + ++ + S +LLW
Sbjct: 516 ---------FYGPNPLCGLYYLAALRAVEELAEVLGEANLARRCREVFERGSHRLDELLW 566
Query: 152 TGSYYK----FDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN 207
G YY+ D+ +H +ADQL G L+A + + P ++ +AL +I+ N
Sbjct: 567 NGEYYEQLGDEDAWPYQHGKGCLADQLLGQ--LQAHVLGMGYLVPEEHVRQALRSIWRYN 624
Query: 208 VRGFEAGSMGAVNGMKPNGDRDRSAI----------------QSEEVWTGVTYLLSAAML 251
F A VN + D + + S+EV+TG Y ++A M+
Sbjct: 625 ---FRADLSEHVNCQRTFALNDEAGLLMCTWPRGGQPTFPFPYSDEVFTGSEYQVAAHMI 681
Query: 252 YEGNVDEAWTTAGGL 266
YEG ++E G +
Sbjct: 682 YEGLLEEGLQIVGAV 696
>gi|160931643|ref|ZP_02079037.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
gi|156869288|gb|EDO62660.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
Length = 804
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 51/242 (21%), Positives = 104/242 (42%), Gaps = 37/242 (15%)
Query: 49 DLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGG-------- 100
D L I+ +YR++ + D FL +++ + + S+ V +G
Sbjct: 471 DGQLGSILRVYREWKISGDDDFLKKIWNQVVSALEFSASYWDSNQDCVLDGQQHNTYDIE 530
Query: 101 FPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--- 157
F + ++ AAL EM+ L + ++ Q++ + + ++L+ G YY+
Sbjct: 531 FYGVNSLGNSIYYAALKAGAEMAEYLGEHERSQKWRSMEQAGCKRMDEMLFNGEYYRQVT 590
Query: 158 ---FDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAG 214
D + ++ + ++DQL G G+ +FP ++ A+ I+ N +
Sbjct: 591 DGDIDEYKYQYGEGCLSDQLLGQTLAHLYGLGH--LFPEDHVKSAVFAIYKYNFKE---- 644
Query: 215 SMGAVNGMK----------------PNGDRDRSA-IQSEEVWTGVTYLLSAAMLYEGNVD 257
MG+ ++ P+G R + + S+EVWTG+ Y ++A ++YEG +
Sbjct: 645 RMGSHKSLQRGYAYQDEPGLLLCSWPSGGRPKQPFVYSDEVWTGIEYQVAAGLIYEGFLQ 704
Query: 258 EA 259
E
Sbjct: 705 EG 706
>gi|402589209|gb|EJW83141.1| hypothetical protein WUBG_05945 [Wuchereria bancrofti]
Length = 90
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 39/59 (66%)
Query: 188 EAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
A+ P ++I AL I+S+N+ F G +GAVNGM+ +G R +Q++E+W GVTY L
Sbjct: 22 RALLPLSHIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYAL 80
>gi|86144926|ref|ZP_01063258.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
gi|85837825|gb|EAQ55937.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
Length = 1024
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 60/121 (49%), Gaps = 8/121 (6%)
Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMK 223
SD+ D L +LK G+ E IFP NI RAL ++ N FE S +G N +
Sbjct: 815 SDNSFGDALLADSYLKLIGL--EGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTQ 869
Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
+G A Q+++VW GV + ++ A+ G +A T +Y +Y+ + + F PEG
Sbjct: 870 ADG-SPHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALYDYSKIPFAAPEG 928
Query: 284 L 284
Sbjct: 929 F 929
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 62/148 (41%), Gaps = 14/148 (9%)
Query: 32 PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
PF + + Y+ +V +WKDL K+I+ +YR Y +DI + + E I +
Sbjct: 596 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDISVVKECWQAVTESIDFLSNLIA 655
Query: 91 QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
+G + G D + YC L +A L E++ L+ +++ + + K
Sbjct: 656 EGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAASELAQLMGESELASGYLTRSKK 715
Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
A LW YY F Q++H
Sbjct: 716 ALATVEQSLWDDKEGYYHFFVTPVQAKH 743
>gi|402583425|gb|EJW77369.1| hypothetical protein WUBG_11725, partial [Wuchereria bancrofti]
Length = 109
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 38/57 (66%)
Query: 190 IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
+ P ++I AL I+S+N+ F G +GAVNGM+ +G R +Q++E+W GVTY L
Sbjct: 1 LLPLSHIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYAL 57
>gi|374921957|gb|AFA26156.1| non-lysosomal glucosylceramidase-like protein, partial [Lolium
perenne]
Length = 109
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 51/101 (50%), Gaps = 10/101 (9%)
Query: 86 ESFDKQGLGIVQNGGFPDQT----------AYCGGLHIAALACMVEMSSLLNDTQQQQQF 135
E FD+ +++N GFPDQT AYCG L +A+L M+ L ++
Sbjct: 7 EQFDRDDDSMIENDGFPDQTYDAWTVLGVSAYCGCLWLASLQAAAAMARSLGHADYAERC 66
Query: 136 SAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
K KA V+ LW GSY+ +DS S S SI ADQL G
Sbjct: 67 MVKFAKAKHVFEAKLWNGSYFNYDSGTSYSSRSIQADQLAG 107
>gi|148976285|ref|ZP_01813009.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
gi|145964379|gb|EDK29634.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
Length = 982
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 80/176 (45%), Gaps = 29/176 (16%)
Query: 111 LHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIM 170
++I +A V LL++T Q +Q Y DL+ S + SD+++
Sbjct: 739 INIGKVAQRVSKKRLLSETAPQ--------AFTQEYLDLV--------PDSDNSFSDALL 782
Query: 171 ADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMKPNGDR 228
AD YL L + E IFP I RAL ++ N FE S +G N + +G
Sbjct: 783 AD---SYLRL----IGLEGIFPQQRIQRALDYVYKHN---FEINSPKLGVANMTRADG-S 831
Query: 229 DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGL 284
A Q+++VW GV + ++ A+ G +A T +Y +Y+ + + F PEG
Sbjct: 832 PHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALYDYSKIPFAAPEGF 887
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/148 (23%), Positives = 59/148 (39%), Gaps = 14/148 (9%)
Query: 32 PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
PF + + Y+ +V +WKDL K+I+ +YR Y +D+ + + E I +
Sbjct: 554 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDMSVVKECWQAVTESIDFLSNLIA 613
Query: 91 QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
G + G D + YC L +A L E++ L+ + + + K
Sbjct: 614 DGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAASELAKLMGEHDLGAGYLTRSKK 673
Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
A LW YY F Q++H
Sbjct: 674 ALATVEQSLWDEKEGYYHFFVTPVQAKH 701
>gi|84386772|ref|ZP_00989797.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
gi|84378300|gb|EAP95158.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
Length = 908
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 59/121 (48%), Gaps = 8/121 (6%)
Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMK 223
SD+ D L +LK G+ E IFP NI RAL ++ N FE S +G N
Sbjct: 699 SDNSFGDALLADSYLKLIGL--EGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTL 753
Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
+G A Q+++VW GV + ++ A+ G +A T +Y +Y+ + + F PEG
Sbjct: 754 ADG-SPHEAFQAQDVWIGVQFSVATALSLAGKNQQAETLMDTVYTALYDYSKIPFAAPEG 812
Query: 284 L 284
Sbjct: 813 F 813
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 63/148 (42%), Gaps = 14/148 (9%)
Query: 32 PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
PF + + Y+ +V +WKDL K+I+ +YR Y +DI + + E I +
Sbjct: 480 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDISVVKECWQAVTESIDFLSNLIA 539
Query: 91 QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
+G + G D + YC L +A L E++ L+ ++++ + + K
Sbjct: 540 EGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAASELAQLMGESERANGYLTRSKK 599
Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
A LW YY F Q++H
Sbjct: 600 ALATVEQSLWDDKEGYYHFFVTPVQAKH 627
>gi|15899746|ref|NP_344351.1| hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
gi|13816440|gb|AAK43141.1| Hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
Length = 803
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 24/180 (13%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
+ Y L IAAL + E S LLN+ + +LT+ Y LW G YY +D
Sbjct: 547 SPYVSFLWIAALEALNEASKLLNNLHNYE--VKELTEKVNKY---LWNGEYYINWYDPIS 601
Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
+ DS+ A Q+ G +++ G+ P +IFS+ ++ G NG
Sbjct: 602 NLRDDSLNASQMTGDWYVQLLGL------PEFLDHEKRKSIFSSIIKYNYTEEEGVKNGS 655
Query: 222 ----MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG 277
+ P G + ++QS+ W+G+ Y L++ M Y G DE A + R VYER L
Sbjct: 656 SNKEITPLGVK--LSVQSKTPWSGIEYYLASHMFYNG-FDEY---AKKILRNVYERYELA 709
>gi|284174574|ref|ZP_06388543.1| hypothetical protein Ssol98_07941 [Sulfolobus solfataricus 98/2]
gi|356934758|gb|AET42959.1| hypothetical protein [Sulfolobus solfataricus 98/2]
Length = 803
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 24/180 (13%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
+ Y L IAAL + E S LLN+ + +LT+ Y LW G YY +D
Sbjct: 547 SPYVSFLWIAALEALNEASKLLNNLHNYE--VKELTEKVNKY---LWNGEYYINWYDPIS 601
Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
+ DS+ A Q+ G +++ G+ P +IFS+ ++ G NG
Sbjct: 602 NLRDDSLNASQMTGDWYVQLLGL------PEFLDHEKRKSIFSSIIKYNYTEEEGVKNGS 655
Query: 222 ----MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG 277
+ P G + ++QS+ W+G+ Y L++ M Y G DE A + R VYER L
Sbjct: 656 SNKEITPLGVK--LSVQSKTPWSGIEYYLASHMFYNG-FDEY---AKKILRNVYERYELA 709
>gi|218677125|ref|YP_002395944.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
gi|218325393|emb|CAV27494.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
Length = 1069
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 63/138 (45%), Gaps = 8/138 (5%)
Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMK 223
SD+ D L +LK G+ + IFP NI RAL ++ N FE S +G N
Sbjct: 860 SDNSFGDALLADSYLKLIGL--DGIFPQENIQRALDYVYKHN---FEINSPKLGVANMTL 914
Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
+G A Q+++VW GV + ++ A+ G +A T +Y +Y+ + + F PEG
Sbjct: 915 ADG-SPHEAFQAQDVWIGVQFSVATALSLAGKNQQAETLMDTVYTALYDYSKIPFAAPEG 973
Query: 284 LTGDKTYRSGGYMRALAV 301
+ M A +
Sbjct: 974 FNCSVSVNEKDLMEAFKL 991
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 63/148 (42%), Gaps = 14/148 (9%)
Query: 32 PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
PF + + Y+ +V +WKDL K+I+ +YR Y +DI + + E I +
Sbjct: 641 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDISVVKECWQAVTESIDFLSNLIA 700
Query: 91 QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
+G + G D + YC L +A L E++ L+N+++ + + K
Sbjct: 701 EGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAASELAKLMNESELAAGYLTRSKK 760
Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
A LW YY F Q++H
Sbjct: 761 ALATVEQSLWDDKEGYYHFFVTPVQAKH 788
>gi|374815490|ref|ZP_09719227.1| hypothetical protein TpriZ_16630 [Treponema primitia ZAS-1]
Length = 814
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 95/237 (40%), Gaps = 30/237 (12%)
Query: 55 IVSIYRDYALHKDIRFLTRVYPTC-LELIRKCESFDKQGLGIVQNGGFPDQTAYCGGL-- 111
IV +YRD+ + D FL ++P L L ++D G ++ D G+
Sbjct: 478 IVRLYRDWLISGDDAFLKELWPKAKLALDYGIRTWDTDGDFVLDGMKHVDYDVEFYGIDP 537
Query: 112 -----HIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKF-------D 159
++AAL V+++ L D ++ AS +W F D
Sbjct: 538 LGNLCYLAALKAAVKIAEYLKDEGAHNRYQTIYDTASVQADKFMWDSKAGYFIQVLKDVD 597
Query: 160 SSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV----------- 208
+ +H +ADQL G + +G+ + + +I +A +IF N
Sbjct: 598 EFKYQHGTGCLADQLIGQFYAHLTGLG--PLVDAGHIKQAAQSIFKYNFISDFTDHPNMQ 655
Query: 209 RGFEAGS-MGAVNGMKPNGDRDRSAI-QSEEVWTGVTYLLSAAMLYEGNVDEAWTTA 263
RG+ G + P G R R SEE W+ Y +++ ++YEG +DE T A
Sbjct: 656 RGYAVNDDKGLLMTTWPLGGRPRYPFFYSEEAWSRTEYHVASTLIYEGFLDEGLTIA 712
>gi|15898194|ref|NP_342799.1| hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
gi|13814563|gb|AAK41589.1| Hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
Length = 663
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 76/169 (44%), Gaps = 14/169 (8%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQNGG-- 100
WKD+N I+ +YR + DI FL VYP +++ +R+C+ G + N
Sbjct: 404 WKDMNPSLILLVYRYFKFTNDIEFLKEVYPILVKVMDWELRQCKGNLPFMEGEMDNAFDA 463
Query: 101 --FPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKF 158
+Y L I +L M E++ L+ D+ S KL+ A + + + G Y+K
Sbjct: 464 TIIKGHDSYTSSLFIGSLIAMREIAKLVGDSNYVDFISEKLSSAREAFRRMF-NGRYFK- 521
Query: 159 DSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN 207
+ S + S +A QL G F G+ E I I +AL +I N
Sbjct: 522 -AWDSVDNASFLA-QLYGEWFTTLVGL--EDIVEEDIIKKALESIIRLN 566
>gi|417950561|ref|ZP_12593680.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
gi|342806239|gb|EGU41472.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
Length = 1046
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 58/121 (47%), Gaps = 8/121 (6%)
Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMK 223
SD+ D L +LK G+ E IFP I RAL ++ N FE S +G N
Sbjct: 837 SDNSFGDALLADSYLKLIGL--EGIFPKQRIQRALDYVYKHN---FEINSPKLGVANMTL 891
Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
+G A Q+++VW GV + ++ A+ G +A T +Y +Y+ + + F PEG
Sbjct: 892 ADG-SPHEAFQAQDVWIGVQFSVATALSLAGKSQQAETLMDTVYTALYDYSKIPFAAPEG 950
Query: 284 L 284
Sbjct: 951 F 951
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 60/148 (40%), Gaps = 14/148 (9%)
Query: 32 PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
PF + + Y+ +V +WKDL K+I+ +YR Y +DI + + E I +
Sbjct: 618 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQKTQDISVVKECWQAVTESIDFLSNLIA 677
Query: 91 QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
+G + G D + YC L +A L E++ L+ + + + K
Sbjct: 678 EGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAASELAKLMGEHDLGAGYLTRSKK 737
Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
A LW YY F Q++H
Sbjct: 738 ALATVEQSLWDEKEGYYHFFVTPVQAKH 765
>gi|407069309|ref|ZP_11100147.1| hypothetical protein VcycZ_07137 [Vibrio cyclitrophicus ZF14]
Length = 1024
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 58/121 (47%), Gaps = 8/121 (6%)
Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS--MGAVNGMK 223
SD+ D L +LK G+ E IFP I RAL ++ N FE S +G N
Sbjct: 815 SDNSFGDALLADSYLKLIGL--EGIFPEDRIQRALDYVYKHN---FEINSPKLGVANMTL 869
Query: 224 PNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEG 283
+G A Q+++VW GV + ++ A+ G +A T +Y +Y+ + + F PEG
Sbjct: 870 ADG-SPHEAFQAQDVWIGVQFSVATALNLAGKSQQAETLMDTVYTALYDYSKIPFAAPEG 928
Query: 284 L 284
Sbjct: 929 F 929
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 63/148 (42%), Gaps = 14/148 (9%)
Query: 32 PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDK 90
PF + + Y+ +V +WKDL K+I+ +YR Y +D+ + + E I +
Sbjct: 596 PFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDLSVVKECWQAVTESIEFLSNLVA 655
Query: 91 QGLGIVQNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
+G + G D + YC L +A L E++ L+N+ + + A+ K
Sbjct: 656 EGDDLPLTRGTDDTFDNLASHGISIYCASLWVAGLQAANELALLMNENELAADYLARSKK 715
Query: 142 ASQVYHDLLWTGS--YYKF--DSSQSRH 165
A LW YY F Q++H
Sbjct: 716 ALATVEQSLWDDKEGYYHFFVTPVQAKH 743
>gi|302348409|ref|YP_003816047.1| trehalase [Acidilobus saccharovorans 345-15]
gi|302328821|gb|ADL19016.1| Putative trehalase [Acidilobus saccharovorans 345-15]
Length = 928
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 80/173 (46%), Gaps = 25/173 (14%)
Query: 149 LLWTGSYYK--FDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFST 206
LLW GSY+ +D +I++ QL G +L+ G+ S + AL ++
Sbjct: 656 LLWNGSYFDDWYDPESGLRDRAILSAQLTGEWYLQVLGLGDGV--DSEKVRSALREVYRA 713
Query: 207 NVRGFEAGSMGAVNGM----------KPNGDR--DRSAIQSEEVWTGVTYLLSAAMLYEG 254
N R +E G G +PNG + +R + Q++ WTGV + +++ MLYEG
Sbjct: 714 NFRRWEGLLNGTYPGTPRPSMVGDVEEPNGTKILNRVSSQADTPWTGVEFGVASQMLYEG 773
Query: 255 NVDEAWTTAGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
V+EA + R++++R GL + G Y R LA A+ ++
Sbjct: 774 MVEEAME----VLRSIHDRY-----RSWGLYFNHLECDGHYSRPLAALAIPNS 817
>gi|385774246|ref|YP_005646813.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
gi|323478361|gb|ADX83599.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
Length = 818
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 86/203 (42%), Gaps = 21/203 (10%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
+ Y L I+AL + E S LLN+ Q + + + + + LW G YY +D
Sbjct: 562 SPYVSFLWISALEALNEASKLLNNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 616
Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
+ DS A Q+ G +++ ++ ++I N G E G+ N
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 674
Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
+ P G + +IQS+ W+GV Y L++ M Y G DE A + R VYER +
Sbjct: 675 ITPLGVK--LSIQSKAPWSGVEYYLASHMFYSG-FDEY---AKKILRNVYERYEIA---- 724
Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
G + YMR L +M
Sbjct: 725 -GSFWNHIEWGARYMRPLVALSM 746
>gi|261417086|ref|YP_003250769.1| glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|385791890|ref|YP_005823013.1| hypothetical protein FSU_3272 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|261373542|gb|ACX76287.1| Glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|302325406|gb|ADL24607.1| conserved domain protein [Fibrobacter succinogenes subsp.
succinogenes S85]
Length = 1047
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 58/121 (47%), Gaps = 4/121 (3%)
Query: 167 DSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNG 226
D + AD + +L+ ++ E I SA L IF+TN + + +GA N ++ +G
Sbjct: 834 DDVFADTMLADTYLRL--LDLEPITDSAKAKSNLLRIFNTNYKA-NSPLIGAANLVRKDG 890
Query: 227 DR-DRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLT 285
D Q+ +VW G+ Y + AM++ G EA A + +Y+ + F PEG
Sbjct: 891 SPLDEFNFQAHDVWIGIQYSIMTAMMFHGLEKEASVLADSMIGNLYDEARVPFAAPEGFN 950
Query: 286 G 286
G
Sbjct: 951 G 951
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 60/133 (45%), Gaps = 11/133 (8%)
Query: 37 NGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIV 96
+ Y+ +V +WKDL K+++ +YR Y KD++ L + E G
Sbjct: 614 DAYDWHNVKEWKDLAPKYVLMVYRHYHKTKDMQCLADCKEAVYAAMEYLEKMVNPGENFP 673
Query: 97 QNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYH 147
G D + YCG L IA L +++ LL D +Q +++ K A++ +
Sbjct: 674 LTHGTDDTFDNLCSYGISVYCGSLWIAGLRAAAKIAELLGDNEQAAKWNEKSEAANKEFT 733
Query: 148 DLLWTGS--YYKF 158
+ LW + Y++F
Sbjct: 734 ESLWDENEGYFRF 746
>gi|284997264|ref|YP_003419031.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|284445159|gb|ADB86661.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
Length = 740
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 51/213 (23%), Positives = 90/213 (42%), Gaps = 19/213 (8%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
WKD N FI+ +YR Y D+ FL +YP L+ ++ + DK G+ + G D
Sbjct: 474 WKDTNPTFILLVYRYYKFTNDLDFLKEMYPKMLKAMKWELTQDKDKDGVPELEGQGDTGF 533
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
+Y ++IA++ + E + +L D + + L K+ +++ L+ G Y
Sbjct: 534 DAMSVKGIDSYTTSIYIASIIALKETAKILKDNDTLNEMTILLEKSRKIFSR-LFNGKY- 591
Query: 157 KFDS-SQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGS 215
FD I QL G + + G+ E I I+ AL + S N + +
Sbjct: 592 -FDPWIGEPEIKGIFLGQLVGEWWSEILGL--EPIIEEEKISSALEAMLSINAN---SSN 645
Query: 216 MGAVNGMKPNGDRDRSAIQSEEVWTGVTYLLSA 248
N + +G + QS W + + +S+
Sbjct: 646 FCTPNIVSEDGKIIDISPQSYSSWPRLVFAISS 678
>gi|385776902|ref|YP_005649470.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
gi|323475650|gb|ADX86256.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
Length = 818
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 86/203 (42%), Gaps = 21/203 (10%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
+ Y L I+AL + E S LLN+ Q + + + + + LW G YY +D
Sbjct: 562 SPYVSFLWISALEALNEASKLLNNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 616
Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
+ DS A Q+ G +++ ++ ++I N G E G+ N
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 674
Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
+ P G + +IQS+ W+GV Y L++ M Y G DE A + R VYER +
Sbjct: 675 ITPLGVK--LSIQSKAPWSGVEYYLASHMFYSG-FDEY---AKKILRNVYERYEIA---- 724
Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
G + YMR L +M
Sbjct: 725 -GSFWNHIEWGARYMRPLVALSM 746
>gi|15898746|ref|NP_343351.1| hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
gi|229578524|ref|YP_002836922.1| hypothetical protein YG5714_0710 [Sulfolobus islandicus Y.G.57.14]
gi|229582969|ref|YP_002841368.1| hypothetical protein YN1551_2495 [Sulfolobus islandicus Y.N.15.51]
gi|284173504|ref|ZP_06387473.1| hypothetical protein Ssol98_02460 [Sulfolobus solfataricus 98/2]
gi|384435081|ref|YP_005644439.1| hypothetical protein [Sulfolobus solfataricus 98/2]
gi|13815223|gb|AAK42141.1| Hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
gi|228009238|gb|ACP45000.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.G.57.14]
gi|228013685|gb|ACP49446.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.N.15.51]
gi|261603235|gb|ACX92838.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
Length = 661
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 92/213 (43%), Gaps = 27/213 (12%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
WKD+N I+ +YR + DI FL VYPT +++ +R+C + GL ++ +
Sbjct: 402 WKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457
Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
F D T +Y L IA+L M E++ L+ D+ + KL A + + + G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLNVAREAFRKMF-NG 515
Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
Y+K + S +A QL G F + + E I I AL +I N A
Sbjct: 516 KYFK--AWDGVDKASFLA-QLYGEWF--TTLLELENIVDENMIKSALESIIRLNGN---A 567
Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
N + NG ++Q+ W + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600
>gi|227827237|ref|YP_002829016.1| hypothetical protein M1425_0924 [Sulfolobus islandicus M.14.25]
gi|227459032|gb|ACP37718.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
Length = 661
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 93/213 (43%), Gaps = 27/213 (12%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
WKD+N ++ +YR + DI FL VYPT +++ +R+C + GL ++ +
Sbjct: 402 WKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457
Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
F D T +Y L IA+L M E++ L+ D+ + KL+ A + + + G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAREAFRKMF-NG 515
Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
Y+K + S +A QL G F + + E I I AL +I N A
Sbjct: 516 KYFK--AWDGVDKASFLA-QLYGEWF--TTLLELENIVDENMIKSALESIIRLNGN---A 567
Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
N + NG ++Q+ W + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600
>gi|227829616|ref|YP_002831395.1| hypothetical protein LS215_0672 [Sulfolobus islandicus L.S.2.15]
gi|227456063|gb|ACP34750.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
Length = 661
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 92/213 (43%), Gaps = 27/213 (12%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
WKD+N I+ +YR + DI FL VYPT +++ +R+C + GL ++ +
Sbjct: 402 WKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457
Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
F D T +Y L IA+L M E++ L+ D+ + KL A + + + G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLNVAREAFRKMF-NG 515
Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
Y+K + S +A QL G F + + E I I AL +I N A
Sbjct: 516 KYFK--AWDGVDKASFLA-QLYGEWF--TTLLELENIVDENMIKSALESIIRLNGN---A 567
Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
N + NG ++Q+ W + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600
>gi|159041841|ref|YP_001541093.1| hypothetical protein Cmaq_1276 [Caldivirga maquilingensis IC-167]
gi|157920676|gb|ABW02103.1| protein of unknown function DUF608 [Caldivirga maquilingensis
IC-167]
Length = 688
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 54/121 (44%), Gaps = 11/121 (9%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
WKDLN +I+ ++R + D+ F+ +YP ++ DK G G+ + G D
Sbjct: 412 WKDLNSTYILLVHRYFKRSNDVEFIKEIYPKLIKAFEWVLVQDKDGDGVPELSGDGDTGF 471
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
+Y L IAAL M E++ L+ D + + L KA Y + W G +
Sbjct: 472 DAMSVKGFDSYTTSLWIAALMVMGELAKLMGDQATLSKVESTLLKARDSY-NRRWLGDRF 530
Query: 157 K 157
K
Sbjct: 531 K 531
>gi|238619389|ref|YP_002914214.1| hypothetical protein M164_0933 [Sulfolobus islandicus M.16.4]
gi|238380458|gb|ACR41546.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
Length = 661
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 91/213 (42%), Gaps = 27/213 (12%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
WKD+N ++ +YR + DI FL VYPT +++ +R+C + GL ++ +
Sbjct: 402 WKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMNWELRQC----RDGLPFMEGEMDN 457
Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
F D T +Y L IA+L M E++ L+ D+ + KL+ A + + + G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAREAFRKMF-NG 515
Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
Y+K + S QL G F + + E I I AL +I N A
Sbjct: 516 KYFKAWDGVDKAS---FLAQLYGEWF--TTLLELENIVDENMIKSALESIIRLNGN---A 567
Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
N + NG ++Q+ W + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600
>gi|229584458|ref|YP_002842959.1| hypothetical protein M1627_0991 [Sulfolobus islandicus M.16.27]
gi|228019507|gb|ACP54914.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
Length = 661
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 93/213 (43%), Gaps = 27/213 (12%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
WKD+N ++ +YR + DI FL VYPT +++ +R+C + GL ++ +
Sbjct: 402 WKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457
Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
F D T +Y L IA+L M E++ L+ D+ + KL+ A + + + G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAREAFRKMF-NG 515
Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
Y+K + S +A QL G F + + E I I AL +I N A
Sbjct: 516 KYFK--AWDGVDKASFLA-QLYGEWF--TTLLELENIVDENMIKSALESIIRLNGN---A 567
Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
N + NG ++Q+ W + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600
>gi|385775519|ref|YP_005648087.1| hypothetical protein [Sulfolobus islandicus REY15A]
gi|323474267|gb|ADX84873.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
Length = 661
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 93/213 (43%), Gaps = 27/213 (12%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
WKD+N ++ +YR + DI FL VYPT +++ +R+C + GL ++ +
Sbjct: 402 WKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457
Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
F D T +Y L IA+L M E++ L+ D+ + KL+ A + + + G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAREAFRKMF-NG 515
Query: 154 SYYKFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA 213
Y+K + S +A QL G F + + E I I AL +I N A
Sbjct: 516 KYFK--AWDGVDKASFLA-QLYGEWF--TTLLELENIADENMIKSALESIIRLNGN---A 567
Query: 214 GSMGAVNGMKPNGDRDRSAIQSEEVWTGVTYLL 246
N + NG ++Q+ W + + +
Sbjct: 568 SPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAI 600
>gi|385772401|ref|YP_005644967.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
gi|323476515|gb|ADX81753.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
Length = 661
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 62/124 (50%), Gaps = 19/124 (15%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
WKD+N ++ +YR + DI FL VYPT +++ +R+C + GL ++ +
Sbjct: 402 WKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457
Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTG 153
F D T +Y L IA+L M E++ L+ D+ + KL+ A + + + G
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAREAFRKMF-NG 515
Query: 154 SYYK 157
Y+K
Sbjct: 516 KYFK 519
>gi|327309977|ref|YP_004336874.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
gi|326946456|gb|AEA11562.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
Length = 694
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 93/229 (40%), Gaps = 19/229 (8%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
WKD N FI+ +YR Y DI F+ VY + R D+ G G+ + G D
Sbjct: 414 WKDTNPTFILLVYRYYKRTGDIDFVKSVYGRLVAAYRWMRGQDRDGDGLPELEGSGDSGF 473
Query: 105 --------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY 156
+AY L IAAL + +++ + D++ ++ + L +A + SY
Sbjct: 474 DCAKIRGPSAYTSSLWIAALVALRDLAEAVGDSKTAEEAGSDLERARNSFERSFDGVSYK 533
Query: 157 KFDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSM 216
+D + ++ A G +L +G + E +A AL+ + N +
Sbjct: 534 AWDGAP---EGAVFAALAVGDWWLYLAG-SREGAARRDRLASALSAVLRVNGSASANCAP 589
Query: 217 GAVNGMKPNGD---RDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTT 262
A G +P D +R Q W + + + AA+ Y D AW +
Sbjct: 590 NASTG-RPEADGEISERVGRQLTASWPRLIFSV-AALGYALTGDAAWLS 636
>gi|15921102|ref|NP_376771.1| hypothetical protein ST0868 [Sulfolobus tokodaii str. 7]
gi|15621887|dbj|BAB65880.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 683
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 10/92 (10%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ-- 104
WKD N F++ +YR Y KD FL VYP + + S DK G G+ + G DQ
Sbjct: 410 WKDTNTTFVLMVYRYYLRTKDKEFLKSVYPYVKKAMSWIISKDKDGDGLPEVDGSTDQGF 469
Query: 105 --------TAYCGGLHIAALACMVEMSSLLND 128
+Y ++IAAL M++++ ++ D
Sbjct: 470 DCVPIEGVCSYISTVYIAALEAMIKIAEIVGD 501
>gi|229581136|ref|YP_002839535.1| hypothetical protein YN1551_0454 [Sulfolobus islandicus Y.N.15.51]
gi|228011852|gb|ACP47613.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.N.15.51]
Length = 816
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 85/203 (41%), Gaps = 21/203 (10%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
+ Y L I+AL + E S LL++ Q + + + + + LW G YY +D
Sbjct: 560 SPYVSFLWISALEALNEASKLLSNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 614
Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
+ DS A Q+ G +++ ++ ++I N G E G+ N
Sbjct: 615 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 672
Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
+ P G + +IQS+ W+GV Y L+ M Y G DE A + R VYER +
Sbjct: 673 ITPLGVK--LSIQSKAPWSGVEYYLALHMFYSG-FDEY---AKKILRNVYERYEIA---- 722
Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
G + YMR L +M
Sbjct: 723 -GNFWNHIEWGARYMRPLVALSM 744
>gi|410638454|ref|ZP_11349015.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
gi|410141863|dbj|GAC16220.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
Length = 1124
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/234 (20%), Positives = 97/234 (41%), Gaps = 26/234 (11%)
Query: 49 DLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIR-KCESFDKQGLGIVQN-------GG 100
D + I++ YR++ + +L +P + + +D +G + N G
Sbjct: 720 DGHFGTILNTYREHQCSANDSWLKSQWPRVKKAVEWGINEWDPNRIGFMSNTQHNTLDGN 779
Query: 101 FPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDS 160
+++ G L++ +L M+ ++ + Q +++ A ++ + LW G YY +
Sbjct: 780 ITGCSSWIGSLYLTSLEAAARMADIVGEPQTAKEYRAIRESGKKLQNKRLWNGEYYIQEE 839
Query: 161 SQSRHSDSI---MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRG-FEAGSM 216
+ R D + DQL G + A +N + +P +A++ + N R F S+
Sbjct: 840 GKERFQDYLDGCHIDQLLGEWW--ADQLNIDRNYPRERAQQAMSALLKYNFRADFYGQSL 897
Query: 217 GAVN-------GMK----PNGDRDRSAIQ-SEEVWTGVTYLLSAAMLYEGNVDE 258
GMK P G + ++ +EV TG Y + M+ G +DE
Sbjct: 898 KPRQYCEIEDGGMKMITWPRGQQPIPGMKYGDEVMTGFEYGAAVTMMQNGMIDE 951
>gi|284998829|ref|YP_003420597.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|284446725|gb|ADB88227.1| hypothetical protein LD85_2606 [Sulfolobus islandicus L.D.8.5]
Length = 803
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 85/203 (41%), Gaps = 21/203 (10%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
+ Y L I+AL + E S LL++ Q + + + + + LW G YY +D
Sbjct: 547 SPYVSFLWISALEALNEASKLLSNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 601
Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
+ DS A Q+ G +++ ++ ++I N G E G+ N
Sbjct: 602 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 659
Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
+ P G + +IQS+ W+GV Y L+ M Y G DE A + R VYER +
Sbjct: 660 ITPLGVK--LSIQSKAPWSGVEYYLALHMFYSG-FDEY---AKKILRNVYERYEIA---- 709
Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
G + YMR L +M
Sbjct: 710 -GNFWNHIEWGARYMRPLVALSM 731
>gi|227831316|ref|YP_002833096.1| hypothetical protein LS215_2489 [Sulfolobus islandicus L.S.2.15]
gi|229580221|ref|YP_002838621.1| hypothetical protein YG5714_2457 [Sulfolobus islandicus Y.G.57.14]
gi|227457764|gb|ACP36451.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
gi|228010937|gb|ACP46699.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.G.57.14]
Length = 818
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 85/203 (41%), Gaps = 21/203 (10%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
+ Y L I+AL + E S LL++ Q + + + + + LW G YY +D
Sbjct: 562 SPYVSFLWISALEALNEASKLLSNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 616
Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
+ DS A Q+ G +++ ++ ++I N G E G+ N
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 674
Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
+ P G + +IQS+ W+GV Y L+ M Y G DE A + R VYER +
Sbjct: 675 ITPLGVK--LSIQSKAPWSGVEYYLALHMFYSG-FDEY---AKKILRNVYERYEIA---- 724
Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
G + YMR L +M
Sbjct: 725 -GNFWNHIEWGARYMRPLVALSM 746
>gi|229585796|ref|YP_002844298.1| hypothetical protein M1627_2396 [Sulfolobus islandicus M.16.27]
gi|228020846|gb|ACP56253.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
Length = 818
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 85/203 (41%), Gaps = 21/203 (10%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
+ Y L I+AL + E S LLN+ Q + + + + + LW G YY +D
Sbjct: 562 SPYVSFLWISALEALNEASKLLNNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 616
Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
+ DS A Q+ G +++ ++ ++I N G E G+ N
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 674
Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
+ P G + +IQS+ W+GV Y L++ M G DE A + R VYER +
Sbjct: 675 ITPLGIK--LSIQSKAPWSGVEYYLASHMFSSG-FDEY---AKKILRNVYERYEIA---- 724
Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
G + YMR L +M
Sbjct: 725 -GNFWNHIEWGARYMRPLVALSM 746
>gi|410867532|ref|YP_006982143.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
ATCC 4875]
gi|410824173|gb|AFV90788.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
ATCC 4875]
Length = 845
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 83/219 (37%), Gaps = 26/219 (11%)
Query: 107 YCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY-----KFDSS 161
+ G L +AAL EM+ +L T + + + +S Y L+ G YY +
Sbjct: 558 FMGSLWLAALRASEEMALILGATDRAMGWRERFVLSSAAYDTALFRGGYYVQAADETHGD 617
Query: 162 QSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEA-------- 213
D +ADQL G + A ++ + P+ ++ AL + + N+ +
Sbjct: 618 DDEFGDGCLADQLIGQWW--AHLLDLGHLLPAEHVRSALQAVVAHNLNPADETRHSQRAY 675
Query: 214 ---GSMGAVNGMKPNGDRDRSAIQ-SEEVWTGVTYLLSAAMLYEG-------NVDEAWTT 262
G G V P G R A + +EVWTG Y ++A L EG +D W
Sbjct: 676 AVRGEQGLVMCSWPRGGRPARATRYCDEVWTGCEYEVAALCLAEGLEKDGHRILDAIWRR 735
Query: 263 AGGLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAV 301
G R Y G + G GG R A+
Sbjct: 736 HDGRIRNPYNEIECGDHYVRSMAGWSVLELGGAPRWDAI 774
>gi|227828566|ref|YP_002830346.1| hypothetical protein M1425_2317 [Sulfolobus islandicus M.14.25]
gi|227460362|gb|ACP39048.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
Length = 818
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 56/203 (27%), Positives = 85/203 (41%), Gaps = 21/203 (10%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
+ Y L I+AL + E S LLN+ Q + +L + Y LW G YY +D
Sbjct: 562 SPYVSFLWISALEALNEASKLLNNPQNYE--VKELIEKVDKY---LWNGEYYMDWYDPIS 616
Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
+ DS A Q+ G +++ ++ ++I N G E G+ N
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDD 674
Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
+ P G + +IQS+ W+GV Y L++ M G DE A + R VYER +
Sbjct: 675 ITPLGIK--LSIQSKAPWSGVEYYLASHMFSSG-FDEY---AKKILRNVYERYEIA---- 724
Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
G + YMR L +M
Sbjct: 725 -GNFWNHIEWGARYMRPLVALSM 746
>gi|422908697|ref|ZP_16943374.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
gi|341639159|gb|EGS63785.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
Length = 981
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 55/120 (45%), Gaps = 4/120 (3%)
Query: 166 SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPN 225
SD+ D L +LK V+ +F S+ I R L I+ TN + +G N +
Sbjct: 771 SDNSFGDALLADSYLKL--VDLPGLFDSSRIERTLDFIYQTNFIT-NSPKLGVANMTLAD 827
Query: 226 GDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLT 285
G R A Q+++VW GV + ++ A+ G +A +Y+ +Y + F PEG
Sbjct: 828 G-RLHDAFQAQDVWIGVQFSVATALRLAGKQAQAERLMDAVYQALYSMAKIPFAAPEGFN 886
Score = 41.6 bits (96), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 64/148 (43%), Gaps = 14/148 (9%)
Query: 32 PFSL-INGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLE----LIRKCE 86
PF + + Y+ +V +WKDL KFI+ +YR + D + + + E LI E
Sbjct: 552 PFDIDPDAYSWHNVKEWKDLAPKFILMVYRHWQKTGDREVVESCWSSVKESIDYLINLIE 611
Query: 87 SFDKQGLGIVQNGGFPDQTA-----YCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTK 141
D L + F + + YCG L +A L E++ L+ + + Q ++ +
Sbjct: 612 DGDTLPLTRGTDDTFDNLASHGVSIYCGSLWVAGLRAASELALLVGEEELAQGYTQRSKD 671
Query: 142 ASQVYHDLLW--TGSYYKF--DSSQSRH 165
A + LW YY F Q++H
Sbjct: 672 ALETLERGLWDEQKGYYHFFITPVQAKH 699
>gi|238620760|ref|YP_002915586.1| hypothetical protein M164_2319 [Sulfolobus islandicus M.16.4]
gi|238381830|gb|ACR42918.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
Length = 818
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/203 (26%), Positives = 84/203 (41%), Gaps = 21/203 (10%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
+ Y L I+AL + E S LLN+ Q + + + + + LW G YY +D
Sbjct: 562 SPYVSFLWISALEALNEASKLLNNPQNYE-----VKELIEKVNKYLWNGEYYMDWYDPIS 616
Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNG- 221
+ DS A Q+ G +++ ++ ++I N E G+ N
Sbjct: 617 NLRDDSSNASQITGDWYVQM--LDLPEFLDYERRKSVFSSIMKYNYSAEEGVRNGSSNDD 674
Query: 222 MKPNGDRDRSAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETP 281
+ P G + +IQS+ W+GV Y L++ M G DE A + R VYER +
Sbjct: 675 ITPLGVK--LSIQSKAPWSGVEYYLASHMFSSG-FDEY---AKKILRNVYERYEIA---- 724
Query: 282 EGLTGDKTYRSGGYMRALAVYAM 304
G + YMR L +M
Sbjct: 725 -GNFWNHIEWGARYMRPLVALSM 746
>gi|326436882|gb|EGD82452.1| hypothetical protein PTSG_11966 [Salpingoeca sp. ATCC 50818]
Length = 1720
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/269 (23%), Positives = 105/269 (39%), Gaps = 41/269 (15%)
Query: 21 RYYTRWFPEEEPFSLINGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLE 80
R ++ W P++E SL D + D + ++ YR+ + + Y L+
Sbjct: 1302 RQWSFW-PDDEDRSL-------DQTICSDGEICTVIKTYREVLMGAPSSWFKDKYHQVLK 1353
Query: 81 LI-RKCESFDKQGLGIVQNGGFPDQ--------TAYCGGLHIAALACMVEMSSLLNDTQQ 131
++ R D QG G+V+ G P ++ G L++ AL EM+ L ND
Sbjct: 1354 IMERWLIIMDAQGDGVVR-GAQPSTYDVALHGANSFIGSLYMCALRAAEEMAVLQNDPDS 1412
Query: 132 QQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIM-----ADQLCGYLFLKASGVN 186
+F A+ + Y L +T + H +I+ D L G + + G+
Sbjct: 1413 AAKFRARFKLSVANYDKLCFTNGKWYTQVVDPAHDTNIIGESTFVDSLLGEWWAQFLGIG 1472
Query: 187 TEAIFPSANIARALATIFSTN-VRGFEAGSMGAVN-------GM------KPNGDRDRSA 232
A+ P A++ L F N V F+ GM P ++A
Sbjct: 1473 --AMLPPAHVLSTLQNCFKYNHVDAFDPARQAPRKFCDSRDAGMYIATWGGPENPPPKNA 1530
Query: 233 I--QSEEVWTGVTYLLSAAMLYEGNVDEA 259
+ SE +WTG+ Y S + G+VD A
Sbjct: 1531 LLYTSEAIWTGLVYPFSGLCMMAGDVDTA 1559
>gi|15922864|ref|NP_378533.1| hypothetical protein ST2529 [Sulfolobus tokodaii str. 7]
gi|15623655|dbj|BAB67642.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 836
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 65/153 (42%), Gaps = 25/153 (16%)
Query: 139 LTKASQVYHDLLWTGSYYK--FDSSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANI 196
+ A +Y +LW G Y+ +D + A QL G +L G ++
Sbjct: 582 VENALSLYESMLWNGEYFDLWYDPISGYRDKTCQASQLLGEFYLNLLGY---SLLDREKT 638
Query: 197 ARALATIFSTNVRGFEAGSMGAVNGMKPNGDR------------DRSAIQSEEVWTGVTY 244
+ L +I N++ E G +NG P+G R ++I + W+GV +
Sbjct: 639 RKTLLSIVKYNLKEEE----GVINGAYPDGYRPLMREYENPLKIKEASIHQDTPWSGVEF 694
Query: 245 LLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG 277
L++ ++YE +DE A + + VY+R +
Sbjct: 695 YLASHLIYEKMIDE----AKKVLKEVYDRYSIA 723
>gi|229820204|ref|YP_002881730.1| hypothetical protein Bcav_1712 [Beutenbergia cavernae DSM 12333]
gi|229566117|gb|ACQ79968.1| protein of unknown function DUF608 [Beutenbergia cavernae DSM
12333]
Length = 865
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 70/152 (46%), Gaps = 20/152 (13%)
Query: 134 QFSAKLTKASQVYHDLLWTGSYYKF-----DSSQSRHSDSIMADQLCGYLFLKASGVNTE 188
+ A + S Y ++ +TG YY+ + +Q + D ++DQL G + A ++
Sbjct: 611 ELHALFERGSAAYDEICFTGEYYRQVLEPDEPTQFQWGDGCLSDQLIGQWW--AHLLDLG 668
Query: 189 AIFPSANIARALATIFSTNVR-GFEA-----------GSMGAVNGMKPNGDRDRSAIQ-S 235
+ P+A++ AL + N+R GF+ +G + P+G R A +
Sbjct: 669 HVLPAAHVRSALRAVVGHNLRRGFDGWEHTQRVYADGDDVGLLMCTWPHGGRPEVATRYC 728
Query: 236 EEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLY 267
+EVW+GV Y ++A L EG E GL+
Sbjct: 729 DEVWSGVEYEVAAHCLLEGLTAEGRAILDGLW 760
>gi|160932020|ref|ZP_02079412.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
gi|156869062|gb|EDO62434.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
Length = 822
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 46/200 (23%), Positives = 85/200 (42%), Gaps = 28/200 (14%)
Query: 126 LNDTQQQQQFSAKLTKASQVYHDLLWTGSYY-----KFDSSQSRHSDSIMADQLCGYLFL 180
+ D ++ +++ K +++ + W G YY D + ++ ++DQL G
Sbjct: 570 MGDQERARRYRDGAEKGAKLMDEKCWNGEYYIQLIDDVDKYRYQYGKGCLSDQLLGQFLA 629
Query: 181 KASGVNTEAIFPSANIARALATIFSTNV--RGFEAG----------SMGAVNGMKPNGDR 228
+ +G+ + P ++ +A +IF N R + G MG P G+R
Sbjct: 630 QEAGLGY--VLPKNHVKKAAESIFRYNFIERASDFGHVQRAYIFNDEMGLTPCTWPLGER 687
Query: 229 DR-SAIQSEEVWTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLGFETPEGLTGD 287
R I EVWTG+ Y ++A ++ E ++E T + + V +R P
Sbjct: 688 PRFPFIYFGEVWTGIEYEVAALLIREDLLEEGLT----IVKAVRDRQDGYRRNPWSENES 743
Query: 288 KTYRSGGYMRALAVYAMQDA 307
Y Y RA+A Y++ +A
Sbjct: 744 GYY----YTRAMASYSILNA 759
>gi|327311335|ref|YP_004338232.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
gi|326947814|gb|AEA12920.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
Length = 892
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 88/220 (40%), Gaps = 40/220 (18%)
Query: 105 TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYK--FDSSQ 162
++Y L AAL MVE + L T + + A+L + + + LW G Y+ +D
Sbjct: 587 SSYVLFLWAAALKAMVEGAKL---TGRSPEKYAELLRRAFEGLERLWNGEYFDLWWDPVT 643
Query: 163 SRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAVNGM 222
+ MA QL G L + + + + AL + N+ A G +NGM
Sbjct: 644 GERDRASMAAQLFGQLLAHVADLGY--LSDKQRVISALRAVAKYNL----APDEGLINGM 697
Query: 223 KPNGDRDRSAI------------------QSEEVWTGVTYLLSAAMLYEGNVDEAWTTAG 264
P DR R + Q + WTGV Y ++ M YEG V+E
Sbjct: 698 YP--DRRRPSFVGPTLYENFTRGPYLPTWQMDTPWTGVEYAVAGHMFYEGLVEEGTAVLK 755
Query: 265 GLYRTVYERTGLGFETPEGLTGDKTYRSGGYMRALAVYAM 304
L+ YER G + E T YMR L+ +A+
Sbjct: 756 ALHER-YERGGHYWNHIEWGT--------HYMRPLSAWAV 786
>gi|149196161|ref|ZP_01873217.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
gi|149141008|gb|EDM29405.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
Length = 1052
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 56/133 (42%), Gaps = 11/133 (8%)
Query: 37 NGYNVFDVSDWKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIV 96
+ YN +V++WKDL K+I+ + R+Y KD+ L + + ++ + +G I
Sbjct: 612 DAYNWHNVAEWKDLAPKYILMLLRNYHFTKDVYLLEECWDSVQASLQYLKDMILEGHSIP 671
Query: 97 QNGGFPDQ---------TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYH 147
G D T YCG L +A L E++ +L AS ++
Sbjct: 672 LTNGTDDTFDNLSSFGITLYCGSLWVAGLKAAGEIAKILKIEDVIDDLKELEEAASASFN 731
Query: 148 DLLW--TGSYYKF 158
LW +Y+ F
Sbjct: 732 QALWDKENNYFHF 744
>gi|348173841|ref|ZP_08880735.1| hypothetical protein SspiN1_25464 [Saccharopolyspora spinosa NRRL
18395]
Length = 995
Score = 45.8 bits (107), Expect = 0.026, Method: Composition-based stats.
Identities = 61/256 (23%), Positives = 108/256 (42%), Gaps = 42/256 (16%)
Query: 55 IVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIV-------QNGGFPDQTAY 107
I+ IYR++ + D ++L RV+P + + + D + GI+ Q+ + + ++
Sbjct: 608 ILRIYREHQMSPDSQWLQRVWPRTRQAVEYLIASDAEPDGILDGAQPNTQDATWFGRNSW 667
Query: 108 CGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSD 167
GL++AAL M+ + D ++ + T ++V L+ G Y+ + + H
Sbjct: 668 LSGLYLAALRAGAAMAGEVGDDAFARRCTELATSGTEVIVRDLFNGEYFVHELDPA-HPG 726
Query: 168 SI------MADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNV--RGFEAGSMGAV 219
S+ ADQL G + A+ + + P AL +I+ N R E +
Sbjct: 727 SVNTNRGCFADQLLGPSW--ATQLGLPRVLPPEPTRSALRSIWRHNFVPRPMEYRQNSPI 784
Query: 220 NGMK---------------PNGDRDR-----SAIQSEEVWTGVTYLLSAAMLYEGNVDEA 259
G + PNG D S E W G+ Y ++A ++ EG VDE
Sbjct: 785 EGGRIFYDADVPALVMCTWPNGGGDEAGDNWSVSYFNEAWHGIEYQVAAHLIAEGMVDEG 844
Query: 260 WTTAGGLYRTVYERTG 275
A R+V++R G
Sbjct: 845 LAVA----RSVHDRYG 856
>gi|332796016|ref|YP_004457516.1| hypothetical protein Ahos_0327 [Acidianus hospitalis W1]
gi|332693751|gb|AEE93218.1| conserved hypothetical protein [Acidianus hospitalis W1]
Length = 662
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 51/207 (24%), Positives = 86/207 (41%), Gaps = 24/207 (11%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLE-----LIRKCESFDKQGLGIVQNGGF 101
WKDLN FI+ +YR + D+ FL YP L+ L R E+ +G G
Sbjct: 407 WKDLNPTFILLVYRYFKFTGDVDFLRETYPKMLKAYEWMLTRSIEA---EGSGDTAFDVL 463
Query: 102 P--DQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFD 159
P + L IA+ + E +LN+ + S L+K ++ + L+ G Y+
Sbjct: 464 PIKGKNPMLLTLFIASALALRETKKVLNEKDE----STDLSKLREMLNS-LYNGKYFIAW 518
Query: 160 SSQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRGFEAGSMGAV 219
Q + I QL G + + G+ E + I+ AL + N + A
Sbjct: 519 EGQ----EGIFMAQLLGEWWTELLGL--ENVTDEEKISSALRYMLEVNGK---ASEYCTP 569
Query: 220 NGMKPNGDRDRSAIQSEEVWTGVTYLL 246
N +K NG+ + + Q+ W + + +
Sbjct: 570 NLVKENGEVVKISPQAYSSWPRLVFAM 596
>gi|284997210|ref|YP_003418977.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|284445105|gb|ADB86607.1| hypothetical protein LD85_0902 [Sulfolobus islandicus L.D.8.5]
Length = 498
Score = 45.8 bits (107), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 49/97 (50%), Gaps = 18/97 (18%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLEL----IRKCESFDKQGLGIVQ---NG 99
WKD+N I+ +YR + DI FL VYPT +++ +R+C + GL ++ +
Sbjct: 402 WKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQC----RDGLPFMEGEMDN 457
Query: 100 GFPDQT------AYCGGLHIAALACMVEMSSLLNDTQ 130
F D T +Y L IA+L M E++ L+ Q
Sbjct: 458 AF-DATIIKGHDSYTSSLFIASLIAMREIAKLVGRQQ 493
>gi|15922946|ref|NP_378615.1| hypothetical protein ST2609 [Sulfolobus tokodaii str. 7]
gi|15623737|dbj|BAB67724.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 661
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 46/167 (27%), Positives = 69/167 (41%), Gaps = 16/167 (9%)
Query: 47 WKDLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQT- 105
WKDLN +I+ IYR Y L DI FL VY + F + GL + F D T
Sbjct: 388 WKDLNPTYILLIYRYYKLTGDIEFLKSVYDKVKKAFEWELKFSRYGLEGKMDSAF-DVTP 446
Query: 106 -----AYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDS 160
+Y L+IA+L M E+S D + ++ +A + + + G Y+
Sbjct: 447 IKGINSYTLSLYIASLFAMREISKTAGDNLNLDE---QIKEAKEAFEKMF-NGKYF---I 499
Query: 161 SQSRHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN 207
+ D++ Q+ G + G+ E I I AL I N
Sbjct: 500 AWEGMEDAVFLAQVFGEWWTTLLGL--EPIADEEKIKSALRWIIKVN 544
>gi|297741933|emb|CBI33368.3| unnamed protein product [Vitis vinifera]
Length = 108
Score = 45.1 bits (105), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 34/53 (64%), Gaps = 2/53 (3%)
Query: 257 DEAWTTAGGLYRTVYERTGLG--FETPEGLTGDKTYRSGGYMRALAVYAMQDA 307
++A+TTA G++ + G G F+TPEG T D +RS YMR LA++ MQ A
Sbjct: 3 EQAFTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWA 55
>gi|374854722|dbj|BAL57596.1| hypothetical conserved protein [uncultured prokaryote]
Length = 751
Score = 44.7 bits (104), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 57/255 (22%), Positives = 95/255 (37%), Gaps = 46/255 (18%)
Query: 49 DLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKC---ESFDKQGLGIVQNGG----- 100
D + IV Y+ Y ++ +L +VYP +L+ S+D G+++
Sbjct: 456 DGQMGLIVRAYQQYRRDNNLEWLRQVYPKLKQLLSFAWLPGSWDADRDGVMEGAQHNTYD 515
Query: 101 --FPDQTAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY-- 156
F CG ++AAL M EM+ + +T Q+ + S+ + L+ G YY
Sbjct: 516 IEFFGPNPMCGVWYLAALLAMEEMAKRVGETDFAQECRQLFERGSRWIDENLFDGEYYVQ 575
Query: 157 -----------------KFDSSQSRHS--DSIMADQLCGYLFLKASGVNTEAIFPSANIA 197
D + R+ + DQL G KA+ + +I
Sbjct: 576 RVQPLQGQPHPMTTAIDPGDPAYQRYQVGTGCLIDQLTGQ--YKANRAGLGDLLKREHIV 633
Query: 198 RALATIFSTNV-RGFE-----------AGSMGAVNGMKPNGDRDRSAIQS-EEVWTGVTY 244
+AL ++ N RGF G + P G+R + E WTG+ Y
Sbjct: 634 KALRSLMRHNFRRGFHQHYNNMRTYALGDEAGVLICSYPRGERPETPFPYWAECWTGLEY 693
Query: 245 LLSAAMLYEGNVDEA 259
+ + +L G EA
Sbjct: 694 MFARLLLDYGLEQEA 708
>gi|332796035|ref|YP_004457535.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
gi|332693770|gb|AEE93237.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
Length = 840
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 61/279 (21%), Positives = 105/279 (37%), Gaps = 61/279 (21%)
Query: 49 DLNLKFIVSIYRDYALHKDIRFLTRVYPT---CLELIRKCESFDK----------QGLGI 95
DLN +FI+ Y D+ FL ++P LE K +++D + L
Sbjct: 462 DLNPEFILMTYLIAKTTGDLNFLKEIFPKMKEALESTMKTQTYDGLIYHTLPAGLEWLRY 521
Query: 96 VQNG-GFP---DQTAYCGGLHIAALA-------CMVEMSSL-----------LNDTQQQQ 133
V N P + +A G ++ L+ M+ ++S +ND
Sbjct: 522 VNNKLNLPRGDNNSASILGHNLIPLSMQTFDDWSMIGITSFTSILWISSIQAVNDACSNL 581
Query: 134 QFSAKLTKASQVYH--DLLWTGSYYK--FDSSQSRHSDSIMADQLCGYLFLKASGVNTEA 189
+ + S V D LW G Y+ +D + A Q+ G+ + G+
Sbjct: 582 KINCSYDYESLVKKLIDYLWNGEYFDLWYDPKSKMRDKACNASQILGHWYSTLLGLR--- 638
Query: 190 IFPSANIARALATIFSTNVRGFEAGSMGAVNGMKPNGDRD-----------RSAIQSEEV 238
+ + L +I N++ E G +NG PNG R + Q +
Sbjct: 639 FLDDSLVKTTLKSIVKYNLKEEE----GLLNGAYPNGYRPLKRNYQNQLNLPATTQIDTP 694
Query: 239 WTGVTYLLSAAMLYEGNVDEAWTTAGGLYRTVYERTGLG 277
W+GV + +++ ++YE DE + R +YER L
Sbjct: 695 WSGVEFYVASHLIYEKLRDE----GEKILRNIYERYKLA 729
>gi|170060875|ref|XP_001865995.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879232|gb|EDS42615.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 183
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 36 INGYNVFDVSDWKDLNLKFIVSIYRDY 62
+ Y + DV +WKDLN KFI+ +YRDY
Sbjct: 54 VRAYPIHDVYEWKDLNSKFILQVYRDY 80
>gi|156383620|ref|XP_001632931.1| predicted protein [Nematostella vectensis]
gi|156219994|gb|EDO40868.1| predicted protein [Nematostella vectensis]
Length = 1090
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 49/232 (21%), Positives = 89/232 (38%), Gaps = 30/232 (12%)
Query: 49 DLNLKFIVSIYRDYALHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQ---- 104
D ++ ++ YR+ + +++P ++++ + G G++ G P+
Sbjct: 634 DGDVGTVLKTYREVRQGAPKTWFNKMWPQVKRIMKRWMTDLDNGTGVIP-GPQPNTYDCT 692
Query: 105 ----TAYCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYY--KF 158
Y G ++A L +M+ + N+ + + + S L+ GS+Y +
Sbjct: 693 LYGVNVYISGYYLAGLRAAEKMAKIQNEPELAATYHQRFELGSAKIDKTLFNGSWYIQEV 752
Query: 159 DSSQSRH--SDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTN--------- 207
DS + D+ D L G + A G+ I NI L IF N
Sbjct: 753 DSEHPVNVVGDATWVDCLVGQWWAHALGLGY--ILSQKNIQSTLQNIFLKNHVERFVPAT 810
Query: 208 ---VRGFEAGSMGAVNGMKPNGDRDRSAI--QSEEVWTGVTYLLSAAMLYEG 254
+ F+ G + PN D +S + SE WTG+ Y + YEG
Sbjct: 811 QAPRQFFDERDAGLRICVYPN-DLPKSPLLYHSEGAWTGLEYEYAELCFYEG 861
>gi|312868970|ref|ZP_07729150.1| glutaminase A [Lactobacillus oris PB013-T2-3]
gi|311095534|gb|EFQ53798.1| glutaminase A [Lactobacillus oris PB013-T2-3]
Length = 306
Score = 37.7 bits (86), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 46/178 (25%), Positives = 78/178 (43%), Gaps = 26/178 (14%)
Query: 117 ACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
A +E++SL+ +Q F + A ++ HD T + F+ S+ R D+ D+
Sbjct: 113 AGAIEVTSLIKPLADRQPFEQIIMFAREICHDPQITLNNEIFE-SEDRTGDT---DRSLA 168
Query: 177 YLFLKASGVNTEAIFPSANIARALATIF---STNVRGFEAGSMGAV---NGMKPNGDRDR 230
Y +LKA+G+ ++ +L T F S V ++GAV NG+KP D+DR
Sbjct: 169 Y-YLKANGIM------QGDVTVSLTTYFKQCSMMVTAVSLANLGAVLANNGVKP-WDQDR 220
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEG-------NVDEAWTTAGGLYRTVYERTGLGFETP 281
I S+ + +++ + E + GGL V R G+G +P
Sbjct: 221 -LISSQAATYAKSLMMTTGLYNESGTYSSIIGIPTKSGVGGGLMSAVPNRVGVGIFSP 277
>gi|259502028|ref|ZP_05744930.1| glutaminase 1 [Lactobacillus antri DSM 16041]
gi|259170029|gb|EEW54524.1| glutaminase 1 [Lactobacillus antri DSM 16041]
Length = 309
Score = 37.7 bits (86), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 77/179 (43%), Gaps = 28/179 (15%)
Query: 117 ACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSSQSRHSDSIMADQLCG 176
A +E++SL+ +Q F + A ++ HD T + F+S + R D+ D+
Sbjct: 116 AGAIEVASLIKPLADRQPFEQIIMFAREICHDPQITLNNEIFES-EDRTGDT---DRSLA 171
Query: 177 YLFLKASGVNTEAIFPSANIARALATIF---STNVRGFEAGSMGAV---NGMKPNGDRDR 230
Y +LKA+G+ A++ +L T F S V ++GAV NG+KP D+
Sbjct: 172 Y-YLKANGI------LQADVTASLTTYFKQCSMMVTAVSLANLGAVLANNGVKP---WDQ 221
Query: 231 SAIQSEEVWTGVTYLLSAAMLYEGN--------VDEAWTTAGGLYRTVYERTGLGFETP 281
+ S T L+ LY+ + V GGL V R G+G +P
Sbjct: 222 ERLISGAAATYAKSLMMTTGLYDESGTYSSIIGVPTKSGVGGGLMSAVPNRVGIGIFSP 280
>gi|320109368|ref|YP_004184958.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
gi|319927889|gb|ADV84964.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
Length = 898
Score = 37.4 bits (85), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 34/167 (20%), Positives = 65/167 (38%), Gaps = 13/167 (7%)
Query: 54 FIVSIYRDYA-------LHKDIRFLTRVYPTCLELIRKCESFDKQGLGIVQNGGFPDQTA 106
+++ + +DY + K+ + R + IR +SF G + P+ +
Sbjct: 531 WVIQLAKDYQQTGDDTFIKKNADHIRRGFAYMKAQIRD-DSFIPVGGQTYDDFPHPEISV 589
Query: 107 YCGGLHIAALACMVEMSSLLNDTQQQQQFSAKLTKASQVYHDLLWTGSYYKFDSS---QS 163
Y G +++AAL V + L DTQ + + + LW G ++ + +
Sbjct: 590 YTGTVYLAALRAGVVLGDALGDTQMSHEAETQFKQTQAGLIHALWNGRFFAYGTDIGGAH 649
Query: 164 RHSDSIMADQLCGYLFLKASGVNTEAIFPSANIARALATIFSTNVRG 210
R D + + QL G + +G + P A+ T+V G
Sbjct: 650 RRDDRLFSGQLAGQFLSRYAGWGD--VLPHDQAKSAIYEQLHTSVLG 694
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.134 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,943,065,818
Number of Sequences: 23463169
Number of extensions: 199339992
Number of successful extensions: 418074
Number of sequences better than 100.0: 418
Number of HSP's better than 100.0 without gapping: 281
Number of HSP's successfully gapped in prelim test: 137
Number of HSP's that attempted gapping in prelim test: 416248
Number of HSP's gapped (non-prelim): 612
length of query: 314
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 172
effective length of database: 9,027,425,369
effective search space: 1552717163468
effective search space used: 1552717163468
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)