BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 002202
(954 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255575898|ref|XP_002528846.1| conserved hypothetical protein [Ricinus communis]
gi|223531697|gb|EEF33520.1| conserved hypothetical protein [Ricinus communis]
Length = 952
Score = 1513 bits (3917), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 736/952 (77%), Positives = 843/952 (88%), Gaps = 3/952 (0%)
Query: 1 MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
M EG+I NG +E++EP +S +KVDP PASLTWQRKL++ +I LSQF L+++E QL
Sbjct: 1 MSEGEIPANGC-QEDREPSDSLLEKVDPGSPASLTWQRKLNSEDIALSQFNLSFQEKFQL 59
Query: 61 APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
AP+G+R+ LIREE AKG+ + I+PF+KR +TS HG+PLGG+GSGSIGRSY+GEFQRWQ+
Sbjct: 60 APVGIRLWRLIREETAKGRVSIINPFLKRFITSCHGIPLGGIGSGSIGRSYKGEFQRWQL 119
Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
FPR+CE+KPVLANQFSVFVSRS+G+KYSSVLCP+ PEVL + +GIGSWDWNLKGD ST
Sbjct: 120 FPRICEEKPVLANQFSVFVSRSSGEKYSSVLCPRNPEVLMEPAISGIGSWDWNLKGDNST 179
Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
YHALYPRAWT+++GEPDPELRIVCRQISPIIPHNYKESSYPVSVFT+T+YNSGKT+AD++
Sbjct: 180 YHALYPRAWTIYDGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTFTLYNSGKTTADVS 239
Query: 241 LLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHV 300
LLFTWTNSVGG+SE++GQH+NS T M DGVHAVLLHH+T+ PPVTFA+AAQET+ VHV
Sbjct: 240 LLFTWTNSVGGNSEYSGQHFNSTTMMEDGVHAVLLHHKTAEGFPPVTFAIAAQETNDVHV 299
Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
S CP FVISGN G+TAKDMWHE+KEHGSFD L S TS SEPGSSIGAAIAASVT+PP
Sbjct: 300 SKCPRFVISGNCQGITAKDMWHEVKEHGSFDNLKSAGTSGPSEPGSSIGAAIAASVTIPP 359
Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
D+ VTFSL+WDCPEV FM G+TY+RRYTKFY TH +AAA IA DAILEHG WE QI A
Sbjct: 360 DAIRSVTFSLSWDCPEVYFMGGRTYHRRYTKFYSTHGDAAARIAHDAILEHGLWESQIVA 419
Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
WQRPILEDKRLPEWYPITLFNELYYLN+GG +WTDGSPP H+LV+I KFSLD S + L
Sbjct: 420 WQRPILEDKRLPEWYPITLFNELYYLNSGGTIWTDGSPPYHNLVSIRGSKFSLDTSGAGL 479
Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEY 540
K I+DV ++NDTAVNIL RM+S LEQI+ VA NS+FGTNLLQ+GEENIGQFLYLEGIEY
Sbjct: 480 KSIIDVTHENDTAVNILGRMTSALEQIHAHVASNSAFGTNLLQEGEENIGQFLYLEGIEY 539
Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
MWNTYDVHFYSSFAL+MLFPK++LS+QRDFAAAVMMHDPSKM+LL DGQWV RKVLGAV
Sbjct: 540 HMWNTYDVHFYSSFALVMLFPKLELSVQRDFAAAVMMHDPSKMQLLHDGQWVCRKVLGAV 599
Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
PHDIGI DPW+EVNAY LY+T RWKDLNPKFVLQVYRDVVATGDKKFA+AVWPSVY+AMA
Sbjct: 600 PHDIGINDPWYEVNAYSLYNTDRWKDLNPKFVLQVYRDVVATGDKKFAEAVWPSVYIAMA 659
Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSED 720
YMDQFDRDGDGMIENDGFPDQTYDTWSVSG+SAYSGGLWVAALQAASALAREVGD+GSED
Sbjct: 660 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAREVGDKGSED 719
Query: 721 YFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
YF +FQKAK VY+KLWNGSYFNYDNSG SSSIQADQLAGQWYARACGL PIVD+DKA
Sbjct: 720 YFWARFQKAKDVYDKLWNGSYFNYDNSGGRNSSSIQADQLAGQWYARACGLFPIVDKDKA 779
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
RSAL+KVYNYNVLKV GKRGA+NGMLPDG+VD+SSMQSREIWSGVTYA+AA+MI ED+
Sbjct: 780 RSALEKVYNYNVLKVKDGKRGAINGMLPDGKVDLSSMQSREIWSGVTYALAATMIQEDML 839
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
D+ F TA GIYEAAWS GLGY+FQTPEAWN DQYRSLCYMRPLAIWAMQWAL+R PK
Sbjct: 840 DMAFHTASGIYEAAWSERGLGYSFQTPEAWNNVDQYRSLCYMRPLAIWAMQWALSR--PK 897
Query: 901 TLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRM 952
+++M+ EV ++ LL +HAGF+KVAR L+LPE + + LLQSLF++TC+++
Sbjct: 898 LEKEEMEMEVNEDYLLPHHAGFTKVARFLRLPEGEESLGLLQSLFNYTCKKL 949
>gi|297739499|emb|CBI29681.3| unnamed protein product [Vitis vinifera]
Length = 949
Score = 1470 bits (3805), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 711/952 (74%), Positives = 824/952 (86%), Gaps = 6/952 (0%)
Query: 6 ILENGLHEEEKEPLNSSFD--KVDPAKPASLTWQRKL-STGEIPLSQFTLNWKETVQLAP 62
+ ENGL E E+EP NSS + KVDP KP SLTWQRKL S G P+ +F +N +ET LAP
Sbjct: 1 MCENGLEEGEREPSNSSIEEIKVDPGKPGSLTWQRKLNSDGNAPV-EFKINLRETFHLAP 59
Query: 63 IGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP 122
IG R+ +REE AKG+ A IDPF+KR+++SS GVPLGG+G+GSIGRSY+GEFQR+Q+FP
Sbjct: 60 IGFRLWRHVREETAKGRGAMIDPFVKRYISSSQGVPLGGIGAGSIGRSYKGEFQRFQLFP 119
Query: 123 RVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH 182
E++PVL NQFSVFVSR NG+KYS+VLC ++PE LK+ +GIGSWDWNL G+KSTY
Sbjct: 120 ITSENEPVLENQFSVFVSRPNGEKYSTVLCRQSPEALKECPPSGIGSWDWNLNGNKSTYL 179
Query: 183 ALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLL 242
ALYPRAWTV++GEPDP L+IVCRQISPIIPHNYKESS+PV+VFT+T++NSGKT+ADITLL
Sbjct: 180 ALYPRAWTVYDGEPDPALKIVCRQISPIIPHNYKESSFPVAVFTFTLFNSGKTAADITLL 239
Query: 243 FTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
FTW NSVGG S +GQH NSK M DGV VLLHH+T++ PPVT+A+AAQE DGVH+S
Sbjct: 240 FTWANSVGGVSGLSGQHLNSKFMMKDGVRGVLLHHKTANGRPPVTYAIAAQEMDGVHISE 299
Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
CP F ISG++ G+TAKDMW+EIKEHGSFDRLNS ETS+ SE GSS+GAA+AAS+T+P DS
Sbjct: 300 CPCFFISGDTPGITAKDMWNEIKEHGSFDRLNSAETSMPSELGSSVGAAVAASLTIPSDS 359
Query: 363 EGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
E VTFSLAWDCPE+NF +TYYRRYTKFYGT +AAA IA DAIL+HG WE QIEAWQ
Sbjct: 360 EQTVTFSLAWDCPEINFSKERTYYRRYTKFYGTRGDAAAKIAHDAILDHGHWESQIEAWQ 419
Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKR 482
+P+LEDKR PEWYPITLFNELYYLN+GG VWTDGSPPVHS +I RKFSLD S+SDLK
Sbjct: 420 KPVLEDKRFPEWYPITLFNELYYLNSGGTVWTDGSPPVHSFTSIIERKFSLDRSRSDLKN 479
Query: 483 IVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLM 542
V + + NDTAV+ILERM+S+LEQ++TPV NS+FG NLLQ GEENIGQFLYLEG+EY M
Sbjct: 480 TVSISHHNDTAVDILERMTSVLEQVHTPVTSNSAFGPNLLQQGEENIGQFLYLEGVEYFM 539
Query: 543 WNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPH 602
WNT DVHFYSSFALIMLFPK++LSIQRDFAA+VMMHDPSKMKLL +G+WVSRKVLGAVPH
Sbjct: 540 WNTNDVHFYSSFALIMLFPKLELSIQRDFAASVMMHDPSKMKLLCNGKWVSRKVLGAVPH 599
Query: 603 DIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
D+G DPWFEVN Y LY+T RWKDLNPKFVLQVYRDVVATGDK FAKAVWPSVYVA+AYM
Sbjct: 600 DVGNYDPWFEVNGYNLYNTDRWKDLNPKFVLQVYRDVVATGDKNFAKAVWPSVYVALAYM 659
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+QFD+DGDGMIEN+GFPDQTYDTWSVSG+SAYSGGLWVAALQAASALAR VGD+GSEDYF
Sbjct: 660 NQFDKDGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVVGDKGSEDYF 719
Query: 723 LFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARS 782
FKFQKAK VY+KLWNGSYFNYD+S S SSSIQADQLAGQWYARACGL PIVDEDKA+S
Sbjct: 720 WFKFQKAKGVYQKLWNGSYFNYDDSDGSSSSSIQADQLAGQWYARACGLSPIVDEDKAKS 779
Query: 783 ALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADI 842
AL+KVY+YNVLKV GGKRGAVNGMLPDG+VD ++MQSREIWSGVTY VAA+MIHE L D+
Sbjct: 780 ALEKVYHYNVLKVGGGKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDM 839
Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTL 902
FQTA G+YEAAWS GLGY+FQTPE+WNTDDQYRSLCYMRPLAIWAMQWA ++PK
Sbjct: 840 AFQTASGVYEAAWSQEGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKLHKY 899
Query: 903 EKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
E PE+ ++SL+ HAGFS+VARLLKLP+E ++S LQ ++D+TC+RM++
Sbjct: 900 EAN--PEMNEDSLVLQHAGFSRVARLLKLPDEDVSRSALQVIYDYTCKRMWL 949
>gi|359486233|ref|XP_002264575.2| PREDICTED: non-lysosomal glucosylceramidase [Vitis vinifera]
Length = 960
Score = 1466 bits (3794), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 711/963 (73%), Positives = 824/963 (85%), Gaps = 17/963 (1%)
Query: 6 ILENGLHEEEKEPLNSSFD-------------KVDPAKPASLTWQRKL-STGEIPLSQFT 51
+ ENGL E E+EP NSS + KVDP KP SLTWQRKL S G P+ +F
Sbjct: 1 MCENGLEEGEREPSNSSIEEVGHVLCYMLGRIKVDPGKPGSLTWQRKLNSDGNAPV-EFK 59
Query: 52 LNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSY 111
+N +ET LAPIG R+ +REE AKG+ A IDPF+KR+++SS GVPLGG+G+GSIGRSY
Sbjct: 60 INLRETFHLAPIGFRLWRHVREETAKGRGAMIDPFVKRYISSSQGVPLGGIGAGSIGRSY 119
Query: 112 RGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWD 171
+GEFQR+Q+FP E++PVL NQFSVFVSR NG+KYS+VLC ++PE LK+ +GIGSWD
Sbjct: 120 KGEFQRFQLFPITSENEPVLENQFSVFVSRPNGEKYSTVLCRQSPEALKECPPSGIGSWD 179
Query: 172 WNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYN 231
WNL G+KSTY ALYPRAWTV++GEPDP L+IVCRQISPIIPHNYKESS+PV+VFT+T++N
Sbjct: 180 WNLNGNKSTYLALYPRAWTVYDGEPDPALKIVCRQISPIIPHNYKESSFPVAVFTFTLFN 239
Query: 232 SGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALA 291
SGKT+ADITLLFTW NSVGG S +GQH NSK M DGV VLLHH+T++ PPVT+A+A
Sbjct: 240 SGKTAADITLLFTWANSVGGVSGLSGQHLNSKFMMKDGVRGVLLHHKTANGRPPVTYAIA 299
Query: 292 AQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAA 351
AQE DGVH+S CP F ISG++ G+TAKDMW+EIKEHGSFDRLNS ETS+ SE GSS+GAA
Sbjct: 300 AQEMDGVHISECPCFFISGDTPGITAKDMWNEIKEHGSFDRLNSAETSMPSELGSSVGAA 359
Query: 352 IAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEH 411
+AAS+T+P DSE VTFSLAWDCPE+NF +TYYRRYTKFYGT +AAA IA DAIL+H
Sbjct: 360 VAASLTIPSDSEQTVTFSLAWDCPEINFSKERTYYRRYTKFYGTRGDAAAKIAHDAILDH 419
Query: 412 GSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKF 471
G WE QIEAWQ+P+LEDKR PEWYPITLFNELYYLN+GG VWTDGSPPVHS +I RKF
Sbjct: 420 GHWESQIEAWQKPVLEDKRFPEWYPITLFNELYYLNSGGTVWTDGSPPVHSFTSIIERKF 479
Query: 472 SLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQ 531
SLD S+SDLK V + + NDTAV+ILERM+S+LEQ++TPV NS+FG NLLQ GEENIGQ
Sbjct: 480 SLDRSRSDLKNTVSISHHNDTAVDILERMTSVLEQVHTPVTSNSAFGPNLLQQGEENIGQ 539
Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
FLYLEG+EY MWNT DVHFYSSFALIMLFPK++LSIQRDFAA+VMMHDPSKMKLL +G+W
Sbjct: 540 FLYLEGVEYFMWNTNDVHFYSSFALIMLFPKLELSIQRDFAASVMMHDPSKMKLLCNGKW 599
Query: 592 VSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAV 651
VSRKVLGAVPHD+G DPWFEVN Y LY+T RWKDLNPKFVLQVYRDVVATGDK FAKAV
Sbjct: 600 VSRKVLGAVPHDVGNYDPWFEVNGYNLYNTDRWKDLNPKFVLQVYRDVVATGDKNFAKAV 659
Query: 652 WPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAR 711
WPSVYVA+AYM+QFD+DGDGMIEN+GFPDQTYDTWSVSG+SAYSGGLWVAALQAASALAR
Sbjct: 660 WPSVYVALAYMNQFDKDGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAR 719
Query: 712 EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL 771
VGD+GSEDYF FKFQKAK VY+KLWNGSYFNYD+S S SSSIQADQLAGQWYARACGL
Sbjct: 720 VVGDKGSEDYFWFKFQKAKGVYQKLWNGSYFNYDDSDGSSSSSIQADQLAGQWYARACGL 779
Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
PIVDEDKA+SAL+KVY+YNVLKV GGKRGAVNGMLPDG+VD ++MQSREIWSGVTY VA
Sbjct: 780 SPIVDEDKAKSALEKVYHYNVLKVGGGKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVA 839
Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQ 891
A+MIHE L D+ FQTA G+YEAAWS GLGY+FQTPE+WNTDDQYRSLCYMRPLAIWAMQ
Sbjct: 840 ATMIHEGLVDMAFQTASGVYEAAWSQEGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQ 899
Query: 892 WALTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRR 951
WA ++PK E PE+ ++SL+ HAGFS+VARLLKLP+E ++S LQ ++D+TC+R
Sbjct: 900 WAFSQPKLHKYEAN--PEMNEDSLVLQHAGFSRVARLLKLPDEDVSRSALQVIYDYTCKR 957
Query: 952 MFI 954
M++
Sbjct: 958 MWL 960
>gi|297795757|ref|XP_002865763.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
lyrata]
gi|297311598|gb|EFH42022.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
lyrata]
Length = 956
Score = 1439 bits (3725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 678/958 (70%), Positives = 797/958 (83%), Gaps = 8/958 (0%)
Query: 1 MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
MFE KI+ G E+ + NSS KVDPA PASLTWQRK+ + P +F L KE QL
Sbjct: 1 MFEEKIITIG---EDVKSFNSSDHKVDPAVPASLTWQRKIDSDAKPPREFNLTAKEIFQL 57
Query: 61 APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
AP+G+R+ L REEAAKG+ AFIDPF + +TSSHGVPLGG+G+GSIGRS++GEFQRWQ+
Sbjct: 58 APVGIRLWFLCREEAAKGRLAFIDPFSRHSVTSSHGVPLGGIGAGSIGRSFQGEFQRWQL 117
Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
FP CE +PVLANQFS FVSR+NG+KYS+VLCP+ P++ K + +GIGSWDWNLKGDKST
Sbjct: 118 FPPKCEYEPVLANQFSAFVSRANGKKYSTVLCPRNPKMGKQDSESGIGSWDWNLKGDKST 177
Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
YHALYPR+WT++EGEPDPELRI+CRQ+SP IPHNYKESS+PVSVFT+T++N G T+A++T
Sbjct: 178 YHALYPRSWTMYEGEPDPELRIICRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTANVT 237
Query: 241 LLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHV 300
LLFTW NSVGGDSEF+G HYNSK MNDGV VLLHH+T++ LP +++A++AQ TDGV V
Sbjct: 238 LLFTWANSVGGDSEFSGGHYNSKIMMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSV 297
Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
S CP F++SG G+TAKDMW +KE+GSFD + E S+ SE GSSIGAA+AAS+TV P
Sbjct: 298 SACPFFIVSGKQNGITAKDMWQAVKENGSFDHFKASEASMQSENGSSIGAAVAASLTVLP 357
Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
VTFSLAWDCPEV F SGK Y RRYTKFYG H +AAA IA DAIL H WE IE
Sbjct: 358 GESRIVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNHGDAAAQIAYDAILGHSQWESWIED 417
Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
WQRPILEDKRLP WYP+TLFNELYYLN+GG +WTDGS PVHSL + +KFSLD SQ L
Sbjct: 418 WQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGL 477
Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEY 540
++VP+QNDTA+++L +M+S LEQ++ A NS+FGT LL++GEENIG FLYLEGIEY
Sbjct: 478 NNDIEVPHQNDTAISVLGKMASTLEQLHASTASNSAFGTKLLEEGEENIGHFLYLEGIEY 537
Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGAV
Sbjct: 538 RMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAV 597
Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
PHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAMA
Sbjct: 598 PHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMA 657
Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSED 720
YM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY GGLWVAALQAAS LAR VGD+ S+D
Sbjct: 658 YMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASVLARVVGDKNSQD 717
Query: 721 YFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
YF KFQKAKVVYE KLWNGSYFNYDNSGS SSSIQADQLAGQWYARA GLLPIVDEDK
Sbjct: 718 YFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSSIQADQLAGQWYARASGLLPIVDEDK 777
Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
AR+AL+KVYN+NV+K+ GKRGAVNGM P+G+VD +SMQSREIWSGVTYA++A+MI E L
Sbjct: 778 ARTALEKVYNFNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGL 837
Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
++ FQTA G+YEAAWS GLGY+FQTPEAWNT+D+YRSL YMRPLAIWAMQWALT+
Sbjct: 838 VEMAFQTASGVYEAAWSEAGLGYSFQTPEAWNTNDEYRSLTYMRPLAIWAMQWALTKTSQ 897
Query: 900 KT----LEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
K LE + +PEV SL+++ GFS+V+RLL LP E AKS LQ+LFD+TCRR+
Sbjct: 898 KQQQLGLEPEQEPEVETNSLMKHDIGFSRVSRLLNLPNEASAKSTLQTLFDYTCRRLM 955
>gi|224129460|ref|XP_002328722.1| predicted protein [Populus trichocarpa]
gi|222839020|gb|EEE77371.1| predicted protein [Populus trichocarpa]
Length = 922
Score = 1436 bits (3718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 704/930 (75%), Positives = 805/930 (86%), Gaps = 10/930 (1%)
Query: 25 KVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFID 84
KVDP KPA LTWQR+L T E LSQFTL W+E +++APIG+R+ ++E A K K FID
Sbjct: 1 KVDPGKPAPLTWQRRLDTSETVLSQFTLTWQEILRMAPIGIRLWRYVKENAKKKKGIFID 60
Query: 85 PFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
PF KR++TSSHG+P+GGVGSGSIGRSYRGEFQRWQ+FPRV E+KPVLANQFS+FVSRSNG
Sbjct: 61 PFAKRNVTSSHGIPVGGVGSGSIGRSYRGEFQRWQLFPRV-EEKPVLANQFSIFVSRSNG 119
Query: 145 QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
+KY SVLC ++P+ L++ +GI SW+WNLKGD STYHALYPRAWTV+EGEPDPELR+VC
Sbjct: 120 KKYCSVLCSRSPDKLEEPAGSGIESWEWNLKGDNSTYHALYPRAWTVYEGEPDPELRVVC 179
Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
RQISP+IPHNYKESS+PVSVFT+T+YNSG+T+AD+TLLFTW NSVGG SEF+GQH NS
Sbjct: 180 RQISPVIPHNYKESSFPVSVFTFTLYNSGETAADVTLLFTWANSVGGVSEFSGQHLNSTK 239
Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEI 324
M+DGVH VLLHH+T+++LPP+TFA+AAQET GVHVS CP FVISGNS GLTAK+MW+E+
Sbjct: 240 MMDDGVHCVLLHHKTANELPPLTFAIAAQETPGVHVSKCPSFVISGNSQGLTAKEMWNEV 299
Query: 325 KEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKT 384
KEHGSFD LNS V SEPGSSIGAAIAA+ TVPPDS VTFSLAWDCPEV F SG+T
Sbjct: 300 KEHGSFDNLNSSGKPVPSEPGSSIGAAIAATSTVPPDSVCTVTFSLAWDCPEVIFASGRT 359
Query: 385 YYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELY 444
Y+RRYTKFYGTH +AAANIA DAIL HG W+ QIEAWQRPILEDKRLPEWYP+TLFNELY
Sbjct: 360 YHRRYTKFYGTHGDAAANIAHDAILGHGHWDSQIEAWQRPILEDKRLPEWYPVTLFNELY 419
Query: 445 YLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSIL 504
YLN+GG +WTDGS P+HSL T+G +KFSLD + SDL +Q DT+V+IL RM+S+L
Sbjct: 420 YLNSGGTIWTDGSSPLHSLATVGGKKFSLDRTGSDLG------HQGDTSVDILGRMTSVL 473
Query: 505 EQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQ 564
EQI+TP+A NS+ GTNLLQ+GEEN+GQFLYLEGIEY MWNTYDVHFY+SFALIMLFPK+Q
Sbjct: 474 EQIHTPLATNSALGTNLLQEGEENVGQFLYLEGIEYPMWNTYDVHFYASFALIMLFPKLQ 533
Query: 565 LSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARW 624
LSIQRDFAAAVMMHDPSKM LL DGQ V+RKVLGAVPHDIGI DPWFEVNAY L++T RW
Sbjct: 534 LSIQRDFAAAVMMHDPSKMHLLLDGQCVTRKVLGAVPHDIGIDDPWFEVNAYNLHNTDRW 593
Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYD 684
KDLNPKFVLQVYRDVVATGDKKFA+A WPSVYVAMAYMDQFD+DGDGMIENDGFPDQTYD
Sbjct: 594 KDLNPKFVLQVYRDVVATGDKKFAQAAWPSVYVAMAYMDQFDKDGDGMIENDGFPDQTYD 653
Query: 685 TWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNY 744
TWSVSG+SAY GGLWVAALQAASALA EVGD+ S +YF F+FQKAKVVY+KLWNGSYFNY
Sbjct: 654 TWSVSGVSAYCGGLWVAALQAASALAWEVGDKDSAEYFWFRFQKAKVVYDKLWNGSYFNY 713
Query: 745 DNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVN 804
D+S S SIQADQLAGQWYARACGLLPIVDEDKARSAL+K+YNYN LKV GKRGAVN
Sbjct: 714 DDSNGRNSPSIQADQLAGQWYARACGLLPIVDEDKARSALEKIYNYNFLKVHDGKRGAVN 773
Query: 805 GMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
GMLPDG VDMS MQSREIWSGVTYAVAA+M+ E L D+ F TA G+YEAAW+ GLGY+F
Sbjct: 774 GMLPDGTVDMSDMQSREIWSGVTYAVAATMMQEGLMDMAFHTASGVYEAAWAEQGLGYSF 833
Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLL--RYHAGF 922
QTPE WNT+ QYRSL YMRPLAIWAMQW L+ PK E + ++ D SLL ++HAGF
Sbjct: 834 QTPEGWNTNGQYRSLGYMRPLAIWAMQWTLSSPKLHKQEMNFQVKLED-SLLGHQHHAGF 892
Query: 923 SKVARLLKLPEEQGAKSLLQSLFDHTCRRM 952
+KVAR LKLPEE+ + S LQ+LFD+ C++
Sbjct: 893 AKVARFLKLPEEESSVSYLQALFDYACKKF 922
>gi|110742221|dbj|BAE99037.1| hypothetical protein [Arabidopsis thaliana]
Length = 957
Score = 1420 bits (3677), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 686/959 (71%), Positives = 803/959 (83%), Gaps = 9/959 (0%)
Query: 1 MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
MFE KI++ G E+ +PLNSS KVDPA PASLTWQRK+ + +F L+ KE QL
Sbjct: 1 MFEEKIMDIG---EDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQL 57
Query: 61 APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
AP+G+R+ L REEAAKG+ AFIDPF K +TSSHGVPLGG+G+GSIGRS++GEFQRWQ+
Sbjct: 58 APVGIRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQL 117
Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
FP CED+PVLANQFS FVSR+NG+KYSSVLCP+ P++ K + +GIGSWDWNLKGDKST
Sbjct: 118 FPPKCEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKST 177
Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
YHALYPR+WT++EGEPDPELRIVCRQ+SP IPHNYKESS+PVSVFT+T++N G T+AD+T
Sbjct: 178 YHALYPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVT 237
Query: 241 LLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHV 300
LLFTW NSVGGDSEF+G HYNSK MNDGV VLLHH+T++ LP +++A++AQ TDGV V
Sbjct: 238 LLFTWANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSV 297
Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
S CP F++SG G+TAKDMW +KE+GSFD L + E S+ S+ GSSIGAA+AASVTV P
Sbjct: 298 SACPFFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLP 357
Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
VTFSLAWDCPEV F SGK Y RRYTKFYG + +AAA IA DAIL H WE IE
Sbjct: 358 GESRIVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWESWIED 417
Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
WQRPILEDKRLP WYP+TLFNELYYLN+GG +WTDGS PVHSL + +KFSLD SQ L
Sbjct: 418 WQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGL 477
Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEY 540
K +DVP+QNDTAV++LE+M+S LE+++ NS+FGT LL++GEENIG FLYLEGIEY
Sbjct: 478 KNDIDVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEY 537
Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGAV
Sbjct: 538 RMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAV 597
Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
PHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAMA
Sbjct: 598 PHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMA 657
Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSED 720
YM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY GGLWVAALQAASALAR VGD+ S+D
Sbjct: 658 YMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQD 717
Query: 721 YFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
YF KFQKAKVVYE KLWNGSYFNYDNSGS SS+IQADQLAGQWYARA GLLPIVDEDK
Sbjct: 718 YFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDK 777
Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
AR+AL+KVYNYNV+K+ GKRGAVNGM P+G+VD +SMQSREIWSGVTYA++A+MI E L
Sbjct: 778 ARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGL 837
Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
++ FQTA GIYEAAWS TGLGY+FQTPE+WNT D+YRSL YMRPLAIWAMQWALT+
Sbjct: 838 VEMAFQTASGIYEAAWSKTGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQ 897
Query: 900 KTLE-----KQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
K + +Q +PE+ S +++ GFS+V+RLL LP E AKS LQ+LFD+TCRRM
Sbjct: 898 KQEQLGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956
>gi|30695777|ref|NP_199801.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|27311753|gb|AAO00842.1| putative protein [Arabidopsis thaliana]
gi|34365725|gb|AAQ65174.1| At5g49900 [Arabidopsis thaliana]
gi|332008486|gb|AED95869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 957
Score = 1420 bits (3677), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 686/959 (71%), Positives = 803/959 (83%), Gaps = 9/959 (0%)
Query: 1 MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
MFE KI++ G E+ +PLNSS KVDPA PASLTWQRK+ + +F L+ KE QL
Sbjct: 1 MFEEKIMDIG---EDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQL 57
Query: 61 APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
AP+G+R+ L REEAAKG+ AFIDPF K +TSSHGVPLGG+G+GSIGRS++GEFQRWQ+
Sbjct: 58 APVGIRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQL 117
Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
FP CED+PVLANQFS FVSR+NG+KYSSVLCP+ P++ K + +GIGSWDWNLKGDKST
Sbjct: 118 FPPKCEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKST 177
Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
YHALYPR+WT++EGEPDPELRIVCRQ+SP IPHNYKESS+PVSVFT+T++N G T+AD+T
Sbjct: 178 YHALYPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVT 237
Query: 241 LLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHV 300
LLFTW NSVGGDSEF+G HYNSK MNDGV VLLHH+T++ LP +++A++AQ TDGV V
Sbjct: 238 LLFTWANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSV 297
Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
S CP F++SG G+TAKDMW +KE+GSFD L + E S+ S+ GSSIGAA+AASVTV P
Sbjct: 298 SACPFFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLP 357
Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
VTFSLAWDCPEV F SGK Y RRYTKFYG + +AAA IA DAIL H WE IE
Sbjct: 358 GESRIVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWESWIED 417
Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
WQRPILEDKRLP WYP+TLFNELYYLN+GG +WTDGS PVHSL + +KFSLD SQ L
Sbjct: 418 WQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGL 477
Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEY 540
K +DVP+QNDTAV++LE+M+S LE+++ NS+FGT LL++GEENIG FLYLEGIEY
Sbjct: 478 KNDIDVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEY 537
Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGAV
Sbjct: 538 RMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAV 597
Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
PHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAMA
Sbjct: 598 PHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMA 657
Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSED 720
YM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY GGLWVAALQAASALAR VGD+ S+D
Sbjct: 658 YMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQD 717
Query: 721 YFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
YF KFQKAKVVYE KLWNGSYFNYDNSGS SS+IQADQLAGQWYARA GLLPIVDEDK
Sbjct: 718 YFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDK 777
Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
AR+AL+KVYNYNV+K+ GKRGAVNGM P+G+VD +SMQSREIWSGVTYA++A+MI E L
Sbjct: 778 ARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGL 837
Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
++ FQTA GIYEAAWS TGLGY+FQTPE+WNT D+YRSL YMRPLAIWAMQWALT+
Sbjct: 838 VEMAFQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQ 897
Query: 900 KTLE-----KQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
K + +Q +PE+ S +++ GFS+V+RLL LP E AKS LQ+LFD+TCRRM
Sbjct: 898 KQEQLGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956
>gi|224120058|ref|XP_002331126.1| predicted protein [Populus trichocarpa]
gi|222872854|gb|EEF09985.1| predicted protein [Populus trichocarpa]
Length = 948
Score = 1417 bits (3667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 700/953 (73%), Positives = 800/953 (83%), Gaps = 9/953 (0%)
Query: 1 MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
MFE KI NG EE+ EP N DKVD KPA LTWQRKL E LSQFTL+ +E + +
Sbjct: 1 MFERKISGNGFDEEQGEPSNYPVDKVDAGKPAPLTWQRKLDGDETVLSQFTLSLQEKLLM 60
Query: 61 APIGVRILCLIREEAAKGKRAF-IDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
APIG+R+ IREE A+ + F +DPF KR++TS G+P+GG+GSGSIGRSY+GEFQRWQ
Sbjct: 61 APIGIRLWRHIREENARKRGGFYMDPFAKRNVTSCLGIPVGGIGSGSIGRSYKGEFQRWQ 120
Query: 120 IFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKS 179
+FPR+CE+KPVLANQFS+FVSRSNG+KYSSVLC ++P+VL++ +GI SWDWNLKG+ S
Sbjct: 121 LFPRICEEKPVLANQFSIFVSRSNGKKYSSVLCSRSPDVLEEAAGSGIESWDWNLKGNNS 180
Query: 180 TYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADI 239
TYHALYPRAWTV+EGEPDPELR+VCRQISPIIPHNYKESS+P SVFT+ +YNSGKTSAD+
Sbjct: 181 TYHALYPRAWTVYEGEPDPELRVVCRQISPIIPHNYKESSFPASVFTFKLYNSGKTSADV 240
Query: 240 TLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVH 299
TLLFTW NSVGG SEF+GQH NS M DGVH LL+H+T++ LP V+FA+AAQET VH
Sbjct: 241 TLLFTWANSVGGVSEFSGQHLNSTKMMEDGVHCALLNHKTANGLPSVSFAIAAQETPVVH 300
Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
VS CP FVISGNS G+TAK+MW+E+KEHGSFD LNS V SEPGSSIGAA+AA+ TVP
Sbjct: 301 VSKCPCFVISGNSQGVTAKEMWNEVKEHGSFDNLNSPGKLVPSEPGSSIGAAVAATSTVP 360
Query: 360 PDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIE 419
PD VTFSLAWDCP+V F SG+TY+RRYTKFYGTH +AAANIA DAILEHG WE +IE
Sbjct: 361 PDGVCTVTFSLAWDCPDVKFGSGRTYHRRYTKFYGTHGDAAANIAHDAILEHGLWESEIE 420
Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
AWQRPILEDKRLPEWY TLFNELYYLN+GG VWTDGSPP SL TI KFSLD + S+
Sbjct: 421 AWQRPILEDKRLPEWYAGTLFNELYYLNSGGTVWTDGSPPFRSLATIEGSKFSLDRAGSN 480
Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
L +Q DTAV+IL RM+S LE+I+TP+ NS+FGTNLLQ+GEENIGQFLYLEGIE
Sbjct: 481 LG------HQGDTAVDILGRMTSALEEIHTPLTTNSAFGTNLLQEGEENIGQFLYLEGIE 534
Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
Y MWNTYDVHFY+SFA IMLFPK+QLSIQRDFAAAVMMHDPS M LL DGQ V RKV+GA
Sbjct: 535 YHMWNTYDVHFYASFAFIMLFPKLQLSIQRDFAAAVMMHDPSMMPLLHDGQRVPRKVIGA 594
Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
VPHDIGI DPWFEVNAY L++T RWKDLNPKFVLQVYRDV+ATGDKKFA+AVWPSVYVAM
Sbjct: 595 VPHDIGIHDPWFEVNAYNLHNTDRWKDLNPKFVLQVYRDVIATGDKKFARAVWPSVYVAM 654
Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSE 719
AYMDQFDRDGDGMIENDGFPDQTYDTWS+SG+SAY GGLWVAALQAASALA EVGD+ S
Sbjct: 655 AYMDQFDRDGDGMIENDGFPDQTYDTWSLSGVSAYCGGLWVAALQAASALAWEVGDKDSA 714
Query: 720 DYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
+YF +FQKAK+VY KLWNGSYFNYDNSG SSSIQADQLAGQWYARACGL PIVDEDK
Sbjct: 715 EYFWCRFQKAKIVYGKLWNGSYFNYDNSGGINSSSIQADQLAGQWYARACGLSPIVDEDK 774
Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
AR AL+K+YNYNVLKV GKRGAVNGMLPDG VD+S +QSREIWSGVTYAVAA+MI E L
Sbjct: 775 ARCALEKIYNYNVLKVQDGKRGAVNGMLPDGTVDLSCLQSREIWSGVTYAVAATMIQEGL 834
Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
D+ F TA G+YEA W+ GLGY+FQTPE WNT DQYRSLCYMRPLAIWAMQWAL+R P
Sbjct: 835 TDMAFHTASGVYEAVWAEQGLGYSFQTPEGWNTTDQYRSLCYMRPLAIWAMQWALSR--P 892
Query: 900 KTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRM 952
+ ++MK + ++S+ +HAGF+KVAR LKLP + +KS QSLF++ +
Sbjct: 893 ELHRQEMKLQAEEDSVPVHHAGFAKVARFLKLPHAESSKSHFQSLFEYATTKF 945
>gi|222423179|dbj|BAH19567.1| AT5G49900 [Arabidopsis thaliana]
Length = 957
Score = 1416 bits (3665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 685/959 (71%), Positives = 801/959 (83%), Gaps = 9/959 (0%)
Query: 1 MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
MFE KI++ G E+ +PLNSS KVDPA PASLTWQRK+ + +F L+ KE QL
Sbjct: 1 MFEEKIMDIG---EDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQL 57
Query: 61 APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
AP+G+R+ L REEAAKG+ AFIDPF K +TSSHGVPLGG+G+GSIGRS++GEFQRWQ+
Sbjct: 58 APVGIRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQL 117
Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
FP CED+PVLANQFS FVSR+NG+KYSSVLCP+ P++ K + +GIGSWDWNLKGDKST
Sbjct: 118 FPPKCEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKST 177
Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
YHALYPR+WT++EGEPDPELRIVCRQ+SP IPHNYKESS+PVSVFT+T++N G T+AD+T
Sbjct: 178 YHALYPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVT 237
Query: 241 LLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHV 300
LLFTW NSVGGDSEF+G HYNSK MNDGV VLLHH+T++ LP +++A++AQ TDGV V
Sbjct: 238 LLFTWANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSV 297
Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
S CP F++SG G+TAKDMW +KE+GSFD L + E S+ S+ GSSIGAA+AASVTV P
Sbjct: 298 SACPFFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLP 357
Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
VTFSLAWDCPEV F SGK Y RRYTKFYG + +AAA IA DAIL H W IE
Sbjct: 358 GESRIVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWGSWIED 417
Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
WQRPILEDKRLP WYP+TLFNELYYLN+GG +WTDGS PVHSL + +KFSLD SQ L
Sbjct: 418 WQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGL 477
Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEY 540
K +DVP+QNDTAV++LE+M+S LE ++ NS+FGT LL++GEENIG FLYLEGIEY
Sbjct: 478 KNDIDVPHQNDTAVSVLEKMASTLEGLHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEY 537
Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGAV
Sbjct: 538 RMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAV 597
Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
PHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAMA
Sbjct: 598 PHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMA 657
Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSED 720
YM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY GGLWVAALQAASALAR VGD+ S+D
Sbjct: 658 YMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQD 717
Query: 721 YFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
YF KFQKAKVVYE KLWNGSYFNYDNSGS SS+IQADQLAGQWYARA GLLPIVDEDK
Sbjct: 718 YFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDK 777
Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
AR+AL+KVYNYNV+K+ GKRGAVNGM P+G+VD +SMQSREIWSGVTYA++A+MI E L
Sbjct: 778 ARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGL 837
Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
++ FQTA GIYEAAWS TGLGY+FQTPE+WNT D+YRSL YMRPLAIWAMQWALT+
Sbjct: 838 VEMAFQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQ 897
Query: 900 KTLE-----KQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
K + +Q +PE+ S +++ GFS+V+RLL LP E AKS LQ+LFD+TCRRM
Sbjct: 898 KQEQLGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956
>gi|225460125|ref|XP_002275782.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
Length = 969
Score = 1381 bits (3574), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 684/951 (71%), Positives = 797/951 (83%), Gaps = 7/951 (0%)
Query: 7 LENGLHEEE---KEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPI 63
+ENG H+E+ + P+ SS KV+P KPASLTWQRKL+T L++F L +E LAP+
Sbjct: 17 MENG-HKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHLAPL 75
Query: 64 GVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPR 123
GVR+ + EAAKG+ + IDPF KR +TS HGVPLGG+G GSIGRSYRGEFQR+Q+FPR
Sbjct: 76 GVRLWHHVNAEAAKGRISIIDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPR 135
Query: 124 VCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHA 183
+CED PVLANQFSVFVSR NG+K S+VLCP+ PEVLK + ++GIGSWDWNL G+ TYHA
Sbjct: 136 ICEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHA 195
Query: 184 LYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLF 243
LYPRAWTV+EGEPDPE+ I+ QISP IPHNYKESS+PVSVF +T+ NSGKTSADITLLF
Sbjct: 196 LYPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLF 255
Query: 244 TWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLC 303
TW NSVGG SEF+G HYNSK K DGVH VLLHH+T++ PPVTFA+AA+ET VH+S C
Sbjct: 256 TWANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISEC 315
Query: 304 PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSE 363
P F+ISGNSLG+TAK+MW EIK+HGSFD L+ +S+ SEPGSSIGAA+AAS+T+PPD+
Sbjct: 316 PCFLISGNSLGVTAKEMWQEIKQHGSFDHLDFDGSSMRSEPGSSIGAAVAASLTLPPDTV 375
Query: 364 GQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQR 423
VTFSLAW CPEV F SGKTY+RRYT+FYGTH +AA IA DAILEH +W +IEAWQ
Sbjct: 376 RTVTFSLAWACPEVRFTSGKTYHRRYTRFYGTHVDAAEEIAHDAILEHANWVSEIEAWQG 435
Query: 424 PILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRI 483
PILED+RLPEWY ITLFNELY+LNAGG +WTDG PP+ SL TI KFSLD S SD K
Sbjct: 436 PILEDRRLPEWYRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNT 495
Query: 484 VDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMW 543
D+ +QND+ V IL RM+S+LEQI+ P NS+FGT LLQ GEEN+GQFLYLEGIEY MW
Sbjct: 496 TDIVHQNDSTVEILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMW 555
Query: 544 NTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHD 603
NTYDVHFYSSFA+IMLFP+++LSIQRDFAAAVM+HDPS+MK++ DG+WV RKVLGAVPHD
Sbjct: 556 NTYDVHFYSSFAIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHD 615
Query: 604 IGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD 663
IGI DPWFE+NAY LYDT RWKDLN KFVLQVYRD+VATGDK FA+AVWP+VY+A+A++D
Sbjct: 616 IGISDPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLD 675
Query: 664 QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
QFD+DGDGMIENDGFPDQTYD WSV+G+SAY GGLWVAALQAASA+AREVGD + DYF
Sbjct: 676 QFDKDGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFW 735
Query: 724 FKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSA 783
FKFQKAK VY+KLWNGSYFNYDNSG S SSSIQADQLAGQWYARACGL PIVD++KARSA
Sbjct: 736 FKFQKAKAVYDKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGLQPIVDDEKARSA 795
Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
L+KVYN+NVLKV GK GAVNGMLPDGRVDMS+MQSREIW+GVTY+VAA+MIHE + +
Sbjct: 796 LEKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETA 855
Query: 844 FQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLE 903
F TA GIY+AAWS GLGY+FQTPEAWNTD++YRSLCYMRPLAIWAMQWAL+ KP+
Sbjct: 856 FNTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALS--KPELHN 913
Query: 904 KQMKPEVTDESL-LRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
MK E +L +H GF KVA LLKLPEE+ +KS LQ FD TCRR++
Sbjct: 914 HDMKHEEGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLY 964
>gi|356520149|ref|XP_003528727.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
glucosylceramidase-like [Glycine max]
Length = 944
Score = 1380 bits (3573), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 673/953 (70%), Positives = 798/953 (83%), Gaps = 13/953 (1%)
Query: 6 ILENGLHEEEKEPLNSSFDK---VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAP 62
++ENG +++ P S V+P KPA LTWQRKL+ SQ +L+ KE + LAP
Sbjct: 1 MVENGFVQDDITPPPKSCSNKKMVEPGKPAGLTWQRKLNNNGNASSQISLSLKEIIHLAP 60
Query: 63 IGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP 122
IG R+ REEAAKG+ IDPF KRH+T HGVPLGG+G+GSIGRS+RGEFQRWQ+FP
Sbjct: 61 IGYRLWRHCREEAAKGRGGMIDPFAKRHVTFCHGVPLGGIGAGSIGRSFRGEFQRWQLFP 120
Query: 123 RVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH 182
+CE+KPVLANQFSVFVSR +G+KYSSVLCP+ P ++K +GI SWDWN+ G+ STYH
Sbjct: 121 VICEEKPVLANQFSVFVSRPSGEKYSSVLCPRKPVIIKQNPVSGIESWDWNINGNSSTYH 180
Query: 183 ALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLL 242
ALYPRAWTV+E EPDP LRIVCRQISP+IPHNYKESS+PV+VFT+T+ N G T+AD+TLL
Sbjct: 181 ALYPRAWTVYE-EPDPALRIVCRQISPVIPHNYKESSFPVTVFTFTLKNLGNTTADVTLL 239
Query: 243 FTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
FTWTNSVGG SEFTG H+NSKT NDGVHAVLLHH+T+++ PVTFA+AA+ET+ VH+S
Sbjct: 240 FTWTNSVGGISEFTGDHFNSKTTRNDGVHAVLLHHKTANERSPVTFAIAAEETEDVHISE 299
Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
CP FVISG ++AK+MW+E+K+HGSFD LN ETS SEPGSSIGAAIAA+VT+P ++
Sbjct: 300 CPVFVISGAYSSISAKNMWNEVKQHGSFDHLNFAETSAPSEPGSSIGAAIAATVTIPSNA 359
Query: 363 EGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
+ VTFSLAWDCPEV F GKTYYRRYTKFYGT+ +AAA+IA DAI+EH WE QI+ WQ
Sbjct: 360 QRIVTFSLAWDCPEVKFPEGKTYYRRYTKFYGTNGDAAADIAHDAIMEHCQWEAQIDDWQ 419
Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKR 482
RPILEDKR PEWYP TL NELYYLN+GG +WTDGS PVH LV G RKFSLD ++L
Sbjct: 420 RPILEDKRFPEWYPTTLLNELYYLNSGGTIWTDGSLPVHGLVNTGERKFSLDGFNNNLS- 478
Query: 483 IVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLM 542
++ND A+NILE +S++EQ ++P A S++G NLLQ+GEENIGQFLYLEGIEY M
Sbjct: 479 -----HKNDIAINILEMFNSVVEQTHSPPASKSAYGVNLLQEGEENIGQFLYLEGIEYQM 533
Query: 543 WNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPH 602
WNTYDVHFYSSF+L+MLFPK++LSIQRDFAAAV+MHDPSKMKLL DGQW RKVLGAVPH
Sbjct: 534 WNTYDVHFYSSFSLVMLFPKLELSIQRDFAAAVLMHDPSKMKLLYDGQWAPRKVLGAVPH 593
Query: 603 DIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
DIG+ DPWFEVN Y LY+T RWKDLNPKFVLQ+YRDVVATGDKKFA+AVWP+VY+A+AYM
Sbjct: 594 DIGLNDPWFEVNGYNLYNTDRWKDLNPKFVLQIYRDVVATGDKKFAQAVWPAVYIAIAYM 653
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
DQFD+DGDGMIEN+GFPDQTYDTWSVSG+SAYSGGLWVAALQAASALAREVGD+GSEDYF
Sbjct: 654 DQFDKDGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAREVGDKGSEDYF 713
Query: 723 LFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARS 782
KFQKAK VYE LWNGSYFNYD+SG S S SIQADQLAGQWYARACGLLPIV+E K+RS
Sbjct: 714 WLKFQKAKSVYENLWNGSYFNYDSSGGSSSLSIQADQLAGQWYARACGLLPIVEEKKSRS 773
Query: 783 ALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADI 842
AL+ VYN+NV+KV G+RGAVNGMLPDG+VDMS+MQSREIWSGVTYA+AA+MI E++ D+
Sbjct: 774 ALQIVYNHNVMKVHXGRRGAVNGMLPDGKVDMSTMQSREIWSGVTYALAATMIQENMIDM 833
Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTL 902
FQTA G+YE AWS GLGY+FQTPEAW T D+YRSLCYMRPLAIWAMQW L+R K +
Sbjct: 834 AFQTAGGVYETAWSNNGLGYSFQTPEAWTTKDEYRSLCYMRPLAIWAMQWELSR--AKHI 891
Query: 903 EKQMKPEVTDESLL-RYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
+ + K ++ ++ +L RYH GFSKVARLLK+ EE ++SL Q ++D TC+RM++
Sbjct: 892 QHESKSDMKEKDMLSRYHDGFSKVARLLKVKEETDSRSLFQVIYDFTCKRMWV 944
>gi|297741007|emb|CBI31319.3| unnamed protein product [Vitis vinifera]
Length = 953
Score = 1380 bits (3571), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 684/951 (71%), Positives = 797/951 (83%), Gaps = 7/951 (0%)
Query: 7 LENGLHEEE---KEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPI 63
+ENG H+E+ + P+ SS KV+P KPASLTWQRKL+T L++F L +E LAP+
Sbjct: 1 MENG-HKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHLAPL 59
Query: 64 GVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPR 123
GVR+ + EAAKG+ + IDPF KR +TS HGVPLGG+G GSIGRSYRGEFQR+Q+FPR
Sbjct: 60 GVRLWHHVNAEAAKGRISIIDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPR 119
Query: 124 VCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHA 183
+CED PVLANQFSVFVSR NG+K S+VLCP+ PEVLK + ++GIGSWDWNL G+ TYHA
Sbjct: 120 ICEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHA 179
Query: 184 LYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLF 243
LYPRAWTV+EGEPDPE+ I+ QISP IPHNYKESS+PVSVF +T+ NSGKTSADITLLF
Sbjct: 180 LYPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLF 239
Query: 244 TWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLC 303
TW NSVGG SEF+G HYNSK K DGVH VLLHH+T++ PPVTFA+AA+ET VH+S C
Sbjct: 240 TWANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISEC 299
Query: 304 PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSE 363
P F+ISGNSLG+TAK+MW EIK+HGSFD L+ +S+ SEPGSSIGAA+AAS+T+PPD+
Sbjct: 300 PCFLISGNSLGVTAKEMWQEIKQHGSFDHLDFDGSSMRSEPGSSIGAAVAASLTLPPDTV 359
Query: 364 GQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQR 423
VTFSLAW CPEV F SGKTY+RRYT+FYGTH +AA IA DAILEH +W +IEAWQ
Sbjct: 360 RTVTFSLAWACPEVRFTSGKTYHRRYTRFYGTHVDAAEEIAHDAILEHANWVSEIEAWQG 419
Query: 424 PILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRI 483
PILED+RLPEWY ITLFNELY+LNAGG +WTDG PP+ SL TI KFSLD S SD K
Sbjct: 420 PILEDRRLPEWYRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNT 479
Query: 484 VDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMW 543
D+ +QND+ V IL RM+S+LEQI+ P NS+FGT LLQ GEEN+GQFLYLEGIEY MW
Sbjct: 480 TDIVHQNDSTVEILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMW 539
Query: 544 NTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHD 603
NTYDVHFYSSFA+IMLFP+++LSIQRDFAAAVM+HDPS+MK++ DG+WV RKVLGAVPHD
Sbjct: 540 NTYDVHFYSSFAIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHD 599
Query: 604 IGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD 663
IGI DPWFE+NAY LYDT RWKDLN KFVLQVYRD+VATGDK FA+AVWP+VY+A+A++D
Sbjct: 600 IGISDPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLD 659
Query: 664 QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
QFD+DGDGMIENDGFPDQTYD WSV+G+SAY GGLWVAALQAASA+AREVGD + DYF
Sbjct: 660 QFDKDGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFW 719
Query: 724 FKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSA 783
FKFQKAK VY+KLWNGSYFNYDNSG S SSSIQADQLAGQWYARACGL PIVD++KARSA
Sbjct: 720 FKFQKAKAVYDKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGLQPIVDDEKARSA 779
Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
L+KVYN+NVLKV GK GAVNGMLPDGRVDMS+MQSREIW+GVTY+VAA+MIHE + +
Sbjct: 780 LEKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETA 839
Query: 844 FQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLE 903
F TA GIY+AAWS GLGY+FQTPEAWNTD++YRSLCYMRPLAIWAMQWAL+ KP+
Sbjct: 840 FNTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALS--KPELHN 897
Query: 904 KQMKPEVTDESL-LRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
MK E +L +H GF KVA LLKLPEE+ +KS LQ FD TCRR++
Sbjct: 898 HDMKHEEGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLY 948
>gi|449463809|ref|XP_004149624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
gi|449494604|ref|XP_004159595.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 951
Score = 1354 bits (3504), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 683/956 (71%), Positives = 800/956 (83%), Gaps = 8/956 (0%)
Query: 1 MFEGKILENGLHEEEKEPLNSSFDK---VDPAKPASLTWQRKLSTGEIPLSQFTLNWKET 57
M EGK L NGL E++ E +N S + +DP + SLTWQRKL+ L QF L+ KE
Sbjct: 1 MSEGKKLGNGLVEQD-EDINKSLTEKTGIDPGQTPSLTWQRKLNREGTSLWQFRLHLKEI 59
Query: 58 VQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQR 117
+ LAP+G R+ IREE+ KG+ A I+PFI+R +T +H +PLGG+GSGSIGRSYRGEFQR
Sbjct: 60 IHLAPVGYRLFRYIREESVKGRGAMINPFIRRRITDTHAIPLGGLGSGSIGRSYRGEFQR 119
Query: 118 WQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGD 177
WQ+FPR CEDKP+LANQFSVFVSR N +KYS+VLC ++PE ++ ++GIGSWDWNLKG
Sbjct: 120 WQLFPRKCEDKPILANQFSVFVSRPNAEKYSTVLCAQSPETQREVESSGIGSWDWNLKGH 179
Query: 178 KSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSA 237
STYHALYPRAWT+++GEPDPELRIVCRQISP+IPHNYKESSYPVSVFT+T++NSGKT A
Sbjct: 180 SSTYHALYPRAWTIYDGEPDPELRIVCRQISPVIPHNYKESSYPVSVFTFTLHNSGKTDA 239
Query: 238 DITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG 297
D++LLFTW NSVGG SE++G H NS+TK DGVH VLLHH+T+ PPVT+A+AAQE +G
Sbjct: 240 DVSLLFTWANSVGGLSEYSGNHINSRTKKKDGVHTVLLHHKTASGFPPVTYAIAAQEGNG 299
Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
VHVS CP FVISGNS G++AKDMW EIKEHGSFDRL + S+ SE GSSIGAA++ASVT
Sbjct: 300 VHVSDCPCFVISGNSQGISAKDMWLEIKEHGSFDRLKFADMSMPSEVGSSIGAAVSASVT 359
Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
V DS VTFSL+WDCPEVNF GKTY+RRYTKFYG +AAA+IARDAILEH WE Q
Sbjct: 360 VSSDSVRTVTFSLSWDCPEVNFCRGKTYHRRYTKFYGNLGDAAADIARDAILEHHHWESQ 419
Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQ 477
I+AWQRP+LEDKR P+WYP+TLFNELYYLNAGG +WTDGS P+ SLV+IG R F LD +
Sbjct: 420 IDAWQRPVLEDKRFPKWYPVTLFNELYYLNAGGTIWTDGSLPIQSLVSIGERGFCLDQCR 479
Query: 478 SDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
S I + ++NDTA +IL RM+S L+++ VA NS+FG NLLQ GEEN+GQFLYLEG
Sbjct: 480 SGDSVIYE-SHENDTANDILGRMTSRLDELRDSVASNSAFGVNLLQKGEENVGQFLYLEG 538
Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
+EY MWNTYDVHFYSSFA+IMLFPK++LSIQRDFAAAVMMHDPSKM+LLD+G+W +R VL
Sbjct: 539 VEYTMWNTYDVHFYSSFAIIMLFPKLELSIQRDFAAAVMMHDPSKMQLLDNGKWEARNVL 598
Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
GAVPHDIG+ DPWFEVN Y LY+T RWKDLNPKFVLQ+YRDVVATGD KFAKAVWP VY+
Sbjct: 599 GAVPHDIGVNDPWFEVNGYNLYNTDRWKDLNPKFVLQIYRDVVATGDMKFAKAVWPPVYL 658
Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
A+AYMDQFDRDGDGMIENDGFPDQTYDTWSVSG+SAYSGGLWVAALQAASALAR ++
Sbjct: 659 AIAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVADEKD 718
Query: 718 SEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE 777
+E YF FKFQKAK YEKLWNGSYFNYD+SG S SSSIQADQLAGQWYARA GLLPIVDE
Sbjct: 719 AEHYFWFKFQKAKRAYEKLWNGSYFNYDSSGGSSSSSIQADQLAGQWYARASGLLPIVDE 778
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
DKA+SAL KVYNYNV+KV GKRGAVNGMLPDG +D SSMQSREIWSGVTYAVAASMIHE
Sbjct: 779 DKAKSALAKVYNYNVMKVKDGKRGAVNGMLPDGTIDFSSMQSREIWSGVTYAVAASMIHE 838
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
D+ D+ F+TA GI+EAAWS GLGY FQTPEAW T D+YRSLCYMRPLAIWAMQWA +
Sbjct: 839 DMTDMAFRTAEGIHEAAWSEDGLGYNFQTPEAWTTTDRYRSLCYMRPLAIWAMQWAFSE- 897
Query: 898 KPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
K +E ++K E+ +++ R+HA FSKVAR LKLPE+ + S+LQ+++D+T +R F
Sbjct: 898 KVSVIE-ELK-ELDSDAISRHHARFSKVARYLKLPEDGTSSSVLQTVYDYTLKRFF 951
>gi|224146269|ref|XP_002325943.1| predicted protein [Populus trichocarpa]
gi|222862818|gb|EEF00325.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 1304 bits (3374), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 618/899 (68%), Positives = 748/899 (83%), Gaps = 12/899 (1%)
Query: 60 LAPIGVRILCLIREEAAKGKRAFIDPFIKRH-LTSSHGVPLGGVGSGSIGRSYRGEFQRW 118
+ P+G+R+ I++EAAK +RA I F K+H LTS HG+PLGG+G+GSIGRSYRGEFQ +
Sbjct: 1 MLPMGLRLWRHIKQEAAK-ERATIFDFSKKHILTSDHGIPLGGIGAGSIGRSYRGEFQHF 59
Query: 119 QIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDK 178
++FP +CE+ PVLANQFS FVSR +G+ +SSVLC +TP++ K++T +GI SWDWNL G K
Sbjct: 60 RLFPGICEEGPVLANQFSAFVSRPSGETFSSVLCSRTPDIPKESTGSGIESWDWNLNGQK 119
Query: 179 STYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD 238
TYHAL+PRAWT+++G PDPEL IV RQISP IPHNYKESS+PVSVFT+T+ N GKTSAD
Sbjct: 120 CTYHALFPRAWTIYDGAPDPELTIVSRQISPFIPHNYKESSFPVSVFTFTLSNHGKTSAD 179
Query: 239 ITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGV 298
+TL+FTW NSVGG S +G H+NSK DGVHAV LHH+ +++ PPVTFA+AAQET V
Sbjct: 180 VTLMFTWANSVGGVSGLSGHHFNSKMMTKDGVHAVTLHHKMTNRQPPVTFAIAAQETADV 239
Query: 299 HVSLCPHFVISGNS--LGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASV 356
HVS CP F+ISG+S + +TAKDMW I+++G+FD++ +TS SEPGSSIGAAIAASV
Sbjct: 240 HVSECPCFLISGDSQDVSVTAKDMWDVIRKNGTFDQIGCNKTS-PSEPGSSIGAAIAASV 298
Query: 357 TVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWEL 416
TVP S VTFSL+WD PE+ F S ++Y+RRYTKFYGT +AAANIARDAILEH +WE
Sbjct: 299 TVPSGSIRTVTFSLSWDIPEIRF-SERSYHRRYTKFYGTLGDAAANIARDAILEHANWES 357
Query: 417 QIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWS 476
QIEAWQRPILEDKR+PEWYPITLFNELYYLNAGG +WTD SPP+ +L + R+FSL+ S
Sbjct: 358 QIEAWQRPILEDKRVPEWYPITLFNELYYLNAGGTIWTDESPPMQNLTAVRERRFSLERS 417
Query: 477 QSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLE 536
S K + +NDTA+ ILERM+S EQ++ PV+ NS FG NLLQ+GEENIGQ LYLE
Sbjct: 418 SSGYKNGNGIAYKNDTAIEILERMASTYEQLHNPVSSNSVFGANLLQNGEENIGQLLYLE 477
Query: 537 GIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKV 596
G EYLMWNTYDVHFY+SFAL+MLFPK++L++QRDFAAAV+MHDPS+M+++ DG+WV RKV
Sbjct: 478 GTEYLMWNTYDVHFYASFALVMLFPKLELNLQRDFAAAVLMHDPSRMQIMSDGKWVPRKV 537
Query: 597 LGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVY 656
LGAVPHD+G+ DPWFE+NAY L+ TARWKDLN KFVLQVYRDVVATGDK FA+AVWPSVY
Sbjct: 538 LGAVPHDVGLNDPWFEINAYNLFSTARWKDLNCKFVLQVYRDVVATGDKDFARAVWPSVY 597
Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
VAMAYMDQFD+DGDGMIEN+G PDQTYDTWSV+G+SAY GGLWVAALQA SA+A EVGD
Sbjct: 598 VAMAYMDQFDKDGDGMIENEGIPDQTYDTWSVTGVSAYCGGLWVAALQATSAMAHEVGDD 657
Query: 717 GSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
GS +YF K+QKAK VY KLWNGSYFNYD+SG + SIQADQLAGQWYARACGL P+VD
Sbjct: 658 GSANYFWIKYQKAKTVYGKLWNGSYFNYDSSG---NKSIQADQLAGQWYARACGLSPVVD 714
Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
DKAR AL+K+Y +NVLKV GG RGAVNGMLPDGRVD+++MQ++EIW GVTYA++ASMI
Sbjct: 715 GDKARKALEKIYEFNVLKVKGGTRGAVNGMLPDGRVDLTAMQTKEIWPGVTYALSASMIQ 774
Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
E L ++ FQTA GIY AAWS GLGY+FQ PE+W+ +DQYRSLCYMRPLAIWAMQWAL+
Sbjct: 775 EGLEEMAFQTAVGIYNAAWSEEGLGYSFQIPESWDMNDQYRSLCYMRPLAIWAMQWALS- 833
Query: 897 PKPKTLEKQMKPE-VTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
KPK +++M E + DES L+ HAGFSKVA LL+LPEE+ KS Q++++ TC+RM
Sbjct: 834 -KPKDFKEEMHHEGIEDESYLKQHAGFSKVAHLLRLPEEEAPKSFFQAIYEFTCKRMLF 891
>gi|357157041|ref|XP_003577664.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 951
Score = 1291 bits (3341), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 618/950 (65%), Positives = 757/950 (79%), Gaps = 10/950 (1%)
Query: 9 NGLHEEEK------EPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAP 62
NG+ E K P ++ D VDP + LTW+ KLS L F L W+E V++A
Sbjct: 4 NGMPEPPKGVSRNSSPSQTNGDSVDPGQLPELTWEHKLSHVRYDLPSFGLKWREAVKMAG 63
Query: 63 IGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP 122
+G R+ I EE +KG+ A IDP KR S GVPLGG+G+GSIGRSY+GEFQRWQ+FP
Sbjct: 64 LGFRLGQHIVEETSKGRTAIIDPMKKRTAKSGQGVPLGGIGAGSIGRSYKGEFQRWQLFP 123
Query: 123 RVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH 182
CEDKPVLANQFS F+SR +G+KYS+VL P P++ K T +GIGSWDWNL G KSTYH
Sbjct: 124 GACEDKPVLANQFSAFISRQDGRKYSTVLHPGKPDLPKGTNISGIGSWDWNLSGQKSTYH 183
Query: 183 ALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLL 242
ALYPRAWTV++GEPDP+L+IVCRQ+SPIIPHNY++SSYP +VFT+T+ NSG T+ D+TLL
Sbjct: 184 ALYPRAWTVYDGEPDPDLKIVCRQVSPIIPHNYQQSSYPAAVFTFTVANSGNTAVDVTLL 243
Query: 243 FTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
FTW NSVGG SE TG H NS DGVH +LLHHRT+ PPVTFA+AAQE + VH+S
Sbjct: 244 FTWANSVGGKSELTGYHCNSSMLEKDGVHGILLHHRTADGQPPVTFAIAAQEKEDVHISE 303
Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
CP+FV+SG+S TAKDMW+ +KEHGSFD L+ +E S+ S PG+SIGAAIAASV + P S
Sbjct: 304 CPYFVMSGSSDEFTAKDMWNSVKEHGSFDLLDPIEASICSRPGTSIGAAIAASVKLAPQS 363
Query: 363 EGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
V+FSLAW CPEV F SGKTY+RRYTKFYGT +AAA++A DAI++H SWE QIE WQ
Sbjct: 364 TKDVSFSLAWACPEVKFSSGKTYHRRYTKFYGTDVDAAASLAHDAIVDHSSWEKQIEEWQ 423
Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKR 482
PIL+DKR P WYP+TLFNELYYLNAGG++WTDG PP+ SL IG +KFSLD S +
Sbjct: 424 NPILQDKRFPAWYPVTLFNELYYLNAGGSIWTDGLPPIQSLTAIGGKKFSLDMSNGETDD 483
Query: 483 IVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLM 542
+ ++ N+TA +IL++M+SILE+I+ +A NS+ GT LLQ GEENIGQFLYLEGIEY M
Sbjct: 484 VDEMIPHNNTATDILQQMASILERIHASLASNSAIGTTLLQ-GEENIGQFLYLEGIEYYM 542
Query: 543 WNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPH 602
WNTYDVHFY+SFALIMLFPK+QLSIQRDFAAAVMMHDP K+KLL DG+ RKVLGAVPH
Sbjct: 543 WNTYDVHFYASFALIMLFPKLQLSIQRDFAAAVMMHDPEKLKLLHDGKLAPRKVLGAVPH 602
Query: 603 DIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
D+G+ DPWF+VNAY L++T RWKDLNPKFVLQVYRDVVATG+K FA+AVWPSVY+AMAYM
Sbjct: 603 DLGLYDPWFKVNAYTLHNTDRWKDLNPKFVLQVYRDVVATGNKSFARAVWPSVYMAMAYM 662
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+QFD+D DGMIEN+ FPDQTYD WS++G+SAY GGLWVAALQAASALAREVGD+ SE+ F
Sbjct: 663 EQFDKDKDGMIENEDFPDQTYDVWSMAGVSAYCGGLWVAALQAASALAREVGDKASEELF 722
Query: 723 LFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARS 782
K++KAK VY KLWNGSYFNYD++G+ S+SI ADQLAGQWYA++CGL IVD+DK++S
Sbjct: 723 WNKYEKAKSVYGKLWNGSYFNYDDAGTKVSTSIHADQLAGQWYAKSCGLSSIVDKDKSQS 782
Query: 783 ALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADI 842
AL+K+Y++NV+K GKRGA+NGM PDG VDMS+MQSREIW GVTYA+AASMI E + +
Sbjct: 783 ALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSTMQSREIWPGVTYALAASMIQEGMVEE 842
Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTL 902
GF+TA GIY AAWS GLGYAFQTPEAW DD YRSLCYMRPLAIW++QWAL+ PK L
Sbjct: 843 GFKTAEGIYHAAWSPEGLGYAFQTPEAWTNDDGYRSLCYMRPLAIWSIQWALSSPK---L 899
Query: 903 EKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRM 952
K+ + ++ +S + ++++A+LL+LPE++ +KS LQ +++ R
Sbjct: 900 HKEPQRDLAQDSFPKNQFSYARIAKLLQLPEDESSKSFLQVIYEIVRNRF 949
>gi|255569126|ref|XP_002525532.1| conserved hypothetical protein [Ricinus communis]
gi|223535211|gb|EEF36890.1| conserved hypothetical protein [Ricinus communis]
Length = 948
Score = 1283 bits (3319), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 645/956 (67%), Positives = 767/956 (80%), Gaps = 14/956 (1%)
Query: 1 MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKL-STGEIPLSQFTLNWKETVQ 59
M E +I E G E+ L+SS K++P P SL+W+R L S G +PL FTL+++E +
Sbjct: 1 MMENRIKEEG---EKDSYLDSSTQKINPGVPPSLSWERPLNSNGNVPLG-FTLSFREILH 56
Query: 60 LAPIGVRILCLIREEAAKGKRAFIDPFIKRH-LTSSHGVPLGGVGSGSIGRSYRGEFQRW 118
+ PIG+R+ +EE KG+ D F K+H +T H VPLGG+G+GSIGRSY+GEFQ +
Sbjct: 57 MLPIGLRLWRYSKEEPTKGRVPIFD-FSKKHVITGDHAVPLGGIGAGSIGRSYKGEFQCF 115
Query: 119 QIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDK 178
++ P CE+ P+LANQFSVFVSR NG+K+SSVLC + PE+ + +GI SWDWNL G+
Sbjct: 116 KLLPLACEEGPILANQFSVFVSRPNGKKFSSVLCSRRPELPTEIKGSGIESWDWNLNGEN 175
Query: 179 STYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD 238
TYHAL+PRAWT ++G+PDPEL+IV +QISP IPHNYKESS+PVSVFT+T+ N G+TSAD
Sbjct: 176 CTYHALFPRAWTTYKGQPDPELKIVSQQISPFIPHNYKESSFPVSVFTFTLSNFGRTSAD 235
Query: 239 ITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGV 298
+TLLFTW NS+GG S + H+NS DGVH V LHH+T PP+TFA+AAQET V
Sbjct: 236 VTLLFTWANSIGGVSGSSRHHFNSSIMKKDGVHTVTLHHKTGDGQPPLTFAIAAQETPDV 295
Query: 299 HVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTV 358
HVS CP F+ISGNS G+TAKDMW EIK+HG+FD L+ + S SE GS IGAAIAA++T+
Sbjct: 296 HVSECPCFLISGNSQGVTAKDMWDEIKKHGTFDHLSYNKIS-PSEGGSCIGAAIAATLTI 354
Query: 359 PPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQI 418
PPD+ TFSLAWDCPEV F SG+TY+RRYTKFYGT +AAA+IA DAILEHG WE QI
Sbjct: 355 PPDTIRTATFSLAWDCPEVRF-SGRTYHRRYTKFYGTLGDAAADIAHDAILEHGHWESQI 413
Query: 419 EAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQS 478
EAWQRPILEDKRLP+WYPITLFNELYYLNAGG VWTDGSPP+ S I K +LD S+S
Sbjct: 414 EAWQRPILEDKRLPKWYPITLFNELYYLNAGGTVWTDGSPPMQSFAAIKGGKLTLDKSRS 473
Query: 479 DLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGI 538
+ + V ++NDTAV IL RM+SI E+++ PV N++FGT LLQ+ EENIGQFLYLEG
Sbjct: 474 EFENARPVAHRNDTAVEILNRMASIYEKMHNPVTSNAAFGTYLLQNDEENIGQFLYLEGS 533
Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
EYLMWNTYDVHFYSSFAL+MLFPK++LSIQRDFAAAVMMHDPS+M+++ DG+ V RKVLG
Sbjct: 534 EYLMWNTYDVHFYSSFALLMLFPKLELSIQRDFAAAVMMHDPSRMQIMSDGRRVPRKVLG 593
Query: 599 AVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVA 658
AVPHDIG+ DPWFEVNAY L TARWKDLNPKFVLQ+YRDVVATGDK FA AVWPSVYVA
Sbjct: 594 AVPHDIGLNDPWFEVNAYNLISTARWKDLNPKFVLQIYRDVVATGDKSFALAVWPSVYVA 653
Query: 659 MAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGS 718
MAYMDQFD+DGDGMIEN+GFPDQTYDTWSV+G+SAY GGLWVAALQAASALA EVGD S
Sbjct: 654 MAYMDQFDKDGDGMIENEGFPDQTYDTWSVTGVSAYCGGLWVAALQAASALACEVGDSES 713
Query: 719 EDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDED 778
+F K+QKAK VY LWNGSYFNYD+SG +SSI ADQLAGQWYARACGL IVDE+
Sbjct: 714 ASFFWVKYQKAKAVYSTLWNGSYFNYDSSG---NSSIHADQLAGQWYARACGLSSIVDEE 770
Query: 779 KARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHED 838
K RSAL+K+Y +NVLKV GKRGAVNGMLPDG+VDMS MQSREIW GVTYA++ASMI E
Sbjct: 771 KVRSALEKIYKFNVLKVKEGKRGAVNGMLPDGKVDMSVMQSREIWPGVTYALSASMIQEG 830
Query: 839 LADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
+A++ FQTA GIYEAAWS GLGY+FQ PE WNTDDQYRSLCYMRPLAIWAMQWAL++PK
Sbjct: 831 MAEMAFQTASGIYEAAWSQEGLGYSFQIPEGWNTDDQYRSLCYMRPLAIWAMQWALSKPK 890
Query: 899 PKTLEKQMKPEVTDESLL-RYHAGFSKVARLLKLPEEQGA-KSLLQSLFDHTCRRM 952
E ++ D+ L +AGFSKVA LLKLP+++G+ KS LQS ++ TCRR+
Sbjct: 891 VFKEEMKLLSLAADDRLHPSQNAGFSKVAHLLKLPDDEGSNKSFLQSAYEFTCRRL 946
>gi|242070593|ref|XP_002450573.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
gi|241936416|gb|EES09561.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
Length = 951
Score = 1269 bits (3284), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 607/930 (65%), Positives = 743/930 (79%), Gaps = 10/930 (1%)
Query: 19 LNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKG 78
+N +KV + LTW+ KL+ G L F L W+ET+QLA IG+R+ I EE +KG
Sbjct: 22 MNGDEEKVYSGQLPVLTWEHKLTRGRHDLPPFRLTWRETMQLAGIGLRLSRHILEETSKG 81
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
+ A IDP KR S GVPLGG SIGRSY+G+FQRWQ+FP CEDKPVLANQFS F
Sbjct: 82 RIAVIDPMKKRSARSGQGVPLGG----SIGRSYKGDFQRWQLFPGTCEDKPVLANQFSAF 137
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+SR +G+KYS+VL P P++ K + +GIGSWDWN+ G++STYHALYPRAWTV++GEPDP
Sbjct: 138 ISRQDGRKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGEQSTYHALYPRAWTVYDGEPDP 197
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
EL IVCRQISPIIPHNY++SSYPV+VFT+T+ NSG T+AD+TLLFTW NSVGG SE TG
Sbjct: 198 ELNIVCRQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGY 257
Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG--LT 316
H NS DGVH +LLHHRT+ PPVTF +AAQE + + +S CP+FVISG+S T
Sbjct: 258 HSNSSMIEKDGVHGILLHHRTADGQPPVTFVIAAQEKEDILISQCPYFVISGSSASDEFT 317
Query: 317 AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
AKDMW+ +KEHGSFD L+ ++TS+ S PGSSIGAAIAASV + P + ++FSLAW CPE
Sbjct: 318 AKDMWNSVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATKDISFSLAWACPE 377
Query: 377 VNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYP 436
V F SGKTY+RRYTKFYGT +AAA +A DAILEH SWE QI+ WQ PIL+D+R P WYP
Sbjct: 378 VKFSSGKTYHRRYTKFYGTDGDAAATLAHDAILEHASWERQIDEWQDPILQDERFPAWYP 437
Query: 437 ITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNI 496
+TLFNELYYLNAGG +WTDG PP+ SL IG +KFSLD + ++ QN+TA +I
Sbjct: 438 VTLFNELYYLNAGGTIWTDGLPPIQSLTAIGWKKFSLDMLNGESDDFNEMVRQNNTASDI 497
Query: 497 LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
L RM+S+ E+++ +A NS+ GT LLQ GEENIGQFLYLEGIEY MWNTYDVHFY+SF+L
Sbjct: 498 LHRMASVFERMHASIASNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSL 556
Query: 557 IMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAY 616
IMLFPK+QLS+QRDFAAAVMMHDP K+++L DG+W +RKVLGAVPHD+G+ DPWF+VNAY
Sbjct: 557 IMLFPKLQLSVQRDFAAAVMMHDPEKLRILHDGKWAARKVLGAVPHDLGLYDPWFKVNAY 616
Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
LY+T RWKDLNPKFVLQVYRDVVATGDK FA+AVWPSVY+AMAYM+QFD+D DGMIEN+
Sbjct: 617 TLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDKDKDGMIENE 676
Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKL 736
FPDQTYD WS++GISAY GGLWVAALQAASALAREVGD+ SE F K++KAK VY KL
Sbjct: 677 DFPDQTYDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYEKAKSVYSKL 736
Query: 737 WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVM 796
WNGSYF+YD+ + S+SIQADQLAGQWYA+ACGL PIVD+DKA+SAL+K++++NV+K
Sbjct: 737 WNGSYFSYDDGDNKVSTSIQADQLAGQWYAKACGLFPIVDKDKAQSALEKIFSFNVMKFK 796
Query: 797 GGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWS 856
G RGA+NGM PDG VDMS+MQSREIW GVTYA+AASMI E + + GF+TA GIY AAWS
Sbjct: 797 DGTRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGIYHAAWS 856
Query: 857 GTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLL 916
GLGY+FQTPEAWN DD+YRSLCYMRPLAIWAMQWAL+ PK L + ++ +S
Sbjct: 857 PEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSNPK---LHNTPQADIPQDSFP 913
Query: 917 RYHAGFSKVARLLKLPEEQGAKSLLQSLFD 946
+ ++++ +LL+LPEE+ +KS L+ +++
Sbjct: 914 KNQFAYARITKLLQLPEEETSKSFLRVVYE 943
>gi|115484889|ref|NP_001067588.1| Os11g0242100 [Oryza sativa Japonica Group]
gi|77549531|gb|ABA92328.1| expressed protein [Oryza sativa Japonica Group]
gi|113644810|dbj|BAF27951.1| Os11g0242100 [Oryza sativa Japonica Group]
gi|215704397|dbj|BAG93831.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222615769|gb|EEE51901.1| hypothetical protein OsJ_33494 [Oryza sativa Japonica Group]
Length = 950
Score = 1259 bits (3259), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 616/952 (64%), Positives = 754/952 (79%), Gaps = 11/952 (1%)
Query: 6 ILENGLHEEEK-----EPLNSSFDK-VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQ 59
++ENG+ E+ K P S D VDP LTW+ KLS L F L W+ET Q
Sbjct: 1 MVENGVLEQPKGVSRNRPRAQSNDHPVDPGYLPELTWEHKLSNIGYDLPSFRLTWRETFQ 60
Query: 60 LAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
LA +G+R+ I EE +KG+ A IDP KR S GVPLGG+GSGSIGRSY+GEFQRWQ
Sbjct: 61 LAGLGLRLGRHILEETSKGRAAVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRWQ 120
Query: 120 IFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKS 179
+FP CE++PVLANQFS F+SR +G+ YSSVL P P++ K + +GIGSWDWN+ G S
Sbjct: 121 LFPGTCEERPVLANQFSAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNMSGQNS 180
Query: 180 TYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADI 239
TYHALYPR+WT++ GEPDP++ IVCRQISPIIPHNY++SSYPVSVFT+T+ NSG T+AD+
Sbjct: 181 TYHALYPRSWTIYNGEPDPDVNIVCRQISPIIPHNYQQSSYPVSVFTFTVTNSGNTAADV 240
Query: 240 TLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVH 299
TLLFTW NSVGG SE TG H NS DGVH +LLHHRT++ PPVTFA+AAQE + +H
Sbjct: 241 TLLFTWANSVGGKSELTGYHSNSPMIEKDGVHGILLHHRTANGQPPVTFAIAAQEKEDIH 300
Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
+S CP+F+ISG+S +AKDMW+ +KE+GSFD L+ +TS+ S+PG SIGAAIAASV +P
Sbjct: 301 ISECPYFIISGSSDAFSAKDMWNYVKENGSFDNLDLTKTSMCSKPGLSIGAAIAASVKLP 360
Query: 360 PDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIE 419
P + V+F+LAW CPEV F SGKTY+RRYTKF+GT +AAA++A DAILEH SWE QIE
Sbjct: 361 PQTTQNVSFALAWACPEVKFSSGKTYHRRYTKFHGTDNDAAASLAHDAILEHNSWERQIE 420
Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
WQ PIL+D+R P+WYP+TLFNELYYLNAGG +WTDG PP+ SL IG +KFSLD D
Sbjct: 421 EWQNPILQDERFPDWYPVTLFNELYYLNAGGTIWTDGLPPIQSLTGIGEKKFSLDMQNGD 480
Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
+ +N+TA +IL +M+S+LE+I+ + NS+ GT LLQ GEENIGQFLYLEGIE
Sbjct: 481 ADDANGIIPRNNTASDILNQMASVLERIHASMESNSAIGTTLLQ-GEENIGQFLYLEGIE 539
Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
Y MWNTYDVHFY+SF+LIMLFPK+QLSIQRDFAAAV+MHDP K+++L DG+WV+RKVLGA
Sbjct: 540 YYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFAAAVLMHDPEKLRMLHDGKWVARKVLGA 599
Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
VPHD+G+ DPWF+VNAY LY+T RWKDLNPKFVLQVYRDVVATGDK FA+AVWPSVY+AM
Sbjct: 600 VPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAM 659
Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSE 719
AYM+QFDRD DGMIEN+ FPDQTYD WS++GISAY GGLWVAALQAASALA EVGD+ SE
Sbjct: 660 AYMEQFDRDKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKASE 719
Query: 720 DYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
F K++KAK VY KLWNGSYFNYD+ + S+SI ADQLAGQWYA+ACGL PIVD+DK
Sbjct: 720 KLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMSASIHADQLAGQWYAKACGLFPIVDKDK 779
Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
A SAL+K+Y++NV+K GKRGA+NGM PDG VDMS+MQSREIW GVTYA+AA+MI E +
Sbjct: 780 AESALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGM 839
Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
+ GF+TA GIY AAWS GLGY+FQTPEAWN DD+YRSLCYMRPLAIWA+QWAL+ PK
Sbjct: 840 VEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPK- 898
Query: 900 KTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRR 951
L KQ ++ +S + ++++A+LL LPE++ KS L+ +++ R
Sbjct: 899 --LHKQTA-DIPQDSFPKNQFSYARIAKLLHLPEDESPKSFLRVIYEIVRNR 947
>gi|218185509|gb|EEC67936.1| hypothetical protein OsI_35656 [Oryza sativa Indica Group]
Length = 950
Score = 1259 bits (3259), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 616/952 (64%), Positives = 754/952 (79%), Gaps = 11/952 (1%)
Query: 6 ILENGLHEEEK-----EPLNSSFDK-VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQ 59
++ENG+ E+ K P S D VDP LTW+ KLS L F L W+ET Q
Sbjct: 1 MVENGVLEQPKGVSRNRPRAQSNDHPVDPGYLPELTWEHKLSNIGYDLPSFRLTWRETFQ 60
Query: 60 LAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
LA +G+R+ I EE +KG+ A IDP KR S GVPLGG+GSGSIGRSY+GEFQRWQ
Sbjct: 61 LAGLGLRLGRHILEETSKGRAAVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRWQ 120
Query: 120 IFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKS 179
+FP CE++PVLANQFS F+SR +G+ YSSVL P P++ K + +GIGSWDWN+ G S
Sbjct: 121 LFPGTCEERPVLANQFSAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNMSGQNS 180
Query: 180 TYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADI 239
TYHALYPR+WT++ GEPDP++ IVCRQISPIIPHNY++SSYPVSVFT+T+ NSG T+AD+
Sbjct: 181 TYHALYPRSWTIYNGEPDPDVNIVCRQISPIIPHNYQQSSYPVSVFTFTVTNSGNTAADV 240
Query: 240 TLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVH 299
TLLFTW NSVGG SE TG H NS DGVH +LLHHRT++ PPVTFA+AAQE + VH
Sbjct: 241 TLLFTWANSVGGKSELTGYHSNSPMIEKDGVHGILLHHRTANGQPPVTFAIAAQEKEDVH 300
Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
+S CP+F+ISG+S +AKDMW+ +KE+GSFD L+ +TS+ S+PG SIGAAIAASV +P
Sbjct: 301 ISECPYFIISGSSDAFSAKDMWNYVKENGSFDNLDLTKTSMCSKPGLSIGAAIAASVKLP 360
Query: 360 PDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIE 419
P + V+F+LAW CPEV F SGKTY+RRYTKF+GT +AAA++A DAILEH SWE QIE
Sbjct: 361 PQTTQNVSFALAWACPEVKFSSGKTYHRRYTKFHGTDNDAAASLAHDAILEHNSWERQIE 420
Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
WQ PIL+D+R P+WYP+TLFNELYYLNAGG +WTDG PP+ SL IG +KFSLD D
Sbjct: 421 EWQNPILQDERFPDWYPVTLFNELYYLNAGGTIWTDGLPPIQSLTGIGEKKFSLDMQNGD 480
Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
+ +N+TA +IL +M+S+LE+I+ + NS+ GT LLQ GEENIGQFLYLEGIE
Sbjct: 481 ADDANGIIPRNNTASDILNQMASVLERIHASMESNSAIGTTLLQ-GEENIGQFLYLEGIE 539
Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
Y MWNTYDVHFY+SF+LIMLFPK+QLSIQRDFAAAV+MHDP K+++L DG+WV+RKVLGA
Sbjct: 540 YYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFAAAVLMHDPEKLRMLHDGKWVARKVLGA 599
Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
+PHD+G+ DPWF+VNAY LY+T RWKDLNPKFVLQVYRDVVATGDK FA+AVWPSVY+AM
Sbjct: 600 IPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAM 659
Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSE 719
AYM+QFDRD DGMIEN+ FPDQTYD WS++GISAY GGLWVAALQAASALA EVGD+ SE
Sbjct: 660 AYMEQFDRDKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKASE 719
Query: 720 DYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
F K++KAK VY KLWNGSYFNYD+ + S+SI ADQLAGQWYA+ACGL PIVD+DK
Sbjct: 720 KLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMSASIHADQLAGQWYAKACGLFPIVDKDK 779
Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
A SAL+K+Y++NV+K GKRGA+NGM PDG VDMS+MQSREIW GVTYA+AA+MI E +
Sbjct: 780 AESALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGM 839
Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
+ GF+TA GIY AAWS GLGY+FQTPEAWN DD+YRSLCYMRPLAIWA+QWAL+ PK
Sbjct: 840 VEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPK- 898
Query: 900 KTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRR 951
L KQ ++ +S + ++++A+LL LPE++ KS L+ +++ R
Sbjct: 899 --LHKQTA-DIPQDSFPKNQFSYARIAKLLHLPEDESPKSFLRVIYEIVRNR 947
>gi|8777421|dbj|BAA97011.1| unnamed protein product [Arabidopsis thaliana]
Length = 928
Score = 1246 bits (3224), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 629/971 (64%), Positives = 745/971 (76%), Gaps = 62/971 (6%)
Query: 1 MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
MFE KI++ G E+ +PLNSS KVDPA PASLTWQRK+ + +F L+ KE QL
Sbjct: 1 MFEEKIMDIG---EDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQL 57
Query: 61 APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
AP+G+R+ L REEAAKG+ AFIDPF K +TSSHGVPLGG+G+GSIGRS++GEFQRWQ+
Sbjct: 58 APVGIRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQL 117
Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
FP CED+PVLANQFS FVSR+NG+KYSSVLCP+ P++ K + +GIGSWDWNLKGDKST
Sbjct: 118 FPPKCEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKST 177
Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
YHALYPR+WT++EG + F Y+
Sbjct: 178 YHALYPRSWTMYEGN--------------------------IYAFEYS------------ 199
Query: 241 LLFTWTNSVGGDSEFTGQH---YNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG 297
FT N + F + Y + TK + +T++ LP +++A++AQ TDG
Sbjct: 200 --FTLVNLIQSLGLFVVKFHPLYLTTTK-KAAFQCLFSLSQTANGLPSLSYAISAQATDG 256
Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
V VS CP F++SG G+TAKDMW +KE+GSFD L + E S+ S+ GSSIGAA+AASVT
Sbjct: 257 VSVSACPFFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVT 316
Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGKTY---------YRRYTKFYGTHQNAAANIARDAI 408
V P VTFSLAWDCPEV F SGK Y +RRYTKFYG + +AAA IA DAI
Sbjct: 317 VLPGESRIVTFSLAWDCPEVQFPSGKIYSRQPIKQNLFRRYTKFYGNNGDAAAQIAHDAI 376
Query: 409 LEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGH 468
L H WE IE WQRPILEDKRLP WYP+TLFNELYYLN+GG +WTDGS PVHSL +
Sbjct: 377 LGHSQWESWIEDWQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVRE 436
Query: 469 RKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEEN 528
+KFSLD SQ LK +DVP+QNDTAV++LE+M+S LE+++ NS+FGT LL++GEEN
Sbjct: 437 KKFSLDKSQLGLKNDIDVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEEN 496
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
IG FLYLEGIEY MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +
Sbjct: 497 IGHFLYLEGIEYRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSE 556
Query: 589 GQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFA 648
GQWV RKVLGAVPHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA
Sbjct: 557 GQWVQRKVLGAVPHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFA 616
Query: 649 KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
AVWPSVYVAMAYM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY GGLWVAALQAASA
Sbjct: 617 SAVWPSVYVAMAYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASA 676
Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
LAR VGD+ S+DYF KFQKAKVVYE KLWNGSYFNYDNSGS SS+IQADQLAGQWYAR
Sbjct: 677 LARVVGDKNSQDYFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYAR 736
Query: 768 ACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
A GLLPIVDEDKAR+AL+KVYNYNV+K+ GKRGAVNGM P+G+VD +SMQSREIWSGVT
Sbjct: 737 ASGLLPIVDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVT 796
Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
YA++A+MI E L ++ FQTA GIYEAAWS TGLGY+FQTPE+WNT D+YRSL YMRPLAI
Sbjct: 797 YALSATMIQEGLVEMAFQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAI 856
Query: 888 WAMQWALTRPKPKTLE-----KQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQ 942
WAMQWALT+ K + +Q +PE+ S +++ GFS+V+RLL LP E AKS LQ
Sbjct: 857 WAMQWALTKTSQKQEQLGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQ 916
Query: 943 SLFDHTCRRMF 953
+LFD+TCRRM
Sbjct: 917 TLFDYTCRRMM 927
>gi|413920603|gb|AFW60535.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
Length = 951
Score = 1244 bits (3220), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 603/930 (64%), Positives = 740/930 (79%), Gaps = 10/930 (1%)
Query: 19 LNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKG 78
+N +KV P + LTW+ KL+ L F L W+E +QLA IG+R+ I EE +KG
Sbjct: 22 MNGDEEKVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKG 81
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
+ A IDP KR S GVPLGG SIGRSY+G+FQRWQ+FP CED VLANQFS F
Sbjct: 82 RIAVIDPMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAF 137
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+SR +G+KYS+VL P P++ K + +GIGSWDWN+ G +STYHALYPRAWTV++GEPDP
Sbjct: 138 ISRQDGKKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDP 197
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
EL IVCRQISPIIPHNY++SSYPV+VFT+T+ NSG T+AD+TLLFTW NSVGG SE TG
Sbjct: 198 ELNIVCRQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGY 257
Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG--LT 316
H NS DGVH +LLHHRT+ P VTF +AAQE + + +S CP+FVISG+S T
Sbjct: 258 HSNSSMIEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFT 317
Query: 317 AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
AKDMW+ +KEHGSFD L+ ++TS+ S PGSSIGAAIAASV + P + ++FSLAW CPE
Sbjct: 318 AKDMWNSVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPE 377
Query: 377 VNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYP 436
V F SGKTY+RRYTKFYGT +AAA +A DAILEH SWE QIE WQ PIL+D+R P WYP
Sbjct: 378 VKFSSGKTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWYP 437
Query: 437 ITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNI 496
+TLFNELYYLNAGG +WTDG PP+ SL +G +KFSLD + ++ QN+TA +I
Sbjct: 438 VTLFNELYYLNAGGTIWTDGLPPIQSLTAVGWKKFSLDMLNGESDDFNEMVRQNNTASDI 497
Query: 497 LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
L RM+S+ E+++ +A NS+ GT LLQ GEENIGQFLYLEGIEY MWNTYDVHFY+SF+L
Sbjct: 498 LHRMASVFERMHASIASNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSL 556
Query: 557 IMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAY 616
IMLFPK+QLS+QRDFAAAVMMHDP K+++L DG+W +RKVLGAVPHD+G+ DPWF+VNAY
Sbjct: 557 IMLFPKLQLSVQRDFAAAVMMHDPEKLRILHDGKWAARKVLGAVPHDLGLYDPWFKVNAY 616
Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
LY+T RWKDLNPKFVLQVYRDVVATGDK FA+AVWPSVY+AMAYM+QFD+D DGMIEN+
Sbjct: 617 TLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDKDKDGMIENE 676
Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKL 736
FPDQTYD WS++GISAY GGLWVAALQAASALAREVGD+ SE F K++KAK VY KL
Sbjct: 677 DFPDQTYDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYEKAKFVYSKL 736
Query: 737 WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVM 796
WNGSYF+YD+ + S+SIQADQLAGQWYA+ACGL PIVD+DKA+SAL+KVY++NV+K
Sbjct: 737 WNGSYFSYDDGDNKMSTSIQADQLAGQWYAKACGLFPIVDKDKAQSALEKVYSFNVMKFK 796
Query: 797 GGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWS 856
GG RGA+NGM PDG VDMS+MQSREIW GVTYA+AASMI E + + GF+TA G+Y AAWS
Sbjct: 797 GGTRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGVYHAAWS 856
Query: 857 GTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLL 916
GLGY+FQTPEAWN DD+YRSLCYMRPLAIWAMQWAL++PK L + ++ +S
Sbjct: 857 PGGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSKPK---LHNTPQADIPQDSFP 913
Query: 917 RYHAGFSKVARLLKLPEEQGAKSLLQSLFD 946
+ ++++ +LL+LPE++ +KS ++ +++
Sbjct: 914 KNQFAYARITKLLQLPEDESSKSFVRVVYE 943
>gi|449447581|ref|XP_004141546.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 944
Score = 1217 bits (3149), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 599/945 (63%), Positives = 729/945 (77%), Gaps = 12/945 (1%)
Query: 15 EKEPLNSSFDKVDPAKPASLTWQRKLS-TGEIPLSQFTLNWKETVQLAPIGVRILCLIRE 73
EK +S +VDP+KP SLTW+RKL TG+ P S F+ + + G R+ +E
Sbjct: 2 EKGGNGASSTEVDPSKPPSLTWKRKLDFTGKSPES-FSFTLTDAWHMGMTGYRLWRNGKE 60
Query: 74 EAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLAN 133
E AKG+ + F +T HGV LGG+G+GSIGRSYRGEFQR+Q+F CED+PVLAN
Sbjct: 61 EIAKGRIPIYEFFSDVPITCYHGVSLGGIGAGSIGRSYRGEFQRFQMFYGPCEDEPVLAN 120
Query: 134 QFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHE 193
QFSVFVSR NG K+SSVLC P+ KD GI SWDWNL G+ STYHAL+PR+WTV++
Sbjct: 121 QFSVFVSRPNGNKFSSVLCSAKPQKSKDGKQTGIESWDWNLSGENSTYHALFPRSWTVYD 180
Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
GEPDP+L+IVCRQ+SPIIPHNYKESS+PVSVFT+ + N G+TSA +TLLFTW NSVGG S
Sbjct: 181 GEPDPDLKIVCRQMSPIIPHNYKESSFPVSVFTFKLSNEGQTSAQVTLLFTWANSVGGKS 240
Query: 254 EFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSL 313
FTG H+NSK DG VLLHH++++ P VT+ +AA+ TD VHVSLCP FVISG+S
Sbjct: 241 GFTGHHFNSKMGAEDGAQGVLLHHKSANGRPTVTYGIAAEATDDVHVSLCPCFVISGDSE 300
Query: 314 GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
G++AKDMW EIK HGSFD L S+ + S+PG SIGAA+AA++T+PP S VTFSLAWD
Sbjct: 301 GISAKDMWQEIKNHGSFDNLGSVGANEGSKPGCSIGAAVAATLTIPPTSARTVTFSLAWD 360
Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
CPEV F GKTY+R+Y+KFYGT +AA IARDAI +HG WE +IEAWQRPI+EDKRLP+
Sbjct: 361 CPEVKF-DGKTYHRQYSKFYGTLGDAAEIIARDAISKHGKWEAEIEAWQRPIIEDKRLPD 419
Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPN---QN 490
WYP+TLFNELY+LN+GG +WTDG PP+ +L TI H+K+ L+ S+S+L PN +
Sbjct: 420 WYPVTLFNELYFLNSGGTIWTDGLPPLQNLSTISHKKYFLERSKSELNG--GAPNGDHRK 477
Query: 491 DTAVNILERMSSILEQIYTPVA-LNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVH 549
D AV+ILERMS IL+Q + N++ GT LL GEEN+G L +EG +YLMWNTYDVH
Sbjct: 478 DVAVDILERMSQILDQTHGGAGPSNAALGTRLLHPGEENVGNLLLVEGSQYLMWNTYDVH 537
Query: 550 FYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDP 609
FYSSFALIMLFPK++LSIQRDFAAAV+MHDP K K + DG WV RKVLGAVPHDIG DP
Sbjct: 538 FYSSFALIMLFPKLELSIQRDFAAAVLMHDPRKAKTMCDGNWVPRKVLGAVPHDIGFNDP 597
Query: 610 WFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDG 669
W EVNAY L + +RWKDL KFVLQVYRDVVATGDK FAK+VWPSVYVA+A+M+QFD+D
Sbjct: 598 WLEVNAYNLLNVSRWKDLGSKFVLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDKDK 657
Query: 670 DGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
DGMIEN+GFPDQTYDTW+V G+SAY GGLWVAALQAASALA EV D + YF K+QKA
Sbjct: 658 DGMIENEGFPDQTYDTWTVKGVSAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQKA 717
Query: 730 KVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
+ VYE LWNGSYFNYDNS SSSIQADQLAGQWYARACGL PI DE+K R AL+K+YN
Sbjct: 718 RSVYETLWNGSYFNYDNSKGPWSSSIQADQLAGQWYARACGLCPIADEEKIRVALEKIYN 777
Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACG 849
+NV+KV GG RGAVNGM PDG VD S +Q +EIW+GVTY+VAA+MI E + + GFQTA G
Sbjct: 778 FNVMKVKGGTRGAVNGMFPDGSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMG 837
Query: 850 IYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA-LTRPKPKTLEKQMKP 908
IY+AAW+ GLGY+FQTPEAW+ DD++RS+ YMRPLAIWAMQWA ++ KP + +
Sbjct: 838 IYQAAWAQDGLGYSFQTPEAWDVDDRFRSIGYMRPLAIWAMQWAMMSDSKPTKVPTKAFS 897
Query: 909 EVTDESLLRYHAGFSKVARLLKLP--EEQGAKSLLQSLFDHTCRR 951
E+ + + HA F KVA LLKLP ++ +SL+++ +D C+R
Sbjct: 898 EMQESAFATQHAAFLKVASLLKLPSNDDTARRSLVEAAYDFICKR 942
>gi|297851790|ref|XP_002893776.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
lyrata]
gi|297339618|gb|EFH70035.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
lyrata]
Length = 947
Score = 1186 bits (3067), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/940 (61%), Positives = 716/940 (76%), Gaps = 24/940 (2%)
Query: 24 DKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFI 83
DK+ P +W+RKL++ S+F L+ ++ + L P+G R+ ++EAAKG+ +
Sbjct: 15 DKLPP-----FSWERKLNSQAKTPSEFKLSKRDHLHLFPLGYRLWRHTKDEAAKGRASIF 69
Query: 84 DPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSN 143
D F K H+T HGVPLGG+G+GSIGRSY+GEFQ++++FP++CE+ P+L NQFS FVSR
Sbjct: 70 DIFRKHHITGDHGVPLGGIGAGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPG 129
Query: 144 GQKYSSVLCPKTPEVLKDTTA---------AGIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
G K+S+VLCP P+V+KD GI SWDWN+ G+KSTYHALYPR+WTV++G
Sbjct: 130 GVKHSTVLCPSKPQVIKDNGGYLCKGQVPNIGIESWDWNMTGEKSTYHALYPRSWTVYDG 189
Query: 195 EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSE 254
EPDPELRIV RQ+SP IPHNY+ESS PVSVF +T+ N+G A +TLLFTW NSVGG S
Sbjct: 190 EPDPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEQAIVTLLFTWENSVGGASG 249
Query: 255 FTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG 314
TGQH+NS K DGVHAV L H+T++ PPV++A+AA+ET+ V VS CP F++SG +
Sbjct: 250 LTGQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPN 309
Query: 315 -LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
+TA DMW EIK++ SFD+L S S S+PG+SIGAAIAA V VPP + VTFSL+WD
Sbjct: 310 KITAGDMWDEIKKNASFDKLTSNACS-PSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWD 368
Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
CPE F KTY+RRYT+FYG+ NAA +A DA+L WE QIE WQ PIL D LPE
Sbjct: 369 CPEARF-DEKTYHRRYTRFYGSLGNAAVAMAHDALLNFSEWETQIEEWQAPILADTTLPE 427
Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
WY ITLFNELYY N+GG +WTDG PP SL +IG RK SL S +D P+QND A
Sbjct: 428 WYRITLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLSISTTDNTD----PDQNDIA 483
Query: 494 VNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSS 553
++IL R+ ++ QI+ P++ N++ GT ++Q+ ENIGQFLYLEGI+YLM+NTYDVHFYSS
Sbjct: 484 LDILGRIDAVCSQIHAPLSSNAALGTTMIQNTSENIGQFLYLEGIQYLMYNTYDVHFYSS 543
Query: 554 FALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEV 613
FAL+MLFPK++LSIQRDFAAAV+MHD SK +++ G++V+RK LGAVPHDIG+ DPWFEV
Sbjct: 544 FALLMLFPKLELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKALGAVPHDIGLNDPWFEV 603
Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
NAY L++T RWKDLN KFVLQVYRDVVATGD FAKAVWPSVY A+AY+DQFD+DGDGMI
Sbjct: 604 NAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMI 663
Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
EN+GFPDQTYD WS SG+SAY GGLWVAALQA SALAREVGD G+ YF K++KA+ VY
Sbjct: 664 ENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREVGDNGAAVYFNAKYEKARSVY 723
Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
EKLWNGSYFNYDNS S SSSI ADQLAGQWYARACGL PI E+ + AL+ VY++NV+
Sbjct: 724 EKLWNGSYFNYDNSRSGSSSSILADQLAGQWYARACGLKPIAKEEWIKKALETVYDFNVM 783
Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
+V G RGAVNGMLPDGRVD S+M SRE+W+G TY+VAA MI E LAD GF+TA GIYEA
Sbjct: 784 RVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEA 843
Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP-KTLEKQMKPEVTD 912
AWS GLG AFQTPEAW T+D++RSLCYMRPLAIW +QWA T P+P + E +K + D
Sbjct: 844 AWSDRGLGCAFQTPEAWTTNDEFRSLCYMRPLAIWGIQWAHTMPQPNREQEVSLKQQEED 903
Query: 913 ES--LLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCR 950
+ L + HAGF KVA LK + + ++ LQ+ ++ R
Sbjct: 904 ATSVLFQQHAGFIKVAHYLKNTKGKDHRNRLQTAYETFLR 943
>gi|30692911|ref|NP_174631.2| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|79319136|ref|NP_001031135.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|332193493|gb|AEE31614.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|332193494|gb|AEE31615.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
Length = 947
Score = 1159 bits (2998), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 572/936 (61%), Positives = 714/936 (76%), Gaps = 24/936 (2%)
Query: 24 DKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFI 83
DK+ P +W+RKL++ S+F L+ ++ + L P+G R+ ++EAAKG+ +
Sbjct: 15 DKLPP-----FSWERKLNSQAKTPSEFKLSKRDHMHLFPLGYRLWRHTKDEAAKGRASIF 69
Query: 84 DPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSN 143
D F K H+T HGVPLGG+GSGSIGRSY+GEFQ++++FP++CE+ P+L NQFS FVSR
Sbjct: 70 DIFRKHHITGDHGVPLGGIGSGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPG 129
Query: 144 GQKYSSVLCPKTPEVLKDTTA---------AGIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
G K+S+VLCP P+V+KD GI SWDWN+ G+KSTYHALYPR+WTV++G
Sbjct: 130 GVKHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRSWTVYDG 189
Query: 195 EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSE 254
EPDPELRIV RQ+SP IPHNY+ESS PVSVF +T+ N+G A +TLLFTW NSVGG S
Sbjct: 190 EPDPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEPAIVTLLFTWENSVGGASG 249
Query: 255 FTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG 314
TGQH+NS K DGVHAV L H+T++ PPV++A+AA+ET+ V VS CP F++SG +
Sbjct: 250 LTGQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPN 309
Query: 315 -LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
+TA DMW EIK++ SFD+L S S S+PG+SIGAAIAA V VPP + VTFSL+WD
Sbjct: 310 QITAGDMWDEIKKNASFDKLTSNACS-PSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWD 368
Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
CPE F KTY+RRYT+FYG+ NAA +A DA+L WE QIE WQ P+L D LPE
Sbjct: 369 CPEARF-DEKTYHRRYTRFYGSLGNAAVAMAHDALLNFPEWETQIEEWQAPVLADTTLPE 427
Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
WY +TLFNELYY N+GG +WTDG PP SL +IG RK SL S D + +QN+ A
Sbjct: 428 WYRVTLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLGLSTID----KNDQDQNNVA 483
Query: 494 VNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSS 553
++IL R+ ++ QI+ P+ N++ G ++Q+ ENIGQFLYLEG++YLM+NTYDVHFYSS
Sbjct: 484 LDILGRIDAVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYNTYDVHFYSS 543
Query: 554 FALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEV 613
FAL+MLFPKI+LSIQRDFAAAV+MHD SK +++ G++V+RKVLGAVPHDIG+ DPWFE+
Sbjct: 544 FALLMLFPKIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDIGLNDPWFEL 603
Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
NAY L++T RWKDLN KFVLQVYRDVVATGD FAKAVWPSVY A+AY+DQFD+DGDGMI
Sbjct: 604 NAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMI 663
Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
EN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE+GD G+ YF K++KA+ VY
Sbjct: 664 ENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYEKARSVY 723
Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
EKLWNGSYFNYDNS S SSSI ADQ+AGQWYARACGL PI E+ + AL+ VY++NV+
Sbjct: 724 EKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKALETVYDFNVM 783
Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
+V G RGAVNGMLPDGRVD S+M SRE+W+G TY+VAA MI E LAD GF+TA GIYEA
Sbjct: 784 RVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEA 843
Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP-KTLEKQMKPEVTD 912
AWS GLG AFQTPEAW T+D+YRSLCYMRPLAIW +QWA T PKP + E ++P+ D
Sbjct: 844 AWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQEVSLRPQEED 903
Query: 913 ES--LLRYHAGFSKVARLLKLPEEQGAKSLLQSLFD 946
+ L + HAGF KVA LK + + ++ LQ+ ++
Sbjct: 904 ATSVLFQQHAGFIKVAHYLKNTKGKDHRNRLQTAYE 939
>gi|28393605|gb|AAO42222.1| unknown protein [Arabidopsis thaliana]
Length = 947
Score = 1157 bits (2992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 571/936 (61%), Positives = 713/936 (76%), Gaps = 24/936 (2%)
Query: 24 DKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFI 83
DK+ P +W+RKL++ S+F L+ ++ + L P+G R+ ++EAAKG+ +
Sbjct: 15 DKLPP-----FSWERKLNSQAKTPSEFKLSKRDHMHLFPLGYRLWRHTKDEAAKGRASIF 69
Query: 84 DPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSN 143
D F K H+T HGVPLGG+GSGSIGRSY+GEFQ++++FP++CE+ P+L NQFS FVSR
Sbjct: 70 DIFRKHHITGDHGVPLGGIGSGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPG 129
Query: 144 GQKYSSVLCPKTPEVLKDTTA---------AGIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
G K+S+VLCP P+V+KD GI SWDWN+ G+KSTYHALYPR+WTV++G
Sbjct: 130 GVKHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRSWTVYDG 189
Query: 195 EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSE 254
EPDPELRIV RQ+SP IPHNY+ESS PVSVF +T+ N+G A +TLLFTW NSVGG S
Sbjct: 190 EPDPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEPAIVTLLFTWENSVGGASG 249
Query: 255 FTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG 314
TGQH+NS K DGVHAV L H+T++ PPV++A+AA+ET+ V VS CP F++SG +
Sbjct: 250 LTGQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPN 309
Query: 315 -LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
+TA DMW EIK++ SFD+L S S S+PG+SIGAAIAA V VPP + VTFSL+WD
Sbjct: 310 QITAGDMWDEIKKNASFDKLTSNACS-PSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWD 368
Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
CPE F KTY+RRYT+FYG+ NAA +A DA+L WE QIE WQ P+L D LPE
Sbjct: 369 CPEARF-DEKTYHRRYTRFYGSLGNAAVAMAHDALLNFPEWETQIEEWQAPVLADTTLPE 427
Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
WY +TLFNELYY N+GG +WTDG PP SL +IG RK SL S D + +QN+ A
Sbjct: 428 WYRVTLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLGLSTID----KNDQDQNNVA 483
Query: 494 VNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSS 553
++IL R+ ++ QI+ P+ N++ G ++Q+ ENIGQFLYLEG++YLM+NTY VHFYSS
Sbjct: 484 LDILGRIDAVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYNTYGVHFYSS 543
Query: 554 FALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEV 613
FAL+MLFPKI+LSIQRDFAAAV+MHD SK +++ G++V+RKVLGAVPHDIG+ DPWFE+
Sbjct: 544 FALLMLFPKIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDIGLNDPWFEL 603
Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
NAY L++T RWKDLN KFVLQVYRDVVATGD FAKAVWPSVY A+AY+DQFD+DGDGMI
Sbjct: 604 NAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMI 663
Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
EN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE+GD G+ YF K++KA+ VY
Sbjct: 664 ENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYEKARSVY 723
Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
EKLWNGSYFNYDNS S SSSI ADQ+AGQWYARACGL PI E+ + AL+ VY++NV+
Sbjct: 724 EKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKALETVYDFNVM 783
Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
+V G RGAVNGMLPDGRVD S+M SRE+W+G TY+VAA MI E LAD GF+TA GIYEA
Sbjct: 784 RVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEA 843
Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP-KTLEKQMKPEVTD 912
AWS GLG AFQTPEAW T+D+YRSLCYMRPLAIW +QWA T PKP + E ++P+ D
Sbjct: 844 AWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQEVSLRPQEED 903
Query: 913 ES--LLRYHAGFSKVARLLKLPEEQGAKSLLQSLFD 946
+ L + HAGF KVA LK + + ++ LQ+ ++
Sbjct: 904 ATSVLFQQHAGFIKVAHYLKNTKGKDHRNRLQTAYE 939
>gi|326489639|dbj|BAK01800.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 797
Score = 1111 bits (2873), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 527/794 (66%), Positives = 647/794 (81%), Gaps = 5/794 (0%)
Query: 160 KDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESS 219
+ T +GIGSWDWNL G KSTYHALYPRAWTV++GEPDP+L+IVCRQISPIIPHNY++SS
Sbjct: 6 RGTNISGIGSWDWNLNGGKSTYHALYPRAWTVYDGEPDPDLKIVCRQISPIIPHNYQQSS 65
Query: 220 YPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRT 279
YP +VFT+T+ NSG T+AD+TLLFTW NSVGG SE TG H NS DGVH +LLHHRT
Sbjct: 66 YPAAVFTFTVTNSGNTAADVTLLFTWANSVGGKSELTGNHSNSSMTEKDGVHGILLHHRT 125
Query: 280 SHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETS 339
+ PPV FA+AAQE + +H+S CP+FV+SG+S TAKDMW+ +KEHGSFD L+ ++ S
Sbjct: 126 ADGRPPVAFAIAAQEKEDIHISECPYFVMSGSSDEFTAKDMWNSVKEHGSFDLLDPVKES 185
Query: 340 VTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNA 399
+S PG+SIGAAIAAS+ + P + V+FSLAW PEV F SGKTY+RRYTKFYGT +A
Sbjct: 186 TSSRPGTSIGAAIAASIKLAPQATQNVSFSLAWASPEVKFCSGKTYHRRYTKFYGTDVDA 245
Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
AA++A DAIL+H SWE QIE WQ PIL+DKR P WYP+TLFNELYYLNAGG +WTDG PP
Sbjct: 246 AASLAHDAILDHSSWETQIENWQHPILQDKRFPAWYPVTLFNELYYLNAGGTIWTDGLPP 305
Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
+ SL IG +KFSLD S + + Q +TA +IL +M+S+LE+I+ +A +S+ GT
Sbjct: 306 IQSLTAIGGKKFSLDMSNEETDDDSQMNPQTNTATDILHQMASVLERIHASLASSSAIGT 365
Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
LLQ GEENIGQFLYLEGIEY MWNTYDVHFYSSF+LIMLFPK+QLS+QRDFAAAVM+HD
Sbjct: 366 TLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYSSFSLIMLFPKLQLSVQRDFAAAVMIHD 424
Query: 580 PSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
P K+KLL DG+ +RKVLGAVPHD+G+ DPW +VNAY L++T RWKDLNPKFVLQVYRDV
Sbjct: 425 PEKLKLLHDGKLAARKVLGAVPHDLGLYDPWIKVNAYTLHNTDRWKDLNPKFVLQVYRDV 484
Query: 640 VATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
VATGDK FA+AVWPSVY+AMAYM+QFD+D DGMIEN+ FPDQTYD WS++G+SAY GGLW
Sbjct: 485 VATGDKSFARAVWPSVYMAMAYMEQFDKDKDGMIENEDFPDQTYDVWSMAGVSAYCGGLW 544
Query: 700 VAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQAD 758
VAALQAASALA EVGD+ SE F K++KAK VY+ KLWNGSYFNYD++G+ S+SI AD
Sbjct: 545 VAALQAASALAHEVGDKASEKLFWNKYEKAKSVYDKKLWNGSYFNYDDAGTKASTSIHAD 604
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQ 818
QLAGQWYA+ACGL IVD+DK++SAL+K+Y +NV+K G RGA+NGM PDG +DMS+MQ
Sbjct: 605 QLAGQWYAKACGLSSIVDKDKSQSALEKIYAFNVMKFKDGNRGAINGMWPDGTLDMSTMQ 664
Query: 819 SREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRS 878
SREIW GVTYA+AASMI E + D GF+TA GIY AAWS GLGYAFQTPE+WN DD+YRS
Sbjct: 665 SREIWPGVTYALAASMIQEGMVDEGFKTAEGIYNAAWSTEGLGYAFQTPESWNNDDEYRS 724
Query: 879 LCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAK 938
LCYMRPLAIW++QWAL+ PK L K + ++T +S + ++++A+LL+LPE++ +K
Sbjct: 725 LCYMRPLAIWSIQWALSNPK---LHKAPQTDITQDSFPKNQFSYARIAKLLQLPEDESSK 781
Query: 939 SLLQSLFDHTCRRM 952
SL + +++ R
Sbjct: 782 SLPRVIYEIVRNRF 795
>gi|62733283|gb|AAX95400.1| At5g49900 [Oryza sativa Japonica Group]
Length = 931
Score = 1088 bits (2814), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 552/911 (60%), Positives = 680/911 (74%), Gaps = 50/911 (5%)
Query: 81 AFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS---- 136
A IDP KR S GVPLGG+GSGSIGRSY+GEFQRWQ+FP CE++PVLANQFS
Sbjct: 28 AVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRWQLFPGTCEERPVLANQFSDRYL 87
Query: 137 --------------VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH 182
F+SR +G+ YSSVL P P++ K + +GIGSWDWN+ G STYH
Sbjct: 88 ATVNVLTVSRDYTQAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNMSGQNSTYH 147
Query: 183 ALYPRAWTVHEGEPDPELRIVCRQISPIIPHN-----YKESSYPVSVFTYTIYNSGKTSA 237
ALYPR+WT++ G V Q + I+ N E V++ T NSG T+A
Sbjct: 148 ALYPRSWTIYNGIAVTNNNRVYFQATTILFSNVGFCLLGEPDPDVNIVT----NSGNTAA 203
Query: 238 DITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHR-----------------TS 280
D+TLLFTW V + F + + DGVH +LLHHR T+
Sbjct: 204 DVTLLFTWA-FVINEHLFYTFNILMNDREKDGVHGILLHHRLSPKQISICLNDMYESRTA 262
Query: 281 HQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
+ PPVTFA+AAQE + +H+S CP+F+ISG+S +AKDMW+ +KE+GSFD L+ +TS+
Sbjct: 263 NGQPPVTFAIAAQEKEDIHISECPYFIISGSSDAFSAKDMWNYVKENGSFDNLDLTKTSM 322
Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAA 400
S+PG SIGAAIAASV +PP + V+F+LAW CPEV F SGKTY+RRYTKF+GT +AA
Sbjct: 323 CSKPGLSIGAAIAASVKLPPQTTQNVSFALAWACPEVKFSSGKTYHRRYTKFHGTDNDAA 382
Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
A++A DAILEH SWE QIE WQ PIL+D+R P+WYP+TLFNELYYLNAGG +WTDG PP+
Sbjct: 383 ASLAHDAILEHNSWERQIEEWQNPILQDERFPDWYPVTLFNELYYLNAGGTIWTDGLPPI 442
Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
SL IG +KFSLD D + +N+TA +IL +M+S+LE+I+ + NS+ GT
Sbjct: 443 QSLTGIGEKKFSLDMQNGDADDANGIIPRNNTASDILNQMASVLERIHASMESNSAIGTT 502
Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
LLQ GEENIGQFLYLEGIEY MWNTYDVHFY+SF+LIMLFPK+QLSIQRDFAAAV+MHDP
Sbjct: 503 LLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFAAAVLMHDP 561
Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
K+++L DG+WV+RKVLGAVPHD+G+ DPWF+VNAY LY+T RWKDLNPKFVLQVYRDVV
Sbjct: 562 EKLRMLHDGKWVARKVLGAVPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVV 621
Query: 641 ATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWV 700
ATGDK FA+AVWPSVY+AMAYM+QFDRD DGMIEN+ FPDQTYD WS++GISAY GGLWV
Sbjct: 622 ATGDKSFARAVWPSVYMAMAYMEQFDRDKDGMIENEDFPDQTYDVWSMAGISAYCGGLWV 681
Query: 701 AALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQL 760
AALQAASALA EVGD+ SE F K++KAK VY KLWNGSYFNYD+ + S+SI ADQL
Sbjct: 682 AALQAASALAHEVGDKASEKLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMSASIHADQL 741
Query: 761 AGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSR 820
AGQWYA+ACGL PIVD+DKA SAL+K+Y++NV+K GKRGA+NGM PDG VDMS+MQSR
Sbjct: 742 AGQWYAKACGLFPIVDKDKAESALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSR 801
Query: 821 EIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLC 880
EIW GVTYA+AA+MI E + + GF+TA GIY AAWS GLGY+FQTPEAWN DD+YRSLC
Sbjct: 802 EIWPGVTYALAATMIQEGMVEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLC 861
Query: 881 YMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSL 940
YMRPLAIWA+QWAL+ PK L KQ ++ +S + ++++A+LL LPE++ KS
Sbjct: 862 YMRPLAIWAIQWALSNPK---LHKQTA-DIPQDSFPKNQFSYARIAKLLHLPEDESPKSF 917
Query: 941 LQSLFDHTCRR 951
L+ +++ R
Sbjct: 918 LRVIYEIVRNR 928
>gi|20453181|gb|AAM19831.1| AT4g10060/T5L19_190 [Arabidopsis thaliana]
gi|24111431|gb|AAN46866.1| At4g10060/T5L19_190 [Arabidopsis thaliana]
Length = 922
Score = 1087 bits (2810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 552/934 (59%), Positives = 688/934 (73%), Gaps = 36/934 (3%)
Query: 26 VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
V+ + +TWQRKL++ S+F ++ ++ + L P+G R+ +EEA KG+ + D
Sbjct: 16 VENERLPQVTWQRKLNSKVKNPSEFKMSIRDVLHLFPLGYRLWRYTKEEAKKGRFSMYDI 75
Query: 86 FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
F KRH+ HGVPLGG+G GSIGRSY+GEFQ++++FP++CE+ P+L NQFSVFVSR G
Sbjct: 76 FKKRHVRGDHGVPLGGIGGGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSVFVSRPGGL 135
Query: 146 KYSSVLCPKTPEVLKDTTA-AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
YS+VLCP P+ +K T GI SWDWN++GDKSTYHALYPR+WTV+ EPDPELRIV
Sbjct: 136 SYSTVLCPTKPKSVKGKTEDLGIESWDWNMEGDKSTYHALYPRSWTVY-NEPDPELRIVS 194
Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
RQ+SP IPHNYKESS PVSVF +T+ N GK A +TLLFTW NSVGG S TG+H+NS
Sbjct: 195 RQVSPFIPHNYKESSLPVSVFEFTMSNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTI 254
Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNS-LGLTAKDMWHE 323
DGVHA++LHH+T + PPVT+A+AAQET+ VHVS CP F++SG+S +TAK+MW E
Sbjct: 255 MERDGVHAIVLHHKTGNGHPPVTYAIAAQETEDVHVSECPCFLVSGHSPKEITAKEMWDE 314
Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
IK++ SFD LNS E S PG+SIGAAIAA V VPP + VTFSL+WDCPEV F + K
Sbjct: 315 IKKNKSFDELNS-EPGSPSRPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPEVRF-NEK 372
Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
TY+RRYTKFYG +AA +ARDA+L + WE QIEAWQ PIL D LP+WY +TLFNEL
Sbjct: 373 TYHRRYTKFYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSDTTLPDWYRVTLFNEL 432
Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
YY N+GG +WTDG PP S+ + V QND +++ ++++++
Sbjct: 433 YYFNSGGTIWTDGLPPKESIE----------------RSKVTNTEQNDIVIDLFQKINAV 476
Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
EQIY+P + NS EENIGQF+YLEGIEYLM+NTYDVHFYSSFAL+ LFPK+
Sbjct: 477 CEQIYSPQSSNS----------EENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKL 526
Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
LSIQRDFAA V++ DP+K K++ G+WV+RK+LG+VPHDIG+ DPW E+N Y ++T R
Sbjct: 527 ALSIQRDFAATVLIQDPTKKKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDR 586
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
WKDLN KFVLQVYRDVVAT D+ FAKAVWPSVY A+AY+DQFD+D DGMIEN+GFPDQTY
Sbjct: 587 WKDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTY 646
Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFN 743
D WSV+G+SAY GGLWVAALQAASA A VG+ YF K++KAK+VYEKLWNGSYFN
Sbjct: 647 DAWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFN 706
Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
YD+SGS SSSI ADQLAGQWYARACGL PI E+ + AL+ +Y +NV+KV GG RGAV
Sbjct: 707 YDDSGSGSSSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAV 766
Query: 804 NGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYA 863
NGM +G+VD +S+ S+E+W+G TY+VAA MI E + GFQTA GIYEA WS GL +
Sbjct: 767 NGMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCS 826
Query: 864 FQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQM----KPEVTDESLLRYH 919
FQTPEAWN +D+YRSLCYMRPLAIWA+QWALTR + EKQ E LLR H
Sbjct: 827 FQTPEAWNMNDEYRSLCYMRPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQH 886
Query: 920 AGFSKVARLLKL-PEEQGAKSLLQSLFDHTCRRM 952
GF VAR +K+ P +S LQ ++ + +
Sbjct: 887 KGFKDVARFVKIVPTSNVHRSRLQHTYETVLKTL 920
>gi|240255774|ref|NP_192744.6| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|332657434|gb|AEE82834.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 922
Score = 1087 bits (2810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 551/926 (59%), Positives = 685/926 (73%), Gaps = 36/926 (3%)
Query: 34 LTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTS 93
+TWQRKL++ S+F ++ ++ + L P+G R+ +EEA KG+ + D F KRH+
Sbjct: 24 VTWQRKLNSKVKNPSEFKMSIRDVLHLFPLGYRLWRYTKEEAKKGRFSMYDIFKKRHVRG 83
Query: 94 SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
HGVPLGG+G GSIGRSY+GEFQ++++FP++CE+ P+L NQFSVFVSR G YS+VLCP
Sbjct: 84 DHGVPLGGIGGGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSVFVSRPGGLSYSTVLCP 143
Query: 154 KTPEVLKDTTA-AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
P+ +K T GI SWDWN++GDKSTYHALYPR+WTV+ EPDPELRIV RQ+SP IP
Sbjct: 144 TKPKSVKGKTEDLGIESWDWNMEGDKSTYHALYPRSWTVY-NEPDPELRIVSRQVSPFIP 202
Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHA 272
HNYKESS PVSVF +T+ N GK A +TLLFTW NSVGG S TG+H+NS DGVHA
Sbjct: 203 HNYKESSLPVSVFEFTMSNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTIMERDGVHA 262
Query: 273 VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNS-LGLTAKDMWHEIKEHGSFD 331
++LHH+T + PPVT+A+AAQET+ VHVS CP F++SG+S +TAK+MW EIK++ SFD
Sbjct: 263 IVLHHKTGNGHPPVTYAIAAQETEDVHVSECPCFLVSGHSPKEITAKEMWDEIKKNKSFD 322
Query: 332 RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTK 391
LNS E S PG+SIGAAIAA V VPP + VTFSL+WDCPEV F + KTY+RRYTK
Sbjct: 323 ELNS-EPGSPSRPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPEVRF-NEKTYHRRYTK 380
Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
FYG +AA +ARDA+L + WE QIEAWQ PIL D LP+WY +TLFNELYY N+GG
Sbjct: 381 FYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSDTTLPDWYRVTLFNELYYFNSGGT 440
Query: 452 VWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPV 511
+WTDG PP S+ + V QND +++ ++++++ EQIY+P
Sbjct: 441 IWTDGLPPKESIE----------------RSKVTNTEQNDIVIDLFQKINAVCEQIYSPQ 484
Query: 512 ALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDF 571
+ NS EENIGQF+YLEGIEYLM+NTYDVHFYSSFAL+ LFPK+ LSIQRDF
Sbjct: 485 SSNS----------EENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLALSIQRDF 534
Query: 572 AAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKF 631
AA V++ DP+K K++ G+WV+RK+LG+VPHDIG+ DPW E+N Y ++T RWKDLN KF
Sbjct: 535 AATVLIQDPTKKKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDRWKDLNAKF 594
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
VLQVYRDVVAT D+ FAKAVWPSVY A+AY+DQFD+D DGMIEN+GFPDQTYD WSV+G+
Sbjct: 595 VLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTYDAWSVTGV 654
Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQ 751
SAY GGLWVAALQAASA A VG+ YF K++KAK+VYEKLWNGSYFNYD+SGS
Sbjct: 655 SAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFNYDDSGSGS 714
Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
SSSI ADQLAGQWYARACGL PI E+ + AL+ +Y +NV+KV GG RGAVNGM +G+
Sbjct: 715 SSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSTEGK 774
Query: 812 VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWN 871
VD +S+ S+E+W+G TY+VAA MI E + GFQTA GIYEA WS GL +FQTPEAWN
Sbjct: 775 VDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSFQTPEAWN 834
Query: 872 TDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQM----KPEVTDESLLRYHAGFSKVAR 927
+D+YRSLCYMRPLAIWA+QWALTR + EKQ E LLR H GF VAR
Sbjct: 835 MNDEYRSLCYMRPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQHKGFKDVAR 894
Query: 928 LLKL-PEEQGAKSLLQSLFDHTCRRM 952
+K+ P +S LQ ++ + +
Sbjct: 895 FVKIVPTSNVHRSRLQHTYETVLKTL 920
>gi|297809209|ref|XP_002872488.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
gi|297318325|gb|EFH48747.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
Length = 921
Score = 1075 bits (2779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 548/919 (59%), Positives = 682/919 (74%), Gaps = 35/919 (3%)
Query: 34 LTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTS 93
+TWQRKL++ S+F ++ ++ + L PIG R+ +EEA KG+ + D F KRH+T
Sbjct: 24 VTWQRKLNSKVKNPSEFKMSIRDVLHLFPIGYRLWSHTKEEAKKGRASMYDIFKKRHVTG 83
Query: 94 SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
HGVPLGG+G GSIGRSY+GEFQ++++FP+VCE+ P+L NQFSVFV RS G YSSVLCP
Sbjct: 84 DHGVPLGGIGGGSIGRSYKGEFQQFKLFPKVCEEAPILTNQFSVFVMRSGGVNYSSVLCP 143
Query: 154 KTPEVLKDTTAA-GIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
P+ ++ T GI SWDWN+KG KS YHALYPR+WT++ EPDPELRIV RQISP IP
Sbjct: 144 TKPKSVRGKTEELGIESWDWNMKGYKSMYHALYPRSWTIY-NEPDPELRIVSRQISPFIP 202
Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHA 272
HNYKESS PVSVFT+T+ N GK A +TLLFTW NSVGG S TG+H+NS DGVHA
Sbjct: 203 HNYKESSLPVSVFTFTVTNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTMMERDGVHA 262
Query: 273 VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNS-LGLTAKDMWHEIKEHGSFD 331
+ LHH+TS+ PPVT+A+AAQET+ VHVS C F++SG+S +TAK+MW EIK++ SFD
Sbjct: 263 IALHHKTSNGHPPVTYAIAAQETEDVHVSECSCFLVSGHSPKDITAKEMWDEIKKNKSFD 322
Query: 332 RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTK 391
LNS E S PG+SIGAAIAA+V VPP + V FSL+WDCPEV F + KTY+RRYTK
Sbjct: 323 GLNS-EPGSPSRPGTSIGAAIAANVKVPPGCDRTVMFSLSWDCPEVRF-NEKTYHRRYTK 380
Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
FYG +AA +ARDA+L + WE QIEAWQ PIL D LP+WY +TLFNELYY N+GG
Sbjct: 381 FYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSDTALPDWYRVTLFNELYYFNSGGT 440
Query: 452 VWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPV 511
+WTDG PP + + +I++ QN+ + ++++ I QI++P
Sbjct: 441 IWTDGLPPKERI---------------ERSKIINT-EQNNIVTELFKKINVICAQIHSPQ 484
Query: 512 ALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDF 571
+LN+ EE+IGQF+YLEGIEYLM+NTYDVHFYSSFAL+ LFPK++LSIQRDF
Sbjct: 485 SLNA----------EEHIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLELSIQRDF 534
Query: 572 AAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKF 631
AA V+ DP+K +++ G+WV RK+LG+VPHDIG+ DPW E+N Y ++T RWKDLN KF
Sbjct: 535 AATVLFQDPTKKQIMSSGEWVPRKLLGSVPHDIGLNDPWLELNEYNFFNTDRWKDLNAKF 594
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
VLQVYRDVV+TGD+ FAKAVWPSVY A+AY+DQFD+D D MIEN+GFPDQTYD WSV G+
Sbjct: 595 VLQVYRDVVSTGDQSFAKAVWPSVYTAVAYLDQFDKDEDEMIENEGFPDQTYDAWSVKGV 654
Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQ 751
SAY GGLWVAALQAASA A VG+ G YF K++KAK VYEKLWNGSYFNYD+SGS
Sbjct: 655 SAYCGGLWVAALQAASAFASIVGENGVAIYFNTKYEKAKSVYEKLWNGSYFNYDDSGSGS 714
Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
SSSI ADQLAGQWYARACGL PI E+ + AL+ +Y +NV+KV GG RGAVNGM PDG+
Sbjct: 715 SSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSPDGQ 774
Query: 812 VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWN 871
VD +S+ S+E+W+G TY+VAA MI E + GFQTA GIYEA WS GL +FQTPEAWN
Sbjct: 775 VDTNSLVSKEVWAGTTYSVAACMIQEGQRERGFQTASGIYEAVWSDRGLSCSFQTPEAWN 834
Query: 872 TDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDES---LLRYHAGFSKVARL 928
+D+YRSLCYMRPLAIW++QWALTR + EK+ + +E LLR H GF VAR
Sbjct: 835 MNDEYRSLCYMRPLAIWSIQWALTRTQSFGEEKEKLVKGEEEESDLLLRQHKGFKDVARF 894
Query: 929 LKL-PEEQGAKSLLQSLFD 946
+K+ P +S LQ ++
Sbjct: 895 VKIVPTRNEHRSRLQHTYE 913
>gi|115474457|ref|NP_001060825.1| Os08g0111200 [Oryza sativa Japonica Group]
gi|42408344|dbj|BAD09496.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
gi|42408391|dbj|BAD09542.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
gi|113622794|dbj|BAF22739.1| Os08g0111200 [Oryza sativa Japonica Group]
gi|215701490|dbj|BAG92914.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 928
Score = 1040 bits (2689), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 530/934 (56%), Positives = 671/934 (71%), Gaps = 27/934 (2%)
Query: 26 VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
+D KP +++W+R ++ F++ + + + P+ +++L L ++ AKG + DP
Sbjct: 15 MDCEKPPAISWERTFDDEGKKVAMFSMTLNDMMAIVPLMIKMLGLNLKDNAKGLASVYDP 74
Query: 86 FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
K GVPLGG+G+GSIGRSYRG FQ++QIFP + E+KP+LANQFS F+SR +G+
Sbjct: 75 LKKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPSIYEEKPILANQFSAFISRPDGK 134
Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
+YS+VL +VLK AGIGSWDW LK TYH L+PR+WTV+ GEPDPE++I CR
Sbjct: 135 RYSTVLSAPNADVLKGIDKAGIGSWDWKLKEKNCTYHGLFPRSWTVYNGEPDPEIKITCR 194
Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
QISP IPHNYKESS+PV+VFT+T++NSG T AD+TLLFTW NSVGG SE TG H NS+
Sbjct: 195 QISPFIPHNYKESSFPVAVFTFTLHNSGSTPADVTLLFTWANSVGGKSELTGNHKNSRMT 254
Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS-GNSLGLTAKDMWHEI 324
DG PPVTFA+A++ETDGV V+ CP F + NS +TAKDMW +I
Sbjct: 255 TADG-------------RPPVTFAIASRETDGVRVTGCPRFTMGPSNSGDVTAKDMWDQI 301
Query: 325 KEHGSF--DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
++GSF D + + S PGSSIGAA+AA+ TV V+F+L+W CPEV F +G
Sbjct: 302 NKNGSFVGDGNAAAAATGASRPGSSIGAAVAATTTVAAGGARAVSFALSWSCPEVKFPAG 361
Query: 383 KTYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFN 441
+TY+RRYTKF+GT ++AAA +A DA+LEH WE QIE WQRPIL+DK LPEWYPITLFN
Sbjct: 362 RTYHRRYTKFHGTDRDAAAERLAHDALLEHMKWESQIEEWQRPILQDKSLPEWYPITLFN 421
Query: 442 ELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMS 501
ELYYLNAGG +WTDG PP ++ ++ F+LD V N IL ++
Sbjct: 422 ELYYLNAGGTIWTDGQPPKNTSLSSATEPFNLD-------TFSTVANGGSAVDGILSTVA 474
Query: 502 SILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFP 561
+ T A ++ GT LL+DGEEN+GQ LYLEG+EY MWNTYDVHFY+SFAL+ LFP
Sbjct: 475 VAAARSNTAAAAAAAMGTALLRDGEENVGQVLYLEGMEYNMWNTYDVHFYASFALLSLFP 534
Query: 562 KIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDT 621
+++L++QRDF V++HDP + L DG V+RKVLGAVPHD+G+ DPWFEVNAY L+D
Sbjct: 535 ELELNLQRDFVRGVLLHDPCLRRTL-DGATVARKVLGAVPHDMGLNDPWFEVNAYMLHDP 593
Query: 622 ARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQ 681
RWKDLNPKFVLQVYRDVVATG+ FA+A WP+VY+AMAYMDQFDRDGDGM+EN+G PDQ
Sbjct: 594 VRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDRDGDGMVENEGRPDQ 653
Query: 682 TYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGS 740
TYD WSVSG+SAY+GGLWVAALQAA+A+A VGD +E YF ++ +A+ VY ++LWNG
Sbjct: 654 TYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYHRARRVYTDELWNGG 713
Query: 741 YFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKR 800
YFNYDNSG + SSSIQADQLAGQWYARACGL PIVD DKAR AL V +YNV++V GG
Sbjct: 714 YFNYDNSGGATSSSIQADQLAGQWYARACGLEPIVDGDKARRALATVLDYNVMRVKGGAI 773
Query: 801 GAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGL 860
GAVNGM PDG VD SS QS+E+W GVTYAVAA+MIHE + + F+TA GI++A W G
Sbjct: 774 GAVNGMRPDGAVDASSTQSKEVWPGVTYAVAAAMIHEGMPEAAFKTAKGIHDAGWGKHGF 833
Query: 861 GYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHA 920
GYAFQTPE+W D YR+L YMRPL +WAMQWAL+ P + + E
Sbjct: 834 GYAFQTPESWTADGGYRALHYMRPLGVWAMQWALSPPVLHKEHRVAAVAASPEDAALGQE 893
Query: 921 GFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
F KVA +L+LPEEQ K +L++L+D T R++ +
Sbjct: 894 KFDKVASMLRLPEEQQHKGILRALYD-TLRQLLL 926
>gi|357144426|ref|XP_003573288.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 1
[Brachypodium distachyon]
Length = 932
Score = 1022 bits (2643), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 538/938 (57%), Positives = 676/938 (72%), Gaps = 32/938 (3%)
Query: 26 VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
+D A+PA+ TWQRK + F++ + + + P+ + L EE KG+ DP
Sbjct: 15 IDCAQPATRTWQRKFDDEGKKIEMFSMTMNDMILIVPMIFKGLVRNAEERGKGRDVIYDP 74
Query: 86 FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
F K G+PLGG+G+GSIGRSYRG FQ +QIFP + E KP+LANQFS F+SR NG+
Sbjct: 75 FKKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPAIYEQKPILANQFSAFISRPNGK 134
Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
YS+VL T +VLK AGIGSWDW LK TYHAL+PR+WTV++GEPDP ++I CR
Sbjct: 135 SYSTVLSAPTADVLKGVDKAGIGSWDWKLKEKNCTYHALFPRSWTVYDGEPDPAIKITCR 194
Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
QISP IPHNYKESS+PV VFT+T++N+G T AD+TLLFTW NSVGG SE TG H NSK
Sbjct: 195 QISPFIPHNYKESSFPVVVFTFTVHNTGSTPADVTLLFTWANSVGGRSELTGNHTNSKMM 254
Query: 266 MNDGVHAVLLHHRTSHQ-LPPVTFALAAQETDGVHVSLCPHFVISGNSLG-LTAKDMWHE 323
DGVH VLLHHRT++ PPVTFA+A+QET+ V V+ CP F + +S G TAKDMW E
Sbjct: 255 ERDGVHGVLLHHRTANDGHPPVTFAIASQETEDVRVTDCPSFTMGSSSSGDFTAKDMWQE 314
Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
IK+HGSF + + + S+PGSSIGAA+AA+ TVP V+F+L+W CPEV F +G+
Sbjct: 315 IKKHGSF--ITTDKEPRASKPGSSIGAAVAAATTVPAGGTRVVSFALSWSCPEVKFPAGR 372
Query: 384 TYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY+RRYT F+G ++AAA ++A DA+LEH WE +IE WQRPIL+DKRLP+WYP+ LFNE
Sbjct: 373 TYHRRYTTFHGLDRDAAAESLAHDALLEHMDWESKIEEWQRPILQDKRLPDWYPVALFNE 432
Query: 443 LYYLNAGGAVWTDGSPPVHSLVT--IGHRKFSLDWSQSDLKRIVDVPNQNDTAVN-ILER 499
LYYLNAGG +WTDG PP S+ + +G FSL +P TAV+ IL
Sbjct: 433 LYYLNAGGTIWTDGLPPKMSVSSSGVGTEPFSL--------ATTYLPG---TAVDGILSA 481
Query: 500 MSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIML 559
++S E ++ A FG LL DGEEN+GQFLYLE EY M+NTYDVHFY+SFAL+ L
Sbjct: 482 VASATEDQHSAAA----FGATLLGDGEENVGQFLYLEATEYYMYNTYDVHFYASFALLSL 537
Query: 560 FPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLY 619
FP+++LS+QRDFA AV++HDP M L DG V RKVLGAVPHDIG+ DPWFE+NAY ++
Sbjct: 538 FPELELSLQRDFARAVLIHDPRPMVTL-DGATVLRKVLGAVPHDIGLNDPWFELNAYMIH 596
Query: 620 DTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFP 679
D ARWKDLNPKFVLQVYR VV TG+ FAKA WP+VY+AMAYMDQ+DRDGDGM+EN+G P
Sbjct: 597 DPARWKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDRDGDGMVENEG-P 655
Query: 680 DQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWN 738
DQTYD WSV+G+SAY+GG+WV ALQA +A+AR VGD +E YF ++ +AK VY+ +LWN
Sbjct: 656 DQTYDLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYLRAKQVYDAELWN 715
Query: 739 GSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGG 798
G+YF YDNSG S SI ADQLAGQWYA CGL P+V+EDKARSAL V +YNV++V GG
Sbjct: 716 GTYFRYDNSGGETSESIMADQLAGQWYAHVCGLEPVVEEDKARSALATVLDYNVMRVKGG 775
Query: 799 KRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
GAVNGM PDG +DMSS QS+EIWSG TYAVAA+M+HE + + F+TA G ++A+W
Sbjct: 776 AVGAVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAHDASWGKA 835
Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDE--SLL 916
G GYAFQTPEAW + YR L YMRPL++WAMQWAL+ P+ L K +
Sbjct: 836 GFGYAFQTPEAWTAEGGYRGLHYMRPLSVWAMQWALSPPE---LHKDLGLASVSPPGDAA 892
Query: 917 RYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
R F KVA +L+LPE+ K +L+ H+ R++ +
Sbjct: 893 RGREKFDKVAGMLRLPEKVQHKGFFLALY-HSLRQLVL 929
>gi|222639789|gb|EEE67921.1| hypothetical protein OsJ_25789 [Oryza sativa Japonica Group]
Length = 919
Score = 1020 bits (2638), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 521/934 (55%), Positives = 663/934 (70%), Gaps = 36/934 (3%)
Query: 26 VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
+D KP +++W+R ++ F++ + + + P+ +++L L ++ AKG + DP
Sbjct: 15 MDCEKPPAISWERTFDDEGKKVAMFSMTLNDMMAIVPLMIKMLGLNLKDNAKGLASVYDP 74
Query: 86 FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
K GVPLGG+G+GSIGRSYRG FQ++QIFP + E+KP+LANQFS F+SR +G+
Sbjct: 75 LKKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPSIYEEKPILANQFSAFISRPDGK 134
Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
+YS+VL +VLK AGIGSWDW LK TYH L+PR+WTV+ GEPDPE++I CR
Sbjct: 135 RYSTVLSAPNADVLKGIDKAGIGSWDWKLKEKNCTYHGLFPRSWTVYNGEPDPEIKITCR 194
Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
QISP IPHNYKESS+PV+VFT+T++NSG T AD+TLLFTW NSVGG SE TG H NS+
Sbjct: 195 QISPFIPHNYKESSFPVAVFTFTLHNSGSTPADVTLLFTWANSVGGKSELTGNHKNSRMT 254
Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS-GNSLGLTAKDMWHEI 324
DGVH VLLHHRT+ PPVTFA+A++ETDGV V+ CP F + NS +TAKDMW +I
Sbjct: 255 ARDGVHGVLLHHRTADGRPPVTFAIASRETDGVRVTGCPRFTMGPSNSGDVTAKDMWDQI 314
Query: 325 KEHGSF--DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
++GSF D + + S PGSSIGAA+AA+ TV V+F+L+W CPEV F +G
Sbjct: 315 NKNGSFVGDGNAAAAATGASRPGSSIGAAVAATTTVAAGGARAVSFALSWSCPEVKFPAG 374
Query: 383 KTYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFN 441
+TY+RRYTKF+GT ++AAA +A DA+LEH WE QIE WQRPIL+DK LPEWYPITLFN
Sbjct: 375 RTYHRRYTKFHGTDRDAAAERLAHDALLEHMKWESQIEEWQRPILQDKSLPEWYPITLFN 434
Query: 442 ELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMS 501
ELYYLNAGG +WTDG PP ++ ++ F+LD V N IL ++
Sbjct: 435 ELYYLNAGGTIWTDGQPPKNTSLSSATEPFNLD-------TFSTVANGGSAVDGILSTVA 487
Query: 502 SILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFP 561
+ T A ++ GT LL+DGEEN+GQ LYLEG+EY MWNTYDVHFY+SFAL+ LFP
Sbjct: 488 VAAARSNTAAAAAAAMGTALLRDGEENVGQVLYLEGMEYNMWNTYDVHFYASFALLSLFP 547
Query: 562 KIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDT 621
+++L++QRDF V++HDP + L DG V+RKVLGAVPHD+G+ DPWFEVNAY L+D
Sbjct: 548 ELELNLQRDFVRGVLLHDPCLRRTL-DGATVARKVLGAVPHDMGLNDPWFEVNAYMLHDP 606
Query: 622 ARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQ 681
RWKDLNPKFVLQVYRDVVATG+ FA+A WP+VY+AMAYMDQFDRDGDGM+EN+G PDQ
Sbjct: 607 VRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDRDGDGMVENEGRPDQ 666
Query: 682 TYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGS 740
TYD WSVSG+SAY+GGLWVAALQAA+A+A VGD +E YF ++ +A+ VY ++LWNG
Sbjct: 667 TYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYHRARRVYTDELWNGG 726
Query: 741 YFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKR 800
YFNYDNSG + SSSIQADQLAGQWYARACGL PIVD DKAR AL V +YNV++
Sbjct: 727 YFNYDNSGGATSSSIQADQLAGQWYARACGLEPIVDGDKARRALATVLDYNVMRT----- 781
Query: 801 GAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGL 860
S+E+ GVTYAVAA+MIHE + + F+TA GI++A W G
Sbjct: 782 -----------------HSKEVCPGVTYAVAAAMIHEGMPEAAFKTAKGIHDAGWGKHGF 824
Query: 861 GYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHA 920
GYAFQTPE+W D YR+L YMRPL +WAMQWAL+ P + + E
Sbjct: 825 GYAFQTPESWTADGGYRALHYMRPLGVWAMQWALSPPVLHKEHRVAAVAASPEDAALGQE 884
Query: 921 GFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
F KVA +L+LPEEQ K +L++L+D T R++ +
Sbjct: 885 KFDKVASMLRLPEEQQHKGILRALYD-TLRQLLL 917
>gi|242080271|ref|XP_002444904.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
gi|241941254|gb|EES14399.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
Length = 929
Score = 1014 bits (2623), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 526/946 (55%), Positives = 665/946 (70%), Gaps = 50/946 (5%)
Query: 26 VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
+D ++P + TWQR+ ++ ++L + + +GV DP
Sbjct: 15 IDCSQPPARTWQRRFDDEGKKVAMYSLTMNDLTAI--VGV-----------------YDP 55
Query: 86 FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
K GVPLGG+G+GSIGRSYRG FQ++QIFP E+KP+LANQFS FVSR NG
Sbjct: 56 LRKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPATNEEKPILANQFSAFVSRPNGT 115
Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
KYS+VL T ++LK+ A IGSWDW LK +K TYHAL+PR+WTV++GEPDPE++I R
Sbjct: 116 KYSTVLSAPTADLLKEVDKASIGSWDWKLKEEKCTYHALFPRSWTVYDGEPDPEIKITSR 175
Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
QISP IPHNY+ESS+P +VFT+T++NSG T+AD+TLLFTW NSVGG SE TG H NSK
Sbjct: 176 QISPFIPHNYRESSFPAAVFTFTVHNSGSTAADVTLLFTWANSVGGKSELTGNHVNSKMT 235
Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS-GNSLGLTAKDMWHEI 324
DGVH VLLHHRT+ PPVTFA+A+QETDGV VS+CP F + S TA DMW EI
Sbjct: 236 EQDGVHGVLLHHRTAGGHPPVTFAIASQETDGVRVSVCPSFTMGPSGSRDFTAADMWDEI 295
Query: 325 KEH-GSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
K+H S+ GSS+GAA+AAS TVP V+FSLAW CP++ F +G
Sbjct: 296 KKHGAFGHAGAGNAARAASKAGSSLGAAVAASTTVPAGETRVVSFSLAWACPDIKFPAGS 355
Query: 384 TYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY+RRYTKFYG +AAA +A DA+LEH +WE QIE WQRPIL DKRLPEWYP+ LFNE
Sbjct: 356 TYHRRYTKFYGVDADAAAEQLAHDALLEHMNWESQIEEWQRPILNDKRLPEWYPVALFNE 415
Query: 443 LYYLNAGGAVWTDGSPPVHSLVT---IGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILER 499
LYYLNAGG +WTDG PP + G FS+D + + +AV+ + R
Sbjct: 416 LYYLNAGGTIWTDGQPPKKAGFASSLTGTEPFSID------TLLAEDGGTGGSAVDGVVR 469
Query: 500 MSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIML 559
+ +++ + ++FG LL+DGE+N+GQFLYLEG+EY M+NTYDVHFY+SFAL+ L
Sbjct: 470 AVASAKELSH--SAAAAFGAALLRDGEDNVGQFLYLEGMEYNMYNTYDVHFYASFALLSL 527
Query: 560 FPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLY 619
FP ++LS+QRDFA AV++HDP + + DG+ V RKVLG VPHD+G+ DPW E+N Y L+
Sbjct: 528 FPSLELSLQRDFARAVLLHDPRRRRTF-DGRTVPRKVLGTVPHDVGLNDPWVEMNEYMLH 586
Query: 620 DTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFP 679
D A WKDLNPKFVLQVYRD VATGD FA+A WP+VY+AMAYM QFDRDGDGM+EN+G P
Sbjct: 587 DPALWKDLNPKFVLQVYRDAVATGDAAFAEAAWPAVYIAMAYMHQFDRDGDGMVENEGIP 646
Query: 680 DQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWN 738
DQTYD WSVSG+SAY+GGLWVAAL+AA+ +AR VGDR +E YF + +A VY+ +LWN
Sbjct: 647 DQTYDIWSVSGVSAYTGGLWVAALEAAAGMARVVGDRHAEAYFRGRRDRAARVYDAELWN 706
Query: 739 GSYFNYDNSGSSQ-SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG 797
G+YF YDNSG S SI ADQ+AGQWYARACG+ P+V+E KARSAL V +YNV++V G
Sbjct: 707 GTYFRYDNSGGGACSESIMADQMAGQWYARACGMEPVVEEGKARSALATVLDYNVMRVKG 766
Query: 798 GKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSG 857
G GAVNGM PDG VD SS QS+E+W GVTYAVAA+M+HE + + F+TA G ++A W
Sbjct: 767 GAVGAVNGMRPDGAVDASSGQSKEVWPGVTYAVAAAMVHEGMHEAAFRTAKGAHDAGWGK 826
Query: 858 TGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK---------PKTLEKQMKP 908
G GYAFQTPEAW D YRSL YMRPL IWAMQWAL+ P+ +++E++ P
Sbjct: 827 DGFGYAFQTPEAWTEDGGYRSLHYMRPLGIWAMQWALSPPELHKDLRAPGARSVEEESSP 886
Query: 909 EVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
D +L + A F KVA +L+LPEE+ K L +++ R+M +
Sbjct: 887 --ADAALGQ--AQFEKVANMLRLPEEKQPKGYLWAIY-KVIRQMVL 927
>gi|357144429|ref|XP_003573289.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 2
[Brachypodium distachyon]
Length = 919
Score = 996 bits (2575), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 530/938 (56%), Positives = 666/938 (71%), Gaps = 45/938 (4%)
Query: 26 VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
+D A+PA+ TWQRK + F++ + + + P+ + L EE KG+ DP
Sbjct: 15 IDCAQPATRTWQRKFDDEGKKIEMFSMTMNDMILIVPMIFKGLVRNAEERGKGRDVIYDP 74
Query: 86 FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
F K G+PLGG+G+GSIGRSYRG FQ +QIFP + E KP+LANQFS F+SR NG+
Sbjct: 75 FKKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPAIYEQKPILANQFSAFISRPNGK 134
Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
YS+VL T +VLK AGIGSWDW LK TYHAL+PR+WTV++GEPDP ++I CR
Sbjct: 135 SYSTVLSAPTADVLKGVDKAGIGSWDWKLKEKNCTYHALFPRSWTVYDGEPDPAIKITCR 194
Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK-T 264
QISP IPHNYKESS+PV VFT+T++N+G T AD+TLLFTW NSVGG SE TG H NSK
Sbjct: 195 QISPFIPHNYKESSFPVVVFTFTVHNTGSTPADVTLLFTWANSVGGRSELTGNHTNSKMI 254
Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG-LTAKDMWHE 323
NDG PPVTFA+A+QET+ V V+ CP F + +S G TAKDMW E
Sbjct: 255 TANDG-------------HPPVTFAIASQETEDVRVTDCPSFTMGSSSSGDFTAKDMWQE 301
Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
IK+HGSF + + + S+PGSSIGAA+AA+ TVP V+F+L+W CPEV F +G+
Sbjct: 302 IKKHGSF--ITTDKEPRASKPGSSIGAAVAAATTVPAGGTRVVSFALSWSCPEVKFPAGR 359
Query: 384 TYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY+RRYT F+G ++AAA ++A DA+LEH WE +IE WQRPIL+DKRLP+WYP+ LFNE
Sbjct: 360 TYHRRYTTFHGLDRDAAAESLAHDALLEHMDWESKIEEWQRPILQDKRLPDWYPVALFNE 419
Query: 443 LYYLNAGGAVWTDGSPPVHSLVT--IGHRKFSLDWSQSDLKRIVDVPNQNDTAVN-ILER 499
LYYLNAGG +WTDG PP S+ + +G FSL +P TAV+ IL
Sbjct: 420 LYYLNAGGTIWTDGLPPKMSVSSSGVGTEPFSL--------ATTYLPG---TAVDGILSA 468
Query: 500 MSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIML 559
++S E ++ A FG LL DGEEN+GQFLYLE EY M+NTYDVHFY+SFAL+ L
Sbjct: 469 VASATEDQHSAAA----FGATLLGDGEENVGQFLYLEATEYYMYNTYDVHFYASFALLSL 524
Query: 560 FPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLY 619
FP+++LS+QRDFA AV++HDP M L DG V RKVLGAVPHDIG+ DPWFE+NAY ++
Sbjct: 525 FPELELSLQRDFARAVLIHDPRPMVTL-DGATVLRKVLGAVPHDIGLNDPWFELNAYMIH 583
Query: 620 DTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFP 679
D ARWKDLNPKFVLQVYR VV TG+ FAKA WP+VY+AMAYMDQ+DRDGDGM+EN+G P
Sbjct: 584 DPARWKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDRDGDGMVENEG-P 642
Query: 680 DQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWN 738
DQTYD WSV+G+SAY+GG+WV ALQA +A+AR VGD +E YF ++ +AK VY+ +LWN
Sbjct: 643 DQTYDLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYLRAKQVYDAELWN 702
Query: 739 GSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGG 798
G+YF YDNSG S SI ADQLAGQWYA CGL P+V+EDKARSAL V +YNV++V GG
Sbjct: 703 GTYFRYDNSGGETSESIMADQLAGQWYAHVCGLEPVVEEDKARSALATVLDYNVMRVKGG 762
Query: 799 KRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
GAVNGM PDG +DMSS QS+EIWSG TYAVAA+M+HE + + F+TA G ++A+W
Sbjct: 763 AVGAVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAHDASWGKA 822
Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDE--SLL 916
G GYAFQTPEAW + YR L YMRPL++WAMQWAL+ P+ L K +
Sbjct: 823 GFGYAFQTPEAWTAEGGYRGLHYMRPLSVWAMQWALSPPE---LHKDLGLASVSPPGDAA 879
Query: 917 RYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
R F KVA +L+LPE+ K +L+ H+ R++ +
Sbjct: 880 RGREKFDKVAGMLRLPEKVQHKGFFLALY-HSLRQLVL 916
>gi|326517703|dbj|BAK03770.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 821
Score = 965 bits (2495), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 490/838 (58%), Positives = 607/838 (72%), Gaps = 40/838 (4%)
Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
FVS G+ YS+VL T ++LK AGIGSWDW LK + YH L+PR+WTV++GEP
Sbjct: 1 AFVSHPGGKSYSTVLSAPTADLLKGIDKAGIGSWDWKLKEENCNYHGLFPRSWTVYDGEP 60
Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT 256
D +++I CRQISP IPHNYKESS+PV+VFT+T+ NSG T AD+TLLFTW NSVGG SE T
Sbjct: 61 DTKIKITCRQISPFIPHNYKESSFPVTVFTFTVQNSGSTPADVTLLFTWANSVGGRSELT 120
Query: 257 GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG-- 314
G H NS+ K DGVH VLL HRT+ PPVTFA+A++ET V V+ CP F + +S G
Sbjct: 121 GNHTNSRIKARDGVHGVLLRHRTADGNPPVTFAIASEETSDVRVTCCPTFAMGPSSPGAE 180
Query: 315 -LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
TAKDMW E+K GSF + +S PGSSIGAA+A++ VP V+F+L+W
Sbjct: 181 QFTAKDMWDEVKNRGSFGGDAGGAPTASSRPGSSIGAAVASTTKVPAGDTRVVSFALSWS 240
Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLP 432
CPEV F SG+TY+RRYTKF G ++AAA + DA+LEH WE IE WQ PIL+D+RLP
Sbjct: 241 CPEVKFPSGRTYHRRYTKFLGLDRDAAAEQLVHDALLEHMKWESLIEEWQTPILQDRRLP 300
Query: 433 EWYPITLFNELYYLNAGGAVWTDGSPP-----VHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
EWYP+ LFNELYYLNAGG +WTDG PP S FSLD ++
Sbjct: 301 EWYPVALFNELYYLNAGGTIWTDGMPPKKTSFASSKYGSTAESFSLDGFRA--------- 351
Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
D AV+ + R + E+ P +S+FGT LL DGEEN+GQFLYLEG+EY MWNTYD
Sbjct: 352 --GDPAVDGILRAMATAEERLEP---SSAFGTALLGDGEENVGQFLYLEGMEYHMWNTYD 406
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
VHFY+SF+L+ LFP+I+LS+QRDFA AV++HDP M+ LD G V RKVLGAVPHDIG+
Sbjct: 407 VHFYASFSLLSLFPEIELSLQRDFARAVLLHDPRPMRTLD-GVDVPRKVLGAVPHDIGLA 465
Query: 608 DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDR 667
DPWFE+NAY ++D +RWKDLNPKFVLQVYRDV ATG+ FA A WP+VY+AMAYMDQFDR
Sbjct: 466 DPWFELNAYMIHDPSRWKDLNPKFVLQVYRDVAATGNVTFATAAWPAVYLAMAYMDQFDR 525
Query: 668 DGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQ 727
DGDGM+EN+G PDQTYD WSVSG+SAY+GGLW+AALQAA+A+AR VGDRG+E YFL +++
Sbjct: 526 DGDGMVENEGRPDQTYDLWSVSGVSAYTGGLWLAALQAAAAMARIVGDRGAEGYFLERYK 585
Query: 728 KAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKK 786
+A+ VY+ +LWNGSYF+YDNSG + S SI ADQLAGQWYARACGL PIV+E+KARSAL
Sbjct: 586 RAQRVYDGELWNGSYFDYDNSGCATSKSIMADQLAGQWYARACGLEPIVEEEKARSALGT 645
Query: 787 VYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQT 846
V +YNV++V GG GAVNGM PDG VD SS+QS+E+W GVTY VAA+M+HE + + F+T
Sbjct: 646 VLDYNVMRVQGGAVGAVNGMRPDGAVDASSLQSKEVWVGVTYGVAAAMLHEGMTEAAFRT 705
Query: 847 ACGIYEAAWSGTGLGYAFQTPEAWNTD--DQYRSLCYMRPLAIWAMQWALTRPK------ 898
A G ++A W G GYAFQTPEAW +D YRSL YMRPL+IWAMQWAL+ P+
Sbjct: 706 AKGAHDAGWGRDGFGYAFQTPEAWTSDAGGGYRSLHYMRPLSIWAMQWALSPPELHRDLR 765
Query: 899 --PKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
P ++ P D + + F KVA +LKLPEE K L+++ R+M +
Sbjct: 766 VVPGSVSAVASPAEVDLAREK----FEKVASMLKLPEEVQHKGYLRAIC-QVLRQMLL 818
>gi|302773445|ref|XP_002970140.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
gi|300162651|gb|EFJ29264.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
Length = 983
Score = 962 bits (2487), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 509/956 (53%), Positives = 667/956 (69%), Gaps = 36/956 (3%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLA-PIGVRILCLIREEAAKGKRAFIDP 85
DP P + W+RKL+ L++F + + E ++L +G+R+ I+ E ++G+ IDP
Sbjct: 27 DPGAPPAHAWRRKLNCHANFLTEFRVTFTEALKLVLGLGLRLWKFIQAERSQGRTPPIDP 86
Query: 86 FIK-RHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
F + ++ HGVPLGG+G GSIGR +RGEF+RWQ+ P V + VLANQFSVFV+R
Sbjct: 87 FKRGTKPSACHGVPLGGMGGGSIGRGFRGEFRRWQLLPSVTSEAAVLANQFSVFVTRDKD 146
Query: 145 ----QKYSSVLCPKTPE--VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+K +SVL P+ E V K + I +WDWNL G STYH L+PRAWT+++GEPDP
Sbjct: 147 HGEKKKIASVLSPRPSELDVGKKAQSPEICAWDWNLDGQNSTYHGLFPRAWTIYDGEPDP 206
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
+L+I CRQ+SP IPH+Y+ESS PV VF+Y I N+GK SA ++LLFTW NSVGGDS +G
Sbjct: 207 DLKISCRQVSPFIPHDYRESSLPVCVFSYVIVNTGKESASVSLLFTWANSVGGDSHLSGG 266
Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAK 318
H N +DGV V+LHHRT+ P V FA+AA++ V VS+CP F++SG +TAK
Sbjct: 267 HSNKPFLEDDGVAGVVLHHRTAKGQPSVEFAIAARQNADVAVSVCPSFLVSGKGSTMTAK 326
Query: 319 DMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
+MW++++E G FD E S+ S SS+GAA++ASVT+ P + + F+L+WD PEV
Sbjct: 327 EMWNQMREKGCFDTDTVCEPSLPSLVNSSLGAAVSASVTIAPHGKKVIDFALSWDIPEVK 386
Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
F G++Y+RRYT+FYG++ +AA + DA+L + WE +IE WQ P+L+D LPEWY T
Sbjct: 387 FPKGRSYHRRYTQFYGSYGDAAPRLVHDALLNYPKWEAEIEKWQEPVLKDANLPEWYRFT 446
Query: 439 LFNELYYLNAGGAVWTDGSPP---VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQ------ 489
LFNELYYL AGG VWT P + + + + S+ + +L ++ +
Sbjct: 447 LFNELYYLVAGGTVWTAFVTPSFRSNGFTSRTNSESSITGAVEELVEMIPLNGGTTGNGG 506
Query: 490 -----NDTAVNILERMSSILEQIYTPVALNSS-----FGTNLLQDGEENIGQFLYLEGIE 539
++T I E++ +E +NS+ +GT LL+DGEEN+GQFLYLE +E
Sbjct: 507 VDAILSETEEEIAEKLVKAMEAAILDKQINSAPRDFPYGTYLLEDGEENVGQFLYLESVE 566
Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
Y+MW TYDVHFY+SFALI LFPK++LSIQRD+AAA + H+P +K L G+W +KVLGA
Sbjct: 567 YVMWCTYDVHFYASFALISLFPKLELSIQRDYAAATLSHNPELVKFLAGGKWGIKKVLGA 626
Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
VPHD+GI DPW EVNAY ++DT+RWKDLN KFVLQ+YRDVVAT D FA+ WP+VY AM
Sbjct: 627 VPHDLGIHDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPAVYTAM 686
Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSE 719
AYMD+FDRD DGMIENDGFPDQTYDTW+V GISAY GGLW+AALQAA+A+A +V + +
Sbjct: 687 AYMDRFDRDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVNEPNAA 746
Query: 720 DYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDED 778
+YFL KF+ A+ VY+ KLWNGSYFNYD+ SS S+SIQADQ+AGQWYA A GL P+ D+
Sbjct: 747 NYFLRKFRDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGLPPLFDDY 806
Query: 779 KARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHED 838
KARSAL+++Y+YNV+KV GGK GAVNGM P+GRVD ++MQSREIWSGVTYA AA+MIHE
Sbjct: 807 KARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEG 866
Query: 839 LADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
+ + F TA GIY A WS +G GY FQTPEAW TD +R+L YMRPLAIWAMQWAL P
Sbjct: 867 MLEQAFTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQWALYPPN 926
Query: 899 -----PKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTC 949
P+ K+ + S Y AGF+KV+ LK Q + ++SLF C
Sbjct: 927 CVAAGPRIPSKEESAGGSTSSSYLY-AGFAKVSEALKNAPPQIEQ--IRSLFHRLC 979
>gi|302807226|ref|XP_002985326.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
gi|300147154|gb|EFJ13820.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
Length = 983
Score = 960 bits (2482), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/958 (53%), Positives = 663/958 (69%), Gaps = 36/958 (3%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLA-PIGVRILCLIREEAAKGKRAFIDP 85
DP P + W+RKL+ L++F + + E ++L +G+R+ I+ E ++G+ IDP
Sbjct: 27 DPGAPPAHAWRRKLNCHANFLTEFRVTFTEALKLVLGLGLRLWKFIQAERSQGRTPPIDP 86
Query: 86 FIK-RHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
F + ++ HGVPLGG+G GSIGR +RGEF+RWQ+ P V + VLANQFSVFV+R
Sbjct: 87 FKRGTKPSACHGVPLGGMGGGSIGRGFRGEFRRWQLLPSVTSEAAVLANQFSVFVTRDKD 146
Query: 145 ----QKYSSVLCPKTPE--VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+K +SVL P+ E V K + I +WDWNL G STYH L+PRAWT+++GEPDP
Sbjct: 147 HGEKKKIASVLSPRPSELDVGKKAQSPEICAWDWNLDGQNSTYHGLFPRAWTIYDGEPDP 206
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
+L+I CRQ+SP IPH+Y+ESS PV VF+Y I N+GK SA ++LLFTW NSVGGD+ +G
Sbjct: 207 DLKISCRQVSPFIPHDYRESSLPVCVFSYVIVNTGKESASVSLLFTWANSVGGDTHLSGG 266
Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAK 318
H N +DGV V+LHHRT+ P V FA+AA++ V VS+CP F++SG +TAK
Sbjct: 267 HSNKPFLEDDGVAGVVLHHRTAKGQPSVEFAIAARQNADVAVSVCPSFLVSGKGSTMTAK 326
Query: 319 DMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
+MW++++E G FD E S+ S SS+GAA++ASVT+ P + V F+L+WD PEV
Sbjct: 327 EMWNQMREKGCFDTDTVCEPSLPSLVNSSLGAAVSASVTIAPHGKKVVDFALSWDIPEVK 386
Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
F G++Y+RRYT+FYG++ +AA + DA+L + WE +IE WQ P+L+D LPEWY T
Sbjct: 387 FPKGRSYHRRYTQFYGSYGDAAPRLVHDALLNYPKWEAEIEKWQEPVLKDANLPEWYKFT 446
Query: 439 LFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVD-VPNQNDTAVN-- 495
LFNELYYL AGG VWT P R S ++ +V+ +P T+ N
Sbjct: 447 LFNELYYLVAGGTVWTAFVTPSFRSNGFTSRTNSESSITGAVEELVEMIPLNGGTSGNGG 506
Query: 496 -----------ILERMSSILEQIYTPVALNSS-----FGTNLLQDGEENIGQFLYLEGIE 539
I E++ +E +NS+ +GT LL+DGEEN+GQFLYLE +E
Sbjct: 507 VDAILSETEEEIAEKLVKAMEAAILDKQINSAPRDFPYGTYLLEDGEENVGQFLYLESVE 566
Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
Y+MW TYDVHFY+SFALI LFPK++LSIQRD+AAA + H+P +K L G+W +KVLGA
Sbjct: 567 YVMWCTYDVHFYASFALISLFPKLELSIQRDYAAATLSHNPELVKFLAGGKWGIKKVLGA 626
Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
VPHD+GI DPW EVNAY ++DT+RWKDLN KFVLQ+YRDVVAT D FA+ WP+VY AM
Sbjct: 627 VPHDLGIHDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPAVYTAM 686
Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSE 719
AYMD+FDRD DGMIENDGFPDQTYDTW+V GISAY GGLW+AALQAA+A+A +V + +
Sbjct: 687 AYMDRFDRDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVNEPNAA 746
Query: 720 DYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDED 778
+YFL KF+ A+ VY+ KLWNGSYFNYD+ SS S+SIQADQ+AGQWYA A GL P+ D+
Sbjct: 747 NYFLRKFRDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGLPPLFDDY 806
Query: 779 KARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHED 838
KARSAL+++Y+YNV+KV GGK GAVNGM P+GRVD ++MQSREIWSGVTYA AA+MIHE
Sbjct: 807 KARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEG 866
Query: 839 LADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
+ + F TA GIY A WS +G GY FQTPEAW TD +R+L YMRPLAIWAMQWAL P
Sbjct: 867 MLEQAFTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQWALYPPN 926
Query: 899 -----PKTLEKQMKPEVTDESLLRYHAGFSKVARLLK--LPEEQGAKSLLQSLFDHTC 949
P+ K+ + S Y AGF+KV+ L P+ + +SL L C
Sbjct: 927 CVAAGPRIPSKEESAGGSTSSSYLY-AGFAKVSEALTNVPPQIEQIRSLFHRLCCCCC 983
>gi|326516802|dbj|BAJ96393.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 970
Score = 959 bits (2479), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/895 (54%), Positives = 630/895 (70%), Gaps = 24/895 (2%)
Query: 26 VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
+D P W+R+L++ L +F++ + E +++ +GVR+ +REEA+ G++A IDP
Sbjct: 30 LDGGSPPEQAWRRRLNSHANILKEFSVTFMEAMKMMTLGVRLWSYVREEASHGRKAPIDP 89
Query: 86 FIKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSN 143
F + ++S GVPLGG+GSGSI R +RGEF+ W I P +CE+ PV+ NQFS+FVSR +
Sbjct: 90 FTRERCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCENSPVMENQFSIFVSRDS 149
Query: 144 G-QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRI 202
G +KYSSVL P E LK +GI SWDWNL G STYHAL+PRAWT+++GEPDP+L+I
Sbjct: 150 GNKKYSSVLAPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTIYDGEPDPDLKI 209
Query: 203 VCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNS 262
CRQISP IPH+YK+SS P SVF YT+ N+G+ A ++LL TW NS+GG S +G H+N
Sbjct: 210 SCRQISPFIPHDYKDSSLPTSVFVYTLVNTGRDRAKVSLLMTWANSIGGFSHHSGGHFNE 269
Query: 263 KTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWH 322
+DGV VLLHH+T+ PPVTF++AA ET V V++ P F +SG + ++AK+MW
Sbjct: 270 PFIGDDGVSGVLLHHKTAKDNPPVTFSIAACETQNVSVTVLPVFGLSGEN-HVSAKEMWD 328
Query: 323 EIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS 381
+ + G F R N + S+ S G ++ AA++AS V P V F+LAW P+V F
Sbjct: 329 TMSKDGHFSRENFNAGCSMPSSSGETLCAAVSASTWVEPHGRCTVAFALAWSSPKVKFQK 388
Query: 382 GKTYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLF 440
G TY RRYT+FYGT + +++ N+ DA+ ++ WE +IE WQ PIL+D++LPEWY TLF
Sbjct: 389 GCTYNRRYTEFYGTSERSSSINLVHDALTKYRLWEEEIEKWQDPILKDEKLPEWYKFTLF 448
Query: 441 NELYYLNAGGAVWTDGSPPVHSLVTIGHR-KFSLDWSQSD-------------LKRIVDV 486
NELY+L AGG VWTDG PP S + ++ K+S ++S+ + D+
Sbjct: 449 NELYFLVAGGTVWTDGQPPAISEASPAYQHKYSKKGAKSESVKDNHVKPAAEQVSDGDDL 508
Query: 487 PNQNDTAVN--ILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWN 544
PN + +V+ + S + EQ + + L L +DG EN+G+FLYLEG+EY+MWN
Sbjct: 509 PNGEERSVSTYVAVHGSQMPEQT-SGLGLQEPIPYLLSKDGPENVGKFLYLEGVEYIMWN 567
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+ D ++K L DG RKV GAVPHD+
Sbjct: 568 TYDVHFYASFALLDLFPKIELSIQRDFADAVLYEDRRRVKFLADGTSGIRKVKGAVPHDL 627
Query: 605 GICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
G DPW E+NAY ++DT++WKDLNPKFVLQVYRD ATGD F + VWP+V AM YMDQ
Sbjct: 628 GTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVSAAMDYMDQ 687
Query: 665 FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLF 724
FDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+AALQAA+ +A +GDR + +
Sbjct: 688 FDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPYAEKYKL 747
Query: 725 KFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSA 783
KF KAK VYE KLWNGSYFNYD+ SS S SIQADQLAGQWYA + GL PI DE K RSA
Sbjct: 748 KFMKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPIFDEHKIRSA 807
Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
L+K++ +NV+KV GG+ GAVNGM P G+VD + MQSREIW+GVTY VAA+M+ + G
Sbjct: 808 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 867
Query: 844 FQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
F TA GI+ A WS G GY FQTPE W TD YRSL YMRPLAIWAMQWAL+ PK
Sbjct: 868 FITAEGIFLAGWSEDGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAMQWALSPPK 922
>gi|218184729|gb|EEC67156.1| hypothetical protein OsI_34005 [Oryza sativa Indica Group]
Length = 974
Score = 957 bits (2474), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 490/898 (54%), Positives = 619/898 (68%), Gaps = 28/898 (3%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D P W+R+L++ L +F++ + E +++ +G+R+ +REEA+ G++A IDPF
Sbjct: 31 DGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEASHGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
K ++S GVPLGG+GSGSI R +RGEF+ W I P +CE PV+ NQFS+FVSR G
Sbjct: 91 TKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQFSIFVSRDGG 150
Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+KYSSVL P E LK +GI SWDWNL G STYHAL+PRAWTV++GEPDP+L+I
Sbjct: 151 NKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYDGEPDPDLKIS 210
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IPH+YK+SS P SVF YT+ N+GK A ++LL TW NS+GG S +G H+N
Sbjct: 211 CRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFSHHSGGHFNEP 270
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
DGV VLLHH+T+ PPVTFA+AA ET V+V++ P F +SG +AK MW
Sbjct: 271 FIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGHD-SAKQMWDR 329
Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+K++G FDR N TS+ S G ++ AA++AS V P V F LAW P++ F G
Sbjct: 330 MKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAWSSPKIKFQKG 389
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY RRYT+FYGT + +A N+ DA+ ++ WE +IE WQ PIL+++RLPEWY TLFNE
Sbjct: 390 CTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNERLPEWYKFTLFNE 449
Query: 443 LYYLNAGGAVWTDGSPPV-------HSLVTIGHRKFSLDWSQSDLK------RIVDVPNQ 489
LY+L AGG VWTDG PPV S ++ + D Q +K V N
Sbjct: 450 LYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHVKLTAEQVTNG 509
Query: 490 NDTAVNILERMSSILEQIYTPVALNSSFGTN--------LLQDGEENIGQFLYLEGIEYL 541
D A N E+ S ++ P ++ G L ++G EN+G+FLYLEG+EY+
Sbjct: 510 GDLA-NGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKFLYLEGVEYI 568
Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
MW TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+ D +MK L DG RKV GAVP
Sbjct: 569 MWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSGIRKVKGAVP 628
Query: 602 HDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
HD+G DPW E+NAY ++DT++WKDLNPKFVLQVYRD ATGD F + VWP+V AM Y
Sbjct: 629 HDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAAMDY 688
Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
M+QFDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+AALQAA+ +A +GDR +
Sbjct: 689 MNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEK 748
Query: 722 FLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
+ KF +AK VYE KLWNGSYFNYD+ SS S SIQADQLAGQWYA + GL P+ DE+K
Sbjct: 749 YKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDENKI 808
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
RSAL+K++ +NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTY VAA+M+ +
Sbjct: 809 RSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGME 868
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
GF TA GI+ A WS G GY FQTPE W D YRSL YMRPLAIWAMQWA + PK
Sbjct: 869 HQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQWARSPPK 926
>gi|115482480|ref|NP_001064833.1| Os10g0473400 [Oryza sativa Japonica Group]
gi|110289241|gb|AAP54244.2| expressed protein [Oryza sativa Japonica Group]
gi|113639442|dbj|BAF26747.1| Os10g0473400 [Oryza sativa Japonica Group]
gi|215695411|dbj|BAG90602.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 974
Score = 954 bits (2466), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/898 (54%), Positives = 618/898 (68%), Gaps = 28/898 (3%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D P W+R+L++ L +F++ + E +++ +G+R+ +REEA+ G++A IDPF
Sbjct: 31 DGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEASHGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
K ++S GVPLGG+GSGSI R +RGEF+ W I P +CE PV+ NQFS+FVSR G
Sbjct: 91 TKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQFSIFVSRDGG 150
Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+KYSSVL P E LK +GI SWDWNL G STYHAL+PRAWTV++GEPDP+L+I
Sbjct: 151 NKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYDGEPDPDLKIS 210
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IPH+YK+SS P SVF YT+ N+GK A ++LL TW NS+GG S +G H+N
Sbjct: 211 CRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFSHHSGGHFNEP 270
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
DGV VLLHH+T+ PPVTFA+AA ET V+V++ P F +SG +AK MW
Sbjct: 271 FIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGHD-SAKQMWDR 329
Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+K++G FDR N TS+ S G ++ AA++AS V P V F LAW P++ F G
Sbjct: 330 MKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAWSSPKIKFQKG 389
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY RRYT+FYGT + +A N+ DA+ ++ WE +IE WQ PIL++++LPEWY TLFNE
Sbjct: 390 CTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLPEWYKFTLFNE 449
Query: 443 LYYLNAGGAVWTDGSPPV-------HSLVTIGHRKFSLDWSQSDLK------RIVDVPNQ 489
LY+L AGG VWTDG PPV S ++ + D Q +K V N
Sbjct: 450 LYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHVKLTAEQVTNG 509
Query: 490 NDTAVNILERMSSILEQIYTPVALNSSFGTN--------LLQDGEENIGQFLYLEGIEYL 541
D A N E+ S ++ P ++ G L ++G EN+G+FLYLEG+EY+
Sbjct: 510 GDLA-NGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKFLYLEGVEYI 568
Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
MW TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+ D +MK L DG RKV GAVP
Sbjct: 569 MWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSGIRKVKGAVP 628
Query: 602 HDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
HD+G DPW E+NAY ++DT++WKDLNPKFVLQVYRD ATGD F + VWP+V M Y
Sbjct: 629 HDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDY 688
Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
M+QFDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+AALQAA+ +A +GDR +
Sbjct: 689 MNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEK 748
Query: 722 FLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
+ KF +AK VYE KLWNGSYFNYD+ SS S SIQADQLAGQWYA + GL P+ DE+K
Sbjct: 749 YKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDENKI 808
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
RSAL+K++ +NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTY VAA+M+ +
Sbjct: 809 RSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGME 868
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
GF TA GI+ A WS G GY FQTPE W D YRSL YMRPLAIWAMQWA + PK
Sbjct: 869 HQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQWARSPPK 926
>gi|168037390|ref|XP_001771187.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677567|gb|EDQ64036.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 981
Score = 954 bits (2465), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 514/943 (54%), Positives = 661/943 (70%), Gaps = 49/943 (5%)
Query: 2 FEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLA 61
F G+ L E +P + D DP P++L W+RKL++ L++F + + E V++
Sbjct: 30 FHGRRPSWPLAEYVHKPTLAMLDH-DPGAPSALAWRRKLNSHADFLTEFRVTFMEAVRMI 88
Query: 62 PIGVRILCLIREEAAKGKRAFIDPFIKRHLTSS-HGVPLGGVGSGSIGRSYRGEFQRWQI 120
+G+R+ ++ E A+G+ IDPF + S+ HGVP GG+G GSIGR +RG+F+RWQ+
Sbjct: 89 TLGMRMWFYVKSERAQGRAPPIDPFNRETKPSACHGVPCGGMGGGSIGRGFRGDFRRWQL 148
Query: 121 FPRVCEDKPVLANQFSVFVSRSNG----QKYSSVLCPKTPEVLKDTTA-AGIGSWDWNLK 175
P VCE+ PVLA+QFSVFV R G +K +SVL P P+ L DT + SWDWNL
Sbjct: 149 IPGVCEEAPVLADQFSVFVKRDKGNGVVKKDASVLYPGRPQELSDTKDDTSVSSWDWNLD 208
Query: 176 GDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKT 235
G STYHAL+PRAWT++EGEPDP+L+I CRQ+SP IPHNY+ESS+P VF+Y + N+GK
Sbjct: 209 GQNSTYHALFPRAWTIYEGEPDPDLKISCRQVSPFIPHNYEESSFPCCVFSYVVMNTGKQ 268
Query: 236 SADITLLFTWTNSVGGDSEFTGQHYNS--KTKMNDGVHAVLLHHRTSHQLPPVTFALAAQ 293
+AD+TLLFTW NS+GG+S TG H+N +T G PVTFA+AA+
Sbjct: 269 AADVTLLFTWANSIGGNSATTGGHFNEPFETGAKGGR--------------PVTFAIAAK 314
Query: 294 ETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAI 352
E V ++ CP F + G + +TAKDMW E+KE+GSFD+ N + + S + GS+IGAAI
Sbjct: 315 EMRDVTLTTCPCFSLGGKEVEVTAKDMWTEVKENGSFDKENWNFKPSAPTRQGSAIGAAI 374
Query: 353 AASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHG 412
AA V VPP + V FSLAWD PEV F+ GK+Y+RRYT FYGT +AA+ + RDA+ ++
Sbjct: 375 AARVEVPPGEKRDVVFSLAWDSPEVKFLKGKSYHRRYTSFYGTAGDAASKLTRDALRDYR 434
Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS 472
WE+ IE WQ+P+L+D++LPEWY +TLFNELYYL AGG +WT +
Sbjct: 435 KWEIAIEDWQQPVLKDEKLPEWYRVTLFNELYYLVAGGTIWT--------------AELE 480
Query: 473 LDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNS----SFGTNLLQDGEEN 528
L SQ + +R+ + LE + IL+Q+ T + +S + G +LLQ+GEEN
Sbjct: 481 LAASQ-ETERLQSGETADQMQTETLEHL--ILKQMQTAMLGSSEAGIATGPSLLQEGEEN 537
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
+GQFLYLEGIEY+MWNTYDVHFY+SFAL+ LFPK++L+IQRD AAA + +P K+K + D
Sbjct: 538 VGQFLYLEGIEYIMWNTYDVHFYASFALLALFPKLELAIQRDVAAATLSQNPEKVKYMAD 597
Query: 589 GQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFA 648
G RKV GAVPHD+G DPW EVNAY ++DT+ WKDLNPKFVLQVYRDVVATGD++FA
Sbjct: 598 GGVGIRKVFGAVPHDLGQHDPWVEVNAYNIHDTSHWKDLNPKFVLQVYRDVVATGDRQFA 657
Query: 649 KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
KAVWP+VY AMAYMDQFDRD DG+IENDGFPDQTYDTWSV G+SAY GGLW+AALQAA+A
Sbjct: 658 KAVWPAVYAAMAYMDQFDRDRDGLIENDGFPDQTYDTWSVHGVSAYCGGLWIAALQAAAA 717
Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
+A V D+ + YF KF +A+ VYE KLWNG YFNYD+ SS S+S+QADQLAGQWYA
Sbjct: 718 MADLVDDKDAATYFKGKFNQARDVYERKLWNGEYFNYDSGTSSNSNSVQADQLAGQWYAW 777
Query: 768 ACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
A GL P+ D+ KARSAL+KV+++NV+KV GGK GA NGM P+G+VD + MQSREIW+GVT
Sbjct: 778 ASGLPPLFDDYKARSALQKVFDFNVMKVKGGKWGAANGMHPNGKVDETCMQSREIWTGVT 837
Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
YA +A+MIHE + + F A G++ A WS LGY FQTPEAW D +RSL YMRPLAI
Sbjct: 838 YAASAAMIHEGMVEQAFTAAQGVFLAGWS--DLGYWFQTPEAWTIDGYFRSLAYMRPLAI 895
Query: 888 WAMQWALTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARLLK 930
WAMQWAL PK E P + S L H FS +A L+
Sbjct: 896 WAMQWALYPPK-SIAEAPRVPSMDRGSALVTHDRFSSLANALR 937
>gi|357146604|ref|XP_003574050.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 962
Score = 952 bits (2460), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/894 (54%), Positives = 623/894 (69%), Gaps = 30/894 (3%)
Query: 26 VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
+D A P W+R+L++ L +F++ + E +++ +GVR+ +REEA+ G++A IDP
Sbjct: 30 LDGAAPPEQAWRRRLNSHANILKEFSVTFMEAMKMMSLGVRLWSYVREEASHGRKAPIDP 89
Query: 86 FIKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSN 143
F + ++S GVPLGG+GSGSI R +RGEF+ W I P +CE+ PV+ NQFS+FVSR
Sbjct: 90 FTRESCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCENSPVMENQFSIFVSRDG 149
Query: 144 G-QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRI 202
G +K SSVL P + LK + +GI SWDWNL G STYHAL+PRAWTV++GEPDP+L+I
Sbjct: 150 GNKKCSSVLAPGHHDGLKKYSDSGISSWDWNLSGQHSTYHALFPRAWTVYDGEPDPDLKI 209
Query: 203 VCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNS 262
CRQISP IPH+YKESS P SVF YT+ N+G+ A ++LL TW NS+GG S +G H+N
Sbjct: 210 SCRQISPFIPHDYKESSLPTSVFVYTLVNTGRDRAKVSLLMTWANSIGGFSHHSGGHFNE 269
Query: 263 KTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWH 322
DGV VLLHH+T+ PPVTFA+AA ET V+V++ P F +SG + ++AKDMW
Sbjct: 270 PFIGEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEN-HVSAKDMWD 328
Query: 323 EIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS 381
+K+ G F+ N + S+ S PG ++ AA+ AS V P V F+L+W P+V F
Sbjct: 329 IMKKDGHFNLENFNAGCSMPSSPGETLCAAVTASTWVEPHGRCTVAFALSWSSPKVKFQK 388
Query: 382 GKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFN 441
G TY RRYT+FYGT + ++ N+ DA+ ++ WE +IE WQ PIL D+RLPEWY TLFN
Sbjct: 389 GCTYNRRYTEFYGTSERSSINLVHDALTKYRLWEEEIEKWQNPILRDERLPEWYKFTLFN 448
Query: 442 ELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMS 501
ELY+L AGG VWTDG PP I + Q K+ + ++++ + L R +
Sbjct: 449 ELYFLVAGGTVWTDGQPPA-----IDEKTNPASNQQKHSKKPIK-DTKSESVKDNLPRPT 502
Query: 502 SILEQIYTPVALNSSF-----GTNLL-----------QDGEENIGQFLYLEGIEYLMWNT 545
+ EQ++ L + TN L +DG EN+G+FLYLEG+EY+MWNT
Sbjct: 503 A--EQVFNGDDLTNGGPQMPEQTNGLRVQEPVPCIHSKDGPENVGKFLYLEGVEYIMWNT 560
Query: 546 YDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG 605
YDVHFY+SFAL+ LFPKI+LSIQRDFA AV+ D ++K L DG RKV GAVPHD+G
Sbjct: 561 YDVHFYASFALLDLFPKIELSIQRDFADAVLYEDRRRVKFLADGTSGIRKVKGAVPHDLG 620
Query: 606 ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
DPW E+NAY ++DT++WKDLNPKFVLQVYRD ATGD F + VWP+V AM YMDQF
Sbjct: 621 THDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVCAAMDYMDQF 680
Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
DRDGDG+IENDGFPDQTYD W+V GISAY G LW+AALQAA+ +A +GDR + + K
Sbjct: 681 DRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRPYAEKYKLK 740
Query: 726 FQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSAL 784
F KAK VYE KLWNGSYFNYD+ SS S SIQADQLAGQWYA + GL PI DE K RSAL
Sbjct: 741 FIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPIFDEHKIRSAL 800
Query: 785 KKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
+K++ +NV+KV GG+ GAVNGM P G+VD + MQSREIW+GVTY VAA+M+ + GF
Sbjct: 801 QKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQGF 860
Query: 845 QTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
TA GI+ A WS G GY FQTPE W TD YRSL YMRPLAIWAMQWAL+ PK
Sbjct: 861 ITAEGIFLAGWSEEGYGYWFQTPEGWTTDGHYRSLIYMRPLAIWAMQWALSPPK 914
>gi|357139475|ref|XP_003571307.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 946
Score = 949 bits (2453), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 510/919 (55%), Positives = 640/919 (69%), Gaps = 26/919 (2%)
Query: 26 VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
+D A+P TWQRK ++ F+ + + +AP+ V+++ ++ EE AKGK + DP
Sbjct: 15 IDCAQPPERTWQRKFDDEGKEIAMFSFTMNDIMTVAPLIVKMMRILVEERAKGKASVYDP 74
Query: 86 FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
F K GVPLG +G+GSIGRSYRG FQ +Q+FP + E KP+LANQFS FVSR G+
Sbjct: 75 FKKWMSNCYRGVPLGTLGAGSIGRSYRGYFQHFQLFPGIYEQKPILANQFSAFVSRPGGK 134
Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
YS+VL EVLK A IGSWDWNLK +YH L+PR+WTV++GEPDPE++I CR
Sbjct: 135 SYSTVLSAPPAEVLKGIDKASIGSWDWNLKEKNCSYHGLFPRSWTVYDGEPDPEIKITCR 194
Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
QISP IP NYKESS+PV+VFT+T+ NSG T AD+TLLFTW NSVGG SE TG H NS +
Sbjct: 195 QISPFIPRNYKESSFPVAVFTFTVQNSGSTPADVTLLFTWANSVGGKSELTGNHSNSTMR 254
Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI--SGNSLGLTAKDMWHE 323
DGVH VLL H T+ PPVTFA+A+Q+T V V+ CP F + S S TAK+MW E
Sbjct: 255 ARDGVHGVLLRHSTAEGHPPVTFAIASQDTGDVRVTCCPSFSMGPSSKSGEPTAKEMWEE 314
Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
IK+HGSF + T S PGSS+GAA+AA+ TV +V+F+L+W CP V F +G+
Sbjct: 315 IKKHGSFGDA-AAGTGRPSRPGSSVGAAVAAATTVTAGCTREVSFALSWSCPVVKFPAGR 373
Query: 384 TYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY+RR+TKF G ++AAA +A A+LEH WE QIEAWQRP+L+DK LP+WYP+ LFNE
Sbjct: 374 TYHRRHTKFVGLDRDAAAEQLAHHALLEHMEWERQIEAWQRPVLQDKSLPDWYPVALFNE 433
Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSS 502
LYYLNAGG +WTDG P + + +S + + + + A +IL M+
Sbjct: 434 LYYLNAGGTIWTDGMPSKKTSLVSSSSGTMEPFSLAAFHPDPNATSSSTAADDILLAMAR 493
Query: 503 ILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPK 562
E + P A + E+ +G+FLYLEG+EY M+NTYDVHFY+SFAL+ LFP
Sbjct: 494 AEEHL-LPAA-----------EDEKGVGKFLYLEGMEYHMYNTYDVHFYASFALLSLFPD 541
Query: 563 IQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTA 622
++LS+QRDFAAAV+ HDP M L DG++V RKVLGAVPHDIG+ DPW E+NAY ++D +
Sbjct: 542 LELSLQRDFAAAVLRHDPRLMYTL-DGKFVPRKVLGAVPHDIGLNDPWHELNAYMIHDPS 600
Query: 623 RWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQT 682
RWKDLNPKFVLQVYRDV ATGD FA++ WPSVYVA+AYMDQFDRD DGM+EN+G PDQT
Sbjct: 601 RWKDLNPKFVLQVYRDVAATGDLDFARSAWPSVYVALAYMDQFDRDRDGMVENEGRPDQT 660
Query: 683 YDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSY 741
YD WSVSG+SAY+GGLWVAAL+AA+A+A VGD +E FL ++ +A VY+ +LW G Y
Sbjct: 661 YDLWSVSGVSAYTGGLWVAALRAATAMAALVGDLPAEAVFLERYNRANKVYDSELWTGDY 720
Query: 742 FNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRG 801
F YDNSG S S+ ADQLAGQWYARACGL P+V DKAR AL V +NV++V GG G
Sbjct: 721 FRYDNSGGGNSESVMADQLAGQWYARACGLEPVVGRDKARRALAAVLEHNVMQVQGGGVG 780
Query: 802 AVNGM-LPD--GRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
AVNG LP+ G VD SS QS+E+W+G TYAVAA+MI E + + GF A G Y A W
Sbjct: 781 AVNGARLPEHGGGVDESSTQSKEVWTGTTYAVAAAMIGEGMREEGFTAAKGAYGAGWGED 840
Query: 859 GLGYAFQTPEAWNTDDQ--YRSLCYMRPLAIWAMQWALTRPKPKTLEKQ---MKPEVTDE 913
G GYAFQ PE+W D YRSL YMRPLA+WAMQWAL+ P P E + K + T+E
Sbjct: 841 GYGYAFQMPESWTADGAGGYRSLHYMRPLAVWAMQWALSPPTPVLPELERMAAKSKATEE 900
Query: 914 SLLRYHAGFSKVARLLKLP 932
+ F KVA LL+LP
Sbjct: 901 EVELAKEKFEKVASLLRLP 919
>gi|16905165|gb|AAL31035.1|AC078948_19 unknown protein [Oryza sativa Japonica Group]
Length = 967
Score = 937 bits (2421), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/898 (53%), Positives = 612/898 (68%), Gaps = 35/898 (3%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D P W+R+L++ L +F++ + E +++ +G+R+ +REEA+ G++A IDPF
Sbjct: 31 DGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEASHGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
K ++S GVPLGG+GSGSI R +RGEF+ W I P +CE P+ FVSR G
Sbjct: 91 TKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPI-------FVSRDGG 143
Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+KYSSVL P E LK +GI SWDWNL G STYHAL+PRAWTV++GEPDP+L+I
Sbjct: 144 NKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYDGEPDPDLKIS 203
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IPH+YK+SS P SVF YT+ N+GK A ++LL TW NS+GG S +G H+N
Sbjct: 204 CRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFSHHSGGHFNEP 263
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
DGV VLLHH+T+ PPVTFA+AA ET V+V++ P F +SG +AK MW
Sbjct: 264 FIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGHD-SAKQMWDR 322
Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+K++G FDR N TS+ S G ++ AA++AS V P V F LAW P++ F G
Sbjct: 323 MKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAWSSPKIKFQKG 382
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY RRYT+FYGT + +A N+ DA+ ++ WE +IE WQ PIL++++LPEWY TLFNE
Sbjct: 383 CTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLPEWYKFTLFNE 442
Query: 443 LYYLNAGGAVWTDGSPPV-------HSLVTIGHRKFSLDWSQSDLK------RIVDVPNQ 489
LY+L AGG VWTDG PPV S ++ + D Q +K V N
Sbjct: 443 LYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHVKLTAEQVTNG 502
Query: 490 NDTAVNILERMSSILEQIYTPVALNSSFGTN--------LLQDGEENIGQFLYLEGIEYL 541
D A N E+ S ++ P ++ G L ++G EN+G+FLYLEG+EY+
Sbjct: 503 GDLA-NGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKFLYLEGVEYI 561
Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
MW TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+ D +MK L DG RKV GAVP
Sbjct: 562 MWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSGIRKVKGAVP 621
Query: 602 HDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
HD+G DPW E+NAY ++DT++WKDLNPKFVLQVYRD ATGD F + VWP+V M Y
Sbjct: 622 HDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDY 681
Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
M+QFDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+AALQAA+ +A +GDR +
Sbjct: 682 MNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEK 741
Query: 722 FLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
+ KF +AK VYE KLWNGSYFNYD+ SS S SIQADQLAGQWYA + GL P+ DE+K
Sbjct: 742 YKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDENKI 801
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
RSAL+K++ +NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTY VAA+M+ +
Sbjct: 802 RSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGME 861
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
GF TA GI+ A WS G GY FQTPE W D YRSL YMRPLAIWAMQWA + PK
Sbjct: 862 HQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQWARSPPK 919
>gi|357116998|ref|XP_003560263.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 932
Score = 936 bits (2418), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/876 (54%), Positives = 611/876 (69%), Gaps = 31/876 (3%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D P W+RKLS+ L +F + ++E ++ +G+R+ IREEA+ G++A IDPF
Sbjct: 37 DDGSPPEHAWRRKLSSHANRLKEFNVTFREAFKMMKLGLRLWSYIREEASYGRKAPIDPF 96
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
+ ++S GVPLGG+G+GSI R +RGEF+ WQI P CE PV+ NQFS+F+ R +
Sbjct: 97 TRESTKPSASQGVPLGGMGTGSISRGFRGEFKHWQITPGSCEMLPVMENQFSIFIIRGS- 155
Query: 145 QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
+KYSSVL P + LK ++ GI SWDW L+GD+STYHAL+PRAWTV++GEPDPEL++ C
Sbjct: 156 KKYSSVLAPGQHDGLKKSSNDGISSWDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSC 215
Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
RQISP IPHNY+ESS P SVF YT+ N+GK A ++L+ TW NS+GG S+ +G H N
Sbjct: 216 RQISPFIPHNYQESSLPTSVFVYTLVNTGKERAKVSLVMTWANSIGGLSDHSGGHVNEPF 275
Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEI 324
+GV VLLHH+T++ PPVTFA+AA E V V++ P F +SG S +TA++MW I
Sbjct: 276 IGENGVSGVLLHHKTANNNPPVTFAVAACENQNVDVTVLPVFGLSGES-SVTAREMWGTI 334
Query: 325 KEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
+ G FDR N S+ S G ++ AA+ AS V V F+LAW PEV F G
Sbjct: 335 VQDGCFDRDNFKAGPSMPSSLGETVCAAVCASTWVEAHGRCTVVFALAWSSPEVKFKKGS 394
Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
TY+RRYTKFYGT +A N+ +DA++ + WE +I+ WQ PIL D+RLPEWY ITLFNEL
Sbjct: 395 TYHRRYTKFYGTSPRSAINLVQDALMSYKHWEEEIDKWQTPILRDERLPEWYKITLFNEL 454
Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
Y+L AGG VW D SL+ +K + S L D+P + + +
Sbjct: 455 YFLVAGGTVWIDS----ESLMVDADKKVN-----SSLPEDSDLPLHDSNCNSTVP----- 500
Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
+ P+ L+ D +EN+G+FLYLEG+EY MW TYDVHFY+SFAL+ LFPKI
Sbjct: 501 ---LIDPMPLDF--------DDKENVGKFLYLEGVEYFMWCTYDVHFYASFALLNLFPKI 549
Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
+LSIQRDFA AV+ D S+++ L DG W +RKV+GAVPHD+G DPW E+NAY ++DT+R
Sbjct: 550 ELSIQRDFARAVLREDRSRVRFLADGTWGTRKVIGAVPHDLGAHDPWHELNAYNIHDTSR 609
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
WKDLNPKFVLQVYRD ATGD F K VWP+V AM YM+QFDRDGDGMIENDGFPDQTY
Sbjct: 610 WKDLNPKFVLQVYRDFAATGDMSFGKDVWPAVCTAMEYMEQFDRDGDGMIENDGFPDQTY 669
Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYF 742
D W+V G+SAY G LW+A+LQAA+A+AR +G + + KF KAK V+E KLWNGSYF
Sbjct: 670 DAWTVLGVSAYCGCLWLASLQAAAAMARGLGHNDYAERCMVKFAKAKHVFETKLWNGSYF 729
Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
NYD+ S S SIQADQLAGQWYA + GL P+ DE + + L+K+++YNV++V GG+ GA
Sbjct: 730 NYDSGASYSSRSIQADQLAGQWYAASSGLPPLFDEGRIKCTLQKIFDYNVMRVKGGRMGA 789
Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
VNGM P+G+VD + MQSREIW+GVTY++AA+M+ + D F TA GI+ A WS G GY
Sbjct: 790 VNGMYPNGKVDETCMQSREIWTGVTYSLAATMLLHGMEDQAFTTAEGIFLAGWSEEGYGY 849
Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
FQTPEAW D YRSL YMRPLAIWAMQ+AL+ PK
Sbjct: 850 WFQTPEAWTIDGHYRSLIYMRPLAIWAMQYALSPPK 885
>gi|225433442|ref|XP_002285674.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
Length = 978
Score = 929 bits (2402), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 487/959 (50%), Positives = 636/959 (66%), Gaps = 47/959 (4%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D A P W+R+L++ L +F++ + E +++ +G+R+ IREEA++G++A IDPF
Sbjct: 31 DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
+ ++S GVPLGG+GSGSI R +RGEF+ WQI P C+ P++ANQFS+F+SR G
Sbjct: 91 TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150
Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+KY+SVL P E L + GI SW WNL G STYHAL+PRAWT+++GEPDPEL++
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IPHNY++SS P +VF YT+ N+GK A ++LLFTW NS+GG S +G H N
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISHLSGDHVNEP 270
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
DGV VLLHH+T+ + PPVTFA+AA ET V V++ P F +S S +TAKDMW +
Sbjct: 271 FIGEDGVSGVLLHHKTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGS-HITAKDMWGK 329
Query: 324 IKEHGSFDRLNSME-TSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+ + G FDR N S+ S PG ++ AA++AS V P + V F+LAW P+V F+ G
Sbjct: 330 MVQDGQFDRENCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKG 389
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
+Y+RRYTK+YGT + AA NI DA+ + WE +IE WQ PIL D RLPEWY TLFNE
Sbjct: 390 SSYHRRYTKYYGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNE 449
Query: 443 LYYLNAGGAVWTDGSPP-------VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVN 495
LY+L AGG VW D S P +H + + ++ ++ + +R V N +
Sbjct: 450 LYFLVAGGTVWIDSSLPATSSKNSLHQSAAVENTNVNVTVAKGNSRRGAAVENSVTDGYD 509
Query: 496 ILER--MSSILEQIYT----------PVALNS--SFGTNLL---QDGEENIGQFLYLEGI 538
+ R + E+I+T P NS S + L QD +++G+FLYLEG+
Sbjct: 510 AISRKGLEYDEEEIHTRNTCEEKPVIPQESNSHHSIHKDTLKDPQDETDDVGRFLYLEGV 569
Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
EY+MW TYDVHFY+SFAL+ LFPKI+LSIQR+FA AV+ D ++K L +G W RKV G
Sbjct: 570 EYIMWCTYDVHFYASFALLELFPKIELSIQREFAKAVLSEDGRRVKFLAEGNWGIRKVRG 629
Query: 599 AVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVA 658
AVPHD+G DPW E+NAY ++DT++WKDLNPKFVLQVYRD AT D F VWP+V A
Sbjct: 630 AVPHDLGTHDPWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAA 689
Query: 659 MAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGS 718
M YM+QFDRD DG+IENDGFPDQTYDTW+V GISAY G LW+AALQAA+A+A ++GD+
Sbjct: 690 MEYMEQFDRDSDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPF 749
Query: 719 EDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE 777
+ KF KAK+V+ EKLWNGSYFNYD+ SS S SIQADQLAGQWY + GL + D+
Sbjct: 750 AEKCKSKFFKAKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGLPSLFDD 809
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
K +S+L K+Y++NV+KV GGK GAVNGM P+G+VD S MQSREIW+GVTY VAA+MI
Sbjct: 810 YKIKSSLHKIYDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILS 869
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
+ + F TA GI+ A WS G GY FQTPE W D +RSL YMRPLAIW MQWAL+ P
Sbjct: 870 GMEEQAFTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWALSMP 929
Query: 898 KPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQG-------AKSLLQSLFDHTC 949
+ + D + + +LP E G AK S+F +C
Sbjct: 930 RA----------ILDAPTINFMERIHVSPHNARLPHETGVRKIATKAKCFGNSVFHCSC 978
>gi|168014916|ref|XP_001759997.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688747|gb|EDQ75122.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 970
Score = 926 bits (2392), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 496/930 (53%), Positives = 626/930 (67%), Gaps = 49/930 (5%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
DP P W+RKL++ L +F + + E ++ +G+R+ ++ E ++G+ IDPF
Sbjct: 46 DPGAPPHQAWRRKLNSHAEVLKEFRVTFMEGFKMLRLGMRMWTYVQSEKSEGRLPPIDPF 105
Query: 87 IKRHLTSS-HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
+ S+ HGVP GG+G GSIGR +RGEF+RWQI P VC++ PVLANQFSV VS S G+
Sbjct: 106 RRETKPSACHGVPCGGMGGGSIGRGFRGEFRRWQIVPGVCDEAPVLANQFSVHVSLSGGE 165
Query: 146 K-YSSVLCPKTPEVLK--DTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRI 202
K SSVL P P LK GI SWDW L G +TYHAL+PRAWT++EGEPDP L+I
Sbjct: 166 KAVSSVLYPGRPRELKGLKNDENGISSWDWKLDGQNTTYHALFPRAWTIYEGEPDPALKI 225
Query: 203 VCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNS 262
CRQISP IPHNYKESSYP VF+Y N+GK AD+ LLFTW NS+G + TG H N
Sbjct: 226 SCRQISPFIPHNYKESSYPACVFSYVAVNTGKEVADVNLLFTWANSIGSTTGSTGGHINE 285
Query: 263 K--TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDM 320
T + G P+TFA+AA+++ V +S CP F +SG +TAKD+
Sbjct: 286 TFTTSVKGGQ--------------PLTFAIAAKQSPDVSLSACPCFQVSGEGQSITAKDL 331
Query: 321 WHEIKEHGSF--DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
W +KEHG D N++ TS+ S+PGS IGAAIAA V +PP+ + + F LAWD P V
Sbjct: 332 WGRMKEHGRLEVDDWNTINTSLPSKPGSVIGAAIAAHVVIPPNEKRTIDFVLAWDSPTVK 391
Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
F+ G++Y RRYT FY NAA IA +A+ + WE +IE WQRPILED++LPEWY +T
Sbjct: 392 FLKGRSYLRRYTTFYDADGNAAPRIAHNALCGYRLWEEEIEKWQRPILEDEKLPEWYRVT 451
Query: 439 LFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILE 498
LFNELYYL AGG VWT S + + H SL ++V + D + L
Sbjct: 452 LFNELYYLVAGGTVWTASSANTSAPGSPLHGDLSLG------GKVVPFSTRRDIDLAALR 505
Query: 499 RMSSILEQIYTPVALN-SSF------GTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFY 551
R + + N SSF G LLQ EEN+GQFLYLEGIEY+MWNTYDVHFY
Sbjct: 506 RDCKDGDVVAALEGSNKSSFNSRLPSGLALLQAEEENVGQFLYLEGIEYIMWNTYDVHFY 565
Query: 552 SSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWF 611
+SFAL+ LFPK++L+IQRDFAA + HDP + K + G+ +KV G+VPHD+G DPW
Sbjct: 566 ASFALLSLFPKLELAIQRDFAAGCLAHDPRRTKFMAGGKKGIKKVFGSVPHDLGQHDPWA 625
Query: 612 EVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDG 671
EVNAY ++DT+RWKDLN KFVLQVYRDVVATGDK FAKAVWP+VY AMAY+DQFDRD DG
Sbjct: 626 EVNAYNIHDTSRWKDLNAKFVLQVYRDVVATGDKVFAKAVWPAVYAAMAYLDQFDRDRDG 685
Query: 672 MIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV 731
+IENDGFPDQTYDTW+V G+SAY GGLW+AALQAA A+A + ++ + YF KF +A+
Sbjct: 686 LIENDGFPDQTYDTWTVHGVSAYCGGLWIAALQAAEAMAVLLEEKNAARYFQSKFIQARE 745
Query: 732 VY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNY 790
Y +KLWNGSYF+YD+ SS S SIQADQ+AGQWYA A GL P+ D+ KARSAL+K++++
Sbjct: 746 AYLKKLWNGSYFSYDSGNSSNSKSIQADQMAGQWYAWASGLPPLFDDAKARSALQKIFDF 805
Query: 791 NVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
NV+KV G GAVNGM P+G+VD + MQSRE+W+GVTYA+AA+MIHE + D F+ A G+
Sbjct: 806 NVMKVKNGHLGAVNGMHPNGKVDETCMQSREVWTGVTYALAAAMIHEGMLDQAFRAAEGV 865
Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK-----PKTLEKQ 905
+ A WS LGY FQTPEAW D +RSL YMRPLAIWAMQWA+ P+ P+
Sbjct: 866 FIAGWS--DLGYWFQTPEAWTMDGYFRSLAYMRPLAIWAMQWAIDPPETIANAPRIPTVD 923
Query: 906 MKPEVTDESLLRYHAGFSKVARLLKLPEEQ 935
P D S GFS + L+ P Q
Sbjct: 924 RLPSECDTS------GFSSMQSTLRNPLPQ 947
>gi|413941636|gb|AFW74285.1| hypothetical protein ZEAMMB73_466373 [Zea mays]
Length = 966
Score = 925 bits (2390), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 507/965 (52%), Positives = 649/965 (67%), Gaps = 78/965 (8%)
Query: 31 PASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRH 90
PAS TWQR+ ++ F++ + V+L V + L+R+ ++D
Sbjct: 22 PASRTWQRRFDDEGKNVAMFSMTMSDLVEL----VAVYDLLRK--------WMD------ 63
Query: 91 LTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSV 150
GVP+GG+GSGSIGRSYRG FQ++QIFP E+KPVLANQFS FVSR NG KYS+V
Sbjct: 64 -NCYRGVPVGGIGSGSIGRSYRGYFQQFQIFPVTNEEKPVLANQFSAFVSRPNGNKYSTV 122
Query: 151 LCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
L T ++LK AGIGSWDW LK D+ TYHAL+PR+WTV++GEPDPE++I CRQISP
Sbjct: 123 LSAPTADLLKGVGKAGIGSWDWRLKEDRCTYHALFPRSWTVYDGEPDPEIKITCRQISPF 182
Query: 211 IPHNYKESSYPVSVFTYTI-------YNSG--------KTSADITLLFTWTNSVG----- 250
IPHNY+ESS+P +VFT+T+ + +G + A + L + T SV
Sbjct: 183 IPHNYRESSFPAAVFTFTVQSVLSVLFKAGEQLWELTCRCDAALHLGRSTTASVAFSSIT 242
Query: 251 GDSEFTGQHYNSKTKMNDGVHAVLLHH-------RTSHQLPPVTFALAAQETDGVHVSLC 303
G S + V+ +L RT+ PPVTFA+A++ETDGV VS+C
Sbjct: 243 GGSGLIFRGARCYRPCRWLVYTPVLETDADRRQCRTAGGQPPVTFAIASRETDGVRVSVC 302
Query: 304 PHFVISG----NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSS----IGAAIAAS 355
P F I+ +S TA MW EIK+HG+F + + S +GAA+AAS
Sbjct: 303 PSFTIASGGPSSSGEPTAAAMWDEIKKHGAFGHAGAGNGNAARAAASKAGSSVGAAVAAS 362
Query: 356 VTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAA-NIARDAILEHGSW 414
VP + V+FSLAW CPEV F +G TYYRRYT+FYGT +AAA ++A DA+LEH +W
Sbjct: 363 TAVPAGATRVVSFSLAWACPEVKFPAGSTYYRRYTRFYGTDADAAAEHLAHDALLEHMNW 422
Query: 415 ELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLD 474
E QIE WQ+PIL D+RLPEWYP+ LFNELYYLNAGG +WT V ++T G
Sbjct: 423 ESQIEEWQKPILHDQRLPEWYPVALFNELYYLNAGGTIWT-----VEEMLTDGQAPKRAG 477
Query: 475 WSQS-------DLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEE 527
++ S + ++ V ++ ++S E+ ++ A FG LL+ GE+
Sbjct: 478 FASSLTATEPFSIDTLLPVAAGGSAVDGVVRAVASATERSHSAAA----FGAALLRGGED 533
Query: 528 NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD 587
N+GQFLYLEG+EY M+NTYDVHFY+SFAL+ LFP+++LS+QRDFA AV++HDP + LD
Sbjct: 534 NVGQFLYLEGMEYNMYNTYDVHFYASFALLSLFPRLELSLQRDFARAVLLHDPRPRRTLD 593
Query: 588 DGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKF 647
G+ V+RKVLGAVPHD+G+ DPW E+NAY L+D ARWKDLNPKFVLQVYRD VATGD F
Sbjct: 594 -GRTVARKVLGAVPHDVGLNDPWVELNAYMLHDAARWKDLNPKFVLQVYRDAVATGDAAF 652
Query: 648 AKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
A+AVWP+VYVAMAYMDQFDRDGDGM+EN+G PDQTYD WSVSG+SAY+GG+WVAALQAA+
Sbjct: 653 AEAVWPAVYVAMAYMDQFDRDGDGMVENEGVPDQTYDMWSVSGVSAYTGGVWVAALQAAA 712
Query: 708 ALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSS-SIQADQLAGQWY 765
+AR VGDR +E YF + +A VY+ +LW G+YF YDNSG +S SI ADQLAGQWY
Sbjct: 713 GMARVVGDRPAEAYFRARRDRAARVYDAELWTGAYFRYDNSGGGGASESIMADQLAGQWY 772
Query: 766 ARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSG 825
ARACGL P+V++ KARSAL V +YNV++V GG GAVNGM PDG VD SS QS+E+W G
Sbjct: 773 ARACGLEPVVEDAKARSALATVLDYNVMRVKGGAIGAVNGMRPDGAVDASSTQSKEVWPG 832
Query: 826 VTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPL 885
VTYAVAA+M+HE + + F+TA G ++AAW G GYAFQTPEAW D YRSL YMRPL
Sbjct: 833 VTYAVAAAMVHEGMTEAAFRTAKGAHDAAWGNDGFGYAFQTPEAWTEDGGYRSLHYMRPL 892
Query: 886 AIWAMQWALTRPK----PKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLL 941
IWAMQWAL+ P+ +T E + E + A F KVA +L+LPEEQ K L
Sbjct: 893 GIWAMQWALSPPELHRDLRTPEARSSEEPSPADAALGQAQFDKVASMLRLPEEQQPKGYL 952
Query: 942 QSLFD 946
++++
Sbjct: 953 WAIYN 957
>gi|356576421|ref|XP_003556330.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
Length = 950
Score = 920 bits (2379), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/885 (53%), Positives = 618/885 (69%), Gaps = 27/885 (3%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D + P W+R+L++ L +F + + E +++ +G+RI +REEA+ G++A IDPF
Sbjct: 31 DSSAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRIWSYVREEASHGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
+ ++S GVPLGG+GSGSI R +RGEF++WQI P +CE PV+ANQFS+F+SR G
Sbjct: 91 TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPSLCEASPVMANQFSIFISREGG 150
Query: 145 QK-YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
K ++SVL P E L GI SW WNL G STYHAL+PRAWTV++GEPDPEL+I
Sbjct: 151 NKNFASVLAPGQHEGLGKPDDQGISSWGWNLSGQHSTYHALFPRAWTVYDGEPDPELKIS 210
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP +PHNY+ESS P +VF YT+ N+GK A ++LLFTW NS+GG S +G H N
Sbjct: 211 CRQISPFVPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGSSHLSGDHVNEP 270
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
K DGV VLL+H+T+ PPVTFA+AA ET V+VS+ P F +S S TAK MW +
Sbjct: 271 FKAEDGVSGVLLYHKTAKGNPPVTFAIAACETQNVNVSVLPSFGLSEGS-STTAKGMWSK 329
Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+ + G FD+ N + S+ S PG ++ AA+AAS+ V P + V FSLAW P+V F+ G
Sbjct: 330 MVKDGQFDQENFNSGPSMPSSPGETLCAAVAASMWVEPHGKCTVAFSLAWSSPKVKFVKG 389
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
T+ RRYTKFYGT + AAA++A DA+ + WE +IE WQ PIL+D+ LPEWY TLFNE
Sbjct: 390 STFNRRYTKFYGTSEKAAADLAHDALTHYNRWEEEIEKWQNPILKDETLPEWYKFTLFNE 449
Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQ--NDTAVNILERM 500
LY+L AGG +W D SP + S + ++D R+ ++ N +T + +R
Sbjct: 450 LYFLVAGGTIWID-SPLLSSNM------------RNDQDRVRELENTVVKETEDKMSDRK 496
Query: 501 SSILEQIYTPVALNSSF------GTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
+++E+I ++ L D + ++G+FLYLEG+EY+MW TYDVHFY+SF
Sbjct: 497 RTVVERIMDSTCDSAVITGHDPADEKLSGDDDADVGRFLYLEGVEYIMWCTYDVHFYASF 556
Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVN 614
AL+ LFPKI+L+IQRDFA AV+ D K+K L +G W RKV GAVPHD+G DPW E+N
Sbjct: 557 ALLELFPKIELNIQRDFARAVLCEDGRKVKFLAEGNWGIRKVYGAVPHDLGTHDPWHEMN 616
Query: 615 AYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIE 674
AY ++DT++WKDLNPKFVLQVYRD TGD +F VWP+V AM YM+QFDRDGDG+IE
Sbjct: 617 AYNIHDTSKWKDLNPKFVLQVYRDFATTGDLEFGVDVWPAVRAAMEYMEQFDRDGDGLIE 676
Query: 675 NDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY- 733
NDGFPDQTYDTW+V G+S Y G LW+AALQAA+A+A ++GDR + KF KAK +
Sbjct: 677 NDGFPDQTYDTWTVHGVSTYCGCLWLAALQAAAAMALDLGDREFAEKCKRKFLKAKPAFE 736
Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
EKLWNG+YFNYD+ S S SIQADQLAGQWY + GL P+ ++ K +SAL+KVY++NV+
Sbjct: 737 EKLWNGTYFNYDSGSSGNSKSIQADQLAGQWYTASSGLPPLFEDSKIKSALRKVYDFNVM 796
Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
KV GG+ GAVNGM P+G+VD + MQSRE+W+GVTY +AA+MI + + F TA GI+ A
Sbjct: 797 KVKGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGLAATMILAGMEEEAFATAEGIFLA 856
Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
WS G GY FQTPEAW D YRSL YMRPLAIW MQ+A+ RPK
Sbjct: 857 GWSEDGYGYWFQTPEAWTMDGHYRSLMYMRPLAIWGMQYAINRPK 901
>gi|297831192|ref|XP_002883478.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
lyrata]
gi|297329318|gb|EFH59737.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
lyrata]
Length = 950
Score = 919 bits (2374), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 470/883 (53%), Positives = 609/883 (68%), Gaps = 20/883 (2%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D A P W+R+L+ L +FT+ ++E +++ +G+R+ +REEA+ G++A IDPF
Sbjct: 31 DCAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
K + ++S GVPLGG+GSGSI R +RGEF++WQI P C+ P+++NQFS+F+SR G
Sbjct: 91 TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150
Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+KY+SVL P L + G+ SW WNL G STYHAL+PRAWT+++GEPDPEL+I
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IP+NY++SS P +VF YT+ N+GK A ++LLFTW NS+GG S +G H N
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
DGV VLLHH+T PPVTFA+AA ET V+V++ P F +S +S TAKDMW
Sbjct: 271 FIGEDGVSGVLLHHKTGKGNPPVTFAIAASETQNVNVTVLPCFGLSEDS-SFTAKDMWDT 329
Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+++ G FD+ N + S S G +I AA++AS V + V+F+L+W P+V F G
Sbjct: 330 VEKDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 389
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY RRYTKFYGT AA ++ DA+ + WE IEAWQ PIL D+RLPEWY TLFNE
Sbjct: 390 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 449
Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQS-----DLK-RIVDVPNQNDTAVNI 496
LY+L AGG VW D S +++ H++ L S D+ + D+ N N V
Sbjct: 450 LYFLVAGGTVWIDSSS-LNANGNSQHQQSGLGNSDGMAGGLDINDQRNDLANGNSVGVKS 508
Query: 497 LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
+ +S+I + N F D +++G+FLYLEG+EY+MW TYDVHFY+S+AL
Sbjct: 509 NDEVSAIHNR-------NGLFVDTRHVDDGDDVGRFLYLEGVEYVMWCTYDVHFYASYAL 561
Query: 557 IMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAY 616
+MLFPKI+L+IQRDFA AV+ D K+K L +G RKV GAVPHD+G+ DPW E+NAY
Sbjct: 562 LMLFPKIELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAY 621
Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
++DT++WKDLNPKFVLQVYRD ATGD +F VWP+V AM YM+QFDRD D +IEND
Sbjct: 622 NIHDTSKWKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIEND 681
Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-K 735
GFPDQTYDTW+V G+SAY G LW+AALQAA+A+A ++GD+ + KF AK E K
Sbjct: 682 GFPDQTYDTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETK 741
Query: 736 LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKV 795
LWNGSYFNYD+ SS S SIQ DQLAGQWYA + GL PI +E K RS ++K++++NV+K
Sbjct: 742 LWNGSYFNYDSGASSNSKSIQTDQLAGQWYAASSGLPPIFEESKIRSTMQKIFDFNVMKT 801
Query: 796 MGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAW 855
GGK GAVNGM PDG+VD + MQSREIW+GVTYA AA+MI + + GF TA GI+ A W
Sbjct: 802 KGGKMGAVNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGW 861
Query: 856 SGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
S G GY FQTPE W D YRSL YMRPLAIW MQWAL+ PK
Sbjct: 862 SEEGFGYWFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPK 904
>gi|255554158|ref|XP_002518119.1| conserved hypothetical protein [Ricinus communis]
gi|223542715|gb|EEF44252.1| conserved hypothetical protein [Ricinus communis]
Length = 968
Score = 919 bits (2374), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 485/921 (52%), Positives = 623/921 (67%), Gaps = 32/921 (3%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D A P W+R+L++ L +F++ + E +++ +G+R+ +REEA+ G++A IDPF
Sbjct: 31 DSAAPPKHAWRRRLNSHANILKEFSVTFTEAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
+ ++S GVPLGG+GSGSI R +RGEF++WQI P +CE PV+ANQFS+F+SR G
Sbjct: 91 TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIVPSICEVSPVMANQFSIFISRDGG 150
Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+KY+SVL P E L GI SW WNL G STYHAL+PRAWT+++GEPDPEL+I
Sbjct: 151 TKKYASVLAPGQHEGLGKDGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKIS 210
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IPHNY++SS P +VF YT+ NSGK A ++LLFTW NS+GG S +G H N
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNSGKERAKVSLLFTWANSIGGVSHLSGDHVNEP 270
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
DGV VLLHH+T+ PPVTFA+AA ET V V++ P F +S S +TAKDMW +
Sbjct: 271 FIGEDGVSGVLLHHKTAKGNPPVTFAIAACETQNVSVTVLPSFGLSEES-HITAKDMWSK 329
Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+ + G FDR N ++ S PG ++ AA++AS V P + V F+L+W P++ F G
Sbjct: 330 MVQDGQFDRENFDCGPTMPSSPGETLCAAVSASAWVEPHGKCTVAFALSWSSPKIKFSKG 389
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY+RRYTKFYGT + AA N+ DA+ + WE +IE WQ PIL+D+RLPEWY TLFNE
Sbjct: 390 STYHRRYTKFYGTSERAAQNLVHDALKNYKWWEEEIEKWQNPILKDERLPEWYKFTLFNE 449
Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRK-FSLDWSQSDLKRIVDVPNQNDT-----AVNI 496
LY+L AGG VW D S L+T R+ ++D + + R Q T A
Sbjct: 450 LYFLVAGGTVWIDSS-----LLTEDMRETMNVDVIEVQVSRPKGAEKQIATNGYNVATIG 504
Query: 497 LERMSSILEQIY-----TPVA-----LNSSFGTNLL---QDGEENIGQFLYLEGIEYLMW 543
LE Y PV+ LN S + L Q+ +++G+FLYLEG+EY+MW
Sbjct: 505 LEEKDGASNGNYPSKDELPVSHENGHLNHSLKLSPLMEWQNNSDDVGRFLYLEGVEYIMW 564
Query: 544 NTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHD 603
TYDVHFY+SFAL+ LFPKI+L+IQRDFA AV+ D K+K L +G RKV GAVPHD
Sbjct: 565 CTYDVHFYASFALLELFPKIELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHD 624
Query: 604 IGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD 663
+G DPW E+NAY ++DT++WKDLNPKFVLQVYRD AT D F VWP+V AM YM+
Sbjct: 625 LGTHDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATQDMSFGVDVWPAVRSAMEYME 684
Query: 664 QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
QFDRDGD +IENDGFPDQTYD W+V G+SAY G LW+AAL+AA+A+A +VGD+ +
Sbjct: 685 QFDRDGDALIENDGFPDQTYDAWTVHGVSAYCGCLWLAALEAAAAMALQVGDKYFAELCR 744
Query: 724 FKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARS 782
KF KAK +E KLWNGSYFNYD+ SS S SIQADQLAGQWY + GL P+ D+ K +S
Sbjct: 745 SKFVKAKSAFEAKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYVASSGLPPLFDDSKIKS 804
Query: 783 ALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADI 842
L+K+Y++NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTYAVAA+MI + D
Sbjct: 805 TLQKIYDFNVMKVRGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYAVAATMILAGMEDK 864
Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTL 902
F A GI+ A WS G GY FQTPE W TD +RSL YMRPLAIW MQWAL+ PK +
Sbjct: 865 AFAAAEGIFLAGWSEDGYGYWFQTPEGWTTDGHFRSLIYMRPLAIWGMQWALSL--PKAI 922
Query: 903 EKQMKPEVTDESLLRYHAGFS 923
+ K + D LL FS
Sbjct: 923 LEAPKINIMDRLLLSPSTRFS 943
>gi|22331303|ref|NP_189060.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|79313355|ref|NP_001030757.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|17529232|gb|AAL38843.1| unknown protein [Arabidopsis thaliana]
gi|110742225|dbj|BAE99039.1| hypothetical protein [Arabidopsis thaliana]
gi|332643347|gb|AEE76868.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|332643348|gb|AEE76869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 950
Score = 910 bits (2352), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/883 (53%), Positives = 608/883 (68%), Gaps = 20/883 (2%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D A P W+R+L+ L +FT+ ++E +++ +G+R+ +REEA+ G++A IDPF
Sbjct: 31 DSAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
K + ++S GVPLGG+GSGSI R +RGEF++WQI P C+ P+++NQFS+F+SR G
Sbjct: 91 TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150
Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+KY+SVL P L + G+ SW WNL G STYHAL+PRAWT+++GEPDPEL+I
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IP+NY++SS P +VF YT+ N+GK A ++LLFTW NS+GG S +G H N
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
DGV VLLHH+T PPVTFA+AA ET V+V++ P F +S +S TAKDMW
Sbjct: 271 FIGEDGVSGVLLHHKTGKGNPPVTFAIAASETQNVNVTVLPCFGLSEDS-SFTAKDMWDT 329
Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+++ G FD+ N + S S G +I AA++AS V + V+F+L+W P+V F G
Sbjct: 330 MEQDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 389
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY RRYTKFYGT AA ++ DA+ + WE IEAWQ PIL D+RLPEWY TLFNE
Sbjct: 390 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 449
Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRI------VDVPNQNDTAVNI 496
LY+L AGG VW D S +++ H++ L S + + D+ N N V
Sbjct: 450 LYFLVAGGTVWIDSSS-LNANGNSQHQQSGLGNSDGKVGGLDINDQRNDLGNGNSVGVKS 508
Query: 497 LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
+ +S+I + N F D +++G+FLYLEG+EY+MW TYDVHFY+S+AL
Sbjct: 509 NDEVSAIHNR-------NGLFVDTPHVDDGDDVGRFLYLEGVEYVMWCTYDVHFYASYAL 561
Query: 557 IMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAY 616
+MLFPKI+L+IQRDFA AV+ D K+K L +G RKV GAVPHD+G+ DPW E+NAY
Sbjct: 562 LMLFPKIELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAY 621
Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
++DT+RWKDLNPKFVLQVYRD ATGD +F VWP+V AM YM+QFDRD D +IEND
Sbjct: 622 NIHDTSRWKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIEND 681
Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-K 735
GFPDQTYDTW+V G+SAY G LW+AALQAA+A+A ++GD+ + KF AK E K
Sbjct: 682 GFPDQTYDTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETK 741
Query: 736 LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKV 795
LWNGSYFNYD+ SS S SIQ DQLAGQWYA + GL P+ +E K RS ++K++++NV+K
Sbjct: 742 LWNGSYFNYDSGSSSNSKSIQTDQLAGQWYAASSGLPPLFEESKIRSTMQKIFDFNVMKT 801
Query: 796 MGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAW 855
GGK GAVNGM PDG+VD + MQSREIW+GVTYA AA+MI + + GF TA GI+ A W
Sbjct: 802 KGGKMGAVNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGW 861
Query: 856 SGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
S G GY FQTPE W D YRSL YMRPLAIW MQWAL+ PK
Sbjct: 862 SEEGFGYWFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPK 904
>gi|449442371|ref|XP_004138955.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
gi|449520930|ref|XP_004167485.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 993
Score = 909 bits (2349), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/976 (50%), Positives = 649/976 (66%), Gaps = 66/976 (6%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D A P W+RKL+ L +F++ + E +++ +G+R+ +REEA++G++A IDPF
Sbjct: 31 DSASPPEQAWRRKLNGHANLLKEFSVTFVEAIKMVRLGIRLWSYVREEASQGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SN 143
+ ++S GVPLGG+GSGSI R +RGEF++WQI P CE PV+ANQFS+FVSR
Sbjct: 91 TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGTCEASPVMANQFSIFVSRDGG 150
Query: 144 GQKYSSVLCPKTPEVL---KDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPEL 200
G+KY+SVL P E L + +GI SW WNL G STYHAL+PRAWTV++GEPDPEL
Sbjct: 151 GKKYASVLAPGQHEGLGCCRKDGDSGISSWGWNLDGQHSTYHALFPRAWTVYDGEPDPEL 210
Query: 201 RIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHY 260
++ CRQISP IPHNY++SS P +VF YT+ N+G+ A ++LLFTW NS+GG+S +G H
Sbjct: 211 KVSCRQISPFIPHNYRDSSLPTAVFVYTLVNTGRERAKVSLLFTWANSIGGNSHLSGNHV 270
Query: 261 NSKTKMNDGVHAVLLHH-----------RTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
N DGV VLLHH RT+ PPVTFA+AA ET V V++ P F +S
Sbjct: 271 NEPFIDEDGVSGVLLHHKQNSKDLCKCFRTAKGNPPVTFAIAACETQNVSVTVLPSFGLS 330
Query: 310 GNSLGLTAKDMWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
S +TAKDMW ++ + G FDR N S S+ S PG ++ AA+AAS V P + V F
Sbjct: 331 EGSC-ITAKDMWDKMVQDGQFDRDNFSSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAF 389
Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
SL+W P+V F+ G +Y+RRYTKFYGT AA +A DA+ + WE +IE WQRP+L D
Sbjct: 390 SLSWSSPKVKFLKGFSYHRRYTKFYGTSGKAAQTLAHDALTNYKRWEEEIEKWQRPVLMD 449
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS---------------------PPVHSLVTIG 467
+RLPEWY TLFNELY+L AGG VW D S V + V+
Sbjct: 450 ERLPEWYKFTLFNELYFLVAGGTVWIDSSFVGKKASYDQDHLARLKNDDVKAVEAKVSGR 509
Query: 468 HRKFSLDWSQSDLKRI--VDVPNQNDTAVNIL--ERMSSILEQIYTPVALNSSFGT-NLL 522
+ S + + L ++ ++N T+ + + + L++ YT + S+ T +L
Sbjct: 510 GEEVSRTTTTTTLDGFPSIEYDDENSTSSSHASEDELMVPLKRGYT----DRSYQTYKVL 565
Query: 523 QDG--EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
+ G EE++G+FLYLEG+EY+MW TYDVHFY+S+AL+ LFPKI+L+IQRDFA AV+ D
Sbjct: 566 EPGNTEEDVGRFLYLEGVEYVMWCTYDVHFYASYALLELFPKIELNIQRDFAKAVLSEDG 625
Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
K++ L +G++ RKV GAVPHD+G DPW E+NAY ++DT+RWKDLN KFVLQVYRD
Sbjct: 626 RKVRFLAEGKYGIRKVRGAVPHDLGTHDPWNEMNAYNIHDTSRWKDLNTKFVLQVYRDFA 685
Query: 641 ATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWV 700
AT D F VWPSV A+ YM+QFDRDGDG+IENDGFPDQTYDTW+V GISAY G LWV
Sbjct: 686 ATRDMSFGVDVWPSVRAAIEYMEQFDRDGDGVIENDGFPDQTYDTWTVHGISAYCGCLWV 745
Query: 701 AALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQ 759
AALQAA+A+A E+GD+ + KF KA+ V E +LWNGSYFNYD+ SS S SIQADQ
Sbjct: 746 AALQAAAAMAHELGDQEFAETCKSKFLKARPVLEAELWNGSYFNYDSGSSSNSKSIQADQ 805
Query: 760 LAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQS 819
LAGQWY + GL P+ D+ K +SAL+K+Y++NV+KV GG+ GAVNGM P+G++D + MQS
Sbjct: 806 LAGQWYTASSGLPPLFDDFKIKSALRKIYDFNVMKVRGGRMGAVNGMHPNGKIDETCMQS 865
Query: 820 REIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSL 879
REIW+GVTY VAA+MI + + F+TA GI+ A WS G GY FQTPEAW+TD YRSL
Sbjct: 866 REIWTGVTYGVAATMILAGMEEEAFKTAEGIFLAGWSEEGFGYWFQTPEAWSTDGHYRSL 925
Query: 880 CYMRPLAIWAMQWALTRPK-----PK-TLEKQMKPEVTDESLLRYHAGFSKVARLLKLPE 933
YMRPL+IW MQWAL+ PK PK + ++ ++ + G ++A
Sbjct: 926 IYMRPLSIWGMQWALSLPKAILDAPKINVMDRIHVSSSNTKFFNHETGVRRIA------- 978
Query: 934 EQGAKSLLQSLFDHTC 949
AK S+F+ C
Sbjct: 979 -TKAKCFGDSVFNCAC 993
>gi|356535470|ref|XP_003536268.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
Length = 949
Score = 907 bits (2345), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/882 (53%), Positives = 614/882 (69%), Gaps = 22/882 (2%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D + P W+R+L++ L +F + + E +++ +G+RI +REEA+ G++A IDPF
Sbjct: 31 DSSAPPEQAWRRRLNSHANLLKEFRVTFTEAIKMVRLGIRIWSYVREEASHGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
+ ++S GVPLGG+GSGSI R +RGEF++WQI P +CE PV++NQFS+F+SR G
Sbjct: 91 TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPSLCEASPVMSNQFSIFISREGG 150
Query: 145 QK-YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+K ++SVL P E L GI SW WNL G STYHAL+PRAWTV++GEPDPEL+I
Sbjct: 151 KKKFASVLAPGQHEGLGKPDDQGISSWGWNLSGQHSTYHALFPRAWTVYDGEPDPELKIS 210
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IPHNY+ESS P +VF YT+ N+GK A ++LLFTW NS+GG S +G H N
Sbjct: 211 CRQISPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGSSHSSGDHVNEP 270
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
K DGV VLL+H+T+ PPVTFA+AA ET V+VS+ P F +S S +TAK MW +
Sbjct: 271 FKAEDGVSGVLLYHKTAKGNPPVTFAIAACETQNVNVSVLPSFGLSEES-SMTAKHMWSK 329
Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+ + G FD+ N + S+ S PG ++ AA+AAS V P + V FSLAW P+V F+ G
Sbjct: 330 MVKDGQFDQENFNSGPSMPSSPGETLCAAVAASTWVEPHGKCTVAFSLAWSSPKVKFVKG 389
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
T+ RRYTKFYGT + AAA++A DA+ + WE +IE WQ P+L+D+ LPEWY TLFNE
Sbjct: 390 STFNRRYTKFYGTSEKAAADLAHDALTHYNRWEEEIEKWQNPVLKDEALPEWYKFTLFNE 449
Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSS 502
LY+L AGG +W D SP + S + + D R ++ +T + +R +
Sbjct: 450 LYFLVAGGTIWID-SPVLSSNMR----------NDQDRVRELESAVVKETEDKMSDRKRT 498
Query: 503 ILEQIY-----TPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALI 557
++E + V L + ++++G+FLYLEG+EY+MW TYDVHFY+SFAL+
Sbjct: 499 VVESTTDSTYDSAVITGHDRADEKLYEDDDDVGRFLYLEGVEYIMWCTYDVHFYASFALL 558
Query: 558 MLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYC 617
LFP+I+L+IQRDFA AV+ D K+K L +G W RKV GAVPHD+G DPW E+NAY
Sbjct: 559 ELFPRIELNIQRDFARAVLCEDGRKVKFLAEGNWGIRKVYGAVPHDLGTHDPWHEMNAYN 618
Query: 618 LYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDG 677
++DT++WKDLNPKFVLQVYRD TGD +F VWP+V AM YM+QFDRDGDG+IENDG
Sbjct: 619 IHDTSKWKDLNPKFVLQVYRDFATTGDLQFGVDVWPAVRAAMEYMEQFDRDGDGLIENDG 678
Query: 678 FPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKL 736
FPDQTYDTW+V G+S Y G LW+AALQAA+ +A E+GDR + KF KAK + EKL
Sbjct: 679 FPDQTYDTWTVHGVSTYCGCLWLAALQAAAVMALELGDREFAETCKRKFLKAKPAFEEKL 738
Query: 737 WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVM 796
WNG+YFNYD+ S S SIQADQLAGQWY + GL + ++ K +SAL+KVY++NV+KV
Sbjct: 739 WNGTYFNYDSGSSGNSKSIQADQLAGQWYTASSGLPSLFEDSKIKSALRKVYDFNVMKVK 798
Query: 797 GGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWS 856
GG+ GAVNGM P+G+VD + MQSRE+W+GVTY +AA+MIH + + F TA GI+ A WS
Sbjct: 799 GGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGLAATMIHAGMEEEAFTTAEGIFLAGWS 858
Query: 857 GTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
G GY FQTPEAW D YRSL YMRPLAIW MQ+A+ RPK
Sbjct: 859 EDGYGYWFQTPEAWTMDGHYRSLMYMRPLAIWGMQYAINRPK 900
>gi|224072655|ref|XP_002303825.1| predicted protein [Populus trichocarpa]
gi|222841257|gb|EEE78804.1| predicted protein [Populus trichocarpa]
Length = 966
Score = 905 bits (2338), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/952 (49%), Positives = 612/952 (64%), Gaps = 45/952 (4%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D A P W+R+L++ L +F++ +KE +Q+ +G+R+ +REEA+ G++A IDPF
Sbjct: 31 DSAAPPEQAWRRRLNSHANILKEFSVTFKEAIQMVRLGIRLWSYVREEASHGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
+ ++S GVPLGG+GSGSI R +RGEF++WQI P +CE P+ F+SR G
Sbjct: 91 TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIVPGICESSPI-------FISRDGG 143
Query: 145 QK-YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
K Y+SVL P E + GI SW WNL G STYHAL+PRAWTV++GEPDPEL+I
Sbjct: 144 NKNYASVLAPGQHEGIGKAGDQGISSWGWNLSGQHSTYHALFPRAWTVYDGEPDPELKIS 203
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IPHNY++SS P +VF YT+ N+GK A ++LLFTW NS+GG S +G H N
Sbjct: 204 CRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISHLSGDHVNEP 263
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
DGV VLLHH+ + PPVTFA+AA ET V V++ P F +S S TAK MW
Sbjct: 264 FIGEDGVSGVLLHHKQGN--PPVTFAIAACETQNVSVTVLPSFGLSEGSC-TTAKAMWGT 320
Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+ + G FDR N + S+ S PG ++ AA++AS V P + V F+LAW P++ F+ G
Sbjct: 321 MVQDGHFDRGNFNWGPSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKIKFLKG 380
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
+Y+RRYTKFYGT + AA N+ DA+ + WE +IE WQ PIL+D++LPEWY TLFNE
Sbjct: 381 SSYHRRYTKFYGTSERAAQNLVHDALTNYKQWEEEIEKWQDPILKDEKLPEWYKFTLFNE 440
Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSS 502
LY+L AGG VW D S HR ++ + +K V P N +
Sbjct: 441 LYFLVAGGTVWIDSSLSSADTRNGHHRSREVE--TTGIK--VTEPQVNCNGGPDHTTTND 496
Query: 503 ILEQIYTPVALNSSFGTNLLQDGE------------------------ENIGQFLYLEGI 538
N +F T + E +++G+FLYLEG+
Sbjct: 497 HNTTSSEQKENNKAFHTKCICKDESAVSRERGNLDHTLDPFTFLDPLSDDVGRFLYLEGV 556
Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
EY+MW TYDVHFY+SFAL+ LFPKI+L+IQRDFA AV+ D K++ L DG RK G
Sbjct: 557 EYIMWCTYDVHFYASFALLALFPKIELNIQRDFAKAVLSEDGRKVRFLADGSVGIRKARG 616
Query: 599 AVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVA 658
AVPHD+G DPW E+NAY ++DT++WKDLNPKFVLQVYRD ATGD F VWP+V A
Sbjct: 617 AVPHDLGTHDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGVDVWPAVRTA 676
Query: 659 MAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGS 718
M YM+QFDRD DG++ENDGFPDQTYD W+V G+SAY G LW+A+LQAA+A+A ++GD+
Sbjct: 677 MEYMEQFDRDDDGLVENDGFPDQTYDAWTVHGVSAYCGCLWLASLQAAAAMAMQLGDKYF 736
Query: 719 EDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE 777
+ KF KAK +E KLWNGSYFNYD+ S+ S SIQADQLAG+WY + GL + D+
Sbjct: 737 AELCKSKFAKAKSAFESKLWNGSYFNYDSGSSNNSKSIQADQLAGEWYMASSGLPSLFDD 796
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
K RSAL K+Y++NV+KV GGK GAVNGM P+G+VD + MQSREIWSGVTYAVAA+MI
Sbjct: 797 VKIRSALNKIYDFNVMKVRGGKMGAVNGMHPNGKVDETCMQSREIWSGVTYAVAATMILS 856
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
+ D F TA GI+ A WS G GY FQTPEAW D +RSL YMRPLAIW MQWAL+
Sbjct: 857 GMEDKAFTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHFRSLIYMRPLAIWGMQWALSL- 915
Query: 898 KPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTC 949
PK + K + + SLL FS + A L S+F +C
Sbjct: 916 -PKAILDAPKINIMERSLLSPSTRFSLIGETGVKKIATKANCLGNSVFHCSC 966
>gi|357440847|ref|XP_003590701.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
gi|355479749|gb|AES60952.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
Length = 992
Score = 897 bits (2319), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 487/932 (52%), Positives = 628/932 (67%), Gaps = 71/932 (7%)
Query: 23 FDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAF 82
FD +D A P W+R+L++ L +F + + E +++ +G+R+ +REEA+ G++A
Sbjct: 28 FD-IDNAAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRMWSYVREEASHGRKAP 86
Query: 83 IDPFIKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVS 140
IDPF + ++S GVPLGG+GSGSI R +RGEF++WQI P +CE PV+ANQFS+FVS
Sbjct: 87 IDPFTRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGLCEPSPVMANQFSIFVS 146
Query: 141 RSNGQK-YSSVLCPKTPE---VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
R G K ++SVL P E + GI SW WNL G STYHAL+PRAWTV++GEP
Sbjct: 147 REGGNKSFASVLAPGQHEGVGACRKADDQGISSWGWNLNGQHSTYHALFPRAWTVYDGEP 206
Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT 256
DPEL+I CRQISP IPHNY+ESS P +VF YT+ N+GK A ++LLFTW NS+GG+S +
Sbjct: 207 DPELKISCRQISPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGNSHLS 266
Query: 257 GQHYNSKTKMNDGVHAVLLHH-------RTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
G H N DGV VLL+H RT+ PPVTF++AA ET V VS+ P F +S
Sbjct: 267 GDHVNEPFIAEDGVSGVLLYHKQVVEYFRTAKDNPPVTFSIAACETQNVSVSVLPCFGLS 326
Query: 310 GNSLGLTAKDMWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
S +TAK MW ++ + G FDR N S S+ S PG ++ AA+AAS V P + V F
Sbjct: 327 DRS-SVTAKGMWTKMVKDGQFDRENFSSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAF 385
Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
SLAW P+V F+ G T+ RRYTKFYGT + AA ++A DA+ + WE +I WQ PIL+D
Sbjct: 386 SLAWSSPKVKFVKGSTFNRRYTKFYGTSERAAVHLAHDALTHYTRWEEEIAKWQDPILKD 445
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS---LDWSQSDLKRIV- 484
++LPEWY TLFNELY+L AGG +W D + L++ R S L+ S++ + RI
Sbjct: 446 EKLPEWYKFTLFNELYFLVAGGTIWIDST-----LLSSNKRNNSQDQLEESENAVVRITE 500
Query: 485 ------------------------------------DVPNQNDTAVNILERMSSILEQIY 508
D+ +N T VN L + +S
Sbjct: 501 AKVDCRKREVVECTTDNSYDSTAHRGHNHLDEKHNRDISRENGT-VNTLGKGNSA----N 555
Query: 509 TPVALNSSFGTNLLQDGE-ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSI 567
TP + S NL D + ++ G+FLYLEG+EY+MW TYDVHFY+SFAL+MLFP+I+L+I
Sbjct: 556 TP---HHSTMKNLQHDDDNDDGGRFLYLEGVEYVMWCTYDVHFYASFALLMLFPRIELNI 612
Query: 568 QRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDL 627
QRDFA AV+ D K+K L +G W +RKV GAVPHD+G DPW E+NAY ++DT++WKDL
Sbjct: 613 QRDFAQAVLCEDGRKVKFLAEGNWGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDL 672
Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWS 687
NPKFVLQVYRD ATGD +F VWP+V AM YM+QFDRD DG+IENDGFPDQTYDTW+
Sbjct: 673 NPKFVLQVYRDFSATGDLQFGVDVWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWT 732
Query: 688 VSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDN 746
V G+SAY GGLW+AALQAA+A+A ++GDR + KF KAK VYE KLWNGSYFNYD+
Sbjct: 733 VHGVSAYCGGLWLAALQAAAAMALQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDS 792
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
SS S SIQADQLAGQWY + GL + D+ K +S+L+KV+++NV+KV GG+ GAVNGM
Sbjct: 793 GSSSNSKSIQADQLAGQWYTASSGLPSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGM 852
Query: 807 LPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
P+G+VD + MQSREIW+GVTY VAA+MI + + F TA GI+ A WS G GY FQT
Sbjct: 853 HPNGKVDETCMQSREIWAGVTYGVAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQT 912
Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
PEA+ D YRSL YMRPL+IW MQ+ALT PK
Sbjct: 913 PEAFTIDGHYRSLIYMRPLSIWGMQYALTMPK 944
>gi|11994237|dbj|BAB01359.1| unnamed protein product [Arabidopsis thaliana]
Length = 937
Score = 886 bits (2289), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/883 (52%), Positives = 599/883 (67%), Gaps = 33/883 (3%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D A P W+R+L+ L +FT+ ++E +++ +G+R+ +REEA+ G++A IDPF
Sbjct: 31 DSAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
K + ++S GVPLGG+GSGSI R +RGEF++WQI P C+ P+++NQFS+F+SR G
Sbjct: 91 TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150
Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+KY+SVL P L + G+ SW WNL G STYHAL+PRAWT+++GEPDPEL+I
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IP+NY++SS P +VF YT+ N+GK A ++LLFTW NS+GG S +G H N
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
G PPVTFA+AA ET V+V++ P F +S +S TAKDMW
Sbjct: 271 FMTGKGN-------------PPVTFAIAASETQNVNVTVLPCFGLSEDS-SFTAKDMWDT 316
Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+++ G FD+ N + S S G +I AA++AS V + V+F+L+W P+V F G
Sbjct: 317 MEQDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 376
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY RRYTKFYGT AA ++ DA+ + WE IEAWQ PIL D+RLPEWY TLFNE
Sbjct: 377 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 436
Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRI------VDVPNQNDTAVNI 496
LY+L AGG VW D S +++ H++ L S + + D+ N N V
Sbjct: 437 LYFLVAGGTVWIDSSS-LNANGNSQHQQSGLGNSDGKVGGLDINDQRNDLGNGNSVGVKS 495
Query: 497 LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
+ +S+I + N F D +++G+FLYLEG+EY+MW TYDVHFY+S+AL
Sbjct: 496 NDEVSAIHNR-------NGLFVDTPHVDDGDDVGRFLYLEGVEYVMWCTYDVHFYASYAL 548
Query: 557 IMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAY 616
+MLFPKI+L+IQRDFA AV+ D K+K L +G RKV GAVPHD+G+ DPW E+NAY
Sbjct: 549 LMLFPKIELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAY 608
Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
++DT+RWKDLNPKFVLQVYRD ATGD +F VWP+V AM YM+QFDRD D +IEND
Sbjct: 609 NIHDTSRWKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIEND 668
Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-K 735
GFPDQTYDTW+V G+SAY G LW+AALQAA+A+A ++GD+ + KF AK E K
Sbjct: 669 GFPDQTYDTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETK 728
Query: 736 LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKV 795
LWNGSYFNYD+ SS S SIQ DQLAGQWYA + GL P+ +E K RS ++K++++NV+K
Sbjct: 729 LWNGSYFNYDSGSSSNSKSIQTDQLAGQWYAASSGLPPLFEESKIRSTMQKIFDFNVMKT 788
Query: 796 MGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAW 855
GGK GAVNGM PDG+VD + MQSREIW+GVTYA AA+MI + + GF TA GI+ A W
Sbjct: 789 KGGKMGAVNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGW 848
Query: 856 SGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
S G GY FQTPE W D YRSL YMRPLAIW MQWAL+ PK
Sbjct: 849 SEEGFGYWFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPK 891
>gi|222612993|gb|EEE51125.1| hypothetical protein OsJ_31868 [Oryza sativa Japonica Group]
Length = 935
Score = 882 bits (2278), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/898 (52%), Positives = 590/898 (65%), Gaps = 67/898 (7%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D P W+R+L++ L +F++ + E +++ +A IDPF
Sbjct: 31 DGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRM-------------------KAPIDPF 71
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
K ++S GVPLGG+GSGSI R +RGEF+ W I P +CE P+ FVSR G
Sbjct: 72 TKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPI-------FVSRDGG 124
Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+KYSSVL P E LK +GI SWDWNL G STYHAL+PRAWTV++GEPDP+L+I
Sbjct: 125 NKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYDGEPDPDLKIS 184
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IPH+YK+SS P SVF YT+ N+GK A ++LL TW NS+GG S +G H+N
Sbjct: 185 CRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFSHHSGGHFNEP 244
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
T+ PPVTFA+AA ET V+V++ P F +SG +AK MW
Sbjct: 245 -------------FITAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGHD-SAKQMWDR 290
Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+K++G FDR N TS+ S G ++ AA++AS V P V F LAW P++ F G
Sbjct: 291 MKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAWSSPKIKFQKG 350
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
TY RRYT+FYGT + +A N+ DA+ ++ WE +IE WQ PIL++++LPEWY TLFNE
Sbjct: 351 CTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLPEWYKFTLFNE 410
Query: 443 LYYLNAGGAVWTDGSPPV-------HSLVTIGHRKFSLDWSQSDLK------RIVDVPNQ 489
LY+L AGG VWTDG PPV S ++ + D Q +K V N
Sbjct: 411 LYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHVKLTAEQVTNG 470
Query: 490 NDTAVNILERMSSILEQIYTPVALNSSFGTN--------LLQDGEENIGQFLYLEGIEYL 541
D A N E+ S ++ P ++ G L ++G EN+G+FLYLEG+EY+
Sbjct: 471 GDLA-NGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKFLYLEGVEYI 529
Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
MW TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+ D +MK L DG RKV GAVP
Sbjct: 530 MWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSGIRKVKGAVP 589
Query: 602 HDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
HD+G DPW E+NAY ++DT++WKDLNPKFVLQVYRD ATGD F + VWP+V M Y
Sbjct: 590 HDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDY 649
Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
M+QFDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+AALQAA+ +A +GDR +
Sbjct: 650 MNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEK 709
Query: 722 FLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
+ KF +AK VYE KLWNGSYFNYD+ SS S SIQADQLAGQWYA + GL P+ DE+K
Sbjct: 710 YKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDENKI 769
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
RSAL+K++ +NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTY VAA+M+ +
Sbjct: 770 RSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGME 829
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
GF TA GI+ A WS G GY FQTPE W D YRSL YMRPLAIWAMQWA + PK
Sbjct: 830 HQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQWARSPPK 887
>gi|9665098|gb|AAF97289.1|AC010164_11 Hypothetical protein [Arabidopsis thaliana]
Length = 790
Score = 875 bits (2262), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 457/825 (55%), Positives = 569/825 (68%), Gaps = 82/825 (9%)
Query: 138 FVSRSNGQKYSSVLCPKTPEVLKDTTA---------AGIGSWDWNLKGDKSTYHALYPRA 188
FVSR G K+S+VLCP P+V+KD GI SWDWN+ G+KSTYHALYPR+
Sbjct: 24 FVSRPGGVKHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRS 83
Query: 189 WTVHEG---EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTW 245
WTV++G + + + R I + K + + + N+G A +TLLFTW
Sbjct: 84 WTVYDGIIFDLNVSNHVYYRLIYCKVLMYIK--VMKMFLLDEQVTNTGAEPAIVTLLFTW 141
Query: 246 TNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPH 305
NSVGG S TGQH+NS K +G PPV++A+AA+ET+ V VS CP
Sbjct: 142 ENSVGGASGLTGQHFNSTMKTANG-------------HPPVSYAIAAKETEDVRVSSCPC 188
Query: 306 FVISGNSLG-LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
F++SG + +TA DMW EIK++ SFD+L S S S+PG+SIGAAIAA V VPP +
Sbjct: 189 FIVSGTTPNQITAGDMWDEIKKNASFDKLTSNACS-PSKPGTSIGAAIAAKVKVPPGCDR 247
Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
VTFSL+WDCPE F KTY+RRYT+FYG+ NAA +A DA+L
Sbjct: 248 TVTFSLSWDCPEARF-DEKTYHRRYTRFYGSLGNAAVAMAHDALL--------------- 291
Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIV 484
+G PP SL +IG RK SL S D
Sbjct: 292 ------------------------------NGLPPKQSLDSIGRRKISLGLSTIDKND-- 319
Query: 485 DVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWN 544
+QN+ A++IL R+ ++ QI+ P+ N++ G ++Q+ ENIGQFLYLEG++YLM+N
Sbjct: 320 --QDQNNVALDILGRIDAVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYN 377
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFYSSFAL+MLFPKI+LSIQRDFAAAV+MHD SK +++ G++V+RKVLGAVPHDI
Sbjct: 378 TYDVHFYSSFALLMLFPKIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDI 437
Query: 605 GICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
G+ DPWFE+NAY L++T RWKDLN KFVLQVYRDVVATGD FAKAVWPSVY A+AY+DQ
Sbjct: 438 GLNDPWFELNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQ 497
Query: 665 FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLF 724
FD+DGDGMIEN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE+GD G+ YF
Sbjct: 498 FDKDGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNA 557
Query: 725 KFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSAL 784
K++KA+ VYEKLWNGSYFNYDNS S SSSI ADQ+AGQWYARACGL PI E+ + AL
Sbjct: 558 KYEKARSVYEKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKAL 617
Query: 785 KKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
+ VY++NV++V G RGAVNGMLPDGRVD S+M SRE+W+G TY+VAA MI E LAD GF
Sbjct: 618 ETVYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGF 677
Query: 845 QTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP-KTLE 903
+TA GIYEAAWS GLG AFQTPEAW T+D+YRSLCYMRPLAIW +QWA T PKP + E
Sbjct: 678 RTASGIYEAAWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQE 737
Query: 904 KQMKPEVTDES--LLRYHAGFSKVARLLKLPEEQGAKSLLQSLFD 946
++P+ D + L + HAGF KVA LK + + ++ LQ+ ++
Sbjct: 738 VSLRPQEEDATSVLFQQHAGFIKVAHYLKNTKGKDHRNRLQTAYE 782
>gi|302762052|ref|XP_002964448.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
gi|300168177|gb|EFJ34781.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
Length = 916
Score = 858 bits (2217), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 476/937 (50%), Positives = 618/937 (65%), Gaps = 44/937 (4%)
Query: 33 SLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHL- 91
+L W+R+L T + L++F +E +++ P+GVR+ R E +G+RA IDPF +
Sbjct: 5 ALAWKRRLDTPVVELTEFKPTLREILEMLPMGVRLWKFCRSEHEQGRRAPIDPFYPQDTP 64
Query: 92 -TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR----SNGQK 146
+S GVPLGG+G GSIGR +RG+F RWQ+ P VC+ VLA+QFSVF++R QK
Sbjct: 65 ASSCQGVPLGGIGGGSIGRGFRGDFSRWQLLPTVCQTSAVLADQFSVFITRHSKDGTTQK 124
Query: 147 YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQ 206
SSVL P P T +GI SW+WNL G KSTY+ L+PRAWT ++GEPDP+++I CRQ
Sbjct: 125 TSSVLFPGEPVFESKETESGISSWNWNLTGVKSTYYGLFPRAWTTYDGEPDPQMKITCRQ 184
Query: 207 ISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM 266
+SP +PHNYKESS+P VF Y + N+G A TLLFTW NS+ G SE +G H NS
Sbjct: 185 VSPFVPHNYKESSFPSCVFIYELVNTGDEDATATLLFTWANSIAGGSETSGGHENSP--- 241
Query: 267 NDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG---LTAKDMWHE 323
+ H VTFA+AA+ETD V S+CP F+++G+ G TAKDMW E
Sbjct: 242 ---------FMKEEHS---VTFAIAARETDDVSTSVCPCFLVAGSPEGNSYQTAKDMWSE 289
Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
I E G F+ NS ++ S+P + +GAA+AASV VP +TF LAWD P+ F+ G
Sbjct: 290 ITEKGCFNAKNSTMSASLSQPETVVGAAVAASVVVPAHGTKSITFGLAWDSPKATFLGGS 349
Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
+YYRRYTKFYG NAA+ + DAIL + WE +I AWQ PIL D LPEWY TLFNEL
Sbjct: 350 SYYRRYTKFYGKSGNAASKLVHDAILNYKRWESEIHAWQEPILNDHSLPEWYRYTLFNEL 409
Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
YYL AGG +WT + + FS Q+ L + ++ N + +L+ +++
Sbjct: 410 YYLVAGGTIWTGMNMKRIYTTLLCDLSFSDTCRQACLAQKLNPVKSNGVSEALLDAINAN 469
Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
E SF + + E+N+GQFLYLEG+EY WNTYDVHFY+S+AL+ LFP++
Sbjct: 470 SEA--------QSFWSIGIPAVEDNVGQFLYLEGVEYHFWNTYDVHFYASYALLALFPRL 521
Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
+L +QRDFAAA + HDP KM L G R V G+VPHD+G DPW +N+Y L+DT++
Sbjct: 522 ELQLQRDFAAATLSHDPEKMFYLAYGNTGIRAVFGSVPHDLGTHDPWNTLNSYNLHDTSK 581
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
WKDLN KFV+QV RD ATGD +FA+AVWP+ Y AMA DQFDRDGDGMIENDG PDQTY
Sbjct: 582 WKDLNCKFVVQVNRDAAATGDLEFARAVWPAAYAAMAVTDQFDRDGDGMIENDGVPDQTY 641
Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYF 742
D W+V G+S+Y GGLWVAALQAASALA VGD+ S F KF +AK VYEK LWNGSYF
Sbjct: 642 DLWTVDGVSSYCGGLWVAALQAASALADLVGDKPSSRMFHGKFLRAKAVYEKSLWNGSYF 701
Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
+YD + S+S+Q+DQLAGQWY+ A GL IV ED+A SAL V+ NV++ GG GA
Sbjct: 702 HYDKN----STSVQSDQLAGQWYSLASGLPGIVREDQATSALGTVFATNVMRYKGGNEGA 757
Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
VNGMLP G D +S+QSRE+W+G TYAVAA+MI + + + GF+TA G++ WS G GY
Sbjct: 758 VNGMLPSGGEDRTSLQSREVWAGTTYAVAAAMIQQGMREEGFRTAKGVFLNGWSDQGHGY 817
Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHA-- 920
AFQTPEAW+ D +YRSL YMRPL+IWAMQWAL P+ + ++ EV D + A
Sbjct: 818 AFQTPEAWDNDGKYRSLAYMRPLSIWAMQWALDPPE-QLKQRSCSVEVGDITAQEVEACN 876
Query: 921 -GFSKVARLLKL---PEEQGAKSLLQSLFDHTCRRMF 953
F+K+A LK PE + S+ +FD+ R++
Sbjct: 877 KSFAKLASQLKELSPPEAKAYSSVWIWMFDYIRSRVY 913
>gi|302787158|ref|XP_002975349.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
gi|300156923|gb|EFJ23550.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
Length = 916
Score = 858 bits (2216), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/937 (50%), Positives = 617/937 (65%), Gaps = 44/937 (4%)
Query: 33 SLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHL- 91
+L W+R+L T + L +F ++E +++ P+GVR+ R E +G+RA IDPF +
Sbjct: 5 ALAWKRRLDTPVMELKEFKPTFREILEMLPMGVRLWKFCRSEHEQGRRAPIDPFYPQDTP 64
Query: 92 -TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR----SNGQK 146
+S GVPLGG+G GSIGR +RG+F RWQ+ P VC+ VLA+QFSVF++R QK
Sbjct: 65 ASSCQGVPLGGIGGGSIGRGFRGDFSRWQLLPTVCQTSAVLADQFSVFITRHSKDGTTQK 124
Query: 147 YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQ 206
SSVL P P + T +GI SW+WNL G KSTY+ L+PRAWT ++GEPDP+++I CRQ
Sbjct: 125 TSSVLFPGEPVFESEETESGISSWNWNLTGVKSTYYGLFPRAWTTYDGEPDPQMKITCRQ 184
Query: 207 ISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM 266
+SP +PHNYKESS+P VF Y + N+G A TLLFTW NS+ G SE +G H NS
Sbjct: 185 VSPFLPHNYKESSFPSCVFIYELVNTGDEDATATLLFTWANSIAGGSETSGGHENSP--- 241
Query: 267 NDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG---LTAKDMWHE 323
+ H VTFA+AA+ETD V S+CP F+++G G TAKDMW E
Sbjct: 242 ---------FMKEEHS---VTFAIAARETDDVSTSVCPCFLVAGAPEGNSYQTAKDMWSE 289
Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
I E G F+ NS ++ S+P + +GAA+AASV VP +TF LAWD P+ F+ G
Sbjct: 290 IIEKGCFNAKNSTMSASLSQPETVVGAAVAASVVVPAHGTKSITFGLAWDSPKATFLGGS 349
Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
+YYRRYTKFYG NAA+ + DAIL + WE +I AWQ PIL D LPEWY TLFNEL
Sbjct: 350 SYYRRYTKFYGKSGNAASKLVHDAILNYKRWESEIHAWQEPILNDHSLPEWYRYTLFNEL 409
Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
YYL AGG +WT + + FS Q+ L + ++ N + +L+ +++
Sbjct: 410 YYLVAGGTIWTGMNMKRIYTTLLCDLSFSDSCRQACLAQKLNPVKSNGVSEALLDAINAN 469
Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
E SF + + E+N+GQFLYLEG+EY WNTYDVHFY+S+AL+ LFP++
Sbjct: 470 SEA--------QSFWSFDIPAVEDNVGQFLYLEGVEYHFWNTYDVHFYASYALLALFPRL 521
Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
+L +QRDFAAA + HDP KM L G R V G+VPHD+G DPW +N+Y LYDT++
Sbjct: 522 ELQLQRDFAAATLSHDPEKMFYLAYGNTGIRAVFGSVPHDLGTHDPWNTLNSYNLYDTSK 581
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
WKDLN KFV+QV RD ATGD FA+AVWP+ Y AMA+ DQFDRDGDGMIENDG PDQTY
Sbjct: 582 WKDLNCKFVVQVNRDAAATGDLDFARAVWPAAYAAMAFTDQFDRDGDGMIENDGVPDQTY 641
Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYF 742
D W+V G+S+Y GGLWVAALQAASALA VGD+ S F KF +AK VYEK LWNGSYF
Sbjct: 642 DLWTVDGVSSYCGGLWVAALQAASALADLVGDKPSSRMFHDKFLRAKAVYEKSLWNGSYF 701
Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
+YD + S+S+Q+DQLAGQWY+ A GL IV ED+A SAL V+ NV++ GG GA
Sbjct: 702 HYDKN----STSVQSDQLAGQWYSLASGLPGIVREDQATSALGTVFATNVMRYKGGNEGA 757
Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
VNGMLP G D +S+QSRE+W G TYAVAA+MI + + + GF+TA G++ WS G GY
Sbjct: 758 VNGMLPSGGEDRTSLQSREVWVGTTYAVAAAMIQQGMREEGFRTAKGVFLNGWSDQGHGY 817
Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHA-- 920
AFQTPEAW+ D +YRSL YMRPL+IWAMQWAL P+ + ++ EV D + A
Sbjct: 818 AFQTPEAWDNDGKYRSLAYMRPLSIWAMQWALDPPE-QLKQRSCNVEVGDITAQEVEACN 876
Query: 921 -GFSKVARLLKL---PEEQGAKSLLQSLFDHTCRRMF 953
F+K+A LK PE + S+ +FD+ R++
Sbjct: 877 KSFAKLASQLKELSPPETKAYSSVWIWMFDYIRSRVY 913
>gi|218199525|gb|EEC81952.1| hypothetical protein OsI_25831 [Oryza sativa Indica Group]
Length = 850
Score = 822 bits (2124), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/876 (49%), Positives = 561/876 (64%), Gaps = 100/876 (11%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D P W+RKLS+ L +F + ++E +++ +G+R+ I+EEA+ G++A IDPF
Sbjct: 23 DDGSPPEYAWRRKLSSHANRLKEFNVTFREAIKMMKLGLRLWSYIQEEASHGRKAPIDPF 82
Query: 87 IKR--HLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
+ ++S GVPLGG+G +SN
Sbjct: 83 TRESDKPSASQGVPLGGMG-------------------------------------KSND 105
Query: 145 QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
SS WDW L+GD+STYHAL+PRAWTV++GEPDPEL++ C
Sbjct: 106 DGISS--------------------WDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSC 145
Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
RQISP IPHNY+ESS P SVF YTI N+GK A ++L+ TW NS+GG S +G H N
Sbjct: 146 RQISPFIPHNYQESSLPTSVFVYTIVNTGKERAKVSLVMTWANSIGGLSHHSGGHVNEP- 204
Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEI 324
T++ PPVTFA+AA E V+V++ P F +SG S +TA++MW +
Sbjct: 205 ------------FMTANNNPPVTFAVAACENQNVNVTVLPAFGLSGES-SVTAREMWGTL 251
Query: 325 KEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
+ G FDR N S S+ S G ++ AA+ AS V P V FSLAW P+V F G
Sbjct: 252 SQDGCFDRDNFSAGPSMPSSLGDTVCAAVCASTWVEPHGRCTVVFSLAWSSPKVKFKKGN 311
Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
YYRRYTKFYGT +A N+ +D+++++ WE +I+ WQRPIL D+R PEWY +TLFNEL
Sbjct: 312 AYYRRYTKFYGTSPRSAINLVQDSLMKYKHWEEEIDKWQRPILHDERFPEWYKVTLFNEL 371
Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
Y+L +GG VW D + + + + + S D ND ++ R S++
Sbjct: 372 YFLVSGGTVWIDSASLMADADEMMNSRLSED---------------NDLPLHHSSRNSAV 416
Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
++P + DG EN+G+FLYLEGIEY MW TYDVHFY+SFAL+ LFPKI
Sbjct: 417 PLIGFSPHII----------DGRENVGKFLYLEGIEYFMWCTYDVHFYASFALLELFPKI 466
Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
+LSIQRDFA AV+ D S+++ L DG W +RKV+GAV HD+G DPW E+NAY ++DT+R
Sbjct: 467 ELSIQRDFATAVLREDKSRVRFLADGTWGTRKVIGAVAHDLGAHDPWHELNAYNIHDTSR 526
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
WKDLNPKFVLQVYRD VATGD F K VWP+V AM YM+QFD D DGMIENDGFPDQTY
Sbjct: 527 WKDLNPKFVLQVYRDFVATGDMSFGKDVWPAVCTAMEYMEQFDHDDDGMIENDGFPDQTY 586
Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYF 742
D W+V G+SAY G LW+AALQAA+ALAR +G + + KF KAK V+E KLWNGSYF
Sbjct: 587 DAWTVRGVSAYCGCLWLAALQAAAALARTLGHHDYAERCMLKFAKAKPVFEAKLWNGSYF 646
Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
NYD+ S S SI ADQLAGQWY + GL P+ DE + + L+K+++YNV++V GG+ GA
Sbjct: 647 NYDSGTSYNSRSIHADQLAGQWYTASSGLPPLFDEGRIKCTLQKIFDYNVMRVKGGRMGA 706
Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
VNGM P+G+VD + MQSREIW+GVTY++AA+M+ + D F TA GI+ A WS G GY
Sbjct: 707 VNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMDDQAFTTAEGIFTAGWSEEGYGY 766
Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
FQTPEAW D YRSL YMRPLAIWAMQ+AL+ P+
Sbjct: 767 WFQTPEAWTIDGHYRSLIYMRPLAIWAMQYALSPPR 802
>gi|222636955|gb|EEE67087.1| hypothetical protein OsJ_24071 [Oryza sativa Japonica Group]
Length = 838
Score = 803 bits (2075), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/876 (48%), Positives = 550/876 (62%), Gaps = 112/876 (12%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D P W+RKLS+ L +F + ++E +++ +G+R+ IREEA+ G++A IDPF
Sbjct: 23 DDGSPPEYAWRRKLSSHANRLKEFNVTFREAIKMMKLGLRLWSYIREEASHGRKAPIDPF 82
Query: 87 IKR--HLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
+ ++S GVPLGG+G +SN
Sbjct: 83 TRESDKPSASQGVPLGGMG-------------------------------------KSND 105
Query: 145 QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
SS WDW L+GD+STYHAL+PRAWTV++GEPDPEL++ C
Sbjct: 106 DGISS--------------------WDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSC 145
Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
RQISP IPHNY+ESS P SVF YTI N+GK A ++L+ TW NS+GG S +G H N
Sbjct: 146 RQISPFIPHNYQESSLPTSVFVYTIVNTGKERAKVSLVMTWANSIGGLSHHSGGHVNEP- 204
Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEI 324
T++ PPVTFA+AA E V+V++ P F +SG S +TA++MW +
Sbjct: 205 ------------FMTANNNPPVTFAVAACENQNVNVTVLPAFGLSGES-SVTAREMWGTL 251
Query: 325 KEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
+ G FDR N S S+ S G ++ AA+ AS V P V FSLAW P+V F G
Sbjct: 252 SQDGCFDRDNFSAGPSMPSSLGDTVCAAVCASTWVEPHGRCTVVFSLAWSSPKVKFKKGN 311
Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
YYRRYTKFYGT + + +WE +I+ WQRPIL D+R PEWY +TLFNEL
Sbjct: 312 AYYRRYTKFYGTSPSIS------------NWEEEIDKWQRPILHDERFPEWYKVTLFNEL 359
Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
Y+L AGG VW D + + + + + S D ND ++ R S++
Sbjct: 360 YFLVAGGTVWIDSASLMADADEMMNSRLSED---------------NDLPLHHSSRNSAV 404
Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
++P + D EN+G+FLYLEGIEY MW TYDVHFY+SFAL+ LFPKI
Sbjct: 405 PLIGFSPHII----------DDRENVGKFLYLEGIEYFMWCTYDVHFYASFALLELFPKI 454
Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
+LSIQRDFA AV+ D S+++ L DG W +RKV+GAV HD+G DPW E+NAY ++DT+R
Sbjct: 455 ELSIQRDFATAVLREDKSRVRFLADGTWGTRKVIGAVAHDLGAHDPWHELNAYNIHDTSR 514
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
WKDLNPKFVLQVYRD ATGD F K VWP+V AM YM+QFD D DGMIENDGFPDQTY
Sbjct: 515 WKDLNPKFVLQVYRDFAATGDMSFGKDVWPAVCTAMEYMEQFDHDDDGMIENDGFPDQTY 574
Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYF 742
D W+V G+SAY G LW+AALQ A+ALAR +G + + KF KAK V+E KLWNGSYF
Sbjct: 575 DAWTVRGVSAYCGCLWLAALQTAAALARTLGHHDYAERCMLKFAKAKPVFEAKLWNGSYF 634
Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
NYD+ S S SI ADQLAGQWY + GL P+ DE + + L+K+++YNV++V GG+ GA
Sbjct: 635 NYDSGTSYNSRSIHADQLAGQWYTASSGLPPLFDEGRIKCTLQKIFDYNVMRVKGGRMGA 694
Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
VNGM P+G+VD + MQSREIW+GVTY++AA+M+ + D F TA GI+ A WS G GY
Sbjct: 695 VNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMDDQAFTTAEGIFTAGWSEEGYGY 754
Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
FQTPEAW D YRSL YMRPLAIWAMQ+AL+ P+
Sbjct: 755 WFQTPEAWTIDGHYRSLIYMRPLAIWAMQYALSPPR 790
>gi|4539009|emb|CAB39630.1| putative protein [Arabidopsis thaliana]
gi|7267702|emb|CAB78129.1| putative protein [Arabidopsis thaliana]
Length = 750
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/753 (53%), Positives = 504/753 (66%), Gaps = 96/753 (12%)
Query: 221 PVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT-KMNDGVHAV------ 273
P+ +++ N GK A +TLLFTW NSVGG S TG+H+NS + G+ V
Sbjct: 71 PILTNQFSMSNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTIIHLKRGLVPVSGRFIA 130
Query: 274 --------LLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNS-LGLTAKDMWHEI 324
+ RT + PPVT+A+AAQET+ VHVS CP F++SG+S +TAK+MW EI
Sbjct: 131 SIYCNCDEMKTCRTGNGHPPVTYAIAAQETEDVHVSECPCFLVSGHSPKEITAKEMWDEI 190
Query: 325 KEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKT 384
K++ SFD LNS E S PG+SIGAAIAA V VPP + VTFSL+WDCPEV F + KT
Sbjct: 191 KKNKSFDELNS-EPGSPSRPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPEVRF-NEKT 248
Query: 385 YYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELY 444
Y+R Y +TLFNELY
Sbjct: 249 YHR-----------------------------------------------YRVTLFNELY 261
Query: 445 YLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSIL 504
Y N+GG +WTDG PP S+ + V QND +++ ++++++
Sbjct: 262 YFNSGGTIWTDGLPPKESIE----------------RSKVTNTEQNDIVIDLFQKINAVC 305
Query: 505 EQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQ 564
EQIY+P + NS EENIGQF+YLEGIEYLM+NTYDVHFYSSFAL+ LFPK+
Sbjct: 306 EQIYSPQSSNS----------EENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLA 355
Query: 565 LSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARW 624
LSIQRDFAA V++ DP+K K++ G+WV+RK+LG+VPHDIG+ DPW E+N Y ++T RW
Sbjct: 356 LSIQRDFAATVLIQDPTKKKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDRW 415
Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYD 684
KDLN KFVLQVYRDVVAT D+ FAKAVWPSVY A+AY+DQFD+D DGMIEN+GFPDQTYD
Sbjct: 416 KDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTYD 475
Query: 685 TWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNY 744
WSV+G+SAY GGLWVAALQAASA A VG+ YF K++KAK+VYEKLWNGSYFNY
Sbjct: 476 AWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFNY 535
Query: 745 DNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVN 804
D+SGS SSSI ADQLAGQWYARACGL PI E+ + AL+ +Y +NV+KV GG RGAVN
Sbjct: 536 DDSGSGSSSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVN 595
Query: 805 GMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
GM +G+VD +S+ S+E+W+G TY+VAA MI E + GFQTA GIYEA WS GL +F
Sbjct: 596 GMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSF 655
Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQ--MKPEVTDES--LLRYHA 920
QTPEAWN +D+YRSLCYMRPLAIWA+QWALTR + EKQ + + +ES LLR H
Sbjct: 656 QTPEAWNMNDEYRSLCYMRPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQHK 715
Query: 921 GFSKVARLLKL-PEEQGAKSLLQSLFDHTCRRM 952
GF VAR +K+ P +S LQ ++ + +
Sbjct: 716 GFKDVARFVKIVPTSNVHRSRLQHTYETVLKTL 748
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/66 (51%), Positives = 47/66 (71%), Gaps = 12/66 (18%)
Query: 84 DPFIKRHLTSSHGVPLGGVGS------------GSIGRSYRGEFQRWQIFPRVCEDKPVL 131
D F KRH+ HGVPLGG+G+ GSIGRSY+GEFQ++++FP++CE+ P+L
Sbjct: 14 DIFKKRHVRGDHGVPLGGIGNVVNFTQTRIFSGGSIGRSYKGEFQQFKLFPKICEEAPIL 73
Query: 132 ANQFSV 137
NQFS+
Sbjct: 74 TNQFSM 79
>gi|326507888|dbj|BAJ86687.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 676
Score = 702 bits (1813), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/618 (58%), Positives = 451/618 (72%), Gaps = 16/618 (2%)
Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG---LTAKDMW 321
+ DGVH VLLHHRT+ PPVTFA+A+QETD V V+ CP F + ++ G AKDMW
Sbjct: 10 RARDGVHGVLLHHRTADGHPPVTFAIASQETDDVRVTCCPSFAMGPSAPGGEQFMAKDMW 69
Query: 322 HEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS 381
E K GS S +S PGSSIGAA+AA+ TVP V+F+L+W CPEV F +
Sbjct: 70 EEAKNRGSVGVAPG--ASASSRPGSSIGAAVAAATTVPAGGTRAVSFALSWSCPEVKFPA 127
Query: 382 GKTYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLF 440
G+TY+RRYTKF G ++AAA +A DA+LEH WE IE WQRP+L DKRLP WYP+ LF
Sbjct: 128 GRTYHRRYTKFLGLDRDAAAERLAHDALLEHMKWESLIEEWQRPVLHDKRLPGWYPVALF 187
Query: 441 NELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERM 500
NELYYLNAGG +WTDG PP + + K+ +S + + IL M
Sbjct: 188 NELYYLNAGGTIWTDGLPPKKT--SFASSKYGSTTMESFSLDGFHFHSGDHAPDGILRAM 245
Query: 501 SSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLF 560
++ E+ +S+FGT LL DGEE++G+FLYLEG+EY +WNTYDVHFY+SFAL+ LF
Sbjct: 246 ATAEERTEA----SSAFGTALLGDGEESVGRFLYLEGMEYHLWNTYDVHFYASFALLSLF 301
Query: 561 PKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYD 620
P+I+L +QRDFA AV+ HDP M+ LD G V RKVLGAVPHD+G+ DPWF++NAY ++D
Sbjct: 302 PEIELGLQRDFARAVLHHDPRPMRTLD-GATVPRKVLGAVPHDVGLGDPWFQLNAYMIHD 360
Query: 621 TARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPD 680
ARWKDLN KFVLQVYRD ATGD FA AVWP+VY+AMAYMDQFDRDGDGM+EN+G PD
Sbjct: 361 PARWKDLNTKFVLQVYRDGAATGDAAFATAVWPAVYLAMAYMDQFDRDGDGMVENEGRPD 420
Query: 681 QTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNG 739
QTYD WSVSG+SAY+GGLWVAALQAA+ +AR VGDRGSE YF+ +++KA+ VY+ +LWNG
Sbjct: 421 QTYDFWSVSGVSAYTGGLWVAALQAAAVMARVVGDRGSEGYFVERYEKARRVYDGELWNG 480
Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK 799
+YF+YDNSG + S SI ADQLAGQWYARACGL PIV+E+KARSAL V +YNV++V GG
Sbjct: 481 AYFDYDNSGGTNSKSIMADQLAGQWYARACGLEPIVEEEKARSALGTVLDYNVMRVQGGA 540
Query: 800 RGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTG 859
GAVNGM PDG VD SS+QSRE+W GVTY VAA+M HE + + F+TA G ++A W G
Sbjct: 541 IGAVNGMRPDGTVDTSSIQSREVWPGVTYGVAAAMAHEGMPEAAFRTAKGAHDAGWGRDG 600
Query: 860 LGYAFQTPEAWNTDDQYR 877
G + P+A D + R
Sbjct: 601 FG--VRVPDAGGVDGRRR 616
>gi|147812668|emb|CAN61857.1| hypothetical protein VITISV_016690 [Vitis vinifera]
Length = 521
Score = 692 bits (1785), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/531 (64%), Positives = 402/531 (75%), Gaps = 37/531 (6%)
Query: 435 YPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV 494
Y ITLFNELY+LNAGG +WTDG PP+ SL TI KFSLD S SD K D+ +QND+ V
Sbjct: 11 YRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNTTDIVHQNDSTV 70
Query: 495 NILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
IL RM+S+LEQI+ P NS+FGT LLQ GEEN+GQFLYLEGIEY MWNTYDVHFYSSF
Sbjct: 71 EILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMWNTYDVHFYSSF 130
Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVN 614
A+IMLFP+++LSIQRDFAAAVM+HDPS+MK++ DG+WV RKVLGAVPHDIGI DPWFE+N
Sbjct: 131 AIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHDIGISDPWFELN 190
Query: 615 AYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIE 674
AY LYDT RWKDLN KFVLQVYRD+VATGDK FA+AVWP+VY+A+A++DQFD+DGDGMIE
Sbjct: 191 AYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQFDKDGDGMIE 250
Query: 675 NDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE 734
NDG P LQAASA+AREVGD + DYF FKFQKAK VY+
Sbjct: 251 NDGLP-----------------------LQAASAMAREVGDSMTADYFWFKFQKAKAVYD 287
Query: 735 K-----------LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSA 783
K L F + S + Q + YARACGL PIVD++KARSA
Sbjct: 288 KDQLAAVIGEGRLLXLFRFLGNPSHTGIDGLFQDHRDWTTRYARACGLQPIVDDEKARSA 347
Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
L+KVYN+NVLKV GK GAVNGMLPDGRVDMS+MQSREIW+GVTY+VAA+MIHE + +
Sbjct: 348 LEKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETA 407
Query: 844 FQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLE 903
F TA GIY+AAWS GLGY+FQTPEAWNTD++YRSLCYMRPLAIWAMQWAL+ KP+
Sbjct: 408 FNTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALS--KPELHN 465
Query: 904 KQMKPEVTDESL-LRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
MK E +L +H GF KVA LLKLPEE+ +KS LQ FD TCRR++
Sbjct: 466 HDMKHEEGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLY 516
>gi|449482327|ref|XP_004156248.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
sativus]
Length = 508
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/505 (64%), Positives = 393/505 (77%), Gaps = 9/505 (1%)
Query: 454 TDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPN---QNDTAVNILERMSSILEQIYTP 510
+DG PP+ +L TI H+K+ L+ S+S+L PN + D AV+ILERMS IL+Q +
Sbjct: 4 SDGLPPLQNLSTISHKKYFLERSKSELNG--GAPNGDHRKDVAVDILERMSQILDQTHGG 61
Query: 511 VA-LNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
N++ GT LL GEEN+G L +EG +YLMWNTYDVHFYSSFALIMLFPK++LSIQR
Sbjct: 62 AGPSNAALGTRLLHPGEENVGNLLLVEGSQYLMWNTYDVHFYSSFALIMLFPKLELSIQR 121
Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNP 629
DFAAAV+MHDP K K + DG WV RKVLGAVPHDIG DPW EVNAY L + +RWKDL
Sbjct: 122 DFAAAVLMHDPRKAKTMCDGNWVPRKVLGAVPHDIGFNDPWLEVNAYNLLNVSRWKDLGS 181
Query: 630 KFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS 689
KFVLQVYRDVVATGDK FAK+VWPSVYVA+A+M+QFD+D DGMIEN+GFPDQTYDTW+V
Sbjct: 182 KFVLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDKDKDGMIENEGFPDQTYDTWTVK 241
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
G+SAY GGLWVAALQAASALA EV D + YF K+QKA+ VYE LWNGSYFNYDNS
Sbjct: 242 GVSAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQKARSVYETLWNGSYFNYDNSKG 301
Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
SSSIQADQLAGQWYARACGL PI DE+K R AL+K+YN+NV+KV GG RGAVNGM PD
Sbjct: 302 PWSSSIQADQLAGQWYARACGLCPIADEEKIRVALEKIYNFNVMKVKGGTRGAVNGMFPD 361
Query: 810 GRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEA 869
G VD S +Q +EIW+GVTY+VAA+MI E + + GFQTA GIY+AAW+ GLGY+FQTPEA
Sbjct: 362 GSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQAAWAQDGLGYSFQTPEA 421
Query: 870 WNTDDQYRSLCYMRPLAIWAMQWA-LTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARL 928
W+ DD++RS+ YMRPLAIWAMQWA ++ KP + + E+ + + HA F KVA L
Sbjct: 422 WDVDDRFRSIGYMRPLAIWAMQWAMMSDSKPTKVPTKAFSEMQESAFATQHAAFLKVASL 481
Query: 929 LKLP--EEQGAKSLLQSLFDHTCRR 951
LKLP ++ +SL+++ +D C+R
Sbjct: 482 LKLPSNDDTARRSLVEAAYDFICKR 506
>gi|226499590|ref|NP_001146319.1| uncharacterized protein LOC100279895 [Zea mays]
gi|219886613|gb|ACL53681.1| unknown [Zea mays]
Length = 649
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/602 (56%), Positives = 420/602 (69%), Gaps = 24/602 (3%)
Query: 320 MWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
MW+ + ++G FDR N S S+ S PG + AA++AS V P V F+LAW P+V
Sbjct: 1 MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60
Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
F G TY RRYT+FYGT + +A N+ DA+ ++ WE +IE WQ PIL+D+RLPEWY T
Sbjct: 61 FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120
Query: 439 LFNELYYLNAGGAVWTDGSPPV---HSLVTIGHRKFSL----DWSQSDLK------RIVD 485
LFNELY+L AGG VWTDG PP H+K S D +Q +K +
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180
Query: 486 VP------NQNDTAVNILERM--SSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
VP N +D +V+ + S + EQI + + +DG E++G+FLYLEG
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQI-NGLKSEEPIPYLISKDGPEHVGKFLYLEG 239
Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
+EY+MWNTYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+ D K+K L DG RKV
Sbjct: 240 VEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVK 299
Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
GAVPHD+G DPW E+NAY ++DT++WKDLNPKFVLQ+YRD ATGD +F + VWP+V
Sbjct: 300 GAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRGVWPAVCA 359
Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
AM YMDQFDRDGDG+IENDGFPDQTYD W+V GISAY G LW+AALQAA+ +A +GDR
Sbjct: 360 AMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRH 419
Query: 718 SEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
+ + KF KAK VYE KLWNGSYFNYD+ SS S SIQADQLAGQWYA + GL P+ D
Sbjct: 420 FAEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFD 479
Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
E K R+AL+K++ +NV+KV GG+ GAVNGM P G+VD + MQSREIW+GVTYAVAA+M+
Sbjct: 480 EHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLL 539
Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
+ GF TA GI+ A WS G GY FQTPE W TD YRSL YMRPLAIWA+Q+A++
Sbjct: 540 HGMEHQGFTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYAVSP 599
Query: 897 PK 898
PK
Sbjct: 600 PK 601
>gi|414871104|tpg|DAA49661.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
Length = 649
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/602 (56%), Positives = 420/602 (69%), Gaps = 24/602 (3%)
Query: 320 MWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
MW+ + ++G FDR N S S+ S PG + AA++AS V P V F+LAW P+V
Sbjct: 1 MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60
Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
F G TY RRYT+FYGT + +A N+ DA+ ++ WE +IE WQ PIL+D+RLPEWY T
Sbjct: 61 FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120
Query: 439 LFNELYYLNAGGAVWTDGSPPV---HSLVTIGHRKFSL----DWSQSDLK------RIVD 485
LFNELY+L AGG VWTDG PP H+K S D +Q +K +
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180
Query: 486 VP------NQNDTAVNILERM--SSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
VP N +D +V+ + S + EQI + + +DG E++G+FLYLEG
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQI-NGLKSEEPIPYLISKDGPEHVGKFLYLEG 239
Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
+EY+MWNTYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+ D K+K L DG RKV
Sbjct: 240 VEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVK 299
Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
GAVPHD+G DPW E+NAY ++DT++WKDLNPKFVLQ+YRD ATGD +F + VWP+V
Sbjct: 300 GAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCA 359
Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
AM YMDQFDRDGDG+IENDGFPDQTYD W+V GISAY G LW+AALQAA+ +A +GDR
Sbjct: 360 AMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRH 419
Query: 718 SEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
+ + KF KAK VYE KLWNGSYFNYD+ SS S SIQADQLAGQWYA + GL P+ D
Sbjct: 420 FAEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFD 479
Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
E K R+AL+K++ +NV+KV GG+ GAVNGM P G+VD + MQSREIW+GVTYAVAA+M+
Sbjct: 480 EHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLL 539
Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
+ GF TA GI+ A WS G GY FQTPE W TD YRSL YMRPLAIWA+Q+A++
Sbjct: 540 HGMEHQGFTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYAVSP 599
Query: 897 PK 898
PK
Sbjct: 600 PK 601
>gi|147788372|emb|CAN61187.1| hypothetical protein VITISV_019326 [Vitis vinifera]
Length = 900
Score = 629 bits (1621), Expect = e-177, Method: Compositional matrix adjust.
Identities = 348/687 (50%), Positives = 444/687 (64%), Gaps = 63/687 (9%)
Query: 274 LLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRL 333
+L +RT+ + PPVTFA+AA ET V V++ P F +S S +TAKDMW ++ + G FDR
Sbjct: 167 VLANRTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGS-HITAKDMWGKMVQDGQFDRE 225
Query: 334 NSME-TSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKF 392
N S+ S PG ++ AA++AS V P + V F+LAW P+V F+ G +Y+RRYTK+
Sbjct: 226 NCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKGSSYHRRYTKY 285
Query: 393 YGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAV 452
YGT + AA NI DA+ + WE +IE WQ PIL D RLPEWY TLFNELY+L AGG V
Sbjct: 286 YGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNELYFLVAGGTV 345
Query: 453 WTDGSPP-------VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILER--MSSI 503
W D S P +H + + ++ ++ + +R V N + R +
Sbjct: 346 WIDSSLPATSSKNSLHQSAAVENTNVNVTVAKGNSRRGAAVENSVTDGYDTTSRKGLEYD 405
Query: 504 LEQIYT----------PVALNS--SFGTNLL---QDGEENIGQFLYLEGIEYLMWNTYDV 548
E+I+T P NS S + L QD +++G+FLYLEG+EY+MW TYDV
Sbjct: 406 EEEIHTRNTCEEKPVIPQESNSHHSIHKDTLKDPQDETDDVGRFLYLEGVEYIMWCTYDV 465
Query: 549 HFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD 608
HFY+SFAL+ LFPKI+LSIQR+FA AV+ D ++K L +G W RKV GAVPHD+G D
Sbjct: 466 HFYASFALLELFPKIELSIQREFAKAVLSEDGRRVKFLAEGNWGIRKVRGAVPHDLGTHD 525
Query: 609 PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD 668
PW E+NAY ++DT++WKDLNPKFVLQVYRD AT D F VWP+V AM YM+QFDRD
Sbjct: 526 PWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAAMEYMEQFDRD 585
Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK 728
DG+IENDGFPDQTYDTW+V GISAY G LW+AALQAA+A+A ++GD+ + KF K
Sbjct: 586 SDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPFAEKCKSKFFK 645
Query: 729 AKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKV 787
AK+V+ EKLWNGSYFNYD+ SS S SIQADQLAGQWY + GL + D+ K +S+L K+
Sbjct: 646 AKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGLPSLFDDCKIKSSLHKI 705
Query: 788 YNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
Y++NV+KV GGK GAVNGM P+G+VD S MQSREIW+GVTY VAA+MI + + F TA
Sbjct: 706 YDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILSGMEEQAFTTA 765
Query: 848 CGIYEAAWSGTGLG------------------------------------YAFQTPEAWN 871
GI+ A WS G G Y FQTPE W
Sbjct: 766 EGIFTAGWSEEGYGTLDFIDDMHCVVQILNCFVHSGLCVACVILLYCTCRYWFQTPEGWT 825
Query: 872 TDDQYRSLCYMRPLAIWAMQWALTRPK 898
D +RSL YMRPLAIW MQWAL+ P+
Sbjct: 826 IDGHFRSLIYMRPLAIWGMQWALSMPR 852
>gi|357440839|ref|XP_003590697.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
gi|355479745|gb|AES60948.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
Length = 1103
Score = 624 bits (1608), Expect = e-175, Method: Compositional matrix adjust.
Identities = 353/669 (52%), Positives = 451/669 (67%), Gaps = 57/669 (8%)
Query: 273 VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDR 332
V+ + RT+ PPVTF++AA ET V VS+ P F +S S +TAK MW ++ + G FDR
Sbjct: 401 VVEYFRTAKDNPPVTFSIAACETQNVSVSVLPCFGLSDRS-SVTAKGMWTKMVKDGQFDR 459
Query: 333 LN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTK 391
N S S+ S PG ++ AA+AAS V P + V FSLAW P+V F+ G T+ RRYTK
Sbjct: 460 ENFSSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAFSLAWSSPKVKFVKGSTFNRRYTK 519
Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
FYGT + AA ++A DA+ + WE +I WQ PIL+D++LPEWY TLFNELY+L AGG
Sbjct: 520 FYGTSERAAVHLAHDALTHYTRWEEEIAKWQDPILKDEKLPEWYKFTLFNELYFLVAGGT 579
Query: 452 VWTDGSPPVHSLVTIGHRKFS---LDWSQSDLKRIV------------------------ 484
+W D + L++ R S L+ S++ + RI
Sbjct: 580 IWIDST-----LLSSNKRNNSQDQLEESENAVVRITEAKVDCRKREVVECTTDNSYDSTA 634
Query: 485 -------------DVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE-ENIG 530
D+ +N T VN L + +S TP + S NL D + ++ G
Sbjct: 635 HRGHNHLDEKHNRDISRENGT-VNTLGKGNSA----NTP---HHSTMKNLQHDDDNDDGG 686
Query: 531 QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQ 590
+FLYLEG+EY+MW TYDVHFY+SFAL+MLFP+I+L+IQRDFA AV+ D K+K L +G
Sbjct: 687 RFLYLEGVEYVMWCTYDVHFYASFALLMLFPRIELNIQRDFAQAVLCEDGRKVKFLAEGN 746
Query: 591 WVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKA 650
W +RKV GAVPHD+G DPW E+NAY ++DT++WKDLNPKFVLQVYRD ATGD +F
Sbjct: 747 WGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFSATGDLQFGVD 806
Query: 651 VWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALA 710
VWP+V AM YM+QFDRD DG+IENDGFPDQTYDTW+V G+SAY GGLW+AALQAA+A+A
Sbjct: 807 VWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWTVHGVSAYCGGLWLAALQAAAAMA 866
Query: 711 REVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARAC 769
++GDR + KF KAK VYE KLWNGSYFNYD+ SS S SIQADQLAGQWY +
Sbjct: 867 LQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASS 926
Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
GL + D+ K +S+L+KV+++NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTY
Sbjct: 927 GLPSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDETCMQSREIWAGVTYG 986
Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWA 889
VAA+MI + + F TA GI+ A WS G GY FQTPEA+ D YRSL YMRPL+IW
Sbjct: 987 VAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQTPEAFTIDGHYRSLIYMRPLSIWG 1046
Query: 890 MQWALTRPK 898
MQ+ALT PK
Sbjct: 1047 MQYALTMPK 1055
Score = 263 bits (671), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 181/314 (57%), Gaps = 59/314 (18%)
Query: 23 FDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAF 82
FD +D A P W+R+L++ L +F + + E +++ +G+R+ +REEA+ G++A
Sbjct: 28 FD-IDNAAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRMWSYVREEASHGRKAP 86
Query: 83 IDPFIKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSV--- 137
IDPF + ++S GVPLGG+GSGSI R +RGEF++WQI P +CE PV+ANQFSV
Sbjct: 87 IDPFTRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGLCEPSPVMANQFSVSLV 146
Query: 138 -------------------------------------------------FVSRSNGQK-Y 147
FVSR G K +
Sbjct: 147 FPCTIDSLYTSLICSQIRSRQNHLWVYFPSIYEDYFVSWGNECSLFAEIFVSREGGNKSF 206
Query: 148 SSVLCPKTPE---VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
+SVL P E + GI SW WNL G STYHAL+PRAWTV++GEPDPEL+I C
Sbjct: 207 ASVLAPGQHEGVGACRKADDQGISSWGWNLNGQHSTYHALFPRAWTVYDGEPDPELKISC 266
Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
RQISP IPHNY+ESS P +VF YT+ N+GK A ++LLFTW NS+GG+S +G H N
Sbjct: 267 RQISPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGNSHLSGDHVNEPF 326
Query: 265 KMNDGVHAVLLHHR 278
DGV VLL+H+
Sbjct: 327 IAEDGVSGVLLYHK 340
>gi|125821356|ref|XP_687652.2| PREDICTED: non-lysosomal glucosylceramidase [Danio rerio]
Length = 851
Score = 621 bits (1601), Expect = e-175, Method: Compositional matrix adjust.
Identities = 358/869 (41%), Positives = 490/869 (56%), Gaps = 79/869 (9%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P ++ ++ +GVR L R+ + K FID F L +GVPLGG+G
Sbjct: 44 PFQAKDVSLSNMLEHVSLGVRYLKWWYRKTQVEKKAPFIDMFRAVPLRQIYGVPLGGIGG 103
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
GSI R +RGEF RWQ+ P + K V+ANQF+V + R Y VL + P L+
Sbjct: 104 GSITRGWRGEFCRWQLNPGMYHYKTVIANQFTVCLRRDGQTVYQQVLSTERPSTLQ---- 159
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
W+W G+ + YH L+PRAWTV+ P + + CRQ+SP+IPH+YK+SS PV+V
Sbjct: 160 ----GWNWGYCGEHAFYHGLFPRAWTVYN-LPGQNVTLTCRQVSPVIPHDYKDSSLPVAV 214
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSH 281
F + I N + DI+++FT N G + +G H+N K + V VLLHH+T
Sbjct: 215 FVWDIENKNDYALDISIMFTMVNGSGQKDDKSGGHWNEPFHLEKEGESVSGVLLHHQTPA 274
Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
P T +AA++ G +S F G T +W ++ G D + +S
Sbjct: 275 N--PYTLCIAARQKSGQEISHQTAFSPKG-----TCSAVWCDLMTDGRLD--SPTNSSPP 325
Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAA 400
+E G + AA+ S +V +S V FSLAWD P++ F S +TY RRYT++YGT +AA
Sbjct: 326 TEKGEEVAAALVVSCSVSANSRNSVDFSLAWDMPKITFGSKERTYVRRYTRYYGTKGDAA 385
Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
+I+ A+ + WE IE WQRPIL+D+ LP WY LFNELY++ GG +W
Sbjct: 386 PSISHYALTHYSKWEESIEEWQRPILQDRSLPAWYKSALFNELYFVVDGGTIW------- 438
Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
V++P D + + + Q P +
Sbjct: 439 -----------------------VELPEDADISGGLRPEDGGLPAQ---PEVV------- 465
Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
++ G+F YLEG EY M+NTYDVHFY+SFALIML+PK+ LS+Q D A +V+ HDP
Sbjct: 466 ------KDFGRFAYLEGQEYRMYNTYDVHFYASFALIMLWPKLALSLQYDIAGSVVQHDP 519
Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
+ L +G++ K G VPHDIG D PW VNAY ++DTA WKDLN KFVLQVYRD
Sbjct: 520 MERLNLMNGRYSPVKTRGVVPHDIGDPDDEPWVRVNAYLIHDTADWKDLNLKFVLQVYRD 579
Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
T D+++ K +WP M +FD+DGDG+IEN G+ DQTYD W V+G SAY GGL
Sbjct: 580 YHLTQDQQYLKDMWPVCQTVMENELKFDKDGDGLIENSGYADQTYDGWKVTGPSAYCGGL 639
Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
W+A++ +AR + + ++ ++K LWNG Y+NYD+SG S S+S+ +
Sbjct: 640 WLASVCMMCKMARVLNCESVYQRYRDILERGSAAFDKLLWNGKYYNYDSSGRSLSNSVMS 699
Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
DQ AG W+ RA GL ++K SALK V++ NV+ GG+ GAVNGM P+G D
Sbjct: 700 DQCAGHWFLRASGLGDDEYQAFPKEKICSALKSVFDLNVMSFAGGQMGAVNGMRPEGVPD 759
Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
SS+QS E+W GV Y +AA+MIHE + + G +TA G Y A W G+ AFQTPEA+
Sbjct: 760 RSSVQSDEVWVGVVYGLAATMIHEGMVEEGLRTAEGCYRAVWERMGM--AFQTPEAYCEK 817
Query: 874 DQYRSLCYMRPLAIWAMQWAL-TRPKPKT 901
YRSL YMRPL+IWAMQ AL RP T
Sbjct: 818 GIYRSLAYMRPLSIWAMQLALNNRPNHDT 846
>gi|363744064|ref|XP_003642968.1| PREDICTED: non-lysosomal glucosylceramidase-like [Gallus gallus]
Length = 853
Score = 614 bits (1584), Expect = e-173, Method: Compositional matrix adjust.
Identities = 355/844 (42%), Positives = 474/844 (56%), Gaps = 80/844 (9%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
K AFID L +G PLGG+GSG+I R +RGEF RWQ+ P + V+A+QF+V
Sbjct: 73 KSAFIDLLCAVPLQQIYGCPLGGIGSGTITRGWRGEFCRWQLNPGKYHYETVIADQFTVC 132
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+ R Y VL + P L+ W+W G + YHALYPRAW V+E P
Sbjct: 133 LRRKGQTVYQQVLSVEKPSALQ--------GWNWGYCGRYAFYHALYPRAWMVYE-LPGQ 183
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
+ + CRQISP+IPH+YK+SS PV VF + + N+ + D++++FT N G + +G
Sbjct: 184 NVVLTCRQISPVIPHDYKDSSLPVGVFIWEVENNSEEPVDVSIMFTLQNGTGTKGDRSGG 243
Query: 259 HYNSKTKMNDG---VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGL 315
H+N + DG V VLLHH T + P TFA+AA+E G ++ F +G
Sbjct: 244 HWNEPFALQDGGERVAGVLLHHCTP--VNPFTFAVAAREKAGTVITHLTAFDPAG----- 296
Query: 316 TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
+ +D+W ++ + G + + S ++ G AA+ AS TVP G + +L WD P
Sbjct: 297 SGRDVWQDLLQDGKLE--SPTGKSKQTQNGEVTAAAVCASCTVPAQGHGTLELALVWDMP 354
Query: 376 EVNFMSG-KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEW 434
V+F S + + RRYT+F+G++ +AA ++ A+ + WE +IEAWQ+PILED +LP W
Sbjct: 355 HVHFGSKERLHLRRYTRFFGSNGDAAPALSHYALTHYKEWERKIEAWQKPILEDSQLPSW 414
Query: 435 YPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV 494
Y LFNELY++ GG +W D P
Sbjct: 415 YKSALFNELYFMTDGGTIWLDLPP------------------------------------ 438
Query: 495 NILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
+ + + P A S +L++ G+F YLEG EY M+NTYDVHFY+SF
Sbjct: 439 ------DCLPQDLQGPGAAKLSHLLPVLRE----YGRFAYLEGQEYRMYNTYDVHFYASF 488
Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFE 612
AL+ML+PK+Q+S+Q D A V+ D + L G+ KV VPHDIG D PW
Sbjct: 489 ALVMLWPKLQISLQYDIAVTVLNEDVQPRQYLVCGKTAQVKVKNVVPHDIGDPDDEPWQR 548
Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
VNAY ++DTA WKDLN KFVLQVYRD T D + + +WP M +FD D DG+
Sbjct: 549 VNAYLMHDTANWKDLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELKFDTDNDGL 608
Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
IEN G DQTYD W V G SAY GGLW+AA++ +A +GD + + QK K
Sbjct: 609 IENGGIADQTYDAWVVDGASAYCGGLWLAAVRMMCEMAEVLGDTETRQKYDAILQKGKES 668
Query: 733 YEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKV 787
+E+ LWNG Y+NYD+SGSS SSSI +DQ AGQW+ ACGL + + + SALK +
Sbjct: 669 FERLLWNGKYYNYDSSGSSTSSSIMSDQCAGQWFLGACGLDQKEVEVFPKSHIVSALKTI 728
Query: 788 YNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
+ NV+ GGK GAVNGM PDG D SS+QS E+W GV YA+AA+MI E L GF TA
Sbjct: 729 FEKNVMSFAGGKMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTA 788
Query: 848 CGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL---TRPKPKTLEK 904
G Y W LG AFQTPEA+ YRSL YMRPL+IW+MQ AL R P
Sbjct: 789 EGCYRTVWE--HLGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLALESKARQAPSPTPP 846
Query: 905 QMKP 908
Q P
Sbjct: 847 QKNP 850
>gi|253761389|ref|XP_002489100.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
gi|241947399|gb|EES20544.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
Length = 603
Score = 611 bits (1575), Expect = e-172, Method: Compositional matrix adjust.
Identities = 321/581 (55%), Positives = 399/581 (68%), Gaps = 27/581 (4%)
Query: 320 MWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
MW + + GSFDR N S S+ G ++ AA++AS V P V F+LAW P+V
Sbjct: 1 MWGTMVQDGSFDRDNFSAGPSMPCLLGDTVCAAVSASTWVEPHGRCTVAFALAWSSPKVK 60
Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
F G TYYRRYTKFYGT +A N+ DA++++ WE I+ WQ PIL D+RLPEWY IT
Sbjct: 61 FKKGSTYYRRYTKFYGTSPRSAVNLVEDALMKYKHWEEAIDKWQTPILRDERLPEWYKIT 120
Query: 439 LFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILE 498
LFNELY+L AGG VW D SLV S +D K P ++ +
Sbjct: 121 LFNELYFLVAGGTVWIDS----ESLV-----------SDADNKLNPSPPEDSNFPFHDST 165
Query: 499 RMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIM 558
S++ + P + D +EN+G+FLYLEGIEY MW TYDVHFY+SFAL+
Sbjct: 166 CNSTVPLIGFDPHEI----------DDKENVGKFLYLEGIEYFMWCTYDVHFYASFALLD 215
Query: 559 LFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCL 618
LFPKI+LSIQRDFA AV+ D +++ L DG W RKV+GAV HD+G DPW E+NAY +
Sbjct: 216 LFPKIELSIQRDFARAVLREDNRRVRFLADGTWGIRKVIGAVAHDLGAHDPWHELNAYNI 275
Query: 619 YDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF 678
+DT+RWKDLNPKFVLQ+YRD ATGDK F K VWP+V AM YM+QFD DGD MIENDGF
Sbjct: 276 HDTSRWKDLNPKFVLQIYRDFAATGDKSFGKDVWPAVCTAMEYMEQFDHDGDDMIENDGF 335
Query: 679 PDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLW 737
PDQTYD W+V G+SAY G LW+AALQAA+ALAR +G + + +F KAK V+E +LW
Sbjct: 336 PDQTYDAWTVQGVSAYCGCLWLAALQAAAALARSLGHDDYAERCMTRFAKAKSVFEARLW 395
Query: 738 NGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG 797
NGSYFNYD+ S S SIQADQLAGQWY + GL P+ DED+ + L+K+++YNV++V G
Sbjct: 396 NGSYFNYDSGTSYNSRSIQADQLAGQWYTASSGLPPLFDEDRIKCTLQKIFDYNVMRVKG 455
Query: 798 GKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSG 857
G+ GAVNGM P+G+VD + MQSREIW+GVTY++AA+M+ + F TA GIY A WS
Sbjct: 456 GRMGAVNGMHPNGKVDDTCMQSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIYIAGWSE 515
Query: 858 TGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
G GY FQTPEAW D YRSL YMRPLAIWAMQWAL+ P+
Sbjct: 516 EGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQWALSPPR 556
>gi|414589150|tpg|DAA39721.1| TPA: hypothetical protein ZEAMMB73_266944 [Zea mays]
Length = 583
Score = 608 bits (1568), Expect = e-171, Method: Compositional matrix adjust.
Identities = 311/584 (53%), Positives = 390/584 (66%), Gaps = 53/584 (9%)
Query: 320 MWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
MW + + GSFD+ N + S+ S G ++ AA++AS V P V F+LAW P+V
Sbjct: 1 MWGTMVQDGSFDQDNLNTGPSMPSLLGDTVCAAVSASTWVEPHGRCAVVFALAWSSPQVK 60
Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
F G TYYRRYTKFYGT +A N+ DA++++ WE +I+ WQ P+L+D+RLPEWY IT
Sbjct: 61 FKKGSTYYRRYTKFYGTSPRSAVNLVEDALMKYKHWEEEIDKWQTPVLQDERLPEWYKIT 120
Query: 439 LFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILE 498
LFNELY+L AGG VW D SLV+ K + P +++ + I
Sbjct: 121 LFNELYFLVAGGTVWIDSG----SLVSDADNKSN--------------PEESNLSHEI-- 160
Query: 499 RMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIM 558
D +EN+G+FLYLEGIEY MW TYDVHFY+SFAL+
Sbjct: 161 -------------------------DDKENVGKFLYLEGIEYFMWCTYDVHFYASFALLD 195
Query: 559 LFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCL 618
LFPKI+LSIQRDFA AV+ D +++ L DG W RKV+GAV HD+G DPW E+NAY +
Sbjct: 196 LFPKIELSIQRDFARAVLREDNRRVRFLADGTWGIRKVIGAVAHDLGAHDPWHELNAYNI 255
Query: 619 YDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF 678
+DT+RWKDLNPKFVLQ+YRD ATGDK F K VWP+V AM YM+QFD DGDGMIENDGF
Sbjct: 256 HDTSRWKDLNPKFVLQIYRDFAATGDKSFGKDVWPAVCTAMEYMEQFDHDGDGMIENDGF 315
Query: 679 PDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY---FLFKFQKAKVVYE- 734
PDQTYD W+V G+SAY G L A A A G G +DY + +F KAK V+E
Sbjct: 316 PDQTYDAWTVQGVSAYCG---CLWLAALQAAAALAGSLGHDDYAERCMTRFVKAKTVFEA 372
Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
+LWNGS+FNYD+ S S SIQADQLAGQWY + GL P+ DED+ + L+K+++YNV++
Sbjct: 373 RLWNGSHFNYDSGTSYSSRSIQADQLAGQWYTASSGLPPLFDEDRIKCTLQKIFDYNVMR 432
Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
V GG+ GAVNGM P+G+VD + MQSREIW+GVTY++AA+M+ + F TA GIY A
Sbjct: 433 VKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIYTAG 492
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
WS G GY FQTPEAW D YRSL YMRPLAIWAMQWAL+ P+
Sbjct: 493 WSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQWALSPPR 536
>gi|330844075|ref|XP_003293963.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
gi|325075658|gb|EGC29519.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
Length = 1205
Score = 608 bits (1568), Expect = e-171, Method: Compositional matrix adjust.
Identities = 340/912 (37%), Positives = 499/912 (54%), Gaps = 108/912 (11%)
Query: 35 TWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSS 94
+W R+L F ++ + ++L IG R+ +++E + G+ +DPF
Sbjct: 327 SWARRLDHNFPDHKHFQVSMWQGLKLVGIGYRMWKYVKKENSSGRVPMMDPFNLPKPGPI 386
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV--------------- 139
GVP+GG+GSGSI R +RG+F RW + + ++ V ++FSV++
Sbjct: 387 MGVPIGGIGSGSINRGWRGDFVRWNMNSGLVTNETVDVDKFSVYINFDDSNSASSSTGST 446
Query: 140 --------------SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALY 185
S + ++ ++VL P P+ + W+W LKG+ S Y L+
Sbjct: 447 PSGTFTPNGPNFDPSSTKSKQKATVLYPGKPK-----NNLNLDVWNWGLKGENSCYFGLF 501
Query: 186 PRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGK-TSADITLLFT 244
PRAWTV+E EP P++++VC+QISP+IPHNY+ESSYPV V+ + I N+ K +SAD++L+ T
Sbjct: 502 PRAWTVYE-EPHPDIKLVCKQISPVIPHNYQESSYPVGVYVWKIENNNKNSSADVSLMLT 560
Query: 245 WTNSVGGDSEFTGQHYNSKTKMND----GVHAVLLHHRTSHQLP----------PVTFAL 290
W NS+G S+ G HYN + D + V L HR + P+ +++
Sbjct: 561 WQNSIGTKSDQDGGHYNKYFTLGDEPNKKIKGVTLIHRKEQRCAYNSSNIVYHDPMEYSI 620
Query: 291 AAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGA 350
A ++ V+++ F + L A ++W+ + G L +++ S S P IGA
Sbjct: 621 AVKDDPDVNITFNERFETTSR---LDAANLWYTFNKKG---ELENIQDSRPSAPKKPIGA 674
Query: 351 AIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILE 410
A+AA V VP + F +AWD P F G YY+RYTKFYG + IA DA+
Sbjct: 675 AVAAKVKVPAGGSKTIVFCIAWDQPITRFQLGSAYYKRYTKFYGNKGGNSQRIAYDALQN 734
Query: 411 HGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRK 470
+ SW+ QI WQ PI++D LP +Y + LFNELYYL GG VWT GSP
Sbjct: 735 YKSWDNQIAQWQNPIIQDAELPSFYKMALFNELYYLVDGGTVWTHGSPQ----------- 783
Query: 471 FSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIG 530
+Q+ L + + D +IG
Sbjct: 784 ----------------------------------DQLPARYPLKTQLKPEDISD-PNHIG 808
Query: 531 QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQ 590
+F YLE +EYLM+NTYDVHFYSS+AL ML+P +++S+Q D A A M+ + + GQ
Sbjct: 809 RFAYLESLEYLMYNTYDVHFYSSYALAMLWPLLEISLQYDIAEATMLDYGITWEGIHSGQ 868
Query: 591 WVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFA 648
+ RK VPHD+G DPW VN+Y + D +RWKDL KFVLQVYRD + DK F
Sbjct: 869 QIPRKKRCTVPHDLGNPGEDPWKRVNSYNIQDISRWKDLPSKFVLQVYRDYLVVEDKNFL 928
Query: 649 KAVWPSVY-VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
V+ V V ++ FD D DG+++N+GFPDQTYD W G SAYSGGLW+A+L+ AS
Sbjct: 929 LQVYNVVEEVIQRTLESFDTDHDGVVDNEGFPDQTYDVWPAVGCSAYSGGLWLASLKVAS 988
Query: 708 ALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYA 766
+A+ +G + E + F+K Y +KLWNG YF YD S S + SI +D LAG WY
Sbjct: 989 EMAKILGFKEDESIYNAIFEKGSASYTKKLWNGYYFKYDCSNSVHADSIMSDMLAGHWYL 1048
Query: 767 RACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGV 826
+CGL + D+A S+L + YNV + GAVNGM P+G VD + +QS E+W G
Sbjct: 1049 LSCGLPSYMTFDQALSSLSIINEYNVNSYGKERCGAVNGMRPEGVVDNTCLQSSEVWIGT 1108
Query: 827 TYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLA 886
++++AA+MI + ++ GI ++++ G+ +QTPEAW+++ YR+ YMRPL+
Sbjct: 1109 SFSLAATMIQHHMDKDAWELVKGIVNSSYN--RWGFQYQTPEAWDSNGCYRAGAYMRPLS 1166
Query: 887 IWAMQWALTRPK 898
IW++QWAL + K
Sbjct: 1167 IWSIQWALLKKK 1178
>gi|281212396|gb|EFA86556.1| hypothetical protein PPL_00357 [Polysphondylium pallidum PN500]
Length = 4775
Score = 607 bits (1566), Expect = e-171, Method: Compositional matrix adjust.
Identities = 346/876 (39%), Positives = 491/876 (56%), Gaps = 85/876 (9%)
Query: 36 WQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSH 95
W ++L F ++ + +L +G R+ ++ E + G+ +DPF
Sbjct: 339 WTKRLDHQFPEHKAFQVSIGQGFKLMGLGYRMWKYVKRENSAGRVPIMDPFNIPKPGPVM 398
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ------KYSS 149
GVP+GG+GSGSI R +RG+F RW + + + V AN+FSV++ Q K ++
Sbjct: 399 GVPIGGIGSGSITRGWRGDFVRWNLKNGMVNSEVVDANKFSVYIKMEEQQSQPLSTKRAT 458
Query: 150 VLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
VLCP P+ + + W+W+LKGD+S+Y +PRAWTV+E EP ++R+VC+Q+SP
Sbjct: 459 VLCPGKPK-----SNFALDVWNWSLKGDRSSYFGQFPRAWTVYE-EPHQDVRLVCKQVSP 512
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND- 268
+IP+NYKESSYP VF + I N+ T A+++L+FTW NS G + G H+N + K D
Sbjct: 513 VIPNNYKESSYPCGVFVWKIENTASTDAEVSLMFTWQNSDGTAVDQAGGHHNKRFKYTDE 572
Query: 269 ------GVHAVLLHHRTSHQLP------PVTFALAAQETDGVHVSLCPHFVISGNSLGLT 316
GV S P P+ ++ ++ V S F + L
Sbjct: 573 QGRSINGVTLTTNRDLKSSVDPKKVYQDPLELSIGVRDDADVQFSFVSRFETTNR---LE 629
Query: 317 AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
A ++W+ + G D + E S S P IGAAIAA V V + +V FS+AWD P
Sbjct: 630 AANLWYSFNKSGVLD---NSEDSRPSAPKKPIGAAIAAKVLVKAGTTREVAFSVAWDTPV 686
Query: 377 VNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYP 436
F SG YYRRYTKF+GT N + IA DA+ + WE +I WQ PIL D LP +Y
Sbjct: 687 CRFNSGSGYYRRYTKFFGTAGNNSQRIACDAVYNYRKWEEEIVRWQHPILSDPSLPTFYK 746
Query: 437 ITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNI 496
+FNELYY GG VWT G+P Q + K+IV Q D
Sbjct: 747 QAIFNELYYFVDGGTVWTHGAPD----------------DQPNQKKIVSKFQQED----- 785
Query: 497 LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
L D IG+F YLE ++YLM+NTYDVHFY+SFAL
Sbjct: 786 -------------------------LNDPNY-IGRFAYLESVDYLMYNTYDVHFYASFAL 819
Query: 557 IMLFPKIQLSIQRDFAAAVMM-HDPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEV 613
ML+P+++LS+Q DFA A ++ + ++++ + G+ + RKV GAVPHD+G DPW V
Sbjct: 820 AMLWPRLELSLQTDFADATLLDYSDTQVECIQTGKNMPRKVRGAVPHDLGNPGEDPWKRV 879
Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
NAY + D +RWKDL KFVLQ+YRD + G D+ F +W V + ++D D DG+
Sbjct: 880 NAYHIQDVSRWKDLPSKFVLQIYRDYLINGNDRTFLLQMWGVVEEVIRRAFEYDIDIDGV 939
Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
I+N+G PDQTYD WS G SAYSGGLW+AA++ AS +AR +G + E + F+K K
Sbjct: 940 IDNEGVPDQTYDAWSALGCSAYSGGLWLAAVKVASEMARILGLKEDETVYKKIFEKGKKS 999
Query: 733 YE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
Y KLWNG YFNYD+S + SI +DQLAG WY ACGL + D+A S+L + YN
Sbjct: 1000 YSTKLWNGHYFNYDSSKNPHFDSIMSDQLAGHWYLLACGLPSYITLDQALSSLSIINEYN 1059
Query: 792 VLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIY 851
V G GAVNGM P+G D +S+QS E+W G +Y +A++M+ + + ++ G+
Sbjct: 1060 VKSYSNGSCGAVNGMRPEGGPDTTSLQSCEVWIGTSYGLASTMLLHFMDNEAWELIKGLV 1119
Query: 852 EAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
++ ++ G+ +QTPEAW+ + YR+ YMRPLAI
Sbjct: 1120 DSTYN--KWGFQYQTPEAWDQNGLYRAGTYMRPLAI 1153
>gi|291241331|ref|XP_002740565.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 792
Score = 604 bits (1558), Expect = e-170, Method: Compositional matrix adjust.
Identities = 351/887 (39%), Positives = 480/887 (54%), Gaps = 117/887 (13%)
Query: 21 SSFDKVDPAKPASLTWQRKL----STGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAA 76
S D P+ W+ KL + PL L K+ +L P+ R R
Sbjct: 7 SCADSTFPSTVPEFGWRIKLDHVFAEKRAPLRSLRL--KQIPELLPMFFRYAWYCRGVTK 64
Query: 77 KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
KG+R +ID T S+GVPLGG+G G I R ++G+F RW + P + V +QF+
Sbjct: 65 KGRRPYIDQIHPLQSTQSYGVPLGGIGCGCINRGWKGDFCRWSLIPGMYTHDTVQTDQFT 124
Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
V + R Y VL P T + +W+W +TYHALYPRAWTV+ P
Sbjct: 125 VCIRRGGRTTYQQVLSTSRP------TDQNLNTWNWGFNPSCATYHALYPRAWTVYH-LP 177
Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT 256
+ ++CRQ+SPI+PH+YK++S PV+VF + +YN G D+T++F+W N G S+ +
Sbjct: 178 GQNVTLLCRQVSPILPHDYKDTSLPVAVFVWEVYNHGNEPIDVTIMFSWQNGTGQKSDKS 237
Query: 257 GQHYNSKTKMNDG-VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGL 315
G +N MN+G V VLLH+ H T A++A D V VS F SG+ L
Sbjct: 238 GGRWNKPFTMNEGLVDGVLLHN--GHPTISCTMAISAVCKDDVSVSHKTAFDPSGDGL-- 293
Query: 316 TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
++W ++ G D + S + PG AA+AA V S F LAWD P
Sbjct: 294 ---ELWQDLISDGELD--SEPGASFRTVPGELTAAAVAAKCHVAASSSCTAEFCLAWDMP 348
Query: 376 EVNFMSG-KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEW 434
V FM+ K + RRYT+++GT +A +++ A+ ++ WE +IE WQ PIL+++ LP W
Sbjct: 349 RVQFMAKEKIHNRRYTRWFGTDGDAGPSLSAYALTQYREWEHRIERWQNPILQNESLPSW 408
Query: 435 YPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV 494
Y LFNELY+++ GG +W + + + L+ G D+ N
Sbjct: 409 YKSALFNELYFISDGGTIWIEHNKEHNGLINSG-----------------DINN------ 445
Query: 495 NILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
S+++Q G+F YLEG EY M+NTYDVHFY+SF
Sbjct: 446 -------SVIKQY----------------------GKFAYLEGHEYRMFNTYDVHFYASF 476
Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVN 614
ALIML+P++QLS+Q D +DD +PW VN
Sbjct: 477 ALIMLWPQLQLSLQYD---------------IDD-------------------EPWVRVN 502
Query: 615 AYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIE 674
AY +YDTA WKDLN KFVLQVYRD T D + + +WP V V M +D DGDG+I+
Sbjct: 503 AYLMYDTADWKDLNLKFVLQVYRDYYVTKDFIYLQDMWPKVKVVMQTSLAWDTDGDGIID 562
Query: 675 NDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE 734
N G+ DQTYD WSV+G SAY GGLW+AA++A +A+ + + + + K YE
Sbjct: 563 NSGYADQTYDAWSVTGASAYCGGLWLAAVRAMMEIAKILDEEDQFKKYSDVLIQGKSSYE 622
Query: 735 -KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE----DKARSALKKVYN 789
KLWNG Y+NYD+S S+SI DQ AGQWY RACGL+ DE D AL+ +Y
Sbjct: 623 KKLWNGKYYNYDSSKQKHSNSIMCDQAAGQWYLRACGLVQENDEVFPKDHILCALRAIYK 682
Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACG 849
NV+ G GAVNGM PDG D +S QS E+W+GVTYA+AA+MI E L + F TA G
Sbjct: 683 MNVMGFEKGTMGAVNGMRPDGSPDTTSFQSEEVWTGVTYALAANMIQEGLLEEAFTTASG 742
Query: 850 IYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
+Y+ + GLG FQTPEA+ ++ YRSL YMRPLAIWAMQ+A+ R
Sbjct: 743 VYKTCYERCGLG--FQTPEAYFANNCYRSLGYMRPLAIWAMQYAVER 787
>gi|156356077|ref|XP_001623757.1| predicted protein [Nematostella vectensis]
gi|156210486|gb|EDO31657.1| predicted protein [Nematostella vectensis]
Length = 783
Score = 604 bits (1558), Expect = e-170, Method: Compositional matrix adjust.
Identities = 357/849 (42%), Positives = 490/849 (57%), Gaps = 96/849 (11%)
Query: 73 EEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLA 132
+E G+ +IDP +GVPLGG+G G+IGR +RGEF RWQ+ P + V A
Sbjct: 6 KEKKNGRVPYIDPLAANPCRQVYGVPLGGIGCGTIGRGWRGEFNRWQLTPGIYSYNYVEA 65
Query: 133 NQFSVFVSRSNGQKYSSVLCPKTPE-VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
NQF V V + Y +VL P P VL + G W+W G + YHALYPRAWT
Sbjct: 66 NQFVVCVRKKGRTTYQAVLSPNRPNGVLCGRSLQG---WNWGFSGSNAVYHALYPRAWTR 122
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
+E P ++ +VCRQ+SP+ PH+YK++S PV+VF ++I N+ +++++F++ N G
Sbjct: 123 YE-LPGQDIILVCRQVSPVFPHDYKDTSLPVAVFIWSIENNNNEEVEVSIMFSFENG-DG 180
Query: 252 DSEFTGQHYN-----SKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQ---ETDGVHVSLC 303
+ T HYN + + V VLLHH+ H P T A+AA+ + + V V+
Sbjct: 181 TQDLTAGHYNESADTTGSNTTCDVTGVLLHHK--HHKLPYTLAIAAKALHKKELVTVTTK 238
Query: 304 PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSE 363
F + +W+++ + G + +S E S S S+ AA+AA+ TV S
Sbjct: 239 TWF-----DSRTPCRRVWNDLMDDGKLN--SSTEPSPPSRSDQSLCAAVAATTTVAARSR 291
Query: 364 GQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
G++ F+LAWD P + F S K +YR YT+++G NA + A++ + WE +IEAWQ
Sbjct: 292 GELEFALAWDMPVIYFGNSKKRHYRYYTRYFGHQGNAGPALCSHALMSYPDWETKIEAWQ 351
Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKR 482
+PIL+D+ LP WY LFNELY++ GG VW
Sbjct: 352 KPILQDESLPNWYKSALFNELYFVADGGTVW----------------------------- 382
Query: 483 IVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLM 542
+DV +++ T S GT D IG+F YLEG EY M
Sbjct: 383 -LDVRSESGT-----------------------SKGT---ADIVRKIGRFAYLEGHEYRM 415
Query: 543 WNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPH 602
+NTYDVHFY+SFAL ML+PK+QLS+Q D A AV + D + + +G + +KV G +PH
Sbjct: 416 YNTYDVHFYASFALAMLWPKLQLSLQYDMAHAVNVSDNQVVMTMMNGHYCRKKVPGCIPH 475
Query: 603 DIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
DIG PW VNAY ++DT++WKDLN KFVLQVYRD V T D + + +WP M
Sbjct: 476 DIGDPSEAPWDHVNAYHIHDTSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWPITKTVMT 535
Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREV---GDRG 717
+D DGDG+IEN G DQT+D W V+G SAY GGLW+AAL+ + +A + +RG
Sbjct: 536 KSMTYDSDGDGLIENSGLADQTFDAWPVTGPSAYCGGLWLAALRVMAEIATILDFPDERG 595
Query: 718 SEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
+ L + +KA YE+ LWNG Y+NYD+S S +SI ADQL+GQWY AC L +
Sbjct: 596 KYEKILARGKKA---YERLLWNGKYYNYDSSTSKYHNSIMADQLSGQWYLHACDLAQTSN 652
Query: 777 E-----DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
+ + SAL+ V+N+NV+K G GAVNG+ PDG++D SS+Q+ E+W+GVTYAVA
Sbjct: 653 DRVFPSENVISALRTVFNFNVMKFQEGTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVA 712
Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQ 891
ASMI E L D GF+TA GIY + LG FQTPEA + YRSL YMRPL+IWAMQ
Sbjct: 713 ASMIQEGLVDEGFKTASGIYNTCFE--RLGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQ 770
Query: 892 WALTRPKPK 900
WAL + K K
Sbjct: 771 WALEKRKNK 779
>gi|348520215|ref|XP_003447624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oreochromis
niloticus]
Length = 856
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 349/856 (40%), Positives = 478/856 (55%), Gaps = 79/856 (9%)
Query: 63 IGVRILCLIREEAAKGKRA-FIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIF 121
+GVR L ++ K+A FID F + L +G PLGG+G G+I R +RGEF RWQ+
Sbjct: 62 LGVRYLKFWYKKTQVEKKAPFIDMFGAQPLRQIYGAPLGGIGGGTITRGWRGEFCRWQLN 121
Query: 122 PRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTY 181
P + K V+ NQF+V + R Y VL + P L+ W+W G+ + Y
Sbjct: 122 PGMYHYKTVIPNQFTVCLRREGKTVYQQVLSVERPPTLQ--------GWNWGYCGEYAFY 173
Query: 182 HALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITL 241
HALYPRAWTV++ P + + CRQISP+IPH+Y++SS PV+V + I N + D+++
Sbjct: 174 HALYPRAWTVYD-LPGQNVTLTCRQISPVIPHDYQDSSLPVAVLVWDIENKNEYDLDVSI 232
Query: 242 LFTWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGV 298
+FT N G + G H+N K + V VLLHH T+ + P T +AA+E
Sbjct: 233 MFTMVNGSGQKDDKCGGHWNEPFHLEKEGEAVSGVLLHHCTA--VNPYTLCIAAREQSDR 290
Query: 299 HVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTV 358
+S F G L W+++ + G D + +S + G + AA+A +V
Sbjct: 291 EISHQTAFSPKGPCFSL-----WNDLIDDGRLD--SPTGSSPPTVKGEKVAAAVAVGCSV 343
Query: 359 PPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
P + F LAWD P++ F S K + RRYT+++GT +A+ +++ A+ + WE
Sbjct: 344 PAQGHNSLEFCLAWDMPKMTFGSQEKEHTRRYTRYFGTKGDASPSLSHYALTHYKQWERS 403
Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQ 477
IE WQRPIL+D LP WY LFNELY++ GG VWT+
Sbjct: 404 IEEWQRPILQDSSLPSWYKSALFNELYFVADGGTVWTE---------------------- 441
Query: 478 SDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
+ D + I + Q P + + G+F YLEG
Sbjct: 442 --------MTEDADISGGICSEDGGLPAQ---PAVI-------------KEYGRFAYLEG 477
Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
EY M+NTYDVHFYSSFALIML+PK+ LS+Q D A +V+ D ++ L GQ+ K
Sbjct: 478 QEYRMYNTYDVHFYSSFALIMLWPKLALSVQYDIAGSVVQQDLTERLYLMSGQYSPVKTK 537
Query: 598 GAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV 655
VPHDIG D PW VNAY ++DTA WKDLN KFVLQVYRD T D+++ + +WP
Sbjct: 538 NVVPHDIGDPDDEPWMRVNAYLIHDTAGWKDLNLKFVLQVYRDFHLTQDRQYLQDMWPIC 597
Query: 656 YVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
M +FD DGDG+IEN G+ DQTYD W V+G SAY GG+WVA+L +AR +
Sbjct: 598 QAVMESELKFDLDGDGLIENSGYADQTYDGWIVTGPSAYCGGMWVASLCVMCKMARLLDS 657
Query: 716 RGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL--- 771
S ++ + +EK LWNG Y+NYD+SG S+S+ +DQ AG W+ RA GL
Sbjct: 658 EESYRHYKDLLDRGSAAFEKLLWNGKYYNYDSSGRELSNSVMSDQCAGHWFLRASGLGDG 717
Query: 772 -LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
++K +SALK V++ NV+ GG+ GAVNGM P+G D SS+QS E+W GV Y +
Sbjct: 718 EFQAFPKEKIQSALKSVFDLNVMSFAGGQMGAVNGMRPEGVPDHSSVQSDEVWIGVVYGL 777
Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
AA+MIHE + + G +TA G Y W LG AFQTPEA+ + YRSL YMRPL+IWAM
Sbjct: 778 AATMIHEGMQEEGMRTAEGCYRTVWE--KLGMAFQTPEAYCEKNIYRSLAYMRPLSIWAM 835
Query: 891 QWAL-TRPKPKTLEKQ 905
Q AL T K +T Q
Sbjct: 836 QLALNTSQKDQTTTAQ 851
>gi|414871105|tpg|DAA49662.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
Length = 589
Score = 590 bits (1522), Expect = e-165, Method: Compositional matrix adjust.
Identities = 314/568 (55%), Positives = 392/568 (69%), Gaps = 24/568 (4%)
Query: 320 MWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
MW+ + ++G FDR N S S+ S PG + AA++AS V P V F+LAW P+V
Sbjct: 1 MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60
Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
F G TY RRYT+FYGT + +A N+ DA+ ++ WE +IE WQ PIL+D+RLPEWY T
Sbjct: 61 FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120
Query: 439 LFNELYYLNAGGAVWTDGSPPV---HSLVTIGHRKFSL----DWSQSDLK------RIVD 485
LFNELY+L AGG VWTDG PP H+K S D +Q +K +
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180
Query: 486 VP------NQNDTAVNILERM--SSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
VP N +D +V+ + S + EQI + + +DG E++G+FLYLEG
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQI-NGLKSEEPIPYLISKDGPEHVGKFLYLEG 239
Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
+EY+MWNTYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+ D K+K L DG RKV
Sbjct: 240 VEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVK 299
Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
GAVPHD+G DPW E+NAY ++DT++WKDLNPKFVLQ+YRD ATGD +F + VWP+V
Sbjct: 300 GAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCA 359
Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
AM YMDQFDRDGDG+IENDGFPDQTYD W+V GISAY G LW+AALQAA+ +A +GDR
Sbjct: 360 AMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRH 419
Query: 718 SEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
+ + KF KAK VYE KLWNGSYFNYD+ SS S SIQADQLAGQWYA + GL P+ D
Sbjct: 420 FAEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFD 479
Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
E K R+AL+K++ +NV+KV GG+ GAVNGM P G+VD + MQSREIW+GVTYAVAA+M+
Sbjct: 480 EHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLL 539
Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAF 864
+ GF TA GI+ A WS G GYA
Sbjct: 540 HGMEHQGFTTAEGIFTAGWSEEGYGYAL 567
>gi|334333160|ref|XP_003341682.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
glucosylceramidase-like [Monodelphis domestica]
Length = 950
Score = 589 bits (1519), Expect = e-165, Method: Compositional matrix adjust.
Identities = 342/829 (41%), Positives = 466/829 (56%), Gaps = 80/829 (9%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+A+QF+V
Sbjct: 138 KIPFIDMLNFLPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGIYQHQTVIADQFTVC 197
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+ R Y VL + P VL+ SW+W L G + YHALYPRAWTV++ P
Sbjct: 198 LRRGGQTVYQQVLSLERPSVLR--------SWNWGLCGFFAFYHALYPRAWTVYQ-LPGQ 248
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
++ + CRQI+PI+PH+Y++SS PV VF + + N+ DI+++F+ N +GG + G
Sbjct: 249 QVTLTCRQITPILPHDYQDSSLPVGVFVWDVENNSDEDLDISIMFSMRNGLGGGHDSPGG 308
Query: 259 HYNSKTKM-NDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGL 315
+N + +DG V +LLHH P T ALAA+ T V+ I+
Sbjct: 309 LWNEPFCLEHDGETVQGLLLHHPNPPN--PYTMALAARLTANTTVTH-----ITAFDPXC 361
Query: 316 TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
T + +W ++ D + E ++ G IG A+ A +P Q+ F+LAWD P
Sbjct: 362 TGQKVWQDLLPRXQLDSPLA-EWGPPTQKGQGIGGAVCAGTHLPRRGRCQLEFALAWDMP 420
Query: 376 EVNFMS-GKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEW 434
++ F + G+ +YRRYT+F+G NAA + A+ ++ SWE +IEAWQ P+LED+ LP W
Sbjct: 421 KILFGARGQAHYRRYTRFFGQEGNAAPALCHYALSQYRSWEERIEAWQNPVLEDRSLPAW 480
Query: 435 YPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV 494
Y LFNELY+L GG +W V+VP + T
Sbjct: 481 YKSALFNELYFLADGGTMW------------------------------VEVPAETTT-- 508
Query: 495 NILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
E IL Q LQ + G+F YLEG EY M+NTYDVHFY+SF
Sbjct: 509 ---EEFGGILNQ---------------LQPTLQEYGRFAYLEGQEYRMYNTYDVHFYASF 550
Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFE 612
ALIML+PK++LS+Q D A A + D ++ K L G K +PHD+G D PW
Sbjct: 551 ALIMLWPKLELSLQYDMAVATLSEDLTRRKYLMSGVMAPVKKKNVIPHDVGDPDDEPWLR 610
Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
+NAY ++DTA WKDLN KFVLQ+YRD TG++ F + +WP M +FD+D DG+
Sbjct: 611 INAYLIHDTADWKDLNLKFVLQIYRDFYLTGNESFLRDMWPVCQAVMESEMKFDKDQDGL 670
Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
IEN G+PDQTYD W +G SAY GGLW+AA+ +A G + ++ FL + +
Sbjct: 671 IENGGYPDQTYDGWITTGPSAYCGGLWLAAVAVMVQMATVCGTQDIQEKFLSILSRGREA 730
Query: 733 YEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKV 787
YE+ LWNG Y+NYD+S QS I +DQ AGQW+ RACGL + AL+ +
Sbjct: 731 YERLLWNGRYYNYDSSTQPQSRIIMSDQCAGQWFLRACGLGEGDTEVFPTPHVIRALQTI 790
Query: 788 YNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
+ NV GG GAVNGM PDG D SS+QS E+W GV Y +AA+MI E L GF+TA
Sbjct: 791 FEVNVQGFAGGAMGAVNGMQPDGIPDTSSVQSDEVWVGVVYGLAATMIQEGLIQEGFRTA 850
Query: 848 CGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
G Y W LG AFQTPEA+ +RSL YMRPL+IWAMQ AL +
Sbjct: 851 EGCYRTVWE--RLGMAFQTPEAYCQHRVFRSLAYMRPLSIWAMQLALQQ 897
>gi|410925034|ref|XP_003975986.1| PREDICTED: non-lysosomal glucosylceramidase-like [Takifugu
rubripes]
Length = 827
Score = 589 bits (1518), Expect = e-165, Method: Compositional matrix adjust.
Identities = 351/876 (40%), Positives = 487/876 (55%), Gaps = 85/876 (9%)
Query: 37 QRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRA-FIDPFIKRHLTSSH 95
++ +G++ LSQ W+ +G+R L ++ K+A FID F L +
Sbjct: 22 RKPFQSGDVSLSQV---WEHL----GLGIRYLYWWYKKTRVEKKAPFIDIFGAVPLRQIY 74
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
G PLGG+G G+I R +RGEF RWQ+ P + K V+ NQF V + R+ Y VL +
Sbjct: 75 GAPLGGIGGGTITRGWRGEFCRWQLKPGMYHYKTVIENQFMVCLRRNGQTVYQQVLSVER 134
Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
P L+ W+W G+ + YHALYPRAW V+ P + + CRQISPIIPH+Y
Sbjct: 135 PPTLQ--------GWNWGFCGEYAYYHALYPRAWNVYH-LPGQNVTLTCRQISPIIPHDY 185
Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM---NDGVHA 272
+++S PV+VF + I N + D++++FT N G + +G H+N ++ + V
Sbjct: 186 QDASLPVAVFVWDIENKNDYALDVSIMFTLLNGSGHADDKSGGHWNEPFRLEKNGEAVSG 245
Query: 273 VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDR 332
VLLHH T+ + P T +AA+E V+ F G GL W ++ G D
Sbjct: 246 VLLHHCTA--VNPYTLCIAAREQPKREVTHQTAFSPKGTCHGL-----WSDLITDGRLD- 297
Query: 333 LNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTK 391
+ +S + G + AA+AA +V S G + F LAWD P++ F SG K + RRYT+
Sbjct: 298 -SPTGSSPPTAKGERVAAALAAGCSVATQSRGSLEFCLAWDMPKITFGSGEKEHSRRYTR 356
Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
+YG+ +A ++ A+ + WE IE WQ PIL+D LP WY LFNELY++ GG
Sbjct: 357 YYGSKGDACPLLSHHALTHYRGWEKSIEEWQGPILQDSSLPTWYKSALFNELYFVADGGT 416
Query: 452 VWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPV 511
VWT+ L DV + L S+++
Sbjct: 417 VWTE------------------------LAEDADVSGGLRSEEGGLPAQPSVIK------ 446
Query: 512 ALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDF 571
+ G+F YLEG EY M+NTYDVH+Y+SFALIML+PK+ LS+Q D
Sbjct: 447 ----------------DYGRFAYLEGQEYRMYNTYDVHYYASFALIMLWPKLALSLQYDI 490
Query: 572 AAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNP 629
A +V+ DP++ L +G+ K G VPHDIG + PW VNAY +++TA WKDLN
Sbjct: 491 AGSVVQQDPTERLYLMNGRCSPVKTKGVVPHDIGDPEDEPWQRVNAYLIHNTADWKDLNL 550
Query: 630 KFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS 689
KFVLQVYRD T D ++ + +WP M +FD DGDG+IEN GF DQTYD W+V+
Sbjct: 551 KFVLQVYRDFHLTQDSQYLQDMWPICQTVMESALKFDLDGDGLIENSGFADQTYDGWTVT 610
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSG 748
G SAY GGLW+A+L +AR + + Y+ + V Y+K LWNG Y+NYD+SG
Sbjct: 611 GPSAYCGGLWLASLVVMCKMARLLENEMKYQYYRDILDRGSVAYDKVLWNGKYYNYDSSG 670
Query: 749 SSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVN 804
S+S+ +DQ AG W+ RA GL ++K + ALK V++ NV+ GG+ GAVN
Sbjct: 671 KDHSNSVMSDQCAGHWFLRASGLGDGEYQAFPKEKIQRALKTVFDLNVMSFAGGQMGAVN 730
Query: 805 GMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
GM P+G D SS+QS E+W GV Y +AA+MIHE + + G +TA G Y W LG AF
Sbjct: 731 GMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGMLEEGMRTAEGCYRTVWE--RLGMAF 788
Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
QTPEA+ YRSL YMRPL+IWAMQ AL + K
Sbjct: 789 QTPEAYCEKGIYRSLAYMRPLSIWAMQLALDASRNK 824
>gi|196002317|ref|XP_002111026.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
gi|190586977|gb|EDV27030.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
Length = 843
Score = 588 bits (1517), Expect = e-165, Method: Compositional matrix adjust.
Identities = 352/889 (39%), Positives = 492/889 (55%), Gaps = 109/889 (12%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
K+ + +G+R I + FID F KR + +GVP+GG+G G+I R + G+
Sbjct: 28 KQVLVSLKMGLRYGMYILHNRFHSREPFIDLFGKRQVDRLYGVPIGGLGCGTIMRGWNGQ 87
Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNL 174
F RWQ+ P V A+QF+V + ++ Y VL P P ++A +G W W
Sbjct: 88 FCRWQLRPGWYTYDIVHADQFTVSIRKNGEPSYQQVLSPCRP------SSAQLGDWFWGY 141
Query: 175 KGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGK 234
G ++TY ALYPRAWT + P + + CRQISPIIPHNYK+S PV F + I N
Sbjct: 142 HGSQATYRALYPRAWTTYN-LPGQNVTLTCRQISPIIPHNYKDSCLPVGCFIWEIDNQSD 200
Query: 235 TSADITLLFTWTNSVGGDSEFTGQHYN------SKTKMNDG-VHAVLLHHRTSHQLPPVT 287
+++++FT+ N G + + G H N S MN+ + VLLHHR S L P T
Sbjct: 201 DEIEVSIMFTFQNGDGSEHDSAGGHQNETFTRSSNENMNNSKISGVLLHHRAS--LQPCT 258
Query: 288 FALAAQETDGVHVSLCPHF-----VISGNSLGLTAKDMWHEIKE----HGSFDRLNSMET 338
F + A+E CP V NS+ K EI HG + NS T
Sbjct: 259 FTICAKE--------CPETGDYPAVSVSNSVSFDPKGSGREISRQLLSHGKLE--NSSNT 308
Query: 339 SVTSEP----GSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKT-YYRRYTKFY 393
T P G IGAA+A + V P + ++ F+L WD PEV F + + Y RRYT+F+
Sbjct: 309 KATQSPVSKKGEIIGAAVATTAIVKPQGKAKMEFALTWDQPEVYFDAKEIKYRRRYTRFF 368
Query: 394 ---GTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGG 450
G+H+ AA++A A+ SWE +I+ WQ PILE++ P+WY +FNELYY++ G
Sbjct: 369 NPNGSHR--AADLAVYALQNVKSWEEKIDQWQSPILENESYPDWYKSAIFNELYYMSDSG 426
Query: 451 AVW--TDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIY 508
++W TDG D ++ + A + L+ + +QI+
Sbjct: 427 SIWAETDG----------------------------DEDDETNLA-SCLKEVPGKGKQIH 457
Query: 509 TPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQ 568
+ G+F YLEG EYLM+NTYDVHFY+SFAL L+P I+LSIQ
Sbjct: 458 ------------------KEYGRFAYLEGHEYLMYNTYDVHFYASFALAQLWPMIELSIQ 499
Query: 569 RDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG-ICD-PWFEVNAYCLYDTARWKD 626
DFAA++M DP ++ G RK G VPHDIG CD PW +NAYC+ D W D
Sbjct: 500 YDFAASIMHEDPELRLIMWTGLRKPRKSYGFVPHDIGEPCDEPWKRINAYCISDVQDWTD 559
Query: 627 LNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTW 686
LN KFVLQ+YRD ATG+ +F K +WP M D DGDG+I+N+G DQTYD W
Sbjct: 560 LNTKFVLQIYRDYCATGNLEFLKDMWPCAKHVMETAISQDTDGDGIIDNNG-ADQTYDVW 618
Query: 687 SVSGISAYSGGLWVAALQAASALAREVG---DRGSEDYFLFKFQKAKVVYEKLWNGSYFN 743
+ G SAY GG+++AAL +A +G D+ L + + + +KLWNG Y+
Sbjct: 619 RMYGASAYCGGIFLAALYIMYKIADLIGCKEDKAKYSKILLRGRDS--FQQKLWNGDYYL 676
Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDED-----KARSALKKVYNYNVLKVMGG 798
YD+S + S SI ADQ+ G WY +ACGL+ ++D + AL+KV++YNV+K M G
Sbjct: 677 YDSSKGNHSDSIMADQMCGHWYLQACGLVNNNEDDIFPKSQVDKALRKVFDYNVMKFMNG 736
Query: 799 KRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
GAVNGM P+G +D SS+QS E+W+GVTYA+AA++I + + GF+ A G+Y+ ++
Sbjct: 737 TWGAVNGMKPNGEIDTSSVQSEEVWTGVTYALAANLIQQGMIKEGFKVAEGMYDTTFNVM 796
Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMK 907
GLG FQTPEA+ ++YR YMRPL+IWA++WA K +K K
Sbjct: 797 GLG--FQTPEAYYKINRYRCTGYMRPLSIWAIEWAFNHVFKKNTKKITK 843
>gi|260813884|ref|XP_002601646.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
gi|229286945|gb|EEN57658.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
Length = 871
Score = 584 bits (1505), Expect = e-164, Method: Compositional matrix adjust.
Identities = 333/834 (39%), Positives = 460/834 (55%), Gaps = 120/834 (14%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
KR FID L +GVPLGG+G GSI R ++G+F RW + P + +ANQF+V
Sbjct: 129 KRPFIDALNPVQLQQIYGVPLGGLGCGSICRGWKGDFCRWALLPGIYSYDIKMANQFTVC 188
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
V R Y VL + P L + W+W+ + YHALYPRAWTV+E P
Sbjct: 189 VRRRGQTTYQQVLSTRHPSCLTE--------WNWSYSPSHAYYHALYPRAWTVYE-LPGQ 239
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
+ + CRQ+SP++PH+YK+SS P +VF +T N GK DI+L+FT+ NS+G D++ +G
Sbjct: 240 NITLTCRQVSPVLPHDYKDSSIPAAVFVWTAENHGKEEVDISLMFTFQNSIGADTDLSGG 299
Query: 259 HYNSKTKMN----DGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG 314
H+N +N D V +LLHH+ H+ P T AL +G+ V+ F +G
Sbjct: 300 HWNEPFSVNEEGVDPVSGILLHHQ--HKTMPCTIALGIPHKEGIRVTGSTSFNPAG---- 353
Query: 315 LTAKDMWHEIKEHGSFDRLNSMETSVT-SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
+ K++W ++ G L S E + G ++ A+ AS +P + F L+WD
Sbjct: 354 -SGKEIWRDLMYDG---ELGSPEAPTNKTVKGEALAGAVCASFKLPSGGKQDTDFVLSWD 409
Query: 374 CPEVNFM-SGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLP 432
P + F+ S +TY R YT ++G AA +A A+ + WE IE WQRPIL+++ P
Sbjct: 410 MPVIKFVGSCRTYKRWYTGWFGDDGTAAPKLAAYALSHYPEWESTIEEWQRPILDNQSYP 469
Query: 433 EWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDT 492
EWY LFNELY+++ GG +W V ND
Sbjct: 470 EWYRSALFNELYFMSDGGTIWL-------------------------------VAEDNDG 498
Query: 493 AVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYS 552
V + P L ++++ G+F YLEG EY M+NTYDVH Y+
Sbjct: 499 HV-----------ENNLPPWLREKTPAHIME-----YGRFGYLEGHEYRMYNTYDVHHYA 542
Query: 553 SFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFE 612
SFAL ML+P+++ SIQ D+ DD +P+
Sbjct: 543 SFALAMLWPQLEYSIQYDY---------------DD-------------------EPFIH 568
Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
VNAY ++DT+ W+DLN KFVLQ++RD DK F +WP M +D D DGM
Sbjct: 569 VNAYVMHDTSDWRDLNLKFVLQIFRDYFVGRDKGFLAFMWPRAQAVMTKALTWDTDQDGM 628
Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
I+N G DQTYD W V+G SAY GGLW+AAL+ +A +G E ++L Q+AK
Sbjct: 629 IDNSGAADQTYDAWIVTGASAYCGGLWLAALRCMVEIANILGKEEEEQHYLAVMQRAKTA 688
Query: 733 Y-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLP----IVDEDK-------A 780
Y EKLWNG Y+NYD SG S S+SI A +G WY RACGL+P ++D D+
Sbjct: 689 YEEKLWNGEYYNYDCSGRSYSNSIMAGAGSGHWYLRACGLVPEDNEVMDSDREVLPLDHV 748
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
S+L+KVY++NV++ G GAVNGM P+G+ D++SMQS E+W+G+TYA+AASMI E +
Sbjct: 749 ASSLRKVYDFNVMQFHQGTMGAVNGMRPNGKKDLTSMQSEEVWTGITYALAASMIQEGMV 808
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
+ GFQTA G+Y + G+AFQTPEA+ D +RSL YMRPLAIWAMQWAL
Sbjct: 809 EEGFQTAYGVYNMCY--LQCGFAFQTPEAYLERDFFRSLGYMRPLAIWAMQWAL 860
>gi|198424051|ref|XP_002127036.1| PREDICTED: similar to Non-lysosomal glucosylceramidase (NLGase)
(Glucosylceramidase 2) (Beta-glucocerebrosidase 2)
(Beta-glucosidase 2) [Ciona intestinalis]
Length = 861
Score = 583 bits (1502), Expect = e-163, Method: Compositional matrix adjust.
Identities = 334/849 (39%), Positives = 481/849 (56%), Gaps = 75/849 (8%)
Query: 63 IGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP 122
+ VR L + K K+ +D F + L +GVPLGG+GSG+I R ++G+F +WQ+ P
Sbjct: 74 LAVRYLQYYNQLRRKNKKPCMDFFNQHPLRPIYGVPLGGIGSGTINRGWQGDFCKWQLKP 133
Query: 123 RVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH 182
+ LA+QF V + R N VL K P + + +W+W G + Y+
Sbjct: 134 GIYTFDCPLADQFIVSIRRHNTVVCQQVLSTKGP------SGEALRTWNWKYCGSYAYYN 187
Query: 183 ALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLL 242
LYPRAWTV++ + + + CRQISP+ P++YK++S P +VF + + N G ++T++
Sbjct: 188 GLYPRAWTVYDLQGQ-NIVLTCRQISPVFPNDYKDTSLPAAVFVWDVENRGNEEVEVTIV 246
Query: 243 FTWTNSVGG---DSEFTG--QHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG 297
+ N G + +G S T N+ V++H+ T +P T +AA E G
Sbjct: 247 MSLENGTGNPEQQKKVSGLWNEPFSNTDHNNKAKGVVMHNTTYRDMP-YTLTVAAAEKPG 305
Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSM--ETSVTSEPGSSIGAAIAAS 355
V V+ C F + N ++W+ + G RL S ++S+T+ G A+AAS
Sbjct: 306 VDVTHCTSFNPTHN-----GAEVWNSLLNTG---RLQSKPGKSSLTAT-GEHTACAVAAS 356
Query: 356 VTVPPDS-EGQVTFSLAWDCPEVNFM-SGKTYYRRYTKFYGTHQNAAANIARDAILEHGS 413
V P S G + F+L WD P+++F +GK Y RRYT+++ NA A IA A+ + +
Sbjct: 357 CKVKPHSMAGSMDFALTWDMPQIHFGDAGKHYKRRYTRWFNG-DNAGATIASYALDNYQT 415
Query: 414 WELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSL 473
WE +IEAWQRPIL++K+LP+WY +FNELYY+ GG VW
Sbjct: 416 WEDKIEAWQRPILDNKQLPDWYKSAIFNELYYITDGGTVW-------------------- 455
Query: 474 DWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFL 533
++V + + N + E + ++S D + G+F
Sbjct: 456 ----------LEVAEEENGTAN-----GKVPETLQNTTKVDSC-------DFLKEYGKFA 493
Query: 534 YLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVS 593
YLEG EY M+NTYDVHFY+S AL L+PK++LS+Q D A +++ +P K L DG
Sbjct: 494 YLEGQEYKMYNTYDVHFYASIALAYLWPKLELSVQYDIATSILHSNPQPHKYLMDGVTAP 553
Query: 594 RKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAV 651
K VPHD+G + PW VN Y ++DTA W+DLNPKFVLQ YRD T D F KA+
Sbjct: 554 VKTPNVVPHDVGCPEDEPWLRVNTYFVHDTADWRDLNPKFVLQAYRDYYITKDIDFLKAM 613
Query: 652 WPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAR 711
WP + M + D+D DG+IEN G DQT+D W V+G SAY GGLW+AAL+ A
Sbjct: 614 WPICKIVMEQSMRHDKDNDGLIENSGAADQTFDGWCVTGPSAYCGGLWLAALRCMEEAAD 673
Query: 712 EVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQ-SSSIQADQLAGQWYARAC 769
+ ++ + F +A Y+ +LWNG YFNYD S + Q +S+ ADQ AGQW+ A
Sbjct: 674 ILHEKDDQVKFHDILVRATKAYQDRLWNGKYFNYDCSNNPQHRTSVMADQCAGQWFLLAS 733
Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
G+ ++ +D+ S+LK +++ NV + GG+ GAVNGM P G VD SS+QS E+W+GVTYA
Sbjct: 734 GICDVLPDDQVMSSLKTIFSLNVEQYDGGRHGAVNGMRPHGAVDTSSVQSDEVWTGVTYA 793
Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWA 889
+AA+MIH+ + GF+TA GIY + W G+ AFQTPEA + YRSL YMRPL+IW
Sbjct: 794 LAATMIHKGMLKEGFRTASGIYRSCWQRYGM--AFQTPEAMRKNHTYRSLAYMRPLSIWG 851
Query: 890 MQWALTRPK 898
MQ+AL PK
Sbjct: 852 MQYALENPK 860
>gi|351707038|gb|EHB09957.1| Non-lysosomal glucosylceramidase [Heterocephalus glaber]
Length = 922
Score = 582 bits (1500), Expect = e-163, Method: Compositional matrix adjust.
Identities = 349/869 (40%), Positives = 476/869 (54%), Gaps = 83/869 (9%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P ++ V+ +G+R L R+ + K+ FID L +G PLGG+G
Sbjct: 92 PFQANNISLSNLVKHLGMGIRYLKWWYRKTQVEKKKPFIDMINCVPLRQIYGCPLGGIGG 151
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF+V + R Y VL + P VL
Sbjct: 152 GTITRGWRGQFCRWQLNPGIYQHQTVIADQFTVCLRREGRTVYQQVLSMERPSVLH---- 207
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P + + CRQI+PI+PH+Y++SS PV V
Sbjct: 208 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNITLTCRQITPILPHDYEDSSLPVGV 262
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSH 281
F + + N G + D++++F+ N +GG+ + G +N + DG V +LLHH T
Sbjct: 263 FVWDVENEGDEAVDVSIMFSMRNGLGGEDDAQGGLWNEPFCLEQDGETVQGLLLHHPT-- 320
Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDR-LNSMETS 339
LP P T A+AA+ T+ V+ I+ T +W ++ + G D S
Sbjct: 321 -LPNPYTMAVAARLTEDSTVTH-----ITAFDPDSTGHQVWQDLLQDGQLDSPAGDGGRS 374
Query: 340 VTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQN 398
S+ G+ I A+ S +PP + FSLAWD P + F G+ +YRRYT+F+G +
Sbjct: 375 NPSQKGAGIAGALCVSGKLPPRGRCSLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGPDGD 434
Query: 399 AAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
AA ++ A+ + WE +I AWQ P+L+D+ LP WY LFNELY+L GG VW
Sbjct: 435 AAPALSHYALCHYADWEDRISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW----- 489
Query: 459 PVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFG 518
++VP + P L S G
Sbjct: 490 -------------------------LEVPEDS------------------LPEELVGSMG 506
Query: 519 TNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMH 578
L+ G+F YLEG EY M+NTYDVHFY+SFAL+ML+PKI+LS+Q D A A +
Sbjct: 507 Q--LRPILREYGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKIELSLQYDMALATLRE 564
Query: 579 DPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVY 636
D ++ + L G K +PHDIG D PW VNAY ++DTA WKDLN KFVLQVY
Sbjct: 565 DLTRRRYLMSGVMGPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVY 624
Query: 637 RDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSG 696
RD TGD+ F K +WP M +FD+D DG+IEN G+ DQTYD W +G SAY G
Sbjct: 625 RDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCG 684
Query: 697 GLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSI 755
GLW+AA+ +A + +D F + + YE+ LWNG Y+NYD+S QS S+
Sbjct: 685 GLWLAAVAVTVQMAVLCAAQDVQDKFSSILSRGQEAYERLLWNGRYYNYDSSSQPQSRSV 744
Query: 756 QADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
+DQ AGQW+ +ACGL + AL+ ++ NV GG GAVNGM P G
Sbjct: 745 MSDQCAGQWFLKACGLGEGDTEVFPTTHVVRALQTIFELNVQSFAGGAMGAVNGMQPHGI 804
Query: 812 VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWN 871
D SS+QS E+W GV Y +AA+MI E L GFQTA G Y W GL AFQTPEA+
Sbjct: 805 PDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYC 862
Query: 872 TDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
+RSL YMRPL+IWAMQ AL + K K
Sbjct: 863 QQQVFRSLAYMRPLSIWAMQLALQQQKHK 891
>gi|405977906|gb|EKC42333.1| Non-lysosomal glucosylceramidase [Crassostrea gigas]
Length = 1464
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 329/853 (38%), Positives = 474/853 (55%), Gaps = 83/853 (9%)
Query: 56 ETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEF 115
+ + +G+R + +G++ FID + + +G P+GG+G G++GR YRGEF
Sbjct: 675 QVIGFIGLGIRYAKEWWKRRRQGRKMFIDHMDQLFHKAIYGAPIGGIGCGTMGRGYRGEF 734
Query: 116 QRWQIFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVLCPKTPEVLKDTTAAGIGSWDWNL 174
R+Q+ P + + + ANQF + R NGQ Y VL + ++ + SW+W
Sbjct: 735 ARFQMVPGIYDHTVIQANQF-ILCIRKNGQTVYQKVLTGQRE------SSKSLRSWEWQA 787
Query: 175 KGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGK 234
+ YHALYPR+WTV+ + + LR++CRQ+SPI PH+YK++S+P++ F +T N+G
Sbjct: 788 PREGDVYHALYPRSWTVYHLD-EFNLRLICRQVSPIFPHDYKDTSFPMATFIWTAENNGT 846
Query: 235 TSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND---GVHAVLLHHRTSHQLPPVTFALA 291
D++++FT+ N G + G +N N V V ++H Q T+ +
Sbjct: 847 EPLDVSIMFTFKNGRGVKDDGNGGCHNHVFDENSEGRQVSGVSINHNI--QDMKCTYCIG 904
Query: 292 AQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAA 351
D V V+ +F G T +++W ++KE G + +T+ T + G AA
Sbjct: 905 GVHKDDVSVTCKEYFDPRG-----TGQEVWEDLKEDGQLN-TEGGKTTCTGK-GEESAAA 957
Query: 352 IAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYGTHQNAAANIARDAILE 410
+ VPP + F+L WD P ++F + + +Y RRY++++G A ++ A
Sbjct: 958 VCVKANVPPKQCKDLEFTLTWDMPIIHFKAKERFYARRYSRWFGLDGEAGPRMSSYAAAN 1017
Query: 411 HGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRK 470
+ +WE +IE WQ+P+L++K LP WY LFNELYY++ GG VW D PV
Sbjct: 1018 YSTWEQKIEEWQKPVLQNKNLPAWYKSALFNELYYVSDGGTVWID---PVEG-------- 1066
Query: 471 FSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIG 530
L R+ D + P+ +
Sbjct: 1067 -----QGGKLVRVTDPKD--------------------PPIV--------------QEFS 1087
Query: 531 QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQ 590
+F YLEG EY M+NTYDVH Y+SF+LIML+PK+Q+S+Q D A AV D +++K L G+
Sbjct: 1088 RFGYLEGHEYRMYNTYDVHHYASFSLIMLWPKLQISLQYDIATAVESEDSTRIKFLTSGE 1147
Query: 591 WVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFA 648
RK L AVPHD+G + PW VN Y ++ T WKDLN KFVLQ YRD AT D +
Sbjct: 1148 VGIRKRLHAVPHDMGDPEDEPWNRVNCYVIHPTCNWKDLNMKFVLQTYRDYSATKDDAYL 1207
Query: 649 KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
+ ++P M ++D D DG+I+N GF DQT+D W+++G SAY GG+W+AAL+
Sbjct: 1208 RHMYPVAKYVMETAKKWDVDDDGIIDNGGFADQTFDAWTMTGASAYCGGMWLAALRMMIE 1267
Query: 709 LAREVGDRGSEDYFLFK--FQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
+A + + +D +K K KV Y+ KLWNGSY+NYD S SI ADQLAG W+
Sbjct: 1268 MATIL--KKEDDVLHYKEILDKGKVSYQNKLWNGSYYNYDCSSGGHHDSIMADQLAGHWF 1325
Query: 766 ARACGLL--PIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIW 823
+A GL I D+ +S+LKK++ NV+ GG GA+NG PDG D+SS QS E W
Sbjct: 1326 LKASGLQDDDIFPPDRVKSSLKKIFENNVMMFEGGNMGAINGTRPDGSKDISSCQSEEFW 1385
Query: 824 SGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMR 883
GVTY +AA+MI E L D FQTA G Y W GL AFQTPEA+ TD+ YRSL YMR
Sbjct: 1386 VGVTYGLAANMIQEGLLDQAFQTAWGAYHVCWEWYGL--AFQTPEAYMTDNIYRSLGYMR 1443
Query: 884 PLAIWAMQWALTR 896
PLAIW+MQWAL +
Sbjct: 1444 PLAIWSMQWALEK 1456
>gi|354485725|ref|XP_003505033.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Cricetulus griseus]
Length = 898
Score = 576 bits (1485), Expect = e-161, Method: Compositional matrix adjust.
Identities = 334/828 (40%), Positives = 464/828 (56%), Gaps = 84/828 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID F L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 101 RKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 160
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R+ Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 161 ADQFTVCLRRNGRTVYQQVLSLERPSVLR--------SWNWGLCGHFAFYHALYPRAWTV 212
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++S PV VF + + N G + D++++F+ N +GG
Sbjct: 213 YQ-LPGQSVTLTCRQITPILPHDYQDSCLPVGVFVWDVENQGDETLDVSIMFSMRNGLGG 271
Query: 252 DSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
+ + G +N + DG V +LLHH T P T A+AA++T H++
Sbjct: 272 EDDALGGLWNEPFCLQRDGKTVQGLLLHHPTPPN--PYTMAVAARDTTVTHIT-----AF 324
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
+S T + +W + + G D + S S+ G + A+ S +PP + + F
Sbjct: 325 DPDS---TGQQVWKHLLQDGQLD--SPAGQSTPSKKGEGVAGAVCISSKLPPRGQCCLEF 379
Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
SLAWD P + F G+ +YRRYT+F+G+ + A ++ A+ ++ WE +I AWQ P+L+
Sbjct: 380 SLAWDMPRIMFGAKGQIHYRRYTRFFGSDGDVAPALSHYALCQYADWENRISAWQSPVLD 439
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
D+ LP WY LFNELY+L GG VW ++VP
Sbjct: 440 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 469
Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
+ + E + + Q+ +LQD G+F YLEG EY M+NTYD
Sbjct: 470 EDS-----LPEELGENMYQL-----------RPILQD----YGRFGYLEGQEYRMYNTYD 509
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
VHFY+SFAL+ML+PK++LS+Q D A A D ++ + L G K +PHDIG
Sbjct: 510 VHFYASFALVMLWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDP 569
Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M +F
Sbjct: 570 DDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKF 629
Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
D+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 630 DKDQDGLIENGGYADQTYDGWVATGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSI 689
Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
+ + YE+ LWNG Y+NYD+S QS SI +DQ AGQW+ RACGL +
Sbjct: 690 LCRGREAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHV 749
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L
Sbjct: 750 VRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLT 809
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
GF+TA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 810 REGFRTAEGCYRTVWERLGL--AFQTPEAYCQQQVFRSLAYMRPLSIW 855
>gi|73971791|ref|XP_531991.2| PREDICTED: non-lysosomal glucosylceramidase [Canis lupus familiaris]
Length = 1136
Score = 576 bits (1485), Expect = e-161, Method: Compositional matrix adjust.
Identities = 336/831 (40%), Positives = 463/831 (55%), Gaps = 83/831 (9%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 338 RKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 397
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 398 ADQFTVCLRREGKTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 449
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 450 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 508
Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + DG V +LLHH LP P T A+AA+ T V+
Sbjct: 509 GDDAIGGLWNEPFCLERDGETVQGLLLHHPA---LPNPYTMAVAARLTADTTVTH----- 560
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
I+ T + +W ++ + G D + S S+ G+ I A+ + +PP + ++
Sbjct: 561 ITAFDPDSTGQQVWQDLLQDGQLD--SPAGRSTPSQKGAGIAGAVCVTGKLPPRGQCRLE 618
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ YYRRYT+F+G NAA ++ A+ ++ WE +I AWQ P+L
Sbjct: 619 FSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCQYVDWEEKISAWQSPVL 678
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW ++V
Sbjct: 679 DDRSLPAWYKSALFNELYFLADGGTVW------------------------------LEV 708
Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTY 546
P ++ ++E M + + G+F YLEG EY M+NTY
Sbjct: 709 P-EDSLPEKLVESMCQLRPIL-------------------REYGRFGYLEGQEYRMYNTY 748
Query: 547 DVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGI 606
DVHFY+SFAL+ML+PK++LS+Q D A A + D ++ + L G K +PHDIG
Sbjct: 749 DVHFYASFALVMLWPKLELSLQYDMAVATLREDLTRRQYLMSGVMAPVKRRNVIPHDIGD 808
Query: 607 CD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F + +WP M +
Sbjct: 809 PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMK 868
Query: 665 FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLF 724
FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G +D F
Sbjct: 869 FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAILCGAHDVQDKFSS 928
Query: 725 KFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDK 779
++ + YE+ LWNG Y+NYD S QS SI +DQ AGQW+ RA GL +
Sbjct: 929 ILRRGQEAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLRASGLGEGDTEVFPTRH 988
Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
AL+ ++ +NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L
Sbjct: 989 VVCALQTIFEFNVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGL 1048
Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
GFQTA G Y W LG AFQTPEA+ +RSL YMRPL+IWAM
Sbjct: 1049 TWEGFQTAEGCYRTVWE--RLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 1097
>gi|432920219|ref|XP_004079895.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oryzias latipes]
Length = 858
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 339/845 (40%), Positives = 470/845 (55%), Gaps = 78/845 (9%)
Query: 63 IGVRILCLIREEAAKGKRA-FIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIF 121
+G+R L ++ K+A FID F L +G PLGG+G G+I R +RGEF RWQ+
Sbjct: 62 LGIRYLRWWYKKTQVEKKAPFIDMFHALPLRQIYGAPLGGIGGGTITRGWRGEFCRWQLN 121
Query: 122 PRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTY 181
P + K V+A+QF+V + R Y VL + P L+ W+W G+ + Y
Sbjct: 122 PGMYHYKTVIADQFTVCLRRGGQTVYQQVLSVEHPSTLQ--------GWNWGYCGEYAFY 173
Query: 182 HALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITL 241
HALYPRAWTV+ P + + CRQISP+IPH+YK+SS PV+VF + I N + ++++
Sbjct: 174 HALYPRAWTVYH-LPGQNVTLTCRQISPVIPHDYKDSSLPVAVFVWDIENKNDYALEVSI 232
Query: 242 LFTWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGV 298
+FT N G + +G H+N K + VLLHH T+ + P T ++A+E
Sbjct: 233 MFTMVNGSGHKDDSSGGHWNEPFHLEKEGKALSGVLLHHCTA--VNPYTLCISAREQPDR 290
Query: 299 HVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTV 358
VS F SG GL W ++ G D + +S ++ G + AA+A +V
Sbjct: 291 KVSHQTAFSPSGTCSGL-----WSDLMTDGRLDSPSG--SSPPTQKGEKVAAALAVGCSV 343
Query: 359 PPDSEGQVTFSLAWDCPEVNFMS-GKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
P V F LAWD P + F S + + RRYT+++G+ A+ +++ A+ + WE +
Sbjct: 344 PAHGHNDVEFCLAWDMPLITFGSRERKHVRRYTRYFGSKGEASPSLSHYALTHYKEWERR 403
Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQ 477
IE WQ PIL+D LP WY LFNELY++ GG VWT+
Sbjct: 404 IEEWQSPILQDSSLPSWYKSALFNELYFVADGGTVWTE---------------------- 441
Query: 478 SDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
L DV + L ++++ G+F YLEG
Sbjct: 442 --LAEDADVSGGVRSEDGGLPAQPDVIKEY----------------------GRFAYLEG 477
Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
EY M+NTYDVHFY+SFALIML+PK+ LS+Q D A +V+ D ++ L G++ K
Sbjct: 478 QEYRMYNTYDVHFYASFALIMLWPKLALSVQYDIAGSVVQQDLTERLHLMSGRYSPVKTK 537
Query: 598 GAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV 655
VPHDIG D PW +NAY ++DTA WKDLN KFVLQVYRD T D ++ K +WP
Sbjct: 538 NVVPHDIGDPDDEPWHRLNAYLIHDTAGWKDLNLKFVLQVYRDFHITQDNQYLKDMWPIC 597
Query: 656 YVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
M +FD DGDG+IEN G+ DQTYD W+V+G SAY GGLW+A+L +AR V
Sbjct: 598 QAVMESEIKFDLDGDGLIENSGYADQTYDGWTVTGPSAYCGGLWLASLCVMCKMARLVDT 657
Query: 716 RGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWY--ARACGL- 771
+ ++ + ++K LWNG Y+NYD+SG S+S+ +DQ AG W+ A ACG
Sbjct: 658 ERTYQHYKDILDRGSAAFDKLLWNGKYYNYDSSGRYHSNSVMSDQCAGHWFLKASACGEE 717
Query: 772 -LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
++K +S LK +++ NV+ GG+ GAVNGM P+G D SS+QS E+W GV Y +
Sbjct: 718 DYQAFPKEKIQSTLKSIFDLNVMAFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGL 777
Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
AA+MIHE + + G TA G Y W LG AFQTPEA+ + YRSL YMRPL+IWAM
Sbjct: 778 AATMIHEGMREEGMHTAEGCYRTVWE--RLGMAFQTPEAYCEKNIYRSLAYMRPLSIWAM 835
Query: 891 QWALT 895
Q AL
Sbjct: 836 QLALN 840
>gi|426220232|ref|XP_004004320.1| PREDICTED: non-lysosomal glucosylceramidase [Ovis aries]
Length = 918
Score = 574 bits (1479), Expect = e-161, Method: Compositional matrix adjust.
Identities = 336/830 (40%), Positives = 464/830 (55%), Gaps = 81/830 (9%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K+ FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 117 RKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 176
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
ANQF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 177 ANQFTVCLRRRGQTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 228
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + +++++F+ N +GG
Sbjct: 229 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALEVSIMFSMRNGLGG 287
Query: 252 DSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
+ + G +N + DG V +LLHH T P T A+AA+ + V+ F
Sbjct: 288 EDDAPGGLWNEPFCLERDGETVQGLLLHHPTPPN--PYTMAVAARLSADTTVTHLTAF-- 343
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
+S G + +W ++ G D L S+ S+ G I A+ S +PP ++ F
Sbjct: 344 DPDSAG---QQVWQDLLLDGQLDSLAG--KSLPSQKGVGIAGAVCVSGKLPPRGRQRLEF 398
Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
SLAWD P + F G+ YYRRYT+F+G +AA ++ A+ + WE +I AWQ P+LE
Sbjct: 399 SLAWDMPRIMFGAKGRVYYRRYTRFFGPDGDAAPALSHYALSHYADWEEKISAWQNPVLE 458
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
D+ LP WY LFNELY+L GG VW ++VP
Sbjct: 459 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 488
Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
+ P AL S G L+ + G+F YLEG EY M+NTYD
Sbjct: 489 EDS------------------LPEALVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYD 528
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
VHFY+SFAL+ML+PK++LS+Q D A A + D ++ + L G K +PHD+G
Sbjct: 529 VHFYASFALVMLWPKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDP 588
Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
D PW VNAY ++DT WKDLN KFVLQVYRD TGD+ F + +WP M +F
Sbjct: 589 DDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKF 648
Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
D+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +++ F
Sbjct: 649 DKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSI 708
Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
+ + Y++ LWNG Y+NYD S QS SI +DQ AGQW+ RA GL + +
Sbjct: 709 LSRGQEAYDRLLWNGRYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDSEVFPTQQV 768
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
AL+ ++ +NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L
Sbjct: 769 VRALQTIFEFNVQAFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGLT 828
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
GF+TA G Y W GL AFQTPEA+ +RSL YMRPL+IWAM
Sbjct: 829 WEGFRTAEGCYRTVWERLGL--AFQTPEAYCQRRVFRSLAYMRPLSIWAM 876
>gi|332228461|ref|XP_003263408.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
[Nomascus leucogenys]
Length = 919
Score = 573 bits (1478), Expect = e-160, Method: Compositional matrix adjust.
Identities = 334/831 (40%), Positives = 458/831 (55%), Gaps = 87/831 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 119 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 178
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G HALYPR WTV
Sbjct: 179 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFCFLHALYPRXWTV 230
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P PE + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 231 YQ-LPGPECSLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 289
Query: 252 DSEFTGQHYNSKTKMNDG---VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + G V +LLHH T LP P T A+AA+ T V+
Sbjct: 290 GDDAPGGLWNEPFCLERGGETVQGLLLHHPT---LPNPYTMAVAARVTADTTVTH----- 341
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
I+ T + +W ++ + G D + S ++ G I A+ S + P + ++
Sbjct: 342 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLQPRGQCRLE 399
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ ++ WE +I AWQ P+L
Sbjct: 400 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPTLSHYALCQYPEWEERISAWQSPVL 459
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW +
Sbjct: 460 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 488
Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
+LE +L G N+ L+ + G+F YLEG EY M+N
Sbjct: 489 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 527
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDI
Sbjct: 528 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 587
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
G D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M
Sbjct: 588 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 647
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 648 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 707
Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL +
Sbjct: 708 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 767
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 768 PHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 827
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 828 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 876
>gi|440901933|gb|ELR52793.1| Non-lysosomal glucosylceramidase, partial [Bos grunniens mutus]
Length = 927
Score = 573 bits (1476), Expect = e-160, Method: Compositional matrix adjust.
Identities = 336/830 (40%), Positives = 464/830 (55%), Gaps = 81/830 (9%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K+ FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 126 RKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 185
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
ANQF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 186 ANQFTVCLRRRGQTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 237
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + +++++F+ N +GG
Sbjct: 238 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALEVSIMFSMRNGLGG 296
Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
+ + G +N + +DG V +LLHH T P T A+AA+ + V+ F
Sbjct: 297 EDDAPGGLWNEPFCLEHDGETVQGLLLHHPTPPN--PYTMAVAARLSADTTVTHLTAF-- 352
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
+S G + +W ++ G D L S+ S+ G I A+ S +PP ++ F
Sbjct: 353 DPDSAG---QQVWQDLLLDGQLDSLAG--KSLPSQKGVGIAGAVCVSGKLPPRGRHRLEF 407
Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
SLAWD P + F G+ YYRRYT+F+G +AA ++ A+ + WE +I AWQ P+LE
Sbjct: 408 SLAWDMPRIMFGAKGQVYYRRYTRFFGPDGDAAPALSHYALRHYADWEEKISAWQSPVLE 467
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
D+ LP WY LFNELY+L GG VW ++VP
Sbjct: 468 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 497
Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
+ P L S G L+ + G+F YLEG EY M+NTYD
Sbjct: 498 EDS------------------LPEELVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYD 537
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
VHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHD+G
Sbjct: 538 VHFYASFALIMLWPKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDP 597
Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
D PW VNAY ++DT WKDLN KFVLQVYRD TGD+ F + +WP M +F
Sbjct: 598 DDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKF 657
Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
D+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +++ F
Sbjct: 658 DKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSI 717
Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
+ + Y++ LWNG Y+NYD S QS SI +DQ AGQW+ RA GL + +
Sbjct: 718 LSRGQEAYDRLLWNGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQV 777
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
AL+ ++ +NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L
Sbjct: 778 VRALQTIFEFNVQAFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGLT 837
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
GF+TA G Y W GL AFQTPEA+ +RSL YMRPL+IWAM
Sbjct: 838 WEGFRTAEGCYRTVWERLGL--AFQTPEAYCQRRVFRSLAYMRPLSIWAM 885
>gi|410978559|ref|XP_003995657.1| PREDICTED: non-lysosomal glucosylceramidase [Felis catus]
Length = 917
Score = 573 bits (1476), Expect = e-160, Method: Compositional matrix adjust.
Identities = 340/858 (39%), Positives = 474/858 (55%), Gaps = 88/858 (10%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P + ++ ++ +G+R L R+ + K FID L +G PLGG+G
Sbjct: 92 PFNANNISLSNLIKHLGMGLRYLQWWYRKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGG 151
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF+V + R Y VL + P VL+
Sbjct: 152 GTITRGWRGQFCRWQLNPGMYQHQTVVADQFTVCLRREGTTVYQQVLSVERPSVLR---- 207
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P + + CRQI+PI+PH+Y++SS PV V
Sbjct: 208 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGV 262
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSH 281
F + + N G + D++++F+ N +GG + G +N + DG V +LLHH T
Sbjct: 263 FVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERDGETVQGLLLHHPT-- 320
Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
LP P T A+AA+ T V+ I+ T + +W ++ + G D + S
Sbjct: 321 -LPNPYTMAVAARLTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPAGRST 372
Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNA 399
S+ G I A+ + +PP + ++ FSLAWD P + F G+ YYRRYT+F+G NA
Sbjct: 373 PSQKGVGIAGAVCVTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNA 432
Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
A ++ A+ + WE +I AWQ P+L+D+ LP WY LFNELY+L GG VW
Sbjct: 433 APTLSHYALCRYTDWEEKISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW------ 486
Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQN--DTAVNILERMSSILEQIYTPVALNSSF 517
++VP + + V + ++ IL++
Sbjct: 487 ------------------------LEVPEDSLPEELVGSMCQLRPILQEY---------- 512
Query: 518 GTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMM 577
G+F YLEG EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A +
Sbjct: 513 ------------GRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLR 560
Query: 578 HDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQV 635
D ++ + L G K +PHDIG D PW VNAY ++DTA WKDLN KFVLQV
Sbjct: 561 EDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYVVHDTADWKDLNLKFVLQV 620
Query: 636 YRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
YRD TGD+ F + +WP M +FD+D DG+IEN G+ DQTYD W +G SAY
Sbjct: 621 YRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDGWVTTGPSAYC 680
Query: 696 GGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSS 754
GGLW+AA+ +A G + +D F ++ + YE+ LWNG Y+NYD S QS S
Sbjct: 681 GGLWLAAVAVMVQMAVLCGAQDVQDKFSSILRRGQEAYERLLWNGRYYNYDCSPPPQSCS 740
Query: 755 IQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDG 810
I +DQ AGQW+ RA GL + AL+ ++ +NV GG GAVNGM P G
Sbjct: 741 IMSDQCAGQWFLRASGLGEGDTEVFPTPHVVCALQTIFEFNVRAFAGGAMGAVNGMQPHG 800
Query: 811 RVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
D SS+QS E+W GV Y +AA+MI E L GFQTA G Y W GL AFQTPEA+
Sbjct: 801 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAY 858
Query: 871 NTDDQYRSLCYMRPLAIW 888
+RSL YMRPL+IW
Sbjct: 859 CQQRVFRSLAYMRPLSIW 876
>gi|344271640|ref|XP_003407645.1| PREDICTED: non-lysosomal glucosylceramidase [Loxodonta africana]
Length = 967
Score = 573 bits (1476), Expect = e-160, Method: Compositional matrix adjust.
Identities = 339/855 (39%), Positives = 460/855 (53%), Gaps = 82/855 (9%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P ++ ++L +G R ++ + K FID L +G PLGG+G
Sbjct: 144 PFDANNISLSNMIKLVGVGFRYFQWWYQKTRVEKKTPFIDMLNSVPLRQIYGCPLGGIGG 203
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF+V + R Y VL + P VL+
Sbjct: 204 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFTVCLRREGQTVYQQVLSLERPSVLR---- 259
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P + + CRQI+PI+PH+Y++SS PV V
Sbjct: 260 ----SWNWGLCGFFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGV 314
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSH 281
F + + N G + D++++FT N +GG + G +N + DG V +LLHH T
Sbjct: 315 FVWDVENEGDEALDVSIMFTMRNGLGGRDDAPGGLWNEPFCLERDGETVQGLLLHHPTPP 374
Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
P T A+AA+ T V+ I+ T + +W ++ + G D + S
Sbjct: 375 N--PYTMAMAARLTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPAGRSTP 425
Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
S+ G I A+ + +PP + FSLAWD P + F G+ YYRRYT+F+G +AA
Sbjct: 426 SQKGVGIAGAVCVAAKLPPRGRCCLEFSLAWDMPRIVFGAKGQVYYRRYTRFFGQDGSAA 485
Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
++ A+ + WE +I AWQ PILED+ LP WY LFNELY+L GG VW
Sbjct: 486 PALSHYALCRYADWEERISAWQSPILEDRSLPAWYKSALFNELYFLADGGTVW------- 538
Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
++VP T G
Sbjct: 539 -----------------------LEVPEDALTE--------------------EPEGGMR 555
Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
L+ G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A + D
Sbjct: 556 QLRPMLREYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLKEDL 615
Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
+ + L G K +PHDIG D PW VNAY ++DTA WKDLN KFVLQVYRD
Sbjct: 616 TPRRYLMSGLTAPVKRRNVIPHDIGDPDDEPWLRVNAYVIHDTADWKDLNLKFVLQVYRD 675
Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TGD+ F + +WP M +FD+D DG+IEN G+ DQTYD W +G SAY GGL
Sbjct: 676 YYLTGDQGFLRDMWPVCLAVMESEMEFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGL 735
Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
W+AA+ +A + ++ F + K YE+ LWNG Y+NYD+S QS SI +
Sbjct: 736 WLAAVAVMVQMATLCEAQDIQEKFSSILSRGKEAYERLLWNGRYYNYDSSSQPQSRSIMS 795
Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
DQ AGQW+ RACGL + AL+ ++ +NV GG GAVNGM P G D
Sbjct: 796 DQCAGQWFLRACGLGEGDTEVFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPD 855
Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
SS+QS E+W GV Y +AA+MI E L GF TA G Y W LG AFQTPEA+
Sbjct: 856 TSSVQSDEVWVGVVYGLAATMIQEGLTSKGFWTAEGCYRTVWE--RLGLAFQTPEAYCQQ 913
Query: 874 DQYRSLCYMRPLAIW 888
+RSL YMRPL+IW
Sbjct: 914 RVFRSLAYMRPLSIW 928
>gi|355567647|gb|EHH23988.1| Non-lysosomal glucosylceramidase [Macaca mulatta]
gi|355753217|gb|EHH57263.1| Non-lysosomal glucosylceramidase [Macaca fascicularis]
gi|380789499|gb|AFE66625.1| non-lysosomal glucosylceramidase [Macaca mulatta]
gi|384943046|gb|AFI35128.1| non-lysosomal glucosylceramidase [Macaca mulatta]
Length = 927
Score = 572 bits (1475), Expect = e-160, Method: Compositional matrix adjust.
Identities = 338/856 (39%), Positives = 470/856 (54%), Gaps = 84/856 (9%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P ++ + ++ +G+R L R+ + K FID L +G PLGG+G
Sbjct: 100 PFQANNVSLRNMIKHIGMGLRYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGG 159
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF+V + R Y VL + P VL+
Sbjct: 160 GTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSVLR---- 215
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P ++ + CRQI+PI+PH+Y++SS PV V
Sbjct: 216 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQDVTLTCRQITPILPHDYQDSSLPVGV 270
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
F + + N G + D++++F+ N +GG + G +N + G V +LLHH T
Sbjct: 271 FVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPT-- 328
Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
LP P T A+AA+ T V+ I+ T + +W ++ + G D + S
Sbjct: 329 -LPNPYTMAVAARVTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPAGQST 380
Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNA 399
++ G I A+ S + P + ++ FSLAWD P + F G+ +YRRYT+F+G +A
Sbjct: 381 PTQKGVGIAGAVCVSSKLRPRGQCRLEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDA 440
Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
A ++ A+ + WE +I AWQ P+L+D+ LP WY LFNELY+L GG VW +
Sbjct: 441 APALSHYALCRYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE---- 496
Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
R S+ E++ N
Sbjct: 497 --------------------------------------VREDSLPEEL----GRNMCHLC 514
Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
L+D G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A + D
Sbjct: 515 PTLRD----YGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLRED 570
Query: 580 PSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
++ + L G K +PHDIG D PW VNAY ++DTA WKDLN KFVLQVYR
Sbjct: 571 LTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYR 630
Query: 638 DVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
D TGD+ F K +WP M +FD+D DG+IEN G+ DQTYD W +G SAY GG
Sbjct: 631 DYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGG 690
Query: 698 LWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQ 756
LW+AA+ +A G + +D F + + YE+ LWNG Y+NYD+S QS S+
Sbjct: 691 LWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVM 750
Query: 757 ADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRV 812
+DQ AGQW+ +ACGL + AL+ ++ NV GG GAVNGM P G
Sbjct: 751 SDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVP 810
Query: 813 DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
D SS+QS E+W GV Y +AA+MI E L GFQTA G Y W GL AFQTPEA+
Sbjct: 811 DKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQ 868
Query: 873 DDQYRSLCYMRPLAIW 888
+RSL YMRPL+IW
Sbjct: 869 QRVFRSLAYMRPLSIW 884
>gi|109111050|ref|XP_001084705.1| PREDICTED: non-lysosomal glucosylceramidase [Macaca mulatta]
Length = 927
Score = 572 bits (1475), Expect = e-160, Method: Compositional matrix adjust.
Identities = 338/856 (39%), Positives = 470/856 (54%), Gaps = 84/856 (9%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P ++ + ++ +G+R L R+ + K FID L +G PLGG+G
Sbjct: 100 PFQANNVSLRNMIKHIGMGLRYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGG 159
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF+V + R Y VL + P VL+
Sbjct: 160 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFTVCLRREGQTVYQQVLSLERPSVLR---- 215
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P ++ + CRQI+PI+PH+Y++SS PV V
Sbjct: 216 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQDVTLTCRQITPILPHDYQDSSLPVGV 270
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
F + + N G + D++++F+ N +GG + G +N + G V +LLHH T
Sbjct: 271 FVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPT-- 328
Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
LP P T A+AA+ T V+ I+ T + +W ++ + G D + S
Sbjct: 329 -LPNPYTMAVAARVTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPAGQST 380
Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNA 399
++ G I A+ S + P + ++ FSLAWD P + F G+ +YRRYT+F+G +A
Sbjct: 381 PTQKGVGIAGAVCVSSKLRPRGQCRLEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDA 440
Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
A ++ A+ + WE +I AWQ P+L+D+ LP WY LFNELY+L GG VW +
Sbjct: 441 APALSHYALCRYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE---- 496
Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
R S+ E++ N
Sbjct: 497 --------------------------------------VREDSLPEEL----GRNMCHLC 514
Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
L+D G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A + D
Sbjct: 515 PTLRD----YGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLRED 570
Query: 580 PSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
++ + L G K +PHDIG D PW VNAY ++DTA WKDLN KFVLQVYR
Sbjct: 571 LTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYR 630
Query: 638 DVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
D TGD+ F K +WP M +FD+D DG+IEN G+ DQTYD W +G SAY GG
Sbjct: 631 DYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGG 690
Query: 698 LWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQ 756
LW+AA+ +A G + +D F + + YE+ LWNG Y+NYD+S QS S+
Sbjct: 691 LWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVM 750
Query: 757 ADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRV 812
+DQ AGQW+ +ACGL + AL+ ++ NV GG GAVNGM P G
Sbjct: 751 SDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVP 810
Query: 813 DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
D SS+QS E+W GV Y +AA+MI E L GFQTA G Y W GL AFQTPEA+
Sbjct: 811 DKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQ 868
Query: 873 DDQYRSLCYMRPLAIW 888
+RSL YMRPL+IW
Sbjct: 869 QRVFRSLAYMRPLSIW 884
>gi|194669510|ref|XP_001790122.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
gi|297477943|ref|XP_002689726.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
gi|296484715|tpg|DAA26830.1| TPA: hypothetical protein BOS_8918 [Bos taurus]
Length = 918
Score = 572 bits (1474), Expect = e-160, Method: Compositional matrix adjust.
Identities = 336/830 (40%), Positives = 463/830 (55%), Gaps = 81/830 (9%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K+ FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 117 RKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 176
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
ANQF+V + R Y VL + P VL SW+W L G + YHALYPRAWTV
Sbjct: 177 ANQFTVCLRRRGQTVYQQVLSVERPSVL--------CSWNWGLCGYFAFYHALYPRAWTV 228
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + +++++F+ N +GG
Sbjct: 229 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALEVSIMFSMRNGLGG 287
Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
+ + G +N + +DG V +LLHH T P T A+AA+ + V+ F
Sbjct: 288 EDDAPGGLWNEPFCLEHDGETVQGLLLHHPTPPN--PYTMAVAARLSADTTVTHLTAF-- 343
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
+S G + +W ++ G D L S+ S+ G I A+ S +PP ++ F
Sbjct: 344 DPDSAG---QQVWQDLLLDGQLDSLAG--KSLPSQKGVGIAGAVCVSGKLPPRGRHRLEF 398
Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
SLAWD P + F G+ YYRRYT+F+G +AA ++ A+ + WE +I AWQ P+LE
Sbjct: 399 SLAWDMPRIMFGAKGQVYYRRYTRFFGPDGDAAPALSHYALRHYADWEEKISAWQSPVLE 458
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
D+ LP WY LFNELY+L GG VW ++VP
Sbjct: 459 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 488
Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
+ P L S G L+ + G+F YLEG EY M+NTYD
Sbjct: 489 EDS------------------LPEELVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYD 528
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
VHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHD+G
Sbjct: 529 VHFYASFALIMLWPKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDP 588
Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
D PW VNAY ++DT WKDLN KFVLQVYRD TGD+ F + +WP M +F
Sbjct: 589 DDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKF 648
Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
D+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +++ F
Sbjct: 649 DKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSI 708
Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
+ + Y++ LWNG Y+NYD S QS SI +DQ AGQW+ RA GL + +
Sbjct: 709 LSRGQEAYDRLLWNGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQV 768
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
AL+ ++ +NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L
Sbjct: 769 VRALQTIFEFNVQAFAGGAMGAVNGMQPCGVPDTSSVQSDEVWVGVVYGLAATMIQEGLT 828
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
GF+TA G Y W GL AFQTPEA+ +RSL YMRPL+IWAM
Sbjct: 829 REGFRTAEGCYRTVWERLGL--AFQTPEAYCQRRVFRSLAYMRPLSIWAM 876
>gi|395855828|ref|XP_003800351.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
[Otolemur garnettii]
Length = 1147
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 343/862 (39%), Positives = 470/862 (54%), Gaps = 90/862 (10%)
Query: 46 PLSQFTLNWKETVQLAPIGVRILCL-IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P L+ ++ +G+R L R+ + K FID L +G PLGG+G
Sbjct: 314 PFHANNLSLSNIIKHLGMGLRYLQWWYRKTQVEKKAPFIDMINCVPLRQIYGCPLGGIGG 373
Query: 105 GSIGRSYRGEFQRWQIFP------RVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEV 158
G+I R +RG+F RWQ+ P V D+P+ +F+V + R Y VL + P V
Sbjct: 374 GTITRGWRGQFCRWQLNPGMYQHRTVIADQPISPTKFTVCLRRKGQTVYQQVLSLECPSV 433
Query: 159 LKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKES 218
L+ SW+W L G + YHALYPRAWTV++ P + + CRQI+PI+PH+Y++S
Sbjct: 434 LR--------SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDS 484
Query: 219 SYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLL 275
S PV VF + + N G + D++++F+ N +GG + G +N + DG V +LL
Sbjct: 485 SLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERDGETVQGLLL 544
Query: 276 HHRTSHQLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLN 334
HH T LP P T A+AA+ T V+ I+ T + +W ++ + G D +
Sbjct: 545 HHPT---LPNPYTMAVAARCTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--S 594
Query: 335 SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFY 393
S S+ G + A+ S +PP + ++ FSLAWD P++ F G+ +YRRYT+F+
Sbjct: 595 PAGQSAPSQKGVGVAGAVCVSGKLPPQGQCRLEFSLAWDMPKIMFGAKGQVHYRRYTRFF 654
Query: 394 GTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
G NAA ++ A+ + WE +I AWQ P+L+D+ LP WY LFNELY+L GG VW
Sbjct: 655 GQDGNAAPALSHYALCRYADWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW 714
Query: 454 TDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVAL 513
+ R SI E++
Sbjct: 715 LE------------------------------------------VREDSISEEL------ 726
Query: 514 NSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAA 573
G L + G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A
Sbjct: 727 --GGGMCQLHPMLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMAL 784
Query: 574 AVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKF 631
A + D ++ + L +G K +PHDIG D PW VNAY ++DTA WKDLN KF
Sbjct: 785 ATLREDLTRRRYLMNGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKF 844
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
VLQVYRD TGD+ F K +WP M +FD+D DG+IEN G+ DQTYD W +G
Sbjct: 845 VLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDAWVTTGP 904
Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSS 750
SAY GGLW+AA+ +A G + +D F ++ + YE+ LWNG Y+NYD+S
Sbjct: 905 SAYCGGLWLAAVAVMVQMAALCGAQDLQDKFSSILRQGQEAYERLLWNGRYYNYDSSSQP 964
Query: 751 QSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
QS SI +DQ AGQW+ +ACGL + AL+ +Y NV GG GAVNGM
Sbjct: 965 QSRSIMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIYELNVQAFAGGAMGAVNGM 1024
Query: 807 LPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
P G D SS+QS E+W GV Y +AA+MI E L GF+TA G Y W LG AFQT
Sbjct: 1025 QPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE--RLGLAFQT 1082
Query: 867 PEAWNTDDQYRSLCYMRPLAIW 888
PEA+ +RSL YMRPL+IW
Sbjct: 1083 PEAYCQQRVFRSLAYMRPLSIW 1104
>gi|66801399|ref|XP_629625.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
gi|60463006|gb|EAL61202.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
Length = 1302
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 338/945 (35%), Positives = 495/945 (52%), Gaps = 142/945 (15%)
Query: 36 WQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSH 95
W R+L F ++ + ++L +G R+ +++E + G+ +DPF
Sbjct: 373 WARRLDNNYPEHKHFQVSMWQGLKLVGMGYRMWKYVKKETSSGRVPIMDPFNLPKPGPIM 432
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVS--------------- 140
GVP+GG+GSGSI R ++G+F RW + + ++ V ++FSV+++
Sbjct: 433 GVPIGGIGSGSINRGWKGDFVRWNLNNGLVSNEIVDVDKFSVYINFDNSNTSSSSSSSSS 492
Query: 141 --------------------------------RSNGQKYSSVLCPKTPEVLKDTTAAGIG 168
++ + VL P P+ + +
Sbjct: 493 NSSTPPPQFSNNNNNSYNNSYNNNINNNGFDNNLKSKQKAVVLYPGKPK-----SNLNLD 547
Query: 169 SWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYT 228
W+W LKGD S Y L+PRAWTV+E EP P++R+ C+Q+SP+IPHNY+ESSYP V+ +
Sbjct: 548 VWNWGLKGDNSCYFGLFPRAWTVYE-EPHPDIRLTCKQVSPVIPHNYQESSYPCGVYVWK 606
Query: 229 IYNSGK-TSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG------------------ 269
I N+ + +SAD++L+ TW NS+G S+ G H+N D
Sbjct: 607 IENNNQYSSADVSLMLTWQNSIGTQSDQDGGHFNKYFTFGDNDQQSPSSSSSSSSGNNKK 666
Query: 270 -VHAVLLHHRTSHQ--LP---------PVTFALAAQETDGVHVSLCPHFVISGNSLGLTA 317
+ + + H+ + LP P+ +++A + V +S C F + L A
Sbjct: 667 NIKGITMIHKKEQKCTLPNGNTKVFNDPMEYSIAVMDEPDVEISFCERFETTSR---LDA 723
Query: 318 KDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEV 377
++W+ + G L + + S S P IGAA+AA V V S + F ++WD P
Sbjct: 724 ANLWYTFNKKG---ELENNKDSRPSAPKKPIGAAVAAKVKVAAGSSKTIVFCISWDAPIT 780
Query: 378 NFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPI 437
F G YY+RYTKFYG + IA DAI W+ QI WQ+PIL D LP +Y +
Sbjct: 781 RFQMGSAYYKRYTKFYGNQGGNSQRIAYDAINNWKHWDNQIIQWQQPILSDPELPTFYKM 840
Query: 438 TLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNIL 497
LFNELYYL GG VWT GSP I VP++ T +
Sbjct: 841 ALFNELYYLVDGGTVWTHGSP------------------------IDPVPSRYPTKTQLK 876
Query: 498 ERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALI 557
+ ++ TN +IG+F YLE +EYLM+NTYDV FY+SFAL
Sbjct: 877 QEDTT---------------DTN-------HIGRFAYLESLEYLMYNTYDVMFYASFALT 914
Query: 558 MLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNA 615
ML+P +++S+Q D A A + + + G + RK G VPHD+G DPW VN+
Sbjct: 915 MLWPLLEISLQYDIADATQLDYGINWEGIHSGHLMPRKKQGTVPHDLGNPGEDPWKRVNS 974
Query: 616 YCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVY-VAMAYMDQFDRDGDGMIE 674
YC+ D +RWKDL KFVLQVYRD + DK F + V V ++ FD D DG+++
Sbjct: 975 YCIQDVSRWKDLPSKFVLQVYRDYLVMEDKNFLLQCYNVVEGVIQRTLEHFDFDHDGVVD 1034
Query: 675 NDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-AKVVY 733
N+GFPDQTYD W +G SAYSGGLW+AAL+ A+ +A+ +G E + F+K +K
Sbjct: 1035 NEGFPDQTYDVWPATGCSAYSGGLWLAALKVAAEIAKILGFNEDESIYNAIFEKGSKSFT 1094
Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
+KLWNG YFNYD+S S S SI +D LAG WY +CGL + D+A S+L + YNV
Sbjct: 1095 KKLWNGHYFNYDSSKSVHSDSIMSDMLAGHWYLLSCGLKSYMTFDQALSSLSIINEYNVK 1154
Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
G GAVNGM P+G VD + +QS E+W G ++++AA+MI + + GI +
Sbjct: 1155 SYSKGVCGAVNGMRPEGTVDTTCLQSSEVWIGTSFSLAATMILHHMDSDAWDLVKGIVNS 1214
Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
++ G+ +QTPEAW+ + +R+ YMRPL+IW++QWAL + K
Sbjct: 1215 SYQ--KWGFQYQTPEAWDQNGCFRAASYMRPLSIWSIQWALQKRK 1257
>gi|385648282|ref|NP_001245314.1| glucosidase, beta (bile acid) 2 [Sus scrofa]
Length = 915
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 336/831 (40%), Positives = 460/831 (55%), Gaps = 87/831 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + V+
Sbjct: 119 RKTRVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYRHQTVI 178
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 179 ADQFTVCLRRKGQTVYQQVLSVEHPSVLR--------SWNWGLCGSFAFYHALYPRAWTV 230
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + N G + D++++F+ N +GG
Sbjct: 231 YQ-LPGQSVTLTCRQITPILPHDYQDSSLPVGVFVWDAENEGDETLDVSIMFSMRNGLGG 289
Query: 252 DSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
D + G +N + DG + +LLHH T LP P T A+AA+ T V+ F
Sbjct: 290 DEDAPGGLWNEPFCLERDGETIQGLLLHHPT---LPNPYTMAMAARLTADTTVTHITAFD 346
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
G T + +W ++ G D + S S+ G I A+ + +PP ++
Sbjct: 347 PDG-----TGQQVWQDLLXDGQLD--SPAGQSPPSQKGVGIAGAVCVTTRLPPGGRRRLE 399
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ Y+RRYT+F+G +AA ++ A+ + WE +I AWQ P+L
Sbjct: 400 FSLAWDMPRIMFGAKGQVYHRRYTRFFGPDGDAAPALSHHALSRYADWEERISAWQSPVL 459
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW ++V
Sbjct: 460 DDRSLPAWYKSALFNELYFLADGGTVW------------------------------LEV 489
Query: 487 PNQN--DTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWN 544
P + + V + ++ IL++ G+F YLEG EY M+N
Sbjct: 490 PEDSLPEDLVGSMCQLRPILQEY----------------------GRFGYLEGQEYRMYN 527
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDI
Sbjct: 528 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTERRYLMSGITAPVKRRNVIPHDI 587
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
G D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F + +WP M
Sbjct: 588 GDPDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQVFLRDMWPVCLAVMESE 647
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G R +D F
Sbjct: 648 MKFDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGTREVQDQF 707
Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
+ + YE+ LWNG Y+NYD S Q+ SI +DQ AGQW+ RA GL +
Sbjct: 708 SSILSRGREAYERLLWNGRYYNYDCSSQPQAYSIMSDQCAGQWFLRASGLGEGDTEVFPT 767
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ ++ +NV GG GAVNGM P G D SS+QS E+W GV YA+AA+MI E
Sbjct: 768 PHVVQALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYALAATMIQE 827
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
L GF+TA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 828 GLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 876
>gi|426361725|ref|XP_004048050.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Gorilla
gorilla gorilla]
Length = 927
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 332/831 (39%), Positives = 459/831 (55%), Gaps = 87/831 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ +W+W L G + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------TWNWGLCGYFAFYHALYPRAWTV 238
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297
Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + + V +LLHH T LP P T A+AA+ T V+
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTADTTVTH----- 349
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
I+ T + +W ++ + G D + S ++ G I A+ S + P + ++
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 407
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVL 467
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW +
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 496
Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
+LE +L G N+ L+ + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 535
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
G D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715
Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL +
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|297684086|ref|XP_002819683.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pongo
abelii]
Length = 927
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 333/824 (40%), Positives = 456/824 (55%), Gaps = 87/824 (10%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+A+QF+V
Sbjct: 134 KTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVC 193
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+ R Y VL + P VL+ SW+W L G + YHALYPRAWTV++ P
Sbjct: 194 LRREGQTVYQQVLSLERPTVLR--------SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQ 244
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
+ + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG + G
Sbjct: 245 NVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGG 304
Query: 259 HYNSKTKMNDG---VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFVISGNSLG 314
+N + G V +LLHH T LP P T A+AA+ T V+ I+
Sbjct: 305 LWNEPFCLERGRETVQGLLLHHPT---LPNPYTMAVAARVTADTTVTH-----ITAFDPD 356
Query: 315 LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDC 374
T + +W ++ + G D + S ++ G I A+ S + P + ++ FSLAWD
Sbjct: 357 STGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCISSKLRPRGQCRLEFSLAWDM 414
Query: 375 PEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I AWQ P+L+D+ LP
Sbjct: 415 PRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVLDDRSLPA 474
Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
WY LFNELY+L GG VW +
Sbjct: 475 WYKSALFNELYFLADGGTVWLE-------------------------------------- 496
Query: 494 VNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWNTYDVHFY 551
+LE +L G N+ L+ + G+F YLEG EY M+NTYDVHFY
Sbjct: 497 ---------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 542
Query: 552 SSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--P 609
+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDIG D P
Sbjct: 543 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEP 602
Query: 610 WFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDG 669
W VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M +FD+D
Sbjct: 603 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 662
Query: 670 DGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F +
Sbjct: 663 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 722
Query: 730 KVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSAL 784
+ YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL + AL
Sbjct: 723 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRAL 782
Query: 785 KKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L GF
Sbjct: 783 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 842
Query: 845 QTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
QTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 843 QTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|402897018|ref|XP_003911574.1| PREDICTED: non-lysosomal glucosylceramidase [Papio anubis]
Length = 927
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 337/856 (39%), Positives = 469/856 (54%), Gaps = 84/856 (9%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P ++ + ++ +G+R L R+ + K FID L +G PLGG+G
Sbjct: 100 PFQANNVSLRNMIKHIGMGLRYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGG 159
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF+V + R Y VL + P VL+
Sbjct: 160 GTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSVLR---- 215
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P + + CRQI+PI+PH+Y++SS PV V
Sbjct: 216 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGV 270
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
F + + N G + D++++F+ N +GG + G +N + G V +LLHH T
Sbjct: 271 FVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPT-- 328
Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
LP P T A+AA+ T V+ I+ T + +W ++ + G D + S
Sbjct: 329 -LPNPYTMAVAARVTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPAGQST 380
Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNA 399
++ G I A+ S + P + ++ FSLAWD P + F G+ +YRRYT+F+G +A
Sbjct: 381 PTQKGVGIAGAVCVSSKLRPRGQCRLEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDA 440
Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
A ++ A+ ++ WE +I AWQ P+L+D+ LP WY LFNELY+L GG VW +
Sbjct: 441 APALSHYALCQYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE---- 496
Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
R S+ E++ N
Sbjct: 497 --------------------------------------VREDSLPEEL----GRNMCHLC 514
Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
L+D G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A + D
Sbjct: 515 PTLRD----YGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLRED 570
Query: 580 PSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
++ + L G K +PHDIG D PW VNAY ++DTA WKDLN KFVLQVYR
Sbjct: 571 LTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYR 630
Query: 638 DVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
D TGD+ F K +WP M +FD+D DG+IEN G+ DQTYD W +G S Y GG
Sbjct: 631 DYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSVYCGG 690
Query: 698 LWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQ 756
LW+AA+ +A G + +D F + + YE+ LWNG Y+NYD+S QS S+
Sbjct: 691 LWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVM 750
Query: 757 ADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRV 812
+DQ AGQW+ +ACGL + AL+ ++ NV GG GAVNGM P G
Sbjct: 751 SDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVP 810
Query: 813 DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
D SS+QS E+W GV Y +AA+MI E L GFQTA G Y W GL AFQTPEA+
Sbjct: 811 DKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQ 868
Query: 873 DDQYRSLCYMRPLAIW 888
+RSL YMRPL+IW
Sbjct: 869 QRVFRSLAYMRPLSIW 884
>gi|444729906|gb|ELW70309.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
Length = 1001
Score = 571 bits (1472), Expect = e-160, Method: Compositional matrix adjust.
Identities = 352/894 (39%), Positives = 488/894 (54%), Gaps = 74/894 (8%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P ++ V+ +G+R L R+ + K FID L +G PLGG+G
Sbjct: 92 PFQANNISLSNLVKHLGMGLRYLQWWYRKTQVEKKTPFIDMINCVPLRQIYGCPLGGIGG 151
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF+V + R Y VL P VL+
Sbjct: 152 GTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLDHPSVLR---- 207
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P + + CRQI+PI+PH+Y++SS PV V
Sbjct: 208 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGV 262
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSH 281
F + + N G + D++++F+ N +GG+ + G +N + DG V +LLHH T
Sbjct: 263 FVWDVENEGDEALDVSIMFSMRNGLGGEDDAPGGLWNEPFCLERDGETVQGLLLHHPT-- 320
Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
LP P T A+AA+ T V+ F G T + +W ++ + G D + S
Sbjct: 321 -LPNPYTMAVAARLTANTIVTHITAFDPDG-----TGQQVWQDLLQDGQLD--SPAGRST 372
Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNA 399
S+ G I A+ S +PP + + FSLAWD P++ F G+ +YRRYT+F+ +A
Sbjct: 373 PSQKGVGIAGAVCISGKLPPRGQCCLEFSLAWDMPKIVFGAKGQVHYRRYTRFFSPDGDA 432
Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
A ++ A+ + WE +I AWQ PIL+D+ LP WY LFNELY+L GG VW + P
Sbjct: 433 APALSHYALCRYADWEERISAWQSPILDDRSLPAWYKSALFNELYFLADGGTVWLE--VP 490
Query: 460 VHSLVT-IGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILER------MSSILEQIY---- 508
SL +G L D R + + L R S++ ++Y
Sbjct: 491 EDSLPEELGGGMHQLRPILRDYGRFGYLEGMLNPCPLCLRRSLPAWYKSALFNELYFLAD 550
Query: 509 ------------TPVALNSSFGTNLLQDGEENIGQFLYLEGI-------------EYLMW 543
P L G + L+ + G+F YLEG+ EY M+
Sbjct: 551 GGTVWLEVPEDSLPEELGG--GMHQLRPILRDYGRFGYLEGMAAGGLQYGWGWGQEYRMY 608
Query: 544 NTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHD 603
NTYDVHFY+SFAL+ML+PK++LS+Q D A A + D ++ + L G K +PHD
Sbjct: 609 NTYDVHFYASFALVMLWPKLELSLQYDMALATLREDLTRRRYLMSGVIAPVKRRNVIPHD 668
Query: 604 IGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
IG D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M
Sbjct: 669 IGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMES 728
Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + ++
Sbjct: 729 EMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQEK 788
Query: 722 FLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVD 776
F + + YE+ LWNG Y+NYD+S QS SI +DQ AGQW+ +ACGL +
Sbjct: 789 FTSILTRGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTEVFP 848
Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
AL+ ++ +NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI
Sbjct: 849 TPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQ 908
Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
E L GF+TA G Y W GL AFQTPEA+ +RSL YMRPL+IWAM
Sbjct: 909 EGLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIWAM 960
>gi|224090236|ref|XP_002190896.1| PREDICTED: non-lysosomal glucosylceramidase [Taeniopygia guttata]
Length = 826
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 348/838 (41%), Positives = 469/838 (55%), Gaps = 79/838 (9%)
Query: 71 IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
+++ + K AFID L +G PLGG+G G+I R +RGEF RWQ+ P + V
Sbjct: 40 VKKTCIEKKTAFIDLLCAVPLQQIYGCPLGGIGGGTITRGWRGEFCRWQLNPGLYHYGTV 99
Query: 131 LANQFSVFVSRSNGQK-YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAW 189
+ANQF+V + R GQ Y VL + P L+ W+W G + YHALYPRAW
Sbjct: 100 IANQFTVCL-RCKGQTIYQQVLSMERPSSLQ--------GWNWGYCGHYAFYHALYPRAW 150
Query: 190 TVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSV 249
V+E P ++ + CRQ+SP+IPH+YK+SS PV VF + + N +++++F+ N +
Sbjct: 151 LVYE-LPGQQVVLTCRQVSPVIPHDYKDSSLPVGVFIWEVENRRDEDVEVSIMFSLQNGM 209
Query: 250 GGDSEFTGQHYNSK-TKMNDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHF 306
G + +G H+N T DG V VLLHH + + P T A++A+E G V+
Sbjct: 210 GTKEDGSGGHWNEPFTFQKDGERVAGVLLHHCPA--VNPFTLAISAREKAGTGVT----H 263
Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
V + N GL +++W ++ + G D + S +E G AA+ AS VP +
Sbjct: 264 VTAFNPTGL-GREVWQDLLQDGRLD--SPAGKSRPTEKGEVTAAAVCASCRVPAHGHKTL 320
Query: 367 TFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
+LAWD P V+F S K + RRYT+F+G+ +AA ++ A+ + WE +IEAWQ PI
Sbjct: 321 EMALAWDMPCVHFGSKEKLHVRRYTRFFGSKGDAAPTLSHYALTHYEEWERKIEAWQNPI 380
Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVD 485
LE+ +LP WY LFNELY++ GG +W + P D DL+
Sbjct: 381 LENSQLPAWYKSALFNELYFMTDGGTIWVEVPP---------------DCCAEDLQ---- 421
Query: 486 VPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNT 545
P S +L++ G+F YLEG EY M+NT
Sbjct: 422 -----------------------GPAGAGLSHLLPVLRE----YGRFAYLEGQEYRMYNT 454
Query: 546 YDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG 605
YDVHFY+SFALIML+PK+Q+S+Q D A V+ D + L GQ K+ VPHDIG
Sbjct: 455 YDVHFYASFALIMLWPKLQISLQYDIAVTVVNEDVQPRQYLMGGQTAQVKMKNVVPHDIG 514
Query: 606 IC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD 663
+PW VNAY ++DTA WKDLN KFVLQVYRD T D + + +WP M
Sbjct: 515 DPGDEPWQRVNAYLMHDTADWKDLNLKFVLQVYRDYYLTHDSLYLQDMWPVCQAVMESEL 574
Query: 664 QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
+FD D DG+IEN GF DQTYD W V AY GGLW+AA+ +A +GD +L
Sbjct: 575 KFDTDNDGLIENGGFADQTYDAWVVIRRCAYCGGLWLAAVCMMCKMAEVLGDTEIRQKYL 634
Query: 724 FKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDED 778
K K +E+ LWNG Y+NYD+SGS SSSI +DQ AGQW+ ACGL + +
Sbjct: 635 GILNKGKEAFERMLWNGKYYNYDSSGSDTSSSIMSDQCAGQWFLGACGLDQGEFEVFPKS 694
Query: 779 KARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHED 838
SAL+ ++ NVL GG GAVNGM PDG D SS+QS E+W GV Y++AA+MI E
Sbjct: 695 HILSALRTIFEKNVLSFAGGTMGAVNGMRPDGVPDTSSVQSNEVWVGVVYSLAATMIQEG 754
Query: 839 LADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
+ + GF+TA G Y W LG AFQTPEA+ YRSL YMRPL+IW+MQ AL R
Sbjct: 755 MVEEGFRTAEGCYRTVWE--RLGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQLALER 810
>gi|301767028|ref|XP_002918945.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Ailuropoda melanoleuca]
Length = 928
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 335/831 (40%), Positives = 461/831 (55%), Gaps = 87/831 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 130 RKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 189
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 190 ADQFTVCLRREGKTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 241
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +G
Sbjct: 242 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGV 300
Query: 252 DSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + DG V +LLHH T LP P T A+AA+ T V+
Sbjct: 301 GDDAPGGLWNEPFCLERDGETVQGLLLHHPT---LPNPYTMAVAARLTADTTVTH----- 352
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
I+ T + +W ++ + G D + S S+ G I A+ + +PP + ++
Sbjct: 353 ITAFDPDSTGQQVWQDLLQDGQLD--SPAGQSTPSQKGVGIAGAVCVTGKLPPRGQCRLE 410
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ YYRRYT+F+G NAA ++ A+ + WE +I AWQ PIL
Sbjct: 411 FSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCRYVDWEEKISAWQSPIL 470
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW ++V
Sbjct: 471 DDRSLPAWYKSALFNELYFLADGGTVW------------------------------LEV 500
Query: 487 PNQN--DTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWN 544
P + + V + ++ IL + G+F YLEG EY M+N
Sbjct: 501 PEDSLPEELVGSMCQLRPILREY----------------------GRFGYLEGQEYRMYN 538
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFAL+ML+PK++LS+Q D A A + D ++ + L G K +PHDI
Sbjct: 539 TYDVHFYASFALVMLWPKLELSLQYDMALATLREDLTRRQYLMSGVTAPVKRRNVIPHDI 598
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
G D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F + +WP M
Sbjct: 599 GDPDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESE 658
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 659 MKFDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKF 718
Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
++ + YE+ LWNG Y+NYD S QS SI +DQ AGQW+ +A GL +
Sbjct: 719 SSILRRGREAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTEVFPT 778
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ ++ +NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 779 PHVVCALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 838
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 839 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 887
>gi|143018484|sp|Q5M868.2|GBA2_RAT RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
Length = 912
Score = 568 bits (1464), Expect = e-159, Method: Compositional matrix adjust.
Identities = 332/828 (40%), Positives = 459/828 (55%), Gaps = 81/828 (9%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID F L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 118 RKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 177
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 178 ADQFIVCLRRDGRTVYQQVLSLELPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 229
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 230 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDETLDVSIMFSMRNGLGG 288
Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
+ + G +N ++ DG V +LLHH T P T A+AA+ H +
Sbjct: 289 EDDAAGGLWNEPFRLEQDGTTVQGLLLHHPTPPN--PYTMAVAAR-----HTADTTVTYT 341
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
+ T + +W ++ + G D + S ++ G + A+ AS + P + F
Sbjct: 342 TAFDPDSTGQQVWQDLLQDGQLD--SPAGQSTPTQRGEGVAGAVCASSKLLPRGRCCLEF 399
Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
SLAWD P + F G+ +YRRYT+F+G+ + A ++ A+ ++ WE I AWQ P+L+
Sbjct: 400 SLAWDMPRIMFGAKGQVHYRRYTRFFGSDGDVAPALSHYALCQYAGWENSISAWQNPVLD 459
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
D+ LP WY LFNELY+L GG VW ++VP
Sbjct: 460 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 489
Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
+ + E + + Q+ +LQD G+F YLEG EY M+NTYD
Sbjct: 490 EDS-----LPEELGGSMYQL-----------RPILQD----YGRFGYLEGQEYRMYNTYD 529
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
VHFY+SFAL+ML+PK++LS+Q D A A D ++ + L G K +PHDIG
Sbjct: 530 VHFYASFALVMLWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDP 589
Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M +F
Sbjct: 590 DDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKF 649
Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
D+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 650 DKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSI 709
Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ RACGL +
Sbjct: 710 LCRGREAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTEVFPTLHV 769
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
ALK ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L
Sbjct: 770 VRALKTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLT 829
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
GF+TA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 830 WEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 875
>gi|189217893|ref|NP_001013109.2| non-lysosomal glucosylceramidase [Rattus norvegicus]
gi|149045757|gb|EDL98757.1| glucosidase beta 2 [Rattus norvegicus]
Length = 904
Score = 568 bits (1463), Expect = e-159, Method: Compositional matrix adjust.
Identities = 332/828 (40%), Positives = 459/828 (55%), Gaps = 81/828 (9%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID F L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 118 RKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 177
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 178 ADQFIVCLRRDGRTVYQQVLSLELPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 229
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 230 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDETLDVSIMFSMRNGLGG 288
Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
+ + G +N ++ DG V +LLHH T P T A+AA+ H +
Sbjct: 289 EDDAAGGLWNEPFRLEQDGTTVQGLLLHHPTPPN--PYTMAVAAR-----HTADTTVTYT 341
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
+ T + +W ++ + G D + S ++ G + A+ AS + P + F
Sbjct: 342 TAFDPDSTGQQVWQDLLQDGQLD--SPAGQSTPTQRGEGVAGAVCASSKLLPRGRCCLEF 399
Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
SLAWD P + F G+ +YRRYT+F+G+ + A ++ A+ ++ WE I AWQ P+L+
Sbjct: 400 SLAWDMPRIMFGAKGQVHYRRYTRFFGSDGDVAPALSHYALCQYAGWENSISAWQNPVLD 459
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
D+ LP WY LFNELY+L GG VW ++VP
Sbjct: 460 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 489
Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
+ + E + + Q+ +LQD G+F YLEG EY M+NTYD
Sbjct: 490 EDS-----LPEELGGSMYQL-----------RPILQD----YGRFGYLEGQEYRMYNTYD 529
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
VHFY+SFAL+ML+PK++LS+Q D A A D ++ + L G K +PHDIG
Sbjct: 530 VHFYASFALVMLWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDP 589
Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M +F
Sbjct: 590 DDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKF 649
Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
D+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 650 DKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSI 709
Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ RACGL +
Sbjct: 710 LCRGREAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTEVFPTLHV 769
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
ALK ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L
Sbjct: 770 VRALKTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLT 829
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
GF+TA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 830 WEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 875
>gi|281339788|gb|EFB15372.1| hypothetical protein PANDA_007478 [Ailuropoda melanoleuca]
Length = 923
Score = 568 bits (1463), Expect = e-159, Method: Compositional matrix adjust.
Identities = 335/831 (40%), Positives = 461/831 (55%), Gaps = 87/831 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGKTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +G
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGV 297
Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + DG V +LLHH T LP P T A+AA+ T V+
Sbjct: 298 GDDAPGGLWNEPFCLERDGETVQGLLLHHPT---LPNPYTMAVAARLTADTTVTH----- 349
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
I+ T + +W ++ + G D + S S+ G I A+ + +PP + ++
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPAGQSTPSQKGVGIAGAVCVTGKLPPRGQCRLE 407
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ YYRRYT+F+G NAA ++ A+ + WE +I AWQ PIL
Sbjct: 408 FSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCRYVDWEEKISAWQSPIL 467
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW ++V
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVW------------------------------LEV 497
Query: 487 PNQN--DTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWN 544
P + + V + ++ IL + G+F YLEG EY M+N
Sbjct: 498 PEDSLPEELVGSMCQLRPILREY----------------------GRFGYLEGQEYRMYN 535
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFAL+ML+PK++LS+Q D A A + D ++ + L G K +PHDI
Sbjct: 536 TYDVHFYASFALVMLWPKLELSLQYDMALATLREDLTRRQYLMSGVTAPVKRRNVIPHDI 595
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
G D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F + +WP M
Sbjct: 596 GDPDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESE 655
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 656 MKFDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKF 715
Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
++ + YE+ LWNG Y+NYD S QS SI +DQ AGQW+ +A GL +
Sbjct: 716 SSILRRGREAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTEVFPT 775
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ ++ +NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 PHVVCALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 835
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|449526389|ref|XP_004170196.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
sativus]
Length = 440
Score = 567 bits (1462), Expect = e-158, Method: Compositional matrix adjust.
Identities = 274/441 (62%), Positives = 337/441 (76%), Gaps = 3/441 (0%)
Query: 15 EKEPLNSSFDKVDPAKPASLTWQRKLS-TGEIPLSQFTLNWKETVQLAPIGVRILCLIRE 73
EK +S +VDP+KP SLTW+RKL TG+ P S F+ + + G R+ +E
Sbjct: 2 EKGGNGASSTEVDPSKPPSLTWKRKLDFTGKSPES-FSFTLTDAWHMGMTGYRLWRNGKE 60
Query: 74 EAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLAN 133
E AKG+ + F +T HGV LGG+G+GSIGRSYRGEFQR+Q+F CED+PVLAN
Sbjct: 61 EIAKGRIPIYEFFSDVPITCYHGVSLGGIGAGSIGRSYRGEFQRFQMFYGPCEDEPVLAN 120
Query: 134 QFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHE 193
QFSVFVSR NG K+SSVLC P+ KD GI SWDWNL G+ STYHAL+PR+WTV++
Sbjct: 121 QFSVFVSRPNGNKFSSVLCSAKPQKSKDGKQTGIESWDWNLSGENSTYHALFPRSWTVYD 180
Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
GEPDP+L+IVCRQ+SPIIPHNYKESS+PVSVFT+ + N G+TSA +TLLFTW NSVGG S
Sbjct: 181 GEPDPDLKIVCRQMSPIIPHNYKESSFPVSVFTFKLSNEGQTSAQVTLLFTWANSVGGKS 240
Query: 254 EFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSL 313
FTG H+NSK DG VLLHH++++ P VT+ +AA+ TD VHVSLCP FVISG+S
Sbjct: 241 GFTGHHFNSKMGAEDGAQGVLLHHKSANGRPTVTYGIAAEATDDVHVSLCPCFVISGDSE 300
Query: 314 GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
G++AKDMW EIK HGSFD L S+ + S+PG SIGAA+AA++T+PP S VTFSLAWD
Sbjct: 301 GISAKDMWQEIKNHGSFDNLGSVGANEGSKPGCSIGAAVAATLTIPPTSARTVTFSLAWD 360
Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
CPEV F GKTY+R+Y+KFYGT +AA IARDAI +HG WE +IEAWQRPI+EDKRLP+
Sbjct: 361 CPEVKF-DGKTYHRQYSKFYGTLGDAAEIIARDAISKHGKWEAEIEAWQRPIIEDKRLPD 419
Query: 434 WYPITLFNELYYLNAGGAVWT 454
WYP+TLFNELY+LN+GG +WT
Sbjct: 420 WYPVTLFNELYFLNSGGTIWT 440
>gi|50511011|dbj|BAD32491.1| mKIAA1605 protein [Mus musculus]
Length = 940
Score = 565 bits (1457), Expect = e-158, Method: Compositional matrix adjust.
Identities = 335/855 (39%), Positives = 471/855 (55%), Gaps = 82/855 (9%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P ++ V+ +G+R L R+ + K FID L +G PLGG+G
Sbjct: 113 PFQANNISLSNLVKHLGMGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGG 172
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF V + R Y VL + P VL+
Sbjct: 173 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNVLR---- 228
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P + + CRQ++PI+PH+Y++SS PV V
Sbjct: 229 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGV 283
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
F + + N G + D+++ F+ N +GG+ + G +N ++ G V +LLHH T
Sbjct: 284 FVWDVENEGDETLDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPP 343
Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
P T A+AA+ T V+ F +G T + +W ++ + G D + S
Sbjct: 344 N--PYTMAVAARCTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLD--SPAGQSTP 394
Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
++ G I A+ S + P S + FSLAWD P++ F + +YRRYT+F+G+ + A
Sbjct: 395 TQKGEGIAGAVCVSSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVA 454
Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
++ A+ + WE +I AWQ P+L+D+ LP WY LFNELY+L GG VW
Sbjct: 455 PALSHYALCHYADWEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW------- 507
Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
++VP A ++ E + + Q+ +
Sbjct: 508 -----------------------LEVP-----ADSLPEGLGGSMRQL-----------RS 528
Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
LQD G+F YLEG EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A + D
Sbjct: 529 TLQD----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDL 584
Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
++ + L G K +PHDIG D PW VNAY ++DTA WKDLN KFVLQ+YRD
Sbjct: 585 TRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRD 644
Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TGD+ F + +WP M +FD+D DG+IEN G+ DQTYD W +G SAY GGL
Sbjct: 645 YYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGL 704
Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
W+AA+ +A G + ++ F + + YE+ LWNG Y+NYD+S QS SI +
Sbjct: 705 WLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMS 764
Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
DQ AGQW+ RACGL + AL+ ++ NV GG GAVNGM P G D
Sbjct: 765 DQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPD 824
Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
SS+QS E+W GV Y +AA+MI E L GF+TA G Y W GL AFQTPEA+
Sbjct: 825 RSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQ 882
Query: 874 DQYRSLCYMRPLAIW 888
+RSL YMRPL+IW
Sbjct: 883 QVFRSLAYMRPLSIW 897
>gi|426361727|ref|XP_004048051.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Gorilla
gorilla gorilla]
Length = 933
Score = 565 bits (1457), Expect = e-158, Method: Compositional matrix adjust.
Identities = 332/837 (39%), Positives = 457/837 (54%), Gaps = 93/837 (11%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP------RVC 125
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P V
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 126 EDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALY 185
D+P+ +F+V + R Y VL + P VL+ +W+W L G + YHALY
Sbjct: 187 ADQPICPLKFTVCLRREGQTVYQQVLSLERPSVLR--------TWNWGLCGYFAFYHALY 238
Query: 186 PRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTW 245
PRAWTV++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+
Sbjct: 239 PRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSM 297
Query: 246 TNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVS 301
N +GG + G +N + + V +LLHH T LP P T A+AA+ T V+
Sbjct: 298 RNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTADTTVT 354
Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
I+ T + +W ++ + G D + S ++ G I A+ S + P
Sbjct: 355 H-----ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPR 407
Query: 362 SEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
+ ++ FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I A
Sbjct: 408 GQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISA 467
Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
WQ P+L+D+ LP WY LFNELY+L GG VW +
Sbjct: 468 WQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE------------------------- 502
Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGI 538
+LE +L G N+ L+ + G+F YLEG
Sbjct: 503 ----------------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQ 535
Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K
Sbjct: 536 EYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRN 595
Query: 599 AVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVY 656
+PHDIG D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP
Sbjct: 596 VIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCL 655
Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
M +FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G +
Sbjct: 656 AVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQ 715
Query: 717 GSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL---- 771
+D F + + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL
Sbjct: 716 DIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGD 775
Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
+ AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +A
Sbjct: 776 TEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLA 835
Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
A+MI E L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 836 ATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|395740303|ref|XP_003777396.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pongo
abelii]
Length = 933
Score = 565 bits (1457), Expect = e-158, Method: Compositional matrix adjust.
Identities = 334/830 (40%), Positives = 454/830 (54%), Gaps = 93/830 (11%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP------RVCEDKPVLA 132
K FID L +G PLGG+G G+I R +RG+F RWQ+ P V D+PV
Sbjct: 134 KTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVIADQPVCP 193
Query: 133 NQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVH 192
+F+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV+
Sbjct: 194 LKFTVCLRREGQTVYQQVLSLERPTVLR--------SWNWGLCGYFAFYHALYPRAWTVY 245
Query: 193 EGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD 252
+ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 246 Q-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGG 304
Query: 253 SEFTGQHYNSKTKMNDG---VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFVI 308
+ G +N + G V +LLHH T LP P T A+AA+ T V+ I
Sbjct: 305 DDAPGGLWNEPFCLERGRETVQGLLLHHPT---LPNPYTMAVAARVTADTTVTH-----I 356
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
+ T + +W ++ + G D + S ++ G I A+ S + P + ++ F
Sbjct: 357 TAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCISSKLRPRGQCRLEF 414
Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
SLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I AWQ P+L+
Sbjct: 415 SLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVLD 474
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
D+ LP WY LFNELY+L GG VW +
Sbjct: 475 DRSLPAWYKSALFNELYFLADGGTVWLE-------------------------------- 502
Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWNT 545
+LE +L G N+ L+ + G+F YLEG EY M+NT
Sbjct: 503 ---------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNT 542
Query: 546 YDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG 605
YDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDIG
Sbjct: 543 YDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIG 602
Query: 606 ICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD 663
D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M
Sbjct: 603 DPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEM 662
Query: 664 QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 663 KFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFS 722
Query: 724 FKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDED 778
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL +
Sbjct: 723 SILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTP 782
Query: 779 KARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHED 838
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842
Query: 839 LADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|148670514|gb|EDL02461.1| glucosidase beta 2, isoform CRA_a [Mus musculus]
Length = 918
Score = 565 bits (1457), Expect = e-158, Method: Compositional matrix adjust.
Identities = 335/855 (39%), Positives = 471/855 (55%), Gaps = 82/855 (9%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P ++ V+ +G+R L R+ + K FID L +G PLGG+G
Sbjct: 91 PFQANNISLSNLVKHLGMGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGG 150
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF V + R Y VL + P VL+
Sbjct: 151 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNVLR---- 206
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P + + CRQ++PI+PH+Y++SS PV V
Sbjct: 207 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGV 261
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
F + + N G + D+++ F+ N +GG+ + G +N ++ G V +LLHH T
Sbjct: 262 FVWDVENEGDETLDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPP 321
Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
P T A+AA+ T V+ F +G T + +W ++ + G D + S
Sbjct: 322 N--PYTMAVAARCTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLD--SPAGQSTP 372
Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
++ G I A+ S + P S + FSLAWD P++ F + +YRRYT+F+G+ + A
Sbjct: 373 TQKGEGIAGAVCVSSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVA 432
Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
++ A+ + WE +I AWQ P+L+D+ LP WY LFNELY+L GG VW
Sbjct: 433 PALSHYALCHYADWEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW------- 485
Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
++VP A ++ E + + Q+ +
Sbjct: 486 -----------------------LEVP-----ADSLPEGLGGSMRQL-----------RS 506
Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
LQD G+F YLEG EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A + D
Sbjct: 507 TLQD----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDL 562
Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
++ + L G K +PHDIG D PW VNAY ++DTA WKDLN KFVLQ+YRD
Sbjct: 563 TRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRD 622
Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TGD+ F + +WP M +FD+D DG+IEN G+ DQTYD W +G SAY GGL
Sbjct: 623 YYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGL 682
Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
W+AA+ +A G + ++ F + + YE+ LWNG Y+NYD+S QS SI +
Sbjct: 683 WLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMS 742
Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
DQ AGQW+ RACGL + AL+ ++ NV GG GAVNGM P G D
Sbjct: 743 DQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPD 802
Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
SS+QS E+W GV Y +AA+MI E L GF+TA G Y W GL AFQTPEA+
Sbjct: 803 RSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQ 860
Query: 874 DQYRSLCYMRPLAIW 888
+RSL YMRPL+IW
Sbjct: 861 QVFRSLAYMRPLSIW 875
>gi|338720173|ref|XP_001497796.3| PREDICTED: non-lysosomal glucosylceramidase [Equus caballus]
Length = 983
Score = 565 bits (1456), Expect = e-158, Method: Compositional matrix adjust.
Identities = 332/822 (40%), Positives = 458/822 (55%), Gaps = 83/822 (10%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+A+QF+V
Sbjct: 190 KTPFIDFMNCVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVC 249
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+ R Y VL + P VL+ SW+W L G + YHALYPRAWTV++ P
Sbjct: 250 LRREGQTVYQQVLSVERPRVLR--------SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQ 300
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
+ + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG + G
Sbjct: 301 NVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGG 360
Query: 259 HYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFVISGNSLG 314
+N + DG +LLHH T LP P T A+AA+ T V+ I+
Sbjct: 361 LWNEPFCLEQDGETTQGLLLHHPT---LPNPYTMAVAARLTADTTVTH-----ITAFDPD 412
Query: 315 LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDC 374
T + +W ++ + G D + S ++ G I A+ + + P ++ FSLAWD
Sbjct: 413 STGQQVWQDLLQDGQLD--SPAGPSTPTQKGVGIAGAVCVAGKLSPRGRCRLEFSLAWDM 470
Query: 375 PEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
P + F G+ YYRRYT+F+G +AA ++ A+ + WE +I AWQ PIL+D+ LP
Sbjct: 471 PRIMFGAKGQDYYRRYTRFFGPDGDAAPALSHYALCRYADWEKRISAWQSPILDDRSLPA 530
Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
WY LFNELY+L GG VW ++VP +
Sbjct: 531 WYKSALFNELYFLADGGTVW------------------------------LEVPEDS--- 557
Query: 494 VNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSS 553
+ E ++ + Q+ +LQ+ G+F YLEG EY M+NTYDVHFY+S
Sbjct: 558 --LPEELAGSMCQL-----------RPILQE----YGRFGYLEGQEYRMYNTYDVHFYAS 600
Query: 554 FALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWF 611
FALIML+PK++LS+Q D A A + D ++ + L G K +PHDIG D PW
Sbjct: 601 FALIMLWPKLELSLQYDMALATLGEDLTRRRYLMSGVMAPVKKRNVIPHDIGDPDDEPWL 660
Query: 612 EVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDG 671
VNAY ++DTA WKDLN KFVLQ+YRD TGD+ F + +WP M +FD+D DG
Sbjct: 661 RVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDG 720
Query: 672 MIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV 731
+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F + +
Sbjct: 721 LIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQDRFSSILSRGQE 780
Query: 732 VYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKK 786
YE+ LWNG Y+NYD S QS SI +DQ AGQW+ RACGL + AL+
Sbjct: 781 AYERLLWNGRYYNYDCSSQPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTRHIVRALQT 840
Query: 787 VYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQT 846
++ NV GG GAVNGM P G D +S+QS E+W GV Y +AA+MI E L GF+T
Sbjct: 841 IFELNVQAFAGGAMGAVNGMQPHGVPDRTSVQSDEVWVGVVYGLAATMIQEGLTWEGFRT 900
Query: 847 ACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
A G Y W LG AFQTPEA+ +RSL YMRPL+IW
Sbjct: 901 AEGCYRTVWE--RLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 940
>gi|240120073|ref|NP_766280.2| non-lysosomal glucosylceramidase [Mus musculus]
gi|143018443|sp|Q69ZF3.2|GBA2_MOUSE RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
gi|34784135|gb|AAH56935.1| Glucosidase beta 2 [Mus musculus]
Length = 918
Score = 565 bits (1456), Expect = e-158, Method: Compositional matrix adjust.
Identities = 335/855 (39%), Positives = 471/855 (55%), Gaps = 82/855 (9%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P ++ V+ +G+R L R+ + K FID L +G PLGG+G
Sbjct: 91 PFQANNISLSNLVKHLGMGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGG 150
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF V + R Y VL + P VL+
Sbjct: 151 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNVLR---- 206
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P + + CRQ++PI+PH+Y++SS PV V
Sbjct: 207 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGV 261
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
F + + N G + D+++ F+ N +GG+ + G +N ++ G V +LLHH T
Sbjct: 262 FVWDVENEGDETLDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPP 321
Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
P T A+AA+ T V+ F +G T + +W ++ + G D + S
Sbjct: 322 N--PYTMAVAARCTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLD--SPAGQSTP 372
Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
++ G I A+ S + P S + FSLAWD P++ F + +YRRYT+F+G+ + A
Sbjct: 373 TQKGEGIAGAVCVSSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVA 432
Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
++ A+ + WE +I AWQ P+L+D+ LP WY LFNELY+L GG VW
Sbjct: 433 PALSHYALCHYADWEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW------- 485
Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
++VP A ++ E + + Q+ +
Sbjct: 486 -----------------------LEVP-----ADSLPEGLGGSMRQL-----------RS 506
Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
LQD G+F YLEG EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A + D
Sbjct: 507 TLQD----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDL 562
Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
++ + L G K +PHDIG D PW VNAY ++DTA WKDLN KFVLQ+YRD
Sbjct: 563 TRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRD 622
Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TGD+ F + +WP M +FD+D DG+IEN G+ DQTYD W +G SAY GGL
Sbjct: 623 YYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGL 682
Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
W+AA+ +A G + ++ F + + YE+ LWNG Y+NYD+S QS SI +
Sbjct: 683 WLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMS 742
Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
DQ AGQW+ RACGL + AL+ ++ NV GG GAVNGM P G D
Sbjct: 743 DQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPD 802
Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
SS+QS E+W GV Y +AA+MI E L GF+TA G Y W GL AFQTPEA+
Sbjct: 803 RSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQ 860
Query: 874 DQYRSLCYMRPLAIW 888
+RSL YMRPL+IW
Sbjct: 861 QVFRSLAYMRPLSIW 875
>gi|26354314|dbj|BAC40785.1| unnamed protein product [Mus musculus]
Length = 918
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 335/855 (39%), Positives = 471/855 (55%), Gaps = 82/855 (9%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P ++ V+ +G+R L R+ + K FID L +G PLGG+G
Sbjct: 91 PFQANNISLSNLVKHLGMGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGG 150
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF V + R Y VL + P VL+
Sbjct: 151 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNVLR---- 206
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P + + CRQ++PI+PH+Y++SS PV V
Sbjct: 207 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGV 261
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
F + + N G + D+++ F+ N +GG+ + G +N ++ G V +LLHH T
Sbjct: 262 FVWDVENEGDETLDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPP 321
Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
P T A+AA+ T V+ F +G T + +W ++ + G D + S
Sbjct: 322 N--PYTMAVAARCTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLD--SPAGQSTP 372
Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
++ G I A+ S + P S + FSLAWD P++ F + +YRRYT+F+G+ + A
Sbjct: 373 TQKGEGIAGAVCISSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVA 432
Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
++ A+ + WE +I AWQ P+L+D+ LP WY LFNELY+L GG VW
Sbjct: 433 PALSHYALCHYADWEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW------- 485
Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
++VP A ++ E + + Q+ +
Sbjct: 486 -----------------------LEVP-----ADSLPEGLGGSMRQL-----------RS 506
Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
LQD G+F YLEG EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A + D
Sbjct: 507 TLQD----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDL 562
Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
++ + L G K +PHDIG D PW VNAY ++DTA WKDLN KFVLQ+YRD
Sbjct: 563 TRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRD 622
Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TGD+ F + +WP M +FD+D DG+IEN G+ DQTYD W +G SAY GGL
Sbjct: 623 YYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGL 682
Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
W+AA+ +A G + ++ F + + YE+ LWNG Y+NYD+S QS SI +
Sbjct: 683 WLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMS 742
Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
DQ AGQW+ RACGL + AL+ ++ NV GG GAVNGM P G D
Sbjct: 743 DQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPD 802
Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
SS+QS E+W GV Y +AA+MI E L GF+TA G Y W GL AFQTPEA+
Sbjct: 803 RSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQ 860
Query: 874 DQYRSLCYMRPLAIW 888
+RSL YMRPL+IW
Sbjct: 861 QVFRSLAYMRPLSIW 875
>gi|395515264|ref|XP_003761826.1| PREDICTED: non-lysosomal glucosylceramidase [Sarcophilus harrisii]
Length = 954
Score = 563 bits (1451), Expect = e-157, Method: Compositional matrix adjust.
Identities = 336/863 (38%), Positives = 472/863 (54%), Gaps = 82/863 (9%)
Query: 46 PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
P + ++ T++ +G R L ++ + K FID L +G PLGG+G
Sbjct: 109 PFNANNVSLSNTLKHLGMGFRYLKWWYQKTQVEKKTPFIDMLNCLPLRQIYGCPLGGIGG 168
Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
G+I R +RG+F RWQ+ P + + + V+A+QF+V + R Y VL + P VL+
Sbjct: 169 GTITRGWRGQFCRWQLNPGIYQHRTVMADQFTVCLRREGQTVYQQVLSLERPNVLR---- 224
Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
SW+W L G + YHALYPRAWTV++ P ++ + C QI+PI+PH+Y++SS PV V
Sbjct: 225 ----SWNWGLCGFFAFYHALYPRAWTVYQ-LPGQQVTLTCLQITPILPHDYQDSSLPVGV 279
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSH 281
F + + N+G +++++F+ N +GG + G +N + DG V VLLHH
Sbjct: 280 FVWNVENTGDEDLEVSIMFSMRNGLGGGHDNLGGLWNEPFCLERDGETVQGVLLHHPNPP 339
Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
P T ALAA+ T V+ F G T + +W ++ G D + + S
Sbjct: 340 N--PYTMALAARRTANTTVTHITAFDPDG-----TGQKVWQDLLLDGQLD--SPLGPSPP 390
Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
++ IG A+ A +P Q+ F+L+WD P+V F G+ +YRRYT+F+G NAA
Sbjct: 391 TQKRQGIGGAVCAGAHLPRRGRCQLEFALSWDMPKVLFGAKGQAHYRRYTRFFGQEGNAA 450
Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
+ A+ ++ SWE +I AWQ P+LED+ LP WY LFNELY+L GG VW +
Sbjct: 451 PALCHYALTQYRSWEEKIIAWQSPVLEDRTLPAWYKSALFNELYFLADGGTVW------L 504
Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
+FS +L ++ L++ +G
Sbjct: 505 EVTEETITEEFS----------------------GMLNQLRPTLQE----------YGRF 532
Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
+G+E Y M+NTYDVHFY+SFALIML+PK++LS+Q D A A + D
Sbjct: 533 AYLEGQE------------YHMYNTYDVHFYASFALIMLWPKLELSLQYDMAVATLSEDL 580
Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
++ K L G K +PHD+G D PW +NAY ++DTA WKDLN KFVLQ+YRD
Sbjct: 581 TRRKYLMSGVIAPVKKKNVIPHDVGDPDDEPWLRINAYLIHDTADWKDLNLKFVLQIYRD 640
Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG++ F + +WP M +FD+D DG+IEN G+ DQTYD W +G SAY GGL
Sbjct: 641 FYLTGNESFLRDMWPVCQAVMESEMKFDKDQDGLIENGGYADQTYDGWITTGPSAYCGGL 700
Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
W+AA+ +A G + ++ FL + + YE+ LWNG Y+NYD+S QS I +
Sbjct: 701 WLAAVAVMVQMAAVCGAQDVQEKFLSILSRGREAYERLLWNGRYYNYDSSSQPQSRIIMS 760
Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
DQ AGQW+ RACGL + AL+ ++ NV GG GAVNGM PDG D
Sbjct: 761 DQCAGQWFLRACGLGKGDTEVFPSAHVIRALQTIFELNVQGFAGGAMGAVNGMHPDGVPD 820
Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
SS+QS E+W GV Y +AA+MI E L GF TA G Y W LG AFQTPEA+
Sbjct: 821 TSSVQSDEVWVGVVYGLAATMIQEGLVQEGFLTAEGCYRTVWE--RLGMAFQTPEAYCQR 878
Query: 874 DQYRSLCYMRPLAIWAMQWALTR 896
+RSL YMRPL+IWAMQ AL +
Sbjct: 879 RVFRSLAYMRPLSIWAMQLALQQ 901
>gi|24308251|ref|NP_065995.1| non-lysosomal glucosylceramidase [Homo sapiens]
gi|143018392|sp|Q9HCG7.2|GBA2_HUMAN RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
gi|15030210|gb|AAH11363.1| Glucosidase, beta (bile acid) 2 [Homo sapiens]
gi|16215453|emb|CAC83792.1| bile acid beta-glucosidase [Homo sapiens]
gi|21739906|emb|CAD38976.1| hypothetical protein [Homo sapiens]
gi|119578752|gb|EAW58348.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
gi|119578753|gb|EAW58349.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
Length = 927
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 333/831 (40%), Positives = 459/831 (55%), Gaps = 87/831 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297
Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + + V +LLHH T LP P T A+AA+ T V+
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 349
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
I+ T + +W ++ + G D + S ++ G I A+ S + P + ++
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 407
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVL 467
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW +
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 496
Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
+LE +L G N+ L+ + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 535
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
G D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715
Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL +
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|432110769|gb|ELK34246.1| Non-lysosomal glucosylceramidase [Myotis davidii]
Length = 868
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 328/832 (39%), Positives = 448/832 (53%), Gaps = 109/832 (13%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
K+ FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+A+QF+V
Sbjct: 126 KKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVC 185
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+ R Y VL + P VL+ SW+W L G + YHALYPRAWTV++ P
Sbjct: 186 LRREGQTVYQQVLSVERPSVLR--------SWNWGLCGFFAFYHALYPRAWTVYQ-LPGQ 236
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
+ + CRQI+P++PH+Y++SS PV VF + + N G + D++++F+ N +GG + +G
Sbjct: 237 NVTLTCRQITPVLPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDASGG 296
Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAK 318
+N L H +H + A + DG T +
Sbjct: 297 LWNEPF--------CLEHTTVTH--------ITAFDPDG------------------TGQ 322
Query: 319 DMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
+W ++ + G D + S S+ G I A+ + +PP ++ FSLAWD P +
Sbjct: 323 QVWQDLLQDGQLD--SPAGRSTLSQKGVGIAGAVCVTSKLPPRGRCRMEFSLAWDMPRIM 380
Query: 379 F-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPI 437
F G+ YYRRYT+F+G+ +AA ++ A+ + WE +I AWQ P+L+D+ LP WY
Sbjct: 381 FGAQGQVYYRRYTRFFGSDGDAAPALSHYALCRYADWEERISAWQSPVLDDRSLPAWYKS 440
Query: 438 TLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNIL 497
LFNELY+L GG VW ++VP
Sbjct: 441 ALFNELYFLADGGTVW------------------------------LEVPED-------- 462
Query: 498 ERMSSILEQIYTPVALNSSFGTNLLQDGE--ENIGQFLYLEGIEYLMWNTYDVHFYSSFA 555
+L G N+ Q G+F YLEG EY M+NTYDVHFY+SFA
Sbjct: 463 --------------SLPEELGGNMCQLRPLLTEYGRFGYLEGQEYRMYNTYDVHFYASFA 508
Query: 556 LIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEV 613
LIML+PK++LS+Q D A A + D + + L G K +PHDIG D PW V
Sbjct: 509 LIMLWPKLELSLQYDMALATLGEDLTWRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRV 568
Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
NAY ++DTA WKDLN KFVLQVYRD TGD+ F + +WP M +FD+D DG+I
Sbjct: 569 NAYVIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLI 628
Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
EN G+ DQTYD W +G SAY GGLW+AA+ +A G + ++ F + + Y
Sbjct: 629 ENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDIQEKFSSILSRGQQAY 688
Query: 734 EK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVY 788
E+ LWNG Y+NYD S QS SI +DQ AGQW+ RA GL + AL+ ++
Sbjct: 689 ERLLWNGRYYNYDCSSRPQSCSIMSDQCAGQWFLRASGLGEGETEVFPTQHVVRALQTIF 748
Query: 789 NYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTAC 848
+NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L GFQTA
Sbjct: 749 EFNVQAFAGGTMGAVNGMQPQGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAE 808
Query: 849 GIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
G Y W GLG FQTPEA+ +RSL YMRPL+IWAMQ AL + K
Sbjct: 809 GCYRTVWERLGLG--FQTPEAYCQKRVFRSLAYMRPLSIWAMQLALQQQHKK 858
>gi|397519481|ref|XP_003829887.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pan
paniscus]
Length = 927
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 334/831 (40%), Positives = 458/831 (55%), Gaps = 87/831 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297
Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + + V +LLHH T LP P T A+AA+ T V+
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 349
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
I+ T + +W ++ + G D + S ++ G I A+ S + P + ++
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 407
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERISAWQSPVL 467
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW +
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 496
Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
+LE +L G N+ L + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLHPTLRDYGRFGYLEGQEYRMYN 535
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
G D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715
Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL +
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ V+ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTVFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|14042883|dbj|BAB55430.1| unnamed protein product [Homo sapiens]
Length = 927
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 332/831 (39%), Positives = 459/831 (55%), Gaps = 87/831 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L+G + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLRGYFAFYHALYPRAWTV 238
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297
Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + + V +LLHH T LP P T A+AA+ T V+
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 349
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
I+ T + +W ++ + G D + S ++ G I A+ S + P + ++
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 407
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVL 467
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LF ELY+L GG VW +
Sbjct: 468 DDRSLPAWYKSALFKELYFLADGGTVWLE------------------------------- 496
Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
+LE +L G N+ L+ + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 535
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
G D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715
Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL +
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|55633173|ref|XP_520565.1| PREDICTED: non-lysosomal glucosylceramidase isoform 6 [Pan
troglodytes]
gi|410227830|gb|JAA11134.1| talin 1 [Pan troglodytes]
Length = 927
Score = 560 bits (1444), Expect = e-156, Method: Compositional matrix adjust.
Identities = 333/831 (40%), Positives = 458/831 (55%), Gaps = 87/831 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297
Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + + V +LLHH T LP P T A+AA+ T V+
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 349
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
I+ T + +W ++ + G D + S + G I A+ S + P + ++
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTHKGVGIAGAVCVSSKLRPRGQCRLE 407
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERISAWQSPVL 467
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW +
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 496
Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
+LE +L G N+ L+ + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 535
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
G D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715
Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL +
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835
Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|413920604|gb|AFW60536.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
Length = 459
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 275/442 (62%), Positives = 333/442 (75%), Gaps = 6/442 (1%)
Query: 19 LNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKG 78
+N +KV P + LTW+ KL+ L F L W+E +QLA IG+R+ I EE +KG
Sbjct: 22 MNGDEEKVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKG 81
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
+ A IDP KR S GVPLGG SIGRSY+G+FQRWQ+FP CED VLANQFS F
Sbjct: 82 RIAVIDPMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAF 137
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+SR +G+KYS+VL P P++ K + +GIGSWDWN+ G +STYHALYPRAWTV++GEPDP
Sbjct: 138 ISRQDGKKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDP 197
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
EL IVCRQISPIIPHNY++SSYPV+VFT+T+ NSG T+AD+TLLFTW NSVGG SE TG
Sbjct: 198 ELNIVCRQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGY 257
Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG--LT 316
H NS DGVH +LLHHRT+ P VTF +AAQE + + +S CP+FVISG+S T
Sbjct: 258 HSNSSMIEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFT 317
Query: 317 AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
AKDMW+ +KEHGSFD L+ ++TS+ S PGSSIGAAIAASV + P + ++FSLAW CPE
Sbjct: 318 AKDMWNSVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPE 377
Query: 377 VNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYP 436
V F SGKTY+RRYTKFYGT +AAA +A DAILEH SWE QIE WQ PIL+D+R P WYP
Sbjct: 378 VKFSSGKTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWYP 437
Query: 437 ITLFNELYYLNAGGAVWTDGSP 458
+TLFNELYYLNAGG +WT P
Sbjct: 438 VTLFNELYYLNAGGTIWTGMHP 459
>gi|390458158|ref|XP_002743170.2| PREDICTED: non-lysosomal glucosylceramidase [Callithrix jacchus]
Length = 1065
Score = 557 bits (1435), Expect = e-155, Method: Compositional matrix adjust.
Identities = 340/858 (39%), Positives = 473/858 (55%), Gaps = 86/858 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + V+
Sbjct: 269 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYRHQTVI 328
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 329 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 380
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G D++++F+ N +GG
Sbjct: 381 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEDLDVSIMFSMRNGLGG 439
Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + DG V +LLHH T LP P T A+AA+ T V+ F
Sbjct: 440 GDDAPGGLWNEPFHLERDGKTVQGLLLHHPT---LPNPYTMAVAARVTADTTVTHITAFD 496
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
G T + +W ++ + G D + S ++ G + A+ S + P + ++
Sbjct: 497 PDG-----TGQQVWQDLLQDGQLD--SPAGQSTPTQKGVGVAGAVCVSSKLRPRGQCRLE 549
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ ++ WE +I AWQ P+L
Sbjct: 550 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCQYTEWEERISAWQNPVL 609
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW + R +
Sbjct: 610 DDRLLPAWYKSALFNELYFLADGGTVWLE-------------------------VREDSL 644
Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTY 546
P + D N+ ++ + +G +G+E Y M+NTY
Sbjct: 645 PQELDG--NMCHLRPTLRD-----------YGRFGYLEGQE------------YRMYNTY 679
Query: 547 DVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGI 606
DVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDIG
Sbjct: 680 DVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGD 739
Query: 607 CD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M +
Sbjct: 740 PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 799
Query: 665 FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLF 724
FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 800 FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSS 859
Query: 725 KFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDK 779
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL +
Sbjct: 860 ILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPH 919
Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L
Sbjct: 920 VVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGL 979
Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IWAMQ AL + +
Sbjct: 980 TWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIWAMQLALQQQQH 1037
Query: 900 KTLEKQMKPEVTDESLLR 917
K K +P+V ++ LR
Sbjct: 1038 K---KASRPKVKQDTGLR 1052
>gi|221041110|dbj|BAH12232.1| unnamed protein product [Homo sapiens]
Length = 933
Score = 555 bits (1431), Expect = e-155, Method: Compositional matrix adjust.
Identities = 333/837 (39%), Positives = 457/837 (54%), Gaps = 93/837 (11%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP------RVC 125
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P V
Sbjct: 127 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 126 EDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALY 185
D+P+ +F+V + R Y VL + P VL+ SW+W L G + YHALY
Sbjct: 187 ADQPICPLKFTVCLRRERQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALY 238
Query: 186 PRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTW 245
PRAWTV++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+
Sbjct: 239 PRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSM 297
Query: 246 TNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVS 301
N +GG + G +N + + V +LLHH T LP P T A+AA+ T V+
Sbjct: 298 RNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVT 354
Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
I+ T + +W ++ + G D + S ++ G I A+ S + P
Sbjct: 355 H-----ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPR 407
Query: 362 SEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
+ ++ FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I A
Sbjct: 408 GQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISA 467
Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
WQ P+L+D+ LP WY LFNELY+L GG VW +
Sbjct: 468 WQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE------------------------- 502
Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGI 538
+LE +L G N+ L+ + G+F YLEG
Sbjct: 503 ----------------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQ 535
Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K
Sbjct: 536 EYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRN 595
Query: 599 AVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVY 656
+PHDIG D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP
Sbjct: 596 VIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCL 655
Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
M +FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G +
Sbjct: 656 AVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQ 715
Query: 717 GSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL---- 771
+D F + + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL
Sbjct: 716 DIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGD 775
Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
+ AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +A
Sbjct: 776 TEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLA 835
Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
A+MI E L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 836 ATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|332831849|ref|XP_001167923.2| PREDICTED: non-lysosomal glucosylceramidase isoform 4 [Pan
troglodytes]
Length = 933
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 333/837 (39%), Positives = 457/837 (54%), Gaps = 93/837 (11%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP------RVC 125
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P V
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 126 EDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALY 185
D+P+ + +F+V + R Y VL + P VL+ SW+W L G + YHALY
Sbjct: 187 ADQPICSLKFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALY 238
Query: 186 PRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTW 245
PRAWTV++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+
Sbjct: 239 PRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSM 297
Query: 246 TNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVS 301
N +GG + G +N + + V +LLHH T LP P T A+AA+ T V+
Sbjct: 298 RNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVT 354
Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
I+ T + +W ++ + G D + S + G I A+ S + P
Sbjct: 355 H-----ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTHKGVGIAGAVCVSSKLRPR 407
Query: 362 SEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
+ ++ FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I A
Sbjct: 408 GQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERISA 467
Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
WQ P+L+D+ LP WY LFNELY+L GG VW +
Sbjct: 468 WQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE------------------------- 502
Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGI 538
+LE +L G N+ L+ + G+F YLEG
Sbjct: 503 ----------------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQ 535
Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K
Sbjct: 536 EYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRN 595
Query: 599 AVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVY 656
+PHDIG D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP
Sbjct: 596 VIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCL 655
Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
M +FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G +
Sbjct: 656 AVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQ 715
Query: 717 GSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL---- 771
+D F + + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL
Sbjct: 716 DIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGD 775
Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
+ AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +A
Sbjct: 776 TEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLA 835
Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
A+MI E L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 836 ATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|397519483|ref|XP_003829888.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pan
paniscus]
Length = 755
Score = 536 bits (1382), Expect = e-149, Method: Compositional matrix adjust.
Identities = 312/779 (40%), Positives = 427/779 (54%), Gaps = 87/779 (11%)
Query: 124 VCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHA 183
V D+P+ + +F+V + R Y VL + P VL+ SW+W L G + YHA
Sbjct: 7 VIADQPICSLKFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHA 58
Query: 184 LYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLF 243
LYPRAWTV++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F
Sbjct: 59 LYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMF 117
Query: 244 TWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVH 299
+ N +GG + G +N + + V +LLHH T LP P T A+AA+ T
Sbjct: 118 SMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATT 174
Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
V+ I+ T + +W ++ + G D + S ++ G I A+ S +
Sbjct: 175 VTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLR 227
Query: 360 PDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQI 418
P + ++ FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I
Sbjct: 228 PRGQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERI 287
Query: 419 EAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQS 478
AWQ P+L+D+ LP WY LFNELY+L GG VW +
Sbjct: 288 SAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------------------- 324
Query: 479 DLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLE 536
+LE +L G N+ L + G+F YLE
Sbjct: 325 ------------------------VLED-----SLPEELGRNMCHLHPTLRDYGRFGYLE 355
Query: 537 GIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKV 596
G EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K
Sbjct: 356 GQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKR 415
Query: 597 LGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
+PHDIG D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP
Sbjct: 416 RNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPV 475
Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
M +FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G
Sbjct: 476 CLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCG 535
Query: 715 DRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL-- 771
+ +D F + + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL
Sbjct: 536 AQDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGE 595
Query: 772 --LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
+ AL+ V+ NV GG GAVNGM P G D SS+QS E+W GV Y
Sbjct: 596 GDTEVFPTQHVVRALQTVFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYG 655
Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
+AA+MI E L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 656 LAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 712
>gi|194386992|dbj|BAG59862.1| unnamed protein product [Homo sapiens]
Length = 755
Score = 536 bits (1380), Expect = e-149, Method: Compositional matrix adjust.
Identities = 311/779 (39%), Positives = 427/779 (54%), Gaps = 87/779 (11%)
Query: 124 VCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHA 183
V D+P+ +F+V + R Y VL + P VL+ SW+W L G + YHA
Sbjct: 7 VIADQPICPLKFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHA 58
Query: 184 LYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLF 243
LYPRAWTV++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F
Sbjct: 59 LYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMF 117
Query: 244 TWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVH 299
+ N +GG + G +N + + V +LLHH T LP P T A+AA+ T
Sbjct: 118 SMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPHTMAVAARVTAATT 174
Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
V+ I+ T + +W ++ + G D + S ++ G I A+ S +
Sbjct: 175 VTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLR 227
Query: 360 PDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQI 418
P + ++ FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I
Sbjct: 228 PRGQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERI 287
Query: 419 EAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQS 478
AWQ P+L+D+ LP WY LFNELY+L GG VW +
Sbjct: 288 SAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------------------- 324
Query: 479 DLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLE 536
+LE +L G N+ L+ + G+F YLE
Sbjct: 325 ------------------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLE 355
Query: 537 GIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKV 596
G EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K
Sbjct: 356 GQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKR 415
Query: 597 LGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
+PHDIG D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP
Sbjct: 416 RNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPV 475
Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
M +FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G
Sbjct: 476 CLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCG 535
Query: 715 DRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL-- 771
+ +D F + + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL
Sbjct: 536 AQDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGE 595
Query: 772 --LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
+ AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y
Sbjct: 596 GDTEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYG 655
Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
+AA+MI E L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 656 LAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 712
>gi|440799031|gb|ELR20092.1| glucosylceramidase [Acanthamoeba castellanii str. Neff]
Length = 860
Score = 530 bits (1364), Expect = e-147, Method: Compositional matrix adjust.
Identities = 359/905 (39%), Positives = 488/905 (53%), Gaps = 138/905 (15%)
Query: 36 WQRKLSTGEIPLSQF---TLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRH-- 90
WQR+LS E P Q T T++ P+G R+ REE +G R DP +
Sbjct: 36 WQRRLS--EQPTRQLRPLTAPLLHTLRNLPLGFRLWRQAREE--RGGR---DPVLDASGG 88
Query: 91 LTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL-ANQFSVFVSRSNGQKYSS 149
SS G+P GG+G G +GRS+RG+F+ I P PV+ ANQFSVF+ ++
Sbjct: 89 QASSSGLPAGGMGGGCVGRSWRGDFEPGLILPNGRGQAPVVPANQFSVFIEPDGEDSEAT 148
Query: 150 VLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
VL P P + +G+WDW+ G++S Y AL+PRAWTV++ EP+P LR+ RQISP
Sbjct: 149 VLNPGIPPTDR------LGAWDWSFVGEESYYSALFPRAWTVYD-EPEPRLRLTSRQISP 201
Query: 210 IIPHNYKESSYPVSVFTYTIYN-SGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND 268
+I +NYKESS P VF +T+ N + +L+FTW N G S+ G H N+
Sbjct: 202 VIGNNYKESSLPACVFVWTLENLDQEDEITASLMFTWQNGTGAASDALGGHANTPFTRPH 261
Query: 269 GVHAVLLHH---------RTSHQLPPVTFALAAQETDG-VHVSLCPHFVISGNSLGLTAK 318
V+ H +T P++FA+AA D H+++ + NS G
Sbjct: 262 PERVVMDHTQWRVDERGGKTHASSEPLSFAIAASSPDSDAHITV--RSCWATNSDG---A 316
Query: 319 DMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
++W + K G+ + N S+ G ++GAAIA VT+PP QV F+LAWD P V
Sbjct: 317 EVWKDFKRSGALEE-NWDALGKASQEGQTLGAAIAVRVTIPPGEHRQVEFALAWDHPVVE 375
Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
F +G+ Y RRYT+FY H ++AA +A DA++++ WE IE WQ PIL+DKRLP
Sbjct: 376 FGAGRRYTRRYTRFYSQHGDSAAQLACDALVQYPQWERLIEQWQAPILDDKRLP------ 429
Query: 439 LFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILE 498
GG VW Q +L + P + + E
Sbjct: 430 --------GCGGTVWL----------------------QDELSAKEEKPRREKRETDGQE 459
Query: 499 RMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIM 558
S ++G++ LE E+ ++N D FY+SFALI
Sbjct: 460 EEQS-------------------------DVGRWACLESQEHRLYNASDALFYASFALIA 494
Query: 559 LFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAY 616
+P++QL++QR+ AAA + D + +L G+ KV GAVP+D+G DPW +NAY
Sbjct: 495 NWPQLQLALQREVAAATLREDSRMVDVLATGRQAPNKVRGAVPYDMGTPADDPWVALNAY 554
Query: 617 CLYDTARWK---------------------DLNPKFVLQVYRDVVATGDKKFAKAVWPSV 655
+++T+ WK DLN KFVL VYRD VAT DK+F VWP+V
Sbjct: 555 NIHETSSWKGKLGLLPASHLSPSTELTEAADLNSKFVLGVYRDFVATQDKRFLADVWPAV 614
Query: 656 YVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASAL--AREV 713
A+ ++ QFDR+ DGMIEN+G PDQTY +W +GGLWVAAL AA A+ A ++
Sbjct: 615 DAALRHLQQFDRNKDGMIENEGLPDQTYHSW--------TGGLWVAALSAACAMAQALQL 666
Query: 714 GDRGSEDYFLFKFQKAKVVY-EKLWNGS-YFNYDNSGSSQSSSIQADQLAGQWYARACGL 771
D ++ Q A VY E LWN S YFNYD S S+ SIQADQL GQWYARACGL
Sbjct: 667 ADDSVAQHYRDLHQSAARVYNESLWNPSGYFNYDTSTSTHHDSIQADQLCGQWYARACGL 726
Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
I + + +ALK V+ N + GKRGAVNGM PDGRVD + + S E W G T+A+A
Sbjct: 727 PSIAPDKRIVTALKTVFANNFNE---GKRGAVNGMRPDGRVDYACLHSSEAWPGTTFALA 783
Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQ 891
A M+ E L D F TA G+ + + LGY F+TPEAW+ YR+L +MRPLAIWAMQ
Sbjct: 784 ACMMGEGLVDQAFITAAGVCGSIYE--DLGYWFRTPEAWDGSGNYRALGHMRPLAIWAMQ 841
Query: 892 WALTR 896
WA T+
Sbjct: 842 WAWTQ 846
>gi|413920605|gb|AFW60537.1| hypothetical protein ZEAMMB73_659971, partial [Zea mays]
Length = 449
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 260/427 (60%), Positives = 317/427 (74%), Gaps = 6/427 (1%)
Query: 19 LNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKG 78
+N +KV P + LTW+ KL+ L F L W+E +QLA IG+R+ I EE +KG
Sbjct: 22 MNGDEEKVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKG 81
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
+ A IDP KR S GVPLGG SIGRSY+G+FQRWQ+FP CED VLANQFS F
Sbjct: 82 RIAVIDPMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAF 137
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+SR +G+KYS+VL P P++ K + +GIGSWDWN+ G +STYHALYPRAWTV++GEPDP
Sbjct: 138 ISRQDGKKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDP 197
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
EL IVCRQISPIIPHNY++SSYPV+VFT+T+ NSG T+AD+TLLFTW NSVGG SE TG
Sbjct: 198 ELNIVCRQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGY 257
Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG--LT 316
H NS DGVH +LLHHRT+ P VTF +AAQE + + +S CP+FVISG+S T
Sbjct: 258 HSNSSMIEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFT 317
Query: 317 AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
AKDMW+ +KEHGSFD L+ ++TS+ S PGSSIGAAIAASV + P + ++FSLAW CPE
Sbjct: 318 AKDMWNSVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPE 377
Query: 377 VNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYP 436
V F SGKTY+RRYTKFYGT +AAA +A DAILEH SWE QIE WQ PIL+D+R P W+
Sbjct: 378 VKFSSGKTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWFV 437
Query: 437 ITLFNEL 443
TL L
Sbjct: 438 PTLIIHL 444
>gi|119578751|gb|EAW58347.1| glucosidase, beta (bile acid) 2, isoform CRA_c [Homo sapiens]
gi|168270622|dbj|BAG10104.1| non-lysosomal glucosylceramidase [synthetic construct]
Length = 877
Score = 514 bits (1323), Expect = e-142, Method: Compositional matrix adjust.
Identities = 306/782 (39%), Positives = 429/782 (54%), Gaps = 85/782 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297
Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + + V +LLHH T LP P T A+AA+ T V+
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 349
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
I+ T + +W ++ + G D + S ++ G I A+ S + P + ++
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 407
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVL 467
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW +
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 496
Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
+LE +L G N+ L+ + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 535
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
G D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715
Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL +
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835
Query: 838 DL 839
L
Sbjct: 836 LL 837
>gi|10047285|dbj|BAB13431.1| KIAA1605 protein [Homo sapiens]
Length = 922
Score = 513 bits (1322), Expect = e-142, Method: Compositional matrix adjust.
Identities = 306/782 (39%), Positives = 429/782 (54%), Gaps = 85/782 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 172 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 231
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 232 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 283
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 284 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 342
Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + + V +LLHH T LP P T A+AA+ T V+
Sbjct: 343 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 394
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
I+ T + +W ++ + G D + S ++ G I A+ S + P + ++
Sbjct: 395 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 452
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I AWQ P+L
Sbjct: 453 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVL 512
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+D+ LP WY LFNELY+L GG VW +
Sbjct: 513 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 541
Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
+LE +L G N+ L+ + G+F YLEG EY M+N
Sbjct: 542 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 580
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
TYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G K +PHDI
Sbjct: 581 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 640
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
G D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M
Sbjct: 641 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 700
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F
Sbjct: 701 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 760
Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
+ + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL +
Sbjct: 761 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 820
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 821 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 880
Query: 838 DL 839
L
Sbjct: 881 LL 882
>gi|403306662|ref|XP_003943842.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Saimiri
boliviensis boliviensis]
Length = 923
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 315/832 (37%), Positives = 436/832 (52%), Gaps = 89/832 (10%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPNVLR--------SWNWGLCGYFAFYHALYPRAWTV 238
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297
Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
+ G +N + DG V +LLHH T LP P T A AA+ T V+ F
Sbjct: 298 GDDAPGGLWNEPFHLERDGETVQGLLLHHPT---LPNPYTMAAAARVTADTMVTHITAFD 354
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
G T + +W ++ + G D + S ++ G + A+ S + P + +
Sbjct: 355 PDG-----TGQQVWQDLLQDGQLD--SPAGQSTPTQKGVGVAGAVCVSSKLRPRGQCCLE 407
Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALSRYTEWEERISAWQSPVL 467
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVW---TDGSPPVHSLVTIGHRKFSLDWSQSDLKRI 483
+D+ LP WY LFNELY+L GG VW + S P + H + +L D R
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLEVREDSLPQELDGNMCHLRPTL----RDYGRF 523
Query: 484 VDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMW 543
+ Q N + V +SF +L W
Sbjct: 524 GYLEGQEYRMYNTYD------------VHFYASFALIML--------------------W 551
Query: 544 NTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHD 603
++ AL L + L+ +R + VM P K + +PHD
Sbjct: 552 PKLELSLQYDMALATL--REDLTRRRYLMSGVMA--PVKRR-------------NVIPHD 594
Query: 604 IGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
IG D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M
Sbjct: 595 IGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMES 654
Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
+FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D
Sbjct: 655 EMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDK 714
Query: 722 FLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVD 776
F + + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL +
Sbjct: 715 FSSILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFP 774
Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI
Sbjct: 775 TPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQ 834
Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
E L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 835 EGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|431902810|gb|ELK09025.1| Non-lysosomal glucosylceramidase [Pteropus alecto]
Length = 970
Score = 501 bits (1289), Expect = e-138, Method: Compositional matrix adjust.
Identities = 295/777 (37%), Positives = 418/777 (53%), Gaps = 103/777 (13%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K+ FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 119 RKTQVEKKKPFIDLINSVPLKQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 178
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 179 ADQFTVCLRREGRTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 230
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P ++ + CRQI+PI+PH+Y++SS PV +F + + N G + D++++F+ N +GG
Sbjct: 231 YQ-LPGQKVTLTCRQITPILPHDYQDSSLPVGIFVWDVENEGDEALDVSIMFSMRNGLGG 289
Query: 252 DSEFTGQHYNSKTKMN-DGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
+ +G +N + DG A +T H++
Sbjct: 290 GDDASGGLWNEPFCLERDGETA---------------------DTTITHIT-----AFDP 323
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
+S T + +W ++ + G D + S S+ G I A+ + +PP + + FSL
Sbjct: 324 DS---TGQQVWQDLLQDGQLD--SPAGRSTPSQKGVGIAGAVCVTGKLPPRGQCCLEFSL 378
Query: 371 AWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDK 429
AWD P + F G+ YYRRYT+F+G+ +AA ++ A+ + WE +I AWQ P+L+D+
Sbjct: 379 AWDMPRIMFGAKGQVYYRRYTRFFGSDGDAAPALSHYALCRYTDWEERISAWQSPVLDDR 438
Query: 430 RLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQ 489
LP WY LFNELY+L GG +W ++VP
Sbjct: 439 SLPSWYKSALFNELYFLADGGTIW------------------------------LEVPED 468
Query: 490 N--DTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
+ V + ++ IL++ G+F YLEG EY M+NTYD
Sbjct: 469 ALPEELVGNMCQLRPILQEY----------------------GRFGYLEGQEYRMYNTYD 506
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
VHFY+SFALIML+PK++LS+Q D A A + D S + L G K +PHDIG
Sbjct: 507 VHFYASFALIMLWPKLELSLQYDMALATLREDLSWRQYLMSGVVAPVKRRNVIPHDIGDP 566
Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F + +WP M +F
Sbjct: 567 DDEPWLRVNAYMIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKF 626
Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
D+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + D F
Sbjct: 627 DKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDILDKFSSI 686
Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
+ + YE+ LWNG Y+NYD S QS SI +DQ AGQW+ RA GL +
Sbjct: 687 LSRGREAYERLLWNGHYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDTEVFPTPHV 746
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AL+ ++ +NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E
Sbjct: 747 VCALQTIFKFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 803
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%), Gaps = 2/41 (4%)
Query: 850 IYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
Y W GL AFQTPEA+ +RSL YMRPL+IWAM
Sbjct: 890 CYRTVWERLGL--AFQTPEAYCQHQVFRSLAYMRPLSIWAM 928
>gi|10440626|gb|AAG16864.1|AC069145_13 unknown protein [Oryza sativa Japonica Group]
Length = 444
Score = 497 bits (1279), Expect = e-137, Method: Compositional matrix adjust.
Identities = 248/379 (65%), Positives = 296/379 (78%), Gaps = 1/379 (0%)
Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
L ++G EN+G+FLYLEG+EY+MW TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+ D
Sbjct: 18 LSKNGPENVGKFLYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDR 77
Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+MK L DG RKV GAVPHD+G DPW E+NAY ++DT++WKDLNPKFVLQVYRD
Sbjct: 78 RRMKFLADGTSGIRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFA 137
Query: 641 ATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWV 700
ATGD F + VWP+V M YM+QFDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+
Sbjct: 138 ATGDMSFGRDVWPAVCAVMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWL 197
Query: 701 AALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQ 759
AALQAA+ +A +GDR + + KF +AK VYE KLWNGSYFNYD+ SS S SIQADQ
Sbjct: 198 AALQAAATMAHRLGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQ 257
Query: 760 LAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQS 819
LAGQWYA + GL P+ DE+K RSAL+K++ +NV+KV GG+ GAVNGM P+G+VD + MQS
Sbjct: 258 LAGQWYAASSGLPPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQS 317
Query: 820 REIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSL 879
REIW+GVTY VAA+M+ + GF TA GI+ A WS G GY FQTPE W D YRSL
Sbjct: 318 REIWTGVTYGVAANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSL 377
Query: 880 CYMRPLAIWAMQWALTRPK 898
YMRPLAIWAMQWA + PK
Sbjct: 378 IYMRPLAIWAMQWARSPPK 396
>gi|403306664|ref|XP_003943843.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Saimiri
boliviensis boliviensis]
Length = 929
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 315/838 (37%), Positives = 434/838 (51%), Gaps = 95/838 (11%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP------RVC 125
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P V
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 126 EDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALY 185
D+P+ +F+V + R Y VL + P VL+ SW+W L G + YHALY
Sbjct: 187 ADQPICPLKFTVCLRREGQTVYQQVLSLERPNVLR--------SWNWGLCGYFAFYHALY 238
Query: 186 PRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTW 245
PRAWTV++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+
Sbjct: 239 PRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSM 297
Query: 246 TNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVS 301
N +GG + G +N + DG V +LLHH T LP P T A AA+ T V+
Sbjct: 298 RNGLGGGDDAPGGLWNEPFHLERDGETVQGLLLHHPT---LPNPYTMAAAARVTADTMVT 354
Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
F G T + +W ++ + G D + S ++ G + A+ S + P
Sbjct: 355 HITAFDPDG-----TGQQVWQDLLQDGQLD--SPAGQSTPTQKGVGVAGAVCVSSKLRPR 407
Query: 362 SEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
+ + FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I A
Sbjct: 408 GQCCLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALSRYTEWEERISA 467
Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVW---TDGSPPVHSLVTIGHRKFSLDWSQ 477
WQ P+L+D+ LP WY LFNELY+L GG VW + S P + H + +L
Sbjct: 468 WQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLEVREDSLPQELDGNMCHLRPTL---- 523
Query: 478 SDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
D R + Q N + V +SF +L
Sbjct: 524 RDYGRFGYLEGQEYRMYNTYD------------VHFYASFALIML--------------- 556
Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
W ++ AL L + L+ +R + VM P K +
Sbjct: 557 -----WPKLELSLQYDMALATL--REDLTRRRYLMSGVMA--PVKRR------------- 594
Query: 598 GAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV 655
+PHDIG D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP
Sbjct: 595 NVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVC 654
Query: 656 YVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
M +FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G
Sbjct: 655 LAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGA 714
Query: 716 RGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL--- 771
+ +D F + + YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL
Sbjct: 715 QDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEG 774
Query: 772 -LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
+ AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +
Sbjct: 775 DTEVFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGL 834
Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
AA+MI E L GFQTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 835 AATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|443709232|gb|ELU03988.1| hypothetical protein CAPTEDRAFT_186839 [Capitella teleta]
Length = 802
Score = 489 bits (1258), Expect = e-135, Method: Compositional matrix adjust.
Identities = 309/844 (36%), Positives = 454/844 (53%), Gaps = 130/844 (15%)
Query: 77 KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
+GK+ FID +G P+GG+G GSIGR +RGEF R+Q+ P + K V A+QF
Sbjct: 68 QGKKMFIDNLNLVPWKPIYGAPIGGIGCGSIGRGFRGEFCRFQMIPGMYTHKTVEADQFI 127
Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
+ V +SN + VL P LK + SW W ++TYHALYPR+WT +E P
Sbjct: 128 LTVRKSNETIHQCVLSP-----LK-LKGKRLSSWRWGFPSQQATYHALYPRSWTRYE-IP 180
Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT 256
+ +L ++CRQISPI P++Y+E+ +P SVF + + N G + DI+L FT+ N G + +
Sbjct: 181 ECDLTLICRQISPIFPNDYQETCFPGSVFVWEVINKGSDAVDISLTFTFKNGQGVKEDRS 240
Query: 257 GQHYNSK-----TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGN 311
G ++ T +D V V +H + + P + +AA+ G++VS +F
Sbjct: 241 GGVWSRAFQEGITGTDDCVSGVEIHQKFNDL--PCVYGVAAKHGPGINVSEKSYFNPQS- 297
Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
T +++W+++ GS + + S T++ G + A+ A T+ P S +TFSL
Sbjct: 298 ----TGEEVWNDLLAEGSL-KSTPVSGSETAK-GEELACAVCAKTTISPYSHTSLTFSLT 351
Query: 372 WDCPEVNFMSGKT-YYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKR 430
WD P ++F + + Y+RRY +F+G A +A A+ SWE +I WQ+P L+ +
Sbjct: 352 WDMPIIHFGAKRNKYHRRYARFFGHDGCATPRLAAHALNHFTSWEEKIAEWQQPTLKANK 411
Query: 431 LPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQN 490
LP WY LFNE Y+++ GG+VW D +DV +
Sbjct: 412 LPAWYKSALFNETYFISDGGSVWVDS---------------------------IDVKD-- 442
Query: 491 DTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHF 550
++ E + FG +G E Y M+NT DVH+
Sbjct: 443 -----------AVSEHPFV-----QEFGKFAYLEGHE------------YRMYNTLDVHY 474
Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
Y+SFAL+ L+P +QLS+Q D +D+ +PW
Sbjct: 475 YASFALMKLWPNLQLSLQYD---------------IDN-------------------EPW 500
Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
+NAY ++ T WKDLNPKFVLQVYRD V T D+++ ++P M + ++D D D
Sbjct: 501 SRLNAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMYPQCKAVMDHSLRWDVDRD 560
Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-- 728
G+I+N GF DQTYD W+V+G SAY GGLW+AA++A + +A +G ED L K+++
Sbjct: 561 GIIDNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHRLG--VVED--LMKYEEIL 616
Query: 729 ---AKVVYEKLWNG----SYFNYDNSGSSQSSSIQADQLAGQWYARACGLL--PIVDEDK 779
K EKLWNG Y+NYD S S SI ADQ+ GQW+ +ACG+ +
Sbjct: 617 ERGKKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLKACGVADNAVFPSAN 676
Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
+ AL+ V+ +NVL GG+ GA+NGM P+ + D++S QS E W+GVTYA+ A+MI +
Sbjct: 677 VKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYALGATMIQVGM 736
Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
D GFQTA G Y W GL AFQTPEA+ + ++RSL YMRPLAIWA+Q A+ + P
Sbjct: 737 IDKGFQTAYGAYHTCWERYGL--AFQTPEAYFDNRRFRSLGYMRPLAIWAIQHAVEKFHP 794
Query: 900 KTLE 903
+ E
Sbjct: 795 QFFE 798
>gi|297741934|emb|CBI33369.3| unnamed protein product [Vitis vinifera]
Length = 508
Score = 471 bits (1211), Expect = e-129, Method: Compositional matrix adjust.
Identities = 227/452 (50%), Positives = 306/452 (67%), Gaps = 5/452 (1%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D A P W+R+L++ L +F++ + E +++ +G+R+ IREEA++G++A IDPF
Sbjct: 31 DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
+ ++S GVPLGG+GSGSI R +RGEF+ WQI P C+ P++ANQFS+F+SR G
Sbjct: 91 TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150
Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+KY+SVL P E L + GI SW WNL G STYHAL+PRAWT+++GEPDPEL++
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
CRQISP IPHNY++SS P +VF YT+ N+GK A ++LLFTW NS+GG S +G H N
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISHLSGDHVNEP 270
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
DGV VLLHH+T+ + PPVTFA+AA ET V V++ P F +S S +TAKDMW +
Sbjct: 271 FIGEDGVSGVLLHHKTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGS-HITAKDMWGK 329
Query: 324 IKEHGSFDRLNSME-TSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
+ + G FDR N S+ S PG ++ AA++AS V P + V F+LAW P+V F+ G
Sbjct: 330 MVQDGQFDRENCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKG 389
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
+Y+RRYTK+YGT + AA NI DA+ + WE +IE WQ PIL D RLPEWY TLFNE
Sbjct: 390 SSYHRRYTKYYGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNE 449
Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLD 474
LY+L AGG VW D S P S H+ +++
Sbjct: 450 LYFLVAGGTVWIDSSLPATSSKNSLHQSAAVE 481
>gi|324500616|gb|ADY40283.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 890
Score = 470 bits (1210), Expect = e-129, Method: Compositional matrix adjust.
Identities = 323/886 (36%), Positives = 458/886 (51%), Gaps = 117/886 (13%)
Query: 44 EIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVG 103
+P ++ T+ ++ P R + K ++ FI+ F +GVP GG+G
Sbjct: 30 RVPFNRPTI--RQVYDSLPFVRRYFIYWLKHVGKKEKLFINTFQPLKHKPHYGVPCGGIG 87
Query: 104 SGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFSVFVSRSNGQK----YSSVLC--PKT 155
GSIGR +RG F ++ + P + E K + A+QF V + R NG+ Y VLC P T
Sbjct: 88 CGSIGRDFRGGFCKFGLRPGLIEHKVDVLRADQFIVTLRRKNGEGFATVYQKVLCAAPTT 147
Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
P T+ + +WD++ D TY LYPR+WT ++ + + IV RQISP+IP +Y
Sbjct: 148 P-----ATSTELNAWDFSFPPDCLTYRGLYPRSWTRYDIR-EYRVSIVMRQISPVIPGDY 201
Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------GDSEFTGQHYNSKTK 265
++SS PVSVF + + N G +++ FT+ N G S F +H +S+
Sbjct: 202 RDSSLPVSVFVFDVENGGDEDLYVSITFTFRNGTGYHKWSQEGNCTSSLFESKHAHSR-- 259
Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIK 325
V V L H S T+A+AA VS C F S + ++W +
Sbjct: 260 ----VEGVCLEHTISRMR--CTYAVAANCNQRTTVSRCAAFDPSK-----SGDEIWQSLW 308
Query: 326 EHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS--EGQVTFSLAWDCPEVNFMSGK 383
G + + P + A+ A V +S E + F+LAWD P V F + K
Sbjct: 309 STGHLQNVGEVT------PTRELAIAVCAQTIVKANSKAEAPMEFALAWDMPIVQFGARK 362
Query: 384 -TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
Y RRYT+F+G +AA + ++ + WE +I WQ P+L + LP+WY LFNE
Sbjct: 363 RNYRRRYTRFFGDRGDAAPALCAHSLTVYEQWERRINDWQDPVLNHRNLPDWYKSALFNE 422
Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSS 502
LY++ GG+VW + F DW+
Sbjct: 423 LYFMTDGGSVWFE---------------FEEDWACE------------------------ 443
Query: 503 ILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPK 562
EQ +P ++ L++D G+F YLE EY M NTYDVHFY+SFAL L+P
Sbjct: 444 --EQHLSPYSIA------LMKD----FGRFGYLESWEYRMVNTYDVHFYASFALAQLWPS 491
Query: 563 IQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYD 620
++ IQ +F + D + +G +K VPHD+G DPW NAY ++D
Sbjct: 492 LEHVIQAEFTDQIEHFDGKPVIYHMEGNRGPKKSRTRVPHDLGNPAVDPWLLTNAYVMHD 551
Query: 621 TARWKDLNPKFVLQVYRD--VVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDG 677
T +WKDLN KF+L +RD V D++F VWP+V + + +D+DGDGMIEN G
Sbjct: 552 TGKWKDLNLKFILISFRDYTCVLGKDQRFLNLVWPAVKSLINEGLKNWDQDGDGMIENFG 611
Query: 678 FPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKL 736
DQTYD W +SG+SAY G LW+AAL+AA +AREVGD + +A+ Y KL
Sbjct: 612 AADQTYDAWKMSGVSAYCGCLWLAALRAALVMAREVGDSACAHRYAETLSRAREAYITKL 671
Query: 737 WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG------LLPIVDEDKARSALKKVYNY 790
WNG Y+N+D S+ + I ADQL G W+ ++ LLP +SAL +YNY
Sbjct: 672 WNGEYYNFDERSRSRHT-IMADQLCGYWFLQSISPELADELLPAA---HVKSALNIIYNY 727
Query: 791 NVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
NVL+ GG+ GAVNGM DG VD + +Q+ E+W+GVTYA+AA +I + GF TA G
Sbjct: 728 NVLRFAGGRLGAVNGMRRDGTVDRNYIQADEMWTGVTYALAAFLIQQGAVQRGFDTAWGC 787
Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
Y+ ++ GL Y QTPEA YR++ YMRPL+IWAMQWAL R
Sbjct: 788 YDGCFNRFGLQY--QTPEALYEQRFYRAIGYMRPLSIWAMQWALIR 831
>gi|392918721|ref|NP_504047.2| Protein R08F11.1 [Caenorhabditis elegans]
gi|387910741|emb|CCD72306.2| Protein R08F11.1 [Caenorhabditis elegans]
Length = 830
Score = 464 bits (1194), Expect = e-127, Method: Compositional matrix adjust.
Identities = 295/869 (33%), Positives = 439/869 (50%), Gaps = 105/869 (12%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPF-IKRHLTSSHGVPLGGVGSGSIGRSYRG 113
K +++L P+ VR+ E G+ AFID F + RH T + GVPLGG+G GSIG +RG
Sbjct: 36 KHSIKLIPLFVRVALHTFVEWWNGREAFIDIFNVFRHFTYT-GVPLGGIGCGSIGTDFRG 94
Query: 114 EFQRWQIFPRVCEDKPVL-ANQFSVFVSRSNGQK--YSSVL-CPKTPEVLKDTTAAGIGS 169
F R+ I P + E NQF V V + Y S+L C + P A +
Sbjct: 95 GFNRFSIIPGIKEQTETQKCNQFIVTVHSKKTFELIYQSILSCAEFP-------ATVLPK 147
Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
WD + + Y L+PRAW + +V +SP+IP +Y +SS P++ F + +
Sbjct: 148 WDTTIPAEDVRYRGLFPRAWQEFR-LGSSGVTVVVEHLSPVIPGDYSDSSLPLANFEFHV 206
Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
+N ++++ ++ N G S+ D + L H PVT+A
Sbjct: 207 FNDSYEEVEVSITMSFRNGTGNRKWNDENLCQSQKIQKDTMVVRTLAHTVKGM--PVTYA 264
Query: 290 LAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
+ +E +G V+ C + N T +W +++ +G + + S+P +G
Sbjct: 265 IGTEEKNGSKVTTC---LFDPNG---TGGRLWSDLEAYGHLSSYDHL----PSKP-KELG 313
Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYG--THQNAAANIARD 406
A+ +S VPPD FSL W P+V+F + + +Y RRY +F+ A I R
Sbjct: 314 IAVCSSFFVPPDGAHNTQFSLTWYMPQVHFGTAERFYNRRYCRFFNGPDADEVTAAICRH 373
Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
+ +W+ +IE WQ P+L D++LPEWY +FNELYY+ G VW + P
Sbjct: 374 GLQNFSTWQKKIEDWQAPVLNDQKLPEWYKSAIFNELYYIVDGSTVWFEYDP-------- 425
Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
DW + MS E+ +
Sbjct: 426 -------DWKTEE------------------SLMSEQTEKQF------------------ 442
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
+ G+F Y+E EY M NTYDVHFYSS+A++ +P+I++S+Q DFA + D + L
Sbjct: 443 KEYGRFGYMESWEYFMINTYDVHFYSSWAILKNWPQIEMSMQLDFADQMDRVDNNIATSL 502
Query: 587 DDGQWVSRKVLGAVPHDIG-------------ICDPWFEVNAYCLYDTARWKDLNPKFVL 633
DG+ ++ K + +PHD+G + DPW NAY L+DT RWKDLN KFV+
Sbjct: 503 ADGEQMTIKSVDRIPHDMGHPTKGCIQEKRLSVADPWIHTNAYILHDTGRWKDLNLKFVI 562
Query: 634 QVYRD--VVATGDKK---FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSV 688
YRD ++ G +K + + ++ +D+D DGMIENDGF DQTYD W +
Sbjct: 563 SCYRDWKLIELGSEKGQVLEFFLGKCTKIVDGALECWDKDNDGMIENDGFADQTYDVWKM 622
Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNS 747
+G SAY G LW+AAL + + ++ G ++ K + A Y KLWNG++F +D
Sbjct: 623 TGTSAYCGSLWIAALSSYIEMLKQSG--LPTKHYEEKLEMAYDAYIGKLWNGTFFKFDEL 680
Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
S + ADQL G W A + +DK +SAL ++ YNV G+ GAVNG L
Sbjct: 681 -PENSKIVMADQLCGFWAMTAMDEPVQISKDKMKSALDTIFKYNVQMYNNGRCGAVNGYL 739
Query: 808 PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
RVD SS+QS E+W+G+TYA++A MI + + + F+T+ G++E+ W L Y QTP
Sbjct: 740 TSERVDGSSIQSEEVWAGITYALSAMMIEKGMDEQAFKTSEGLFESIWHRFPLQY--QTP 797
Query: 868 EAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
EA +D YR+L YMRPL+IWA+Q AL R
Sbjct: 798 EAITSDGMYRALGYMRPLSIWAIQHALDR 826
>gi|71992715|ref|NP_001021680.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
gi|18376555|emb|CAD21661.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
Length = 959
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 292/849 (34%), Positives = 447/849 (52%), Gaps = 104/849 (12%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
++ FI+ F +GVP GG+G G+IGR +RG F ++ + P + E K V A+QF
Sbjct: 119 EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 178
Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
+ V ++ Y VL + + + +WD+ Y L+PR+WT P
Sbjct: 179 LSVRENDRCIYQKVLSAADVQ----RPSGQLSTWDFKFPKKNVHYRGLFPRSWTTFRV-P 233
Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD--ITLLFTWTNSVGG--- 251
+ EL +V RQ+SP++PHNY++++YPV +F + N G S + I++ FT+ N G
Sbjct: 234 ELELTVVIRQVSPVLPHNYEDTTYPVCLFLIDVENGGSASREYEISVAFTFRNGTGNRRW 293
Query: 252 --DSEFTGQHYNSK-------TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
++E +G + ++ T + + V L+H S P T+ LA + ++
Sbjct: 294 EREAECSGVKFETEEEGLKPTTTAQEKISGVSLYHTISSM--PCTYGLATTHREHTTTTV 351
Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
C F S N L W+ +K+ G D + E + + + A+ T+PP+S
Sbjct: 352 CERFDPSRNGAAL-----WNHLKQTG--DVPSCEEECLIN--AREMAVAVCNRFTLPPNS 402
Query: 363 EGQVTFSLAWDCPEVNFMS-GKTYYRRYTKFYGTHQ--NAAANIARDAILEHGSWELQIE 419
++L+WD P+V+F S ++Y+RRYT+F+ + +AA ++ A+ WE +IE
Sbjct: 403 TKTHDYALSWDMPKVHFGSVARSYHRRYTRFFEASEAGSAADSLCVRALRLKDKWESEIE 462
Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
AWQ PIL K+LP+WY +FNELY++ GG VW + F +W++ +
Sbjct: 463 AWQSPILNHKKLPDWYKSAIFNELYFITDGGTVWFE---------------FDENWAEHE 507
Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
+ ++ +D + IG+F YLE E
Sbjct: 508 ------------------------------------THLSHYTKDKMKEIGRFGYLESWE 531
Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
Y M NTYDVHFY+SFAL L+P+++++IQ +F V + +G W + K
Sbjct: 532 YRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANVKTASR 591
Query: 600 VPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD--VVATGDKKFAKAVWPSV 655
VPHD+G +PW NAY ++DT +WKDLN K+VL +RD V++ + F WP+V
Sbjct: 592 VPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFHTWPAV 651
Query: 656 YVAM-AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+ M ++ +D+DGDGMIEN G DQTYD W + G+SAY G LW+A+L+ A +A +
Sbjct: 652 RMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMK 711
Query: 715 DRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG--- 770
D ++ F +KAK V+ + LW G+YF + S+ + + ADQL G W+ ++
Sbjct: 712 DDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQSVSPEL 770
Query: 771 ---LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
LLP RSAL +Y NV K G+ GAVNGM P G VD +Q+ E+W+GVT
Sbjct: 771 ADELLP---NHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVT 827
Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
YAVA+ +I + + F TA G Y + TGL Y QTPEA YR++ YMRPL+I
Sbjct: 828 YAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRPLSI 885
Query: 888 WAMQWALTR 896
WAMQW+L +
Sbjct: 886 WAMQWSLKK 894
>gi|71992723|ref|NP_001021681.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
gi|50511739|emb|CAH04764.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
Length = 930
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 292/849 (34%), Positives = 447/849 (52%), Gaps = 104/849 (12%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
++ FI+ F +GVP GG+G G+IGR +RG F ++ + P + E K V A+QF
Sbjct: 90 EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 149
Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
+ V ++ Y VL + + + +WD+ Y L+PR+WT P
Sbjct: 150 LSVRENDRCIYQKVLSAADVQ----RPSGQLSTWDFKFPKKNVHYRGLFPRSWTTFRV-P 204
Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD--ITLLFTWTNSVGG--- 251
+ EL +V RQ+SP++PHNY++++YPV +F + N G S + I++ FT+ N G
Sbjct: 205 ELELTVVIRQVSPVLPHNYEDTTYPVCLFLIDVENGGSASREYEISVAFTFRNGTGNRRW 264
Query: 252 --DSEFTGQHYNSK-------TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
++E +G + ++ T + + V L+H S P T+ LA + ++
Sbjct: 265 EREAECSGVKFETEEEGLKPTTTAQEKISGVSLYHTISSM--PCTYGLATTHREHTTTTV 322
Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
C F S N L W+ +K+ G D + E + + + A+ T+PP+S
Sbjct: 323 CERFDPSRNGAAL-----WNHLKQTG--DVPSCEEECLIN--AREMAVAVCNRFTLPPNS 373
Query: 363 EGQVTFSLAWDCPEVNFMS-GKTYYRRYTKFYGTHQ--NAAANIARDAILEHGSWELQIE 419
++L+WD P+V+F S ++Y+RRYT+F+ + +AA ++ A+ WE +IE
Sbjct: 374 TKTHDYALSWDMPKVHFGSVARSYHRRYTRFFEASEAGSAADSLCVRALRLKDKWESEIE 433
Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
AWQ PIL K+LP+WY +FNELY++ GG VW + F +W++ +
Sbjct: 434 AWQSPILNHKKLPDWYKSAIFNELYFITDGGTVWFE---------------FDENWAEHE 478
Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
+ ++ +D + IG+F YLE E
Sbjct: 479 ------------------------------------THLSHYTKDKMKEIGRFGYLESWE 502
Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
Y M NTYDVHFY+SFAL L+P+++++IQ +F V + +G W + K
Sbjct: 503 YRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANVKTASR 562
Query: 600 VPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD--VVATGDKKFAKAVWPSV 655
VPHD+G +PW NAY ++DT +WKDLN K+VL +RD V++ + F WP+V
Sbjct: 563 VPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFHTWPAV 622
Query: 656 YVAM-AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+ M ++ +D+DGDGMIEN G DQTYD W + G+SAY G LW+A+L+ A +A +
Sbjct: 623 RMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMK 682
Query: 715 DRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG--- 770
D ++ F +KAK V+ + LW G+YF + S+ + + ADQL G W+ ++
Sbjct: 683 DDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQSVSPEL 741
Query: 771 ---LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
LLP RSAL +Y NV K G+ GAVNGM P G VD +Q+ E+W+GVT
Sbjct: 742 ADELLP---NHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVT 798
Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
YAVA+ +I + + F TA G Y + TGL Y QTPEA YR++ YMRPL+I
Sbjct: 799 YAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRPLSI 856
Query: 888 WAMQWALTR 896
WAMQW+L +
Sbjct: 857 WAMQWSLKK 865
>gi|71992731|ref|NP_001021682.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
gi|15130776|emb|CAC48128.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
Length = 922
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 292/849 (34%), Positives = 447/849 (52%), Gaps = 104/849 (12%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
++ FI+ F +GVP GG+G G+IGR +RG F ++ + P + E K V A+QF
Sbjct: 82 EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 141
Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
+ V ++ Y VL + + + +WD+ Y L+PR+WT P
Sbjct: 142 LSVRENDRCIYQKVLSAADVQ----RPSGQLSTWDFKFPKKNVHYRGLFPRSWTTFRV-P 196
Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD--ITLLFTWTNSVGG--- 251
+ EL +V RQ+SP++PHNY++++YPV +F + N G S + I++ FT+ N G
Sbjct: 197 ELELTVVIRQVSPVLPHNYEDTTYPVCLFLIDVENGGSASREYEISVAFTFRNGTGNRRW 256
Query: 252 --DSEFTGQHYNSK-------TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
++E +G + ++ T + + V L+H S P T+ LA + ++
Sbjct: 257 EREAECSGVKFETEEEGLKPTTTAQEKISGVSLYHTISSM--PCTYGLATTHREHTTTTV 314
Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
C F S N L W+ +K+ G D + E + + + A+ T+PP+S
Sbjct: 315 CERFDPSRNGAAL-----WNHLKQTG--DVPSCEEECLIN--AREMAVAVCNRFTLPPNS 365
Query: 363 EGQVTFSLAWDCPEVNFMS-GKTYYRRYTKFYGTHQ--NAAANIARDAILEHGSWELQIE 419
++L+WD P+V+F S ++Y+RRYT+F+ + +AA ++ A+ WE +IE
Sbjct: 366 TKTHDYALSWDMPKVHFGSVARSYHRRYTRFFEASEAGSAADSLCVRALRLKDKWESEIE 425
Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
AWQ PIL K+LP+WY +FNELY++ GG VW + F +W++ +
Sbjct: 426 AWQSPILNHKKLPDWYKSAIFNELYFITDGGTVWFE---------------FDENWAEHE 470
Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
+ ++ +D + IG+F YLE E
Sbjct: 471 ------------------------------------THLSHYTKDKMKEIGRFGYLESWE 494
Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
Y M NTYDVHFY+SFAL L+P+++++IQ +F V + +G W + K
Sbjct: 495 YRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANVKTASR 554
Query: 600 VPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD--VVATGDKKFAKAVWPSV 655
VPHD+G +PW NAY ++DT +WKDLN K+VL +RD V++ + F WP+V
Sbjct: 555 VPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFHTWPAV 614
Query: 656 YVAM-AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+ M ++ +D+DGDGMIEN G DQTYD W + G+SAY G LW+A+L+ A +A +
Sbjct: 615 RMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMK 674
Query: 715 DRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG--- 770
D ++ F +KAK V+ + LW G+YF + S+ + + ADQL G W+ ++
Sbjct: 675 DDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQSVSPEL 733
Query: 771 ---LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
LLP RSAL +Y NV K G+ GAVNGM P G VD +Q+ E+W+GVT
Sbjct: 734 ADELLP---NHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVT 790
Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
YAVA+ +I + + F TA G Y + TGL Y QTPEA YR++ YMRPL+I
Sbjct: 791 YAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRPLSI 848
Query: 888 WAMQWALTR 896
WAMQW+L +
Sbjct: 849 WAMQWSLKK 857
>gi|341901617|gb|EGT57552.1| hypothetical protein CAEBREN_15139 [Caenorhabditis brenneri]
Length = 925
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 288/841 (34%), Positives = 439/841 (52%), Gaps = 98/841 (11%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
++ FI+ F +GVP GG+G G+IGR +RG F ++ + P + E K V A+QF
Sbjct: 81 EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 140
Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
+ V ++ Y VL D + + SWD+ Y L+PR+WT P
Sbjct: 141 LSVRENDTCIYQKVLSAA------DVHRSQLSSWDFGFPKKHLHYRGLFPRSWTTFRV-P 193
Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSA-DITLLFTWTNSVGG---- 251
+ L + RQ+SP++PHNY+E++YPV +F + N+ A D+++ FT+ N G
Sbjct: 194 ELGLTVTIRQVSPVLPHNYQETTYPVCLFLVDVENTSNNRAYDVSIAFTFRNGTGNRRWD 253
Query: 252 -DSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
+SE + ++S + V V L+H S P T+ LA+ H+S+C F S
Sbjct: 254 RESECSAGRFDSIHSDRESVSGVSLYHTISSM--PCTYGLASTHKKDSHLSICERFDPSR 311
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
+ W+ +K+ G + + ++ +PP S ++L
Sbjct: 312 SGAAF-----WNHLKQTGDVPEFEEE----CPQNAREMAVSVCNRFHLPPQSSKTHDYAL 362
Query: 371 AWDCPEVNFMS-GKTYYRRYTKFY-GTHQNAAAN-IARDAILEHGSWELQIEAWQRPILE 427
+WD P+V+F S ++Y+RRYT+F+ G+ + A+ + A+L WE +IE WQ PIL
Sbjct: 363 SWDMPKVHFGSVARSYHRRYTRFFEGSEAGSVADSLCVRALLARNQWEEEIEKWQSPILT 422
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
+LP+WY +FNELY++ GG VW + F +W+
Sbjct: 423 HPKLPDWYKSAIFNELYFMVDGGTVWFE---------------FDENWAD---------- 457
Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
++ +V E+M +NIG+F YLE EY M NTYD
Sbjct: 458 HEAHLSVYTKEKM--------------------------KNIGRFGYLESWEYRMVNTYD 491
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG-- 605
VHFY+S+AL L+P+++L++Q +F V + +G W + K VPHD+G
Sbjct: 492 VHFYASYALAELWPQLELTVQSEFTDQVYHSIEKPTRFHMEGDWANVKTASRVPHDLGNP 551
Query: 606 ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD--VVATGDKKFAKAVWPSVYVAM-AYM 662
+PW NAY ++DT +WKDLN K+VL +RD V++ ++F WP++ + M +
Sbjct: 552 ADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEAHQEFLHHTWPAIKMIMLEAL 611
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+ +D++GDGMIEN G DQTYD W + G+SAY G LW+A+L+ + +A V D +E+ F
Sbjct: 612 ENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSVEMAAIVKDEITENLF 671
Query: 723 LFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG------LLPIV 775
+KAK V+ + LW G+YF + S+ ++ ADQL G W+ ++ LLP
Sbjct: 672 KGTLEKAKKVFIDLLWTGTYFRFCERSRSR-ETVMADQLCGYWFLQSVSPEMADDLLP-- 728
Query: 776 DEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMI 835
RSAL +Y NV K G+ GAVNGM P G VD +Q+ E+W+GVTYAVA+ +I
Sbjct: 729 -NHMVRSALDTIYRLNVCKFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLI 787
Query: 836 HEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
+ + F TA G Y + GL Y QTPEA YR++ YMRPL+IWAMQW+L
Sbjct: 788 QQGEVEKAFHTASGSYLTCFEQAGLQY--QTPEALYESKFYRAIGYMRPLSIWAMQWSLK 845
Query: 896 R 896
+
Sbjct: 846 K 846
>gi|169642399|gb|AAI60640.1| LOC100145361 protein [Xenopus (Silurana) tropicalis]
Length = 571
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 254/611 (41%), Positives = 343/611 (56%), Gaps = 65/611 (10%)
Query: 293 QETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSME-TSVTSEPGSSIGAA 351
+E GV VS C F +G +++W ++ E G RLNS S ++ G AA
Sbjct: 1 RERPGVCVSHCTQFDPTG-----LGQEVWKDLLEDG---RLNSAPGPSPSTAKGKKTAAA 52
Query: 352 IAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIARDAILE 410
+AA +VPP S + F L+WD P + F +G + ++RRYT+F+G +AA + +
Sbjct: 53 VAAGCSVPPRSRETLEFCLSWDMPNIRFGAGQREHHRRYTRFFGCQGDAAPALCHYGLTH 112
Query: 411 HGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRK 470
WE +IEAWQ PIL++ LP WY LFNELY++ GG VW +
Sbjct: 113 FVGWEQKIEAWQEPILQNGDLPAWYKSALFNELYFMADGGTVWVE--------------- 157
Query: 471 FSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIG 530
V + +D N++ R L ++D + G
Sbjct: 158 -------------VSAADDDD---NLMRRGCKDLPG---------------MKDVLQEYG 186
Query: 531 QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQ 590
+F YLEG EY M+NTYDVHFY+SFALIML+P++++S+Q D AA+V+ DP K L G
Sbjct: 187 RFAYLEGQEYRMYNTYDVHFYASFALIMLWPQLEISLQYDMAASVLQEDPELRKYLMSGT 246
Query: 591 WVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFA 648
K VPHD+G D PW ++NAY ++DTA WKDLN KFVLQVYRD T D +
Sbjct: 247 VAPVKSRSVVPHDVGDPDDEPWQKLNAYLIHDTANWKDLNLKFVLQVYRDYHITKDTTYL 306
Query: 649 KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
+ +W M +FD DGDG+IEN GF DQTYD W ++G S+Y GGLW+AA+
Sbjct: 307 RDMWSVCQTVMETSLKFDEDGDGLIENSGFADQTYDDWVMTGPSSYCGGLWLAAVCMMCK 366
Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
+A +GD +++ F + K +EK LWNG Y+N+D S+S+ DQ AG W+ R
Sbjct: 367 MAEVLGDGAAQEKFTDILHRGKAAFEKLLWNGKYYNFDCGDQPYSNSVMVDQCAGNWFLR 426
Query: 768 ACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIW 823
ACGL + +D SAL+ ++ NV + G+ GAVNGM PDG D SS+QS E+W
Sbjct: 427 ACGLGAGESEVFPKDHVVSALRTIFELNVKQFADGQMGAVNGMRPDGTTDTSSVQSDEVW 486
Query: 824 SGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMR 883
GV Y +AA+MIHE L GF TA G Y W LG +FQTPEA+ +RSL YMR
Sbjct: 487 IGVVYGLAATMIHEGLVQEGFATAEGCYRTVWE--RLGMSFQTPEAYCEKQVFRSLAYMR 544
Query: 884 PLAIWAMQWAL 894
PL+IWAMQ AL
Sbjct: 545 PLSIWAMQLAL 555
>gi|308485740|ref|XP_003105068.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
gi|308257013|gb|EFP00966.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
Length = 933
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 285/861 (33%), Positives = 447/861 (51%), Gaps = 114/861 (13%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
++ FI+ F +GVP GG+G G+IGR +RG F ++ + P + E K V A+QF
Sbjct: 81 EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 140
Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAG--IGSWDWNLKGDKSTYHALYPRAWTVHEG 194
+ V ++ Y VL D +G + +WD+ Y L+PR+WT
Sbjct: 141 ISVRENDTCIYQKVLSAA------DVHRSGNQLSTWDFTFPKKNVNYRGLFPRSWTTFRI 194
Query: 195 EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSA-DITLLFTWTNSVGG-- 251
P+ + + RQ+SP++PHNY++++YPV +F + N+ + ++++ FT+ N G
Sbjct: 195 -PELGITVTIRQVSPVLPHNYEDTTYPVCLFVIDVENTTTSKEYEVSIAFTFRNGTGNRR 253
Query: 252 ---DSEFTGQHYN----SKTKMNDG---VHAVLLHHRTSHQLPPVTFALAAQETDGVHVS 301
+SE + + S ++ N G V + L+H S P T+ LA +G +S
Sbjct: 254 WERESECSAVKFETPGPSDSESNPGQEKVTGISLYHTISSM--PCTYGLATTHKNGSILS 311
Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGS-----------FDRLNSMETSVTSEPGSSIGA 350
+C F + N W+ +++ G F ++NS + + +
Sbjct: 312 VCERFDPARNGSAF-----WNHLQQTGDVPSCVRILHVFFHKINSFQEEECAVNAREMAV 366
Query: 351 AIAASVTVPPDSEGQVTFSLAWDCPEVNFMS-GKTYYRRYTKFYGTHQ--NAAANIARDA 407
A+ + P ++ ++L+WD P+V+F S ++Y+RRYT+F+ + AA ++ A
Sbjct: 367 AVCNRFRLSPGTKKTHDYALSWDMPKVHFGSVARSYHRRYTRFFDAAEAGTAADSLCVRA 426
Query: 408 ILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIG 467
+ WE +IE WQ PIL ++LP+WY +FNELY++ GG VW +
Sbjct: 427 LHLKNKWEEEIEKWQLPILNHEKLPDWYKSAIFNELYFITDGGTVWFE------------ 474
Query: 468 HRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEE 527
F W++ ++ + E+M SI
Sbjct: 475 ---FDEQWAEHEMH----------LSEYTKEKMKSI------------------------ 497
Query: 528 NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD 587
G+F YLE EY M NTYDVHFY+S+AL L+P+++L++Q +F V +
Sbjct: 498 --GRFGYLESWEYRMVNTYDVHFYASYALAELWPQLELTVQSEFTDQVYHSIEKPTRFHM 555
Query: 588 DGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGD- 644
+G W + K VPHD+G +PW NAY ++DT +WKDLN K+VL +RD V +
Sbjct: 556 EGDWANVKTASRVPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEE 615
Query: 645 -KKFAKAVWPSVYVAM-AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAA 702
++F WP+V + M ++ +D++GDGMIEN G DQTYD W + G+SAY G LW+A+
Sbjct: 616 HQEFLFHTWPAVKMIMLEALENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLAS 675
Query: 703 LQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLA 761
L+ + +AR +G+ +E+ F KAK V+ + LW G+YF + S+ ++ ADQL
Sbjct: 676 LRVSIEMARLMGEEATENLFRVTLNKAKKVFIDTLWTGTYFRFCERSRSR-ETVMADQLC 734
Query: 762 GQWYARACG------LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMS 815
G W+ ++ LLP RSAL +Y NV + G+ GAVNGM P+G VD
Sbjct: 735 GYWFLQSVSPEMVDDLLP---NHMVRSALDTIYRLNVCRFGNGQMGAVNGMKPNGVVDRE 791
Query: 816 SMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQ 875
+Q+ E+W+GVTYAVA+ +I + + F TA G Y + TGL Y QTPEA
Sbjct: 792 YIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSYLTCFEQTGLQY--QTPEALYESKF 849
Query: 876 YRSLCYMRPLAIWAMQWALTR 896
YR++ YMRPL+IWAMQW+L +
Sbjct: 850 YRAIGYMRPLSIWAMQWSLKK 870
>gi|268569852|ref|XP_002640631.1| Hypothetical protein CBG08749 [Caenorhabditis briggsae]
Length = 899
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 289/858 (33%), Positives = 450/858 (52%), Gaps = 96/858 (11%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
++ FI+ F +GVP GG+G G+IGR +RG F ++ + P + E K V A+QF
Sbjct: 75 EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 134
Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
+ V ++ Y VL +V + ++ + +WD+ Y L+PR+WT P
Sbjct: 135 LSVRENDRCIYQKVL--SAADVHR--SSGQLSTWDFGFPKKNVHYRGLFPRSWTTFRI-P 189
Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNS-GKTSADITLLFTWTNSVGG---- 251
D L +V RQ+SP++P+NY++++YPV +F + N+ D+++ FT+ N G
Sbjct: 190 DLGLTVVIRQVSPVLPNNYEDTTYPVCLFLIDVENTTNDREYDVSVAFTFRNGTGNRRWE 249
Query: 252 -DSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
++E + + + + V V L+HR S T+ LA+ + +S+C F S
Sbjct: 250 REAECSAVMFETPEDSAEQVTGVSLYHRISGM--SCTYGLASTHSKDSVLSVCERFDPSK 307
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
+ L W+ + + G L E ++ + + A++ T+PP S ++L
Sbjct: 308 SGGAL-----WNHLMQTGDVPSLEE-ECAINAR---EMAVAVSNRFTLPPLSSKTHDYAL 358
Query: 371 AWDCPEVNFMS-GKTYYRRYTKFYGTHQ--NAAANIARDAILEHGSWELQIEAWQRPILE 427
+WD P+V+F S G++Y+RRYT+F+ + +AA ++ A+ WE +IE WQ PIL
Sbjct: 359 SWDMPKVHFGSTGRSYHRRYTRFFEASEAGSAADSLCVRALRLRHKWEEEIEKWQSPILT 418
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
+LP+WY +FNELY++ GG VW + F +W
Sbjct: 419 HDKLPDWYKSAIFNELYFITDGGTVWFE---------------FDENW------------ 451
Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
A + + +N ++ +NIG+F YLE EY M NTYD
Sbjct: 452 ------------------------ADHEAHLSNYTKEKMKNIGRFGYLESWEYRMVNTYD 487
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG-- 605
VHFY+S+AL L+P+++L++Q +F V + +G W + K VPHD+G
Sbjct: 488 VHFYASYALAELWPELELTVQSEFTDQVYHSIEKPTRFHMEGDWANVKTASRVPHDLGNP 547
Query: 606 ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGD--KKFAKAVWPSVYVAM-AYM 662
+PW NAY ++DT +WKDLN K+VL +RD V + ++F WP+V + M +
Sbjct: 548 ADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEEHQEFLHHTWPAVKMIMLEAL 607
Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
+D++GDGMIEN G DQTYD W + G+SAY G LW+A+L+ + +A + D ++ F
Sbjct: 608 ANWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSIEMAGLLEDGETKKLF 667
Query: 723 LFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG------LLPIV 775
L KAK V+ + LW G+YF + S+ ++ ADQL G W+ ++ LLP
Sbjct: 668 LETLNKAKKVFVDVLWTGTYFRFCERSRSR-ETVMADQLCGYWFLQSVSPEMADDLLP-- 724
Query: 776 DEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMI 835
RSAL +Y NV + G+ GAVNGM P G VD +Q+ E+W+GVTYAVA+ +I
Sbjct: 725 -SHMVRSALDTIYRLNVCQFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLI 783
Query: 836 HEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
+ + F TA G Y + TGL Y QTPEA YR++ YMRPL+IWAMQW+L
Sbjct: 784 QQGEVEKAFHTASGSYLTCFEQTGLQY--QTPEALYESKFYRAIGYMRPLSIWAMQWSLK 841
Query: 896 RPKPKTLEKQMKPEVTDE 913
+ K+ P++ E
Sbjct: 842 KHCGLNTSKERLPKLPSE 859
>gi|341888271|gb|EGT44206.1| hypothetical protein CAEBREN_07747 [Caenorhabditis brenneri]
Length = 820
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 295/863 (34%), Positives = 437/863 (50%), Gaps = 95/863 (11%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPF-IKRHLTSSHGVPLGGVGSGSIGRSYRG 113
K +++L P+ VR+ E G+ AFID F + RH T + GVPLGG+GSGSIG +RG
Sbjct: 36 KHSIKLIPLFVRVALHTFVEWWNGREAFIDIFNVFRHFTFT-GVPLGGIGSGSIGTDFRG 94
Query: 114 EFQRWQIFPRVCEDKPVL-ANQF--SVFVSRSNGQKYSSVL-CPKTPEVLKDTTAAGIGS 169
F R+ I P + E NQF SV ++ Y S+L C + P + +
Sbjct: 95 GFNRFSIIPGIKEQTDTQKCNQFIASVHSKKTFELIYQSILSCAEFPSTV-------LPK 147
Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
WD + + Y L+PRAW + +V +SP+IP++Y +SS P++ F + +
Sbjct: 148 WDSTIPAEDVRYRGLFPRAWQEFR-LGSSGITVVIEHLSPVIPNDYSDSSLPLANFDFHV 206
Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
+N ++++ + N G + ++ +H + K+ V T +P VT+A
Sbjct: 207 FNDSPEEVEVSITMAFRNGTG-NRKWNDEHLCTSQKVQKETQVVRTLSHTVKGMP-VTYA 264
Query: 290 LAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
+ +E +G V+ C + N G +W +++ +G +++ +G
Sbjct: 265 IGTEEKNGAKVTTC---LFDPNGTG---GRLWSDLEAYGHLSSYDNLPPK-----PKELG 313
Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYG--THQNAAANIARD 406
A+ +S VPPD FSL+W PEV+F + + +Y RRY +F+ AA I R
Sbjct: 314 IAVCSSFFVPPDGSHTTQFSLSWYMPEVHFGTAERFYNRRYCRFFNGPDADEVAAAICRH 373
Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
+ W+ IE WQ PI+ +++LPEWY +FNELYY+ G VW + P
Sbjct: 374 GLQNIEKWQEAIEKWQNPIITNQKLPEWYRSAIFNELYYIVDGSTVWFEYDP-------- 425
Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
DW T N++ + + Y
Sbjct: 426 -------DWR---------------TEENLISEQTEKQFREY------------------ 445
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
G+F Y+E EY M NTYDVHFYSS+A++ +P+I++S+Q DFA V D L
Sbjct: 446 ---GRFGYMESWEYFMLNTYDVHFYSSWAILKNWPEIEMSMQLDFADQVDRIDNGTATSL 502
Query: 587 DDGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGD 644
DG + K +PHD+G + DPW NAY L+DT WKDLN KFV+ YRD
Sbjct: 503 ADGDQMLIKSYDRIPHDMGHPMADPWLHTNAYILHDTGSWKDLNLKFVISCYRDYKMIAK 562
Query: 645 KKFAKAVWPSVYVAMAY------MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
+ K ++ ++ +D+D DGMIENDGF DQTYD W ++G SAY G L
Sbjct: 563 DREDKDQLLEFFLGKCTKIVEDALESWDKDKDGMIENDGFADQTYDVWKMTGTSAYCGSL 622
Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQA 757
W+AAL + ++ G ++ K A Y KLWNG +F +D S + A
Sbjct: 623 WLAALTCYIEMLKQAG--SPSKHYEGKLLNAYDAYTTKLWNGRFFKFDEL-PDNSKIVMA 679
Query: 758 DQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSM 817
DQL G W A V E+K + AL+ ++ YNV GK GAVNG L RVD SS+
Sbjct: 680 DQLCGFWALTAMDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVDGSSI 739
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
QS E+W+G+TYA++A MI +++ ++ F+T+ G++E+ W+ L Y QTPEA +D YR
Sbjct: 740 QSEEVWAGITYALSAMMIEKNMDEMAFKTSEGLFESIWNRFPLQY--QTPEAITSDGMYR 797
Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
+L YMRPL+IWA+Q AL R K
Sbjct: 798 ALGYMRPLSIWAIQHALDRRHKK 820
>gi|224028315|gb|ACN33233.1| unknown [Zea mays]
Length = 384
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 224/343 (65%), Positives = 270/343 (78%), Gaps = 1/343 (0%)
Query: 523 QDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK 582
+DG E++G+FLYLEG+EY+MWNTYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+ D K
Sbjct: 20 KDGPEHVGKFLYLEGVEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRK 79
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVAT 642
+K L DG RKV GAVPHD+G DPW E+NAY ++DT++WKDLNPKFVLQ+YRD AT
Sbjct: 80 VKFLADGTSGIRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAAT 139
Query: 643 GDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAA 702
GD +F + VWP+V AM YMDQFDRDGDG+IENDGFPDQTYD W+V GISAY G LW+AA
Sbjct: 140 GDMQFGRDVWPAVCAAMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAA 199
Query: 703 LQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLA 761
LQAA+ +A +GDR + + KF KAK VYE KLWNGSYFNYD+ SS S SIQADQLA
Sbjct: 200 LQAAATMAHRLGDRHFAEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLA 259
Query: 762 GQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSRE 821
GQWYA + GL P+ DE K R+AL+K++ +NV+KV GG+ GAVNGM P G+VD + MQSRE
Sbjct: 260 GQWYAASSGLPPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSRE 319
Query: 822 IWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
IW+GVTYAVAA+M+ + GF TA GI+ A WS G GYA
Sbjct: 320 IWTGVTYAVAANMLLHGMEHQGFTTAEGIFTAGWSEEGYGYAL 362
>gi|308471410|ref|XP_003097936.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
gi|308239241|gb|EFO83193.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
Length = 845
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 292/874 (33%), Positives = 444/874 (50%), Gaps = 92/874 (10%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPF-IKRHLTSSHGVPLGGVGSGSIGRSYRG 113
K +++L P+ VR+ E G+ AFID F + RH T + GVPLGG+GSGSIG +RG
Sbjct: 36 KHSIKLIPLFVRVALHTFVEWWNGREAFIDIFNVFRHFTYT-GVPLGGIGSGSIGTDFRG 94
Query: 114 EFQRWQIFPRVCEDKPVL-ANQF--SVFVSRSNGQKYSSVL-CPKTPEVLKDTTAAGIGS 169
F R+ I P + E NQF SV ++ Y S+L C + P A +
Sbjct: 95 GFNRFSIIPGIKEQTETQKCNQFIASVHSKKTFELIYQSILSCAEFP-------ATVLPK 147
Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
WD + + Y L+PR W + +V +S +IP +Y +SS P++ F + +
Sbjct: 148 WDTTIPAEDVRYRGLFPRGWQEFR-LGSSGITVVIESLSSVIPGDYSDSSLPLANFDFHV 206
Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
+N ++++ + N G + ++ +H K+ V T +P VT+A
Sbjct: 207 FNDSPEEVEVSITLAFRNGTG-NRKWNDEHLCQSQKVQRDTMVVRTLSHTVKGMP-VTYA 264
Query: 290 LAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
+ +E +G V+ C + N G +W +++ +G + + +G
Sbjct: 265 IGTEEKNGAKVTTC---LFDPNGTG---GRLWSDLEAYGHLSSYDHLPPK-----PKELG 313
Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYG--THQNAAANIARD 406
A+ +S VPPD FSL+W P+V+F + + +Y RRY +F+ AA+I R
Sbjct: 314 IAVCSSFFVPPDGSHTTQFSLSWYMPQVHFGTAERFYNRRYCRFFNGPDADEVAASICRH 373
Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
+ W+ IE WQ P++ D++LP+WY +FNELYY+ G VW + P
Sbjct: 374 GLQNISKWQEAIEKWQTPVINDEKLPDWYRSAIFNELYYIVDGSTVWFEYDP-------- 425
Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
+W + + + R + + ++ V L +
Sbjct: 426 -------EWKTDESL----ISEHTEKHFKEYGRFGYMECEFFSLVELRT----------- 463
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
F +L EY M NTYDVHFYSS+A++ +P+I++S+Q DFA V D L
Sbjct: 464 ----LFFFLTAWEYFMLNTYDVHFYSSWAILKNWPQIEMSMQLDFADQVDRIDNGTATSL 519
Query: 587 DDGQWVSRKVLGAVPHDIG-------------ICDPWFEVNAYCLYDTARWKDLNPKFVL 633
DG+ +S K +PHD+G + DPW NAY L+DT WKDLN KFV+
Sbjct: 520 ADGEQMSIKSYDRIPHDMGHPTKYCIQEHRLSVADPWLHTNAYILHDTGCWKDLNLKFVI 579
Query: 634 QVYRD--VVATGDKKFAKA-VWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
YRD +++ GD++ + + + ++ +D+D DGMIENDGF DQTYD W ++G
Sbjct: 580 SCYRDWKMISNGDQEILEFFIGKCTKIVDGALESWDKDQDGMIENDGFADQTYDVWKMTG 639
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE----KLWNGSYFNYDN 746
SAY G LW+AAL + ++ G F +K + YE KLWNG +F +D
Sbjct: 640 TSAYCGSLWLAALTCYIQMLKKAGSPTK-----FYDEKLLMAYEAYTTKLWNGKFFKFDE 694
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S + ADQL G W A V E+K +SAL+ ++ YNV GGK GAVNG
Sbjct: 695 L-PDNSKIVMADQLCGFWALTAMDEPVQVSEEKMKSALETIFKYNVQMYDGGKCGAVNGY 753
Query: 807 LPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
L RVD SS+QS E+W+G+TYA++A MI + + ++ F+T+ G++++ W+ L FQT
Sbjct: 754 LTSERVDGSSIQSEEVWAGITYALSAMMIEKGMDEMAFKTSEGLFDSIWNRFPL--QFQT 811
Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
PEA +D YR+L YMRPL+IWA+Q AL + K
Sbjct: 812 PEAITSDGMYRALGYMRPLSIWAIQHALEKRNRK 845
>gi|341899054|gb|EGT54989.1| hypothetical protein CAEBREN_31176 [Caenorhabditis brenneri]
Length = 831
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 294/873 (33%), Positives = 436/873 (49%), Gaps = 104/873 (11%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPF-IKRHLTSSHGVPLGGVGSGSIGRSYRG 113
K +++L P+ VR+ E G+ AFID F + RH T + GVPLGG+GSGSIG +RG
Sbjct: 36 KHSIKLIPLFVRVALHTFVEWWNGREAFIDIFNVFRHFTFT-GVPLGGIGSGSIGTDFRG 94
Query: 114 EFQRWQIFPRVCEDKPVL-ANQF--SVFVSRSNGQKYSSVL-CPKTPEVLKDTTAAGIGS 169
F R+ I P + E NQF SV ++ Y S+L C + P + +
Sbjct: 95 GFNRFSIIPGIKEQTDTQKCNQFIASVHSKKTFELIYQSILSCAEFPSTV-------LPK 147
Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
WD + + Y L+PRAW + +V +SP+IP++Y +SS P++ F + +
Sbjct: 148 WDSTIPAEDVRYRGLFPRAWQEFR-LGSSGITVVIEHLSPVIPNDYSDSSLPLANFDFHV 206
Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
+N ++++ + N G + ++ +H + K+ V T +P VT+A
Sbjct: 207 FNDSPEEVEVSITMAFRNGTG-NRKWNDEHLCTSQKVQKETQVVRTLSHTVKGMP-VTYA 264
Query: 290 LAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
+ +E +G V+ C + N G +W +++ +G +++ +G
Sbjct: 265 IGTEEKNGAKVTTC---LFDPNGTG---GRLWSDLEAYGHLSSYDNLPPK-----PKELG 313
Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYG--THQNAAANIARD 406
A+ +S VPPD FSL+W PEV+F + + +Y RRY +F+ AA I R
Sbjct: 314 IAVCSSFFVPPDGSHTTQFSLSWYMPEVHFGTAERFYNRRYCRFFNGPDADEVAAAICRH 373
Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
+ W+ IE WQ PI+ +++LPEWY +FNELYY+ G VW + P
Sbjct: 374 GLQNIEKWQEAIEKWQNPIINNQKLPEWYRSAIFNELYYIVDGSTVWFEYDP-------- 425
Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
DW T N++ + + Y
Sbjct: 426 -------DWR---------------TEENLISEQTEKQFREY------------------ 445
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
G+F Y+E EY M NTYDVHFYSS+A++ +P+I++S+Q DFA V D L
Sbjct: 446 ---GRFGYMESWEYFMLNTYDVHFYSSWAILKNWPEIEMSMQLDFADQVDRIDNGTATSL 502
Query: 587 DDGQWVSRKVLGAVPHDIG-------------ICDPWFEVNAYCLYDTARWKDLNPKFVL 633
DG + K +PHD+G + DPW NAY L+DT WKDLN KFV+
Sbjct: 503 ADGDQMLIKSYDRIPHDMGHPTKYCIQEHRLSVADPWLHTNAYILHDTGSWKDLNLKFVI 562
Query: 634 QVYRDVVATGDKKFAKAVWPSVYVAMAY------MDQFDRDGDGMIENDGFPDQTYDTWS 687
YRD + K ++ ++ +D+D DGMIENDGF DQTYD W
Sbjct: 563 SCYRDYKMIAKDREDKDQLLEFFLGKCTKIVEDALESWDKDKDGMIENDGFADQTYDVWK 622
Query: 688 VSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNS 747
++G SAY G LW+AAL + ++ G S++Y KLWNG +F +D
Sbjct: 623 MTGTSAYCGSLWLAALTCYIEMLKQAGS-PSKNYEGKLLNAYDAYITKLWNGRFFKFDEL 681
Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
S + ADQL G W A V E+K + AL+ ++ YNV GK GAVNG L
Sbjct: 682 -PDNSKIVMADQLCGFWALTAMDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGAVNGFL 740
Query: 808 PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
RVD SS+QS E+W+G+TYA++A MI +++ ++ F+T+ G++E+ W+ L Y QTP
Sbjct: 741 TSERVDGSSIQSEEVWAGITYALSAMMIEKNMDEMAFKTSEGLFESIWNRFPLQY--QTP 798
Query: 868 EAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
EA +D YR+L YMRPL+IWA+Q AL R K
Sbjct: 799 EAITSDGMYRALGYMRPLSIWAIQHALDRRHKK 831
>gi|268555576|ref|XP_002635777.1| Hypothetical protein CBG10433 [Caenorhabditis briggsae]
Length = 821
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 293/864 (33%), Positives = 435/864 (50%), Gaps = 97/864 (11%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPF-IKRHLTSSHGVPLGGVGSGSIGRSYRG 113
K +++L P+ VR+ E G+ AFID F + RH T + GVPLGG+GSGSIG +RG
Sbjct: 36 KHSIKLIPLFVRVALHTFVEWWNGREAFIDIFNVFRHFTYT-GVPLGGIGSGSIGTDFRG 94
Query: 114 EFQRWQIFPRVCEDKPVL-ANQF--SVFVSRSNGQKYSSVL-CPKTPEVLKDTTAAGIGS 169
F R+ I P + E NQF SV ++ Y S+L C + P A +
Sbjct: 95 GFNRFSIIPGIKEQTETQKCNQFIASVHSKKTFELIYQSILSCAEFP-------ATVLPK 147
Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
WD + + Y L+PRAW + +V +SP+IP +Y +SS P++ F + +
Sbjct: 148 WDSTIPAEDVRYRGLFPRAWQEFRLGTSG-ITVVVEHLSPVIPEDYSDSSLPLANFDFHV 206
Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
+N ++++ ++ N G S D + L H PVT+A
Sbjct: 207 FNDSPEEVEVSITMSFRNGTGNKKWNDESLCQSHKVQKDTMVVRTLSHTVKGM--PVTYA 264
Query: 290 LAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
+ +E +G V+ C + N G +W +++ +G + + +G
Sbjct: 265 IGTEEKNGAKVTTC---LFDPNGTG---GRLWSDLEAYGHLSSYDHLPPK-----PKELG 313
Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYG--THQNAAANIARD 406
A+ +S VPPD FSLAW P+V+F + + +Y RRY +F+ A+I R
Sbjct: 314 IAVCSSFFVPPDGSHTTQFSLAWYMPQVHFGTAERFYNRRYCRFFNGPDADEVGASICRH 373
Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
+ +W+ I WQ PI+ D++LPEWY +FNELYY+ G VW + P
Sbjct: 374 GLQNIETWQEAIRKWQDPIINDEKLPEWYRSAIFNELYYIVDGSTVWFEYDP-------- 425
Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
+W + + I E +Q
Sbjct: 426 -------EWRTDE--------------ILISEETEKQFKQY------------------- 445
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
G+F Y+E EY M NTYDVHFYSS+A++ +P+I++S+Q DFA V D L
Sbjct: 446 ---GRFGYMESWEYFMINTYDVHFYSSWAILKNWPQIEMSMQMDFADQVDRVDYGTATSL 502
Query: 587 DDGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD--VVAT 642
DG ++ K +PHD+G + DPW NAY L+DT WKDLN KFV+ YRD ++A
Sbjct: 503 ADGDTMTIKSYDRIPHDMGHPMADPWIHTNAYILHDTGHWKDLNLKFVISCYRDWKMIAE 562
Query: 643 GDKKFAKAV----WPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
G++ + + + ++ +D+D DGMIENDGF DQTYD W ++G SAY G L
Sbjct: 563 GEEDSQEILEFFLGKCTKIVDGALENWDKDKDGMIENDGFADQTYDVWKMTGTSAYCGSL 622
Query: 699 WVAALQAASALAREVG--DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQ 756
W+AAL + + + G + E+ L ++ KLWNG +F +D S +
Sbjct: 623 WIAALTSYIQMLKRAGIPTKDYEEKLLMAYEAYTT---KLWNGKFFKFDEL-PDNSKIVM 678
Query: 757 ADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSS 816
ADQL G W A V E K +SAL+ ++ YNV GGK GAVNG L RVD SS
Sbjct: 679 ADQLCGFWALTAMDEPIQVSEGKIQSALETIFKYNVEMYDGGKCGAVNGFLASERVDGSS 738
Query: 817 MQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQY 876
+QS E+W+G+TY+++A MI + + ++ F+T+ G++ + W+ L Y QTPEA D Y
Sbjct: 739 IQSEEVWAGITYSLSAMMIEKGMDEMAFKTSEGLFHSIWNRFPLQY--QTPEAITADGMY 796
Query: 877 RSLCYMRPLAIWAMQWALTRPKPK 900
R+L YMRPL+IWA+Q AL + K +
Sbjct: 797 RALGYMRPLSIWAIQHALDKKKNR 820
>gi|358254610|dbj|GAA55931.1| non-lysosomal glucosylceramidase [Clonorchis sinensis]
Length = 1200
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 291/853 (34%), Positives = 422/853 (49%), Gaps = 93/853 (10%)
Query: 71 IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
+R+ + + F+DP +GVP+GG+G G+IGR +RGEF R + P + P
Sbjct: 63 LRKRLIEKRLPFLDPSQHVPWKPIYGVPMGGIGCGAIGRGFRGEFVRSSLIPGMYSYDPQ 122
Query: 131 LANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWT 190
A+QF + + + + VL P+T A G+ SW+W + Y LYPR+WT
Sbjct: 123 PADQFILTIRKGGETLFHQVLSPQTS---VPAFANGLRSWNWGFSAQHAHYIGLYPRSWT 179
Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTN-SV 249
V+E L +VC Q+SP++PH+YK S P VF +T++N G + ++T+ F+W S
Sbjct: 180 VYEVA-HFNLILVCHQVSPVLPHDYKASCLPTCVFVWTVFNFGSEALEVTISFSWHGPSP 238
Query: 250 GGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
S + S K +G+ +V L + PV L+A+ +H+ + P IS
Sbjct: 239 APRSAPNSNNQVSNHKQPNGLSSVELTQPLVKE--PVADYLSARIQFAMHLLVGPSGYIS 296
Query: 310 GNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFS 369
S G T + W ++ +L M ++ P + A A ++ + Q+ F+
Sbjct: 297 YVS-GYTITN-WPTLRRRTP--KLAVMVSATCIVPPARSANASAGDLS--EAGQSQLEFA 350
Query: 370 LAWDCPEVNFMSGKTYYRR-YTKFYGT-HQNAAANIARDAILEHGSWELQIEAWQRPILE 427
+ W P V F K YRR Y +++ A + A+ W +IE WQ P+L
Sbjct: 351 VVWHSPLVWFRGNKVIYRRRYARWFPEPGMLGAERLLESALSNWKDWVERIEQWQNPVLH 410
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
+ P+WY LFNELYYL GG VW D P H H W + V
Sbjct: 411 NSSFPDWYKSALFNELYYLTDGGTVWLD---PNH------HGGADFKWQSWKYRSRV--- 458
Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
+ +G F YLEG EY M+NT D
Sbjct: 459 --------------------------------------AQEMGLFAYLEGHEYRMYNTLD 480
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGI- 606
VH+ SS+ALI L+PK+QL++ D A + D +++ + G++ R AVPHD G
Sbjct: 481 VHYNSSWALIKLWPKLQLAVLLDCADLAIEEDQTQLYFIHQGRYGIRSTESAVPHDFGDP 540
Query: 607 -CDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV-YVAMAYMDQ 664
+PW + NAY +Y T WKDLNPKFVLQV+RD T D + + P V + +
Sbjct: 541 EGEPWRDANAYVMYPTKDWKDLNPKFVLQVWRDWKLTQDNDYLLYMLPIVNRIIQVSLQS 600
Query: 665 FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA-----------ALQAASALAREV 713
+D +GDGMIEN GFPDQTYDTW+ GISAY+GGLW+A AL A S L +
Sbjct: 601 WDSNGDGMIENSGFPDQTYDTWNAQGISAYTGGLWLACLYSACEMLEYALSANSPLKAFL 660
Query: 714 GDRGSED---------YFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQ 763
SED + AK VY++ LW G+Y+ Y N+ +I ADQL+G
Sbjct: 661 IASNSEDEVSWPQVQKELQILLESAKTVYDRALWTGTYYIYQNTPVGNHEAIMADQLSGH 720
Query: 764 WYARACGLLP--IVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSRE 821
+ R G P I+ + ALK + + N + G G +NG LP + D+S++Q+ E
Sbjct: 721 GFLRVGGAPPGAILPSEHVIMALKTIQSTNWESIQNGSLGVINGGLPGAKRDLSNVQAEE 780
Query: 822 IWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCY 881
W GV Y++AA+MI E ++ GF A Y ++ GL Y QTPEA+ D ++R Y
Sbjct: 781 FWVGVNYSLAATMILEGMSQEGFSLAEACYNTIYNRFGLQY--QTPEAYMIDGRFRCPGY 838
Query: 882 MRPLAIWAMQWAL 894
MRPLAIW++Q AL
Sbjct: 839 MRPLAIWSIQQAL 851
>gi|326528739|dbj|BAJ97391.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 424
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 230/410 (56%), Positives = 292/410 (71%), Gaps = 2/410 (0%)
Query: 26 VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
++ A+PA+ TWQRK + F++ + + + P+ ++ L + ++ KG+ DP
Sbjct: 15 INCAQPATRTWQRKFDDEGKKIELFSMTMNDMISIIPMILKGLMINADQRGKGRDILYDP 74
Query: 86 FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
F K G+PLGG+G+GSIGRSYRG FQ +QIFP + E++P+LANQFS FVSR NG+
Sbjct: 75 FRKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPALYEERPILANQFSAFVSRPNGK 134
Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
YS+VL T + K GIGSWDW LK TYHAL+PR+WTV++GEPDPE+ I CR
Sbjct: 135 SYSTVLSAPTADAFKGVDKVGIGSWDWKLKEKNCTYHALFPRSWTVYDGEPDPEINITCR 194
Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
QISPIIPHNYKESS+PV+VFT+T+ NSG T+AD+TLLFTWTNSVGG SE TG H NSK
Sbjct: 195 QISPIIPHNYKESSFPVAVFTFTVQNSGSTAADVTLLFTWTNSVGGRSELTGNHTNSKMI 254
Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI-SGNSLGLTAKDMWHEI 324
DGV VLL HRT+ PPVTFA+A+QET+ V V+ CP F + S +S TAKDMW EI
Sbjct: 255 ERDGVQGVLLRHRTADGHPPVTFAIASQETEEVRVTDCPFFTMGSSDSGDFTAKDMWEEI 314
Query: 325 KEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKT 384
K+HGSF + + S+PGSSIGAA+AA+ TVP V+F+L+W CPEV F GKT
Sbjct: 315 KQHGSFSETRTDKEPRASKPGSSIGAAVAAATTVPAGGTRVVSFALSWSCPEVKFPDGKT 374
Query: 385 YYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
Y+RRYTKF G +AAA ++A DA+LEH WE +IE WQRPIL+DKRLPE
Sbjct: 375 YHRRYTKFCGLDGDAAAESLAHDALLEHMDWESKIEEWQRPILQDKRLPE 424
>gi|355689821|gb|AER98956.1| glucosidase, beta 2 [Mustela putorius furo]
Length = 550
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 240/560 (42%), Positives = 315/560 (56%), Gaps = 64/560 (11%)
Query: 339 SVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQ 397
S S+ G I A+ + +PP + ++ FSLAWD P + F G+ YYRRYT+F+G
Sbjct: 1 STPSQKGVGIAGAVCVTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDG 60
Query: 398 NAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
NAA ++ A+ + WE +I AWQ P+L+D+ LP WY LFNELY+L GG VW
Sbjct: 61 NAAPALSHYALCRYIDWEEKISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---- 116
Query: 458 PPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQN--DTAVNILERMSSILEQIYTPVALNS 515
++VP + D V + ++ IL++
Sbjct: 117 --------------------------LEVPEDSLPDELVGSMCQLRPILQEY-------- 142
Query: 516 SFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A
Sbjct: 143 --------------GRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALAT 188
Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
+ D ++ + L G K +PHDIG D PW VNAY ++DTA WKDLN KFVL
Sbjct: 189 LREDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYVVHDTADWKDLNLKFVL 248
Query: 634 QVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISA 693
QVYRD TGD+ F + +WP M +FD+D DG+IEN G+ DQTYD W +G SA
Sbjct: 249 QVYRDYHLTGDQSFLRDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDGWITTGPSA 308
Query: 694 YSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQS 752
Y GGLW+AA+ +A G + +D F ++ + YE+ LWNG Y+NYD S QS
Sbjct: 309 YCGGLWLAAVAVMVQMAVLCGAKDIQDKFSSILRRGQEAYERLLWNGRYYNYDCSPQPQS 368
Query: 753 SSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SI +DQ AGQW+ +A GL + AL+ ++ +NV GG GAVNGM P
Sbjct: 369 CSIMSDQCAGQWFLKASGLGEGDTEVFPTQHVVCALQTIFEFNVRAFAGGAMGAVNGMQP 428
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
G D SS+QS E+W GV Y +AA+MI E L GFQTA G Y W GL AFQTPE
Sbjct: 429 HGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPE 486
Query: 869 AWNTDDQYRSLCYMRPLAIW 888
A+ +RSL YMRPL+IW
Sbjct: 487 AYCQQRVFRSLAYMRPLSIW 506
>gi|37522029|ref|NP_925406.1| hypothetical protein gll2460 [Gloeobacter violaceus PCC 7421]
gi|35213028|dbj|BAC90401.1| gll2460 [Gloeobacter violaceus PCC 7421]
Length = 867
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 290/851 (34%), Positives = 408/851 (47%), Gaps = 133/851 (15%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQI--FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
HG+PLGG G+G IGRS+RG+F W + +PV QF++F + + VLC
Sbjct: 79 HGMPLGGFGAGCIGRSHRGDFTLWHLDSGDHYFRSQPVC--QFALFEQSEGAKPRAKVLC 136
Query: 153 PKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
+ P +G+WDW TY ALYPR+W +EG + +++ +Q SPIIP
Sbjct: 137 TEKP------ADGSLGAWDWTYPKGAGTYAALYPRSWFTYEGFFNSHVQL--KQFSPIIP 188
Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND---- 268
+NYK++SYPV+VF +T +N +++L +W N VG FT Q + K+ D
Sbjct: 189 NNYKQTSYPVAVFEWTAFNPTDQPVTLSILVSWENMVGW---FTNQFKSRAVKVRDDDTP 245
Query: 269 -------------------------GVHAVLLHHRTSHQLP-PVTFALAAQ-----ETDG 297
G+ T Q P P T A++ + E DG
Sbjct: 246 VYDYEPYWGESTGNFNSAREENAYRGLTMSRASGYTEQQKPAPATAAVSERPFVTAEGDG 305
Query: 298 -----------VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGS 346
V ++ F +GN L ++W G RL + + + G
Sbjct: 306 QFVIATTREPDVEITYHTRFDPTGNGL-----EVWQPFATTG---RLGDLVSVRPAAKGE 357
Query: 347 SIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARD 406
I AA+A +T+ P V +LAWD P + F G YR YTK + A IA +
Sbjct: 358 RIAAAMAVRLTLKPGELRVVPMALAWDLPIMEFAQGVRRYRYYTKHFDKSGRNAWKIATE 417
Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
A+ + W+ QI+ WQ P+L D LP+WY LFNELYYL +GG +W +G P
Sbjct: 418 ALDNYADWDRQIQEWQNPVLADPSLPDWYKQALFNELYYLTSGGTLWENGPVP------- 470
Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
E+++S+ + P
Sbjct: 471 -------------------------------EKITSLADIAKAP---------------- 483
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
G+F LE I+Y +++ DV Y SFAL++ +P+I ++ R +A AV D S +
Sbjct: 484 PGQGRFAVLESIDYRWYDSLDVRLYGSFALLINWPEIDKAVLRSYADAVPDEDDSTRIIG 543
Query: 587 DDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG- 643
RK A+PHD+G DPW + N D WKDL FVLQVYRD V TG
Sbjct: 544 YTKTVAKRKSRDALPHDLGAPNEDPWIKNNYTTYQDCNLWKDLPCDFVLQVYRDYVDTGK 603
Query: 644 -DKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D F + WP+ A+ + FD+DGDG EN G PD TYD W + GISAY GGLW+A
Sbjct: 604 RDLGFVRYCWPASKAALVRLKTTFDQDGDGFPENGGPPDCTYDAWPLKGISAYCGGLWLA 663
Query: 702 ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQL 760
AL AA + + VGD + F + +A+ +YEK LWNG Y+ D S S S +I ADQL
Sbjct: 664 ALLAAVEMGKLVGDAAAVQQFGAWYAQAQPLYEKTLWNGRYYRID-SQSKNSEAIMADQL 722
Query: 761 AGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSR 820
G++YA+ C L IV E +ARSAL+ VY +K GG+ G NG DG +
Sbjct: 723 CGEYYAQVCNLADIVPEPQARSALETVYQTCFVKFYGGRFGCANGTNADGSFIGDTEHPS 782
Query: 821 EIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLC 880
E+W+G+ + +AA MI + G A + + G G F+TPEA +R+
Sbjct: 783 EVWTGINFGLAAFMIRNGMRREGMAIAEAVVANVYGG---GLQFRTPEALTPARTFRACM 839
Query: 881 YMRPLAIWAMQ 891
Y+RP+AIWAMQ
Sbjct: 840 YLRPMAIWAMQ 850
>gi|147812670|emb|CAN61859.1| hypothetical protein VITISV_016692 [Vitis vinifera]
Length = 305
Score = 404 bits (1038), Expect = e-109, Method: Compositional matrix adjust.
Identities = 211/323 (65%), Positives = 250/323 (77%), Gaps = 23/323 (7%)
Query: 7 LENGLHEEE---KEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPI 63
+ENG H+E+ + P+ SS KV+P KPASLTWQRKL+T L++F L +E L I
Sbjct: 1 MENG-HKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHLISI 59
Query: 64 GVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPR 123
IDPF KR +TS HGVPLGG+G GSIGRSYRGEFQR+Q+FPR
Sbjct: 60 -------------------IDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPR 100
Query: 124 VCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHA 183
+CED PVLANQFSVFVSR NG+K S+VLCP+ PEVLK + ++GIGSWDWNL G+ TYHA
Sbjct: 101 ICEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHA 160
Query: 184 LYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLF 243
LYPRAWTV+EGEPDPE+ I+ QISP IPHNYKESS+PVSVF +T+ NSGKTSADITLLF
Sbjct: 161 LYPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLF 220
Query: 244 TWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLC 303
TW NSVGG SEF+G HYNSK K DGVH VLLHH+T++ PPVTFA+AA+ET VH+S C
Sbjct: 221 TWANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISEC 280
Query: 304 PHFVISGNSLGLTAKDMWHEIKE 326
P F+ISGNSLG+TAK+MW EIK+
Sbjct: 281 PCFLISGNSLGVTAKEMWQEIKQ 303
>gi|343959578|dbj|BAK63646.1| bile acid beta-glucosidase [Pan troglodytes]
Length = 514
Score = 404 bits (1037), Expect = e-109, Method: Compositional matrix adjust.
Identities = 225/524 (42%), Positives = 292/524 (55%), Gaps = 64/524 (12%)
Query: 375 PEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
P + F G+ +YRRYT+F+G +AA ++ A+ + WE +I AWQ P+L+D+ LP
Sbjct: 2 PRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERISAWQSPVLDDRSLPA 61
Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
WY LFNELY+L GG VW +
Sbjct: 62 WYKSALFNELYFLADGGTVWLE-------------------------------------- 83
Query: 494 VNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWNTYDVHFY 551
+LE +L G N+ L+ + G+F YLEG EY M+NTYDVHFY
Sbjct: 84 ---------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 129
Query: 552 SSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--P 609
+SFALIML+PK++LS+Q D A A + D ++ + L G K +P DIG D P
Sbjct: 130 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPRDIGDPDDEP 189
Query: 610 WFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDG 669
W VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +WP M +FD+D
Sbjct: 190 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 249
Query: 670 DGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + +D F +
Sbjct: 250 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 309
Query: 730 KVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSAL 784
+ YE+ LWNG Y+NYD+S QS S+ +DQ AGQW+ +ACGL + AL
Sbjct: 310 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRAL 369
Query: 785 KKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L GF
Sbjct: 370 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 429
Query: 845 QTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
QTA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 430 QTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 471
>gi|47212590|emb|CAG12815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 777
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 243/660 (36%), Positives = 354/660 (53%), Gaps = 78/660 (11%)
Query: 37 QRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRA-FIDPFIKRHLTSSH 95
++ +G++ LSQ ++ +G+R L ++ K+A FID F L +
Sbjct: 31 RKPFQSGDVSLSQ-------VLKHLGLGIRYLRWWYKKTRVEKKAPFIDMFGALPLRQIY 83
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
G PLGG+G G+I R +RGEF RWQ+ P K V+ NQF+V + R Y VL +
Sbjct: 84 GAPLGGIGGGTITRGWRGEFCRWQLNPGKYHYKTVIENQFTVCLRRDGQTVYQQVLSVER 143
Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
P L+ W+W G+ + YHALYPRAWTV+ P + + CRQISP+IPH+Y
Sbjct: 144 PPTLQ--------GWNWGFCGEYAYYHALYPRAWTVYH-LPGQNVTLTCRQISPVIPHDY 194
Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN---DGVHA 272
+++S PV+VF + + N + D++++FT N G + +G H+N ++ + V
Sbjct: 195 QDASLPVAVFVWDVENKNDYALDVSIMFTALNGSGHRDDKSGGHWNQPFRLEKEGEAVSG 254
Query: 273 VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDR 332
VLLHH T+ + P T +AA+E VS F G GL W+++ G D
Sbjct: 255 VLLHHCTA--VNPYTLCIAAREQPDRDVSHQTAFSPKGTCNGL-----WNDLITDGRLD- 306
Query: 333 LNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTK 391
+ +S + G + AA+AA V + + FSLAWD P++ F +G + + RRYT+
Sbjct: 307 -SPTGSSPPTAKGERVAAALAAGCAVSAQNRNCLEFSLAWDMPKITFGAGEQQHSRRYTR 365
Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
+YG+ +A+ ++ A+ ++ WE +IE WQ P+L+D LP WY LFNELY++ GG
Sbjct: 366 YYGSKGDASPLLSHHALTQYRGWEKRIEEWQTPVLQDSSLPAWYKSALFNELYFVADGGT 425
Query: 452 VWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPV 511
VWT +L DV L +++++
Sbjct: 426 VWT------------------------ELAEDADVSGGLRPEDGGLPAQPAVIKE----- 456
Query: 512 ALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDF 571
G+F YLEG EY M+NTYDVH+Y+SFALIML+PK+ LS+Q D
Sbjct: 457 -----------------YGRFAYLEGQEYRMYNTYDVHYYASFALIMLWPKLALSLQYDI 499
Query: 572 AAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNP 629
A +V+ DP++ L +G+ K G VPHDIG +PW VNAY ++DTA WKDLN
Sbjct: 500 AGSVVQQDPTERLYLMNGRCSPVKTKGVVPHDIGDPEDEPWQRVNAYLIHDTAGWKDLNL 559
Query: 630 KFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS 689
KFVLQVYRD T D ++ + +WP + QFD DGDG+IEN G+ DQTYD W+ S
Sbjct: 560 KFVLQVYRDFHLTQDAQYLRDMWPICEMVTESELQFDLDGDGLIENSGYADQTYDGWAAS 619
Score = 126 bits (317), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/118 (53%), Positives = 81/118 (68%), Gaps = 2/118 (1%)
Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
++K ++ALK +++ NV+ GG+ GAVNGM P+G D SS+QS E+W GV Y +AA+MIH
Sbjct: 661 KEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIH 720
Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
E + + G +TA G Y W LG AFQTPEA+ YRSL YMRPL IWAMQ AL
Sbjct: 721 EGMLEEGMRTAEGCYRTVWE--RLGMAFQTPEAYCEKGIYRSLAYMRPLCIWAMQLAL 776
>gi|353232147|emb|CCD79502.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 900
Score = 384 bits (987), Expect = e-103, Method: Compositional matrix adjust.
Identities = 276/868 (31%), Positives = 425/868 (48%), Gaps = 109/868 (12%)
Query: 71 IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
I++ + + FID +GVP+GG+GSGSIGR YRGEF R + P +
Sbjct: 77 IKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFCRSSLIPGMYSYDVQ 136
Query: 131 LANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWT 190
+QF V V ++ Y VL P T + A + W W +K Y LYPR+WT
Sbjct: 137 PVDQFIVTVRKNGVTVYHQVLSPLTKP---PSNAKKLKKWKWGFDPEKGHYLGLYPRSWT 193
Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
V+E P+ +L +V +QISP+IPH+Y+ + PV+VF + + + K + +T+ F+W
Sbjct: 194 VYE-IPELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISWNKENLKVTITFSWHGPNH 252
Query: 251 GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
+ S+ N + H T+ F +Q + +G
Sbjct: 253 RKRTKSSDFSKSEDGNNSSENTTNTHVSTAEN----AFPTCSQRSTAFS---------TG 299
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLN--------SMETSVTSEPGSSIGAAIAASVTVP--- 359
+ L L + K+ +F L+ ++ ++ + A++A+ VP
Sbjct: 300 DFLKLNCQG-----KKLKTFTDLSWLGWLVGYTVPLDSKAKKSPKLAIAVSATTNVPRCN 354
Query: 360 -----PDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYGTHQNAAANIA-RDAILEHG 412
S+ ++ F++ W P V F +G Y RRY +++ + + A + AI
Sbjct: 355 ITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKLLLAHAIDNWR 414
Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS 472
W +IE WQ PIL ++ LP WY LFNELYYL+ GG VW D P+
Sbjct: 415 QWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ----------- 460
Query: 473 LDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQF 532
+D +SDL + + D V+ + + + IG F
Sbjct: 461 VDCFKSDLLNLTE-----DMKVDSWDHRARL----------------------SREIGLF 493
Query: 533 LYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWV 592
YLEG EY M+NTYDVH Y+S+ALI L+PKIQL++ D A + D + + + +G+ +
Sbjct: 494 GYLEGHEYRMYNTYDVHHYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKAL 553
Query: 593 SRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKA 650
R AV HD G + PW NAY ++ T WKDLN KF+LQV+RD T D ++
Sbjct: 554 FRSSECAVVHDFGDPEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLY 613
Query: 651 VWPSVY-VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA--- 706
+ P V + + +D D DG+IEN GFPDQTYD W+ G++AY+GG+W++ L A
Sbjct: 614 MLPIVLRILRKSLVAWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDM 673
Query: 707 --------SALAREVGDRGSE---------DYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
S + ++ + + D F KA+ Y KLW GSY+ +
Sbjct: 674 LSWCLKSDSPVYDQIVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYC 733
Query: 749 SSQSSSIQADQLAGQWYARACGLLP--IVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
+ + I ADQL+G W++R G+ P I+ ++ L+ + N N + G GA+NG
Sbjct: 734 TPRREVIMADQLSGYWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGC 793
Query: 807 LPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
LP + D+SS+Q+ E W V Y++++ MI E + D G Y+ ++ GL Y QT
Sbjct: 794 LPVCKPDLSSVQAEEFWVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQY--QT 851
Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
PEA+ +D ++R YMR LAIW++Q AL
Sbjct: 852 PEAYMSDGRFRCPGYMRALAIWSIQQAL 879
>gi|256082537|ref|XP_002577511.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 892
Score = 381 bits (978), Expect = e-102, Method: Compositional matrix adjust.
Identities = 277/868 (31%), Positives = 422/868 (48%), Gaps = 117/868 (13%)
Query: 71 IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
I++ + + FID +GVP+GG+GSGSIGR YRGEF R + P +
Sbjct: 77 IKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFCRSSLIPGMYSYDVQ 136
Query: 131 LANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWT 190
+QF V V ++ Y VL P T + A + W W +K Y LYPR+WT
Sbjct: 137 PVDQFIVTVRKNGVTVYHQVLSPLTKP---PSNAKKLKKWKWGFDPEKGHYLGLYPRSWT 193
Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
V+E P+ +L +V +QISP+IPH+Y+ + PV+VF + + + K + +T+ F+W
Sbjct: 194 VYE-IPELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISWNKENLKVTITFSWHGPNH 252
Query: 251 GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
+ S+ N + H T+ F +Q + +G
Sbjct: 253 RKRTKSSDFSKSEDGNNSSENTTNTHVSTAEN----AFPTCSQRSTAFS---------TG 299
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLN--------SMETSVTSEPGSSIGAAIAASVTVP--- 359
+ L L + K+ +F L+ ++ ++ + A++A+ VP
Sbjct: 300 DFLKLNCQG-----KKLKTFTDLSWLGWLVGYTVPLDSKAKKSPKLAIAVSATTNVPRCN 354
Query: 360 -----PDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYGTHQNAAANIA-RDAILEHG 412
S+ ++ F++ W P V F +G Y RRY +++ + + A + AI
Sbjct: 355 ITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKLLLAHAIDNWR 414
Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS 472
W +IE WQ PIL ++ LP WY LFNELYYL+ GG VW D P+
Sbjct: 415 QWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PI------------ 459
Query: 473 LDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQF 532
Q DL + V + + A R+S IG F
Sbjct: 460 ----QVDLTEDMKVDSWDHRA-----RLS-------------------------REIGLF 485
Query: 533 LYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWV 592
YLEG EY M+NTYDVH Y+S+ALI L+PKIQL++ D A + D + + + +G+ +
Sbjct: 486 GYLEGHEYRMYNTYDVHHYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKAL 545
Query: 593 SRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKA 650
R AV HD G + PW NAY ++ T WKDLN KF+LQV+RD T D ++
Sbjct: 546 FRSSECAVVHDFGDPEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLY 605
Query: 651 VWPSVY-VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA--- 706
+ P V + + +D D DG+IEN GFPDQTYD W+ G++AY+GG+W++ L A
Sbjct: 606 MLPIVLRILRKSLVAWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDM 665
Query: 707 --------SALAREVGDRGSE---------DYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
S + ++ + + D F KA+ Y KLW GSY+ +
Sbjct: 666 LSWCLKSDSPVYDQIVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYC 725
Query: 749 SSQSSSIQADQLAGQWYARACGLLP--IVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
+ + I ADQL+G W++R G+ P I+ ++ L+ + N N + G GA+NG
Sbjct: 726 TPRREVIMADQLSGYWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGC 785
Query: 807 LPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
LP + D+SS+Q+ E W V Y++++ MI E + D G Y+ ++ GL Y QT
Sbjct: 786 LPVCKPDLSSVQAEEFWVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQY--QT 843
Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
PEA+ +D ++R YMR LAIW++Q AL
Sbjct: 844 PEAYMSDGRFRCPGYMRALAIWSIQQAL 871
>gi|312382731|gb|EFR28085.1| hypothetical protein AND_04399 [Anopheles darlingi]
Length = 777
Score = 369 bits (947), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 245/788 (31%), Positives = 395/788 (50%), Gaps = 89/788 (11%)
Query: 153 PKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
PKTP +GSW+ L + +Y ALYPRAW+ ++ + +R+V RQISPIIP
Sbjct: 31 PKTP----------LGSWESALDASRCSYTALYPRAWSEYD-LSEYGVRLVQRQISPIIP 79
Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHA 272
H+YKESS P +VF +T+ N +T+ FT+ N G + +
Sbjct: 80 HDYKESSMPCAVFVWTVENVCDRDRQVTITFTFKNGTGTKKQDAEGGSETGAFTQGSARG 139
Query: 273 VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDR 332
V + + T+ L+ + + ++++ C F SG T + +W+++KEHG
Sbjct: 140 VSIRQTIADM--ACTYCLSCRSSSEINLTRCERFDPSG-----TGEKLWNDLKEHGHLTE 192
Query: 333 LNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS-GKTYYRRYTK 391
+ ET T + + A++ + V P + ++ F+L WD P+V+F GK Y+R YTK
Sbjct: 193 QSVDETVKTKD----VAVAVSGQILVQPGTTSELQFALVWDMPKVHFQKRGKEYHRYYTK 248
Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
++G +A ++ A+ +G WE I+ WQRPIL+D LP+WY +FNELY++ GG+
Sbjct: 249 YFGKTGDAGPTMSDYALTNYGKWERLIDEWQRPILDDADLPDWYKSAIFNELYFIADGGS 308
Query: 452 VW---TDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV------------PN-----QND 491
VW D P R L+ + + DV PN Q D
Sbjct: 309 VWLTMEDSDLPYDDPRLAYGRYSYLEGHEYRMYTTYDVHFYASHALASLWPNLQVSIQYD 368
Query: 492 TAVNILERMSSILEQIY----TPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
+I +S + +Y P + +S +L EE + + N Y
Sbjct: 369 YKDSIGREISEGRKHLYDGKVIPRKIQNSVPHDLGDPAEE-----------PFDLINAYP 417
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFA-----AAVMMHDPSKMKLLDDGQWVSRKVLGAVPH 602
+H S + + + K L + RD+ A + + SK + +++ ++ + +
Sbjct: 418 IHDVSEWRDLNI--KFILQVYRDYYTLNHYAQLNAENASKFSSI---EFIDKESM----Y 468
Query: 603 DIGICDPWFEVNAYCLYDTARWKDLNPK----FVLQVYRDVVATGDKKFAKAVWPSVYVA 658
D I D V+ + K N K ++ + V + KA++P+ +
Sbjct: 469 DTYIQDNRHRVSTPPATNGTENKAANRKSASMYINEANGKVYLMDAMTYLKAMYPACKLV 528
Query: 659 MAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGS 718
+ ++D+D DG+IEN PDQTYD+W + G SAY GGLW+A+L + +A +
Sbjct: 529 LERSREWDKDDDGLIENSRSPDQTYDSWVMDGPSAYCGGLWLASLHCMATMASLLDQTED 588
Query: 719 EDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL-LPIVD 776
+ K + + EKLWNG+Y+ +D +S+ +SI +DQL G WY RACG +
Sbjct: 589 AANYQAILDKGRRSFEEKLWNGTYYKFDAQSASK-NSIMSDQLCGHWYLRACGFDYDVFP 647
Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLP------DGRVDMSSMQSREIWSGVTYAV 830
++ R A++ +Y NV++ GG GAVNG +P +GR D+S++Q E+W+GVTYA+
Sbjct: 648 KENVRLAMRTIYENNVMRFCGGNLGAVNGYVPNAQPNKEGRPDVSNIQGEEVWTGVTYAL 707
Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
AA+MIHE + + FQTA G+Y + G+ F+TPEA + YR++ YMRPL+IW+M
Sbjct: 708 AATMIHEGMFEEAFQTAGGLYRSLSEKIGMN--FETPEAVYAERHYRAIGYMRPLSIWSM 765
Query: 891 Q--WALTR 896
Q W L +
Sbjct: 766 QTAWELRK 773
>gi|327288418|ref|XP_003228923.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Anolis
carolinensis]
Length = 667
Score = 362 bits (929), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 199/456 (43%), Positives = 257/456 (56%), Gaps = 56/456 (12%)
Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
YT+F+G +A +A + + WE +IEAWQRPILED LP WY LFNELY+L+
Sbjct: 228 YTRFFGREGDACPALAHYMLTHYPEWEKRIEAWQRPILEDSDLPAWYKSALFNELYFLSD 287
Query: 449 GGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIY 508
GG +W + P D P
Sbjct: 288 GGTLWVEFQPE-------------------------DAP--------------------- 301
Query: 509 TPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQ 568
+L + G + L + G+F YLEG EY M+NTYDVHFY+SFAL ML+PK++LS+Q
Sbjct: 302 ---SLAAGQGLSGLLPVLKEYGRFAYLEGQEYRMYNTYDVHFYASFALAMLWPKLELSLQ 358
Query: 569 RDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKD 626
D A AV+ DP L +GQ K+ VPHDIG D PW VNAY ++DTA WKD
Sbjct: 359 YDMAVAVLSEDPRPRVYLMNGQTAQVKLRNVVPHDIGEPDEEPWQRVNAYLIHDTADWKD 418
Query: 627 LNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTW 686
LN KFVLQVYRD T D + + +WP M +FD DGDG+IEN GF DQTYD W
Sbjct: 419 LNLKFVLQVYRDFFLTEDCTYLRDMWPVCQAVMESELKFDTDGDGLIENSGFADQTYDAW 478
Query: 687 SVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYD 745
V+G SAY GGLW+AA+ +A +G+ + + +E+ LWNG Y+NYD
Sbjct: 479 VVTGASAYCGGLWLAAVCMMCRMAEILGEGAALQRYSAILSSGAAAFERLLWNGRYYNYD 538
Query: 746 NSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRG 801
+ G S+SI +DQLAGQW+ RACGL + ++ +SAL+ +Y NVL G G
Sbjct: 539 SGGGPSSNSIMSDQLAGQWFLRACGLGEGKYQVFPQEHVQSALRTIYEMNVLGFSEGAMG 598
Query: 802 AVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
AVNGM PDG D SS+QS E+W GV Y++AA+MI E
Sbjct: 599 AVNGMRPDGVPDTSSVQSDEVWVGVVYSLAATMIQE 634
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 84/226 (37%), Positives = 128/226 (56%), Gaps = 14/226 (6%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K+ FID F + L +G PLGG G G+I R ++G+F RWQ+ P + K V+
Sbjct: 7 RKTRLEKKQPFIDLFNSQPLRQIYGCPLGGFGGGTITRGWKGDFCRWQLNPGLYHYKRVV 66
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P L+ W+W+ G + YHALYPRAW+V
Sbjct: 67 ADQFTVCLRRKGQTAYQQVLSVEQPSCLQ--------GWNWSFCGHYAFYHALYPRAWSV 118
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P +L + CRQ++PIIPH+YKE+S PV VF + + N +++++FT+ N
Sbjct: 119 YQ-LPGQDLVLTCRQVTPIIPHDYKETSLPVGVFVWEVENLRDEEVEVSIMFTFQNGTET 177
Query: 252 DSEFTGQHYNSKTKMNDGVHA---VLLHHRTSHQLPPVTFALAAQE 294
+ G H+N + +G H VLLHH + T A+AA+E
Sbjct: 178 KEDRRGGHWNEPFALREGGHCVRGVLLHH--CLPVNGYTLAIAARE 221
>gi|428200465|ref|YP_007079054.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
gi|427977897|gb|AFY75497.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
Length = 808
Score = 362 bits (929), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 283/856 (33%), Positives = 400/856 (46%), Gaps = 143/856 (16%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + + + A QFSVF G+ + +C +
Sbjct: 41 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHVFRSLPACQFSVFEQPEGGEAQAYAMCTE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE D T + W W K K TYHALYPR+W ++G EL VC Q SPI N
Sbjct: 101 PPE---DGT---LSRWLWYPK-KKGTYHALYPRSWYEYKGVFQSEL--VCEQFSPICAGN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
Y+E+SYPV++F +T +N ++++ TW N+VG G E+ Q
Sbjct: 152 YQETSYPVAIFEWTAHNPTDKPITLSIMLTWQNTVGWFSNAIKSPSIQVRDDGSPEYEYQ 211
Query: 259 --------HYNSKTKMNDGVHAVL----LHH-RTSHQLPPVTFALAAQETDGVHVSLCPH 305
++N N V +L LH + FA T + S
Sbjct: 212 PRWGDSTGNFNQWIVDNFRVGCLLNRIRLHEDEIGEGEGQICFASITNPT--IEASYLGR 269
Query: 306 FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
+ +G+ ++W +GS L E +EPG I AA+A TV P +
Sbjct: 270 WNPNGD-----GSEVWDCFAANGS---LPDTEDETPAEPGEQIAAAMAIRFTVRPGKTRK 321
Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
+ F LAWD P F G Y RRYT F+G A I R A L+HG + W+
Sbjct: 322 IPFILAWDLPITEFAEGVKYCRRYTDFFGRTGTNAWAIVRTA-LKHG------DMWK--- 371
Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVD 485
+R+ EW QS + + D
Sbjct: 372 ---ERIKEW------------------------------------------QSPILKRED 386
Query: 486 VPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNT 545
+P+ A + ++Y +A + T +D + IGQF LE ++Y + +
Sbjct: 387 LPDWFKMA---------LFNELYL-LADGGTLWTAATED--DPIGQFGVLECMDYRWYES 434
Query: 546 YDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG 605
DV Y SF L+ML+ ++ ++ FA A+ D + + + RK GA PHD+G
Sbjct: 435 LDVRLYGSFGLMMLWSRLDKAVLEAFARAIPTGDDTPRIIGYNKAPAIRKAKGATPHDLG 494
Query: 606 ICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAY 661
+ PW + N D WKDL FVLQVYRD V TG D F W S+ +AY
Sbjct: 495 APNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFVLTGAKDVAFLWECWQSIVETLAY 554
Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREV-------G 714
+ +FDRDGDG+ EN G PDQT+D W + GISAY GGLW+AAL+AA A+ + +
Sbjct: 555 LKKFDRDGDGIPENSGAPDQTFDDWRLRGISAYCGGLWIAALEAAIAIGKILIENPPSNP 614
Query: 715 DRGSEDY----------FLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQ 763
D E+Y + Q A+ +Y EKLWNG Y++ D+ S S + ADQL GQ
Sbjct: 615 DLELENYPQSLQNTIQTYQNWLQGARSLYHEKLWNGEYYHLDS--ESGSDIVMADQLCGQ 672
Query: 764 WYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRV-DMSSMQSREI 822
+YAR GL +V+ A SALKKVY LK GK GA NG+ PDG D + E+
Sbjct: 673 FYARLLGLPDVVERQYAESALKKVYEACFLKFQDGKFGAANGVKPDGSPEDPKATHPLEV 732
Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYM 882
W+G+ + +AA ++ + D F+ A + + G F+TPEA +R+ Y+
Sbjct: 733 WTGINFGLAAFLLQMGMKDEAFKLAETVVRQVYDN---GLQFRTPEAITAVGTFRASHYL 789
Query: 883 RPLAIWAMQWALTRPK 898
R +AIW + LT K
Sbjct: 790 RAMAIWGIYGVLTEFK 805
>gi|148670516|gb|EDL02463.1| glucosidase beta 2, isoform CRA_c [Mus musculus]
Length = 467
Score = 356 bits (914), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 200/467 (42%), Positives = 262/467 (56%), Gaps = 59/467 (12%)
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPN 488
+ LP WY LFNELY+L GG VW ++VP
Sbjct: 10 RTLPAWYKSALFNELYFLADGGTVW------------------------------LEVP- 38
Query: 489 QNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDV 548
A ++ E + + Q+ + + ++ G+F YLEG EY M+NTYDV
Sbjct: 39 ----ADSLPEGLGGSMRQLRSTL---------------QDYGRFGYLEGQEYRMYNTYDV 79
Query: 549 HFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD 608
HFY+SFAL+ML+PK++LS+Q D A A + D ++ + L G K +PHDIG D
Sbjct: 80 HFYASFALVMLWPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPD 139
Query: 609 --PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFD 666
PW VNAY ++DTA WKDLN KFVLQ+YRD TGD+ F + +WP M +FD
Sbjct: 140 DEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFD 199
Query: 667 RDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKF 726
+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G + ++ F
Sbjct: 200 KDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFASIL 259
Query: 727 QKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKAR 781
+ + YE+ LWNG Y+NYD+S QS SI +DQ AGQW+ RACGL +
Sbjct: 260 CRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVV 319
Query: 782 SALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLAD 841
AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y +AA+MI E L
Sbjct: 320 RALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTW 379
Query: 842 IGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
GF+TA G Y W GL AFQTPEA+ +RSL YMRPL+IW
Sbjct: 380 EGFRTAEGCYRTVWERLGL--AFQTPEAYCQQQVFRSLAYMRPLSIW 424
>gi|428223619|ref|YP_007107716.1| glucosylceramidase [Geitlerinema sp. PCC 7407]
gi|427983520|gb|AFY64664.1| Glucosylceramidase [Geitlerinema sp. PCC 7407]
Length = 821
Score = 354 bits (909), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 276/865 (31%), Positives = 387/865 (44%), Gaps = 160/865 (18%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G +GRS RG+F W I + + A QFSVF ++G + LC +
Sbjct: 39 HGMPLGGFGAGCLGRSPRGDFNLWHIDGGEHVFQSMPACQFSVFEQGADGTSQAYALCSE 98
Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
P A +G W W + TY ALYPR+ +EG R+ C Q SPI
Sbjct: 99 AP-------AESLGQWQWYPASMAERSTGTYSALYPRSRFDYEGVF--RARLACEQFSPI 149
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND-- 268
NY+ESSYPV+VF +T +N +++L TW N VG FT + + K+ D
Sbjct: 150 WADNYQESSYPVAVFEWTAHNPTDQLLTLSILLTWQNMVGW---FTNALKSPEVKVRDDG 206
Query: 269 -----------------------GVHAVLLHHRTSHQLPPV----TFALAAQ-ETDGVHV 300
G L R P +A+A Q E G +
Sbjct: 207 SPVYEYQPRLGESEGNFNQWIAEGAWVGCLMDRAGRAEEPQEGEGQWAIATQCEGPGTEI 266
Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSS---IGAAIAASVT 357
+ +G+ ++W +F + ++E + P IG AIA T
Sbjct: 267 FYHTRWNPAGD-----GGELWR------TFAQTGALENRIDGTPAQGEEQIGGAIALRFT 315
Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
+ P +V F LAWD P F +G YRRYT F+G IAR A+ E+
Sbjct: 316 LAPGETRRVPFVLAWDFPVTEFAAGTAAYRRYTDFFGRSGRNGWAIARTALEEY------ 369
Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQ 477
AWQ I W Q
Sbjct: 370 -SAWQARIAA-----------------------------------------------WQQ 381
Query: 478 SDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
L R D+PN A + ++Y + + + +D +GQF LE
Sbjct: 382 PILDR-ADLPNWFKMA---------LFNELYDLTSGGTLWSAASDRD---PVGQFAVLEC 428
Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD-------DGQ 590
I+Y + + DV Y SF L+ML+P+++ S+ R FA A+ D + +++ +
Sbjct: 429 IDYRWYESLDVRLYGSFGLLMLWPELEKSVMRAFARAIPTAD-DRQRVIGYYYTQGTESP 487
Query: 591 WVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKK 646
RK L A PHD+G + PW + N D WKDL FVLQVYRD TG D
Sbjct: 488 LALRKALDATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFRLTGSTDDA 547
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
F WP+V A+AY+ FD+DGDG+ EN G PDQT+D W + G+SAY GGLW+AAL+AA
Sbjct: 548 FIAECWPAVTRALAYLKAFDQDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWIAALEAA 607
Query: 707 SALAREVGDRGSED---------------YFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQ 751
AL + D Y + Q V + LWNG Y+ D+ S
Sbjct: 608 IALGERLQQHPVSDHPELSQGAIAHHLSTYRTWLAQSRAVYDDALWNGRYYRLDS--GSG 665
Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
S + ADQL GQ+YAR L +V D+A SALK VY+ L+ GK GA NG+ PDG
Sbjct: 666 SEVVMADQLCGQFYARLLDLPDVVASDRALSALKTVYDSCFLRFHDGKFGAANGVKPDGS 725
Query: 812 VDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
+ + E+W+G+ + +AA ++ L D GF+ + E + G F+TPEA
Sbjct: 726 PENPKATHPLEVWTGINFGIAAFLVQMGLRDEGFRLTQAVVEQIYE---HGLQFRTPEAI 782
Query: 871 NTDDQYRSLCYMRPLAIWAMQWALT 895
+R+ Y+RP+AIWA+ LT
Sbjct: 783 TAVGTFRACHYLRPMAIWAVYGVLT 807
>gi|326934726|ref|XP_003213436.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Meleagris gallopavo]
Length = 639
Score = 351 bits (901), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 204/413 (49%), Positives = 251/413 (60%), Gaps = 21/413 (5%)
Query: 508 YTPVALNSSFGTNLLQDG--EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQL 565
+ P +LLQDG E IGQ EY M+NTYDVHFY+SFAL+ML+PK+Q+
Sbjct: 233 FDPAGSGRDVWQDLLQDGKLESPIGQ-------EYRMYNTYDVHFYASFALVMLWPKLQI 285
Query: 566 SIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTAR 623
S+Q D A V+ D + L G+ K+ VPHDIG D PW VNAY ++DTA
Sbjct: 286 SLQYDIAVTVLNEDMQPRQYLVCGKTAQVKLKNVVPHDIGDPDDEPWQRVNAYLMHDTAN 345
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
WKDLN KFVLQVYRD T D + + +WP M +FD D DG+IEN G DQTY
Sbjct: 346 WKDLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELKFDTDNDGLIENGGTADQTY 405
Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYF 742
D W V+G SAY GGLW+AA++ +A +GD ++ + QK K +E+ LWNG Y+
Sbjct: 406 DAWVVNGASAYCGGLWLAAVRMMCEMAEVLGDAETQQKYGAILQKGKESFERLLWNGKYY 465
Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGG 798
NYD+SGSS SSSI +DQ AGQW+ ACGL L + + SALK ++ NV+ GG
Sbjct: 466 NYDSSGSSTSSSIMSDQCAGQWFLGACGLDQKELEVFPKSHIVSALKTIFEKNVMSFAGG 525
Query: 799 KRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
GAVNGM PDG D SS+QS E+W GV YA+AA+MI E L GF TA G Y W
Sbjct: 526 TMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTAEGCYRTVWE-- 583
Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL---TRPKPKTLEKQMKP 908
LG AFQTPEA+ YRSL YMRPL+IW+MQ AL TR P Q P
Sbjct: 584 NLGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLALESKTRRPPSPTPTQKNP 636
Score = 156 bits (395), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 95/256 (37%), Positives = 139/256 (54%), Gaps = 19/256 (7%)
Query: 79 KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
K AFID L +G PLGG+G G+I R +RGEF RWQ+ P + V+ +QF+V
Sbjct: 14 KSAFIDLLCAVPLQQIYGCPLGGIGGGTITRGWRGEFCRWQLNPGKYHYETVITDQFTVC 73
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+ R Y VL + P L+ W+W G + YHALYPRAW V+E P
Sbjct: 74 LRRKGQTVYQQVLSVEKPSALQ--------GWNWGYCGRYAFYHALYPRAWMVYE-LPGQ 124
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
+ + CRQISP+IPH+YK+SS PV VF + + N+ + D++++FT N G + +G
Sbjct: 125 NVVLTCRQISPVIPHDYKDSSLPVGVFIWEVENNSEEPVDVSIMFTLQNGTGTKGDGSGG 184
Query: 259 HYNSKTKMNDG---VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGL 315
H+N + DG V VLLHH T + P TFA+AA G ++ F +G
Sbjct: 185 HWNEPFALQDGGERVTGVLLHHCTP--INPFTFAVAAXXQAGTVITHLTAFDPAG----- 237
Query: 316 TAKDMWHEIKEHGSFD 331
+ +D+W ++ + G +
Sbjct: 238 SGRDVWQDLLQDGKLE 253
>gi|443710865|gb|ELU04898.1| hypothetical protein CAPTEDRAFT_216398 [Capitella teleta]
Length = 449
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 204/541 (37%), Positives = 289/541 (53%), Gaps = 109/541 (20%)
Query: 375 PEVNFMSGKT-YYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
P ++F + + Y+RRY +F+G A +A A+ SWE +I WQ+P L+ +LP
Sbjct: 2 PIIHFGAKRNKYHRRYARFFGHDGCATPRLAAHALNHFTSWEEKIAEWQQPTLKANKLPA 61
Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
WY LFNE Y+++ GG+VW D +DV +
Sbjct: 62 WYKSALFNETYFISDGGSVWVDS---------------------------IDVKD----- 89
Query: 494 VNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSS 553
++ E + FG +G E Y M+NT DVH+Y+S
Sbjct: 90 --------AVSEHPFV-----QEFGKFAYLEGHE------------YRMYNTLDVHYYAS 124
Query: 554 FALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEV 613
FAL+ L+P +QLS+Q D +D+ +PW +
Sbjct: 125 FALMKLWPNLQLSLQYD---------------IDN-------------------EPWSRL 150
Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
NAY ++ T WKDLNPKFVLQVYRD V T D+++ ++P M + ++D D DG+I
Sbjct: 151 NAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMYPQCKAVMDHSLRWDVDRDGII 210
Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK----- 728
+N GF DQTYD W+V+G SAY GGLW+AA++A + +A +G ED L K+++
Sbjct: 211 DNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHRLG--VVED--LMKYEEILERG 266
Query: 729 AKVVYEKLWNG----SYFNYDNSGSSQSSSIQADQLAGQWYARACGLL--PIVDEDKARS 782
K EKLWNG Y+NYD S S SI ADQ+ GQW+ +ACG+ + +
Sbjct: 267 KKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLKACGVADNAVFPSANVKR 326
Query: 783 ALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADI 842
AL+ V+ +NVL GG+ GA+NGM P+ + D++S QS E W+GVTYA+ A+MI + D
Sbjct: 327 ALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYALGATMIQVGMIDK 386
Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTL 902
GFQTA G Y W GL AFQTPEA+ + ++RSL YMRPLAIWA+Q A+ + P+
Sbjct: 387 GFQTAYGAYHTCWERYGL--AFQTPEAYFDNRRFRSLGYMRPLAIWAIQHAVEKFHPQFF 444
Query: 903 E 903
E
Sbjct: 445 E 445
>gi|449275451|gb|EMC84314.1| Non-lysosomal glucosylceramidase [Columba livia]
Length = 692
Score = 337 bits (864), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 185/365 (50%), Positives = 233/365 (63%), Gaps = 7/365 (1%)
Query: 535 LEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSR 594
L G EY M+NTYDVHFY+SFALIML+PK+Q+S+Q D A AV+ D + L GQ
Sbjct: 314 LPGQEYRMYNTYDVHFYASFALIMLWPKLQISLQYDIAVAVVNEDVQPRQYLMGGQTAPV 373
Query: 595 KVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVW 652
K VPHDIG +PW VNAY ++DTA WKDLN KFVLQVYRD T D + + +W
Sbjct: 374 KTKNVVPHDIGDPADEPWQRVNAYLIHDTANWKDLNLKFVLQVYRDYYLTHDSLYLRDMW 433
Query: 653 PSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALARE 712
P M +FD D DG+IEN GF DQTYD W V+G SAY GGLW+AA+ +A
Sbjct: 434 PVCQAVMESELKFDMDNDGLIENGGFADQTYDAWVVNGASAYCGGLWLAAVCMMCKMAEV 493
Query: 713 VGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL 771
+GD + ++ +K K +E+ LWNG Y+NYD+SGS+ SSSI +DQ AG + +
Sbjct: 494 LGDAEIQQKYMDILRKGKETFERLLWNGKYYNYDSSGSNTSSSIMSDQCAG--HPKPPLS 551
Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
+ + + SALK ++ NV+ GG GAVNGM P G D SS+QS E+W GV YA+A
Sbjct: 552 MQVFPKSHVVSALKTIFEKNVMGFAGGTMGAVNGMRPSGVPDTSSVQSNEVWVGVVYALA 611
Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQ 891
A+MI E L + GF+TA G Y W LG AFQTPEA+ YRSL YMRPL+IW+MQ
Sbjct: 612 ATMIQEGLVEEGFRTAEGCYRTVWE--QLGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQ 669
Query: 892 WALTR 896
AL R
Sbjct: 670 LALER 674
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 109/202 (53%), Gaps = 21/202 (10%)
Query: 134 QFSVFVSRSNGQK-YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVH 192
QF+V + R GQ Y VL + P L+ W+W G + YHALYPRAW V+
Sbjct: 6 QFTVCL-RCKGQTVYQQVLSVERPSTLQ--------GWNWGYCGHYAFYHALYPRAWMVY 56
Query: 193 EGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD 252
E P + + CRQISP+IPH+YK+SS PV VF + + N D++++F+ N G
Sbjct: 57 E-LPGQNVVLTCRQISPVIPHDYKDSSLPVGVFIWEVENGRDEDVDVSIMFSLQNGTGTK 115
Query: 253 SEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
+ +G H+N K + V VLLHH T + P TFA++A+E GV V+ F +
Sbjct: 116 EDRSGGHWNEPFTFEKEGERVAGVLLHHCT--HVNPFTFAISAREKAGVVVTHLTAFNPA 173
Query: 310 GNSLGLTAKDMWHEIKEHGSFD 331
G + +++W ++ + G D
Sbjct: 174 G-----SGREVWQDLLQDGQLD 190
>gi|328869576|gb|EGG17953.1| hypothetical protein DFA_06619 [Dictyostelium fasciculatum]
Length = 1200
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 177/399 (44%), Positives = 246/399 (61%), Gaps = 30/399 (7%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMM-HDPSKMK 584
++ IGQF YLE EYLM+NTYDVHFY+SFAL L+P ++ S+Q A A M + ++
Sbjct: 790 DKEIGQFAYLESQEYLMYNTYDVHFYASFALATLWPSLEFSLQSGIADATMEDYGGETVE 849
Query: 585 LLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVAT 642
+ G+ + RK+ G VPHDIG DPW VNAY + D +RWKDL KF+LQVYRD + T
Sbjct: 850 CIHSGKQIPRKLRGTVPHDIGNPGEDPWKRVNAYNIQDISRWKDLPCKFILQVYRDYLLT 909
Query: 643 ------------------------GDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF 678
GD+ F +W +V + +FD D DG+I+N+GF
Sbjct: 910 SVDSSGQKSNATSSSFGQEYIPLDGDRGFLLQMWGTVEEVIRKAFEFDTDDDGVIDNEGF 969
Query: 679 PDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLW 737
PDQTYDTWS SG SAY+GGLW+AA++A SA+AR +G R E+ + F+K K Y +KLW
Sbjct: 970 PDQTYDTWSASGCSAYTGGLWLAAIKATSAMARILGLRDDEEVYSKLFEKGKKSYNKKLW 1029
Query: 738 NGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG 797
NG YFNYD+S S +SI ADQLAG WY +CGL + D+A S L + YN+
Sbjct: 1030 NGHYFNYDSSKQSHYNSIMADQLAGHWYLTSCGLSSYMTLDQALSTLSIINEYNIKSYSN 1089
Query: 798 GKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSG 857
G GAVNG+ P VD + +QS E+W G +Y++A++ + + + G+ +++
Sbjct: 1090 GSCGAVNGISPLAIVDQTCLQSSEVWIGTSYSLASTFLLHYMDKEAWSLIKGLVNSSYE- 1148
Query: 858 TGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
G+ +QTPEAW+ + +R+ YMRPLAIW++QWAL+R
Sbjct: 1149 -KWGFQYQTPEAWDMNGSFRASTYMRPLAIWSVQWALSR 1186
>gi|254410084|ref|ZP_05023864.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183120|gb|EDX78104.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 814
Score = 335 bits (860), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 268/864 (31%), Positives = 385/864 (44%), Gaps = 153/864 (17%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G +GRS RG+F W + K + A QFSVF N + LC +
Sbjct: 41 HGMPLGGFGAGCLGRSPRGDFNLWHLDGGEHTFKRLPACQFSVFEQPENQDAQAYALCTE 100
Query: 155 TPEVLKDTTAAGIGSWDW-----NLKGDKS------TYHALYPRAWTVHEGEPDPELRIV 203
PE +G W W N++ TYHALYPR+W V+E EL
Sbjct: 101 PPE------DGSLGKWQWYPTSPNVETRHGASLRTGTYHALYPRSWFVYENVFQTEL--T 152
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
C+Q SPI Y+ESSYPV+VF +T +N ++++ TW N VG FT + +
Sbjct: 153 CQQFSPIWADCYQESSYPVAVFEWTAHNPTDQPITLSIMLTWQNMVGW---FTNAIQHPQ 209
Query: 264 TKMNDGVHAVLLHH------------------RTSHQLPPVTFALAAQETDGVHVSLCPH 305
++ D V + R L V +E +G LC
Sbjct: 210 VQVRDDGSPVYEYQPRWGNSTGNYNQWIVDNFRVGCLLDRVRLRDEIEEGEG---QLCFA 266
Query: 306 FVISG----------NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAAS 355
V + N G ++W+ GS L+ ++ +EPG I AIA
Sbjct: 267 TVTNPSQEVFYHCRWNPTG-DGSEVWNTFASDGS---LSDVQDETPAEPGEQIAGAIAVR 322
Query: 356 VTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWE 415
V + P ++ F LAWD P F SG YYRRYT F+G N A ++ R A L+H
Sbjct: 323 VRLRPGRTRKIPFILAWDFPVTEFASGVKYYRRYTDFFGRTGNNAWSMVRTA-LKHS--- 378
Query: 416 LQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDW 475
+ WQ IL W
Sbjct: 379 ---DLWQERILS-----------------------------------------------W 388
Query: 476 SQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEEN-IGQFLY 534
L+R D+PN A ++ + L S+ GT E + +G+F
Sbjct: 389 QNPILQR-HDLPNWFKMA-------------LFNELYLLSAGGTIWSSANERDPVGKFGV 434
Query: 535 LEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSR 594
LE +Y + + DV Y SFAL ML+ ++ ++ FA A+ D + + + R
Sbjct: 435 LESFDYRWYESLDVRLYGSFALAMLWSRLDKAVLEAFARAIPTSDDTPRIIGYNQATAIR 494
Query: 595 KVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDK--KFAKA 650
K A PHD+G + PW + N D +WKDL FVLQVYR V TG K +F
Sbjct: 495 KAADATPHDLGAANEHPWEKTNYTSYQDCNQWKDLPCDFVLQVYRTFVFTGSKDIEFLAE 554
Query: 651 VWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA--------- 701
W ++ A+AY+ FD DGDG+ EN G PDQT+D W + G+SAY GGLW+A
Sbjct: 555 CWSAIVRALAYLKSFDHDGDGIPENSGAPDQTFDDWQLQGVSAYCGGLWLAALEAASAIA 614
Query: 702 -------ALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSS 753
L +A L + + + + Q+A+ +Y EKLWNG Y+ D+ S S
Sbjct: 615 QILLDNYTLVSAINLPDPESIQHTLNTYHKWLQQAQPIYQEKLWNGQYYRLDS--DSDSD 672
Query: 754 SIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
+ ADQL GQ+YA+ GL IV + +SALK VY LK + GA NG+ PDG +
Sbjct: 673 VVMADQLCGQFYAQLLGLPDIVPPECTQSALKTVYESCFLKFHNAQFGAANGVKPDGSPE 732
Query: 814 M-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
+ E+W+G+ + +AA +I + D + + + G F+TPEA
Sbjct: 733 HPDATHPLEVWTGINFGLAAFLIQLGMKDEALKLTESVVRQVYEN---GLQFRTPEAITP 789
Query: 873 DDQYRSLCYMRPLAIWAMQWALTR 896
+R+ Y+R +AIWA+ L++
Sbjct: 790 VGTFRAGHYLRAMAIWAVYLTLSQ 813
>gi|156554493|ref|XP_001604835.1| PREDICTED: non-lysosomal glucosylceramidase-like [Nasonia
vitripennis]
Length = 826
Score = 332 bits (850), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 171/386 (44%), Positives = 239/386 (61%), Gaps = 8/386 (2%)
Query: 519 TNLLQ--DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVM 576
+N LQ D G+F YLEG EY M+NTYDVHFY++FAL L+P ++ IQ +F +V
Sbjct: 422 SNELQPDDPRRRYGRFAYLEGHEYRMYNTYDVHFYAAFALAQLWPNLEACIQYEFRDSVH 481
Query: 577 MHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQ 634
D +++ L DG RKV G+VPHDIG +P+ +NAY ++D ++W+DLN KFVL
Sbjct: 482 AEDQQRLQGLYDGSKRHRKVKGSVPHDIGDPGEEPFDLINAYPIHDVSQWRDLNSKFVLS 541
Query: 635 VYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
YR + + + W ++ + + + FD D DG+IEN GFPDQTYD W +SG SAY
Sbjct: 542 CYRIYFFNRNLEQLRDFWSTIKLVLEHSLTFDEDNDGLIENGGFPDQTYDCWVMSGPSAY 601
Query: 695 SGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSS 753
GGLW+AAL A +A +G+ E + + KV ++ KLWNG Y+N+D
Sbjct: 602 CGGLWIAALHCAVKMAELLGESEDEARYKGILDRGKVAFQDKLWNGKYYNFDCGKDESKL 661
Query: 754 SIQADQLAGQWYARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRV 812
SI +DQL G W RACG + +D+ RS+L+ V+ NV+K G++GAVNG P G +
Sbjct: 662 SIMSDQLCGHWLLRACGFSYEVFPQDRVRSSLETVFQNNVMKYKNGQQGAVNGFSPTGSI 721
Query: 813 DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
D S +QS E+W+GV Y +AA +IHE + + F+TA G+Y + G+G F+TPEA
Sbjct: 722 DYSCIQSEEMWTGVAYGLAALLIHEGMIEEAFRTAEGVYRTVYEKIGMG--FETPEALYE 779
Query: 873 DDQYRSLCYMRPLAIWAMQWALTRPK 898
YR++ YMRPL+IWAMQ A T K
Sbjct: 780 HKVYRAIGYMRPLSIWAMQHAWTMRK 805
Score = 222 bits (566), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 139/405 (34%), Positives = 220/405 (54%), Gaps = 24/405 (5%)
Query: 53 NWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYR 112
+K+T+Q+ P+ VR + + G+ +D F RH +GVP+GG+G GSIGR +R
Sbjct: 32 KFKQTIQMMPLAVRYVFYYTNVSRYGRPVLMDYFNMRHGQQIYGVPIGGLGGGSIGRGFR 91
Query: 113 GEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVLCPKT-PEVLKDTTAAGIGSW 170
GEF R+ + P + + ANQF + V NG Y+ VL KT P+ L+ SW
Sbjct: 92 GEFCRYALLPGLYSYHVIPANQFILTVRDVNGNTIYNQVLSTKTKPKHLR--------SW 143
Query: 171 DWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIY 230
W +G+K++Y LYPR+WT +E E + EL++ CRQ+SP+IPH+YK+SS P +VF + +
Sbjct: 144 KWGFEGNKASYTGLYPRSWTTYEVE-ELELKLTCRQVSPVIPHDYKDSSLPCAVFVWDVV 202
Query: 231 NSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFAL 290
N I + FT+ + + G + ++++ ++H P T+A+
Sbjct: 203 NRSNRDLRIAITFTFQSGCASKEDAQGDKWTEFFELDES-RGTMIHQEFRGM--PCTYAI 259
Query: 291 AAQ-ETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
+++ D V V+ F GN L W ++ G FD S E ++S+ I
Sbjct: 260 SSRTRPDDVGVTRMIGFDPRGNGAVL-----WQRLQNAGRFD--ESAEPRISSKTRKPIA 312
Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFM-SGKTYYRRYTKFYGTHQNAAANIARDAI 408
A+ A V S G + FSLAW P++ F K + R YT+F+ + + A I R A+
Sbjct: 313 TAVCADTLVKAGSSGFLEFSLAWHMPKIQFFFKKKDHLRYYTQFF-SGEKAGPEICRYAL 371
Query: 409 LEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
++ WE +IE WQ+ +LED LP+WY +FNELY+++ GG++W
Sbjct: 372 SQYERWEDEIERWQKSVLEDPDLPDWYKSAIFNELYFVSDGGSLW 416
>gi|345289099|gb|AEN81041.1| AT1G33700-like protein, partial [Capsella rubella]
Length = 191
Score = 332 bits (850), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 152/190 (80%), Positives = 173/190 (91%)
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
ENIGQFLYLEG +YLM+NTYDVHFYSSFAL+MLFPK++LSIQRDFAAAV+MHD S+ +++
Sbjct: 2 ENIGQFLYLEGTQYLMYNTYDVHFYSSFALLMLFPKLELSIQRDFAAAVLMHDSSRKQVM 61
Query: 587 DDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKK 646
G++V+RKVLGAVPHDIG+ DPWFEVNAY L++T RWKDLN KFVLQVYRDVVATGD
Sbjct: 62 SSGEFVTRKVLGAVPHDIGLNDPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLN 121
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
FAKAVWPSVY A+AY+DQFD+DGDGMIENDGFPDQTYD WS SG+SAY GGLWVAALQA
Sbjct: 122 FAKAVWPSVYTAIAYLDQFDKDGDGMIENDGFPDQTYDAWSCSGVSAYCGGLWVAALQAG 181
Query: 707 SALAREVGDR 716
SALARE+GD
Sbjct: 182 SALAREIGDN 191
>gi|390344694|ref|XP_001200450.2| PREDICTED: uncharacterized protein LOC764213 [Strongylocentrotus
purpuratus]
Length = 1283
Score = 331 bits (848), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 194/558 (34%), Positives = 286/558 (51%), Gaps = 78/558 (13%)
Query: 49 QFTLNWKETVQLAPIGVRILCLIREEAAK--------GKRAFIDPFIKRHLTSSHGVPLG 100
+F+ W+ + P ++ L A+ GKR FID +GVP+G
Sbjct: 99 KFSEKWRPLTMVRPHQIKSFVLPSVRYAQMHLKMWRGGKRPFIDVLHPMKFKPVYGVPVG 158
Query: 101 GVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLK 160
G+G G+I R ++G+F RW + P V NQF+V V R Y VL P P K
Sbjct: 159 GIGCGAIDRGWKGDFCRWSLSPGRYTYDNVELNQFTVCVRRGGKTVYQQVLSPNKPHN-K 217
Query: 161 DTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSY 220
D A W+W G++++YH LYPRAWT + P + +VCRQISP+ PHNY+E+S
Sbjct: 218 DLAQA----WEWQFPGNRASYHGLYPRAWTKYL-LPGQNITLVCRQISPVFPHNYQETSL 272
Query: 221 PVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTS 280
PV VF + +YN G +AD++++FTW N G ++ G +N+ + V+ H
Sbjct: 273 PVGVFVWDVYNHGTAAADVSIMFTWKNGTGTSADKKGGRWNTGFS-SASTSGVMFHD--G 329
Query: 281 HQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSME-TS 339
H P T +AA + D V V+ C F S K++W+++KE G +LN+ + +
Sbjct: 330 HPTQPCTMCIAAADGDDVTVTTCASFNPSSQ-----LKELWNDLKEDG---QLNTTKGPT 381
Query: 340 VTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNA 399
+E G ++ A+ +S + P+ +G + F+L+WD P+V F S +Y+RRYT+++G ++
Sbjct: 382 ERTEKGETLAGAVVSSRHIEPNQQGLLEFALSWDMPKVQFDSC-SYFRRYTRWFGRDGDS 440
Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
A + A+ E+G WE I WQ PILE+ RLP WY LFNELY++ GG +W D
Sbjct: 441 ALALCSHALEEYGDWEKNINEWQNPILENDRLPAWYKSALFNELYFMTDGGGIWVDVDD- 499
Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
+ PN + N R SS++ +
Sbjct: 500 -------------------------EYPNGEN---NAHTRASSLVRE------------- 518
Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q DF V D
Sbjct: 519 ---------YGRFGYLEGHEYRMYNTYDVHFYASFALIMLWPKLELSMQYDFGEYVDHKD 569
Query: 580 PSKMKLLDDGQWVSRKVL 597
+ L G R+VL
Sbjct: 570 EDSFQDLSSGLRSIRRVL 587
>gi|115633999|ref|XP_780575.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Strongylocentrotus purpuratus]
Length = 597
Score = 330 bits (847), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 194/558 (34%), Positives = 286/558 (51%), Gaps = 78/558 (13%)
Query: 49 QFTLNWKETVQLAPIGVRILCLIREEAAK--------GKRAFIDPFIKRHLTSSHGVPLG 100
+F+ W+ + P ++ L A+ GKR FID +GVP+G
Sbjct: 99 KFSEKWRPLTMVRPHQIKSFVLPSVRYAQMHLKMWRGGKRPFIDVLHPMKFKPVYGVPVG 158
Query: 101 GVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLK 160
G+G G+I R ++G+F RW + P V NQF+V V R Y VL P P K
Sbjct: 159 GIGCGAIDRGWKGDFCRWSLSPGRYTYDNVELNQFTVCVRRGGKTVYQQVLSPNKPHN-K 217
Query: 161 DTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSY 220
D A W+W G++++YH LYPRAWT + P + +VCRQISP+ PHNY+E+S
Sbjct: 218 DLAQA----WEWQFPGNRASYHGLYPRAWTKYL-LPGQNITLVCRQISPVFPHNYQETSL 272
Query: 221 PVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTS 280
PV VF + +YN G +AD++++FTW N G ++ G +N+ + V+ H
Sbjct: 273 PVGVFVWDVYNHGTAAADVSIMFTWKNGTGTSADKKGGRWNTGFS-SASTSGVMFH--DG 329
Query: 281 HQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSME-TS 339
H P T +AA + D V V+ C F S K++W+++KE G +LN+ + +
Sbjct: 330 HPTQPCTMCIAAADGDDVTVTTCASFNPSSQ-----LKELWNDLKEDG---QLNTTKGPT 381
Query: 340 VTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNA 399
+E G ++ A+ +S + P+ +G + F+L+WD P+V F S +Y+RRYT+++G ++
Sbjct: 382 ERTEKGETLAGAVVSSRHIEPNQQGLLEFALSWDMPKVQFDSC-SYFRRYTRWFGRDGDS 440
Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
A + A+ E+G WE I WQ PILE+ RLP WY LFNELY++ GG +W D
Sbjct: 441 ALALCSHALEEYGDWEKNINEWQNPILENDRLPAWYKSALFNELYFMTDGGGIWVDVDD- 499
Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
+ PN + N R SS++ +
Sbjct: 500 -------------------------EYPNGEN---NAHTRASSLVRE------------- 518
Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q DF V D
Sbjct: 519 ---------YGRFGYLEGHEYRMYNTYDVHFYASFALIMLWPKLELSMQYDFGEYVDHKD 569
Query: 580 PSKMKLLDDGQWVSRKVL 597
+ L G R+VL
Sbjct: 570 EDSFQDLSSGLRSIRRVL 587
>gi|345289089|gb|AEN81036.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289091|gb|AEN81037.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289093|gb|AEN81038.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289095|gb|AEN81039.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289097|gb|AEN81040.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289101|gb|AEN81042.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289103|gb|AEN81043.1| AT1G33700-like protein, partial [Capsella rubella]
Length = 191
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 151/190 (79%), Positives = 173/190 (91%)
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
ENIGQFLYLEG +YLM+NTYDVHFYSSFAL+MLFPK++LSIQRDFAAAV+MHD S+ +++
Sbjct: 2 ENIGQFLYLEGTQYLMYNTYDVHFYSSFALLMLFPKLELSIQRDFAAAVLMHDSSRKQVM 61
Query: 587 DDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKK 646
G++V+RKVLGAVPHDIG+ DPWFEVNAY L++T RWKDLN KFVLQVYRDVVATGD
Sbjct: 62 SSGEFVTRKVLGAVPHDIGLNDPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLN 121
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
FAKAVWPSVY A+AY+DQFD+DGDGMIEN+GFPDQTYD WS SG+SAY GGLWVAALQA
Sbjct: 122 FAKAVWPSVYTAIAYLDQFDKDGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAG 181
Query: 707 SALAREVGDR 716
SALARE+GD
Sbjct: 182 SALAREIGDN 191
>gi|333898016|ref|YP_004471890.1| glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
gi|333113281|gb|AEF18218.1| Glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
Length = 806
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 170/367 (46%), Positives = 236/367 (64%), Gaps = 10/367 (2%)
Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
F LE +Y + T DV FY SF L+ML+P I+ + R FA + + D S+ K+ +G
Sbjct: 437 FGLLECFDYNYYETLDVRFYGSFPLVMLWPDIEKQVMRQFADTINVQDSSEFKVGSNGAM 496
Query: 592 VSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKF 647
+KV G +PHD+G PW ++NAY + WKDLN K+VL VYRD V TG DK+F
Sbjct: 497 AVKKVQGMIPHDLGSSYALPWIKINAYDWQNPNIWKDLNSKYVLLVYRDYVLTGKTDKEF 556
Query: 648 AKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
K W SV A+ + + D+D DG+ +N+G PDQTYDTWS+ G SAY G LW+AAL+AA
Sbjct: 557 LKYTWKSVKTALDKLKEMDKDNDGIPDNEGIPDQTYDTWSMKGTSAYCGSLWLAALKAAQ 616
Query: 708 ALAREVGDRGSEDYFLFK--FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQW 764
+ + + D +E Y + ++ A+ +EK LWNG Y+N+D S SI ADQLAGQW
Sbjct: 617 EIGKVLKD--NEAYIKYNEWYKIAQQNFEKELWNGEYYNFDTE-SDHKDSIMADQLAGQW 673
Query: 765 YARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWS 824
YA L I+ +D + ALKK+Y +NV+K GK GAVNGM PDG VD S +Q++E+W+
Sbjct: 674 YADILRLGDILPKDHVQKALKKIYEFNVMKFENGKMGAVNGMRPDGIVDESDIQAQEVWT 733
Query: 825 GVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRP 884
GVTYA+A+ M + + + + TA G+Y+ + +G GY F+TPEAW D YR+ YMRP
Sbjct: 734 GVTYALASFMKYRGMTEEAYNTAYGVYKMTYDKSGKGYWFRTPEAWTKDGNYRASMYMRP 793
Query: 885 LAIWAMQ 891
L+IW+M+
Sbjct: 794 LSIWSME 800
Score = 238 bits (608), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 131/374 (35%), Positives = 202/374 (54%), Gaps = 32/374 (8%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVS-RSNGQKYSSVLCP 153
G PLGG G+G+IGR+Y G F RW + + V ANQFSVF N + VL
Sbjct: 71 QGAPLGGFGAGTIGRTYNGGFSRWHLEIGKNKYTTVYANQFSVFQKVEGNKDGVAQVLYA 130
Query: 154 KTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
PE + SW W+ + Y+ALYP +W + + P +++ +Q SPIIP+
Sbjct: 131 GEPE------NGYLSSWKWDYPKESGMYYALYPNSWYTYTNKDLP-VQLAVKQFSPIIPY 183
Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYN-----------S 262
NYKE+SYPV+VF +T YN + D++++FTW N +G F G+ N
Sbjct: 184 NYKETSYPVAVFKWTAYNPTNKNVDVSIMFTWQNMIG----FFGKQVNVNSGNFNKIIKD 239
Query: 263 KTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWH 322
K+K ++ V AV+ + ++ +++ ++ GV +S FV +G+ D+WH
Sbjct: 240 KSKDSEIVAAVMGNISNDNEEWNGEYSIGVKKVPGVDISYKAKFVTTGD-----GSDLWH 294
Query: 323 EIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
E ++G D N + + T + G IG+AIA + + P +V F+L+WD P + F G
Sbjct: 295 EFSKNGILD--NKDDETPTKQDG--IGSAIAVNFKLQPGQTIEVPFALSWDLPIMKFGGG 350
Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
+Y+ YTK++G + + I ++A+ + WE I+ WQ+PIL +K P+WY LFNE
Sbjct: 351 DKWYKMYTKYFGKNGKNSFAILKEALNNYQKWEKMIDDWQKPILSNKSKPDWYKTALFNE 410
Query: 443 LYYLNAGGAVWTDG 456
LYYL GG W +G
Sbjct: 411 LYYLADGGTAWENG 424
>gi|427725201|ref|YP_007072478.1| glucosylceramidase [Leptolyngbya sp. PCC 7376]
gi|427356921|gb|AFY39644.1| Glucosylceramidase [Leptolyngbya sp. PCC 7376]
Length = 807
Score = 330 bits (845), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 255/858 (29%), Positives = 387/858 (45%), Gaps = 148/858 (17%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + + A QF V+ G+ + + +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDAGEHTFRNAGACQFGVYEYVEGGEAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
P+ D T + SWDW +K TY ALYPR+W ++ + EL C Q SPI+ +
Sbjct: 101 PPD---DGT---LESWDW-YPTEKGTYSALYPRSWYEYKDVFNVEL--TCEQFSPIVAES 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
Y+E+SYP ++F ++ +N ++++ +W N VG
Sbjct: 152 YQETSYPTAIFEWSAHNPTDKPITVSIMMSWQNMVGWFTKATKTPKVELRDDGSPVYEYQ 211
Query: 251 ---GDSEFT-GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHF 306
GDS Q + ++ + V H + S + FA T + V +
Sbjct: 212 SRWGDSTGNLNQWIQDRYRVGCLLTRVRPHDQISEGEGQICFATVTNPT--LEVFYHNRW 269
Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
SG+ +D+W+ +GS L +E + PG I AIA TV P ++
Sbjct: 270 NPSGD-----GRDVWNYFAGNGS---LPDVEDETPASPGEQIAGAIAVRFTVRPGRTKKI 321
Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
F LAWD P + F TY+RRYT F+G H W + A + L
Sbjct: 322 PFFLAWDFPVMEFAPNITYFRRYTDFFGRSG-------------HNVWSIIRTAMKHSDL 368
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
+R+ EW L S++DL +
Sbjct: 369 WKERIEEWQKPFL------------------------------------SRADLPDWFKM 392
Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTY 546
S+ ++Y +A S T +D + +GQF LE ++Y + +
Sbjct: 393 ---------------SLCNELYL-LAQGGSLWTAATED--DPVGQFGVLECLDYSWYESL 434
Query: 547 DVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM-----KLLD-DGQWVSRKVLGAV 600
DV Y SFAL ML+PK+ S+ FA A+ D ++ KL+ D +RK GA
Sbjct: 435 DVRMYGSFALAMLWPKLDKSVLEAFARAIPSSDDTQRAIGYNKLMGYDNFMAARKRAGAT 494
Query: 601 PHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVY 656
PHD+G + PW N D WKDL FVLQVYRD V TG D +F + W V
Sbjct: 495 PHDLGAPNEHPWEATNYTAYQDCNLWKDLGADFVLQVYRDFVMTGSKDTEFLWSCWEGVV 554
Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW----------VAALQAA 706
A+ Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW L+
Sbjct: 555 QALDYLKTFDLDCDGIPENGGSPDQTFDDWKLKGISAYCGGLWIAALEAAIAIANILKNK 614
Query: 707 SALAREVGDRGSEDYF---LFKFQ----KAKVVYEK-LWNGSYFNYDNSGSSQSSSIQAD 758
+++ E ++ S + F + K+Q +++ +Y+ LWNGSY+ D S S + +D
Sbjct: 615 ASIYEEHQNQVSTEEFEAAIAKYQDWLKQSRPLYQDTLWNGSYYTLDT--GSDSKVVMSD 672
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSS-M 817
QL G++Y + L + E++ L+ +Y LK G+ GA NG+ PDG + +
Sbjct: 673 QLCGEFYTQLLNLPNVNPENRTAITLQTIYEACFLKFHDGQFGAANGLNPDGSPEKENDT 732
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
E+W+G+ + +AA MI + D + + +S G F+TPEA ++ +R
Sbjct: 733 HPLEVWTGINFGIAAFMIRNGMKDEALKMTEAVIHQVYSN---GLQFRTPEAITANNTFR 789
Query: 878 SLCYMRPLAIWAMQWALT 895
+ Y+R + IWA+ ALT
Sbjct: 790 ACHYLRAMGIWAIYDALT 807
>gi|324504438|gb|ADY41918.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 534
Score = 329 bits (843), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 217/576 (37%), Positives = 289/576 (50%), Gaps = 71/576 (12%)
Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTK-FYGTHQNAA 400
+E +G AI A S +V F LAWD P V F +G+ YRR F+
Sbjct: 11 TEGEVDLGVAICAEFPAKCKSSAEVEFVLAWDMPIVKFGAGRRQYRRRYARFFPDASKRV 70
Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
+ A++ WE +I+AWQ+ IL D LP+WY LFNE Y+L GG W
Sbjct: 71 EQMCSRALMSRIEWERKIDAWQQRILSDDSLPDWYKSALFNESYFLTDGGTCW------- 123
Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
F D D R + N++A E
Sbjct: 124 ----------FEYD----DEWRSTERQMSNESAKYFKE---------------------- 147
Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
G+F YLE EY M NTYDVHFYSSFAL+ +P I+L+IQ DFA V+
Sbjct: 148 --------FGRFAYLEAWEYYMLNTYDVHFYSSFALLENWPLIELAIQLDFADQVLSSCD 199
Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
K +++ + K LG +PHD+G + +PW +NAY L DT WKDLN KFVL YRD
Sbjct: 200 RKSVNINESTRTAVKRLGRLPHDLGNPMDEPWLHLNAYALSDTCEWKDLNLKFVLTCYRD 259
Query: 639 ------VVATGDKKFAKAVWPSVYV----AMAYMDQFDRDGDGMIENDGFPDQTYDTWSV 688
+ D + + Y +A +D DGD +IEN G PDQTYD WS+
Sbjct: 260 YEKIVKIYFNDDNEMKGCLLRRFYDLSSGIIADAKAWDVDGDDLIENAGQPDQTYDVWSM 319
Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNS 747
G SAY GGLW+ AL+ +A +G+ F K A+ YE KLWNG YF++D
Sbjct: 320 HGSSAYCGGLWLCALECVRRMALTLGEVVDAQKFANKLNNARKAYERKLWNGKYFDFDEH 379
Query: 748 GSSQSSSIQADQLAGQWYARAC-GLLP--IVDEDKARSALKKVYNYNVLKVMGGKRGAVN 804
S+ SI ADQL G W+ G + I+ + ++LK ++ YNV K G+ G VN
Sbjct: 380 -STDHKSIMADQLCGFWFMCITDGKVDDVIITRQQICASLKTIFEYNVEKFANGQLGPVN 438
Query: 805 GMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
M+P G VD + +QS E+W GV YA+A+ + + + F+TA G Y + W GL Y
Sbjct: 439 AMMPSGVVDSTGIQSEEVWGGVAYALASFHLLVEENESAFKTAEGWYRSCWERYGLQY-- 496
Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
Q+PEA N YR++ YMRPLAIWAMQ AL + K
Sbjct: 497 QSPEAINESSYYRAIGYMRPLAIWAMQSALDALRNK 532
>gi|195033785|ref|XP_001988762.1| GH11342 [Drosophila grimshawi]
gi|193904762|gb|EDW03629.1| GH11342 [Drosophila grimshawi]
Length = 869
Score = 328 bits (841), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 274/928 (29%), Positives = 429/928 (46%), Gaps = 181/928 (19%)
Query: 77 KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
+G+R ++D + + +GVP+GG+G GSIGR Y GEF R+Q+ P + E V ANQF
Sbjct: 13 EGRRVYMDYYYMENGKQMYGVPIGGIGGGSIGRGYAGEFCRFQMRPGIYEYNVVQANQFI 72
Query: 137 VFVSRSNGQK-YSSVLC----------PKTPEVLKDTTAAG----------------IGS 169
V + G + S+L P + D A + +
Sbjct: 73 VTIKDHKGCTIFQSLLSKCSTKSIKAKPNNNDAEADEAEAQKSKCQLPNCSSRSRQPLSA 132
Query: 170 WDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVF 225
W N++ + +Y LYPR+WT H G +R++CRQISP+IPH+YKESS P +VF
Sbjct: 133 WHSNIEDARCSYTGLYPRSWTEYDLSHYG-----VRLICRQISPVIPHDYKESSLPCAVF 187
Query: 226 TYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPP 285
++ N +++ FT+ N G + G+ N+++++ +A + R + P
Sbjct: 188 VWSAQNVCDHERKVSITFTFKNGTGNKKQ-DGEG-NAESQLISEGNAKGVAIRQTIGDMP 245
Query: 286 VTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
++ LA + + ++ CP F +GN + +W ++KEHG + E+ T +
Sbjct: 246 CSYNLACRVLPEISITRCPQFDPAGN-----GEQLWMQLKEHGQLSEQPTAESLKTKD-- 298
Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIA 404
IG A+ A + + P + + F LAWD P++ F +T+ R YTK++ + I
Sbjct: 299 --IGVAVCAHLALKPKTSHDLEFVLAWDMPKIQFPRKLQTHTRYYTKYFDDSGESGPRIC 356
Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLV 464
A+ + SWE RL + + + N+ DG P +
Sbjct: 357 EYALKHYASWE--------------RLIDAWQRPILND------------DGLPDWYKCA 390
Query: 465 TIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQD 524
F D LK DT++ P+A + D
Sbjct: 391 IFNQLYFISDGGTLWLK--------CDTSLG-------------KPLAFD---------D 420
Query: 525 GEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPK----IQLSIQRDFAAAV----- 575
G+F YLEG EY M+NTYDVHFY+S AL L+P +Q + AA +
Sbjct: 421 PRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELSDTRK 480
Query: 576 MMHD----PSKMK----------------------LLDDGQW--VSRKVLGAVPHDIGIC 607
M++D P K+K + D +W ++ K + V D +
Sbjct: 481 MLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNEWKDLNTKFVLQVYRDYYVL 540
Query: 608 DPWFEVNA-----------------YCLY-------DTARWKDLNPK----FVLQVYRDV 639
+ + A Y LY ++A K N K ++ + V
Sbjct: 541 NELAQAQADNASKFSSIEFIDKESLYELYTQDNKRKNSADEKQQNRKSASMYINETNGKV 600
Query: 640 VATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
+ KA++ + M ++D+D DG+IEN PDQTYD+W + G SAY GLW
Sbjct: 601 YLMDAMSYLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLW 660
Query: 700 VAALQAASALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQAD 758
+AALQ S +A + + ++ K + EKLWNGSY+ +D S + +I AD
Sbjct: 661 LAALQTMSVMATLLDQPNDCLRYQDILERGKNSLEEKLWNGSYYRFDQS-HNHRDTIMAD 719
Query: 759 QLAGQWYARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------G 810
QL G WY ++CG I ++ R+ALK++Y+ NV+ G GA NG + + G
Sbjct: 720 QLCGHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANANEPSKPG 779
Query: 811 RVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
VD S++Q+ E+W GV YA+AA+MI E + + FQTA G+Y+ G+ F+TPEA
Sbjct: 780 HVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMN--FETPEAL 837
Query: 871 NTDDQYRSLCYMRPLAIWAMQWALTRPK 898
+ +YRS+ YMRPL+IW+MQ A R +
Sbjct: 838 YGEKRYRSIGYMRPLSIWSMQVAWERRR 865
>gi|242015137|ref|XP_002428230.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212512791|gb|EEB15492.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 812
Score = 328 bits (840), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 170/368 (46%), Positives = 237/368 (64%), Gaps = 7/368 (1%)
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
G+F YLEG EY M+NTYDVHFY+SFAL ML+P +Q +QRDF V D S+ L +G
Sbjct: 441 GRFAYLEGHEYRMYNTYDVHFYASFALAMLWPNLQSCVQRDFCDTVPQEDRSRQWHLFNG 500
Query: 590 QWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKF 647
RKV +VPHD+G +P+ ++N+Y ++D + W+DL+ KFVLQ YRD D
Sbjct: 501 SIGYRKVKNSVPHDLGDPFEEPFLKINSYPIHDVSEWRDLDLKFVLQSYRDYFLNNDLNQ 560
Query: 648 AKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
K +W SV V M ++D+DGDG+IEN G PDQTYD W++ G SAY G LW+AALQ
Sbjct: 561 LKYLWKSVCVLMNNALRWDKDGDGLIENVGEPDQTYDAWTMEGPSAYCGSLWLAALQCTC 620
Query: 708 ALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYA 766
+A + D + + F + A+ ++ KLWNG Y+N+D S ++I +DQL G WY
Sbjct: 621 KMALILSDVDTYNKFSETLKMAQQSFDKKLWNGKYYNFDCS-EKWGTTIMSDQLCGLWYL 679
Query: 767 RACGLL-PIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSG 825
CG+ ++ K S+L+ +Y NVLK+ G G VNGMLP+G +D+ ++QS EIW G
Sbjct: 680 YCCGIKNEVLPPAKVNSSLRCIYENNVLKLGKGNLGCVNGMLPNGEIDIHTLQSEEIWVG 739
Query: 826 VTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPL 885
V YA+A+ MI E + + G++TA GIYE + G+G F+TPEA +YR++ YMRPL
Sbjct: 740 VVYALASLMIFEGMVEEGWKTARGIYETVYDRIGMG--FETPEALRESKKYRAIGYMRPL 797
Query: 886 AIWAMQWA 893
+IW+MQ A
Sbjct: 798 SIWSMQLA 805
Score = 219 bits (558), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 140/402 (34%), Positives = 219/402 (54%), Gaps = 24/402 (5%)
Query: 60 LAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
+ P+ +R L KG+R +D + +GVPLGG+GSGSIGR ++GEF R+Q
Sbjct: 39 MIPLIIRYLIYSIYILCKGRRPLMDYMKNLNGKQMYGVPLGGIGSGSIGRGFKGEFCRYQ 98
Query: 120 IFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVL-CPKTPEVLKDTTAAGIGSWDWNLKGD 177
+ P E + + NQF + V G + VL C + P+ + + +WDW L G+
Sbjct: 99 LRPGNYEYETIYGNQFILTVRDGEGNAVFQKVLNCGRAPK-------SCLKNWDWTLNGE 151
Query: 178 KSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSA 237
+ Y LYPR+WT + E + +LR++C+QISP+IP+NYK+S P S+F + I N
Sbjct: 152 NANYVGLYPRSWTEYNIE-EQKLRLICKQISPVIPNNYKDSCIPGSMFIWHIENQSNEVK 210
Query: 238 DITLLFTWTNSVG--GDSEFTGQHYNSKTKMNDG-VHAVLLHHRTSHQLPPVTFALAAQE 294
D+ + FT+ N +G GD + K ++G + VL++H S TF ++ E
Sbjct: 211 DVNITFTFQNGIGEKGDKKKGCWSQMFKENCSNGNILGVLINHVISSM--NYTFGISVCE 268
Query: 295 TDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAA 354
GVHVS + N LG KD+W++IK G ++ + + S + G + +A+ A
Sbjct: 269 KAGVHVSCLKKW----NPLG-NGKDIWNKIKT-GKLEKFD-FDPSHHTANGEQLASAVCA 321
Query: 355 SVTVPPDSEGQVTFSLAWDCPEVNFMSGKT-YYRRYTKFYGTHQNAAANIARDAILEHGS 413
+P S + F+L WD P++NF K Y++ YT+F+G +N ++ + +
Sbjct: 322 QTLIPGKSVSTLEFNLIWDMPKINFHENKKQYWKYYTRFFGKEKNGPK-LSAYCFENYKN 380
Query: 414 WELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTD 455
W+L+I+ WQ PIL D LP WY LFNE Y+++ GG VW D
Sbjct: 381 WDLEIDKWQNPILNDDNLPSWYKSALFNETYFVSDGGTVWLD 422
>gi|328709489|ref|XP_001943760.2| PREDICTED: non-lysosomal glucosylceramidase-like [Acyrthosiphon
pisum]
Length = 776
Score = 324 bits (830), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 176/380 (46%), Positives = 231/380 (60%), Gaps = 12/380 (3%)
Query: 522 LQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS 581
+ D G F YLEG EY M+NTYDVHFY+SF L L+PK+Q IQ F +
Sbjct: 396 VTDPRYQYGYFGYLEGHEYRMYNTYDVHFYASFTLAKLWPKLQACIQYKFRDTIDNALNI 455
Query: 582 KMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
K L G RK +VPHD+G DP+ VNAY +++ + W+DLN KFVLQ YRD
Sbjct: 456 TRKHLSCGTRGLRKYKYSVPHDLGDPEEDPFNLVNAYLVHNVSEWRDLNLKFVLQCYRDY 515
Query: 640 VATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
GD K+ ++P + M +D DGDGMIEN G DQT+DTW ++GISAY GGLW
Sbjct: 516 TMFGDNKYLADMYPQIKNVMEKSLSWDTDGDGMIENSGTADQTFDTWIMTGISAYCGGLW 575
Query: 700 VAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQAD 758
+AAL ++ + + + KAK + LWNG YFN+D+SG S SI AD
Sbjct: 576 LAALYCTVQISNILKQDDISEKYSTILNKAKESFNAALWNGGYFNFDSSGQHHSKSIMAD 635
Query: 759 QLAGQWYARACG----LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM 814
QL G+WY + CG +LP+ D+ + AL +Y NVL GK GAVNGM+PDG D
Sbjct: 636 QLCGEWYLKCCGVNEEVLPV---DRVKKALHTIYKMNVLGFNDGKMGAVNGMMPDGNPDT 692
Query: 815 SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDD 874
S+QS E+W+GVTYA+++ MI L D GF TA GIY ++ G+ A+QTPEA + +
Sbjct: 693 FSLQSEEVWTGVTYALSSLMIAHGLRDEGFNTAKGIYNTVYNNIGM--AYQTPEAIYSKN 750
Query: 875 QYRSLCYMRPLAIWAMQWAL 894
YRS+ YMRPL+IW++Q AL
Sbjct: 751 AYRSVGYMRPLSIWSIQTAL 770
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 122/409 (29%), Positives = 189/409 (46%), Gaps = 75/409 (18%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFID---PFIKRHLTSSHGVPLGGVGSGSIGRSY 111
K+ +QL P+ +R + + KG++ +D P +H+ +GVPLGG+G G+IGR +
Sbjct: 40 KQILQLLPLIIRYVFYMLLCYLKGQKPIMDFLFPIKSKHM---YGVPLGGIGGGTIGRGF 96
Query: 112 RGEFQRWQIFPRVCEDKPVLANQFSVFVSRS-NGQKYSSVL-CPKTPEVLKDTTAAGIGS 169
+GEF R+Q+ P + E + V ANQF V + S N Y++VL C K + +
Sbjct: 97 KGEFCRYQVVPGMYEYESVQANQFIVNIQNSENITVYNNVLSCLK-------CNNKTLYN 149
Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
W W+ ++ Y LYPR+WT + P+ +R++C+QISPIIPHNYK+S P VF +T+
Sbjct: 150 WKWDFPSNQCKYTGLYPRSWTEYL-IPEFHIRLICKQISPIIPHNYKDSCIPGGVFIWTV 208
Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
N+ I+L FT+ + G S +H P
Sbjct: 209 ENNSDQQYSISLTFTFASGNGISS----------------------YHSDEPSCEPFNLN 246
Query: 290 LAAQETDGVHVSLC-----PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEP 344
E G + C +VI G IKE +F +LN +P
Sbjct: 247 KDGTEVAGSLIHNCFKEMKCTYVIGG------------LIKEGAAFSQLN-------FDP 287
Query: 345 GSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIA 404
+ G I+ S+ L + ++ Y + YTKF+ + A I
Sbjct: 288 KGN-GMTISKSL------------HLFGELRDMQLGPSNIYSKYYTKFFDKNGTAGPEIM 334
Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
++ WE +IE WQ PIL + LP+WY +FNE+Y++ GG +W
Sbjct: 335 HYSMQNFKKWEQEIEKWQNPILNNPLLPDWYKSAIFNEVYFVADGGTIW 383
>gi|307151416|ref|YP_003886800.1| glucosylceramidase [Cyanothece sp. PCC 7822]
gi|306981644|gb|ADN13525.1| Glucosylceramidase [Cyanothece sp. PCC 7822]
Length = 805
Score = 322 bits (825), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 257/864 (29%), Positives = 394/864 (45%), Gaps = 141/864 (16%)
Query: 80 RAFIDPFIKRHLTSS-----HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQ 134
+A+ +P+ R+ ++ HGVPLGG G+G IGRS +G+F W + + + A Q
Sbjct: 21 KAWDNPYTVRYNSNLDDGPWHGVPLGGFGAGCIGRSPKGDFNLWHLDGGEHIFRTLAACQ 80
Query: 135 FSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
FS+F G+ + L + PE D T + W W +K TYHALYPR+W +EG
Sbjct: 81 FSIFEQPEGGEAQAYALSTEPPE---DGT---LSRWSW-YPTEKGTYHALYPRSWYEYEG 133
Query: 195 EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSE 254
+ +I+C Q SPI NY+E+SYPV++F +TI+N ++++ +W N+VG
Sbjct: 134 VF--QTKIICEQYSPIWAGNYQETSYPVAIFAWTIHNPTDRPITLSIMMSWQNTVGW--- 188
Query: 255 FTGQHYNSKTKMNDGVHAVLLHH------------------RTSHQLPPVTFALAAQETD 296
F+ + K+ D V + R L + QE +
Sbjct: 189 FSNAIKTPEVKVRDDGSPVYEYQPRWGDSTGNYNQWIVDNFRVGCLLNRIQPHPQVQEGE 248
Query: 297 G-------VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
G ++ S+ + N +G ++W +G L E + PG I
Sbjct: 249 GQLCIASILNPSVEVFYNTRWNPMG-DGSEIWDYFAMNG---MLGDYENESAAAPGEQIA 304
Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAIL 409
AA+A T+PP GKT R F
Sbjct: 305 AAMAIRFTIPP---------------------GKT---RLIPF----------------- 323
Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHR 469
I AW PI E + +Y + + + N G VW+ + ++ R
Sbjct: 324 --------ILAWDLPITEFSQGINYY--RRYTDFFGRN-GQNVWSVVRTALKNVDI--WR 370
Query: 470 KFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENI 529
+DW QS + + D+P+ A + ++Y +A + T +D + +
Sbjct: 371 DRIIDW-QSPILKDQDLPDWFKMA---------LFNELYL-LADGGTLWTAATED--DPV 417
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
GQF LE I+Y + + DV Y SFAL ML+P++ ++ FA A+ D + + +G
Sbjct: 418 GQFAVLECIDYRWYESLDVRLYGSFALGMLWPRLDKAVMEAFARAIPQADDTPRIIGYNG 477
Query: 590 QWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DK 645
RK GA PHD+G + PW + N D WKDL FVL +YR + TG ++
Sbjct: 478 AKAIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLLLYRSFLWTGGNNQ 537
Query: 646 KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQA 705
+ W SV +AY+ FD DGDG+ EN G PDQT+D W + GISAY G LW+AAL+A
Sbjct: 538 DYLWEHWHSVTETLAYLKAFDLDGDGIPENSGAPDQTFDDWQLRGISAYCGALWIAALEA 597
Query: 706 ASAL------------------AREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNS 747
A + A E + Y + Q + ++ LWNG Y+ D+
Sbjct: 598 AIKMGEILLQRPPMNPQLQPQNAAESIEEAITTYRHWLQQARSLYHDTLWNGEYYRLDS- 656
Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
S S + +DQL GQ+YAR GL +V+ SAL+K+Y LK GGK GA NG+
Sbjct: 657 -ESGSDVVMSDQLCGQFYARLLGLPDVVEPQYTTSALRKIYEACFLKFHGGKYGAANGVK 715
Query: 808 PDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
PDG + + E+W+G+ + +AA MI + + F+ + + + G F+T
Sbjct: 716 PDGTPENPDATHPLEVWTGINFGLAAFMIQMGMKEEAFKITEAVVKQVYEN---GLQFRT 772
Query: 867 PEAWNTDDQYRSLCYMRPLAIWAM 890
PEA +R+ Y+R +AIWA+
Sbjct: 773 PEAITAVGTFRASHYLRAMAIWAV 796
>gi|158299437|ref|XP_319575.4| AGAP008830-PA [Anopheles gambiae str. PEST]
gi|157013519|gb|EAA14848.4| AGAP008830-PA [Anopheles gambiae str. PEST]
Length = 913
Score = 315 bits (807), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 194/636 (30%), Positives = 317/636 (49%), Gaps = 94/636 (14%)
Query: 53 NWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYR 112
++++ + + P+ +R + + A +G++ +D + + +G PLGG+G+G+IGR +
Sbjct: 35 SFRQILSMVPLALRYIPYYWKVAREGRQVLMDYWYTENGKQIYGAPLGGIGAGTIGRGFA 94
Query: 113 GEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVLCPKTPEVLKDTTA------- 164
GEF R+Q+ P + E V ANQF V + G + S+L V K +
Sbjct: 95 GEFCRYQLKPGLYEYNTVHANQFIVTIKDETGATIFHSLLSTYRSYVQKIDSHRLAKAFV 154
Query: 165 ----AG---------IGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
AG + SW+ L + +Y ALYPRAW+ ++ + +++V RQISPII
Sbjct: 155 LRVWAGFISYRPKTPLASWESGLDASRCSYTALYPRAWSEYD-LSEHGVKLVQRQISPII 213
Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVH 271
PH+YKESS P +VF +T+ N +TL FT+ N G + +
Sbjct: 214 PHDYKESSLPCAVFVWTVENVCDKDRQVTLTFTFKNGTGTKKQDAEGGSETSAFTQGNAR 273
Query: 272 AVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFD 331
V + + P T+ ++ + + ++++ C F +GN + +W+++KE+G
Sbjct: 274 GVSIRQTIAEM--PCTYCVSCRSSSEINLTRCERFDPTGN-----GEKLWNDLKENGHLT 326
Query: 332 RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS-GKTYYRRYT 390
++ ET + + + A++ + V P + Q+ F+L WD P+V+F GK Y+R YT
Sbjct: 327 EKSNDETLKSKD----VAVAVSGQILVQPGTTSQLEFALVWDMPKVHFQKRGKEYHRYYT 382
Query: 391 KFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGG 450
K++G +A I+ A+ +G WE I+ WQRPILED LP+WY +FNELY++ GG
Sbjct: 383 KYFGKSGDAGPQISDYALNNYGKWERLIDEWQRPILEDADLPDWYKSAIFNELYFIADGG 442
Query: 451 AVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTP 510
+VW F++D Q+DL
Sbjct: 443 SVW-----------------FTMD-DQTDLP----------------------------- 455
Query: 511 VALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRD 570
D G++ YLEG EY M+ TYDVHFY+S AL L+P +Q+SIQ D
Sbjct: 456 -----------FDDPRLAYGRYSYLEGHEYRMYTTYDVHFYASHALASLWPNLQVSIQYD 504
Query: 571 FAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLN 628
+ ++ K L DG+ + RK+ +VPHD+G +P+ +NAY ++D + W+DLN
Sbjct: 505 YKDSIEREITEGRKHLYDGKVIPRKIKNSVPHDLGDPAEEPFDLINAYPIHDVSEWRDLN 564
Query: 629 PKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
KF+LQVYRD A S + ++ ++D+
Sbjct: 565 LKFILQVYRDYYTLNHYAQLNAENASKFSSIEFIDK 600
Score = 219 bits (558), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 110/255 (43%), Positives = 164/255 (64%), Gaps = 11/255 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++P+ V + + ++D+DGDG+IEN PDQTYDTW + G SAY GGLW+A+L
Sbjct: 653 YLKAMYPACRVVLEHSLEWDKDGDGLIENSKAPDQTYDTWVMDGPSAYCGGLWLASLHCM 712
Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + D + K + + EKLWNG+Y+ +D +S+ +SI +DQL G WY
Sbjct: 713 SAMASLLDQNEDSDRYKAVLDKGRASFEEKLWNGTYYRFDAQSASK-NSIMSDQLCGHWY 771
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP------DGRVDMSSMQ 818
R+CG + ++ R A++ +Y NV++ GG+ GAVNG +P DGR D S+Q
Sbjct: 772 LRSCGFDYDVFPKENVRLAMRTIYENNVMRFCGGQLGAVNGYVPSGQPNKDGRPDTVSIQ 831
Query: 819 SREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRS 878
E+W+GVTYA+A++MIHE + + F+TA G+Y+A G+ F+TPEA + YR+
Sbjct: 832 GEEVWTGVTYALASTMIHEGMFEEAFKTAGGLYQALSERIGMN--FETPEAVYAERHYRA 889
Query: 879 LCYMRPLAIWAMQWA 893
+ YMRPL+IW+MQ A
Sbjct: 890 IGYMRPLSIWSMQTA 904
>gi|189239183|ref|XP_966847.2| PREDICTED: similar to bile acid beta-glucosidase, putative
[Tribolium castaneum]
gi|270010940|gb|EFA07388.1| hypothetical protein TcasGA2_TC016367 [Tribolium castaneum]
Length = 818
Score = 315 bits (806), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 167/373 (44%), Positives = 236/373 (63%), Gaps = 12/373 (3%)
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
G+F LEG EY M+NTYDVHFY+SF+ ++ FP +Q +Q D A+ P K+K+L DG
Sbjct: 442 GKFALLEGHEYKMYNTYDVHFYASFSFVLNFPLLQSILQYDMRDAIFTEIPDKVKMLYDG 501
Query: 590 QWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DK 645
+ RKV VPHDIG +P+ +N+Y ++D ++W+DLN KFVLQV+RD TG D+
Sbjct: 502 EVCERKVPNTVPHDIGDPGEEPFILLNSYPIHDVSQWRDLNSKFVLQVFRDAFITGLDDR 561
Query: 646 KFA--KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAAL 703
A ++ + Y M FD DGDG+IEN G PDQT+DTW ++G SAY GGLW+AAL
Sbjct: 562 SIAYLNDMYNACYTVMHKSLDFDVDGDGLIENSGSPDQTFDTWVMTGASAYCGGLWLAAL 621
Query: 704 QAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAG 762
A + +A + ++ F K +E KLWNG +N+D S + SI ADQL G
Sbjct: 622 FAMTKIADALQKTEDKEKFQELLDKGTAAFERKLWNGKCYNFDCS-DKECRSIMADQLCG 680
Query: 763 QWYARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSRE 821
QWY R+CG + +D+ +++LK +Y NV G+ GAVNG + DG +D ++QS+E
Sbjct: 681 QWYLRSCGFNYEVFPQDRVKTSLKTIYENNVQSFCDGRMGAVNGFI-DGVIDEFTIQSQE 739
Query: 822 IWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCY 881
+W+GVTYA+AASM+ E + F TA G++++ GL +F TPEA YR++ Y
Sbjct: 740 VWTGVTYALAASMLQEGMRTEAFNTAGGMFKSMSERFGL--SFDTPEALYAAKYYRAIGY 797
Query: 882 MRPLAIWAMQWAL 894
MRPL+IW+MQ A+
Sbjct: 798 MRPLSIWSMQLAI 810
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 126/398 (31%), Positives = 195/398 (48%), Gaps = 24/398 (6%)
Query: 60 LAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
+ P+ +R + GK +D +GVP+GG+G G+IGR YRGEF R+Q
Sbjct: 46 MLPLILRYIWYYIVHRIMGKSIVMDYVHTLRAKQIYGVPIGGIGCGTIGRGYRGEFCRFQ 105
Query: 120 IFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKS 179
+ P + E V ANQF V + + + L P + SW+ + G +
Sbjct: 106 LKPGLYEYNTVDANQFIVTIKDGHNETIFHSLLSTFP-------TKSLKSWESLIDGSEC 158
Query: 180 TYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADI 239
Y LYPR+WT ++ + + CRQI+PIIPHNYK+SS P ++F + + N + +
Sbjct: 159 FYTGLYPRSWTEYDLSK-YGVMLTCRQITPIIPHNYKDSSLPCAMFVWDVKNISEEDRTV 217
Query: 240 TLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVH 299
T+ FT+ N VG S +SK V+L+H +++LAA+ +
Sbjct: 218 TIAFTFKNGVGDKSRDRSSTCSSKAFTLADSEGVILYHTIDKM--QCSYSLAAKTNPQID 275
Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
+S C +F NS G+ W ++K +GSFD+++ + + IA T+
Sbjct: 276 ISKCLYF--DPNSDGIKP---WIQLKNNGSFDKISDKNYG---QIFGEMACGIATKATIR 327
Query: 360 PDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTH---QNAAANIARDAILEHGSWEL 416
+ +L WD P V F + ++YT++Y H N I R A + +WE
Sbjct: 328 AGNTVTSEMALVWDMPSVQFPKKQ---KKYTRYYTKHFDGDNTGLRIVRYAFENYQNWER 384
Query: 417 QIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
I WQR IL+D+ LP+WY LFNE YY++ GGA+W
Sbjct: 385 AIYDWQRRILDDENLPDWYKSALFNESYYISDGGAIWV 422
>gi|427419169|ref|ZP_18909352.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
gi|425761882|gb|EKV02735.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
Length = 835
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 256/909 (28%), Positives = 383/909 (42%), Gaps = 202/909 (22%)
Query: 85 PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
P+I R+ ++ HG+PLGG+G+G IGR+ G F W + + A QF+V+
Sbjct: 22 PYIVRYASNLDDGPEHGMPLGGLGAGCIGRAPNGTFNLWHLDGGEHTFEHFPACQFAVYE 81
Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKS-------------------- 179
+ + ++ T G+ W W ++S
Sbjct: 82 EAAGASQTYAMA----------TAGDGLERWQWYPAENRSVGTAHPPGQPGNDTTSTIKG 131
Query: 180 -TYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD 238
TY+ALYPR+W + L C Q SPI P NY+ESSYPV+VF +T +N
Sbjct: 132 GTYNALYPRSWYSYADVFKANL--TCEQFSPIWPDNYQESSYPVAVFVWTAHNPTDQPIT 189
Query: 239 ITLLFTWTNSVG--GDSEFTGQ-----------HYNSKTKMNDGVHAVLLHHRTSHQL-- 283
+++L +W N VG +SE + + Y + + G L+ + +
Sbjct: 190 LSILLSWQNMVGWFTNSESSPEIQQREDGSPFYDYGPPLRQSAGNFNQLVSDERAIGMVM 249
Query: 284 ----------PPVTFALAAQETDGV--HVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFD 331
F +A T+G+ H PH D+W HG+
Sbjct: 250 DGAWTDSPGEGDGQFCIAVPATEGITYHTRWNPHG---------DGGDLWESFAGHGTLS 300
Query: 332 RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTK 391
+ ++V E +G AIA T+ P ++ F L WD P F +GK YRRYT
Sbjct: 301 NVADTTSAVKDE---QVGGAIAIKFTLAPGETQELPFVLTWDLPVTEFATGKLAYRRYTD 357
Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
F A +A++A+ + P W
Sbjct: 358 FCDCSGRNAWQLAQEALANY--------------------PTW----------------- 380
Query: 452 VWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV-NILERMSSILEQIYTP 510
R+ +DW Q L R D+P+ A+ N L M+S
Sbjct: 381 -----------------RQNIIDWQQPILDR-PDLPDWFKMALFNELYDMAS-------- 414
Query: 511 VALNSSFGTNLLQDGEE--NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQ 568
G L + +GQF LE I+Y + + DV Y F+ +ML+P+++ +I
Sbjct: 415 -------GGTLWSAASDIDPVGQFGVLECIDYRWYESLDVRLYGGFSTLMLWPELEKAII 467
Query: 569 RDFAAAVMMHDPSK------MKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYD 620
R FA A+ DP K + + RK+ GA PHD+G + PW N D
Sbjct: 468 RAFARAIPTEDPKKRIIGYYYTVGESEHLAPRKLKGATPHDLGAPNENPWIATNYTSYQD 527
Query: 621 TARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF 678
+WKDL FVLQVYR V TG D +F WP+V + Y+ FD DGDG+ EN G
Sbjct: 528 CNQWKDLPSDFVLQVYRAFVLTGGDDLEFLAECWPAVTQTLQYLKLFDLDGDGVPENGGA 587
Query: 679 PDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD--RGSEDYFLFKFQ------KAK 730
PDQT+D W + G+SAY GGLW+AAL+AA + + + S + + K Q + K
Sbjct: 588 PDQTFDDWQLKGLSAYCGGLWLAALEAAIKMGDLLQQHRKISGSFAIVKSQYQRWIKQGK 647
Query: 731 VVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
+VY+ KLW GS++ D+ S S + ADQL GQ+ AR L IVD++ AL +Y+
Sbjct: 648 IVYQNKLWTGSFYRLDS--ESNSDVVMADQLCGQFCARLMDLPDIVDQEFIDIALDTIYD 705
Query: 790 YNVLKV---------------------------MGGKRGAVNGMLPDGRVDM-SSMQSRE 821
+K +G GA NG+ PDG + E
Sbjct: 706 ACFVKFNEYAAKHSTPQNQKFEGSQAGFFSAAQLGMAIGAANGVKPDGSPENPDDTHQLE 765
Query: 822 IWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCY 881
+W+G+ + VAA + D + + ++ G F+TPEA +R+ Y
Sbjct: 766 VWTGINFGVAAFLAQMGKRDQAMAITEAVVQQIYN---YGLQFRTPEAITAMGTFRACHY 822
Query: 882 MRPLAIWAM 890
+RP+AIW +
Sbjct: 823 LRPMAIWGL 831
>gi|254423918|ref|ZP_05037636.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
gi|196191407|gb|EDX86371.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
Length = 828
Score = 306 bits (784), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 263/903 (29%), Positives = 390/903 (43%), Gaps = 204/903 (22%)
Query: 85 PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQI--FPRVCEDKPVLANQFSV 137
P+I R+ ++ HG PLGG G+G IGRS G F W + V + P QFSV
Sbjct: 26 PYIVRYASNLDDGPEHGAPLGGFGAGCIGRSPHGTFNLWHLDGGEHVFDSFP--GCQFSV 83
Query: 138 FVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKG----DKSTYHALYPRAWTVHE 193
F ++G+ + + PE + SW W G + TYHALYPR+W ++E
Sbjct: 84 FEQSADGETKAYAMSSDKPET-------ELASWQWYPAGKTTENTGTYHALYPRSWYLYE 136
Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
+L C Q+SPI +NY ESSYP + F +T +N ++++ +W N VG
Sbjct: 137 NIFKADL--TCEQLSPIWANNYIESSYPTATFIWTAHNHTDQPLTLSIMLSWENMVGW-- 192
Query: 254 EFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL-CPH-----FV 307
FT NS+T + R P + +E+ G + +L C V
Sbjct: 193 -FT----NSETSPE-------IQQREDGS-PFYDYVPLLRESKGNYNTLVCEEGVARGVV 239
Query: 308 ISGNSLGLTAK-----------------------------DMWHEIKEHGSFDRLNSMET 338
+ GN G A+ ++W++ G RL+++E
Sbjct: 240 LDGNWTGSPAEGQGQLAIAAHHLEAEVSYHTRWHPDGDGSELWNDFSTTG---RLSNLE- 295
Query: 339 SVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQN 398
V G IG AIA T+ P Q+ F+L+WD P F GK YRRYT F+
Sbjct: 296 DVDQSGGERIGGAIALRFTLAPGETKQIPFTLSWDLPVTEFAKGKRAYRRYTDFFDRSGR 355
Query: 399 AAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
A IA A+ + WQ+ I+
Sbjct: 356 NAWVIASTALNNYA-------IWQQNII-------------------------------- 376
Query: 459 PVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFG 518
+W Q L R D+PN A + ++Y + + +
Sbjct: 377 ---------------NWQQPILDR-TDLPNWFKMA---------LFNELYDLASGGTLWS 411
Query: 519 TNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMH 578
D +GQF LE I+Y + + DV Y +AL++L+P+++ SI R FA A+
Sbjct: 412 AASKSD---PVGQFGVLECIDYRWYESLDVRLYGGYALLLLWPELEKSILRAFARAIPTA 468
Query: 579 DPSKMKLL-------DDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNP 629
+ +K +++ + RK+ GA PHD+G + PW N D +WKDL+
Sbjct: 469 N-AKERIIGYYYTVGEADHMAPRKLKGATPHDLGAPNENPWMATNYTSYQDCNQWKDLSS 527
Query: 630 KFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWS 687
FVLQVYR +TG D F WP++ + Y FD DGDG+IEN G PDQT+D W
Sbjct: 528 DFVLQVYRAFESTGSVDMDFLLDCWPAIVETLGYTKLFDTDGDGLIENSGAPDQTFDDWK 587
Query: 688 VSGISAYSGGLWVAALQAASALAREVGD----RGSEDYFLFKF----QKAKVVYEK-LWN 738
+ GISAY GGLW+AAL AA + + + G+ L ++ Q + Y+K LWN
Sbjct: 588 LQGISAYCGGLWIAALSAAIKIGEILQESKRVEGTIVILLAQYRRWWQHGRTAYQKQLWN 647
Query: 739 GSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKV--- 795
G Y+ D S S + ADQL GQ+ A GL +VD + L+ +Y ++
Sbjct: 648 GEYYRLDT--GSGSDVVMADQLCGQFCASTMGLPDVVDSEFVEPTLRAIYEACFVRFNQY 705
Query: 796 -----------------------MGGKRGAVNGMLPDGR-VDMSSMQSREIWSGVTYAVA 831
+G K G NG+ PDG + E+W+G+ + +A
Sbjct: 706 TAQLGPQNQKFEGAQTGYFSASELGVKVGCANGVRPDGSPQNPDDTHQLEVWTGINFGLA 765
Query: 832 ASMIHE----DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
A + HE + +I +YE G F+TPEA +R+ Y+RP+AI
Sbjct: 766 AFLAHEGKLHEAMEITEAVVRQVYEH-------GLQFRTPEAITAVGTFRACHYLRPMAI 818
Query: 888 WAM 890
WA+
Sbjct: 819 WAV 821
>gi|301628717|ref|XP_002943495.1| PREDICTED: non-lysosomal glucosylceramidase [Xenopus (Silurana)
tropicalis]
Length = 692
Score = 296 bits (759), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 179/471 (38%), Positives = 237/471 (50%), Gaps = 100/471 (21%)
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPN 488
K LP WY LFNELY++ GG VW + V +
Sbjct: 301 KDLPAWYKSALFNELYFMADGGTVWVE----------------------------VSAAD 332
Query: 489 QNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDV 548
+D N++ R L + ++LQ+ G+F YLEG EY M+NTYDV
Sbjct: 333 DDD---NLMRRGCKDLPGM-----------KDVLQE----YGRFAYLEGQEYRMYNTYDV 374
Query: 549 HFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD 608
HFY+SFALIML+P++++S+Q D AA+V+ DP K L G
Sbjct: 375 HFYASFALIMLWPQLEISLQYDMAASVLQEDPELRKYLMSGTVAP--------------- 419
Query: 609 PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD 668
V + Y + P LQ M +FD D
Sbjct: 420 ----VKSRSYYYSG------PSLALQT----------------------VMETSLKFDED 447
Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK 728
GDG+IEN GF DQTYD W ++G S+Y GGLW+AA+ +A +GD +++ F +
Sbjct: 448 GDGLIENSGFADQTYDDWVMTGPSSYCGGLWLAAVCMMCKMAEVLGDGAAQEKFTDILHR 507
Query: 729 AKVVYEKL-WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSA 783
K +EKL WNG Y+N+D S+S+ DQ AG W+ RACGL + +D SA
Sbjct: 508 GKAAFEKLLWNGKYYNFDCGDQPYSNSVMVDQCAGNWFLRACGLGAGESEVFPKDHVVSA 567
Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
L+ ++ NV + G+ GAVNGM PDG D SS+QS E+W GV Y +AA+MIHE L G
Sbjct: 568 LRTIFELNVKQFADGQMGAVNGMRPDGTTDTSSVQSDEVWIGVVYGLAATMIHEGLVQEG 627
Query: 844 FQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
F TA G Y W LG +FQTPEA+ +RSL YMRPL+IWAMQ AL
Sbjct: 628 FATAEGCYRTVWE--RLGMSFQTPEAYCEKQVFRSLAYMRPLSIWAMQLAL 676
>gi|159896834|ref|YP_001543081.1| hypothetical protein Haur_0301 [Herpetosiphon aurantiacus DSM 785]
gi|159889873|gb|ABX02953.1| protein of unknown function DUF608 [Herpetosiphon aurantiacus DSM
785]
Length = 774
Score = 295 bits (754), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 159/371 (42%), Positives = 228/371 (61%), Gaps = 6/371 (1%)
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
+++G F YLE +Y + T DV FYSS++++ L+P+++ F+ V D + + ++
Sbjct: 400 DDVGLFSYLECYDYPFYGTLDVSFYSSWSILALWPELERGEILAFSKTVNDADDTVVTIV 459
Query: 587 DDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGD 644
RK GA+PHD+G P + NAY D WKDLN K++L++YRDV D
Sbjct: 460 ATQVPAIRKAAGALPHDLGAPKEQPLIKTNAYDFQDINNWKDLNLKYILRIYRDVSLWND 519
Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
+ +A W ++ A+ Y+ QFD DGDG++++ G DQTYDTW++SG ++YS L + AL+
Sbjct: 520 QAMLEATWDTIPTALEYVHQFDSDGDGLLDHSG-ADQTYDTWAMSGAASYSASLLICALE 578
Query: 705 AASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQ 763
AA LA+ +GD D + A+ +E KLWNG+YF Y + + I ADQL GQ
Sbjct: 579 AAIRLAQRMGDHAQADAWSEWLAAARQSFETKLWNGTYFRYHTADTDLREVIMADQLVGQ 638
Query: 764 WYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIW 823
WYA A GL + + RSAL+ VY +NV++ G GAVNGM PDG VD SS Q+ E+W
Sbjct: 639 WYAGAIGLPAVAPREMIRSALQTVYRFNVMQYANGALGAVNGMHPDGTVDTSSNQASEVW 698
Query: 824 SGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMR 883
SG TYA+AA M+ E L G+QTA G Y A ++ GL F+TPEAW + +R+ YMR
Sbjct: 699 SGTTYAIAAMMLQEGLDLEGWQTAWGAYNATYNELGLW--FRTPEAWGIERTFRASMYMR 756
Query: 884 PLAIWAMQWAL 894
P +IWA++ AL
Sbjct: 757 PQSIWAIEHAL 767
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/367 (34%), Positives = 193/367 (52%), Gaps = 26/367 (7%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+P+GG+GSG+IGR++RG++ RW + + V NQ+SVF ++ Q+ + VLC
Sbjct: 43 HGLPIGGMGSGAIGRNFRGDWSRWHLEVGKHVHRSVWPNQWSVFW-QTASQQAAQVLCTT 101
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
P+ + + SW+WN YHAL+PRAW ++ PD L +V Q SP++ N
Sbjct: 102 QPDTDE------LSSWNWNYPVGAGNYHALFPRAWFDYQ-HPDWPLELVQEQFSPVLAGN 154
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS---EFTGQHYNSKTKMNDGVH 271
KESS+PV VFT+ + N G + + L+ TW ++ ++ QH + NDG
Sbjct: 155 LKESSFPVGVFTWRVTNRGSETVRLGLMLTWEHTRAVEAAGLSLQRQH----SAWNDGNT 210
Query: 272 AVLLHHRTSHQLPPV---TFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHG 328
+ + +TS Q T+ALA Q + VS + ++ ++ L W + G
Sbjct: 211 SGVTLTQTSDQALSSHNGTWALAVQAPESASVSQWTCWDVAQDAAAL-----WQDFASDG 265
Query: 329 SFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRR 388
+L TS AIA ++ + P + + FSLAWD P V F +Y+R
Sbjct: 266 ---QLADYPTSQKVAADQRSATAIAVTLELAPGASAVIPFSLAWDFPIVEFADQSRWYKR 322
Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
YT+F+GT+ + A +A ++ +W IEAWQ PIL D + P WY LFNELYYL
Sbjct: 323 YTRFWGTNGDQAQALAVASLTNADAWRTAIEAWQNPILADDQRPFWYKSALFNELYYLVD 382
Query: 449 GGAVWTD 455
GG +W D
Sbjct: 383 GGTLWVD 389
>gi|390344669|ref|XP_001200431.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Strongylocentrotus purpuratus]
Length = 347
Score = 291 bits (745), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 152/325 (46%), Positives = 199/325 (61%), Gaps = 9/325 (2%)
Query: 579 DPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVY 636
D + L G RK G VPHD G DPW ++N Y +DTA WKDLN K VL VY
Sbjct: 13 DEDSFQDLSSGLRSIRKSAGCVPHDSGNPEEDPWLKLNCYWFHDTAEWKDLNLKMVLMVY 72
Query: 637 RDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSG 696
RD AT D++F +WP MA + D+DGDG+I++ G DQTYD W+ G SAY G
Sbjct: 73 RDYFATKDREFLDFMWPKCMTVMAKANSQDKDGDGLIDHFGTADQTYDVWTAKGCSAYCG 132
Query: 697 GLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSI 755
GL++AAL+ +A +GD ++ + Q+ K YE KLWNG YFNYD+S S SI
Sbjct: 133 GLYLAALKCMCEMADLLGDDKAKRQYSEILQRGKATYEKKLWNGRYFNYDSSDQSYHDSI 192
Query: 756 QADQLAGQWYARACGLLPIVDE----DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
ADQ G WY RAC L+P D+ D RSAL+ +++ NV+ V G GA+NG+ P G+
Sbjct: 193 MADQTCGHWYLRACDLVPERDQVFQVDHVRSALRTIFDMNVMGVKEGNFGAMNGIRPSGK 252
Query: 812 VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWN 871
D +S+Q E+W G TY +A +MI E + + GF+TA G Y + GL A+Q PEA+
Sbjct: 253 PDHTSLQGEEVWVGTTYGLAGNMIQEGMWEEGFRTAKGCYTTCYEQAGL--AYQVPEAYM 310
Query: 872 TDDQYRSLCYMRPLAIWAMQWALTR 896
+ YRSL YMRPLAIWAMQWA+ +
Sbjct: 311 SKKIYRSLGYMRPLAIWAMQWAVEK 335
>gi|94968715|ref|YP_590763.1| hypothetical protein Acid345_1688 [Candidatus Koribacter versatilis
Ellin345]
gi|94550765|gb|ABF40689.1| protein of unknown function DUF608 [Candidatus Koribacter
versatilis Ellin345]
Length = 811
Score = 290 bits (741), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 165/376 (43%), Positives = 229/376 (60%), Gaps = 24/376 (6%)
Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
F YLE +Y + T DV FY SF L +P+I+ R++A + P + W
Sbjct: 438 FSYLECFDYAYYGTLDVRFYGSFPLAKFWPEIEKQEMREYADTI----PENIDQKYIWAW 493
Query: 592 VS----------RKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
S RKV G+ PHD+G + D N Y + WKDLN K+VL V+RD
Sbjct: 494 KSDNEHKLEPTQRKVAGSAPHDLGSPLEDALILPNQYNYQSVSFWKDLNSKYVLMVWRDY 553
Query: 640 VATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
V +G DK F + + +V +M ++ QFD++ DG+IENDG+PDQTYD WS G SAYSG
Sbjct: 554 VMSGSKDKAFLQYNYNAVKQSMQFLRQFDKNNDGLIENDGYPDQTYDNWSARGESAYSGS 613
Query: 698 LWVAALQAASALAREVGDR--GSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSI 755
L++AAL+A +A+ +GD+ ++ LFK + V +KLWNG+YFNYD GSS +I
Sbjct: 614 LYLAALRATEEMAKLLGDQRTAADTAALFKRAQDSFV-KKLWNGTYFNYD-VGSSYKDAI 671
Query: 756 QADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMS 815
A+QLAGQWYA GL +V D SALKKVY++NV+K+ G GA+NG+ G V
Sbjct: 672 MAEQLAGQWYASLTGLGDLVPRDMQHSALKKVYDFNVMKLQNGTMGALNGISASGEVLKD 731
Query: 816 SMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQ 875
+ Q+ E+W+GVT+AVAA+M+ L + GF TA G+Y + GY F+TPEA++T+
Sbjct: 732 NEQTEEVWTGVTFAVAATMLQNGLREEGFNTAKGVYNVVYDQK--GYWFRTPEAYDTNGM 789
Query: 876 YRSLCYMRPLAIWAMQ 891
YR+ YMRP AIW+M+
Sbjct: 790 YRASMYMRPGAIWSME 805
Score = 210 bits (534), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 125/386 (32%), Positives = 191/386 (49%), Gaps = 41/386 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
G P+GG G+G+ R+YRG F+RW + + + + V ANQF+VF + VL +
Sbjct: 61 QGAPVGGFGAGTFSRTYRGNFERWHVKAGIHKYQNVPANQFAVFAQADGDAAVAQVLSTQ 120
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE + SW+W Y ALYP++W ++ P + + Q SP+IP N
Sbjct: 121 KPE------HGELSSWNWTYPVGAGDYAALYPKSWFAYK-SPQLPITLTLEQFSPVIPDN 173
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTG-----------QHYNSK 263
YKESSYPV+V+ + + N + +++LF+WTN VG + T HY S+
Sbjct: 174 YKESSYPVAVYNWYVTNPSQKRVTVSILFSWTNMVGWFRDTTTGFGAALNSQDINHYKSE 233
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVT------FALAAQETDGVHVSLCPHFVISGNSLGLTA 317
+ + ++ S PV+ FA+A+ GV +S + SG+
Sbjct: 234 AIKSSQMQGIVFDRLRSE---PVSDEWDGQFAIASLAGSGVEISYLTTYRPSGD-----G 285
Query: 318 KDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEV 377
D+W + G RL ++ + S G S+ AIA T+ P + + +LAWD P +
Sbjct: 286 SDVWTPFAKDG---RLPNVAAGLASS-GDSLAGAIAVRFTLAPGEKRTIPMALAWDLPFI 341
Query: 378 NFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPI 437
F SG+ + R YT+F+ A +AR A+ + W QI+AWQ+P + D+ P WY
Sbjct: 342 QFGSGRKWARHYTRFFDRFGTNAWKLARTALEQGNDWSRQIDAWQKPYISDESKPLWYRG 401
Query: 438 TLFNELYYLNAGGAVW-----TDGSP 458
LFNELY L GG +W T G+P
Sbjct: 402 MLFNELYDLVDGGTMWAHEVGTPGNP 427
>gi|148670515|gb|EDL02462.1| glucosidase beta 2, isoform CRA_b [Mus musculus]
Length = 377
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 148/308 (48%), Positives = 192/308 (62%), Gaps = 7/308 (2%)
Query: 537 GIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKV 596
G EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A + D ++ + L G K
Sbjct: 19 GQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKR 78
Query: 597 LGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
+PHDIG D PW VNAY ++DTA WKDLN KFVLQ+YRD TGD+ F + +WP
Sbjct: 79 RNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPV 138
Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
M +FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G
Sbjct: 139 CLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCG 198
Query: 715 DRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL-- 771
+ ++ F + + YE+ LWNG Y+NYD+S QS SI +DQ AGQW+ RACGL
Sbjct: 199 AQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGE 258
Query: 772 --LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
+ AL+ ++ NV GG GAVNGM P G D SS+QS E+W GV Y
Sbjct: 259 GDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYG 318
Query: 830 VAASMIHE 837
+AA+MI E
Sbjct: 319 LAATMIQE 326
>gi|374311163|ref|YP_005057593.1| glucosylceramidase [Granulicella mallensis MP5ACTX8]
gi|358753173|gb|AEU36563.1| Glucosylceramidase [Granulicella mallensis MP5ACTX8]
Length = 811
Score = 287 bits (735), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 165/380 (43%), Positives = 220/380 (57%), Gaps = 18/380 (4%)
Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
F LE +Y + T DV FY+S L+ +P I + R+FA V P + + Q
Sbjct: 439 FALLECFDYAYYGTLDVRFYASLPLLKFWPDIDKQVLREFADTVPKEWPEQGLWVWKTQQ 498
Query: 592 VS------RKVLGAVPHDIGI--CDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
V RK +GAVPHD+G+ DP+ VN DT WKDLN KFVL VYRD V TG
Sbjct: 499 VGEPITHKRKKIGAVPHDLGVPEGDPFVAVNEPGWQDTNDWKDLNSKFVLMVYRDYVLTG 558
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D F + WP+V A+ Y+ QFD G G+ EN G+PDQTYD W V G+SAYSGGLW+A
Sbjct: 559 RKDTAFLRETWPAVKAAIEYLRQFDHGG-GVPENSGYPDQTYDDWVVRGVSAYSGGLWLA 617
Query: 702 ALQAASALAREVGD-RGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQL 760
AL+A AR VGD + + +Y + K +LWNG YF YD S S+ IQADQL
Sbjct: 618 ALRAGEETARVVGDTKTTAEYHALFLKGQKTYISQLWNGEYFRYDTSSESK-DDIQADQL 676
Query: 761 AGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSR 820
AGQWYA GL IV SA KK+++ NV+K G+ GA NGM DG + +++ +++
Sbjct: 677 AGQWYANLTGLGEIVPHTMQVSAAKKIFDVNVMKFGHGEMGAANGMTADGAI-LTNAEAK 735
Query: 821 EIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLC 880
E+W G T A ++ E + D ++T G+Y + GY F+TPEAW+ +R+
Sbjct: 736 EVWVGTTLGYAGLLMQEGMNDEAWKTTRGLYHVIYEDK--GYWFRTPEAWDITGNFRAGM 793
Query: 881 YMRPLAIWAMQWALTRPKPK 900
YMRP AIWA++ +T P K
Sbjct: 794 YMRPTAIWALE--MTSPTAK 811
Score = 197 bits (500), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 142/438 (32%), Positives = 201/438 (45%), Gaps = 51/438 (11%)
Query: 51 TLNWKETVQLA-PIGVRILCLIREEAAKG---------KRAFIDPFIKRHLTSS------ 94
T W T LA PI LCL A G KRA P +T +
Sbjct: 4 TSKWISTRGLAFPILAFTLCLSMASTAWGGDGIPKAAWKRALGLPLENPGVTRAAGDIDD 63
Query: 95 ---HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV-SRSNGQKYSSV 150
GVP+GG G+G+ RSYRG+F RW I V + +PV ANQF++F S + S
Sbjct: 64 GYWQGVPVGGFGAGTFSRSYRGDFARWHIKAGVHKYEPVYANQFAMFQQSEGDAHGTSQA 123
Query: 151 LCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
L P + + SW W+ Y+ L+P+AW ++ + P + Q SP+
Sbjct: 124 LMNGHP------SNGELSSWQWDYPVGAGDYYGLFPKAWFDYKWDKFPA-HVTLEQFSPV 176
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT---------GQH-- 259
+P+NY+ESSYPV+V+ + N + +++L +WTN G +T G H
Sbjct: 177 LPNNYRESSYPVAVYRWHAENPTNKTVVVSVLLSWTNMSGWFRTYTRDFKGAPNQGNHNE 236
Query: 260 YNSKTKMNDGVHAVLLHHRTSHQLPPV----TFALAAQETDGVHVSLCPHFVISGNSLGL 315
Y S+ + G ++ R P FA+AA E+ GV VS + SG+
Sbjct: 237 YVSEKVGDAGTMKGIVFDRNRAGTAPNDWDGQFAIAAMESPGVEVSYQTTYQASGD---- 292
Query: 316 TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
K +W + G RL + S S+ + AIA T+ P + V ++WD
Sbjct: 293 -GKAVWSAFSKDG---RLANDGKSWVSDK-EKLAGAIALRFTLRPGEKKVVPMVVSWDFS 347
Query: 376 EVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWY 435
+ F G+ + RRYT FYGT A IARD +L W +I+ WQ P + D+ P WY
Sbjct: 348 LIQFGEGRKWDRRYTDFYGTSGQNAWKIARDGLLHATEWSDEIDKWQAPYITDESKPLWY 407
Query: 436 PITLFNELYYLNAGGAVW 453
LFNELY L GG W
Sbjct: 408 RGMLFNELYALTDGGTFW 425
>gi|353232146|emb|CCD79501.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 725
Score = 287 bits (735), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 209/657 (31%), Positives = 319/657 (48%), Gaps = 84/657 (12%)
Query: 71 IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
I++ + + FID +GVP+GG+GSGSIGR YRGEF R + P +
Sbjct: 77 IKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFCRSSLIPGMYSYDVQ 136
Query: 131 LANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWT 190
+QF V V ++ Y VL P T + A + W W +K Y LYPR+WT
Sbjct: 137 PVDQFIVTVRKNGVTVYHQVLSPLTKP---PSNAKKLKKWKWGFDPEKGHYLGLYPRSWT 193
Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
V+E P+ +L +V +QISP+IPH+Y+ + PV+VF + + + K + +T+ F+W
Sbjct: 194 VYEI-PELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISWNKENLKVTITFSWHGPNH 252
Query: 251 GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
+ S+ N + H T+ F +Q + +G
Sbjct: 253 RKRTKSSDFSKSEDGNNSSENTTNTHVSTAEN----AFPTCSQRSTAFS---------TG 299
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLN--------SMETSVTSEPGSSIGAAIAASVTVP--- 359
+ L L + K+ +F L+ ++ ++ + A++A+ VP
Sbjct: 300 DFLKLNCQG-----KKLKTFTDLSWLGWLVGYTVPLDSKAKKSPKLAIAVSATTNVPRCN 354
Query: 360 -----PDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYGTHQNAAANIA-RDAILEHG 412
S+ ++ F++ W P V F +G Y RRY +++ + + A + AI
Sbjct: 355 ITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKLLLAHAIDNWR 414
Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS 472
W +IE WQ PIL ++ LP WY LFNELYYL+ GG VW D P+
Sbjct: 415 QWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ----------- 460
Query: 473 LDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQF 532
+D +SDL + + D V+ + + + IG F
Sbjct: 461 VDCFKSDLLNLTE-----DMKVDSWDHRARL----------------------SREIGLF 493
Query: 533 LYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWV 592
YLEG EY M+NTYDVH Y+S+ALI L+PKIQL++ D A + D + + + +G+ +
Sbjct: 494 GYLEGHEYRMYNTYDVHHYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKAL 553
Query: 593 SRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKA 650
R AV HD G +PW NAY ++ T WKDLN KF+LQV+RD T D ++
Sbjct: 554 FRSSECAVVHDFGDPEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLY 613
Query: 651 VWPSVY-VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ P V + + +D D DG+IEN GFPDQTYD W+ G++AY+GG+W++ L A
Sbjct: 614 MLPIVLRILRKSLVAWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYAT 670
>gi|345293661|gb|AEN83322.1| AT5G49900-like protein, partial [Capsella grandiflora]
gi|345293663|gb|AEN83323.1| AT5G49900-like protein, partial [Capsella grandiflora]
gi|345293665|gb|AEN83324.1| AT5G49900-like protein, partial [Capsella grandiflora]
Length = 156
Score = 285 bits (728), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 130/155 (83%), Positives = 144/155 (92%)
Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
Y MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGA
Sbjct: 1 YRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGA 60
Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
VPHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAM
Sbjct: 61 VPHDLGINDPWFEVNGYSLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAM 120
Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
AYM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY
Sbjct: 121 AYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAY 155
>gi|345293667|gb|AEN83325.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293669|gb|AEN83326.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293671|gb|AEN83327.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293673|gb|AEN83328.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293675|gb|AEN83329.1| AT5G49900-like protein, partial [Capsella rubella]
Length = 156
Score = 284 bits (726), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 130/155 (83%), Positives = 144/155 (92%)
Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
Y MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGA
Sbjct: 1 YRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGA 60
Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
VPHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAM
Sbjct: 61 VPHDLGINDPWFEVNGYNLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAM 120
Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
AYM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY
Sbjct: 121 AYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAY 155
>gi|449674846|ref|XP_002159451.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
magnipapillata]
Length = 751
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 168/428 (39%), Positives = 238/428 (55%), Gaps = 62/428 (14%)
Query: 317 AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
A ++W ++ + G D N S S PG + AA+AA TV ++ + F L+WD P
Sbjct: 381 AAELWEDLNDDGVLD--NKRGDSCYSYPGQLLAAAVAAKTTVHAGAKQDLKFVLSWDQPF 438
Query: 377 VNFMS--GKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEW 434
V F S G +Y RRYT+F+G++ NA+ ++ A+ WE IE WQ PIL D LP W
Sbjct: 439 VTFGSDSGLSYCRRYTRFFGSNGNASPSLCCYALQNIDFWEDNIERWQNPILNDSELPAW 498
Query: 435 YPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV 494
+ LFNELY+++ GG VW +D+ + D+
Sbjct: 499 FKSALFNELYFISDGGTVW------------------------------LDIKRKKDSHG 528
Query: 495 NILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
N + S L ++Y G+F YLEG EY M+NTYDVHFY+S+
Sbjct: 529 NFI----SDLVKLY---------------------GRFAYLEGHEYRMYNTYDVHFYASW 563
Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFE 612
AL +L+P +Q+S+Q D +V+ D K ++ +G RKV+G VPHD+G D P+
Sbjct: 564 ALALLWPNLQISMQYDICNSVLNVDDEKRPIMMEGGIAPRKVIGCVPHDLGDPDDEPFVR 623
Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
VNAYC++DT++WKDLN FVLQVYRD T +K+F +WP++ AMA+ D DGDG+
Sbjct: 624 VNAYCIHDTSQWKDLNLHFVLQVYRDFYITKNKQFLMDMWPAMKAAMAHSLAQDIDGDGI 683
Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
IEN G D T+DTW V+G SAY+GGLW++AL+ +A +G S + ++ K
Sbjct: 684 IENSGIADSTFDTWVVTGPSAYTGGLWLSALRCIIEIAHILGLSQSISKYKSILERGKKS 743
Query: 733 YE-KLWNG 739
YE KLWNG
Sbjct: 744 YEKKLWNG 751
Score = 163 bits (413), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 78/194 (40%), Positives = 114/194 (58%), Gaps = 3/194 (1%)
Query: 70 LIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP 129
R+ KGK FIDP+ +GVPLGG+G+GSIGR ++G+F RWQ+ P +
Sbjct: 3 FTRKVKKKGKLPFIDPYAVSACKQIYGVPLGGIGAGSIGRGWKGDFNRWQLRPGIYTYDT 62
Query: 130 VLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAW 189
V NQF+V + + Y VL P P +K+ + + SW W+ G + YHALYPRAW
Sbjct: 63 VDVNQFTVCIRKCGKTIYQQVLAPYYPVAIKNVES--LQSWTWSFDGGRGYYHALYPRAW 120
Query: 190 TVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSV 249
TV+ + +++ CRQ+SPI P NYK++S PV VF +TI N G D++++ T+ N
Sbjct: 121 TVYH-ITEHNIKLTCRQVSPIFPDNYKDTSLPVGVFVWTIENIGMHDLDVSIMLTFQNGD 179
Query: 250 GGDSEFTGQHYNSK 263
G + + G H N +
Sbjct: 180 GTNEDCVGGHNNEE 193
>gi|256082539|ref|XP_002577512.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 717
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 210/657 (31%), Positives = 316/657 (48%), Gaps = 92/657 (14%)
Query: 71 IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
I++ + + FID +GVP+GG+GSGSIGR YRGEF R + P +
Sbjct: 77 IKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFCRSSLIPGMYSYDVQ 136
Query: 131 LANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWT 190
+QF V V ++ Y VL P T + A + W W +K Y LYPR+WT
Sbjct: 137 PVDQFIVTVRKNGVTVYHQVLSPLTKP---PSNAKKLKKWKWGFDPEKGHYLGLYPRSWT 193
Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
V+E P+ +L +V +QISP+IPH+Y+ + PV+VF + + + K + +T+ F+W
Sbjct: 194 VYEI-PELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISWNKENLKVTITFSWHGPNH 252
Query: 251 GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
+ S+ N + H T+ F +Q + +G
Sbjct: 253 RKRTKSSDFSKSEDGNNSSENTTNTHVSTAEN----AFPTCSQRSTAFS---------TG 299
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLN--------SMETSVTSEPGSSIGAAIAASVTVPP-- 360
+ L L + K+ +F L+ ++ ++ + A++A+ VP
Sbjct: 300 DFLKLNCQG-----KKLKTFTDLSWLGWLVGYTVPLDSKAKKSPKLAIAVSATTNVPRCN 354
Query: 361 ------DSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYGTHQNAAANIA-RDAILEHG 412
S+ ++ F++ W P V F +G Y RRY +++ + + A + AI
Sbjct: 355 ITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKLLLAHAIDNWR 414
Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS 472
W +IE WQ PIL ++ LP WY LFNELYYL+ GG VW D P+
Sbjct: 415 QWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PI------------ 459
Query: 473 LDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQF 532
Q DL + V + + A R+S IG F
Sbjct: 460 ----QVDLTEDMKVDSWDHRA-----RLS-------------------------REIGLF 485
Query: 533 LYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWV 592
YLEG EY M+NTYDVH Y+S+ALI L+PKIQL++ D A + D + + + +G+ +
Sbjct: 486 GYLEGHEYRMYNTYDVHHYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKAL 545
Query: 593 SRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKA 650
R AV HD G +PW NAY ++ T WKDLN KF+LQV+RD T D ++
Sbjct: 546 FRSSECAVVHDFGDPEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLY 605
Query: 651 VWPSVY-VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ P V + + +D D DG+IEN GFPDQTYD W+ G++AY+GG+W++ L A
Sbjct: 606 MLPIVLRILRKSLVAWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYAT 662
>gi|290990718|ref|XP_002677983.1| predicted protein [Naegleria gruberi]
gi|284091593|gb|EFC45239.1| predicted protein [Naegleria gruberi]
Length = 937
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 156/389 (40%), Positives = 223/389 (57%), Gaps = 24/389 (6%)
Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
F YLEG EYLM NTYDVHFY+S A+IM +P+IQLSIQRD AV + ++
Sbjct: 550 FFYLEGHEYLMCNTYDVHFYASHAMIMNWPEIQLSIQRDIIRAVREENNQEVTFYLSNMK 609
Query: 592 VSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAK 649
RKV G+VPHD+G PW VNAY D +WKDLN +F+L ++RD + T +++F +
Sbjct: 610 NRRKVKGSVPHDVGTPYERPWKLVNAYNFQDVNKWKDLNSQFILTIFRDFMITKNREFLE 669
Query: 650 AVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
+PS+ +A+ Y + FD+DGDG+IEN+ FPD TYD W V+G+SAYSG LW+A+L+A
Sbjct: 670 EAFPSIILALNYSLTNFDKDGDGLIENESFPDTTYDAWKVTGVSAYSGMLWIASLKAILE 729
Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYE-KLWN--GSYFNYDNSGSSQSSSIQADQLAGQW- 764
+A+E+ D + + +E KLW+ Y++YD S Q SI +D L G +
Sbjct: 730 MAKELDDENTRQKVEPLIESGLKTFESKLWDETNQYYHYDASTEPQHDSIMSDHLHGTFM 789
Query: 765 ------YARACG-------LLPIVDEDKARSALKKVYNYNVLKVMGGKR--GAVNGMLPD 809
Y + G L+P + ++ +L K+ N +K + G VNGM P
Sbjct: 790 LLLIGNYVKNSGRCEDSHYLIPF-NMERVSKSLNKILENNFVKYQQITKLGGCVNGMRPT 848
Query: 810 GRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEA 869
VD +S+QSRE+W+G +Y VAA I + G +++ WS + FQTPEA
Sbjct: 849 QEVDKTSLQSREMWTGTSYVVAALTILLGRREEGIDLMKSVFDKCWSSNS-AFWFQTPEA 907
Query: 870 WNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
+YR+L Y RPL+IW+ A+ K
Sbjct: 908 VTETGEYRALGYCRPLSIWSCLSAMMNKK 936
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 121/466 (25%), Positives = 203/466 (43%), Gaps = 67/466 (14%)
Query: 40 LSTGEIPLSQFTLNWKETVQLAPIGVRILCL--IREEAAKGK---------RAFIDPFIK 88
L IPL T + E P +R L + + E+ K + RA +DP K
Sbjct: 87 LQKTTIPLRAQTYSIGEMFHFLPGALRFLTMSYVPEKLIKKRETPFDFIMRRAVVDPRRK 146
Query: 89 RHLTSSHGVPLGGVG-SGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKY 147
G + +G +G RS+ Q I+ D + NQFS+ V R NG+
Sbjct: 147 GPPIGGIGAGMFTLGMNGWFTRSHLIFNQTGGIYS----DTVLPVNQFSLRVER-NGEIE 201
Query: 148 SSVLCP----KTPEVLKDTTAAGIGSW----DWNLKGDKSTY----HALYPRAWTVHEGE 195
S VL P K + KD +W D + +K + ++L+P++WTV++
Sbjct: 202 SFVLNPICEFKDGTMDKDLIKKLKHNWMFINDERRRKNKMQFSQQVYSLFPKSWTVYQT- 260
Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEF 255
D + ++C + SP+ HNY+ESS+P F +TI N+ A+++LLFT + + +F
Sbjct: 261 SDEKFTVICERYSPVWAHNYQESSFPTCNFEWTIINNTDEEANVSLLFT----IQSEEKF 316
Query: 256 TGQHYNSKTKMN--DGVHAVLLHHRTSHQLP---PVTFALAAQETDGVHVSLCPHFVISG 310
++ S N D V + + + + F + A + G ++ F +
Sbjct: 317 RKKNEQSIIARNETDSVKCLTIETTVTDSFSHKDTIQFCIGANTSLG-SITFLDEFNGTN 375
Query: 311 NSLGLTAKDMWHEIKEHGSF-----DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
S D+W E +G F + S+ AI + V +P +S Q
Sbjct: 376 ES---HLHDLWQEFNNNGEFSTEFCENFAKRHDWTKSQSKHLSCHAICSKVQIPSNSSKQ 432
Query: 366 VTFSLAWDCPEVNFMSGKT----YYRRYTKF-------YGTHQNAAA-NIARDAILEHGS 413
V F+L W+ P V F G T Y R YT+F H++ A + + ++
Sbjct: 433 VCFNLNWNAPFVRF--GPTMEFRYLRHYTQFCTDAISSITQHKDTACFEMCKRSLDSSKQ 490
Query: 414 WELQIEAWQRPILEDKRLPEWYPI---TLFNELYYLNAGGAVWTDG 456
W+ +I W + +++ ++ + +P LFNELYY+ GG +WT+G
Sbjct: 491 WKDEISQWHQGVID--QVGKDHPTAVAALFNELYYIVDGGTLWTNG 534
>gi|320105716|ref|YP_004181306.1| glucosylceramidase [Terriglobus saanensis SP1PR4]
gi|319924237|gb|ADV81312.1| Glucosylceramidase [Terriglobus saanensis SP1PR4]
Length = 794
Score = 275 bits (704), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 159/377 (42%), Positives = 216/377 (57%), Gaps = 28/377 (7%)
Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
F LE +Y + T DV FY+S L+ +P+I + R+FA V + + + G W
Sbjct: 423 FALLECFDYAYYGTLDVRFYASLPLLKFWPEIDKRVLREFAETV------EKEWSEKGLW 476
Query: 592 V------------SRKVLGAVPHDIGI--CDPWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
V RK +GAVPHD+G+ DP++ VN DT WKDLN KFVL VYR
Sbjct: 477 VWKTAETGSPVLHKRKKVGAVPHDLGVPEGDPFYVVNEPGWQDTNDWKDLNSKFVLMVYR 536
Query: 638 DVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
D V TG DK F + WP+V A+ Y+ QFD G G+ EN G+PDQTYD+W V G+SAYS
Sbjct: 537 DYVLTGRTDKAFLQETWPAVKAAIQYLRQFDH-GRGVPENSGYPDQTYDSWVVRGVSAYS 595
Query: 696 GGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSS 754
GGLW+ AL+A AR +GD ++ + F A+ Y LWNG YF YD S++
Sbjct: 596 GGLWLGALRAGEETARVLGDWTAQAEYQALFAMAQKTYVATLWNGEYFLYDTDASNK-GV 654
Query: 755 IQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM 814
IQADQLAGQWYA GL +V RS LKK+Y +NV + G GAVNGM DG V +
Sbjct: 655 IQADQLAGQWYAHMTGLGDLVPRAMQRSVLKKIYAFNVNQFGDGNLGAVNGMNADGTV-V 713
Query: 815 SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDD 874
++++RE+W+G T AA + + + + G++ + GY F+TPEAW+
Sbjct: 714 DNVEAREVWAGTTLGYAALLKSLGIEEESYHVVHGLFHVIYESK--GYWFRTPEAWDVTG 771
Query: 875 QYRSLCYMRPLAIWAMQ 891
+R+ YMRP +W M+
Sbjct: 772 NFRASMYMRPAGLWGME 788
Score = 199 bits (505), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 126/370 (34%), Positives = 178/370 (48%), Gaps = 27/370 (7%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
G P+GG G+G+ RSYRG+F RW I V + AN+F++F Q+
Sbjct: 56 QGAPVGGFGAGTFSRSYRGDFARWHIKAGVHKYDVSYANEFAMF------QQVEGEATGV 109
Query: 155 TPEVLKDTTAAG-IGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
+L D G + SW W+ Y ALYP++W + E P + Q SP+IP
Sbjct: 110 AKVLLNDHPKQGELASWSWDYPVGAGEYAALYPKSWYDYRWEKFPA-HVTLEQYSPVIPD 168
Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTG------QHYNSKTKMN 267
NY+ESSYP +V+ + N + +++L +WTN G FT N T N
Sbjct: 169 NYRESSYPTAVYRWHAENPTDKTVTVSVLLSWTNMSGWFRTFTHDFKGAPNQGNYDTYRN 228
Query: 268 DGVHAVLLHHRTSHQLPPV----TFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
+ ++ R+ + P FA+AA ET GV V+ F +G+ K +W
Sbjct: 229 EAGMKGIVFDRSRAGVAPNEWDGQFAIAAMETPGVEVTYQTSFQAAGD-----GKAVWSH 283
Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
G RL + +TS S+ G + AIA T+ P + V +AWD P V F G+
Sbjct: 284 FARDG---RLPNDDTSWVSD-GEKLAGAIAVRFTLKPGEKKVVPMVIAWDLPVVQFGEGR 339
Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
+ R+YT FYGT A IAR+ +L W I+AWQ+P + D+ P WY LFNEL
Sbjct: 340 QWNRKYTDFYGTDGKNAWKIAREGLLHAQEWSDAIDAWQKPTITDESKPLWYRGMLFNEL 399
Query: 444 YYLNAGGAVW 453
Y L GG W
Sbjct: 400 YALTDGGTFW 409
>gi|195115764|ref|XP_002002426.1| GI12858 [Drosophila mojavensis]
gi|193913001|gb|EDW11868.1| GI12858 [Drosophila mojavensis]
Length = 877
Score = 273 bits (697), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 173/463 (37%), Positives = 242/463 (52%), Gaps = 78/463 (16%)
Query: 511 VALNSSFGTNL-LQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
+ +SS G L D G+F YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q
Sbjct: 414 LKCDSSLGNQLAFDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 473
Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
DF A+ K+L DG+ + RK+ VPHD+G D P+ +N Y ++D WKDL
Sbjct: 474 DFKDAIGAELNDTRKMLYDGKVMPRKIKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 533
Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD------QFDRDG------------ 669
N KFVLQVYRD + A+A S + ++ ++D Q+ +D
Sbjct: 534 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDKESLYEQYSQDNNRKSSADGKQQN 593
Query: 670 ------------------DGMI---------------------ENDGF------PDQTYD 684
D M+ +NDG PDQTYD
Sbjct: 594 RKSASMYINETNGKVYLMDAMVYLKSMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYD 653
Query: 685 TWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFN 743
W + G SAY GLW+AALQA S +A + + +K K + EKLWNGSY+
Sbjct: 654 AWVMDGPSAYCSGLWLAALQAMSVMATLLDQPNDCLRYQDILEKGKRSLEEKLWNGSYYR 713
Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
+D S + +I ADQL G WY ++CG I ++ R+ALK++Y+ NV+ G GA
Sbjct: 714 FDLS-HNHRDTIMADQLCGHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGA 772
Query: 803 VNGMLPD-------GRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAW 855
NG + + G VD S++Q+ E+W GV YA+AA+MI E + D FQTA G+Y+
Sbjct: 773 ANGFIANVSEPSKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTLS 832
Query: 856 SGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
G+ Y +TPEA + +YRS+ YMRPL+IW+MQ A R +
Sbjct: 833 QRIGMNY--ETPEALYGEKRYRSIGYMRPLSIWSMQVAWERRR 873
Score = 192 bits (487), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 130/417 (31%), Positives = 208/417 (49%), Gaps = 56/417 (13%)
Query: 77 KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
+G+R ++D + + +GVP+GG+G GSIGR Y GEF R+Q+ P + E V ANQF
Sbjct: 13 EGRRVYMDYYYLENGKQIYGVPIGGIGGGSIGRGYAGEFCRFQLRPGIYEYNVVHANQFI 72
Query: 137 VFVSRSNGQK-YSSVL--CP--------KTPEVLKDTTAAG------------------- 166
V + G + S+L C KTP + A
Sbjct: 73 VTIKDHKGCTIFQSLLSRCSTKSFKAKTKTPSTNNNDIDASTDEAEAQSTKCQLPHCSCS 132
Query: 167 -----IGSWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKE 217
+ +W N++ + +Y LYPRAWT H G +R+VCRQISP+IPH+YKE
Sbjct: 133 RNRQPLSTWHSNIEDSRCSYTGLYPRAWTEYDLSHYG-----VRLVCRQISPVIPHDYKE 187
Query: 218 SSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHH 277
SS P +VF +++ N +++ FT+ N G + + ++ V +
Sbjct: 188 SSLPCAVFVWSVQNVCDQERKVSITFTFKNGTGNKKQDAEGNPEAQLIAEGNAKGVAIRQ 247
Query: 278 RTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSME 337
+ P ++ LA + + ++ CP F +G + + +W ++KE+G + + E
Sbjct: 248 TIADM--PCSYNLACRVLPEISITRCPQFDPAG-----SGEQLWAQLKEYGQLNEQPTSE 300
Query: 338 TSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTH 396
T T + IG A+ A + + P + ++ F LAWD P++ F +T+ R YTK++
Sbjct: 301 TLKTKD----IGVAVCAQLALKPQASHELEFVLAWDMPKIQFPRKLQTHTRYYTKYFDDS 356
Query: 397 QNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
I A+ + SWE I+AWQRPIL D+ LP+WY +FN+LYY++ GG +W
Sbjct: 357 GECGPRICEYALKHYASWERLIDAWQRPILNDEGLPDWYKCAIFNQLYYISDGGTIW 413
>gi|255073149|ref|XP_002500249.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
gi|226515511|gb|ACO61507.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
Length = 954
Score = 271 bits (692), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 263/934 (28%), Positives = 410/934 (43%), Gaps = 159/934 (17%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
G P+GG+G+G++GR+YRG+F RW + +P ++F +R +G ++VL
Sbjct: 63 GAPIGGMGAGTMGRTYRGDFARWHMRVGGHTHRPA---AHTLFAARVDGT--ATVLSALP 117
Query: 156 PEVLKDTTAAGIGSWDWNLK------GDKSTYHALYPRAWTVHEGE---PDPELRIVCRQ 206
+ + G GS + + G TYHALYPR+W ++ P +R+ Q
Sbjct: 118 SHAVMEGVPGGGGSDRTHSRRVLPPDGTGGTYHALYPRSWYEYDPSALAPGGRVRLSQTQ 177
Query: 207 ISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG---------------- 250
SP+IP Y+++S PV VF + N G AD+ L F++ N +
Sbjct: 178 FSPVIPGEYRDASMPVGVFRFLASNEGDVPADVALAFSFENVLATTDVAPVDVGPPRGAW 237
Query: 251 --GDSEFTGQHYNSKTKMNDGVHA----VLLHHRTSHQLPPVTFALA---AQETDGVHVS 301
E T + GVH V+ R+ + A+A E G++V+
Sbjct: 238 RLAPHELTHSTFERSGGRVAGVHMHTDDVVYGTRSPSEPWHGGIAVAVGGNGELGGLNVT 297
Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDR-----LNSMETSVTSEPGSSIGAAIAASV 356
P + +G++ + +K W + G F R N + + + P S+ A++A+
Sbjct: 298 TLPLY-DAGDADSIESK-FWRPFERAGGFGRDESAAANEHDGGLPAGPLSAC--AVSAAF 353
Query: 357 TVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGT--HQN------AAANIARDAI 408
T+ P VTF+ AWD P +F G + +RYT+++G+ H+ AA +IA +A+
Sbjct: 354 TLAPGERRNVTFAAAWDLPASSFPRGVRWAKRYTRYHGSFIHRRDVPRGAAAPDIATEAL 413
Query: 409 LEHGSWELQIEAWQRPILEDK----------------------------RLPEWYPITLF 440
L WE I WQR ++ R P W +F
Sbjct: 414 LRADEWEGDIARWQRGFVQSAAAKDANLDDPAGGRGRNYPGRVRDRPTLRRPAWLVGAMF 473
Query: 441 NELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERM 500
NELYYL GG +W G P +T+ + ++ S++ ++ + +
Sbjct: 474 NELYYLVDGGTLW--GRP-----LTVAGLEGDVEESEASAAAPGGWRGEDGEGGDAQD-- 524
Query: 501 SSILEQIYTPVALNSS-FGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIML 559
VA N +G L G+F LE + ++N DVHFY S+ L ML
Sbjct: 525 ----------VAGNGGEWGGTL--------GRFGLLESFDAPVYNALDVHFYGSWPLAML 566
Query: 560 FPKIQLSIQRDFAAAVMMHDPSKMKL----LDDG---QWVSRKVLGAVPHDIGICDPWFE 612
+P + L++ D AA V D + L D G + RKV G+ P D+G F+
Sbjct: 567 WPGLDLAVLADLAAGVDSEDDEERALSWNARDAGRAPEMRKRKVRGSAPRDLGAP---FD 623
Query: 613 V------NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKK------------FAKAVWPS 654
NA+ L D RWKDL PK +L + R G + + P
Sbjct: 624 APLTSSPNAFDLTDVNRWKDLAPKLMLTLARAHALRGGFRGPNDPGGIPRDVLRRMFRPC 683
Query: 655 VYVAMAYMDQFDRDGDGMIEND----GFPDQTYDTWSVSG-ISAYSGGLWVAALQAASAL 709
A + +FD +GDG+IE+D PD ++ W G SAY+GGLW+AAL+A + L
Sbjct: 684 YESLAAQLRRFDGNGDGLIEHDDDDEAGPDHSFVAWRARGDTSAYAGGLWLAALRAGAGL 743
Query: 710 AREVGDRGSEDYFLFKFQKAKVVYEK-LWNG------SYFNYDNSGSSQSSSIQADQLAG 762
AR++ + + D + A V ++ LW G Y+ +D SG+ A Q+ G
Sbjct: 744 ARDLNETHARDSLEQTMRAAAVAHDAALWLGDDDGGRGYYRFDASGTEGGDVSSAAQVMG 803
Query: 763 QWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREI 822
+W G ++ K R+AL V NV G G VD S S E+
Sbjct: 804 EWALAVIGARGVLPARKVRAALASVARLNVAAAGGAGGAVNGAAA-SGSVDAVSAHSNEV 862
Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYM 882
W GV+YAVA+ M+ + D ++ A G+ + G AF+TPEAW+ +RS
Sbjct: 863 WPGVSYAVASHMLLAGMDDEAWELARGVAAGTYE---RGLAFRTPEAWDARGGFRSAMSQ 919
Query: 883 RPLAIWAMQWALT---RPKPKTLEKQMKPEVTDE 913
R ++WA++ ALT R + EK + V DE
Sbjct: 920 RAGSVWAIEHALTLRHRREVAAWEKAGRGRVKDE 953
>gi|170036781|ref|XP_001846240.1| bile acid beta-glucosidase [Culex quinquefasciatus]
gi|167879683|gb|EDS43066.1| bile acid beta-glucosidase [Culex quinquefasciatus]
Length = 913
Score = 261 bits (668), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 162/444 (36%), Positives = 238/444 (53%), Gaps = 80/444 (18%)
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
G++ YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q D+ AV K L DG
Sbjct: 471 GRYAYLEGHEYRMYNTYDVHFYASHALASLWPNLQVSLQYDYKDAVQREITEGRKHLYDG 530
Query: 590 QWVSRK------------------VLGAVP-HDIG------------ICDPWFEVNAYCL 618
+ + RK ++ A P HD+ + ++ +N Y
Sbjct: 531 KVIPRKIKSSVPHDLGDPDEEPFELINAYPIHDVSEWKDLNTKFILQVYRDYYTLNHYAQ 590
Query: 619 --------YDTARWKDLNPKFVLQVY--RDVVATGDK----------------------- 645
+ + + D F + V R+ V+ DK
Sbjct: 591 LNADNASKFSSIEFIDKESMFEMYVQDNRNKVSPEDKAQNRKSASMYINETNGKVYLMDA 650
Query: 646 -KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
+ K+++P+ + ++D+D DG+IEN PDQTYDTW + G S+Y GGLW+A+L
Sbjct: 651 MTYLKSMYPACKQVLEKTLEWDKDNDGLIENSKSPDQTYDTWVMDGPSSYCGGLWLASLH 710
Query: 705 AASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLA 761
S +A + SED ++ +K K + EKLWNGSY+ +D S SS +SI +DQL
Sbjct: 711 CMSVMANLLDQ--SEDCIKYREILEKGKASFEEKLWNGSYYKFD-SQSSSKNSIMSDQLC 767
Query: 762 GQWYARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP------DGRVDM 814
G WY R CG IV ++ R+ALK +Y+ NV++ GG GAVNG +P DGR D
Sbjct: 768 GHWYLRCCGFDYDIVPKENVRTALKTIYDNNVMRFCGGNMGAVNGYIPSSQPNKDGRADS 827
Query: 815 SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDD 874
S++Q+ E+W+GVTYA+A++MIHE + FQTA G+Y+ +G +F+TPEA +
Sbjct: 828 STLQAEEVWTGVTYALASTMIHEGMFTEAFQTAGGLYQTL--SEKIGMSFETPEALYAER 885
Query: 875 QYRSLCYMRPLAIWAMQWALTRPK 898
YR++ YMRPL+IW+MQ A + K
Sbjct: 886 HYRAIGYMRPLSIWSMQTAWEQKK 909
Score = 171 bits (434), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 126/424 (29%), Positives = 214/424 (50%), Gaps = 50/424 (11%)
Query: 60 LAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
+ P+ +R + + + +G++ +D + + +G P+GG+G+G+IGR + GEF R+Q
Sbjct: 50 IIPMALRYIPYYWKVSREGRQVLMDYWYTENGKQIYGAPIGGIGAGTIGRGFAGEFCRFQ 109
Query: 120 IFPRVCEDKPVLANQFSVFVS---------------RSNGQKYSSVLCPKTPEVL----- 159
+ P + E V ANQF V + R+ + V P +L
Sbjct: 110 MRPGLYEYNTVHANQFIVTIKDENNVTIFQSLLSSYRTKDDEVHDVCAPNQTTLLHGLMP 169
Query: 160 -KDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKES 218
+ + SWD + K +Y ALYPRAW+ ++ + +++V RQISPIIPH+YKES
Sbjct: 170 CETRPKTPLASWDSTINSSKCSYTALYPRAWSEYDLS-EHGIKLVQRQISPIIPHDYKES 228
Query: 219 SYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHR 278
S PV+VF +T+ N +T+ FT+ N G N K H
Sbjct: 229 SLPVAVFAWTVENVCGKDRHVTITFTFKNGTG----------NKKQDAEGNAETAAFAHG 278
Query: 279 TSHQLP--------PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF 330
T+ + P T+ ++ + + ++++ C F SGN + +W+++KE+G
Sbjct: 279 TAKGVSIKQTIADLPCTYCVSCKTSSEINLTRCEKFDPSGN-----GEKLWNDLKENGHL 333
Query: 331 DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYT 390
+ E+ T + + A++A + VP +S ++ FSL WD P+++F Y +Y
Sbjct: 334 TEKSGDESLKTKD----VAVAVSAQILVPSESSSELEFSLVWDMPKIHFTKKMKEYYKYY 389
Query: 391 KF-YGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAG 449
+G + +A I+ A+ +G WE I+ WQRPIL+D LP+WY +FNELY++ G
Sbjct: 390 TKYFGKNGDAGPVISDYALRNYGKWESLIDEWQRPILDDSDLPDWYKSAIFNELYFIADG 449
Query: 450 GAVW 453
G++W
Sbjct: 450 GSIW 453
>gi|339252284|ref|XP_003371365.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
gi|316968412|gb|EFV52690.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
Length = 1615
Score = 252 bits (643), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 144/361 (39%), Positives = 198/361 (54%), Gaps = 46/361 (12%)
Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
EY M+NTYDVHFY+SFAL LFP+I+ SIQ +FA + + + G +V
Sbjct: 1286 EYRMYNTYDVHFYASFALAKLFPQIEHSIQAEFADQFYNVTQTNVYHISPGTYVF----- 1340
Query: 599 AVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD-VVATGDKKFAKAVWPSVY- 656
+PW EVNAY +DT+ WKDLN KFVL YRD ++ + F + VWP+V
Sbjct: 1341 ---------EPWLEVNAYAFHDTSEWKDLNLKFVLTSYRDYLMLNKSESFLRHVWPAVKE 1391
Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
+ + +D DGD ++ENDG DQTYD W + W ++ A L
Sbjct: 1392 IIETALQNWDHDGDKLVENDGTCDQTYDAWCMH---------WTLLVELAKRL------- 1435
Query: 717 GSEDYFLFKFQKAKVVYEKLW--NGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL-LP 773
+ QK ++ E+ W G+Y+ +D S SS++ ADQL G Y + CGL
Sbjct: 1436 --------EMQKKSLISEQCWMMRGTYYQFDQQ-SCNSSTVMADQLCGYAYLKICGLQTD 1486
Query: 774 IVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAAS 833
I+ D + L+ +YN NV G GAVNGML G D SS+Q+ E W+GVTY ++A
Sbjct: 1487 ILPLDNVKKVLETIYNLNVCSFGNGTLGAVNGMLYSGEKDTSSLQADEAWTGVTYFLSAH 1546
Query: 834 MIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
MI E + GF TA GIY++ + G+ Y QTPEA +R++ YMRPL+IWA+QW
Sbjct: 1547 MISEGFVEQGFSTASGIYKSCFESFGMHY--QTPEALYEKKWFRAVGYMRPLSIWAIQWY 1604
Query: 894 L 894
L
Sbjct: 1605 L 1605
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 117/386 (30%), Positives = 186/386 (48%), Gaps = 22/386 (5%)
Query: 77 KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCED--KPVLANQ 134
+G ID F L +GVP G +GSGSIGR G F ++ + P + E V A+
Sbjct: 899 RGTSMTIDSFRPLQLKPIYGVPCGTIGSGSIGRHILGGFCKFSLVPGIVEHDVSSVPADN 958
Query: 135 FSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
F V V + Y VL +K + SW +N + +Y YP +WT ++
Sbjct: 959 FIVTVRDAGVTIYQQVL-----SAVKVENKKFLSSWLFNFPAECMSYLGRYPMSWTTYKL 1013
Query: 195 EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-GDS 253
P L ++C Q SP+IPHNYK+S P+++F +T+ + S+D+T+ T+T G G
Sbjct: 1014 -PCHNLILICHQYSPVIPHNYKDSCLPLTLFKWTVMSEESLSSDLTVSITFTFRNGTGRP 1072
Query: 254 EFTGQHYNSKT----KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
QH +M+ VH V + + + +AA T + V+ H +
Sbjct: 1073 RVDDQHQMRANSFLERMSSSVHCVGVSLHQTIDGMNCDYCVAALNTKSISVT---HSTFN 1129
Query: 310 GNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFS 369
G ++W + E G ++ E V + +I + ++ P EG + FS
Sbjct: 1130 PEGCG---SEIWKSLYETGRLSNESTDEFVVVEDRPLAIAVSCCTNIR-SPCQEG-IVFS 1184
Query: 370 LAWDCPEVNFMSGK-TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
L W PE+ F + + TY RRY+ ++ +AA ++ A+ + +I+ WQ PIL++
Sbjct: 1185 LVWHMPEIYFGNKRRTYLRRYSDWFDESSDAAKRLSIYAMENREIFLSKIDEWQEPILKN 1244
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWT 454
LP+WY +FNELYYL GG+VW
Sbjct: 1245 DNLPKWYKSAIFNELYYLTDGGSVWV 1270
>gi|440680630|ref|YP_007155425.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
gi|428677749|gb|AFZ56515.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
Length = 802
Score = 251 bits (642), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 147/385 (38%), Positives = 216/385 (56%), Gaps = 29/385 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD- 587
IGQF LE ++Y + + DV Y SFAL+MLFP+++ S+ R FA A+ H ++++
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVIRAFARAIP-HSDDYLRVIGY 478
Query: 588 ------DGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
D RKV GA PHD+G + W + N C D WKDL FVLQVYRD
Sbjct: 479 YYTIGADTTTAVRKVAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDF 538
Query: 640 VATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
+ TG D +F W ++ + Y+ FD DGDG+ EN G PDQT+D W + G+SAY GG
Sbjct: 539 IFTGGQDVEFLADCWDAIVQTLDYLKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGG 598
Query: 698 LWVAALQAASALAREVGD-RGSEDYFLFKFQKA----------KVVYEKLWNGSYFNYDN 746
LW+AAL++A A+ + + RG+E+ + QK+ + EKLWNG Y+ D+
Sbjct: 599 LWLAALESAIAICDILTNRRGAENAEKIRSQKSVYQSWLQQSLPIYQEKLWNGQYYRLDS 658
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S+ + ADQL GQ+YAR GL IV D A SAL+ VY+ +K G+ GA NG+
Sbjct: 659 --ESGSNVVMADQLCGQFYARLLGLPDIVPNDNALSALQTVYDACFVKFCNGQFGAANGV 716
Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
PDG + + E+W+G+ + +AA +I + + + + + + G F+
Sbjct: 717 RPDGSPENPQATHPLEVWTGINFGLAAFLIQMGMKNEAMRLTQAVVQQIYDN---GLQFR 773
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
TPEA + +R+ Y+R +AIWA+
Sbjct: 774 TPEAITSAGTFRASTYLRAMAIWAI 798
Score = 192 bits (487), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 131/392 (33%), Positives = 189/392 (48%), Gaps = 45/392 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W I A QFS+F S K + L +
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFNTFPACQFSIFESTGISTK-AYALSSQ 98
Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
PE D T + +W W + + YHALYPR+W V+E +L C Q SPI
Sbjct: 99 PPE---DGT---LQTWQWYPASSTTEETGKYHALYPRSWFVYENVFQTDL--TCEQFSPI 150
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
+NY+E+SYPV+VF + +N ++++ TW N VG FT + + ++ D
Sbjct: 151 WANNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWENMVGW---FTNALKSPQVQIRDDG 207
Query: 271 HAVLLHH---------------RTSHQ---LPPVTFALAAQETDGV-------HVSLCPH 305
V + T H L V E DG H +
Sbjct: 208 SPVYEYQPRLGASQGNYNQIVENTDHFSCFLGRVASGEPVGEGDGSWSIATPKHPQVEIF 267
Query: 306 FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
+ +S+G T D+W GS L + + + ++ + +GAAIA T+ P +
Sbjct: 268 YHTRWDSVG-TGADIWQSFATDGS---LANNQDATPADENTRLGAAIAIRFTLQPGETLE 323
Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
+ F+LAWD P F +G YYRRYT F+G + A IA +LE+ +W I WQ PI
Sbjct: 324 IPFTLAWDFPVTEFSAGINYYRRYTDFFGRNGENAGKIATTGLLEYQNWRSHILTWQNPI 383
Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
L+ + LP+W+ + LFNELY L +GG +W+ S
Sbjct: 384 LDREDLPKWFKMALFNELYDLTSGGTLWSAAS 415
>gi|434407175|ref|YP_007150060.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
7417]
gi|428261430|gb|AFZ27380.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
7417]
Length = 816
Score = 250 bits (639), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 146/385 (37%), Positives = 211/385 (54%), Gaps = 28/385 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
GQF LE ++Y + + DV Y SFAL+MLFP+++ S+ R FA A+ D +
Sbjct: 428 FGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVIRAFARAIPQSDDHQRIIGYY 487
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ D RK+ GA PHD+G + W + N C D WKDL FVLQVYRD +
Sbjct: 488 YTIGADTTTAVRKIAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFL 547
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F W ++ + Y+ FD+DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 548 FTGADDVEFLADCWDAIVQTLDYLKGFDKDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 607
Query: 699 WVAALQAASAL----------AREVGDRGSED--YFLFKFQKAKVVYEKLWNGSYFNYDN 746
W+AAL+ A A+ A+ + ++ Y + Q V EKLWNG Y+ D+
Sbjct: 608 WLAALECAIAICDILLTNRQDAKNAKNLETQRSIYGAWLAQSLTVYEEKLWNGQYYRLDS 667
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S + + ADQL GQ+YAR GL IV D+A SAL+ VY+ LK G+ GA NG+
Sbjct: 668 --ESGTEVVMADQLCGQFYARLLGLPDIVKCDRALSALQTVYDACFLKFFDGQFGAANGV 725
Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
PDG + ++ E+W+G+ + +AA ++ + D + + + G F+
Sbjct: 726 RPDGVPENPNATHPLEVWTGINFGLAAFLVQMGMQDEALRLTQAVVSQIYDN---GLQFR 782
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
TPEA T +R+ Y+RP+AIWA+
Sbjct: 783 TPEAITTTGTFRASTYLRPMAIWAI 807
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 139/401 (34%), Positives = 197/401 (49%), Gaps = 55/401 (13%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SNGQKYSSVLCP 153
HG+PLGG G+G IGRS RG+F W I K + A QFSVF S ++ Q Y+ L
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKNIPACQFSVFESHGASSQAYA--LST 97
Query: 154 KTPEVLKDTTAAGIGSWDWNLKG------------DKSTYHALYPRAWTVHEGEPDPELR 201
+ PE D T + +W W + D TYHALYPR+W V+E +L
Sbjct: 98 EPPE---DGT---LSAWQWYPRSAESKAGKMPTPQDTGTYHALYPRSWFVYENVFQAQLS 151
Query: 202 IVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYN 261
C Q SP+ +NY+E+SYPV+VF + +N ++++ TW N VG FT +
Sbjct: 152 --CEQFSPVWANNYQETSYPVAVFNWKAHNPTDAPITLSIMLTWENMVGW---FTNALKS 206
Query: 262 SKTKMNDGVHAVLLHHR-------TSHQLPPVTFALAA-----------QETDGV-HVSL 302
K ++ D V + HQL T + QE DG ++
Sbjct: 207 PKVQIRDDGSPVYEYQPRLGKSQGNYHQLIENTQQVGCVLGRVASDAPVQEGDGTWCIAT 266
Query: 303 CPHFVIS------GNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASV 356
H +I N +G T ++W GS N ++ + +E + IGAA+A
Sbjct: 267 RKHPLIEVFHHTRWNPVG-TGDEVWQSFAADGSLP--NYVDETAAAE-NTRIGAALAVRF 322
Query: 357 TVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWEL 416
T+ P +V F LAWD P F +G YYRRYT F+G A IA ++ E+ +W
Sbjct: 323 TLQPGECLEVPFVLAWDFPVTEFAAGVNYYRRYTDFFGRSGQNAEAIAFLSLQEYQNWRS 382
Query: 417 QIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
QI+ WQ+PIL + LP W+ + LFNELY L +GG +W+ S
Sbjct: 383 QIQNWQKPILVREDLPPWFKMALFNELYDLTSGGTLWSAAS 423
>gi|298490121|ref|YP_003720298.1| glucosylceramidase ['Nostoc azollae' 0708]
gi|298232039|gb|ADI63175.1| Glucosylceramidase ['Nostoc azollae' 0708]
Length = 806
Score = 249 bits (637), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 147/388 (37%), Positives = 219/388 (56%), Gaps = 32/388 (8%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD- 587
IGQF LE ++Y + + DV Y SFAL++LFP+++ S+ R FA A+ H ++++
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFALLILFPELEKSVIRAFARAIP-HSDDHLRVIGY 478
Query: 588 ------DGQWVSRKVLGAVPHDIGICDPWF--EVNAYCLYDTARWKDLNPKFVLQVYRDV 639
+ + RKV A PHD+G + W + N C D WKDL+ FVLQVYRD
Sbjct: 479 YYTIGAETKKAIRKVAKATPHDLGAPNEWVWEKTNYTCYQDCNLWKDLSSDFVLQVYRDF 538
Query: 640 VATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
+ TG D +F W ++ + Y+ FD DGDG+ EN G PDQT+D W ++G+SAY GG
Sbjct: 539 LFTGSHDVQFLVDCWDAIVQTLDYLKTFDLDGDGIPENSGAPDQTFDDWLLNGVSAYCGG 598
Query: 698 LWVAALQAASALAREVGD-RGSED------------YFLFKFQKAKVVYE-KLWNGSYFN 743
LW+AAL++A A+A + + RG+E+ + Q+++ +YE KLWNG Y+
Sbjct: 599 LWLAALESAIAIADILTNHRGTENTEKTDKLRSQKSIYQSWLQQSRPIYEQKLWNGQYYR 658
Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
D+ S S + ADQL GQ+Y R GL IV + A SAL+ VY +K GK GA
Sbjct: 659 LDS--ESGSDVVMADQLCGQFYTRLLGLPDIVPSNHAHSALQTVYEACFVKFYDGKFGAA 716
Query: 804 NGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
NG+LPDG + + E+W+G+ + +AA ++ + + Q + E + G
Sbjct: 717 NGVLPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKNEAMQLTQAVVEQIYDN---GL 773
Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
F+TPEA +R+ Y+RP+AIWA+
Sbjct: 774 QFRTPEAITPVGTFRASTYLRPMAIWAI 801
Score = 192 bits (489), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 132/409 (32%), Positives = 195/409 (47%), Gaps = 54/409 (13%)
Query: 85 PFIKRHLTSS-----HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
P+ R++++ HG+PLGG G+G IGRS RG+F W I K + QFSVF
Sbjct: 25 PYTVRYVSNIDDGPWHGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHLFKSFPSCQFSVFE 84
Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGE 195
S + K ++ + E+ +D T + SW W TYHALYPR+W V+E
Sbjct: 85 STGSSSKAYAL----STEIPEDGT---LQSWQWYPPSTTTQATGTYHALYPRSWFVYENV 137
Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
+ ++ C Q SPI +NY+E+SYPV+VF + +N ++++ TW N VG
Sbjct: 138 L--QTQLTCEQFSPIWANNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWENIVGWFTNA 195
Query: 251 -----------GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVH 299
G + Q ++K N H V L V L QE DG
Sbjct: 196 LKSPDLQIRDDGSPVYEYQPRLGESKGNYN-HIVENTEFFGCFLGRVPTGLPIQEGDGSW 254
Query: 300 VSLCP-----------HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSI 348
P HF N +G + D+W G L + + ++ + +
Sbjct: 255 CIATPKNPRMEVFYHTHF----NPVG-SGADLWESFSTDG---YLPNYLNATPADADTRL 306
Query: 349 GAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAI 408
GAA+A T+ P ++ F+LAWD P F G YYRRYT F+ + + A I +
Sbjct: 307 GAALAVRCTLQPGETLEIPFTLAWDFPVTEFAEGINYYRRYTDFFSRYGDNAFAITSRGL 366
Query: 409 LEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
++ W QI+ WQ+PIL+ + LPEW+ + LFNELY L +GG +W+ S
Sbjct: 367 EQYQVWRAQIQNWQKPILDREDLPEWFKMALFNELYDLTSGGTLWSAAS 415
>gi|425435596|ref|ZP_18816046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
gi|389679839|emb|CCH91411.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
Length = 784
Score = 249 bits (636), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 142/379 (37%), Positives = 209/379 (55%), Gaps = 17/379 (4%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ FA A+ HD + +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPTHDDTPRII 471
Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
+ RK A PHD+G + PW + N D WKDL FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D+ F + WPS+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591
Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
AL+A + +G ED +F Q+++ +Y + LWNG Y+N D+ S S + D
Sbjct: 592 ALEATLKIGAILG----EDTAIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
QL GQ+YAR L +V+ +SAL K+Y LK GGK GA NG+ PDG + +
Sbjct: 646 QLCGQFYARLLALPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEAT 705
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
E+W G+ + + A ++ + G+Q + + G F+TPEA + +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN---GLQFRTPEAITSVGTFR 762
Query: 878 SLCYMRPLAIWAMQWALTR 896
+ Y+R +AIWA+ LT
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 116/389 (29%), Positives = 181/389 (46%), Gaps = 47/389 (12%)
Query: 93 SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
S HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + +
Sbjct: 39 SWHGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMA 98
Query: 153 PKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
++P T + W W +K Y ALYPR+W +E + + +++C Q SP+
Sbjct: 99 TESP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWA 149
Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG---------------------- 250
HNY+ESSYP+++F +T +N I+++ TW N++G
Sbjct: 150 HNYQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYE 209
Query: 251 -----GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPH 305
G+SE +YN N V ++ + A+A+ + V
Sbjct: 210 YQSKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGK 266
Query: 306 FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
+ +G+ ++W +GS L +E ++ G A+A T+ P +
Sbjct: 267 WNPAGD-----GGEVWDYFAMNGS---LPDIEDETPAQKGEQTATAMAIRFTIRPGKTRK 318
Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
+ F LAWD P F G YYRRYT F+G + A + R A+ + +W I WQ+PI
Sbjct: 319 IPFILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPI 378
Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWT 454
LE L +W+ + LFNELY L GG +WT
Sbjct: 379 LERSDLADWFKMALFNELYLLTDGGTLWT 407
>gi|443669591|ref|ZP_21134796.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
gi|159029778|emb|CAO87856.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443330115|gb|ELS44858.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
Length = 784
Score = 249 bits (636), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 153/423 (36%), Positives = 225/423 (53%), Gaps = 32/423 (7%)
Query: 495 NILERMSSILEQI----YTPVALNSSFGTNLLQDG---------EENIGQFLYLEGIEYL 541
NI+ ILE+I + +AL + LL DG ++ IGQF LE I+Y
Sbjct: 370 NIINWQKPILERIDLADWFKMALFNEL--YLLTDGGTLWTAATEQDPIGQFGVLECIDYR 427
Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
+ + DV Y SF L+ML+P+++ ++ FA A+ D + + + RK A P
Sbjct: 428 WYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRIIGYNQASAIRKAKNATP 487
Query: 602 HDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYV 657
HD+G + PW + N D WKDL FVL VYRD + TG D+ F + WPS+ +
Sbjct: 488 HDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVL 547
Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
+ Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+AAL+A + +G
Sbjct: 548 TLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGEIIG--- 604
Query: 718 SEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
ED +F Q+++ +Y + LWNG Y+N D+ S S + DQL GQ+YAR L +
Sbjct: 605 -EDTAIFTSWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTDQLCGQFYARLLALPDV 661
Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAAS 833
V+ +SAL K+Y LK GGK GA NG+ PDG + + E+W G+ + + A
Sbjct: 662 VENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAF 721
Query: 834 MIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
++ + G+Q + + G F+TPEA + +R+ Y+R +AIWA+
Sbjct: 722 LLQMGMEKQGWQITEAVVRQVYEN---GLQFRTPEAITSVGTFRACHYLRAMAIWAIYGV 778
Query: 894 LTR 896
LT
Sbjct: 779 LTE 781
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 119/387 (30%), Positives = 183/387 (47%), Gaps = 47/387 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+P KD T + W W +K TY ALYPR+W +E + + +++C Q SP+ HN
Sbjct: 101 SP---KDGT---LSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
Y+ESSYP+++F +T++N I+++ TW N++G
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
G+SE +YN N V ++ + A+A+ + V +
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
+G+ ++W +GS L +E +E G A+A T+ P ++
Sbjct: 269 PAGD-----GGEVWDYFAMNGS---LPDIEDETPAEKGKQTATAMAIRFTIRPGKTRKIP 320
Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
F LAWD P F G YYRRYT F+G + A + R A+ + W I WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDKWRENIINWQKPILE 380
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
L +W+ + LFNELY L GG +WT
Sbjct: 381 RIDLADWFKMALFNELYLLTDGGTLWT 407
>gi|443320525|ref|ZP_21049620.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
gi|442789771|gb|ELR99409.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
Length = 806
Score = 248 bits (633), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 154/403 (38%), Positives = 216/403 (53%), Gaps = 30/403 (7%)
Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
G L EN IGQF LE I+Y + + DV Y SFAL ML+P++ +I FA A+
Sbjct: 404 GGTLWTAATENDPIGQFAVLECIDYRWYESLDVRLYGSFALAMLWPRLDKAILEAFARAI 463
Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
D S + + RK+ GA PHD+G + PW + N D WKDL+ FVL
Sbjct: 464 PQSDESMRVIGYNQAEAQRKIAGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLSSDFVL 523
Query: 634 QVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
QVYRD V TG D F WPS+ + Y+ FD DGDG+ EN G PDQT+D W + GI
Sbjct: 524 QVYRDFVLTGKEDTDFLWECWPSITQTLNYLKSFDLDGDGIPENAGAPDQTFDDWKLEGI 583
Query: 692 SAYSGGLWVAALQAASALAREVGDR-------GSEDY-----FLFKFQKA------KVVY 733
SAY GGLW+AAL+A+ A+ + D DY K ++ + +
Sbjct: 584 SAYCGGLWIAALEASIAIGAVLLDNPPMNPQLQPSDYPEGIEATLKTHQSWLTLARSIYH 643
Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
++LWNG Y++ D S S + ADQLAGQ+YA+ GL +V+++ +S LK +Y L
Sbjct: 644 QQLWNGQYYSLDT--KSGSVVVMADQLAGQFYAQLLGLPDVVEKEYVQSTLKTIYQSCFL 701
Query: 794 KVMGGKRGAVNGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
GK GA NG+ DG + S E+W+G+ +A+ A +IH + D G + A I +
Sbjct: 702 NFHQGKIGAANGVKLDGTGENPQSTHPLEVWTGINFALGAFLIHSGMRDEGLKLAETIVK 761
Query: 853 AAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
+ G F+TPEA T +R+ Y+R +AIWA+ + L
Sbjct: 762 QIYD---QGLQFRTPEALTTSGTFRASHYLRAMAIWAIDYTLN 801
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 131/391 (33%), Positives = 181/391 (46%), Gaps = 53/391 (13%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G +GRS RG+F W + + A QFSVF G+ + +C K
Sbjct: 41 HGMPLGGFGAGCVGRSPRGDFNLWHFDGGEHIFRSLDACQFSVFEHPEQGEVKAYGMCTK 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE D T + W W + D+ TYHALYPR+W ++G EL VC Q SPI N
Sbjct: 101 PPE---DGT---LSRWRWYPE-DQGTYHALYPRSWFEYQGVFATEL--VCTQFSPIWAKN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
Y+ESSYP+++F +T YN ++++ TW N VG
Sbjct: 152 YQESSYPLAIFEWTAYNPTDQPITLSIMLTWQNVVGWFTNAIKSPEIRIRDDGSPVYQYQ 211
Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSH----QLPPVTFALAAQETDGVHVSLC 303
GDS TG N + DG L +R Q F +A+ + V
Sbjct: 212 PRWGDS--TG---NYNQWIQDGSRVGCLFNRIQPHEELQEGEGQFCIASILNPSLEVFYQ 266
Query: 304 PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSE 363
+ G+ D+W + GS L + +EPG I AIA TV P
Sbjct: 267 TRWHPRGD-----GSDVWDDFARDGS---LGDRQDETPAEPGEQIAGAIAVRFTVRPGKT 318
Query: 364 GQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQR 423
+ F L WD P + F G YYRRYT F+G + A ++ R A+ W+ +I+ WQ
Sbjct: 319 RLIPFVLTWDLPILEFAPGVNYYRRYTDFFGRNGQNAWSMVRTALKHGDMWQDKIDQWQS 378
Query: 424 PILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
P+++ P W + LFNELY L GG +WT
Sbjct: 379 PVVQRSDYPTWLKMALFNELYLLTQGGTLWT 409
>gi|257060089|ref|YP_003137977.1| glucosylceramidase [Cyanothece sp. PCC 8802]
gi|256590255|gb|ACV01142.1| Glucosylceramidase [Cyanothece sp. PCC 8802]
Length = 805
Score = 247 bits (630), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 149/407 (36%), Positives = 213/407 (52%), Gaps = 30/407 (7%)
Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
G L EN +GQF LE ++Y + + DV Y SF L+ML+P+++ S+ FA A+
Sbjct: 404 GGTLWTAASENDPVGQFGVLECLDYRWYESLDVRLYGSFGLMMLWPRLEKSVLEAFARAI 463
Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
HD + + + RK GA PHD+G + PW + N D WKDL FVL
Sbjct: 464 PTHDDTPRIIGYNQAKAIRKAKGATPHDLGAPNEHPWQKTNYTSYQDCNLWKDLGSDFVL 523
Query: 634 QVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
QVYRD + TG D F WP++ + Y+ FD D DG+ EN G PDQT+D W + GI
Sbjct: 524 QVYRDYLLTGSDDTDFLWECWPAITETLDYLKTFDLDNDGIPENSGAPDQTFDDWKLQGI 583
Query: 692 SAYSGGLWVAALQAASALA----------REVGDRGS--------EDYFLFKFQKAKVVY 733
SAY GGLW+AAL+AA +A E+ R + +++ + Q + +
Sbjct: 584 SAYCGGLWIAALEAAIKIAEILLKNVPTTEELQSRNNPESIKHYVKNHRDWLEQSRSIYH 643
Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
+ LWNG Y+ D+ S S + ADQL GQ+YAR L +V+ SAL KVY L
Sbjct: 644 DTLWNGEYYKLDS--QSGSDVVMADQLCGQFYARLLNLPDVVETQYTESALNKVYEACFL 701
Query: 794 KVMGGKRGAVNGMLPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
K GK GA NGM PDG D +S +E+W+G+ + +AA ++ D F+ + +
Sbjct: 702 KFQDGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEAVVK 761
Query: 853 AAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
+ G F+TPEA +R+ Y+R +AIW + LT +P
Sbjct: 762 QVYEN---GLQFRTPEAITAVGTFRASHYLRAMAIWGIYGILTHFRP 805
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 128/389 (32%), Positives = 181/389 (46%), Gaps = 41/389 (10%)
Query: 94 SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
+HG+PLGG GSG IGRS G+F W + + + + QFSVF L
Sbjct: 40 NHGMPLGGFGSGCIGRSPSGDFNLWHLDGGEHVFRSIPSCQFSVFEQPEGEDAQVYALST 99
Query: 154 KTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
+ PE + W W +K TYHALYPR+W +EG +L +C Q SPIIP+
Sbjct: 100 QPPE------EGTLSRWSW-YPPEKGTYHALYPRSWYQYEGVFKSQL--ICEQFSPIIPN 150
Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTG 257
NY+E+SYP+ +F +T +N ++++ TW N VG G E+
Sbjct: 151 NYQETSYPIGIFEWTAHNPTDKPITLSIMVTWQNIVGWFTNAIKSPEITVRDDGSPEYQY 210
Query: 258 QHY--NSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG-------VHVSLCPHFVI 308
Q +S N V + R L + QE +G + S+ ++
Sbjct: 211 QPRWGDSTGNFNQWVQD---NFRVGFILNRIQLHDQIQEGEGQICIASVTNPSVEVFYLG 267
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
N G ++W +GS L E +EPG I A+A T+ P ++ F
Sbjct: 268 KWNPKG-DGSEVWDYFAMNGS---LPDTEDETPAEPGEQIAVAMAIRFTIRPGRTKKIPF 323
Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
LAWD P F G YYRRYT FYG + + R A+ W ++E WQ PIL+
Sbjct: 324 ILAWDFPVTEFAPGVQYYRRYTDFYGRNGKNGWAMVRTALKHSDVWYERVEEWQSPILKR 383
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
+ LP+W+ + LFNELY L GG +WT S
Sbjct: 384 EDLPDWFKMALFNELYLLTDGGTLWTAAS 412
>gi|422303239|ref|ZP_16390593.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
gi|389791807|emb|CCI12404.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
Length = 784
Score = 247 bits (630), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 141/379 (37%), Positives = 208/379 (54%), Gaps = 17/379 (4%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ FA A+ D + +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471
Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
+ RK A PHD+G + PW + N D WKDL FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D+ F + WPS+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591
Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
AL+A + +G ED +F Q+++ +Y + LWNG Y+N D+ S S + D
Sbjct: 592 ALEATIKIGEIIG----EDTAIFTSWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
QL GQ+YAR L +V+ +SAL K+Y LK GGK GA NG+ PDG + +
Sbjct: 646 QLCGQFYARLLSLPDVVENQYTQSALSKIYQSCFLKFHGGKFGAANGLKPDGTPENPEAT 705
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
E+W G+ + + A ++ + G+Q + + G F+TPEA + +R
Sbjct: 706 HPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVYEN---GLQFRTPEAITSVGTFR 762
Query: 878 SLCYMRPLAIWAMQWALTR 896
+ Y+R +AIWA+ LT
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 119/383 (31%), Positives = 179/383 (46%), Gaps = 39/383 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+P T + W W +K Y ALYPR+W +E + + +++C Q SP+ HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
Y+ESSYP+++F +T +N I+++ TW N++G FT + K+ D V
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGW---FTNAIKSPVVKVRDDGSPVY 208
Query: 275 LHHR-------TSHQLPPVTFALAA---------QETDG-------VHVSLCPHFVISGN 311
+ +Q F + E DG + SL ++ N
Sbjct: 209 EYQSKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268
Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
G ++W +GS L +E +E G A+A T+ P ++ F LA
Sbjct: 269 PAG-DGGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
WD P F G YYRRYT F+G + A + R A+ +W I WQ+PILE L
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384
Query: 432 PEWYPITLFNELYYLNAGGAVWT 454
+W+ + LFNELY L GG +WT
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWT 407
>gi|425442611|ref|ZP_18822852.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
gi|389716273|emb|CCH99457.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
Length = 784
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 140/379 (36%), Positives = 209/379 (55%), Gaps = 17/379 (4%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ F+ A+ D + +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFSRAIPSSDDTPRII 471
Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
+ RK A PHD+G + PW + N D WKDL FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D+ F + WPS+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591
Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
AL+A + +G ED +F Q+++ +Y + LWNG Y+N D+ S S + D
Sbjct: 592 ALEATIKIGEIIG----EDTAIFTSWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
QL GQ+YAR L +V+ +SAL K+Y LK GGK GA NG+ PDG + +
Sbjct: 646 QLCGQFYARLLSLPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEAT 705
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
E+W G+ + + A ++ + G+Q + + + G F+TPEA + +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN---GLQFRTPEAITSVGTFR 762
Query: 878 SLCYMRPLAIWAMQWALTR 896
+ Y+R +AIWA+ LT
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781
Score = 185 bits (470), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 121/383 (31%), Positives = 182/383 (47%), Gaps = 39/383 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+P KD T + W W +K +Y ALYPR+W +E + + +++C Q SP+ HN
Sbjct: 101 SP---KDGT---LSRWSW-YPAEKGSYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
Y+ESSYP+++F +T +N I+++ TW N++G FT + K+ D V
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGW---FTNAIKSPVVKVRDDGSPVY 208
Query: 275 LHHR-------TSHQLPPVTFALAA---------QETDG-------VHVSLCPHFVISGN 311
+ +Q F + E DG + SL ++ N
Sbjct: 209 EYQSKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268
Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
G ++W +GS L +E +E G A+A T+ P ++ F LA
Sbjct: 269 PAG-DGGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
WD P F G YYRRYT F+G + A + R A+ +W I WQ+PILE L
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384
Query: 432 PEWYPITLFNELYYLNAGGAVWT 454
+W+ + LFNELY L GG +WT
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWT 407
>gi|425447791|ref|ZP_18827773.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
gi|389731620|emb|CCI04385.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
Length = 784
Score = 246 bits (628), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 141/379 (37%), Positives = 209/379 (55%), Gaps = 17/379 (4%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ FA A+ D + +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471
Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
+ RK A PHD+G + PW + N D WKDL FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D+ F + WPS+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591
Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
AL+A + +G ED +F Q+++ +Y + LWNG Y+N D+ S S + D
Sbjct: 592 ALEATLKIGAILG----EDTAIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
QL GQ+YAR L +V+ +SAL K+Y LK GGK GA NG+ PDG + +
Sbjct: 646 QLCGQFYARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEAT 705
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
E+W G+ + + A ++ + G+Q + + + G F+TPEA + +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVQQIYEN---GLQFRTPEAITSVGTFR 762
Query: 878 SLCYMRPLAIWAMQWALTR 896
+ Y+R +AIWA+ LT
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781
Score = 169 bits (427), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 116/387 (29%), Positives = 181/387 (46%), Gaps = 47/387 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G G IGRS +G+F W + + A QFS+F +G + + +
Sbjct: 41 HGMPLGGFGGGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+P+ A + W W +K TY ALYPR+W +E + + +++C Q SP+ HN
Sbjct: 101 SPKY------ATLSRWLW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
Y+ESSYP+++F +T++N I+++ TW N++G
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
G+SE +YN N V ++ + A+A+ + V +
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
+G+ ++W +GS L +E ++ G A+A T+ P ++
Sbjct: 269 PAGD-----GSEVWDYFAMNGS---LPDIEDETPAQKGEQTATAMAIRFTIRPGKTRKIP 320
Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
F LAWD P F G YYRRYT F+G + A + R A+ +W I WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILE 380
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
L +W+ + LFNELY L GG +WT
Sbjct: 381 RSDLADWFKMALFNELYLLADGGTLWT 407
>gi|425450263|ref|ZP_18830094.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
gi|389768962|emb|CCI06046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
Length = 784
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 141/379 (37%), Positives = 208/379 (54%), Gaps = 17/379 (4%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ FA A+ D + +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471
Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
+ RK A PHD+G + PW + N D WKDL FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D+ F + WPS+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591
Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
AL+A + +G ED +F Q+++ +Y + LWNG Y+N D+ S S + D
Sbjct: 592 ALEATIKIGAILG----EDTAIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
QL GQ+YAR L +V+ +SAL K+Y LK GGK GA NG+ PDG + +
Sbjct: 646 QLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEAT 705
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
E+W G+ + + A ++ + G+Q + + G F+TPEA + +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITQAVVRQIYEN---GLQFRTPEAITSVGTFR 762
Query: 878 SLCYMRPLAIWAMQWALTR 896
+ Y+R +AIWA+ LT
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781
Score = 182 bits (462), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 180/387 (46%), Gaps = 47/387 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLPACQFSIFEQTEDGSAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+P T + W W +K Y ALYPR+W +E + + +++C Q SP+ HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYGALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
Y+ESSYP+++F +T +N I+++ TW N++G
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
G+SE +YN N V ++ + A+A+ + V +
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
+G+ ++W +GS L ++ ++ G A+A T+ P ++
Sbjct: 269 PAGD-----GGEVWDYFAMNGS---LPDIDDETPAQKGEQTATAMAIRFTIRPGKTRKIP 320
Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
F LAWD P F G YYRRYT F+G + A + R A+ + +W I WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPILE 380
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
L +W+ + LFNELY L GG +WT
Sbjct: 381 RSDLADWFKMALFNELYLLTDGGTLWT 407
>gi|218247014|ref|YP_002372385.1| hypothetical protein PCC8801_2201 [Cyanothece sp. PCC 8801]
gi|218167492|gb|ACK66229.1| protein of unknown function DUF608 [Cyanothece sp. PCC 8801]
Length = 805
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 148/407 (36%), Positives = 212/407 (52%), Gaps = 30/407 (7%)
Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
G L EN +GQF LE ++Y + + DV Y SF L+ML+P+++ S+ FA A+
Sbjct: 404 GGTLWTAASENDPVGQFGVLECLDYRWYESLDVRLYGSFGLMMLWPRLEKSVLEAFARAI 463
Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
HD + + + RK GA PHD+G + PW + N D WKDL FVL
Sbjct: 464 PTHDDTPRIIGYNQAKAIRKAKGATPHDLGAPNEHPWQKTNYTSYQDCNLWKDLGSDFVL 523
Query: 634 QVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
QVYRD + TG D F WP++ + Y+ FD D DG+ EN G PDQT+D W + GI
Sbjct: 524 QVYRDYLLTGSDDTDFLWECWPAITETLDYLKTFDLDNDGIPENSGAPDQTFDDWKLQGI 583
Query: 692 SAYSGGLWVAALQAASALA----------REVGDRGS--------EDYFLFKFQKAKVVY 733
SAY GGLW+AAL+AA +A E+ R + +++ + Q + +
Sbjct: 584 SAYCGGLWIAALEAAIKIAEILLKNVPTTEELQSRNNPESIKHYVKNHRDWLEQSRSIYH 643
Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
+ LWNG Y+ D+ S S + ADQL GQ+YAR +V+ SAL KVY L
Sbjct: 644 DTLWNGEYYKLDS--QSGSDVVMADQLCGQFYARLLNFPDVVETQYTESALNKVYEACFL 701
Query: 794 KVMGGKRGAVNGMLPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
K GK GA NGM PDG D +S +E+W+G+ + +AA ++ D F+ + +
Sbjct: 702 KFQDGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEAVVK 761
Query: 853 AAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
+ G F+TPEA +R+ Y+R +AIW + LT +P
Sbjct: 762 QVYEN---GLQFRTPEAITAVGTFRASHYLRAMAIWGIYGILTHFRP 805
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 128/389 (32%), Positives = 181/389 (46%), Gaps = 41/389 (10%)
Query: 94 SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
+HG+PLGG GSG IGRS G+F W + + + + QFSVF L
Sbjct: 40 NHGMPLGGFGSGCIGRSPSGDFNLWHLDGGEHVFRSIPSCQFSVFEQPEGEDAQVYALST 99
Query: 154 KTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
+ PE + W W +K TYHALYPR+W +EG +L +C Q SPIIP+
Sbjct: 100 QPPE------EGTLSRWSW-YPPEKGTYHALYPRSWYQYEGVFKSQL--ICEQFSPIIPN 150
Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTG 257
NY+E+SYP+ +F +T +N ++++ TW N VG G E+
Sbjct: 151 NYQETSYPIGIFEWTAHNPTDKPITLSIMVTWQNIVGWFTNAIKSPEITVRDDGSPEYQY 210
Query: 258 QHY--NSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG-------VHVSLCPHFVI 308
Q +S N V + R L + QE +G + S+ ++
Sbjct: 211 QPRWGDSTGNFNQWVQD---NFRVGFILNRIQLHDEIQEGEGQICIASVTNPSVEVFYLG 267
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
N G ++W +GS L E +EPG I A+A T+ P ++ F
Sbjct: 268 KWNPKG-DGSEVWDYFAMNGS---LPDTEDETPAEPGEQIAVAMAIRFTIRPGRTKKIPF 323
Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
LAWD P F G YYRRYT FYG + + R A+ W ++E WQ PIL+
Sbjct: 324 ILAWDFPVTEFAPGVQYYRRYTDFYGRNGKNGWAMVRTALKHSDVWYERVEEWQSPILKR 383
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
+ LP+W+ + LFNELY L GG +WT S
Sbjct: 384 EDLPDWFKMALFNELYLLTDGGTLWTAAS 412
>gi|425471589|ref|ZP_18850441.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
gi|389882505|emb|CCI37035.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
Length = 784
Score = 246 bits (627), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 139/374 (37%), Positives = 206/374 (55%), Gaps = 13/374 (3%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ FA A+ D + + +
Sbjct: 415 IGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRIIGYN 474
Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
RK A PHD+G + PW + N D WKDL FVL VYRD + TG D
Sbjct: 475 QASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKD 534
Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
+ F + WPS+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+AAL+
Sbjct: 535 EDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALE 594
Query: 705 AASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQ 763
A + +G+ + F Q+++ +Y + LWNG Y+N D+ S S + DQL GQ
Sbjct: 595 ATIKIGAILGE--DTEIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTDQLCGQ 650
Query: 764 WYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREI 822
+YAR L +V+ +SAL K+Y LK GGK GA NG+ PDG + + E+
Sbjct: 651 FYARLLALPDVVENQYTQSALGKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEV 710
Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYM 882
W G+ + + A ++ + G+Q + + + G F+TPEA + +R+ Y+
Sbjct: 711 WVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN---GLQFRTPEAITSVGTFRACHYL 767
Query: 883 RPLAIWAMQWALTR 896
R +AIWA+ LT
Sbjct: 768 RAMAIWAIYGILTE 781
Score = 185 bits (470), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 120/383 (31%), Positives = 179/383 (46%), Gaps = 39/383 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
P T + W W +K TY ALYPR+W +E + + +++C Q SP+ HN
Sbjct: 101 RP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
Y+ESSYP+++F +T +N I+++ TW N++G FT + K+ D V
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGW---FTNAIKSPVVKVRDDGSPVY 208
Query: 275 LHHR-------TSHQLPPVTFALAA---------QETDG-------VHVSLCPHFVISGN 311
+ +Q F + E DG + SL ++ N
Sbjct: 209 EYQSKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIATITNPSLEVFYLGKWN 268
Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
G ++W +GS L +E +E G A+A T+ P ++ F LA
Sbjct: 269 PAG-DGGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
WD P F G YYRRYT F+G + A + R A+ +W I WQ+PILE L
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384
Query: 432 PEWYPITLFNELYYLNAGGAVWT 454
+W+ + LFNELY L GG +WT
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWT 407
>gi|425464233|ref|ZP_18843555.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
gi|389833806|emb|CCI21373.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
Length = 784
Score = 245 bits (626), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 140/379 (36%), Positives = 209/379 (55%), Gaps = 17/379 (4%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ FA A+ D + +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471
Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
+ RK A PHD+G + PW + N D WKDL FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D+ F + WPS+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591
Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
AL+A + +G ED +F Q+++ +Y + LWNG Y+N D+ S S + D
Sbjct: 592 ALEATIKIGAILG----EDTAIFASWLQQSRTIYHQTLWNGEYYNLDS--GSGSDVVMTD 645
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
QL GQ+Y+R L +V+ +SAL K+Y LK GGK GA NG+ PDG + +
Sbjct: 646 QLCGQFYSRLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPQAT 705
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
E+W G+ + + A ++ + G+Q + + + G F+TPEA + +R
Sbjct: 706 HPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVKQVYEN---GLQFRTPEAITSVGTFR 762
Query: 878 SLCYMRPLAIWAMQWALTR 896
+ Y+R +AIWA+ LT
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 120/383 (31%), Positives = 179/383 (46%), Gaps = 39/383 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+P T + W W +K TY ALYPR+W +E + + +++C Q SP+ HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
Y+ESSYP+++F +T +N I+++ TW N++G FT + K+ D V
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGW---FTNAIKSPVVKVRDDGSPVY 208
Query: 275 LHHR-------TSHQLPPVTFALAA---------QETDG-------VHVSLCPHFVISGN 311
+ +Q F + E DG + SL ++ N
Sbjct: 209 EYQSKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268
Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
G ++W +GS L +E +E G A+A T+ P ++ F LA
Sbjct: 269 PAG-DGGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
WD P F G YYRRYT F+G + A + R A+ +W I WQ+PILE L
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERTDL 384
Query: 432 PEWYPITLFNELYYLNAGGAVWT 454
W+ + LFNELY L GG +WT
Sbjct: 385 AAWFKMALFNELYLLTDGGTLWT 407
>gi|425455967|ref|ZP_18835678.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
gi|389803038|emb|CCI17984.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
Length = 784
Score = 245 bits (625), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 141/379 (37%), Positives = 208/379 (54%), Gaps = 17/379 (4%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ FA A+ D + +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471
Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
+ RK A PHD+G + PW + N D WKDL FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D+ F + WPS+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591
Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
AL+A + +G ED +F Q+++ +Y + LWNG Y+N D+ S S + D
Sbjct: 592 ALEATLKIGAILG----EDTAIFASWLQQSRAIYHDTLWNGEYYNLDS--GSGSDVVMTD 645
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
QL GQ+YAR L +V+ +SAL K+Y LK GGK GA NG+ PDG + +
Sbjct: 646 QLCGQFYARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEAT 705
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
E+W G+ + + A ++ + G+Q + + G F+TPEA + +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN---GLQFRTPEAITSVGTFR 762
Query: 878 SLCYMRPLAIWAMQWALTR 896
+ Y+R +AIWA+ LT
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781
Score = 187 bits (475), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 118/387 (30%), Positives = 183/387 (47%), Gaps = 47/387 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+P KD T + W W +K TY ALYPR+W +E + + +++C Q SP+ HN
Sbjct: 101 SP---KDGT---LSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
Y+ESSYP+++F +T++N I+++ TW N++G
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
G+SE +YN N V ++ + A+A+ + V +
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
+G+ ++W +GS L +E ++ G A+A T+ P ++
Sbjct: 269 PAGD-----GSEVWDYFAMNGS---LPDIEDETPAQKGEQTATAMAIRFTIRPGKTRKIP 320
Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
F LAWD P F G YYRRYT F+G + A + R A+ +W I WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILE 380
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
L +W+ + LFNELY L GG +WT
Sbjct: 381 RTDLADWFKMALFNELYLLADGGTLWT 407
>gi|440753145|ref|ZP_20932348.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
gi|440177638|gb|ELP56911.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
Length = 789
Score = 245 bits (625), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/373 (37%), Positives = 207/373 (55%), Gaps = 17/373 (4%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ FA A+ D + +
Sbjct: 412 QDAIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471
Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
+ RK A PHD+G + PW + N D WKDL FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D+ F + WPS+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591
Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
AL+A + +G ED +F Q+++ +Y + LWNG Y+N D+ S S + D
Sbjct: 592 ALEATLKIGAILG----EDTAIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
QL GQ+YAR L +V+ +SAL K+Y LK GGK GA NG+ PDG + +
Sbjct: 646 QLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEAT 705
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
E+W G+ + + A ++ + G+Q + + + G F+TPEA + +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN---GLQFRTPEAITSVGTFR 762
Query: 878 SLCYMRPLAIWAM 890
+ Y+R +AIWA+
Sbjct: 763 ACHYLRAMAIWAI 775
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 116/387 (29%), Positives = 179/387 (46%), Gaps = 47/387 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+P T + W W +K Y ALYPR+W +E + + +++C Q SP+ HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
Y+ESSYP+++F +T +N I+++ TW N++G
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
G+SE +YN N V ++ + A+A+ + V +
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
+G+ ++W +GS L +E +E G A+A T+ P ++
Sbjct: 269 PAGD-----GGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRKIP 320
Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
F LAWD P F G YYRRYT F+G + A + R A+ +W I WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILE 380
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
L +W+ + LFNELY L GG +WT
Sbjct: 381 RSDLADWFKMALFNELYLLTDGGTLWT 407
>gi|166363582|ref|YP_001655855.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
gi|166085955|dbj|BAG00663.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
Length = 784
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 138/374 (36%), Positives = 206/374 (55%), Gaps = 13/374 (3%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ FA A+ D + + +
Sbjct: 415 IGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRIIGYN 474
Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
RK A PHD+G + PW + N D WKDL FVL VYRD + TG D
Sbjct: 475 QASAIRKTKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKD 534
Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
+ F + WPS+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+AAL+
Sbjct: 535 EDFLRECWPSIVLTLQYLKTFDLDKDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALE 594
Query: 705 AASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQ 763
A + +G+ + F Q+++ +Y + LWNG Y+N D+ S S + DQL GQ
Sbjct: 595 ATLKIGAILGE--DTEIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTDQLCGQ 650
Query: 764 WYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREI 822
+Y+R L +V+ +SAL K+Y LK GGK GA NG+ PDG + + E+
Sbjct: 651 FYSRLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEV 710
Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYM 882
W G+ + + A ++ + G+Q + + + G F+TPEA + +R+ Y+
Sbjct: 711 WVGINFGLVAFLMQMGMEKQGWQITEAVVKQVYEN---GLQFRTPEAITSVGTFRACHYL 767
Query: 883 RPLAIWAMQWALTR 896
R +AIWA+ LT
Sbjct: 768 RAMAIWAIYGVLTE 781
Score = 184 bits (466), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 119/381 (31%), Positives = 180/381 (47%), Gaps = 35/381 (9%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+P T + W W +K TY ALYPR+W +E + + +++C Q SP+ HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-------------GDSEFTGQHYN 261
Y+ESSYP+++F +T +N I+++ TW N++G D Y
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNGIKSPVVKVRDDGSPVYEYQ 211
Query: 262 SKTKMNDGVH--------AVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSL 313
SK ++G + V R ++ P A T + SL ++ N
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIATI-TNPSLEVFYLGKWNPA 270
Query: 314 GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
G ++W +GS L +E ++ G A+A T+ P ++ F LAWD
Sbjct: 271 G-DGGEVWDYFAMNGS---LPDIEDETPAQKGEQTATAMAIRFTIRPGKTRKIPFILAWD 326
Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
P F G YYRRYT F+G + A + R A+ +W I WQ+PILE L +
Sbjct: 327 LPVNEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDLAD 386
Query: 434 WYPITLFNELYYLNAGGAVWT 454
W+ + LFNELY L GG +WT
Sbjct: 387 WFKMALFNELYLLTDGGTLWT 407
>gi|425460273|ref|ZP_18839755.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
gi|389827051|emb|CCI21967.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
Length = 784
Score = 242 bits (618), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 140/379 (36%), Positives = 207/379 (54%), Gaps = 17/379 (4%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ FA A+ D + +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTTRII 471
Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
+ RK A PHD+G + PW + N D WKDL FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D+ F + W S+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWSSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591
Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
AL+A + +G ED +F Q+++ +Y + LWNG Y+N D+ S S + D
Sbjct: 592 ALEATLKIGAILG----EDTAIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
QL GQ+YAR L +V+ +SAL K+Y LK GGK GA NG+ PDG + +
Sbjct: 646 QLCGQFYARLLALPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEAT 705
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
E+W G+ + + A ++ + G+Q + + G F+TPEA + +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN---GLQFRTPEAITSVGTFR 762
Query: 878 SLCYMRPLAIWAMQWALTR 896
+ Y+R +AIWA+ LT
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 182/389 (46%), Gaps = 47/389 (12%)
Query: 93 SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
S HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + +
Sbjct: 39 SWHGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMA 98
Query: 153 PKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
++P T + W W +K TY ALYPR+W +E + + +++C Q SP+
Sbjct: 99 TESP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWA 149
Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG---------------------- 250
HNY+ESSYP+++F +T +N I+++ TW N++G
Sbjct: 150 HNYQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYE 209
Query: 251 -----GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPH 305
G+SE +YN N V ++ + A+A+ + V
Sbjct: 210 YQSKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGK 266
Query: 306 FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
+ +G+ ++W +GS L +E +E G A+A T+ P +
Sbjct: 267 WNPAGD-----GGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRK 318
Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
+ F LAWD P F G YYRRYT F+G + A + R A+ + +W I WQ+PI
Sbjct: 319 IPFILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPI 378
Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWT 454
LE L +W+ + LFNELY L GG +WT
Sbjct: 379 LERSDLADWFKMALFNELYLLTDGGTLWT 407
>gi|416392732|ref|ZP_11685894.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
0003]
gi|357263592|gb|EHJ12579.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
0003]
Length = 804
Score = 242 bits (618), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 142/390 (36%), Positives = 208/390 (53%), Gaps = 28/390 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
+GQF LE ++Y + + DV Y SF L+ML+P+++ ++ FA A+ D + + +
Sbjct: 417 VGQFGVLECLDYCWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTIRIIGYN 476
Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
RKV GA PHD+G + PW + N D WKDL FVLQVYRD + TG D
Sbjct: 477 QAEGFRKVKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDD 536
Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
F W ++ + Y+ +FD D DG+ EN G PDQT+D W +SGISAY GGLW+AAL+
Sbjct: 537 IDFLWECWAAIPETLNYLKEFDLDNDGIPENSGAPDQTFDDWKLSGISAYCGGLWIAALE 596
Query: 705 AASALAR------------------EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
AA + E ++ + Y + Q + + LWNG Y+N D+
Sbjct: 597 AAIKIGEILLENSPNNPQLEPENYPESIEKELDKYNNWLQQSRAIYHSTLWNGEYYNLDS 656
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S + ADQL GQ+YAR GL +V+ + SAL KVY LK GK GA NGM
Sbjct: 657 --KSGSDIVMADQLCGQFYARLLGLSDVVESEYTLSALNKVYEACFLKFQNGKYGAANGM 714
Query: 807 LPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
PDG D +S +E+W+G+ + +A+ +I + + + + + + G F+
Sbjct: 715 KPDGTPEDPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVKQVYEN---GLQFR 771
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
TPEA YR+ Y+R +AIW + + L+
Sbjct: 772 TPEAITAVGTYRACHYLRAMAIWGVYYQLS 801
Score = 193 bits (490), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 130/389 (33%), Positives = 179/389 (46%), Gaps = 43/389 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + K + + QFS+F +G + L +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE D T + W W +K TY ALYP +W +EG E I C Q SPIIP N
Sbjct: 101 PPE---DKT---LSRWSW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
Y+E+SYP+ VF +T+ N ++++ TW N VG G E+ Q
Sbjct: 152 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211
Query: 259 --------HYNSKTKMNDGVHAVL--LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
++N + N V +L L Q +A+ V V +
Sbjct: 212 PRWGHSTGNFNQWVQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEVFYLGKWNP 271
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
G+ ++W GS L E +EPG I AA+A TV P ++ F
Sbjct: 272 DGD-----GSEVWDYFAMDGS---LPDQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIPF 323
Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
LAWD P F G YYRRYT F+G + N + R A+ W ++E W IL++
Sbjct: 324 ILAWDLPVTEFTPGVQYYRRYTDFFGRNGNNGWAMVRTALKHSYVWRDKVEEWHNSILKN 383
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
LP+W + LFNELY L GG +WT S
Sbjct: 384 DNLPDWLKMALFNELYLLTDGGTLWTAAS 412
>gi|126657976|ref|ZP_01729128.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
gi|126620614|gb|EAZ91331.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
Length = 804
Score = 242 bits (618), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 144/393 (36%), Positives = 210/393 (53%), Gaps = 28/393 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
IGQF LE ++Y + + DV Y SF L+ML+P+++ ++ FA A+ D + + +
Sbjct: 417 IGQFGVLECMDYRWYESLDVRLYGSFGLLMLWPRLEKAVIEAFARAIPSSDDTIRIIGYN 476
Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
RK GA PHD+G + PW + N D WKDL FVLQVYRD + TG D
Sbjct: 477 QAEGIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGADD 536
Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
+F W +V A+ Y+ +FD D DG+ EN G PDQT+D W + G+SAY G LW+AAL+
Sbjct: 537 IEFLWECWEAVPEALNYLKEFDLDNDGIPENSGAPDQTFDDWELRGMSAYCGALWIAALE 596
Query: 705 AASALAR------------------EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
AA + E ++ + Y + Q V + LWNG Y+ D+
Sbjct: 597 AAIKIGEILLENRPNNPQLEPKNYPECIEKELDKYQNWLQQSRSVYHSTLWNGEYYKLDS 656
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S + ADQL+GQ+YAR GL +V+ A SALKKVY LK GK GA NGM
Sbjct: 657 --ESGSDVVMADQLSGQFYARLLGLPDVVENQYALSALKKVYEACFLKFQNGKYGAANGM 714
Query: 807 LPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
PDG D +S +E+W+G+ + +A+ +I + D + + + + G F+
Sbjct: 715 KPDGTPEDPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYEN---GLQFR 771
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
TPEA +R+ Y+R +AIW + + L++ K
Sbjct: 772 TPEAITAVGTFRACHYLRAMAIWGIYYQLSKTK 804
Score = 193 bits (490), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 129/389 (33%), Positives = 181/389 (46%), Gaps = 43/389 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + K + + QFS+F +G + L +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALSTE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE D T + W W + +K +Y ALYP +W +EG E I C Q SPIIP+N
Sbjct: 101 PPE---DNT---LSRWSWYPQ-EKGSYSALYPCSWYQYEGVFQTE--IYCEQFSPIIPNN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
Y+E+SYP+ VF +TI N ++++ TW N VG G E+ Q
Sbjct: 152 YQETSYPIGVFEWTIKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211
Query: 259 --------HYNSKTKMNDGVHAVL--LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
++N + N V +L L Q +A+ V V +
Sbjct: 212 PRWGHSTGNFNQWIQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEVFYLGKWNP 271
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
+G+ ++W +GS L E +EPG I AA+A T+ P ++ F
Sbjct: 272 NGD-----GSEVWDYFAMNGS---LPDQEDETPAEPGEQIAAAMAIRFTIKPGRVKKIPF 323
Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
LAWD P F G YRRYT F+G + + R A+ W Q+E W IL+
Sbjct: 324 ILAWDLPVTEFAQGIQSYRRYTDFFGRNGQNGWAMVRTALKHSDVWRDQVEEWHNSILKQ 383
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
K LP W+ + LFNELY L GG +WT S
Sbjct: 384 KDLPNWFKMALFNELYLLTDGGTIWTAAS 412
>gi|427709805|ref|YP_007052182.1| glucosylceramidase [Nostoc sp. PCC 7107]
gi|427362310|gb|AFY45032.1| Glucosylceramidase [Nostoc sp. PCC 7107]
Length = 802
Score = 242 bits (618), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 146/388 (37%), Positives = 215/388 (55%), Gaps = 25/388 (6%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
IGQF LE ++Y + + DV Y SF L++LFP+++ S+ R FA + D +
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFGLLLLFPELEKSVMRAFARGIPQSDDHQRIIGYY 479
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ D +RKV GA PHD+G + W + N C D WKDL+ FVLQVYRD +
Sbjct: 480 YTIGADTTTAARKVAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLSCDFVLQVYRDYL 539
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F W +V + Y+ FD+DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 540 LTGADDVEFLADCWDAVVQTLDYLKTFDKDGDGIPENSGAPDQTFDDWRLLGVSAYCGGL 599
Query: 699 WVAALQAASALAR---------EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
W+AAL+AA A++ E ++ Y ++ Q + EKLWNG Y+ D+
Sbjct: 600 WLAALEAAIAISDILITNHKVVESAEKQKSIYEVWLAQSRPIYQEKLWNGQYYRLDS--E 657
Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
S S+ + ADQL GQ+YAR L IV D+A SALK VY+ LK G+ GA NG+LPD
Sbjct: 658 SGSAVVMADQLCGQFYARLLELPDIVPSDRALSALKTVYDACFLKFQNGEFGAANGVLPD 717
Query: 810 GRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
G + + E+W+G+ + +AA ++ + D + + E ++ G F+TPE
Sbjct: 718 GSPENPKATHPLEVWTGINFGLAAFLVQMGMQDEALRLTQAVVEQVYNN---GLQFRTPE 774
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTR 896
A +R+ Y+R +AIW + +++
Sbjct: 775 AITATGTFRASTYLRAMAIWGIYLVMSK 802
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 135/391 (34%), Positives = 188/391 (48%), Gaps = 43/391 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W I + V A QFSVF S SS
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNVPACQFSVFESYGT----SSQAYAL 95
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKST----YHALYPRAWTVHEGEPDPELRIVCRQISPI 210
+ + L+D + + +W W +ST YHALYPR+W V+E +L C Q SPI
Sbjct: 96 STQSLEDGS---LNAWQWYPASSESTSTGTYHALYPRSWFVYENVFQSQL--TCEQFSPI 150
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
+NY+E+SYPV+VF + +N ++++ TW N VG FT + + K+ D
Sbjct: 151 WANNYQETSYPVAVFLWKAHNPSNAPITLSIMLTWENMVGW---FTNALKSPEVKIRDDG 207
Query: 271 HAVLLHHRTSHQ------------------LPPVTFALAAQETDG---VHVSLCPHFVIS 309
V + + L V QE DG + P I
Sbjct: 208 SPVYEYQPRWGESQGNYNQVVENSEYFGCFLGRVGITAPVQEGDGSWCIATVKHPQVEIF 267
Query: 310 GNSLGL---TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
+S T ++W GS N ++T+ E +G+AIA V P ++
Sbjct: 268 HHSKWHPEGTGDEVWQSFAADGSLP--NYLDTNPVGE-NQRLGSAIAVRFNVQPGETLEI 324
Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
F LAWDCP F +G YYRRYT F+G N A IA A+ + +W QI++WQ PIL
Sbjct: 325 PFVLAWDCPVTEFAAGVNYYRRYTDFFGKDGNQAWEIATTALGNYQTWRSQIQSWQAPIL 384
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
+ LP W+ + LFNELY L +GG +W+ S
Sbjct: 385 NREDLPAWFKMALFNELYDLTSGGTLWSAAS 415
>gi|390437958|ref|ZP_10226465.1| Glycosidase-like protein [Microcystis sp. T1-4]
gi|389838618|emb|CCI30589.1| Glycosidase-like protein [Microcystis sp. T1-4]
Length = 784
Score = 242 bits (617), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 140/379 (36%), Positives = 207/379 (54%), Gaps = 17/379 (4%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ IGQF LE I+Y + + DV Y SF L+ML+P+++ ++ FA A+ D + +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471
Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
+ RK A PHD+G + PW + N D WKDL FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D+ F + WPS+ + + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDKDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591
Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
AL+A + +G ED +F Q+++ +Y + LWNG Y+N D+ S S + D
Sbjct: 592 ALEATLKIGAILG----EDTAIFAGWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
QL GQ+YAR L +V+ +SAL K+Y LK GK GA NG+ PDG + +
Sbjct: 646 QLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHEGKFGAANGLKPDGTPENPQAT 705
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
E+W G+ + + A ++ + G+Q + + G F+TPEA + +R
Sbjct: 706 HPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVYEN---GLQFRTPEAITSVGTFR 762
Query: 878 SLCYMRPLAIWAMQWALTR 896
+ Y+R +AIWA+ LT
Sbjct: 763 ACHYLRAMAIWAIYGILTE 781
Score = 184 bits (466), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 115/387 (29%), Positives = 179/387 (46%), Gaps = 47/387 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS +G+F W + + A QFS+F +G + + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+P T + W W +K Y ALYPR+W +E + + +++C Q SP+ HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
Y+ESSYP+++F +T +N I+++ TW N++G
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
G+SE +YN N V ++ + A+A+ + V +
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
+G+ ++W +GS L +E ++ G A+A T+ P ++
Sbjct: 269 PAGD-----GGEVWDYFAMNGS---LPDIEDETPAQKGEQTATAMAIRFTIRPGKTRKIP 320
Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
F LAWD P F G YYRRYT F+G + A + R A+ +W I WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILE 380
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
L +W+ + LFNELY L GG +WT
Sbjct: 381 RSDLADWFKMALFNELYLLTDGGTLWT 407
>gi|119510228|ref|ZP_01629365.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
gi|119465077|gb|EAW45977.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
Length = 814
Score = 241 bits (616), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 148/382 (38%), Positives = 211/382 (55%), Gaps = 25/382 (6%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
IGQF LE ++Y + + DV Y SF L+ LFP+++ S+ R FA A+ D +
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFGLLHLFPELEKSVIRAFARAIPHSDNHQRVIGYY 479
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ D RKV GA PHD+G + W + N C D WKDL FVLQVYRD +
Sbjct: 480 YTIGADTTTAVRKVAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFL 539
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F W + + Y+ FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 540 LTGADDVQFLADCWAGIVQTLDYLKSFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 599
Query: 699 WVAALQAASALAREV--GDRGSED-------YFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
W+AAL+AA A++ + RG+ED Y ++ Q + EKLWNG YF D+
Sbjct: 600 WLAALEAAIAISDILLTNHRGAEDTEELGQRYRVWLAQSRTIYQEKLWNGQYFRLDS--E 657
Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
S S + ADQL GQ+YAR GL IV D+A SAL+ VY+ +K G+ GA NG+ PD
Sbjct: 658 SGSDVVMADQLCGQFYARLLGLPDIVPSDRALSALQTVYHACFVKFCNGEFGAANGVRPD 717
Query: 810 GRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
G + ++ E+W+G+ + +AA ++ + D + + + + G F+TPE
Sbjct: 718 GSAENPNATHPLEVWTGINFGLAAFLVQMGMQDEALKLTGAVVQQIYHN---GLQFRTPE 774
Query: 869 AWNTDDQYRSLCYMRPLAIWAM 890
A +R+ Y+R +AIWA+
Sbjct: 775 AITASGTFRASTYLRAMAIWAI 796
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 130/392 (33%), Positives = 187/392 (47%), Gaps = 45/392 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SNGQKYSSVLCP 153
HG+PLGG G+G IGRS RG+F W I + + A QFSVF S ++ Q Y+ L
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFQNIPACQFSVFESDGTSAQAYA--LAT 97
Query: 154 KTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
+ PE + +W W TYHALYPR+W V+E +L C Q SP
Sbjct: 98 EAPE------NGSLSAWKWYPASTATQSTGTYHALYPRSWFVYENVLQAQL--TCEQFSP 149
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
I NY+ESSYPV+VF + +N ++++ TW N VG FT + + ++ D
Sbjct: 150 IWAENYQESSYPVAVFNWKAHNPTNAPITLSIMLTWQNMVGW---FTNALKSPEVRIRDD 206
Query: 270 VHAVLLHHR------------TSHQLPPVTFALAAQETDGV---HVSLC------PHFVI 308
V + T + F + D V S C P F +
Sbjct: 207 GSPVYEYQPRLGDSQDNYNCITENTENFSCFCSRVVDDDSVAEGDGSWCIATIKHPQFKV 266
Query: 309 SGNSL---GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
N+ T D+W GS L + + + + + IG AIA T+ P
Sbjct: 267 FYNTCWNPAGTGADVWETFAADGS---LTNYQDATPALENTQIGVAIALRFTLQPGETLA 323
Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
+ F L+WD P F +G YYRRYT F+G + A IA A+ ++ +W IE WQ+PI
Sbjct: 324 IPFVLSWDFPVTEFAAGINYYRRYTDFFGRGGDHAWAIASTALTQYQTWYKNIENWQQPI 383
Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
++ + LP+W+ + LFNELY L +GG +W+ S
Sbjct: 384 IDREDLPDWFKMALFNELYDLTSGGTLWSAAS 415
>gi|428308994|ref|YP_007119971.1| bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
gi|428250606|gb|AFZ16565.1| putative bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
Length = 819
Score = 241 bits (616), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 147/390 (37%), Positives = 212/390 (54%), Gaps = 28/390 (7%)
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
D + IGQF LE ++Y + + DV Y SFAL ML+P++ ++ FA A+ + D +
Sbjct: 429 DDRDPIGQFGVLECLDYRWYESLDVRLYGSFALAMLWPRLDKAVLEAFARAIPIGDKTPR 488
Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
+ + RK A PHD+G + PW + N D +WKDL FVLQVYRD V
Sbjct: 489 IIGYNQAQAIRKAADATPHDLGAPNEHPWEQTNYTSYQDCNQWKDLPCDFVLQVYRDFVF 548
Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
TG D +F W SV +AY+ FD+DGDG+ EN G PDQT+D W + GISAY GGLW
Sbjct: 549 TGANDTEFLWECWSSVVKTLAYLKTFDKDGDGIPENSGAPDQTFDDWRLQGISAYCGGLW 608
Query: 700 VAALQAASALAR-------------EVGDRGS--EDYFLFK--FQKAKVVY-EKLWNGSY 741
+AAL+AA A+ EV D S E L++ ++++ +Y EKLWNG Y
Sbjct: 609 LAALEAAIAMGEILINHYPPTSELIEVPDPESIQETLKLYRQWLEQSQPIYQEKLWNGQY 668
Query: 742 FNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRG 801
+ D+ S S + DQL GQ+YA+ GL IV + +SALK VY+ LK + G
Sbjct: 669 YRLDS--ESGSDIVMTDQLCGQFYAQLLGLPDIVPPECVQSALKSVYDSCFLKFQDAQLG 726
Query: 802 AVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGL 860
A NG+ PDG + + E+W+G+ + +AA ++ + + Q + +
Sbjct: 727 AANGVRPDGSPENPDATHPLEVWTGINFGLAAFLVQQGMKQEALQLTEAVVRQVYEN--- 783
Query: 861 GYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
G F+TPEA +R+ Y+R +AIWA+
Sbjct: 784 GLQFRTPEAITAAGTFRASHYLRAMAIWAV 813
Score = 203 bits (516), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 134/401 (33%), Positives = 185/401 (46%), Gaps = 56/401 (13%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + K + A QFSVF N K + LC +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHSFKSLPACQFSVFEQSENAPKQAYALCTE 100
Query: 155 TPEVLKDTTAAGIGSWDW-----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
P+ D T +G+W W KG+ TY ALYP++W +E EL C Q SP
Sbjct: 101 PPQ---DGT---LGTWQWYPTSQQAKGNTGTYQALYPQSWFTYENVFQTEL--TCEQFSP 152
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
I Y+E+SYPV+VF +T +N ++++ TW N VG FT + + K+ D
Sbjct: 153 IWAGCYQETSYPVAVFEWTAHNPTDAPITLSIMLTWQNIVGW---FTNAIKSPQVKVRDD 209
Query: 270 VHAV--------------------------LLHHRTSHQLPPVTFALAAQETDGVHVSLC 303
V +L H V A E LC
Sbjct: 210 GSPVYEYQPRWGESTGSYNQWIVDNFRVGCILDRVRLHDSEAVADFRGASEIQEGEGQLC 269
Query: 304 PHFVISGNSLGL----------TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIA 353
++ S+ + ++W GS L+ M+ + PG I AAIA
Sbjct: 270 -FATVTNPSMEVFYHTRWNPAGNGAEVWQSFAADGS---LSDMQDETAAAPGEQIAAAIA 325
Query: 354 ASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGS 413
T+ P ++ F LAWD P F G YYRRYT F+G N A +I R A+
Sbjct: 326 IRFTIRPGRTRKIPFILAWDFPVTEFSPGVNYYRRYTDFFGRTGNNAWSIVRTALKHADL 385
Query: 414 WELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
W+ +I++WQ PIL+ + LP W+ + LFNELY L GG +WT
Sbjct: 386 WKEKIQSWQEPILKREDLPHWFKMALFNELYLLADGGTLWT 426
>gi|67922703|ref|ZP_00516206.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
gi|67855413|gb|EAM50669.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
Length = 804
Score = 241 bits (615), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 142/390 (36%), Positives = 207/390 (53%), Gaps = 28/390 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
+GQF LE ++Y + + DV Y SF L+ML+P ++ ++ FA A+ D + + +
Sbjct: 417 VGQFGVLECLDYCWYESLDVRLYGSFGLLMLWPGLEKAVMEAFARAIPSSDDTIRIIGYN 476
Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
RKV GA PHD+G + PW + N D WKDL FVLQVYRD + TG D
Sbjct: 477 QAEGFRKVKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDD 536
Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
F W ++ + Y+ +FD D DG+ EN G PDQT+D W +SGISAY GGLW+AAL+
Sbjct: 537 IDFLWECWAAIPETLNYLKEFDLDNDGIPENSGAPDQTFDDWKLSGISAYCGGLWIAALE 596
Query: 705 AASALAR------------------EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
AA + E ++ + Y + Q + + LWNG Y+N D+
Sbjct: 597 AAIKIGEILLENSPNNPQLEPENYPESIEKELDKYNNWLQQSRAIYHSTLWNGEYYNLDS 656
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S + ADQL GQ+YAR GL +V+ + SAL KVY LK GK GA NGM
Sbjct: 657 --KSGSDIVMADQLCGQFYARLLGLSDVVESEYTLSALNKVYEACFLKFQNGKYGAANGM 714
Query: 807 LPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
PDG D +S +E+W+G+ + +A+ +I + + + + + + G F+
Sbjct: 715 KPDGTPEDPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVKQVYEN---GLQFR 771
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
TPEA YR+ Y+R +AIW + + L+
Sbjct: 772 TPEAITAVGTYRACHYLRAMAIWGVYYQLS 801
Score = 193 bits (490), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 130/389 (33%), Positives = 179/389 (46%), Gaps = 43/389 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + K + + QFS+F +G + L +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALATE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE D T + W W +K TY ALYP +W +EG E I C Q SPIIP N
Sbjct: 101 PPE---DKT---LSRWSW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
Y+E+SYP+ VF +T+ N ++++ TW N VG G E+ Q
Sbjct: 152 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211
Query: 259 --------HYNSKTKMNDGVHAVL--LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
++N + N V +L L Q +A+ V V +
Sbjct: 212 PRWGHSTGNFNQWVQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEVFNLGKWNP 271
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
G+ ++W GS L E +EPG I AA+A TV P ++ F
Sbjct: 272 DGD-----GSEVWDYFAMDGS---LPDQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIPF 323
Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
LAWD P F G YYRRYT F+G + N + R A+ W ++E W IL++
Sbjct: 324 ILAWDLPVTEFTPGVQYYRRYTDFFGRNGNNGWAMVRTALKHSDVWRDKVEEWHNSILKN 383
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
LP+W + LFNELY L GG +WT S
Sbjct: 384 DNLPDWLKMALFNELYLLTDGGTLWTAAS 412
>gi|162452686|ref|YP_001615053.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
gi|161163268|emb|CAN94573.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
Length = 782
Score = 241 bits (614), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 149/364 (40%), Positives = 194/364 (53%), Gaps = 19/364 (5%)
Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
F LE Y + T DV FY S+AL L+P I R F +V +
Sbjct: 427 FTSLESYIYPFYGTSDVRFYGSWALFSLWPDIDKQEVRQFCDSVTTTRSDR--------- 477
Query: 592 VSRKVLGAVPHDIGICDPWFEV-NAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFA 648
+ +G HD G + F NAY D+ WKDLN K VL VYRD TG DK F
Sbjct: 478 --PRAIGTTAHDFGDNESVFRKWNAYTYRDSTNWKDLNSKLVLMVYRDWALTGKTDKAFL 535
Query: 649 KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
WP+V +AM + D DGDG+ E++G DQTYD + G +AY GGL++AA +AA
Sbjct: 536 DYCWPAVKMAMQKVKGQDGDGDGLPESNGI-DQTYDDMDLHGNTAYCGGLFIAASEAAKE 594
Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
LA VGD + +++K +E KLW GSY+ D +GS ++ I +DQLAGQWYAR
Sbjct: 595 LATAVGDASLASTYQTWVEQSKGGFESKLWTGSYYKID-TGSRDTNRIMSDQLAGQWYAR 653
Query: 768 ACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
A GL PIVD A SA K+Y N + GG RG VN M +G VD +S Q+RE W G +
Sbjct: 654 ALGLPPIVDPSHAASAFTKIYENNFKRFDGGTRGVVNVMTANGSVDGTSNQTRECWVGTS 713
Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
+ V A MI E LA + + + W GL F+TPEAW + R+ YMR +
Sbjct: 714 WGVVAGMIQEGLAAQAGEIGASLVDTIWKTDGL--WFRTPEAWEGNGSIRAPYYMRATTL 771
Query: 888 WAMQ 891
WA +
Sbjct: 772 WAAK 775
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 110/368 (29%), Positives = 162/368 (44%), Gaps = 56/368 (15%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV-SRSNGQKYSSVLCPK 154
G P+GG G+G+I + G F + ++ + + + N F+ ++ G +
Sbjct: 97 GAPIGGFGAGTITWRFDGRFYKGRL--DIGSNDQNVDNSAGFFMYQKAEGGD------AQ 148
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
T + KD G ++TY +L+PR+W + G P + Q SPIIP +
Sbjct: 149 TKRLDKDGLGQG-----------QATYSSLFPRSWVDYHGAAFP-CKAKVEQYSPIIPGD 196
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
YK++SYPV V+ + I N D+ ++ TW N GG N T G L
Sbjct: 197 YKKTSYPVGVYRWEITNPTGAPCDVAVMLTWKNDHGG---------NGATVQTSGDDVGL 247
Query: 275 LHHRTSHQLPPVT-----FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
+ R P + F LA++ GV VS S +S+ + E+ G+
Sbjct: 248 VLTRGGGA--PSSEGQGEFTLASRNAPGVVVSYQ-----SADSVAA----LQSELANDGA 296
Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRY 389
+ G S+G AIA TV P V LAWD P SG +YR Y
Sbjct: 297 L---------ANTTGGHSLG-AIAFKATVEPGKSVIVPIVLAWDIPVTQAGSGDRWYRAY 346
Query: 390 TKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAG 449
T+ +G NA+ IA +A+ EH W I+ WQ +L+D + PEW LFNELYY
Sbjct: 347 TRHFGRTGNASWTIAAEALAEHEPWLWAIQDWQASVLDDPKYPEWLKGALFNELYYYAIA 406
Query: 450 GAVWTDGS 457
G W G+
Sbjct: 407 GTYWEAGA 414
>gi|427727368|ref|YP_007073605.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
gi|427363287|gb|AFY46008.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
Length = 803
Score = 241 bits (614), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 143/382 (37%), Positives = 211/382 (55%), Gaps = 26/382 (6%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
IGQF LE ++Y + + DV Y SF L++LFP+++ S+ R FA A+ D +
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFGLVLLFPELEKSVMRAFARAIPQGDDTPRVIGYY 479
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ + RK +GA PHD+G + W + N D WKDL FVLQVYRD +
Sbjct: 480 YTIGAESPVAVRKAVGATPHDLGAPNEHVWQKTNYTSYQDCNLWKDLGCDFVLQVYRDYL 539
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F + W ++ + Y+ FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 540 FTGANDVEFLQDCWDAIVQTLDYVKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 599
Query: 699 WVAALQAASALAREVGDRGSEDYFLFK--------FQKAKVVY-EKLWNGSYFNYDNSGS 749
W+AAL+AA A++ E+ + D L K +K+ +Y EKLWNG Y+ D+ +
Sbjct: 600 WMAALEAAIAIS-EILSQHQPDSALLKQKSIYATWLEKSLPIYQEKLWNGQYYRLDS--N 656
Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
S S + ADQL GQ+YA L IV D+A SAL+ VY+ LK G+ GA NG+ PD
Sbjct: 657 SGSDVVMADQLCGQFYAALLNLPDIVPGDRAISALRTVYDACFLKFYNGQFGAANGVRPD 716
Query: 810 GRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
G + + E+W+G+ + +AA ++ + D + + + +S G F+TPE
Sbjct: 717 GSPENPQATHPLEVWTGINFGLAAFLVQMGMKDEALRLTEAVVQQIYSN---GLQFRTPE 773
Query: 869 AWNTDDQYRSLCYMRPLAIWAM 890
A +R+ Y+R +AIW +
Sbjct: 774 AITPSGTFRASTYLRAMAIWGV 795
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 128/386 (33%), Positives = 188/386 (48%), Gaps = 39/386 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SNGQKYSSVLCP 153
HG+PLGG G+G IGRS RG+F W I + + A QFSVF S ++ Q Y+ L
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHLFQNIPACQFSVFESHGTSSQAYA--LAT 97
Query: 154 KTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
PE + +W W + YHALYPR+W V+E +L C Q SP
Sbjct: 98 TAPE------DGSLQAWQWYPASTATQETGAYHALYPRSWFVYENVLQAQL--TCEQFSP 149
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-ND 268
+ HNY+E+SYPV+VF + +N ++++ TW N G FT + + K+ +D
Sbjct: 150 VYAHNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWQNMTGW---FTNALKSPEVKVRDD 206
Query: 269 GVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLC----PHFVISGNSLGLTAKDMWHEI 324
G R + + + ++ G + P G+ +T K E+
Sbjct: 207 GSPVYEYQPRLGESQGNYNYLVESPQSLGCFLGRVGITEPLQEGEGSWCIVTQKHPQVEV 266
Query: 325 KEHG-------------SFDRLNSMETSVTSEPGSS---IGAAIAASVTVPPDSEGQVTF 368
H SF S+ + + P IG AIA T+ P ++ F
Sbjct: 267 FHHTRWNAAGKGDEIWQSFAADGSLPDYIDTTPAGENEQIGTAIAVRFTLQPGENLEIPF 326
Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
LAWD P F +G Y+RRYT F+GT+ N A IA A+ ++ +W+ QI+AWQ+PI+E
Sbjct: 327 VLAWDFPVTEFAAGVNYHRRYTDFFGTNGNNAWAIATLALEQYQTWQQQIQAWQQPIIER 386
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWT 454
+ LP W + LFNELY L +GG +W+
Sbjct: 387 EDLPNWLKMALFNELYDLTSGGTLWS 412
>gi|427716943|ref|YP_007064937.1| glucosylceramidase [Calothrix sp. PCC 7507]
gi|427349379|gb|AFY32103.1| Glucosylceramidase [Calothrix sp. PCC 7507]
Length = 800
Score = 240 bits (613), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 149/383 (38%), Positives = 211/383 (55%), Gaps = 26/383 (6%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
+GQF LE ++Y + + DV Y SFAL+MLFP+++ S+ R FA A+ D +
Sbjct: 415 VGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVIRAFARAIPHSDAHQRVIGYY 474
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ D RKV GA PHD+G + W + N C D WKDL FVLQVYRD +
Sbjct: 475 YTIGADTTTAVRKVPGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFL 534
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F WP++ + Y+ FD+D DG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 535 LTGADDVEFLTDCWPAIVETLNYLKTFDQDDDGIPENSGAPDQTFDDWRLQGVSAYCGGL 594
Query: 699 WVAALQAASA-----LAREVGDRGS-----EDYFLFKFQKAKVVYEKLWNGSYFNYDNSG 748
W+AAL+AA A L + + D G+ Y + Q + EKLWNG Y+ D+
Sbjct: 595 WLAALEAAIAISNILLNQNLADLGNLALQKSIYETWLAQSRPIYEEKLWNGQYYRLDS-- 652
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
S S + ADQL GQ+YAR GL IV D+A SAL+ VY LK G+ GA NG+ P
Sbjct: 653 KSGSDVVMADQLCGQFYARLLGLPDIVASDRALSALQTVYEACFLKFYDGQFGAANGVRP 712
Query: 809 DGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
DG + + E+W+G+ + +AA ++ + D + + + + G F+TP
Sbjct: 713 DGSPENPQATHPLEVWTGINFGLAAFLVQMGMQDEALKLTQAVVKQIYDN---GLQFRTP 769
Query: 868 EAWNTDDQYRSLCYMRPLAIWAM 890
EA T +R+ Y+R +AIWA+
Sbjct: 770 EAITTVGTFRASTYLRAMAIWAI 792
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 134/386 (34%), Positives = 192/386 (49%), Gaps = 38/386 (9%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SNGQKYSSVLCP 153
HG+PLGG G+G IGRS RG+F W I + + A QFSVF S ++ Q Y+ L
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFQNIPACQFSVFESHGTSSQAYA--LST 97
Query: 154 KTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
+ PE D + AG W W TYHALYPR+W V+E +L C Q SP+ +
Sbjct: 98 QAPE---DGSLAG---WKW-YPASGGTYHALYPRSWFVYENVFQAQL--TCEQFSPVWAN 148
Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-------------GDSEFTGQHY 260
NY+E+SYPV++F + +N ++++ TW N VG D Y
Sbjct: 149 NYQETSYPVAIFLWKAHNPTDAPITLSIMLTWENMVGWFTNALKSPEVRIRDDGSPVYEY 208
Query: 261 NSKTKMNDGVHAVLLHH--RTSHQLPPVTFALAAQETDGV-------HVSLCPHFVISGN 311
+ + G + L+ + + L V + E DG H + N
Sbjct: 209 QPRLGESQGNYNQLIENSGQVGCFLSQVDRDASVAEGDGSWCIATLKHPQIEVFHHTRWN 268
Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
+G T +D+W GS L + + + + + IG AIA T+ P ++ FSLA
Sbjct: 269 PVG-TGEDVWQSFAADGS---LPNYQDATPAIENTRIGGAIAIRFTLQPGQTIEIPFSLA 324
Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
WD P F +G YYRRYT F+G A IA A+ E+ +W QI+ WQ+PILE + L
Sbjct: 325 WDFPVTEFAAGVNYYRRYTDFFGRSGENAWAIASTALQEYQNWRSQIQRWQKPILEREDL 384
Query: 432 PEWYPITLFNELYYLNAGGAVWTDGS 457
P+W+ + LFNELY L +GG +W+ S
Sbjct: 385 PQWFKMALFNELYDLTSGGTLWSAAS 410
>gi|172036524|ref|YP_001803025.1| hypothetical protein cce_1609 [Cyanothece sp. ATCC 51142]
gi|354553307|ref|ZP_08972614.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
gi|171697978|gb|ACB50959.1| unknown [Cyanothece sp. ATCC 51142]
gi|353555137|gb|EHC24526.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
Length = 818
Score = 239 bits (609), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 140/385 (36%), Positives = 206/385 (53%), Gaps = 28/385 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
IGQF LE ++Y + + DV Y SF L+ML+P+++ ++ FA A+ D + + +
Sbjct: 431 IGQFGVLESMDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPNSDDTVRIIGYN 490
Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
RK GA PHD+G + PW + N D WKDL FVLQVYRD + TG D
Sbjct: 491 QAEGIRKTKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDD 550
Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
+F W ++ + Y+ FD D DG+ EN G PDQT+D W + GISAY G LW+AAL+
Sbjct: 551 IEFLWECWEAIPETLNYLKAFDLDNDGIPENSGAPDQTFDDWELRGISAYCGALWIAALE 610
Query: 705 AASALAREVGDRGSE-----------------DYFLFKFQKAKVVYEK-LWNGSYFNYDN 746
AA + + + + D + Q+++ +Y LWNG Y+ D+
Sbjct: 611 AAIKIGEILLENAPQNPQLEPENYPKSIEKELDKYQQWLQQSRAIYHSTLWNGEYYKLDS 670
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S + ADQL+GQ+YAR GL +V++ A SALKKVY LK GK GA NGM
Sbjct: 671 --ESGSDVVMADQLSGQFYARLLGLPDVVEQQYALSALKKVYEACFLKFQNGKYGAANGM 728
Query: 807 LPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
PDG D +S +E+W+G+ + +A+ +I + D + + + + G F+
Sbjct: 729 KPDGTPEDPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYEN---GLQFR 785
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
TPEA +R+ Y+R +AIW +
Sbjct: 786 TPEAITAVGTFRACHYLRAMAIWGV 810
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 133/389 (34%), Positives = 181/389 (46%), Gaps = 43/389 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+P+GG GSG IGRS RG+F W + K + + QFS+F NG + L +
Sbjct: 55 HGMPIGGFGSGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQPENGTAQAYALATE 114
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE D T + W W +K TY ALYP +W +EG E I C Q SPIIP N
Sbjct: 115 PPE---DNT---LSRWAW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 165
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
Y+E+SYP+ VF +T+ N ++++ TW N VG G E+ Q
Sbjct: 166 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 225
Query: 259 --------HYNSKTKMNDGVHAVL--LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
++N + N V +L L Q +A+ V V +
Sbjct: 226 PRWGHSTGNFNQWIQDNFRVGFILNRLQPHQQVQEGEGQICIASVTNPSVEVFYLGKWNP 285
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
+G+ ++W +GS L E +EPG I AA+A TV P ++ F
Sbjct: 286 NGD-----GSEVWDHFAMNGS---LPDQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIPF 337
Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
LAWD P F G YYRRYT F+G + + R A+ W +IE WQ PIL
Sbjct: 338 ILAWDLPVTEFAQGIQYYRRYTDFFGRNGQNGWAMVRTALKHSDVWREKIEEWQNPILNR 397
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
LP+W+ + LFNELY L GG +WT S
Sbjct: 398 DDLPDWFKMALFNELYLLTDGGTLWTAAS 426
>gi|332711224|ref|ZP_08431157.1| putative bile acid beta-glucosidase [Moorea producens 3L]
gi|332350038|gb|EGJ29645.1| putative bile acid beta-glucosidase [Moorea producens 3L]
Length = 815
Score = 238 bits (608), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 145/385 (37%), Positives = 208/385 (54%), Gaps = 28/385 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
IGQF LE ++Y + + DV Y SFAL ML+PK+ ++ FA A+ D + + +
Sbjct: 427 IGQFGVLECLDYRWYESLDVRLYGSFALAMLWPKLDKAVLEAFARAIPTSDETLRIIGYN 486
Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV--ATGD 644
RK+ A PHD+G + PW + N D +WKDL FVLQVYRD V + D
Sbjct: 487 QAQAVRKIADATPHDLGAPNEHPWQQTNYTSYQDCNQWKDLASDFVLQVYRDFVFYSATD 546
Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
+F W S+ A+AY+ +FD+DGDG+ EN G PDQT+D W + G+SAY GGLW+AAL+
Sbjct: 547 IEFLWECWSSIVKALAYLKRFDQDGDGIPENYGAPDQTFDDWRLQGVSAYCGGLWLAALE 606
Query: 705 AASALAR-------------EVGDRGS-----EDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
+A A+ EV + S + Y + Q + EKLWNG Y+ D+
Sbjct: 607 SAIAIGEILTKNYPPTSPFVEVPELDSIQDTIDCYRQWLAQAMPIYQEKLWNGQYYQLDS 666
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S + ADQL GQ+YA+ GL IV + SALK VY+ LK G+ GAVNG+
Sbjct: 667 --ESGSDVVMADQLCGQFYAQLLGLPDIVPPECTLSALKTVYDACFLKFHQGQLGAVNGV 724
Query: 807 LPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
PDG D + E+W+G+ + +AA +I + + D + + + G F+
Sbjct: 725 RPDGTPEDPDATHPMEVWTGINFGLAAFLIQQGMKDEALRITEAVVRQVYDH---GLQFR 781
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
TPEA +R+ Y+R + IWA+
Sbjct: 782 TPEAITAAGTFRASHYLRAMGIWAV 806
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 137/397 (34%), Positives = 184/397 (46%), Gaps = 55/397 (13%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + K + A QFSVF + + LC +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSLPACQFSVFEQPEHSLAQAYALCTQ 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKST---------YHALYPRAWTVHEGEPDPELRIVCR 205
PE K +GSW+W KS Y+ALYPR+W ++ EL C
Sbjct: 101 PPEDGK------LGSWEWYPTQGKSEENSSEVSGHYYALYPRSWFTYQNVFQTEL--TCE 152
Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
Q SPI Y+ESSYPV+VF +T +N T ++++ TW N VG FT N + +
Sbjct: 153 QFSPIWAGCYQESSYPVAVFEWTAHNPTDTPITVSIMLTWQNMVGW---FTNGIKNPEVR 209
Query: 266 MNDGVHAV------------------LLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
+ D V + H+R L V +E +G LC V
Sbjct: 210 VRDDGSPVYEYQPRWGDSTGNLNQWIVDHYRVGFVLDRVRLGDDIKEGEG---QLCLATV 266
Query: 308 ISG----------NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
+ N +G A ++W GS L + + PG I AIA T
Sbjct: 267 TNPSMEVFYHGRWNPVGDGA-EVWDTFATDGS---LVDHQDETPAAPGEQIACAIAIRFT 322
Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
+ P ++ F LAWD P F G YYRRYT F+G + N + R A+ H W Q
Sbjct: 323 LRPGKTRKIPFILAWDFPVTEFKLGINYYRRYTDFFGRNGNNGWPMVRTALKHHDLWREQ 382
Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
IE+WQ PIL + LP W+ + LFNELY L GG +WT
Sbjct: 383 IESWQEPILNQEDLPNWFKMALFNELYLLTDGGTLWT 419
>gi|12005896|gb|AAG44660.1|AF258662_1 AD035 [Homo sapiens]
Length = 367
Score = 238 bits (608), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 153/421 (36%), Positives = 215/421 (51%), Gaps = 71/421 (16%)
Query: 242 LFTWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDG 297
+F+ N +GG + G +N + + V +LLHH T LP P T A+AA+ T
Sbjct: 1 MFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAA 57
Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
V+ I+ T + +W ++ + G D + S ++ G I A+ S
Sbjct: 58 TTVT-----HITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSK 110
Query: 358 VPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWEL 416
+ P + ++ FSLAWD P + F G+ +YRRYT+F+G +AA ++ A+ + WE
Sbjct: 111 LRPRGQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEE 170
Query: 417 QIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWS 476
+I AWQ P+L+D+ LP WY LFNELY+L GG VW +
Sbjct: 171 RISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE--------------------- 209
Query: 477 QSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLY 534
+LE +L G N+ L+ + G+F Y
Sbjct: 210 --------------------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGY 238
Query: 535 LEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSR 594
LEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A + D ++ + L G
Sbjct: 239 LEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPV 298
Query: 595 KVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVW 652
K +PHDIG D PW VNAY ++DTA WKDLN KFVLQVYRD TGD+ F K +W
Sbjct: 299 KRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMW 358
Query: 653 P 653
P
Sbjct: 359 P 359
>gi|428770169|ref|YP_007161959.1| glucosylceramidase [Cyanobacterium aponinum PCC 10605]
gi|428684448|gb|AFZ53915.1| Glucosylceramidase [Cyanobacterium aponinum PCC 10605]
Length = 802
Score = 238 bits (608), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 148/406 (36%), Positives = 213/406 (52%), Gaps = 30/406 (7%)
Query: 516 SFGTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAA 573
S G +L EN +GQF LE I+Y + + DV Y SF LIM++P++ SI F+
Sbjct: 401 SEGGSLWTAATENDPVGQFGVLECIDYRWYESLDVRLYGSFPLIMMWPRLDKSIMEAFSR 460
Query: 574 AVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKF 631
A+ D + + + RK GA PHD+G + PW + N D WKDL F
Sbjct: 461 AIPTEDKTPRIIGYNQTQAIRKAKGATPHDLGAPNEHPWEKSNYTSYQDCNLWKDLGCDF 520
Query: 632 VLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS 689
VL VYRD + TG D+ F W SV + Y+ FD DGDG+ EN G PDQT+D W +
Sbjct: 521 VLLVYRDYLFTGGLDQDFLWECWDSVVETLHYVKTFDLDGDGIPENSGAPDQTFDDWRLQ 580
Query: 690 GISAYSGGLWVAALQAASALAREV-----------------GDRGSEDYFLFKFQKAKVV 732
GISAY GGLW+A L+AA A+ + G + + F +A+ +
Sbjct: 581 GISAYCGGLWIAGLEAAIAIGNTLIANPPMNPSLQPDDYPQGIKDAIAVFQQWLTQARSL 640
Query: 733 Y-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
Y E LWNG Y+ D+ S S + ADQL GQ+YA+ GL +V+ +S L+KVY+
Sbjct: 641 YHETLWNGEYYRLDS--ESGSDVVMADQLCGQFYAQLLGLPDVVESKYVKSTLRKVYDAC 698
Query: 792 VLKVMGGKRGAVNGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
LK GK G NG+ PDG+ V + E+W+G+ Y + A MI + G + A +
Sbjct: 699 FLKFHDGKFGIANGVKPDGKPVKENDTHPLEVWTGINYGIVAFMILNGMKQEGLRVAETV 758
Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
+ + G F+TPEA +R+ Y+R +A+WA+ ++L +
Sbjct: 759 VKQVYEN---GLQFRTPEAITAVGTFRASHYLRAMAVWAIYYSLEK 801
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 129/387 (33%), Positives = 187/387 (48%), Gaps = 41/387 (10%)
Query: 93 SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
++HG+PLGG G G +GR GEF W I + + A QFSVF +
Sbjct: 38 ANHGMPLGGFGGGCVGRGVNGEFNLWHIDGGNHIYQSLPACQFSVFEQVEGEPPQVYAMS 97
Query: 153 PKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
+P T + SW W +G K TY ALYPR+W ++G + R++C Q SPI
Sbjct: 98 TNSP------TDGSLSSWAWYPQG-KGTYSALYPRSWFDYQGVF--KSRLICEQFSPIWA 148
Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------GDSE 254
+Y+ESSYP++VF ++++N ++++FTW NSVG + E
Sbjct: 149 GSYQESSYPLAVFEWSVHNPTDKEITLSIMFTWQNSVGWFTNAIKSPTVKVRDDGSPEYE 208
Query: 255 FTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG-------VHVSLCPHFV 307
+ + +S +N + + R + QE +G + SL +
Sbjct: 209 YRPKWGDSTGNLNQWIQD---NFRVGCLFDRIRMEEQPQEGEGQWAISSITNPSLEVFYH 265
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
N G A D+W +GS L + +EPG I A+A TV P ++
Sbjct: 266 SRWNPKGDGA-DIWDYFAMNGS---LPDYQDETPAEPGEQIAGAMAIRFTVKPGKTKKIP 321
Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
F LAWD P F G TYYRRYT F+G + A I R A+ H W+ +I+ WQ+PIL+
Sbjct: 322 FILAWDFPVTEFAQGITYYRRYTDFFGRNGRNAWAIVRTALKHHDMWQNRIKEWQKPILD 381
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
LPEW + LFNELY L+ GG++WT
Sbjct: 382 RGDLPEWLKMALFNELYLLSEGGSLWT 408
>gi|78185056|ref|YP_377491.1| hypothetical protein Syncc9902_1489 [Synechococcus sp. CC9902]
gi|78169350|gb|ABB26447.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 823
Score = 238 bits (607), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 223/855 (26%), Positives = 358/855 (41%), Gaps = 144/855 (16%)
Query: 94 SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
+HG+PLGG+G+G GR+ G W + + QFS++ + + + ++
Sbjct: 44 NHGMPLGGLGAGCFGRAPDGNTNLWHLDGGEHWFGTLPDCQFSLWEKQGDDVRAHALAVA 103
Query: 154 KTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELR--IVCRQI 207
+ + AA + +W W K TY A YP +WT ++G R ++C
Sbjct: 104 PEGDASRPDAAASLPAWSWYPASTNKRCTGTYAARYPLSWTHYDGV----FRSGVLCEAF 159
Query: 208 SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------G 251
SPI+P +Y+ SSYPV+VF + + N +++L+ +W N+VG G
Sbjct: 160 SPILPGDYQRSSYPVAVFCWQLANPTDQPLELSLMVSWRNTVGWFTNTDASAEVHFRDDG 219
Query: 252 DSEFT-------GQHYNSKTKMNDGVHAVLLHHRTSH--QLPPVTFALAA-QETDGVHVS 301
E G ++ G+ VL+ R+S Q + LA + GV V
Sbjct: 220 SPEHNYSPAIGRGAGQRNRAHSESGLTGVLMEGRSSEPIQEGEGQWCLAVPDDLAGVEVL 279
Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
C + SG+ D+W G R+ + S G AA+ + P
Sbjct: 280 RCSRWDPSGD-----GSDIWTPFSAEG---RIPESDNDRESRAGEQASAAMVVKFILAPG 331
Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
++ + SW+L I A+
Sbjct: 332 ESREIPVVI------------------------------------------SWDLPITAF 349
Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLK 481
+ +R +++ N W+D + + W Q L+
Sbjct: 350 ASGCADRRRYTDFFGSDGRNAAAIAAEALRDWSDWRGRIEA------------WQQPVLE 397
Query: 482 RIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYL 541
R A+ RM+ + ++Y + S + QD G+F LE ++Y
Sbjct: 398 R---------KALPEPVRMA-LFNELYDLCSGGSLWTAATAQD---PYGRFGVLECLDYA 444
Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---LDDGQW---VSRK 595
+ + DV Y S AL+ L+P++ ++ R F+ A+ HD ++ + G+ RK
Sbjct: 445 WYESLDVRLYGSLALLQLWPELDKAVLRSFSRAIPAHDATQRPIGWYFTQGRGRVEADRK 504
Query: 596 VLGAVPHDIGICDPW-FEVNAYCLY-DTARWKDLNPKFVLQVYR--DVVATG-DKKFAKA 650
V GA PHD+G + F+ Y Y D WKDL FVLQV+R + TG D F
Sbjct: 505 VAGATPHDLGAPNELPFDATNYTAYQDCNLWKDLASDFVLQVWRTFQLAPTGEDLNFLAD 564
Query: 651 VWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAA-------- 702
WPS A+ Y+ FD + DG+ +N G PDQT+D W + G+SAY G LW+AA
Sbjct: 565 CWPSAVRALDYLKGFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMA 624
Query: 703 --LQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQ 759
LQ L V R F ++++ +++ LWNG Y+ D S + + ADQ
Sbjct: 625 QRLQLDLGLDTTVEQR----RFGQWLEQSRSNFDRLLWNGEYYEID--AESGTPVVMADQ 678
Query: 760 LAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR-VDMSSMQ 818
L G +YAR GL +V + A S L V + + GG G NG+ DG +D +
Sbjct: 679 LCGDFYARLLGLPSVVSDANALSTLNVVRDACFERFEGGTLGVANGLRRDGTPLDPNGTH 738
Query: 819 SREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRS 878
E+W+G+ + +A+ +D + +SG G F+TPEA + +R+
Sbjct: 739 PLEVWTGINFGIASYFRLMGQSDTALAITGAVVNQVYSG---GLQFRTPEAITAVNTFRA 795
Query: 879 LCYMRPLAIWAMQWA 893
Y+R +AIW + WA
Sbjct: 796 CHYLRAMAIWGL-WA 809
>gi|195397814|ref|XP_002057523.1| GJ18180 [Drosophila virilis]
gi|194141177|gb|EDW57596.1| GJ18180 [Drosophila virilis]
Length = 869
Score = 238 bits (606), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 185/623 (29%), Positives = 290/623 (46%), Gaps = 108/623 (17%)
Query: 77 KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
+G+R ++D + + +GVP+GG+G GSIGR Y GEF R+Q+ P + E V ANQF
Sbjct: 13 EGRRVYMDYYYMENGKQIYGVPIGGIGGGSIGRGYAGEFCRFQMRPGIYEYNVVHANQFI 72
Query: 137 VFVSRSNGQK-YSSVLCPKTPEVLKDTTAAG--------------------------IGS 169
V + G + S+L + + K T + +
Sbjct: 73 VTIKDHKGCTIFQSLLSRCSTKSFKAKTNNNDAEADEADAQKTKCQLPNCSSRPRQPLSA 132
Query: 170 WDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVF 225
W N++ + +Y LYPRAWT H G +R++CRQISP+IPH+YKESS P +VF
Sbjct: 133 WHSNIEDSRCSYTGLYPRAWTEYDLSHYG-----VRLICRQISPVIPHDYKESSLPCAVF 187
Query: 226 TYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPP 285
+++ N +++ FT+ N G + + S+ V + + P
Sbjct: 188 VWSVQNVCDQERKVSITFTFKNGTGNKKQDADGNAESQLISEGNAKGVAIRQNIADM--P 245
Query: 286 VTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
++ LA + + ++ CP F +GN + +W ++KEHG + ET + +
Sbjct: 246 CSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQLSEQPTAETLRSKD-- 298
Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIA 404
IG A+ A + + P + ++ F LAWD P++ F +T+ R YTK++ + I
Sbjct: 299 --IGVAVCAQLALKPQAYHELEFVLAWDMPKIQFPRKLQTHTRYYTKYFDDSGESGPRIC 356
Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLV 464
A+ + SWE RL + + + N+ DG P
Sbjct: 357 EYALKHYASWE--------------RLIDAWQRPILND------------DGLP------ 384
Query: 465 TIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL-LQ 523
DW + I ++ I + + +SS G +L
Sbjct: 385 ---------DWYK----------------CAIFNQLYFISDGGTIWLKCDSSLGQSLAFD 419
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
D G+F YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q DF A+
Sbjct: 420 DPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTR 479
Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
K+L DG+ + RKV VPHD+G D P+ +N Y ++D WKDLN KFVLQVYRD
Sbjct: 480 KMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYV 539
Query: 642 TGDKKFAKAVWPSVYVAMAYMDQ 664
+ A+A S + ++ ++D+
Sbjct: 540 LNELAQAQADNASKFSSIEFIDK 562
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 107/261 (40%), Positives = 156/261 (59%), Gaps = 12/261 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++ + M ++D+D DG+IEN PDQTYD+W + G SAY GLW+AALQ
Sbjct: 608 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLWLAALQTM 667
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
S +A + + +K K + EKLWNGSY+ +D S + +I ADQL G WY
Sbjct: 668 SVMATLLDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLS-HNHRDTIMADQLCGHWY 726
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK++Y+ NV+ G GA NG + + G VD S++
Sbjct: 727 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPSKPGHVDNSNI 786
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
Q+ E+W GV YA+AA+MI E + + FQTA G+Y+ G+ Y +TPEA + +YR
Sbjct: 787 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMNY--ETPEALYGEKRYR 844
Query: 878 SLCYMRPLAIWAMQWALTRPK 898
S+ YMRPL+IW+MQ A R +
Sbjct: 845 SIGYMRPLSIWSMQVAWERRR 865
>gi|22297737|ref|NP_680984.1| hypothetical protein tlr0193 [Thermosynechococcus elongatus BP-1]
gi|22293914|dbj|BAC07746.1| tlr0193 [Thermosynechococcus elongatus BP-1]
Length = 806
Score = 238 bits (606), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 146/378 (38%), Positives = 216/378 (57%), Gaps = 19/378 (5%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
+ +GQF LE ++Y + + DV Y SFAL+ L+P+++ S+ R FA A+ DP+ +++
Sbjct: 425 RDPVGQFAVLECLDYRWYESLDVRLYGSFALLQLWPELEKSVMRAFARAIPTADPT-LRI 483
Query: 586 L--------DDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQV 635
+ + RK+ AVPHD+G + PW + N D WKDL FVL V
Sbjct: 484 IGYFYRGDPETAYKAPRKLANAVPHDLGAPNEHPWEKTNYTAYQDCNLWKDLASDFVLLV 543
Query: 636 YRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISA 693
YRD + TG D FA+ WP+V A+ ++ QFD+DGDG+ EN G PDQTYD W + G+SA
Sbjct: 544 YRDFLFTGGTDLNFARECWPAVVAALDHLKQFDQDGDGLPENGGAPDQTYDDWKLQGVSA 603
Query: 694 YSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSS 753
Y GGLW+AAL+AA AL + E Y + Q ++ LWNG Y+ D S S
Sbjct: 604 YCGGLWLAALEAAIALGTLLQQPQVEIYRQWLSQARPRYHQLLWNGEYYRLDT--GSGSD 661
Query: 754 SIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
I ADQL GQ+YA+ GL+ IV D AL+K+Y+ LK G+ GA NG+LP+G+ +
Sbjct: 662 VIMADQLCGQFYAQLLGLVDIVPPDCCDRALRKIYDTCFLKFHNGQFGAANGLLPNGQPE 721
Query: 814 MS-SMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
+ E+W+G+ + +AA + + D ++ A + + G F+TPEA
Sbjct: 722 NPHATHPLEVWTGINFGLAAFLWQRGMIDEAWRLAEVVVRQIYEN---GLQFRTPEAITA 778
Query: 873 DDQYRSLCYMRPLAIWAM 890
+ +R+ Y+RP+AIWA+
Sbjct: 779 NGTFRACMYLRPMAIWAL 796
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 127/395 (32%), Positives = 176/395 (44%), Gaps = 59/395 (14%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
+G+PLGG G+G IGRS G+F W + + A QFS+F Q Y+ PK
Sbjct: 56 YGMPLGGFGAGCIGRSSAGDFNLWHVDGGEHIFGTLPACQFSLFEQGEQTQAYALGSAPK 115
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+ SW W G K TY YPR+W V+EG +I C Q SPI+PHN
Sbjct: 116 D---------GRLSSWQWYPAG-KGTYAVRYPRSWFVYEGVF--RAQITCEQFSPILPHN 163
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
Y+E+SYPV+VF +T N S ++L+ +W N+VG G +T
Sbjct: 164 YQETSYPVAVFLWTFSNPTDQSLTLSLMLSWQNTVGWFCNTTPSSAIAIRDDGSPVYT-- 221
Query: 259 HYNSKTKMNDGVHAVLLH---------HRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
Y + +DG L+ R H PP QE DG +L P +
Sbjct: 222 -YTPRWGQSDGNFNELIQTESFQGWRLRRMPHPNPP-------QEGDGEWAALIPTGL-- 271
Query: 310 GNSLGLT-------AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
G G + +W GS +N + + AA A ++ P
Sbjct: 272 GEFFGCSRWQPEGDGAHLWQSFSVDGSLPFVNDPTPAAAG---EQVAAAFALRFSLAPGE 328
Query: 363 EGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
Q+ LAWD P F G YYRRYT F H A +A A+ + +W+ QI WQ
Sbjct: 329 RKQIPVVLAWDFPVTEFGKGVIYYRRYTDFCDRHGTNAVTLAAQALAAYATWQEQIRTWQ 388
Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
PIL P+W+ + L NELY L++GG++W+ S
Sbjct: 389 APILSHPDWPDWFKMALCNELYVLSSGGSLWSAAS 423
>gi|428210868|ref|YP_007084012.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
6304]
gi|427999249|gb|AFY80092.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
6304]
Length = 820
Score = 237 bits (605), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 149/400 (37%), Positives = 213/400 (53%), Gaps = 37/400 (9%)
Query: 523 QDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK 582
QD EE G+F LE I+Y + + DV Y SFAL+ML+P+++ S+ R FA A+ D +
Sbjct: 415 QDAEEPRGKFAVLECIDYRWYESLDVRLYGSFALLMLWPELEKSVMRGFARAIPTGDDTP 474
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ + RKV A PHD+G + PW + N D WKDL F L VYRD +
Sbjct: 475 RIIGYNQAAAIRKVTDATPHDLGAPNEHPWVKTNYTSYQDCNLWKDLPCDFALLVYRDFL 534
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F WPS+ A+AY+ FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 535 LTGGDDYEFLWDCWPSMVQAIAYLKTFDTDGDGIPENGGAPDQTFDDWPMRGVSAYCGGL 594
Query: 699 WVAAL--------------------------QAASALAREVGDRGSEDYFLFKFQKAKVV 732
W+AAL Q S L + + + ++++ +
Sbjct: 595 WIAALQAAIAIGEVILNNPPHSGVFTDTIYEQKRSQLLEPATIQATLSTYQDWLEQSRPL 654
Query: 733 YE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
Y+ KLWNG+Y+ D+ S S + ADQL GQ+YAR GL IV D A SALK VY
Sbjct: 655 YQHKLWNGTYYRLDS--ESGSEVVMADQLCGQFYARLLGLADIVPIDCAESALKTVYESC 712
Query: 792 VLKVMGGKRGAVNGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
LK G+ GA NG+L DG V+ + E+W+G+ + +AA ++ + + GF+ +
Sbjct: 713 FLKFHNGEFGAANGVLIDGSPVNPDATHPLEVWTGINFGLAAFLLQMGMKEEGFKLTETV 772
Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
+ + G F+TPEA +R+ Y+R +AIWA+
Sbjct: 773 IQQVYEN---GLQFRTPEAITAVGTFRASHYLRAMAIWAV 809
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 121/385 (31%), Positives = 180/385 (46%), Gaps = 37/385 (9%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + + + A QFS+F N Q+++ LC +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHTFQSLPACQFSIFEETQN-QRHAYALCTE 99
Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
P + + SW W + TYHALYPR+W V++ +L C Q SPI
Sbjct: 100 PP------SDGSLNSWQWYPASTEENPTGTYHALYPRSWFVYQNVLKAQL--TCEQFSPI 151
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-------------GDSEFTG 257
+P Y+E SYP+++F +T YN +++L TW N VG D
Sbjct: 152 LPGKYQECSYPLAIFEWTAYNPTDEPITLSILLTWENVVGWFTNAIKSPTVKVRDDGSPV 211
Query: 258 QHYNSKTKMNDGVHAVLLH--HRTSHQLPPVTFALAAQETDG---VHVSLCPHFVISGNS 312
Y SK + G + HR + QE +G + + P N+
Sbjct: 212 YEYQSKWGQSQGNSNRYIEDFHRIGCIMAGGHLGEVPQEGEGELAIATATNPFLDTFKNT 271
Query: 313 L---GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFS 369
+ K++W + GS L+ + + ++ G IG A+A T+ P ++ F
Sbjct: 272 RWNPTGSGKEVWEQFSHDGSL--LDCTDETPAAK-GERIGVALAVRFTIRPGKTRKIPFI 328
Query: 370 LAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDK 429
LAWD P F +G YRRYT F+G H + ++ R A+ W I WQ+ IL+ +
Sbjct: 329 LAWDFPVTEFAAGVWEYRRYTDFFGRHGKNSWSMIRTALKHSEMWRENIITWQQSILDRE 388
Query: 430 RLPEWYPITLFNELYYLNAGGAVWT 454
L + + LFNELY L GG +WT
Sbjct: 389 DLSDSLKMALFNELYILTDGGTLWT 413
>gi|288901049|gb|ADC68237.1| hypothetical protein [Dendrobium nobile]
Length = 207
Score = 237 bits (605), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 123/193 (63%), Positives = 152/193 (78%), Gaps = 1/193 (0%)
Query: 644 DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAAL 703
D F + VWP+V AM YM+QFDRDGDG+IENDGFPDQTYD W+V GISAY G LW+A+L
Sbjct: 6 DMAFGREVWPAVCAAMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLASL 65
Query: 704 QAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAG 762
QAA+A+A +GD + KF AK +E KLWNGSYFNYD+ SS S SIQADQLAG
Sbjct: 66 QAAAAMAHRLGDHAYAEKCTIKFLTAKHAFESKLWNGSYFNYDSGNSSNSKSIQADQLAG 125
Query: 763 QWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREI 822
QWYA + GL P+ DE+K RS L+K++++NV+KV GG+ GAVNGM P+G+VD S MQSREI
Sbjct: 126 QWYAASSGLPPLFDENKIRSTLQKIFDFNVMKVRGGRIGAVNGMHPNGKVDESCMQSREI 185
Query: 823 WSGVTYAVAASMI 835
W+GVTYA++A+MI
Sbjct: 186 WTGVTYALSATMI 198
>gi|434400779|ref|YP_007134783.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
gi|428271876|gb|AFZ37817.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
Length = 804
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 149/385 (38%), Positives = 210/385 (54%), Gaps = 28/385 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
IGQF LE ++Y + + DV Y SF L+ML+ K+ S+ FA A+ D + + +
Sbjct: 417 IGQFGVLECMDYRWYESLDVRLYGSFGLMMLWSKLDKSVLEAFARAIPKGDDTPRIIGYN 476
Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
RK GA PHD+G + PW N D WKDL FVLQVYRD V TG D
Sbjct: 477 RAQAIRKAAGATPHDLGAPNEHPWQATNYTSYQDCNLWKDLPSDFVLQVYRDYVLTGSKD 536
Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
+F W S+ V +AY+ +FD DGDG+ EN G PDQT+D W + GISAY GGLW+AAL+
Sbjct: 537 SEFLWECWDSIVVTLAYLKEFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALE 596
Query: 705 AASALAREV-----------------GDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDN 746
AA A+ + + G +++ Q+A+ +Y + LWNG Y+ D+
Sbjct: 597 AAIAIGQILMTHPPMNPNLQPPDYPQGIETVINHYRVWLQQARSLYHDTLWNGEYYRLDS 656
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S + ADQL GQ+YAR GL +V+E +SALKK+Y LK GK GA NG+
Sbjct: 657 --QSNSDVVMADQLCGQFYARLLGLPDVVEEQYTKSALKKIYQACFLKFHNGKYGAANGV 714
Query: 807 LPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
LPDG V+ + E+W+G+ + +AA M+ + FQ + + + G F+
Sbjct: 715 LPDGSPVNPNDTHPLEVWTGINFGLAAFMLQMGMKAEAFQLTETVVKQVYEN---GLQFR 771
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
TPEA +R+ Y+R +AIW +
Sbjct: 772 TPEAITAVGTFRASHYLRAMAIWGI 796
Score = 202 bits (513), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 132/384 (34%), Positives = 186/384 (48%), Gaps = 40/384 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS+RG+F W + + + A QFSVF S S + + LC +
Sbjct: 42 HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHVFRSLPACQFSVFES-SEAKTQAYALCTE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE D T + W W K +K YHALYPR+W E + + ++ C Q SPI N
Sbjct: 101 PPE---DGT---LSRWQWYPK-EKGNYHALYPRSW--FEYQDVFQSQLTCEQFSPIWAEN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
Y+ESSYPV++F + ++N ++++ TW N+VG FT + K+ D V
Sbjct: 152 YQESSYPVAIFDWNVHNPTDKPITLSIMLTWQNTVGW---FTNAIKSPTVKVRDDGSPVY 208
Query: 275 LH------------------HRTSHQLPPVTFALAAQETDG------VHVSLCPHFVISG 310
+ +R L + QE +G V F +S
Sbjct: 209 EYQPKWGDSTGNFNQWIQDNYRVGCLLNRIQPHAEIQEGEGQIAIATVTNPTLEVFYLSR 268
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
D+W GS L + ++PG I +AIA TV P Q+ F L
Sbjct: 269 WHPDGDGADVWDYFAADGS---LPDRQDETPADPGEQIASAIAVRFTVNPGKTRQIPFVL 325
Query: 371 AWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKR 430
AWD P F G YYRRYT F+G N A +I + A+ W+ I+AWQ PIL+ +
Sbjct: 326 AWDLPVTEFAQGIKYYRRYTDFFGRTGNNAWSIVKTALKHSDWWQESIQAWQEPILKRQD 385
Query: 431 LPEWYPITLFNELYYLNAGGAVWT 454
LP+W+ + LFNELY L GG +W+
Sbjct: 386 LPDWFKMALFNELYLLTQGGTLWS 409
>gi|428773649|ref|YP_007165437.1| glucosylceramidase [Cyanobacterium stanieri PCC 7202]
gi|428687928|gb|AFZ47788.1| Glucosylceramidase [Cyanobacterium stanieri PCC 7202]
Length = 800
Score = 237 bits (604), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 141/393 (35%), Positives = 209/393 (53%), Gaps = 28/393 (7%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ +GQF LE I+Y + + DV Y SF L+ LFP+++ S+ F+ A+ D + +
Sbjct: 413 DDPVGQFGVLECIDYRWYESLDVRLYGSFGLLQLFPRLEKSVMEAFSRAIPTADDTPRII 472
Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
+G RK GA PHD+G + PW + N D WKDL FVL VYRD + TG
Sbjct: 473 GYNGASAIRKAKGATPHDLGAPNEHPWIKSNYTSYQDCNLWKDLGSDFVLLVYRDYLYTG 532
Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
D+ F W SV + Y+ FD D DG+ EN G PDQT+D W + GISAY GGLW+
Sbjct: 533 KNDQDFLWECWDSVVETLHYVKGFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIC 592
Query: 702 ALQAASALAREVGD-----------------RGSEDYFLFKFQKAKVVY-EKLWNGSYFN 743
AL+AA A+ + D +G+ F +++ +Y + LWNG Y+N
Sbjct: 593 ALEAAIAIGTILLDNPPVNPALEKENAQENIQGAIALFRQWLHQSRAIYHDTLWNGEYYN 652
Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
D+ S S + ADQL GQ+YA+ L +V+++ + L K+Y+ LK G G
Sbjct: 653 LDS--KSGSDVVMADQLCGQYYAQLLKLPDVVEKEYVTATLNKIYDACYLKFHNGTLGIA 710
Query: 804 NGMLPDGRVDMSS-MQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
NG+ PDG + + EIW+G+ Y + A MI + + G + A + +S G
Sbjct: 711 NGVKPDGSPEKENDTHPLEIWTGINYGIVAFMILNGMKEEGLKVAETVVNQVYSN---GL 767
Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
F+TPEA + +R+ Y+R +AIWA+ AL+
Sbjct: 768 QFRTPEAITAVNTFRASHYLRAMAIWAIYQALS 800
Score = 187 bits (474), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 121/402 (30%), Positives = 185/402 (46%), Gaps = 48/402 (11%)
Query: 84 DPFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
+P+ R+ ++ +HG+PLGG G+G IGR+ G+F W I + + A QF+VF
Sbjct: 24 NPYTVRYESNLDDGPNHGMPLGGFGAGCIGRAPSGDFNLWHIDGGNHTFETLPACQFAVF 83
Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
L PE + SW W +G K TY ALYPR+W +EG
Sbjct: 84 EHSEGEDAQVYALSTNAPE------DGSLSSWSWYPQG-KGTYSALYPRSWFSYEGVFKS 136
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-------- 250
+L VC Q SP+ +Y+ESSYP++VF ++I+N ++++ +W N+VG
Sbjct: 137 QL--VCEQFSPVWAGSYQESSYPMAVFEWSIHNPTDKPITMSIMVSWQNTVGWFTNAIKS 194
Query: 251 ----------GDSEFTGQHYNSKTKMN----DGVHAVLLHHRTSHQLPPV----TFALAA 292
+ E+ + NS +N DG L R + P +A+A
Sbjct: 195 PTVKVRDDGSPEYEYKPRWGNSTGNLNQWIQDGFRVGCLFDRIRLEDEPTEGEGQWAIAT 254
Query: 293 QETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAI 352
+ V + G+ ++W +GS L + +EPG I A+
Sbjct: 255 TTNPTLEVFYHTRWNPKGD-----GSEIWDRFAMNGS---LPDFQDETPAEPGEQIAGAM 306
Query: 353 AASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHG 412
A T+ P ++ F L WD P F G Y+RRYT F+G + ++ R + H
Sbjct: 307 AIRFTIRPGKTKKIPFILGWDFPVTEFAQGVNYFRRYTDFFGRNGRNVWSMIRTGLKHHD 366
Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
W+ +++ WQ PIL LP+W + LFNELY L GG +WT
Sbjct: 367 MWQNKVKMWQEPILSRSNLPDWLKMALFNELYLLTQGGTLWT 408
>gi|443325334|ref|ZP_21054034.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
gi|442795063|gb|ELS04450.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
Length = 808
Score = 236 bits (602), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 142/385 (36%), Positives = 205/385 (53%), Gaps = 28/385 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
+GQF E ++Y + + DV Y SF L+ML+ K+ ++ FA A+ D + +
Sbjct: 421 VGQFCVCECMDYRWYESLDVRLYGSFGLMMLWSKLDKAVLEAFARAIPKSDDKPRIIGYN 480
Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVAT-GDK 645
G RK GA PHD+G + PW N D +WKDL FVLQVYRD + T GD
Sbjct: 481 GATAIRKAAGATPHDLGAPNEHPWEATNYTSYQDCNQWKDLPSDFVLQVYRDYLFTEGDN 540
Query: 646 -KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
+F W SV +AY+ +FD D DG+ EN G PDQT+D W + GIS+Y GGLW+AAL+
Sbjct: 541 TEFLWECWDSVVCTLAYLKEFDEDNDGIPENSGAPDQTFDDWRLQGISSYCGGLWIAALE 600
Query: 705 AASALAR------------EVGD--RGSE----DYFLFKFQKAKVVYEKLWNGSYFNYDN 746
AA + + D +G E DY + + ++ ++ LWNG Y+ D+
Sbjct: 601 AAIKIGEILIANPTKNPELQTADFPQGIEKTISDYREWLQRSRELYHDTLWNGEYYRLDS 660
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S + ADQL GQ+YAR GL +V+ +SAL K+Y+ LK GK GA NG+
Sbjct: 661 --KSNSEVVMADQLCGQFYARLLGLPDVVENQYTKSALSKIYDACFLKFHDGKFGAANGV 718
Query: 807 LPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
LPDG V+ E+W+G+ + +AA M+ + D F+ + + G F+
Sbjct: 719 LPDGSAVNPDDTHPLEVWTGINFGLAAFMVQMGMKDEAFRLTETVVRQVYEN---GLQFR 775
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
TPEA +R+ Y+R +AIW +
Sbjct: 776 TPEAITAVGTFRASHYLRAMAIWGV 800
Score = 200 bits (509), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 129/388 (33%), Positives = 185/388 (47%), Gaps = 39/388 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYS-SVLCP 153
HG PLGG G+G IGRS +G+F W + + A QFSVF S + G+ + ++ C
Sbjct: 37 HGAPLGGFGAGCIGRSPKGDFNLWHLDGGEHLLNSIPACQFSVFESVAGGETQAFAMACE 96
Query: 154 KTPEVLK-DTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
E+ K + W+W + ++ Y ALYPR+W ++G EL C Q SPI
Sbjct: 97 DDGEIGKWGDREESLSRWNWYPQ-EQGVYAALYPRSWYEYQGVFASEL--TCEQFSPIWK 153
Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFT 256
NY+E+SYPV++F +T +N ++++ TW N+VG G E+
Sbjct: 154 ENYQETSYPVAIFDWTAHNPTDKPITLSIMLTWQNTVGWFTNAIKSPEVKVRDDGSPEYE 213
Query: 257 GQ--------HYNSKTKMNDGVHAVL--LHHRTSHQLPPVTFALAAQETDGVHVSLCPHF 306
Q +YN + + V +L + + Q A+A V V +
Sbjct: 214 YQPKWGDSTGNYNQWIQDSYRVGCILNRVQPHSEIQEGEGQIAIATVANPAVEVFYLSRW 273
Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
G+ ++W GS L E ++PG I AIA VT+PP ++
Sbjct: 274 NPEGD-----GSEVWDSFAADGS---LPDKEDETPADPGEQIACAIAVRVTIPPGKTKKI 325
Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
F LAWD P F G YYRRYT F+G N +I + A+ W QIEAWQ PIL
Sbjct: 326 PFILAWDFPVTEFAQGINYYRRYTDFFGRTGNNVWSIVKTALKHSDWWREQIEAWQNPIL 385
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWT 454
E LP+W+ + LFNELY L GG +W+
Sbjct: 386 ERPDLPDWFKMALFNELYLLTQGGTLWS 413
>gi|186681980|ref|YP_001865176.1| hypothetical protein Npun_R1544 [Nostoc punctiforme PCC 73102]
gi|186464432|gb|ACC80233.1| protein of unknown function DUF608 [Nostoc punctiforme PCC 73102]
Length = 801
Score = 236 bits (601), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 147/383 (38%), Positives = 207/383 (54%), Gaps = 26/383 (6%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
IGQF LE ++Y + + DV Y SFAL+MLFP+++ S+ R FA A+ D +
Sbjct: 419 IGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVMRAFARAIPQGDDTPRIIGYY 478
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
M + + RKV GA PHD+G + W + N D WKDL FVLQVYRD +
Sbjct: 479 MTIKAESPIAVRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFL 538
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F W ++ + Y+ FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 539 LTGADDVEFLADCWNAIVQTLDYLKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 598
Query: 699 WVAALQAASA-----LAREVGDRGS-----EDYFLFKFQKAKVVYEKLWNGSYFNYDNSG 748
W+AAL+AA A L ++GD G Y + Q + EKLWNG Y+ D+
Sbjct: 599 WLAALEAAIAISDLLLTHKLGDLGGLAVQKSIYETWLQQSLPIYQEKLWNGQYYQLDS-- 656
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
S S + ADQL GQ+YAR L IV D+A SALK VY+ LK G+ G NG+ P
Sbjct: 657 QSGSDVVMADQLCGQFYARLLDLPDIVPSDRALSALKTVYDACFLKFCNGEFGTANGVRP 716
Query: 809 DGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
DG + + E+W+G+ + +AA ++ + D + + + + G F+TP
Sbjct: 717 DGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEALRLTQAVVQQIYEN---GLQFRTP 773
Query: 868 EAWNTDDQYRSLCYMRPLAIWAM 890
EA +R+ Y+R +AIW +
Sbjct: 774 EAITAAGTFRASTYLRAMAIWGI 796
Score = 193 bits (491), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 134/392 (34%), Positives = 193/392 (49%), Gaps = 46/392 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SNGQKYSSVLCP 153
HG+PLGG G+G IGRS RG+F W I K V A QFSVF S ++ Q Y+ L
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKNVPACQFSVFESNGTSTQAYA--LST 97
Query: 154 KTPEVLKDTTAAGIGSWDW--NLKGDKST--YHALYPRAWTVHEGEPDPELRIVCRQISP 209
+ P + +W W L + T YHALYPR+W V+E +L C Q SP
Sbjct: 98 QGP------NDGALKAWQWYPTLPSTEGTGNYHALYPRSWFVYENIFQAQL--TCEQFSP 149
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
+ NY+E+SYPV++F + +N ++++ TW N VG FT + + ++ D
Sbjct: 150 VWAGNYQETSYPVAIFLWNAHNPTNAPITLSIMLTWQNMVGW---FTNALKSPEVRVRDD 206
Query: 270 VHAVLLHH-------RTSHQL--PPVTFAL--------AAQETDGV-------HVSLCPH 305
V + +Q+ P F A QE DG H +
Sbjct: 207 GSPVYEYQPRWGESQGNYNQIVENPQQFGCLLSRVGSDALQEGDGTWCIATLKHPQVEIF 266
Query: 306 FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
+ N+ G T ++W + GS N ++ + E G +GAA+A T+ P +
Sbjct: 267 YHTRWNAEG-TGDEVWQSFAKDGSLS--NYIDANPVGE-GEQLGAAMALRFTLQPGETLE 322
Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
+ F LAWD P F +G YYRRYT F+ N A IA A+ E+ +W QI+ WQ+PI
Sbjct: 323 IPFVLAWDLPITEFAAGVNYYRRYTDFFDRSGNNAWAIASTALQEYQTWRSQIQTWQQPI 382
Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
L+ + LP W+ + LFNELY L +GG +W+ S
Sbjct: 383 LDREDLPNWFKMALFNELYDLTSGGTLWSAAS 414
>gi|334116939|ref|ZP_08491031.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
gi|333461759|gb|EGK90364.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
Length = 818
Score = 235 bits (600), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 148/390 (37%), Positives = 212/390 (54%), Gaps = 20/390 (5%)
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
D + GQF LE I+Y + + DV Y SFAL+ML+P ++ S+ FA A+ D +
Sbjct: 434 DERDPKGQFAVLECIDYRWYESLDVRLYGSFALLMLWPDLEKSVVLAFARAISAGDDTPR 493
Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
+ + RK +GA PHD+G + PW + N D WKDL FVLQVYRD +
Sbjct: 494 IIGWNQASAIRKAVGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLL 553
Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
TG D +F + WP++ A+AY+ FD DGDG+ EN G PDQT+D W + GISAY GGLW
Sbjct: 554 TGGTDSEFLQECWPAIVEALAYLKTFDEDGDGIPENSGAPDQTFDDWQMRGISAYCGGLW 613
Query: 700 VAALQAASALAREVGDRGSED----------YFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
+AAL+AA A+ + D + Y + Q EKLWNG Y+ D+
Sbjct: 614 LAALEAAIAIGEILEDAAPQIIPNPQAKIALYHSWLLQARPSYQEKLWNGQYYRLDS--E 671
Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
S S + ADQL GQ+YA+ GL IV + A SAL+ VY LK G+ GA NG++ +
Sbjct: 672 SNSEVVMADQLCGQFYAKLLGLPDIVPPECAVSALETVYESCFLKFNEGEFGAANGVMLN 731
Query: 810 GRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
G + ++ E+W+G+ + +AA ++ + D F+ + + G F+TPE
Sbjct: 732 GSPENPNATHPLEVWTGINFGLAAFLVQMGMEDKAFKLTDAVVNQIYEN---GLQFRTPE 788
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
A +R+ Y+R +AIWA+ LT K
Sbjct: 789 AITAGGTFRASHYLRAMAIWAIYGVLTNFK 818
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 134/410 (32%), Positives = 189/410 (46%), Gaps = 70/410 (17%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W I + + A QFSVF + G+K + L +
Sbjct: 42 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFRSLPACQFSVF-EETQGKKQAFALSTE 100
Query: 155 TPEVLKDTTAAGIGSWDW---------------------NLKGDKSTYHALYPRAWTVHE 193
P T + SW W +YHALYPR+W V+E
Sbjct: 101 PP------TDGSLSSWKWYPCEGRRKKEEGRREEEERTGEESNSSGSYHALYPRSWFVYE 154
Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
+L C Q SPI+P+NY+ESSYPV++F + +N ++++ TW N VG
Sbjct: 155 NVFHAQLS--CEQFSPILPNNYQESSYPVAMFEWVAHNPTDAPITLSIMLTWENIVGW-- 210
Query: 254 EFTGQHYNSKTKMND----------------GVHAVLLH--HRTSHQLPPVTFALAAQET 295
FT + + K + D G +L+ HR + +T QE
Sbjct: 211 -FTNANKSPKVMVRDDGSPFYDYQPRWGESAGNLNLLVEDFHRIGCVMTRLTAEDEPQEG 269
Query: 296 DG-----------VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEP 344
DG V V + +GN +D+W G F ++ ET+ +
Sbjct: 270 DGQWAIATITNPAVEVFYQTRWNPAGN-----GEDIWRNFSAEG-FLLDDNDETAAAA-- 321
Query: 345 GSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIA 404
G + AIA T+ P ++ F +AWD P F SG YYRRYT F+G + A I
Sbjct: 322 GEQLACAIAVRFTLRPGKTRKIPFIIAWDLPVTEFASGVWYYRRYTDFFGRNGKNAWPII 381
Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
R A+ +W IEAWQ+PILE K L + + + LFNELY L GG +W+
Sbjct: 382 RTAMKHSDTWRENIEAWQKPILERKDLSDSFKMALFNELYALTDGGTLWS 431
>gi|147788373|emb|CAN61188.1| hypothetical protein VITISV_019327 [Vitis vinifera]
Length = 550
Score = 235 bits (599), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 108/217 (49%), Positives = 152/217 (70%), Gaps = 3/217 (1%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D A P W+R+L++ L +F++ + E +++ +G+R+ IREEA++G++A IDPF
Sbjct: 31 DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
+ ++S GVPLGG+GSGSI R +RGEF+ WQI P C+ P++ANQFS+F+SR G
Sbjct: 91 TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150
Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
+KY+SVL P E L + GI SW WNL G STYHAL+PRAWT+++GEPDPEL++
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
CRQISP IPHNY++SS P +VF YT + + S+ T
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTPLTTFEVSSKFT 247
>gi|428317101|ref|YP_007114983.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
gi|428240781|gb|AFZ06567.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
Length = 809
Score = 234 bits (598), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 148/390 (37%), Positives = 213/390 (54%), Gaps = 20/390 (5%)
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
D + GQF LE I+Y + + DV Y SFAL+ML+P ++ S+ FA A+ D +
Sbjct: 425 DERDPKGQFAVLECIDYRWYESLDVRLYGSFALLMLWPDLEKSVVLAFARAISAGDDTPR 484
Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
+ + RK +GA PHD+G + PW + N D WKDL FVLQVYRD +
Sbjct: 485 IIGWNQASAIRKAVGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLL 544
Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
TG D +F + WP++ A+AY+ FD DGDG+ EN G PDQT+D W + GISAY GGLW
Sbjct: 545 TGGTDCEFLQECWPAIVEALAYLKTFDEDGDGIPENSGAPDQTFDDWQMRGISAYCGGLW 604
Query: 700 VAALQAASALAREVGDRGSED----------YFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
+AAL+AA A+ + D E Y + Q EKLWNG Y+ D+
Sbjct: 605 LAALEAAIAIGEILEDAAPEIIPNPQAKIALYHSWLLQARPSYQEKLWNGQYYRLDS--E 662
Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
S S + ADQL GQ+YA+ GL IV + A SAL+ VY LK G+ GA NG++ +
Sbjct: 663 SNSEVVMADQLCGQFYAKLLGLPDIVPPECAVSALETVYESCFLKFNDGEFGAANGVMLN 722
Query: 810 GRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
G + ++ E+W+G+ + +AA ++ + + F+ + + + G F+TPE
Sbjct: 723 GSPENPNATHPLEVWTGINFGLAAFLVQMGMEEKAFKLTDAVVKQIYEN---GLQFRTPE 779
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
A +R+ Y+R +AIWA+ LT K
Sbjct: 780 AITAGGTFRASHYLRAMAIWAIYGVLTNFK 809
Score = 192 bits (489), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 135/402 (33%), Positives = 189/402 (47%), Gaps = 62/402 (15%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W I + + A QFSVF + G+K + L +
Sbjct: 41 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFRSLPACQFSVF-EETQGKKQAFALSTE 99
Query: 155 TPEVLKDTTAAGIGSWDW-------------NLKGDKSTYHALYPRAWTVHEGEPDPELR 201
P T + SW W + TYHALYPR+W V+E +L
Sbjct: 100 PP------TDGSLSSWKWYPSEGRRKKEEGRREESTSGTYHALYPRSWFVYENVFQAQLS 153
Query: 202 IVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYN 261
C Q SPI+P+NY+ESSYPV++F + +N ++++ TW N VG FT + +
Sbjct: 154 --CEQFSPILPNNYQESSYPVAMFEWVAHNPTDAPITLSIMLTWENIVGW---FTNANKS 208
Query: 262 SKTKMND----------------GVHAVLLH--HRTSHQLPPVTFALAAQETDG------ 297
K K+ D G +L+ HR + +T QE DG
Sbjct: 209 PKVKVRDDGSPFYDYQPRWGESAGNLNLLVEDFHRIGCVMTRLTADDEPQEGDGQWAIAT 268
Query: 298 -----VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAI 352
V V + +GN D+W G F + ET+ + G + AI
Sbjct: 269 ITNPAVEVFYQTRWNPAGN-----GDDIWRNFSAEG-FLLDDRDETAAAA--GEQLACAI 320
Query: 353 AASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHG 412
A T+ P ++ F +AWD P F SG YYRRYT F+G + A I R A+
Sbjct: 321 AVRFTIRPGKTRKIPFIIAWDLPVTEFASGVWYYRRYTDFFGRNGKNAWPIIRTAMKHSD 380
Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
+W IEAWQ+PILE + L + + + LFNELY L GG +W+
Sbjct: 381 TWRENIEAWQKPILERQDLSDSFKMALFNELYALTDGGTLWS 422
>gi|443311911|ref|ZP_21041533.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
gi|442777986|gb|ELR88257.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
Length = 804
Score = 234 bits (597), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 141/393 (35%), Positives = 205/393 (52%), Gaps = 31/393 (7%)
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
D + +GQF LE ++Y + + DV Y SF L+ML+ K+ SI R +A A+ D +
Sbjct: 409 DERDPVGQFAVLECLDYRWYESLDVRLYGSFGLLMLWSKLDKSILRAYARAISTSDRTLR 468
Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
++ + RKV A PHD+G + PW + N D WKDL FVLQVYRD V
Sbjct: 469 EIGYNQATGLRKVANATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPSDFVLQVYRDFVL 528
Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
TG D + W ++ + Y+ FD D DG+ EN G PDQT+D W + G+SAY GGLW
Sbjct: 529 TGSTDWELLWDCWGAIVQTLNYLKTFDLDNDGIPENSGAPDQTFDDWRLQGVSAYCGGLW 588
Query: 700 VAALQAASALAREVGDRGSED---------------------YFLFKFQKAKVVYEKLWN 738
+A L++A A+ + + E Y + Q V EKLWN
Sbjct: 589 LAGLESAIAIGKTLISYPQEHPLGEMLTSPLNQPPIEETIGIYQTWLEQSRPVYQEKLWN 648
Query: 739 GSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGG 798
G Y+ D+ S S + ADQL GQ+YAR G+ IV+ D+ S+LK VY LK G
Sbjct: 649 GQYYKLDS--ESGSDVVMADQLCGQFYARLLGVPDIVESDRTVSSLKTVYEACFLKFHNG 706
Query: 799 KRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSG 857
GA NG+ DG + + E+W+G+ + +AA ++ D+ + F+ + E ++
Sbjct: 707 AFGAANGLKIDGSPENPDATHPLEVWTGINFGLAAFLLQMDMKEEAFKLTQTVVEQIYNN 766
Query: 858 TGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
G F+TPEA +R+ Y+RP+AIWA+
Sbjct: 767 ---GLQFRTPEAITAVGTFRACHYLRPMAIWAV 796
Score = 186 bits (472), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 133/399 (33%), Positives = 202/399 (50%), Gaps = 46/399 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS +G+F W I K + A QFSVF +SN + LC +
Sbjct: 41 HGMPLGGFGAGCIGRSSKGDFNLWHIDGGEHTFKSIPACQFSVFEQQSN---LAYALCTE 97
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE + +W W + +K TYHALYPR+W V+E +L C Q SPI+P N
Sbjct: 98 APE------DGSLKAWQWYPQ-NKGTYHALYPRSWFVYENVFLSQL--TCEQFSPILPDN 148
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
Y+E+SYPV++F + +N ++++ TW N VG FT + + ++ D V
Sbjct: 149 YRETSYPVAIFVWKAHNPTDQPLTLSIMLTWQNMVGW---FTNSIKSPQVRVRDDGSPVY 205
Query: 275 L-----------HHRTSHQLPPVTFALAAQETDGVHV----------SLCPHFVISG--- 310
+++ + LA +D V + S+ P + IS
Sbjct: 206 EYEPRLGQSAGNYNKLVETADSLGCILARVGSDDVALEGIGQWVIATSINPKYQISHHTR 265
Query: 311 -NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFS 369
N G T D+W GS N ++ + ++ G + +AIA T+ P ++ F
Sbjct: 266 WNPTG-TGADVWQSFSTDGSLP--NYIDETPIAD-GEQLASAIAIKFTLEPGESLEIPFV 321
Query: 370 LAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDK 429
LAWD P F G YYRRYT F+G + + A IA +A+ + +W+ Q++AWQ+PI++
Sbjct: 322 LAWDFPITEFAPGVNYYRRYTDFFGRNGSNAEAIAFNALQHYQTWQQQVQAWQQPIIDRT 381
Query: 430 RLPEWYPITLFNELYYLNAGGAVWT--DGSPPVHSLVTI 466
LP+W+ + LFNELY L +GG +W+ D PV +
Sbjct: 382 DLPDWFKMALFNELYDLTSGGTLWSAADERDPVGQFAVL 420
>gi|434389621|ref|YP_007100232.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
gi|428020611|gb|AFY96705.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
Length = 818
Score = 234 bits (597), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 144/400 (36%), Positives = 211/400 (52%), Gaps = 38/400 (9%)
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK- 582
D + +GQF LE ++Y + + DV Y SF++++L+PK++ SI FA A+ D +K
Sbjct: 418 DDRDPVGQFGVLECLDYRWYESLDVRLYGSFSVLLLWPKLEKSILIAFARAIDTEDTNKR 477
Query: 583 ----------------MKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARW 624
+ D RK L A PHD+G + PW + N C D W
Sbjct: 478 LIGYYVQTAFGYEDRPQNVDRDAMMAPRKTLHATPHDLGAPNEHPWEKTNYTCYQDCNLW 537
Query: 625 KDLNPKFVLQVYRDVVATGDK--KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQT 682
KDL FVLQVYR + TG K +F W ++ +AY+ QFD D DG+IEN G PDQT
Sbjct: 538 KDLGSDFVLQVYRAYIFTGKKDLEFLAECWTAIVDTLAYLKQFDLDNDGIIENSGAPDQT 597
Query: 683 YDTWSVSGISAYSGGLWVAALQAASALAR--------EVGDRGSEDYFLFK--FQKAKVV 732
+D W + GISAY GGLW+AAL+AA ++ E+ + +E ++ K +
Sbjct: 598 FDDWRLQGISAYCGGLWLAALEAAISIGNILTQQYHGELTPKPAEVIPQYQAWLDSGKPI 657
Query: 733 Y-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
Y ++LWNGSY+ D S S + ADQL GQ+Y + L IV D A++AL+ VY
Sbjct: 658 YQQQLWNGSYYRIDT--GSGSEVVMADQLCGQFYTQLLELDDIVPPDCAKTALQTVYQSC 715
Query: 792 VLKVMGGKRGAVNGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
L GGK GA NG+ DG + E+W+G+ + +AA ++ + + + A I
Sbjct: 716 FLNFQGGKFGAANGVKIDGSPSNPKDTHPLEVWTGINFGIAAFLVQMGMQEEALKLAETI 775
Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
E +S G F+TPEA + +R+ Y+R +AIW +
Sbjct: 776 VEQVYSN---GLQFRTPEAITSAGTFRASHYLRAMAIWGI 812
Score = 182 bits (461), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 124/398 (31%), Positives = 181/398 (45%), Gaps = 47/398 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS+RGEF W I K + A QFSVF +NG + + +
Sbjct: 41 HGMPLGGFGAGCIGRSHRGEFNLWHIDGGEHVFKNLSACQFSVFEQLANGDSQAYAMSTQ 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKS-----TYHALYPRAWTVHEGEPDPELRIVCRQISP 209
P + W W + + +Y ALYPR+W +E +L C Q SP
Sbjct: 101 AP------GDGSLNRWQWYPQTTPTGQSTGSYQALYPRSWYSYENVFQAQL--TCEQFSP 152
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
I NY+E+SYP++VF +T +N ++++ TW N+VG F +SK ++ D
Sbjct: 153 IWADNYQETSYPIAVFEWTAHNPTDKPITLSIMLTWENTVGW---FANTSPSSKVEIRDD 209
Query: 270 VHAV------------------LLHHRTSHQLPPVTFALAAQETDG-------VHVSLCP 304
V + ++R L + E +G + SL
Sbjct: 210 GSPVYDYQPRIGESEGNFNQWIVDNYRVGCLLDKAHKSDIPSEGEGQWSIASIANPSLEV 269
Query: 305 HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
+ N +G ++W GS ++ + E I AI TV P
Sbjct: 270 FYHSRWNPIG-NGGEIWDSFAADGSLPDIDDESEAKQRE---RIATAITIRFTVKPGQTK 325
Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
++ F LAWD P F G YYRRYT F+G + + R A+ +W+ QI WQ+P
Sbjct: 326 KIPFILAWDFPVTEFGKGINYYRRYTDFFGRSGENSWAMVRTALKHSDTWQEQIINWQQP 385
Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWT--DGSPPV 460
IL+ LP+W+ + LFNELY L GG +W+ D PV
Sbjct: 386 ILDRSDLPDWFKMALFNELYLLTDGGTIWSAADDRDPV 423
>gi|16330256|ref|NP_440984.1| hypothetical protein sll1775 [Synechocystis sp. PCC 6803]
gi|383321997|ref|YP_005382850.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325166|ref|YP_005386019.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491050|ref|YP_005408726.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436317|ref|YP_005651041.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
gi|451814414|ref|YP_007450866.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
gi|1652745|dbj|BAA17664.1| sll1775 [Synechocystis sp. PCC 6803]
gi|339273349|dbj|BAK49836.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
gi|359271316|dbj|BAL28835.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274486|dbj|BAL32004.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277656|dbj|BAL35173.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407958168|dbj|BAM51408.1| hypothetical protein BEST7613_2477 [Bacillus subtilis BEST7613]
gi|451780383|gb|AGF51352.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
Length = 817
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 142/394 (36%), Positives = 205/394 (52%), Gaps = 26/394 (6%)
Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
G L EN +GQF E ++Y + + DV Y SF L+M +PK++ ++ FA A+
Sbjct: 404 GGTLWTAATENDPVGQFGVCECMDYRWYESLDVRLYGSFGLLMHWPKLEQAVMLAFARAI 463
Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
DP + + + RKV A PHD+G + PW N D WKDL FVL
Sbjct: 464 PTEDPKERVIGYNLSLAPRKVKNATPHDLGAPNEHPWERSNYTAYQDCNLWKDLGSDFVL 523
Query: 634 QVYRD--VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
VYRD ++ D +F WPS+ A+ Y+ FD D DG+ EN G PDQT+D W + GI
Sbjct: 524 LVYRDFLLLPDADGEFLGECWPSIVAALDYLKGFDLDNDGIPENSGAPDQTFDDWRLQGI 583
Query: 692 SAYSGGLWVAALQAASALAREVGDRGSE--DYFLFKFQKAKVVYE------------KLW 737
SAY GGLW+AAL+AA + + + + + D + Q A YE LW
Sbjct: 584 SAYCGGLWIAALEAALVMGQYLLENPPQQTDLDAMEIQTAIARYETWLAQARSLYQDTLW 643
Query: 738 NGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG 797
NG Y+N D+ S S + ADQL GQ+YAR L +V+ A AL KVY+ LK
Sbjct: 644 NGEYYNLDS--QSGSKVVMADQLCGQFYARLLNLPDVVEHHYAEKALAKVYDTCFLKFAH 701
Query: 798 GKRGAVNGMLPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWS 856
G+ GA NG+LPDG + + E+W+G+ + +AA +I + + + + ++
Sbjct: 702 GELGAANGLLPDGSPQNPNDTHPLEVWTGINFGLAAFLIQMGMKEEALKMTETVVNQVYN 761
Query: 857 GTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
G F+TPEA +R+ Y+R +AIWA+
Sbjct: 762 N---GLQFRTPEAITAVGTFRASHYLRAMAIWAV 792
Score = 198 bits (504), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 132/401 (32%), Positives = 190/401 (47%), Gaps = 48/401 (11%)
Query: 85 PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
P+ R+ ++ HG+PLGG G+G IGRS G+F W + K + A QFSV+
Sbjct: 26 PYTVRYASNLDDGPDHGMPLGGFGAGCIGRSPTGKFNLWHLDGGNHIFKNLPACQFSVYE 85
Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPE 199
+ + + + + PE + W W K +K Y ALYPR+W ++ E
Sbjct: 86 QVAGEEAKAYAMATEAPE------DGSLDRWQWYPK-EKGEYSALYPRSWYEYKDVFKSE 138
Query: 200 LRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG--------- 250
L +C Q SP+ NY+ESSYP++VF +T++N ++++ TW NSVG
Sbjct: 139 L--ICEQFSPVWGGNYQESSYPLAVFDWTVHNPTDKPITLSIMLTWQNSVGWFTNAIKSP 196
Query: 251 -------GDSEFTGQHY------NSKTKMNDGVHAVLLHHRTS-HQL---PPVTFALAAQ 293
G E+ Q N + DG L +R H+ +A+
Sbjct: 197 TVKLRDDGSPEYEYQPRWGDSTGNFNQWIQDGFRVGCLFNRVQPHEALREGEGQMCIASV 256
Query: 294 ETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIA 353
V V + SG+ D+W +GS L ++ +EPG I AI
Sbjct: 257 SNPSVEVFYHNRWNPSGD-----GADVWDWFAGNGS---LPDIQDETAAEPGEQIAGAIC 308
Query: 354 ASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGS 413
TV P ++ F LAWD P F G TYYRRYT FYG + ++ R A+ +
Sbjct: 309 LRFTVRPGKTKKIPFLLAWDLPITEFAQGVTYYRRYTDFYGRNGKNVWSMIRTALKHSDT 368
Query: 414 WELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
W +IEAWQ PILE+ RLP+W ++LFNELY L GG +WT
Sbjct: 369 WRKKIEAWQAPILENPRLPDWLKMSLFNELYLLTQGGTLWT 409
>gi|17230308|ref|NP_486856.1| hypothetical protein alr2816 [Nostoc sp. PCC 7120]
gi|17131910|dbj|BAB74515.1| alr2816 [Nostoc sp. PCC 7120]
Length = 804
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 141/385 (36%), Positives = 209/385 (54%), Gaps = 28/385 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
IGQF LE ++Y + + DV Y SF L+ LFP+++ ++ R FA A+ D +
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFGLLQLFPELEKAVMRAFARAIPQGDDTPRVIGYY 479
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ + RK GA PHD+G + W + N D WKDL FVLQVYRD +
Sbjct: 480 YTIGAESPIAVRKTPGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 539
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F + W ++ + Y+ FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 540 LTGADDVQFLRDCWDAIVETLDYVKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 599
Query: 699 W----VAALQAASALAREVGDRGSEDYFLFK-------FQKAKVVY-EKLWNGSYFNYDN 746
W AA+ + L + D ++++ L++ K+ +Y EKLWNG Y+ D+
Sbjct: 600 WLAALAAAIAISDILLQNHQDSATKEHLLYQKSTYETWLTKSLPIYQEKLWNGKYYRLDS 659
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S + ADQL GQ+YA L IV D+A SAL+ VY+ LK G+ GA NG+
Sbjct: 660 --ESGSDVVMADQLCGQFYANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGV 717
Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
PDG + + E+W+G+ + +AA ++ + D GF+ + E ++ G F+
Sbjct: 718 RPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAVVEQIYNN---GLQFR 774
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
TPEA +R+ Y+RP+AIWA+
Sbjct: 775 TPEAITAAGTFRASTYLRPMAIWAI 799
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 136/396 (34%), Positives = 196/396 (49%), Gaps = 51/396 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQI--FPRVCEDKPVLANQFSVF-VSRSNGQKYSSVL 151
HG+PLGG G+G IGRS RG+F W I + ++ P A QFSVF + ++ Q Y+
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNFP--ACQFSVFEANGTSSQAYALST 97
Query: 152 CPKTPEVLKDTTAAGIGSWDWNLKGDKS----TYHALYPRAWTVHEGEPDPELRIVCRQI 207
P E LK SW W ++ TYHALYPR+W V+E EL C Q
Sbjct: 98 QPTDDESLK--------SWQWYPASSETQTTGTYHALYPRSWFVYENVFQAEL--TCEQF 147
Query: 208 SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN 267
SPI +NY+E+SYPV+VF + +N ++++ TW N VG FT + ++
Sbjct: 148 SPIWANNYQETSYPVAVFAWQAHNPTNAPITLSIMLTWQNMVGW---FTNALKSPDVRVR 204
Query: 268 D---------------GVHAVLLHHRTSHQ---LPPVTFALAAQETDGV-------HVSL 302
D G + L H L V QE +G H +
Sbjct: 205 DDGSPVYEYQPRLGESGGNYNYLDESPQHLGCFLGRVGMTEPLQEGEGSWCIATLKHSQV 264
Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
+ N +G T +++W GS N ++TS +E +GAAIA T+ P
Sbjct: 265 EVFYHTRWNPVG-TGEEVWQSFAADGSL--ANHIDTSSVTE-NEQLGAAIAVRFTLQPGE 320
Query: 363 EGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
++ F ++WD P F +G YYRRYT F+G + A IA A+ ++ +W+ QIE+WQ
Sbjct: 321 TLEIPFVVSWDLPVTEFAAGVNYYRRYTDFFGKTGDHAWAIATTALEQYKTWQQQIESWQ 380
Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
PIL LP+W+ + LFNELY L +GG +W+ +P
Sbjct: 381 DPILNRPDLPDWFKMALFNELYDLTSGGTLWSAATP 416
>gi|225593155|gb|ACN96073.1| pedicted bile acid beta-glucosidase [Fischerella sp. MV11]
Length = 812
Score = 233 bits (595), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 142/392 (36%), Positives = 205/392 (52%), Gaps = 35/392 (8%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
IGQF LE ++Y + + DV Y SFAL+MLFP+++ ++ R FA A+ + D +
Sbjct: 421 IGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKAVMRAFARAIPVSDSTTRVIGYY 480
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ + + RKV GA PHD+G + W + N D WKDL FVLQVYRD +
Sbjct: 481 YTVGAESPFAIRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 540
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D F W ++ + Y+ FD D DG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 541 FTGADDITFLAECWNAIVQTLDYLKTFDIDNDGIPENSGAPDQTFDDWRLQGVSAYCGGL 600
Query: 699 W--------------VAALQAASALAREVGDRG----SEDYFLFKFQKAKVVYE-KLWNG 739
W + + + +GD G + + Q+++ +YE KLWNG
Sbjct: 601 WLAALEAAIAICDVLITNRRDTENAEKFLGDLGDLAVQKSIYETWLQQSRPIYEAKLWNG 660
Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK 799
Y+ D+ S S + ADQL GQ+YAR GL IV D+A SALK VYN LK G+
Sbjct: 661 QYYRLDS--ESGSDVVMADQLCGQFYARLLGLPDIVKRDRALSALKTVYNACFLKFHHGQ 718
Query: 800 RGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
GA NG+ PDG + + E+W+G+ + +AA M+ + D + + +
Sbjct: 719 FGAANGLRPDGSPENPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKLTEAVVRQVYEN- 777
Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
G F+TPEA +R+ Y+R +AIWA+
Sbjct: 778 --GLQFRTPEAITAVGTFRASTYLRAMAIWAI 807
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 133/389 (34%), Positives = 190/389 (48%), Gaps = 44/389 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W I K + QFSVF R+ + LC +
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNIPGCQFSVF-ERTEAGTQAYALCTQ 98
Query: 155 TPEVLKDTTAAGIGSWDW-----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
PE + +W W N +G TYHALYPR+W V+E +L C Q SP
Sbjct: 99 PPE------DGSLAAWQWYPTIKNEEGITGTYHALYPRSWFVYEQVYKTQL--TCEQFSP 150
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
+ +NY+E+SYPV++F + +N S ++++ TW N VG FT + + +M D
Sbjct: 151 LWANNYQETSYPVAIFVWKAHNPTNASIALSIMLTWENMVGW---FTNALKSPEVRMRDD 207
Query: 270 VHAVL---------------LHHRTSH---QLPPVTFALAAQETDG---VHVSLCPHFVI 308
V L + H L V QE +G + P I
Sbjct: 208 GSPVYEYQPRLGESKDNFNQLVENSEHIGCILGRVGVDQPIQEGEGQWCIATRKHPKLEI 267
Query: 309 SGNSLGL---TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
+S T +D+W + GS ++V G I AA+A T+ P +
Sbjct: 268 FYHSRWQPRGTGEDVWQSFAQDGSLPNYLDHASAVE---GEQIAAALAVRFTLQPGETIE 324
Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
+ F L WD P F +G YYRRYT F+G A IA A+ ++ +W+ QI+AWQ+PI
Sbjct: 325 IPFVLTWDFPVTEFAAGINYYRRYTDFFGNDGKNAWAIALIALKQYQAWQAQIQAWQKPI 384
Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWT 454
L+ LP+W+ + LFNELY L+AGG +W+
Sbjct: 385 LDRADLPDWFKMALFNELYDLSAGGTLWS 413
>gi|119493962|ref|ZP_01624522.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
gi|119452283|gb|EAW33479.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
Length = 799
Score = 233 bits (595), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 141/391 (36%), Positives = 210/391 (53%), Gaps = 23/391 (5%)
Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
G L +EN GQF LE ++Y + + DV Y SF L++L+P++ S+ FA ++
Sbjct: 408 GGTLWSAADENNPKGQFAVLECLDYRWYESLDVRLYGSFGLLLLWPELDKSVLLAFARSI 467
Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
D + + + RK GA PHD+G + PW + N D WKDL FVL
Sbjct: 468 SKGDKTPRIIGYNQAEAIRKATGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVL 527
Query: 634 QVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
QVYRD V TG D +F + W ++ + Y+ FD+D DG+ EN G PDQT+D W + G+
Sbjct: 528 QVYRDFVLTGSEDYEFLQDCWLAIVETLDYLKIFDKDEDGIPENSGAPDQTFDDWKLQGV 587
Query: 692 SAYSGGLWVAALQAASALAR-----EVGDRGSEDY------FLFKFQKAKVVYEKLWNGS 740
SAY GGLW+AAL+AA + + E D S + +KAK +Y+ LWNG
Sbjct: 588 SAYCGGLWLAALEAAIEIGKCLQKTEFSDNVSPSISDKISTYETWLEKAKPIYQNLWNGQ 647
Query: 741 YFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKR 800
YF D+ S S + ADQL GQ+YA+ GL IV +K SAL+ +Y G+
Sbjct: 648 YFQLDS--QSGSDVVMADQLCGQFYAKLLGLEDIVSPEKTISALQTIYQSCFQNFYQGQF 705
Query: 801 GAVNGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTG 859
GA NG+ +G ++ + E+W+G+ + +AA +I + + GF+ A + + +
Sbjct: 706 GAANGVRINGEPINPNDTHPLEVWTGINFGLAAFLIQMGMKEEGFKLAEVVVQQIYEN-- 763
Query: 860 LGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
G F+TPEA +R+ Y+R +AIWA+
Sbjct: 764 -GLQFRTPEAITAAGTFRASHYLRAMAIWAI 793
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 123/386 (31%), Positives = 194/386 (50%), Gaps = 39/386 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + + + A QFS+F NG+ + LC +
Sbjct: 41 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHIFRSLPACQFSIF-EEVNGKTQTYALCTE 99
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKS----TYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
PE + +W+W S TYHALYPR+W V+E +L C Q SP+
Sbjct: 100 PPE------DESLSAWNWYPASSNSQKTGTYHALYPRSWFVYENVFKAQLS--CEQFSPV 151
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQH----------- 259
+NY+E+SYP++VF + +N +++L TW N++G + Q
Sbjct: 152 WANNYQETSYPIAVFEWIAHNPTDEPITLSILLTWQNTIGWFTNALEQPLVQVRDDGSPI 211
Query: 260 YNSKTKMNDGV----HAVLLHHRTSHQLPPVTFALAAQETDG-------VHVSLCPHFVI 308
Y + + + V H V HR + ++ + +E +G + ++ ++
Sbjct: 212 YEYQPRWGNSVGNFNHLVEDFHRIGCLMSRLSASEEVREGEGQMAIATITNPAVEVYYHT 271
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
+ N++G T +++W+ + G+ L E+ + G IG A+A T+ P ++ F
Sbjct: 272 AWNTVG-TGEEIWNYFSQDGT---LLDEESEHPATEGRQIGVALAVKFTIRPGKTRKIPF 327
Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
L+WD P F SG YYRRYT F+G + A ++ R A+ W I+AWQ PIL
Sbjct: 328 LLSWDLPVTEFESGVGYYRRYTDFFGRNGKNAWSMIRTAMKHSDVWLDNIKAWQNPILNR 387
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWT 454
+ LP W+ + LFNELY L GG +W+
Sbjct: 388 QDLPSWFKMALFNELYLLTDGGTLWS 413
>gi|427712531|ref|YP_007061155.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
gi|427376660|gb|AFY60612.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
Length = 799
Score = 233 bits (594), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 144/381 (37%), Positives = 211/381 (55%), Gaps = 25/381 (6%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL-- 586
+GQ+ LE ++Y + + DV Y SFAL+ L+P+++ + R FA A+ DP K++++
Sbjct: 414 LGQYAVLECLDYRWYESLDVRLYGSFALLYLWPELEKGVMRAFARAIPTEDP-KLRIIGY 472
Query: 587 ------DDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
+ RK+ A PHD+G + PW N D WKDL FVL VYRD
Sbjct: 473 YYQGDPETAHRAPRKIAHATPHDLGAPNEHPWQASNYTSYQDCNLWKDLASDFVLLVYRD 532
Query: 639 VVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSG 696
+ TG D +F WP++ ++ Y+ FD DGDG+ EN G PDQTYD W + G+SAY G
Sbjct: 533 YLLTGATDLEFLTDCWPAIVSSLDYLKTFDTDGDGLPENSGAPDQTYDDWQLKGVSAYCG 592
Query: 697 GLWVAALQAASALAREVGDRGS------EDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSS 750
GLW+AAL+AA A+A+ + +G+ E + + Q KLWNG ++ D+ +S
Sbjct: 593 GLWIAALEAALAIAQILEKQGNNTKLEQEKWQAWLSQSRPTYQAKLWNGQFYRLDSDSNS 652
Query: 751 QSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDG 810
Q + ADQL GQ+YAR L +V + A SALK VY+ LK G+ GA NG+LPDG
Sbjct: 653 Q--VVMADQLCGQYYARLLNLPDVVPIECAESALKTVYDVCFLKFHEGQFGAANGLLPDG 710
Query: 811 RVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEA 869
+ ++ E+W G+ Y +AA + L D ++ + + G F+TPEA
Sbjct: 711 SPEKPNATHPLEVWIGINYGLAAFLYQMGLEDGAWRLTQTVIRQIYDN---GLQFRTPEA 767
Query: 870 WNTDDQYRSLCYMRPLAIWAM 890
+R+ Y+RP+AIWAM
Sbjct: 768 ITAVGTFRACMYLRPMAIWAM 788
Score = 183 bits (465), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 126/394 (31%), Positives = 184/394 (46%), Gaps = 53/394 (13%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + + + QFSVF ++ + LC +
Sbjct: 38 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHIFQTMPGCQFSVF-EQAGDHSQAYALCTE 96
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
P + +G+W+W K TY ALYPR+W ++G + + C Q SPIIP N
Sbjct: 97 PP------SDGTLGAWNWYPTA-KGTYGALYPRSWFQYQGVF--QAGLTCEQFSPIIPDN 147
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT------------------ 256
Y+E+SYPV+VF + +N ++++ +W N VG S T
Sbjct: 148 YQETSYPVAVFVWAAHNPTDQPITLSIMLSWQNLVGWFSNATPASEIRIRDDGSPVYDYT 207
Query: 257 ---GQ---HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
GQ +YNS + +L R H A +E +G P
Sbjct: 208 PRLGQSQGNYNSFHQSQTQASCLLQRQRDPH--------TAIKEGEGEWAFSVP----VR 255
Query: 311 NSLGLTAKDMWHEIKEHG----SFDRLNSMETSVTSEPGS---SIGAAIAASVTVPPDSE 363
+ L + W+ + + G SF R S+ ++P S IGA +A T+ P
Sbjct: 256 DDLEIFYHTRWNPVGDGGELWSSFSRDGSLSNLENNQPASDIEQIGATLAVRFTLAPGES 315
Query: 364 GQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQR 423
++ F LAWD P F Y+RRYT F+ A IA+ A+ + W QI AWQ+
Sbjct: 316 KEIPFGLAWDLPVTEFAPNVNYFRRYTDFWNRSGTNAETIAQVALENYQKWSEQIGAWQQ 375
Query: 424 PILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
PILE P+ + + L NELY L +GG++WT S
Sbjct: 376 PILEHPDWPDDFKMALINELYDLTSGGSLWTAAS 409
>gi|414077215|ref|YP_006996533.1| beta-glucosidase [Anabaena sp. 90]
gi|413970631|gb|AFW94720.1| putative beta-glucosidase [Anabaena sp. 90]
Length = 795
Score = 233 bits (593), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 143/383 (37%), Positives = 212/383 (55%), Gaps = 26/383 (6%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
+GQF LE ++Y + + DV Y SF L+MLFP+++ S+ R FA A+ D
Sbjct: 413 VGQFAVLECLDYRWYESLDVRLYGSFGLLMLFPELEKSVIRAFARAIPQSDDRTRIIGYY 472
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ + + RK+ GA PHD+G + W + N D WKDL FVLQVYRD +
Sbjct: 473 LTIKSESPNAVRKIAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 532
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F W ++ + Y+ FD DGD + EN G PDQT+D W + G+SAY GGL
Sbjct: 533 FTGANDLQFLADCWDAIVQTLDYVKTFDLDGDSIPENSGAPDQTFDDWRLQGVSAYCGGL 592
Query: 699 WVAALQAASALAREVGD-RGSED---------YFLFKFQKAKVVYEKLWNGSYFNYDNSG 748
W+AAL+AA A+++ + + RG+ED Y + Q + EKLWNG Y+ D+
Sbjct: 593 WLAALEAAIAISKILTNRRGAEDAEVEVQGSIYQGWLSQSKPIYQEKLWNGEYYRLDS-- 650
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
S S + ADQL GQ+YAR L IV D+A SAL KVY+ LK G+ GA NG+LP
Sbjct: 651 DSGSDVVMADQLCGQFYARLLSLPDIVPSDRALSALTKVYDSCFLKFQNGEFGAANGVLP 710
Query: 809 DGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
+G + +S E+W+G+ + +AA ++ ++ + + + + G F+TP
Sbjct: 711 NGLPENPNSTHPLEVWTGINFGLAAFLVQMNMKNEAMRLTEAVVRQIYDN---GLQFRTP 767
Query: 868 EAWNTDDQYRSLCYMRPLAIWAM 890
EA + +R+ Y+R +AIW +
Sbjct: 768 EAITANGTFRASTYLRAMAIWGI 790
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 137/391 (35%), Positives = 189/391 (48%), Gaps = 48/391 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W I K V A QFS+F S GQ Y+ P
Sbjct: 38 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKTVPACQFSIFES---GQAYALSTQPP 94
Query: 155 TPEVLKDTTAAGIGSWDW---NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
L+ SW W + + + TYHALYPR+W V+E +L C Q SPI
Sbjct: 95 EDHTLQ--------SWQWYPQSQEQNSGTYHALYPRSWFVYENVLKTQL--TCEQFSPIW 144
Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND--- 268
NY+ESSYPV+VF + +N I+++ TW N VG FT + + ++ D
Sbjct: 145 ADNYQESSYPVAVFLWKAHNPTNAPITISIMLTWENMVGW---FTNTLKSPEVRVRDDGS 201
Query: 269 -------------GVHAVLL--HHRTSHQLPPVTFALAAQETDG---VHVSLCPHFVI-- 308
G + L H L V+ QE DG + + P I
Sbjct: 202 PVYEYLPRWGESQGNYNCLAEDHQYFGCVLGQVSDHQTVQEGDGTWCIATAKNPQVEIFY 261
Query: 309 --SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
N +G +D+W +GS N ++ + E + +GAAIA T+ P +V
Sbjct: 262 HSRFNPVG-NGEDVWQSFSANGSLP--NYIDETPADE-NTRLGAAIAVRFTLQPGENLEV 317
Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
L+WD P F +G YYRRYT F+G + A IA A+ E+ +W I+ WQ PIL
Sbjct: 318 PLVLSWDFPVTEFAAGVNYYRRYTDFFGCNGENAWQIATTALKEYQNWRSLIQDWQNPIL 377
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
LP+W+ + LFNELY L +GG +W+ S
Sbjct: 378 AKSDLPDWFKMALFNELYDLTSGGTLWSAAS 408
>gi|354566897|ref|ZP_08986068.1| Glucosylceramidase [Fischerella sp. JSC-11]
gi|353544556|gb|EHC14010.1| Glucosylceramidase [Fischerella sp. JSC-11]
Length = 812
Score = 232 bits (591), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 145/392 (36%), Positives = 205/392 (52%), Gaps = 35/392 (8%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
IGQF LE ++Y + + DV Y SFAL+MLFP+++ ++ R FA A+ + D +
Sbjct: 421 IGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKAVMRAFARAIPVSDQTTRVIGYY 480
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ + + RKV GA PHD+G + W + N D WKDL FVLQVYRD +
Sbjct: 481 YTVGAESPFAIRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 540
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D F W ++ + Y+ FD D DG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 541 FTGADDITFLAECWNAIVETLDYLKTFDIDNDGIPENSGAPDQTFDDWRLQGVSAYCGGL 600
Query: 699 W----VAALQAASALAREVGDRGSEDYFLFK--------------FQKAKVVYE-KLWNG 739
W AA+ L D + + FL Q+++ +YE KLWNG
Sbjct: 601 WLAALEAAIAICDVLITNRRDTDNAENFLGDLGDLAVQKSIYETWLQQSRPIYEAKLWNG 660
Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK 799
Y+ D+ S S + ADQL GQ+YAR GL IV D+A SALK VYN LK G+
Sbjct: 661 QYYRLDS--ESGSDVVMADQLCGQFYARLLGLPDIVKRDRALSALKTVYNACFLKFHHGQ 718
Query: 800 RGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
GA NG+ PDG + + E+W+G+ + +AA M+ + D + + +
Sbjct: 719 FGAANGLRPDGSPENPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKLTETVVRQIYEN- 777
Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
G F+TPEA +R+ Y+R +AIWA+
Sbjct: 778 --GLQFRTPEAITAVGTFRASTYLRAMAIWAI 807
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 136/389 (34%), Positives = 190/389 (48%), Gaps = 44/389 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W I K + QFSVF R+ + LC +
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNIPGCQFSVF-ERTGAGTQAYALCTQ 98
Query: 155 TPEVLKDTTAAGIGSWDW-----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
PE + +W W N +G TYHALYPR+W V+E +L C Q SP
Sbjct: 99 PPE------DGSLAAWQWYPTIKNEEGITGTYHALYPRSWFVYEQVYKTQL--TCEQFSP 150
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
I +NY+E+SYPV++F + +N S ++++ TW N VG FT + + KM D
Sbjct: 151 IWANNYQETSYPVAIFVWKAHNPTNASIALSIMLTWENMVGW---FTNALKSPEVKMRDD 207
Query: 270 VHAVL---------------LHHRTSH---QLPPVTFALAAQETDG---VHVSLCPHFVI 308
V L + H L V QE +G + P I
Sbjct: 208 GSPVYEYQPPLGESKDNFNQLVENSEHIGCVLGRVGVDQPIQEGEGQWCIATRKHPKLEI 267
Query: 309 SGNSLGL---TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
+S T +D+W + GS ++V G I AA+A T+ P +
Sbjct: 268 FYHSRWQPRGTGEDVWQSFAQDGSLPNYLDHTSAVE---GEQIAAALAVRFTLQPGETLE 324
Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
+ F L WD P F +G YYRRYT F+G A IA A+ ++ +W+ QI+AWQ+PI
Sbjct: 325 IPFVLTWDFPVTEFAAGINYYRRYTDFFGNDGKNAWAIALIALKQYQAWQAQIQAWQKPI 384
Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWT 454
LE LP+W+ + LFNELY L+AGG +W+
Sbjct: 385 LERADLPDWFKMALFNELYDLSAGGTLWS 413
>gi|7470943|pir||T31071 conserved hypothetical protein - Anabaena sp. (fragment)
gi|2828799|gb|AAC38131.1|AAC38131 unknown [Nostoc sp. PCC 7120]
Length = 439
Score = 231 bits (589), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 141/385 (36%), Positives = 208/385 (54%), Gaps = 28/385 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
IGQF LE ++Y + + DV Y SF L+ LFP+++ ++ R FA A+ D +
Sbjct: 55 IGQFAVLECLDYRWYESLDVRLYGSFGLLQLFPELEKAVMRAFARAIPQGDDTPRVIGYY 114
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ + RK GA PHD+G + W + N D WKDL FVLQVYRD +
Sbjct: 115 YTIGAESPIAVRKTPGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 174
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F + W ++ + Y+ FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 175 LTGADDVQFLRDCWDAIVETLDYVKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 234
Query: 699 W----VAALQAASALAREVGDRGSEDYFLFK-------FQKAKVVY-EKLWNGSYFNYDN 746
W AA+ + L + D ++ + L++ K+ +Y EKLWNG Y+ D+
Sbjct: 235 WLAALAAAIAISDILLQNHQDSATKGHLLYQKSTYETWLTKSLPIYQEKLWNGKYYRLDS 294
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S + ADQL GQ+YA L IV D+A SAL+ VY+ LK G+ GA NG+
Sbjct: 295 --ESGSDVVMADQLCGQFYANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGV 352
Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
PDG + + E+W+G+ + +AA ++ + D GF+ + E ++ G F+
Sbjct: 353 RPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAVVEQIYNN---GLQFR 409
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
TPEA +R+ Y+RP+AIWA+
Sbjct: 410 TPEAITAAGTFRASTYLRPMAIWAI 434
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 35/49 (71%)
Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
++ +W+ QIE+WQ PIL LP+W+ + LFNELY L +GG +W+ +P
Sbjct: 3 QYKTWQQQIESWQDPILNRPDLPDWFKMALFNELYDLTSGGTLWSAATP 51
>gi|81299663|ref|YP_399871.1| hypothetical protein Synpcc7942_0854 [Synechococcus elongatus PCC
7942]
gi|81168544|gb|ABB56884.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 798
Score = 229 bits (584), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 140/389 (35%), Positives = 203/389 (52%), Gaps = 26/389 (6%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
+GQF LE ++Y + + DV Y +F L+ML+P+++ ++ R FA A+ D + +++
Sbjct: 412 VGQFAVLECLDYAWYESLDVRLYGAFGLLMLWPELEKAVMRAFARAIPQAD-DRTRVI-- 468
Query: 589 GQWVS---------RKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
G W + RK+ A PHD+G + PW + N D WKDL FVLQV+R
Sbjct: 469 GYWFTIGQESPLAKRKLANATPHDLGAPNESPWLQTNYTGYQDCNLWKDLGCDFVLQVWR 528
Query: 638 DVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
D TG D+ F WP+ A+ Y+ FD DGDG+ EN G PDQT+D W + GISAY
Sbjct: 529 DYQLTGSSDRGFLSDCWPAAVAALRYLKDFDLDGDGIPENSGAPDQTFDDWRLQGISAYC 588
Query: 696 GGLW----VAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQ 751
GGLW AAL A L DR +D+ + Q + ++ LWNG Y+ D+ SQ
Sbjct: 589 GGLWIAALEAALAIADVLELSAEDRDRQDFQSWLAQARSLYHDTLWNGRYYQLDSGSGSQ 648
Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
+ ADQL G +Y+R L P+ + A+S VY K G+ G NG+LPDG
Sbjct: 649 --VVMADQLCGDFYSRLLQLPPVAPLEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGS 706
Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
VD E+W+G+ + +AA + D F+ + + G F+TPEA
Sbjct: 707 PVDPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIRQIYDN---GLQFRTPEAI 763
Query: 871 NTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
+ +R+ Y+RP+AIWA+ LT P
Sbjct: 764 TANATFRASHYLRPMAIWAVYGVLTNFSP 792
Score = 168 bits (425), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 121/396 (30%), Positives = 184/396 (46%), Gaps = 57/396 (14%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG PLGG G+G GRS RG+ W + + A QF+V+ S + S+
Sbjct: 36 HGAPLGGFGAGCWGRSPRGDVTLWHLDGGEHWYGSIPACQFAVYESGTGAYALST----- 90
Query: 155 TPEVLKDTTAAGIGSWDW--NLKGDKST--YHALYPRAWTVHEGEPDPELRIVCRQISPI 210
E D + + SW+W ++ST Y ALYPR+ ++ + E I CRQ SPI
Sbjct: 91 --EAPSDGS---LSSWNWYPASTAERSTGEYSALYPRSQFSYQQVFEAE--IHCRQFSPI 143
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
+PH+Y+ +SYP ++F + ++N I++L +W N G FT + + D
Sbjct: 144 LPHDYQATSYPTAIFRWQLHNPSDRPLTISILLSWENLCGW---FTNTNKAPEVVYRDDG 200
Query: 271 HAV-------------------------LLHHRTSHQLPPV---TFALAAQETDGVHVSL 302
V LL +T Q P +A+A E +G+ +
Sbjct: 201 SPVYDYVPALGQSVGNLNQRIAGEGWQGLLLDQTRSQDPEEGDGQWAIAIAEAEGLEIFR 260
Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSF-DRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
C + +G+ ++W GS DR +S + G + AIA +T+ P
Sbjct: 261 CDRWDPTGD-----GSELWQSFALDGSIPDRQDSQPAAA----GERLAGAIAVRLTLQPG 311
Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
++ FS+AWD P F +G +RRYT F+G AA IA + + WE I AW
Sbjct: 312 ESREIPFSIAWDLPVTEFAAGVKAFRRYTDFFGRDGRNAAAIAATGLKHYDEWEQAIAAW 371
Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
Q+PIL+ L + + + LFNELY L +GG++WT S
Sbjct: 372 QQPILDRDDLSDTFKLGLFNELYDLCSGGSLWTAAS 407
>gi|366162568|ref|ZP_09462323.1| hypothetical protein AcelC_02756 [Acetivibrio cellulolyticus CD2]
Length = 842
Score = 228 bits (582), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 148/367 (40%), Positives = 190/367 (51%), Gaps = 20/367 (5%)
Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
F +LE +Y + T DV FY S+ LI+++P I R FA +V K
Sbjct: 401 FSHLECYKYDFYGTSDVRFYGSWPLILMWPDIDKQAVRQFADSVYHTRTDK--------- 451
Query: 592 VSRKVLGAVPHDIGICDPWFEV-NAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFA 648
+G HDIG + FE NAY D+ WKDLN K VL VYRD TG D F
Sbjct: 452 --PAAIGTTAHDIGNANNVFETWNAYVYRDSTNWKDLNSKLVLMVYRDWALTGKTDMDFL 509
Query: 649 KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
K W V AM + D DGD + + G DQTYD + G +AY GGL+VAA AA
Sbjct: 510 KYCWIPVQKAMDKVKSQDSDGDYLPNSSGI-DQTYDDMQLQGNTAYCGGLFVAACNAAKE 568
Query: 709 LAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
+A +GD + ++K+K Y KLWNGSYF D +GS +S I +DQL G WYA+
Sbjct: 569 IATAMGDSTKATQYQSWYEKSKENYIAKLWNGSYFKID-TGSYDTSRIMSDQLCGHWYAK 627
Query: 768 ACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
ACGL IV ++ A+ + KVY +N K G G VN M DG VDMS Q+ E W G +
Sbjct: 628 ACGLEGIVPDEYAKKSYSKVYEFNNKKFDNGLHGFVNIMKADGNVDMSHAQTAEAWVGTS 687
Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD-DQYRSLCYMRPLA 886
Y V A MI E L Q + + W + F+TPEAW + R+ YMR
Sbjct: 688 YGVIAGMIQEGLETEASQVGSNLADTVWKTNDM--WFRTPEAWRQGVSEVRAPYYMRANC 745
Query: 887 IWAMQWA 893
IWA++ A
Sbjct: 746 IWAVKHA 752
Score = 157 bits (396), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 176/370 (47%), Gaps = 59/370 (15%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
G P+GG G+G+I + G F + ++ E F ++ + +GQ S
Sbjct: 70 GAPIGGFGAGTITWKFDGSFTKGRLDIGSNELSSDADCAFYMY-QKPSGQNIS------- 121
Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
+K A IG+ ++TY++L+P++W + G + + Q SPIIP NY
Sbjct: 122 ---MKKLDAGSIGA-------GQATYYSLFPKSWVDYSGS-NFYCKAKVTQFSPIIPGNY 170
Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLL 275
+ +SYPV V+ + I N G +S D+ ++ +W N GG NSK G+ +L
Sbjct: 171 ETTSYPVGVYKWEITNPGSSSCDVAIMLSWKNEFGGSFAEGQTSGNSK-----GI--ILR 223
Query: 276 HHRTSHQL--PPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRL 333
+ TS+ F+LAA++ DGV V+ +GS
Sbjct: 224 RNGTSNATMENQGEFSLAAEQKDGVKVT-------------------------YGSASGT 258
Query: 334 NSMETSVTSEP--GSSIGA----AIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYR 387
+++ + ++ G ++G+ A+A T+ P V +L+WD P SG +YR
Sbjct: 259 STLSSDFGTDGLLGDTVGSNNIGAVAFKATLEPGQTITVPVALSWDIPISQAGSGSKWYR 318
Query: 388 RYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLN 447
+YT+FYG + IA+DAI + SWE I++WQ +L D + P+W T+FNELYY
Sbjct: 319 KYTRFYGRSGLNSWKIAKDAIDNYSSWESSIDSWQNSVLNDTKYPQWLKTTMFNELYYYF 378
Query: 448 AGGAVWTDGS 457
GG W G+
Sbjct: 379 TGGTYWEAGA 388
>gi|284929778|ref|YP_003422300.1| putative bile acid beta-glucosidase [cyanobacterium UCYN-A]
gi|284810222|gb|ADB95919.1| predicted bile acid beta-glucosidase [cyanobacterium UCYN-A]
Length = 803
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 135/391 (34%), Positives = 206/391 (52%), Gaps = 28/391 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
+G+F LE I+Y + + DV Y SF L+ L+P ++ ++ + FA ++ D + + D
Sbjct: 417 VGKFGVLECIDYRWYESLDVRLYGSFGLLALWPSLEKAVIKTFAESISKSDHTLRLIGYD 476
Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
RKV GAVPHD+G + PW E N D WKDL FVLQVYRD + T D
Sbjct: 477 RSLAPRKVKGAVPHDLGAPNEHPWEETNYTSYQDCNLWKDLGSDFVLQVYRDFLLTDSQD 536
Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
F + WP++ + Y+ +FD D DG+ EN G PDQT+D W + GIS+Y G LW+ AL+
Sbjct: 537 INFLQECWPAIVETLDYLKKFDLDDDGIPENSGAPDQTFDDWQLHGISSYCGALWITALE 596
Query: 705 AASALAREVGDRGSEDYF-------------LFKF----QKAKVVY-EKLWNGSYFNYDN 746
AA + + + + L K+ +K++++Y E LWNG Y+ D+
Sbjct: 597 AAIGIGKILLKNSENNSHPDLQNSLAFIQENLTKYDDWLKKSRLIYHETLWNGEYYQLDS 656
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S +I DQL GQ+YAR L +VD SAL K+Y L G+ GA NG+
Sbjct: 657 --KSGSDTIMTDQLCGQFYARLLNLPDVVDTSCTISALSKIYEACFLNFYDGQYGAANGV 714
Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
DG + +S +E+W G+ + +AA +I + + + + E ++ G F+
Sbjct: 715 RSDGTPENPNSTHPQEVWVGINFGLAAFLIQMGMKNQALELTKVVVEQIYAN---GLQFR 771
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
TPEA +R+ Y+R +AIW + + L +
Sbjct: 772 TPEAITPVGTFRASHYLRAMAIWGVYYQLKK 802
Score = 199 bits (505), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 128/391 (32%), Positives = 193/391 (49%), Gaps = 47/391 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+P+GG G+G IGRS G+F W + K + + QFS+F +G+ + L K
Sbjct: 41 HGMPIGGFGAGCIGRSSAGDFNLWHLDGGEHVFKTISSCQFSIFEELKDGKTKAYALSTK 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE D T + W+W K +K TY+ALYPR+W +EG E +IVC Q SPIIP N
Sbjct: 101 EPE---DKT---LSKWNWYPK-NKGTYYALYPRSWYQYEGIF--EAKIVCEQFSPIIPDN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSE---- 254
Y+E+SYP+ +F +T+ N ++++ TW N +G G E
Sbjct: 152 YQETSYPIGIFEWTVSNPTDQPITLSIMLTWQNMIGWFTNSIKTPKIEVRDDGSPEYEYS 211
Query: 255 ----FTGQHYNSKTKMNDGVHAVLLHHRTSHQLPP----VTFALAAQETDGVHVSLCPHF 306
F+ ++N + + +L R + + + FA A + D + S +
Sbjct: 212 PQLGFSQGNFNKWKQSSSQFGFILDRLRVNDSISEGEGQICFA-TAHDPD-LEFSYLGKW 269
Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
SGN ++W G+ N+++ ++ ++P I AA+A + P ++
Sbjct: 270 NPSGN-----GSEVWDHFSASGTLP--NTLDETI-AKPDEQIAAAMAVKFVIKPGEVKKI 321
Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
F+LAWD P F G Y+RRYT F+G + A IA + W QI +Q PIL
Sbjct: 322 PFTLAWDLPITEFSLGNQYHRRYTDFFGINGKNAWMIAETGLKSLNIWREQILEYQLPIL 381
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
DK LP+W+ + LFNELY L GG +WT S
Sbjct: 382 NDKDLPDWFKMALFNELYLLTDGGTLWTTSS 412
>gi|218437284|ref|YP_002375613.1| hypothetical protein PCC7424_0277 [Cyanothece sp. PCC 7424]
gi|218170012|gb|ACK68745.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7424]
Length = 801
Score = 227 bits (578), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 147/429 (34%), Positives = 216/429 (50%), Gaps = 39/429 (9%)
Query: 496 ILERMSSILEQIYTPVALNSSFGTNL--LQDG---------EENIGQFLYLEGIEYLMWN 544
I+E + IL+Q P + L L DG ++ +GQF LE I+Y +
Sbjct: 373 IIEWQAPILKQKELPDWFKMALFNELYLLADGGTLWTAATEDDPVGQFAVLECIDYRWYE 432
Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
+ DV Y SF L ML+P++ ++ FA A+ D + + +G RK GA PHD+
Sbjct: 433 SIDVRLYGSFGLGMLWPRLDKAVMEAFARAIPKGDDTPRIIGYNGAKAIRKAKGATPHDL 492
Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMA 660
G + PW + N D WKDL FVL +YR + TG ++ F W SV +A
Sbjct: 493 GAPNEHPWEKTNYTSYQDCNLWKDLGCDFVLLLYRSFLWTGANNQDFLWEHWDSVTQTIA 552
Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR---- 716
Y+ +FD DGDG+ EN G PDQT+D W + GISAY G LW+A L+AA + + R
Sbjct: 553 YVKKFDLDGDGIPENSGAPDQTFDDWQLRGISAYCGALWIAGLEAAIKIGETLLQRPPMN 612
Query: 717 ---GSED-----------YFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAG 762
SE+ Y + Q + ++ LW G Y+ D+ S S + +DQL G
Sbjct: 613 PQLQSENAPEKIEIAIATYRQWLHQARALYHDTLWTGEYYRLDS--ESGSDVVMSDQLCG 670
Query: 763 QWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSRE 821
Q+YAR GL +V+ SAL+K+Y L+ GGK GA NG+ PDG + + E
Sbjct: 671 QFYARLLGLPDVVEPQYTLSALRKIYESCFLQFYGGKYGAANGVRPDGSPENPDATHPLE 730
Query: 822 IWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCY 881
+W+G+ + +AA MI + G + + + G F+TPEA +R+ Y
Sbjct: 731 VWTGINFGLAAFMIQMGMKQEGLSITEAVVKQIYEN---GLQFRTPEAITAAGTFRASHY 787
Query: 882 MRPLAIWAM 890
+R +AIWA+
Sbjct: 788 LRAMAIWAI 796
Score = 209 bits (531), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 130/384 (33%), Positives = 191/384 (49%), Gaps = 39/384 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + + + QFS+F G+ + LC +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHTFRNLPPCQFSIFEHPEGGEAQAYALCTQ 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE D T + W W + K TYHALYPR+W +EG + +I+C Q SPI+ N
Sbjct: 101 PPE---DGT---LSRWSWYPQ-KKGTYHALYPRSWYEYEGVF--QSQIICEQYSPILADN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS------------------EFT 256
Y+ESSYP+++F +TIYN I+++ +W N+VG S E+
Sbjct: 152 YQESSYPLAIFEWTIYNPSDRPVTISIMMSWQNTVGWFSNAIKTPEVKVRDDGSPVYEYQ 211
Query: 257 GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG---VHVSLCPHFVISGNSL 313
+ +S N ++ ++R L + +E +G + L P+ + N+
Sbjct: 212 PRWGDSTGNFN---QWIVDNYRVGCLLNRIQPHPQVEEGEGQVCIASILNPNLEVFYNTR 268
Query: 314 GLTAKD---MWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
D +W +GS L + + PG I AA+A T+PP ++ F L
Sbjct: 269 WNPQGDGSQIWDYFAMNGS---LGDYQNESPAAPGEQIAAAMAIRFTIPPGKTRKIPFIL 325
Query: 371 AWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKR 430
AWD P F G YYRRYT F+G + ++ R A+ H W +I WQ PIL+ K
Sbjct: 326 AWDLPITEFAQGINYYRRYTDFFGRNGQNVWSVVRTALKHHEMWRDRIIEWQAPILKQKE 385
Query: 431 LPEWYPITLFNELYYLNAGGAVWT 454
LP+W+ + LFNELY L GG +WT
Sbjct: 386 LPDWFKMALFNELYLLADGGTLWT 409
>gi|300866990|ref|ZP_07111661.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335025|emb|CBN56827.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 863
Score = 227 bits (578), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 142/408 (34%), Positives = 215/408 (52%), Gaps = 42/408 (10%)
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
D + GQF LE I+Y + + DV Y SFAL+ML+P++ ++ FA A+ D +
Sbjct: 456 DERDPKGQFAVLECIDYRWYESLDVRLYGSFALLMLWPELDKAVLLAFARAIFTGDDTPR 515
Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
+ + RK+ GA PHD+G + PW + N D WKDL FVLQVYRD +
Sbjct: 516 VIGWNQASAVRKIAGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLL 575
Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
TG D +F + WP++ A+AY+ FD DGDG+ EN G PDQT+D W + G+SAY GGLW
Sbjct: 576 TGSTDYEFLQECWPAIVEALAYLKTFDLDGDGIPENSGAPDQTFDDWQMRGVSAYCGGLW 635
Query: 700 V----------AALQAASALAREVG-------DRGSEDYFLFK--------------FQK 728
+ + L + E+ ++ +D LF ++
Sbjct: 636 LAALEAAIAIGSLLIKHQSSVEELNQSPITNFNKDFKDVSLFPITNYQLPITNYQNWLEQ 695
Query: 729 AKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKV 787
AK +Y+ +LWNG Y+ D+ S S + ADQL GQ+YAR GL IV E+ A SAL+ V
Sbjct: 696 AKPIYQQRLWNGQYYRLDSE--SGSDVVMADQLCGQFYARLLGLPDIVPEECAISALETV 753
Query: 788 YNYNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQT 846
Y +K G+ GA NG++ DG + + E+W+G+ + +AA M+ + + +
Sbjct: 754 YESCFIKFNQGEFGAANGVMLDGSPEKPGATHPLEVWTGINFGLAAFMMQMGMKEKALEL 813
Query: 847 ACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
+ + ++ G F+TPEA +R+ Y+R +AIWA+ + L
Sbjct: 814 TEVVVKQIYAN---GLQFRTPEAITATGTFRASHYLRAMAIWAIYYEL 858
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 124/401 (30%), Positives = 187/401 (46%), Gaps = 54/401 (13%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS+RG+F W I + + A QFSVF +G+K + L +
Sbjct: 66 HGMPLGGFGAGCIGRSHRGDFNLWHIDGGEHVFRSLPACQFSVF-EEIDGKKQAYALSTE 124
Query: 155 TPEVLKDTTAAGIGSWDW-----NLKGDK--------------STYHALYPRAWTVHEGE 195
P T + +W W +K +K +YHALYPR+W V+E
Sbjct: 125 PP------TDGSLSAWKWYPSELKIKNEKLKSEEQNSDSSLSTGSYHALYPRSWFVYENV 178
Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
+L C Q SPI NY+ESSYPV++F + +N ++++ TW N+VG
Sbjct: 179 FQAQLS--CEQFSPIWAGNYQESSYPVAIFEWIAHNPTDRPITLSIMLTWQNTVGWFTNA 236
Query: 251 --------GDSEFTGQHYNSKTKMNDGVHAVLLH--HRTSHQLPPVTFALAAQETDG--- 297
D YNS+ + G L+ HR + ++ + +E +G
Sbjct: 237 NKSPEIKVRDDGSPVYEYNSRWGESGGNFNRLVEDFHRIGCVMTRLSASEEVKEGEGQWA 296
Query: 298 ----VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIA 353
+ ++ + N G + +D+W + G F + ET G + AIA
Sbjct: 297 IATVANAAVEVFYQTRWNPAG-SGEDIWRPFSQEG-FLLDDPDETPAAG--GEQLACAIA 352
Query: 354 ASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGS 413
T+ P ++ F LAWD P F YYRRYT F+G + +I R A+ +
Sbjct: 353 VRFTIRPGKTRKIPFILAWDFPITEFAPEVWYYRRYTDFFGRNGKNVWSIIRTAMKHSDT 412
Query: 414 WELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
W I+AWQ+PIL + LP+ + +FNELY L GG +W+
Sbjct: 413 WRENIQAWQQPILAREDLPDIIKMAMFNELYALTDGGTLWS 453
>gi|220909879|ref|YP_002485190.1| hypothetical protein Cyan7425_4519 [Cyanothece sp. PCC 7425]
gi|219866490|gb|ACL46829.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7425]
Length = 814
Score = 226 bits (577), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 150/410 (36%), Positives = 216/410 (52%), Gaps = 46/410 (11%)
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD---- 579
D + IGQF LE ++Y + + DV Y SFAL++L+P+++ ++ FA A+ D
Sbjct: 414 DERDPIGQFAVLECLDYRWYESLDVRLYGSFALLLLWPRLEKAVMEAFARAIPTQDDRTR 473
Query: 580 ----------PSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
PS L K+ A PHD+G + PW + N D WKDL
Sbjct: 474 VIGYYYTQGMPSPTAL--------HKIANATPHDLGAPNEHPWEKTNYTSYQDCNLWKDL 525
Query: 628 NPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
FV+ VYRD + TG D F WP+++ +AY+ FD DGDG+ EN G PDQT+D
Sbjct: 526 PADFVILVYRDYLLTGGTDYDFLTTCWPAIHATLAYLKTFDLDGDGIPENSGAPDQTFDD 585
Query: 686 WSVSGISAYSGGLWVAALQAASA----LAREVGDRGSEDYFLFKFQ----KAKVVYEK-L 736
W + GISAY GGLW+AAL+AA A L + D L +Q +++ +Y++ L
Sbjct: 586 WRLQGISAYCGGLWIAALEAAIAIIKLLLVQAPDLPHLQESLTTYQTWLAQSRPLYQQTL 645
Query: 737 WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVM 796
WNG Y+ D+ S S + ADQL GQ+YAR GL +V ED AR+AL VY L+
Sbjct: 646 WNGQYYRLDS--GSGSDVVMADQLCGQFYARLVGLPDVVPEDCARAALDMVYQTCFLQFH 703
Query: 797 GGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQ-TACGIYEAA 854
G GA NG+ P G + + E+W+G+ + +AA ++ Q TA +Y+
Sbjct: 704 NGTLGAANGLRPGGLAEKPDATHPLEVWTGINFGLAAFLVQMGRKAEAMQLTAAVVYQI- 762
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT---RPKPKT 901
G F+TPEA + +R+ Y+RP+AIWA+ LT P P++
Sbjct: 763 ---YEHGLQFRTPEAITAEQTFRAGMYLRPMAIWAIYHLLTSHANPNPES 809
Score = 196 bits (499), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 136/390 (34%), Positives = 191/390 (48%), Gaps = 50/390 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF--VSRSNGQKYS-SVL 151
HG+PLGG G+G +GRS+RG+F W I + + A QFSVF V +S G+ YS S +
Sbjct: 42 HGMPLGGFGAGCVGRSHRGDFNLWHIDGGEHVFQSLPACQFSVFEQVGQSPGKAYSLSTV 101
Query: 152 CPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
P+ + SW W + S YHALYPR+W V++ +L VC Q SPI+
Sbjct: 102 APED---------GSLNSWQWYPQSGGS-YHALYPRSWFVYQNVFQADL--VCEQFSPIL 149
Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVH 271
NY+E+SYPV+VF +T +N ++L+ TW N VG FT + K+ D
Sbjct: 150 TENYQETSYPVAVFHWTAHNPTDQPLTLSLMLTWQNVVGW---FTNADKSPVVKVRDDGS 206
Query: 272 AVLLH-----------HRTSHQLPPVTFAL--------AAQETDG---VHVSLCPH---- 305
V + +R V F L A E +G V P+
Sbjct: 207 PVYTYQPRLGESQGNFNRVIDDNGRVGFVLDRLRSEEAAIAEGEGQWAVATFASPYTSEV 266
Query: 306 -FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
+ N +G ++W GS L + + + G I AAIA T+PP
Sbjct: 267 FYYTRWNPVG-DGSELWQTFAADGS---LANCDDETPAGVGEQIAAAIAVRFTLPPGQTR 322
Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
Q+ F LAWD P F +G Y+RRYT F+G + IAR + H +W QI+AWQ+P
Sbjct: 323 QIPFVLAWDFPVTEFAAGVNYFRRYTDFFGRTGDHGWAIARTGLKMHQTWTDQIQAWQQP 382
Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
IL D P+W+ + L NELY L +GG +W+
Sbjct: 383 IL-DSNEPDWFKMALLNELYDLTSGGTLWS 411
>gi|56750695|ref|YP_171396.1| hypothetical protein syc0686_c [Synechococcus elongatus PCC 6301]
gi|56685654|dbj|BAD78876.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 798
Score = 226 bits (577), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 139/389 (35%), Positives = 202/389 (51%), Gaps = 26/389 (6%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
+GQF LE ++Y + + DV Y +F L+ML+P+++ ++ R FA A+ D + +++
Sbjct: 412 VGQFAVLECLDYAWYESLDVRLYGAFGLLMLWPELEKAVMRAFARAIPQAD-DRTRVI-- 468
Query: 589 GQWVS---------RKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
G W + RK+ A PHD+G + PW + N D WKDL FVLQV+R
Sbjct: 469 GYWFTIGQESPLAKRKLANATPHDLGAPNESPWLQTNYTGYQDCNLWKDLGCDFVLQVWR 528
Query: 638 DVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
D TG D+ F WP+ A+ Y+ FD DGDG+ EN G PDQT+D W + GISAY
Sbjct: 529 DYQLTGSSDRGFLSDCWPAAVAALRYLKDFDLDGDGIPENSGAPDQTFDDWRLQGISAYC 588
Query: 696 GGLW----VAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQ 751
GGLW AAL A L DR +D+ + Q + ++ LWNG Y+ D+ SQ
Sbjct: 589 GGLWIAALEAALAIADVLELSAEDRDRQDFQSWLAQARSLYHDTLWNGRYYQLDSGSGSQ 648
Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
+ ADQL G +Y+R L P+ + A+S VY K G+ G NG+LPDG
Sbjct: 649 --VVMADQLCGDFYSRLLQLPPVAPLEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGS 706
Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
V E+W+G+ + +AA + D F+ + + G F+TPEA
Sbjct: 707 PVGPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIRQIYDN---GLQFRTPEAI 763
Query: 871 NTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
+ +R+ Y+RP+AIWA+ LT P
Sbjct: 764 TANATFRASHYLRPMAIWAVYGVLTNFSP 792
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 121/396 (30%), Positives = 184/396 (46%), Gaps = 57/396 (14%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG PLGG G+G GRS RG+ W + + A QF+V+ S + S+
Sbjct: 36 HGAPLGGFGAGCWGRSPRGDVTLWHLDGGEHWYGSIPACQFAVYESGTGAYALST----- 90
Query: 155 TPEVLKDTTAAGIGSWDW--NLKGDKST--YHALYPRAWTVHEGEPDPELRIVCRQISPI 210
E D + + SW+W ++ST Y ALYPR+ ++ + E I CRQ SPI
Sbjct: 91 --EAPSDGS---LSSWNWYPASTAERSTGEYSALYPRSQFSYQQVFEAE--IHCRQFSPI 143
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
+PH+Y+ +SYP ++F + ++N I++L +W N G FT + + D
Sbjct: 144 LPHDYQATSYPTAIFRWQLHNPSDRPLTISILLSWENLCGW---FTNTNKAPEVVYRDDG 200
Query: 271 HAV-------------------------LLHHRTSHQLPPV---TFALAAQETDGVHVSL 302
V LL +T Q P +A+A E +G+ +
Sbjct: 201 SPVYDYVPALGQSVGNLNQRIAGEGWQGLLLDQTRSQDPEEGDGQWAIAIAEAEGLEIFR 260
Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSF-DRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
C + +G+ ++W GS DR +S + G + AIA +T+ P
Sbjct: 261 CDRWDPTGD-----GSELWQSFALDGSIPDRQDSQPAAA----GERLAGAIAVRLTLQPG 311
Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
++ FS+AWD P F +G +RRYT F+G AA IA + + WE I AW
Sbjct: 312 ESREIPFSIAWDLPVTEFAAGVKAFRRYTDFFGRDGRNAAAIAATGLKHYDEWEQAIAAW 371
Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
Q+PIL+ L + + + LFNELY L +GG++WT S
Sbjct: 372 QQPILDRDDLSDTFKLGLFNELYDLCSGGSLWTAAS 407
>gi|428209901|ref|YP_007094254.1| glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
gi|428011822|gb|AFY90385.1| Glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
Length = 812
Score = 226 bits (577), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 142/389 (36%), Positives = 216/389 (55%), Gaps = 32/389 (8%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
IGQF LE I+Y + + DV Y SFAL++LFP+++ ++ R FA A+ D +
Sbjct: 422 IGQFAILECIDYRWYESLDVRLYGSFALLILFPELEKAVMRAFARAIPASDNHRRLIGYY 481
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ + + RKV GA PHD+G + W + N D WKDL FVLQVYRD +
Sbjct: 482 LTIGAEMPLAPRKVAGATPHDLGAPNEHVWEKTNYTAYQDCNLWKDLPSDFVLQVYRDYL 541
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F + W ++ A+ Y+ FD+DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 542 LTGATDVEFLASCWEAIVQALNYLKTFDKDGDGIPENSGAPDQTFDDWKLQGVSAYCGGL 601
Query: 699 WVAALQAASALAREVGD---------------RGSEDYFLFKFQKAKVVY-EKLWNGSYF 742
W+AAL+AA A+ + + + + + D + +++ +Y EKLWNG Y+
Sbjct: 602 WLAALEAAIAIGKILLNNVERLHVTSLHVTSLQETIDTYQSWLSRSRPIYQEKLWNGQYY 661
Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
D+ +S S + ADQL GQ+YAR L IV ++ A SALK +Y+ LK GK GA
Sbjct: 662 RLDS--NSGSDVVMADQLCGQFYARLLKLPDIVPQECALSALKTIYDACFLKFHQGKFGA 719
Query: 803 VNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLG 861
NG+ P+G + ++ E+W+G+ + +AA ++ + Q + + G
Sbjct: 720 ANGLKPNGSPENPNATHPLEVWTGINFGLAAFLLQMGMKAEALQLTETVVNQIYEN---G 776
Query: 862 YAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
F+TPEA + +R+ Y+R +AIWA+
Sbjct: 777 LQFRTPEAITSVGTFRASYYLRAMAIWAI 805
Score = 202 bits (514), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 133/394 (33%), Positives = 195/394 (49%), Gaps = 54/394 (13%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF--VSRSNGQKYSSVLC 152
HG+PLGG G+G IGRS RG+F WQ+ +P+ A QFSVF VS + + + LC
Sbjct: 41 HGMPLGGFGAGCIGRSSRGDFNVWQVDGGEHIFQPMPACQFSVFESVSERSASRQAYALC 100
Query: 153 PKTPEVLKDTTAAGIGSWDW-NLKGDK----STYHALYPRAWTVHEGEPDPELRIVCRQI 207
+ P+ G+ +W W +G++ TYHALYPR+W V+EG +L C Q
Sbjct: 101 TEAPD-------DGLQNWQWYPSRGEQHAHTGTYHALYPRSWFVYEGVFQSQL--TCEQF 151
Query: 208 SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN 267
SPI +NY+ESSYPV VF +T +N ++++ +W N G FT +S+ K+
Sbjct: 152 SPIWANNYQESSYPVGVFVWTAHNPTNAPMTLSIMLSWQNMTGW---FTNASKSSEVKIR 208
Query: 268 DGVHAV-------------------------LLHHRTSHQLPP--VTFALAAQETDGVHV 300
D V +L S + + +A + GV V
Sbjct: 209 DDGSPVYEYKSRWGESQGNFNRLVESEYVGCVLGRDNSEDVEEGDGQWCIATAKQPGVEV 268
Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
+ N +G T +D+W + G+ L +++ + G I AIA T+ P
Sbjct: 269 FHHTRW----NPVG-TGEDIWTSFAQSGA---LPNIQDETPAIEGEQIAGAIAIRFTLQP 320
Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
+ F L WD P F +G YYRR+T F+ A IAR A+ + SW+ QI+A
Sbjct: 321 GETRTIPFVLTWDFPVTEFAAGVKYYRRHTDFFDRTGKNAWAIARTALQAYQSWQSQIQA 380
Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
WQ+PIL+ LP W+ + LFNELY L +GG +W+
Sbjct: 381 WQQPILDKTDLPNWFKMALFNELYDLTSGGTLWS 414
>gi|75907319|ref|YP_321615.1| hypothetical protein Ava_1096 [Anabaena variabilis ATCC 29413]
gi|75701044|gb|ABA20720.1| Protein of unknown function DUF608 [Anabaena variabilis ATCC 29413]
Length = 804
Score = 225 bits (574), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 139/385 (36%), Positives = 206/385 (53%), Gaps = 28/385 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
IGQF LE ++Y + + DV Y SF L+ LFP+++ ++ R FA A+ D +
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFGLLQLFPELEKAVMRAFARAIPQGDDTPRVIGYY 479
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ + RK GA PHD+G + W + N D WKDL FVLQVYRD +
Sbjct: 480 YTIGAESPIAVRKTPGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 539
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F + W ++ + Y+ FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 540 LTGADDVQFLRDCWDAIVETLDYVKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 599
Query: 699 WV----AALQAASALAREVGDRGSEDYFLFK-------FQKAKVVY-EKLWNGSYFNYDN 746
W+ AA+ + L + D +++ L + K+ +Y EKLWNG Y+ D+
Sbjct: 600 WMAALAAAIAISDILLQNHQDSETKEKLLLQKSTYETWLTKSLPIYQEKLWNGKYYRLDS 659
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S S + ADQL GQ+YA L IV D+A SAL+ VY+ LK G+ GA NG+
Sbjct: 660 --ESGSDVVMADQLCGQFYANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGV 717
Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
PDG + + E+W+G+ + +AA ++ + D GF+ + ++ G F+
Sbjct: 718 RPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAVVAQIYNN---GLQFR 774
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
TPEA +R+ Y+R +AIWA+
Sbjct: 775 TPEAITAAGTFRASTYLRAMAIWAI 799
Score = 186 bits (472), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 141/398 (35%), Positives = 193/398 (48%), Gaps = 55/398 (13%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQI--FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
HG+PLGG G+G IGRS RG+F W I + ++ P A QFSVF SNG +
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNFP--ACQFSVF--ESNGTSSQAYAL 95
Query: 153 PKTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQIS 208
P T + SW W TYHALYPR+W V+E EL C Q S
Sbjct: 96 STQP-----TDDGSLKSWQWYPASTATQTTGTYHALYPRSWFVYENVFQAEL--TCEQFS 148
Query: 209 PIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND 268
PI +NY+E+SYPV+VF + +N ++++ TW N VG FT + ++ D
Sbjct: 149 PIWANNYQETSYPVAVFVWQAHNPTNAPITLSIMLTWQNMVGW---FTNALKSPDVRVRD 205
Query: 269 ---------------GVHAVLLHHRTSHQ---LPPVTFALAAQETDGVHVSLC------P 304
G + L H L V A QE +G S C P
Sbjct: 206 DGSPVYEYQPRLGESGGNYNYLDESPQHLGCFLGRVGMAEPLQEGEG---SWCIVTRKHP 262
Query: 305 HFVI----SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
I N +G T +++W GS N ++TS SE +GAAIA T+ P
Sbjct: 263 QVEIFHHTRWNPVG-TGEEVWQSFAADGSL--ANYIDTSPVSE-NEQLGAAIAVRFTLQP 318
Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
++ F ++WD P F +G YYRRYT F+G N A IA A+ ++ +W+ QI+A
Sbjct: 319 GETLEIPFVVSWDLPVTEFAAGVNYYRRYTDFFGKSGNHAWAIATIALEQYQTWQQQIQA 378
Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
WQ PIL LP+W+ + LFNELY L +GG +W+ +P
Sbjct: 379 WQDPILNRDDLPDWFKMALFNELYDLTSGGTLWSAATP 416
>gi|295829678|gb|ADG38508.1| AT3G24180-like protein [Neslia paniculata]
Length = 190
Score = 225 bits (574), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 119/190 (62%), Positives = 146/190 (76%), Gaps = 1/190 (0%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
F VWP+V AM YM+QFDRD D +IENDGFPDQTYDTW+V G+SAY G LW+AALQAA
Sbjct: 1 FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60
Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
+A+A ++GD+ + KF AK EK LWNGSYFNYD+ SS S SIQ DQLAGQWY
Sbjct: 61 AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120
Query: 766 ARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSG 825
A + GL PI +E K RS ++K++++NV+K GG+ GAVNGM PDG+VD + MQSREIW+G
Sbjct: 121 AASSGLPPIFEESKIRSTMQKIFDFNVMKTKGGRMGAVNGMHPDGKVDETCMQSREIWTG 180
Query: 826 VTYAVAASMI 835
VTYA AA+MI
Sbjct: 181 VTYAAAATMI 190
>gi|295829670|gb|ADG38504.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829672|gb|ADG38505.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829674|gb|ADG38506.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829676|gb|ADG38507.1| AT3G24180-like protein [Capsella grandiflora]
Length = 190
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 119/190 (62%), Positives = 146/190 (76%), Gaps = 1/190 (0%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
F VWP+V AM YM+QFDRD D +IENDGFPDQTYDTW+V G+SAY G LW+AALQAA
Sbjct: 1 FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60
Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
+A+A ++GD+ + KF AK EK LWNGSYFNYD+ SS S SIQ DQLAGQWY
Sbjct: 61 AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120
Query: 766 ARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSG 825
A + GL PI +E K +S ++K++++NV+K GGK GAVNGM PDG+VD + MQSREIW+G
Sbjct: 121 AASSGLPPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIWTG 180
Query: 826 VTYAVAASMI 835
VTYA AA+MI
Sbjct: 181 VTYAAAATMI 190
>gi|170079447|ref|YP_001736085.1| hypothetical protein SYNPCC7002_A2862 [Synechococcus sp. PCC 7002]
gi|169887116|gb|ACB00830.1| Conserved hypothetical protein (DUF608) [Synechococcus sp. PCC
7002]
Length = 803
Score = 224 bits (572), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 146/404 (36%), Positives = 211/404 (52%), Gaps = 34/404 (8%)
Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
G +L EN +GQF LE ++Y + + DV Y SF L ML+P++ ++ FA A+
Sbjct: 404 GGSLWTAATENDPVGQFGVLECLDYCWYESLDVRMYGSFGLTMLWPRLDKAVLEAFARAI 463
Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
D ++ + + +RKV GA PHD+G + PW N D WKDL FVL
Sbjct: 464 PTSDDTQRMIGYNKTMATRKVAGATPHDLGAPNEHPWEATNYTAYQDCNLWKDLGSDFVL 523
Query: 634 QVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
QVYRD V TG D +F + W SV A+ Y+ FD DGDG+ EN G PDQT+D W + GI
Sbjct: 524 QVYRDFVMTGAEDTEFLWSCWGSVLTALDYLKTFDLDGDGIPENGGAPDQTFDDWKLKGI 583
Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYF--------LFKFQKAKVVYEK-------- 735
SAY GGLW+AAL+AA A+A+ + + S +F +F++A Y+
Sbjct: 584 SAYCGGLWIAALEAAIAMAKIL--QQSPSHFEAHQNQVSAAEFEQAIATYQTWLDQARPL 641
Query: 736 ----LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
LWNGSY+ D S S + ADQL GQ+Y + L + D+ + L +Y
Sbjct: 642 YQDILWNGSYYTLDT--GSNSRVVMADQLCGQYYTQLLSLPDVNPGDRTATTLATIYTAC 699
Query: 792 VLKVMGGKRGAVNGMLPDGRVDMSS-MQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
K GG+ GA NG+ PDG + + E+W+G+ + +AA MI + + +
Sbjct: 700 FEKFHGGQFGAANGLNPDGTPEKENDTHPLEVWTGINFGIAALMIRNGMQTEALRMVEAV 759
Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
E + G F+TPEA + +R+ Y+R + IWA+ AL
Sbjct: 760 VEQVYDN---GLQFRTPEAITANGTFRACHYLRAMGIWAIYDAL 800
Score = 189 bits (480), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 130/388 (33%), Positives = 178/388 (45%), Gaps = 47/388 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + + A QFSVF + +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDAGEHTFRNAGACQFSVFEQPEGEAAQAYAMSTV 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE D T + SW W K TY ALYPR+W ++G EL C Q SPII N
Sbjct: 101 APE---DGT---LDSWQW-YPAKKGTYAALYPRSWYEYQGAFAAEL--TCEQFSPIIAQN 151
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
Y+E+SYPV+VF +T +N ++++ TW N VG
Sbjct: 152 YQETSYPVAVFEWTAHNPTDKPITVSIMMTWQNMVGWFTKAQKTPEVVLRDDGSPVYEYQ 211
Query: 251 ---GDSEFT-GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHF 306
GDS Q + ++ ++ V H S + FA T L +
Sbjct: 212 SRWGDSTGNLNQWVQDRYRVGCLLNRVRPHADLSEGEGQLCFATVTNPT------LEVFY 265
Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
N +G +D+W +GS L +E + PG I AIA T+ P ++
Sbjct: 266 HNRWNPVG-DGRDVWDYFAGNGS---LPDIEDETPAAPGEQIAGAIAVRFTLRPGKTKKI 321
Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
F LAWD P + F TY+RRYT F+G N + R A+ W+ +I AWQ+PIL
Sbjct: 322 PFFLAWDLPVMEFAPDITYFRRYTDFFGRSGNNVWTMIRTAMKHSDLWKERIAAWQQPIL 381
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWT 454
LP+W + LFNELY L GG++WT
Sbjct: 382 GRDDLPDWLKMALFNELYLLTQGGSLWT 409
>gi|345291087|gb|AEN82035.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291089|gb|AEN82036.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291091|gb|AEN82037.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291093|gb|AEN82038.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291095|gb|AEN82039.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291097|gb|AEN82040.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291099|gb|AEN82041.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291101|gb|AEN82042.1| AT3G24180-like protein, partial [Capsella rubella]
Length = 191
Score = 224 bits (572), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 119/190 (62%), Positives = 146/190 (76%), Gaps = 1/190 (0%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
F VWP+V AM YM+QFDRD D +IENDGFPDQTYDTW+V G+SAY G LW+AALQAA
Sbjct: 1 FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60
Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
+A+A ++GD+ + KF AK EK LWNGSYFNYD+ SS S SIQ DQLAGQWY
Sbjct: 61 AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120
Query: 766 ARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSG 825
A + GL PI +E K +S ++K++++NV+K GGK GAVNGM PDG+VD + MQSREIW+G
Sbjct: 121 AASSGLPPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIWTG 180
Query: 826 VTYAVAASMI 835
VTYA AA+MI
Sbjct: 181 VTYAAAATMI 190
>gi|282897193|ref|ZP_06305195.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
gi|281197845|gb|EFA72739.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
Length = 798
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 152/402 (37%), Positives = 218/402 (54%), Gaps = 42/402 (10%)
Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
G L G E +GQF LE ++Y + + DV Y SF L++LFP+++ ++ R FA A+
Sbjct: 405 GGTLWSAGTEKDPMGQFAVLECLDYRWYESLDVRLYGSFGLLILFPELEKAVIRAFARAI 464
Query: 576 MMHD--------------PSKMKLLDDGQWVSRKVLGAVPHDIGICDPW-FEVNAYCLY- 619
D PS + L RKV GA PHD+G + W +E Y Y
Sbjct: 465 PQSDDRSRIIGYYLTINSPSPVAL--------RKVAGATPHDLGAPNEWVWEQTNYTSYQ 516
Query: 620 DTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDG 677
D WKDL FVLQVYRD + TG D +F W + + + Y+ QFD D DG+ EN G
Sbjct: 517 DCNLWKDLGCDFVLQVYRDFLFTGGNDIEFLVDCWYGIVLTLDYLKQFDIDADGIPENSG 576
Query: 678 FPDQTYDTWSVSGISAYSGGLWVAALQAASALAR------EVGDRGSED--YFLFKFQKA 729
PDQT+D W +SG+SAY G LW+AAL+AA A+ E+ + G + Y + Q
Sbjct: 577 APDQTFDDWRLSGVSAYCGALWLAALEAAIAICDVLMNHPELSNVGEQRSIYENWLNQSL 636
Query: 730 KVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
V +KLWNG ++ D+ S S+ + ADQL+GQ+YAR L IV +D+A SAL +Y+
Sbjct: 637 PVYQQKLWNGKFYRLDS--ESGSNVVMADQLSGQFYARLLNLPDIVPQDRALSALTTIYD 694
Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTAC 848
LK GK GA NG+LPDG + + E+W+G+ + +AA ++ + D G +
Sbjct: 695 ACFLKFQDGKFGAANGVLPDGSPENPQATHPLEVWTGINFGLAAFLLQMGMKDQGLRLTQ 754
Query: 849 GIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
+ + G F+TPEA +R+ Y+RP+AIWA+
Sbjct: 755 AVVRQIYDH---GLQFRTPEAITAAGTFRASTYLRPMAIWAI 793
Score = 209 bits (531), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 138/400 (34%), Positives = 197/400 (49%), Gaps = 61/400 (15%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W I K V QFS+F S GQ Y+ L +
Sbjct: 38 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNVPGCQFSIFTS---GQAYA--LSTQ 92
Query: 155 TPEVLKDTTAAGIGSWDW---------NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
PE D T + +W W K D TYHALYPR+W V+E L+ C
Sbjct: 93 PPE---DNT---LQTWQWYPNVVHSSTGTKTDGGTYHALYPRSWFVYENLF--PLQFTCE 144
Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
Q SP+ HNY+E+SYPV+VF + I+N +++ +W N G FT + + K
Sbjct: 145 QFSPVWAHNYRETSYPVAVFLWNIHNPTNQPITASIMLSWENMTGW---FTNALKSPQVK 201
Query: 266 M--------------------------NDGVHAVLLHHRTSH--QLPPVTFALAAQETDG 297
+ ND +L T+H Q T+ +A +
Sbjct: 202 IRDDGSPVYEYQPRWGESQGNYNWLAENDQYLGCVLGRATNHPVQEGDGTWCIATVKNPQ 261
Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
V + F GN +++W + ++GS L++ + ++ S +GAA+A T
Sbjct: 262 VELFYHCQFNPVGN-----GEELWRDFSQNGS---LSNYQDETPADVNSRLGAAVAVRFT 313
Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
+ P V F L+WD P F +G YYRRYT F+GT + A IA A+ E +W
Sbjct: 314 ISPGETFTVPFVLSWDFPVTEFAAGVNYYRRYTDFFGTTGDNAWQIATCALKECYNWRSH 373
Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
IE+WQ+PILE + LP W+ + LFNELY L +GG +W+ G+
Sbjct: 374 IESWQKPILEREDLPSWFKMALFNELYDLTSGGTLWSAGT 413
>gi|254431466|ref|ZP_05045169.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
gi|197625919|gb|EDY38478.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
Length = 837
Score = 223 bits (568), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 138/387 (35%), Positives = 207/387 (53%), Gaps = 25/387 (6%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
E+ +G+F LE ++Y + + DV Y SFAL+ L+P++ ++ R FA A+ DP+ +
Sbjct: 440 EDPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFARAIPAADPTPRPI 499
Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
G+ RKV GA PHD+G + PW N D WKDL FVLQV+R
Sbjct: 500 GWYFTQGKGRVEAPRKVAGATPHDLGAPNERPWDATNYTAYQDCNLWKDLASDFVLQVWR 559
Query: 638 D--VVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
+ TG D +F WP+ A+ Y+ FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 560 TFRLAPTGEDLRFLADCWPAAVEALRYLKGFDANSDGLPDNGGAPDQTFDDWPLKGVSAY 619
Query: 695 SGGLWVAALQAASALAR----EVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNS 747
G LW+AAL+AA A+ + E+G D S+ F ++++ +++ LWNG Y+N D
Sbjct: 620 CGALWIAALEAALAMGQRLQLELGLDTSSQQRDFSSWLEQSRANFDRLLWNGEYYNID-- 677
Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
S + + ADQL G +YAR L P+V E++ARSAL+ + GG G NG+
Sbjct: 678 ADSGTPVVMADQLCGDFYARLLELPPVVAEERARSALRAIREACFEGFQGGSLGVANGLR 737
Query: 808 PDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
DG +D E+W+G+ + +AA D + + G G F+T
Sbjct: 738 RDGTPLDPDGTHPLEVWTGINFGLAAYYRLMGETDTALAITGAVVHQVYGG---GLQFRT 794
Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWA 893
PEA + +R+ Y+R +AIWA+ WA
Sbjct: 795 PEAITAVNTFRACHYLRAMAIWAL-WA 820
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 118/413 (28%), Positives = 185/413 (44%), Gaps = 49/413 (11%)
Query: 85 PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
P+ R+ ++ +HG+PLGG G+G IGR+ G F W + + QF++F
Sbjct: 37 PYTVRYASNLDDGPNHGMPLGGFGAGCIGRAPDGSFNLWNLDGGEHWFGTLPDCQFALFE 96
Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGE 195
NG + ++ + + + +W W TY A YP +WT G
Sbjct: 97 RDGNGSRAHALAVAPEADASRPDAGPPLSAWSWYPASTADTSTGTYAARYPLSWTSFSGV 156
Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
R+ C SPI+P +Y+ SSYPV+VFT+++ N + +++LL +W N+ G
Sbjct: 157 F--SARVGCEAFSPILPGDYERSSYPVAVFTWSLSNPTRRPLELSLLLSWRNTTGWFTNT 214
Query: 251 -----------GDSEFT-----GQHYNSKTK-MNDG-VHAVLLHHRTSHQLPP------- 285
G E G+ + + ++DG + VLL S +
Sbjct: 215 DPAAAVHFRDDGSPEHNYVPAIGRSEGQRNRWVDDGPLKGVLLEGPISDPIAEGQGQWCI 274
Query: 286 VTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
T A G+ + C + +G+ ++W GS N+ S ++P
Sbjct: 275 ATDAGLEARHPGLRIHRCSRWNPAGD-----GAELWESFARDGSIPDSNNDRHSGANDPA 329
Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIAR 405
S AA+A + + P + + ++WD P F +G RRYT +G AA IA
Sbjct: 330 S---AALAVQLRLEPGASIDIPLVISWDLPVTAFATGSRALRRYTDHFGADGTNAAAIAA 386
Query: 406 DAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
+A+ SW QI AWQ+P+LE LPE + LFNELY L +GG +WT SP
Sbjct: 387 EALGSWPSWREQIAAWQQPVLERTDLPEPLRMALFNELYDLASGGTLWTAASP 439
>gi|434391596|ref|YP_007126543.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
gi|428263437|gb|AFZ29383.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
Length = 809
Score = 223 bits (568), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 145/405 (35%), Positives = 210/405 (51%), Gaps = 39/405 (9%)
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
D + +GQF LE +Y + + DV Y SFAL+ML+ K++ S+ R FA A+ D +
Sbjct: 410 DDRDPVGQFAVLECFDYRWYESLDVRLYGSFALLMLWSKLEKSVIRAFARAIGQRD-DRT 468
Query: 584 KLLD-------DGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQ 634
+++ + RKV GA PHD+G + PW N D WKDL+ FVLQ
Sbjct: 469 RVIGYYYTQGLESPTALRKVAGATPHDLGAPNEHPWEATNYTSYQDCNLWKDLSCDFVLQ 528
Query: 635 VYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGIS 692
VYRD V TG D + W +V + Y+ FD D DG+ EN G PDQT+D W + G+S
Sbjct: 529 VYRDFVLTGGTDWELLWDCWSAVVQTLTYLKTFDLDEDGIPENSGAPDQTFDDWRLQGVS 588
Query: 693 AYSGGLWVAAL----------------QAASALAREVGDRG----SEDYFLFKFQKAKVV 732
AY GGLW+AAL AS + D + D F +++ +
Sbjct: 589 AYCGGLWLAALEAAIAIGKTLLSYPDDHPASKILASAPDYPPIPETLDVFQSWLMRSRPI 648
Query: 733 Y-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
Y EKLWNG Y+ D+ S S + ADQL GQ+YAR GL IV + A ALK VY+
Sbjct: 649 YQEKLWNGQYYRLDS--KSGSDVVMADQLCGQFYARLLGLPDIVPPECATQALKTVYDAC 706
Query: 792 VLKVMGGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
+K G+ GA NG+ PDG + ++ E+W+G+ + +AA ++ + ++ +
Sbjct: 707 FVKFHNGQFGAANGLKPDGLPENPNATHPLEVWTGINFGLAAFLMQMGMKSEAWRITQAV 766
Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
+ + G F+TPEA +R+ Y+RP+AIWA+ LT
Sbjct: 767 VQQVYDN---GLQFRTPEAITAKGTFRACHYLRPMAIWAIYRVLT 808
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 131/414 (31%), Positives = 196/414 (47%), Gaps = 52/414 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGR+ RG+F W I K + A QFS+F +++ + + LC
Sbjct: 40 HGMPLGGFGAGCIGRASRGDFNLWHIDGGEHTFKSLPACQFSIFEQQASTSQ-AFALCTL 98
Query: 155 TPEVLKDTTAAGIGSWDWNLK-GDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
P D T + +W W + + YHALYPR+W V+E L C Q SPI+P
Sbjct: 99 PP---ADGT---LTTWQWYPQHCENGVYHALYPRSWFVYENVFQAHLS--CEQFSPILPE 150
Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---- 269
NY+E+SYPV+VF + +N ++++ TW N+VG F N + +M D
Sbjct: 151 NYQETSYPVAVFVWRAHNPTNAPLTLSIMLTWQNTVGW---FQNAIKNPEIRMRDDGSPV 207
Query: 270 -----------------------VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHF 306
VL + Q +A+A + + V +
Sbjct: 208 YEYEPCLGNNLGNFNRLIAKDNLCGCVLERVASDLQDGIGQWAIATPKQPNIEVFYHTRW 267
Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
SG+ ++WH GS L + E ++ + AAIA T+ P ++
Sbjct: 268 NPSGD-----GAEIWHSFARDGS---LPNTEDKTSASEQEQLAAAIAVRFTLQPGETREL 319
Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
F++AWD P F G Y+RRYT F+G + A IA+ A+ + SW+ QI+AWQ+PIL
Sbjct: 320 PFAIAWDFPITEFAPGVKYFRRYTDFFGRGGDNAVAIAQTALQNYQSWQQQIQAWQQPIL 379
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWT--DGSPPVHSLVTIGHRKFSLDWSQS 478
+ LP + + LFNELY L +GG +W+ D PV + F W +S
Sbjct: 380 DRDDLPSVFKMALFNELYDLTSGGTLWSAADDRDPVGQFAVL--ECFDYRWYES 431
>gi|428307493|ref|YP_007144318.1| glucosylceramidase [Crinalium epipsammum PCC 9333]
gi|428249028|gb|AFZ14808.1| Glucosylceramidase [Crinalium epipsammum PCC 9333]
Length = 811
Score = 222 bits (565), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 146/391 (37%), Positives = 211/391 (53%), Gaps = 32/391 (8%)
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
+ IGQF LE ++Y + + DV Y SFAL ML+PK+ ++ +A A+ D + +
Sbjct: 421 DGIGQFGVLECLDYRWYESLDVRLYGSFALAMLWPKLDKAVLEAYARAIPTSDNTPRIIG 480
Query: 587 DDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG- 643
+ RK A PHD+G + PW + N D WKDL FVLQVYRD + TG
Sbjct: 481 YNQASAIRKAANATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDYLLTGA 540
Query: 644 -DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAA 702
D +F W ++ +AY+ +FD DGDG+ EN G PDQT+D W + G+SAY GGLW+AA
Sbjct: 541 TDIEFLSECWSAIVKTLAYLKEFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAA 600
Query: 703 LQAASALAREVGDRGSE----DY------FLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQ 751
L+AA A+ + + G E D+ + +++K +Y EKLWNG Y+ D+ S
Sbjct: 601 LEAAIAIGEILINHGVELEDGDFESAIATYKSWLEQSKPLYQEKLWNGQYYQLDS--ESG 658
Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK-----------VMGGKR 800
S+ + ADQL GQ+YAR L IV D SALK VY+ LK +
Sbjct: 659 SNVVMADQLCGQFYARLLNLPDIVPNDCVESALKTVYDSCFLKFNQKLSQHSESLTPTFI 718
Query: 801 GAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTG 859
GA NG+L DG + + E+W+G+ + +AA MI + D FQ + + ++
Sbjct: 719 GAANGVLADGSPENPQATHPLEVWTGINFGLAAFMIQMGMKDEAFQLTEAVVQQIYNN-- 776
Query: 860 LGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
G F+TPEA +R+ Y+R +AIWA+
Sbjct: 777 -GLQFRTPEAITAAGTFRASHYLRAMAIWAI 806
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 139/393 (35%), Positives = 190/393 (48%), Gaps = 45/393 (11%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G +GRS RG+F W I K + A QFSVF N + +C +
Sbjct: 41 HGMPLGGFGAGCLGRSSRGDFNLWHIDGGEHTFKSLPACQFSVFEQAGNSAPQAYAMCTE 100
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKST-----YHALYPRAWTVHEGEPDPELRIVCRQISP 209
PE D T + +W W K K T YHALYPR+W V++G +L C Q SP
Sbjct: 101 PPE---DGT---LSTWQWYPKTGKGTESSGTYHALYPRSWFVYDGVFQAQL--TCEQFSP 152
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
I NY+ESSYPV VF +T +N ++++ TW NSVG FT + K+ D
Sbjct: 153 IWADNYQESSYPVGVFEWTAHNPTDQPITLSIMLTWQNSVGW---FTNAIKTPQVKVRDD 209
Query: 270 VHAV------------------LLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGN 311
V + H R L V QE +G I+
Sbjct: 210 GSPVYEYQPRWKDSTGNLNQWIVDHFRAGCLLDRVRIHDEVQEGEGQWAIA----TITNP 265
Query: 312 SLGLTAKDMWH------EIKEHGSFD-RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
S+ + W+ EI ++ +FD L E ++PG I AA+ TV P
Sbjct: 266 SVEVFYHTRWNPAGSGEEIWDNFAFDGSLRDKEDETPADPGEQIAAAMTIRFTVRPGRTR 325
Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
++ F LAWD P F +G YYRRYT F+G N A +I R A+ +W+ QI++WQ
Sbjct: 326 KIPFVLAWDFPVTEFATGVNYYRRYTDFFGRTGNNAWSIVRTALKHDDTWKEQIQSWQEL 385
Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
L++ LP+W+ + LFNELY L GG +WT S
Sbjct: 386 TLKNANLPDWFKMALFNELYILADGGTLWTAAS 418
>gi|282900195|ref|ZP_06308151.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
CS-505]
gi|281194944|gb|EFA69885.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
CS-505]
Length = 789
Score = 222 bits (565), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 152/402 (37%), Positives = 218/402 (54%), Gaps = 42/402 (10%)
Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
G L G E +GQF LE ++Y + + DV Y SF L++LFP+++ ++ R FA A+
Sbjct: 396 GGTLWSAGTEKDPMGQFAVLECLDYRWYESLDVRLYGSFGLLILFPELEKAVIRAFARAI 455
Query: 576 MMHD--------------PSKMKLLDDGQWVSRKVLGAVPHDIGICDPW-FEVNAYCLY- 619
D PS M L RKV GA PHD+G + W +E Y Y
Sbjct: 456 PQSDDRSRIIGYYLTINSPSPMAL--------RKVAGATPHDLGAPNEWVWEQTNYTSYQ 507
Query: 620 DTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDG 677
D WKDL FVLQVYRD + TG D +F W + + + Y+ QFD DGDG+ EN G
Sbjct: 508 DCNLWKDLGCDFVLQVYRDFLFTGANDIEFLVDCWHGMVLTLDYLKQFDMDGDGIPENSG 567
Query: 678 FPDQTYDTWSVSGISAYSGGLWVAALQAASALAR------EVGDRGSED--YFLFKFQKA 729
PDQT+D W +SG+SAY G LW+AAL+AA A+ E+ + G + Y + Q
Sbjct: 568 APDQTFDDWRLSGVSAYCGALWLAALEAAIAICDVLINRPELPNVGEQKSIYEHWLNQSL 627
Query: 730 KVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
V +KLWNG ++ D+ S S+ + ADQL+GQ+YAR L IV +D+A SAL +Y+
Sbjct: 628 PVYQQKLWNGKFYRLDS--ESGSNVVMADQLSGQFYARLLNLPDIVPKDRALSALTTIYD 685
Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTAC 848
LK GK GA NG+L DG + + E+W+G+ + +AA ++ + + G +
Sbjct: 686 ACFLKFQDGKFGAANGVLLDGSPENPQATHPLEVWTGINFGLAAFLLQMGMKEQGLRLTE 745
Query: 849 GIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
+ + G F+TPEA +R+ Y+RP+AIWA+
Sbjct: 746 AVVRQVYEH---GLQFRTPEAITAAGTFRASTYLRPMAIWAV 784
Score = 215 bits (547), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 138/391 (35%), Positives = 196/391 (50%), Gaps = 52/391 (13%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W I K V QFS+F S GQ Y+ L +
Sbjct: 38 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNVPGCQFSIFTS---GQAYA--LSTQ 92
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
PE D T + +W W D TYHALYPR+W V+E L+ C Q SP+ HN
Sbjct: 93 PPE---DNT---LQTWQWYPNTDGGTYHALYPRSWFVYENVF--PLQFTCEQFSPVWAHN 144
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-------- 266
Y+E+SYPV+VF + I+N +++ +W N G FT + + K+
Sbjct: 145 YRETSYPVAVFLWNIHNPTNQPLTASIMLSWENMTGW---FTNALKSPQVKIRDDGSPVY 201
Query: 267 ------------------NDGVHAVLLHHRTSH--QLPPVTFALAAQETDGVHVSLCPHF 306
ND +L T H Q T+ +A + V + F
Sbjct: 202 EYQPRWGESQGNYNWLAENDQYLGCVLGRATDHPIQEGDGTWCIATVKNPQVELFYHCQF 261
Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
GN +++W + ++GS L++ + ++ S +GAA+A T+PP V
Sbjct: 262 NPLGN-----GEEIWRDFSQNGS---LSNYQDETPADVNSRLGAAVAVRFTLPPGETLTV 313
Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
F L+WD P F +G YYRRYT F+GT + A IA A+ E +W IE+WQ+PIL
Sbjct: 314 PFVLSWDFPVTEFAAGVNYYRRYTDFFGTTGDNAWQIATCALKECYNWRSHIESWQKPIL 373
Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
E + LP W+ + LFNELY L +GG +W+ G+
Sbjct: 374 EREDLPSWFKMALFNELYDLTSGGTLWSAGT 404
>gi|428299644|ref|YP_007137950.1| glucosylceramidase [Calothrix sp. PCC 6303]
gi|428236188|gb|AFZ01978.1| Glucosylceramidase [Calothrix sp. PCC 6303]
Length = 804
Score = 221 bits (563), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 135/386 (34%), Positives = 204/386 (52%), Gaps = 27/386 (6%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
++ +GQF LE ++Y + + DV Y SF L+MLFP+++ ++ R FA A+ D K ++
Sbjct: 415 KDPVGQFAVLECLDYRWYESLDVRLYGSFGLLMLFPELEKAVMRAFARAIPASD-EKTRV 473
Query: 586 LD-------DGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVY 636
+ + RK GA PHD+G + W + N D WKDL FVLQVY
Sbjct: 474 IGYYYTIGAESPLAIRKAAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGSDFVLQVY 533
Query: 637 RDVVATGDK--KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
RD V TG K +F W ++ + + Y+ FD+D DG+ EN G PDQT+D W + G+SAY
Sbjct: 534 RDFVFTGSKDIEFLAECWDAIALTLEYLKAFDKDHDGIPENSGAPDQTFDDWKLQGVSAY 593
Query: 695 SGGLW----VAALQAASALAREVGDR----GSEDYFLFKFQKAKVVY-EKLWNGSYFNYD 745
GGLW AA+ L GD+ + + K++ +Y +KLWNG Y+ D
Sbjct: 594 CGGLWLAALEAAIAICDILLENHGDQEKLTAQKIIYTDWLAKSRPIYQQKLWNGQYYRLD 653
Query: 746 NSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNG 805
+ S S + ADQL GQ+YAR L IV D A SA++ VY + GK GA NG
Sbjct: 654 S--ESGSDVVMADQLCGQFYARLLKLADIVPHDCAISAVRTVYEACFINFNDGKFGAANG 711
Query: 806 MLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
+ PDG + + E+W+G+ + +AA ++ + D ++ + + + G F
Sbjct: 712 LRPDGSPENPKATHPLEVWTGINFGLAAFLMQMGMQDEAWKLTEVVVQQIYDN---GLQF 768
Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAM 890
+TPEA +R+ Y+R +AIW +
Sbjct: 769 RTPEAITATGTFRACTYLRAMAIWGI 794
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 128/387 (33%), Positives = 182/387 (47%), Gaps = 35/387 (9%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS+ G+F W I K + A QFS+F S SS
Sbjct: 40 HGMPLGGFGAGCIGRSHGGDFNFWHIDGGEHVFKNIPACQFSLFESVGE----SSQAYAL 95
Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
D T + SW + + + + TYHALYPR+W + G + I C+Q SPI
Sbjct: 96 ATAAPADGT---LQSWKYYPQSSPEQNTGTYHALYPRSWFEYGGVY--QANITCQQFSPI 150
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG 269
NY+ESSYPV+VF + ++N ++++ TW N VG FT + + + +DG
Sbjct: 151 WADNYQESSYPVAVFLWNLHNPTNAPITLSIMLTWENMVGW---FTNTSKSPEVIVRDDG 207
Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS---GNSLGLTAKDMWHEIKE 326
R + T GV +S V+ G LT+K EI
Sbjct: 208 SPVYEYQPRLGQSQGNFNEIVDNGNTIGVVLSGISGEVVQEGDGQWCFLTSKQAGVEIFS 267
Query: 327 HG-------------SFDRLNSMETSVTSEPGSS--IGAAIAASVTVPPDSEGQVTFSLA 371
H SF S+ + P + + A+A +T+ P +V F L
Sbjct: 268 HSRWNPTGNGEEIWQSFAADGSLSNYQDATPATDERLAGALAVRLTLQPGETLEVPFVLT 327
Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
WD P F +G Y RRYT F+G++ A IA A+ ++ W +I+ WQ+PIL L
Sbjct: 328 WDFPVTEFAAGVNYNRRYTDFFGSNGKNALQIAETALEQYRDWLEKIQTWQKPILNRHDL 387
Query: 432 PEWYPITLFNELYYLNAGGAVWTDGSP 458
P+W + LFNELY L AGG +W+ SP
Sbjct: 388 PDWMKMALFNELYDLTAGGTLWSAASP 414
>gi|113477064|ref|YP_723125.1| hypothetical protein Tery_3568 [Trichodesmium erythraeum IMS101]
gi|110168112|gb|ABG52652.1| protein of unknown function DUF608 [Trichodesmium erythraeum
IMS101]
Length = 816
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 150/396 (37%), Positives = 216/396 (54%), Gaps = 44/396 (11%)
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
GQF LE ++Y + + DV Y SF L++L+P+++ S+ FA AV D + + +
Sbjct: 422 GQFAVLECLDYRWYESLDVRLYGSFGLLILWPELEKSVLVAFARAVSTADDTLRIIGYNQ 481
Query: 590 QWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DK 645
RKV GA PHD+G + PW N D +WKDL+ FVLQVYRD + TG D
Sbjct: 482 VSAVRKVAGATPHDLGAPNEHPWEMTNYTSYQDCNQWKDLSSDFVLQVYRDFLLTGADDY 541
Query: 646 KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQA 705
+F W ++ +AY+ FD+D DG+ EN+G PDQT+D W + G+SAY GGLW+AAL+A
Sbjct: 542 EFLWQSWSAITETLAYLKGFDKDNDGIPENEGAPDQTFDDWQLRGVSAYCGGLWLAALEA 601
Query: 706 ASALA-------REVG-----------DRGSED-----YFLFK--FQKAKVVY-EKLWNG 739
A A+ RE+ D+ S D +L++ +K +Y EKLWNG
Sbjct: 602 AIAIGKVLIEHPREIPYYPPKGFYSEVDKNSVDAINNQVYLYQGWLKKGLPIYQEKLWNG 661
Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK 799
Y+ D+ S S + ADQL+GQ+YA+ L IV + A SALK VYN GK
Sbjct: 662 EYYRLDS--ESNSEVVMADQLSGQFYAKLLNLEDIVPAECALSALKTVYNSCFKNFHNGK 719
Query: 800 RGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIH----EDLADIGFQTACGIYEAA 854
GA NG+LPDG + ++ E+W+G+ + +AA M+ ++ +I IYE
Sbjct: 720 FGAANGVLPDGSPENPNATHPLEVWTGINFGLAAFMVQIGMKKEALEITEVVVGQIYEN- 778
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
G F+TPEA +R+ Y+R +AIWA+
Sbjct: 779 ------GLQFRTPEAITVMGTFRASHYLRAMAIWAI 808
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 129/394 (32%), Positives = 196/394 (49%), Gaps = 49/394 (12%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQI--FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
HG+PLGG G+G+ GRS RG+F W + + ++ P A QFSVF + GQK + L
Sbjct: 41 HGMPLGGFGAGATGRSPRGDFNLWHLDGGEHIYQNLP--ACQFSVF-EETKGQKQAYALS 97
Query: 153 PKTPEVLKDTTAAGIGSWDWNLK---GDKS-TYHALYPRAWTVHEGEPDPELRIVCRQIS 208
+ P T + +W W + G K+ TYHALYPR+W V+E +L C Q S
Sbjct: 98 TELP------TDGSLSAWQWYPREKAGLKTGTYHALYPRSWFVYENVFTAQLS--CEQFS 149
Query: 209 PIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND 268
PI+ NY+E+SYP+++F +T +N + ++++ +W N+VG FT + ++ D
Sbjct: 150 PILAGNYQETSYPIAIFEWTAHNPTDEAITLSIMLSWQNTVGW---FTNSVKTPEVRVRD 206
Query: 269 GVHAVLLH------------------HRTSHQLPPVTFALAAQETDG-------VHVSLC 303
V + HR + ++ A E +G ++ +
Sbjct: 207 DGSPVYEYKPRWGESTDNFNLLVEDFHRIGCTMTKLSIADEPAEGEGQMAIATFINAGME 266
Query: 304 PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSE 363
+ N G T +D+WH GS L E + + G IG AI+ T+ P
Sbjct: 267 VFYHTRWNPTG-TGEDIWHYFALDGS---LIDEENELPATEGEQIGVAISVRFTIRPGKN 322
Query: 364 GQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQR 423
++ F L WD P +F +G +YYRRYT FYG + A ++ R A+ + +W IEAWQ
Sbjct: 323 RKIPFFLVWDLPVTDFGNGVSYYRRYTDFYGRNGKNAWSMIRTAMKHYQTWRENIEAWQN 382
Query: 424 PILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
PIL+ + LP W + L NELY L +GG +W S
Sbjct: 383 PILQREDLPNWLKMALLNELYDLTSGGTIWAAAS 416
>gi|209526014|ref|ZP_03274547.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
gi|376002240|ref|ZP_09780079.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423067988|ref|ZP_17056778.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
gi|209493540|gb|EDZ93862.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
gi|375329367|emb|CCE15832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406710526|gb|EKD05737.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
Length = 799
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 141/389 (36%), Positives = 210/389 (53%), Gaps = 20/389 (5%)
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
D ++ G+F LE ++Y + + DV Y SF L+M +P+++ S+ R FA A+ D +
Sbjct: 416 DEDQPWGRFAVLECLDYRWYESLDVRLYGSFGLLMSWPELEKSVIRTFAKAIATADHTPR 475
Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
+ + RK GA PHD+G + PW + N D WKDL FVLQVYRD +
Sbjct: 476 IIGYNQASAIRKAAGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDYLL 535
Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
TG D +F WP++ + Y+ FDRD DG+ EN G PDQT+D W + GISAY GGLW
Sbjct: 536 TGADDIQFLVECWPAIVETLDYLKTFDRDQDGIPENGGAPDQTFDDWRLVGISAYCGGLW 595
Query: 700 VAALQAASALA-------REVGDRGSEDYFLFK--FQKAKVVYEK-LWNGSYFNYDNSGS 749
+AAL+AA +A R++ ++ + + AK +Y++ LWNGSY+N D+
Sbjct: 596 LAALEAAIEIANILLSHNRDLTPDTTKAIATWTNWLETAKPLYDQTLWNGSYYNLDS--Q 653
Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
S S + ADQL GQ+YA GL IV +A+ AL+ +Y GK GA NG+
Sbjct: 654 SGSDVVMADQLCGQFYAALLGLPDIVPPHRAQVALQTIYQACFQNFHNGKFGAANGVRTT 713
Query: 810 GR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
G ++ + E+W+G+ + + A I + + FQ A + + G F+TPE
Sbjct: 714 GEPMNPNDTHPLEVWTGINFGLGAFFIQMGMKEEAFQLAEAVITQVYQN---GLQFRTPE 770
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
A +R+ Y+R +AIWAM + + P
Sbjct: 771 AITAAGTFRASHYLRAMAIWAMYYQIQHP 799
Score = 187 bits (476), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 122/393 (31%), Positives = 185/393 (47%), Gaps = 53/393 (13%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS+RG+F W + +P+ QFSVF +G++ + L +
Sbjct: 41 HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHIFRPLPGCQFSVF-EEIDGKRQAYALSTQ 99
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDK----STYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
P+ + +W W D+ TY+ALYPR+W V+E + ++ C Q SP+
Sbjct: 100 PPQ------DGSLSAWSWYPTSDQHTQTGTYYALYPRSWFVYENVF--QTQLTCEQFSPV 151
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
NY+E+SYP++VF + +N +++L TW N++G FT N K+ D
Sbjct: 152 WGGNYQETSYPIAVFEWIAHNPTDKLIILSILLTWENTIGW---FTNSLENPTVKVRDDG 208
Query: 271 HAVLLH------------------HRTSHQLPPVT-----------FALAAQETDGVHVS 301
V + HR + + A+A V V
Sbjct: 209 SPVYEYQPRWGDSQDNLNRFVEDFHRVGCVMTRLNINDQPREGEGQIAIATISNPAVEVY 268
Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
+ +GN D+W + GS + +V G IGAA+A + P
Sbjct: 269 YHNRWNPNGNGY-----DIWEYFSKDGSLIDIQDDRPAVE---GERIGAALAVRCILRPG 320
Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
++ F +AWD P F +G +YYR YT F+G + A ++ R A+ +W IEAW
Sbjct: 321 KIRKIPFFIAWDLPVCEFEAGVSYYRHYTDFFGRNGRNAWSMIRTAMKHSDTWRENIEAW 380
Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
Q PIL + LP W+ + LFNELY L GG++WT
Sbjct: 381 QNPILNRQDLPPWFKMALFNELYLLTDGGSLWT 413
>gi|427733995|ref|YP_007053539.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
gi|427369036|gb|AFY52992.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
Length = 799
Score = 218 bits (556), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 139/383 (36%), Positives = 206/383 (53%), Gaps = 27/383 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD- 587
IGQF LE ++Y + + DV Y SF L+ LFP+++ ++ R FA A+ D K +++
Sbjct: 419 IGQFAVLECLDYRWYESLDVRLYGSFGLLNLFPELEKAVMRAFARAIPESD-EKTRVIGY 477
Query: 588 ------DGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
+ RK GA PHD+G + W + N D +WKDL FVLQVYRD
Sbjct: 478 YYTIGAESPLAVRKAAGATPHDLGAPNEHVWEKTNYTSYQDCNQWKDLGCDFVLQVYRDF 537
Query: 640 VATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
+ TG D +F W ++ + Y+ FD+DGDG+ EN G PDQT+D W + G+SAY GG
Sbjct: 538 IFTGSNDIEFLADCWNAIVETLNYLKAFDKDGDGIPENSGAPDQTFDDWRLLGVSAYCGG 597
Query: 698 LWVAALQAASALAREVGDR--------GSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSG 748
LW+AAL+AA A+A + + + + +K++ VY EKLWNG ++ D+
Sbjct: 598 LWLAALEAAIAIAEVLSSKQVQVENLAAQKSIYEGWLEKSRPVYQEKLWNGKFYRLDS-- 655
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
S S + ADQL GQ+YAR L IV + SAL VY+ LK GK GA NG+
Sbjct: 656 ESGSDVVMADQLCGQFYARLLKLPDIVPPECTMSALNSVYDACFLKFNQGKLGAANGLRL 715
Query: 809 DGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
DG + + E+W+G+ + +AA ++ + F+ + + G F+TP
Sbjct: 716 DGSPENPDATHPLEVWTGINFGIAAFLVQMGMKSEAFKLTEAVVNQIYEH---GLQFRTP 772
Query: 868 EAWNTDDQYRSLCYMRPLAIWAM 890
EA +R+ Y+R +AIWA+
Sbjct: 773 EAITATRNFRACVYLRAMAIWAI 795
Score = 205 bits (521), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 137/390 (35%), Positives = 197/390 (50%), Gaps = 42/390 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
G+PLGG G+G IGRS RG+F W I + A QFS+F S SN +V
Sbjct: 40 QGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFGNIPACQFSLFESSSN----RNVAYAL 95
Query: 155 TPEVLKDTTAAGIGSWDW---NLKGDKS-TYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
+ E D+T + +W+W KGD S TYHALYPR+W V+E + R+ C Q SPI
Sbjct: 96 STEPSNDST---LSAWEWYPSRSKGDSSGTYHALYPRSWFVYENVF--QSRLSCEQFSPI 150
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
NY+ESSYPV++F + N ++++ TW N VG FT + + +M D
Sbjct: 151 WADNYQESSYPVAIFLWNASNPTSEPVTLSIMLTWQNMVGW---FTNTLKSPEVEMRDDG 207
Query: 271 HAVLLHH--------------RTSHQLPPVTFALAAQETDGVHVSLC------PHFVISG 310
V + S + V + E S C P I
Sbjct: 208 SPVYDYQPNLGESKGNFNRLVTNSEYIGCVLDKIGTAEPIEGEGSWCIATAKHPKVEIFH 267
Query: 311 NSL---GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
++ T +D+W + GS ++ ++++E +GAAIA T+ P Q+
Sbjct: 268 HNRWNPTSTGEDIWRSFAKTGSLANISDETPAISNE---QVGAAIAVRFTLQPGESLQIP 324
Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
F++AWD P F+SG TY RRYT F+G + A NIA A+ E+ +W Q++AWQ+PI+E
Sbjct: 325 FAVAWDFPITEFVSGVTYKRRYTDFFGDNGKNAWNIATTALEEYQNWLSQVQAWQQPIVE 384
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
LP+W+ + LFNELY L AGG +W+ S
Sbjct: 385 RPDLPDWFKMALFNELYDLTAGGTLWSAAS 414
>gi|157119417|ref|XP_001653371.1| bile acid beta-glucosidase, putative [Aedes aegypti]
gi|108883154|gb|EAT47379.1| AAEL001478-PA, partial [Aedes aegypti]
Length = 898
Score = 217 bits (552), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 109/260 (41%), Positives = 163/260 (62%), Gaps = 11/260 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ K+++P+ + ++D+D DG+IEN PDQTYDTW + G SAY GGLW+A+L
Sbjct: 638 YLKSMYPACKQVLEKTLEWDKDNDGIIENSKCPDQTYDTWVMDGPSAYCGGLWLASLHCM 697
Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
+A+A + + F +K K + EKLWNG+Y+ +D +S+ ++I +DQL G WY
Sbjct: 698 TAMANLLDQNDDCNKFREILEKGKTSFEEKLWNGTYYKFDGQSASK-NTIMSDQLCGHWY 756
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP------DGRVDMSSMQ 818
R CG I ++ R+ALK +Y+ NV++ GG GAVNG +P DGR D S++Q
Sbjct: 757 LRCCGFDYDIFPKENVRTALKTIYDNNVMRFCGGNMGAVNGYVPSTQPNKDGRADASTVQ 816
Query: 819 SREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRS 878
E+W+GVTYA+A++MIHE + FQTA G+Y+ +G F+TPEA + YR+
Sbjct: 817 GEEVWTGVTYALASTMIHEGMFTEAFQTAGGMYQTL--SEKIGMNFETPEALYAERHYRA 874
Query: 879 LCYMRPLAIWAMQWALTRPK 898
+ YMRPL+IW++Q A + K
Sbjct: 875 IGYMRPLSIWSLQTAWEQKK 894
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 187/364 (51%), Gaps = 33/364 (9%)
Query: 111 YRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVL------CPKTPEVLKDTT 163
+ GEF R+Q+ P + E V ANQF V + N + S+L C + D
Sbjct: 87 FAGEFCRFQLKPGLYEYNTVHANQFIVTIKDENNVTIFQSLLSSYRYVCTWCNDRYSDLF 146
Query: 164 AAGIG-------------SWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
A +G SW+ + K Y ALYPRAW+ ++ D +++V RQISPI
Sbjct: 147 ARKLGLVPIEIRPKNPLSSWETYINSSKCNYTALYPRAWSEYDLS-DYGIKLVQRQISPI 205
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
IPH+YKESS P +VF +T+ N +T+ FT+ N G + NS+T
Sbjct: 206 IPHDYKESSLPAAVFVWTVENVCGKDRQVTITFTFKNGTGNKKQ--DAEGNSETAEFSHG 263
Query: 271 HAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF 330
A + + S T+ ++ + + ++++ C F +GN + +W+++KE+G
Sbjct: 264 SAKGVSIKQSIVDMQSTYCISCKTSSEINLTRCKKFDPTGN-----GEKLWNDLKENGQL 318
Query: 331 DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRY 389
+ S+E ++ S+ + A++A + V + FSL WD P++ F K YY+ Y
Sbjct: 319 TEV-SVEENLKSK---DVAVAVSAQILVQSTGSSDLEFSLVWDMPKIRFGKKTKEYYKYY 374
Query: 390 TKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAG 449
TK++G +A +I+ A+ G WE I WQRPIL+D LP+WY +FNELY++ G
Sbjct: 375 TKYFGKSGDAGPSISDYALQNFGKWETLINEWQRPILDDNDLPDWYKSAIFNELYFIADG 434
Query: 450 GAVW 453
G+VW
Sbjct: 435 GSVW 438
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 85/137 (62%), Gaps = 2/137 (1%)
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
G+F YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q D+ + K L DG
Sbjct: 456 GRFAYLEGHEYRMYNTYDVHFYASHALASLWPNLQVSLQYDYKDTINREISEGRKHLYDG 515
Query: 590 QWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKF 647
+ + RK+ +VPHD+G D P+ +NAY ++D + W+DLN KF+LQVYRD
Sbjct: 516 KIIPRKIKNSVPHDLGDPDEEPFDLINAYPIHDVSEWRDLNTKFILQVYRDYYTLNRYAQ 575
Query: 648 AKAVWPSVYVAMAYMDQ 664
A S + ++ ++D+
Sbjct: 576 QNAENASKFSSIEFIDK 592
>gi|188570649|gb|ACD64360.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 217 bits (552), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 127/209 (60%), Positives = 157/209 (75%), Gaps = 1/209 (0%)
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
GISAY G LW+AALQAA+A+A E+ D+ + + KF KAK +E KLWNGSYFNYD+
Sbjct: 2 GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SS S SIQADQLAGQWY + GL + D+DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62 SSNSKSIQADQLAGQWYTASAGLPNLFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A WS G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
W D YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|409991122|ref|ZP_11274412.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
Paraca]
gi|291568810|dbj|BAI91082.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409938009|gb|EKN79383.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
Paraca]
Length = 799
Score = 216 bits (551), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 143/389 (36%), Positives = 208/389 (53%), Gaps = 20/389 (5%)
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
D +E G+F LE ++Y + + DV Y SF L+M +P+++ S+ R FA A+ D +
Sbjct: 416 DEDEPWGRFAVLECLDYRWYESLDVRLYGSFGLLMWWPELEKSVMRAFAKAIATDDDTPR 475
Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
+ + RK GA PHD+G + PW + N D WKDL FVLQVYRD +
Sbjct: 476 IIGYNQASAIRKAAGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDYLL 535
Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
TG D +F WP++ + Y+ FDRD D + EN G PDQT+D W + GISAY GGLW
Sbjct: 536 TGADDIQFLVECWPAIVQTLDYLKTFDRDRDCIPENGGAPDQTFDDWRMMGISAYCGGLW 595
Query: 700 VAALQAASALAREVGDRGSE---------DYFLFKFQKAKVVYEK-LWNGSYFNYDNSGS 749
+AAL+AA A+A + S+ + + AK +Y++ LWNGSY+N D+
Sbjct: 596 LAALEAAIAIANILLSHHSDITPDTTKAIATWTNWLETAKPLYDQTLWNGSYYNLDS--Q 653
Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
S S + ADQL GQ+YA GL IV +AR AL+ +Y GK GA NG+
Sbjct: 654 SGSDVVMADQLCGQFYAALLGLPDIVPHHRARVALQTIYQACFQNFHNGKFGAANGVRTT 713
Query: 810 GR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
G ++ E+W+G+ + + A +I + + FQ A + + G F+TPE
Sbjct: 714 GEPINPQDTHPLEVWTGINFGLGAFLIQMGMKEEAFQLAEAVITQVYQN---GLQFRTPE 770
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
A +R+ Y+R +AIWAM + + P
Sbjct: 771 AITAAGTFRASHYLRAMAIWAMYYQIQHP 799
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 128/393 (32%), Positives = 186/393 (47%), Gaps = 53/393 (13%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS+RG+F W + +P+ QFSVF +G++ + L +
Sbjct: 41 HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHIFRPLPGCQFSVF-EEIDGKRQAYALSTQ 99
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDK----STYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
PE + +W+W D+ TYHALYPR+W V+E + ++ C Q SP+
Sbjct: 100 PPE------DGSLSTWNWYPSSDQHTQTGTYHALYPRSWFVYENVF--QTQLTCEQFSPV 151
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG- 269
NY+E+SYPV+VF + +N +++L TW N++G FT + K+ D
Sbjct: 152 WGGNYQETSYPVAVFEWIAHNPTDKLIILSILLTWENTIGW---FTNRLDTPAVKVRDDG 208
Query: 270 -----------------VHAVLLHHRTSHQLPPVTFALAAQETDG-----------VHVS 301
H V HR + + E DG V V
Sbjct: 209 SPVYEYQPRWGDSQDNCNHFVEDFHRVGCVMTRLNINDQLGEGDGQMAIATITNPVVEVY 268
Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
+ SGN D+W + GS L + + G IGAA+A + P
Sbjct: 269 CHNRWRPSGNGY-----DIWEYFSQDGS---LIDIHDDGPALEGERIGAALAVRCILRPG 320
Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
++ F +AWD P F +G TYYR YT F+G + A ++ R A+ +W IEAW
Sbjct: 321 KTRKIPFFIAWDLPVTEFEAGVTYYRHYTDFFGRNGRNAWSMIRTAMKHSDTWRENIEAW 380
Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
Q PIL + LP W+ + LFNELY L GG++WT
Sbjct: 381 QNPILNRQDLPPWFKMALFNELYLLTDGGSLWT 413
>gi|357630375|gb|EHJ78536.1| hypothetical protein KGM_16417 [Danaus plexippus]
Length = 912
Score = 216 bits (551), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 123/297 (41%), Positives = 174/297 (58%), Gaps = 10/297 (3%)
Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYD 684
+D K Q Y VV K++ ++PS + +DRDGDG+IEN GFPDQTYD
Sbjct: 621 EDCGEKLQPQEYAAVVWNS-KQYLSDMYPSCVTLLRRGLDWDRDGDGLIENGGFPDQTYD 679
Query: 685 TWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFN 743
W ++G SAY GGLWVA++ A A+A+ +G E F +KA+ YE KLWNGSY+
Sbjct: 680 AWVMTGPSAYCGGLWVASVSAVHAMAKILGFTDDEKEFSTLLEKARDSYERKLWNGSYYK 739
Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLL-PIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
+D + S + ADQLAGQW+ RA G P+ E + AL +Y NV + + G+ GA
Sbjct: 740 FD-TKPCNSEVVMADQLAGQWFLRASGWTEPVFPEANVKKALHTIYENNVQRFLNGRMGA 798
Query: 803 VNGML--PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGL 860
VNG + P +D +++QS E+W+GVTY +AA MI+E + + F TA G+Y +
Sbjct: 799 VNGFVRGPRPGIDTTAIQSEEVWTGVTYGLAALMIYEGMHEQAFSTAGGLYNTL---MKM 855
Query: 861 GYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA-LTRPKPKTLEKQMKPEVTDESLL 916
G AF+TPEA + +RS+ YMRPL+IW+M A +T+P +P E+ L
Sbjct: 856 GLAFETPEALYENGNHRSVAYMRPLSIWSMYHAIITKPPQHVSTSNGQPHQAKENHL 912
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 162/585 (27%), Positives = 258/585 (44%), Gaps = 108/585 (18%)
Query: 111 YRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVLC--PKTPEVLKDTTAAGI 167
++GEF R+Q++P + E V QF V + + + + SVL K +VL
Sbjct: 51 FKGEFCRFQLYPGLYEYVTVPECQFIVNIRDAKKETIFQSVLSTYSKPKKVLP------- 103
Query: 168 GSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTY 227
SW+WN+KG Y ALYPRAWT ++ ++++CRQISP+IPHNYK+SS P +VF +
Sbjct: 104 -SWEWNIKGADCEYTALYPRAWTTYD-LTKYGIKLICRQISPVIPHNYKDSSLPCAVFVF 161
Query: 228 TIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVT 287
+ N D+++ FTWT +G + G + + + V L + + P T
Sbjct: 162 SAKNISNEQRDVSITFTWTECLGESKKKGGCKLDLERYSAENTCGVTLEQKIADT--PCT 219
Query: 288 FALAAQETDG--VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF--------------- 330
F +A + D +H C NS T+ +W +K+HG
Sbjct: 220 FTIAKNKPDNETIHEGYC-----LWNSTK-TSGYVWECLKKHGRLEPDPSQTPPPPKVDK 273
Query: 331 -DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS-GKTYYRR 388
D+ N + + + A+++++++ P G F L WD P + + K + R
Sbjct: 274 TDKDNKEDKEKAKKIYKNDSIALSSTISLDPRGSGSTEFCLVWDMPVIKYKKDSKVHKRY 333
Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
YTK++G+ A A+IA A+ + WE Q+ WQ PIL + +P+W L NELY++
Sbjct: 334 YTKYFGSDGIAGASIASYALRSYRRWEKQLAEWQDPILNNSCIPDWLKSALMNELYFVAD 393
Query: 449 GGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIY 508
GG +W D + D+ +D P LE + Y
Sbjct: 394 GGTIWFD---------------VAEDYPDTD-------PRHEFGVFGYLEGHEYRMYNTY 431
Query: 509 TPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQ 568
V +SF L + + Q+++ E I S + L P
Sbjct: 432 D-VHFYASFALAQLWPNLQVVLQYMFRESI--------------SVEINKLRPS------ 470
Query: 569 RDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKD 626
++D K K ++PHD+G D P+ +NAY ++D + W+D
Sbjct: 471 --------LYDGKSCKF---------KTKDSIPHDLGEPDGVPFSHINAYNIHDVSEWRD 513
Query: 627 LNPKFVLQVYRD--VVATGDKKFAKAVWPSVYVAMAYMDQFDRDG 669
LN KF+LQV RD ++ + F Y +MAY+D RDG
Sbjct: 514 LNLKFILQVMRDYRLLRGHNAPFNN----ERYHSMAYIDDI-RDG 553
>gi|188570531|gb|ACD64301.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 216 bits (550), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 127/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
GISAY G LW+AALQAA+A+A E+ D+ + + KF KAK +E KLWNGSYFNYD+
Sbjct: 2 GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SS S SIQADQLAGQWY + GL + +DK +S+L+K++N+NV+KV GG+ GAVNGM P
Sbjct: 62 SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFNFNVMKVRGGRMGAVNGMHP 121
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A WS G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
W D YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570671|gb|ACD64371.1| hypothetical protein [Bahiopsis reticulata]
gi|188570673|gb|ACD64372.1| hypothetical protein [Bahiopsis reticulata]
Length = 210
Score = 216 bits (549), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 127/209 (60%), Positives = 155/209 (74%), Gaps = 1/209 (0%)
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
GISAY G LW+AALQAA+A+A E+ D+ S + KF KAK +E KLWNGSYFNYD+
Sbjct: 2 GISAYCGCLWLAALQAAAAMAVELNDKFSAQKYKKKFMKAKTSFESKLWNGSYFNYDSGS 61
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62 SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A WS G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDSAFTTAEGIFTAGWSEEGFGYAFQTPE 181
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
W D YRSL YMRPLAIW MQ AL P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEAP 210
>gi|292384326|gb|ADE21416.1| unknown [Helianthus neglectus]
Length = 210
Score = 216 bits (549), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 127/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
GISAY G LW+AALQAA+A+A E+ D+ + + KF KAK +E KLWNGSYFNYD+
Sbjct: 2 GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SS S SIQADQLAGQWY + GL + EDK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62 SSNSKSIQADQLAGQWYTASAGLPNLFGEDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A WS G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
W D YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570621|gb|ACD64346.1| hypothetical protein [Helianthus petiolaris]
gi|188570623|gb|ACD64347.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 215 bits (548), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 126/209 (60%), Positives = 157/209 (75%), Gaps = 1/209 (0%)
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
GISAY G LW+AALQAA+A+A E+ D+ + + KF KAK +E KLWNGSYFNYD+
Sbjct: 2 GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62 SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
+G+VD + MQSRE+W+GVTY VAA+MIH ++ D F TA GI+ A WS G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAEMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
W D YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570553|gb|ACD64312.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 126/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
GISAY G LW+AALQAA+A+A E+ D+ + + KF KAK +E KLWNGSYFNYD+
Sbjct: 2 GISAYCGCLWLAALQAATAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62 SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A WS G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
W D YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570503|gb|ACD64287.1| hypothetical protein [Helianthus annuus]
gi|188570505|gb|ACD64288.1| hypothetical protein [Helianthus annuus]
gi|188570507|gb|ACD64289.1| hypothetical protein [Helianthus annuus]
gi|188570509|gb|ACD64290.1| hypothetical protein [Helianthus annuus]
gi|188570513|gb|ACD64292.1| hypothetical protein [Helianthus annuus]
gi|188570515|gb|ACD64293.1| hypothetical protein [Helianthus annuus]
gi|188570517|gb|ACD64294.1| hypothetical protein [Helianthus annuus]
gi|188570519|gb|ACD64295.1| hypothetical protein [Helianthus annuus]
gi|188570521|gb|ACD64296.1| hypothetical protein [Helianthus annuus]
gi|188570523|gb|ACD64297.1| hypothetical protein [Helianthus annuus]
gi|188570525|gb|ACD64298.1| hypothetical protein [Helianthus annuus]
gi|188570527|gb|ACD64299.1| hypothetical protein [Helianthus annuus]
gi|188570529|gb|ACD64300.1| hypothetical protein [Helianthus annuus]
gi|188570533|gb|ACD64302.1| hypothetical protein [Helianthus annuus]
gi|188570535|gb|ACD64303.1| hypothetical protein [Helianthus annuus]
gi|188570537|gb|ACD64304.1| hypothetical protein [Helianthus annuus]
gi|188570539|gb|ACD64305.1| hypothetical protein [Helianthus annuus]
gi|188570541|gb|ACD64306.1| hypothetical protein [Helianthus annuus]
gi|188570543|gb|ACD64307.1| hypothetical protein [Helianthus annuus]
gi|188570545|gb|ACD64308.1| hypothetical protein [Helianthus annuus]
gi|188570547|gb|ACD64309.1| hypothetical protein [Helianthus annuus]
gi|188570549|gb|ACD64310.1| hypothetical protein [Helianthus annuus]
gi|188570551|gb|ACD64311.1| hypothetical protein [Helianthus annuus]
gi|188570555|gb|ACD64313.1| hypothetical protein [Helianthus annuus]
gi|188570557|gb|ACD64314.1| hypothetical protein [Helianthus annuus]
gi|188570559|gb|ACD64315.1| hypothetical protein [Helianthus annuus]
gi|188570561|gb|ACD64316.1| hypothetical protein [Helianthus annuus]
gi|188570563|gb|ACD64317.1| hypothetical protein [Helianthus annuus]
gi|188570565|gb|ACD64318.1| hypothetical protein [Helianthus annuus]
gi|188570567|gb|ACD64319.1| hypothetical protein [Helianthus annuus]
gi|188570569|gb|ACD64320.1| hypothetical protein [Helianthus annuus]
gi|188570571|gb|ACD64321.1| hypothetical protein [Helianthus annuus]
gi|188570573|gb|ACD64322.1| hypothetical protein [Helianthus annuus]
gi|188570575|gb|ACD64323.1| hypothetical protein [Helianthus annuus]
gi|188570577|gb|ACD64324.1| hypothetical protein [Helianthus annuus]
gi|188570579|gb|ACD64325.1| hypothetical protein [Helianthus annuus]
gi|188570581|gb|ACD64326.1| hypothetical protein [Helianthus annuus]
gi|188570583|gb|ACD64327.1| hypothetical protein [Helianthus annuus]
gi|188570585|gb|ACD64328.1| hypothetical protein [Helianthus annuus]
gi|188570587|gb|ACD64329.1| hypothetical protein [Helianthus annuus]
gi|188570589|gb|ACD64330.1| hypothetical protein [Helianthus annuus]
gi|188570591|gb|ACD64331.1| hypothetical protein [Helianthus annuus]
gi|188570593|gb|ACD64332.1| hypothetical protein [Helianthus annuus]
gi|188570595|gb|ACD64333.1| hypothetical protein [Helianthus annuus]
gi|188570597|gb|ACD64334.1| hypothetical protein [Helianthus annuus]
gi|188570599|gb|ACD64335.1| hypothetical protein [Helianthus annuus]
gi|188570601|gb|ACD64336.1| hypothetical protein [Helianthus annuus]
gi|188570603|gb|ACD64337.1| hypothetical protein [Helianthus petiolaris]
gi|188570605|gb|ACD64338.1| hypothetical protein [Helianthus petiolaris]
gi|188570607|gb|ACD64339.1| hypothetical protein [Helianthus petiolaris]
gi|188570609|gb|ACD64340.1| hypothetical protein [Helianthus petiolaris]
gi|188570611|gb|ACD64341.1| hypothetical protein [Helianthus petiolaris]
gi|188570613|gb|ACD64342.1| hypothetical protein [Helianthus petiolaris]
gi|188570615|gb|ACD64343.1| hypothetical protein [Helianthus petiolaris]
gi|188570617|gb|ACD64344.1| hypothetical protein [Helianthus petiolaris]
gi|188570619|gb|ACD64345.1| hypothetical protein [Helianthus petiolaris]
gi|188570625|gb|ACD64348.1| hypothetical protein [Helianthus petiolaris]
gi|188570627|gb|ACD64349.1| hypothetical protein [Helianthus petiolaris]
gi|188570629|gb|ACD64350.1| hypothetical protein [Helianthus petiolaris]
gi|188570633|gb|ACD64352.1| hypothetical protein [Helianthus petiolaris]
gi|188570637|gb|ACD64354.1| hypothetical protein [Helianthus petiolaris]
gi|188570641|gb|ACD64356.1| hypothetical protein [Helianthus petiolaris]
gi|188570643|gb|ACD64357.1| hypothetical protein [Helianthus petiolaris]
gi|188570645|gb|ACD64358.1| hypothetical protein [Helianthus petiolaris]
gi|188570651|gb|ACD64361.1| hypothetical protein [Helianthus petiolaris]
gi|188570653|gb|ACD64362.1| hypothetical protein [Helianthus petiolaris]
gi|188570655|gb|ACD64363.1| hypothetical protein [Helianthus petiolaris]
gi|188570657|gb|ACD64364.1| hypothetical protein [Helianthus petiolaris]
gi|188570659|gb|ACD64365.1| hypothetical protein [Helianthus petiolaris]
gi|188570661|gb|ACD64366.1| hypothetical protein [Helianthus petiolaris]
gi|188570663|gb|ACD64367.1| hypothetical protein [Helianthus petiolaris]
gi|188570665|gb|ACD64368.1| hypothetical protein [Helianthus petiolaris]
gi|292384298|gb|ADE21402.1| unknown [Helianthus neglectus]
gi|292384300|gb|ADE21403.1| unknown [Helianthus neglectus]
gi|292384302|gb|ADE21404.1| unknown [Helianthus neglectus]
gi|292384304|gb|ADE21405.1| unknown [Helianthus neglectus]
gi|292384312|gb|ADE21409.1| unknown [Helianthus neglectus]
gi|292384318|gb|ADE21412.1| unknown [Helianthus neglectus]
gi|292384320|gb|ADE21413.1| unknown [Helianthus neglectus]
gi|292384328|gb|ADE21417.1| unknown [Helianthus neglectus]
Length = 210
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 126/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
GISAY G LW+AALQAA+A+A E+ D+ + + KF KAK +E KLWNGSYFNYD+
Sbjct: 2 GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62 SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A WS G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
W D YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570631|gb|ACD64351.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 214 bits (544), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 126/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
GISAY G LW+AALQAA+A+A E+ D+ + + KF KAK +E KLWNGSYFNYD+
Sbjct: 2 GISAYCGCLWLAALQAAAAMAVELNDKLAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62 SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A WS G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
W D YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|444729905|gb|ELW70308.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
Length = 279
Score = 213 bits (542), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 114/240 (47%), Positives = 147/240 (61%), Gaps = 10/240 (4%)
Query: 659 MAYMD---QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
MA M+ +FD+D DG+IEN G+ DQTYD W +G SAY GGLW+AA+ +A G
Sbjct: 1 MAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGA 60
Query: 716 RGSEDYFLFKFQKAKVVYEKL-WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL--- 771
+ ++ F + + YE+L WNG Y+NYD+S QS SI +DQ AGQW+ +ACGL
Sbjct: 61 QDVQEKFTSILTRGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEG 120
Query: 772 -LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
+ AL+ ++ +NV GG GAVNGM P G D SS+QS E+W GV Y +
Sbjct: 121 DTEVFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGL 180
Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
AA+MI E L GF+TA G Y W LG AFQTPEA+ +RSL YMRPL+IWAM
Sbjct: 181 AATMIQEGLTWEGFRTAEGCYRTVWE--RLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 238
>gi|292384310|gb|ADE21408.1| unknown [Helianthus neglectus]
Length = 210
Score = 213 bits (541), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 126/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
GISAY G LW+AALQAA+A+A E+ D+ + + KF KAK +E KLWNGSYFNYD+
Sbjct: 2 GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFFKAKSSFESKLWNGSYFNYDSGS 61
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62 SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A WS G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
W D YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570647|gb|ACD64359.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 212 bits (540), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 126/209 (60%), Positives = 155/209 (74%), Gaps = 1/209 (0%)
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
GISAY G LW+AALQAA+A+A E+ D+ + + KF KAK +E KLWNGSYFNYD+
Sbjct: 2 GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SS S SIQADQLAGQWY + GL + +DK S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62 SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIPSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A WS G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
W D YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570511|gb|ACD64291.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 212 bits (540), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 125/209 (59%), Positives = 156/209 (74%), Gaps = 1/209 (0%)
Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
GISAY G LW+AALQAA+A+A E+ D+ + + KF KAK +E KLWNGSYFNYD+
Sbjct: 2 GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62 SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121
Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A WS G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
W D YRSL YMRPLAIW +Q AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGIQQALEKP 210
>gi|188570635|gb|ACD64353.1| hypothetical protein [Helianthus petiolaris]
gi|188570639|gb|ACD64355.1| hypothetical protein [Helianthus petiolaris]
Length = 211
Score = 212 bits (539), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 102/163 (62%), Positives = 124/163 (76%)
Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
KLWNGSYFNYD+ SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+K
Sbjct: 49 KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 108
Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 109 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 168
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
WS G GYAFQTPE W D YRSL YMRPLAIW MQ AL +P
Sbjct: 169 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEKP 211
>gi|292384324|gb|ADE21415.1| unknown [Helianthus neglectus]
Length = 210
Score = 212 bits (539), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 102/163 (62%), Positives = 124/163 (76%)
Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
KLWNGSYFNYD+ SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+K
Sbjct: 48 KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 107
Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 108 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 167
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
WS G GYAFQTPE W D YRSL YMRPLAIW MQ AL +P
Sbjct: 168 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|292384322|gb|ADE21414.1| unknown [Helianthus neglectus]
Length = 207
Score = 212 bits (539), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 102/163 (62%), Positives = 124/163 (76%)
Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
KLWNGSYFNYD+ SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+K
Sbjct: 45 KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 104
Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 105 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 164
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
WS G GYAFQTPE W D YRSL YMRPLAIW MQ AL +P
Sbjct: 165 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEKP 207
>gi|443318654|ref|ZP_21047901.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
gi|442781756|gb|ELR91849.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
Length = 857
Score = 211 bits (538), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 135/409 (33%), Positives = 200/409 (48%), Gaps = 52/409 (12%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
+GQF LE ++Y + + DV Y FA ++L+P+++ ++ R FA A+ D
Sbjct: 439 VGQFGVLECVDYRWYESLDVRLYGGFATLILWPELEKAVIRAFARAIPTADDHPRIIGYY 498
Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
+ + + RK+ GA PHD+G + P+ + N D +WKDL FV+QVYR
Sbjct: 499 YTIGETDHYAPRKLPGATPHDLGAPNEHPFVKTNYTSYQDCNQWKDLPSDFVIQVYRAYK 558
Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D F WP+V + Y+ +FD D DG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 559 MTGATDGDFLADCWPAVTETLKYLKRFDTDSDGIPENGGAPDQTFDDWQLKGLSAYCGGL 618
Query: 699 WVAALQAASALAREVGDRGS---------EDYFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
W+AAL++A A+A + G Y + Q KV +++LWNG Y+ +
Sbjct: 619 WMAALESAIAIADTLTAAGRAPGNTPILVTQYRRWLEQSRKVYHKRLWNGRYYRLET--G 676
Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVM------------- 796
S SS + ADQL GQ+ AR L +V+E +SAL +Y+ LK
Sbjct: 677 SDSSVVMADQLCGQFCARLMELPDLVEEKYVQSALAAIYDACFLKFNNYAATQARPQEQK 736
Query: 797 --------------GGKRGAVNGMLPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLAD 841
G GA NG+LPDG D E+W+G+ + +A D
Sbjct: 737 FIGSQTGTFRSAQPGIPIGAANGVLPDGSPEDPDGTHQLEVWTGINFGLATFFAQMGQVD 796
Query: 842 IGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
F + + ++ G F+TPEA +R+ Y+RPLAIW +
Sbjct: 797 KAFAITEAVVQQIYT---HGLQFRTPEAITALGTFRACHYLRPLAIWGL 842
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 129/410 (31%), Positives = 184/410 (44%), Gaps = 67/410 (16%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG PLGG G+G IGRS G+F W + + A QFSVF + Q+ + L +
Sbjct: 39 HGAPLGGFGAGCIGRSPHGDFNLWHLDGGEHTFQTFPACQFSVFEQWGD-QRQAYALSTE 97
Query: 155 TPEVLKDTTAAGIGSWDW------------NLKGDKS-----------TYHALYPRAWTV 191
P T +G W W K S TYHALYPR+W V
Sbjct: 98 AP------TDGTLGQWQWYPATLQNSNQNSKFKIQNSELPASPPLSTGTYHALYPRSWYV 151
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG- 250
+E +L C Q SPI NY+E+SYPV+VF +T +N +++L +W N VG
Sbjct: 152 YENVFQAQL--TCEQFSPIWARNYQEASYPVAVFLWTAHNPTDQPITLSILLSWQNMVGW 209
Query: 251 ---------------GDSEF-----TGQHYNSKTKMNDGVHAV--LLHHRTSHQLPPV-- 286
G + GQ + ++ AV +++ S + P
Sbjct: 210 FTNTAKSPQVLQRDDGSPYYDYVSAIGQSAGNANRLVQTGAAVGYVMNGAISPGITPTEG 269
Query: 287 --TFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEP 344
+A+A GV V + N +G D+W+ GS +NS + +
Sbjct: 270 EGQWAIATLTQPGVEV----FYHTRWNPVG-DGGDLWNGFAIDGSL--VNSGDDRPAAA- 321
Query: 345 GSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIA 404
G IG AIA T+ P Q+ +L WD P + +G YRRYT F+G A IA
Sbjct: 322 GEQIGGAIAVRFTLQPGETRQMPMTLVWDLPVTEYAAGVADYRRYTDFFGREGQNAWVIA 381
Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
+ A+ + +W QI WQ+PIL+ P+W+ + LFNELY L +GG +WT
Sbjct: 382 QTALANYTTWRQQIINWQQPILDQPDFPDWFKMALFNELYDLTSGGTLWT 431
>gi|292384308|gb|ADE21407.1| unknown [Helianthus neglectus]
Length = 205
Score = 211 bits (537), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 102/160 (63%), Positives = 123/160 (76%)
Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
KLWNGSYFNYD+ SS S SIQADQLAGQWY + GL + D+DK +S+L+K++++NV+K
Sbjct: 45 KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFDDDKIQSSLQKIFDFNVMK 104
Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 105 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 164
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
WS G GYAFQTPE W D YRSL YMRPLAIW MQ AL
Sbjct: 165 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 204
>gi|195436933|ref|XP_002066400.1| GK18271 [Drosophila willistoni]
gi|194162485|gb|EDW77386.1| GK18271 [Drosophila willistoni]
Length = 955
Score = 210 bits (535), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 140/427 (32%), Positives = 219/427 (51%), Gaps = 44/427 (10%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
K+T+ + P+ R + + +G+R ++D + + +GVP+GG+G G+IGR Y GE
Sbjct: 81 KQTLPMVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 140
Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNG--------QKYSSVLCPKTPEVLKDTTAAG 166
F R+Q+ P + E VLANQF V + G K SS++ E +D +
Sbjct: 141 FCRFQMRPGIYEYNTVLANQFIVTIKDQKGCTIFQSLLSKCSSIINNNDTEADEDEKSRT 200
Query: 167 ---------------IGSWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQI 207
+ +W N+ + +Y LYPR+WT H G +R++CRQI
Sbjct: 201 KCQLPNCSSSRSRQPLSTWHSNIDDSRCSYTGLYPRSWTEYDLSHYG-----VRLICRQI 255
Query: 208 SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN 267
SP+IPH+YKESS P +VF ++ N +++ FT+ N G + S+ +N
Sbjct: 256 SPVIPHDYKESSLPCAVFVWSAENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQL-IN 314
Query: 268 DGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEH 327
+G HA + R P ++ LA + + ++ CP F +GN L W ++KEH
Sbjct: 315 EG-HAKGVAIRQKIADMPCSYNLACRVLPDISITRCPQFDPAGNGEAL-----WAQLKEH 368
Query: 328 GSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYY 386
G + ET T + IG A+ A + + P + ++ F LAWD P++ F KT+
Sbjct: 369 GQLSDEPTQETLKTKD----IGVAVCAHIALKPQASHELEFVLAWDMPKIQFPRKLKTHT 424
Query: 387 RRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYL 446
R YTK++ A I A+ + SWE I+AWQRPIL D+ LPEWY +FN+LY++
Sbjct: 425 RYYTKYFDDSGEAGPRICEYALKHYQSWERLIDAWQRPILNDEGLPEWYKCAIFNQLYFI 484
Query: 447 NAGGAVW 453
+ GG +W
Sbjct: 485 SDGGTIW 491
Score = 202 bits (513), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 109/261 (41%), Positives = 158/261 (60%), Gaps = 12/261 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ K+++ + M ++D+D DG+IEN PDQTYD+W + G SAY GLW+A+LQA
Sbjct: 694 YLKSMYAACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLASLQAM 753
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + + +K K + EKLWNGSY+ +D S S +I ADQL G WY
Sbjct: 754 SAMATILDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 812
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK++Y+ NV+ G GA NG + + G VD S++
Sbjct: 813 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEPSKAGHVDNSNI 872
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
Q+ E+W GV YA+AA+MI E + D FQTA G+Y+ G+ F+TPEA + +YR
Sbjct: 873 QAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTISQRIGMN--FETPEALYGEKRYR 930
Query: 878 SLCYMRPLAIWAMQWALTRPK 898
S+ YMRPL+IW+MQ A R +
Sbjct: 931 SIGYMRPLSIWSMQVAWERRR 951
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/157 (43%), Positives = 94/157 (59%), Gaps = 3/157 (1%)
Query: 511 VALNSSFGTNLLQDGEE-NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
+ +SS G L D G+F YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q
Sbjct: 492 LKCDSSLGKELAYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQF 551
Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
DF A+ K+L DG+ + RKV VPHD+G D P+ +N Y ++D WKDL
Sbjct: 552 DFKDAIAAELNDTRKMLYDGKIMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNEWKDL 611
Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
N KFVLQVYRD + A+A S + ++ ++D+
Sbjct: 612 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDK 648
>gi|188570667|gb|ACD64369.1| hypothetical protein [Bahiopsis lanata]
gi|188570669|gb|ACD64370.1| hypothetical protein [Bahiopsis lanata]
Length = 208
Score = 210 bits (534), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 102/163 (62%), Positives = 122/163 (74%)
Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
KLWNGSYFNYD+ SS S SIQADQLAGQWY + GL + +DK +S+L K++++NV+K
Sbjct: 46 KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLHKIFDFNVMK 105
Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 106 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFTAG 165
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
WS G GYAFQTPE W D YRSL YMRPLAIW MQ AL P
Sbjct: 166 WSEEGFGYAFQTPEGWTMDGCYRSLVYMRPLAIWGMQQALEAP 208
>gi|292384306|gb|ADE21406.1| unknown [Helianthus neglectus]
Length = 205
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 101/160 (63%), Positives = 122/160 (76%)
Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
KLWNGSYFNYD+ SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+K
Sbjct: 45 KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 104
Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 105 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 164
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
WS G GYAFQTPE W D YRSL YMRPLAIW MQ AL
Sbjct: 165 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 204
>gi|292384314|gb|ADE21410.1| unknown [Helianthus neglectus]
Length = 205
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 101/160 (63%), Positives = 122/160 (76%)
Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
KLWNGSYFNYD+ SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+K
Sbjct: 45 KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 104
Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 105 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 164
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
WS G GYAFQTPE W D YRSL YMRPLAIW MQ AL
Sbjct: 165 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 204
>gi|292384316|gb|ADE21411.1| unknown [Helianthus neglectus]
Length = 205
Score = 208 bits (529), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 101/160 (63%), Positives = 122/160 (76%)
Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
KLWNGSYFNYD+ SS S SIQADQLAGQWY + GL + +DK +S+L+K++++NV+K
Sbjct: 45 KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 104
Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 105 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 164
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
WS G GYAFQTPE W D YRSL YMRPLAIW MQ AL
Sbjct: 165 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 204
>gi|303285310|ref|XP_003061945.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
gi|226456356|gb|EEH53657.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
Length = 1073
Score = 206 bits (525), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 249/995 (25%), Positives = 394/995 (39%), Gaps = 227/995 (22%)
Query: 93 SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVL- 151
+ HG PLGG+GSGSIGRSY G+F RW + + V A+ F+ R NG ++VL
Sbjct: 74 ARHGAPLGGLGSGSIGRSYLGDFSRWHLKVGSHTHR-VAAHTFAA--CRVNG--VATVLS 128
Query: 152 ----------CPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPD---P 198
P + + G Y ALYPRAW +E PD P
Sbjct: 129 ASPAHASLEGVPSSEDDDDSPPPRTRRRRTLPEDGIGGNYTALYPRAW--YEYAPDALHP 186
Query: 199 ELRIVCRQI--SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTN--------- 247
+ R+V Q+ SP++P +Y+ S+ P V + N G+ +A+++++ ++ +
Sbjct: 187 DARVVVSQVQFSPVLPGDYRASAVPTGVLRFVARNDGRATANVSIMLSFESVLASADVAP 246
Query: 248 -SVGGDSE------------------------------FTGQHYNSKTKMNDGVHAVLLH 276
V G + G + + DG
Sbjct: 247 LDVSGPRDAWRVASEPTTHETFFSSRESRREGRRRRRIIVGGAHAHTVGVTDGTFTPSAD 306
Query: 277 HRTSHQLPPVTFALAAQETDGVHVSLCPHF-VISGNSLGLTAKDMWHEIKEHGSFDRLNS 335
H A+AA+ DGV VS+ + + G+++ W + +G F
Sbjct: 307 HHGG-------VAIAAEGGDGVEVSVLREYDTLDGDAVAAA----WDAFERNGGFAEDED 355
Query: 336 METSV----------------TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF 379
E + PG++ AA++A ++ P + F++AWD P F
Sbjct: 356 EEDEAAAEEDAASAASASAAKVASPGAA--AAVSAKFSLKPGERRSIAFAIAWDLPVATF 413
Query: 380 MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDK---------R 430
G + +RY AA++A A+ SWE ++ AWQ P ++ R
Sbjct: 414 PRGPAFLKRYAWRRRRRGGGAADVAMSALRRAASWERRVRAWQAPYVDSAANADSKSPVR 473
Query: 431 LPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQ- 489
P W+ LFNELYYL GG +W G P +++G D + +
Sbjct: 474 RPGWFVTALFNELYYLVDGGTLW--GRP-----LSVGGLDGDEFDDDEDDEVEDALDGDS 526
Query: 490 -NDTAVNIL--------ERMSSILEQIYTPVALNSSFGTNLLQDGEE---NIGQFLYLEG 537
+D V +L ER S + + +++ +G +G+F
Sbjct: 527 ADDAEVRLLPIRPRSRGERRSLRRLGLLGGEDGDGGDARDVIGNGGAWGGTLGRFGSASD 586
Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL----------- 586
++ +N D +FY S+ L L+P + L++ D A AV D ++ ++L
Sbjct: 587 VDRATYNALDEYFYGSWPLATLWPGLDLAVIGDLAHAVDAADDTERRVLWRALGSAKKSA 646
Query: 587 ---------DDGQWVSRKVLGAVPHDIGICD--PWFEV-NAYCLYDTARWKDLNPKFVLQ 634
Q RK+ G P D+G P NA D+ DL PK +L
Sbjct: 647 GGTGATTTTIASQTKRRKIRGVAPRDLGTSRDAPLTSAPNALSDVDSNALLDLAPKLMLL 706
Query: 635 VYRDVVATG---------------DKKFAKAVWPSVYVAMA-YMDQFDRDGDGMIEN--- 675
V R G ++ + V+ Y A+A + D +GDG+ E+
Sbjct: 707 VARAHALRGGFRGPEAMASFGGGLERDALRRVFEPAYRALATQLRARDLNGDGLPEHGDA 766
Query: 676 DGFPDQTYDTWSVSG-ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQ-KAKVVY 733
D YD W VSG S+Y GGLW+AAL+A + +AR++ + + + AK
Sbjct: 767 DAPTLSGYDRWVVSGNASSYGGGLWLAALRAGAGIARDLDETDARTSLEETLRIAAKSFD 826
Query: 734 EKLWNG------------------------SYFNYDNSGSSQSSSIQADQLAGQW---YA 766
E LW G Y+ D SG+ + A Q+ G+W
Sbjct: 827 EALWRGGGGGGGGGVRKSQSRSRRGTHALEGYYAADASGTDAGDVVLAGQVMGEWALGMI 886
Query: 767 RACGLLPIVDEDKARSALKKVYNYNVLKVMGGKR-----------------GAVNGMLPD 809
RA G+L D K R+AL Y NV K G R GAVNG D
Sbjct: 887 RAPGVL---DPRKVRAALSTTYERNV-KAFGRARGGSGSGSDDEPSTYVPSGAVNGARVD 942
Query: 810 GRVDMSSM----------QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTG 859
D +++ Q+RE G +YA+A+ +I +D G+ A G Y +
Sbjct: 943 DATDAAALGVGAGDGIPAQARESRVGQSYALASHLILAGFSDEGWDVARGAYRVTYED-- 1000
Query: 860 LGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
G+AF+TPE ++ + ++R R A+WA++ +L
Sbjct: 1001 -GFAFRTPEIFDAERRFRGAISGRAGAVWAIERSL 1034
>gi|45552032|ref|NP_788055.2| CG33090 [Drosophila melanogaster]
gi|74876618|sp|Q7KT91.1|C3390_DROME RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase
gi|45445134|gb|AAO41192.2| CG33090 [Drosophila melanogaster]
gi|201065481|gb|ACH92150.1| FI02015p [Drosophila melanogaster]
Length = 948
Score = 206 bits (523), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 112/263 (42%), Positives = 160/263 (60%), Gaps = 12/263 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++ S M ++D+D DG+IEN PDQTYD+W + G SAY GLW+AALQA
Sbjct: 687 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 746
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + + +K K + EKLWNGSY+ +D S S +I ADQL G WY
Sbjct: 747 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 805
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK++Y+ NV+ G GA NG + + G VD S++
Sbjct: 806 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNI 865
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
Q+ E+W GV YA+AA+MI E + + FQTA G+Y+ G+ F+TPEA + +YR
Sbjct: 866 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 923
Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
S+ YMRPL+IW+MQ AL R + +
Sbjct: 924 SIGYMRPLSIWSMQVALERRRAQ 946
Score = 200 bits (508), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 127/425 (29%), Positives = 211/425 (49%), Gaps = 42/425 (9%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
++T+ + P+ R + + +G+R ++D + + +GVP+GG+G G+IGR Y GE
Sbjct: 76 RQTLPMVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 135
Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAG-------- 166
F R+Q+ P + E VLANQF V + G L K K + G
Sbjct: 136 FCRFQMRPGIYEYNVVLANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDGDPDGERTK 195
Query: 167 -------------IGSWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
+ +W N++ + +Y LYPR+WT H G +R+ CRQ+SP
Sbjct: 196 CQLPNCSSRAKQPLSAWHSNIEDTRCSYTGLYPRSWTEYDLSHYG-----VRLTCRQVSP 250
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
+IPH Y+ESS P +VF +++ N +++ FT+ N G + S+
Sbjct: 251 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 310
Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
V + + S P ++ LA + + ++ CP F +GN + +W ++KEHG
Sbjct: 311 AKGVSIRQKISEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQ 363
Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
+ E T + IG A+ V + P + + F LAWD P++ F +T+ R
Sbjct: 364 LSEHPTSEALKTKD----IGVAVCGQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 419
Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
YTK++ ++ I A+ ++ +WE I+AWQRPIL D+ LP+WY +FN+LY+++
Sbjct: 420 YTKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISD 479
Query: 449 GGAVW 453
GG +W
Sbjct: 480 GGTIW 484
>gi|288558766|gb|ADC53514.1| MIP16949p [Drosophila melanogaster]
Length = 432
Score = 206 bits (523), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 112/263 (42%), Positives = 160/263 (60%), Gaps = 12/263 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++ S M ++D+D DG+IEN PDQTYD+W + G SAY GLW+AALQA
Sbjct: 171 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 230
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + + +K K + EKLWNGSY+ +D S S +I ADQL G WY
Sbjct: 231 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 289
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK++Y+ NV+ G GA NG + + G VD S++
Sbjct: 290 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNI 349
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
Q+ E+W GV YA+AA+MI E + + FQTA G+Y+ G+ F+TPEA + +YR
Sbjct: 350 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 407
Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
S+ YMRPL+IW+MQ AL R + +
Sbjct: 408 SIGYMRPLSIWSMQVALERRRAQ 430
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/125 (43%), Positives = 78/125 (62%), Gaps = 2/125 (1%)
Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
M+NTYDVHFY+S AL L+P +Q+S+Q DF A+ K+L DG+ + RKV VP
Sbjct: 1 MYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVKNCVP 60
Query: 602 HDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
HD+G D P+ +N Y ++D WKDLN KFVLQVYRD + A++ S + ++
Sbjct: 61 HDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQSDNASKFSSI 120
Query: 660 AYMDQ 664
++D+
Sbjct: 121 EFIDK 125
>gi|195338407|ref|XP_002035816.1| GM15538 [Drosophila sechellia]
gi|194129696|gb|EDW51739.1| GM15538 [Drosophila sechellia]
Length = 947
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 111/263 (42%), Positives = 160/263 (60%), Gaps = 12/263 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++ S + ++D+D DG+IEN PDQTYD+W + G SAY GLW+AALQA
Sbjct: 686 YLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + + +K K + EKLWNGSY+ +D S S +I ADQL G WY
Sbjct: 746 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 804
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK++Y+ NV+ G GA NG + + G VD S++
Sbjct: 805 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNI 864
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
Q+ E+W GV YA+AA+MI E + + FQTA G+Y+ G+ F+TPEA + +YR
Sbjct: 865 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 922
Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
S+ YMRPL+IW+MQ AL R + +
Sbjct: 923 SIGYMRPLSIWSMQVALERRRAQ 945
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 129/425 (30%), Positives = 212/425 (49%), Gaps = 42/425 (9%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
++T+ L P+ R + + +G+R ++D + + +GVP+GG+G G+IGR Y GE
Sbjct: 75 RQTLPLVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 134
Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK--TPEVLKDTTAAGIG---- 168
F R+Q+ P + E V ANQF V + G L K T + D A G
Sbjct: 135 FCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDGERTK 194
Query: 169 ---------------SWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
+W N++ + +Y LYPR+WT H G +R+ CRQ+SP
Sbjct: 195 CQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLSHYG-----VRLTCRQVSP 249
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
+IPH Y+ESS P +VF +++ N +++ FT+ N G + S+
Sbjct: 250 VIPHEYRESSQPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 309
Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
V + + + P ++ LA + + ++ CP F +GN + +W ++KEHG
Sbjct: 310 AKGVSIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQ 362
Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
+ E T + IG A+ A V + P + + F LAWD P++ F +T+ R
Sbjct: 363 LSEQPTSEALKTKD----IGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 418
Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
YTK++ ++ I A+ ++ +WE I+AWQRPIL D+ LP+WY +FN+LY+++
Sbjct: 419 YTKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISD 478
Query: 449 GGAVW 453
GG +W
Sbjct: 479 GGTIW 483
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 68/157 (43%), Positives = 93/157 (59%), Gaps = 3/157 (1%)
Query: 511 VALNSSFGTNL-LQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
+ SS G L D G+F YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q
Sbjct: 484 LKCESSLGKELSYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 543
Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
DF A+ K+L DG+ + RKV VPHD+G D P+ +N Y ++D WKDL
Sbjct: 544 DFKDAIAAELNDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 603
Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
N KFVLQVYRD + A+A S + ++ ++D+
Sbjct: 604 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDK 640
>gi|198473190|ref|XP_002133205.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
gi|198139343|gb|EDY70607.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
Length = 865
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 113/261 (43%), Positives = 158/261 (60%), Gaps = 12/261 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++ + M ++D+D DG+IEN PDQTYD+W + G SAY GLW+AALQA
Sbjct: 604 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 663
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + F +K K + EKLWNGSY+ +D S S SI ADQL G WY
Sbjct: 664 SAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLS-PSHRDSIMADQLCGHWY 722
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK +Y+ NV+ G GA NG + + G VD S++
Sbjct: 723 LKSCGFDYEIYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDNSNI 782
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
Q+ E+W GV YA+AA+MI E + + FQTA G+Y+ G+ F+TPEA + +YR
Sbjct: 783 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSERIGMN--FETPEALYGEKRYR 840
Query: 878 SLCYMRPLAIWAMQWALTRPK 898
S+ YMRPL+IW+MQ AL R +
Sbjct: 841 SIGYMRPLSIWSMQVALERRR 861
Score = 189 bits (479), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 122/404 (30%), Positives = 202/404 (50%), Gaps = 43/404 (10%)
Query: 77 KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
+G+R ++D + + +GVP+GG+G G+IGR + GEF R+Q+ P + E V ANQF
Sbjct: 14 EGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGFAGEFCRFQMRPGIYEYNVVQANQFI 73
Query: 137 VFVSRSNGQKYSSVLCPKTPEVLK-----DTTAAG-----------------IGSWDWNL 174
V + G L K + + D A G + +W N+
Sbjct: 74 VTIKDQKGCTIFQSLLSKCSTMSRGNNGSDPDADGEKTKCQLPNCSSRPKQPLSAWHSNI 133
Query: 175 KGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIY 230
+ + +Y LYPR+WT H G +R+VCRQ+SP+IPH Y++SS P +VF +++
Sbjct: 134 EDARCSYTGLYPRSWTEYDLSHYG-----VRLVCRQVSPVIPHEYRDSSLPCAVFVWSVE 188
Query: 231 NSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFAL 290
N +++ F++ N G + S+ V + + + P ++ L
Sbjct: 189 NVCDQERIVSITFSFKNGTGNKKQDAEGGAESQLISAGNAKGVAIRQKIADM--PCSYNL 246
Query: 291 AAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGA 350
A + + ++ CP F +G + + +W ++KEHG + ET T + IG
Sbjct: 247 ACRVLPEISITRCPQFDPAG-----SGEQLWAQLKEHGQLSEQPTGETLKTKD----IGV 297
Query: 351 AIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAIL 409
A+ A + + P + + F LAWD P + F +T+ R YTK++ + I A+
Sbjct: 298 AVCAQLALKPQASHDLEFVLAWDMPRIQFPRKMQTHMRYYTKYFDDSGESGPKICEYALK 357
Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
++ SWE I+AWQRPIL D LP+WY +FN+LY+++ GG +W
Sbjct: 358 QYPSWERLIDAWQRPILSDDTLPDWYKCAIFNQLYFISDGGTIW 401
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/157 (42%), Positives = 94/157 (59%), Gaps = 3/157 (1%)
Query: 511 VALNSSFGTNLLQDGEE-NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
+ +SS G L D G+F YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q
Sbjct: 402 LKCDSSLGKELAYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 461
Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
DF A+ K+L DG+ + RKV VPHD+G D P+ +N Y ++D WKDL
Sbjct: 462 DFKDAIAAELSDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 521
Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
N KFVLQVYRD + ++A S + ++ ++D+
Sbjct: 522 NTKFVLQVYRDYYVLNELAQSQADNASKFSSIEFIDK 558
>gi|195164866|ref|XP_002023267.1| GL21266 [Drosophila persimilis]
gi|194105352|gb|EDW27395.1| GL21266 [Drosophila persimilis]
Length = 867
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 113/261 (43%), Positives = 158/261 (60%), Gaps = 12/261 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++ + M ++D+D DG+IEN PDQTYD+W + G SAY GLW+AALQA
Sbjct: 606 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 665
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + F +K K + EKLWNGSY+ +D S S SI ADQL G WY
Sbjct: 666 SAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLS-PSHRDSIMADQLCGHWY 724
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK +Y+ NV+ G GA NG + + G VD S++
Sbjct: 725 LKSCGFDYEIYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDNSNI 784
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
Q+ E+W GV YA+AA+MI E + + FQTA G+Y+ G+ F+TPEA + +YR
Sbjct: 785 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSERIGMN--FETPEALYGEKRYR 842
Query: 878 SLCYMRPLAIWAMQWALTRPK 898
S+ YMRPL+IW+MQ AL R +
Sbjct: 843 SIGYMRPLSIWSMQVALERRR 863
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 122/404 (30%), Positives = 202/404 (50%), Gaps = 43/404 (10%)
Query: 77 KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
+G+R ++D + + +GVP+GG+G G+IGR + GEF R+Q+ P + E V ANQF
Sbjct: 16 EGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGFAGEFCRFQMRPGIYEYNVVQANQFI 75
Query: 137 VFVSRSNGQKYSSVLCPKTPEVLK-----DTTAAG-----------------IGSWDWNL 174
V + G L K + + D A G + +W N+
Sbjct: 76 VTIKDQKGCTIFQSLLSKCSTMSRGNNGSDPDADGEKTKCQLPNCSSRPKQPLSAWHSNI 135
Query: 175 KGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIY 230
+ + +Y LYPR+WT H G +R+VCRQ+SP+IPH Y++SS P +VF +++
Sbjct: 136 EDARCSYTGLYPRSWTEYDLSHYG-----VRLVCRQVSPVIPHEYRDSSLPCAVFVWSVE 190
Query: 231 NSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFAL 290
N +++ F++ N G + S+ V + + + P ++ L
Sbjct: 191 NVCDQERIVSITFSFKNGTGNKKQDAEGGAESQLISAGNAKGVAIRQKIADM--PCSYNL 248
Query: 291 AAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGA 350
A + + ++ CP F +G + + +W ++KEHG + ET T + IG
Sbjct: 249 ACRVLPEISITRCPQFDPAG-----SGEQLWAQLKEHGQLSEQPTGETLKTKD----IGV 299
Query: 351 AIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAIL 409
A+ A + + P + + F LAWD P + F +T+ R YTK++ + I A+
Sbjct: 300 AVCAQLALKPQASHDLEFVLAWDMPRIQFPRKMQTHMRYYTKYFDDSGESGPKICEYALK 359
Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
++ SWE I+AWQRPIL D LP+WY +FN+LY+++ GG +W
Sbjct: 360 QYPSWERLIDAWQRPILSDDTLPDWYKCAIFNQLYFISDGGTIW 403
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/157 (42%), Positives = 94/157 (59%), Gaps = 3/157 (1%)
Query: 511 VALNSSFGTNLLQDGEE-NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
+ +SS G L D G+F YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q
Sbjct: 404 LKCDSSLGKELAYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 463
Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
DF A+ K+L DG+ + RKV VPHD+G D P+ +N Y ++D WKDL
Sbjct: 464 DFKDAIAAELSDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 523
Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
N KFVLQVYRD + ++A S + ++ ++D+
Sbjct: 524 NTKFVLQVYRDYYVLNELAQSQADNASKFSSIEFIDK 560
>gi|195474003|ref|XP_002089281.1| GE19029 [Drosophila yakuba]
gi|194175382|gb|EDW88993.1| GE19029 [Drosophila yakuba]
Length = 947
Score = 204 bits (518), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 111/261 (42%), Positives = 158/261 (60%), Gaps = 12/261 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++ M ++D+D DG+IEN PDQTYD+W + G SAY GLW+AALQA
Sbjct: 686 YLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + + +K K + EKLWNGSY+ +D S S +I ADQL G WY
Sbjct: 746 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 804
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK++Y+ NV+ G GA NG + + G VD S++
Sbjct: 805 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNI 864
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
Q+ E+W GV YA+AA+MI E + + FQTA G+Y+ G+ F+TPEA + +YR
Sbjct: 865 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 922
Query: 878 SLCYMRPLAIWAMQWALTRPK 898
S+ YMRPL+IW+MQ AL R +
Sbjct: 923 SIGYMRPLSIWSMQVALERRR 943
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 124/425 (29%), Positives = 211/425 (49%), Gaps = 42/425 (9%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
++T+ + P+ R + + +G+R ++D + + +GVP+GG+G G+IGR Y GE
Sbjct: 75 RQTLPMVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 134
Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT--------PEVLKD----- 161
F R+Q+ P + E V ANQF V + G L K P+ D
Sbjct: 135 FCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKSSDPDADPDGERTK 194
Query: 162 --------TTAAGIGSWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
+ +W N++ + +Y LYPR+WT H G +R+ CRQ+SP
Sbjct: 195 CQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLSHYG-----VRLTCRQVSP 249
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
+IPH Y++SS P +VF +++ N +++ F++ N G + S+
Sbjct: 250 VIPHEYRDSSLPCAVFVWSVENVCDQERKVSITFSFKNGTGNKKQDAEGGAESQLISEGN 309
Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
V + + + P ++ LA + + ++ CP F +GN + +W ++KEHG
Sbjct: 310 AKGVSIRQKIAEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQ 362
Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
+ E T + IG A+ A V + P + + F LAWD P++ F +T+ R
Sbjct: 363 LSEQPTSEALKTKD----IGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 418
Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
YTK++ ++ I A+ ++ +WE I+AWQRPIL D+ LP+WY +FN+LY+++
Sbjct: 419 YTKYFDASGDSGPKICEYALRQYSTWERLIDAWQRPILNDESLPDWYKCAIFNQLYFISD 478
Query: 449 GGAVW 453
GG +W
Sbjct: 479 GGTIW 483
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/157 (43%), Positives = 94/157 (59%), Gaps = 3/157 (1%)
Query: 511 VALNSSFGTNLLQDGEE-NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
+ +SS G L D G+F YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q
Sbjct: 484 LKCDSSLGKELTYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 543
Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
DF A+ K+L DG+ + RKV VPHD+G D P+ +N Y ++D WKDL
Sbjct: 544 DFKDAIAAELSDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 603
Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
N KFVLQVYRD + A+A S + ++ ++D+
Sbjct: 604 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDK 640
>gi|194760205|ref|XP_001962332.1| GF15415 [Drosophila ananassae]
gi|190616029|gb|EDV31553.1| GF15415 [Drosophila ananassae]
Length = 894
Score = 204 bits (518), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 111/263 (42%), Positives = 159/263 (60%), Gaps = 12/263 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++ S M ++D+D DG+IEN PDQTYD+W + G S+Y GLW+AALQA
Sbjct: 633 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSSYCSGLWLAALQAM 692
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + + +K K + EKLWNGSY+ +D S +I ADQL G WY
Sbjct: 693 SAMATILDQPNDCMRYQDILEKGKRSLEEKLWNGSYYRFDLS-HGHRDTIMADQLCGHWY 751
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK++Y+ NV+ G GA NG + + G VD S++
Sbjct: 752 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANASDPSKPGHVDNSNI 811
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
QS E+W GV YA+AA+MI E + + FQTA G+Y+ G+ F+TPEA + +YR
Sbjct: 812 QSEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMN--FETPEALYGEKRYR 869
Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
S+ YMRPL+IW+MQ AL R + +
Sbjct: 870 SIGYMRPLSIWSMQVALERRRAQ 892
Score = 198 bits (504), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 129/406 (31%), Positives = 206/406 (50%), Gaps = 45/406 (11%)
Query: 77 KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
+G+R ++D + + +GVP+GG+G G+IGR Y GEF R+Q+ P + E V ANQF
Sbjct: 41 EGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGEFCRFQMRPGIYEYNVVQANQFI 100
Query: 137 VFVSRSNGQK-YSSVL--C----------PKTPEV-----------LKDTTAAGIGSWDW 172
V + G + S+L C P P+ T + +W
Sbjct: 101 VTIKDPKGCTIFQSLLSKCSTRGSSGGGNPADPDADGEKTKCQLPNCSSRTKQPLSAWHS 160
Query: 173 NLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYT 228
N++ + Y LYPR+WT H G +R+VCRQ+SP+IPH Y+ESS P +VF +
Sbjct: 161 NIEDSRCQYTGLYPRSWTEYDLSHYG-----VRLVCRQVSPVIPHEYRESSLPCAVFVWD 215
Query: 229 IYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTF 288
+ N +++ FT+ N G + S+ V + + + P ++
Sbjct: 216 VENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGNAKGVAIRQKIAEM--PCSY 273
Query: 289 ALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSI 348
LA + + +S CP F +GN + +W ++KEHG + + ET T + I
Sbjct: 274 NLACRVRPEISISRCPQFDPAGN-----GEQLWAQLKEHGQLNEQPTPETLKTKD----I 324
Query: 349 GAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIARDA 407
G A+ A V + P + ++ F LAWD P++ F +T+ R YTK++ + I A
Sbjct: 325 GVAVCAQVALKPKASHKLEFVLAWDMPKIQFPRKLQTHTRYYTKYFDDSGESGPKICEYA 384
Query: 408 ILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
+ ++ SWE I+AWQRPIL D+ LP+WY +FN+LY+++ GG +W
Sbjct: 385 LKQYPSWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISDGGTIW 430
Score = 119 bits (299), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 63/137 (45%), Positives = 87/137 (63%), Gaps = 2/137 (1%)
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
G+F YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q DF A+ K+L DG
Sbjct: 451 GRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAADLTDTRKMLYDG 510
Query: 590 QWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKF 647
+ + RKV VPHD+G D P+ +N Y ++D WKDLN KFVLQVYRD +
Sbjct: 511 KVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQ 570
Query: 648 AKAVWPSVYVAMAYMDQ 664
A+A S + ++ ++D+
Sbjct: 571 AQADNASKFSSIEFIDK 587
>gi|194860149|ref|XP_001969520.1| GG23916 [Drosophila erecta]
gi|190661387|gb|EDV58579.1| GG23916 [Drosophila erecta]
Length = 947
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 111/261 (42%), Positives = 158/261 (60%), Gaps = 12/261 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++ M ++D+D DG+IEN PDQTYD+W + G SAY GLW+AALQA
Sbjct: 686 YLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + + +K K + EKLWNGSY+ +D S S +I ADQL G WY
Sbjct: 746 SAMATILDQPNDCLRYQDILEKGKRSLDEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 804
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK++Y+ NV+ G GA NG + + G VD S++
Sbjct: 805 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNI 864
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
Q+ E+W GV YA+AA+MI E + + FQTA G+Y+ G+ F+TPEA + +YR
Sbjct: 865 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 922
Query: 878 SLCYMRPLAIWAMQWALTRPK 898
S+ YMRPL+IW+MQ AL R +
Sbjct: 923 SIGYMRPLSIWSMQVALERRR 943
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 128/425 (30%), Positives = 213/425 (50%), Gaps = 42/425 (9%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
++T+ + P+ R + + +G+R ++D + + +GVP+GG+G G+IGR Y GE
Sbjct: 75 RQTLPMVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 134
Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK--TPEVLKDTTAAGIG---- 168
F R+Q+ P + E V ANQF V + G L K T + D A G
Sbjct: 135 FCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDGERTK 194
Query: 169 ---------------SWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
+W N++ + +Y LYPR+WT H G +R++CRQ+SP
Sbjct: 195 CQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLSHYG-----VRLICRQVSP 249
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
+IPH Y+ESS P +VF +++ N +++ F++ N G + S+
Sbjct: 250 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFSFKNGTGNKKQDAEGGAESQLISEGN 309
Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
V + + + P ++ LA + + ++ CP F +GN + +W ++KEHG
Sbjct: 310 AKGVSIRQKIAEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEPLWAQLKEHGQ 362
Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
+ E T + IG A+ A V + P + + F LAWD P++ F +T+ R
Sbjct: 363 LSEQPTSEALKTKD----IGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 418
Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
YTK++ +A I A+ ++ +WE I+AWQRPIL D+ LP+WY +FN+LY+++
Sbjct: 419 YTKYFDDSGDAGPKICEYALRQYSTWERLIDAWQRPILNDESLPDWYKCAIFNQLYFISD 478
Query: 449 GGAVW 453
GG +W
Sbjct: 479 GGTIW 483
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 68/157 (43%), Positives = 94/157 (59%), Gaps = 3/157 (1%)
Query: 511 VALNSSFGTNLLQDGEE-NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
+ +SS G L D G+F YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q
Sbjct: 484 LKCDSSLGKELAYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 543
Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
DF A+ K+L DG+ + RKV VPHD+G D P+ +N Y ++D WKDL
Sbjct: 544 DFKDAIAAELGDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 603
Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
N KFVLQVYRD + A+A S + ++ ++D+
Sbjct: 604 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDK 640
>gi|87125468|ref|ZP_01081313.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
gi|86166768|gb|EAQ68030.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
Length = 832
Score = 202 bits (515), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 127/383 (33%), Positives = 196/383 (51%), Gaps = 25/383 (6%)
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---L 586
G+F LE I+Y + + DV Y S AL+ L+P++ ++ R FA A+ D ++ +
Sbjct: 442 GRFGVLECIDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATQRPIGWYF 501
Query: 587 DDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR--DV 639
G+ RKV GA PHD+G + PW N D WKDL +VLQV+R +
Sbjct: 502 TQGKGRVEADRKVKGATPHDLGAPNENPWDATNYTAYQDCNLWKDLASDYVLQVWRTFKL 561
Query: 640 VATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
+G D F WP+ A+ Y+ FD + DG+ +N G PDQT+D W + G+SAY G L
Sbjct: 562 SPSGEDINFLAECWPAAVEALTYLKGFDTNNDGLPDNGGAPDQTFDDWPLKGVSAYCGAL 621
Query: 699 WVAALQAASALAREVG-----DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQ 751
W+AAL+AA A+ +++ D +E + F ++++ ++ LWNG Y++ D S
Sbjct: 622 WIAALEAALAMGQQLQLATGLDTAAEQHTFSTWLEQSRANFDTLLWNGEYYDID--AESG 679
Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
+ + ADQL G +YAR L +V + RS LK V GG G NG+ DG
Sbjct: 680 TPVVMADQLCGDFYARLLKLPAVVSDANCRSTLKAVKESCFEAFQGGSLGVANGLRRDGT 739
Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
+D E+W+G+ + +A+ D + ++G G F+TPEA
Sbjct: 740 PLDPQGTHPLEVWTGINFGIASYYRLMDDTSTALAITSAVVNQVYAG---GLQFRTPEAI 796
Query: 871 NTDDQYRSLCYMRPLAIWAMQWA 893
+ +R+ Y+R +AIW + WA
Sbjct: 797 TAVNTFRACHYLRAMAIWGL-WA 818
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/408 (28%), Positives = 188/408 (46%), Gaps = 44/408 (10%)
Query: 85 PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
P+ R+ ++ +HG+PLGG G+G GR+ G F W + + QF++F
Sbjct: 40 PYTVRYASNLDDGPNHGMPLGGFGAGCFGRAPDGSFNLWHLDGGEHWFGALPDCQFALFE 99
Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGE 195
+ + ++ + + + +WDW + TY A YP +WT ++G
Sbjct: 100 RNGSSTRAHALAVKPEADASRPDGGEPLAAWDWYPASTPERSTGTYAARYPLSWTHYDGV 159
Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
D E+R C SPI+P +Y+ +SYPV+VF +T+ N D++LL +W N+ G
Sbjct: 160 YDAEVR--CEAFSPILPGDYQRTSYPVAVFVWTLRNPTPKPLDLSLLLSWRNTSGWFTNT 217
Query: 251 --------GDSEFTGQHYNSKTKMNDG----------VHAVLLHHRTSHQLPPV--TFAL 290
D +Y DG + V+L S+ + + +
Sbjct: 218 DASAEVHFRDDGSPEHNYAPAIGTTDGQRNRWIDDGNLKGVVLEGNVSNPIAEGEGQWCI 277
Query: 291 AAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGA 350
A E GV + C + G+ ++W + GS N+ S ++P + A
Sbjct: 278 ATAEQPGVTIQRCSRWNPHGD-----GNELWSSFRADGSIPDSNNDRRSGHNDP---LSA 329
Query: 351 AIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILE 410
A+A + P ++ ++WD P F SG RRYT F+G + AA IA +A+ +
Sbjct: 330 ALAVQCQLAPGQSLEIPVVISWDLPVTGFASGSQALRRYTDFFGAEGHQAAAIAAEALRD 389
Query: 411 HGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
W QIEAWQ+P+LE LPE + LFNELY L +GG++W+ +P
Sbjct: 390 WRDWRQQIEAWQQPVLERSDLPEPVRMALFNELYDLCSGGSLWSAATP 437
>gi|195579186|ref|XP_002079443.1| GD23959 [Drosophila simulans]
gi|194191452|gb|EDX05028.1| GD23959 [Drosophila simulans]
Length = 947
Score = 200 bits (508), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 110/263 (41%), Positives = 159/263 (60%), Gaps = 12/263 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++ S + ++D+D DG+IEN PDQTYD+W + G SAY GLW+AALQA
Sbjct: 686 YLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + + +K K + EKLWNGSY+ +D S S +I ADQL G WY
Sbjct: 746 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 804
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK++Y+ NV+ G GA NG + + G VD S++
Sbjct: 805 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNI 864
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
Q+ E+W GV YA+AA+MI E + + FQTA G+Y+ G+ F+TPEA + +YR
Sbjct: 865 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 922
Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
S+ YMRPL+I +MQ AL R + +
Sbjct: 923 SIGYMRPLSILSMQVALERRRAQ 945
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 129/425 (30%), Positives = 212/425 (49%), Gaps = 42/425 (9%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
++T+ L P+ R + + +G+R ++D + + +GVP+GG+G G+IGR Y GE
Sbjct: 75 RQTLPLVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 134
Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK--TPEVLKDTTAAGIG---- 168
F R+Q+ P + E V ANQF V + G L K T + D A G
Sbjct: 135 FCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDGERTK 194
Query: 169 ---------------SWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
+W N++ + +Y LYPR+WT H G +R+ CRQ+SP
Sbjct: 195 CQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLSHYG-----VRLTCRQVSP 249
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
+IPH Y+ESS P +VF +++ N +++ FT+ N G + S+
Sbjct: 250 VIPHEYRESSQPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 309
Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
V + + + P ++ LA + + ++ CP F +GN + +W ++KEHG
Sbjct: 310 AKGVSIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQ 362
Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
+ E T + IG A+ A V + P + + F LAWD P++ F +T+ R
Sbjct: 363 LSEQPTSEALKTKD----IGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 418
Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
YTK++ ++ I A+ ++ +WE I+AWQRPIL D+ LP+WY +FN+LY+++
Sbjct: 419 YTKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISD 478
Query: 449 GGAVW 453
GG +W
Sbjct: 479 GGTIW 483
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 68/157 (43%), Positives = 93/157 (59%), Gaps = 3/157 (1%)
Query: 511 VALNSSFGTNL-LQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
+ SS G L D G+F YLEG EY M+NTYDVHFY+S AL L+P +Q+S+Q
Sbjct: 484 LKCESSLGKELSYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 543
Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
DF A+ K+L DG+ + RKV VPHD+G D P+ +N Y ++D WKDL
Sbjct: 544 DFKDAIAAELNDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 603
Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
N KFVLQVYRD + A+A S + ++ ++D+
Sbjct: 604 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDK 640
>gi|85857488|gb|ABC86280.1| RE08574p [Drosophila melanogaster]
Length = 544
Score = 199 bits (505), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 127/425 (29%), Positives = 211/425 (49%), Gaps = 42/425 (9%)
Query: 55 KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
++T+ + P+ R + + +G+R ++D + + +GVP+GG+G G+IGR Y GE
Sbjct: 76 RQTLPMVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 135
Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAG-------- 166
F R+Q+ P + E VLANQF V + G L K K + G
Sbjct: 136 FCRFQMRPGIYEYNVVLANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDGDPDGERTK 195
Query: 167 -------------IGSWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
+ +W N++ + +Y LYPR+WT H G +R+ CRQ+SP
Sbjct: 196 CQLPNCSSRAKQPLSAWHSNIEDTRCSYTGLYPRSWTEYDLSHYG-----VRLTCRQVSP 250
Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
+IPH Y+ESS P +VF +++ N +++ FT+ N G + S+
Sbjct: 251 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 310
Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
V + + S P ++ LA + + ++ CP F +GN + +W ++KEHG
Sbjct: 311 AKGVSIRQKISEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQ 363
Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
+ E T + IG A+ V + P + + F LAWD P++ F +T+ R
Sbjct: 364 LSEHPTSEALKTKD----IGVAVCGQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 419
Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
YTK++ ++ I A+ ++ +WE I+AWQRPIL D+ LP+WY +FN+LY+++
Sbjct: 420 YTKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISD 479
Query: 449 GGAVW 453
GG +W
Sbjct: 480 GGTIW 484
>gi|411117353|ref|ZP_11389840.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
JSC-12]
gi|410713456|gb|EKQ70957.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
JSC-12]
Length = 907
Score = 199 bits (505), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 128/409 (31%), Positives = 192/409 (46%), Gaps = 70/409 (17%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVS-RSNGQKYSSVLCP 153
HG+PLGG G+G IGRS++G+F W I +P+ A QFS+F + Q Y+ LC
Sbjct: 41 HGMPLGGFGAGCIGRSHQGDFNLWHIDGGEHVFQPMPACQFSIFEQVEQDAQAYA--LCT 98
Query: 154 KTPEVLKDTTAAGIGSWDWN--------------------LKGDKSTYHALYPRAWTVHE 193
+ P T + +W W L TYHALYPR+W ++
Sbjct: 99 EAP------TDGSLAAWSWYPTVEDADELGETSLDQARSLLSPHTGTYHALYPRSWFTYQ 152
Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
+L C Q SPI+PHNY+E+SYPV++F +T++N ++L+ TW N G
Sbjct: 153 NVFLADL--TCEQFSPILPHNYQETSYPVAIFEWTVHNPTNQPLTLSLMLTWQNMAGW-- 208
Query: 254 EFTGQHYNSKTKMND-------------------------GVH-AVLLHHRTSHQLPP-- 285
FT + +M D G H A+++ + +L
Sbjct: 209 -FTNASKSPDVQMRDDGSPVYDYQPRLYESAGNWNQIVKEGQHVAIVMAGARNQELEEGD 267
Query: 286 VTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
+A+A + V H+ G+ D+W ++GS +++ + E
Sbjct: 268 GQWAIAVKLPTQAEVFYHTHWNPVGD-----GADIWQSFSQNGSLTNVSNNTPAAKDE-- 320
Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIAR 405
IG AIA + P Q+ F L+WD P + F G TY++RYT F+ A IA+
Sbjct: 321 -QIGGAIAVRFMLQPGETWQIPFCLSWDFPVIEFAEGVTYFKRYTDFFDRSGQNAWTIAQ 379
Query: 406 DAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
A+ + +W+ QI WQ+PILE LP+W+ + LFNELY L GG +WT
Sbjct: 380 TALNHYKTWQEQIHTWQQPILERNDLPDWFKMALFNELYNLTDGGTLWT 428
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 107/295 (36%), Positives = 159/295 (53%), Gaps = 38/295 (12%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD- 587
+GQF LE I+Y + + DV Y SFAL+MLFP+++ ++ R FA A+ D + +++
Sbjct: 436 VGQFAVLECIDYRWYESLDVRLYGSFALLMLFPELEKAVIRAFARAIPASD-DRTRIIGY 494
Query: 588 ------DGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
+ RKV GA PHD+G + W + N D WKDL FVLQVYRD
Sbjct: 495 YYTIGAESPQAVRKVAGATPHDLGAPNEHVWQKTNYTSYQDCNLWKDLPCDFVLQVYRDF 554
Query: 640 VATGDKK--FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
+ TG K F + W ++ + Y+ FD D DG+ EN G PDQT+D W + GISAY GG
Sbjct: 555 LYTGSKDIAFLQECWSAIAQTLHYLKTFDYDFDGIPENSGAPDQTFDDWKLDGISAYCGG 614
Query: 698 LW-----------------VAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNG 739
LW ++ + L+ ++G+ F ++A+ +Y + LWNG
Sbjct: 615 LWIAALEAALAIGRILAEYTTNMRQLATLSNDIGN------FQVWLEQARSIYHDTLWNG 668
Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
Y+ D+ +S S + ADQL GQ+YAR L +VD + A++ALK +Y+ +K
Sbjct: 669 QYYRLDS--NSCSEVVMADQLCGQFYARLLKLPDVVDLNCAQTALKTIYDTCFIK 721
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 49/91 (53%), Gaps = 4/91 (4%)
Query: 801 GAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTG 859
GA NG+ PDG + +S E+W+G+ + +AA ++ + D + + + ++
Sbjct: 811 GAANGVKPDGSPENPNSTHPLEVWTGINFGLAAFLVQMGMRDEAMELTQAVVQQIYTN-- 868
Query: 860 LGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
G F+TPEA +R+ Y+R LAIWA+
Sbjct: 869 -GLQFRTPEAITPSGTFRACHYLRALAIWAI 898
>gi|148240031|ref|YP_001225418.1| hypothetical protein SynWH7803_1695 [Synechococcus sp. WH 7803]
gi|147848570|emb|CAK24121.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 833
Score = 197 bits (500), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 138/388 (35%), Positives = 210/388 (54%), Gaps = 25/388 (6%)
Query: 525 GEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK 584
G++ G+F LE ++Y + + DV Y SFAL+ L+P++ ++ R FA A+ D ++
Sbjct: 438 GKDPHGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFARAIPAADATQRP 497
Query: 585 L---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVY 636
+ G+ RKV GA PHD+G + P+ N D WKDL FVLQV+
Sbjct: 498 IGWYFTQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQDCNLWKDLASDFVLQVW 557
Query: 637 R--DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISA 693
R + TG D KF WP+ A+ Y+ +FD + DG+ +N G PDQT+D W + G+SA
Sbjct: 558 RTFKLAPTGEDLKFLADCWPAAVQALHYLKRFDVNNDGLPDNGGAPDQTFDDWPLKGVSA 617
Query: 694 YSGGLWVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDN 746
Y G LW+AAL+AA A L +E+G D GS+ + F ++++ +++ LWNG Y++ D
Sbjct: 618 YCGALWIAALEAALAMAQRLQQELGLDTGSDQHTFSAWLEQSRSNFDRLLWNGEYYDID- 676
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S + + ADQL G +YAR GL P+V E +RS LK V K GG G NG+
Sbjct: 677 -AESGTPVVMADQLCGDFYARLLGLEPVVSEANSRSTLKAVKESCFEKFQGGSLGVANGL 735
Query: 807 LPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
DG +D + E+W+G+ + +A+ + + +SG G F+
Sbjct: 736 RRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGQTAEAICSAVVDQVYSG---GLQFR 792
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
TPEA + +R+ Y+R +AIW + WA
Sbjct: 793 TPEAITAVNTFRACHYLRAMAIWGL-WA 819
Score = 179 bits (453), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 122/393 (31%), Positives = 190/393 (48%), Gaps = 40/393 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + + QF++F S + Q+ ++
Sbjct: 55 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGTIPDCQFALFESNGSSQRAHALAVKP 114
Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
+ + +A + +WDW + TY A YP +WT +EG D E + C SPI
Sbjct: 115 EADASRPNSAEPLAAWDWYPASTPERSTGTYAARYPLSWTHYEGVYDAE--VSCEAFSPI 172
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSE 254
+P +Y+ +SYPV+VF +T+ N D++LL +W N+VG G E
Sbjct: 173 LPGDYQRTSYPVAVFRWTLRNPSDQPLDLSLLLSWRNTVGWFFNTDPSAEVHFRDDGSPE 232
Query: 255 FT-----GQHYNSKTKMND--GVHAVLLHHRTSHQLPPVT--FALAAQET-DGVHVSLCP 304
GQ + + D GV ++L + S + + LA + +GV V C
Sbjct: 233 HNYAPAIGQGEGQRNRWLDQEGVRGIVLEGQRSEPIGEGQGQWCLAVPDALEGVEVLRCS 292
Query: 305 HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
+ +G+ +++W G R+ + +S G AAIA T+ P
Sbjct: 293 RWDPTGD-----GQELWQSFAAEG---RIPNSNNDRSSRAGEQASAAIALKFTLAPGESR 344
Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
++ +++WD P F +G RRYT F+G AA +A +A+ + W QI+AWQ P
Sbjct: 345 EIPIAISWDLPVTAFATGVRDLRRYTDFFGADGCNAAALAAEALRDWRQWRDQIDAWQAP 404
Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
+L + LPE + LFNELY L +GG++WT S
Sbjct: 405 VLARQELPEALRMALFNELYDLASGGSLWTAAS 437
>gi|318042148|ref|ZP_07974104.1| hypothetical protein SCB01_10580 [Synechococcus sp. CB0101]
Length = 836
Score = 195 bits (496), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 134/387 (34%), Positives = 205/387 (52%), Gaps = 25/387 (6%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
E+ +G+F LE ++Y + + DV Y S AL+ L+P++ ++ R FA A+ DPS +
Sbjct: 442 EDPVGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADPSPRPI 501
Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
G+ +RKV GA PHD+G + PW N D WKDL +VLQV+R
Sbjct: 502 GWYFTQGKGRVEAARKVKGATPHDLGAPNERPWDATNYTAYQDCNLWKDLASDYVLQVWR 561
Query: 638 --DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
+ G D F WP+ A+ Y+ FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 562 TFKLAPNGEDINFLAECWPAAVEALHYLKGFDANNDGLPDNGGAPDQTFDDWPLKGVSAY 621
Query: 695 SGGLWVAALQAASALAREVG-----DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNS 747
G LW+AAL+AA A+A+++ D SE + F ++++ ++ LWNG Y++ D
Sbjct: 622 CGALWIAALEAALAIAQQLQLSTGLDTASEQHTFSSWLEQSRANFDHLLWNGEYYDID-- 679
Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
S + + ADQL G +YAR GL +V E +RS LK V + GGK G NG+
Sbjct: 680 AESGTPVVMADQLCGDFYARLLGLPAVVSEANSRSTLKAVKEACFEQFEGGKLGVANGLR 739
Query: 808 PDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
DG +D + E+W+G+ + +A+ D + ++G G F+T
Sbjct: 740 RDGTPLDPNGTHPLEVWTGINFGIASYYRLMGDTDTALAITAAVVNQVYAG---GLQFRT 796
Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWA 893
PEA + +R+ Y+R +AIW + WA
Sbjct: 797 PEAITAVNTFRACHYLRAMAIWGV-WA 822
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 124/412 (30%), Positives = 198/412 (48%), Gaps = 54/412 (13%)
Query: 85 PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
P+ R+ ++ +HG+PLGG G+G IGR+ G F W + + QF++F
Sbjct: 44 PYTVRYASNLDDGPNHGMPLGGFGAGCIGRAPDGSFNLWHLDGGEHWFGVLPDCQFALF- 102
Query: 140 SRSNGQKYSSVLCPKTPE--VLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHE 193
SNGQ+ + PE + A + SW W + TY A YP +W+ ++
Sbjct: 103 -ESNGQQSRAHALAVQPERDASRPDAGAPLPSWSWYPASTAERTTGTYAARYPLSWSHYQ 161
Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG--- 250
G D E + C+ SPI+P NY+++SYP++VF +T++N + D++LL +W N+ G
Sbjct: 162 GVFDAE--VSCQAFSPILPGNYQQTSYPLAVFVWTLHNPTRKPLDLSLLLSWRNTSGWFT 219
Query: 251 -------------GDSEFT-----GQHYNSKTK-MNDG-VHAVLLHHRTSHQLPPVT--- 287
G E G + + ++DG + V+L S+ PV
Sbjct: 220 NTDSSAEVHFRDDGSPEHNYAPAIGTTEGQRNRWVDDGPLKGVVLERDVSN---PVAEGE 276
Query: 288 --FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
+ +A E GV + C + G+ ++W +GS N+ S S+P
Sbjct: 277 GQWCIATGEQPGVRIQRCSRWNPHGD-----GSELWSSFSANGSIPDSNNDRRSSQSDPA 331
Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIAR 405
S AA+A + P ++ ++WD P F +G + RRYT F+G+ A IA
Sbjct: 332 S---AALAVQCRLEPGQSIEIPVVISWDLPVTAFATGTSALRRYTDFFGSSGGNATAIAA 388
Query: 406 DAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
+ + + W QIEAWQ+P+LE LPE + LFNELY L +GG++W+ S
Sbjct: 389 EGLRDWRRWREQIEAWQQPVLERSDLPEPLRMALFNELYDLCSGGSLWSAAS 440
>gi|87301122|ref|ZP_01083963.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
gi|87284090|gb|EAQ76043.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
Length = 830
Score = 192 bits (489), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 135/386 (34%), Positives = 206/386 (53%), Gaps = 25/386 (6%)
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL- 585
+ +G+F LE ++Y + + DV Y SFAL+ L+P++ ++ R FA A+ D + +
Sbjct: 436 DPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFARAIPAADATPRPIG 495
Query: 586 --LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR- 637
G+ RKV GA PHD+G + P+ N D WKDL FVLQV+R
Sbjct: 496 WYFTQGRGRVEAPRKVAGATPHDLGAPNERPFDATNYTAYQDCNLWKDLASDFVLQVWRT 555
Query: 638 -DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
+ TG D +F WP+V A+ Y+ QFD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 556 FSLAPTGEDLRFLADCWPAVVEALRYLKQFDVNDDGLPDNGGAPDQTFDDWPLQGVSAYC 615
Query: 696 GGLWVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNSG 748
G LW+AAL+AA A L ++G D G+E F + ++ +++ LWNG Y+ D
Sbjct: 616 GALWIAALEAALAMAQRLQLDLGLDTGAEQREFGSWLEPSRANFDRLLWNGEYYRID--A 673
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
S + + ADQL G +YAR GL +V +++A S+LK + + GG+ G NG+
Sbjct: 674 GSGTPVVMADQLCGDFYARLLGLPQVVSDERALSSLKAIREACFERFEGGRLGVANGLRR 733
Query: 809 DGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
DG +D E+W+G+ + +AA D + ++G G F+TP
Sbjct: 734 DGTPLDPKGTHPLEVWTGINFGLAAYYRLMGQTDTALAITSAVVGQVYAG---GLQFRTP 790
Query: 868 EAWNTDDQYRSLCYMRPLAIWAMQWA 893
EA +R+ Y+R +AIWA+ WA
Sbjct: 791 EAITAVGTFRACHYLRAMAIWAL-WA 815
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 123/405 (30%), Positives = 191/405 (47%), Gaps = 43/405 (10%)
Query: 85 PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
P+ R+ ++ +HG PLGG G+G IGR G F W + + QF++F
Sbjct: 40 PYTVRYASNLDDGPNHGAPLGGFGAGCIGRGPDGSFNLWHLDGGEHWFGVLPDCQFALFE 99
Query: 140 SRSNGQKYSSV-LCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
+ ++ + P+ + + + +G W W L+G+ TY A YP W ++
Sbjct: 100 HDGTASRARALAMAPERDDSRPEASERPLGGWQW-LEGEAGTYAARYPLHWFDYQDNFHT 158
Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-------- 250
E + C SPI+P +Y+ +SYPV+VF + + NS K D++LL +W N+VG
Sbjct: 159 E--VSCEAFSPILPGDYQRTSYPVAVFRWRLANSTKKPLDLSLLLSWRNTVGWFTNTDPS 216
Query: 251 --------GDSEFT-----GQHYNSKTKMND--GVHAVLLHHRTSHQLPPV--TFALAA- 292
G E G + ++ D G+ VLL S + + LA
Sbjct: 217 AAVTFRDDGSPEHNYLPAIGAGVGQRNRLIDAPGLKGVLLEGAPSQPIAEGEGQWCLAVP 276
Query: 293 QETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAI 352
+ +GV V C + SG+ ++W GS N+ S SE S AAI
Sbjct: 277 DQLEGVEVMRCSRWDPSGD-----GAEIWEPFARDGSIPDSNNDRRSTGSEQAS---AAI 328
Query: 353 AASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHG 412
A + + P +E ++ ++WD P F +G + RRYT F+G +A IA +A+ +
Sbjct: 329 ALRLRLEPGAEIELPVVISWDLPVTAFATGTSALRRYTDFFGAAGTSAVAIAAEALRDWR 388
Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
W IEAWQ+P+LE LPE + L NELY L +GG++WT S
Sbjct: 389 QWRDAIEAWQQPVLERSDLPEPLRMALLNELYDLTSGGSLWTAAS 433
>gi|427702545|ref|YP_007045767.1| bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
gi|427345713|gb|AFY28426.1| putative bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
Length = 835
Score = 192 bits (487), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 127/386 (32%), Positives = 198/386 (51%), Gaps = 25/386 (6%)
Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL- 585
+ +G+F LE ++Y + + DV Y SFAL+ L+P++ ++ R FA A+ DP+ +
Sbjct: 441 DPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFARAIPAADPTPRPIG 500
Query: 586 --LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR- 637
G+ +RKV GA PHD+G + P+ N D WKDL +VLQV+R
Sbjct: 501 WYFTQGRGRVEAARKVAGATPHDLGAPNERPFDATNYTAYQDCNLWKDLASDYVLQVWRT 560
Query: 638 -DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
+ +G D +F WP+ A+ Y+ QFD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 561 YSLAPSGPDIRFLADCWPAAVTALDYLKQFDVNDDGLPDNGGAPDQTFDDWPLQGVSAYC 620
Query: 696 G----GLWVAALQAASALAREVGDRGSEDYFLFK--FQKAKVVYEK-LWNGSYFNYDNSG 748
G AAL L E+G +E+ F ++++ ++ LWNG Y+ D
Sbjct: 621 GALWIAALEAALAIGQRLQLELGLDTAEEQRRFGGWLEQSRSHFDTLLWNGEYYAID--A 678
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
S + + ADQL G +YAR L P+V +++A S+LK + GG+ G NG+
Sbjct: 679 GSGTPVVMADQLCGDFYARLLALPPVVADERALSSLKAIRQACFESFEGGRLGVANGLRR 738
Query: 809 DGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
DG +D E+W+G+ + +AA + F + + G G F+TP
Sbjct: 739 DGTPLDPEGTHPLEVWTGINFGLAAYYRLMGQSSTAFAITGAVVRQVYEG---GLQFRTP 795
Query: 868 EAWNTDDQYRSLCYMRPLAIWAMQWA 893
EA +R+ Y+R +AIWA+ WA
Sbjct: 796 EAITAAGTFRACHYLRAMAIWAL-WA 820
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/413 (25%), Positives = 179/413 (43%), Gaps = 49/413 (11%)
Query: 85 PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
P+ R+ ++ +HG+PLGG G+G +GR G F W + + QF++F
Sbjct: 37 PYTVRYASNLDDGPNHGMPLGGFGAGCLGRGLDGAFNLWHLDGGEHWFGVLPDCQFALFE 96
Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGE 195
+G++ ++ + + + + SW+W + T YP + H
Sbjct: 97 QDGDGRRAHALATAPQADDSRPGSGPPLASWNWYPAATPEQPSGTVAVRYPI--SSHHYA 154
Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
+ C SPI+P +Y+ +SYPV+VF +T N +++LL +W N+VG
Sbjct: 155 AVFSADVRCEAFSPILPGDYRRTSYPVAVFAWTFSNPTNQPLELSLLLSWRNTVGWFTNT 214
Query: 251 -----------GDSEFT-----GQHYNSKTKMNDG--VHAVLLHHRTSHQLPP------- 285
G E G+ + + DG + +LL + L
Sbjct: 215 DPAAEVHFRDDGSPEHNYVPAIGRGKGQRNRWVDGAGLKGLLLEGERTQPLAEGEGQWCL 274
Query: 286 VTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
AA GV V C + SG+ ++W GS ++ S +P
Sbjct: 275 AVPDEAALNHPGVEVFRCSRWDPSGD-----GAELWEPFSRDGSIPDSDNDRRSTGDDP- 328
Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIAR 405
+ AA+A + P + ++ ++WD P F +G RRYT F+ ++AA IA
Sbjct: 329 --LSAALAVRFRLEPGASLEIPVVISWDLPVTAFATGTRALRRYTDFFSAEGDSAAAIAG 386
Query: 406 DAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
+A+ + W+ IEAWQ P+++ L E + L NELY L +GG++WT SP
Sbjct: 387 EALADWRDWQAAIEAWQAPVVQRHDLAEPLRMALLNELYDLASGGSLWTAASP 439
>gi|317969095|ref|ZP_07970485.1| hypothetical protein SCB02_06140 [Synechococcus sp. CB0205]
Length = 837
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 130/387 (33%), Positives = 199/387 (51%), Gaps = 25/387 (6%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
E+ +G+F LE ++Y + + DV Y S AL+ L+P++ ++ R FA A+ D + +
Sbjct: 443 EDPVGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATPRPI 502
Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
G+ +RKV GA PHD+G + PW N D WKDL +VLQV+R
Sbjct: 503 GWYFTQGKGRVEAARKVKGATPHDLGAPNEKPWDATNYTAYQDCNLWKDLASDYVLQVWR 562
Query: 638 --DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
+ G D +F WP+ A+ Y+ FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 563 TFKLAPNGEDLRFLADCWPAAVEALRYLKTFDINNDGLPDNGGAPDQTFDDWPLKGVSAY 622
Query: 695 SGGLWVAALQAASA------LAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNS 747
G LW+AAL AA A L+ + + F +A+ +++ LWNG Y++ D
Sbjct: 623 CGALWIAALDAALAIAQQLQLSTGLDTSAEQQEFSAWLAQARGNFDRLLWNGEYYDID-- 680
Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
S + + ADQL G +YAR GL P+V E +RS L V + K GG G NG+
Sbjct: 681 AESGTPVVMADQLCGDFYARLLGLEPVVSEANSRSTLNAVKDSCFEKFEGGSLGVANGLR 740
Query: 808 PDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
DG +D + E+W+G+ + +A+ + + +SG G F+T
Sbjct: 741 RDGTPLDPNGTHPLEVWTGINFGIASYFRLMGETETALAITSAVVNQVYSG---GLQFRT 797
Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWA 893
PEA + +R+ Y+R +AIW + WA
Sbjct: 798 PEAITAVNTFRACHYLRAMAIWGV-WA 823
Score = 182 bits (463), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 126/424 (29%), Positives = 201/424 (47%), Gaps = 54/424 (12%)
Query: 75 AAKGKRAF----IDPFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVC 125
AA RAF P+ R+ ++ +HG+PLGG G+G IGR+ G F W +
Sbjct: 31 AASWSRAFGQGWASPYTVRYASNLDDGPNHGMPLGGFGAGCIGRAPDGSFNLWHLDGGEH 90
Query: 126 EDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW----NLKGDKSTY 181
+ QFS+F S Q+ ++ + + A + SW W + + TY
Sbjct: 91 WFGVLPDCQFSLFESNGTEQRAHALAVKPNRDDSRPEAADPLSSWSWYPASSEQRQTGTY 150
Query: 182 HALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITL 241
A YP +W+ ++G + E + C SPI+P NY+++SYP++ F +T+ N + + D++L
Sbjct: 151 AARYPLSWSHYQGVFEAE--VSCEAFSPILPGNYQQTSYPLATFVWTLRNPTRQALDLSL 208
Query: 242 LFTWTNSVG----------------GDSEFTGQHYNSKTK------MNDG-VHAVLLHHR 278
+ +W N+ G G E K++ ++DG + V+L
Sbjct: 209 MLSWRNTCGWFTNTDSSAEVHFRDDGSPEHNYAPAIGKSEGHRNDWIDDGSLKGVVLGGD 268
Query: 279 TSHQLPPVT-----FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRL 333
S PV + +A QE GV + C + G+ D+W GS
Sbjct: 269 VSS---PVAEGEGQWCIATQEQAGVSIQRCSRWNPQGD-----GSDLWGSFSRDGSIPES 320
Query: 334 NSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFY 393
N+ S +EP + AA+A + P ++ ++WD P +F +G + RRYT F+
Sbjct: 321 NNNRRSGAAEP---VSAALAVRCRLEPGQSIEIPVVISWDLPVTSFATGTSALRRYTDFF 377
Query: 394 GTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
G AA IA + + + SW QIEAWQ+P+LE LPE + LFNELY L +GG++W
Sbjct: 378 GASGKNAAAIAAEGLRDWRSWRQQIEAWQQPVLERSDLPEPLRMALFNELYDLCSGGSLW 437
Query: 454 TDGS 457
+ S
Sbjct: 438 SAAS 441
>gi|88809081|ref|ZP_01124590.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
gi|88787023|gb|EAR18181.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
Length = 849
Score = 190 bits (483), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 135/388 (34%), Positives = 205/388 (52%), Gaps = 25/388 (6%)
Query: 525 GEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK 584
G++ G+F LE ++Y + + DV Y S L+ L+P++ ++ R FA A+ D ++
Sbjct: 453 GKDPHGRFGVLECLDYAWYESLDVRLYGSLGLLQLWPELDKAVLRSFARAIPAADATQRP 512
Query: 585 L---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVY 636
+ G+ RKV GA PHD+G + P+ N D WKDL FVLQV+
Sbjct: 513 IGWYFTQGRGRVEADRKVEGATPHDLGAPNEVPFDATNYTAYQDCNLWKDLASDFVLQVW 572
Query: 637 R--DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISA 693
R + TG D F WP+ A+ Y+ +FD + DG+ +N G PDQT+D W + G+SA
Sbjct: 573 RTFKLAPTGEDLTFLADCWPAAVQALHYLKRFDVNNDGLPDNGGAPDQTFDDWPLKGVSA 632
Query: 694 YSGGLWVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDN 746
Y G LW+AAL+AA A L +E+G D GSE + F ++++ ++ LWNG Y++ D
Sbjct: 633 YCGALWIAALEAALAMAQRLQQELGLDTGSEQHTFSNWLEQSRANFDALLWNGEYYDID- 691
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
S + + ADQL G +YAR GL P+V + +RS LK V K GG G NG+
Sbjct: 692 -AESGTPVVMADQLCGDFYARLLGLEPVVSDANSRSTLKAVKESCFEKFQGGSLGVANGL 750
Query: 807 LPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
DG +D + E+W+G+ + +A+ + +SG G F+
Sbjct: 751 RRDGTPLDPNGTHPLEVWTGINFGIASYYQLMGEGQTAEAICSAVVHQVYSG---GLQFR 807
Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
TPEA + +R+ Y+R +AIW + WA
Sbjct: 808 TPEAITAVNTFRACHYLRAMAIWGL-WA 834
Score = 189 bits (479), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 122/393 (31%), Positives = 194/393 (49%), Gaps = 40/393 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG+PLGG G+G IGRS RG+F W + + QFS+F + + ++ ++
Sbjct: 70 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGTIPDCQFSLFENNGSSKRAHALALKP 129
Query: 155 TPEVLKDTTAAGIGSWDW--NLKGDKST--YHALYPRAWTVHEGEPDPELRIVCRQISPI 210
+ + + + +W+W G++ST Y A YP +WT +EG D E++ C SPI
Sbjct: 130 EADASRSNSGEPLAAWNWYPASTGEQSTGTYAARYPLSWTHYEGVFDAEVQ--CEAFSPI 187
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSE 254
+P +Y+ +SYPV+VF +T+ N D++LL +W N+VG G E
Sbjct: 188 LPGDYQRTSYPVAVFRWTLRNPTDQPLDLSLLLSWRNTVGWFFNTDPSAEVHFRDDGSPE 247
Query: 255 FT-------GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVT--FALAAQET-DGVHVSLCP 304
G+ ++ +G+ ++L + S + + LA +T DGV V C
Sbjct: 248 HNYAPAIGHGEGQRNRWLDQEGIRGIVLEGQRSEPIGEGQGQWCLAVPDTLDGVEVLRCS 307
Query: 305 HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
+ +G+ +++W G R+ + S G AAIA + T+ P
Sbjct: 308 RWDPTGD-----GQELWQPFAAEG---RIPNSNNDRASRAGEHASAAIALTFTLAPGESR 359
Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
++ +++WD P F SG RRYT F+G AA +A +A+ + W QI+AWQ P
Sbjct: 360 EIPIAISWDLPVTAFASGVRDLRRYTDFFGADGRNAAAMATEALRDWRHWRDQIDAWQAP 419
Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
+L K LPE + LFNELY L +GG++WT S
Sbjct: 420 VLARKELPEALRMALFNELYDLASGGSLWTAAS 452
>gi|113953483|ref|YP_730032.1| hypothetical protein sync_0818 [Synechococcus sp. CC9311]
gi|113880834|gb|ABI45792.1| conserved hypothetical protein [Synechococcus sp. CC9311]
Length = 833
Score = 189 bits (481), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 135/383 (35%), Positives = 202/383 (52%), Gaps = 25/383 (6%)
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---L 586
G+F LE ++Y + + DV Y SFAL+ L+P++ ++ R FA A+ D ++ +
Sbjct: 443 GRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVIRSFARAIPAADATQRPIGWYF 502
Query: 587 DDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR--DV 639
G+ RKV GA PHD+G + P+ N D WKDL +VLQV+R +
Sbjct: 503 TQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQDCNLWKDLASDYVLQVWRTFKL 562
Query: 640 VATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D F WP+ A+ Y+ QFD + DG+ +N G PDQT+D W + G+SAY G L
Sbjct: 563 APTGEDLSFLAECWPAAVQALHYLKQFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGAL 622
Query: 699 WVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQ 751
W+AAL+AA A L E+G D G E + F ++++ ++K LWNG Y++ D S
Sbjct: 623 WIAALEAALAMAQRLQLELGLDTGDEQHTFSGWLEQSRANFDKLLWNGEYYDID--AESG 680
Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
+ + ADQL G +YAR GL P+V E +RS LK V GG G NG+ DG
Sbjct: 681 TPVVMADQLCGDFYARLLGLPPVVSEANSRSTLKAVKEACFDNFAGGSLGVANGLRRDGT 740
Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
+D + E+W+G+ + +A+ + ++G G F+TPEA
Sbjct: 741 PLDPNGTHPLEVWTGINFGIASYYRLMGEGKTAEAICSAVVTQVYAG---GLQFRTPEAI 797
Query: 871 NTDDQYRSLCYMRPLAIWAMQWA 893
+ +R+ Y+R +AIW + WA
Sbjct: 798 TAVNTFRACHYLRAMAIWGL-WA 819
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 134/447 (29%), Positives = 206/447 (46%), Gaps = 65/447 (14%)
Query: 60 LAPIGVRILC--LIREEAAKG--------KRAF----IDPFIKRHLTSS-----HGVPLG 100
+AP+G+ L L R + KG R+F PF R+ ++ HG+PLG
Sbjct: 1 MAPLGLSALRSRLSRGSSGKGWQPPEASWSRSFGLGWQTPFTVRYASNLDDGPWHGMPLG 60
Query: 101 GVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLK 160
G G+G IGRS RG+F W + + QF+++ + + + ++ + +
Sbjct: 61 GFGAGCIGRSSRGDFNLWHLDGGEHWYGSIPDCQFALWEKQGDQVRVHALATEPARDDSR 120
Query: 161 DTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYK 216
+ + SW W + TY A YP +W ++G E + C SPI+P +Y+
Sbjct: 121 PESGKPLSSWQWYPASTEENSTGTYAARYPLSWNHYKGVFRAE--VSCEAFSPILPGDYQ 178
Query: 217 ESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFT---- 256
SSYPV+VF +T+ N +++LL +W N+VG G E
Sbjct: 179 RSSYPVAVFRWTLTNPTNKPLEVSLLLSWRNTVGWFTNTDASAEVHFRDDGSPEHNYAPA 238
Query: 257 ---GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVT-----FALAAQET-DGVHVSLCPHFV 307
G+ +++ DG+ VLL + S PV + LA +T +GV + C +
Sbjct: 239 IGDGEGQSNRWIDGDGLSGVLLDGKRST---PVAEGEGQWCLALPDTLEGVELMRCSRWD 295
Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
S + ++W G N+ S E S AAIA T+ P ++
Sbjct: 296 PSSD-----GAELWQPFAADGVIPDSNNDRASRKGEHAS---AAIAVKFTLAPGETREIP 347
Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
+++WD P +F +G RRY FYG AA IA +A+ + SW QIEAWQ P+L
Sbjct: 348 VAISWDLPVTSFATGVRDLRRYCDFYGVDGCHAAAIASEALRDWRSWHQQIEAWQEPVLA 407
Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
K LPE + LFNELY L +GG++WT
Sbjct: 408 RKELPEELRMALFNELYDLASGGSLWT 434
>gi|295828610|gb|ADG37974.1| AT1G33700-like protein [Capsella grandiflora]
Length = 105
Score = 189 bits (481), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 88/105 (83%), Positives = 95/105 (90%)
Query: 608 DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDR 667
DPWFEVNAY L++T RWKDLN KFVLQVYRDVVATGD FAKAVWPSVY A+AY+DQFD+
Sbjct: 1 DPWFEVNAYNLFNTNRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60
Query: 668 DGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALARE 712
DGDGMIEN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE
Sbjct: 61 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105
>gi|295828604|gb|ADG37971.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828608|gb|ADG37973.1| AT1G33700-like protein [Capsella grandiflora]
Length = 105
Score = 189 bits (480), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 88/105 (83%), Positives = 95/105 (90%)
Query: 608 DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDR 667
DPWFEVNAY L++T RWKDLN KFVLQVYRDVVATGD FAKAVWPSVY A+AY+DQFD+
Sbjct: 1 DPWFEVNAYNLFNTBRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60
Query: 668 DGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALARE 712
DGDGMIEN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE
Sbjct: 61 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105
>gi|295828602|gb|ADG37970.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828606|gb|ADG37972.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828612|gb|ADG37975.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828614|gb|ADG37976.1| AT1G33700-like protein [Neslia paniculata]
Length = 105
Score = 189 bits (479), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 88/105 (83%), Positives = 95/105 (90%)
Query: 608 DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDR 667
DPWFEVNAY L++T RWKDLN KFVLQVYRDVVATGD FAKAVWPSVY A+AY+DQFD+
Sbjct: 1 DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60
Query: 668 DGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALARE 712
DGDGMIEN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE
Sbjct: 61 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105
>gi|116073260|ref|ZP_01470522.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
gi|116068565|gb|EAU74317.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
Length = 832
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 129/391 (32%), Positives = 196/391 (50%), Gaps = 33/391 (8%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
E+ G+F LE ++Y + + DV Y S AL+ L+P++ ++ R FA A+ D ++ +
Sbjct: 438 EDPHGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATQRPI 497
Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
G+ RKV GA PHD+G + PW N D WKDL +VLQV+R
Sbjct: 498 GWYFTQGKGRVEADRKVKGATPHDLGAPNELPWDATNYTAYQDCNLWKDLGSDYVLQVWR 557
Query: 638 --DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
+ +G D +F WPS A+ Y+ FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 558 TFKLSPSGEDVRFLADCWPSAVEALRYLKTFDVNDDGLPDNGGAPDQTFDDWPLKGVSAY 617
Query: 695 SGGLWVA----------ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFN 743
G LW+A LQ + L R F ++++ +++ LWNG Y++
Sbjct: 618 CGALWIAALEAALAIAQTLQLKTGLDTSAEQR----EFSGWLEQSRGNFDRLLWNGEYYD 673
Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
D S + + ADQL G +YAR GL P+V E +RS LK V K GGK G
Sbjct: 674 ID--AESGTPVVMADQLCGDFYARLLGLEPVVSEANSRSTLKAVREACFEKFDGGKLGVA 731
Query: 804 NGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
NG+ DG +D + E+W+G+ + +A+ + ++G G
Sbjct: 732 NGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGQTAEAICSAVVNQVYAG---GL 788
Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
F+TPEA + +R+ Y+R +AIW + WA
Sbjct: 789 QFRTPEAITAVNTFRACHYLRAMAIWGL-WA 818
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 129/450 (28%), Positives = 214/450 (47%), Gaps = 64/450 (14%)
Query: 60 LAPIGVRILCLIREEAAKGKR--------------AFIDPFIKRHLTS-----SHGVPLG 100
+AP+G+ L + + KR + P+ R+ ++ +HG+PLG
Sbjct: 1 MAPLGLSALKSLLGRGKQAKRWQPPEACWSRPFGLGWDKPYTVRYASNLDDGPNHGMPLG 60
Query: 101 GVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLK 160
G G+G IGR+ G W + + QF++F S +G++ ++ + + +
Sbjct: 61 GFGAGCIGRAPDGNINLWHLDGGEHWFGVLPDCQFALFESNGSGKRAHALAVKPSADASR 120
Query: 161 DTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYK 216
+ + +WDW + TY A YP +WT +EG D E+R C SPI+P +Y+
Sbjct: 121 PESGEPLPAWDWYPASTAERSTGTYAARYPLSWTSYEGVYDAEVR--CEAFSPILPGDYQ 178
Query: 217 ESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFT---- 256
+SYPV+VF +T++N D++LL +W N+ G G E
Sbjct: 179 RTSYPVAVFVWTLHNPTDQPLDLSLLLSWRNTTGWFTNTDASAEVHFRDDGSPEHNYAPA 238
Query: 257 -GQHYNSKTK-MNDG-VHAVLLHHRTSHQLPPVT-----FALAAQETDGVHVSLCPHFVI 308
G+ + + ++DG + V+L S+ PV + +A E GV + C +
Sbjct: 239 IGKTVGQRNRWVDDGNLKGVVLEGNVSN---PVAEGEGQWCIATTEQPGVSIQRCSRWNP 295
Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
SG+ ++W+ GS N+ TS ++P + AA+A + P ++
Sbjct: 296 SGD-----GSELWNSFSADGSIPESNNDRTSGANDP---LSAALAVQCQLAPGQSLEIPV 347
Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
++WD P F +G RRYT F+GT + AA IA +A+ + W QI AWQ+P+L+
Sbjct: 348 MISWDLPVTAFATGSQALRRYTDFFGTGGDQAAAIAAEALRDWKQWREQIGAWQQPVLQR 407
Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
LPE + LFNELY L +GG++W+ SP
Sbjct: 408 SELPEPLRMALFNELYDLCSGGSLWSAASP 437
>gi|324503709|gb|ADY41606.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 261
Score = 186 bits (472), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 106/240 (44%), Positives = 142/240 (59%), Gaps = 7/240 (2%)
Query: 665 FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLF 724
+D DGD +IEN G PDQTYD WS+ G SAY GGLW+ AL+ +A +G+ F
Sbjct: 23 WDVDGDDLIENAGQPDQTYDVWSMHGSSAYCGGLWLCALECVRRMALTLGEVVDAQKFAN 82
Query: 725 KFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARAC-GLLP--IVDEDKA 780
K A+ YE KLWNG YF++D S+ SI ADQL G W+ G + I+ +
Sbjct: 83 KLNNARKAYERKLWNGKYFDFDEH-STDHKSIMADQLCGFWFMCITDGKVDDVIITRQQI 141
Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
++LK ++ YNV K G+ G VN M+P G VD + +QS E+W GV YA+A+ + +
Sbjct: 142 CASLKTIFEYNVEKFANGQLGPVNAMMPSGVVDSTGIQSEEVWGGVAYALASFHLLVEEN 201
Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
+ F+TA G Y + W GL Y Q+PEA N YR++ YMRPLAIWAMQ AL + K
Sbjct: 202 ESAFKTAEGWYRSCWERYGLQY--QSPEAINESSYYRAIGYMRPLAIWAMQSALDALRNK 259
>gi|260434763|ref|ZP_05788733.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
gi|260412637|gb|EEX05933.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
Length = 832
Score = 185 bits (470), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 128/391 (32%), Positives = 195/391 (49%), Gaps = 33/391 (8%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
E+ G+F LE ++Y + + DV Y S AL+ L+P++ ++ R FA A+ D ++ +
Sbjct: 438 EDPYGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATQRPI 497
Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
G+ RKV GA PHD+G + PW N D WKDL FVLQV+R
Sbjct: 498 GWYFTQGKGRVEADRKVKGATPHDLGAPNEIPWDATNYTAYQDCNLWKDLGSDFVLQVWR 557
Query: 638 D--VVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
+ +G D +F WP+ A+ ++ FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 558 SFKLAPSGEDIRFLADCWPAAVEALRFLKTFDVNNDGLPDNGGAPDQTFDDWPLKGVSAY 617
Query: 695 SGGLWVA----------ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFN 743
G LW+A LQ ++ L R F ++++ ++K LWNG Y++
Sbjct: 618 CGALWIAALEAALAIAQTLQLSTGLDTSAEQR----EFSGWLEQSRGNFDKLLWNGEYYD 673
Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
D S + + ADQL G +YAR GL P+V + +RS LK V GG G
Sbjct: 674 ID--AESGTPVVMADQLCGDFYARLLGLPPVVSDSNSRSTLKVVKEACFEAFDGGSLGVA 731
Query: 804 NGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
NG+ DG +D + E+W+G+ + +A+ + E +SG G
Sbjct: 732 NGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGDKQTAQAICSAVVEQVYSG---GL 788
Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
F+TPEA + +R+ Y+R +AIW WA
Sbjct: 789 QFRTPEAITAVNTFRACHYLRAMAIWGF-WA 818
Score = 178 bits (451), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 118/411 (28%), Positives = 192/411 (46%), Gaps = 50/411 (12%)
Query: 85 PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
P+ R+ ++ +HG+PLGG G+G GR+ G W + + QF++F
Sbjct: 40 PYTVRYASNLDDGPNHGMPLGGFGAGCFGRAPDGNINLWHLDGGEHWFGVLPDCQFALFE 99
Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKG--DKST--YHALYPRAWTVHEGE 195
G++ ++ + + + +W+W D+ST Y A YP +WT +EG
Sbjct: 100 GNGRGKRAHALAVQPDKDASRQEAGQPLKAWEWYPASTPDRSTGTYAARYPFSWTSYEGV 159
Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
D E+R C SPI+P +Y+ +SYPV+VF +T+ N D++LL W N+ G
Sbjct: 160 YDAEVR--CEAFSPILPGDYQRTSYPVAVFIWTLRNPTDQPLDLSLLLCWRNTTGWFTNT 217
Query: 251 -----------GDSEFT-----GQHYNSKTKMND--GVHAVLLHHRTSHQLPPVT----- 287
G E G + + D + V+L S+ PV
Sbjct: 218 DASAEVHFRDDGSPEHNYAPAIGSTAGQRNRCIDYGSLKGVVLEGNASN---PVAEGEGQ 274
Query: 288 FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSS 347
+ +AA E GV + C + SG+ +++W GS N+ S + +P
Sbjct: 275 WCIAAAEQPGVTIQRCSRWNPSGD-----GRELWDSFSADGSIPESNNDRRSGSDDP--- 326
Query: 348 IGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDA 407
+ AA+A + P ++ ++WD P F +G RRYT F+ N A IA ++
Sbjct: 327 LSAALAVQCQLAPGQSIEIPLVISWDLPVTGFATGSQALRRYTDFFAADANQAVAIAAES 386
Query: 408 ILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
+ + SW +I+AWQ P+L+ + LPE + LFNELY L +GG++W+ SP
Sbjct: 387 LRDWRSWRQKIDAWQEPVLQRQDLPEPLRMALFNELYDLCSGGSLWSAASP 437
>gi|352096901|ref|ZP_08957615.1| Glucosylceramidase [Synechococcus sp. WH 8016]
gi|351675485|gb|EHA58646.1| Glucosylceramidase [Synechococcus sp. WH 8016]
Length = 833
Score = 183 bits (465), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 133/383 (34%), Positives = 200/383 (52%), Gaps = 25/383 (6%)
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---L 586
G+F LE ++Y + + DV Y SFAL+ L+P++ ++ R FA A+ D ++ +
Sbjct: 443 GRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVIRSFARAIPAADATQRPIGWYF 502
Query: 587 DDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR--DV 639
G+ RKV GA PHD+G + P+ N D WKDL +VLQV+R +
Sbjct: 503 TQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQDCNLWKDLASDYVLQVWRTFKL 562
Query: 640 VATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D F WP+ A+ Y+ QFD + DG+ +N G PDQT+D W + G+SAY G L
Sbjct: 563 APTGEDLSFLAECWPAAVQALHYLKQFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGAL 622
Query: 699 WVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQ 751
W+AAL+AA A L E+G D G E + F ++++ ++ LWNG Y++ D S
Sbjct: 623 WIAALEAALAMAQRLQLELGLDTGDEQHTFSGWLEQSRANFDTLLWNGEYYDID--AESG 680
Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
+ + ADQL G +YAR L P+V E +RS LK V GG G NG+ DG
Sbjct: 681 TPVVMADQLCGDFYARLLELPPVVSEANSRSTLKAVKEACFDNFAGGLLGVANGLRRDGT 740
Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
+D + E+W+G+ + +A+ + ++G G F+TPEA
Sbjct: 741 PLDPNGTHPLEVWTGINFGIASYYRLMGEGPTAEAICSAVVTQVYAG---GLQFRTPEAI 797
Query: 871 NTDDQYRSLCYMRPLAIWAMQWA 893
+ +R+ Y+R +AIW + WA
Sbjct: 798 TAVNTFRACHYLRAMAIWGL-WA 819
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 136/444 (30%), Positives = 208/444 (46%), Gaps = 59/444 (13%)
Query: 60 LAPIGVRILC--LIREEAAKG--------KRAF----IDPFIKRHLTSS-----HGVPLG 100
+AP+G+ L L R + KG R+F PF R+ ++ HG+PLG
Sbjct: 1 MAPLGLSALRSRLSRGSSGKGWQPPEASWSRSFGLGWSSPFTVRYASNLDDGPWHGMPLG 60
Query: 101 GVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLK 160
G G+G IGRS RG+F W + + QF+++ + + + ++ T + +
Sbjct: 61 GFGAGCIGRSSRGDFNLWHLDGGEHWYGSIPDCQFALWEKQDDQVRVHALATEPTRDDSR 120
Query: 161 DTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYK 216
+ + SW W + TY A YP +W +EG E + C SPI+P +Y+
Sbjct: 121 PESGKPLSSWQWYPASTEESSTGTYAARYPLSWNHYEGVFRAE--VSCEAFSPILPGDYQ 178
Query: 217 ESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFT---- 256
SSYPV+VF +T+ N +++LL +W N+VG G E
Sbjct: 179 RSSYPVAVFRWTLTNPTTKPLEVSLLLSWRNTVGWFTNTDSSAEVHFRDDGSPEHNYAPA 238
Query: 257 ---GQHYNSKTKMNDGVHAVLLHHRTSHQLPPV--TFALAAQET-DGVHVSLCPHFVISG 310
G+ +++ DG+ VLL + S L + LA + DGV + C + SG
Sbjct: 239 IGVGEGQSNRWIDGDGLCGVLLDGKRSTPLAEGEGQWCLALPDRLDGVELMRCSRWDPSG 298
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
+ ++W G N+ S E S AAIA +T+ P ++ ++
Sbjct: 299 D-----GAELWQPFAADGLIPDSNNDRASRAGEHAS---AAIAVKLTLAPGETREIPVAI 350
Query: 371 AWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKR 430
+WD P +F +G RRY+ FYG AA IA +A+ + SW QIEAWQ P+L K
Sbjct: 351 SWDLPVTSFATGVRDLRRYSDFYGADGCHAAAIAAEALRDWRSWHEQIEAWQAPVLARKE 410
Query: 431 LPEWYPITLFNELYYLNAGGAVWT 454
LPE + LFNELY L +GG++WT
Sbjct: 411 LPEELRMALFNELYDLASGGSLWT 434
>gi|78212458|ref|YP_381237.1| hypothetical protein Syncc9605_0920 [Synechococcus sp. CC9605]
gi|78196917|gb|ABB34682.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 832
Score = 182 bits (461), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 127/391 (32%), Positives = 194/391 (49%), Gaps = 33/391 (8%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
E+ G+F LE ++Y + + DV Y S AL+ L+P++ ++ R FA A+ D ++ +
Sbjct: 438 EDPYGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATQRPI 497
Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
G+ RKV GA PHD+G + PW N D WKDL FVLQV+R
Sbjct: 498 GWYFTQGKGRVEADRKVKGATPHDLGAPNEIPWDATNYTAYQDCNLWKDLGSDFVLQVWR 557
Query: 638 --DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
+ +G D +F WP+ A+ Y+ FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 558 TFKLAPSGEDIRFLADCWPAAVEALRYLKTFDVNNDGLPDNGGAPDQTFDDWPLKGVSAY 617
Query: 695 SGGLWVA----------ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFN 743
G LW+A LQ ++ L R F ++++ ++K LWNG Y++
Sbjct: 618 CGALWIAALEAALAIAQTLQLSTGLDTSAEQR----EFSGWLEQSRGNFDKLLWNGEYYD 673
Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
D S + + ADQL G +YAR L P+V + + S LK V GG G
Sbjct: 674 ID--AESGTPVVMADQLCGDFYARLLALPPVVSDANSLSTLKAVKEACFEAFDGGSLGVA 731
Query: 804 NGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
NG+ DG +D ++ E+W+G+ + +A+ + E +S G
Sbjct: 732 NGLRRDGTPLDPNATHPLEVWTGINFGIASYYRLMGDKQTAQSICSAVVEQVYSS---GL 788
Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
F+TPEA + YR+ Y+R +AIW + WA
Sbjct: 789 QFRTPEAITAVNTYRACHYLRAMAIWGL-WA 818
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 118/414 (28%), Positives = 189/414 (45%), Gaps = 56/414 (13%)
Query: 85 PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
P+ R+ ++ +HG+PLGG G+G GR+ G W + + QF++F
Sbjct: 40 PYTVRYASNLDDGPNHGMPLGGFGAGCFGRAPDGNINLWHLDGGEHWFGVLPDCQFALFE 99
Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKG--DKST--YHALYPRAWTVHEGE 195
G++ ++ + + + W+W D+ST Y A YP +WT +EG
Sbjct: 100 GNGRGKRAHALAVQPEQDASRPEAGQPLKVWEWYPASTPDRSTGTYAARYPLSWTSYEGV 159
Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEF 255
D E+R C SPI+P +Y+ +SYPV VF +T+ N D++LL +W N+ G F
Sbjct: 160 YDAEVR--CEAFSPILPGDYQRTSYPVVVFVWTLRNPTDQPLDLSLLLSWRNTTG---WF 214
Query: 256 TGQHYNSKTKMND--------------------------GVHAVLLHHRTSHQLPPVT-- 287
T +++ D + VLL S PV
Sbjct: 215 TNTDASAEVHFRDDGSPEHNYAPAIGSTAGQRNCCIDDGSLKGVLLEGNVSD---PVAEG 271
Query: 288 ---FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEP 344
+ +A E GV + C + SG+ +++W GS N+ S + +P
Sbjct: 272 EGQWCIATAEQPGVTIQRCSRWNPSGD-----GRELWDSFSADGSIPESNNDRRSGSDDP 326
Query: 345 GSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIA 404
+ AA+A + P ++ ++WD P F +G RRYT F+ N A IA
Sbjct: 327 ---LSAALAVQCQLAPGQSIEIPLVISWDLPVTAFATGSQALRRYTDFFAADANQAVAIA 383
Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
+A+ + W QIEAWQ+P+L+ + LP+ + LFNELY L +GG++W+ SP
Sbjct: 384 AEALRDWRRWRQQIEAWQQPVLQRQDLPKPLRMALFNELYDLCSGGSLWSAASP 437
>gi|33862646|ref|NP_894206.1| hypothetical protein PMT0373 [Prochlorococcus marinus str. MIT
9313]
gi|33634562|emb|CAE20548.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 837
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 132/385 (34%), Positives = 204/385 (52%), Gaps = 27/385 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL--- 585
+G+F LE +Y + + DV Y S AL+ L+P++ ++ R FA A+ D + +
Sbjct: 442 VGRFGVLECFDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATPRPIGWY 501
Query: 586 LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
G+ RK A PHD+G + P+ N D WKDL +VLQV+R +
Sbjct: 502 FTQGRGRVEAPRKRAAATPHDLGAPNESPFDATNYTAYQDCNLWKDLASDYVLQVWRTFL 561
Query: 641 ATG---DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
+ D F WP+V A++Y+ +FD + DG+ +N G PDQT+D W + G+SAY G
Sbjct: 562 LSPNGEDLSFLAECWPAVVQALSYLKRFDVNHDGLPDNGGAPDQTFDDWPLQGVSAYCGA 621
Query: 698 LWVAALQAASA----LAREVGDRGSEDYFLFK--FQKAKVVYEK-LWNGSYFNYDNSGSS 750
LW+AAL+AA A L ++G +E+ F ++++V +++ LWNG Y+ D S
Sbjct: 622 LWIAALEAALAMAQRLQLDLGLNTAEEQHQFSGWLEQSRVNFDRLLWNGEYYKID--AES 679
Query: 751 QSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDG 810
+ + ADQL G +YAR GL +V ++++RS+L V GG+ G NG+ DG
Sbjct: 680 GTPVVMADQLCGDFYARLLGLPSVVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDG 739
Query: 811 R-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT-GLGYAFQTPE 868
+D E+W+G+ + +AA + + D TA I A + G G F+TPE
Sbjct: 740 MPLDPKGTHPLEVWTGINFGLAA--YYRLMGDA--TTATAICSAVVNQVYGGGLQFRTPE 795
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWA 893
A YR+ Y+R +AIWA+ WA
Sbjct: 796 AITAVKTYRACHYLRAMAIWAL-WA 819
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 128/428 (29%), Positives = 201/428 (46%), Gaps = 59/428 (13%)
Query: 75 AAKGKRAF----IDPFIKRHLTSS-----HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVC 125
AA RAF P+ R+ ++ HG+PLGG G+G IGRS RG+F W +
Sbjct: 26 AASWSRAFGLPWQQPYTVRYASNLDDGPWHGMPLGGFGAGCIGRSPRGDFNLWNLDGGEH 85
Query: 126 EDKPVLANQFSVFVSRSNG-QKYSSVLCPKT----PEVLKDTTAAGIGSWDW----NLKG 176
+ QF++F + + + ++ P+T PE+ K + SW W +
Sbjct: 86 WQGSIPDCQFALFERQGDQVRAHALATAPQTDDSQPELDKP-----LSSWSWYPASTEQR 140
Query: 177 DKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTS 236
+Y A YP +WT ++G EL VC SPI+P +Y+ +SYP++VF + + N S
Sbjct: 141 KTGSYSARYPLSWTHYKGVFAAEL--VCEAFSPILPGDYRRTSYPLAVFRWQLQNPTSQS 198
Query: 237 ADITLLFTWTNSVG----------------GDSEFT-------GQHYNSKTKMNDGVHAV 273
+++LL +W N+ G G E + G+ ++ G+ V
Sbjct: 199 LELSLLLSWRNTCGWFTNTDPSASVHFRDDGSPEHSYVPAIGRGEGQCNRWIDQPGLIGV 258
Query: 274 LLHHRTSHQLPPVT--FALAAQE-TDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF 330
L+ + L + LA + GV V C + SG+ ++W G+
Sbjct: 259 LMEGERADPLAEGQGQWCLAVPDHLPGVEVMRCSRWDPSGD-----GSELWSSFASEGTI 313
Query: 331 DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYT 390
N+ S+ E S AA+A VT+ P ++ ++WD P F +G RRYT
Sbjct: 314 PNSNTAHKSLAGEQTS---AALAVKVTLAPGESLEIPVVISWDLPVTAFATGVRDLRRYT 370
Query: 391 KFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGG 450
F+G+ AAA +A +A+ + W QI+AWQ P+L + LPE + LFNELY L +GG
Sbjct: 371 DFHGSDGQAAAALAAEALSDWSDWREQIDAWQAPVLAREDLPERLRMALFNELYDLASGG 430
Query: 451 AVWTDGSP 458
++WT P
Sbjct: 431 SLWTAARP 438
>gi|33866127|ref|NP_897686.1| hypothetical protein SYNW1593 [Synechococcus sp. WH 8102]
gi|33639102|emb|CAE08108.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 833
Score = 180 bits (456), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 130/383 (33%), Positives = 201/383 (52%), Gaps = 25/383 (6%)
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---L 586
G+F LE ++Y + + DV Y S AL+ L+P++ ++ R FA A+ D S+ +
Sbjct: 443 GRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADGSQRPIGWYF 502
Query: 587 DDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD--V 639
G+ RKV GA PHD+G + P+ N D WKDL ++LQV+R +
Sbjct: 503 TQGRGRVEADRKVAGATPHDLGAPNERPFDATNYTAYQDCNLWKDLASDYMLQVWRTFRL 562
Query: 640 VATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D +F WPS A+ Y+ FD + DG+ +N G PDQT+D W + G+SAY G L
Sbjct: 563 APTGEDLRFLAECWPSAVQALDYLKGFDVNNDGLPDNGGAPDQTFDDWPLQGVSAYCGAL 622
Query: 699 WVAALQAASA----LAREVG-DRGSED-YFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQ 751
W+AAL+AA A L ++G D +E F ++++ ++ LWNG Y+ D S
Sbjct: 623 WIAALEAALAMAQRLQLDLGLDTAAEQRRFGAWLEQSRANFDDLLWNGEYYRID--AESG 680
Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
+ + ADQL G +YAR GL P+V + +ARS L+ V + G+ G NG+ DG
Sbjct: 681 TPVVMADQLCGDFYARLLGLPPVVSDARARSTLQAVKEACFERFESGRLGVANGLRRDGS 740
Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
+D + E+W+G+ + +A+ + + +SG G F+TPEA
Sbjct: 741 PLDPNGTHPLEVWTGINFGLASYYRLMGESKTALAICSAVVGQVYSG---GLQFRTPEAI 797
Query: 871 NTDDQYRSLCYMRPLAIWAMQWA 893
+ +R+ Y+R +AIW + WA
Sbjct: 798 TAVNTFRACHYLRAMAIWGL-WA 819
Score = 161 bits (408), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 119/409 (29%), Positives = 188/409 (45%), Gaps = 45/409 (11%)
Query: 85 PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
P+ R+ ++ HG+PLGG G+G +GRS G F W + + QF+++
Sbjct: 40 PYTVRYASNLDDGPDHGMPLGGFGAGCLGRSPSGAFNLWNLDGGEHWFGSIPDCQFALWE 99
Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW---NLKGDKS-TYHALYPRAWTVHEGE 195
+ + ++ + + + +W W + +G + TY A YP +WT HE
Sbjct: 100 QQGEDVRAHALATAPDRDDTDPSGGTPLSAWQWYPASTQGRSTGTYAARYPLSWTHHEDV 159
Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
++C SPI+P +Y+ +SYPV+VF + N +++L+ W N+VG
Sbjct: 160 Y--RAGVLCEAFSPILPGDYQRTSYPVAVFRWQFSNPTDQPLELSLMLCWRNTVGWFTNT 217
Query: 251 -----------GDSEFT-------GQHYNSKTKMNDGVHAVLLHHRTSHQLPPV--TFAL 290
G E G+ ++ G VLL + S L + L
Sbjct: 218 DAAAEVHFRDDGSPEHNYAPAIGRGEGQRNRQVDQPGFSGVLLEGQPSMPLAEGEGQWCL 277
Query: 291 A-AQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
A + DGVHV C + SG+ ++W G R+ S G
Sbjct: 278 ALPDDLDGVHVMRCSRWDPSGD-----GSELWTPFAADG---RIPDSNNDRASRSGEQAS 329
Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAIL 409
AAIA T+ P + ++ ++WD P F SG + RRYT F+G+ + AA IA +A+
Sbjct: 330 AAIAVKFTLAPGASREIPLVISWDLPVTAFASGSSALRRYTDFFGSSGSNAAAIAAEALR 389
Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
+ W QI+AWQ+P+L +LPE + LFNELY L +GG++WT SP
Sbjct: 390 DWRQWREQIDAWQQPVLARMQLPEPLRMALFNELYDLCSGGSLWTAASP 438
>gi|124023625|ref|YP_001017932.1| bile acid beta-glucosidase [Prochlorococcus marinus str. MIT 9303]
gi|123963911|gb|ABM78667.1| Predicted bile acid beta-glucosidase [Prochlorococcus marinus str.
MIT 9303]
Length = 837
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 130/385 (33%), Positives = 202/385 (52%), Gaps = 27/385 (7%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL--- 585
+G+F LE +Y + + DV Y S AL+ L+P++ ++ R FA A+ D + +
Sbjct: 442 VGRFGVLECFDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATPRPIGWY 501
Query: 586 LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
G+ RK A PHD+G + P+ N D WKDL +VLQV+R +
Sbjct: 502 FTQGRGRVEAPRKRAAATPHDLGAPNESPFDATNYTAYQDCNLWKDLASDYVLQVWRTFL 561
Query: 641 ATG---DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
+ D F WP+ A++Y+ +FD + DG+ +N G PDQT+D W + G+SAY G
Sbjct: 562 LSPNGEDLSFLAECWPAAVQALSYLKRFDVNHDGLPDNGGAPDQTFDDWPLQGVSAYCGA 621
Query: 698 LWVAALQAASA----LAREVGDRGSEDYFLFK--FQKAKVVYEK-LWNGSYFNYDNSGSS 750
LW+AAL+AA A L ++G +E+ F ++++ +++ LWNG Y+ D S
Sbjct: 622 LWIAALEAALAMAQRLQLDLGLNTAEEQHQFSGWLEQSRANFDRLLWNGEYYKID--AES 679
Query: 751 QSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDG 810
+ + ADQL G +YAR GL +V ++++RS+L V GG+ G NG+ DG
Sbjct: 680 GTPVVMADQLCGDFYARLLGLPSVVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDG 739
Query: 811 R-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT-GLGYAFQTPE 868
+D E+W+G+ + +AA + + D TA I A + G G F+TPE
Sbjct: 740 MPLDPKGTHPLEVWTGINFGLAA--YYRLMGDA--TTATAICSAVVNQVYGGGLQFRTPE 795
Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWA 893
A YR+ Y+R +AIWA+ WA
Sbjct: 796 AITAVKTYRACHYLRAMAIWAL-WA 819
Score = 148 bits (373), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 128/428 (29%), Positives = 200/428 (46%), Gaps = 59/428 (13%)
Query: 75 AAKGKRAF----IDPFIKRHLTSS-----HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVC 125
AA RAF P+ R+ ++ HG+PLGG G+G IGRS RG+F W +
Sbjct: 26 AASWSRAFGLAWQQPYTVRYASNLDDGPWHGMPLGGFGAGCIGRSPRGDFNLWNLDAGEH 85
Query: 126 EDKPVLANQFSVFVSRSNG-QKYSSVLCPKT----PEVLKDTTAAGIGSWDW----NLKG 176
+ QF++F + + + ++ P+T PE+ K + SW W +
Sbjct: 86 WQGSIPDCQFALFERQGDQVRAHALATAPQTDDSQPELDKP-----LSSWSWYPASTEQR 140
Query: 177 DKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTS 236
+Y A YP +WT ++G EL VC SPI+P +Y+ +SYP++VF + + N S
Sbjct: 141 TTGSYSARYPLSWTHYKGVFAAEL--VCEAFSPILPGDYRRTSYPLAVFRWQLQNPTSQS 198
Query: 237 ADITLLFTWTNSVG----------------GDSEFT-------GQHYNSKTKMNDGVHAV 273
+++LL +W N+ G G E + G+ ++ G+ V
Sbjct: 199 LELSLLLSWRNTCGWFTNTDPSASVHFRDDGSPEHSYVPAIGRGEGQCNRWIDQPGLIGV 258
Query: 274 LLHHRTSHQLPPVT--FALAAQE-TDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF 330
L+ + L + LA + GV V C + SG+ ++W G+
Sbjct: 259 LMEGERADPLAEGQGQWCLAVPDHLPGVEVMRCSRWDPSGD-----GSELWSSFASEGTI 313
Query: 331 DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYT 390
N S+ E S AA+A VT+ P ++ ++WD P F +G RRYT
Sbjct: 314 PNSNDTHKSLAGEQTS---AALAVKVTLAPGESLEIPVVISWDLPVTAFATGVRDLRRYT 370
Query: 391 KFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGG 450
F+G+ AAA +A +A+ + W QI+AWQ P+L + LPE + LFNELY L +GG
Sbjct: 371 DFHGSDGQAAAALAAEALRDWPDWREQIDAWQAPVLAREDLPERLRMALFNELYDLASGG 430
Query: 451 AVWTDGSP 458
++WT P
Sbjct: 431 SLWTAARP 438
>gi|116072378|ref|ZP_01469645.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
gi|116064900|gb|EAU70659.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
Length = 820
Score = 177 bits (449), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 127/387 (32%), Positives = 191/387 (49%), Gaps = 33/387 (8%)
Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---L 586
G+F LE ++Y + + DV Y SFAL+ L+P++ ++ R F+ A+ D ++ +
Sbjct: 430 GRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFSRAIPARDATQRPIGWYF 489
Query: 587 DDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR--DV 639
G+ RKV GA PHD+G + P+ N D WKDL FVLQV+R +
Sbjct: 490 TQGRGRVEADRKVAGATPHDLGAPNEVPFDATNYTAYQDCNLWKDLASDFVLQVWRTFQL 549
Query: 640 VATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
TG D F WPS A+ Y+ FD + DG+ +N G PDQT+D W + G+SAY G L
Sbjct: 550 APTGEDLNFLAECWPSAVQALDYLKGFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGAL 609
Query: 699 WVAA----------LQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNS 747
W+AA LQ L V R F ++++ +++ LWNG Y+ D
Sbjct: 610 WIAALEAALAMAQRLQLDLGLDTTVEQR----RFGQWLEQSRSNFDRLLWNGEYYEID-- 663
Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
S + + ADQL G +YAR GL +V + A S L V + G G NG+
Sbjct: 664 AESGTPVVMADQLCGDFYARLLGLPSVVSDANALSTLNVVREACFERFEAGTLGVANGLR 723
Query: 808 PDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
DG +D + E+W+G+ + +A+ +D + +SG G F+T
Sbjct: 724 RDGTPLDPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAVVNQVYSG---GLQFRT 780
Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWA 893
PEA + +R+ Y+R +AIW + WA
Sbjct: 781 PEAITAVNTFRACHYLRAMAIWGL-WA 806
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 116/393 (29%), Positives = 181/393 (46%), Gaps = 44/393 (11%)
Query: 94 SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
+HG+PLGG G+G GR+ G W + + QFS++ + + + ++
Sbjct: 41 NHGMPLGGFGAGCFGRAPDGNTNLWHLDGGEHWFGTLPDCQFSLWEKQGDDLRAHALAVS 100
Query: 154 KTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELR--IVCRQI 207
+ + AA + +W W + TY A YP +WT ++G R ++C
Sbjct: 101 PEGDASRPDAAASLPAWSWYPASTDQCSTGTYAARYPLSWTHYDGV----FRSGVLCEAF 156
Query: 208 SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------G 251
SPI+P +Y+ SSYPV+VF + + N +++L+ +W N+VG G
Sbjct: 157 SPILPGDYQRSSYPVAVFRWQLANPTDQPLELSLMVSWRNTVGWFTNTDAAAEVHFRDDG 216
Query: 252 DSEFT-------GQHYNSKTKMNDGVHAVLLHHRTSHQLPPV--TFALAA-QETDGVHVS 301
E G+ ++ GV VLL R+S + + LA + DGV V
Sbjct: 217 SPEHNYSPAIGRGEGQRNRAINEPGVTGVLLEGRSSEPIAEGEGQWCLAVPDDLDGVDVL 276
Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
C + G+ D+W G R+ + S G AA+ T+ P
Sbjct: 277 RCSRWDPCGD-----GSDIWTPFAAEG---RIPESDNDRESRAGEQASAAMVVKFTLAPG 328
Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
++ ++WD P F SG RRYT F+G+ AA IA +A+ + W +IEAW
Sbjct: 329 ESREIPVVISWDLPITAFASGCADRRRYTDFFGSDGRNAAAIAAEALRDWSDWRGRIEAW 388
Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
Q+P+LE K LPE + LFNELY L +GG++WT
Sbjct: 389 QQPVLERKALPEPVRMALFNELYDLCSGGSLWT 421
>gi|344251774|gb|EGW07878.1| Non-lysosomal glucosylceramidase [Cricetulus griseus]
Length = 226
Score = 159 bits (401), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/185 (46%), Positives = 108/185 (58%), Gaps = 7/185 (3%)
Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYEKL-WNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
+A G + +D F + + YE+L WNG Y+NYD+S QS SI +DQ AGQW+ R
Sbjct: 1 MAVLCGAQDVQDKFSSILCRGREAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLR 60
Query: 768 ACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIW 823
ACGL + AL+ ++ NV GG GAVNGM P G D SS+QS E+W
Sbjct: 61 ACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVW 120
Query: 824 SGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMR 883
GV Y +AA+MI E L GF+TA G Y W LG AFQTPEA+ +RSL YMR
Sbjct: 121 VGVVYGLAATMIQEGLTREGFRTAEGCYRTVWE--RLGLAFQTPEAYCQQQVFRSLAYMR 178
Query: 884 PLAIW 888
PL+IW
Sbjct: 179 PLSIW 183
>gi|16183333|gb|AAL13692.1| GH26380p [Drosophila melanogaster]
Length = 391
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 85/203 (41%), Positives = 122/203 (60%), Gaps = 10/203 (4%)
Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
+ KA++ S M ++D+D DG+IEN PDQTYD+W + G SAY GLW+AALQA
Sbjct: 171 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 230
Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
SA+A + + +K K + EKLWNGSY+ +D S S +I ADQL G WY
Sbjct: 231 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 289
Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
++CG I ++ R+ALK++Y+ NV+ G GA NG + + G VD S++
Sbjct: 290 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNI 349
Query: 818 QSREIWSGVTYAVAASMIHEDLA 840
Q+ E+W GV YA+AA+MI E ++
Sbjct: 350 QAEEVWPGVVYALAATMIQEGIS 372
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/125 (43%), Positives = 78/125 (62%), Gaps = 2/125 (1%)
Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
M+NTYDVHFY+S AL L+P +Q+S+Q DF A+ K+L DG+ + RKV VP
Sbjct: 1 MYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVKNCVP 60
Query: 602 HDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
HD+G D P+ +N Y ++D WKDLN KFVLQVYRD + A++ S + ++
Sbjct: 61 HDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQSDNASKFSSI 120
Query: 660 AYMDQ 664
++D+
Sbjct: 121 EFIDK 125
>gi|226444526|gb|ACO57837.1| unknown [Helianthus annuus]
gi|226444528|gb|ACO57838.1| unknown [Helianthus annuus]
gi|226444560|gb|ACO57854.1| unknown [Helianthus petiolaris]
gi|226444568|gb|ACO57858.1| unknown [Helianthus petiolaris]
Length = 103
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 66/103 (64%), Positives = 76/103 (73%)
Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
KV GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 1 KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFIA 60
Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
WS G GYAFQTPE W D YRSL YMRPLAIW MQ AL +
Sbjct: 61 GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103
>gi|226444514|gb|ACO57831.1| unknown [Helianthus annuus]
gi|226444516|gb|ACO57832.1| unknown [Helianthus annuus]
gi|226444518|gb|ACO57833.1| unknown [Helianthus annuus]
gi|226444520|gb|ACO57834.1| unknown [Helianthus annuus]
gi|226444522|gb|ACO57835.1| unknown [Helianthus annuus]
gi|226444530|gb|ACO57839.1| unknown [Helianthus annuus]
gi|226444532|gb|ACO57840.1| unknown [Helianthus annuus]
gi|226444534|gb|ACO57841.1| unknown [Helianthus annuus]
gi|226444536|gb|ACO57842.1| unknown [Helianthus annuus]
gi|226444538|gb|ACO57843.1| unknown [Helianthus annuus]
gi|226444540|gb|ACO57844.1| unknown [Helianthus annuus]
gi|226444542|gb|ACO57845.1| unknown [Helianthus annuus]
gi|226444544|gb|ACO57846.1| unknown [Helianthus annuus]
gi|226444546|gb|ACO57847.1| unknown [Helianthus petiolaris]
gi|226444548|gb|ACO57848.1| unknown [Helianthus petiolaris]
gi|226444550|gb|ACO57849.1| unknown [Helianthus petiolaris]
gi|226444552|gb|ACO57850.1| unknown [Helianthus petiolaris]
gi|226444554|gb|ACO57851.1| unknown [Helianthus petiolaris]
gi|226444556|gb|ACO57852.1| unknown [Helianthus petiolaris]
gi|226444558|gb|ACO57853.1| unknown [Helianthus petiolaris]
gi|226444562|gb|ACO57855.1| unknown [Helianthus petiolaris]
gi|226444564|gb|ACO57856.1| unknown [Helianthus petiolaris]
gi|226444566|gb|ACO57857.1| unknown [Helianthus petiolaris]
gi|226444570|gb|ACO57859.1| unknown [Helianthus argophyllus]
gi|226444572|gb|ACO57860.1| unknown [Helianthus argophyllus]
gi|226444576|gb|ACO57862.1| unknown [Helianthus argophyllus]
gi|226444578|gb|ACO57863.1| unknown [Helianthus argophyllus]
gi|226444580|gb|ACO57864.1| unknown [Helianthus argophyllus]
gi|226444582|gb|ACO57865.1| unknown [Helianthus argophyllus]
gi|226444584|gb|ACO57866.1| unknown [Helianthus argophyllus]
gi|226444586|gb|ACO57867.1| unknown [Helianthus argophyllus]
gi|226444588|gb|ACO57868.1| unknown [Helianthus argophyllus]
gi|226444590|gb|ACO57869.1| unknown [Helianthus argophyllus]
gi|226444592|gb|ACO57870.1| unknown [Helianthus argophyllus]
Length = 103
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 66/103 (64%), Positives = 76/103 (73%)
Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
KV GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 1 KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60
Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
WS G GYAFQTPE W D YRSL YMRPLAIW MQ AL +
Sbjct: 61 GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103
>gi|226444524|gb|ACO57836.1| unknown [Helianthus annuus]
Length = 103
Score = 140 bits (353), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 66/101 (65%), Positives = 75/101 (74%)
Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
KV GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 1 KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60
Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
WS G GYAFQTPE W D YRSL YMRPLAIW MQ AL
Sbjct: 61 GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 101
>gi|119578750|gb|EAW58346.1| glucosidase, beta (bile acid) 2, isoform CRA_b [Homo sapiens]
Length = 370
Score = 139 bits (349), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 81/211 (38%), Positives = 123/211 (58%), Gaps = 12/211 (5%)
Query: 72 REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
R+ + K FID L +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186
Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
A+QF+V + R Y VL + P VL+ SW+W L G + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238
Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
++ P + + CRQI+PI+PH+Y++SS PV VF + + N G + D++++F+ N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297
Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRT 279
+ G +N + + V +LLHH T
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT 328
>gi|449691194|ref|XP_004212588.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
magnipapillata]
Length = 169
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 100/162 (61%), Gaps = 7/162 (4%)
Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE-----DKARSALKKVYNYNVLK 794
+Y+NYD S + + I A Q AGQWY AC L D+ ++ S LK ++ NV++
Sbjct: 1 TYYNYDGSENDYHNCIMAGQCAGQWYLHACDLGQTEDDLVFRPEQVHSTLKVIFQNNVMR 60
Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
G GAVNGM DG VD +S+Q+ E+W+GVTYA+AA+MI + GF+TA GIY
Sbjct: 61 FEDGFMGAVNGMRADGTVDKNSLQAEEVWTGVTYALAATMIQHGMVREGFRTAEGIYRTV 120
Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
+ GLG FQTPEA + +RS YMRPL+IW++ A ++
Sbjct: 121 YEKWGLG--FQTPEAITANKTFRSRGYMRPLSIWSIYHAYSK 160
>gi|226444574|gb|ACO57861.1| unknown [Helianthus argophyllus]
Length = 103
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/103 (63%), Positives = 75/103 (72%)
Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
KV GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH + D F TA GI+ A
Sbjct: 1 KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60
Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
WS G GYAFQTPE W D YRSL YMRPLAIW Q AL +
Sbjct: 61 GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGKQQALEK 103
>gi|148242847|ref|YP_001228004.1| hypothetical protein SynRCC307_1748 [Synechococcus sp. RCC307]
gi|147851157|emb|CAK28651.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 606
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 111/393 (28%), Positives = 172/393 (43%), Gaps = 40/393 (10%)
Query: 95 HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
HG PLGG G+G IGRS +G+F W I + QF++F + + ++
Sbjct: 54 HGAPLGGFGAGCIGRSSKGDFNLWNIDGGEHWFGSLPDCQFALFEDNGSDTRAHALAVAP 113
Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
+ K A + SW W + +Y A YP W + G E C SPI
Sbjct: 114 EADSSKPEGEAPLSSWSWYPASTAEKPTGSYSARYPLNWFHYSGVFGCE--ASCESFSPI 171
Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSE 254
IP NY++++ PV+VF + N + ++LL +W N+ G G E
Sbjct: 172 IPGNYRDTALPVAVFRWRFRNPSRKPLKLSLLLSWRNTTGWFCNTDPSAAVHFRDDGSPE 231
Query: 255 FT-------GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVT--FALAA-QETDGVHVSLCP 304
Q ++ G+ V++ S + + +A +E +GV + C
Sbjct: 232 HNYLPAIGATQGQRNRAINAPGLRGVVMEGPMSEPIAEGQGQWCIAVPEELEGVELMRCS 291
Query: 305 HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
+ G+ ++W GS N+ S ++P + AA+A + P +E
Sbjct: 292 RWNPHGD-----GAEIWQRFAADGSVPESNNDRRSGNNDP---VSAAMALRFELAPGAEL 343
Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
++ ++WD P F +G RRYT F+G + AA IA DA+L W QIE WQ P
Sbjct: 344 ELPVVISWDLPVTAFATGSQCLRRYTDFFGPSGDQAAAIAADALLRWPRWRQQIEEWQAP 403
Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
+L LPE + L NELY L +GG +W+ S
Sbjct: 404 VLLRDDLPEELRMALCNELYDLASGGTLWSAAS 436
Score = 47.4 bits (111), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 34/57 (59%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
+G+F LE ++Y + + DV Y SFAL+ L+P + S+ R FA A+ D ++ +
Sbjct: 441 VGRFGVLECLDYAWYESLDVRLYGSFALLQLWPALDRSVLRSFARAIPAADATQRPI 497
>gi|332796021|ref|YP_004457521.1| hypothetical protein Ahos_0332 [Acidianus hospitalis W1]
gi|332693756|gb|AEE93223.1| conserved hypothetical protein [Acidianus hospitalis W1]
Length = 652
Score = 129 bits (324), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 95/346 (27%), Positives = 160/346 (46%), Gaps = 37/346 (10%)
Query: 553 SFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFE 612
SFAL+ FP++ + F + + G VPHD+G + E
Sbjct: 334 SFALLEYFPELVKKMDEHFGNFI--------------------IDGEVPHDLG--EESIE 371
Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
Y W DL P ++L +YRD T D F K + + + ++ + D D DG+
Sbjct: 372 NPIYGASYPYSWNDLGPTWILMIYRDYKFTNDLSFLKRNYNKMKEVIDWLIKKDEDNDGI 431
Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK--FQKAK 730
++ G D +YD + G S+Y G L++ AL+A ++ + S DY ++ KAK
Sbjct: 432 PDSKGGFDNSYDGTYMYGTSSYIGSLFLCALKAFIESSKIL----SYDYSKYEEILNKAK 487
Query: 731 VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNY 790
E LWNG YF ++ S QL G+++ GL ++DEDK ++ALK +Y +
Sbjct: 488 SSLESLWNGRYFINWKYKDQKNESCLNSQLLGEFWCNLLGLGNVIDEDKIKTALKYIYEH 547
Query: 791 NVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
N K VN + PDG +D S+ Q + W +++A+A+ MI + + + G + A
Sbjct: 548 N---GKASKYCLVNSVNPDGSIDESTDQMKSCWPRISFAIASHMIMKGMINEGIEIA--- 601
Query: 851 YEAAWSGTGLGYAFQTPEAWN--TDDQYRSLCYMRPLAIWAMQWAL 894
W Y + P N + + Y+ L+I+ +++A+
Sbjct: 602 -RKEWETISSRYPYNQPSKINAFNGEHFGLPYYIGSLSIYLVKYAI 646
>gi|156333782|ref|XP_001619412.1| hypothetical protein NEMVEDRAFT_v1g224207 [Nematostella vectensis]
gi|156202556|gb|EDO27312.1| predicted protein [Nematostella vectensis]
Length = 173
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/103 (61%), Positives = 75/103 (72%), Gaps = 2/103 (1%)
Query: 798 GKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSG 857
G GAVNG+ PDG++D SS+Q+ E+W+GVTYAVAASMI E L D GF+TA GIY +
Sbjct: 52 GTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNTCFE- 110
Query: 858 TGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
LG FQTPEA + YRSL YMRPL+IWAMQWAL + K K
Sbjct: 111 -RLGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNK 152
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 55/109 (50%), Gaps = 14/109 (12%)
Query: 609 PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD 668
PW VNAY ++DT++WKDLN KFVLQVYRD V T D + + +WP ++
Sbjct: 3 PWDHVNAYHIHDTSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWP-----------ITKE 51
Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
G N PD DT S+ ++G V AAS + + D G
Sbjct: 52 GTMGAVNGIRPDGQLDTSSLQAEEVWTG---VTYAVAASMIQEGLVDEG 97
>gi|422908697|ref|ZP_16943374.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
gi|341639159|gb|EGS63785.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
Length = 981
Score = 127 bits (318), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 167/640 (26%), Positives = 261/640 (40%), Gaps = 101/640 (15%)
Query: 104 SGSIGRSYRGEFQRWQIF------PRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPE 157
SG + + F+RWQI + D L Q V + NG +L +
Sbjct: 94 SGLFYLNNKSNFERWQIEFSDKTQQAITNDPSSLETQLLVLIDFFNG-----LLVNNSAR 148
Query: 158 VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQ-ISPIIPHNYK 216
L T A GS ++ Y ALYP A ++G E + RQ ISPI+ ++
Sbjct: 149 ALSLT--ANNGSSVESIDPSDIEYRALYPIAEYEYQG---LESVALTRQVISPIVKNDPM 203
Query: 217 ESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLH 276
S P+ + + N + +TL+ N +G ++ K +GV H
Sbjct: 204 LCSLPMHWNHFEMSNRSDKTQLVTLVQPLRNLIG----------STYKKAREGVQDSCCH 253
Query: 277 ---HRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRL 333
+ S Q PVT + A E G VS+ + G+ G + H F+
Sbjct: 254 LVQNAISQQHQPVTISGAGFEFHG--VSMTSNSPYDGDISGEVLYGV------HTQFEG- 304
Query: 334 NSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFY 393
GAA +VTV P QV S A E +G+T Y
Sbjct: 305 ---------------GAAEQVTVTVKP----QVYCSKADQQVERALKTGRTNAHFDRGIY 345
Query: 394 GTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
+ +A + L+ G + + Q L W + + +
Sbjct: 346 SGREPLSALVCVQVTLKPGE-TIALRFAQVMDHSKILLDGWESEKAYARYFDAQSRAQSI 404
Query: 454 TDGSPP----VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYT 509
+ + P + + R+F + SQ+ VD ND+A+ +
Sbjct: 405 LNQTLPRLSEIEQNIISQQREFLDNASQT----FVD----NDSALR------------FA 444
Query: 510 PVALNS-SFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQ 568
+A+NS SF +EN+ L E ++Y +N+ DV+FY SF+L+ L P++ +
Sbjct: 445 TMAMNSLSFLAESTVWDKENV--LLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVM 502
Query: 569 RDFAAAVMMHDPSKMK-----------LLDDGQWVSRKVLGAVPHDIGICDPW-FEVNAY 616
+ FA A++ D + L+D+ SR GAV HD+G P+ + +AY
Sbjct: 503 KHFAKAILNSDEKLRRYWEYEDKPYAELVDEKYQGSRAERGAVIHDLG--SPFDIDPDAY 560
Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
++ WKDL PKF+L VYR TGD++ ++ W SV ++ Y+ DGD +
Sbjct: 561 SWHNVKEWKDLAPKFILMVYRHWQKTGDREVVESCWSSVKESIDYLINLIEDGDTLPLTR 620
Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
G D T+D + G+S Y G LWVA L+AAS LA VG+
Sbjct: 621 G-TDDTFDNLASHGVSIYCGSLWVAGLRAASELALLVGEE 659
>gi|84386772|ref|ZP_00989797.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
gi|84378300|gb|EAP95158.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
Length = 908
Score = 126 bits (316), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 93/309 (30%), Positives = 154/309 (49%), Gaps = 41/309 (13%)
Query: 486 VPNQNDTAVNILERMSSILEQI------------YTPVALNS-SF-GTNLLQDGEENIGQ 531
+P I+E+ ++ LEQ Y +A+NS SF + + D E+ +
Sbjct: 337 LPKLETIEQQIVEQQTTFLEQAQNKISQPESALRYATMAMNSLSFLAESTVWDKED---K 393
Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK------- 584
FL E ++Y +N+ DV+FY SF L+ L P++ + ++F+ A++ D ++ +
Sbjct: 394 FLVKECVDYPFFNSLDVYFYGSFTLLYLLPELDGCVMKEFSKAILAEDFTERRYWEYEAT 453
Query: 585 ----LLDDGQWVSRKVLGAVPHDIGICDPW-FEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
L+DD R + GAV HD+G P+ + +AY ++ WKDL PK++L VYR
Sbjct: 454 PNAELIDDKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRHY 511
Query: 640 VATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
T D K W +V ++ ++ +GD + G D T+D + GIS Y LW
Sbjct: 512 QNTQDISVVKECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASLW 570
Query: 700 VAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGS--YFNYDNSGSSQSSSIQ 756
VA LQAAS LA+ +G+ + +L + +KA E+ LW+ Y+++ + +Q
Sbjct: 571 VAGLQAASELAQLMGESERANGYLTRSKKALATVEQSLWDDKEGYYHF------FVTPVQ 624
Query: 757 ADQLAGQWY 765
A L G Y
Sbjct: 625 AKHLTGAGY 633
>gi|148976285|ref|ZP_01813009.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
gi|145964379|gb|EDK29634.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
Length = 982
Score = 126 bits (316), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 92/299 (30%), Positives = 153/299 (51%), Gaps = 41/299 (13%)
Query: 496 ILERMSSILEQI------------YTPVALNS-SF-GTNLLQDGEENIGQFLYLEGIEYL 541
I+E+ ++ LEQ Y +A+NS SF + + D E+ +FL E ++Y
Sbjct: 421 IVEQQTAFLEQAQSKISQPESALRYATMAMNSLSFLAESTVWDKED---KFLVKECVDYP 477
Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK-----------LLDDGQ 590
+N+ DV+FY SF+L+ L P++ + ++F+ A++ D ++ + L+D+
Sbjct: 478 FFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEATPNAELIDEKY 537
Query: 591 WVSRKVLGAVPHDIGICDPW-FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAK 649
R + GAV HD+G P+ + +AY ++ WKDL PK++L VYR T D K
Sbjct: 538 QGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDMSVVK 595
Query: 650 AVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASAL 709
W +V ++ ++ DGD + G D T+D + GIS Y LWVA LQAAS L
Sbjct: 596 ECWQAVTESIDFLSNLIADGDDLPLTRG-TDDTFDNLASHGISIYCASLWVAGLQAASEL 654
Query: 710 AREVGDRGSEDYFLFKFQKAKVVYEK-LWNGS--YFNYDNSGSSQSSSIQADQLAGQWY 765
A+ +G+ +L + +KA E+ LW+ Y+++ + +QA L G+ Y
Sbjct: 655 AKLMGEHDLGAGYLTRSKKALATVEQSLWDEKEGYYHF------FVTPVQAKHLTGEGY 707
>gi|86144926|ref|ZP_01063258.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
gi|85837825|gb|EAQ55937.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
Length = 1024
Score = 125 bits (313), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 92/301 (30%), Positives = 152/301 (50%), Gaps = 31/301 (10%)
Query: 508 YTPVALNS-SF-GTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQL 565
Y +A+NS SF + + D E+ +FL E ++Y +N+ DV+FY SF+L+ L P++
Sbjct: 487 YATMAMNSLSFLAESTVWDKED---KFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDG 543
Query: 566 SIQRDFAAAVMMHDPSKMK-----------LLDDGQWVSRKVLGAVPHDIGICDPW-FEV 613
+ ++F+ A++ D ++ + L+DD R + GAV HD+G P+ +
Sbjct: 544 CVMKEFSKAILAEDFTERRYWEYEATPNAELIDDKYQGVRAIRGAVIHDLG--SPFDIQP 601
Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
+AY ++ WKDL PK++L VYR T D K W +V ++ ++ +GD +
Sbjct: 602 DAYSWHNVKEWKDLAPKYILMVYRHYQNTQDISVVKECWQAVTESIDFLSNLIAEGDDLP 661
Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
G D T+D + GIS Y LWVA LQAAS LA+ +G+ +L + +KA
Sbjct: 662 LTRG-TDDTFDNLASHGISIYCASLWVAGLQAASELAQLMGESELASGYLTRSKKALATV 720
Query: 734 EK-LWNGS--YFNYDNSGSSQSSSIQADQLAGQWYA--RACGLLPIVDEDKARSALKKVY 788
E+ LW+ Y+++ + +QA L G+ Y GL+ D ++ L
Sbjct: 721 EQSLWDDKEGYYHF------FVTPVQAKHLTGEGYQALETLGLILTGDAIADKNTLNTYL 774
Query: 789 N 789
N
Sbjct: 775 N 775
>gi|218677125|ref|YP_002395944.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
gi|218325393|emb|CAV27494.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
Length = 1069
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 152/299 (50%), Gaps = 41/299 (13%)
Query: 496 ILERMSSILEQI------------YTPVALNS-SF-GTNLLQDGEENIGQFLYLEGIEYL 541
I+E+ ++ LEQ Y +A+NS SF + + D E+ +FL E ++Y
Sbjct: 508 IVEQQTTFLEQAQSKISQPESALRYATMAMNSLSFLAESTVWDKED---KFLVKECVDYP 564
Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK-----------LLDDGQ 590
+N+ DV+FY SF+L+ L P++ + ++F+ A++ D ++ + L+DD
Sbjct: 565 FFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEATPNAELIDDKY 624
Query: 591 WVSRKVLGAVPHDIGICDPW-FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAK 649
R + GAV HD+G P+ + +AY ++ WKDL PK++L VYR T D K
Sbjct: 625 QGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDISVVK 682
Query: 650 AVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASAL 709
W +V ++ ++ +GD + G D T+D + GIS Y LWVA LQAAS L
Sbjct: 683 ECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASLWVAGLQAASEL 741
Query: 710 AREVGDRGSEDYFLFKFQKAKVVYEK-LWNGS--YFNYDNSGSSQSSSIQADQLAGQWY 765
A+ + + +L + +KA E+ LW+ Y+++ + +QA L G+ Y
Sbjct: 742 AKLMNESELAAGYLTRSKKALATVEQSLWDDKEGYYHF------FVTPVQAKHLTGEGY 794
>gi|417950561|ref|ZP_12593680.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
gi|342806239|gb|EGU41472.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
Length = 1046
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 153/299 (51%), Gaps = 41/299 (13%)
Query: 496 ILERMSSILEQI------------YTPVALNS-SF-GTNLLQDGEENIGQFLYLEGIEYL 541
I+E+ ++ LEQ Y +A+NS SF + + D E+ +FL E ++Y
Sbjct: 485 IVEQQTAFLEQAQSKISQPESALRYATMAMNSLSFLAESTVWDKED---KFLVKECVDYP 541
Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK-----------LLDDGQ 590
+N+ DV+FY SF+L+ L P++ + ++F+ A++ D ++ + L+D+
Sbjct: 542 FFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEATPNAELIDEKY 601
Query: 591 WVSRKVLGAVPHDIGICDPW-FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAK 649
R + GAV HD+G P+ + +AY ++ WKDL PK++L VYR T D K
Sbjct: 602 QGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQKTQDISVVK 659
Query: 650 AVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASAL 709
W +V ++ ++ +GD + G D T+D + GIS Y LWVA LQAAS L
Sbjct: 660 ECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASLWVAGLQAASEL 718
Query: 710 AREVGDRGSEDYFLFKFQKAKVVYEK-LWNGS--YFNYDNSGSSQSSSIQADQLAGQWY 765
A+ +G+ +L + +KA E+ LW+ Y+++ + +QA L G+ Y
Sbjct: 719 AKLMGEHDLGAGYLTRSKKALATVEQSLWDEKEGYYHF------FVTPVQAKHLTGEGY 771
>gi|47196687|emb|CAF89164.1| unnamed protein product [Tetraodon nigroviridis]
Length = 239
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 91/156 (58%), Gaps = 20/156 (12%)
Query: 534 YLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVS 593
+ G EY M+NTYDVH+Y+SFALIML+PK+ LS+Q D A +V+ DP++ L +G+
Sbjct: 1 FFPGQEYRMYNTYDVHYYASFALIMLWPKLALSLQYDIAGSVVQQDPTERLYLMNGRCSP 60
Query: 594 RKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWP 653
K G VPHDIG DP + W+ +VYRD T D ++ + +WP
Sbjct: 61 VKTKGVVPHDIG--DP----------EDEPWQ--------RVYRDFHLTQDAQYLRDMWP 100
Query: 654 SVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS 689
+ QFD DGDG+IEN G+ DQTYD W+ S
Sbjct: 101 ICEMVTESELQFDLDGDGLIENSGYADQTYDGWAAS 136
Score = 72.8 bits (177), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 32/61 (52%), Positives = 46/61 (75%)
Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
++K ++ALK +++ NV+ GG+ GAVNGM P+G D SS+QS E+W GV Y +AA+MIH
Sbjct: 178 KEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIH 237
Query: 837 E 837
E
Sbjct: 238 E 238
>gi|15922862|ref|NP_378531.1| hypothetical protein ST2526 [Sulfolobus tokodaii str. 7]
gi|15623653|dbj|BAB67640.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 646
Score = 122 bits (307), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 79/252 (31%), Positives = 127/252 (50%), Gaps = 11/252 (4%)
Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
G VPHD+G + E Y W DL +VL +YRD T D F K + +
Sbjct: 354 GEVPHDLG--EESIEAPIYGASYGYPWTDLGSTWVLMIYRDYKFTNDLAFLKRNYRKMKE 411
Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
+ ++ D+D D + ++ G D +YD + G S+Y G +++ AL+A + ++ +G
Sbjct: 412 VIDWLISLDKDKDCIPDSKGGFDNSYDGTYMYGASSYVGSMFLCALRAFISASKILG--- 468
Query: 718 SEDYFLFK--FQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIV 775
+Y +++ ++ + + LWNG YF S + + S + Q+ GQ++ GL PI+
Sbjct: 469 -MEYSIYEDCLRRGIMTFNSLWNGKYFIAWKSDNRKKESCMSSQILGQFWCDILGLEPII 527
Query: 776 DEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMI 835
DEDK AL+ +Y N K VN + PDG +D + Q R WS V +AV+A MI
Sbjct: 528 DEDKIVQALRSIYELN---GKASKFCLVNSVNPDGSIDTETDQMRSCWSRVAFAVSAHMI 584
Query: 836 HEDLADIGFQTA 847
L + G + A
Sbjct: 585 IRGLKNEGIEIA 596
>gi|374921957|gb|AFA26156.1| non-lysosomal glucosylceramidase-like protein, partial [Lolium
perenne]
Length = 109
Score = 122 bits (307), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 72/109 (66%), Positives = 84/109 (77%), Gaps = 1/109 (0%)
Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
AM YM+QFDRD D MIENDGFPDQTYD W+V G+SAY G LW+A+LQAA+A+AR +G
Sbjct: 1 TAMEYMEQFDRDDDSMIENDGFPDQTYDAWTVLGVSAYCGCLWLASLQAAAAMARSLGHA 60
Query: 717 GSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQW 764
+ + KF KAK V+E KLWNGSYFNYD+ S S SIQADQLAGQW
Sbjct: 61 DYAERCMVKFAKAKHVFEAKLWNGSYFNYDSGTSYSSRSIQADQLAGQW 109
>gi|355689818|gb|AER98955.1| glucosidase, beta 2 [Mustela putorius furo]
Length = 157
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/166 (38%), Positives = 98/166 (59%), Gaps = 12/166 (7%)
Query: 117 RWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKG 176
RWQ+ P + + + V+A+QF+V + R Y VL + P VL+ SW+W L G
Sbjct: 1 RWQLNPGMYQHRTVIADQFTVCLRREGKTVYQQVLSVERPSVLR--------SWNWGLCG 52
Query: 177 DKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTS 236
+ YHALYPRAWTV++ P + + CRQI+PI+PH+Y++SS PV VF + + N G +
Sbjct: 53 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEA 111
Query: 237 ADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLLHHRT 279
D++++F+ N +G + G +N + DG V +LLHH T
Sbjct: 112 LDVSIMFSMRNGLGVGDDAPGGLWNEPFCLERDGETVQGLLLHHPT 157
>gi|284173198|ref|ZP_06387167.1| hypothetical protein Ssol98_00875 [Sulfolobus solfataricus 98/2]
Length = 647
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 87/304 (28%), Positives = 143/304 (47%), Gaps = 18/304 (5%)
Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
G PHD+G I DP + +Y + W DL P VL +YRD V T +++ + +
Sbjct: 355 GETPHDVGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSNREILEKNYNK 409
Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+ + ++ + D D D + ++ G D +YD + G S+Y ++++AL A ++ +
Sbjct: 410 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYIASMFLSALTAFIKMSEILD 469
Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
+ + Y+ F + K + LWNG YF +++S QL GQ++ GL PI
Sbjct: 470 VKIDDKYYRF-LECGKKTFNSLWNGKYFILWKKNDEENTSCLNSQLLGQFWCDILGLPPI 528
Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
D DK +AL+ +Y N K N + DG VD S+ Q R W V++AVAA M
Sbjct: 529 TDHDKINTALRSIYELN---FKASKYCLTNAVREDGSVDSSTAQLRSCWPRVSFAVAAHM 585
Query: 835 IHEDLADIGFQTACGIYEAAWSGTGLGYAF-QTPEAWNTDDQYRSL-CYMRPLAIWAMQW 892
I + G + A + W F Q+ D +Y L Y+ ++W ++
Sbjct: 586 ILRGMVKEGIEVA----KREWETIKELNPFDQSSRIDAIDGKYVGLMSYIGSTSVWLLKL 641
Query: 893 ALTR 896
AL +
Sbjct: 642 ALDK 645
>gi|15899397|ref|NP_344002.1| hypothetical protein SSO2674 [Sulfolobus solfataricus P2]
gi|384433011|ref|YP_005642369.1| hypothetical protein [Sulfolobus solfataricus 98/2]
gi|13815992|gb|AAK42792.1| Conserved hypothetical protein [Sulfolobus solfataricus P2]
gi|261601165|gb|ACX90768.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
Length = 648
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 87/304 (28%), Positives = 143/304 (47%), Gaps = 18/304 (5%)
Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
G PHD+G I DP + +Y + W DL P VL +YRD V T +++ + +
Sbjct: 356 GETPHDVGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSNREILEKNYNK 410
Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+ + ++ + D D D + ++ G D +YD + G S+Y ++++AL A ++ +
Sbjct: 411 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYIASMFLSALTAFIKMSEILD 470
Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
+ + Y+ F + K + LWNG YF +++S QL GQ++ GL PI
Sbjct: 471 VKIDDKYYRF-LECGKKTFNSLWNGKYFILWKKNDEENTSCLNSQLLGQFWCDILGLPPI 529
Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
D DK +AL+ +Y N K N + DG VD S+ Q R W V++AVAA M
Sbjct: 530 TDHDKINTALRSIYELN---FKASKYCLTNAVREDGSVDSSTAQLRSCWPRVSFAVAAHM 586
Query: 835 IHEDLADIGFQTACGIYEAAWSGTGLGYAF-QTPEAWNTDDQYRSL-CYMRPLAIWAMQW 892
I + G + A + W F Q+ D +Y L Y+ ++W ++
Sbjct: 587 ILRGMVKEGIEVA----KREWETIKELNPFDQSSRIDAIDGKYVGLMSYIGSTSVWLLKL 642
Query: 893 ALTR 896
AL +
Sbjct: 643 ALDK 646
>gi|159042199|ref|YP_001541451.1| hypothetical protein Cmaq_1638 [Caldivirga maquilingensis IC-167]
gi|157921034|gb|ABW02461.1| protein of unknown function DUF608 [Caldivirga maquilingensis
IC-167]
Length = 661
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 90/298 (30%), Positives = 138/298 (46%), Gaps = 32/298 (10%)
Query: 553 SFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG---ICDP 609
SFAL+ LFP L+++ D +H+ G VPHD+G I DP
Sbjct: 339 SFALLDLFP--DLTVKMDELLGFYIHN------------------GEVPHDLGEESIEDP 378
Query: 610 WFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDG 669
+ A LY W DL ++L +YRD + TG+ + + + + ++ D DG
Sbjct: 379 IY--GASYLYP---WNDLGSTWILMIYRDYLLTGNVEVLRRNIDKMREVIDWLISRDYDG 433
Query: 670 DGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
D + ++ G D +YD ++ G S+Y L++ +LQA A +G R S D + K
Sbjct: 434 DCIPDSRGGFDNSYDGTNMYGASSYIASLFLCSLQAFIKSAEVLGVRLS-DRYESCLSKG 492
Query: 730 KVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
+ LWNG YF S + + S QL GQ++ L P+VDEDK + AL+ +Y
Sbjct: 493 RETLNSLWNGRYFMAWKSSGNSNESCMNSQLLGQFWCDFLKLPPVVDEDKIKVALRSIYE 552
Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
N N + P G +D SS Q R W V++ V A M+ + + G + A
Sbjct: 553 LN---HKSSPHCLPNSVKPSGEIDTSSGQMRSCWPRVSFVVTAHMVLRGMVNEGLEIA 607
>gi|407069309|ref|ZP_11100147.1| hypothetical protein VcycZ_07137 [Vibrio cyclitrophicus ZF14]
Length = 1024
Score = 119 bits (299), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 142/275 (51%), Gaps = 29/275 (10%)
Query: 508 YTPVALNS-SF-GTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQL 565
Y +A+NS SF + + D E+ +FL E ++Y +N+ DV+FY SF+L+ L P++
Sbjct: 487 YATMAMNSLSFLAESTVWDKED---KFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDG 543
Query: 566 SIQRDFAAAVMMHDPSKMK-----------LLDDGQWVSRKVLGAVPHDIGICDPW-FEV 613
+ ++F+ A++ D ++ + L+D+ R + GAV HD+G P+ +
Sbjct: 544 CVMKEFSKAILAEDFTQRRYWEYEATPNAELIDEKYQGVRAIRGAVIHDLG--SPFDIQP 601
Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
+AY ++ WKDL PK++L VYR T D K W +V ++ ++ +GD +
Sbjct: 602 DAYSWHNVKEWKDLAPKYILMVYRHYQNTQDLSVVKECWQAVTESIEFLSNLVAEGDDLP 661
Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
G D T+D + GIS Y LWVA LQAA+ LA + + +L + +KA
Sbjct: 662 LTRG-TDDTFDNLASHGISIYCASLWVAGLQAANELALLMNENELAADYLARSKKALATV 720
Query: 734 EK-LWNGS--YFNYDNSGSSQSSSIQADQLAGQWY 765
E+ LW+ Y+++ + +QA L GQ Y
Sbjct: 721 EQSLWDDKEGYYHF------FVTPVQAKHLTGQGY 749
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 56/117 (47%), Gaps = 10/117 (8%)
Query: 758 DQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSM 817
D L Y + GL I ED+ + AL VY +N ++ K G N L DG +
Sbjct: 821 DALLADSYLKLIGLEGIFPEDRIQRALDYVYKHN-FEINSPKLGVANMTLADGS-PHEAF 878
Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTAC---GIYEAAWSGTGLGYAFQTPEAWN 871
Q++++W GV ++VA ++ +LA Q +Y A + + + +A PE +N
Sbjct: 879 QAQDVWIGVQFSVATAL---NLAGKSQQAETLMDTVYTALYDYSKIPFA--APEGFN 930
>gi|284997264|ref|YP_003419031.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|284445159|gb|ADB86661.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
Length = 740
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 72/235 (30%), Positives = 121/235 (51%), Gaps = 6/235 (2%)
Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
G VPHD+GI V+ +WKD NP F+L VYR T D F K ++P +
Sbjct: 450 GYVPHDLGIHSLDHPVDGTT--SPPKWKDTNPTFILLVYRYYKFTNDLDFLKEMYPKMLK 507
Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
AM + D+D DG+ E +G D +D SV GI +Y+ +++A++ A A+ + D
Sbjct: 508 AMKWELTQDKDKDGVPELEGQGDTGFDAMSVKGIDSYTTSIYIASIIALKETAKILKDND 567
Query: 718 SEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE 777
+ + +K++ ++ +L+NG YF+ G + I QL G+W++ GL PI++E
Sbjct: 568 TLNEMTILLEKSRKIFSRLFNGKYFD-PWIGEPEIKGIFLGQLVGEWWSEILGLEPIIEE 626
Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAA 832
+K SAL+ + + N N + DG++ S QS W + +A+++
Sbjct: 627 EKISSALEAMLSIN---ANSSNFCTPNIVSEDGKIIDISPQSYSSWPRLVFAISS 678
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 72/360 (20%), Positives = 130/360 (36%), Gaps = 48/360 (13%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
GVPLGG+G+G I + +G+ IF P + F + + +G +
Sbjct: 74 GVPLGGIGAGKIEINNKGKVVNVTIFNNKGFPIPNIRG-FHILIKPDDGNPFF------- 125
Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
+ KD + +L+ D Y LYP A+ + G D ++R SP+IP N
Sbjct: 126 --LEKDLPFISLK----HLESDILIYEGLYPFAYLI--GIKD-KIRAKLTAFSPLIPRNI 176
Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLL 275
+S+ P F+ I G S I + F G + + +D +
Sbjct: 177 NDSTLPAVGFSLKI--EGSNSGLIAISF---------PNIVGSNPIGRVNRSDEDKIIFT 225
Query: 276 HHRTSHQLP-PVTFALAAQETDGV---HVSLCPHFVISGNSL--GLTAKDMWHEIKEHGS 329
+ R+ P L TD + ++++ P + S +D W I
Sbjct: 226 NLRSIDTDPRKGEIVLMGDTTDLITQYNINVKPEIALKNKSWKEKFENEDPWKSIINGEK 285
Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRY 389
+ + + +P A + + + ++ F L+W ++ GK Y
Sbjct: 286 YKEDVHEVSGLWDDP--------AGMLITRYNKDSEIKFVLSW------YIRGKHVLYPY 331
Query: 390 TKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAG 449
+Y + + A IA Q + + + + LPEW + N Y L++
Sbjct: 332 GTYYQRNFSNAIEIAEYFYKNFERLRNQSTNFHKIVYNNTDLPEWLKDAIINSAYILSSN 391
>gi|327310049|ref|YP_004336946.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
gi|326946528|gb|AEA11634.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
Length = 660
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 88/299 (29%), Positives = 138/299 (46%), Gaps = 32/299 (10%)
Query: 552 SSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG---ICD 608
+ FAL+ LFP + + FA + G +PHD G I D
Sbjct: 334 AGFALLELFPDLVVRADEVFAMY--------------------RRGGEIPHDYGEESIED 373
Query: 609 PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD 668
P + Y T WKDL +VL +YRD TGD + + ++ A+ ++ DRD
Sbjct: 374 PIYGAT----YLTP-WKDLASTWVLMIYRDYYYTGDVEILRRNIDAMKDAVDWLMSLDRD 428
Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK 728
GD + ++ G D +YD ++ G AY L++ AL A A +G + E +L ++
Sbjct: 429 GDCVPDSRGRNDNSYDGSNMYGRVAYVASLFLCALTAYIKAAERLGLK-PEPKYLECLER 487
Query: 729 AKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVY 788
K ++LWNG Y+ G + + + QL GQ++ L PI +DK +AL+ +Y
Sbjct: 488 GKRSLDELWNGRYYVAWADGPRRKDACMSSQLLGQFWCDMLDLPPIAPDDKVAAALRSIY 547
Query: 789 NYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
L + K N + PDG+ D + Q R W+ V +AVAA M+ + G + A
Sbjct: 548 E---LGMRASKYCIPNSVTPDGKPDEETPQLRSCWTRVNFAVAAHMLLRGMEREGMEVA 603
>gi|385774575|ref|YP_005647144.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
gi|323478691|gb|ADX83929.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
Length = 661
Score = 115 bits (289), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)
Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
G PHDIG I DP + +Y + W DL P VL +YRD V T D+ + +
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423
Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+ + ++ + D D D + ++ G D +YD + G S+Y ++++AL A ++ +
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483
Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
+ Y+ F + K + LWNG YF ++ S QL GQ++ GL PI
Sbjct: 484 VKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542
Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
D DK +AL+ +Y N K N + D +D S+ Q R W V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599
Query: 835 IHEDLADIGFQTA 847
I + G + A
Sbjct: 600 ILRGMVKEGMEVA 612
>gi|385777223|ref|YP_005649791.1| hypothetical protein [Sulfolobus islandicus REY15A]
gi|323475970|gb|ADX86576.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
Length = 661
Score = 115 bits (289), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)
Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
G PHDIG I DP + +Y + W DL P VL +YRD V T D+ + +
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423
Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+ + ++ + D D D + ++ G D +YD + G S+Y ++++AL A ++ +
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483
Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
+ Y+ F + K + LWNG YF ++ S QL GQ++ GL PI
Sbjct: 484 VKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542
Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
D DK +AL+ +Y N K N + D +D S+ Q R W V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599
Query: 835 IHEDLADIGFQTA 847
I + G + A
Sbjct: 600 ILRGMVKEGMEVA 612
>gi|227831601|ref|YP_002833381.1| hypothetical protein LS215_2801 [Sulfolobus islandicus L.S.2.15]
gi|229580553|ref|YP_002838953.1| hypothetical protein YG5714_2812 [Sulfolobus islandicus Y.G.57.14]
gi|238621059|ref|YP_002915885.1| hypothetical protein M164_2625 [Sulfolobus islandicus M.16.4]
gi|284999153|ref|YP_003420921.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|227458049|gb|ACP36736.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
gi|228011269|gb|ACP47031.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.G.57.14]
gi|238382129|gb|ACR43217.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
gi|284447049|gb|ADB88551.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
Length = 661
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)
Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
G PHDIG I DP + +Y + W DL P VL +YRD V T D+ + +
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423
Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+ + ++ + D D D + ++ G D +YD + G S+Y ++++AL A ++ +
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483
Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
+ Y+ F + K + LWNG YF ++ S QL GQ++ GL PI
Sbjct: 484 VKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542
Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
D DK +AL+ +Y N K N + D +D S+ Q R W V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599
Query: 835 IHEDLADIGFQTA 847
I + G + A
Sbjct: 600 ILRGMVKEGMEVA 612
>gi|227828867|ref|YP_002830647.1| hypothetical protein M1425_2640 [Sulfolobus islandicus M.14.25]
gi|227460663|gb|ACP39349.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
Length = 661
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)
Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
G PHDIG I DP + +Y + W DL P VL +YRD V T D+ + +
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423
Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+ + ++ + D D D + ++ G D +YD + G S+Y ++++AL A ++ +
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483
Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
+ Y+ F + K + LWNG YF ++ S QL GQ++ GL PI
Sbjct: 484 VKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542
Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
D DK +AL+ +Y N K N + D +D S+ Q R W V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599
Query: 835 IHEDLADIGFQTA 847
I + G + A
Sbjct: 600 ILRGMVKEGMEVA 612
>gi|229586074|ref|YP_002844576.1| hypothetical protein M1627_2694 [Sulfolobus islandicus M.16.27]
gi|228021124|gb|ACP56531.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
Length = 661
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)
Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
G PHDIG I DP + +Y + W DL P VL +YRD V T D+ + +
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423
Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+ + ++ + D D D + ++ G D +YD + G S+Y ++++AL A ++ +
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483
Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
+ Y+ F + K + LWNG YF ++ S QL GQ++ GL PI
Sbjct: 484 VKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542
Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
D DK +AL+ +Y N K N + D +D S+ Q R W V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599
Query: 835 IHEDLADIGFQTA 847
I + G + A
Sbjct: 600 ILRGMVKEGMEVA 612
>gi|229583411|ref|YP_002841810.1| hypothetical protein YN1551_3001 [Sulfolobus islandicus Y.N.15.51]
gi|228014127|gb|ACP49888.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.N.15.51]
Length = 661
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)
Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
G PHDIG I DP + +Y + W DL P VL +YRD + T D+ + +
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYIFTSDRGILEKNYNK 423
Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+ + ++ + D D D + ++ G D +YD + G S+Y ++++AL A ++ +
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483
Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
+ Y+ F + K + LWNG YF ++ S QL GQ++ GL PI
Sbjct: 484 IKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542
Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
D DK +AL+ +Y N K N + D +D S+ Q R W V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599
Query: 835 IHEDLADIGFQTA 847
I + G + A
Sbjct: 600 ILRGMVKEGMEVA 612
>gi|194359640|gb|ACF57667.1| bile acid beta-glucosidase [Ctenopharyngodon idella]
Length = 89
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 52/89 (58%), Positives = 63/89 (70%)
Query: 609 PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD 668
PW VNAY +YDTA WKDLN KFVLQV+RD T D+++ K +WP M +FD+D
Sbjct: 1 PWVRVNAYLIYDTADWKDLNLKFVLQVFRDYHLTQDEQYLKDMWPICQTVMETELKFDKD 60
Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGG 697
GDG+IEN G+ DQTYD W V+G SAY GG
Sbjct: 61 GDGLIENSGYADQTYDGWKVTGPSAYCGG 89
>gi|159041841|ref|YP_001541093.1| hypothetical protein Cmaq_1276 [Caldivirga maquilingensis IC-167]
gi|157920676|gb|ABW02103.1| protein of unknown function DUF608 [Caldivirga maquilingensis
IC-167]
Length = 688
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 128/270 (47%), Gaps = 35/270 (12%)
Query: 530 GQFLYLEGIEY------LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
G+F LEG+E L Y+ S ++++FP+++ S+ R F A M D
Sbjct: 336 GRFSILEGVEVCPCHGTLAGACYET---GSLPVVLMFPELEKSLLRQFTEA-MRSD---- 387
Query: 584 KLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTA--RWKDLNPKFVLQVYRDVVA 641
G +PH +GI + ++ TA RWKDLN ++L V+R
Sbjct: 388 --------------GYIPHSLGI----YSLDHIEDGTTAPPRWKDLNSTYILLVHRYFKR 429
Query: 642 TGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
+ D +F K ++P + A ++ D+DGDG+ E G D +D SV G +Y+ LW+A
Sbjct: 430 SNDVEFIKEIYPKLIKAFEWVLVQDKDGDGVPELSGDGDTGFDAMSVKGFDSYTTSLWIA 489
Query: 702 ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLA 761
AL LA+ +GD+ + KA+ Y + W G F + +S A Q+
Sbjct: 490 ALMVMGELAKLMGDQATLSKVESTLLKARDSYNRRWLGDRFKAWDEPDMGKASFLA-QIW 548
Query: 762 GQWYARACGLLPIVDEDKARSALKKVYNYN 791
G+W++ GL I DEDK ++A+ + N
Sbjct: 549 GEWWSLMLGLGHITDEDKVKAAMGTIIRVN 578
>gi|148242848|ref|YP_001228005.1| hypothetical protein SynRCC307_1749 [Synechococcus sp. RCC307]
gi|147851158|emb|CAK28652.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 220
Score = 108 bits (270), Expect = 1e-20, Method: Composition-based stats.
Identities = 72/211 (34%), Positives = 112/211 (53%), Gaps = 14/211 (6%)
Query: 691 ISAYSGGLWVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEKL-WNGSYFN 743
+SAY G LW+AAL+AA A LA E+G D +E F Q+++ +++L WNG ++
Sbjct: 1 MSAYCGALWIAALEAALAMGQTLALELGVDTAAEQRDFSQWLQQSRGNFDQLLWNGEFYK 60
Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
D S + + ADQL G +YAR GL P+V E ARS+L+ + GG+ G
Sbjct: 61 ID--AESGTPVVMADQLCGDFYARLLGLPPVVSEANARSSLQAIKEACFEGFAGGRLGVA 118
Query: 804 NGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
NG+ DG +D + E+W+G+ + +AA + + ++G G
Sbjct: 119 NGLRRDGTPLDPNGTHPLEVWTGINFGLAAYYRLMGETNTAVAITQAVVNQVYAG---GL 175
Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
F+TPEA + +R+ Y+R +AIWA+ WA
Sbjct: 176 QFRTPEALTGQNTFRACHYLRAMAIWAL-WA 205
>gi|15922946|ref|NP_378615.1| hypothetical protein ST2609 [Sulfolobus tokodaii str. 7]
gi|15623737|dbj|BAB67724.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 661
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 76/248 (30%), Positives = 129/248 (52%), Gaps = 24/248 (9%)
Query: 598 GAVPHDIGICDPWFEVNAYCLYDTA--RWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV 655
G VPHD+G C +++ TA +WKDLNP ++L +YR TGD +F K+V+ V
Sbjct: 364 GYVPHDLGYCS----LDSPTDGTTAPPKWKDLNPTYILLIYRYYKLTGDIEFLKSVYDKV 419
Query: 656 YVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
A + +F R G +G D +D + GI++Y+ L++A+L A +++ GD
Sbjct: 420 KKAFEWELKFSRYG-----LEGKMDSAFDVTPIKGINSYTLSLYIASLFAMREISKTAGD 474
Query: 716 RGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIV 775
+ D + ++AK +EK++NG YF + ++ Q+ G+W+ GL PI
Sbjct: 475 NLNLDE---QIKEAKEAFEKMFNGKYF---IAWEGMEDAVFLAQVFGEWWTTLLGLEPIA 528
Query: 776 DEDKARSALKKVYNYNVLKVMGG--KRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAAS 833
DE+K +SAL+ ++KV G K N + DG+V S Q+ W + +A+
Sbjct: 529 DEEKIKSALRW-----IIKVNGNASKYCTPNLVKEDGKVVSLSPQTYSSWPRLVFAICWL 583
Query: 834 MIHEDLAD 841
I + +++
Sbjct: 584 SIEKGISE 591
>gi|261417086|ref|YP_003250769.1| glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|385791890|ref|YP_005823013.1| hypothetical protein FSU_3272 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|261373542|gb|ACX76287.1| Glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|302325406|gb|ADL24607.1| conserved domain protein [Fibrobacter succinogenes subsp.
succinogenes S85]
Length = 1047
Score = 107 bits (266), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 76/229 (33%), Positives = 114/229 (49%), Gaps = 18/229 (7%)
Query: 531 QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK------ 584
+FL E +Y +N+ DV+FY SF+L+ L P++ + + F A++ + ++ +
Sbjct: 521 RFLVRECADYPFFNSLDVYFYGSFSLLALMPRLDGVVMKRFGDAILAVNNNRRRHHEFVN 580
Query: 585 -----LLDDGQWVSRKVLGAVPHDIGICDPW-FEVNAYCLYDTARWKDLNPKFVLQVYRD 638
L D R V GAV HD+G P+ E +AY ++ WKDL PK+VL VYR
Sbjct: 581 LPYADLPDPKLEGPRAVRGAVIHDLG--SPFDAEPDAYDWHNVKEWKDLAPKYVLMVYRH 638
Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
T D + +VY AM Y+++ G+ G D T+D GIS Y G L
Sbjct: 639 YHKTKDMQCLADCKEAVYAAMEYLEKMVNPGENFPLTHG-TDDTFDNLCSYGISVYCGSL 697
Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLW--NGSYFNY 744
W+A L+AA+ +A +GD + K + A K E LW N YF +
Sbjct: 698 WIAGLRAAAKIAELLGDNEQAAKWNEKSEAANKEFTESLWDENEGYFRF 746
Score = 47.0 bits (110), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 39/162 (24%), Positives = 70/162 (43%), Gaps = 17/162 (10%)
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
++ + AD + Y R L PI D KA+S L +++N N K GA N +
Sbjct: 830 ANDCDDVFADTMLADTYLRLLDLEPITDSAKAKSNLLRIFNTN-YKANSPLIGAANLVRK 888
Query: 809 DGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYA---F 864
DG +D + Q+ ++W G+ Y++ +M+ G + + + G A F
Sbjct: 889 DGSPLDEFNFQAHDVWIGIQYSIMTAMMFH-----GLEKEASVLADSMIGNLYDEARVPF 943
Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQM 906
PE +N C + P A+ A ++ L+ + K++
Sbjct: 944 AAPEGFNGS------CRLHPEALVA-KFGLSATAADKMHKEL 978
>gi|15921102|ref|NP_376771.1| hypothetical protein ST0868 [Sulfolobus tokodaii str. 7]
gi|15621887|dbj|BAB65880.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 683
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 72/239 (30%), Positives = 118/239 (49%), Gaps = 23/239 (9%)
Query: 553 SFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFE 612
S ++++FP ++ S F + + +DG +PHD+G F+
Sbjct: 360 SLPVVLMFPMLEKSTIEQFI----------LNMREDGY---------IPHDLGTYS--FD 398
Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
+ +WKD N FVL VYR + T DK+F K+V+P V AM+++ D+DGDG+
Sbjct: 399 APSDGTTAPPKWKDTNTTFVLMVYRYYLRTKDKEFLKSVYPYVKKAMSWIISKDKDGDGL 458
Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
E DG DQ +D + G+ +Y +++AAL+A +A VGD S L K+ ++
Sbjct: 459 PEVDGSTDQGFDCVPIEGVCSYISTVYIAALEAMIKIAEIVGDSTSYYSSLLSKAKSSLM 518
Query: 733 YEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
KL++G F + ++ + Q+ GQW+A L + D SAL ++Y N
Sbjct: 519 --KLFDGKKFIPWTGKPNHHKAVFSAQIFGQWWAYLLDLDVVADRSAILSALDEIYRVN 575
>gi|332796035|ref|YP_004457535.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
gi|332693770|gb|AEE93237.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
Length = 840
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 98/407 (24%), Positives = 165/407 (40%), Gaps = 96/407 (23%)
Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV-------- 575
D + Q+ Y +G + NT DV YS ++ LFP++ I F +
Sbjct: 336 DKRKTANQYPYTDGPLHTALNTIDVLTYSLPTILNLFPELAEKIILQFKDKLIEENTPEH 395
Query: 576 ----MMHDPSKMKLLD----------DGQWVSRKV-----------LGAVPHDIGICDPW 610
+ + ++ K L+ DG+ + + G VPH +
Sbjct: 396 VVYSLSFEENREKFLEKLSKDPSLPTDGEKLYSTINEIVKETGKDPKGRVPH-------F 448
Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV------------YVA 658
F N D DLNP+F+L Y TGD F K ++P + Y
Sbjct: 449 FTDNLRV--DEYHRVDLNPEFILMTYLIAKTTGDLNFLKEIFPKMKEALESTMKTQTYDG 506
Query: 659 MAY------------------MDQFDRDGDGMIENDGFP--DQTYDTWSVSGISAYSGGL 698
+ Y + + D + ++ ++ P QT+D WS+ GI++++ L
Sbjct: 507 LIYHTLPAGLEWLRYVNNKLNLPRGDNNSASILGHNLIPLSMQTFDDWSMIGITSFTSIL 566
Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFN--YDNSGSSQSSSIQ 756
W++++QA + + S DY K + + LWNG YF+ YD + +
Sbjct: 567 WISSIQAVNDACSNLKINCSYDY----ESLVKKLIDYLWNGEYFDLWYDPKSKMRDKACN 622
Query: 757 ADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-- 814
A Q+ G WY+ G L +D+ ++ LK + YN + + G +NG P+G +
Sbjct: 623 ASQILGHWYSTLLG-LRFLDDSLVKTTLKSIVKYN----LKEEEGLLNGAYPNGYRPLKR 677
Query: 815 ---------SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
++ Q WSGV + VA+ +I+E L D G + IYE
Sbjct: 678 NYQNQLNLPATTQIDTPWSGVEFYVASHLIYEKLRDEGEKILRNIYE 724
>gi|327309977|ref|YP_004336874.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
gi|326946456|gb|AEA11562.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
Length = 694
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 132/286 (46%), Gaps = 32/286 (11%)
Query: 553 SFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFE 612
S ++++FP+++ R F + H + DDG VPHD+G
Sbjct: 364 SLPVVLMFPQLE----RSFLEQLARH------MRDDGY---------VPHDLGT----HS 400
Query: 613 VNAYCLYDTA--RWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
++A TA RWKD NP F+L VYR TGD F K+V+ + A +M DRDGD
Sbjct: 401 LDAPSDGTTAPPRWKDTNPTFILLVYRYYKRTGDIDFVKSVYGRLVAAYRWMRGQDRDGD 460
Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
G+ E +G D +D + G SAY+ LW+AAL A LA VGD + + ++A+
Sbjct: 461 GLPELEGSGDSGFDCAKIRGPSAYTSSLWIAALVALRDLAEAVGDSKTAEEAGSDLERAR 520
Query: 731 VVYEKLWNG-SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
+E+ ++G SY +D G+ + + A + W A D+ SAL V
Sbjct: 521 NSFERSFDGVSYKAWD--GAPEGAVFAALAVGDWWLYLAGSREGAARRDRLASALSAVLR 578
Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREI---WSGVTYAVAA 832
N A G P+ ++S R++ W + ++VAA
Sbjct: 579 VNGSASANCAPNASTGR-PEADGEISERVGRQLTASWPRLIFSVAA 623
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 79/372 (21%), Positives = 133/372 (35%), Gaps = 57/372 (15%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
GVPLGG+G+G + +G F F VFV + K+ L +
Sbjct: 13 GVPLGGIGAGKVELDNKGRLVN-ATFANNWSRPIRELRGFHVFVYAEDLDKH---LFAQD 68
Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
VL G + ++ Y +P W L + S ++P +
Sbjct: 69 VSVLGTLPTLGSAGFV------EAEYEGRWPFVWLRGR---RGGLEVEVEGFSAVVPGDL 119
Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL- 274
K+SS P + + S + A +++ N VG N+ ++N+ V
Sbjct: 120 KDSSLPAAGLAVRVRGSRRGIAAVSI----PNVVGA---------NNVGRINEPVPGGFK 166
Query: 275 LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGN--------SLGLTAKDMWHEIKE 326
+ +H+ P LA D V++ P F + +LG + W +
Sbjct: 167 ARNARAHEYDPQNGELALLAKD--PVAVIPQFNLDVPRSEAPKVCALGPEDERPWLYLMG 224
Query: 327 HGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAW--DCPEVNFMSGKT 384
F + + +P + A+ G+ F L+W D P + G
Sbjct: 225 KADFGPRVAEASGGGEQPAGVVAASYEG---------GEARFVLSWYFDKPWALYPYGH- 274
Query: 385 YYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELY 444
YY + K AA++ R + E+ + WQ ++ D LPEW + N Y
Sbjct: 275 YYENFFK-------DAADVGRYFLDEYDRLRRRTAEWQEGLV-DPALPEWLRDAVINSAY 326
Query: 445 YLNAGGAVWTDG 456
L+AG + DG
Sbjct: 327 ILSAGTWLTKDG 338
>gi|284045431|ref|YP_003395771.1| hypothetical protein Cwoe_3979 [Conexibacter woesei DSM 14684]
gi|283949652|gb|ADB52396.1| protein of unknown function DUF608 [Conexibacter woesei DSM 14684]
Length = 870
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 72/234 (30%), Positives = 119/234 (50%), Gaps = 25/234 (10%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYDTWSVSG 690
VL+ YR+ G + + W ++ M ++D+ ++ GDG++ G TYD ++ G
Sbjct: 509 VLKTYREARQGGGEAWLADRWDALVRLMDHVDRTWNSAGDGVLR--GEQPCTYDI-ALHG 565
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSS 750
+ GGLW+AAL++ +A +G G+ Y + +F++A+ YEKL+NG Y+ +G
Sbjct: 566 PNLLIGGLWLAALRSMEEIALRLGVAGAAGYRV-RFEQARGGYEKLFNGEYYAQPVTGEP 624
Query: 751 Q--SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK--------- 799
+DQL GQW+A L ++D ++ RSAL+ + +N+ + G +
Sbjct: 625 HDFGDGCLSDQLLGQWWAHQLELGHLLDPERVRSALRAIVAHNLREGFGARVADEQPPGH 684
Query: 800 ----RGAVNGML-----PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
G +G++ GR D+ E+WSGV YAVAA I E L D G
Sbjct: 685 RVFADGEDSGLVVCSWPRGGRPDVPLRYCDEVWSGVEYAVAAHCIDEGLEDEGL 738
>gi|238619389|ref|YP_002914214.1| hypothetical protein M164_0933 [Sulfolobus islandicus M.16.4]
gi|238380458|gb|ACR41546.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
Length = 661
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 133/295 (45%), Gaps = 31/295 (10%)
Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
+ S +I++FP+++ S + + V G VPHD+G
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYVRND-------------------GYVPHDLGFHSLD 390
Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
++ +WKD+NP VL VYR T D F K V+P++ M + + RDG
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMNWELRQCRDGL 448
Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
+E G D +D + G +Y+ L++A+L A +A+ VGD + K A+
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAR 506
Query: 731 VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
+ K++NG YF +D G ++S + QL G+W+ L IVDE+ +SAL+ +
Sbjct: 507 EAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIVDENMIKSALESIIR 562
Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
N N + +G++ S+Q+ W + +A+ + + D+ F
Sbjct: 563 LN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614
>gi|229584458|ref|YP_002842959.1| hypothetical protein M1627_0991 [Sulfolobus islandicus M.16.27]
gi|228019507|gb|ACP54914.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
Length = 661
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 133/295 (45%), Gaps = 31/295 (10%)
Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
+ S +I++FP+++ S + + V G VPHD+G
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYVRND-------------------GYVPHDLGFHSLD 390
Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
++ +WKD+NP VL VYR T D F K V+P++ M + + RDG
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQCRDGL 448
Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
+E G D +D + G +Y+ L++A+L A +A+ VGD + K A+
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAR 506
Query: 731 VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
+ K++NG YF +D G ++S + QL G+W+ L IVDE+ +SAL+ +
Sbjct: 507 EAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIVDENMIKSALESIIR 562
Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
N N + +G++ S+Q+ W + +A+ + + D+ F
Sbjct: 563 LN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614
>gi|238620760|ref|YP_002915586.1| hypothetical protein M164_2319 [Sulfolobus islandicus M.16.4]
gi|238381830|gb|ACR42918.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
Length = 818
Score = 97.1 bits (240), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 106/437 (24%), Positives = 181/437 (41%), Gaps = 80/437 (18%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
+G++ Y +G E NT DV Y+ +++LFP + +I +D + + D
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 582 ---------KMKLLDDGQWVS--RKVLGAVP---HDIGICDPWFEVNAYCLY----DTAR 623
K +++ D + +K+ G + ++ G DP + Y Y DT
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANETG-KDPKGRMPHYIRYSLTVDTYE 465
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV-----------------YVAMAYMDQFD 666
D+NP+FVL Y TGD++ K+V+ Y+ + ++
Sbjct: 466 RIDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTVDGLPYLTLPSGIEWI 525
Query: 667 RDGDGMIE-ND----------GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
R+ + M+ ND QT D WS G S Y LW++AL+A + ++ + +
Sbjct: 526 RNVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLNN 585
Query: 716 RGSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARAC 769
++Y + K + EK LWNG Y+ YD + + S A Q+ G WY +
Sbjct: 586 --PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQML 637
Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
L +D ++ +S + YN G + G+ N + V + S+QS+ WSGV Y
Sbjct: 638 DLPEFLDYERRKSVFSSIMKYNYSAEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYY 696
Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWA 889
+A+ M + + +YE + F W YMRPL +
Sbjct: 697 LASHMFSSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLVALS 745
Query: 890 MQWALTRPKPKTLEKQM 906
M +A+ K L K++
Sbjct: 746 MIYAIEGMKVNMLNKEV 762
>gi|227827237|ref|YP_002829016.1| hypothetical protein M1425_0924 [Sulfolobus islandicus M.14.25]
gi|227459032|gb|ACP37718.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
Length = 661
Score = 97.1 bits (240), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 133/295 (45%), Gaps = 31/295 (10%)
Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
+ S +I++FP+++ S + + V G VPHD+G
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYVRND-------------------GYVPHDLGFHSLD 390
Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
++ +WKD+NP VL VYR T D F K V+P++ M + + RDG
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQCRDGL 448
Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
+E G D +D + G +Y+ L++A+L A +A+ VGD + K A+
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAR 506
Query: 731 VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
+ K++NG YF +D G ++S + QL G+W+ L IVDE+ +SAL+ +
Sbjct: 507 EAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIVDENMIKSALESIIR 562
Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
N N + +G++ S+Q+ W + +A+ + + D+ F
Sbjct: 563 LN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614
>gi|15899746|ref|NP_344351.1| hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
gi|13816440|gb|AAK43141.1| Hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
Length = 803
Score = 96.7 bits (239), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 93/390 (23%), Positives = 159/390 (40%), Gaps = 83/390 (21%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD--------- 579
+G++ Y G E NT DV Y+ +++LFP++ +I RD + + D
Sbjct: 332 VGKYPYTGGPENTALNTIDVLLYALPGVMLLFPELAKNIVRDLSNRALKEDTPEYVLFSL 391
Query: 580 --PSKM------------------KLLDDGQWVSRKV----LGAVPHDIGICDPWFEVNA 615
P + KL D + + ++ G +PH I
Sbjct: 392 AFPENLMKYKEEVKKDPTISTDLKKLYDTIKKIIKETGKDPKGRMPHYIRHS-------- 443
Query: 616 YCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV-----------------YVA 658
DT D+NP+FVL Y TGD++F +V+ Y+
Sbjct: 444 -LTVDTYERVDINPEFVLLYYLIAKYTGDREFLTSVYEVARNAIESIMRTQTLDGLPYLT 502
Query: 659 MAYMDQFDRDGDGMIEND-----------GFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
+ ++ R + M++ QT D WS G S Y LW+AAL+A +
Sbjct: 503 LPSGIEWMRHVNNMLKVSDAHKILGYHTLALSMQTLDDWSWLGFSPYVSFLWIAALEALN 562
Query: 708 ALAREVGDRGSEDYFLFKFQKAKV---VYEKLWNGSYFN--YDNSGSSQSSSIQADQLAG 762
++ + + L ++ ++ V + LWNG Y+ YD + + S+ A Q+ G
Sbjct: 563 EASKLLNN-------LHNYEVKELTEKVNKYLWNGEYYINWYDPISNLRDDSLNASQMTG 615
Query: 763 QWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREI 822
WY + GL +D +K +S + YN + G K G+ N + V + S+QS+
Sbjct: 616 DWYVQLLGLPEFLDHEKRKSIFSSIIKYNYTEEEGVKNGSSNKEITPLGVKL-SVQSKTP 674
Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYE 852
WSG+ Y +A+ M + + + +YE
Sbjct: 675 WSGIEYYLASHMFYNGFDEYAKKILRNVYE 704
>gi|284174574|ref|ZP_06388543.1| hypothetical protein Ssol98_07941 [Sulfolobus solfataricus 98/2]
gi|356934758|gb|AET42959.1| hypothetical protein [Sulfolobus solfataricus 98/2]
Length = 803
Score = 96.7 bits (239), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 93/390 (23%), Positives = 159/390 (40%), Gaps = 83/390 (21%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD--------- 579
+G++ Y G E NT DV Y+ +++LFP++ +I RD + + D
Sbjct: 332 VGKYPYTGGPENTALNTIDVLLYALPGVMLLFPELAKNIVRDLSNRALKEDTPEYVLFSL 391
Query: 580 --PSKM------------------KLLDDGQWVSRKV----LGAVPHDIGICDPWFEVNA 615
P + KL D + + ++ G +PH I
Sbjct: 392 AFPENLMKYKEEVKKDPTISTDLKKLYDTIKKIIKETGKDPKGRMPHYIRHS-------- 443
Query: 616 YCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV-----------------YVA 658
DT D+NP+FVL Y TGD++F +V+ Y+
Sbjct: 444 -LTVDTYERVDINPEFVLLYYLIAKYTGDREFLTSVYEVARNAIESIMRTQTLDGLPYLT 502
Query: 659 MAYMDQFDRDGDGMIEND-----------GFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
+ ++ R + M++ QT D WS G S Y LW+AAL+A +
Sbjct: 503 LPSGIEWMRHVNNMLKVSDAHKILGYHTLALSMQTLDDWSWLGFSPYVSFLWIAALEALN 562
Query: 708 ALAREVGDRGSEDYFLFKFQKAKV---VYEKLWNGSYFN--YDNSGSSQSSSIQADQLAG 762
++ + + L ++ ++ V + LWNG Y+ YD + + S+ A Q+ G
Sbjct: 563 EASKLLNN-------LHNYEVKELTEKVNKYLWNGEYYINWYDPISNLRDDSLNASQMTG 615
Query: 763 QWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREI 822
WY + GL +D +K +S + YN + G K G+ N + V + S+QS+
Sbjct: 616 DWYVQLLGLPEFLDHEKRKSIFSSIIKYNYTEEEGVKNGSSNKEITPLGVKL-SVQSKTP 674
Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYE 852
WSG+ Y +A+ M + + + +YE
Sbjct: 675 WSGIEYYLASHMFYNGFDEYAKKILRNVYE 704
>gi|149196161|ref|ZP_01873217.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
gi|149141008|gb|EDM29405.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
Length = 1052
Score = 96.3 bits (238), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 96/193 (49%), Gaps = 15/193 (7%)
Query: 531 QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS--------- 581
+FL E +Y +N+ DV+FY SF+++ L P++ + R F A+M +
Sbjct: 519 KFLVRECSDYPFFNSLDVYFYGSFSMMWLLPQVDTNTMRCFRDAIMAENDELRRFYVYLE 578
Query: 582 --KMKLLDDGQWVSRKVLGAVPHDIGI-CDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
KL R GAV HD+G DP +AY ++ A WKDL PK++L + R+
Sbjct: 579 LPNAKLPHPKYEGPRARKGAVIHDLGSPFDP--RPDAYNWHNVAEWKDLAPKYILMLLRN 636
Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
T D + W SV ++ Y+ +G + +G D T+D S GI+ Y G L
Sbjct: 637 YHFTKDVYLLEECWDSVQASLQYLKDMILEGHSIPLTNG-TDDTFDNLSSFGITLYCGSL 695
Query: 699 WVAALQAASALAR 711
WVA L+AA +A+
Sbjct: 696 WVAGLKAAGEIAK 708
>gi|385774246|ref|YP_005646813.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
gi|323478361|gb|ADX83599.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
Length = 818
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 105/437 (24%), Positives = 179/437 (40%), Gaps = 80/437 (18%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
+G++ Y +G E NT DV Y+ +++LFP + +I +D + + D
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVLLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 582 ---------KMKLLDDGQWVS--RKVLGAVP---HDIGICDPWFEVNAYCLY----DTAR 623
K +++ D + +K+ G + ++ G DP + Y Y DT
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANETG-KDPKGRMPHYIRYSLTVDTYE 465
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAM 659
D+NP+FVL Y TGD++ K+V+ PS +
Sbjct: 466 RIDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWI 525
Query: 660 AYMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
Y++ R D + QT D WS G S Y LW++AL+A + ++ + +
Sbjct: 526 RYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLNN 585
Query: 716 RGSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARAC 769
++Y + K + EK LWNG Y+ YD + + S A Q+ G WY +
Sbjct: 586 --PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQML 637
Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
L +D ++ +S + YN G + G+ N + V + S+QS+ WSGV Y
Sbjct: 638 DLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYY 696
Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWA 889
+A+ M + + + +YE + +F W YMRPL +
Sbjct: 697 LASHMFYSGFDEYAKKILRNVYERY----EIAGSFWNHIEWGAR-------YMRPLVALS 745
Query: 890 MQWALTRPKPKTLEKQM 906
M +A+ K L ++
Sbjct: 746 MIYAIEGMKVNMLNNEV 762
>gi|385776902|ref|YP_005649470.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
gi|323475650|gb|ADX86256.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
Length = 818
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 105/437 (24%), Positives = 179/437 (40%), Gaps = 80/437 (18%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
+G++ Y +G E NT DV Y+ +++LFP + +I +D + + D
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 582 ---------KMKLLDDGQWVS--RKVLGAVP---HDIGICDPWFEVNAYCLY----DTAR 623
K +++ D + +K+ G + ++ G DP + Y Y DT
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANETG-KDPKGRMPHYIRYSLTVDTYE 465
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAM 659
D+NP+FVL Y TGD++ K+V+ PS +
Sbjct: 466 RIDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWI 525
Query: 660 AYMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
Y++ R D + QT D WS G S Y LW++AL+A + ++ + +
Sbjct: 526 RYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLNN 585
Query: 716 RGSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARAC 769
++Y + K + EK LWNG Y+ YD + + S A Q+ G WY +
Sbjct: 586 --PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQML 637
Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
L +D ++ +S + YN G + G+ N + V + S+QS+ WSGV Y
Sbjct: 638 DLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYY 696
Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWA 889
+A+ M + + + +YE + +F W YMRPL +
Sbjct: 697 LASHMFYSGFDEYAKKILRNVYERY----EIAGSFWNHIEWGAR-------YMRPLVALS 745
Query: 890 MQWALTRPKPKTLEKQM 906
M +A+ K L ++
Sbjct: 746 MIYAIEGMKVNMLNNEV 762
>gi|284998829|ref|YP_003420597.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|284446725|gb|ADB88227.1| hypothetical protein LD85_2606 [Sulfolobus islandicus L.D.8.5]
Length = 803
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 104/436 (23%), Positives = 175/436 (40%), Gaps = 78/436 (17%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
+G++ Y +G E NT DV Y+ +++LFP + +I +D + + D
Sbjct: 332 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 391
Query: 582 ---------KMKLLDDGQWVS--RKVLGAVPHDIGIC--DPWFEVNAYCLY----DTARW 624
K +++ D + +K+ G + + DP + Y Y DT
Sbjct: 392 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 451
Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAMA 660
D+NP+FVL Y TGD++ K+V+ PS +
Sbjct: 452 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 511
Query: 661 YMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
Y++ R D + QT D WS G S Y LW++AL+A + ++ + +
Sbjct: 512 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSN- 570
Query: 717 GSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARACG 770
++Y + K + EK LWNG Y+ YD + + S A Q+ G WY +
Sbjct: 571 -PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLD 623
Query: 771 LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
L +D ++ +S + YN G + G+ N + V + S+QS+ WSGV Y +
Sbjct: 624 LPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYYL 682
Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
A M + + + +YE + F W YMRPL +M
Sbjct: 683 ALHMFYSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLVALSM 731
Query: 891 QWALTRPKPKTLEKQM 906
+A+ K L ++
Sbjct: 732 IYAIEGMKVNMLNNEV 747
>gi|227831316|ref|YP_002833096.1| hypothetical protein LS215_2489 [Sulfolobus islandicus L.S.2.15]
gi|229580221|ref|YP_002838621.1| hypothetical protein YG5714_2457 [Sulfolobus islandicus Y.G.57.14]
gi|227457764|gb|ACP36451.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
gi|228010937|gb|ACP46699.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.G.57.14]
Length = 818
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 104/436 (23%), Positives = 175/436 (40%), Gaps = 78/436 (17%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
+G++ Y +G E NT DV Y+ +++LFP + +I +D + + D
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 582 ---------KMKLLDDGQWVS--RKVLGAVPHDIGIC--DPWFEVNAYCLY----DTARW 624
K +++ D + +K+ G + + DP + Y Y DT
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 466
Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAMA 660
D+NP+FVL Y TGD++ K+V+ PS +
Sbjct: 467 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 526
Query: 661 YMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
Y++ R D + QT D WS G S Y LW++AL+A + ++ + +
Sbjct: 527 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSN- 585
Query: 717 GSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARACG 770
++Y + K + EK LWNG Y+ YD + + S A Q+ G WY +
Sbjct: 586 -PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLD 638
Query: 771 LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
L +D ++ +S + YN G + G+ N + V + S+QS+ WSGV Y +
Sbjct: 639 LPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYYL 697
Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
A M + + + +YE + F W YMRPL +M
Sbjct: 698 ALHMFYSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLVALSM 746
Query: 891 QWALTRPKPKTLEKQM 906
+A+ K L ++
Sbjct: 747 IYAIEGMKVNMLNNEV 762
>gi|227829616|ref|YP_002831395.1| hypothetical protein LS215_0672 [Sulfolobus islandicus L.S.2.15]
gi|227456063|gb|ACP34750.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
Length = 661
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 71/250 (28%), Positives = 121/250 (48%), Gaps = 16/250 (6%)
Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
G VPHD+G ++ +WKD+NP +L VYR T D F K V+P++
Sbjct: 378 GYVPHDLGFHSLDSPIDGTT--SPPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVK 435
Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
M + + RDG +E G D +D + G +Y+ L++A+L A +A+ VGD
Sbjct: 436 VMDWELRQCRDGLPFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSN 493
Query: 718 SEDYFLFKFQKAKVVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
+ K A+ + K++NG YF +D G ++S + QL G+W+ L IVD
Sbjct: 494 YVGFINEKLNVAREAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIVD 549
Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD--GRVDMSSMQSREIWSGVTYAVAASM 834
E+ +SAL+ +++++ G L D G++ S+Q+ W + +A+
Sbjct: 550 ENMIKSALE-----SIIRLNGNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLA 604
Query: 835 IHEDLADIGF 844
+ + D+ F
Sbjct: 605 YKKGVGDLSF 614
>gi|160934323|ref|ZP_02081710.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
gi|156866996|gb|EDO60368.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
Length = 806
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 168/796 (21%), Positives = 294/796 (36%), Gaps = 151/796 (18%)
Query: 99 LGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEV 158
LGG+G+G++ RG + W+IF + + F+++ N Q +L +
Sbjct: 22 LGGIGTGNVSVGTRGNYCDWEIFGQPGKGIHFPYTFFALWAKSKNTQPIVRILESRIQPP 81
Query: 159 LKDTTAAGIGSWDWNLKG----DKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
T + G S D L G ++S++ A YP E P L + +P IP N
Sbjct: 82 Y--TCSHGFISAD--LAGLPRFERSSFSAAYPFVNVALEDSKLP-LSVELEAFTPFIPLN 136
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
K+S P +V Y + N+ + + T+ + N+VG F G + V
Sbjct: 137 AKDSGIPGAVIRYKVKNNSGEALEATVASSVANAVG----FNGYDLFDNLLVKRPVKNQY 192
Query: 275 LHHRTSHQLPPVTFALAAQE-TDGVHVSLCPHFVISGNSLGLTA------KDMWHEIKEH 327
L L + LA + T+G + ++ L+ D W + E
Sbjct: 193 LDQDQMKGLLYTSPELAEDDFTNGSIAVMTGEDSVTVKEEWLSGAWWDGIHDFWDDFSED 252
Query: 328 GSFD---RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKT 384
G + E+ IG+ + T+P E F L W
Sbjct: 253 GRLEFHSEFQGKESKFNDLSDLRIGS-LGVCKTIPAYGEAVFEFILTW------------ 299
Query: 385 YYRRYTKFYGTHQNAAANIARDAILEHGSW-ELQIEAWQRPILEDKRLPEWYPITLFNEL 443
Y+ K++G H ++ +EH + + Q AW Y + L+ L
Sbjct: 300 YFPNRPKYWGGHVCPENRNKKE--IEHNYYADFQKNAWD------------YGVYLYKNL 345
Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
L R F+ +S L V +E +++
Sbjct: 346 ESLEKAS------------------RSFTKALYESTLPDYV------------IEALANN 375
Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
+ + +P S GT L +G + Q EG +WN Y+ A LFP++
Sbjct: 376 ITVLRSPTCFRISDGTFLSWEG--SFPQHGSCEGNCTHVWN------YAQTA-AFLFPEL 426
Query: 564 QLSIQRDFAAAVMMHDPSKM-----KLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCL 618
+ S++R + D +M ++ D +W +++ AV +G
Sbjct: 427 EQSMRRT-EFLLETDDTGRMAFRTLQVFGDKKW---EMIPAVDGQMGT------------ 470
Query: 619 YDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDG 677
+L++YRD K W +A+ + + D D D +++
Sbjct: 471 -------------ILRLYRDWKLGAGDALLKETWEKAALALDFAFTYWDSDKDCVLD--- 514
Query: 678 FPDQTYDTWSVS--GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK 735
DQ ++T+ + G ++ S L++AAL+A +A +GD + + F+K +
Sbjct: 515 --DQQHNTYDIEFYGPNSLSNSLFLAALKAGKEMAAYLGDTAHAEKYAEAFRKGSEKMDS 572
Query: 736 L-WNGSYF-----NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
L W G Y+ + + + ADQ+ GQ A L ++ E+ + A+ V+
Sbjct: 573 LLWGGDYYIQVIDDINAYKYQYGTGCLADQVFGQMLAHLNHLGYVLPEEHIKKAIHSVFQ 632
Query: 790 YNVLKVM------------GGKRGAVNGMLPDG-RVDMSSMQSREIWSGVTYAVAASMIH 836
YN + M ++G V P G R + + S E+WSG+ Y VAA +I
Sbjct: 633 YNFKETMEDHVNVQRTYALNQEKGLVLCSWPQGGRPKLPFVYSDEVWSGIEYQVAAHLIF 692
Query: 837 EDLADIGFQTACGIYE 852
E + G + E
Sbjct: 693 EGFVEEGLAVVKAVRE 708
>gi|229581136|ref|YP_002839535.1| hypothetical protein YN1551_0454 [Sulfolobus islandicus Y.N.15.51]
gi|228011852|gb|ACP47613.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.N.15.51]
Length = 816
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 104/436 (23%), Positives = 175/436 (40%), Gaps = 78/436 (17%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
+G++ Y +G E NT DV Y+ +++LFP + +I +D + + D
Sbjct: 345 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 404
Query: 582 ---------KMKLLDDGQWVS--RKVLGAVPHDIGIC--DPWFEVNAYCLY----DTARW 624
K +++ D + +K+ G + + DP + Y Y DT
Sbjct: 405 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 464
Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAMA 660
D+NP+FVL Y TGD++ K+V+ PS +
Sbjct: 465 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 524
Query: 661 YMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
Y++ R D + QT D WS G S Y LW++AL+A + ++ + +
Sbjct: 525 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSN- 583
Query: 717 GSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARACG 770
++Y + K + EK LWNG Y+ YD + + S A Q+ G WY +
Sbjct: 584 -PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLD 636
Query: 771 LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
L +D ++ +S + YN G + G+ N + V + S+QS+ WSGV Y +
Sbjct: 637 LPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYYL 695
Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
A M + + + +YE + F W YMRPL +M
Sbjct: 696 ALHMFYSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLVALSM 744
Query: 891 QWALTRPKPKTLEKQM 906
+A+ K L ++
Sbjct: 745 IYAIEGMKVNMLNNEV 760
>gi|15898746|ref|NP_343351.1| hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
gi|229578524|ref|YP_002836922.1| hypothetical protein YG5714_0710 [Sulfolobus islandicus Y.G.57.14]
gi|229582969|ref|YP_002841368.1| hypothetical protein YN1551_2495 [Sulfolobus islandicus Y.N.15.51]
gi|284173504|ref|ZP_06387473.1| hypothetical protein Ssol98_02460 [Sulfolobus solfataricus 98/2]
gi|384435081|ref|YP_005644439.1| hypothetical protein [Sulfolobus solfataricus 98/2]
gi|13815223|gb|AAK42141.1| Hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
gi|228009238|gb|ACP45000.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.G.57.14]
gi|228013685|gb|ACP49446.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.N.15.51]
gi|261603235|gb|ACX92838.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
Length = 661
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 118/248 (47%), Gaps = 12/248 (4%)
Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
G VPHD+G ++ +WKD+NP +L VYR T D F K V+P++
Sbjct: 378 GYVPHDLGFHSLDSPIDGTT--SPPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVK 435
Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
M + + RDG +E G D +D + G +Y+ L++A+L A +A+ VGD
Sbjct: 436 VMDWELRQCRDGLPFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSN 493
Query: 718 SEDYFLFKFQKAKVVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
+ K A+ + K++NG YF +D G ++S + QL G+W+ L IVD
Sbjct: 494 YVGFINEKLNVAREAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIVD 549
Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
E+ +SAL+ + N N + +G++ S+Q+ W + +A+
Sbjct: 550 ENMIKSALESIIRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYK 606
Query: 837 EDLADIGF 844
+ + D+ F
Sbjct: 607 KGVGDLSF 614
>gi|227828566|ref|YP_002830346.1| hypothetical protein M1425_2317 [Sulfolobus islandicus M.14.25]
gi|227460362|gb|ACP39048.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
Length = 818
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 107/440 (24%), Positives = 177/440 (40%), Gaps = 86/440 (19%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
+G++ Y +G E NT DV Y+ +++LFP + +I +D + + D
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 582 ---------KMKLLDDGQWVS--RKVLGAVP---HDIGICDPWFEVNAYCLY----DTAR 623
K +++ D + +K+ G + ++ G DP + Y Y DT
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANETG-KDPKGRMPHYIRYSLTVDTYE 465
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAM 659
D+NP+FVL Y TGD++ K+V+ PS +
Sbjct: 466 RIDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWI 525
Query: 660 AYMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
Y++ R D + QT D WS G S Y LW++AL+A + ++ + +
Sbjct: 526 RYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLNN 585
Query: 716 RGSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARAC 769
++Y + K + EK LWNG Y+ YD + + S A Q+ G WY +
Sbjct: 586 --PQNY------EVKELIEKVDKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQML 637
Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMS---SMQSREIWSGV 826
L +D ++ +S + YN G+ G NG D + S+QS+ WSGV
Sbjct: 638 DLPEFLDYERRKSVFSSIMKYN----YSGEEGVRNGSSNDDITPLGIKLSIQSKAPWSGV 693
Query: 827 TYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLA 886
Y +A+ M + + +YE + F W YMRPL
Sbjct: 694 EYYLASHMFSSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLV 742
Query: 887 IWAMQWALTRPKPKTLEKQM 906
+M +A+ K L K++
Sbjct: 743 ALSMIYAIEGMKVNMLNKEV 762
>gi|229585796|ref|YP_002844298.1| hypothetical protein M1627_2396 [Sulfolobus islandicus M.16.27]
gi|228020846|gb|ACP56253.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
Length = 818
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 107/440 (24%), Positives = 177/440 (40%), Gaps = 86/440 (19%)
Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
+G++ Y +G E NT DV Y+ +++LFP + +I +D + + D
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 582 ---------KMKLLDDGQWVS--RKVLGAVP---HDIGICDPWFEVNAYCLY----DTAR 623
K +++ D + +K+ G + ++ G DP + Y Y DT
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANETG-KDPKGRMPHYIRYSLTVDTYE 465
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAM 659
D+NP+FVL Y TGD++ K+V+ PS +
Sbjct: 466 RIDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWI 525
Query: 660 AYMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
Y++ R D + QT D WS G S Y LW++AL+A + ++ + +
Sbjct: 526 RYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLNN 585
Query: 716 RGSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARAC 769
++Y + K + EK LWNG Y+ YD + + S A Q+ G WY +
Sbjct: 586 --PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQML 637
Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMS---SMQSREIWSGV 826
L +D ++ +S + YN G+ G NG D + S+QS+ WSGV
Sbjct: 638 DLPEFLDYERRKSVFSSIMKYN----YSGEEGVRNGSSNDDITPLGIKLSIQSKAPWSGV 693
Query: 827 TYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLA 886
Y +A+ M + + +YE + F W YMRPL
Sbjct: 694 EYYLASHMFSSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLV 742
Query: 887 IWAMQWALTRPKPKTLEKQM 906
+M +A+ K L K++
Sbjct: 743 ALSMIYAIEGMKVNMLNKEV 762
>gi|385775519|ref|YP_005648087.1| hypothetical protein [Sulfolobus islandicus REY15A]
gi|323474267|gb|ADX84873.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
Length = 661
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/295 (25%), Positives = 132/295 (44%), Gaps = 31/295 (10%)
Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
+ S +I++FP+++ S + + V G VPHD+G
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYVRND-------------------GYVPHDLGFHSLD 390
Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
++ +WKD+NP VL VYR T D F K V+P++ M + + RDG
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQCRDGL 448
Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
+E G D +D + G +Y+ L++A+L A +A+ VGD + K A+
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAR 506
Query: 731 VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
+ K++NG YF +D G ++S + QL G+W+ L I DE+ +SAL+ +
Sbjct: 507 EAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIADENMIKSALESIIR 562
Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
N N + +G++ S+Q+ W + +A+ + + D+ F
Sbjct: 563 LN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614
>gi|15898194|ref|NP_342799.1| hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
gi|13814563|gb|AAK41589.1| Hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
Length = 663
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/297 (26%), Positives = 137/297 (46%), Gaps = 35/297 (11%)
Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
+ S +I++FP+++ + F ++ H + +DG VPHD+G
Sbjct: 352 FGSLPVILMFPELE----KSFLKLLIRH------IREDGY---------VPHDLGYHSLD 392
Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
++ RWKD+NP +L VYR T D +F K V+P + + MD R
Sbjct: 393 SPIDGTT--SPPRWKDMNPSLILLVYRYFKFTNDIEFLKEVYP---ILVKVMDWELRQCK 447
Query: 671 GMIE-NDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
G + +G D +D + G +Y+ L++ +L A +A+ VGD D+ K A
Sbjct: 448 GNLPFMEGEMDNAFDATIIKGHDSYTSSLFIGSLIAMREIAKLVGDSNYVDFISEKLSSA 507
Query: 730 KVVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVY 788
+ + +++NG YF +D S ++S A QL G+W+ GL IV+ED + AL+ +
Sbjct: 508 REAFRRMFNGRYFKAWD---SVDNASFLA-QLYGEWFTTLVGLEDIVEEDIIKKALESII 563
Query: 789 NYNVLKVMGGKRGAVNGMLPD-GRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
N V ++ D G++ S+Q+ W + +A+ + + D+ F
Sbjct: 564 RLN----GNASPHCVPNLVDDNGKIVGLSVQTYSSWPRMVFAICWLAYKKGVGDLSF 616
>gi|302348409|ref|YP_003816047.1| trehalase [Acidilobus saccharovorans 345-15]
gi|302328821|gb|ADL19016.1| Putative trehalase [Acidilobus saccharovorans 345-15]
Length = 928
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 161/365 (44%), Gaps = 69/365 (18%)
Query: 547 DVHFYSSFALIMLFPKIQLSIQRDFAAA--VMMHDPSKMK--LLDDGQWVSRKVLGAVPH 602
D H+Y + L L +S+ R+ + +DP+K + + + + G VPH
Sbjct: 433 DKHYY--YELFALGFHENMSLYREMLRKDPSIQNDPAKFSEAIAEVVRRTGKDPTGRVPH 490
Query: 603 DIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM-AY 661
++ L D+ DL P+F+L ATGD+++ +++W V +
Sbjct: 491 SF--------RASFNLIDSYDRNDLMPEFLLIALLPYYATGDREYLRSLWNDVLTVIDGT 542
Query: 662 MDQFDRDGDGMIEN----------------------DGF------------PD------Q 681
Q D G G+IE+ GF P+
Sbjct: 543 RRQHDSRGTGLIEHYMPSDYEGLSRVAQDAAEKGLVPGFYGGNVVKLLFSGPEFVMNSVN 602
Query: 682 TYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSY 741
T+DT+S+ G+++++G LW A+L+AA+ A G +E+ + + + + LWNGSY
Sbjct: 603 TFDTFSLLGVASFTGDLWAASLRAAAEAASREGLARAEELRQLSLRAYESMVKLLWNGSY 662
Query: 742 FN--YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK 799
F+ YD + +I + QL G+WY + GL VD +K RSAL++VY N + G
Sbjct: 663 FDDWYDPESGLRDRAILSAQLTGEWYLQVLGLGDGVDSEKVRSALREVYRANFRRWEGLL 722
Query: 800 RGAVNGM----------LPDGR--VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
G G P+G ++ S Q+ W+GV + VA+ M++E + + +
Sbjct: 723 NGTYPGTPRPSMVGDVEEPNGTKILNRVSSQADTPWTGVEFGVASQMLYEGMVEEAMEVL 782
Query: 848 CGIYE 852
I++
Sbjct: 783 RSIHD 787
>gi|402589212|gb|EJW83144.1| hypothetical protein WUBG_05948, partial [Wuchereria bancrofti]
Length = 174
Score = 93.6 bits (231), Expect = 4e-16, Method: Composition-based stats.
Identities = 57/159 (35%), Positives = 84/159 (52%), Gaps = 14/159 (8%)
Query: 97 VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDK--PVLANQFSVFVSRSNGQKYSSVLCPK 154
VP G +GSGSIGR +RG F ++ + P + E K V ANQF+ + Y VLC +
Sbjct: 1 VPCGTIGSGSIGRDFRGGFCKFGLRPGITEQKIGVVKANQFNP--NNLWETVYQKVLCAR 58
Query: 155 TPEVLKDTTAAGIG---SWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
+ + ++G G SWD++ DK Y LYPR WT + P+ + +Q+SP+I
Sbjct: 59 S------SVSSGRGELTSWDFSFPPDKLIYRGLYPRFWTYYS-IPEFNFILCIQQVSPVI 111
Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
P+NY+ES PV++F N + FT+ N G
Sbjct: 112 PNNYEESLLPVTLFIIDAENQSSVDLQAAVAFTFRNGTG 150
>gi|385772401|ref|YP_005644967.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
gi|323476515|gb|ADX81753.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
Length = 661
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 132/295 (44%), Gaps = 31/295 (10%)
Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
+ S +I++FP+++ S + + V G VPHD+G
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYVRND-------------------GYVPHDLGFHSLD 390
Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
++ +WKD+NP VL VYR T D F K V+P++ M + + RDG
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQCRDGL 448
Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
+E G D +D + G +Y+ L++A+L A +A+ VGD + K A+
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAR 506
Query: 731 VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
+ K++NG YF +D G ++S + QL G+W+ L I +E+ +SAL+ +
Sbjct: 507 EAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIANENMIKSALESIIR 562
Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
N N + +G++ S+Q+ W + +A+ + + D+ F
Sbjct: 563 LN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614
>gi|297725625|ref|NP_001175176.1| Os07g0444000 [Oryza sativa Japonica Group]
gi|255677726|dbj|BAH93904.1| Os07g0444000 [Oryza sativa Japonica Group]
Length = 87
Score = 93.2 bits (230), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 44/77 (57%), Positives = 57/77 (74%), Gaps = 2/77 (2%)
Query: 63 IGVRILCLIREEAAKGKRAFIDPFIKR--HLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
+G+R+ IREEA+ G++A IDPF + ++S GVPLGG+GSGSI R +RGEF+ WQI
Sbjct: 3 LGLRLWSYIREEASHGRKAPIDPFTRESDKPSASQGVPLGGMGSGSISRGFRGEFKHWQI 62
Query: 121 FPRVCEDKPVLANQFSV 137
P CE PV+ANQFSV
Sbjct: 63 IPGSCEMSPVMANQFSV 79
>gi|374815490|ref|ZP_09719227.1| hypothetical protein TpriZ_16630 [Treponema primitia ZAS-1]
Length = 814
Score = 90.1 bits (222), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 87/323 (26%), Positives = 140/323 (43%), Gaps = 51/323 (15%)
Query: 551 YSSFALIMLFPKIQLSIQR-DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDP 609
Y + A+ LFP+++ S++R DF + + + + S L AV +G
Sbjct: 421 YYAQAMAYLFPELEQSMRRTDFLTETNDEGEMQFRAMRELNGASWGFLPAVDGQMGT--- 477
Query: 610 WFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRD 668
++++YRD + +GD F K +WP +A+ Y + +D D
Sbjct: 478 ----------------------IVRLYRDWLISGDDAFLKELWPKAKLALDYGIRTWDTD 515
Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK 728
GD ++ DG YD GI ++AAL+AA +A + D G+ + + +
Sbjct: 516 GDFVL--DGMKHVDYDV-EFYGIDPLGNLCYLAALKAAVKIAEYLKDEGAHNRYQTIYDT 572
Query: 729 AKVVYEK-LWN--GSYF-----NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
A V +K +W+ YF + D + ADQL GQ+YA GL P+VD
Sbjct: 573 ASVQADKFMWDSKAGYFIQVLKDVDEFKYQHGTGCLADQLIGQFYAHLTGLGPLVDAGHI 632
Query: 781 RSALKKVYNYNVLKVM----GGKRG-AVN---GMLP-----DGRVDMSSMQSREIWSGVT 827
+ A + ++ YN + +RG AVN G+L GR S E WS
Sbjct: 633 KQAAQSIFKYNFISDFTDHPNMQRGYAVNDDKGLLMTTWPLGGRPRYPFFYSEEAWSRTE 692
Query: 828 YAVAASMIHEDLADIGFQTACGI 850
Y VA+++I+E D G A +
Sbjct: 693 YHVASTLIYEGFLDEGLTIAKAV 715
>gi|76156398|gb|AAX27603.2| SJCHGC03819 protein [Schistosoma japonicum]
Length = 192
Score = 89.7 bits (221), Expect = 7e-15, Method: Composition-based stats.
Identities = 62/157 (39%), Positives = 93/157 (59%), Gaps = 8/157 (5%)
Query: 71 IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRV-CED-K 128
I++ + + FID +GVP+GG+G GSIGR +RGEF R + P + C D +
Sbjct: 34 IKKRYIEHRLPFIDAISHVPWRPIYGVPIGGIGCGSIGRGFRGEFCRSSLIPGIYCYDIQ 93
Query: 129 PVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRA 188
PV +QF V V ++N Y+ VL P T + G+ W W + + Y LYPR+
Sbjct: 94 PV--DQFIVTVRKNNVIIYNQVLSPLTK---PPSNGKGLRKWIWGFRPENGYYIGLYPRS 148
Query: 189 WTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVF 225
WTV+E P+ +L +VC+QISP+IPH+Y+ + PV++F
Sbjct: 149 WTVYE-IPELQLTLVCQQISPVIPHDYQVTCLPVAIF 184
>gi|327311335|ref|YP_004338232.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
gi|326947814|gb|AEA12920.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
Length = 892
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 74/228 (32%), Positives = 104/228 (45%), Gaps = 35/228 (15%)
Query: 681 QTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGS 740
QT+DTW+ G+S+Y LW AAL+A A+ G R E Y ++A E+LWNG
Sbjct: 576 QTFDTWAFYGVSSYVLFLWAAALKAMVEGAKLTG-RSPEKYAEL-LRRAFEGLERLWNGE 633
Query: 741 YFN--YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGG 798
YF+ +D + + A QL GQ A L + D+ + SAL+ V YN +
Sbjct: 634 YFDLWWDPVTGERDRASMAAQLFGQLLAHVADLGYLSDKQRVISALRAVAKYN----LAP 689
Query: 799 KRGAVNGMLPDGRVD----------------MSSMQSREIWSGVTYAVAASMIHEDLADI 842
G +NGM PD R + + Q W+GV YAVA M +E L +
Sbjct: 690 DEGLINGMYPDRRRPSFVGPTLYENFTRGPYLPTWQMDTPWTGVEYAVAGHMFYEGLVEE 749
Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
G ++E G G+ + E W T YMRPL+ WA+
Sbjct: 750 GTAVLKALHERYERG---GHYWNHIE-WGTH-------YMRPLSAWAV 786
>gi|269925332|ref|YP_003321955.1| hypothetical protein Tter_0211 [Thermobaculum terrenum ATCC
BAA-798]
gi|269788992|gb|ACZ41133.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
BAA-798]
Length = 811
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 107/229 (46%), Gaps = 22/229 (9%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDGFPDQTYDTWSVSG 690
V++VYR+ +GD F K +WP V A+ Y + D+D D ++E G TYD G
Sbjct: 471 VIRVYREWKFSGDDAFLKELWPKVKSAVNYSSLYWDKDQDFILE--GRQHNTYDI-EFYG 527
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-----NYD 745
+ +G L++ AL+AA +A+ +G + Y +K + + LWNG Y+ + +
Sbjct: 528 PNPLTGILFLGALRAAEEMAKYLGSESASSYAQAFEASSKKLDQLLWNGEYYIQKLDDPN 587
Query: 746 NSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNV------------L 793
+S +DQL GQ A GL ++ ++ AL+ ++ YN +
Sbjct: 588 EHRYQHASGCLSDQLFGQTLASLTGLGYLLPKEHISRALESIFAYNFKPNFWNHTNTQRV 647
Query: 794 KVMGGKRGAVNGMLPDG-RVDMSSMQSREIWSGVTYAVAASMIHEDLAD 841
+G G V P G R S E+WSG Y VA MI+E L D
Sbjct: 648 YALGDDAGLVMCTWPFGDRPSFPFPYSDEVWSGTEYQVATLMIYEGLLD 696
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 67/295 (22%), Positives = 115/295 (38%), Gaps = 23/295 (7%)
Query: 93 SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVL- 151
S+ PLGG+G+G++ RGE + W+IF R + + F+++ + VL
Sbjct: 22 SAVAFPLGGIGTGNVSLGARGELRDWEIFNRPAKGFSLPNTYFALWARPEGESPVTKVLQ 81
Query: 152 -CPKTPEV----LKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQ 206
P P T+ G+ + L + YP AW E + D +++
Sbjct: 82 SLPPPPHTQSHGYHPTSGLGLPHFTSTL------FRGEYPIAWVYFE-DSDLPIKVTLES 134
Query: 207 ISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG--DSEFTGQHYNSKT 264
+P +P N ++S P + Y + N D+ + + TN VG F G
Sbjct: 135 FTPFVPLNAEDSGIPGAYLIYRVENISPKRIDVAIAASITNPVGEVRRDRFGGFSVERGG 194
Query: 265 KMND----GVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDM 320
ND G L S+ L + + + T HV+ ++ S + + ++
Sbjct: 195 NANDFRRQGRILGLFLRSNSYPLYDLRYGDLSLVTMNSHVTFKRVWLPSRWACD-SVEEF 253
Query: 321 WHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
W + G +L + SE G + +IAA T+ P F L+W P
Sbjct: 254 WRDFSTDG---KLEDLGIEDPSEEGQTYTGSIAAMETIAPGEVKDFVFILSWFFP 305
>gi|10440618|gb|AAG16856.1|AC069145_5 hypothetical protein [Oryza sativa Japonica Group]
Length = 143
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 65/118 (55%), Gaps = 21/118 (17%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D P W+R+L++ L +F++ + E +++ +A IDPF
Sbjct: 31 DGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRM-------------------KAPIDPF 71
Query: 87 IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRS 142
K ++S GVPLGG+GSGSI R +RGEF+ W I P +CE PV+ NQFSV +S S
Sbjct: 72 TKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQFSVMLSSS 129
>gi|297741933|emb|CBI33368.3| unnamed protein product [Vitis vinifera]
Length = 108
Score = 82.8 bits (203), Expect = 8e-13, Method: Composition-based stats.
Identities = 35/56 (62%), Positives = 39/56 (69%)
Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
F TA GI+ A WS G GY FQTPE W D +RSL YMRPLAIW MQWAL+ P+
Sbjct: 5 AFTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWALSMPR 60
>gi|160931643|ref|ZP_02079037.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
gi|156869288|gb|EDO62660.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
Length = 804
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 65/235 (27%), Positives = 106/235 (45%), Gaps = 24/235 (10%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDGFPDQTYDTWSVSG 690
+L+VYR+ +GD F K +W V A+ + + D + D ++ DG TYD G
Sbjct: 477 ILRVYREWKISGDDDFLKKIWNQVVSALEFSASYWDSNQDCVL--DGQQHNTYDI-EFYG 533
Query: 691 ISAYSGGLWVAALQAASALAREVGD-RGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
+++ ++ AAL+A + +A +G+ S+ + + K + E L+NG Y+ G
Sbjct: 534 VNSLGNSIYYAALKAGAEMAEYLGEHERSQKWRSMEQAGCKRMDEMLFNGEYYRQVTDGD 593
Query: 750 SQSSSIQ------ADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
Q +DQL GQ A GL + ED +SA+ +Y YN + MG +
Sbjct: 594 IDEYKYQYGEGCLSDQLLGQTLAHLYGLGHLFPEDHVKSAVFAIYKYNFKERMGSHKSLQ 653
Query: 804 NGML-------------PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQ 845
G GR + S E+W+G+ Y VAA +I+E G +
Sbjct: 654 RGYAYQDEPGLLLCSWPSGGRPKQPFVYSDEVWTGIEYQVAAGLIYEGFLQEGLE 708
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 116/291 (39%), Gaps = 21/291 (7%)
Query: 99 LGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEV 158
LGG+G+G+I + RGE ++IF R + + + F+++ S +G+ + VL K P+
Sbjct: 24 LGGIGTGNISLNARGELCDFEIFNRQSKGLKLPYSFFALWYSLPSGESDALVLEAK-PQG 82
Query: 159 LKDTTAAGIGSWDWNL-KGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKE 217
L D S L + D S + + YP A + + P + + +P IP +
Sbjct: 83 LCDAPMGVAPSLVPGLPRFDSSCFSSAYPFANIQLKKDGLP-FEVTLQAYTPFIPLDDVN 141
Query: 218 SSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT---------GQHYNSKTKMND 268
S P F Y + N + T N+ G DS T G+ N K N
Sbjct: 142 SGIPGIRFVYRLKNLLDQPVKAAVSATMPNACGFDSFSTDGLNKLYLKGRPENRVVKKNG 201
Query: 269 GVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHG 328
QL T AL E +G S P ++ G G A++ W ++ E G
Sbjct: 202 LQGICYSTDLEKEQLGFGTMALLTDEQEG--FSAKPRWLNEGWWDG--AEEYWQDLMEDG 257
Query: 329 SFDRLNSMETSVT----SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
+ S V+ S S+IG +IA + P F L W P
Sbjct: 258 CLGEVVSGGKPVSRINPSGEESAIG-SIAVQKVLQPGESKDFVFYLTWHFP 307
>gi|15922864|ref|NP_378533.1| hypothetical protein ST2529 [Sulfolobus tokodaii str. 7]
gi|15623655|dbj|BAB67642.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 836
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 90/401 (22%), Positives = 161/401 (40%), Gaps = 89/401 (22%)
Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
E +G+ Y +G NT DV Y + L +L+P++ ++ + + + +D
Sbjct: 333 ERKVGEHPYTDGPVNTALNTIDVSTYFIYTLTVLYPQLAKNLLLTSSRSALSYDNPLYIF 392
Query: 586 L------DDGQWVSRKVLG-AVPHDIGIC-------------DPWFEVNAYCL----YDT 621
+ ++V R + ++P I DP + Y +D
Sbjct: 393 YSLAMPENRSKYVERVIKDPSIPSSIEKLLQTVKEIAKETGKDPKGRIAHYIYRDLKFDE 452
Query: 622 ARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ--------FDRDGDGM- 672
DLNP+FVL TGD +F K+++P AM + + + R G
Sbjct: 453 YGRNDLNPEFVLMWGLVSKYTGDIEFMKSLYPVAKEAMESVLRTHSYEGLIYSRLPSGFE 512
Query: 673 --------------------------IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
I++ T+D W+ GI+++ L ++AL+
Sbjct: 513 WNRQVFSYFKDVNIYDNLFLVVSLLGIDSFHMSVNTFDDWTTIGINSFVSLLGISALKIL 572
Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFN--YDNSGSSQSSSIQADQLAGQ 763
+ L GD+ + + A +YE LWNG YF+ YD + + QA QL G+
Sbjct: 573 NELG---GDK-------YNVENALSLYESMLWNGEYFDLWYDPISGYRDKTCQASQLLGE 622
Query: 764 WYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDG------------R 811
+Y G ++D +K R L + YN+ + + G +NG PDG +
Sbjct: 623 FYLNLLGY-SLLDREKTRKTLLSIVKYNLKE----EEGVINGAYPDGYRPLMREYENPLK 677
Query: 812 VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
+ +S+ WSGV + +A+ +I+E + D + +Y+
Sbjct: 678 IKEASIHQDTPWSGVEFYLASHLIYEKMIDEAKKVLKEVYD 718
>gi|150009713|ref|YP_001304456.1| hypothetical protein BDI_3128 [Parabacteroides distasonis ATCC
8503]
gi|149938137|gb|ABR44834.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 901
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 93/421 (22%), Positives = 161/421 (38%), Gaps = 69/421 (16%)
Query: 70 LIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP 129
I + G R D + + GVPLGG+G G++ + G F R + +
Sbjct: 34 FIPSKGLLGGRTDFDYLKLENHPGNSGVPLGGIGVGNVEFAPDGRFARIGLNNIHLPIRK 93
Query: 130 VLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH-ALYPRA 188
A+ FS++ G K ++V+ +++D+ + L G + Y+ L+PR
Sbjct: 94 STASFFSLWYK---GAKETAVV-----RLVRDSLP------QYGLLGVEDIYYTGLFPRV 139
Query: 189 WTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNS 248
+ G P + R SP+IPH+ K SS PV F + + + ++++ F+W +
Sbjct: 140 -EMDFGNSIPGINATIRAFSPLIPHDVKNSSLPVVFFDVDLVS--QEGGEVSIAFSWEDF 196
Query: 249 VGG-----------DSEFTGQHYNSKTK------------------------MNDGVHAV 273
+G D + GQ NS +
Sbjct: 197 IGRGIKEPDDIKGMDGQLFGQERNSLCNGEAWPERPLEQTYSESWSCGKMQGIRQFASGP 256
Query: 274 LLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRL 333
L R + Q A+ A+ + V VS+ P F I + W K+ G F+ +
Sbjct: 257 LKPKRANFQNYVNEVAVLAESREDVEVSVLPSFNIK------ETHEAWDLFKQEGRFEDI 310
Query: 334 NSMETSVTSEPGSSI-GAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK---TYY--- 386
S + S+ S PG + G+A+A + + V F L W PE+ YY
Sbjct: 311 -SKDVSLLSIPGKQMGGSAVALKTKLKAKEKKTVRFMLVWFYPEMMIDRENDPLEYYWVG 369
Query: 387 -RRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYY 445
Y +++ + N + + + WQ P+L + +P+WY L N Y
Sbjct: 370 GSDYGRYFHNYFNRLEQLVDYGYGNRERFREKTTEWQLPVL-NSTMPDWYKFKLINSGYV 428
Query: 446 L 446
+
Sbjct: 429 I 429
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/168 (25%), Positives = 81/168 (48%), Gaps = 10/168 (5%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
++Q+ +D TGD + VY M ++ +G N TYD ++ +
Sbjct: 523 IIQLVKDYEQTGDFHYLSRYAGRVYNGMDFLKSLMPEG----LNIPIGGTTYDDFTHPPV 578
Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLWNGSYFNY----DN 746
+Y +++A L+AA +A+ + D E+ + +F K K + LWNG +F+Y D
Sbjct: 579 YSYGASIYLATLRAAQVVAQAMKDTVRENEYKEQFDKTQKDMIRLLWNGRFFSYGCELDG 638
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
SG + + + QL GQ+ +R CG ++ + +++L + ++ K
Sbjct: 639 SGR-KDNLLFTGQLGGQFVSRYCGWGDVIPMEMTKASLVSQFKISLSK 685
>gi|332796016|ref|YP_004457516.1| hypothetical protein Ahos_0327 [Acidianus hospitalis W1]
gi|332693751|gb|AEE93218.1| conserved hypothetical protein [Acidianus hospitalis W1]
Length = 662
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 84/194 (43%), Gaps = 16/194 (8%)
Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
G VPHD+GI F+ RWKDLNP F+L VYR TGD F + +P +
Sbjct: 383 GYVPHDLGIWS--FDSPIEGTTAPPRWKDLNPTFILLVYRYFKFTGDVDFLRETYPKMLK 440
Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
A +M R IE +G D +D + G + L++A+ A + + ++
Sbjct: 441 AYEWM--LTRS----IEAEGSGDTAFDVLPIKGKNPMLLTLFIASALALRETKKVLNEKD 494
Query: 718 SEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE 777
K + + L+NG YF I QL G+W+ GL + DE
Sbjct: 495 EST----DLSKLREMLNSLYNGKYF----IAWEGQEGIFMAQLLGEWWTELLGLENVTDE 546
Query: 778 DKARSALKKVYNYN 791
+K SAL+ + N
Sbjct: 547 EKISSALRYMLEVN 560
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 78/368 (21%), Positives = 148/368 (40%), Gaps = 59/368 (16%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQ-FSVFVSRSNGQKYSSVLCPK 154
GVPLGG+G+G + + RG+ +F + P+ + + F +F+S G +
Sbjct: 13 GVPLGGIGTGKLEINNRGKLVNITVFGNLS--SPIKSVRGFHIFISPDEGDPFF------ 64
Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
+ KD + + D+ Y +YP + + E+++ SPIIP N
Sbjct: 65 ---LEKDLRIRNMR------EVDELYYEGVYPFCYLTGK---KGEIKVTLEAFSPIIPRN 112
Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
K+S+ P + + S K T+ ++ N VG S G+ K+ +G+ +
Sbjct: 113 IKDSNIPAVGLSIKVSGSKKG----TIYLSFPNFVGSTS--IGR---INRKVENGI--IF 161
Query: 275 LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEI--KEHGSFDR 332
+ RT+ P + G+ S P +IS ++ + A++ + K+ D
Sbjct: 162 TNIRTNEYDP-------RRGEVGIFSSY-PTLIISQYNINVDAREAFKTFIYKDLYEDDN 213
Query: 333 LNSMETSVTSEPGSSIG-----AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYR 387
+ V S+ +G A I AS + ++ F +AW + K+Y
Sbjct: 214 VWRNINKVKSDEHEVVGFWDDPAGIIASNY---EENKEIRFVIAW------YSRAKSYQY 264
Query: 388 RYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLN 447
Y +Y + +A+ + + + W+ I+ + LP+W + N Y L
Sbjct: 265 PYGFYYHNRFSGVLEVAQYFMKNYEELRKRTIEWKERIM-NINLPDWLKDAIINSSYILT 323
Query: 448 AGGAVWTD 455
+ + W D
Sbjct: 324 S--STWLD 329
>gi|326436882|gb|EGD82452.1| hypothetical protein PTSG_11966 [Salpingoeca sp. ATCC 50818]
Length = 1720
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 86/320 (26%), Positives = 128/320 (40%), Gaps = 42/320 (13%)
Query: 97 VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTP 156
PLG G G I + G ++WQI R+C D V + F+++ + G + VL
Sbjct: 885 CPLGPCGGGCIALAGDGGLRQWQILNRICHDAHVPDSFFALYTNDGKGGVDARVL----- 939
Query: 157 EVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYK 216
++LKD ++G Y +++ G ++ + P IP + K
Sbjct: 940 QLLKDLPHLSA----IEIEGKYPVMDVDYT---SLNNG-----IKASMECMCPCIPLDSK 987
Query: 217 ESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD--------SEFTGQHYNSKTKMN- 267
S PV F +T+ N+ + D++LL + N VG D +EF G N KM
Sbjct: 988 NSGLPVIFFNFTLTNTTSDAVDVSLLMSQHNLVGWDGETAIENETEFAGYGGNVNAKMTM 1047
Query: 268 DGVHAVLLHHRTSHQLPPVT--FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIK 325
DGV AV + P A+AA D VS + K +W
Sbjct: 1048 DGVTAVDMTSTQFRDTDPQAGHIAIAALTGDSSEVSTMLQYD--------AHKTLWSHFA 1099
Query: 326 EHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE--VNFMSGK 383
+ G +L S S G + A+ S T+ + +TF LAW P VN+ S K
Sbjct: 1100 KTG---KLPGQGQSTPSPSGKTWNGAVCVSRTLAAGATTTITFVLAWHFPNRYVNW-SQK 1155
Query: 384 TYYRRYTKFYGTHQNAAANI 403
+Y TK NA AN
Sbjct: 1156 GFYIDDTKSQYWLGNAYANF 1175
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 85/180 (47%), Gaps = 14/180 (7%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMA-YMDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
V++ YR+V+ + K + V M ++ D GDG++ G TYD ++ G
Sbjct: 1328 VIKTYREVLMGAPSSWFKDKYHQVLKIMERWLIIMDAQGDGVVR--GAQPSTYDV-ALHG 1384
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLW--NGSYFN----- 743
+++ G L++ AL+AA +A D S F +F+ + Y+KL NG ++
Sbjct: 1385 ANSFIGSLYMCALRAAEEMAVLQNDPDSAAKFRARFKLSVANYDKLCFTNGKWYTQVVDP 1444
Query: 744 -YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
+D + +S+ + D L G+W+A+ G+ ++ S L+ + YN + R A
Sbjct: 1445 AHDTNIIGESTFV--DSLLGEWWAQFLGIGAMLPPAHVLSTLQNCFKYNHVDAFDPARQA 1502
>gi|158316208|ref|YP_001508716.1| hypothetical protein Franean1_4431 [Frankia sp. EAN1pec]
gi|158111613|gb|ABW13810.1| protein of unknown function DUF608 [Frankia sp. EAN1pec]
Length = 934
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 70/309 (22%), Positives = 129/309 (41%), Gaps = 37/309 (11%)
Query: 172 WN-LKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIY 230
WN LK +Y L+P + + G + ++ + SPI+ + + +S PV+ F +
Sbjct: 184 WNTLKPGDGSYAVLFPFGYMTYSGF---QSKVSTKIWSPIVANEDERTSMPVAFFDMLMN 240
Query: 231 NSGKTSADITLLFTWTNS--VGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVT- 287
N I+++ T+ N+ S TG + G+ V L + P
Sbjct: 241 NPTAKPIKISVMLTFPNAPAFATGSVRTGLYSRFDRDSASGIGGVTLGSDSPENTPDTVT 300
Query: 288 --FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
+ +AA G ++ C + SG+ D++ G+ +L + + ++
Sbjct: 301 SEWTIAAHPFAGQTLTYCTSWDGSGD-----GSDIYAPFSAAGADGKLPNGDIDQSAS-- 353
Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-----MSGKTYYRRYTKFYGTHQNAA 400
A+A ++T+ PD V F+L+WD P+V + + + RRYT F G +
Sbjct: 354 ---AGAVAVALTLEPDQTQTVRFALSWDFPQVYYDGEDATTRAVWMRRYTAFLGGKTSRT 410
Query: 401 AN-------------IARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLN 447
+ IAR + + +E+W +PI E+ ++P W NELY++
Sbjct: 411 NDYVQDSYPFRQGFTIARKELARYDDSLAAVESWWKPIAENPQVPPWLRKASLNELYHMI 470
Query: 448 AGGAVWTDG 456
G+ W G
Sbjct: 471 FNGSFWESG 479
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 128/319 (40%), Gaps = 52/319 (16%)
Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF--PDQ 681
++D K + + Y + D + + + A+ Y F G + + P
Sbjct: 613 FRDCPHKLIYRTYALIKFYDDDDLLEYGYAPMLKALTYSQFFRPTGSHLPADPASNNPPN 672
Query: 682 TYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK--------AKVVY 733
T D V+G Y+ GL++ +LQ S L + G D + QK AK +
Sbjct: 673 TMDQAVVNGHGIYNCGLYLLSLQILSTLTPQAARLGV-DEATPEIQKELDEELAAAKEEF 731
Query: 734 EKL-WN---GSYFNYDNSGS--SQSSSIQA-------------DQLAGQWYARACGLLPI 774
E++ WN G Y D +G ++ +I+ + AGQ A GL +
Sbjct: 732 ERIFWNPATGRYRYCDGTGGIGDRTGTIRGRFKPVPPPDAIWLESFAGQLVAMELGLPDV 791
Query: 775 VDEDKARSALKKVYNYNVL--KVMGGKRGAVNGMLPDGRVDMSSMQSREI---WSGVTYA 829
VD D AR+ LK + V G G + PD + SS+++ EI G+ +
Sbjct: 792 VDLDHARTHLKNTLDSFVRFRDPEGNLMGGPIILKPDFSIYPSSLRTTEINEVIPGIAFL 851
Query: 830 VAASM-----------IHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQY-- 876
AA I E +G A IY+ +G YAF TPE+W DD +
Sbjct: 852 AAAGAFRIGAKVKDKDITEKALKLGEGCALQIYDIESNG----YAFATPESWFVDDHHIS 907
Query: 877 RSLCYMRPLAIWAMQWALT 895
R Y R ++W++ A++
Sbjct: 908 RFPGYTRTRSVWSLYDAVS 926
>gi|269839491|ref|YP_003324183.1| hypothetical protein Tter_2468 [Thermobaculum terrenum ATCC
BAA-798]
gi|269791221|gb|ACZ43361.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
BAA-798]
Length = 807
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 68/295 (23%), Positives = 119/295 (40%), Gaps = 35/295 (11%)
Query: 98 PLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVL-CPKTP 156
PLGG+G+G++ RG+ + W+I R + + F+++ ++G+ + VL P P
Sbjct: 18 PLGGIGTGNVSLGARGDLRDWEICNRPAKGNRLPNTFFALWTRTASGRCVTRVLEGPIQP 77
Query: 157 -----EVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
+T AG+ + S + YP AW V +P +R+ +P+I
Sbjct: 78 PHDGSHGYHPSTGAGLP------RCRHSRFRGEYPLAW-VELADPALPVRVSLEAYTPLI 130
Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS-----EFTGQHYNSKTKM 266
P ++S P ++ +YT+ N+ +TL+ + TN VGG + + G N
Sbjct: 131 PLCPEDSGIPCAILSYTVENTSDEVVALTLVGSLTNPVGGVTFDRFGNYRGAGGNQNEYR 190
Query: 267 NDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTA------KDM 320
V+ L T+ + + TD H VI+ L ++
Sbjct: 191 ESEVYRGLWLRSTTLPSDDLRYGDMTLATD--------HPVITYKRAWLRGAWWDYLREF 242
Query: 321 WHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
W ++ E G RL + SEPG + ++ + P F L W P
Sbjct: 243 WQDLSEDG---RLEDLGYESPSEPGKTDTCSLGLRDELAPGETHTYRFVLTWYFP 294
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 72/232 (31%), Positives = 107/232 (46%), Gaps = 20/232 (8%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
+L+V R+ +GD ++ + VWP V A+AY DGDG DG TYD G
Sbjct: 462 ILRVLREWRLSGDMEWLRRVWPGVKSALAYASA-QWDGDGDGVLDGRQHNTYDV-EFYGP 519
Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-AKVVYEKLWNGSYF----NYDN 746
+ G ++AAL+A LA +G+ F++ + + E LWNG Y+ + D
Sbjct: 520 NPLCGLYYLAALRAVEELAEVLGEANLARRCREVFERGSHRLDELLWNGEYYEQLGDEDA 579
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL-----KVMGGKRG 801
ADQL GQ A G+ +V E+ R AL+ ++ YN V +
Sbjct: 580 WPYQHGKGCLADQLLGQLQAHVLGMGYLVPEEHVRQALRSIWRYNFRADLSEHVNCQRTF 639
Query: 802 AVN---GML-----PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQ 845
A+N G+L G+ S E+++G Y VAA MI+E L + G Q
Sbjct: 640 ALNDEAGLLMCTWPRGGQPTFPFPYSDEVFTGSEYQVAAHMIYEGLLEEGLQ 691
>gi|383764296|ref|YP_005443278.1| hypothetical protein CLDAP_33410 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381384564|dbj|BAM01381.1| hypothetical protein CLDAP_33410 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 905
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 72/272 (26%), Positives = 117/272 (43%), Gaps = 60/272 (22%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY------MDQFDRDGDGMIENDGFPDQTYDT 685
VL+ YRD GD ++ + +WPSV ++ + +D++D + G++ G T D
Sbjct: 494 VLKTYRDWKICGDNEWLRRLWPSVKKSLEFAWHPNNIDRWDPEKTGVLW--GRQHHTLDM 551
Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSY--- 741
+ G +++ +++AAL+A + +A +GD + + F + K + L+NG Y
Sbjct: 552 -ELFGPNSWLTSMYLAALKAGAEMAEYLGDEAAAIEYREIFARGKKWADANLFNGEYYIQ 610
Query: 742 ------------FNYDNSGSSQSSSIQA-------------------DQLAGQWYARACG 770
F D S S+++A DQ+ GQW+A G
Sbjct: 611 RINLHDRSIVEAFADDESVLIGGSTLEAYWDDEHGEIKYQIGEGSSIDQVLGQWHASLYG 670
Query: 771 LLPIVDEDKARSALKKVYNYNVLKVMG------------GKRGAVNGMLPD--GRVDMSS 816
L I D + RSAL +Y YN + VMG + G V P + + +
Sbjct: 671 LGEIFDPTQVRSALAAIYKYNFIPVMGEVYNPCRIYCLNDEGGLVICAWPPHVAKPTIPA 730
Query: 817 MQSREIWSGVTYAVAASMIHEDLADIGFQTAC 848
S+E +G YA A+ MI L D G T C
Sbjct: 731 PYSQETMNGFEYAAASHMIMNGLVDEG--TTC 760
>gi|410096859|ref|ZP_11291844.1| hypothetical protein HMPREF1076_01022 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225476|gb|EKN18395.1| hypothetical protein HMPREF1076_01022 [Parabacteroides goldsteinii
CL02T12C30]
Length = 847
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 77/324 (23%), Positives = 129/324 (39%), Gaps = 49/324 (15%)
Query: 97 VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTP 156
+PLGG+G+G++ RGE + W+I V A ++S + +N ++ P+
Sbjct: 43 LPLGGIGTGTVSLGGRGELRDWEIMN-------VPAKKYSTVTTGNNAPFFAIYAKPQNQ 95
Query: 157 EVLKDTTAAGIGSWDW-NLKGDKSTYHAL-----------YPRAWTVHEGEPDPELRIVC 204
E A + ++ + +G +H L YP VH + D +++
Sbjct: 96 EATTTLLAGPLYPQEYLHYEGRPVNHHGLPRFAQASFDAAYPFG-QVHLSDKDLPVKVTI 154
Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD-SEF-------- 255
+ +P+IP + + S PV+V +Y + N +I + + N +G D S+F
Sbjct: 155 KGFNPLIPGDAEASGLPVAVLSYEVTNITSQPMEIAICGSMRNFIGKDGSKFRTDWKGDY 214
Query: 256 --TGQHYN-SKTKMNDGVHAVLLHHRTSHQLPPV--TFALAAQETDGVHVSLCPHFVISG 310
TG N +K N G+ + L+ + P T AL Q T GV
Sbjct: 215 IPTGVKDNKNKYVENKGIKGIYLYSDGVDKNDPAWGTVALTTQATSGVSYRTSSKSDNWN 274
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
N + + W + G N E +P A+++ TV P S TF +
Sbjct: 275 NGI----LNFWDDFSADGMLTERNKQEDE---DPM----ASLSVKKTVKPQSTETFTFYI 323
Query: 371 AWDCPEVNFMS----GKTYYRRYT 390
W+ P S G Y R+YT
Sbjct: 324 TWNFPNRKAWSSTVVGNYYSRQYT 347
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 109/249 (43%), Gaps = 51/249 (20%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
++++YR+ +GD F K W + ++Y +D + DG++E G T D +
Sbjct: 486 IMKIYREWQLSGDNDFLKNNWGQIKKVLSYAWTEKGWDGNQDGVME--GSQHNTMDV-NY 542
Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK--------LWNGS 740
G + G ++ AL+AA +A + D+G QK + ++E+ L+NG
Sbjct: 543 FGPNPQMGFWYMGALKAAEKMAIAMKDKGFA-------QKCRTLFEQGSNWMDKNLFNGE 595
Query: 741 YFNYD------------NSGSSQSSSIQ------ADQLAGQWYARACGLLPIVDEDKARS 782
Y+ + N+ + + S Q DQL GQ+ + CGL + ++D ++
Sbjct: 596 YYEHKITDPETFEYLDMNNPNVKIPSFQLGPGCLVDQLVGQYMSHLCGLGYLGNKDHIQT 655
Query: 783 ALKKVYNYNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
+ + YN + VMG + G + P GR+++ E+ +G Y
Sbjct: 656 TMNSIMKYNYVSDFSRHFNNMRSYVMGNESGLLMASWPKGRLEVPFPYFAEVMTGFEYCA 715
Query: 831 AASMIHEDL 839
A MI+E +
Sbjct: 716 AVGMIYEGM 724
>gi|224025536|ref|ZP_03643902.1| hypothetical protein BACCOPRO_02276 [Bacteroides coprophilus DSM
18228]
gi|224018772|gb|EEF76770.1| hypothetical protein BACCOPRO_02276 [Bacteroides coprophilus DSM
18228]
Length = 846
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 107/251 (42%), Gaps = 51/251 (20%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
+++ YRD +GD +F K W V ++Y +D + DG++E G T D +
Sbjct: 484 IMKFYRDWQLSGDSRFLKENWAQVKKVLSYAWTEHGWDDNQDGVME--GSQHNTMDV-NY 540
Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK--------LWNGS 740
G + G ++ AL+AA +A + DR E +K + ++E L+NG
Sbjct: 541 FGPNPQMGFWYLGALKAAEKMAHAMKDRAFE-------KKCRALFENGSEWLDNNLFNGE 593
Query: 741 YFNYD------------NSGSSQSSSIQ------ADQLAGQWYARACGLLPIVDEDKARS 782
Y+ + N ++ Q DQL GQ+ A CGL + ++ R+
Sbjct: 594 YYEHKITDPETFEFLDMNDPDTKIPDFQLGKGCLVDQLVGQYMAHICGLGYLAKKEHIRT 653
Query: 783 ALKKVYNYNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
LK + YN + V+G + G + P GR+++ E+ +G Y
Sbjct: 654 TLKSIMRYNFKEDFSRHFNNMRSYVLGDEAGLLMASWPKGRLEVPFPYFAEVMTGFEYCA 713
Query: 831 AASMIHEDLAD 841
A MI+E + +
Sbjct: 714 ATGMIYEGMEE 724
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/342 (19%), Positives = 134/342 (39%), Gaps = 59/342 (17%)
Query: 87 IKRHLTSSH----GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRS 142
+ RH H +PLGG+G+G++ RGE + W+I V ++S + +
Sbjct: 27 VLRHYDQQHLYQIALPLGGIGTGTVSLGGRGELRDWEIMN-------VPGKKYSTVTTGN 79
Query: 143 NGQKYSSVLCPKTPEVLKDTTAAGIGSWDW--------NLKG----DKSTYHALYPRAWT 190
N ++ P+ E A + ++ N G S++ A YP
Sbjct: 80 NAPFFAIYTKPEKGEAQTTLLAGALYDQEYLHYEGRPVNHHGLPRFKNSSFDAAYPFG-Q 138
Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
VH + + + + +P++P + + S P++V Y + N+ +T D+++ + N +G
Sbjct: 139 VHLSDKGIPVEVTVKGFNPLVPGDAEASGIPIAVLCYEVTNTSQTPVDVSICGSIRNFIG 198
Query: 251 GD-----SEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTF-ALAAQETDGVHVSLCP 304
D +++ G + K N + ++++ L + F + ++TD + ++
Sbjct: 199 KDGSKFVTDWKGDYIPVGCKDNRNI------YKSAQGLQGIYFCSEGVEKTDPAYGNMT- 251
Query: 305 HFVISGNSLGLTAKD-------------MWHEIKEHGSFDRLNSMETSVTSEPGSSIGAA 351
+++ G+T + W + G L M ++P A+
Sbjct: 252 --LVTNAPEGVTYRTSSKADDWSNGILGFWDDFSADG---ELTEMSKQYDADP----MAS 302
Query: 352 IAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFY 393
+A + P TF L W+ P S Y++ Y
Sbjct: 303 LAVKKKLAPGETRSFTFFLTWNFPNRKAWSSSVIGNYYSQQY 344
>gi|320109368|ref|YP_004184958.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
gi|319927889|gb|ADV84964.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
Length = 898
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 56/167 (33%), Positives = 86/167 (51%), Gaps = 10/167 (5%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
V+Q+ +D TGD F K + AYM RD D I G QTYD + I
Sbjct: 532 VIQLAKDYQQTGDDTFIKKNADHIRRGFAYMKAQIRD-DSFIPVGG---QTYDDFPHPEI 587
Query: 692 SAYSGGLWVAALQAASALAREVGDR--GSEDYFLFKFQKAKVVYEKLWNGSYFNY--DNS 747
S Y+G +++AAL+A L +GD E FK +A +++ LWNG +F Y D
Sbjct: 588 SVYTGTVYLAALRAGVVLGDALGDTQMSHEAETQFKQTQAGLIH-ALWNGRFFAYGTDIG 646
Query: 748 GS-SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
G+ + + + QLAGQ+ +R G ++ D+A+SA+ + + +VL
Sbjct: 647 GAHRRDDRLFSGQLAGQFLSRYAGWGDVLPHDQAKSAIYEQLHTSVL 693
>gi|283835266|ref|ZP_06355007.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
gi|291068424|gb|EFE06533.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
Length = 896
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/272 (25%), Positives = 119/272 (43%), Gaps = 67/272 (24%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM------DQFDRDGDGMIENDGFPDQTYDT 685
VL +YRD + GD+ + + W +V + + DQ+D + G++ G T D
Sbjct: 470 VLAIYRDWLLCGDRAWLEEKWSAVRDMLRFAWNPDNPDQWDPEQRGVLT--GRQHHTLDM 527
Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK--FQKAKV-VYEKLWNGSYF 742
+ G + + GL++AAL+A A+A +G ++D L++ ++ +V + E+L+NG YF
Sbjct: 528 -ELFGPNPWLTGLYIAALEAGCAIANHLGH--NDDAMLWQDIARRGRVTLRERLFNGEYF 584
Query: 743 ----------------------NYDNSGS---------------SQSSSIQADQLAGQWY 765
+Y SG S DQL GQW+
Sbjct: 585 QQVIDLANRDTLAPWRTSAESADYGLSGDVWQQYWCEENHEVKYQFGSGCHIDQLMGQWF 644
Query: 766 ARACGLLPIVDEDKARSALKKVYNYNVLKV-----------MGGKRGAVNGMLPDGRVDM 814
A CGL P+ D + + AL+ ++ YN + + ++G P G DM
Sbjct: 645 AGLCGLPPVFDPGQTQRALRALWQYNFVSARHHTNPCRLYGLNDEQGVSICSWPTG--DM 702
Query: 815 SSMQ---SREIWSGVTYAVAASMIHEDLADIG 843
++ ++E+ +G Y +AA M+ +A G
Sbjct: 703 PAIPVPYAQEMMTGFEYQLAALMVASGMATEG 734
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 68/294 (23%), Positives = 117/294 (39%), Gaps = 31/294 (10%)
Query: 98 PLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVL------ 151
PLGG+G+G I + G +IF + F+V R+ + +L
Sbjct: 15 PLGGIGTGCISLAGNGHLTDVEIFNHPDKGYHHGYTHFAVKAERNGALVDARILQGDWQG 74
Query: 152 --CPK---TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGE-PDPELRIVCR 205
C + + K T G+G L G H+ + + + + DPE R
Sbjct: 75 NACGQWQIHDRLRKRHTGYGVGPDAETLAGMPHFAHSEFDGEFPMAQMSFSDPEFPGNIR 134
Query: 206 QIS--PIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
++ P IP + SS P++ F + N+ + D TL + N + DS +++
Sbjct: 135 MLAFNPFIPCDEDNSSLPLAWFELEVENTTDAALDYTLCLSLCNGLAPDST------HNR 188
Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG--NSLGLTAKDMW 321
++ + H + L R P +A A TD VS ++ +SL + +D
Sbjct: 189 HEIREDRHMMTLTQRHYPADDP-RYAELATTTDAQDVSWQTYWYRGDWFDSLTMFWRDFC 247
Query: 322 HEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
+ RL E + S+PG + A++AA + + G+V F W CP
Sbjct: 248 GQ-------PRLPEREYT-QSQPGFADHASLAAHLHLAAGERGRVRFWTGWYCP 293
>gi|224535965|ref|ZP_03676504.1| hypothetical protein BACCELL_00829 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522420|gb|EEF91525.1| hypothetical protein BACCELL_00829 [Bacteroides cellulosilyticus
DSM 14838]
Length = 899
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 91/396 (22%), Positives = 159/396 (40%), Gaps = 76/396 (19%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
GVPLGG+G G+I + G+F R + K + FS++ + N ++ +V +
Sbjct: 64 GVPLGGIGVGNINFAPSGKFTRIGMNNIHTPIKRSEHSFFSLWTRKGNEKE--AVRLVRD 121
Query: 156 PEVLKDTTAAGIGSWDWNLKGDKST-YHALYPRAWTVHEGEPDPELRI--VCRQISPIIP 212
VL + +KG + T Y L+P T D L++ V R S ++P
Sbjct: 122 NHVL------------YGMKGVEHTQYKGLFP---TAELSYADNNLKVTPVIRAYSGLVP 166
Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG--GDSEF-----TGQHYNS-KT 264
HN K+SS PV F + + A L F+W + +G D + GQ + +
Sbjct: 167 HNVKDSSLPVVWFEVDLISQEDMEA--ALAFSWEDFIGLFNDPKSLEGFDNGQLLSEGRA 224
Query: 265 KMNDG--------------------VHAVLLHHRTSHQLPPVTF-------ALAAQETDG 297
+N+G + ++ + S Q +TF +A +E
Sbjct: 225 NINNGENWPLREKAKTYVEPYQMGSLKGLIQYAADSLQPRKLTFQNYVNQVVIAVEEEK- 283
Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
+VS P + + + W + + +G F + + +T + +S +A+A
Sbjct: 284 -NVSYLPTY--------RSNSEAWDQFRNNGEFTS-SLTKNVLTEQSQTSSASALAVKTQ 333
Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGK----TYY---RRYTKFYGTHQNAAANIARDAILE 410
+ + + F LAW PE+ + +Y+ Y K+Y + N ++ A+
Sbjct: 334 LKAGQKKTIRFMLAWYAPELQIDAAALPIGSYWPCGADYNKYYHNYFNCMNSMVSYAVSN 393
Query: 411 HGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYL 446
Q WQ P+LE LP+WY L N Y +
Sbjct: 394 RARIARQTTEWQIPVLESS-LPDWYKFKLINSGYVI 428
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 78/157 (49%), Gaps = 8/157 (5%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
++Q+ +D TGD ++ KA + AM ++ + R G G TYD ++ +
Sbjct: 522 IIQLVKDYEQTGDTEYLKAHLTGLKRAMKFL--YSRMPQGSTIPVG--PTTYDDFTHPPL 577
Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLWNGSYFNY--DNSG 748
+Y G+W+ L+A A+ + +GD +F + K EKLWNG +F Y + G
Sbjct: 578 YSYYAGVWLTTLKAYEAIGKAIGDESIVKQAQQQFATSQKEALEKLWNGRFFAYGCEPDG 637
Query: 749 SSQSSSIQ-ADQLAGQWYARACGLLPIVDEDKARSAL 784
S + ++ QLAGQ+ +R CG + D ++++
Sbjct: 638 SKRLDNVLFTGQLAGQFLSRYCGWGDVYPMDIVKASM 674
>gi|89068658|ref|ZP_01156050.1| hypothetical protein OG2516_16434 [Oceanicola granulosus HTCC2516]
gi|89045845|gb|EAR51906.1| hypothetical protein OG2516_16434 [Oceanicola granulosus HTCC2516]
Length = 876
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 94/375 (25%), Positives = 144/375 (38%), Gaps = 54/375 (14%)
Query: 92 TSSHGV-PLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRS-------- 142
TS + V PLGG+G+G++G G Q W+IF R + + F+V R+
Sbjct: 13 TSPYAVFPLGGIGAGTVGIGADGRLQDWEIFNRPAKGSVNGFSHFAVRAERAGEVVDTRV 72
Query: 143 -----NGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKG----DKSTYHALYPRAWTVHE 193
G + S L P + G G + G S+ YP A E
Sbjct: 73 LHGPFTGNRTGSFLAP-------EFNGYGFGPRRQQMTGFPSFRSSSLAGPYPAADLTFE 125
Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
+P + R +SP +P ++SS PV++F T N G T D TL
Sbjct: 126 -DPAFPGAVDMRALSPFVPLEDRDSSMPVAMFEVTFNNPGDTPLDYTLFGCLA------F 178
Query: 254 EFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPP--VTFALAAQETDGVHVSLCPHFVISGN 311
+F N++ + G +L T + P +LA D H H + GN
Sbjct: 179 DFVDLEGNAEVDIATGDRTILTGRTTLGEDHPDHRQLSLATDHDDTSH----QHHLYRGN 234
Query: 312 SLGLTAKDMWHEIKEHGSF-DR-LNSMETSVTSEPGSSI-GAAIAASVTVPPDSEGQVTF 368
T + W +I G DR S ++SV + +I + +AA VTVP V
Sbjct: 235 WFD-TMEVYWADIARGGRLRDRKYPSAKSSVANNQADAIEHSLLAAHVTVPAGESRTVRM 293
Query: 369 SLAWDCPEVNFMSGKTY-----------YRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
+L+W P T+ + +Y T ++A +AR A+ +
Sbjct: 294 ALSWYVPNFRKYWLTTFPMACCKPSAEVQTHWKNYYATQWSSAEEVARVALDGWADLRDR 353
Query: 418 IEAWQRPILEDKRLP 432
A+ R +L D LP
Sbjct: 354 TLAF-RNVLSDTTLP 367
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/209 (22%), Positives = 92/209 (44%), Gaps = 46/209 (22%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM------DQFDRDGDGMIENDGFPDQTYDT 685
V++ YRD +GD + +++WPS+ A+ + D++D D G++ G T D
Sbjct: 474 VMKAYRDWKLSGDDAWLRSLWPSIKSAIEFAWHPDNPDRWDPDQTGVLS--GRQHHTLDM 531
Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV-VYEKLWNGSYFN- 743
+ G +++ G ++ AL A + +A VG+ + + F+K K V E+L+NG +F
Sbjct: 532 -ELFGPNSWLAGFYLGALSAGAEMAEFVGEADTAAQYRALFEKGKAWVGEELFNGEWFAQ 590
Query: 744 ------------YDNSGSSQS-----------------------SSIQADQLAGQWYARA 768
+ G S + DQ+ +W+AR
Sbjct: 591 KIDIADRDTLAPFQEPGGDASHISGSIFEQYWSEEHGELKYQIGAGCAIDQVLAEWHARL 650
Query: 769 CGLLPIVDEDKARSALKKVYNYNVLKVMG 797
GL + D + +A++ +Y +N ++ +G
Sbjct: 651 YGLDHLYDPNDFATAVRAIYEHNFVERLG 679
>gi|423347418|ref|ZP_17325105.1| hypothetical protein HMPREF1060_02777 [Parabacteroides merdae
CL03T12C32]
gi|409217877|gb|EKN10851.1| hypothetical protein HMPREF1060_02777 [Parabacteroides merdae
CL03T12C32]
Length = 837
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/243 (25%), Positives = 106/243 (43%), Gaps = 39/243 (16%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
V+++YRD +GD +F + W V +AY +D + DG++E G T D +
Sbjct: 476 VMKIYRDWQLSGDDEFLQKNWGQVKKVLAYAWTDKGWDGNQDGVME--GLQHNTMDV-NY 532
Query: 689 SGISAYSGGLWVAALQAASALAREVGDR--GSEDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
G + G ++ AL+AA +A + D+ + LF+ Q + + L+NG Y+ +
Sbjct: 533 FGPNPQMGFWYMGALKAAEKMALAMKDKTFAKKCNTLFR-QGSTWMDANLFNGEYYEHKI 591
Query: 747 SGSSQSSSIQ------------------ADQLAGQWYARACGLLPIVDEDKARSALKKVY 788
+ + DQL GQ+ A CGL + D++ R+ L +
Sbjct: 592 TDPETFEYLDMRNPDVKVPPFQLGKGCLVDQLVGQYMAHICGLGYLGDKEHIRTTLGSIM 651
Query: 789 NYNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
YN +K VMG + G + P GR+++ E+ +G Y A MI+
Sbjct: 652 KYNYVKDFSRHFNNMRSYVMGDESGLLMASWPKGRLEVPFPYFAEVMTGFEYCAAVGMIY 711
Query: 837 EDL 839
E +
Sbjct: 712 EGM 714
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 64/305 (20%), Positives = 121/305 (39%), Gaps = 45/305 (14%)
Query: 97 VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTP 156
+PLGG+G+G++ RGE + W+I V ++S + +N ++ P+T
Sbjct: 33 LPLGGIGTGTVSLGGRGELRDWEIM-------NVPGKKYSTVTTGNNAPFFAIYAKPQTG 85
Query: 157 EVLKDTTAAGIGSWDW--------NLKG----DKSTYHALYPRAWTVHEGEPDPELRIVC 204
E + A + ++ N G +T+ A YP VH + +++
Sbjct: 86 EAMTTLLAGPLYPQEYLHYEGRPVNHHGMPRFADATFEAAYPFG-QVHLSDRQLPVKVTV 144
Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD-----SEFTGQH 259
+ +P++P + S P++V Y + N+ ++ + + N +G D +++ G +
Sbjct: 145 KGFNPLVPGDADASGMPIAVLAYEVTNTTDRPLEVAVCGSMRNFIGKDGSKFRTDWKGDY 204
Query: 260 YNSKTKMN-------DGVHAVLLHHRTSHQLPPV--TFALAAQETDGVHVSLCPHFVISG 310
+ K N +G+ + L+ + P T AL Q +GV
Sbjct: 205 IPTGAKDNKNQFRKKNGLQGLYLYSDGVDKNDPAYGTVALTTQAVEGVTYRTSSRADDWN 264
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
N + + W + G + E +P +S +A TV P S TF L
Sbjct: 265 NGI----LNFWDDFSADGELTERDRQE---DEDPMAS----LAVKKTVGPGSTETFTFYL 313
Query: 371 AWDCP 375
W+ P
Sbjct: 314 TWNFP 318
>gi|154490957|ref|ZP_02030898.1| hypothetical protein PARMER_00874 [Parabacteroides merdae ATCC
43184]
gi|423724888|ref|ZP_17699030.1| hypothetical protein HMPREF1078_02927 [Parabacteroides merdae
CL09T00C40]
gi|154088705|gb|EDN87749.1| hypothetical protein PARMER_00874 [Parabacteroides merdae ATCC
43184]
gi|409236060|gb|EKN28869.1| hypothetical protein HMPREF1078_02927 [Parabacteroides merdae
CL09T00C40]
Length = 837
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/243 (25%), Positives = 106/243 (43%), Gaps = 39/243 (16%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
V+++YRD +GD +F + W V +AY +D + DG++E G T D +
Sbjct: 476 VMKIYRDWQLSGDDEFLQKNWGQVKKVLAYAWTDKGWDGNQDGVME--GLQHNTMDV-NY 532
Query: 689 SGISAYSGGLWVAALQAASALAREVGDR--GSEDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
G + G ++ AL+AA +A + D+ + LF+ Q + + L+NG Y+ +
Sbjct: 533 FGPNPQMGFWYMGALKAAEKMALAMKDKTFAKKCNTLFR-QGSTWMDANLFNGEYYEHKI 591
Query: 747 SGSSQSSSIQ------------------ADQLAGQWYARACGLLPIVDEDKARSALKKVY 788
+ + DQL GQ+ A CGL + D++ R+ L +
Sbjct: 592 TDPETFEYLDMRNPDVKVPPFQLGKGCLVDQLVGQYMAHICGLGYLGDKEHIRTTLGSIM 651
Query: 789 NYNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
YN +K VMG + G + P GR+++ E+ +G Y A MI+
Sbjct: 652 KYNYVKDFSRHFNNMRSYVMGDESGLLMASWPKGRLEVPFPYFAEVMTGFEYCAAVGMIY 711
Query: 837 EDL 839
E +
Sbjct: 712 EGM 714
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 64/305 (20%), Positives = 121/305 (39%), Gaps = 45/305 (14%)
Query: 97 VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTP 156
+PLGG+G+G++ RGE + W+I V ++S + +N ++ P+T
Sbjct: 33 LPLGGIGTGTVSLGGRGELRDWEIM-------NVPGKKYSTVTTGNNAPFFAIYAKPQTG 85
Query: 157 EVLKDTTAAGIGSWDW--------NLKG----DKSTYHALYPRAWTVHEGEPDPELRIVC 204
E + A + ++ N G +T+ A YP VH + +++
Sbjct: 86 EAMTTLLAGPLYPQEYLHYEGRPVNHHGMPRFADATFEAAYPFG-QVHLSDRQLPVKVTV 144
Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD-----SEFTGQH 259
+ +P++P + S P++V Y + N+ ++ + + N +G D +++ G +
Sbjct: 145 KGFNPLVPGDADASGMPIAVLAYEVTNTTDRPLEVAVCGSMRNFIGKDGSKFRTDWKGDY 204
Query: 260 YNSKTKMN-------DGVHAVLLHHRTSHQLPPV--TFALAAQETDGVHVSLCPHFVISG 310
+ K N +G+ + L+ + P T AL Q +GV
Sbjct: 205 IPTGAKDNKNQFRKKNGLQGLYLYSDGVDKNDPAYGTVALTTQAVEGVTYRTSSRADNWN 264
Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
N + + W + G + E +P +S +A TV P S TF L
Sbjct: 265 NGI----LNFWDDFSADGELTERDRQE---DEDPMAS----LAVKKTVGPGSTETFTFYL 313
Query: 371 AWDCP 375
W+ P
Sbjct: 314 TWNFP 318
>gi|348173841|ref|ZP_08880735.1| hypothetical protein SspiN1_25464 [Saccharopolyspora spinosa NRRL
18395]
Length = 995
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 73/254 (28%), Positives = 122/254 (48%), Gaps = 41/254 (16%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYD-TWSVSG 690
+L++YR+ + D ++ + VWP A+ Y+ D + DG++ DG T D TW G
Sbjct: 608 ILRIYREHQMSPDSQWLQRVWPRTRQAVEYLIASDAEPDGIL--DGAQPNTQDATW--FG 663
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-----AKVVYEKLWNGSYFNYD 745
+++ GL++AAL+A +A+A EVGD D F + + +V+ L+NG YF ++
Sbjct: 664 RNSWLSGLYLAALRAGAAMAGEVGD----DAFARRCTELATSGTEVIVRDLFNGEYFVHE 719
Query: 746 ----NSGSSQSS-SIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL------- 793
+ GS ++ ADQL G +A GL ++ + RSAL+ ++ +N +
Sbjct: 720 LDPAHPGSVNTNRGCFADQLLGPSWATQLGLPRVLPPEPTRSALRSIWRHNFVPRPMEYR 779
Query: 794 ---KVMGGK-------RGAVNGMLPDGRVDM-----SSMQSREIWSGVTYAVAASMIHED 838
+ GG+ V P+G D S E W G+ Y VAA +I E
Sbjct: 780 QNSPIEGGRIFYDADVPALVMCTWPNGGGDEAGDNWSVSYFNEAWHGIEYQVAAHLIAEG 839
Query: 839 LADIGFQTACGIYE 852
+ D G A +++
Sbjct: 840 MVDEGLAVARSVHD 853
Score = 46.2 bits (108), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 36/158 (22%), Positives = 63/158 (39%), Gaps = 17/158 (10%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
G+P+GG+G G + G W + +++ AN N ++ + P +
Sbjct: 28 GLPIGGIGCGQLHLGGDGRLWLWDV-----DNRTAPAN--------INDLHFTRPMLPSS 74
Query: 156 PE----VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
P ++ T + + + +G A P A TV +P +++ SP I
Sbjct: 75 PVEQGFAVRVTDGDDVRTRWLDARGFPDATFAGRPPAATVDYADPQEPVQVRLDACSPFI 134
Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSV 249
P +SSYP YT N+G + D+ + N V
Sbjct: 135 PTEIDDSSYPAVFLDYTATNTGPATVDVAIAGFLANPV 172
>gi|332665583|ref|YP_004448371.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332334397|gb|AEE51498.1| protein of unknown function DUF608 [Haliscomenobacter hydrossis DSM
1100]
Length = 857
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/312 (25%), Positives = 129/312 (41%), Gaps = 53/312 (16%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
V++ YR+ + +GD ++ K W V AM+Y +D + DG+ E G T D
Sbjct: 491 VMRFYREWILSGDNEWLKKHWARVKAAMSYAWVPGGWDANQDGVQE--GKQHNTMDV-DY 547
Query: 689 SGISAYSGGLWVAALQAASALAREVGD----RGSEDYFLFKFQKAKVVYEKLWNGSYFNY 744
G + + AL+A++A+AR + D + E + +K V E L+NG Y+ +
Sbjct: 548 YGPNPQMQFWYFGALKASAAMARAMNDPVFAKQCEQVLA---KGSKWVDENLFNGEYYEH 604
Query: 745 -----------DNSGSSQS-------SSIQADQLAGQWYARACGLLPIVDEDKARSALKK 786
D S S + DQ+ GQ+ A CGL + ++ ++AL+
Sbjct: 605 KITDPKTFEFLDRSNPSTAIPNFQLGEGCLLDQVVGQYMAHVCGLGYLAKKENIQTALQT 664
Query: 787 VYNYNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
V YN + VM + G + P GR+ + E SG YA A M
Sbjct: 665 VMKYNFVPRFDNVFNNMRSYVMDKESGLIMASWPKGRLKVPFPYFAESMSGFEYAAAIGM 724
Query: 835 IHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
++E + G Q I + + G F PE + Y R +A W+ AL
Sbjct: 725 LYEGQTEAGLQCIQSIRD-RFDGEKRN-PFDEPECGHH--------YARAMASWSAVLAL 774
Query: 895 TRPKPKTLEKQM 906
+ + +EK M
Sbjct: 775 SGFQYSGVEKSM 786
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 65/317 (20%), Positives = 124/317 (39%), Gaps = 45/317 (14%)
Query: 87 IKRHLTSSH----GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL--ANQ---FSV 137
+ +H H +PLGG+G+G++ RGE + W+I + + + NQ F+V
Sbjct: 25 VLKHYDQDHLLNIALPLGGIGTGTVSLGGRGELRDWEIMNKPGKKFSTITDGNQAPFFAV 84
Query: 138 FVSRSNGQKYS-SVLCPKTPEVLKDTTAAGIGSWDWNLKG----DKSTYHALYPRAWTVH 192
+V S ++L P P + + N G +++ A YP V+
Sbjct: 85 YVKPIGKTAQSKALLGPFHPSEYQHYEGRPV-----NHHGMPRFSVASFDAAYPFG-QVN 138
Query: 193 EGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-- 250
+ + + + +P+IP + SS P+++ Y + N+ ++ + T N +G
Sbjct: 139 LSDDVLPISVKIKGFNPLIPGDVAASSIPLAILKYEVTNNTDAPLEVAICGTMRNFIGID 198
Query: 251 ---------GDSEFTGQHYN-SKTKMNDGVHAVLLHHRTSHQLPPV--TFALAAQETDGV 298
GD + G N ++ + + + + + + P T AL+ E + V
Sbjct: 199 GSKQSKNWKGDLVYLGAKNNQNEYRESPDLKGIFMSSQGVDNQDPAWGTIALSTAERELV 258
Query: 299 HVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTV 358
+ N + D W + G L+ + V P A++A T+
Sbjct: 259 SFRRSS----TPNDWENSILDFWDDFSADGV---LSDKDKLVDDNP----MASLAVKKTI 307
Query: 359 PPDSEGQVTFSLAWDCP 375
P S TF ++W P
Sbjct: 308 PARSSLTFTFYISWHFP 324
>gi|374373788|ref|ZP_09631448.1| hypothetical protein NiasoDRAFT_2604 [Niabella soli DSM 19437]
gi|373234761|gb|EHP54554.1| hypothetical protein NiasoDRAFT_2604 [Niabella soli DSM 19437]
Length = 897
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 86/176 (48%), Gaps = 16/176 (9%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
VL YR+ + D F K W + A ++ DR+ DGM E+ + W G
Sbjct: 504 VLGCYREHQMSADDGFLKRNWDKIKRATQFIINQDRNKDGM-EDTPLENTLDAIWD--GE 560
Query: 692 SAYSGGLWVAALQAASALAREVGDRG----SEDYFLFKFQKAKVVYEKLWNGSYFNY--- 744
A+ GL +AA++A A+A E+ D +DY + ++ + ++L+NG YF +
Sbjct: 561 IAWLVGLCIAAVKAGQAMAEEMNDTAFAAVCKDYVQ---KGSRHMSDQLFNGEYFIHRPD 617
Query: 745 ---DNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG 797
G ++ DQ+ GQ +A GL ++D+ + SALK ++ YN +K MG
Sbjct: 618 PVKGRKGLGSYNTCHIDQVLGQSWAFQVGLGRVIDQQQTLSALKALWKYNFIKDMG 673
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 83/376 (22%), Positives = 136/376 (36%), Gaps = 69/376 (18%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCED--------KPVLANQFSVFVSRSNGQKY 147
G+P+GG+ +G + S G W +F E K + + SR G
Sbjct: 69 GMPVGGINAGGMYLSGDGRLWLWDVFNLNQEGINPVDIPWKTSVQGELRKVRSRDGG--- 125
Query: 148 SSVLCPKTPEVL--------------KDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHE 193
S + PK E+ K+ T + DW D+ + A YP A T+
Sbjct: 126 SYIAPPKAKELRPLDQGFALKLEYEGKEQTRQ-LREEDW----DEVRFEATYPVA-TITY 179
Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
D ++ V + SP IP + S PVSVFT TI N+GK TL N +
Sbjct: 180 SSKDIPVQAVLQAYSPFIPLDTDRSGLPVSVFTITIKNAGKHPVKATLAGWLENKAALHT 239
Query: 254 EFTGQHYNSKTKMND----GVHAVLLHHRTSHQLPPVTF-----ALAAQETDGVHVSLCP 304
+ T + GV++ L +Q +F LAA + D V ++
Sbjct: 240 AAVNRLGRKNTTFKEEQITGVYSTLEKTAFGNQQADKSFDYGTLCLAALDRDAVAIA--- 296
Query: 305 HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
D+ I F ++ +V + + G+ I +VT +
Sbjct: 297 --------------DI-EAIDNARLFTSTTKVDQAVKAAQQTLTGSVI-TTVTAAANESA 340
Query: 365 QVTFSLAWDCPEVNFMSGKTYYRR-----YTKFYGTHQNAAANIARDAILEHGSWELQIE 419
V++ ++W P + GK + Y + + A+ +A + G+ L +
Sbjct: 341 TVSYIISWHTPNLMIGDGKVLPEKDRGQYYDNQFKNAKEVASYVAANFKELSGTTLLWRD 400
Query: 420 AWQRPILEDKRLPEWY 435
W D LP W+
Sbjct: 401 TWY-----DATLPYWF 411
>gi|423226023|ref|ZP_17212489.1| hypothetical protein HMPREF1062_04675 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392630880|gb|EIY24861.1| hypothetical protein HMPREF1062_04675 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 899
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 92/397 (23%), Positives = 161/397 (40%), Gaps = 78/397 (19%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
GVPLGG+G G+I + G+F R + K + FS++ + N ++ +V +
Sbjct: 64 GVPLGGIGVGNINFAPSGKFTRIGMNNIHTPIKRSEHSFFSLWTRKGNEKE--AVRLVRD 121
Query: 156 PEVLKDTTAAGIGSWDWNLKGDKST-YHALYPRAWTVHEGEPDPELRI--VCRQISPIIP 212
VL + +KG + T Y L+P T D L++ V R S ++P
Sbjct: 122 NHVL------------YGMKGVEHTQYKGLFP---TAELSYADNNLKVTPVIRAYSGLVP 166
Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG--GDSEF-----TGQHYNS-KT 264
HN K+SS PV F + + + L F+W + +G D + GQ + +
Sbjct: 167 HNVKDSSLPVVWFEVDLI--AQEDMEAALAFSWEDFIGLFNDPKSLEGFDNGQLLSEGRA 224
Query: 265 KMNDG--------------------VHAVLLHHRTSHQLPPVTF-------ALAAQETDG 297
+N+G + ++ + S Q +TF +A +E
Sbjct: 225 NINNGENWPLREKAKTYVEPYQMGSLKGLIQYAADSLQPRKLTFQNYVNQVVIAVEEEK- 283
Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
+VS P + + + W + + +G F +S+ +V +E + A++ A T
Sbjct: 284 -NVSYLPAY--------RSNSEAWEQFRNNGEFT--SSLTKNVLTEQSQTSSASVLAVKT 332
Query: 358 -VPPDSEGQVTFSLAWDCPEVNFMSGK----TYY---RRYTKFYGTHQNAAANIARDAIL 409
+ + + F LAW PE+ + +Y+ Y K+Y + N ++ A+
Sbjct: 333 QLKAGQKKTIRFMLAWYAPELQIDAAALPIGSYWPCGADYNKYYHNYFNNMNSMVSYAVS 392
Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYL 446
Q WQ P+LE LP+WY L N Y +
Sbjct: 393 NRARIARQTTEWQIPVLESS-LPDWYKFKLINSGYVI 428
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 78/157 (49%), Gaps = 8/157 (5%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
++Q+ +D TGD ++ KA + AM ++ + R G G TYD ++ +
Sbjct: 522 IIQLVKDYEQTGDTEYLKAHLTGLKRAMKFL--YSRMPQGSTIPVG--PTTYDDFTHPPL 577
Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLWNGSYFNY--DNSG 748
+Y G+W+ L+A A+ + +GD +F + K EKLWNG +F Y + G
Sbjct: 578 YSYYAGVWLTTLKAYEAIGKAIGDESIVKQAQQQFATSQKEALEKLWNGRFFAYGCEPDG 637
Query: 749 SSQSSSIQ-ADQLAGQWYARACGLLPIVDEDKARSAL 784
S + ++ QLAGQ+ +R CG + D ++++
Sbjct: 638 SKRLDNVLFTGQLAGQFLSRYCGWGDVYPMDIVKASM 674
>gi|116624646|ref|YP_826802.1| hypothetical protein Acid_5570 [Candidatus Solibacter usitatus
Ellin6076]
gi|116227808|gb|ABJ86517.1| protein of unknown function DUF608 [Candidatus Solibacter usitatus
Ellin6076]
Length = 844
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 83/192 (43%), Gaps = 30/192 (15%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
++ Y D +GD + + +WP V A+ + +D D DG++E G TYD
Sbjct: 480 IMHAYLDWSLSGDDAWLREMWPRVKKAIEFAWIPGGWDGDRDGVLE--GAQHNTYDV-EF 536
Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV-VYEKLWNGSYFNYDNS 747
G + G ++ AL+A +AR VGD S + F+ + + KL+NG Y+
Sbjct: 537 YGPNPMCGIYYLGALRAGEEMARAVGDTASAAEYRRLFESGRQWIDAKLFNGEYYIQQIR 596
Query: 748 GSSQSSSIQA-----------------------DQLAGQWYARACGLLPIVDEDKARSAL 784
G ++ +A DQL GQ+ A GL P+V + R L
Sbjct: 597 GVAKDKVAKAIIGTMGSDDTENPQYQVGAGCLVDQLVGQYMAEVAGLGPLVSPENIRKTL 656
Query: 785 KKVYNYNVLKVM 796
+ +Y YN + M
Sbjct: 657 QSIYRYNYKRSM 668
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/226 (24%), Positives = 99/226 (43%), Gaps = 22/226 (9%)
Query: 86 FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
F R LT+ PLGGV +GSI RG+ + W+IF R + +++V +
Sbjct: 36 FSGRKLTAI-AFPLGGVCAGSISLGGRGQLRDWEIFNRSDKGNAPSYAFPAIWVQSGARK 94
Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
+ VL + + G S ++ +T+ +P A P +++
Sbjct: 95 PVARVLESRIQPPYEGQNGLGTRSAPGLVRLAGATFTGEFPLARIDFTDAALP-VKVSLE 153
Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
+P IPH +S PV+V Y + N G +A +++ ++ + V G+ + T+
Sbjct: 154 AFTPFIPHEPDDSGLPVAVLRYRVSNPGAAAAKVSIAWSIDSPV-------GRTQAADTR 206
Query: 266 MND-----GVHAVLLHHRTSHQLPPV-----TFALAAQETDGVHVS 301
N+ G+ +++ T+ +LP +FALAA T G V+
Sbjct: 207 TNEYREAGGLAGLMM---TNAELPAADALKGSFALAAMGTAGARVT 249
>gi|405382777|ref|ZP_11036555.1| putative bile acid beta-glucosidase [Rhizobium sp. CF142]
gi|397320811|gb|EJJ25241.1| putative bile acid beta-glucosidase [Rhizobium sp. CF142]
Length = 863
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 65/274 (23%), Positives = 109/274 (39%), Gaps = 59/274 (21%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM------DQFDRDGDGMIENDGFPDQTYDT 685
+++ YRD GD + + WP+V A+ Y D++D D G++ G QT D
Sbjct: 475 IIKTYRDWKLCGDTDWLRRYWPNVKRAIEYAWSSENPDRWDPDQTGILS--GRQHQTLDM 532
Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRG-SEDYFLFKFQKAKVVYEKLWNGSYF-- 742
+ G +++ ++VAAL S +A +GD SE A + +L+NG +F
Sbjct: 533 -ELFGPNSWLASMYVAALLGVSEMAAVLGDTELSEKTARMGKAGAAYINSELFNGRWFIQ 591
Query: 743 ---------------------------------NYDNSGSSQSSSIQADQLAGQWYARAC 769
+ +DQ+ GQW+A
Sbjct: 592 KVDLSDKGVLIPFDVGRAAGVLADGFMETYWSEEFQELKYQMGEGCISDQILGQWHAEVA 651
Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVM------------GGKRGAVNGMLPDG-RVDM-S 815
G+ +D DK R+ALK V+ N + + G + P+G R M +
Sbjct: 652 GIGAFLDADKVRTALKSVHRNNFRPTLEDHFNPCRNYAYEDEAGLLIATYPEGIRQPMVA 711
Query: 816 SMQSREIWSGVTYAVAASMIHEDLADIGFQTACG 849
+ + E+W+G+ Y A+ +I L + G CG
Sbjct: 712 APYAEEVWTGIEYMAASHLIMHGLIEEGMDIVCG 745
>gi|253761387|ref|XP_002489099.1| hypothetical protein SORBIDRAFT_0073s002040 [Sorghum bicolor]
gi|241947398|gb|EES20543.1| hypothetical protein SORBIDRAFT_0073s002040 [Sorghum bicolor]
Length = 175
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 50/79 (63%), Gaps = 2/79 (2%)
Query: 27 DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
D P W+RKLS+ L +F + ++E +++ +G+R+ IREEA+ G++A IDPF
Sbjct: 31 DDGSPPEHAWRRKLSSHANRLKEFNVTFREAIRMMKLGLRLWSYIREEASHGRKAPIDPF 90
Query: 87 IKR--HLTSSHGVPLGGVG 103
+ ++S GVPLGG+G
Sbjct: 91 TREGNRPSASQGVPLGGMG 109
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/29 (79%), Positives = 27/29 (93%)
Query: 166 GIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
GI SWDW LKGD+STYHAL+PRAWTV++G
Sbjct: 115 GISSWDWKLKGDRSTYHALFPRAWTVYDG 143
>gi|423217835|ref|ZP_17204331.1| hypothetical protein HMPREF1061_01104 [Bacteroides caccae
CL03T12C61]
gi|392627338|gb|EIY21373.1| hypothetical protein HMPREF1061_01104 [Bacteroides caccae
CL03T12C61]
Length = 851
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 57/244 (23%), Positives = 106/244 (43%), Gaps = 37/244 (15%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
++++YR+ +GD F K W V ++Y +D + DG++E G T D +
Sbjct: 488 IMKIYREWQLSGDNDFLKNNWEQVKKVLSYAWIEKGWDGNQDGVME--GSQHNTMDV-NY 544
Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-AKVVYEKLWNGSYFNYDNS 747
G + G ++ AL+AA ++ + D+ F+K ++ + E L+NG Y+ + +
Sbjct: 545 FGPNPQMGFWYMGALKAAEKMSIAMKDKNFAKKCRTLFEKGSEWMDENLFNGEYYEHKIT 604
Query: 748 ------------------GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
G DQL GQ+ A CGL + D+ ++ +K +
Sbjct: 605 DPKTFEFLDMNDPDVKIPGFQLGQGCLVDQLVGQYMAHLCGLGYLGDKKNIQTTMKSIMK 664
Query: 790 YNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
YN ++ VMG + G + P GR+++ E+ +G Y A M++E
Sbjct: 665 YNFVEDFSRHFNNMRSYVMGDEAGLLMASWPKGRLEVPFPYFSEVMTGFEYCAAVGMLYE 724
Query: 838 DLAD 841
+ +
Sbjct: 725 GMEE 728
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 75/373 (20%), Positives = 145/373 (38%), Gaps = 49/373 (13%)
Query: 60 LAPIGVRILCLIREEAAKGKRA---FIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQ 116
L IG + +I +A + K + + K HL +PLGG+G+G++ RGE +
Sbjct: 6 LIAIGALLSGIITVQAQEWKPQQWPVLKKYDKEHLYQI-ALPLGGIGTGTVSLGGRGELR 64
Query: 117 RWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW---- 172
W+I V A ++S + +N ++ P+ + A + S ++
Sbjct: 65 DWEIM-------NVPAKKYSTVTTGNNAPFFAIYAKPQNQKATTTLLAGPLYSHEYLHYE 117
Query: 173 ----NLKG----DKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
N G +++++ A YP V + + +++ + +P IP + S PV+V
Sbjct: 118 GRPVNHHGLPRFEQASFEAAYPFG-QVSLSDSNLPVKVKVKGFNPFIPGDADASGLPVAV 176
Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGD-----SEFTGQHYNSKTKMN-------DGVHA 272
+Y + N+ ++++ + N +G D +++ G + + K N + +
Sbjct: 177 LSYEVTNTTDQPMNVSICGSIRNFIGKDGSKFRTDWKGDYIPTGAKNNKNKYVESNNIKG 236
Query: 273 VLLHHRTSHQLPPV--TFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF 330
+ L+ + T AL Q +GV NS+ + W + G
Sbjct: 237 IYLYSDGVDKDDSAWGTVALTTQAVNGVTYRTSSKADNWNNSI----LNFWDDFSADG-- 290
Query: 331 DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYT 390
M T + + A+++ + T+ P S TF + W P S Y+
Sbjct: 291 -----MLTERSKQEDEDPMASLSVNKTIAPQSTETFTFYITWSFPNRKAWSSVVVGNYYS 345
Query: 391 KFYGTHQNAAANI 403
Y AA I
Sbjct: 346 NQYPDAWKAAETI 358
>gi|153807893|ref|ZP_01960561.1| hypothetical protein BACCAC_02179 [Bacteroides caccae ATCC 43185]
gi|149129502|gb|EDM20716.1| hypothetical protein BACCAC_02179 [Bacteroides caccae ATCC 43185]
Length = 851
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 57/244 (23%), Positives = 106/244 (43%), Gaps = 37/244 (15%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
++++YR+ +GD F K W V ++Y +D + DG++E G T D +
Sbjct: 488 IMKIYREWQLSGDNDFLKNNWEQVKKVLSYAWIEKGWDGNQDGVME--GSQHNTMDV-NY 544
Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-AKVVYEKLWNGSYFNYDNS 747
G + G ++ AL+AA ++ + D+ F+K ++ + E L+NG Y+ + +
Sbjct: 545 FGPNPQMGFWYMGALKAAEKMSIAMKDKNFAKKCRTLFEKGSEWMDENLFNGEYYEHKIT 604
Query: 748 ------------------GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
G DQL GQ+ A CGL + D+ ++ +K +
Sbjct: 605 DPKTFEFLDMNDPDVKIPGFQLGQGCLVDQLVGQYMAHLCGLGYLGDKKNIQTTMKSIMK 664
Query: 790 YNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
YN ++ VMG + G + P GR+++ E+ +G Y A M++E
Sbjct: 665 YNFVEDFSRHFNNMRSYVMGDEAGLLMASWPKGRLEVPFPYFSEVMTGFEYCAAVGMLYE 724
Query: 838 DLAD 841
+ +
Sbjct: 725 GMEE 728
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 63/319 (19%), Positives = 128/319 (40%), Gaps = 46/319 (14%)
Query: 83 IDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRS 142
+ + K HL +PLGG+G+G++ RGE + W+I V A ++S + +
Sbjct: 32 LKKYDKEHLYQI-ALPLGGIGTGTVSLGGRGELRDWEIM-------NVPAKKYSTVTTGN 83
Query: 143 NGQKYSSVLCPKTPEVLKDTTAAGIGSWDW--------NLKG----DKSTYHALYPRAWT 190
N ++ P+ + A + S ++ N G +++++ A YP
Sbjct: 84 NAPFFAIYAKPQNQKATTTLLAGPLYSHEYLHYEGRPVNHHGLPRFEQASFEAAYPFG-Q 142
Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
V + + +++ + +P IP + S PV+V +Y + N+ ++++ + N +G
Sbjct: 143 VSLSDSNLPVKVKVKGFNPFIPGDADASGLPVAVLSYEVTNTTDQPMNVSICGSIRNFIG 202
Query: 251 GD-----SEFTGQHYNSKTKMN-------DGVHAVLLHHRTSHQLPPV--TFALAAQETD 296
D +++ G + + K N + + + L+ + T AL Q +
Sbjct: 203 KDGSKFRTDWKGDYIPTGAKNNKNKYVESNNIKGIYLYSDGVDKDDSAWGTVALTTQAVN 262
Query: 297 GVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASV 356
GV NS+ + W + G M T + + A+++ +
Sbjct: 263 GVTYRTSSKADNWNNSI----LNFWDDFSADG-------MLTERSKQEDEDPMASLSVNK 311
Query: 357 TVPPDSEGQVTFSLAWDCP 375
T+ P S TF + W P
Sbjct: 312 TIAPQSTETFTFYITWSFP 330
>gi|333382634|ref|ZP_08474302.1| hypothetical protein HMPREF9455_02468 [Dysgonomonas gadei ATCC
BAA-286]
gi|332828576|gb|EGK01276.1| hypothetical protein HMPREF9455_02468 [Dysgonomonas gadei ATCC
BAA-286]
Length = 859
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 63/262 (24%), Positives = 108/262 (41%), Gaps = 53/262 (20%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
++++YR+ +GD + K ++P V +M Y + +D G IE + W G
Sbjct: 495 IMKIYREWRISGDNNWLKEIYPFVKASMDYCIKTWDSREVGAIEEPHHNTYDIEFWGADG 554
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYF------- 742
+ + ALQA + +G+ S L K+K+ E KL++G YF
Sbjct: 555 MIT---SFYTGALQAMCLMGEYLGEDISRYEAL--MDKSKIYMETKLFDGEYFIQNIQWK 609
Query: 743 --NYD--------NSGSSQSS---------------SIQADQLAGQWYARACGLLPIVDE 777
N D NSG S + +D + G W + CG+ I+D+
Sbjct: 610 GLNADDPTKVQAFNSGYSDEALKILEAEGPKYQYGKGCLSDGILGSWMSLVCGMPEIIDK 669
Query: 778 DKARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPD-GRVDMSSMQSREIW 823
K S L V+ YN+ +G + G + P G++ + + S E+W
Sbjct: 670 TKVTSHLNAVHKYNLKHDLREHANPQRPTFALGNEGGLLLCSWPKGGKLQLPFVYSDEVW 729
Query: 824 SGVTYAVAASMIHEDLADIGFQ 845
+G+ Y VAA ++ E + G +
Sbjct: 730 TGIEYQVAAHLMFEGEVEKGLE 751
>gi|336426603|ref|ZP_08606612.1| hypothetical protein HMPREF0994_02618 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336010619|gb|EGN40601.1| hypothetical protein HMPREF0994_02618 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 850
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 69/257 (26%), Positives = 112/257 (43%), Gaps = 48/257 (18%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM------DQFDRDGDGMIENDGFPDQTYDT 685
V+ YRD +GD+++ + W SV A+AY D +DR G++ DG T D
Sbjct: 452 VMLTYRDYKISGDREWLEGKWESVKKAIAYAWSEENPDAWDRKKQGVL--DGRQHHTLDM 509
Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSE-DYFLFKFQKAKVVYEKLWNGSYF-- 742
+ G +A+ G +V AL+AA+ +A +GD +E +Y + ++L+NG YF
Sbjct: 510 -ELFGPNAWLQGFYVGALEAAAEMAAILGDGEAEAEYRALAAGGKAYLNKELFNGEYFYQ 568
Query: 743 -----------------NYDNSGSSQ-----SSSIQADQLAGQWYARACGLLPIVDEDKA 780
Y N Q + DQ+ QW+ CGL I +++K
Sbjct: 569 KIDVCDRSAVDRFNASKEYWNEEKGQIKYQIAEGCGIDQVLAQWHGNLCGLPEIFEKEKV 628
Query: 781 RSALKKVYNYNVLKVMGG------------KRGAVNGMLPDG--RVDMSSMQSREIWSGV 826
R AL ++ YN +G + GA+ P+G + + ++E G+
Sbjct: 629 RKALASIFRYNYKPALGDIYNPWRIFAIDEESGAIMCEWPEGTDKPAIPLTYAQECMDGM 688
Query: 827 TYAVAASMIHEDLADIG 843
Y AA MI E + + G
Sbjct: 689 QYQAAAHMIQEGMTEEG 705
>gi|160932020|ref|ZP_02079412.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
gi|156869062|gb|EDO62434.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
Length = 822
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 112/233 (48%), Gaps = 23/233 (9%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
V+++YR+ +GD + + W V +M Y + +D DGD + D TYDT + G
Sbjct: 491 VVRLYREWRLSGDNRLIEDCWDGVVRSMEYAIRTWDTDGDFV--PDSAQHVTYDT-ELYG 547
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-AKVVYEKLWNGSYF-----NY 744
+++ L+ AAL AA+ +A +GD+ + +K AK++ EK WNG Y+ +
Sbjct: 548 MTSMVSTLFFAALIAAAEMAEYMGDQERARRYRDGAEKGAKLMDEKCWNGEYYIQLIDDV 607
Query: 745 DNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK---------- 794
D +DQL GQ+ A+ GL ++ ++ + A + ++ YN ++
Sbjct: 608 DKYRYQYGKGCLSDQLLGQFLAQEAGLGYVLPKNHVKKAAESIFRYNFIERASDFGHVQR 667
Query: 795 --VMGGKRGAVNGMLPDG-RVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
+ + G P G R + E+W+G+ Y VAA +I EDL + G
Sbjct: 668 AYIFNDEMGLTPCTWPLGERPRFPFIYFGEVWTGIEYEVAALLIREDLLEEGL 720
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 58/254 (22%), Positives = 97/254 (38%), Gaps = 28/254 (11%)
Query: 93 SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
S+ LGG+G+G++ RGE + WQIF + + F++ G+ S +L
Sbjct: 26 SAAAFLLGGIGTGNVSVGARGELRSWQIFNEPGQFNFLPYTFFALRAEVEGGEAVSKILE 85
Query: 153 PKT------PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQ 206
K PE G+ + +KST A YP E + P +++
Sbjct: 86 SKLNPPFTHPEGTLRCELGGLPRF------EKSTMSAEYPFVRVKLEDKQVP-VQVEMEA 138
Query: 207 ISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS-------EFTGQH 259
+P IP N +S P + Y + N +++ + N+VG + + G
Sbjct: 139 FTPFIPLNADDSGIPGAYLNYRVKNPTNQPVTVSIAGSLANAVGFEKYDIWNHMKLNGNP 198
Query: 260 YNSKTKMNDGVHAVLLHHRT--SHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTA 317
N + N G+ + + T LA E D V+ P + + G A
Sbjct: 199 VNEYQERN-GLKGLFFRGENVPEKHIANGTMCLATSEDD---VTAKPLWFQGEWTDG--A 252
Query: 318 KDMWHEIKEHGSFD 331
+D W + GS D
Sbjct: 253 QDFWDDFISDGSLD 266
>gi|160890984|ref|ZP_02071987.1| hypothetical protein BACUNI_03431 [Bacteroides uniformis ATCC 8492]
gi|156859205|gb|EDO52636.1| hypothetical protein BACUNI_03431 [Bacteroides uniformis ATCC 8492]
Length = 858
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/261 (24%), Positives = 104/261 (39%), Gaps = 51/261 (19%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYR+ GD + K ++P+V +M Y + +D G +E + W +G
Sbjct: 498 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 557
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF--NYDNSG 748
+ +V ALQA + + VGD S L KA + KL++G YF N +G
Sbjct: 558 MIT---SFYVGALQAICLMGKTVGDDISRYEELMNKSKA-YLESKLFDGEYFIQNIQWTG 613
Query: 749 SSQSSSIQA------------------------------DQLAGQWYARACGLLPIVDED 778
+ A D + G W CG+ +VD
Sbjct: 614 LDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPEVVDRL 673
Query: 779 KARSALKKVYNYNVLKVMGGK---------RGAVNGML-----PDGRVDMSSMQSREIWS 824
K +S L V+ YN K + G G+L G++ + + S E+W+
Sbjct: 674 KVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYSNEVWT 733
Query: 825 GVTYAVAASMIHEDLADIGFQ 845
G+ Y VAA ++ E + G +
Sbjct: 734 GIEYQVAAHLMFEGEVEKGLE 754
>gi|270294306|ref|ZP_06200508.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270275773|gb|EFA21633.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 851
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/261 (24%), Positives = 104/261 (39%), Gaps = 51/261 (19%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYR+ GD + K ++P+V +M Y + +D G +E + W +G
Sbjct: 491 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 550
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF--NYDNSG 748
+ +V ALQA + + VGD S L KA + KL++G YF N +G
Sbjct: 551 MIT---SFYVGALQAICLMGKTVGDDISRYEELMNKSKA-YLESKLFDGEYFIQNIQWTG 606
Query: 749 SSQSSSIQA------------------------------DQLAGQWYARACGLLPIVDED 778
+ A D + G W CG+ +VD
Sbjct: 607 LDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPEVVDRL 666
Query: 779 KARSALKKVYNYNVLKVMGGK---------RGAVNGML-----PDGRVDMSSMQSREIWS 824
K +S L V+ YN K + G G+L G++ + + S E+W+
Sbjct: 667 KVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYSNEVWT 726
Query: 825 GVTYAVAASMIHEDLADIGFQ 845
G+ Y VAA ++ E + G +
Sbjct: 727 GIEYQVAAHLMFEGEVEKGLE 747
>gi|317480931|ref|ZP_07940011.1| hypothetical protein HMPREF1007_03130 [Bacteroides sp. 4_1_36]
gi|316902824|gb|EFV24698.1| hypothetical protein HMPREF1007_03130 [Bacteroides sp. 4_1_36]
Length = 851
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/261 (24%), Positives = 104/261 (39%), Gaps = 51/261 (19%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYR+ GD + K ++P+V +M Y + +D G +E + W +G
Sbjct: 491 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 550
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF--NYDNSG 748
+ +V ALQA + + VGD S L KA + KL++G YF N +G
Sbjct: 551 MIT---SFYVGALQAICLMGKTVGDDISRYEELMNKSKA-YLESKLFDGEYFIQNIQWTG 606
Query: 749 SSQSSSIQA------------------------------DQLAGQWYARACGLLPIVDED 778
+ A D + G W CG+ +VD
Sbjct: 607 LDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPEVVDRL 666
Query: 779 KARSALKKVYNYNVLKVMGGK---------RGAVNGML-----PDGRVDMSSMQSREIWS 824
K +S L V+ YN K + G G+L G++ + + S E+W+
Sbjct: 667 KVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYSNEVWT 726
Query: 825 GVTYAVAASMIHEDLADIGFQ 845
G+ Y VAA ++ E + G +
Sbjct: 727 GIEYQVAAHLMFEGEVEKGLE 747
>gi|284041275|ref|YP_003391205.1| hypothetical protein Slin_6449 [Spirosoma linguale DSM 74]
gi|283820568|gb|ADB42406.1| protein of unknown function DUF608 [Spirosoma linguale DSM 74]
Length = 857
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 62/252 (24%), Positives = 100/252 (39%), Gaps = 42/252 (16%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
+++ YRD +GD F + WP V A+A+ +D + DG++E G T D
Sbjct: 484 IMKFYRDWQLSGDTDFLRRYWPQVKQALAFAWVPGGWDANRDGVME--GAQHNTMDV-EY 540
Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNY--- 744
G + ++ AL+AASA+A + DR FQK + L+NG Y+
Sbjct: 541 FGPNPQMQLWYLGALKAASAMATAMNDRSFAQTCDQIFQKGSTWTDANLFNGDYYEQQVI 600
Query: 745 --------------------DNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSAL 784
++ + DQL GQ+ A CGL +V+ D ++ L
Sbjct: 601 SPAGHLIAKGTFSGFNKVEQNDPAYQLAKGCLVDQLVGQFMAHVCGLGYLVNPDHIKTTL 660
Query: 785 KKVYNYNVLKVM------------GGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAA 832
K + YN M G + + P GR + E+ +G Y A
Sbjct: 661 KSILKYNYRASMTDHFNNMRSYALGDEASLLMASWPKGRPKVPFPYFSEVMTGFEYTAAI 720
Query: 833 SMIHEDLADIGF 844
M++E+ + G
Sbjct: 721 GMLYENQTEPGL 732
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 75/347 (21%), Positives = 136/347 (39%), Gaps = 43/347 (12%)
Query: 85 PFIKR----HLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL-----ANQF 135
P +KR HLT +PLGG+G+G++ RG+ Q W+I R + + A F
Sbjct: 29 PVLKRYDQQHLTRI-ALPLGGIGTGTVSLGGRGQLQDWEIMNRPAKGYSTVLTGNTAPFF 87
Query: 136 SVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGE 195
++ V + +K ++L P + + + S + YP V+ +
Sbjct: 88 AIHVLET--KKTKALLGPLESHEYQHMEGRPVDHHGLPRFANAS-FETAYPFG-QVNLSD 143
Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
+ + + +P++P + S P++V TY + N+G +++ + N +G
Sbjct: 144 ATMPVSVRIKGFNPLVPTDADASGLPIAVLTYEVTNTGNQPITVSVCGSMHNFIGRDGSK 203
Query: 251 ------GDSEFTGQHYN-SKTKMNDGVHAVLLHHRTSHQLPPV--TFALAAQETDGVHVS 301
G+ E G N ++ + +G+ + ++ + T AL E+ GV
Sbjct: 204 NHKSWKGELEPQGAKQNRNQFRQENGLTGIYMYSDGVDKADAAWGTVALTTAESQGVSYR 263
Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
N+L D W + + G +L T +P A++A T+ P+
Sbjct: 264 TSSVTNAWENAL----LDFWDDFSDDG---KLTEKTTPADEDP----MASLAVQKTIGPN 312
Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAI 408
TF L W P F K R +Y T N A +A +
Sbjct: 313 QTQAFTFFLTWHFPN-RFGWSK---ERVGNYYTTQYNDAWAVADKTV 355
>gi|284997210|ref|YP_003418977.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|284445105|gb|ADB86607.1| hypothetical protein LD85_0902 [Sulfolobus islandicus L.D.8.5]
Length = 498
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 75/164 (45%), Gaps = 23/164 (14%)
Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
+ S +I++FP+++ S + + V + G VPHD+G
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RSDGYVPHDLGFHSLD 390
Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
++ +WKD+NP +L VYR T D F K V+P++ M + + RDG
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQCRDGL 448
Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
+E G D +D + G +Y+ L++A+L A +A+ VG
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVG 490
>gi|326798039|ref|YP_004315858.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326548803|gb|ADZ77188.1| hypothetical protein Sph21_0609 [Sphingobacterium sp. 21]
Length = 877
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 61/266 (22%), Positives = 111/266 (41%), Gaps = 53/266 (19%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYRD +GD ++ K ++P V ++ Y Q +D +G G++E + W +G
Sbjct: 517 IMKVYRDWRISGDDEWMKKLYPMVQKSLDYCIQTWDPNGKGILEEPHHNTYDIEFWGPNG 576
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
+ ++ AL+A + R L+ +A + ++L+NG YF
Sbjct: 577 MCT---SFYLGALKALIEMGRYQRKNMKRYETLYDSGRA-YMEDQLYNGEYFIQKIQWTG 632
Query: 743 -NYDNSGSSQSSSIQ-----------------------ADQLAGQWYARACGLLPIVDED 778
+ +QS Q +D + G W A CGL +D++
Sbjct: 633 LKAPDPTKAQSFHTQYTPEAIEILKKEGPKYQYGTGCLSDGVIGAWMAAMCGLENPLDQE 692
Query: 779 KARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPD-GRVDMSSMQSREIWS 824
K S L+ V+ YN+ MG + G + P G + + + S E+W+
Sbjct: 693 KVNSHLRAVHQYNLKNDLTDHINPQRPTYAMGHEGGLLLCTWPKGGMLSLPFVYSNEVWT 752
Query: 825 GVTYAVAASMIHEDLADIGFQ--TAC 848
G+ Y VA+ ++ D G + +AC
Sbjct: 753 GIEYQVASHLMQIGETDKGLEIISAC 778
>gi|402589209|gb|EJW83141.1| hypothetical protein WUBG_05945 [Wuchereria bancrofti]
Length = 90
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 51/88 (57%), Gaps = 12/88 (13%)
Query: 745 DNSGSSQSSSIQADQLAGQW--YARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
+NSG+ ADQ W +RA LLP+ +SAL ++Y+ N+L GG+ GA
Sbjct: 3 ENSGT-------ADQTYDAWKLLSRARALLPL---SHIKSALNQIYSSNILNFAGGRLGA 52
Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAV 830
VNGM DG V +Q+ E+W GVTYA+
Sbjct: 53 VNGMRKDGTVGRRHLQADEMWVGVTYAL 80
>gi|410638454|ref|ZP_11349015.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
gi|410141863|dbj|GAC16220.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
Length = 1124
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 62/287 (21%), Positives = 127/287 (44%), Gaps = 27/287 (9%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+L YR+ + + + K+ WP V A+ + ++++D + G + N T D +++G
Sbjct: 726 ILNTYREHQCSANDSWLKSQWPRVKKAVEWGINEWDPNRIGFMSNT--QHNTLDG-NITG 782
Query: 691 ISAYSGGLWVAALQAASALAREVGD-RGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
S++ G L++ +L+AA+ +A VG+ + +++Y + K+ ++LWNG Y+ +
Sbjct: 783 CSSWIGSLYLTSLEAAARMADIVGEPQTAKEYRAIRESGKKLQNKRLWNGEYYIQEEGKE 842
Query: 750 SQSSSIQA---DQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKR------ 800
+ DQL G+W+A + ++A+ A+ + YN G+
Sbjct: 843 RFQDYLDGCHIDQLLGEWWADQLNIDRNYPRERAQQAMSALLKYNFRADFYGQSLKPRQY 902
Query: 801 -----GAVNGML-PDGRVDMSSMQ-SREIWSGVTYAVAASMIHEDLADIG---FQTACGI 850
G + + P G+ + M+ E+ +G Y A +M+ + D G +
Sbjct: 903 CEIEDGGMKMITWPRGQQPIPGMKYGDEVMTGFEYGAAVTMMQNGMIDEGLMLLKVVADR 962
Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLC---YMRPLAIWAMQWAL 894
Y+ G+ P ++ + C Y R L++W+ AL
Sbjct: 963 YDGRLRTEGVSNVVNGPWGYSGNPYGDDECGKYYGRSLSVWSALTAL 1009
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 35/171 (20%), Positives = 69/171 (40%), Gaps = 14/171 (8%)
Query: 202 IVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYN 261
+ +P IP + + SS P +++ T+ N+ ++L + N++G G+ +
Sbjct: 409 VALEVFNPFIPMDLQNSSIPCAIYQVTVKNTSTDEVTTSVLSSQKNALGYTEGEKGEFGS 468
Query: 262 SKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMW 321
++ + + LLH + P L Q D +G + + +
Sbjct: 469 NQNTLLKKGNTTLLHMTRAGSHPKADMVLLTQAADA-----------TGTTNWTSPAEFK 517
Query: 322 HEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAW 372
E ++G N+ S S G ++ A++A +T+ P VTF L W
Sbjct: 518 REFTDNGGLTGPNN---SPISRAGQTVNGALSAPLTLAPGESKSVTFVLTW 565
>gi|156383694|ref|XP_001632968.1| predicted protein [Nematostella vectensis]
gi|156220031|gb|EDO40905.1| predicted protein [Nematostella vectensis]
Length = 1641
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 87/188 (46%), Gaps = 19/188 (10%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM---IEN--DGFPD 680
D+NP F+ QVY + TG+KK +WP++ M + +G + + N D
Sbjct: 676 DVNPAFLSQVYHLYMWTGNKKLLDDLWPAIKDVTHAMVTWTTNGTSLPLRMTNTYDILQF 735
Query: 681 QTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWN- 738
Q YD +S Y+ + + L A A+ + +R + +KA ++ LW+
Sbjct: 736 QRYD------VSMYNSVMHLLGLTVARAIGQVRKERKFVKDLTLRIRKATSAIDRLLWDE 789
Query: 739 --GSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN----V 792
G Y + + +S++ AD L GQ +A GL ++DE+K S L+K N
Sbjct: 790 KRGYYRAWWDHAQPSTSALMADSLYGQVWAYTLGLGNLLDENKMASHLRKEAELNDTPFG 849
Query: 793 LKVMGGKR 800
LKVM G R
Sbjct: 850 LKVMSGLR 857
>gi|229820204|ref|YP_002881730.1| hypothetical protein Bcav_1712 [Beutenbergia cavernae DSM 12333]
gi|229566117|gb|ACQ79968.1| protein of unknown function DUF608 [Beutenbergia cavernae DSM
12333]
Length = 865
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 67/256 (26%), Positives = 111/256 (43%), Gaps = 40/256 (15%)
Query: 632 VLQVYRDVV--ATGDKKFAKAVWPSVYVAMAYM-DQFDRDGDGMIENDGFPDQTYDTWSV 688
+L+ YR+V A G A A WP + + + ++ ++D G G++ G T+D +
Sbjct: 509 LLKTYREVRSGAIGLGHLA-ASWPRLMLLLDHVRGRWDPSGSGVLR--GVQPSTHDI-DL 564
Query: 689 SGISAYSGGLWVAALQAASALA-REVGDRGSEDYFLFK-------------FQKAKVVYE 734
G++ + G LW+AAL+A +A R D D F++ Y+
Sbjct: 565 RGVNTFMGTLWLAALRACEEMALRLAADPHHADALGGATLGRELAAELHALFERGSAAYD 624
Query: 735 KL-WNGSYFNY----DNSGSSQ-SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVY 788
++ + G Y+ D Q +DQL GQW+A L ++ RSAL+ V
Sbjct: 625 EICFTGEYYRQVLEPDEPTQFQWGDGCLSDQLIGQWWAHLLDLGHVLPAAHVRSALRAVV 684
Query: 789 NYNVLKVMGG--------KRGAVNGML-----PDGRVDMSSMQSREIWSGVTYAVAASMI 835
+N+ + G G G+L GR ++++ E+WSGV Y VAA +
Sbjct: 685 GHNLRRGFDGWEHTQRVYADGDDVGLLMCTWPHGGRPEVATRYCDEVWSGVEYEVAAHCL 744
Query: 836 HEDLADIGFQTACGIY 851
E L G G++
Sbjct: 745 LEGLTAEGRAILDGLW 760
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 86/353 (24%), Positives = 133/353 (37%), Gaps = 58/353 (16%)
Query: 97 VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-----SNGQKYSSVL 151
+PLGG+G+G++ G ++WQ+ + + F+V VS+ + +
Sbjct: 17 MPLGGIGTGNVAIGADGGLRQWQLHNIGNHHGDLPGSLFAVRVSQWEPPLDEVRVLQAPA 76
Query: 152 CP----KTPEVLKDTTAAGIGSWDWNLKG----DKSTYHALYPRAWTVHEGEPDPELRIV 203
P +TP V D + W L +T A YP A ++ P L I
Sbjct: 77 APAGNTRTPMVDDDH----VPDWQRELAAAHGVTATTISATYPVARVAYDLGEVP-LAIE 131
Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD-------SEFT 256
+P++P + SS PV++FT+T+ N+G L N+VG D + F
Sbjct: 132 LEATTPLVPLDEAASSLPVAMFTFTLRNAGTFPVHGYLAGALQNAVGYDGVTPIDGTRFR 191
Query: 257 GQHYNSKTKMNDGVHAVLLHHRTSHQLPPV-----TFALAAQETDGVHVSLCPHFVISGN 311
G N+ L+ T+ LPP + ALA D S P + +
Sbjct: 192 GYGGNTNRVRRRASWTDLV--MTNDSLPPEHPGAGSMALAC---DDPRASALPAYSGTEE 246
Query: 312 SLG-LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG---------AAIAASVTVPPD 361
L L ++ M + F S PG++ G A+A+ V P
Sbjct: 247 LLAFLRSRSMGGDTAR---FTGAPWAADSHAEGPGAARGPSPAAETWLGALASRFHVEPG 303
Query: 362 SEGQVTFSLAWDCPE--VNFMS--------GKTYYRRYTKFYGTHQNAAANIA 404
V F +AW P VNF G T + + H +AAA IA
Sbjct: 304 ESTTVRFLMAWHFPNRYVNFTQFGPERPEWGPTRFWIGNHYTLAHDDAAAVIA 356
>gi|406663141|ref|ZP_11071211.1| putative bile acid beta-glucosidase [Cecembia lonarensis LW9]
gi|405552803|gb|EKB48138.1| putative bile acid beta-glucosidase [Cecembia lonarensis LW9]
Length = 937
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 74/352 (21%), Positives = 143/352 (40%), Gaps = 24/352 (6%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLA--NQFSVFVSRSNGQKYSSVLCP 153
G+P+GG+ +G++ G+ W IF R E + N V +G Y + P
Sbjct: 108 GMPVGGLCTGTVYIGGDGKLWLWDIFNRQKEGVKSVTHDNWHGQRVRPRDGANYIFPMSP 167
Query: 154 KTPEV----LKDTTAAGIGSWDWNLKG-DKSTYHALYPRAWTVHEGEPDPELRIVCRQIS 208
+ P L+ + I + KG + ++ A YP A + P + + + S
Sbjct: 168 EYPFEQGFGLRISQNNKIWERSLDFKGFEDISFKAQYPIAEISYRDSQMP-IEVDLQAFS 226
Query: 209 PIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND 268
P IP + K SS P + +TI N+ K L N++ ++ + S + +
Sbjct: 227 PFIPLDVKNSSNPAAFMHFTITNTSKEEVKAELSGWLENAILRFTKDAEETELSNSIEKE 286
Query: 269 GVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHG 328
+ VL S + + + + D +SL +I +S + + ++ +K +
Sbjct: 287 SGNTVLTCSARSKNIHNFSTTIVQNQRDFGTMSL---MLIGNDSSTVGSAEI--PVKSNL 341
Query: 329 SFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-----MSGK 383
F N+ ET+ T+E G + ++ +V + P ++TF ++W P + +G+
Sbjct: 342 LFPE-NTTETA-TAEFGEQLCGGVSKTVRLNPGESQKITFIISWHFPNIELPQNDRSTGE 399
Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWY 435
R YT + + A +A + + W+ E+ LP W+
Sbjct: 400 NKGRHYTNHFDDSREVAMELAAN----YQKLSYLTNLWREAFYENSSLPHWF 447
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 72/259 (27%), Positives = 111/259 (42%), Gaps = 46/259 (17%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD----GDGMIENDGFPDQTYDTWS 687
VL+ YRD + D F K W ++ +A+ Y+ D++ +GMI G T D
Sbjct: 540 VLRAYRDYQMSEDDTFLKNNWDNIKLALKYLIDLDKEYGETANGMIY--GEQHNTLDAEW 597
Query: 688 VSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF----- 742
I A + L++AAL AA +A+ GD SE +L + + E L+NG YF
Sbjct: 598 FGYIPAITS-LYLAALAAAVEMAKATGDISSEKEYLSILESGRKNIESLFNGEYFIQEED 656
Query: 743 --NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG--- 797
+ D G + I DQ+ GQ +A L + ++ +S+L ++ YN + MG
Sbjct: 657 PDHLDAIGIGKGCYI--DQVFGQGWAFQLNLGRLYNKKMIQSSLDSLWKYNFVPDMGPLR 714
Query: 798 --------GKRGAVNGMLPDGRVDMSSMQ---SREIW-------------SGVTYAVAAS 833
G+ A++G D + M + RE W +G Y A+
Sbjct: 715 DSLPPQIAGRPYAIHG---DAGLVMCTWPLGGKREDWQRHWQFGYFNECMTGFEYQAASH 771
Query: 834 MIHEDLADIGFQTACGIYE 852
MI ED D G I+E
Sbjct: 772 MIWEDKLDEGLSIIKAIHE 790
>gi|402583425|gb|EJW77369.1| hypothetical protein WUBG_11725, partial [Wuchereria bancrofti]
Length = 109
Score = 58.5 bits (140), Expect = 1e-05, Method: Composition-based stats.
Identities = 28/60 (46%), Positives = 39/60 (65%), Gaps = 3/60 (5%)
Query: 771 LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
LLP+ +SAL ++Y+ N+L GG+ GAVNGM DG V +Q+ E+W GVTYA+
Sbjct: 1 LLPL---SHIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYAL 57
>gi|429739681|ref|ZP_19273428.1| hypothetical protein HMPREF9151_01886 [Prevotella saccharolytica
F0055]
gi|429156127|gb|EKX98766.1| hypothetical protein HMPREF9151_01886 [Prevotella saccharolytica
F0055]
Length = 989
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/262 (21%), Positives = 107/262 (40%), Gaps = 52/262 (19%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++ YRD +GD ++ K ++P++ ++ Y + +D G IE + W G
Sbjct: 633 IMKAYRDWRISGDTEWLKKLYPAIKASLDYCIRTWDPRQKGCIEEPHHNTYDIEFWGADG 692
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
+ +V AL+A ++ + R S Y + +++ +L+NG Y+
Sbjct: 693 MCT---SFYVGALEAFIRMSDAL-HRPSTPYRKLLKKSVELMNTQLFNGEYYIQKTQWTG 748
Query: 743 --------------NYDNSGSSQ-----------SSSIQADQLAGQWYARACGLLPIVDE 777
NY N + Q + +D + G W A CGL +++
Sbjct: 749 LRAKSPTEVLSFGSNYSNPEALQLLQAEGPKYQYGTGCLSDGVIGMWMASVCGLPEPLNQ 808
Query: 778 DKARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPDGRV-DMSSMQSREIW 823
+ S L VY YNV + +G + G + P G+ + + S E+W
Sbjct: 809 AQTLSHLNAVYKYNVREDLYRHYNTQRPSYALGHEGGLLLCTWPKGKKPTLPFVYSNEVW 868
Query: 824 SGVTYAVAASMIHEDLADIGFQ 845
+G+ Y VA+ ++ + D G +
Sbjct: 869 TGIEYQVASHLLMKGEIDKGLK 890
>gi|282877348|ref|ZP_06286172.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
gi|281300533|gb|EFA92878.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
Length = 997
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 53/262 (20%), Positives = 107/262 (40%), Gaps = 52/262 (19%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYRD +G+ ++ K+++P+V ++ Y + +D G + + W G
Sbjct: 638 IMKVYRDWRISGNDEWLKSLYPAVKASLDYCIRTWDPHEKGSLTEPHHNTYDIEFWGADG 697
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
+ + AL+A ++ + + + L K + + + L+NG Y+
Sbjct: 698 MCT---SFYTGALEAFVQMSEHLNKQAGKYKKLLK-RSISCMDKDLFNGEYYFQKTQWTG 753
Query: 743 --------------NYDNSGSSQ-----------SSSIQADQLAGQWYARACGLLPIVDE 777
NY ++ + + + +D + G W A CGL +++
Sbjct: 754 LEAKSPTEIPSFGSNYRHAEAQELLKTEGPKYQYDTGCLSDGILGMWMASVCGLPEPLNQ 813
Query: 778 DKARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPDG-RVDMSSMQSREIW 823
+S L VY YN+ K MGG G + P G + + + S E+W
Sbjct: 814 THVKSHLNAVYQYNLRKNMYNHYNTQRPSYAMGGDGGLLLCTWPKGAKPTLPFVYSNEVW 873
Query: 824 SGVTYAVAASMIHEDLADIGFQ 845
+G+ Y VA+ +I + + G +
Sbjct: 874 TGIEYQVASHLIMKGEVEKGLE 895
>gi|332981217|ref|YP_004462658.1| hypothetical protein Mahau_0633 [Mahella australiensis 50-1 BON]
gi|332698895|gb|AEE95836.1| protein of unknown function DUF608 [Mahella australiensis 50-1 BON]
Length = 880
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 66/276 (23%), Positives = 114/276 (41%), Gaps = 58/276 (21%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM------DQFDRDGDGMIENDGFPDQTYDT 685
V++ YR+ +GD ++ K++WP V ++ Y D++D + G++ G T D
Sbjct: 469 VIKTYREWKISGDDEWLKSLWPRVKKSIEYAWVDTNEDRWDPEQKGVLT--GRQHHTLDM 526
Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV-VYEKLWNGSY--- 741
+ G +++ G ++AAL+A + +A VGD + + F++ K L+NG Y
Sbjct: 527 -ELFGPNSWLTGFYLAALKAGAEMASYVGDEQAAQRYTDIFRRGKAWADSNLFNGEYYYH 585
Query: 742 ------------FNYDNSGS-SQSSSIQA-------------------DQLAGQWYARAC 769
F+ D +G+ S+I+A DQ+ QW+A
Sbjct: 586 LIDLQDKSILERFDTDKTGTLIGDSTIKAYWDEERGQIKYQVGEGCIIDQVLAQWHANLI 645
Query: 770 GLLPIVDEDKARSALKKVYNYNVLKV-----------MGGKRGAVNGMLPDGRVD--MSS 816
GL I D R A+ VY N K+ + + G + P GR +S
Sbjct: 646 GLGEIFDPGNTRKAVYAVYKNNFKKMREVFNPCRVFSLNDESGMIICSYPAGRKKPAISV 705
Query: 817 MQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
++E +G Y A +I + G + I E
Sbjct: 706 PYAQETMNGFEYQAAEHLIQKGFIREGEEVVRAIRE 741
>gi|156383620|ref|XP_001632931.1| predicted protein [Nematostella vectensis]
gi|156219994|gb|EDO40868.1| predicted protein [Nematostella vectensis]
Length = 1090
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/233 (24%), Positives = 101/233 (43%), Gaps = 23/233 (9%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMA-YMDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
VL+ YR+V K + +WP V M +M D +G G+I G TYD ++ G
Sbjct: 640 VLKTYREVRQGAPKTWFNKMWPQVKRIMKRWMTDLD-NGTGVIP--GPQPNTYDC-TLYG 695
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYF----NYD 745
++ Y G ++A L+AA +A+ + + +F+ +K L+NGS++ + +
Sbjct: 696 VNVYISGYYLAGLRAAEKMAKIQNEPELAATYHQRFELGSAKIDKTLFNGSWYIQEVDSE 755
Query: 746 NSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNG 805
+ + + D L GQW+A A GL I+ + +S L+ ++ N ++ A
Sbjct: 756 HPVNVVGDATWVDCLVGQWWAHALGLGYILSQKNIQSTLQNIFLKNHVERFVPATQAPRQ 815
Query: 806 MLPDGRVDMSSM-------------QSREIWSGVTYAVAASMIHEDLADIGFQ 845
+ + S W+G+ Y A +E L DIG +
Sbjct: 816 FFDERDAGLRICVYPNDLPKSPLLYHSEGAWTGLEYEYAELCFYEGLNDIGLK 868
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 74/175 (42%), Gaps = 27/175 (15%)
Query: 96 GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK- 154
+PLG +G GS+ + G ++WQI V V + F++ V ++N K ++
Sbjct: 162 ALPLGALGGGSVALAGDGGLRQWQISGAVNHTAHVPNSFFAIRVEQTNSVKAVALQSNTW 221
Query: 155 ----------------TPEVLKDTTAAGIGSWDWNLKGDKS-TYHALYPRAWTVHEGEPD 197
PE KD + G K+ T A YP A + +
Sbjct: 222 YDQTGFKPAAYISDHVVPEASKDLLKT--------IPGVKTITVGAKYPIAEVSYLSDEV 273
Query: 198 PELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD 252
P +++ SP IP + K S PV +F +++ N+ + A ++LL + N G D
Sbjct: 274 P-VQVHLEAFSPCIPLDSKNSGIPVIMFNFSVTNTTSSDAQVSLLGSLQNIAGWD 327
>gi|340616465|ref|YP_004734918.1| hypothetical protein zobellia_454 [Zobellia galactanivorans]
gi|339731262|emb|CAZ94527.1| Conserved hypothetical protein [Zobellia galactanivorans]
Length = 923
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/274 (22%), Positives = 110/274 (40%), Gaps = 61/274 (22%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM-IENDGFPDQTYD 684
D +VL+ YR+ + + + +F VW V +A+ Y +DG + +E +G +
Sbjct: 495 DAQSGYVLKAYREHLNSPNNEFLDTVWEKVKMAIGY--HIFKDGAEIGLEPNGVLEGEQT 552
Query: 685 TWSVS--GISAYSGGLWVAALQAASALAREVG--DRGSEDYFLFKFQKAKVVYEKLWNGS 740
W G + Y+ L++AAL+AA +A+ G + S + LF+ K + E++WNG
Sbjct: 553 FWDPMWYGPNPYNNTLYLAALRAAEEMAKVKGEPELASRYHKLFE-SGQKFMNEEMWNGE 611
Query: 741 Y-----------------------------------FNYDNSGSSQSSSIQADQLAGQWY 765
+ FN SG ++ A QL GQ +
Sbjct: 612 FYIHLYPVGLNGNVGIKNGFKLPSEIDDNAQSYIKAFNEGRSGYFPGTACDAQQLFGQNW 671
Query: 766 ARACGLLPIVDEDKARSALKKVYNYN----VLKVMGGKRGAVNGMLPDGRVDM------- 814
A GL I+ +K+ A K ++ YN + + ++ + G M
Sbjct: 672 ANQLGLGYILSPEKSEIAAKSIFKYNWTPDIATIYDFEKPNARVLAAKGEAAMINGGWTK 731
Query: 815 -------SSMQSREIWSGVTYAVAASMIHEDLAD 841
++ +W+G+ Y A MI+E L +
Sbjct: 732 EKPKSFENTHDKTNVWTGLEYEAACDMINEGLLE 765
>gi|410867532|ref|YP_006982143.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
ATCC 4875]
gi|410824173|gb|AFV90788.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
ATCC 4875]
Length = 845
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 151/361 (41%), Gaps = 54/361 (14%)
Query: 526 EENIGQFLYLEGI---EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK 582
++ G+F EG+ MW T DV + ++ ++A A PS
Sbjct: 396 RDHEGRFFGFEGVLGASTAMW-TGDVGYSCPLNCTHVW---------NYAQAASALWPSL 445
Query: 583 MKLLDDGQW-VSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKF------VLQV 635
+ D ++ V + G++PH + I W E L+D +P VL+
Sbjct: 446 EASMRDTEFEVMQAPDGSLPHRV-IAPVWLEQ----LWDVEIGGPDHPALDGMLGAVLKT 500
Query: 636 YRDVV--ATGDKKFAKAVWPSVYVAMAYMD-QFDRDGDGMIENDGFPDQTYDTWSVSGIS 692
YR+V A G + + W ++ M ++ ++D + G++ G T+D S+ G +
Sbjct: 501 YREVRRGAVG-AGWLRDRWNNLCRLMDHVAARWDPECSGLLT--GEQPSTHDI-SLYGAN 556
Query: 693 AYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYF-----NYDN 746
+ G LW+AAL+A+ +A +G + +F + Y+ L+ G Y+
Sbjct: 557 MFMGSLWLAALRASEEMALILGATDRAMGWRERFVLSSAAYDTALFRGGYYVQAADETHG 616
Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNV-----------LKV 795
ADQL GQW+A L ++ + RSAL+ V +N+
Sbjct: 617 DDDEFGDGCLADQLIGQWWAHLLDLGHLLPAEHVRSALQAVVAHNLNPADETRHSQRAYA 676
Query: 796 MGGKRGAVNGMLP-DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
+ G++G V P GR ++ E+W+G Y VAA + E L G + I +A
Sbjct: 677 VRGEQGLVMCSWPRGGRPARATRYCDEVWTGCEYEVAALCLAEGLEKDGHR----ILDAI 732
Query: 855 W 855
W
Sbjct: 733 W 733
>gi|312086024|ref|XP_003144914.1| hypothetical protein LOAG_09338 [Loa loa]
gi|307759923|gb|EFO19157.1| hypothetical protein LOAG_09338, partial [Loa loa]
Length = 84
Score = 57.0 bits (136), Expect = 5e-05, Method: Composition-based stats.
Identities = 28/76 (36%), Positives = 41/76 (53%)
Query: 670 DGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
DGMIEN G DQTYDT ++G+S Y G LW+ L A + A E+ D S + + +A
Sbjct: 9 DGMIENSGTADQTYDTLKMNGVSTYCGSLWLTLLHAEVSFASEIDDIRSGNLYTGFLLRA 68
Query: 730 KVVYEKLWNGSYFNYD 745
++E + Y +
Sbjct: 69 GDIFESEYYNIYLQME 84
>gi|312373134|gb|EFR20945.1| hypothetical protein AND_18252 [Anopheles darlingi]
Length = 734
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 40/68 (58%)
Query: 77 KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
+G++ +D + +G PLGG+G+G+IGR + GEF R+Q+ P + E V ANQF
Sbjct: 655 EGRQVLMDYLYTENAKQIYGAPLGGIGAGTIGRGFAGEFCRYQLRPGLYEYNTVHANQFI 714
Query: 137 VFVSRSNG 144
V + G
Sbjct: 715 VTIKDETG 722
>gi|340616467|ref|YP_004734920.1| hypothetical protein zobellia_456 [Zobellia galactanivorans]
gi|339731264|emb|CAZ94529.1| Conserved hypothetical protein [Zobellia galactanivorans]
Length = 822
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 87/192 (45%), Gaps = 27/192 (14%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
D + F+L VY + GD++F +WP V A + Q +R G PD+ T
Sbjct: 476 DCSTSFILSVYASHIWDGDEEFVNEMWPHVKRAAQW--QINRS-----RKFGLPDRLAAT 528
Query: 686 WSVSG-----ISAYSGGLWVAALQAASALAREVGDRGSEDYF-LFKFQKA---KVVYEKL 736
+ +SG + +Y+ + +AA++AA+ LA+E R ED L+K A +
Sbjct: 529 YDLSGFGKKDLVSYNAFMHLAAMKAATELAKE---REKEDLVSLYKKNIAAAQNTLKAHF 585
Query: 737 WNGSYF-NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALK-------KVY 788
W G+YF N+ N ++ I D GQ ++ GL +++ + +S L+ Y
Sbjct: 586 WTGTYFRNWWNKDEPKNDDIHIDTQFGQVWSYLLGLGDVIEPELMKSHLEAEVVAADSPY 645
Query: 789 NYNVLKVMGGKR 800
VL G R
Sbjct: 646 GMKVLAYAGNDR 657
>gi|338212680|ref|YP_004656735.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336306501|gb|AEI49603.1| hypothetical protein Runsl_3226 [Runella slithyformis DSM 19594]
Length = 855
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 59/258 (22%), Positives = 99/258 (38%), Gaps = 42/258 (16%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
+++ YRD +G+ F + WP + A+A+ D +D + DG++ DG T D
Sbjct: 484 IMKFYRDWQLSGNTAFLQKNWPKIKRALAFAWVPDGWDGNLDGVM--DGAQHNTMDV-EY 540
Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV-VYEKLWNGSYFNYD-- 745
G + ++ AL+AA +A + D FQ ++L+NG Y+
Sbjct: 541 FGPNPQMQIWYLGALKAAEKMAEALHDAAFAQQCKTLFQSGSTWADQQLFNGEYYEQQVM 600
Query: 746 ---------------NSGSSQSSSIQ------ADQLAGQWYARACGLLPIVDEDKARSAL 784
NS Q DQL GQ+ A CGL +V +++L
Sbjct: 601 SPVGRKFAKGTVSGWNSTQGDFPPFQLAKGCLVDQLVGQFMAHVCGLGYLVKPQNVQTSL 660
Query: 785 KKVYNYNVLKV------------MGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAA 832
K + YN + + + P GR + E+ +G Y A
Sbjct: 661 KSILKYNYRATLTEHFNNMRSYALADEPALLMASWPKGRPTVPFPYFAEVMTGFEYTAAI 720
Query: 833 SMIHEDLADIGFQTACGI 850
M++E++ D G + I
Sbjct: 721 GMLYENMTDDGLKCIQNI 738
>gi|361067543|gb|AEW08083.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162873|gb|AFG64132.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162874|gb|AFG64133.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162875|gb|AFG64134.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162876|gb|AFG64135.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162877|gb|AFG64136.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162878|gb|AFG64137.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162879|gb|AFG64138.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162880|gb|AFG64139.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162881|gb|AFG64140.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162882|gb|AFG64141.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
Length = 79
Score = 55.1 bits (131), Expect = 2e-04, Method: Composition-based stats.
Identities = 30/62 (48%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 875 QYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEE 934
YRSL YMRPLAIWAMQWAL PK LE + E ++HAG + VA K +
Sbjct: 2 HYRSLIYMRPLAIWAMQWALNPPK-AVLEAPKINLMEREHPSQFHAGLAAVAEAFKHVHQ 60
Query: 935 QG 936
G
Sbjct: 61 NG 62
>gi|156376779|ref|XP_001630536.1| predicted protein [Nematostella vectensis]
gi|156217559|gb|EDO38473.1| predicted protein [Nematostella vectensis]
Length = 666
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/175 (27%), Positives = 79/175 (45%), Gaps = 15/175 (8%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
D + FV++VY TGD + +WP+V A+ ++ G G+ P + T
Sbjct: 277 DSSALFVVEVYHYFKWTGDSALLRRLWPNVKRAVTWLIGESTKGTGL------PFRKVST 330
Query: 686 WSVSGISA-----YSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLWNG 739
+ + GI Y G+++ AL+AA LA + D+ +A ++V LWN
Sbjct: 331 YDLVGIDKYDHVMYEAGVFLLALRAAQHLAVHLKDKPFNATTTHALIRATQLVDVTLWNE 390
Query: 740 S---YFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
Y + +S S I +D L GQ +A GL ++ E+K + L+ YN
Sbjct: 391 ERQFYRAWWDSEYGASDWIMSDSLYGQVWANTLGLGKLLPEEKMKVHLQNEIKYN 445
>gi|270294313|ref|ZP_06200515.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|317480939|ref|ZP_07940019.1| hypothetical protein HMPREF1007_03138 [Bacteroides sp. 4_1_36]
gi|270275780|gb|EFA21640.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|316902832|gb|EFV24706.1| hypothetical protein HMPREF1007_03138 [Bacteroides sp. 4_1_36]
Length = 853
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/261 (21%), Positives = 105/261 (40%), Gaps = 53/261 (20%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYRD G+ ++ K ++ V ++ Y ++ +D G+IE + W SG
Sbjct: 491 IMKVYRDWHIYGNDEWLKLIYSYVQNSLDYCINTWDPKRKGVIEEPHHNTYDIEFWGPSG 550
Query: 691 -ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF------- 742
I++Y + ALQA A+ + ++ +Y + + +L++G YF
Sbjct: 551 MINSY----YTGALQAFVAMGEHL-EKDMTEYRELLDKSIDYMENQLYDGEYFIQNIRWK 605
Query: 743 ---------------NYDNSG----------SSQSSSIQADQLAGQWYARACGLLPIVDE 777
NY G +D + G W + CGL +D
Sbjct: 606 ELQASDPTKVQSVNSNYSKEGLDLLEKEGPKYQYGKGCLSDGVVGAWLSLVCGLDEAIDR 665
Query: 778 DKARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPD-GRVDMSSMQSREIW 823
K S L V+ YN+ + +G + G + P G++ + + S E+W
Sbjct: 666 KKILSHLLSVHKYNLKRNLRKHVNPQRSTFALGDEGGLLLCSWPKGGKLQLPFVYSNEVW 725
Query: 824 SGVTYAVAASMIHEDLADIGF 844
+G+ Y VA+ ++ E + G
Sbjct: 726 TGIEYQVASHLMFEGEVEKGL 746
>gi|160890992|ref|ZP_02071995.1| hypothetical protein BACUNI_03439 [Bacteroides uniformis ATCC 8492]
gi|156859213|gb|EDO52644.1| hypothetical protein BACUNI_03439 [Bacteroides uniformis ATCC 8492]
Length = 831
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/261 (21%), Positives = 105/261 (40%), Gaps = 53/261 (20%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYRD G+ ++ K ++ V ++ Y ++ +D G+IE + W SG
Sbjct: 469 IMKVYRDWHIYGNDEWLKLIYSYVQNSLDYCINTWDPKRKGVIEEPHHNTYDIEFWGPSG 528
Query: 691 -ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF------- 742
I++Y + ALQA A+ + ++ +Y + + +L++G YF
Sbjct: 529 MINSY----YTGALQAFVAMGEHL-EKDMTEYRELLDKSIDYMENQLYDGEYFIQNIRWK 583
Query: 743 ---------------NYDNSG----------SSQSSSIQADQLAGQWYARACGLLPIVDE 777
NY G +D + G W + CGL +D
Sbjct: 584 ELQASDPTKVQSVNSNYSKEGLDLLEKEGPKYQYGKGCLSDGVVGAWLSLVCGLDEAIDR 643
Query: 778 DKARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPD-GRVDMSSMQSREIW 823
K S L V+ YN+ + +G + G + P G++ + + S E+W
Sbjct: 644 KKILSHLLSVHKYNLKRNLRKHVNPQRSTFALGDEGGLLLCSWPKGGKLQLPFVYSNEVW 703
Query: 824 SGVTYAVAASMIHEDLADIGF 844
+G+ Y VA+ ++ E + G
Sbjct: 704 TGIEYQVASHLMFEGEVEKGL 724
>gi|160887666|ref|ZP_02068669.1| hypothetical protein BACUNI_00066 [Bacteroides uniformis ATCC 8492]
gi|156862844|gb|EDO56275.1| hypothetical protein BACUNI_00066 [Bacteroides uniformis ATCC 8492]
Length = 992
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/260 (20%), Positives = 107/260 (41%), Gaps = 51/260 (19%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYR+ +GD ++ K ++P+V ++ Y + +D G +E + W G
Sbjct: 634 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 693
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
+ ++ AL A + +E+ + ++Y + K + L++G YF
Sbjct: 694 MCT---SFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 749
Query: 743 ----------NYDNSGSSQS--------------SSIQADQLAGQWYARACGLLPIVDED 778
++ S S ++ + +D + G W A CGL ++D +
Sbjct: 750 LQAPNPVDVMSFGGSYSEEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 809
Query: 779 KARSALKKVYNYNV-------------LKVMGGKRGAVNGMLPD-GRVDMSSMQSREIWS 824
K RS L V+ YN+ + G G + P G + + + S E+W+
Sbjct: 810 KVRSHLVAVHKYNLKHDLIDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNEVWT 869
Query: 825 GVTYAVAASMIHEDLADIGF 844
G+ Y VA+ ++ + + G
Sbjct: 870 GIEYQVASHLMMKGEVEKGL 889
>gi|317478064|ref|ZP_07937244.1| hypothetical protein HMPREF1007_00360 [Bacteroides sp. 4_1_36]
gi|316905767|gb|EFV27541.1| hypothetical protein HMPREF1007_00360 [Bacteroides sp. 4_1_36]
Length = 970
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/260 (20%), Positives = 109/260 (41%), Gaps = 51/260 (19%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYR+ +GD ++ K ++P+V ++ Y + +D G +E + W G
Sbjct: 612 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 671
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
+ ++ AL A + +E+ + ++Y + K + L++G YF
Sbjct: 672 MCT---SFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 727
Query: 743 ----------NYDNSGSSQS--------------SSIQADQLAGQWYARACGLLPIVDED 778
++ S S ++ + +D + G W A CGL ++D +
Sbjct: 728 LQAPNPVDVMSFGGSYSEEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 787
Query: 779 KARSALKKVYNYN----VLKVMGGKR-----GAVNGML-----PDGRVDMSSMQSREIWS 824
K RS L V+ YN ++ +R G G+L G + + + S E+W+
Sbjct: 788 KVRSHLVAVHKYNLKHDLIDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNEVWT 847
Query: 825 GVTYAVAASMIHEDLADIGF 844
G+ Y VA+ ++ + + G
Sbjct: 848 GIEYQVASHLMMKGEVEKGL 867
>gi|270295078|ref|ZP_06201279.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270274325|gb|EFA20186.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 970
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/260 (20%), Positives = 107/260 (41%), Gaps = 51/260 (19%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYR+ +GD ++ K ++P+V ++ Y + +D G +E + W G
Sbjct: 612 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 671
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
+ ++ AL A + +E+ + ++Y + K + L++G YF
Sbjct: 672 MCT---SFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 727
Query: 743 ----------NYDNSGSSQS--------------SSIQADQLAGQWYARACGLLPIVDED 778
++ S S ++ + +D + G W A CGL ++D +
Sbjct: 728 LQAPNPVDVMSFGGSYSDEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 787
Query: 779 KARSALKKVYNYNV-------------LKVMGGKRGAVNGMLPD-GRVDMSSMQSREIWS 824
K RS L V+ YN+ + G G + P G + + + S E+W+
Sbjct: 788 KVRSHLVAVHKYNLKHDLVDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNEVWT 847
Query: 825 GVTYAVAASMIHEDLADIGF 844
G+ Y VA+ ++ + + G
Sbjct: 848 GIEYQVASHLMMKGEVEKGL 867
>gi|384263211|ref|YP_005418399.1| amylo-alpha-16-glucosidase [Rhodospirillum photometricum DSM 122]
gi|378404313|emb|CCG09429.1| Amylo-alpha-16-glucosidase [Rhodospirillum photometricum DSM 122]
Length = 728
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 6/62 (9%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
D P FV V TGD FA+A+WP+V A+A++D DRDGDG +E +DG
Sbjct: 397 DATPLFVALAGAYVERTGDGAFARALWPAVDAALAWIDHHGDRDGDGFVEYGRRCDDGLV 456
Query: 680 DQ 681
+Q
Sbjct: 457 NQ 458
>gi|29348317|ref|NP_811820.1| hypothetical protein BT_2908 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340221|gb|AAO78014.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 848
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 57/270 (21%), Positives = 105/270 (38%), Gaps = 61/270 (22%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYR+ +G+ +F +++P V ++ Y + +D G IE + W G
Sbjct: 490 IMKVYREWRISGENEFLISMYPKVKKSLDYCISTWDPRRVGSIEEPHHNTYDIEFWGPDG 549
Query: 691 I--SAYSGGLWVAALQAASALAR--EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF---- 742
+ S Y G L SA R E D+ +Y + K L+NG YF
Sbjct: 550 MHNSFYYGAL--------SAFIRMSEFLDKDVTEYKKLLKKGRKFTETGLFNGEYFIQKI 601
Query: 743 --------------NYDNSGSSQSSSIQ--------------ADQLAGQWYARACGLLPI 774
++ +S S ++ I +D + G W +R CG+
Sbjct: 602 EWRGLNAKDPTVAQSFHSSYSPEAKEILEKEGPKYQYGNGCLSDGVLGSWLSRMCGMEET 661
Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP--------------DGRVDMSSMQSR 820
++ +K +S L V+ YN K + G++ + + S
Sbjct: 662 LNTEKVKSHLLSVHRYNFKKDLTDHANPQRSPYALGKEGGLLLGSWPKGGKLSLPFVYSN 721
Query: 821 EIWSGVTYAVAASMIHEDLADIGFQT--AC 848
E+W+G+ Y VA+ ++ + + G + AC
Sbjct: 722 EVWTGIEYQVASHLMLQGEVEKGLEIVRAC 751
>gi|255013072|ref|ZP_05285198.1| hypothetical protein B2_04148 [Bacteroides sp. 2_1_7]
gi|410102535|ref|ZP_11297461.1| hypothetical protein HMPREF0999_01233 [Parabacteroides sp. D25]
gi|409238607|gb|EKN31398.1| hypothetical protein HMPREF0999_01233 [Parabacteroides sp. D25]
Length = 620
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 64/145 (44%), Gaps = 8/145 (5%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
D+N F+L++ R +K+ WP++ AM ++ D D G+ F D
Sbjct: 346 DINAFFILRIARFYRYYHNKQDLINYWPAMKNAMNWLISRDTDKIGLPAQVSFWGDWKDV 405
Query: 686 WSVSG--ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK------LW 737
+ G S ++ ++++AL+ LA E GD+ +Y+ +QK K LW
Sbjct: 406 KGIEGRKYSPFTCLIYLSALKEMVYLAEECGDQAGANYYQSAYQKGYKTMNKDVREGGLW 465
Query: 738 NGSYFNYDNSGSSQSSSIQADQLAG 762
NG+Y+ S + I DQ G
Sbjct: 466 NGNYYCQIWKDGSVNDRILQDQTIG 490
>gi|428168440|gb|EKX37385.1| hypothetical protein GUITHDRAFT_145081 [Guillardia theta CCMP2712]
Length = 887
Score = 47.0 bits (110), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 53/226 (23%), Positives = 86/226 (38%), Gaps = 62/226 (27%)
Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
+H Y S L +LFP++ R +L P D
Sbjct: 528 LHLYRSIPLALLFPQL-----------------------------VRNILDTGPID---- 554
Query: 608 DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDR 667
+P F + + D+ NP FVL V ++TGD++ V + +++
Sbjct: 555 EPKFPCGSRMMGDS------NPGFVLSVLNLFMSTGDRR----------VLLDFLEPV-- 596
Query: 668 DGDGMIENDGFPDQTYDTWS-----VSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
M G P +T+ + +SAY+ L++A L AA +A V D + Y
Sbjct: 597 ----MSSRFGLPRHLQNTYDWFHFHLKDVSAYNAVLYLAQLAAAERIAALVQDDSFQHYC 652
Query: 723 LFKFQKAK-VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYA 766
KA+ V E LWNG + + N ++ + L GQ +A
Sbjct: 653 QLHLSKAQEKVKELLWNGRFLRAWWNEEGELVEALHTETLYGQLWA 698
>gi|427384889|ref|ZP_18881394.1| hypothetical protein HMPREF9447_02427 [Bacteroides oleiciplenus YIT
12058]
gi|425728150|gb|EKU91009.1| hypothetical protein HMPREF9447_02427 [Bacteroides oleiciplenus YIT
12058]
Length = 998
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 50/254 (19%), Positives = 106/254 (41%), Gaps = 57/254 (22%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYR+ +G+ ++ K ++P+V ++ Y + +D G +E + W G
Sbjct: 640 IMKVYREWRISGNTQWMKDLFPAVKKSLDYCIRTWDPLQKGYLEEPHHNTYDIEFWGPDG 699
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
+ ++ AL A +++ + + ++ Y + + K + L++G YF
Sbjct: 700 MCT---SFYLGALTAFVEMSKAL-KQSTKKYDILLAKGKKYMENTLFDGEYFIQKIQWEG 755
Query: 743 ----------NYDNSGSSQS--------------SSIQADQLAGQWYARACGLLPIVDED 778
++ S ++ S +D + G W A CGL +++ +
Sbjct: 756 LQAPNPVDVISFGGGYSDEAMKLLKKEGPKYQYGSGCLSDGILGMWMASVCGLNEVLNNE 815
Query: 779 KARSALKKVYNYNVLKVM------------GGKRGAVNGML-----PDGRVDMSSMQSRE 821
K +S L V+ YN+ + GK G G+L G + + + S E
Sbjct: 816 KVKSHLVSVHKYNLKDNLVDHFNPQRPAYACGKEG---GLLLCTWPKGGMLSLPFVYSNE 872
Query: 822 IWSGVTYAVAASMI 835
+W+G+ Y VA+ ++
Sbjct: 873 VWTGIEYQVASHLM 886
>gi|383124512|ref|ZP_09945175.1| hypothetical protein BSIG_1742 [Bacteroides sp. 1_1_6]
gi|251841336|gb|EES69417.1| hypothetical protein BSIG_1742 [Bacteroides sp. 1_1_6]
Length = 848
Score = 46.6 bits (109), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 56/270 (20%), Positives = 105/270 (38%), Gaps = 61/270 (22%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++VYR+ +G+ +F +++P V ++ Y + +D G IE + W G
Sbjct: 490 IMKVYREWRISGENEFLISMYPKVKKSLDYCISTWDPRRVGSIEEPHHNTYDIEFWGPDG 549
Query: 691 I--SAYSGGLWVAALQAASALAR--EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF---- 742
+ S Y G L SA R E D+ +Y + K L+NG YF
Sbjct: 550 MHNSFYYGAL--------SAFIRMSEFLDKDVTEYKKLLKKGRKFTETGLFNGEYFIQKI 601
Query: 743 --------------NYDNSGSSQSSSIQ--------------ADQLAGQWYARACGLLPI 774
++ +S S ++ I +D + G W +R CG+
Sbjct: 602 EWRGLNAKDPTVAQSFHSSYSPEAKEILEKEGPKYQYGNGCLSDGVLGSWLSRMCGMEET 661
Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP--------------DGRVDMSSMQSR 820
++ +K +S L V+ YN K + G++ + + S
Sbjct: 662 LNTEKVKSHLLSVHRYNFKKDLTDHANPQRSPYALGKEGGLLLGSWPKGGKLSLPFVYSN 721
Query: 821 EIWSGVTYAVAASMIHEDLADIGFQT--AC 848
++W+G+ Y VA+ ++ + + G + AC
Sbjct: 722 KVWTGIEYQVASHLMLQGEVEKGLEIVRAC 751
>gi|220925399|ref|YP_002500701.1| amylo-alpha-16-glucosidase [Methylobacterium nodulans ORS 2060]
gi|219950006|gb|ACL60398.1| Amylo-alpha-16-glucosidase [Methylobacterium nodulans ORS 2060]
Length = 765
Score = 46.6 bits (109), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 18/130 (13%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
D P FV+ + TGDK +A+WP++ A+ ++D + DRDGDG +E
Sbjct: 413 DATPLFVMLAGQYFEQTGDKDTIQAIWPNILAALEWIDTYGDRDGDGFVEYARETDKGLA 472
Query: 675 NDGFPDQTYDTWSVSGISAYS-------GGLWVAALQAASALAREVGDRGSEDYFLFKFQ 727
N G+ D + G A G AA +AAS LA +G+ + +
Sbjct: 473 NQGWKDSHDSIFHADGRMAQGPIALCEVQGYVYAAKRAASKLAATLGNAALAERLKDAAE 532
Query: 728 KAKVVYEKLW 737
+ +V +++ +
Sbjct: 533 RLRVAFDEAF 542
>gi|374372806|ref|ZP_09630467.1| hypothetical protein NiasoDRAFT_3452 [Niabella soli DSM 19437]
gi|373234882|gb|EHP54674.1| hypothetical protein NiasoDRAFT_3452 [Niabella soli DSM 19437]
Length = 872
Score = 46.2 bits (108), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 57/252 (22%), Positives = 104/252 (41%), Gaps = 53/252 (21%)
Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
+++V+R+ +G+ + K ++P V ++ Y ++ +D G++E + W G
Sbjct: 517 IMKVHREWRISGNHTWLKKIYPLVKRSLDYCIETWDPGKKGVLEEPHHNTYDIEFW---G 573
Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYF------- 742
++ ALQ+ A+ + + + + L K K K+ E +L+NG +F
Sbjct: 574 PDPMCTTFYLGALQSIIAMGQFLKEDITGYSSLLK--KGKMFLETRLFNGEFFFQQIQWE 631
Query: 743 --NYDNSGSSQS-----------------------SSIQADQLAGQWYARACGLLPIVDE 777
N +QS +D + G W A CGL VD+
Sbjct: 632 GLKAQNPARAQSFGGNYSQEAIEILKKEGPKYQYGKGCLSDGVLGSWIAAMCGLEDPVDK 691
Query: 778 DKARSALKKVYNYNV---LK----------VMGGKRGAVNGMLPD-GRVDMSSMQSREIW 823
K L V+ YN+ LK MG + G + P G++ + + S E+W
Sbjct: 692 RKITGHLLSVHKYNLKATLKDHVNPQRPAYAMGHEGGLLLCTWPKGGKLSLPFVYSNEVW 751
Query: 824 SGVTYAVAASMI 835
+G+ Y VAA ++
Sbjct: 752 TGIEYQVAAHLM 763
>gi|154246650|ref|YP_001417608.1| amylo-alpha-16-glucosidase [Xanthobacter autotrophicus Py2]
gi|154160735|gb|ABS67951.1| Amylo-alpha-16-glucosidase [Xanthobacter autotrophicus Py2]
Length = 733
Score = 46.2 bits (108), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 67/140 (47%), Gaps = 30/140 (21%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
D P FV+ V TGD A+ +WP+V A+ ++ + DRDGDG +E
Sbjct: 381 DSTPLFVMLAGAYVERTGDLDTARLLWPAVAAALGWITDYGDRDGDGFVEYGRRSGDGLV 440
Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
N G+ D + G + AY G W QAA+++AR +G + D +
Sbjct: 441 NQGWKDSHDSVFHADGRLARGPIALVEVQAYVFGAW----QAAASIARRLGQ--APDAMV 494
Query: 724 FKFQKAKVVYEKLWNGSYFN 743
F+ +KA ++ + ++ ++F+
Sbjct: 495 FE-RKADLLRDA-FDAAFFD 512
>gi|393766406|ref|ZP_10354962.1| amylo-alpha-16-glucosidase [Methylobacterium sp. GXF4]
gi|392728187|gb|EIZ85496.1| amylo-alpha-16-glucosidase [Methylobacterium sp. GXF4]
Length = 780
Score = 46.2 bits (108), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
D P FV+ + TGD + +A+WPS+ +A+ +MD++ DRDGDG +E
Sbjct: 428 DGTPLFVMLAWEYYAVTGDLETIRAIWPSLELALEWMDRYGDRDGDGFVE 477
>gi|170749555|ref|YP_001755815.1| amylo-alpha-16-glucosidase [Methylobacterium radiotolerans JCM
2831]
gi|170656077|gb|ACB25132.1| Amylo-alpha-16-glucosidase [Methylobacterium radiotolerans JCM
2831]
Length = 781
Score = 45.4 bits (106), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
D P FV+ + TGD + +A+WP++ +A+ +MD++ DRDGDG +E
Sbjct: 429 DGTPLFVMLAWEYYAVTGDLETVRAIWPALELALEWMDRYGDRDGDGFVE 478
>gi|431796995|ref|YP_007223899.1| glycogen debranching protein [Echinicola vietnamensis DSM 17526]
gi|430787760|gb|AGA77889.1| glycogen debranching enzyme [Echinicola vietnamensis DSM 17526]
Length = 1213
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 65/141 (46%), Gaps = 16/141 (11%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYD 684
D+N F Q+ R TGDK + +AVWPS+ +A+ + FD DGDG+ +
Sbjct: 419 DMNQVFFDQLLRHFDWTGDKAYLQAVWPSIERHLAWEKRNFDPDGDGLY------NAYAS 472
Query: 685 TWSVSGISAYSGGLWVA------ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLW- 737
W+ + SGG+ A A + A+ LA +G+ G + Y K + E LW
Sbjct: 473 IWASDALQYNSGGVAHASAYNYFANKKAAELAAFIGEDGKQ-YAAEAEAILKAMNETLWL 531
Query: 738 -NGSYFNYDNSGSSQSSSIQA 757
+G Y Y + Q+ QA
Sbjct: 532 PSGWYAEYKDFLGPQNLHPQA 552
>gi|217978466|ref|YP_002362613.1| amylo-alpha-16-glucosidase [Methylocella silvestris BL2]
gi|217503842|gb|ACK51251.1| Amylo-alpha-16-glucosidase [Methylocella silvestris BL2]
Length = 729
Score = 44.3 bits (103), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 47/108 (43%), Gaps = 18/108 (16%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
D P FV+ TGD + +WP++ A+ ++D+F DRDGDG +E
Sbjct: 377 DATPLFVMLAGMYFERTGDSETIAKIWPNILAALRWIDEFGDRDGDGFVEYLRETENGLA 436
Query: 675 NDGFPDQTYDTWSVSGISAYS-------GGLWVAALQAASALAREVGD 715
N G+ D + G A G AA AA LAR +GD
Sbjct: 437 NQGWKDSHDSIFHADGSLAEGPIALCEVQGYVYAAKSAAGRLARLMGD 484
>gi|170060875|ref|XP_001865995.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879232|gb|EDS42615.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 183
Score = 44.3 bits (103), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 45/110 (40%), Gaps = 31/110 (28%)
Query: 604 IGICDPWFE--VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV---- 657
+ I + W V AY ++D WKDLN KF+LQVYRD ++A ++
Sbjct: 43 LKIVEVWLRGTVRAYPIHDVYEWKDLNSKFILQVYRDYYTQLKAEYAGKFSSIEFINKES 102
Query: 658 ------------------------AMAYMDQFDRDGDGMIENDGFPDQTY 683
A Y ++D D DG+IEN PD TY
Sbjct: 103 FRNRMSRTERDVLLEENVSGLQAGAGEYPLEWDMDNDGLIENKP-PDLTY 151
>gi|384915661|ref|ZP_10015873.1| Glycogen debranching enzyme [Methylacidiphilum fumariolicum SolV]
gi|384526964|emb|CCG91744.1| Glycogen debranching enzyme [Methylacidiphilum fumariolicum SolV]
Length = 733
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 67/310 (21%), Positives = 120/310 (38%), Gaps = 56/310 (18%)
Query: 565 LSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARW 624
I+ D ++H+ + ++ + G+ RK G++ +A CL
Sbjct: 345 FDIEADAEPGKILHETREGEMANLGEIPFRKYYGSI-------------DATCL------ 385
Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE--------- 674
F+L T D+KF + +WPS+ +A+ ++D++ D DGDG +E
Sbjct: 386 ------FLLLAAEYYQRTADRKFIEEIWPSIELALLWIDRYGDVDGDGFVEYRRRSSRGL 439
Query: 675 -NDGFPD------QTYDTWSVSGISAYS-GGLWVAALQAASALAREVGDRGSEDYFLFKF 726
N G+ D Q ++ V I+ G AA + + LA +G D L +
Sbjct: 440 TNQGWKDSFDSVFQKDGSFPVGPIALCEVQGFVFAAKKKIAYLASVLGQISLHDRLLKEA 499
Query: 727 QKAKVVYEK-LWNGSYFNYDNS--GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSA 783
+ K +EK W +Y + G + + A A C I +DKA
Sbjct: 500 EILKEKFEKEFWVEELSSYALALDGRKEKCKVLASN------AGLCLYTGIAKQDKAELI 553
Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
K + + G + A + R + S + +W T +AA M+ + ++
Sbjct: 554 GKLLMSDKFFTGWGIRTVASTEI----RYNPISYHNGSVWPFDTAFIAAGMMSYGMKEMA 609
Query: 844 FQTACGIYEA 853
G++EA
Sbjct: 610 INLFKGLFEA 619
>gi|170743420|ref|YP_001772075.1| amylo-alpha-16-glucosidase [Methylobacterium sp. 4-46]
gi|168197694|gb|ACA19641.1| Amylo-alpha-16-glucosidase [Methylobacterium sp. 4-46]
Length = 764
Score = 43.5 bits (101), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 49/107 (45%), Gaps = 18/107 (16%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
D P FV+ + TGD+ +A+WP++ A+ ++D + DRDGDG +E
Sbjct: 412 DATPLFVMLAGQYFEQTGDRATVQAIWPNILAALEWIDTYGDRDGDGFVEYARETDKGLA 471
Query: 675 NDGFPDQTYDTWSVSGISAYS-------GGLWVAALQAASALAREVG 714
N G+ D + G A G AA +AAS LA +G
Sbjct: 472 NQGWKDSHDSIFHADGRMAQGPIALCEVQGYVYAAKRAASKLAGVLG 518
>gi|86604990|ref|YP_473753.1| glycogen debranching enzyme family protein [Synechococcus sp.
JA-3-3Ab]
gi|86553532|gb|ABC98490.1| glycogen debranching enzyme family protein [Synechococcus sp.
JA-3-3Ab]
Length = 721
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 1/53 (1%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDG 677
D P FV+ R V +GD + + +WP++ A+++MD + D DGDG +E G
Sbjct: 370 DATPLFVVLAGRYVRRSGDLELLRQLWPNIEAALSWMDTYGDPDGDGFLEYAG 422
>gi|383768111|ref|YP_005447094.1| hypothetical protein PSMK_30380 [Phycisphaera mikurensis NBRC
102666]
gi|381388381|dbj|BAM05197.1| hypothetical protein PSMK_30380 [Phycisphaera mikurensis NBRC
102666]
Length = 823
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 53/211 (25%), Positives = 82/211 (38%), Gaps = 33/211 (15%)
Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
L+ +T D + SS + L+P + R +A + H S +L
Sbjct: 410 LLHSTVDAEYNSSLFALCLWPDLLALQLRQWAERLEPHAASGGAILQ------------- 456
Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
HD+G + Y A ++ N +LQ Y TGD+ A+A+ + A
Sbjct: 457 -HDLGSGND--ASGQASRYRMAVEENSNFLLLLQTY--TRWTGDRSVARALAATASSLTA 511
Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISA--------YSGGLWVAALQAASALARE 712
Y+ DR ENDGFPD + + G +A Y +AALQA++ L R
Sbjct: 512 YLLWCDR------ENDGFPDDPGENTMIDGPAALRLARRPTYPAVKRLAALQASADLLRL 565
Query: 713 VG-DRGSEDYFLFKFQKAKVVYEKLWNGSYF 742
G D + D A + W G ++
Sbjct: 566 AGQDDAARDTESIADADAAKIESAAWLGDHY 596
>gi|326797904|ref|YP_004315723.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326548668|gb|ADZ77053.1| hypothetical protein Sph21_0471 [Sphingobacterium sp. 21]
Length = 1132
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 58/123 (47%), Gaps = 22/123 (17%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIE-------NDG 677
D+N F Q+Y TGD F KA WP++ +A+ + FD+DGDG+ + +D
Sbjct: 416 DMNLVFFDQLYNHFQYTGDTAFIKASWPAIKRHLAWEKRNFDQDGDGLYDAYCAIWASDA 475
Query: 678 FPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV---VYE 734
Y V+ SAY+ + A+ LA VG ED +K + AK+ V +
Sbjct: 476 L---QYSGGGVTHTSAYN----YRGFEMAAKLATIVG----EDPRPYKQEAAKIKEAVNK 524
Query: 735 KLW 737
LW
Sbjct: 525 HLW 527
>gi|16263864|ref|NP_436656.1| hypothetical protein SM_b20116 [Sinorhizobium meliloti 1021]
gi|334319999|ref|YP_004556628.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti AK83]
gi|384539727|ref|YP_005723811.1| hypothetical protein SM11_pD1478 [Sinorhizobium meliloti SM11]
gi|407724161|ref|YP_006843822.1| hypothetical protein BN406_06540 [Sinorhizobium meliloti Rm41]
gi|15139988|emb|CAC48516.1| CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti 1021]
gi|334097738|gb|AEG55748.1| Amylo-alpha-16-glucosidase [Sinorhizobium meliloti AK83]
gi|336038380|gb|AEH84310.1| conserved hypothetical protein [Sinorhizobium meliloti SM11]
gi|407324221|emb|CCM72822.1| hypothetical protein BN406_06540 [Sinorhizobium meliloti Rm41]
Length = 734
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 55/107 (51%), Gaps = 18/107 (16%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
D P F++ + TGD + +WP+V A+ ++D+F DRDGDG +E G
Sbjct: 382 DSTPLFIMLAGAYLDRTGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLV 441
Query: 680 DQ----TYDTWSVSGISAYSGGLWVA--------ALQAASALAREVG 714
+Q ++D+ +G + SG + A A QAA+ L+R++G
Sbjct: 442 NQCWKDSHDSIFHAGGNLASGPIATAEVQAYVFGAWQAAARLSRKLG 488
>gi|384533984|ref|YP_005716648.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti BL225C]
gi|333816160|gb|AEG08827.1| Amylo-alpha-16-glucosidase [Sinorhizobium meliloti BL225C]
Length = 734
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 55/107 (51%), Gaps = 18/107 (16%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
D P F++ + TGD + +WP+V A+ ++D+F DRDGDG +E G
Sbjct: 382 DSTPLFIMLAGAYLDRTGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLV 441
Query: 680 DQ----TYDTWSVSGISAYSGGLWVA--------ALQAASALAREVG 714
+Q ++D+ +G + SG + A A QAA+ L+R++G
Sbjct: 442 NQCWKDSHDSIFHAGGNLASGPIATAEVQAYVFGAWQAAARLSRKLG 488
>gi|418404986|ref|ZP_12978419.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti CCNWSX0020]
gi|359501068|gb|EHK73697.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti CCNWSX0020]
Length = 734
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 55/107 (51%), Gaps = 18/107 (16%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
D P F++ + TGD + +WP+V A+ ++D+F DRDGDG +E G
Sbjct: 382 DSTPLFIMLAGAYLDRTGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLV 441
Query: 680 DQ----TYDTWSVSGISAYSGGLWVA--------ALQAASALAREVG 714
+Q ++D+ +G + SG + A A QAA+ L+R++G
Sbjct: 442 NQCWKDSHDSIFHAGGNLASGPIATAEVQAYVFGAWQAAARLSRKLG 488
>gi|300770983|ref|ZP_07080860.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
gi|300762256|gb|EFK59075.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
Length = 1205
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 8/73 (10%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYD 684
D+N F Q++ TGD+ F K VWP++ +A+ + FD DGDG+ D
Sbjct: 418 DMNLVFFDQLFTHFQYTGDRTFLKEVWPAIKRHLAWEKRNFDMDGDGLY------DAYCA 471
Query: 685 TWSVSGISAYSGG 697
W+ G+ YSGG
Sbjct: 472 IWASDGLQ-YSGG 483
>gi|227537591|ref|ZP_03967640.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
gi|227242593|gb|EEI92608.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
Length = 1205
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 8/73 (10%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYD 684
D+N F Q++ TGD+ F K VWP++ +A+ + FD DGDG+ D
Sbjct: 418 DMNLVFFDQLFTHFQYTGDRTFLKEVWPAIKRHLAWEKRNFDMDGDGLY------DAYCA 471
Query: 685 TWSVSGISAYSGG 697
W+ G+ YSGG
Sbjct: 472 IWASDGLQ-YSGG 483
>gi|433611718|ref|YP_007195179.1| Glycogen debranching enzyme [Sinorhizobium meliloti GR4]
gi|429556660|gb|AGA11580.1| Glycogen debranching enzyme [Sinorhizobium meliloti GR4]
Length = 734
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 54/107 (50%), Gaps = 18/107 (16%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
D P F++ + TGD + +WP+V A+ ++D+F DRDGDG +E G
Sbjct: 382 DSTPLFIMLAGAYLDRTGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLV 441
Query: 680 DQ----TYDTWSVSGISAYSGGLWVA--------ALQAASALAREVG 714
+Q ++D+ +G + SG + A A QAA+ L+R +G
Sbjct: 442 NQCWKDSHDSIFHAGGNLASGPIATAEVQAYVFGAWQAAARLSRRLG 488
>gi|429221453|ref|YP_007173779.1| glycogen debranching protein [Deinococcus peraridilitoris DSM
19664]
gi|429132316|gb|AFZ69330.1| glycogen debranching enzyme [Deinococcus peraridilitoris DSM 19664]
Length = 430
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/121 (25%), Positives = 54/121 (44%), Gaps = 5/121 (4%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
D N FVL Y T D F +A WP + A+ ++ D + G++E D D
Sbjct: 99 DANMWFVLGHYACHRRTNDTAFLRAAWPRIQAALLWLRYQDMNDCGLLEAPEAADWA-DL 157
Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV----YEKLWNGSY 741
++ + Y+ L+V L+AA L+ +G+ G+ + K+ ++ W+G
Sbjct: 158 YATRYNTLYANALYVGTLEAAGYLSAALGEDGAAHRVRANDVRRKINLLLWLDRPWDGQR 217
Query: 742 F 742
F
Sbjct: 218 F 218
>gi|398838157|ref|ZP_10595439.1| glycogen debranching enzyme [Pseudomonas sp. GM102]
gi|398116719|gb|EJM06477.1| glycogen debranching enzyme [Pseudomonas sp. GM102]
Length = 1066
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 31/127 (24%), Positives = 55/127 (43%), Gaps = 26/127 (20%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
D P FV+ + T D + +WP++ A+++++++ DRDGDG +E
Sbjct: 714 DSTPLFVMLAAAYLERTNDLGTLRQLWPNIEAALSWIEEYGDRDGDGFVEYGRQSAEGLI 773
Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
N G+ D + +G + AY G W A+ +AR +GD
Sbjct: 774 NQGWKDSHDSVFHANGQLAVGPIAIVEVQAYVYGAWT----GAAKIARRLGDPDRAQRLK 829
Query: 724 FKFQKAK 730
+K Q+ +
Sbjct: 830 YKAQRLR 836
>gi|268317317|ref|YP_003291036.1| amylo-alpha-16-glucosidase [Rhodothermus marinus DSM 4252]
gi|262334851|gb|ACY48648.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus DSM 4252]
Length = 868
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 55/128 (42%), Gaps = 19/128 (14%)
Query: 598 GAVPHDI----GICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWP 653
G +PH+I + D WFE Y D P F++ A+GD + + W
Sbjct: 363 GKIPHEISQSAALID-WFEDYPYPWASA----DATPLFIIAHADYWQASGDLDYIREHWD 417
Query: 654 SVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI------SAYSGGLWVAALQAAS 707
++ A + D DG+ ++EN G W G+ Y G+W+AAL+
Sbjct: 418 ALVRAYRFTAGTDTDGNDLVENTG----VGHGWVEGGLLYPPHEELYQQGVWLAALEGME 473
Query: 708 ALAREVGD 715
A+A +G+
Sbjct: 474 AMATALGE 481
>gi|427409761|ref|ZP_18899963.1| hypothetical protein HMPREF9718_02437 [Sphingobium yanoikuyae ATCC
51230]
gi|425711894|gb|EKU74909.1| hypothetical protein HMPREF9718_02437 [Sphingobium yanoikuyae ATCC
51230]
Length = 739
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 61/140 (43%), Gaps = 30/140 (21%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
D P FV+ + TGD ++ + P++ + ++D DRDGDG +E
Sbjct: 387 DSTPLFVMLAGAYLERTGDLEYLAGLLPNIEAGLRWIDTHGDRDGDGFVEYGRLTDQGLQ 446
Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
N G+ D + V G + AY G W QAAS + R +GD + +L
Sbjct: 447 NQGWKDSHDSIFHVDGSIARGPIALAEVQAYVYGAW----QAASLIYRAMGDLDRSETYL 502
Query: 724 FKFQKAKVVYEKLWNGSYFN 743
+ V ++ ++ S+F+
Sbjct: 503 AR----AVDLQQRFDASFFD 518
>gi|398384505|ref|ZP_10542535.1| glycogen debranching enzyme [Sphingobium sp. AP49]
gi|397722664|gb|EJK83200.1| glycogen debranching enzyme [Sphingobium sp. AP49]
Length = 738
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 52/129 (40%), Gaps = 25/129 (19%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
D P FV+ + TGD F A+ P V A+A++D DRDGDG +E
Sbjct: 386 DSTPLFVMLAGAYLERTGDAAFLAAILPHVEAALAWIDAHGDRDGDGFVEYGRLTDQGLQ 445
Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASALARE---VGDRGSED 720
N G+ D + G + AY G W AA +A+ +E G R D
Sbjct: 446 NQGWKDSHDSIFHADGSIARGPIALAEVQAYVYGAWQAAGAIFAAVGQEDKATGYRARAD 505
Query: 721 YFLFKFQKA 729
+F A
Sbjct: 506 DLRTRFDAA 514
>gi|159040682|ref|YP_001539934.1| glycoside hydrolase 15-like protein [Caldivirga maquilingensis
IC-167]
gi|157919517|gb|ABW00944.1| glycoside hydrolase 15-related [Caldivirga maquilingensis IC-167]
Length = 604
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 49/119 (41%), Gaps = 9/119 (7%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
D VL Y T D++F K W V A Y+ + +DG T
Sbjct: 346 DETASVVLATYVHFKLTLDRQFLKNAWVMVRKAAEYLA-------ANVSDDGLTTPTVGP 398
Query: 686 WSVS-GISAYSGGLWVAALQAASALAREVGDRG-SEDYFLFKFQKAKVVYEKLWNGSYF 742
W G+ Y+ AAL +AS LA E+GDR + ++ + + WNGS+F
Sbjct: 399 WEEHLGVHTYTNASVYAALSSASYLAGEIGDRNRAVEWGRYASVIRSTTLNQAWNGSFF 457
>gi|334345035|ref|YP_004553587.1| amylo-alpha-16-glucosidase [Sphingobium chlorophenolicum L-1]
gi|334101657|gb|AEG49081.1| Amylo-alpha-16-glucosidase [Sphingobium chlorophenolicum L-1]
Length = 744
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDGFPDQ 681
D P FV+ + TGD F ++ P V A+A++D+ DRDGDG +E DQ
Sbjct: 392 DSTPLFVMLAGAYLERTGDSAFIASILPHVRAALAWIDEHGDRDGDGFVEYGRLTDQ 448
>gi|345303570|ref|YP_004825472.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus SG0.5JP17-172]
gi|345112803|gb|AEN73635.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus SG0.5JP17-172]
Length = 868
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 55/128 (42%), Gaps = 19/128 (14%)
Query: 598 GAVPHDI----GICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWP 653
G +PH+I + D WFE Y D P F++ A+GD + + W
Sbjct: 363 GKIPHEISQSAALID-WFEDYPYPWASA----DATPLFIIAHADYWQASGDLDYIREHWD 417
Query: 654 SVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI------SAYSGGLWVAALQAAS 707
++ A + D DG+ ++EN G W G+ Y G+W+AAL+
Sbjct: 418 ALVRAYRFTAGTDTDGNDLVENTG----VGHGWVEGGLLYPPHEELYQQGVWLAALEGME 473
Query: 708 ALAREVGD 715
A+A +G+
Sbjct: 474 AMATALGE 481
>gi|337288158|ref|YP_004627630.1| alpha-L-rhamnosidase [Thermodesulfobacterium sp. OPB45]
gi|334901896|gb|AEH22702.1| alpha-L-rhamnosidase [Thermodesulfobacterium geofontis OPF15]
Length = 707
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 52/110 (47%), Gaps = 21/110 (19%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDGFPDQ--- 681
D P FV+ + TGD F K++W ++ +A++++D + D DGDG +E FP +
Sbjct: 358 DATPLFVILASKYFERTGDIDFIKSIWKNIVLAISWIDNYGDVDGDGFVEY--FPSKKGL 415
Query: 682 TYDTWSVSGISA-YSGGLWV--------------AALQAASALAREVGDR 716
T W S S Y G V A + AS LAR +G++
Sbjct: 416 TNKGWKDSEDSIFYEDGTLVKPPVALVEVQGYVYKAKREASKLARILGEK 465
>gi|300787634|ref|YP_003767925.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei U32]
gi|384151022|ref|YP_005533838.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei S699]
gi|399539517|ref|YP_006552179.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei S699]
gi|299797148|gb|ADJ47523.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei U32]
gi|340529176|gb|AEK44381.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei S699]
gi|398320287|gb|AFO79234.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei S699]
Length = 726
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 72/151 (47%), Gaps = 13/151 (8%)
Query: 642 TGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSG-ISAYSGGLWV 700
TGDK + AV P++ +AY D F R+ D ++ ++ D+ Y + +G ++ YS +
Sbjct: 384 TGDKAYVTAVLPALRKELAYNDSF-RNADNLVVSN---DRDYWQATQTGEVTKYSIDYSI 439
Query: 701 AALQAASALAREVGDRGSEDYFLFKFQKAKV-VYEKLWNGSYFNYDNSGSSQSSSIQ-AD 758
L+ S L R VG D + K K V KLWN + Y S + ++ A+
Sbjct: 440 -VLREMSWLERAVGSAALADGYDTKADAIKTAVMSKLWNPALGAYGQSSAKPDVLVEDAN 498
Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYN 789
LA Q+ G +P E K +ALK ++
Sbjct: 499 ALALQY-----GFVPAGQEAKVLAALKTLWT 524
>gi|86608269|ref|YP_477031.1| glycogen debranching enzyme family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86556811|gb|ABD01768.1| glycogen debranching enzyme family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 721
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDG 677
D P FV+ R V +GD + + +WP++ A+ +MD + D DGDG +E G
Sbjct: 370 DSTPLFVVLAGRYVRRSGDLELLRQLWPNIEAALHWMDTYGDADGDGFLEYAG 422
>gi|373456251|ref|ZP_09548018.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
gi|371717915|gb|EHO39686.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
Length = 1123
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 60/138 (43%), Gaps = 8/138 (5%)
Query: 619 YDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF 678
YD A D P +V+ + A+GD F + W + A+ ++ D DGD +IEN
Sbjct: 647 YDAA---DATPLYVILAAHYLKASGDLTFIQQSWSHIKRAIDFLYSTDTDGDMLIENKNV 703
Query: 679 PDQTYDTWSVSGISA--YSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEK 735
+ + G + Y GLW L+ A+ LA+++ + + +K +++
Sbjct: 704 GHGWVEGGKLWGANPTFYLAGLWAQTLKDAAYLAKKLKLQDLSRRYAADAEKVIRILNRD 763
Query: 736 LWN--GSYFNYDNSGSSQ 751
WN +F Y S Q
Sbjct: 764 FWNEKDDFFYYGKIKSCQ 781
>gi|374624040|ref|ZP_09696525.1| Amylo-alpha-1,6-glucosidase [Ectothiorhodospira sp. PHS-1]
gi|373943126|gb|EHQ53671.1| Amylo-alpha-1,6-glucosidase [Ectothiorhodospira sp. PHS-1]
Length = 722
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 34/62 (54%), Gaps = 6/62 (9%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
D P F++ R T D+ F +WP + A+A++D++ D DGDG +E +DG
Sbjct: 370 DATPLFIVLAERYYRHTADRSFIAGLWPHIERALAWIDRYGDVDGDGFVEYHRESDDGLV 429
Query: 680 DQ 681
+Q
Sbjct: 430 NQ 431
>gi|150376166|ref|YP_001312762.1| amylo-alpha-16-glucosidase [Sinorhizobium medicae WSM419]
gi|150030713|gb|ABR62829.1| Amylo-alpha-16-glucosidase [Sinorhizobium medicae WSM419]
Length = 734
Score = 41.2 bits (95), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
D P F++ + TGD + +WP++ A+ ++D+F DRDGDG +E
Sbjct: 382 DSTPLFIMLAGAYLDRTGDTDTVRGLWPNIVAALDWIDRFGDRDGDGFVE 431
>gi|75676297|ref|YP_318718.1| amylo-alpha-1,6-glucosidase [Nitrobacter winogradskyi Nb-255]
gi|74421167|gb|ABA05366.1| amylo-alpha-1,6-glucosidase [Nitrobacter winogradskyi Nb-255]
Length = 741
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 45/106 (42%), Gaps = 22/106 (20%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
D P FV+ + TGD + +WP + A+ +++Q+ DRDGDG +E
Sbjct: 389 DSTPLFVVLAGAYLDRTGDVVTIRRLWPHIEAALTWIEQYGDRDGDGFVEYHRQTSEGLI 448
Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASAL 709
N G+ D + G + AY G W AA + A L
Sbjct: 449 NQGWKDSYDSVFHADGTFAKGPIALVEVQAYVYGAWCAAAKIAEQL 494
>gi|398384993|ref|ZP_10543020.1| glycogen debranching enzyme [Sphingobium sp. AP49]
gi|397721792|gb|EJK82339.1| glycogen debranching enzyme [Sphingobium sp. AP49]
Length = 739
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 60/140 (42%), Gaps = 30/140 (21%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
D P FV+ + TGD +F + P++ A+ ++D DRDGDG +E
Sbjct: 387 DSTPLFVMLAGAYLERTGDLQFLARILPNIKSALQWIDNHGDRDGDGFVEYGRLTDQGLQ 446
Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
N G+ D + G + AY G W QAAS++ R GD + +L
Sbjct: 447 NQGWKDSHDSIFHADGSIAKGPIALAEVQAYVYGAW----QAASSIYRAKGDLDRSESYL 502
Query: 724 FKFQKAKVVYEKLWNGSYFN 743
+ V + ++ ++F+
Sbjct: 503 AR----AVDLQDRFDATFFD 518
>gi|163851052|ref|YP_001639095.1| amylo-alpha-16-glucosidase [Methylobacterium extorquens PA1]
gi|163662657|gb|ABY30024.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens PA1]
Length = 778
Score = 40.4 bits (93), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
D P FV+ + TGD + +A+WP + +A+ ++D + DRDGDG +E
Sbjct: 426 DGTPLFVMLAAQYHEVTGDLETIRAIWPQLKLALEWIDTYGDRDGDGFVE 475
>gi|110636016|ref|YP_676224.1| amylo-alpha-1,6-glucosidase [Chelativorans sp. BNC1]
gi|110287000|gb|ABG65059.1| Amylo-alpha-1,6-glucosidase [Chelativorans sp. BNC1]
Length = 734
Score = 40.4 bits (93), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
D P FV+ + TGD + K++WP++ A+++++++ DRDGDG +E
Sbjct: 382 DATPLFVMLAGAYLDRTGDLETLKSLWPNIEAALSWIEEYGDRDGDGFVE 431
>gi|323495701|ref|ZP_08100771.1| glycogen debranching enzyme [Vibrio sinaloensis DSM 21326]
gi|323319168|gb|EGA72109.1| glycogen debranching enzyme [Vibrio sinaloensis DSM 21326]
Length = 924
Score = 40.0 bits (92), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 74/173 (42%), Gaps = 22/173 (12%)
Query: 573 AAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFV 632
A ++ + + M+L DD ++ V I +Y+T D P +
Sbjct: 443 AKAIIENFASMQLQDDTSLNDGRIPNRVTSKTNI-----------IYNTT---DGTPWMI 488
Query: 633 LQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEN---DGFPDQTYD---TW 686
++ + +GD FAK ++P V +A +++ D DG+I++ D + D D W
Sbjct: 489 REILEYINYSGDIDFAKRIYPVVQRFIAGVERNYLDQDGLIKHRHPDTWMDAKIDGQIPW 548
Query: 687 SVSGISAYS-GGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLW 737
S G A LW AL A LA+ D D+ + Q+AK + +K W
Sbjct: 549 SPRGPKANDIQALWFDALHVAQTLAQLNQDDVYADHVVSLAQQAKQSFIDKFW 601
>gi|449672634|ref|XP_002169551.2| PREDICTED: uncharacterized protein LOC100201428 [Hydra
magnipapillata]
Length = 502
Score = 40.0 bits (92), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 73/166 (43%), Gaps = 9/166 (5%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
D+N F++Q + D F ++P AM ++ Q G G+ PD TYD
Sbjct: 107 DVNVGFIVQALLMYKWSNDTVFFDEIYPFTVRAMNWLMQDATKGTGLPYRK--PD-TYDL 163
Query: 686 WSVSGI--SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLWN---G 739
+ + AY+ ++ AL A ++A D + +A K + ++WN G
Sbjct: 164 FDLEKYDHCAYNSISYLLALHAMKSMASMQKDNNTIKNVNAALARATKQLEVEMWNEKKG 223
Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALK 785
Y + + + + +D L GQ +A GL ++D+DK + LK
Sbjct: 224 FYHAWFDDEWGSPTWLMSDVLYGQVWAYTLGLGDLLDKDKMKKHLK 269
>gi|15921050|ref|NP_376719.1| hypothetical protein ST0815 [Sulfolobus tokodaii str. 7]
gi|342306303|dbj|BAK54392.1| glucan 1,4-alpha-glucosidase [Sulfolobus tokodaii str. 7]
Length = 615
Score = 40.0 bits (92), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 64/147 (43%), Gaps = 20/147 (13%)
Query: 634 QVYRDVVATGD--KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS-G 690
Q+Y+D+ D K F K P++ M Y++ DG P ++D W G
Sbjct: 378 QIYKDIDELIDIFKNFLK---PAIRFLMRYVE------------DGLPKPSFDLWEERYG 422
Query: 691 ISAYSGGLWVAALQAASALAREVGDR--GSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSG 748
I Y+ AL +AS LAR++GD S+ + ++ K +V+ + NG + +
Sbjct: 423 IHIYTVSTVYGALVSASELARDIGDEVLASDMLDVAEYMKEEVLRRMVHNGRFIRRIDEN 482
Query: 749 SSQSSSIQADQLAGQWYARACGLLPIV 775
++ I A + ++ PI+
Sbjct: 483 GNKDLVIDASMYSPYFFGMVDVRDPIM 509
>gi|240138186|ref|YP_002962658.1| hypothetical protein MexAM1_META1p1519 [Methylobacterium extorquens
AM1]
gi|418061390|ref|ZP_12699252.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens DSM 13060]
gi|240008155|gb|ACS39381.1| conserved hypothetical protein, putative glycosyltransferase
[Methylobacterium extorquens AM1]
gi|373565054|gb|EHP91121.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens DSM 13060]
Length = 778
Score = 39.7 bits (91), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
D P FV+ + TGD +A+WP + +A+ ++D + DRDGDG +E
Sbjct: 426 DGTPLFVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVE 475
>gi|218529882|ref|YP_002420698.1| amylo-alpha-16-glucosidase [Methylobacterium extorquens CM4]
gi|218522185|gb|ACK82770.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens CM4]
Length = 778
Score = 39.7 bits (91), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
D P FV+ + TGD +A+WP + +A+ ++D + DRDGDG +E
Sbjct: 426 DGTPLFVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVE 475
>gi|294012573|ref|YP_003546033.1| amylo-alpha-1,6-glucosidase [Sphingobium japonicum UT26S]
gi|292675903|dbj|BAI97421.1| amylo-alpha-1,6-glucosidase [Sphingobium japonicum UT26S]
Length = 749
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDGFPDQ 681
D P FV+ + TGD F ++ P V A++++D+ DRDGDG +E DQ
Sbjct: 397 DSTPLFVMLAGAYLERTGDSGFIASILPHVEAALSWIDEHGDRDGDGFVEYGRLTDQ 453
>gi|254560746|ref|YP_003067841.1| hypothetical protein METDI2293 [Methylobacterium extorquens DM4]
gi|254268024|emb|CAX23895.1| conserved hypothetical protein, putative glycosyltransferase
[Methylobacterium extorquens DM4]
Length = 778
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
D P FV+ + TGD +A+WP + +A+ ++D + DRDGDG +E
Sbjct: 426 DGTPLFVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVE 475
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.417
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,363,448,587
Number of Sequences: 23463169
Number of extensions: 722419741
Number of successful extensions: 1540419
Number of sequences better than 100.0: 552
Number of HSP's better than 100.0 without gapping: 412
Number of HSP's successfully gapped in prelim test: 140
Number of HSP's that attempted gapping in prelim test: 1536609
Number of HSP's gapped (non-prelim): 1273
length of query: 954
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 802
effective length of database: 8,792,793,679
effective search space: 7051820530558
effective search space used: 7051820530558
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)